JP2017083922A

JP2017083922A - System parameter identification apparatus, system parameter identification method, and computer program therefor

Info

Publication number: JP2017083922A
Application number: JP2015208362A
Authority: JP
Inventors: 徳晃廣瀬; Noriaki Hirose; 竜介但馬; Ryusuke Tajima
Original assignee: Toyota Central R&D Labs Inc
Current assignee: Toyota Central R&D Labs Inc
Priority date: 2015-10-22
Filing date: 2015-10-22
Publication date: 2017-05-18

Abstract

PROBLEM TO BE SOLVED: To disclose a technology capable of improving identification accuracy of a parameter when identifying the parameter of a system subjected to control while using a particle filter.SOLUTION: In a system parameter identification device 10, a particle filtering execution part 24 allocates a state vector of a system to each of a plurality of particles, calculates a weight of each of the particles on the basis of an observed value and an estimate of each of the particles and updates the particle on the basis of the weight. Therefore, the state vector is estimated and the parameter is identified. In the particle filtering execution part 24, the particle is updated at an interval of (n) steps ((n) is an integer equal to or greater than 2) from a first step within any period, and the weight of each of the particles is calculated on the basis of observed values and estimates that are acquired in at least two steps among the (n) steps included in one update cycle in which the particle is updated.SELECTED DRAWING: Figure 1

Description

本明細書に開示する技術は、システムパラメータ同定装置、システムパラメータ同定方法、及びそのためのコンピュータプログラムに関する。詳しくは、制御対象となるシステムが有するパラメータを同定するための技術に関する。 The technology disclosed in this specification relates to a system parameter identification device, a system parameter identification method, and a computer program therefor. Specifically, the present invention relates to a technique for identifying parameters of a system to be controlled.

非特許文献１には、パーティクルフィルタを用いて制御対象となるシステムのパラメータを同定する技術が開示されている。パーティクルフィルタとは、所与の観測値と各パーティクルの推定値に基づいて各パーティクルの重みを求め、その重みに基づいてパーティクルを更新する手順を繰り返すことで、推定したい値を求める公知の手法である。 Non-Patent Document 1 discloses a technique for identifying a parameter of a system to be controlled using a particle filter. The particle filter is a known method for obtaining the value to be estimated by repeating the procedure of obtaining the weight of each particle based on a given observation value and the estimated value of each particle, and updating the particle based on the weight. is there.

益田哲也、杉江俊治、“粒子フィルタによる線形近似システムのパラメータ同定”、システム制御情報学会論文誌、ｖｏｌ．２４、Ｎｏ．１０、ｐ２５０−２５４、２０１１Tetsuya Masuda, Toshiharu Sugie, “Parameter Identification of Linear Approximation System Using Particle Filter”, Transactions of the Institute of Systems, Control and Information Engineers, vol. 24, no. 10, p250-254, 2011

非特許文献１では、システムの出力がステップ毎に観測され、パーティクルは、ステップ毎に更新される。即ち、パーティクルの更新周期は、システムの出力のステップ周期と等しい。この場合、各パーティクルの重みは、１個のステップにおける観測値と推定値とに基づいて算出される。このような手法では、実際のパラメータ（以下、真値とも称する）とは異なる値を有するパラメータに対してパーティクルの重みが大きくなる場合がある。このため、真値ではないパラメータを同定することが起こり得ることとなり、真値の同定精度が低い。 In Non-Patent Document 1, the output of the system is observed for each step, and the particles are updated for each step. That is, the particle update period is equal to the step period of the system output. In this case, the weight of each particle is calculated based on the observed value and the estimated value in one step. In such a method, there are cases where the weight of particles increases for parameters having values different from actual parameters (hereinafter also referred to as true values). For this reason, it is possible to identify a parameter that is not a true value, and the accuracy of identifying a true value is low.

本明細書では、パーティクルフィルタを用いて制御対象となるシステムのパラメータを同定するに際して、パラメータの同定精度を向上できる技術を開示する。 The present specification discloses a technique capable of improving parameter identification accuracy when identifying a parameter of a system to be controlled using a particle filter.

本明細書は、パーティクルフィルタを用いて制御対象のシステムのパラメータを同定するシステムパラメータ同定装置を開示する。このシステムパラメータ同定装置は、入力値取得部と、観測値取得部と、第１関数記憶部と、第２関数記憶部と、状態ベクトル算出部と、推定値算出部と、パーティクルフィルタ実行部と、を備える。入力値取得部は、システムへの入力値を、任意の期間においてステップ毎に取得可能である。観測値取得部は、システムからの出力の観測値を、任意の期間においてステップ毎に取得可能である。第１関数記憶部は、任意の期間内の任意のステップにおけるシステムの状態ベクトルを、任意のステップの１つ前のステップにおけるシステムの状態ベクトルに基づいて算出する第１関数を記憶している。システムの状態ベクトルは、システムの状態量と、システムのパラメータと、入力値に関連するシステムノイズとを要素に含んでいる。第２関数記憶部は、任意のステップにおけるシステムの状態ベクトルに基づいて、任意のステップにおけるシステムの出力の推定値を算出する第２関数を記憶している。状態ベクトル算出部は、任意のステップの１つ前のステップにおけるシステムの状態ベクトルを第１関数記憶部に記憶されている第１関数に入力して、任意のステップにおけるシステムの状態ベクトルを算出する。推定値算出部は、任意のステップにおけるシステムの状態ベクトルを第２関数記憶部に記憶されている第２関数に入力して、任意のステップにおけるシステムの出力の推定値を算出する。パーティクルフィルタ実行部は、複数のパーティクルのそれぞれに、システムの状態ベクトルを割り当て、観測値取得部から取得した観測値と、パーティクル毎に推定値算出部から取得した推定値とに基づいて、各パーティクルの重みを算出し、その重みに基づいてパーティクルを更新することでシステムの状態ベクトルを推定してシステムのパラメータを同定する。パーティクルフィルタ実行部では、パーティクルは、任意の期間内の最初のステップからｎステップ目毎に（ｎ：２以上の整数）更新され、かつ、各パーティクルの重みは、パーティクルが更新される１個の更新周期内に含まれるｎ個のステップのうちの少なくとも２個のステップにおいて取得された観測値と推定値とに基づいて算出される。 The present specification discloses a system parameter identification device that identifies a parameter of a system to be controlled using a particle filter. The system parameter identification device includes an input value acquisition unit, an observation value acquisition unit, a first function storage unit, a second function storage unit, a state vector calculation unit, an estimated value calculation unit, a particle filter execution unit, . The input value acquisition unit can acquire an input value to the system for each step in an arbitrary period. The observation value acquisition unit can acquire the observation value of the output from the system for each step in an arbitrary period. The first function storage unit stores a first function that calculates a system state vector at an arbitrary step within an arbitrary period based on a system state vector at a step immediately preceding the arbitrary step. The system state vector includes system state quantities, system parameters, and system noise related to input values. The second function storage unit stores a second function that calculates an estimated value of the output of the system at an arbitrary step based on the state vector of the system at the arbitrary step. The state vector calculation unit inputs the system state vector in the step immediately before the arbitrary step to the first function stored in the first function storage unit, and calculates the system state vector in the arbitrary step. . The estimated value calculating unit inputs a system state vector at an arbitrary step to the second function stored in the second function storage unit, and calculates an estimated value of the system output at the arbitrary step. The particle filter execution unit assigns a system state vector to each of a plurality of particles, and based on the observation values acquired from the observation value acquisition unit and the estimation values acquired from the estimation value calculation unit for each particle, The system state vector is estimated by updating the particles based on the weight and the system parameters are identified. In the particle filter execution unit, the particles are updated every n steps (n: an integer of 2 or more) from the first step in an arbitrary period, and the weight of each particle is the same as that of the particle being updated. It is calculated based on the observed value and the estimated value acquired in at least two of the n steps included in the update period.

上記のシステムパラメータ同定装置では、パーティクルの更新周期が、ステップ周期のｎ倍（ｎ：２以上の整数）である。そして、各パーティクルの重みは、パーティクルが更新される１個の更新周期内に含まれるｎ個のステップのうちの少なくとも２個のステップにおいて取得された観測値と推定値とに基づいて算出される。このため、各パーティクルの重みを１個のステップにおける観測値と推定値とに基づいて算出する場合と比較して、真値とは異なるパラメータに対してパーティクルの重みが大きくなってしまうことが起こり難くなる。パーティクルの重みは、より真値に近いパラメータに対して大きくなり易くなる。この結果、制御対象となるシステムのパラメータの同定精度を向上できる。 In the system parameter identification device described above, the particle update period is n times the step period (n is an integer of 2 or more). The weight of each particle is calculated based on the observed value and the estimated value acquired in at least two of the n steps included in one update cycle in which the particle is updated. . For this reason, compared with the case where the weight of each particle is calculated based on the observed value and the estimated value in one step, the particle weight may increase for a parameter different from the true value. It becomes difficult. Particle weights tend to be large for parameters closer to true values. As a result, the identification accuracy of the parameters of the system to be controlled can be improved.

また、本明細書は、パーティクルフィルタを用いて制御対象のシステムのパラメータを同定する新規なシステムパラメータ同定方法を開示する。このシステムパラメータ同定方法は、入力値取得工程と、観測値取得工程と、状態ベクトル算出工程と、推定値算出工程と、パーティクルフィルタ実行工程とを備える。入力値取得工程では、システムへの入力値を、任意の期間においてステップ毎に取得する。観測値取得工程では、システムからの出力の観測値を、任意の期間においてステップ毎に取得する。状態ベクトル算出工程では、システムの状態量と、システムのパラメータと、入力値に関連するシステムノイズとを要素に含む、システムの状態ベクトルであって、任意の期間内の任意のステップにおけるシステムの状態ベクトルを、任意のステップの１つ前のステップにおけるシステムの状態ベクトルに基づいて算出する第１関数に、１つ前のステップにおけるシステムの状態ベクトルを入力して任意のステップにおけるシステムの状態ベクトルを算出する。推定値算出工程では、任意のステップにおけるシステムの状態ベクトルに基づいて任意のステップにおけるシステムの出力の推定値を算出する第２関数に、任意のステップにおけるシステムの状態ベクトルを入力して任意のステップにおけるシステムの出力の推定値を算出する。パーティクルフィルタ実行工程では、複数のパーティクルのそれぞれに、システムの状態ベクトルを割り当て、観測値取得工程で取得した観測値と、パーティクル毎に推定値算出工程で取得した推定値とに基づいて、各パーティクルの重みを算出し、その重みに基づいてパーティクルを更新することでシステムの状態ベクトルを推定してシステムのパラメータを同定する。パーティクルフィルタ実行工程では、パーティクルは、任意の期間内の最初のステップからｎステップ目毎に（ｎ：２以上の整数）更新され、かつ、各パーティクルの重みは、パーティクルが更新される１個の更新周期内に含まれるｎ個のステップのうちの少なくとも２個のステップにおいて取得された観測値と推定値とに基づいて算出される。このシステムパラメータ同定方法によると、制御対象となるシステムのパラメータの同定精度を適切に向上できる。 The present specification also discloses a novel system parameter identification method for identifying parameters of a system to be controlled using a particle filter. This system parameter identification method includes an input value acquisition step, an observation value acquisition step, a state vector calculation step, an estimated value calculation step, and a particle filter execution step. In the input value acquisition process, an input value to the system is acquired for each step in an arbitrary period. In the observation value acquisition step, the observation value of the output from the system is acquired for each step in an arbitrary period. In the state vector calculation process, a system state vector including elements of system state quantities, system parameters, and system noise related to input values, and the state of the system at any step within an arbitrary period The system state vector in an arbitrary step is input by inputting the system state vector in the previous step into a first function that calculates the vector based on the system state vector in the previous step of the arbitrary step. calculate. In the estimated value calculating step, the system state vector at any step is input to the second function that calculates the estimated value of the system output at the arbitrary step based on the system state vector at the arbitrary step. Compute an estimate of the system output at. In the particle filter execution step, a system state vector is assigned to each of a plurality of particles, and each particle is based on the observation value acquired in the observation value acquisition step and the estimation value acquired in the estimation value calculation step for each particle. The system state vector is estimated by updating the particles based on the weight and the system parameters are identified. In the particle filter execution step, particles are updated every n steps (n: an integer of 2 or more) from the first step in an arbitrary period, and the weight of each particle is the same as that of the particle being updated. It is calculated based on the observed value and the estimated value acquired in at least two of the n steps included in the update period. According to this system parameter identification method, the identification accuracy of the parameters of the system to be controlled can be appropriately improved.

また、本明細書は、パーティクルフィルタを用いて制御対象のシステムのパラメータを同定するための新規なコンピュータプログラムを開示する。このコンピュータプログラムは、コンピュータに、入力値取得処理と、観測値取得処理と、状態ベクトル算出処理と、推定値算出処理と、パーティクルフィルタ実行処理と、を実行させる。入力値取得処理では、システムへの入力値を、任意の期間においてステップ毎に取得する。観測値取得処理では、システムからの出力の観測値を、任意の期間においてステップ毎に取得する。状態ベクトル算出処理では、システムの状態量と、システムのパラメータと、入力値に関連するシステムノイズとを要素に含む、システムの状態ベクトルであって、任意の期間内の任意のステップにおけるシステムの状態ベクトルを、任意のステップの１つ前のステップにおけるシステムの状態ベクトルに基づいて算出する第１関数に、１つ前のステップにおけるシステムの状態ベクトルを入力して任意のステップにおけるシステムの状態ベクトルを算出する。推定値算出処理では、任意のステップにおけるシステムの状態ベクトルに基づいて任意のステップにおけるシステムの出力の推定値を算出する第２関数に、任意のステップにおけるシステムの状態ベクトルを入力して任意のステップにおけるシステムの出力の推定値を算出する。パーティクルフィルタ実行処理では、複数のパーティクルのそれぞれに、システムの状態ベクトルを割り当て、観測値取得処理で取得した観測値と、パーティクル毎に推定値算出処理で取得した推定値とに基づいて、各パーティクルの重みを算出し、その重みに基づいてパーティクルを更新することでシステムの状態ベクトルを推定してシステムのパラメータを同定する。パーティクルフィルタ実行処理では、パーティクルは、任意の期間内の最初のステップからｎステップ目毎に（ｎ：２以上の整数）更新され、かつ、各パーティクルの重みは、パーティクルが更新される１個の更新周期内に含まれるｎ個のステップのうちの少なくとも２個のステップにおいて取得された観測値と推定値とに基づいて算出される。このコンピュータプログラムによると、コンピュータに、パラメータを同定するための処理を適切に実行させることができる。 The present specification also discloses a novel computer program for identifying parameters of a system to be controlled using a particle filter. This computer program causes a computer to execute an input value acquisition process, an observation value acquisition process, a state vector calculation process, an estimated value calculation process, and a particle filter execution process. In the input value acquisition process, an input value to the system is acquired for each step in an arbitrary period. In the observation value acquisition process, the observation value of the output from the system is acquired for each step in an arbitrary period. In the state vector calculation process, a system state vector including elements of system state quantities, system parameters, and system noise related to input values, and the state of the system at any step within an arbitrary period The system state vector in an arbitrary step is input by inputting the system state vector in the previous step into a first function that calculates the vector based on the system state vector in the previous step of the arbitrary step. calculate. In the estimated value calculation process, the system state vector at an arbitrary step is input to a second function that calculates an estimated value of the system output at an arbitrary step based on the system state vector at an arbitrary step. Compute an estimate of the system output at. In the particle filter execution processing, a system state vector is assigned to each of a plurality of particles, and each particle is based on the observation value acquired by the observation value acquisition processing and the estimation value acquired by the estimation value calculation processing for each particle. The system state vector is estimated by updating the particles based on the weight and the system parameters are identified. In the particle filter execution processing, the particles are updated every n steps (n: an integer of 2 or more) from the first step in an arbitrary period, and the weight of each particle is the same as that of the particle being updated. It is calculated based on the observed value and the estimated value acquired in at least two of the n steps included in the update period. According to this computer program, it is possible to cause the computer to appropriately execute processing for identifying parameters.

本明細書が開示する技術の詳細、及び、さらなる改良は、発明を実施するための形態及び実施例にて詳しく説明する。 Details of the technology disclosed in this specification and further improvements will be described in detail in the detailed description and examples.

実施例１のシステムパラメータ同定装置と外部記憶媒体のブロック図。1 is a block diagram of a system parameter identification device and an external storage medium according to a first embodiment. システムパラメータの同定工程を示すフローチャート。The flowchart which shows the identification process of a system parameter. ロボットアームへの入力トルクの時系列データ。Time series data of the input torque to the robot arm. ロボットアームからの出力角度の時系列データ。Time series data of the output angle from the robot arm. クーロン摩擦力γのパラメータノイズζ_３の分散β_３ ^２の分散計算式のグラフ。The graph of the dispersion | distribution formula of dispersion | distribution (beta) ₃ ² of parameter noise (zeta) ₃ of Coulomb frictional force (gamma). 粘性摩擦係数ｄのパラメータノイズζ_４の分散β_４ ^２の分散計算式のグラフ。Dispersion equation graph of dispersion beta ₄ ² parameters noise zeta ₄ of viscous friction coefficient d. 慣性モーメントＪのパラメータノイズζ_５の分散β_５ ^２の分散計算式のグラフ。Dispersion equation graph of dispersion beta ₅ ² parameters noise zeta ₅ of inertia J. 最初のステップ及びパーティクル更新時のステップにおける各パラメータノイズζ_１〜ζ_５の分散β_１ ^２〜β_５ ^２を表したグラフ。The first step and the graph showing the variance β ₁ ^² ~β ₅ ² of each parameter noise ζ ₁ ~ζ ₅ at step when the particle update. 実験１のパーティクル更新時のステップにおける各パラメータのパーティクルの分布を表したグラフ。The graph showing the particle distribution of each parameter in the step at the time of particle update in Experiment 1. 実験２のパーティクル更新時のステップにおける各パラメータのパーティクルの分布を表したグラフ。The graph showing the distribution of the particle | grains of each parameter in the step at the time of the particle update of Experiment 2. FIG. 実施例２の制御器とシステムのブロック図。FIG. 3 is a block diagram of a controller and system according to a second embodiment.

以下に説明する実施例の主要な特徴を列記しておく。なお、以下に記載する技術要素は、それぞれ独立した技術要素であって、単独であるいは各種の組合せによって技術的有用性を発揮するものであり、出願時請求項記載の組合せに限定されるものではない。 The main features of the embodiments described below are listed. The technical elements described below are independent technical elements and exhibit technical usefulness alone or in various combinations, and are not limited to the combinations described in the claims at the time of filing. Absent.

（特徴１）本明細書が開示するシステムパラメータ同定装置は、システムのパラメータにパラメータノイズを付与するノイズ付与部をさらに備えていてもよい。このノイズ付与部は、パーティクルが更新されるステップにおいて、システムのパラメータにパラメータノイズを付与してもよい。この構成によると、パーティクルが局所解（ローカルミニマム）に収束することを抑制でき、同定精度をさらに向上できる。 (Feature 1) The system parameter identification device disclosed in the present specification may further include a noise applying unit that applies parameter noise to the system parameters. The noise adding unit may add parameter noise to the system parameters in the step of updating the particles. According to this configuration, the particles can be prevented from converging on a local solution (local minimum), and the identification accuracy can be further improved.

（特徴２）本明細書が開示するシステムパラメータ同定装置では、パラメータノイズの分散が可変であってもよい。この構成によると、パーティクルの拡散の度合いを変更できる。 (Feature 2) In the system parameter identification device disclosed in this specification, the variance of the parameter noise may be variable. According to this configuration, the degree of particle diffusion can be changed.

（特徴３）本明細書が開示するシステムパラメータ同定装置では、任意の期間内においてパーティクルがｍ回（ｍ：２以上の整数）更新されるとすると、ｊ回目（ｊ：１〜ｍ−１までの任意の整数）に更新されるときのステップで付与されるパラメータノイズの分散が、ｎ・ｊ＋１ステップ目からｎ・ｊ＋ｎステップ目までのｎ個のステップのうちの少なくとも１個のステップにおいて取得された入力値又は観測値に基づいて決定されてもよい。この構成によると、パーティクルが不要に拡散してしまうことを抑制でき、効率的にパラメータを同定できる。 (Characteristic 3) In the system parameter identification device disclosed in the present specification, if particles are updated m times (m is an integer equal to or greater than 2) within an arbitrary period, the j-th time (j: 1 to m−1). The variance of the parameter noise given in the step at the time of updating to (an arbitrary integer) is obtained in at least one of the n steps from the n · j + 1 step to the n · j + n step. It may be determined based on the input value or the observed value. According to this structure, it can suppress that a particle | grain spread | diffuses unnecessarily and a parameter can be identified efficiently.

（特徴４）本明細書が開示するシステムパラメータ同定装置では、各パーティクルの重みは、上記少なくとも２個のステップにおける同時確率から算出されてもよい。この構成によると、パーティクルの重みは、より真値に近いパラメータに対して大きくなるため、同定精度をさらに向上できる。 (Feature 4) In the system parameter identification device disclosed in this specification, the weight of each particle may be calculated from the joint probability in the at least two steps. According to this configuration, since the weight of the particle is increased with respect to a parameter closer to the true value, the identification accuracy can be further improved.

（特徴５）システムを制御する制御器が、上記に記載のシステムパラメータ同定装置を備えていてもよい。この制御器の制御パラメータは、パーティクルフィルタ実行部により更新されたパーティクルの分散及び平均に基づいて決定されてもよい。この構成によると、制御パラメータを効率的に決定することができる。 (Feature 5) A controller for controlling the system may include the system parameter identification device described above. The control parameter of this controller may be determined based on the dispersion and average of particles updated by the particle filter execution unit. According to this configuration, the control parameter can be determined efficiently.

図１〜３を参照してシステムパラメータ同定装置１０について説明する。同定装置１０は、パーティクルフィルタを用いて制御対象となるシステム（例えば、ロボットアーム）のパラメータＰを同定する。同定装置１０には、システムのパラメータＰを同定するための各種処理を実行するコンピュータが搭載されている。コンピュータは、演算処理を行うＣＰＵ、演算処理のデータが一時的に記憶されるＲＡＭ、及びＣＰＵによって実行される演算プログラムが記憶されたＲＯＭを備えている。ＣＰＵがＲＯＭに記憶された演算プログラムを実行することで、ＣＰＵは、後述する状態ベクトル算出部、推定値算出部等として機能する。 The system parameter identification device 10 will be described with reference to FIGS. The identification device 10 identifies a parameter P of a system (for example, a robot arm) to be controlled using a particle filter. The identification device 10 is equipped with a computer that executes various processes for identifying the parameter P of the system. The computer includes a CPU that performs arithmetic processing, a RAM that temporarily stores arithmetic processing data, and a ROM that stores arithmetic programs executed by the CPU. When the CPU executes the arithmetic program stored in the ROM, the CPU functions as a state vector calculation unit, an estimated value calculation unit, and the like, which will be described later.

同定装置１０のコンピュータは、入力値取得部１２と、観測値取得部１４と、第１関数記憶部１６と、第２関数記憶部１８と、状態ベクトル算出部２０と、推定値算出部２２と、パーティクルフィルタ実行部２４と、分散計算式記憶部２６と、ノイズ決定部２８と、を備える。 The computer of the identification apparatus 10 includes an input value acquisition unit 12, an observation value acquisition unit 14, a first function storage unit 16, a second function storage unit 18, a state vector calculation unit 20, an estimated value calculation unit 22, , A particle filter execution unit 24, a dispersion calculation formula storage unit 26, and a noise determination unit 28.

同定装置１０は、オフラインで使用される。即ち、外部記憶媒体３０には、所定期間（ｔ＝０〜Ｔ）における入力値データ３２と観測値データ３４とが予め記憶されている。入力値データ３２は、システムへの入力値を示す時系列データである。観測値データ３４は、入力に対するシステムからの出力の観測値を示す時系列データである。
入力値取得部１２は、外部記憶媒体３０に記憶されている入力値データ３２から、ステップ毎に（即ち、一定の周期で）入力値ｕを取得する。取得された入力値は、ノイズ決定部２８及び状態ベクトル算出部２０に出力される。この処理は、ＣＰＵで実施される。以下では、ｋステップ目（ｋ＝０〜Ｔ／Ｔ_ｓ、Ｔ_ｓ：ステップ周期）の入力値ｕをｕ_ｋと表し、他の値の表記についても同様とする。
観測値取得部１４は、外部記憶媒体３０に記憶されている観測値データ３４から、ステップ毎に観測値Ｙ_ｋ ^ｒを取得する。取得された観測値Ｙ_ｋ ^ｒは、ノイズ決定部２８及びパーティクルフィルタ実行部２４に出力される。この処理は、ＣＰＵで実施される。なお、入力値取得部１２及び観測値取得部１４は、所定期間の一部の期間における入力値データ３２及び観測値データ３４を取得してもよい。 The identification device 10 is used offline. That is, the external storage medium 30 stores in advance input value data 32 and observation value data 34 for a predetermined period (t = 0 to T). The input value data 32 is time series data indicating an input value to the system. The observation value data 34 is time-series data indicating the observation value of the output from the system with respect to the input.
The input value acquisition unit 12 acquires the input value u from the input value data 32 stored in the external storage medium 30 for each step (that is, at a constant cycle). The acquired input value is output to the noise determination unit 28 and the state vector calculation unit 20. This process is performed by the CPU. In the following, the input value u at the k-th step (k = 0 to T / T _s , T _s : step period) is represented as u _k, and the same applies to the notation of other values.
The observation value acquisition unit 14 acquires the observation value Y _k ^r for each step from the observation value data 34 stored in the external storage medium 30. The acquired observation value Y _k ^r is output to the noise determination unit 28 and the particle filter execution unit 24. This process is performed by the CPU. Note that the input value acquisition unit 12 and the observation value acquisition unit 14 may acquire the input value data 32 and the observation value data 34 in a part of a predetermined period.

第１関数記憶部１６は、ＲＯＭに設けられており、式３で表される第１関数を記憶している。第１関数は、制御対象となるシステムの状態方程式と、隣接ステップ間のパラメータＰの関係式を統合した関数であり、詳細には、以下のように定義される。 The first function storage unit 16 is provided in the ROM and stores the first function represented by Expression 3. The first function is a function that integrates the state equation of the system to be controlled and the relational expression of the parameter P between adjacent steps, and is defined in detail as follows.

制御対象となるシステムの状態方程式は、式１によって定義される。
ｆは既知の非線形関数、Ｘ_ｋはｋステップ目のシステムの状態量、η_ｋはｋステップ目のシステムノイズである。式１から明らかなように、ｋ＋１ステップ目の状態量Ｘ_ｋ＋１は、ｋステップ目における状態量Ｘ_ｋ及びシステムノイズη_ｋに基づいて算出される。なお、本実施例では、システムノイズη_ｋとして、平均が入力値ｕ_ｋ、分散がα^２（定数）であるシステムノイズが用いられる。 The state equation of the system to be controlled is defined by Equation 1.
f is a known nonlinear function, X _k is the state quantity of the system at the k step, and η _k is the system noise at the k step. As is apparent from Equation 1, the state quantity X _{k + 1} at the ( _{k + 1) th} step is calculated based on the state quantity X _{k at the} kth step and the system noise η _k . In the present embodiment, system noise having an average input value u _k and variance α ² (constant) is used as the system noise η _k .

隣接ステップ間のパラメータＰの関係式は、式２によって定義される。
ζ_ｋは、ｋステップ目のパラメータＰ_ｋに対するパラメータノイズである。ζ_ｋの平均はゼロ、分散はβ^２である。本実施例では分散β^２は可変であるが、定数であってもよい。分散β^２については後で詳述する。 The relational expression of the parameter P between adjacent steps is defined by Expression 2.
ζ _k is parameter noise for the parameter P _{k at} the k-th step. The mean of ζ _k is zero and the variance is β ² . In this embodiment, the variance β ² is variable, but may be a constant. It will be described in detail later dispersion β ^2.

式１と式２を統合することにより、式３で表される新たな状態方程式（第１関数）が定義される。
Ｚ_ｋは、ｋステップ目のシステムの状態ベクトルであり、Ｚ_ｋ＝（Ｘ_ｋ ^Ｔ，Ｐ_ｋ ^Ｔ）^Ｔとして定義される。ｆ_ａは式１、２から導出される既知の非線形関数である。状態ベクトルＺ_ｋは、状態量Ｘ_ｋ、パラメータＰ_ｋ、システムノイズη_ｋ、パラメータノイズζ_ｋを要素に含んでいる。また、式３から明らかなように、ｋ＋１ステップ目の状態ベクトルＺ_ｋ＋１は、ｋステップ目における状態ベクトルＺ_ｋ、システムノイズη_ｋ、及びパラメータノイズζ_ｋに基づいて算出される。 By integrating Equation 1 and Equation 2, a new state equation (first function) represented by Equation 3 is defined.
Z _k is a state vector of the system in the k-th step, and is defined as Z _k = (X _k ^T , P _k ^T ) ^T. f _a is a known nonlinear function derived from Equations 1 and 2. The state vector Z _k includes a state quantity X _k , a parameter P _k , a system noise η _k , and a parameter noise ζ _k as elements. As is clear from Equation 3, the state vector Z _{k + 1} at the ( _{k + 1) th} step is calculated based on the state vector Z _{k at the} kth step, the system noise η _k , and the parameter noise ζ _k .

第２関数記憶部１８は、ＲＯＭに設けられており、式４で表される出力方程式（以下、第２関数とも称する）を記憶している。
Ｙ_ｋはｋステップ目のシステムの出力の推定値、Ｈ_ｋはｋステップ目の既知の行列、σ_ｋはｋステップ目の観測ノイズである。σ_ｋの平均は予め設定されており、分散はρ^２（定数）である。式４から明らかなように、ｋステップ目の推定値Ｙ_ｋは、ｋステップ目における状態ベクトルＺ_ｋ及び観測ノイズσ_ｋに基づいて算出される。 The second function storage unit 18 is provided in the ROM and stores an output equation (hereinafter also referred to as a second function) represented by Expression 4.
Y _k is an estimated value of the output of the system at the k step, H _k is a known matrix at the k step, and σ _k is an observation noise at the k step. The average of σ _k is set in advance, and the variance is ρ ² (constant). As is apparent from Equation 4, the estimated value Y _k at the k-th step is calculated based on the state vector Z _k and the observation noise σ _k at the k-th step.

状態ベクトル算出部２０は、後述するパーティクルフィルタ実行部２４で発生させたＳ個のパーティクルのそれぞれについて、ｋステップ目における状態ベクトルＺ_ｋ ^ｉ（ｉ＝１〜Ｓ）と、入力値取得部１２から取得した入力値ｕ_ｋに所定の分散α^２を持たせたシステムノイズη_ｋと、ノイズ決定部２８から取得したパラメータノイズζ_ｋを、第１関数記憶部１６に記憶されている第１関数（式３参照）に入力して、ｋ＋１ステップ目の状態ベクトルＺ(〜)_ｋ＋１ ^ｉを算出する（なお、実際の表記ではチルダ「〜」は文字の上に付される）。状態ベクトル算出部２０で算出された状態ベクトルＺ(〜)_ｋ＋１ ^ｉは、推定値算出部２２に出力される。この処理はＣＰＵで実施される。なお、本実施例では、第１関数に入力して算出された状態ベクトルにはチルダ「〜」を付すことで、パーティクルフィルタ実行部２４で推定された状態ベクトルと区別する。 For each of S particles generated by a particle filter execution unit 24 to be described later, the state vector calculation unit 20 determines the state vector Z _k ⁱ (i = 1 to S) at the _k- ^th step and the input value acquisition unit 12. acquired the system noise eta _k which gave a predetermined dispersion alpha ² to the input values u _k and the parameter noise zeta _k obtained from the noise determining unit 28, the first function stored in the first function storage section 16 ( Then, the state vector Z (˜) _{k + 1} ⁱ at the ( _{k + 1} ) th step is calculated (refer to the tilde “˜” in the actual notation). The state vector Z (˜) _{k + 1} ⁱ calculated by the state vector calculation unit 20 is output to the estimated value calculation unit 22. This process is performed by the CPU. In the present embodiment, the state vector calculated by inputting to the first function is distinguished from the state vector estimated by the particle filter execution unit 24 by adding a tilde “˜”.

推定値算出部２２は、各パーティクルについて、状態ベクトル算出部２０から取得したｋ＋１ステップ目の状態ベクトルＺ(〜)_ｋ＋１ ^ｉを、第２関数記憶部１８に記憶されている第２関数（式４参照）に入力して、ｋ＋１ステップ目のシステムの出力の推定値Ｙ(〜)_ｋ＋１ ^ｉを算出する。このとき、観測ノイズσ_ｋはゼロとして推定値Ｙ(〜)_ｋ＋１ ^ｉが算出される。推定値算出部２２で算出された推定値Ｙ(〜)_ｋ＋１ ^ｉは、パーティクルフィルタ実行部２４に出力される。この処理はＣＰＵで実施される。なお、本実施例では、第２関数に入力して算出された推定値にはチルダ「〜」を付す。 For each particle, the estimated value calculation unit 22 uses the second function (formula 4) stored in the second function storage unit 18 for the k + 1 step state vector Z (˜) _{k + 1} ⁱ acquired from the state vector calculation unit 20. And the estimated value Y (˜) _{k + 1} ⁱ of the system output of the ( _{k + 1} ) ^{th step} is calculated. At this time, the estimated value Y (˜) _{k + 1} ⁱ is calculated with the observation noise σ _k being zero. The estimated value Y (˜) _{k + 1} ⁱ calculated by the estimated value calculation unit 22 is output to the particle filter execution unit 24. This process is performed by the CPU. In the present embodiment, a tilde “˜” is added to the estimated value calculated by inputting to the second function.

パーティクルフィルタ実行部２４は、最初のステップ（即ち、ｋ＝０）において、Ｓ個のパーティクルを発生させる。パーティクルは所定の範囲で一様に分布している。ここで、「所定の範囲」とは、同定すべきパラメータＰの真値を含む範囲である。各パーティクルには、所定の状態ベクトルＺ_０ ^ｉが割り当てられる。以下では、最初のステップにおける状態ベクトルＺ_０ ^ｉを、初期状態ベクトルとも称する。
また、パーティクルフィルタ実行部２４は、最初のステップからｎステップ目毎に、パーティクルを更新する。具体的には、パーティクルフィルタ実行部２４は、以下の手順でパーティクルを更新する。（１）まず、パーティクルの更新がｊ回目であるｋ＝ｎ・ｊステップ目（ｊ＝１〜ｍ、ｍ：所定期間Ｔ内でパーティクルが更新される回数、ｎ・ｍ＝Ｔ／Ｔ_ｓ）において、観測値取得部１４から、ｊ−１回目のパーティクルの更新後に取得されたｎ個の観測値Ｙ^ｒ（即ち、Ｙ_{ｎ（ｊ−１）＋１} ^ｒからＹ_{ｎ（ｊ−１）＋ｎ} ^ｒまでのｎ個の観測値Ｙ^ｒ）を取得する。また、推定値算出部２２から、各パーティクルについて、ｊ−１回目のパーティクルの更新後に取得されたｎ個の推定値Ｙ(〜)^ｉ（即ち、Ｙ(〜)_{ｎ（ｊ−１）＋１} ^ｉからＹ(〜)_{ｎ（ｊ−１）＋ｎ} ^ｉまでのｎ個の推定値Ｙ(〜)^ｉ）を取得する。（２）次に、これらｎ個の観測値Ｙ^ｒ及び推定値Ｙ(〜)^ｉを次の式５で表される重み計算式に入力して、各パーティクルの重みｗ^ｉを算出する。
なお、式５のＷは次の式６で定義される。
（３）続いて、パーティクル数が、重みの高い領域で多く、重みの低い領域で少なくなるようにパーティクルを更新して、ｋ＝ｎ・ｊステップ目の状態ベクトルＺ_ｎ・ｊ ^ｉを推定して、ｋ＝ｎ・ｊステップ目のシステムパラメータＰ_ｎ・ｊを同定する。なお、この状態ベクトルＺ_ｎ・ｊ ^ｉは、ｎ個の観測値Ｙ^ｒ及び推定値Ｙ(〜)^ｉに基づいて推定されたものであり、第１関数から算出されたｋ＝ｎ・ｊステップ目の状態ベクトルＺ(〜)_ｎ・ｊ ^ｉとは異なる。
パーティクルフィルタ実行部２４で推定された状態ベクトルＺ_ｎ・ｊ ^ｉは、状態ベクトル算出部に出力される。これらの処理は、ＣＰＵで実施される。 The particle filter execution unit 24 generates S particles in the first step (that is, k = 0). The particles are uniformly distributed within a predetermined range. Here, the “predetermined range” is a range including the true value of the parameter P to be identified. A predetermined state vector Z ₀ ⁱ is assigned to each particle. Hereinafter, the state vector Z ₀ ⁱ in the first step is also referred to as an initial state vector.
Further, the particle filter execution unit 24 updates the particles every nth step from the first step. Specifically, the particle filter execution unit 24 updates particles according to the following procedure. (1) First, k = n · j-th step in which particles are updated (j = 1 to m, m: number of times particles are updated within a predetermined period T, n · m = T / T _s ) N observation values Y ^r (that is, Y _{n (j−1) +1} ^r to Y _{n (j−1) + n} ^r ₎ acquired from the observation value acquisition unit 14 after the j−1th particle update. N observation values Y ^r ) are obtained. Further, n estimated values Y (˜) ⁱ (that is, Y (˜) _{n (j−1) +1} ⁱ acquired after the j−1th update of the particles from the estimated value calculation unit 22 for each particle. acquires _{Y (~) n (j-} 1) + n i to the n-number of estimated values Y (~) ⁱ⁾ from. (2) Next, the n observed values Y ^r and the estimated values Y (˜) ⁱ are input to the weight calculation formula expressed by the following formula 5 to calculate the weight w ^{i of} each particle.
Note that W in Expression 5 is defined by Expression 6 below.
(3) Subsequently, the particles are updated so that the number of particles is large in the high weight region and small in the low weight region, and the state vector Z _{n · j} ⁱ at the k = n · j step is estimated. Thus, the system parameter P _{n · j} at the k = n · j step is identified. The state vector Z _{n · j} ⁱ is estimated based on the n observed values Y ^r and the estimated value Y (˜) ⁱ , and k = n · j steps calculated from the first function. It is different from the eye state vector Z (˜) _{n · j} ⁱ .
The state vector Z _{n · j} ⁱ estimated by the particle filter execution unit 24 is output to the state vector calculation unit. These processes are performed by the CPU.

分散計算式記憶部２６は、ＲＯＭに設けられており、パラメータノイズζ_ｋの分散β^２を記述する計算式を記憶している。この計算式は、入力値ｕ又は観測値Ｙ^ｒの関数であり、パラメータ毎に予め設定されている（後述）。なお、この計算式は、入力値ｕ又は観測値Ｙ^ｒの関数に限られず、入力値ｕ又は観測値Ｙ^ｒから導出される値（例えばＹ^ｒ）の関数であってもよい。 The variance calculation formula storage unit 26 is provided in the ROM, and stores a calculation formula describing the variance β ² of the parameter noise ζ _k . This calculation formula is a function of the input value u or the observed value ^Yr , and is preset for each parameter (described later). Incidentally, this formula is not limited to the function of the input values u or observed value Y ^r, may be a function of the value (e.g., Y ^r) derived from the input value u or observed value Y ^r.

ノイズ決定部２８は、ｋ＝ｎ・ｊステップ目（但し、ｊ≠ｍ）において、入力値取得部１２又は観測値取得部１４から、ｊ回目のパーティクル更新後の最初のステップからｊ＋１回目のパーティクル更新時のステップまでのｎ個の入力値ｕ又はｎ個の観測値Ｙ^ｒを取得する（即ち、ｕ_{ｎ・ｊ＋１}からｕ_{ｎ・ｊ＋ｎ}までのｎ個の入力値ｕ又はＹ_{ｎ・ｊ＋１} ^ｒからＹ_{ｎ・ｊ＋ｎ} ^ｒまでのｎ個の観測値Ｙ^ｒを取得する）。そして、分散式記憶部から取得した分散式にｎ個の入力値ｕ又はｎ個の観測値Ｙ^ｒから選択された１個の入力値ｕ又は観測値Ｙ^ｒを入力して、分散β^２を算出し、ｋ＝ｎ・ｊステップ目のパラメータノイズζ_ｎ・ｊを決定する。一方、ｋ＝ｎ・ｊ以外のステップでは、上記処理は実行されず、パラメータノイズζは一律にゼロと設定される。このようにして決定されたパラメータノイズζ_ｋは、状態ベクトル算出部２０に出力される。この処理はＣＰＵで実施される。
上述したことから明らかなように、パーティクルの更新が行われないステップ（ｋ＝ｎ・ｊ以外のステップ）では、ζ_ｋ＝０である。即ち、パーティクルの更新が行われないステップでは、Ｐ_ｋ＋１＝Ｐ_ｋが成立する。このため、ノイズ決定部２８は、パーティクルの更新を行うステップにおいてのみパラメータノイズを決定し、状態ベクトル算出部２０に出力するということもできる。
なお、上記の構成では、ノイズ決定部２８はパーティクルの更新を行うステップにおいてのみパラメータノイズを決定したが、この構成に限られない。ノイズ決定部２８は、最初のステップ（ｋ＝０）においても、パラメータノイズζ_０を決定してもよい。この場合、パラメータノイズζ_０の分散β^２は、ｋ＝１〜ｎステップ目までのｎ個の入力値ｕ又はｎ個の観測値Ｙ^ｒから選択された１個の入力値ｕ又は観測値Ｙ^ｒを分散式に入力することにより算出される。なお、ノイズ決定部２８が「ノイズ付与部」の一例に相当する。 In the k = n · j-th step (where j ≠ m), the noise determination unit 28 receives the j + 1-th particle from the first step after the j-th particle update from the input value acquisition unit 12 or the observation value acquisition unit 14. Obtain n input values u or n observed values Y ^r up to the step at the time of update (ie, from _n input values u or Y _{n · j + 1} ^r from u _{n · j + 1} to u _{n · j + n} _N observation values ^Yr up to ^Yn _{· j +} ^nr are acquired). Then, enter the one input value u or observed value Y ^r selected from n input values u or n observations Y ^r in the obtained dispersion equation from the dispersion equation storing section, a dispersion beta ² The parameter noise ζ _{n · j} at the k = n · j step is determined. On the other hand, in steps other than k = n · j, the above process is not executed, and the parameter noise ζ is uniformly set to zero. The parameter noise ζ _k determined in this way is output to the state vector calculation unit 20. This process is performed by the CPU.
As is apparent from the above description, ζ _k = 0 in the steps where the particles are not updated (steps other than k = n · j). That is, P _{k + 1} = P _k is established in the step where the particle is not updated. For this reason, the noise determination unit 28 can also determine the parameter noise only in the step of updating the particle and output it to the state vector calculation unit 20.
In the above configuration, the noise determination unit 28 determines the parameter noise only in the step of updating the particles. However, the configuration is not limited to this configuration. The noise determination unit 28 may determine the parameter noise ζ ₀ also in the first step (k = 0). In this case, the variance β ² of the parameter noise ζ ₀ is one input value u or observation value Y selected from n input values u or n observation values Y ^r from k = 1 to n steps. It is calculated by inputting ^r into the dispersion formula. The noise determining unit 28 corresponds to an example of “noise applying unit”.

次に、図２を参照して同定装置１０のコンピュータがシステムのパラメータＰを同定する処理について説明する。まず、ステップＳ２の入力値取得工程では、入力値取得部１２が、外部記憶媒体３０に記憶されている入力値データ３２から、ステップ毎に全期間の入力値ｕｋを取得する。次に、ステップＳ４の観測値取得工程では、観測値取得部１４が、外部記憶媒体３０に記憶されている観測値データ３４から、ステップ毎に全期間の観測値Ｙ_ｋ ^ｒを取得する。続いて、ステップＳ６の初期状態ベクトル決定工程では、パーティクルフィルタ実行部２４が、最初のステップ（ｋ＝０）においてＳ個のパーティクルを発生させ、各パーティクルに初期状態ベクトルＺ_０ ^ｉ（ｉ＝１〜Ｓ）を割り当てる。なお、ステップＳ２〜Ｓ６の工程は、順不同である。 Next, a process in which the computer of the identification apparatus 10 identifies the system parameter P will be described with reference to FIG. First, in the input value acquisition process of step S <b> 2, the input value acquisition unit 12 acquires the input value uk for the entire period for each step from the input value data 32 stored in the external storage medium 30. Next, in the observation value acquisition step of step S4, the observation value acquisition unit 14 acquires the observation values Y _k ^r for the entire period from the observation value data 34 stored in the external storage medium 30 for each step. Subsequently, in the initial state vector determination step in step S6, the particle filter execution unit 24 generates S particles in the first step (k = 0), and the initial state vector Z ₀ ⁱ (i = 1) is generated for each particle. ~ S). Note that the steps S2 to S6 are in no particular order.

続いて、ステップＳ８の状態ベクトル算出工程では、状態ベクトル算出部２０が、ステップＳ６で割り当てられたパーティクル毎の初期状態ベクトルＺ_０ ^ｉと、システムノイズη_０（即ち、平均が入力値ｕ_０、分散がα^２となるノイズ）と、ζ_０＝０を第１関数に入力して、状態ベクトルＺ(〜)_１ ^ｉを算出する。次いで、状態ベクトル算出部２０は、Ｚ(〜)_１ ^ｉと、η_１と、ζ_１＝０を第１関数に入力して、Ｚ(〜)_２ ^ｉを算出する。状態ベクトル算出部２０は、この処理をｋ＝ｎステップ目までｎ回繰り返して、パーティクル毎にｎ個の状態ベクトルＺ(〜)_１ ^ｉ〜Ｚ(〜)_ｎ ^ｉを算出する。なお、ｋ＝０〜ｎ−１ステップ目までのζ_ｋは０である。続いて、ステップＳ１０の推定値算出工程では、推定値算出部２２が、ステップＳ８で算出されたｎ個の状態ベクトルＺ(〜)_１ ^ｉ〜Ｚ(〜)_ｎ ^ｉを第２関数に入力して、パーティクル毎にｎ個の推定値Ｙ(〜)_１ ^ｉ〜Ｙ(〜)_ｎ ^ｉを算出する。 Subsequently, in the state vector calculation step of step S8, the state vector calculation unit 20 performs the initial state vector Z ₀ ^{i for} each particle assigned in step S6 and the system noise η ₀ (that is, the average is the input value u ₀ , The noise having variance α ² ) and ζ ₀ = 0 are input to the first function, and the state vector Z (˜) ₁ ⁱ is calculated. Next, the state vector calculation unit 20 inputs Z (˜) ₁ ⁱ , η ₁ and ζ ₁ = 0 to the first function, and calculates Z (˜) ₂ ⁱ . State vector calculation unit 20, the process is repeated n times until k = n-th step, each particle n states vector Z (~) to calculate a _{^{_{^{1 i ~Z (~) n i}}}} . Note that ζ _k is 0 from k = 0 to the (n−1) th step. Subsequently, the estimated value calculation step of step S10, the estimated value calculating section 22 inputs the n state vector Z calculated in step S8 (~) _{1 i} ^~Z the (~) _n ⁱ to the second function Te, n pieces of estimated values Y (~) for each particle to calculate the _{^{_{^{1 i ~Y (~) n i}}}} .

続いて、ステップＳ１２のパーティクルフィルタ実行工程では、１回目のパーティクルの更新を行う。具体的には、パーティクルフィルタ実行部２４が、ステップＳ４で取得した全期間の観測値Ｙ_ｋ ^ｒのうちの、ｋ＝１〜ｎステップ目までのｎ個の観測値Ｙ_１ ^ｒ〜Ｙ_ｎ ^ｒと、ステップＳ１０で取得したｎ個の推定値Ｙ(〜)_１ ^ｉ〜Ｙ(〜)_ｎ ^ｉを重み計算式（式５参照）に入力して、各パーティクルの重みｗ^ｉを算出する。そして、パーティクルフィルタ実行部２４は、重みｗ^ｉに基づいてパーティクルを更新して、パーティクル毎にｋ＝ｎステップ目の状態ベクトルＺ_ｎ ^ｉを推定する。このときのパーティクルの分布により、ｋ＝ｎステップ目のパラメータＰが同定される。 Subsequently, in the particle filter execution step in step S12, the first particle update is performed. Specifically, the particle filter execution unit 24, of the observed value _Y ^{k r} of the total period obtained in step S4, k = n number of observations _Y ¹ r of 1~n until th step _{to Y} ^{n r} If, type acquired n pieces of estimated value Y in step S10 (~) ₁ ⁱ to Y a (~) _n ⁱ weight calculation formula (see equation 5), calculates the weight ^{w i} of each particle. Then, the particle filter execution unit 24 updates the particles based on the weight w ^i, estimates the k = n th step of the state vector Z _n ⁱ for each particle. The parameter P at the k = n step is identified by the particle distribution at this time.

続いて、ステップＳ１４では、パーティクルの更新が最後であるか（即ち、ｊ＝ｍであるか）否かを判定する。パーティクルの更新が最後ではない場合（ステップＳ１４でＮＯ）は、ステップＳ１６に進む。ステップＳ１６の分散決定工程では、ノイズ決定部２８が、ｋ＝ｎ＋１〜２ｎステップ目までのｎ個の入力値ｕ_ｎ＋１〜ｕ_２ｎ、又は、ｎ個の観測値Ｙ_ｎ＋１ ^ｒ〜Ｙ_２ｎ ^ｒ（或いは、入力値ｕ_ｋ又は観測値Ｙ_ｋ ^ｒから導出される値）から選択された１個の入力値ｕ_ｋ又は観測値Ｙ_ｋ ^ｒを、分散計算式記憶部２６に記憶されている計算式に入力して分散β^２を算出し、ｋ＝ｎステップ目のパラメータノイズζ_ｎを決定する。 Subsequently, in step S14, it is determined whether or not the particle update is the last (that is, j = m). If the particle update is not the last time (NO in step S14), the process proceeds to step S16. In the variance determining step of step S16, the noise determining unit 28 performs n input values u _{n + 1 to} u _2n up to k = _{n + 1 to} _2n steps or n observed values Y _{n + 1} ^{r to} Y _2n ^r (or the input values u _k or observed values Y _k 1 input values selected from ^r value derived from) u _k or observed values Y _k ^r, the calculation expression stored in the distributed computing equation storing section 26 The variance β ² is input to calculate the parameter noise ζ _n at the k = nth step.

続いて、ステップＳ１８の状態ベクトル算出工程では、状態ベクトル算出部２０が、ステップＳ１２で推定されたｋ＝ｎステップ目の状態ベクトルＺ_ｎ ^ｉと、η_ｎと、ステップＳ１６で決定されたパラメータノイズζ_ｎを、第１関数に入力してｋ＝ｎ＋１ステップ目の状態ベクトルＺ(〜)_ｎ＋１ ^ｉを算出する。ステップＳ１８の処理が終了すると、同定装置１０のコンピュータは、ステップＳ８の処理に戻る。ステップＳ８では、状態ベクトル算出部２０が、ｋ＝ｎ＋２〜２ｎステップ目までのｎ−１個の状態ベクトルＺ(〜)_ｎ＋２ ^ｉ〜Ｚ(〜)_２ｎ ^ｉを算出する。続いて、ステップＳ１０では、ステップＳ１８及びステップＳ８で算出されたｎ個の状態ベクトルＺ(〜)_ｎ＋１ ^ｉ〜Ｚ(〜)_２ｎ ^ｉから、ｎ個の推定値Ｙ(〜)_ｎ＋１ ^ｉ〜Ｙ(〜)_２ｎ ^ｉが算出され、ステップＳ１２では、２回目のパーティクルの更新が行われる。以下、上述した処理を繰り返し、ステップＳ１２でｍ回目（即ち、最後）のパーティクルの更新が行われてｋ＝ｎ・ｍステップ目の状態ベクトルＺ_ｎ・ｍ ^ｉが推定されると、同定装置１０のコンピュータは、ステップＳ１４でＹＥＳと判定し、パラメータＰの同定処理を終了する。このときのパーティクルの分布により、ｋ＝ｎ・ｍステップ目のパラメータＰが同定される。 Subsequently, in the state vector calculation step in step S18, the state vector calculation unit 20 performs the k = n-th state vector Z _n ⁱ estimated in step S12, η _n, and the parameter noise determined in step S16. ζ _n is input to the first function to calculate the state vector Z (˜) _{n + 1} ⁱ of the k = n + 1 step. When the process of step S18 ends, the computer of the identification apparatus 10 returns to the process of step S8. In step S <b> 8, the state vector calculation unit 20 calculates n−1 state vectors Z (˜) _{n + 2} ^{i to} Z (˜) _2n ^{i up} to k = _{n + 2} to _2n steps. Subsequently, in step S10, n estimated values Y (to) _{n + 1} ⁱ to Y ((n) are calculated from the n state vectors Z (to) _{n + 1} ^{i to} Z (to) _2n ⁱ calculated in steps S18 and S8. ~) _2n ⁱ is calculated, in step S12, the second particle update occurs. Thereafter, the above-described processing is repeated, and when the m-th (that is, last) particle update is performed in step S12 and the state vector Z _{n · m} ⁱ at the k = n · m step is estimated, the identification apparatus 10 In step S14, the parameter P identification processing ends. The parameter P at the k = n · m step is identified by the particle distribution at this time.

上記の説明から明らかなように、パーティクルが更新されるステップ（ｋ＝ｎ・ｊ、ｊ＝１〜ｍ）では、状態ベクトルは、状態ベクトル算出部２０によって算出される状態ベクトルＺ(〜)_ｎ ^ｉと、パーティクルフィルタ実行部２４によって推定される状態ベクトルＺ_ｎ ^ｉの２種類が存在する。そして、ステップＳ１８の工程では、第１関数に入力される状態ベクトルとして、後者の状態ベクトルＺ_ｎ ^ｉが用いられる。なお、上記のようにパラメータの同定がオフラインで行われる場合は、ステップＳ１６の工程のように都度パラメータノイズζ_ｎ・ｊを決定する代わりに、予めｍ−１個の分散β^２を算出してパラメータノイズζ_ｎ・ｊを決定しておいてもよい。 As is clear from the above description, in the step of updating the particles (k = n · j, j = 1 to m), the state vector is the state vector Z (˜) _n calculated by the state vector calculation unit 20. and ^i, 2 kinds of state vector Z _n ⁱ estimated by a particle filter execution unit 24 is present. In the process of step S18, the latter state vector Z _n ⁱ is used as the state vector input to the first function. When parameter identification is performed off-line as described above, instead of determining the parameter noise ζ _{n · j} each time as in step S16, m−1 variances β ² are calculated in advance. The parameter noise ζ _{n · j} may be determined.

上記の同定装置１０では、パーティクルの更新周期が、ステップ間隔Ｔ_ｓのｎ倍である。そして、各パーティクルの重みｗ^ｉは、パーティクルが更新される１個の更新周期内に含まれるｎ個のステップにおいて取得された観測値Ｙ^ｒと推定値Ｙ(〜)^ｉとに基づいて算出される。このため、各パーティクルの重みを１個のステップにおける観測値Ｙ^ｒと推定値Ｙ(〜)^ｉとに基づいて算出する場合と比較して、パーティクルの重みｗ^ｉは、より真値に近いパラメータに対して大きくなり易くなる。この結果、制御対象となるシステムのパラメータの同定精度を向上できる。 In the identification device 10 described above, the particle update cycle is n times the step interval T _s . Then, the weight w ^{i of} each particle is calculated based on the observed value Y ^r and the estimated value Y (˜) ⁱ acquired in n steps included in one update cycle in which the particle is updated. The For this reason, compared with the case where the weight of each particle is calculated based on the observed value Y ^r and the estimated value Y (˜) ⁱ in one step, the particle weight w ⁱ is a parameter closer to the true value. It becomes easy to become large. As a result, the identification accuracy of the parameters of the system to be controlled can be improved.

また、上記の同定装置１０は、ノイズ決定部２８を備える。ノイズ決定部２８は、パーティクルが更新されるステップ（ｋ＝ｎ・ｊ、但し、ｊ≠ｍ）において、パラメータノイズζを決定して状態ベクトル算出部２０に出力する。この構成によると、パーティクルが局所解（ローカルミニマム）に収束することを抑制でき、同定精度をさらに向上できる。 The identification device 10 includes a noise determination unit 28. The noise determination unit 28 determines the parameter noise ζ and outputs it to the state vector calculation unit 20 in the step (k = n · j, where j ≠ m) when the particle is updated. According to this configuration, the particles can be prevented from converging on a local solution (local minimum), and the identification accuracy can be further improved.

また、上記の同定装置１０では、ノイズ決定部２８はパラメータノイズζの分散β^２を、分散計算式記憶部２６に記憶されている分散計算式を用いて決定するため、分散β^２は可変となる。この構成によると、分散β^２に応じてパーティクルの拡散の度合いを変更できる。 Further, in the identification device 10 described above, the noise determination unit 28 determines the variance β ² of the parameter noise ζ using the variance calculation formula stored in the variance calculation formula storage unit 26, so that the variance β ² is variable. Become. According to this configuration, to change the degree of diffusion of the particles according to the dispersion beta ^2.

特に、上記の同定装置１０では、パーティクルのｊ回目の更新時のステップで第１関数に入力されるパラメータノイズζ_ｎ・ｊの分散β^２が、ｋ＝ｎ・ｊ＋１ステップ目からｎ・ｊ＋ｎステップ目までのｎ個のステップのうちの１個のステップにおいて取得された入力値ｕ又は観測値Ｙ^ｒに基づいて決定される。この構成によると、パラメータを同定するための情報の多寡に応じて分散β^２を決定できる。このため、情報が少ないときに大きな分散β^２を設定することに起因してパーティクルが不要に拡散してしまうという事態の発生を抑制でき、効率的にパラメータを同定できる。 In particular, in the identification device 10 described above, the variance β ^{2 of the} parameter noise ζ _{n · j} input to the first function in the step at the j-th update of the particle is the n · j + n step from the k = n · j + 1 step. It is determined based on the input value u or the observed value Y ^r acquired in one of the n steps up to the eye. According to this configuration, it can be determined a dispersion beta ² according to amount of information for identifying the parameters. Therefore, information due to that set a large dispersion beta ² when less can suppress the occurrence of a situation that the particles will be unnecessarily spread, can be efficiently identified parameters.

また、上記の同定装置１０では、各パーティクルの重みｗ^ｉが、ｎ個のステップにおける同時確率から算出される。このため、パーティクルの重みｗ^ｉは、より真値に近いパラメータに対して大きくなり、同定精度をさらに向上できる。 In the identification device 10 described above, the weight w ^{i of} each particle is calculated from the simultaneous probabilities in n steps. For this reason, the weight w ⁱ of the particle becomes larger with respect to the parameter closer to the true value, and the identification accuracy can be further improved.

次に、ロボットアームのシミュレーションモデルを用いて同定装置のパラメータ同定精度を検証した具体例について説明する。この具体例では、同定装置は、実験１と、実験１の比較例としての実験２を実施する。実験１では、上述したノイズ決定処理が行われる（即ち、パラメータノイズζの分散β^２が可変である）。実験２では、ノイズ決定処理が行われない（即ち、パラメータノイズζの分散β^２が定数である）。このロボットアームは、単軸運動モデルであり、ロボットアームの軸を回転軸として回転運動する。同定装置１０が同定するロボットアームのパラメータＰは、ｍ、φ、γ、ｄ、Ｊの５個である。ｍは転がり摩擦のヒステリシスの膨らみを表す係数、φは転がり出し変位領域幅、γはクーロン摩擦力、ｄは粘性摩擦係数、Ｊは慣性モーメントを表す。 Next, a specific example in which the parameter identification accuracy of the identification device is verified using a simulation model of the robot arm will be described. In this specific example, the identification apparatus performs Experiment 1 and Experiment 2 as a comparative example of Experiment 1. In Experiment 1, above the noise determination process is performed (i.e., variance beta ² parameters noise ζ is variable). In Experiment 2, noise determination processing is not performed (that is, the variance β ² of the parameter noise ζ is a constant). This robot arm is a single-axis motion model, and rotates about the axis of the robot arm as a rotation axis. The robot arm parameters P identified by the identification device 10 are m, φ, γ, d, and J. m is a coefficient representing the swelling of the hysteresis of rolling friction, φ is the rolling displacement area width, γ is the Coulomb friction force, d is the viscous friction coefficient, and J is the moment of inertia.

（実験１）
まず、ロボットアームの状態方程式と、隣接ステップ間のパラメータＰの関係式を統合して、ロボットアームの第１関数を求める。ロボットアームの状態方程式は、次の式７で示すロボットアームの運動方程式を以下の手順で離散化することにより求められる。
θはロボットアームの回転角度、τはロボットアームに入力されるトルク、ｖはクーロン摩擦を包含する転がり摩擦力である。ここで、ｖ（ｔ）は次の式８によって定義される。
ωはロボットアームの角速度（＝θ）、ν（ニュー）は速度反転後の転がり摩擦力、δは速度反転後の移動距離、ξ＝δ／φである。ここで、ｇ（ξ）は次の式９によって定義される。
(Experiment 1)
First, the state equation of the robot arm and the relational expression of the parameter P between adjacent steps are integrated to obtain the first function of the robot arm. The equation of state of the robot arm is obtained by discretizing the equation of motion of the robot arm shown by the following equation 7 in the following procedure.
θ is a rotation angle of the robot arm, τ is a torque input to the robot arm, and v is a rolling friction force including Coulomb friction. Here, v (t) is defined by the following Expression 8.
ω is the angular velocity (= θ) of the robot arm, ν (new) is the rolling friction force after the speed reversal, δ is the moving distance after the speed reversal, and ξ = δ / φ. Here, g (ξ) is defined by the following Equation 9.

ここで、上述したように、式７〜９をパーティクルフィルタに実装するためには、これらを離散系に変換する必要がある。離散系に変換するためには、まず、速度反転を判定して、次の式１０、１１に従って速度反転後のころがり摩擦力ν_ｋと速度反転後の移動距離δ_ｋを定める。
式１０、１１を式８に代入すると、離散系での転がり摩擦力ｖ_ｋが次の式１２の通りに導出できる。但し、ｇ（ξ）には式９、ξにはδ_ｋ／φ_ｋが用いられる。
Here, as described above, in order to implement Expressions 7 to 9 in the particle filter, it is necessary to convert them into a discrete system. In order to convert to a discrete system, first, the speed reversal is determined, and the rolling friction force ν _k after the speed reversal and the moving distance δ _k after the speed reversal are determined according to the following equations 10 and 11.
When Expressions 10 and 11 are substituted into Expression 8, the rolling friction force v _k in a discrete system can be derived as the following Expression 12. However, Equation 9 is used for g (ξ), and δ _k / φ _k is used for ξ.

一方、式７を状態方程式に書き換えると、次の式１３となる。
式１３を簡略化すると、次の式１４となる。
式１４を、入力τ（ｔ）にゼロ次ホールドを仮定して離散化すると、次の式１５が得られる。但し、ｖ（ｔ）の離散化は、式１２に従っている。
式１５を一般式に置き換えると次の式１６で示すロボットアームの状態方程式が求められる。
ここで、ｆは非線形な転がり摩擦を包含した式１５を表す既知の非線形関数、Ｘ_ｋ’はＸ_ｋと転がり摩擦の状態量ω_ｋ−１、θ_ｋ−１、δ_ｋ−１、ν_ｋ−１、ｖ_ｋ−１を含有した状態行列、η_ｋは平均τ_ｋ、分散α^２（＝２．５×１０^−３）の入力信号であり、いわゆるシステムノイズに相当する。 On the other hand, when Equation 7 is rewritten into the state equation, the following Equation 13 is obtained.
When Expression 13 is simplified, the following Expression 14 is obtained.
When Expression 14 is discretized assuming zero-order hold at the input τ (t), the following Expression 15 is obtained. However, the discretization of v (t) follows Formula 12.
When equation 15 is replaced with a general equation, the state equation of the robot arm expressed by the following equation 16 is obtained.
Here, f is a known nonlinear function representing Equation 15 including nonlinear rolling friction, X _k ′ is X _k and state quantities ω _k−1 , θ _k−1 , δ _k−1 , ν _{k of} rolling friction. ₋₁ , v _k−1 containing state matrix, η _k is an input signal of mean τ _k and variance α ² (= 2.5 × 10 ⁻³ ), and corresponds to so-called system noise.

一方、隣接ステップ間のパラメータＰの関係式は、式２で示した通りである。但し、Ｐ_ｋは、このロボットアームのシミュレーションモデルで同定すべき５個のパラメータからなる行列であり、次の式１７で表される。
On the other hand, the relational expression of the parameter P between adjacent steps is as shown in Expression 2. Here, P _k is a matrix composed of five parameters to be identified in the simulation model of the robot arm, and is expressed by the following Expression 17.

式１６と式２を統合することにより、式３で表される新たな状態方程式（第１関数）が定義される。但し、Ｚ_ｋ＝（Ｘ_ｋ’^Ｔ，Ｐ_ｋ ^Ｔ）^Ｔとして定義される。ｆ_ａは式１６、式２から導出される既知の非線形関数である。 By integrating Expression 16 and Expression 2, a new state equation (first function) represented by Expression 3 is defined. However, it is defined as Z _k = (X _k ′ ^T , P _k ^T ) ^T. f _a is a known nonlinear function derived from Equations 16 and 2.

一方、観測量は角度θ_ｋであるため、出力方程式（第２関数）は線形となり、次の式１８で表される。
本モデルでは、観測ノイズσ_ｋは、平均ゼロ、分散ρ^２＝４．０×１０^−６に設定されている。 On the other hand, since the observation amount is the angle θ _k , the output equation (second function) is linear and is expressed by the following Expression 18.
In this model, the observation noise σ _k is set to mean zero and variance ρ ² = 4.0 × 10 ⁻⁶ .

図３は、ロボットアームに入力するトルクτ（ｔ）の時系列データを示し、図４は、ロボットアームから出力される角度θ（ｔ）の時系列データを示す。前者が入力値データ３２に相当し、後者が観測値データ３４に相当する。時系列データの期間ＴはＴ＝４５秒であり、ステップ間隔Ｔ_ｓはＴ_ｓ＝０．０１秒である。同定装置１０の入力値取得部１２及び観測値取得部１４は、時系列データの全期間の入力値及び観測値を取得する。パーティクルは、ｎ＝２５０ステップ目に更新される。即ち、パーティクルの更新周期は２．５秒であり、パーティクルは計１８回更新される（ｊ＝１〜１８）。また、パーティクルの総数ＳはＳ＝５００００とする。また、パラメータの真値は、ｍ＝０．８、φ＝１．５、γ＝５．０、ｄ＝２．０、Ｊ＝４．０に設定されている。 FIG. 3 shows time-series data of torque τ (t) input to the robot arm, and FIG. 4 shows time-series data of angle θ (t) output from the robot arm. The former corresponds to the input value data 32 and the latter corresponds to the observation value data 34. The period T of the time series data is T = 45 seconds, and the step interval T _s is T _s = 0.01 seconds. The input value acquisition unit 12 and the observation value acquisition unit 14 of the identification device 10 acquire input values and observation values for all periods of time-series data. The particles are updated at the n = 250th step. That is, the particle update cycle is 2.5 seconds, and the particles are updated 18 times in total (j = 1 to 18). The total number S of particles is S = 50000. The true values of the parameters are set to m = 0.8, φ = 1.5, γ = 5.0, d = 2.0, and J = 4.0.

次に、分散計算式記憶部２６が記憶している分散計算式について説明する。本モデルでは、同定すべきパラメータが５個あるため、パラメータノイズζ_ｋは、各パラメータに対応する５個のパラメータノイズ及び分散を有する。以下では、パラメータｍ_ｋ、φ_ｋ、γ_ｋ、ｄ_ｋ、Ｊ_ｋに対応するパラメータノイズ及び分散を、それぞれζ_１ｋ及びβ_１ ^２、ζ_２ｋ及びβ_２ ^２、ζ_３ｋ及びβ_３ ^２、ζ_４ｋ及びβ_４ ^２、ζ_５ｋ及びβ_５ ^２とする。 Next, the variance calculation formula stored in the variance calculation formula storage unit 26 will be described. In this model, since there are five parameters to be identified, the parameter noise ζ _k has five parameter noises and variances corresponding to each parameter. In the following, parameter noise and variance corresponding to the parameters m _k , φ _k , γ _k , d _k , J _k are respectively expressed as ζ _1k and β ₁ ² , ζ _2k and β ₂ ² , ζ _3k and β ₃ ² , ζ. _{Let 4k} and β ₄ ² , ζ _5k and β ₅ ² .

（分散β_１ ^２、β_２ ^２の分散計算式）
転がり摩擦力ｖ_ｋは、速度反転後に転がり出し変位領域幅φ_ｋの距離を移動すると、クーロン摩擦力γと等価となる。このため、転がり摩擦のヒステリシスの膨らみを表す係数ｍ_ｋとφ_ｋは、速度反転後にのみ転がり摩擦力ｖ_ｋに影響する同定パラメータである。従って、パラメータｍ_ｋ、φ_ｋのパラメータノイズζ_１ｋ、ζ_２ｋの分散β_１ ^２、β_２ ^２の分散計算式は、それぞれ次の式１９、式２０で表される。
なお、速度反転の有無は、観測値データθ（ｔ）（図３参照）から判定される。 (Dispersion calculation formula of dispersion β ₁ ² and β ₂ ² )
Rolling friction force v _k, moving distance of the displacement region width phi _k out rolling after speed reversal, the Coulomb friction force γ equivalent. For this reason, the coefficients m _k and φ _k representing the bulge of rolling friction hysteresis are identification parameters that affect the rolling friction force v _k only after the speed reversal. Accordingly, the dispersion calculation formulas of the variances β ₁ ² and β ₂ ² of the parameter noises ζ _1k and ζ _2k of the parameters m _k and φ _k are expressed by the following formulas 19 and 20, respectively.
The presence or absence of speed reversal is determined from the observed value data θ (t) (see FIG. 3).

（分散β_３ ^２の分散計算式）
パラメータγ_ｋ（クーロン摩擦力）のパラメータノイズζ_３ｋの分散β_３ ^２の分散計算式は、式２１で表される（図５参照）。なお、β_３ ^２ _ｍａｘ＝２．５×１０^−３である。ｋ＝ｎ・ｊステップ目の分散β_３ ^２は、ｋ＝ｎ・ｊ＋１〜ｎ・ｊ＋ｎステップ目までの観測値データθ（ｔ）の最大角速度の絶対値ｍａｘ（｜ω｜）を式２１に入力することにより算出される。ロボットアームの高速回転時はクーロン摩擦力に対して粘性摩擦力が支配的になるため、クーロン摩擦力γ_ｋのパラメータノイズζ_３ｋの分散β_３ ^２が小さくなるように設定している。また、ロボットアームの極低速回転時は、速度反転直後である可能性が高いと判断し、この場合も分散β_３ ^２が小さくなるように設定している。
(Dispersion calculation formula of variance β ₃ ² )
The dispersion calculation formula of the dispersion β ₃ ² of the parameter noise ζ _3k of the parameter γ _k (Coulomb friction force) is expressed by Equation 21 (see FIG. 5). Note that β ₃ ² _max = 2.5 × 10 ⁻³ . The variance β ₃ ² of the k = n · j step is expressed by the equation 21 in which the absolute value max (| ω |) of the maximum angular velocity of the observation value data θ (t) from the k = n · j + 1 to n · j + n steps Calculated by inputting. Since the viscous friction force is dominant over the Coulomb friction force during high-speed rotation of the robot arm, the variance β ₃ ² of the parameter noise ζ _3k of the Coulomb friction force γ _k is set to be small. Further, when the robot arm rotates at a very low speed, it is determined that there is a high possibility that it is immediately after the speed reversal, and in this case, the variance β ₃ ² is set to be small.

（分散β_４ ^２の分散計算式）
粘性摩擦力は角速度に比例した摩擦力である。このため、角速度が小さいときは、ロボットアームの挙動に対する粘性摩擦力の影響が小さくなり、システム同定が困難となると考えられる。従って、パラメータｄ_ｋ（粘性摩擦係数）のパラメータノイズζ_４ｋの分散β_４ ^２が、最大角速度の絶対値ｍａｘ（｜ω｜）に比例するように、分散β_４ ^２の分散計算式を次の式２２のように定める（図６参照）。なお、β_４ ^２ _ｍａｘ＝２．５×１０^−３である。
(Dispersion calculation formula of variance β ₄ ² )
The viscous frictional force is a frictional force proportional to the angular velocity. For this reason, when the angular velocity is small, it is considered that the influence of the viscous friction force on the behavior of the robot arm becomes small and the system identification becomes difficult. Therefore, the parameter _{d k} parameter noise ζ dispersed beta ₄ ² of _4k of (viscous friction coefficient), the absolute value max of the maximum angular velocity (| omega |) in proportion to the variance calculation formula of the dispersing beta ₄ ² follows It is determined as shown in Equation 22 (see FIG. 6). Note that β ₄ ² _max = 2.5 × 10 ⁻³ .

（分散β_５ ^２の分散計算式）
パラメータＪ_ｋ（慣性モーメント）は、ロボットアームが十分な加減速動作をするときにシステム同定が可能である。別言すれば、ロボットアームの定速回転時や停止時には同定ができない。このため、慣性モーメントＪ_ｋのシステムノイズζ_５ｋの分散β_５ ^２がトルクτ（入力値）に比例するように、分散β_５ ^２の分散計算式を次の式２３のように定める（図７参照）。ｋ＝ｎ・ｊステップ目の分散β_５ ^２は、ｋ＝ｎ・ｊ＋１〜ｎ・ｊ＋ｎステップ目までの入力値データτ（ｔ）の最大トルクの絶対値ｍａｘ（｜τ｜）を式２３に入力することにより算出される。なお、β_５ ^２ _ｍａｘ＝１．０×１０^−２である。
(Dispersion equation of dispersing beta ₅ ²⁾
The parameter J _k (moment of inertia) can be identified by the system when the robot arm performs a sufficient acceleration / deceleration operation. In other words, it cannot be identified when the robot arm rotates at a constant speed or stops. Therefore, dispersion beta ₅ ² of system noise zeta _5k moment of inertia J _k is in proportion to the torque tau (input value) defines the dispersion equation of dispersing beta ₅ ² as the following equation 23 (FIG. 7 reference). The variance β ₅ ² at the k = n · j step is expressed in Equation 23 by the absolute value max (| τ |) of the maximum torque of the input value data τ (t) up to the k = n · j + 1 to n · j + n steps. Calculated by inputting. Note that β ₅ ² _max = 1.0 × 10 ⁻² .

図８（ａ）〜（ｅ）は、最初のステップ（ｋ＝０）と、パーティクル更新時のステップ（ｋ＝ｎ・ｊ（ｊ＝１〜１７））におけるパラメータノイズζの分散を表す。横軸Ｎは、最初のステップから数えたときの分散β_１ ^２〜β_５ ^２の個数を表し、縦軸β_１ ^２〜β_５ ^２は分散の値を表す。本実験では、最初のステップにおいてもパラメータノイズζの分散が算出されるため、計１８個の分散が算出される。図８（ａ）〜（ｅ）から明らかなように、分散β_１ ^２〜β_５ ^２は可変である。例えば、図８（ａ）において、Ｎ＝１の分散β_１ ^２は、ｋ＝０ステップ目のパラメータノイズζ_１・０の分散であり、Ｎ＝１８の分散β_１ ^２は、ｋ＝２５０・１７ステップ目のパラメータノイズζ_{１・２５０・１７}の分散である。図９（ａ）〜（ｅ）は、上述した条件で同定装置１０がロボットアームのパラメータ同定処理を実行したときの各パラメータのパーティクルの分布を示す。図１０において、濃色はパーティクルの数が少ない領域を示し、淡色はパーティクルの数が多い領域を示している。また、破線はパラメータの真値を示し、丸印は、パーティクル更新時のステップにおけるパーティクルの平均値を示す。 8A to 8E show the variance of the parameter noise ζ in the first step (k = 0) and the step (k = n · j (j = 1 to 17)) at the time of particle update. The horizontal axis N represents the number of variances β ₁ ^{2 to} β ₅ ² counted from the first step, and the vertical axis β ₁ ^{2 to} β ₅ ² represents the value of the variance. In this experiment, since the variance of the parameter noise ζ is also calculated in the first step, a total of 18 variances are calculated. As is apparent from FIGS. 8A to 8E, the dispersions β ₁ ^{2 to} β ₅ ² are variable. For example, in FIG. 8A, the variance β ₁ ^{2 with} N = 1 is the variance of the parameter noise ζ _{1 · 0 at} the k = 0 step, and the variance β ₁ ^{2 with} N = 18 is k = 250 · This is the variance of the parameter noise ζ _{1, 250, 17 at} the 17th step. FIGS. 9A to 9E show the particle distribution of each parameter when the identification device 10 executes the robot arm parameter identification process under the above-described conditions. In FIG. 10, the dark color indicates an area where the number of particles is small, and the light color indicates an area where the number of particles is large. The broken line indicates the true value of the parameter, and the circle indicates the average value of the particles at the step of updating the particles.

（実験２）
図１０（ａ）〜（ｅ）は、各パラメータのパラメータノイズζ_ｋの分散β^２を定数とした場合の各パラメータのパーティクルの分布を示す。なお、この場合、各パラメータｍ_ｋ、φ_ｋ、γ_ｋ、ｄ_ｋ、Ｊ_ｋのパラメータノイズζ_ｋの分散を、それぞれβ_６ ^２、β_７ ^２、β_８ ^２、β_９ ^２、β_１０ ^２とすると、各分散は、β_６ ^２＝β_１ｍａｘ ^２＝０．０２^２、β_７ ^２＝β_２ｍａｘ ^２＝０．０５^２、β_８ ^２＝β_３ｍａｘ ^２＝２．５×１０^−３、β_９ ^２＝β_４ｍａｘ ^２＝２．５×１０^−３、β_１０ ^２＝β_５ｍａｘ ^２＝１．０×１０^−２と設定されている。 (Experiment 2)
FIGS. 10A to 10E show the particle distribution of each parameter when the variance β ² of the parameter noise ζ _k of each parameter is a constant. In this case, the variances of the parameter noises ζ _k of the parameters m _k , φ _k , γ _k , d _k , and J _k are β ₆ ² , β ₇ ² , β ₈ ² , β ₉ ² , β ₁₀ ^{2, respectively.} Then, each variance is β ₆ ² = β _1max ² = 0.02 ² , β ₇ ² = β _2max ² = 0.05 ² , β ₈ ² = β _3max ² = 2.5 × 10 ⁻³ , β ₉ ² = β _4max ² = 2.5 × 10 ⁻³ and β ₁₀ ² = β _5max ² = 1.0 × 10 ⁻² .

表１は、実験１、２を実施したときの最後のパーティクル更新時のステップにおけるパーティクルの平均と分散を各パラメータについて示している。表２は、実験１、２を各１０回実施したときの最後のパーティクル更新時のステップにおけるパーティクルの平均の誤差率（％）と分散の平均を各パラメータについて示している。
図９、１０、表１、２によると、実験１、２共にパーティクルの平均はパーティクルを更新する度に徐々に真値に近い値に収束している。しかしながら、実験１のほうが、パラメータＪを除いてパーティクルの平均が真値により近く、かつ、平均の誤差率も小さい。パラメータＪに関しても、実験１と２の平均の差及び平均の誤差率の差はごく僅かである。また、パラメータφを除いて、実験１のほうが、パーティクルの分散の平均は大幅に小さくなっている。パラメータφに関しても、実験１と２の分散の平均の差はごく僅かである。このことから、パラメータノイズの分散を可変としたほうが、分散を定数とする場合よりも、各回のシミュレーションの結果がばらつくことを抑制しながら、パラメータの同定精度をより向上できることが分かる。 Table 1 shows the average and dispersion of the particles in each step at the time of the last particle update when Experiments 1 and 2 were performed for each parameter. Table 2 shows, for each parameter, the average error rate (%) of particles and the average of dispersion at the step of updating the last particle when Experiments 1 and 2 are performed 10 times.
According to FIGS. 9 and 10 and Tables 1 and 2, the average of the particles in Experiments 1 and 2 gradually converges to a value close to the true value every time the particles are updated. However, in Experiment 1, except for the parameter J, the average of the particles is closer to the true value, and the average error rate is smaller. Regarding parameter J, the average difference between experiments 1 and 2 and the difference in average error rate are negligible. Moreover, the average of particle dispersion is much smaller in Experiment 1 except for the parameter φ. Regarding the parameter φ, the average difference between the variances of Experiments 1 and 2 is very small. From this, it can be seen that the variable parameter noise variance can improve the parameter identification accuracy while suppressing the variation of the simulation results each time, compared to the case where the variance is a constant.

また、実験１に関して、表１、表２の結果によると、パラメータφ、γ、ｄ、Ｊの平均の誤差率はいずれも０．５％未満であるのに対し、パラメータｍの平均の誤差率は１．８２９％と他のパラメータと比べてやや大きい。これは、パラメータｍの変化に対してθ_ｋが不感であることが原因と考えられる。このため、同定結果をセンサレス力制御等の制御に応用する場合には、それほど大きな影響を与えないものと思われる。 Regarding Experiment 1, according to the results of Tables 1 and 2, the average error rate of the parameters φ, γ, d, and J are all less than 0.5%, whereas the average error rate of the parameter m is Is 1.829%, which is slightly larger than other parameters. This is considered to be because θ _k is insensitive to the change of the parameter m. For this reason, when the identification result is applied to control such as sensorless force control, it seems that the influence is not so great.

また、図９、１０を比較すると、実験２では、実験１よりも、パーティクルの更新を繰り返す過程でパーティクルの平均が真値から大きく逸脱していることが分かる。これは、パラメータを同定するための十分な情報がない場合であっても、一定の分散を有するパラメータノイズζが同定すべきパラメータに与えられるためである。特に、実験２では、パラメータＪのパーティクルの平均が、ｊ＝１３、１４で真値から大幅に逸脱している。この原因は、次のように推測される。即ち、図３、４によると、ｊ＝１３、１４のステップ（即ち、ｔ＝３０〜３２．５秒）では、その約７．５秒前から入力トルクτがほぼゼロであり、出力角度θの角加速度がほぼゼロである。このように、ロボットアームの運動が角加速度を殆ど発生しない場合に、実験２のように一定の分散を有するパラメータノイズζをパラメータに与えると、パーティクルが不要に拡散して、真値ではない値にパーティクルが集合してしまうことが原因であると推測される。実験１のように、パラメータを同定するための十分な情報がある場合とない場合でパラメータノイズζの分散β^２を可変とすることにより、パーティクルの更新の終了時だけではなく、パーティクルの更新を繰り返す過程においても、高い精度でパラメータを同定できることが分かる。 9 and 10, it can be seen that in Experiment 2, the average of the particles greatly deviates from the true value in the process of repeatedly updating the particles in Experiment 2. This is because even when there is not enough information for identifying a parameter, parameter noise ζ having a certain variance is given to the parameter to be identified. In particular, in Experiment 2, the average of the particles of parameter J deviates significantly from the true value at j = 13 and 14. This cause is presumed as follows. That is, according to FIGS. 3 and 4, in the step of j = 13, 14 (that is, t = 30 to 32.5 seconds), the input torque τ is almost zero from about 7.5 seconds before the output angle θ. The angular acceleration of is almost zero. In this way, when the motion of the robot arm hardly generates angular acceleration, if the parameter noise ζ having a certain variance is given to the parameter as in Experiment 2, the particles are unnecessarily diffused and are not true values. This is presumed to be caused by the aggregation of particles. As in Experiment 1, by changing the variance β ² of the parameter noise ζ depending on whether or not there is sufficient information for identifying the parameter, not only at the end of the update of the particle but also at the end of the update of the particle. It can be seen that the parameters can be identified with high accuracy even in the process of repetition.

次に、図１２を参照して実施例２について説明する。本実施例では、制御器４０が、制御対象となるシステム４２と通信可能に接続されている。制御器４０は、システムパラメータ同定装置４４と制御部４６を備える。制御部４６は、システム４２からの制御出力値を目標値に近づけるように、システム４２からの制御出力値を用いたフィードバック制御を行うことによってシステム４２への制御入力値を決定する。制御部４６は、システム４２への制御入力値を決定するための制御パラメータを有しており、制御入力値は、制御パラメータに基づいて決定される（後述）。 Next, Example 2 will be described with reference to FIG. In the present embodiment, the controller 40 is communicably connected to the system 42 to be controlled. The controller 40 includes a system parameter identification device 44 and a control unit 46. The control unit 46 determines the control input value to the system 42 by performing feedback control using the control output value from the system 42 so that the control output value from the system 42 approaches the target value. The control unit 46 has a control parameter for determining a control input value to the system 42, and the control input value is determined based on the control parameter (described later).

同定装置４４は、実施例１の同定装置１０と異なり、システムパラメータをオンラインで同定する。即ち、同定装置４４の入力値取得部及び観測値取得部は、システム４２の制御入力値及び制御出力値を、システム４２の動作中にリアルタイムで取得する。同定装置４４は、ノイズ決定処理を実施しない点を除いて、実施例１の同定装置１０と同様の処理を実施する。即ち、本実施例では、ｋ＝ｎ・ｊステップ目（ｊ＝１〜ｍ−１）におけるシステムパラメータのパラメータノイズζの分散β^２は、可変ではなく一定とされる。同定装置４４のパーティクルフィルタ実行部２４により、ｋ＝ｎ・ｊステップ目（ｊ＝１〜ｍ）毎にパーティクルの更新が行われると、更新後のパーティクルの平均及び分散が、制御部４６に出力される。 Unlike the identification device 10 of the first embodiment, the identification device 44 identifies system parameters online. That is, the input value acquisition unit and the observation value acquisition unit of the identification device 44 acquire the control input value and the control output value of the system 42 in real time during the operation of the system 42. The identification device 44 performs the same processing as that of the identification device 10 of the first embodiment except that the noise determination processing is not performed. That is, in the present embodiment, the variance β ² of the parameter noise ζ of the system parameter at the k = n · jth step (j = 1 to m−1) is not variable but constant. When the particle filter execution unit 24 of the identification device 44 updates the particles every k = n · j steps (j = 1 to m), the average and variance of the updated particles are output to the control unit 46. Is done.

本実施例では、制御パラメータを算出するための制御パラメータ計算式が、パーティクルの平均、パーティクルの分散、及び制御出力値と制御入力値との偏差を変数とする関数となるように設計されている。このため、制御部４６は、ｋ＝ｎ・ｊステップ目において同定装置４４から出力されたパーティクルの平均及び分散と、制御出力値と制御入力値との偏差とを制御パラメータ計算式に入力して、ｋ＝ｎ・ｊステップ目の制御パラメータを算出する。制御部４６は、この制御パラメータに基づいて、ｋ＝ｎ・ｊ＋１ステップ目の制御入力値を決定する。制御器４０は、上記の処理をｋ＝ｎ・ｊステップ目毎に繰り返すことにより、制御パラメータを同定する。 In the present embodiment, the control parameter calculation formula for calculating the control parameter is designed to be a function having the average of particles, the dispersion of particles, and the deviation between the control output value and the control input value as variables. . For this reason, the control unit 46 inputs the average and variance of the particles output from the identification device 44 at the k = n · j step, and the deviation between the control output value and the control input value to the control parameter calculation formula. K = n · jth step control parameters are calculated. Based on this control parameter, the control unit 46 determines the control input value of the k = n · j + 1 step. The controller 40 identifies the control parameter by repeating the above processing every k = n · j steps.

この構成によると、制御器４０は、システム４２のフィードバック制御とシステムパラメータの同定を同時に行う。制御器４０は、同定装置４４によるオンラインでのシステムパラメータの同定結果（具体的には、パーティクルの平均及び分散）を利用して、制御パラメータを決定する。この構成によると、制御部４６が試行錯誤的に制御パラメータを決定する構成と比較して、効率的に正確な制御パラメータを決定できる。 According to this configuration, the controller 40 simultaneously performs feedback control of the system 42 and identification of system parameters. The controller 40 determines a control parameter by using an online system parameter identification result (specifically, average and dispersion of particles) by the identification device 44. According to this configuration, it is possible to efficiently determine an accurate control parameter as compared with a configuration in which the control unit 46 determines the control parameter by trial and error.

以上、本発明の具体例を詳細に説明したが、これらは例示にすぎず、特許請求の範囲を限定するものではない。特許請求の範囲に記載の技術には、以上に例示した具体例を様々に変形、変更したものが含まれる。 Specific examples of the present invention have been described in detail above, but these are merely examples and do not limit the scope of the claims. The technology described in the claims includes various modifications and changes of the specific examples illustrated above.

例えば、同定装置は、各パーティクルの重みｗ^ｉを、１個の更新周期内に含まれるｎ個のステップのうちの少なくとも２個のステップにおいて取得された観測値Ｙ^ｒと推定値Ｙ(〜)^ｉとに基づいて算出してもよい。また、各パーティクルの重みｗ^ｉは、同時確率以外の方法（即ち、式５以外の重み計算式）で求めてもよい。また、分散α^２、ρ^２は、ステップ毎に変化するように予め設定してもよい。また、パラメータノイズζ及び観測ノイズσの平均はゼロでなくてもよい。また、ノイズ決定部２８は、パーティクルが更新される全てのステップにおいてパラメータノイズζを決定する構成でなくてもよい。例えば、ノイズ決定部２８は、パーティクルが更新される全てのステップのうち、１つおきのステップにおいて（即ち、周期的に）パラメータノイズζを決定する構成であってもよい。 For example, the identification apparatus sets the weight w ^{i of} each particle to the observed value Y ^r and the estimated value Y (˜) acquired in at least two of the n steps included in one update period. You may calculate based on ⁱ . Further, the weight w ^{i of} each particle may be obtained by a method other than the joint probability (that is, a weight calculation formula other than Equation 5). The variances α ² and ρ ² may be set in advance so as to change at each step. Further, the average of the parameter noise ζ and the observation noise σ may not be zero. Further, the noise determining unit 28 may not be configured to determine the parameter noise ζ in all steps in which particles are updated. For example, the noise determination unit 28 may be configured to determine the parameter noise ζ in every other step (that is, periodically) among all the steps in which particles are updated.

また、本明細書または図面に説明した技術要素は、単独であるいは各種の組合せによって技術的有用性を発揮するものであり、出願時請求項記載の組合せに限定されるものではない。また、本明細書または図面に例示した技術は複数目的を同時に達成するものであり、そのうちの一つの目的を達成すること自体で技術的有用性を持つものである。 The technical elements described in this specification or the drawings exhibit technical usefulness alone or in various combinations, and are not limited to the combinations described in the claims at the time of filing. In addition, the technology illustrated in the present specification or the drawings achieves a plurality of objects at the same time, and has technical utility by achieving one of the objects.

１０：システムパラメータ同定装置、１２：入力値取得部、１４：観測値取得部、１６：第１関数記憶部、１８：第２関数記憶部、２０：状態ベクトル算出部、２２：推定値算出部、２４：パーティクルフィルタ実行部、２６：分散計算式記憶部、２８：ノイズ決定部、３０：外部記憶媒体、３２：入力値データ、３４：観測値データ、４０：制御器、４４：同定装置 10: system parameter identification device, 12: input value acquisition unit, 14: observation value acquisition unit, 16: first function storage unit, 18: second function storage unit, 20: state vector calculation unit, 22: estimated value calculation unit , 24: particle filter execution unit, 26: dispersion calculation formula storage unit, 28: noise determination unit, 30: external storage medium, 32: input value data, 34: observation value data, 40: controller, 44: identification device

Claims

A system parameter identification device that identifies a parameter of a system to be controlled using a particle filter,
An input value acquisition unit, an observation value acquisition unit, a first function storage unit, a second function storage unit, a state vector calculation unit, an estimated value calculation unit, and a particle filter execution unit,
The input value acquisition unit can acquire an input value to the system for each step in an arbitrary period,
The observation value acquisition unit can acquire the observation value of the output from the system for each step in an arbitrary period,
The first function storage unit calculates a first function that calculates a state vector of the system at an arbitrary step within the arbitrary period based on a state vector of the system at a step immediately before the arbitrary step. The system state vector includes elements of the system state quantity, the system parameters, and system noise associated with the input value;
The second function storage unit stores a second function for calculating an estimated value of the output of the system in the arbitrary step based on the state vector of the system in the arbitrary step.
The state vector calculation unit inputs the state vector of the system in the step immediately before the arbitrary step to the first function stored in the first function storage unit, and the state vector in the arbitrary step Calculate the system state vector,
The estimated value calculation unit inputs the state vector of the system in the arbitrary step to the second function stored in the second function storage unit, and estimates the output of the system in the arbitrary step To calculate
The particle filter execution unit assigns a state vector of the system to each of a plurality of particles, and is based on an observation value acquired from the observation value acquisition unit and an estimation value acquired from the estimation value calculation unit for each particle. Calculating the weight of each particle, and updating the particles based on the weight to estimate the state vector of the system and identifying the parameters of the system,
In the particle filter execution unit,
The particles are updated every nth step from the first step in the arbitrary period (n: an integer of 2 or more),
The weight of each particle is calculated based on the observed value and the estimated value acquired in at least two of the n steps included in one update cycle in which the particle is updated. Parameter identification device.

It further includes a noise applying unit that adds parameter noise to the parameters of the system,
The system parameter identification device according to claim 1, wherein the noise adding unit adds the parameter noise to a parameter of the system in a step in which particles are updated.

The system parameter identification device according to claim 2, wherein the variance of the parameter noise is variable.

If the particle is updated m times (m: integer greater than or equal to 2) within the arbitrary period, it is given at the step when it is updated j times (j: any integer from 1 to m−1). The variance of the parameter noise is determined based on an input value or an observation value acquired in at least one of the n steps from the n · j + 1 step to the n · j + n step. The system parameter identification device according to claim 3.

The system parameter identification device according to any one of claims 1 to 4, wherein a weight of each particle is calculated from a joint probability in the at least two steps.

A controller for controlling the system, comprising the system parameter identification device according to claim 1,
The control parameter of the controller is determined based on a dispersion and an average of particles updated by the particle filter execution unit.

A system parameter identification method for identifying a parameter of a system to be controlled using a particle filter,
An input value acquisition step, an observation value acquisition step, a state vector calculation step, an estimated value calculation step, and a particle filter execution step,
In the input value acquisition step, an input value to the system is acquired for each step in an arbitrary period,
In the observation value acquisition step, the observation value of the output of the system is acquired for each step in an arbitrary period,
In the state vector calculation step, the state vector of the system, the system state vector including the system state quantity, the system parameter, and the system noise related to the input value as elements, in any step within the arbitrary period The system state vector in the previous step is input to a first function that calculates the system state vector based on the system state vector in the previous step of the arbitrary step. Calculating the state vector of the system at any step;
In the estimated value calculating step, a second function for calculating an estimated value of the output of the system at the arbitrary step based on the state vector of the system at the arbitrary step is used as a second state function of the system at the arbitrary step. To calculate an estimate of the output of the system at the arbitrary step,
In the particle filter execution step, a system state vector is assigned to each of a plurality of particles, and based on the observation value acquired in the observation value acquisition step and the estimation value acquired in the estimation value calculation step for each particle. Calculating the weight of each particle, and updating the particles based on the weight to estimate the state vector of the system and identifying the parameters of the system,
In the particle filter execution step,
The particles are updated every nth step from the first step in the arbitrary period (n: an integer of 2 or more),
The weight of each particle is calculated based on the observed value and the estimated value acquired in at least two of the n steps included in one update cycle in which the particle is updated. Parameter identification method.

A computer program for identifying parameters of a system to be controlled using a particle filter,
Let the computer execute an input value acquisition process, an observation value acquisition process, a state vector calculation process, an estimated value calculation process, and a particle filter execution process,
In the input value acquisition process, an input value to the system is acquired for each step in an arbitrary period,
In the observation value acquisition process, the observation value of the output of the system is acquired for each step in an arbitrary period,
In the state vector calculation process, the state vector of the system, the system state vector including the system state quantity, the system parameter, and the system noise related to the input value as elements, and in an arbitrary period within the arbitrary period The system state vector in the previous step is input to a first function that calculates the state vector of the system in the step based on the state vector of the system in the step immediately before the arbitrary step. And calculating the state vector of the system in the arbitrary step,
In the estimated value calculating process, a second function for calculating an estimated value of the output of the system at the arbitrary step based on the state vector of the system at the arbitrary step is used as a second state function of the system at the arbitrary step. To calculate an estimate of the output of the system at the arbitrary step,
In the particle filter execution process, a state vector of the system is assigned to each of a plurality of particles, and based on the observed value acquired in the observed value acquisition process and the estimated value acquired in the estimated value calculation process for each particle. Calculating the weight of each particle, and updating the particles based on the weight to estimate the state vector of the system and identifying the parameters of the system,
In the particle filter execution process,
The particles are updated every nth step from the first step in the arbitrary period (n: an integer of 2 or more),
The weight of each particle is calculated based on the observed value and the estimated value acquired in at least two of the n steps included in one update cycle in which the particle is updated. program.