JP7421719B2

JP7421719B2 - Control devices, control systems, and control programs

Info

Publication number: JP7421719B2
Application number: JP2019014886A
Authority: JP
Inventors: 将吾米倉; 康夫國吉
Original assignee: National Institute of Advanced Industrial Science and Technology AIST; University of Tokyo NUC
Current assignee: National Institute of Advanced Industrial Science and Technology AIST; University of Tokyo NUC
Priority date: 2019-01-30
Filing date: 2019-01-30
Publication date: 2024-01-25
Anticipated expiration: 2039-01-30
Also published as: WO2020158439A1; JP2020121376A

Description

本発明は、産業用と医療用および家庭用などのロボットや移動システムの運動・制御、製造プラントなど、複雑で動的な状態変化を伴うシステムにおける制御装置、制御システム、および制御プログラムに関する。 The present invention relates to a control device, a control system, and a control program for systems that involve complex and dynamic state changes, such as the movement and control of industrial, medical, and household robots and mobile systems, and manufacturing plants.

工業、商業、農業などの産業界、手術や看護・介護などの医療界、さらには清掃など家庭におけるロボットや産業機械の複雑化・高機能化が急激に進んでいる。これら、ロボットや産業機械などの装置の構成要素は一様で無く、また作業対象や動作環境は必ずしも一定では無い。 Robots and industrial machines are rapidly becoming more complex and sophisticated in industries such as industry, commerce, and agriculture, in the medical field such as surgery, nursing, and nursing care, and even in the home, such as cleaning. The components of these devices such as robots and industrial machines are not uniform, and the work objects and operating environments are not necessarily constant.

その様な中でニューロンネットワーク（ニューラルネットワーク）を適用した装置に繰り返し学習を行なうことで、個体毎の運動パターンを生成する二足歩行ロボットとして、特許文献１が提案されている。特許文献１では請求項１に記載の通り、ニューロンネットワークを備えており、図３や［０００４］に記載の通り、繰り返し学習により設計精度の向上や設計時間の短縮を図っている。 Under such circumstances, Patent Document 1 has been proposed as a bipedal walking robot that generates movement patterns for each individual by repeatedly performing learning on a device to which a neuron network is applied. As described in claim 1, Patent Document 1 includes a neuron network, and as described in FIG. 3 and [0004], it is intended to improve design accuracy and shorten design time through repeated learning.

特開２００６－８８３３１号公報JP2006-88331A

特許文献１で利用されているニューロンネットワークは図１１や段落［０００７］～［０００８］に記載されている通り、複数の入力に対して一意の重み付け係数Ｗ＿ｋ＿＊をかけて出力信号を生成している。この重み付け係数Ｗ＿ｋ＿＊を繰り返し学習によって最適化しているものである。そのため、装置個体の構成が固定され、さらに環境が一定の条件下では最適化が可能であるが、外乱などによる予期しない環境の変化には追随出来ない。 As described in FIG. 11 and paragraphs [0007] to [0008], the neuron network used in Patent Document 1 generates an output signal by applying a unique weighting coefficient W_k_* to multiple inputs. There is. This weighting coefficient W_k_* is optimized through repeated learning. Therefore, although the configuration of the individual device is fixed and optimization is possible under conditions where the environment is constant, it is not possible to follow unexpected changes in the environment due to disturbances or the like.

本発明は、かかる事情を鑑みてなされたものであり、予期しない外的環境の変化が発生しても、自律的に反応し、制御対象が所望の動作を安定的に行なうことを可能とする制御装置、制御システム、および制御プログラムを提供することを目的とする。 The present invention has been made in view of the above circumstances, and enables the controlled object to react autonomously and stably perform desired operations even when unexpected changes in the external environment occur. The purpose is to provide a control device, a control system, and a control program.

本発明によれば、制御装置であって、駆動信号を制御対象に供給することで前記制御対象を制御可能に構成されるもので、スパイク信号列生成部と駆動信号生成部とを備え、前記スパイク信号列生成部は、前記制御対象を制御するための基本制御信号と擾乱を含む内部状態、および内部状態に関するダイナミクスによって規定されるタイミングで、スパイク信号列を生成可能に構成され、前記駆動信号生成部は、前記スパイク信号列に基づいて時系列に連続変化する前記駆動信号を生成可能に構成される、制御装置が提供される。 According to the present invention, the control device is configured to be able to control the controlled object by supplying a drive signal to the controlled object, and includes a spike signal train generation section and a drive signal generation section, The spike signal train generation unit is configured to be able to generate a spike signal train at a timing defined by a basic control signal for controlling the controlled object, an internal state including disturbance, and dynamics regarding the internal state, and A control device is provided in which the generation unit is configured to be able to generate the drive signal that changes continuously in time series based on the spike signal train.

本発明に係る制御装置では、基本制御信号を、前記スパイク信号列生成部により一旦スパイク信号列に変換したのちに、前記駆動信号生成部にて生成した駆動信号を用いて制御対象に対する制御を行なう。このとき、予期しない外的環境の変化が発生しても、制御装置側が自律的に反応し、制御システム全体が所望の動作を行なうことが可能となるという有利な効果を奏する。 In the control device according to the present invention, the basic control signal is once converted into a spike signal train by the spike signal train generation section, and then the control target is controlled using the drive signal generated by the drive signal generation section. . At this time, even if an unexpected change in the external environment occurs, the control device side reacts autonomously, and the advantageous effect is that the entire control system can perform the desired operation.

本発明の実施形態に係る制御装置および制御対象からなる制御システムの機能ブロック図。FIG. 1 is a functional block diagram of a control system including a control device and a controlled object according to an embodiment of the present invention. 制御装置における最適化制御フロー図。The optimization control flow diagram in the control device. スパイク信号列を用いた制御例として水平軸上の粒子位置を制御する構成図。FIG. 2 is a configuration diagram for controlling particle positions on a horizontal axis as an example of control using a spike signal train. 水平軸上を移動する粒子に関する３重／２重／１重井戸ポテンシャルを示す状態図。FIG. 2 is a phase diagram showing triple/double/single well potentials for particles moving on a horizontal axis. 秩序創発機能のうち、エントロピー減少・パターン形成機能に関するシミュレーション結果図。Simulation result diagram for entropy reduction and pattern formation functions among order emergence functions. 秩序創発機能のうち、目標状態への引き込み領域拡大機能に関するシミュレーション結果図。A diagram showing simulation results regarding the function of enlarging the region of attraction to the target state among the order emergence functions. 秩序創発機能のうち、自然周波数へのバインディング機能に関するシミュレーション結果図。A simulation result diagram regarding the binding function to natural frequencies among the order emergence functions. 制御システムの一例である筋骨格ロボット制御システムの構成図。FIG. 1 is a configuration diagram of a musculoskeletal robot control system, which is an example of a control system. 筋骨格ロボット制御システムの低摩擦環境における移動速度シミュレーション結果図。Diagram of movement speed simulation results for a musculoskeletal robot control system in a low-friction environment. 筋骨格ロボット制御システムの低摩擦環境における協調運動能力シミュレーション結果図。Coordination ability simulation result diagram in a low-friction environment for a musculoskeletal robot control system. スパイク信号列生成部と駆動信号生成部を外付け制御装置とした制御システムの機能ブロック図。FIG. 2 is a functional block diagram of a control system in which a spike signal train generation section and a drive signal generation section are external control devices.

以下、図面を用いて本発明の実施形態について説明する。以下に示す実施形態中で示した各種特徴事項は、互いに組み合わせ可能である。特に、本明細書において「部」とは、例えば、広義の回路によって実施されるハードウェア資源と、これらのハードウェア資源によって具体的に実現されうるソフトウェアの情報処理とを合わせたものも含みうる。また、本実施形態においては様々な情報を取り扱うが、これら情報は、０または１で構成される２進数のビット集合体として信号値の高低によって表されるデジタル信号情報と、電圧・電流が連続的に変化するアナログ信号情報、および時間軸上で瞬間的に電圧・電流が発生するスパイク信号情報で、広義の回路上で通信・演算が実行されうる。 Embodiments of the present invention will be described below with reference to the drawings. Various features shown in the embodiments described below can be combined with each other. In particular, in this specification, the term "unit" may include, for example, a combination of hardware resources implemented by circuits in a broad sense and software information processing that can be specifically implemented by these hardware resources. . In addition, various types of information are handled in this embodiment, and these information include digital signal information expressed by the high and low signal values as a binary bit collection consisting of 0 or 1, and continuous voltage/current information. Communication and calculations can be performed on circuits in a broad sense using analog signal information that changes over time and spike signal information that instantaneously generates voltage and current on the time axis.

また、広義の回路とは、デジタル回路（ＤｉｇｉｔａｌＣｉｒｃｕｉｔ）、アナログ回路（ＡｎａｌｏｇＣｉｒｃｕｉｔ）、光回路（ＯｐｔｉｃａｌＣｉｒｃｕｉｔ）、回路類（Ｃｉｒｃｕｉｔｒｙ）、プロセッサ（Ｐｒｏｃｅｓｓｏｒ）、およびメモリ（Ｍｅｍｏｒｙ）等を少なくとも適当に組み合わせることによって実現される回路である。すなわち、デジタル回路としては、特定用途向け集積回路（ＡｐｐｌｉｃａｔｉｏｎＳｐｅｃｉｆｉｃＩｎｔｅｇｒａｔｅｄＣｉｒｃｕｉｔ：ＡＳＩＣ）、プログラマブル論理デバイス（例えば、単純プログラマブル論理デバイス（ＳｉｍｐｌｅＰｒｏｇｒａｍｍａｂｌｅＬｏｇｉｃＤｅｖｉｃｅ：ＳＰＬＤ）、複合プログラマブル論理デバイス（ＣｏｍｐｌｅｘＰｒｏｇｒａｍｍａｂｌｅＬｏｇｉｃＤｅｖｉｃｅ：ＣＰＬＤ）、およびフィールドプログラマブルゲートアレイ（ＦｉｅｌｄＰｒｏｇｒａｍｍａｂｌｅＧａｔｅＡｒｒａｙ：ＦＰＧＡ））等を含むものである。アナログ回路としては、抵抗、コンデンサ（Ｃａｐａｓｉｔｏｒ）、インダクタ（Ｉｎｄｕｃｔｏｒ）などの受動素子（ＰａｓｓｉｖｅＣｏｍｐｏｎｅｎｔ）、ダイオード（Ｄｉｏｄｅ）、トランジスタ（Ｔｒａｎｓｉｓｔｏｒ）、サイリスタ（Ｔｈｙｒｉｓｔｏｒ）などのディスクリート半導体（ＤｉｓｃｒｅｔｅＳｅｍｉｃｏｎｄｕｃｔｏｒ）、およびコンパレータ（Ｃｏｍｐａｒａｔｏｒ）などのアナログ集積回路（ＡｎａｌｏｇＩｎｔｅｇｒａｔｅｄＣｉｒｃｕｉｔ）等を含むものである。また、デジタル回路とアナログ回路の境界部に、Ｄ／Ａコンバータ（Ｄｉｇｉｔａｌ－ｔｏ－ＡｎａｌｏｇＣｏｎｖｅｒｔｅｒ）もしくはＡ／Ｄコンバータ（Ａｎａｌｏｇ－ｔｏ－ＤｉｇｉｔａｌＣｏｎｖｅｒｔｅｒ）を使用する回路構成も可能である。さらに光回路としては、発光ダイオード（ＬｉｇｈｔＥｍｉｔｔｉｎｇＤｉｏｄｅ）、半導体レーザー（ＳｅｍｉｃｏｎｄｕｃｔｏｒＬａｓｅｒ）などの発光素子（ＬｉｇｈｔＥｍｉｔｔｅｒ）、フォトダイオード（Ｐｈｏｔｏｄｉｏｄｅ）などの受光素子（Ｐｈｏｔｏｄｅｔｅｃｔｏｒ）、光ファイバー（ＯｐｔｉｏｃａｌＦｉｂｅｒ）などの光導波路（ＯｐｔｉｃａｌＷａｖｅｇｕｉｄｅ）さらには光集積回路（ＯｐｔｉｃａｌＩｎｔｅｇｒａｔｅｄＣｉｒｃｕｉｔ）等を含むものである。 In addition, a circuit in a broad sense includes at least a digital circuit, an analog circuit, an optical circuit, a circuit, a processor, a memory, etc. This is a circuit realized by combining them. That is, digital circuits include application specific integrated circuits (ASICs), programmable logic devices (for example, simple programmable logic devices (SPLDs)), and complex programmable logic devices (COMPLDs). ex Programmable Logic Device: CPLD), field programmable gate array (FPGA), and the like. Analog circuits include passive components such as resistors, capacitors, and inductors, and discrete semiconductors such as diodes, transistors, and thyristors. tor), and comparator It includes analog integrated circuits such as (Comparators) and the like. Further, a circuit configuration using a D/A converter (Digital-to-Analog Converter) or an A/D converter (Analog-to-Digital Converter) at the boundary between the digital circuit and the analog circuit is also possible. Furthermore, optical circuits include light emitting elements such as light emitting diodes and semiconductor lasers, photodetectors such as photodiodes, and optical fibers. Al Fiber) and other light guides It includes an optical waveguide, an optical integrated circuit, and the like.

１．全体構成
第１節では、本発明に係る制御装置を含む制御システム１の全体構成について図面を用いて説明する。図１は、本実施形態に係る制御システム１の構成概要を示す図である。制御システム１は、制御装置２および制御対象３とを備え、これらが電気的に接続されたシステムである。制御対象３は二脚歩行などのロボット（後述）、移動体、ペースメーカー、電気回路系、化学反応系、通信ネットワーク、社会経済管理システム、金融システム、生体ネットワークおよび動植物など、運動・状態に関して周辺環境変化により、電気的、力学的もしくは化学的内部状態などが変動する特性を持ち、所望である動作を行なうために制御を必要とするものである。前記内部状態は、前記制御システム１の機能や動作に関わり、かつ検知可能なものであれば項目は限定されない。 1. Overall Configuration In Section 1, the overall configuration of a control system 1 including a control device according to the present invention will be explained using the drawings. FIG. 1 is a diagram showing an outline of the configuration of a control system 1 according to the present embodiment. The control system 1 is a system that includes a control device 2 and a controlled object 3, and these are electrically connected. Controlled objects 3 include robots such as bipedal walking (described later), mobile objects, pacemakers, electric circuit systems, chemical reaction systems, communication networks, socio-economic management systems, financial systems, biological networks, and the surrounding environment in terms of movement and state. It has the characteristic that its electrical, mechanical, or chemical internal state fluctuates due to changes, and requires control in order to perform the desired operation. The internal state is not limited to any item as long as it is related to the function or operation of the control system 1 and can be detected.

１．１制御装置２
図１に示す通り、制御装置２は通信部２１と、記憶部２２と、制御部２３とを有し、これらの構成要素が制御装置２内部において通信バス２０を電気的に接続されている。以下、各構成要素についてさらに説明する。 1.1 Control device 2
As shown in FIG. 1, the control device 2 includes a communication section 21, a storage section 22, and a control section 23, and these components are electrically connected through a communication bus 20 inside the control device 2. Each component will be further explained below.

＜通信部２１＞
通信部２１は、制御対象３との間で情報の授受を行なうものである。ＵＳＢ、ＩＥＥＥ１３９４、Ｔｈｕｎｄｅｒｂｏｌｔ、有線ＬＡＮネットワーク通信等といった有線型の通信手段が好ましいものの、無線ＬＡＮネットワーク通信、５Ｇ／ＬＴＥ／３Ｇ等のモバイル通信、Ｂｌｕｅｔｏｏｔｈ（登録商標）通信等を必要に応じて含めてもよい。これらは一例であり、専用の通信規格を採用してもよい。すなわち、これら複数の通信手段の集合として実施することがより好ましい。 <Communication Department 21>
The communication unit 21 is for exchanging information with the controlled object 3. Although wired communication means such as USB, IEEE1394, Thunderbolt, wired LAN network communication, etc. are preferable, wireless LAN network communication, mobile communication such as 5G/LTE/3G, Bluetooth (registered trademark) communication, etc. may be included as necessary. Good too. These are just examples, and a dedicated communication standard may be adopted. That is, it is more preferable to implement it as a set of these plurality of communication means.

図１においては、通信部２１から制御対象３内の状態検知部３１および駆動部３０それぞれ別に接続している様子を示しているが、物理的な接続はまとめて１つとし、制御対象３内部で論理的に分配する構成としても良い。 Although FIG. 1 shows that the communication unit 21 is connected to the state detection unit 31 and drive unit 30 in the controlled object 3 separately, there is only one physical connection. It is also possible to have a configuration in which the data is logically distributed.

＜記憶部２２＞
記憶部２２は、様々な情報を記憶する揮発性または不揮発性の記憶媒体である。これは、例えばソリッドステートドライブ（ＳｏｌｉｄＳｔａｔｅＤｒｉｖｅ：ＳＳＤ）等のストレージデバイスとして、あるいは、プログラムの演算に係る一時的に必要な情報（引数、配列等）を記憶するランダムアクセスメモリ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ：ＲＡＭ）等のメモリとして実施されうる。また、これらの組合せであってもよい。 <Storage unit 22>
The storage unit 22 is a volatile or nonvolatile storage medium that stores various information. This can be used as a storage device such as a solid state drive (SSD), or as a random access memory that stores temporarily necessary information (arguments, arrays, etc.) related to program operations. It can be implemented as a memory such as RAM). Alternatively, a combination of these may be used.

特に、記憶部２２は、制御実行内容に関する各種パラメータ、制御対象３に関する形状、寸法、材質、重量などの個別特徴情報、最適化途中を含む連続制御時における過去の設定情報を記憶している。 In particular, the storage unit 22 stores various parameters related to control execution contents, individual characteristic information such as shape, size, material, and weight regarding the controlled object 3, and past setting information during continuous control including during optimization.

また、記憶部２２は、制御部２３によって実行される制御装置２に係る種々のプログラム等を記憶している。具体的には例えば、二脚歩行ロボットの様に複数の筋・腱および関節など複数の駆動要素を有する制御対象３に関する動作手順や、制御部２３を構成する基本制御信号生成部２３１、スパイク信号列生成部２３２、駆動信号生成部２３３で用いるパラメータ群の初期値や更新手順である。 Furthermore, the storage unit 22 stores various programs related to the control device 2 that are executed by the control unit 23 . Specifically, for example, the operation procedure regarding the controlled object 3 having multiple driving elements such as multiple muscles/tendons and joints like a biped walking robot, the basic control signal generation unit 231 forming the control unit 23, and the spike signal These are the initial values and updating procedures of the parameter group used by the column generation unit 232 and the drive signal generation unit 233.

＜制御部２３＞
制御部２３は、制御装置２に関連する全体動作の処理・制御を行なう。制御部２３は、例えば不図示の中央処理装置（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ：ＣＰＵ）である。制御部２３は、記憶部２２に記憶された所定のプログラムを読み出すことによって、制御装置２に係る種々の機能を実現する。具体的には制御対象３毎に予め与えられた情報、制御対象３内状態検知部３１から通信部２１を介して受信した状態情報を元に、基本制御信号生成部２３１とスパイク信号列生成部２３２と駆動信号生成部２３３を通じて制御対象３への駆動信号ＡＳを生成し制御を実施する機能が該当する。 <Control unit 23>
The control unit 23 processes and controls the overall operation related to the control device 2. The control unit 23 is, for example, a central processing unit (CPU) not shown. The control unit 23 implements various functions related to the control device 2 by reading predetermined programs stored in the storage unit 22. Specifically, based on information given in advance for each controlled object 3 and state information received from the controlled object 3 internal state detection section 31 via the communication section 21, the basic control signal generation section 231 and the spike signal train generation section This corresponds to the function of generating the drive signal AS to the controlled object 3 through the drive signal generating section 232 and the drive signal generating section 233 and performing control.

すなわち、ソフトウェア（記憶部２２に記憶されている）による情報処理がハードウェア（制御部２３）によって具体的に実現されることで、基本制御信号生成部２３１、スパイク信号列生成部２３２、および駆動信号生成部２３３として実行されうる。なお、図１においては、単一の制御部２３として表記されているが、実際の構成はこれに限るものではなく、機能毎に複数の制御部２３を有するように実施してもよい。また、それらの組合せであっても良い。以下、基本制御信号生成部２３１、スパイク信号列生成部２３２、駆動信号生成部２３３についてさらに詳述する。 That is, information processing by software (stored in the storage unit 22) is concretely realized by the hardware (control unit 23), so that the basic control signal generation unit 231, the spike signal train generation unit 232, and the drive It can be executed as the signal generation unit 233. Note that although FIG. 1 shows a single control section 23, the actual configuration is not limited to this, and may be implemented so as to have a plurality of control sections 23 for each function. Alternatively, a combination thereof may be used. The basic control signal generation section 231, spike signal train generation section 232, and drive signal generation section 233 will be described in further detail below.

［基本制御信号生成部２３１］
基本制御信号生成部２３１はソフトウェア（記憶部２２に記憶されている）による情報処理がハードウェア（制御部２３）によって具体的に実現されているものである。基本制御信号生成部２３１は、通信部２１を介して制御対象３の状態検知部３１から得た状態情報、および制御対象３毎に予め与えられたパラメータを元に、非スパイク信号状である基本制御信号ＣＳを生成するものである。制御アルゴリズムは限定されるものではなく、フィードバック制御、フィードフォワード制御、モデル予測制御、深層学習を用いた制御など各種アルゴリズムが利用可能である。 [Basic control signal generation unit 231]
The basic control signal generation unit 231 is a unit in which information processing by software (stored in the storage unit 22) is concretely realized by hardware (control unit 23). The basic control signal generation unit 231 generates a basic control signal in the form of a non-spike signal based on state information obtained from the state detection unit 31 of the controlled object 3 via the communication unit 21 and parameters given in advance for each controlled object 3. It generates a control signal CS. The control algorithm is not limited, and various algorithms such as feedback control, feedforward control, model predictive control, and control using deep learning can be used.

なお、後述する秩序創発機能を最大限利用するには、基本制御信号生成部２３１単体の周波数特性として、強い自然周波数／固有周波数ピークを持たない制御アルゴリズムとパラメータ設定が望ましい。 Note that in order to make maximum use of the order generation function described below, it is desirable to set a control algorithm and parameters that do not have a strong natural frequency/natural frequency peak as the frequency characteristic of the basic control signal generation section 231 alone.

［スパイク信号列生成部２３２］
スパイク信号列生成部２３２は、ソフトウェア（記憶部２２に記憶されている）による情報処理がハードウェア（制御部２３）によって具体的に実現されているもので、ハードウェアは前述したデジタル回路およびアナログ回路の組合せで構成される。 [Spike signal train generation unit 232]
In the spike signal train generation section 232, information processing by software (stored in the storage section 22) is specifically realized by hardware (control section 23), and the hardware includes the digital circuit and analog circuit described above. Consists of a combination of circuits.

スパイク信号列生成部２３２は、基本制御信号生成部２３１で生成された基本制御信号ＣＳを入力とし、スパイク信号列ＳＴを生成する要素であるニューロン（図１中不図示）を内包するものである。このスパイク信号列生成部２３２は、生体において確率的にインパルス状の活動電位を発生するニューロンネットワークすなわち確率的スパイキングニューロンネットワーク（ＳｔｏｃｈｓａｓｔｉｃａｌｌｙＳｐｉｋｉｎｇＮｅｕｒｏｎＮｅｔｗｏｒｋ：ｓＳＮＮ）と同等の動作をするものである。スパイク信号列生成部２３２内ニューロンとしては、ＬＩＦ（ｌｅａｋｙｉｎｔｅｇｒａｔｅ－ａｎｄ－ｆｉｒｅ）ニューロンを始め、ポアソン（Ｐｏｉｓｓｏｎ）スパイクモデルやホジキン－ハクスレイ（Ｈｏｄｇｋｉｎ－Ｈｕｘｌｅｙ）モデル、バースト発火可能なモデルなど、入力となる基本制御信号と擾乱を含む内部状態、および内部状態に関するダイナミクスによって規定されるタイミングでスパイク信号列を生成するモデルが適用可能である。 The spike signal train generation unit 232 receives the basic control signal CS generated by the basic control signal generation unit 231 as input, and includes neurons (not shown in FIG. 1) which are elements that generate the spike signal train ST. . The spike signal train generation unit 232 operates in the same manner as a neuron network that stochastically generates impulse-like action potentials in a living body, that is, a stochastically spiking neuron network (sSNN). Neurons in the spike signal train generation unit 232 include LIF (leaky integrate-and-fire) neurons, Poisson spike models, Hodgkin-Huxley models, models capable of burst firing, etc. A model that generates a spike signal train at a timing defined by a basic control signal, an internal state including disturbances, and dynamics related to the internal state is applicable.

スパイク信号列生成部２３２（ｓＳＮＮ）内に複数のニューロンが存在する場合は、それらニューロン同士の間に任意のシナプス結合を有することも可能である。その際、全てのニューロンは同期発火しない様に設計される。具体的には、例えばニューロン毎に独立のノイズを受ける、あるいはニューロン毎に異なる発火閾値（後述）やリセット電位（後述）を設定する、などである。 When a plurality of neurons exist in the spike signal train generation unit 232 (sSNN), it is also possible to have arbitrary synaptic connections between these neurons. At that time, all neurons are designed so that they do not fire synchronously. Specifically, for example, each neuron receives independent noise, or sets a different firing threshold (described later) or reset potential (described later) for each neuron.

ここでは、ＬＩＦニューロンの場合に関して数式を用いてより詳しく説明する。ｉ番目のスパイク信号列生成部２３２（ｓＳＮＮ）におけるｊ番目のＬＩＦニューロンは、下に記述する［数１］［数２］［数３］として表すことができる。

ここに、ｖ＿ｉｊは電位、変数上のドットは時間微分、γは減衰係数、ｂ＿ｉｊはバイアス入力（電流）、Ｉは入力信号、Ｄ＿ｉはｉ番目のスパイク信号列生成部２３２におけるノイズ強度、ξ＿ｉｊは単位強度のガウシアン（正規分布）ノイズ、ｖ＾θは発火閾値、ｖ＿ｉｊ＾Ｒはリセット電位、τ＿ｒｅｆは不能期間、ｋはスパイク信号発生順番号である。 Here, the case of the LIF neuron will be explained in more detail using mathematical formulas. The j-th LIF neuron in the i-th spike signal train generation unit 232 (sSNN) can be expressed as [Equation 1], [Equation 2], and [Equation 3] described below.

Here, v_ij is the electric potential, dots on variables are time differentials, γ is the attenuation coefficient, b_ij is the bias input (current), I is the input signal, D_i is the noise intensity in the i-th spike signal train generation section 232, and ξ_ij is Gaussian (normal distribution) noise of unit strength, v^θ is a firing threshold, v_ij^R is a reset potential, τ_ref is a disabled period, and k is a spike signal generation order number.

対象であるＬＩＦニューロンの内部電位ｖ＿ｉｊがｖ＾θに到達すると発火し、数学的にはディラック（Ｄｉｒａｃ）のデルタ関数δで記述されるスパイク信号を生成すると共に、内部電位はｖ＿ｉｊ＾Ｒにリセットされ、τ＿ｒｅｆの不能期に入る。［数１］で示したＬＩＦニューロンではガウシアンノイズという形で擾乱を加える事により多数のＬＩＦが同期発火することを防いでいる。［数１］から明らかな通り、ノイズ成分印加以外に発火閾値ｖ＾θやリセット電位ｖ＿ｉｊ＾Ｒを個別に設定することでも同期発火を防ぐことが可能である。個々のＬＩＦニューロンにおけるスパイク信号列σ＿ｉｊは形式的に［数４］で記述することが出来る。

この際、ｉ番目のスパイク信号列生成部２３２に複数のＬＩＦニューロンが存在する場合は、各ＬＩＦニューロンから出力される全てのスパイク信号列を活用する。その際にはスパイク信号列の平均を用いる、あるいはＬＩＦニューロン毎に線形の重み付けを行なう方式が適用できる。そうして得られた新たなスパイク信号列をスパイク信号列生成部２３２の出力であるスパイク信号列ＳＴとする。 It fires when the internal potential v_ij of the target LIF neuron reaches v^θ, generates a spike signal that is mathematically described by Dirac's delta function δ, and resets the internal potential to v_ij^R. and enters the disabled period of τ_ref. In the LIF neuron shown in [Equation 1], disturbance in the form of Gaussian noise is added to prevent a large number of LIFs from firing synchronously. As is clear from [Equation 1], synchronous firing can be prevented by individually setting the firing threshold v^θ and the reset potential v_ij^R in addition to applying the noise component. The spike signal train σ_ij in each LIF neuron can be formally described by [Equation 4].

At this time, if a plurality of LIF neurons exist in the i-th spike signal train generating section 232, all the spike signal trains output from each LIF neuron are utilized. In this case, a method using the average of the spike signal train or a method of linearly weighting each LIF neuron can be applied. The new spike signal train thus obtained is defined as the spike signal train ST, which is the output of the spike signal train generating section 232.

すなわち、スパイク信号列生成部２３２は、ニューロンネットワークを構成する複数のニューロンを有し、複数のニューロンそれぞれによって出力される信号に基づいて、スパイク信号列ＳＴを生成可能に構成される。 That is, the spike signal train generating section 232 has a plurality of neurons forming a neuron network, and is configured to be able to generate a spike signal train ST based on signals output by each of the plurality of neurons.

［駆動信号生成部２３３］
駆動信号生成部２３３は、ソフトウェア（記憶部２２に記憶されている）による情報処理がハードウェア（制御部２３）によって具体的に実現されているもので、ハードウェアは前述したデジタル回路およびアナログ回路の組合せで構成される。 [Drive signal generation unit 233]
In the drive signal generation section 233, information processing by software (stored in the storage section 22) is concretely realized by hardware (control section 23), and the hardware includes the digital circuit and analog circuit described above. Consists of a combination of

駆動信号生成部２３３は前記スパイク信号列生成部２３２で生成されたスパイク信号列ＳＴを制御対象３の駆動部３０に供給する駆動信号に変換する機能を有する。この駆動信号は生体ニューロンネットワークにおけるシナプス後電位（ｐｏｓｔｓｙｎａｐｔｉｃｐｏｔｅｎｔｉａｌ：ＰＳＰ）に基づいて生成されるものである。 The drive signal generation section 233 has a function of converting the spike signal train ST generated by the spike signal train generation section 232 into a drive signal to be supplied to the drive section 30 of the controlled object 3. This drive signal is generated based on a postsynaptic potential (PSP) in a biological neuron network.

スパイク信号列ＳＴからシナプス後電位相当である駆動信号ＡＳを生成するにはシナプス類似の方式を用いることが出来る。具体的にはローパスフィルタ、古典的α関数状シナプスモデル、方形波シナプスモデル、ダイナミックシナプスモデルなどである。 A synapse-like system can be used to generate the drive signal AS, which is equivalent to a postsynaptic potential, from the spike signal train ST. Specifically, these include a low-pass filter, a classical α-function synapse model, a square wave synapse model, and a dynamic synapse model.

駆動信号生成部２３３における信号処理方式をローパスフィルタとした場合、出力の基となるシナプス後電位（ｐｏｓｔｓｙｎａｐｔｉｃｐｏｔｅｎｔｉａｌ：ＰＳＰ）ｙ＿ｉ、制御対象３駆動部３０の活性度Ａ＿ｉは、それぞれ［数５］［数６］で記述することが可能である。

ここに、τ＿ｓはシナプス時定数、Ｎは対象とするｉ番目のスパイク信号列生成部２３２内のＬＩＦニューロン数、ｇ＿ｉ＾Ａは増幅ゲインで、Ａ＿ｉ＾０はオフセットである。 When the signal processing method in the drive signal generation unit 233 is a low-pass filter, the postsynaptic potential (PSP) y_i, which is the basis of the output, and the activation level A_i of the controlled object 3 drive unit 30 are each expressed as [Equation 5] [ 6].

Here, τ_s is a synaptic time constant, N is the number of LIF neurons in the target i-th spike signal train generating section 232, g_i^A is an amplification gain, and A_i^0 is an offset.

なお、図１では駆動信号ＡＳを駆動信号生成部２３３から駆動部３０間の直結線で伝えているが、Ａ／Ｄ（Ａｎａｌｏｇ－ｔｏ－Ｄｉｇｉｔａｌ）変換を行った後、デジタル信号として通信部２１を介して駆動部３０に伝達する構成も可能である。 Note that in FIG. 1, the drive signal AS is transmitted through a direct connection line between the drive signal generation section 233 and the drive section 30, but after A/D (Analog-to-Digital) conversion, the drive signal AS is transmitted as a digital signal to the communication section 21. A configuration in which the signal is transmitted to the drive unit 30 via the transmission line is also possible.

１．２制御対象３
制御対象３は具体的には例えば、機械的なタスクを実行するロボットや移動システムなどである。なお本発明の制御装置２はタスクの技術分野を限定するものでは無く、動的に作用するもので、かつ環境変動に伴う動作状態変化を検出可能なものであれば、電気回路システムや化学反応システムも制御対象３とすることが可能である。 1.2 Controlled object 3
Specifically, the controlled object 3 is, for example, a robot or a movement system that performs a mechanical task. Note that the control device 2 of the present invention is not limited to the technical field of the task, but can be applied to electrical circuit systems or chemical reactions as long as it acts dynamically and can detect changes in operating state due to environmental changes. The system can also be the controlled object 3.

［駆動部３０］
駆動部３０は、タスクを実行する際に、外部からの制御信号に基づき制御対象３を動作させるものである。具体的には例えば、制御対象３がロボットの場合におけるモーター、空圧・油圧などのアクチュエータなどであるが、これらに限定されるものでは無い。 [Drive section 30]
The drive unit 30 operates the controlled object 3 based on an external control signal when executing a task. Specifically, for example, when the controlled object 3 is a robot, it may be a motor, a pneumatic/hydraulic actuator, etc., but is not limited to these.

［状態検知部３１］
状態検知部３１は、制御対象３が動作時に外乱などによる環境変動があった場合を含めて制御対象３の内部状態を検知するものである。内部状態としては、制御対象３における注目する箇所の位置、速度、加速度、回転、角速度、角加速度、力およびモーメントなどの機械的力学的情報、電圧、電流および抵抗などの電気的情報、音や光の物理的情報、温度、圧力、流速などの流体力学情報、濃度、ｐＨ、分子量などの化学的情報等であるが、これらに限定されるものでは無い。検知した内部状態は状態情報として通信部２１に対して送信できる構成となっている。すなわち換言すると、状態情報とは、前記制御対象における注目する箇所において、制御対象の挙動および環境変動により変化する内部状態を示す情報である。 [State detection unit 31]
The state detection unit 31 detects the internal state of the controlled object 3, including when there is an environmental change due to a disturbance or the like when the controlled object 3 is in operation. The internal state includes mechanical information such as the position, velocity, acceleration, rotation, angular velocity, angular acceleration, force and moment of the point of interest in the controlled object 3, electrical information such as voltage, current and resistance, sound and These include, but are not limited to, physical information of light, fluid dynamics information such as temperature, pressure, and flow rate, and chemical information such as concentration, pH, and molecular weight. The detected internal state is configured to be able to be transmitted to the communication unit 21 as state information. In other words, the state information is information indicating an internal state that changes due to the behavior of the controlled object and environmental fluctuations at a point of interest in the controlled object.

２．制御システム１の最適化方法
第２節では、制御システム１において、制御装置２のパラメータ最適化方法について説明する。ここでは、例えば制御対象３が二脚歩行ロボットの場合であれば、指定した地点へ移動するという作業を実行するなどであるが、本発明の最適化方法は制御対象３やその作業の種類により限定されるものでは無い。 2. Optimization Method for Control System 1 In Section 2, a parameter optimization method for the control device 2 in the control system 1 will be described. Here, for example, if the controlled object 3 is a bipedal walking robot, a task such as moving to a specified point is executed, but the optimization method of the present invention depends on the controlled object 3 and the type of its work. It is not limited.

外乱が少ない環境において、複数回の基本タスクを繰り返し実行することで、前記制御装置２のパラメータを最適化する際の最適化フローを図２に示す。 FIG. 2 shows an optimization flow when optimizing the parameters of the control device 2 by repeatedly executing a basic task a plurality of times in an environment with few disturbances.

［最適化開始］
（ステップＳ１）
基本制御信号生成部２３１、スパイク信号列生成部２３２、駆動信号生成部２３３内の各パラメータ群を初期化する。初期化に用いるパラメータ値は記憶部２２に記憶されている情報を用いることができる。記憶部２２に記憶されている情報とは不揮発的に継続して記憶されている情報だけでなく、ユーザーが作業開始時に制御対象３の個々の特徴および外部環境状況を鑑みて外部から入力した情報も含む。 [Start optimization]
(Step S1)
Each parameter group in the basic control signal generation section 231, spike signal train generation section 232, and drive signal generation section 233 is initialized. Information stored in the storage unit 22 can be used as the parameter values used for initialization. The information stored in the storage unit 22 includes not only information that is continuously stored in a non-volatile manner, but also information input from the outside by the user at the start of work, taking into account the individual characteristics of the controlled object 3 and the external environment situation. Also included.

基本制御信号生成部２３１では採用したアルゴリズムにて使用されるパラメータを初期設定する。具体的には、例えばフィードバック制御の一種であるＰＩＤ（Ｐｒｏｐｏｒｔｉｏｎａｌ－Ｉｎｔｅｇｒａｌ－Ｄｉｆｆｅｒｅｎｔｉａｌ）制御では、比例ゲインＫ＿Ｐ、積分ゲインＫ＿Ｉ、微分ゲインＫ＿Ｄなどである。 The basic control signal generation unit 231 initializes parameters used in the adopted algorithm. Specifically, for example, in PID (Proportional-Integral-Differential) control, which is a type of feedback control, there are proportional gain K_P, integral gain K_I, differential gain K_D, etc.

スパイク信号列生成部２３２では確率的スパイキングニューロンネットワーク（ｓＳＮＮ）としてのあらゆるパラメータを初期設定する。例えばスパイク信号列生成部２３２をＬＩＦニューロンで構成した場合、ニューロン数Ｎや前記［数１］［数２］に含まれる変数、具体的には減衰係数γ、バイアス入力ｂ＿ｉｊ、ノイズ強度Ｄ＿ｉ、発火閾値ｖ＾θ、リセット電位ｖ＿ｉｊ＾Ｒ、不能期間τ＿ｒｅｆなどである。 The spike signal train generation unit 232 initializes all parameters for the stochastic spiking neuron network (sSNN). For example, when the spike signal train generation unit 232 is configured with LIF neurons, the number of neurons N and the variables included in [Equation 1] and [Equation 2], specifically, the attenuation coefficient γ, bias input b_ij, noise intensity D_i, firing These include the threshold value v^θ, the reset potential v_ij^R, and the disabled period τ_ref.

駆動信号生成部２３３では、スパイク信号列ＳＴから駆動信号ＡＳを生成するのに採用した方式に関するパラメータを初期設定する。例えばローパスフィルタ方式を採用した場合、時定数τや通過域利得などをフィルタ特性値として設定を行なう。駆動信号生成部２３３の全てもしくは一部を電気的なアナログ回路として構成する場合は、抵抗の抵抗値、コンデンサの容量値などで固定、もしくは半固定的に予め設定しておくことも可能である。 The drive signal generation unit 233 initializes parameters related to the method adopted to generate the drive signal AS from the spike signal train ST. For example, when a low-pass filter method is adopted, the time constant τ, passband gain, etc. are set as filter characteristic values. When all or part of the drive signal generation section 233 is configured as an electrical analog circuit, it is also possible to set the resistance value of a resistor, the capacitance value of a capacitor, etc. in advance in a fixed or semi-fixed manner. .

（ステップＳ２）
基本制御信号生成部２３１、スパイク信号列生成部２３２、駆動信号生成部２３３内の各パラメータの更新を行なう。全てのパラメータは更新対象となりうるが、制御部２３全体としての概略の方向性は基本制御信号ＣＳに大きく依存するため、基本制御信号生成部２３１部の最適化を主たる対象とするのが望ましい。スパイク信号列生成部２３２および駆動信号生成部２３３に関しては、例えばニューロン数Ｎ、シナプス時定数τ＿ｓ、ノイズ強度Ｄを更新対象とし、その他シナプス結合強度に関するパラメータなどは更新しないという制御方法を取ることが可能である。後述のステップＳ４により収束していないと判定される毎に、前記各パラメータが更新される。 (Step S2)
Each parameter in the basic control signal generation section 231, spike signal train generation section 232, and drive signal generation section 233 is updated. All parameters can be updated, but since the general directionality of the control section 23 as a whole depends largely on the basic control signal CS, it is desirable to optimize the basic control signal generation section 231 as the main target. Regarding the spike signal train generation section 232 and the drive signal generation section 233, a control method may be adopted in which, for example, the number of neurons N, the synaptic time constant τ_s, and the noise intensity D are updated, and other parameters related to the synaptic connection strength are not updated. It is possible. Each time it is determined that convergence has not occurred in step S4, which will be described later, each of the parameters is updated.

（ステップＳ３）
制御対象３が制御装置２からの制御に従い基本タスクを実行する。状態検知部３１にて制御対象３における前述した各種内部状態を検知し、状態情報を制御装置２内通信部２１に送信する。基本制御信号生成部２３１にて制御対象３全体としての評価値を計算する。後述のステップＳ４により収束していないと判定される毎に、基本タスクの実行も１回目、２回目、３回目と回数が増えていく。 (Step S3)
The controlled object 3 executes basic tasks under control from the control device 2. The state detection unit 31 detects the various internal states described above in the controlled object 3 and transmits state information to the communication unit 21 within the control device 2 . The basic control signal generation unit 231 calculates an evaluation value for the controlled object 3 as a whole. Each time it is determined that convergence has not been achieved in step S4, which will be described later, the number of executions of the basic task increases, such as the first, second, and third times.

（ステップＳ４）
基本制御信号生成部２３１において、システム全体が収束しているかどうかを判定する。収束していない（ＮＯ）と判定された場合は、ステップＳ２に戻ってパラメータ更新作業から継続する。パラメータ更新時における学習アルゴリズムとしては、遺伝アルゴリズムなどの進化戦略を適用することが可能であるが、それに限定するものでは無い。収束している（ＹＥＳ）と判定された場合は作業を終了する。
［最適化終了］ (Step S4)
The basic control signal generation unit 231 determines whether the entire system has converged. If it is determined that the process has not converged (NO), the process returns to step S2 and continues from the parameter update work. As a learning algorithm when updating parameters, an evolutionary strategy such as a genetic algorithm can be applied, but the learning algorithm is not limited thereto. If it is determined that the process has converged (YES), the work is finished.
[Optimization finished]

３．秩序創発機能
第３節では、本発明の構成における制御装置２が有する秩序創発機能について詳述する。これは、基本制御信号生成部２３１にて生成した基本制御信号ＣＳから、一旦スパイク信号列生成部２３２（ｓＳＮＮ）を用いてスパイク信号列ＳＴを生成し、さらにその後駆動信号生成部２３３にて駆動信号ＡＳを生成する構成を有する、本発明における制御装置２固有の機能であり従来知られたものでは無い。 3. Order Emergence Function In Section 3, the order emergence function possessed by the control device 2 in the configuration of the present invention will be described in detail. This is done by first generating a spike signal sequence ST using a spike signal sequence generation unit 232 (sSNN) from the basic control signal CS generated by the basic control signal generation unit 231, and then driving it using the drive signal generation unit 233. This is a function unique to the control device 2 of the present invention, which has a configuration for generating the signal AS, and is not known in the past.

本節では、秩序創発機能を示す例として、２つの確率的スパイキングニューロンネットワーク（ｓＳＮＮ）を有する場合を図３に示す。ここでは図３の水平軸上に存在する粒子の位置を制御するものとする。図３中Ｓ＿０、Ｓ＿１がｓＳＮＮで、本発明におけるスパイク信号列生成部２３２と駆動信号生成部２３３を内包するものとする。２つのｓＳＮＮはそれぞれ入力信号としてＩ＿０（ｔ）、Ｉ＿１（ｔ）を受け取る。ここでは入力信号Ｉ＿＊（ｔ）は、粒子の現在位置ｘ（ｔ）と目標位置ｘ＿０＾ｇ、ｘ＿１＾ｇの差分量として定義している。スパイク信号列生成部２３２としてはＬＩＦニューロン（第１節参照）、駆動信号生成部２３３としてはローパスフィルタ（第１節参照）を使用することとする。 In this section, as an example showing the order emergence function, a case with two stochastic spiking neuron networks (sSNN) is shown in FIG. 3. Here, it is assumed that the position of particles existing on the horizontal axis in FIG. 3 is controlled. In FIG. 3, S_0 and S_1 are sSNNs, which include the spike signal train generation section 232 and drive signal generation section 233 of the present invention. The two sSNNs each receive I_0(t) and I_1(t) as input signals. Here, the input signal I_*(t) is defined as the amount of difference between the current position x(t) of the particle and the target positions x_0^g and x_1^g. As the spike signal train generation section 232, an LIF neuron (see Section 1) is used, and as the drive signal generation section 233, a low-pass filter (see Section 1) is used.

３．１エントロピー減少・パターン形成機能
図４Ａに、３重井戸ポテンシャル関数における質量を持った粒子の位置をｓＳＮＮによって制御する場合を示す。ここでは中心（ｘ＝０）はポテンシャルの極小値ではあるが最小値ではなく、中心の両側にポテンシャルが最小となる場所が存在する点に留意されたい。 3.1 Entropy reduction/pattern formation function FIG. 4A shows a case where the position of a particle with mass in a triple well potential function is controlled by sSNN. Note that although the center (x=0) is the minimum value of the potential, it is not the minimum value, and there are locations on both sides of the center where the potential is minimum.

図４Ａ環境下での粒子位置移動状態のシミュレーション結果を図５Ａ、図５Ｂに示す。横軸は時間ｔ、縦軸は粒子位置ｘを示す。また図５Ａはニューロン数Ｎ＝２、図５Ｂはニューロン数Ｎ＝１５０の場合である。ＡｐＥｎは移動状態から算出したエントロピー（ＡｐｐｒｏｘｉｍａｔｅＥｎｔｒｏｐｙ）である。ニューロン数が少ない図５Ａは図５Ｂに比して粒子の移動量の絶対値は大きいが、これは２箇所存在するポテンシャル最小位置を周期的に移動していることが理由であり、その規則的な周期性のためエントロピーＡｐＥｎとしては小さい値となっている。この様に確率的スパイキングニューロンネットワーク（ｓＳＮＮ）においてはスパイク性が高いほど、エントロピー減少機能、パターン形成機能が発現する。 The simulation results of the particle position movement state under the environment of FIG. 4A are shown in FIGS. 5A and 5B. The horizontal axis shows time t, and the vertical axis shows particle position x. Further, FIG. 5A shows the case where the number of neurons is N=2, and FIG. 5B shows the case where the number of neurons is N=150. ApEn is the entropy (approximate entropy) calculated from the moving state. In Figure 5A, where the number of neurons is small, the absolute value of the amount of movement of the particle is larger than in Figure 5B, but this is because the particle moves periodically between two potential minimum positions, and its regularity. Due to the periodicity, the entropy ApEn has a small value. In this way, in a stochastic spiking neuron network (sSNN), the higher the spiking property, the more entropy reduction function and pattern formation function are expressed.

３．２目標状態の引き込み領域拡大機能
図４Ｂに、２重井戸ポテンシャル関数における質量を持った粒子の位置をｓＳＮＮによって制御する場合を示す。ここでは中心（ｘ＝０）はポテンシャルの極大値となっており、車の山登り問題（ｍｏｕｎｔａｉｎｃａｒｔａｓｋ）と同様に、谷底からポテンシャルの極大値ｘ＝０に直接到達することは出来ず、反動や外力の助けを必要とする問題設定とする。 3.2 Target state attraction region expansion function Figure 4B shows a case where the position of a particle with mass in a double well potential function is controlled by sSNN. Here, the center (x = 0) is the maximum potential value, and like the mountain car task, it is not possible to directly reach the maximum potential value x = 0 from the bottom of the valley, and the reaction The problem should be set up in such a way that it requires the help of external forces.

粒子の初期位置ｘ＿０と初期速度ｖ＿０を様々に変更して、一定時間以上中心付近［－０．１，０．１］の範囲内にとどまることが出来た場合を引き込み領域と定義してシミュレーションした結果を図６Ａ、図６Ｂに示す。図６Ａはニューロン数Ｎ＝１、図６Ｂはニューロン数Ｎ＝１００の場合である。図６Ａ、図６Ｂ中白い領域が引き込み領域である。また、バイアス入力ｂをパラメータにしてニューロン数Ｎを変化させた場合における引き込み領域割合（ｂａｓｉｎｒａｔｅ）のシミュレーション結果を図６Ｃに示す。図６Ａ、図６Ｂ、図６Ｃから明らかな様に確率的スパイキングニューロンネットワーク（ｓＳＮＮ）は引き込み領域を拡大する機能を有しており、ｓＳＮＮに含まれるニューロン数は少ない方が引き込み領域拡大機能を強く発現する場合が多い。 A simulation was performed by varying the initial position x_0 and initial velocity v_0 of the particle and defining the case where the particle could stay within the range [-0.1, 0.1] near the center for a certain period of time as the attraction region. The results are shown in FIGS. 6A and 6B. 6A shows the case where the number of neurons is N=1, and FIG. 6B shows the case where the number of neurons is N=100. The white area in FIGS. 6A and 6B is the pull-in area. Further, FIG. 6C shows a simulation result of the draw-in area ratio (basin rate) when the number N of neurons is changed using the bias input b as a parameter. As is clear from FIGS. 6A, 6B, and 6C, the stochastic spiking neuron network (sSNN) has the ability to expand the attraction area, and the smaller the number of neurons included in the sSNN, the more the ability to expand the attraction area. It is often strongly expressed.

３．３自然周波数へのバインディング機能
図４Ｃに、バネマス系における質量を持った粒子の位置をｓＳＮＮによって制御する場合を示す。ここでバネマス系とは１重井戸ポテンシャル関数と等しい。何も制御を行わないバネマス系では、ばね定数ｋと粒子の質量ｍで定まる自然周波数ｆ＿０（固有周波数）を有している。そのバネマス系に通常のフィードバック制御を行なうと、フィードバック制御のゲインなどの影響により自然周波数ｆ＿０が変調されることが知られている。
中心位置（ｘ＾ｇ＝０）を目標としてｓＳＮＮによる制御を実施し、自然周波数ｆ＿０に対するＳＮＲ（ｓｉｇｎａｌ－ｔｏ－ｎｏｉｓｅｒａｔｉｏ）をシミュレーションした結果を図７に示す。図７中、横軸はシナプス時定数τ＿ｓ、縦軸は増幅ゲインｇ＾Ａであり、より白い領域がＳＮＲが高いことを示している。自然周波数ｆ＿０が１～１０Ｈｚという非常に広いパラメータ領域において自然周波数ｆ＿０への共鳴現象が確認できる。また、多くの領域で白い縞模様が垂直方向に伸びている事から、確率的スパイキングニューロンネットワーク（ｓＳＮＮ）を用いてバネマス系を駆動する場合、自然周波数ｆ＿０にほとんど影響を与えてない事が明白である。 3.3 Binding function to natural frequency Figure 4C shows the case where the position of a particle with mass in a spring-mass system is controlled by sSNN. Here, the spring mass system is equivalent to a single double well potential function. A spring-mass system that does not perform any control has a natural frequency f_0 (natural frequency) determined by the spring constant k and the particle mass m. It is known that when normal feedback control is performed on the spring mass system, the natural frequency f_0 is modulated due to the influence of the gain of the feedback control.
FIG. 7 shows the results of simulating the SNR (signal-to-noise ratio) for the natural frequency f_0 by performing control using sSNN with the center position (x^g=0) as the target. In FIG. 7, the horizontal axis is the synaptic time constant τ_s, and the vertical axis is the amplification gain g^A, indicating that the whiter region has a higher SNR. A resonance phenomenon to the natural frequency f_0 can be confirmed in a very wide parameter range where the natural frequency f_0 is 1 to 10 Hz. Furthermore, since the white striped pattern extends vertically in many areas, it is clear that when driving a spring-mass system using a stochastic spiking neuron network (sSNN), it has almost no effect on the natural frequency f_0. It's obvious.

４．ロボット制御システム
第４節では、実施形態として、制御対象３としてロボット、さらに具体的には筋骨格ロボットを用いた二脚歩行ロボット制御システムのシミュレーション結果を説明する。 4. Robot Control System In Section 4, as an embodiment, simulation results of a biped walking robot control system using a robot, more specifically a musculoskeletal robot, as the controlled object 3 will be described.

図８にシミュレーションに用いた筋骨格ロボット制御システムの機能概略図を示す。図８左側が骨格（リンク）、関節（ジョイント）、筋（図中Ｍｕｓｃｌｅの線、一部省略）の構成を示しており、ロボット駆動部３０として各脚毎に８本の筋および多関節筋を接続している。また、ロボット状態検知部３１（図８中ＳｅｎｓｏｒｙＩｎｐｕｔ）として、筋発生力、筋長、関節角、上体姿勢、重心位置、足裏反力、各骨（リンク）において９軸慣性計測装置（ＩｎｅｒｔｉａＭｅａｓｕｒｅｍｅｎｔＵｎｉｔ）によって得られる３軸姿勢、３軸加速度、３軸角速度を測定可能な構成である。 Figure 8 shows a functional schematic diagram of the musculoskeletal robot control system used in the simulation. The left side of FIG. 8 shows the structure of the skeleton (links), joints, and muscles (muscle lines in the figure, some of which are omitted). The robot drive unit 30 includes eight muscles and multi-joint muscles for each leg. are connected. In addition, as a robot state detection unit 31 (Sensory Input in Figure 8), a 9-axis inertial measurement device ( It is configured to be able to measure 3-axis attitude, 3-axis acceleration, and 3-axis angular velocity obtained by Inertia Measurement Unit).

左右各脚は静止、振り動作など複数の相を有しており、各相毎に異なる反射活性化ルールを持っている。反射活性化ルール（図８中ＲｅｆｌｅｘＳｙｓｔｅｍ：基本制御信号生成部２３１）は発生力のポジティブフィードバック制御ルール、筋長のフィードバック制御ルール、関節角あるいは上体姿勢のＰＤ（比例微分）制御ルールの組合せで構築される。 Each leg has multiple phases, such as rest and swinging, and each phase has different reflex activation rules. The reflex activation rule (Reflex System: basic control signal generation unit 231 in FIG. 8) is a combination of positive feedback control rules for generated force, feedback control rules for muscle length, and PD (proportional differential) control rules for joint angles or body posture. Constructed with.

図９Ａ、図９Ｂ、図９Ｃには、滑りやすい低摩擦環境での重心移動速度シミュレーション結果を示す。図９Ａ、図９Ｂの横軸が時間ｔ、縦軸が重心移動速度ｖ＾ｇである。通常（図中破線）の摩擦係数μは１０としているが、低摩擦環境（図中実線）では時間ｔ＝［１０，４０］にて摩擦係数μを０．０４と低く設定している。図９Ａ、図９Ｂより通常環境、低摩擦環境ともに安定した二脚歩行動作が行われている。そのとき通常環境よりも低摩擦環境の方が全体に低速度側にシフトしている。図９Ｃは歩容周波数と振幅の関係を示したもので、低摩擦環境では低周波数側に遷移している。 FIGS. 9A, 9B, and 9C show simulation results of center-of-gravity movement speed in a slippery, low-friction environment. The horizontal axis of FIGS. 9A and 9B is time t, and the vertical axis is the center of gravity moving speed v^g. Normally (broken line in the figure) the friction coefficient μ is set to 10, but in a low friction environment (solid line in the figure) the friction coefficient μ is set as low as 0.04 at time t=[10,40]. From FIGS. 9A and 9B, stable bipedal walking motion is performed in both the normal environment and the low-friction environment. At that time, the overall speed is shifted to the lower speed side in the low friction environment than in the normal environment. FIG. 9C shows the relationship between gait frequency and amplitude, which shifts to the lower frequency side in a low friction environment.

図１０には大きな滑りが発生した状況における協調運動能力シミュレーション結果を示す。図１０Ａ、図１０Ｃは横軸ｘが位置を示し、ｘ＝［４，１６］を低摩擦区間とし摩擦係数μ＝０．０４、それ以外は通常で摩擦係数μ＝１０である。なお、図１０Ａのサンプリング間隔は０．１ｓ、図１０Ｃのサンプリング間隔は０．２５ｓである。低摩擦区間（図中ｓｌｉｐｐｅｒｙ帯）上にある、黒色帯は右足の滑り、灰色帯は左足の滑りを示している。図１０Ｂ、図１０Ｄは、それぞれ図１０Ａ、図１０Ｃに対応する時間ｔと重心移動速度ｖ＾ｇの関係を示している。 Figure 10 shows the results of a simulation of coordination ability in a situation where a large slip occurred. In FIGS. 10A and 10C, the horizontal axis x indicates the position, where x=[4,16] is a low friction section and the friction coefficient μ=0.04, and the other areas are normal and the friction coefficient μ=10. Note that the sampling interval in FIG. 10A is 0.1 s, and the sampling interval in FIG. 10C is 0.25 s. The black band on the low-friction section (slippery band in the figure) indicates slipping of the right foot, and the gray band indicates slipping of the left foot. FIGS. 10B and 10D show the relationship between time t and center of gravity moving speed v^g corresponding to FIGS. 10A and 10C, respectively.

図１０Ａ、図１０Ｂにおいては、０．５ｓ以上の時間、０．５ｍ程度の滑りが発生しているが、それに適応して歩行が継続出来ている。この際、左右の足で滑る距離が非対称になっている点に留意されたい。また、図１０Ｃ、図１０Ｄでは低摩擦区間終了地点であるｘ＝１６ｍ、ｔ＝１３ｓ付近でｖ＾ｇが極端に下がっており、これは低摩擦区間終了直前におけるやや長めの右足滑りから通常区間に入るときに転倒寸前の状態となったことを示している。この状況でも、つま先などが通常区間（μ＝１０）である滑りづらい地面に接触していることを足がかりとして、正常歩行に復帰することが出来ている。この様に、従来知られている反射回路のみの制御では実現困難であった、非常に高い適応能力を、本発明による制御装置２を用いたロボット制御システムは有している。 In FIGS. 10A and 10B, although a slip of about 0.5 m occurs for a period of 0.5 s or more, the patient is able to adapt and continue walking. At this time, it should be noted that the distances slid by the left and right feet are asymmetrical. In addition, in Figures 10C and 10D, v^g drops extremely near the end point of the low-friction section at x = 16 m, t = 13 s, and this is due to a slightly longer right foot slip just before the end of the low-friction section. This indicates that the person was on the verge of falling when entering the vehicle. Even in this situation, the patient is able to return to normal walking using the fact that his toes are in contact with the hard-to-slip ground in the normal section (μ=10) as a foothold. In this way, the robot control system using the control device 2 according to the present invention has extremely high adaptability, which has been difficult to achieve with conventionally known control using only reflection circuits.

本節で説明した二脚歩行ロボット制御システムにおける協調動作には、運動系列のエントロピーを低減する必要がある。また、転倒回避にはＺＭＰ（ｚｅｒｏ－ｍｏｍｅｎｔｐｏｉｎｔ）をある範囲内に制御する必要もある。第３節にて説明した通り、本発明における制御装置２が有する秩序創発機能（３．１エントロピー減少・パターン形成機能、３．２目標状態の引き込み領域拡大機能を参照されたい）が有効に働くことで、即時的な転倒回避機能が実現されていると言える。 For cooperative motion in the biped walking robot control system described in this section, it is necessary to reduce the entropy of the motion sequence. Furthermore, to avoid falling, it is also necessary to control ZMP (zero-moment point) within a certain range. As explained in Section 3, the order generation function (see 3.1 Entropy reduction/pattern formation function, 3.2 Target state attraction area expansion function) of the control device 2 of the present invention works effectively. Therefore, it can be said that an immediate fall avoidance function is realized.

５．変形例
なお、次のような態様によって、本実施形態を更に創意工夫してもよい。 5. Modifications The present embodiment may be further modified by the following aspects.

第４節では、二脚歩行ロボット制御システムの実施形態について説明したが、一般に移動システムでは移動に伴い外部環境の変動を伴うものであり、秩序創発機能を有する本発明の制御装置２の特徴を活かすことが出来る。また、秩序創発機能は、無人による完全自律型制御システムとして、あるいは有人システムの補助的な制御システムどちらでも活用することが出来る。移動システムとしては、具体的には例えば、多足歩行ロボット、車輪・キャタピラ型ロボット、無人航空機（ＵｎｍａｎｎｅｄＡｅｒｉａｌＶｅｈｉｃｌｅ：ＵＡＶ、ドローン）、無人水上艇（ＵｎｍａｎｎｅｄＳｕｒｆａｃｅＶｅｈｉｃｌｅ：ＵＳＶ）、無人潜水艇（ＵｎｍａｎｎｅｄＵｎｄｅｒｗａｔｅｒＶｅｈｉｃｌｅ：ＵＵＶ）、自動運転を含む自動車、航空機、船舶などであるが、これらに限定されるものでは無い。 In Section 4, the embodiment of the bipedal walking robot control system was described. Generally speaking, a mobile system involves changes in the external environment as it moves. You can take advantage of it. Furthermore, the order generation function can be utilized either as an unmanned fully autonomous control system or as an auxiliary control system for a manned system. Specific examples of mobile systems include multilegged robots, wheeled and caterpillar robots, unmanned aerial vehicles (UAVs, drones), unmanned surface vehicles (USVs), and unmanned submersibles (unmanned submersibles). Underwater vehicles (UUV), automobiles including self-driving vehicles, aircraft, ships, etc., but are not limited to these.

さらには、運搬や加工に関して作業対象物が頻繁に変更される産業用・医療用・農業用・家庭用のロボットにも適用可能である。 Furthermore, the present invention can also be applied to industrial, medical, agricultural, and household robots in which work objects are frequently changed in terms of transportation and processing.

本発明による制御装置２が持つ秩序創発機能は、状態の変動に適応する必要がある制御システム１であれば技術分野を限定するものでは無い。すなわち機械的な運動に対するものだけでは無く、電気的変動あるいは化学反応的変動に対しても発揮することが可能である。さらには、金融システムの制御やインターネットなどのコミュニケーションネットワークにおける情報の流入・流出・伝播の制御、空調システム、などにも適用可能である事が期待できる。したがって、例えば秩序創発機能のうち自然周波数へのバインディング機能（３．３参照）を活用した心臓ペースメーカーや人工心肺などへ応用することも可能である。 The order generation function of the control device 2 according to the present invention is not limited to any technical field as long as the control system 1 needs to adapt to changes in state. In other words, it can be applied not only to mechanical motion, but also to electrical fluctuations or chemical reaction fluctuations. Furthermore, it can be expected to be applied to control of financial systems, control of inflow, outflow, and propagation of information in communication networks such as the Internet, and air conditioning systems. Therefore, it is also possible to apply it, for example, to a cardiac pacemaker or a heart-lung machine that utilizes the natural frequency binding function (see 3.3) among the order emergence functions.

本発明におけるスパイク信号列生成部２３２および駆動信号生成部２３３は、図１に示した様に、基本制御信号生成部２３１、通信部２１、記憶部２２などと共に１つの制御装置２として構成することも可能であるが、スパイク信号列生成部２３２と駆動信号生成部２３３を外付け制御装置とすることも可能である。 The spike signal train generation section 232 and the drive signal generation section 233 in the present invention can be configured as one control device 2 together with the basic control signal generation section 231, the communication section 21, the storage section 22, etc., as shown in FIG. However, it is also possible to use the spike signal train generation section 232 and the drive signal generation section 233 as external control devices.

図１１に機能ブロック図を示す。図１１中、２ｂが基本制御信号ＣＳを生成する基本制御装置、３が制御対象である。基本制御装置２ｂと制御対象３の組合せだけでも従来方式の制御は可能であるが、その従来制御を補足すべく外付け制御装置２ａを接続している。外付け制御装置２ａにはスパイク信号列生成部２３２および駆動信号生成部２３３が配備されている。既存の制御システムに外付け制御装置２ａを追加することで、第３節で説明した秩序創発機能を活用することが可能となり、制御システム１の機能・性能を向上することが出来る。 FIG. 11 shows a functional block diagram. In FIG. 11, 2b is a basic control device that generates a basic control signal CS, and 3 is a controlled object. Conventional control is possible with just the combination of the basic control device 2b and the controlled object 3, but the external control device 2a is connected to supplement the conventional control. A spike signal train generation section 232 and a drive signal generation section 233 are provided in the external control device 2a. By adding the external control device 2a to an existing control system, it becomes possible to utilize the order generation function described in Section 3, and the functions and performance of the control system 1 can be improved.

さらには、スパイク信号列は撹乱に起因して確率的に生成されるものに限らず、十分な複雑さと予測不能性を含むならばカオスなどを利用して決定論的に生成されたスパイク信号列であっても同等の機能を得る事が出来る。 Furthermore, spike signal trains are not limited to those generated stochastically due to disturbances, but can also be spike signal sequences generated deterministically using chaos etc. if they contain sufficient complexity and unpredictability. However, you can get the same functionality.

６．結言
以上のように、本実施形態によれば、予期しない外的環境の変化が発生しても、自律的に反応し、制御システム１全体が所望の動作を行なうことを可能とする制御装置２を実施することが出来る。 6. Conclusion As described above, according to the present embodiment, even if an unexpected change in the external environment occurs, the control device 2 reacts autonomously and enables the entire control system 1 to perform the desired operation. can be carried out.

かかる制御装置２は、駆動信号を制御対象３に供給することで前記制御対象３を制御可能に構成されるもので、スパイク信号列生成部２３２と駆動信号生成部２３３とを備え、前記スパイク信号列生成部２３２は、前記制御対象３を制御するための基本制御信号ＣＳおよび擾乱を含む内部状態によって規定されるタイミングで、スパイク信号列ＳＴを生成可能に構成され、前記駆動信号生成部は、前記スパイク信号列ＳＴに基づいて時系列に連続変化する前記駆動信号ＡＳを生成可能に構成される。 The control device 2 is configured to be able to control the controlled object 3 by supplying a drive signal to the controlled object 3, and includes a spike signal train generation section 232 and a drive signal generation section 233, and includes a spike signal train generation section 232 and a drive signal generation section 233. The train generation unit 232 is configured to be able to generate a spike signal train ST at a timing defined by the basic control signal CS for controlling the controlled object 3 and an internal state including disturbance, and the drive signal generation unit The drive signal AS is configured to be able to generate the drive signal AS that continuously changes in time series based on the spike signal train ST.

また、これにより以下の制御システム１を実施することが出来る。 Moreover, this makes it possible to implement the following control system 1.

かかる制御システム１は、制御対象３と、前記制御対象３を制御する制御装置２とを備え、
前記制御対象３は、ロボット、移動体、ペースメーカー、電気回路系、および化学反応系の少なくとも１つであり、前記制御装置２は、上に記載した制御装置２である。 Such a control system 1 includes a controlled object 3 and a control device 2 that controls the controlled object 3,
The controlled object 3 is at least one of a robot, a moving body, a pacemaker, an electric circuit system, and a chemical reaction system, and the control device 2 is the control device 2 described above.

制御装置２また制御システム１をハードウェアとして実施するためのソフトウェアを、プログラムとして実施することもできる。そして、このようなプログラムを、コンピュータが読み取り可能な非一時的な記録媒体として提供してもよいし、外部のサーバからダウンロード可能に提供してもよいし、外部のコンピュータで当該プログラムを起動させて、クライアント端末で各機能を実施可能な、いわゆるクラウド・コンピューティングを実施してもよい。 Software for implementing the control device 2 or the control system 1 as hardware can also be implemented as a program. Such a program may be provided as a computer-readable non-temporary recording medium, may be provided for download from an external server, or may be provided by running the program on an external computer. So-called cloud computing, in which each function can be performed on a client terminal, may be implemented.

かかる制御プログラムは、制御対象を制御するためのもので、コンピュータに、スパイク信号列生成機能と駆動信号生成機能とを実行させるもので、前記スパイク信号列生成機能によれば、前記制御対象３を制御するための基本制御信号ＣＳと擾乱を含む内部状態とによって規定されるタイミングで、スパイク信号列ＳＴを生成させ、前記駆動信号生成機能によれば、前記スパイク信号列ＳＴに基づいて時系列に連続変化する前記駆動信号ＡＳを生成させることとする。 This control program is for controlling a controlled object and causes a computer to execute a spike signal train generation function and a drive signal generation function. According to the spike signal train generation function, the control object 3 is A spike signal train ST is generated at a timing defined by a basic control signal CS for control and an internal state including disturbance, and according to the drive signal generation function, the spike signal train ST is chronologically generated based on the spike signal train ST. The drive signal AS that continuously changes is generated.

最後に、本発明に係る種々の実施形態を説明したが、これらは、例として提示したものであり、発明の範囲を限定することは意図していない。当該新規な実施形態は、その他の様々な形態で実施されることが可能であり、発明の要旨を逸脱しない範囲で、種々の省略、置き換え、変更を行うことができる。当該実施形態やその変形は、発明の範囲や要旨に含まれるとともに、特許請求の範囲に記載された発明とその均等の範囲に含まれるものである。 Finally, although various embodiments according to the present invention have been described, these are presented as examples and are not intended to limit the scope of the invention. The new embodiment can be implemented in various other forms, and various omissions, substitutions, and changes can be made without departing from the gist of the invention. The embodiment and its modifications are included within the scope and gist of the invention, and are included within the scope of the invention described in the claims and its equivalents.

１：制御システム
２：制御装置
２ａ：外付け制御装置
２ｂ：基本制御装置
２０：通信バス
２１：通信部
２２：記憶部
２３：制御部
２３１：基本制御信号生成部
２３２：スパイク信号列生成部
２３３：駆動信号生成部
３：制御対象
３０：駆動部
３１：状態検知部
ＣＳ：制御信号
ＳＴ：スパイク信号列
ＡＳ：駆動信号
Ｉ：入力信号
ＡｐＥｎ：エントロピー
Ｎ：ニューロン数
ｘ＿０：初期位置
ｖ＿０：初期速度
ｂ：バイアス入力
ｆ＿０：自然周波数
τ＿ｓ：シナプス時定数
ｇ＾Ａ：増幅ゲイン
ｖ＾ｇ：重心移動速度
μ ：摩擦係数 1: Control system 2: Control device 2a: External control device 2b: Basic control device 20: Communication bus 21: Communication section 22: Storage section 23: Control section 231: Basic control signal generation section 232: Spike signal train generation section 233 : Drive signal generation unit 3 : Control object 30 : Drive unit 31 : State detection unit CS : Control signal ST : Spike signal sequence AS : Drive signal I : Input signal ApEn : Entropy N : Number of neurons x_0 : Initial position v_0 : Initial velocity b: Bias input f_0: Natural frequency τ_s: Synaptic time constant g^A: Amplification gain v^g: Center of gravity movement speed μ: Friction coefficient

Claims

A control device,
comprising a basic control signal generation section, a spike signal train generation section, and a drive signal generation section,
The basic control signal generation unit is configured to be able to generate a non-spike basic control signal for controlling the controlled object based on state information of the controlled object,
The state information is information indicating an internal state that changes due to the behavior of the controlled object and environmental changes at a point of interest in the controlled object,
The spike signal train generation unit has a plurality of neurons forming a neuron network, and is configured to be able to generate a spike signal train based on spike signals generated from each of the plurality of neurons,
Each of the neurons is configured to generate the spike signal based on a disturbance and a firing threshold of the neuron by inputting the basic control signal;
The drive signal generation unit is configured to be able to generate a drive signal that continuously changes in time series based on the spike signal train, wherein the drive signal corresponds to a postsynaptic potential in the neuron network, and the drive signal corresponds to a postsynaptic potential in the neuron network. configured to be able to control the control target by being supplied to the target,
At least the parameters included in the basic control signal generation unit are configured to update the parameters, cause the control target to execute a preset basic task based on the parameters, and execute the basic task. updated by repeatable optimization, including calculating an evaluation value based on the results ;
Control device.

The control device according to claim 1,
Additionally equipped with a communications department,
The communication unit is configured to be able to receive the status information.
Control device.

The control device according to claim 1 or 2,
The neuron is configured to be able to generate the spike signal using the generation of stochastic impulse-like action potentials in a living body as a model.
Control device.

A control system,
comprising a controlled object and a control device that controls the controlled object,
The controlled object is at least one of a robot, a mobile object, a pacemaker, an electric circuit system, a chemical reaction system, a communication network, a socio-economic management system, a financial system, a biological network, and an animal or plant,
The control device is the control device according to any one of claims 1 to 3,
control system.

The control system according to claim 4,
The controlled object is a musculoskeletal robot, and includes a robot drive unit and a robot state detection unit,
The robot driving unit includes a plurality of bones, a plurality of joints, a muscle that applies a pulling force between the adjacent bones, and/or a multi-joint muscle that applies a pulling force across the plurality of bones,
The robot state detection unit is configured to be able to detect at least one state of muscle generation force, muscle length, joint angle, body posture, center of gravity position, sole reaction force, 3-axis posture, 3-axis acceleration, and 3-axis angular velocity. be done,
control system.

A control program,
causing the computer to function as each part of the control device according to any one of claims 1 to 3;
control program.