JP2020121376A

JP2020121376A - Control device, control system and control program

Info

Publication number: JP2020121376A
Application number: JP2019014886A
Authority: JP
Inventors: 将吾米倉; Shogo Yonekura; 康夫國吉; Yasuo Kuniyoshi
Original assignee: National Institute of Advanced Industrial Science and Technology AIST; University of Tokyo NUC
Current assignee: National Institute of Advanced Industrial Science and Technology AIST; University of Tokyo NUC
Priority date: 2019-01-30
Filing date: 2019-01-30
Publication date: 2020-08-13
Anticipated expiration: 2039-01-30
Also published as: JP7421719B2; WO2020158439A1

Abstract

To provide a control device which autonomously reacts even if an unintentional change of an external environment occurs, and makes a control object stably perform a desired operation, a control system and a control program.SOLUTION: This control device can control a control object by supplying a drive signal to the control object, and comprises a spike signal row creation part and a drive signal creation part. The spike signal row creation part can create an internal state including a fundamental control signal for controlling the control object and a disturbance, and a spike signal row at timing which is defined by dynamics related to the internal state. The drive signal creation part can create the drive signal which is continuously changed in a time series on the basis of the spike signal row.SELECTED DRAWING: Figure 1

Description

本発明は、産業用と医療用および家庭用などのロボットや移動システムの運動・制御、製造プラントなど、複雑で動的な状態変化を伴うシステムにおける制御装置、制御システム、および制御プログラムに関する。 The present invention relates to a control device, a control system, and a control program in a system involving complicated and dynamic state changes such as motion/control of robots and moving systems for industrial use, medical use, and home use, a manufacturing plant, and the like.

工業、商業、農業などの産業界、手術や看護・介護などの医療界、さらには清掃など家庭におけるロボットや産業機械の複雑化・高機能化が急激に進んでいる。これら、ロボットや産業機械などの装置の構成要素は一様で無く、また作業対象や動作環境は必ずしも一定では無い。 Industrial and commercial industries such as agriculture, medical fields such as surgery and nursing/nursing care, and households such as cleaning robots and industrial machines are becoming more complex and highly functional. The components of these devices such as robots and industrial machines are not uniform, and the work target and operating environment are not always constant.

その様な中でニューロンネットワーク（ニューラルネットワーク）を適用した装置に繰り返し学習を行なうことで、個体毎の運動パターンを生成する二足歩行ロボットとして、特許文献１が提案されている。特許文献１では請求項１に記載の通り、ニューロンネットワークを備えており、図３や［０００４］に記載の通り、繰り返し学習により設計精度の向上や設計時間の短縮を図っている。 Patent Document 1 is proposed as a bipedal walking robot that generates a motion pattern for each individual by repeatedly performing learning in a device to which a neuron network (neural network) is applied. In Patent Document 1, a neuron network is provided as described in claim 1, and as described in FIG. 3 and [0004], it is attempted to improve design accuracy and shorten design time by iterative learning.

特開２００６−８８３３１号公報JP, 2006-88331, A

特許文献１で利用されているニューロンネットワークは図１１や段落［０００７］〜［０００８］に記載されている通り、複数の入力に対して一意の重み付け係数Ｗ＿ｋ＿＊をかけて出力信号を生成している。この重み付け係数Ｗ＿ｋ＿＊を繰り返し学習によって最適化しているものである。そのため、装置個体の構成が固定され、さらに環境が一定の条件下では最適化が可能であるが、外乱などによる予期しない環境の変化には追随出来ない。 The neuron network used in Patent Document 1 generates an output signal by applying a unique weighting coefficient W_k_* to a plurality of inputs, as described in FIG. 11 and paragraphs [0007] to [0008]. There is. This weighting coefficient W_k_* is optimized by iterative learning. Therefore, although the configuration of each device is fixed and the environment can be optimized under a constant environment, it cannot follow an unexpected change in the environment due to disturbance or the like.

本発明は、かかる事情を鑑みてなされたものであり、予期しない外的環境の変化が発生しても、自律的に反応し、制御対象が所望の動作を安定的に行なうことを可能とする制御装置、制御システム、および制御プログラムを提供することを目的とする。 The present invention has been made in view of such circumstances, and makes it possible for a controlled object to stably perform a desired operation by reacting autonomously even when an unexpected external environment change occurs. An object is to provide a control device, a control system, and a control program.

本発明によれば、制御装置であって、駆動信号を制御対象に供給することで前記制御対象を制御可能に構成されるもので、スパイク信号列生成部と駆動信号生成部とを備え、前記スパイク信号列生成部は、前記制御対象を制御するための基本制御信号と擾乱を含む内部状態、および内部状態に関するダイナミクスによって規定されるタイミングで、スパイク信号列を生成可能に構成され、前記駆動信号生成部は、前記スパイク信号列に基づいて時系列に連続変化する前記駆動信号を生成可能に構成される、制御装置が提供される。 According to the present invention, the control device is configured to control the control target by supplying a drive signal to the control target, and includes a spike signal train generation unit and a drive signal generation unit, The spike signal train generation unit is configured to be able to generate a spike signal train at an internal state including a basic control signal and a disturbance for controlling the controlled object, and at a timing defined by dynamics related to the internal state, and the drive signal A control device is provided in which the generation unit is configured to be capable of generating the drive signal that continuously changes in time series based on the spike signal train.

本発明に係る制御装置では、基本制御信号を、前記スパイク信号列生成部により一旦スパイク信号列に変換したのちに、前記駆動信号生成部にて生成した駆動信号を用いて制御対象に対する制御を行なう。このとき、予期しない外的環境の変化が発生しても、制御装置側が自律的に反応し、制御システム全体が所望の動作を行なうことが可能となるという有利な効果を奏する。 In the control device according to the present invention, the basic control signal is once converted into the spike signal train by the spike signal train generation unit, and then the control target is controlled using the drive signal generated by the drive signal generation unit. .. At this time, even if an unexpected change in the external environment occurs, the control device side reacts autonomously, and the entire control system can perform a desired operation, which is an advantageous effect.

本発明の実施形態に係る制御装置および制御対象からなる制御システムの機能ブロック図。1 is a functional block diagram of a control system including a control device and a control target according to an embodiment of the present invention. 制御装置における最適化制御フロー図。The optimization control flow chart in a control apparatus. スパイク信号列を用いた制御例として水平軸上の粒子位置を制御する構成図。The block diagram which controls the particle position on a horizontal axis as a control example using a spike signal train. 水平軸上を移動する粒子に関する３重／２重／１重井戸ポテンシャルを示す状態図。FIG. 3 is a state diagram showing a triple/double/single well potential for particles moving on a horizontal axis. 秩序創発機能のうち、エントロピー減少・パターン形成機能に関するシミュレーション結果図。Of the order emergence function, the simulation result diagram regarding the entropy reduction/pattern formation function. 秩序創発機能のうち、目標状態への引き込み領域拡大機能に関するシミュレーション結果図。Of the order emergence function, the simulation result diagram regarding the function of expanding the pull-in area to the target state. 秩序創発機能のうち、自然周波数へのバインディング機能に関するシミュレーション結果図。Of the order emergence function, the simulation result diagram regarding the binding function to the natural frequency. 制御システムの一例である筋骨格ロボット制御システムの構成図。The block diagram of the musculoskeletal robot control system which is an example of a control system. 筋骨格ロボット制御システムの低摩擦環境における移動速度シミュレーション結果図。FIG. 6 is a diagram showing a moving speed simulation result of a musculoskeletal robot control system in a low friction environment. 筋骨格ロボット制御システムの低摩擦環境における協調運動能力シミュレーション結果図。FIG. 6 is a diagram showing a result of a simulation of a cooperative movement ability of a musculoskeletal robot control system in a low friction environment. スパイク信号列生成部と駆動信号生成部を外付け制御装置とした制御システムの機能ブロック図。FIG. 3 is a functional block diagram of a control system in which the spike signal train generation unit and the drive signal generation unit are external control devices.

以下、図面を用いて本発明の実施形態について説明する。以下に示す実施形態中で示した各種特徴事項は、互いに組み合わせ可能である。特に、本明細書において「部」とは、例えば、広義の回路によって実施されるハードウェア資源と、これらのハードウェア資源によって具体的に実現されうるソフトウェアの情報処理とを合わせたものも含みうる。また、本実施形態においては様々な情報を取り扱うが、これら情報は、０または１で構成される２進数のビット集合体として信号値の高低によって表されるデジタル信号情報と、電圧・電流が連続的に変化するアナログ信号情報、および時間軸上で瞬間的に電圧・電流が発生するスパイク信号情報で、広義の回路上で通信・演算が実行されうる。 Embodiments of the present invention will be described below with reference to the drawings. The various features shown in the embodiments described below can be combined with each other. In particular, in the present specification, the “unit” may include, for example, a combination of hardware resources implemented by a circuit in a broad sense and information processing of software that can be specifically realized by these hardware resources. .. In addition, although various kinds of information are handled in the present embodiment, these pieces of information are digital signal information represented by high and low of a signal value as a binary bit aggregate composed of 0 or 1, and voltage and current are continuous. Communication and calculation can be performed on a circuit in a broad sense by analog signal information that changes dynamically and spike signal information that instantaneously generates voltage and current on the time axis.

また、広義の回路とは、デジタル回路（ＤｉｇｉｔａｌＣｉｒｃｕｉｔ）、アナログ回路（ＡｎａｌｏｇＣｉｒｃｕｉｔ）、光回路（ＯｐｔｉｃａｌＣｉｒｃｕｉｔ）、回路類（Ｃｉｒｃｕｉｔｒｙ）、プロセッサ（Ｐｒｏｃｅｓｓｏｒ）、およびメモリ（Ｍｅｍｏｒｙ）等を少なくとも適当に組み合わせることによって実現される回路である。すなわち、デジタル回路としては、特定用途向け集積回路（ＡｐｐｌｉｃａｔｉｏｎＳｐｅｃｉｆｉｃＩｎｔｅｇｒａｔｅｄＣｉｒｃｕｉｔ：ＡＳＩＣ）、プログラマブル論理デバイス（例えば、単純プログラマブル論理デバイス（ＳｉｍｐｌｅＰｒｏｇｒａｍｍａｂｌｅＬｏｇｉｃＤｅｖｉｃｅ：ＳＰＬＤ）、複合プログラマブル論理デバイス（ＣｏｍｐｌｅｘＰｒｏｇｒａｍｍａｂｌｅＬｏｇｉｃＤｅｖｉｃｅ：ＣＰＬＤ）、およびフィールドプログラマブルゲートアレイ（ＦｉｅｌｄＰｒｏｇｒａｍｍａｂｌｅＧａｔｅＡｒｒａｙ：ＦＰＧＡ））等を含むものである。アナログ回路としては、抵抗、コンデンサ（Ｃａｐａｓｉｔｏｒ）、インダクタ（Ｉｎｄｕｃｔｏｒ）などの受動素子（ＰａｓｓｉｖｅＣｏｍｐｏｎｅｎｔ）、ダイオード（Ｄｉｏｄｅ）、トランジスタ（Ｔｒａｎｓｉｓｔｏｒ）、サイリスタ（Ｔｈｙｒｉｓｔｏｒ）などのディスクリート半導体（ＤｉｓｃｒｅｔｅＳｅｍｉｃｏｎｄｕｃｔｏｒ）、およびコンパレータ（Ｃｏｍｐａｒａｔｏｒ）などのアナログ集積回路（ＡｎａｌｏｇＩｎｔｅｇｒａｔｅｄＣｉｒｃｕｉｔ）等を含むものである。また、デジタル回路とアナログ回路の境界部に、Ｄ／Ａコンバータ（Ｄｉｇｉｔａｌ−ｔｏ−ＡｎａｌｏｇＣｏｎｖｅｒｔｅｒ）もしくはＡ／Ｄコンバータ（Ａｎａｌｏｇ−ｔｏ−ＤｉｇｉｔａｌＣｏｎｖｅｒｔｅｒ）を使用する回路構成も可能である。さらに光回路としては、発光ダイオード（ＬｉｇｈｔＥｍｉｔｔｉｎｇＤｉｏｄｅ）、半導体レーザー（ＳｅｍｉｃｏｎｄｕｃｔｏｒＬａｓｅｒ）などの発光素子（ＬｉｇｈｔＥｍｉｔｔｅｒ）、フォトダイオード（Ｐｈｏｔｏｄｉｏｄｅ）などの受光素子（Ｐｈｏｔｏｄｅｔｅｃｔｏｒ）、光ファイバー（ＯｐｔｉｏｃａｌＦｉｂｅｒ）などの光導波路（ＯｐｔｉｃａｌＷａｖｅｇｕｉｄｅ）さらには光集積回路（ＯｐｔｉｃａｌＩｎｔｅｇｒａｔｅｄＣｉｒｃｕｉｔ）等を含むものである。 Further, the circuit in a broad sense at least appropriately includes a digital circuit (Digital Circuit), an analog circuit (Analog Circuit), an optical circuit (Optical Circuit), circuits (Circuitry), a processor (Processor), a memory (Memory) and the like. It is a circuit realized by combining them. That is, as a digital circuit, an application specific integrated circuit (ASIC), a programmable logic device (for example, a simple programmable logic device (Simple Programmable Logic Device: SPLD), a complex programmable logic device (Complex Logic Program)). CPLD), a field programmable gate array (Field Programmable Gate Array: FPGA), and the like. Examples of the analog circuit include resistors, capacitors (capacitors), passive elements (passive components) such as inductors, diodes (diodes), transistors (discrete semiconductors) such as thyristors (discrete semiconductors), and comparators. (Comparator) and other analog integrated circuits (Analog Integrated Circuit) and the like. A circuit configuration using a D/A converter (Digital-to-Analog Converter) or an A/D converter (Analog-to-Digital Converter) is also possible at the boundary between the digital circuit and the analog circuit. Further, as an optical circuit, a light emitting diode (Light Emitting Diode), a light emitting element (Light Emitter) such as a semiconductor laser (Semiconductor Laser), a light receiving element (Photodetector) such as a photodiode (Photodiode), and an optical fiber (Optical Optical) such as an optical fiber. It includes a waveguide (Optical Waveguide), an optical integrated circuit (Optical Integrated Circuit), and the like.

１．全体構成
第１節では、本発明に係る制御装置を含む制御システム１の全体構成について図面を用いて説明する。図１は、本実施形態に係る制御システム１の構成概要を示す図である。制御システム１は、制御装置２および制御対象３とを備え、これらが電気的に接続されたシステムである。制御対象３は二脚歩行などのロボット（後述）、移動体、ペースメーカー、電気回路系、化学反応系、通信ネットワーク、社会経済管理システム、金融システム、生体ネットワークおよび動植物など、運動・状態に関して周辺環境変化により、電気的、力学的もしくは化学的内部状態などが変動する特性を持ち、所望である動作を行なうために制御を必要とするものである。前記内部状態は、前記制御システム１の機能や動作に関わり、かつ検知可能なものであれば項目は限定されない。 1. Overall Configuration In Section 1, the overall configuration of a control system 1 including a control device according to the present invention will be described with reference to the drawings. FIG. 1 is a diagram showing a schematic configuration of a control system 1 according to the present embodiment. The control system 1 is a system that includes a control device 2 and a controlled object 3 and that are electrically connected to each other. The controlled object 3 is a robot such as a bipedal locomotive (described later), a moving body, a pacemaker, an electric circuit system, a chemical reaction system, a communication network, a socioeconomic management system, a financial system, a biological network, an animal and plant environment, and the surrounding environment with respect to movement and condition It has a characteristic that the electrical, mechanical or chemical internal state changes due to the change, and requires control in order to perform a desired operation. Items are not limited as long as the internal state is related to the function or operation of the control system 1 and can be detected.

１．１制御装置２
図１に示す通り、制御装置２は通信部２１と、記憶部２２と、制御部２３とを有し、これらの構成要素が制御装置２内部において通信バス２０を電気的に接続されている。以下、各構成要素についてさらに説明する。 1.1 Control device 2
As shown in FIG. 1, the control device 2 has a communication unit 21, a storage unit 22, and a control unit 23, and these components are electrically connected to the communication bus 20 inside the control device 2. Hereinafter, each component will be further described.

＜通信部２１＞
通信部２１は、制御対象３との間で情報の授受を行なうものである。ＵＳＢ、ＩＥＥＥ１３９４、Ｔｈｕｎｄｅｒｂｏｌｔ、有線ＬＡＮネットワーク通信等といった有線型の通信手段が好ましいものの、無線ＬＡＮネットワーク通信、５Ｇ／ＬＴＥ／３Ｇ等のモバイル通信、Ｂｌｕｅｔｏｏｔｈ（登録商標）通信等を必要に応じて含めてもよい。これらは一例であり、専用の通信規格を採用してもよい。すなわち、これら複数の通信手段の集合として実施することがより好ましい。 <Communication unit 21>
The communication unit 21 exchanges information with the controlled object 3. Wired communication means such as USB, IEEE 1394, Thunderbolt, and wired LAN network communication are preferable, but wireless LAN network communication, mobile communication such as 5G/LTE/3G, and Bluetooth (registered trademark) communication are included as necessary. Good. These are examples, and a dedicated communication standard may be adopted. That is, it is more preferable to implement it as a set of a plurality of these communication means.

図１においては、通信部２１から制御対象３内の状態検知部３１および駆動部３０それぞれ別に接続している様子を示しているが、物理的な接続はまとめて１つとし、制御対象３内部で論理的に分配する構成としても良い。 Although FIG. 1 shows a state in which the communication unit 21 is separately connected to the state detection unit 31 and the drive unit 30 in the controlled object 3, the physical connection is one and the inside of the controlled object 3 is shown. May be logically distributed in.

＜記憶部２２＞
記憶部２２は、様々な情報を記憶する揮発性または不揮発性の記憶媒体である。これは、例えばソリッドステートドライブ（ＳｏｌｉｄＳｔａｔｅＤｒｉｖｅ：ＳＳＤ）等のストレージデバイスとして、あるいは、プログラムの演算に係る一時的に必要な情報（引数、配列等）を記憶するランダムアクセスメモリ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ：ＲＡＭ）等のメモリとして実施されうる。また、これらの組合せであってもよい。 <Memory unit 22>
The storage unit 22 is a volatile or non-volatile storage medium that stores various information. This is, for example, as a storage device such as a solid state drive (SSD), or a random access memory (Random Access Memory) for storing temporarily necessary information (arguments, arrays, etc.) related to the calculation of a program. It may be implemented as a memory such as RAM). Also, a combination of these may be used.

特に、記憶部２２は、制御実行内容に関する各種パラメータ、制御対象３に関する形状、寸法、材質、重量などの個別特徴情報、最適化途中を含む連続制御時における過去の設定情報を記憶している。 In particular, the storage unit 22 stores various parameters related to control execution contents, individual characteristic information such as shape, size, material, and weight related to the controlled object 3, and past setting information at the time of continuous control including in the middle of optimization.

また、記憶部２２は、制御部２３によって実行される制御装置２に係る種々のプログラム等を記憶している。具体的には例えば、二脚歩行ロボットの様に複数の筋・腱および関節など複数の駆動要素を有する制御対象３に関する動作手順や、制御部２３を構成する基本制御信号生成部２３１、スパイク信号列生成部２３２、駆動信号生成部２３３で用いるパラメータ群の初期値や更新手順である。 Further, the storage unit 22 stores various programs and the like related to the control device 2 executed by the control unit 23. Specifically, for example, an operation procedure regarding the controlled object 3 having a plurality of driving elements such as a plurality of muscles/tendons and joints like a bipedal walking robot, a basic control signal generation unit 231, a spike signal that constitutes the control unit 23, and the like. These are initial values and update procedures of parameter groups used in the column generation unit 232 and the drive signal generation unit 233.

＜制御部２３＞
制御部２３は、制御装置２に関連する全体動作の処理・制御を行なう。制御部２３は、例えば不図示の中央処理装置（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ：ＣＰＵ）である。制御部２３は、記憶部２２に記憶された所定のプログラムを読み出すことによって、制御装置２に係る種々の機能を実現する。具体的には制御対象３毎に予め与えられた情報、制御対象３内状態検知部３１から通信部２１を介して受信した状態情報を元に、基本制御信号生成部２３１とスパイク信号列生成部２３２と駆動信号生成部２３３を通じて制御対象３への駆動信号ＡＳを生成し制御を実施する機能が該当する。 <Control unit 23>
The control unit 23 processes and controls the overall operation related to the control device 2. The control unit 23 is, for example, a central processing unit (CPU) (not shown). The control unit 23 realizes various functions of the control device 2 by reading out a predetermined program stored in the storage unit 22. Specifically, based on the information given in advance for each controlled object 3 and the state information received from the controlled object 3 internal state detection unit 31 via the communication unit 21, the basic control signal generation unit 231 and the spike signal sequence generation unit. The function of generating the drive signal AS to the controlled object 3 through the drive signal generating unit 232 and the drive signal generating unit 233 and executing the control is applicable.

すなわち、ソフトウェア（記憶部２２に記憶されている）による情報処理がハードウェア（制御部２３）によって具体的に実現されることで、基本制御信号生成部２３１、スパイク信号列生成部２３２、および駆動信号生成部２３３として実行されうる。なお、図１においては、単一の制御部２３として表記されているが、実際の構成はこれに限るものではなく、機能毎に複数の制御部２３を有するように実施してもよい。また、それらの組合せであっても良い。以下、基本制御信号生成部２３１、スパイク信号列生成部２３２、駆動信号生成部２３３についてさらに詳述する。 That is, the information processing by the software (stored in the storage unit 22) is specifically realized by the hardware (control unit 23), so that the basic control signal generation unit 231, the spike signal sequence generation unit 232, and the drive. It may be executed as the signal generator 233. In addition, in FIG. 1, it is described as a single control unit 23, but the actual configuration is not limited to this, and a plurality of control units 23 may be provided for each function. Also, a combination thereof may be used. Hereinafter, the basic control signal generator 231, the spike signal string generator 232, and the drive signal generator 233 will be described in more detail.

［基本制御信号生成部２３１］
基本制御信号生成部２３１はソフトウェア（記憶部２２に記憶されている）による情報処理がハードウェア（制御部２３）によって具体的に実現されているものである。基本制御信号生成部２３１は、通信部２１を介して制御対象３の状態検知部３１から得た状態情報、および制御対象３毎に予め与えられたパラメータを元に、非スパイク信号状である基本制御信号ＣＳを生成するものである。制御アルゴリズムは限定されるものではなく、フィードバック制御、フィードフォワード制御、モデル予測制御、深層学習を用いた制御など各種アルゴリズムが利用可能である。 [Basic control signal generation unit 231]
The basic control signal generation unit 231 is one in which information processing by software (stored in the storage unit 22) is specifically realized by hardware (control unit 23). The basic control signal generation unit 231 is a non-spike signal based on the state information obtained from the state detection unit 31 of the controlled object 3 via the communication unit 21 and the parameters given in advance for each controlled object 3. The control signal CS is generated. The control algorithm is not limited, and various algorithms such as feedback control, feedforward control, model predictive control, and control using deep learning can be used.

なお、後述する秩序創発機能を最大限利用するには、基本制御信号生成部２３１単体の周波数特性として、強い自然周波数／固有周波数ピークを持たない制御アルゴリズムとパラメータ設定が望ましい。 In order to maximize the use of the ordered emergence function, which will be described later, it is desirable that the basic control signal generation unit 231 has a frequency characteristic of a control algorithm and parameter setting that does not have a strong natural frequency/natural frequency peak.

［スパイク信号列生成部２３２］
スパイク信号列生成部２３２は、ソフトウェア（記憶部２２に記憶されている）による情報処理がハードウェア（制御部２３）によって具体的に実現されているもので、ハードウェアは前述したデジタル回路およびアナログ回路の組合せで構成される。 [Spike signal sequence generation unit 232]
The spike signal sequence generation unit 232 is one in which information processing by software (stored in the storage unit 22) is specifically realized by hardware (control unit 23), and the hardware is the digital circuit and analog described above. Composed of a combination of circuits.

スパイク信号列生成部２３２は、基本制御信号生成部２３１で生成された基本制御信号ＣＳを入力とし、スパイク信号列ＳＴを生成する要素であるニューロン（図１中不図示）を内包するものである。このスパイク信号列生成部２３２は、生体において確率的にインパルス状の活動電位を発生するニューロンネットワークすなわち確率的スパイキングニューロンネットワーク（ＳｔｏｃｈｓａｓｔｉｃａｌｌｙＳｐｉｋｉｎｇＮｅｕｒｏｎＮｅｔｗｏｒｋ：ｓＳＮＮ）と同等の動作をするものである。スパイク信号列生成部２３２内ニューロンとしては、ＬＩＦ（ｌｅａｋｙｉｎｔｅｇｒａｔｅ−ａｎｄ−ｆｉｒｅ）ニューロンを始め、ポアソン（Ｐｏｉｓｓｏｎ）スパイクモデルやホジキン−ハクスレイ（Ｈｏｄｇｋｉｎ−Ｈｕｘｌｅｙ）モデル、バースト発火可能なモデルなど、入力となる基本制御信号と擾乱を含む内部状態、および内部状態に関するダイナミクスによって規定されるタイミングでスパイク信号列を生成するモデルが適用可能である。 The spike signal train generation unit 232 receives the basic control signal CS generated by the basic control signal generation unit 231, and includes a neuron (not shown in FIG. 1) that is an element that generates the spike signal train ST. .. The spike signal sequence generation unit 232 operates similarly to a neuron network that stochastically generates impulse-like action potentials in a living body, that is, a stochastic spiking neuron network (sSNN). As the neurons in the spike signal sequence generation unit 232, there are LIF (leaky integrate-and-fire) neurons, Poisson spike models, Hodgkin-Huxley models, burst ignitable models, and the like. It is possible to apply a model that generates a spike signal train at a timing defined by a basic control signal and an internal state including a disturbance, and dynamics related to the internal state.

スパイク信号列生成部２３２（ｓＳＮＮ）内に複数のニューロンが存在する場合は、それらニューロン同士の間に任意のシナプス結合を有することも可能である。その際、全てのニューロンは同期発火しない様に設計される。具体的には、例えばニューロン毎に独立のノイズを受ける、あるいはニューロン毎に異なる発火閾値（後述）やリセット電位（後述）を設定する、などである。 When a plurality of neurons exist in the spike signal sequence generation unit 232 (sSNN), it is possible to have arbitrary synaptic connections between the neurons. At that time, all neurons are designed not to fire synchronously. Specifically, for example, each neuron receives an independent noise, or a different firing threshold (described later) or reset potential (described later) is set for each neuron.

ここでは、ＬＩＦニューロンの場合に関して数式を用いてより詳しく説明する。ｉ番目のスパイク信号列生成部２３２（ｓＳＮＮ）におけるｊ番目のＬＩＦニューロンは、下に記述する［数１］［数２］［数３］として表すことができる。

ここに、ｖ＿ｉｊは電位、変数上のドットは時間微分、γは減衰係数、ｂ＿ｉｊはバイアス入力（電流）、Ｉは入力信号、Ｄ＿ｉはｉ番目のスパイク信号列生成部２３２におけるノイズ強度、ξ＿ｉｊは単位強度のガウシアン（正規分布）ノイズ、ｖ＾θは発火閾値、ｖ＿ｉｊ＾Ｒはリセット電位、τ＿ｒｅｆは不能期間、ｋはスパイク信号発生順番号である。 Here, the case of the LIF neuron will be described in more detail using mathematical expressions. The j-th LIF neuron in the i-th spike signal sequence generation unit 232 (sSNN) can be expressed as [Equation 1] [Equation 2] [Equation 3] described below.

Here, v_ij is the potential, dots on the variables are time-differentiated, γ is the attenuation coefficient, b_ij is the bias input (current), I is the input signal, D_i is the noise intensity in the i-th spike signal train generation unit 232, and ξ_ij is Gaussian (normal distribution) noise of unit intensity, v^θ is a firing threshold, v_ij^R is a reset potential, τ_ref is an impossible period, and k is a spike signal generation sequence number.

対象であるＬＩＦニューロンの内部電位ｖ＿ｉｊがｖ＾θに到達すると発火し、数学的にはディラック（Ｄｉｒａｃ）のデルタ関数δで記述されるスパイク信号を生成すると共に、内部電位はｖ＿ｉｊ＾Ｒにリセットされ、τ＿ｒｅｆの不能期に入る。［数１］で示したＬＩＦニューロンではガウシアンノイズという形で擾乱を加える事により多数のＬＩＦが同期発火することを防いでいる。［数１］から明らかな通り、ノイズ成分印加以外に発火閾値ｖ＾θやリセット電位ｖ＿ｉｊ＾Ｒを個別に設定することでも同期発火を防ぐことが可能である。個々のＬＩＦニューロンにおけるスパイク信号列σ＿ｉｊは形式的に［数４］で記述することが出来る。

この際、ｉ番目のスパイク信号列生成部２３２に複数のＬＩＦニューロンが存在する場合は、各ＬＩＦニューロンから出力される全てのスパイク信号列を活用する。その際にはスパイク信号列の平均を用いる、あるいはＬＩＦニューロン毎に線形の重み付けを行なう方式が適用できる。そうして得られた新たなスパイク信号列をスパイク信号列生成部２３２の出力であるスパイク信号列ＳＴとする。 When the internal potential v_ij of the target LIF neuron reaches v^θ, it fires and mathematically generates a spike signal described by the Dirac delta function δ, and the internal potential is reset to v_ij^R. Then, the τ_ref is disabled. In the LIF neuron shown in [Equation 1], a large number of LIFs are prevented from firing synchronously by adding a disturbance in the form of Gaussian noise. As is clear from [Equation 1], it is possible to prevent the synchronous firing by individually setting the firing threshold v^θ and the reset potential v_ij^R in addition to the noise component application. The spike signal train σ_ij in each LIF neuron can be formally described by [Equation 4].

At this time, if a plurality of LIF neurons are present in the i-th spike signal train generation unit 232, all spike signal trains output from each LIF neuron are utilized. At that time, a method of using the average of the spike signal train or a method of linearly weighting each LIF neuron can be applied. The new spike signal train thus obtained is used as the spike signal train ST which is the output of the spike signal train generator 232.

すなわち、スパイク信号列生成部２３２は、ニューロンネットワークを構成する複数のニューロンを有し、複数のニューロンそれぞれによって出力される信号に基づいて、スパイク信号列ＳＴを生成可能に構成される。 That is, the spike signal string generation unit 232 has a plurality of neurons that form a neuron network, and is configured to be able to generate the spike signal string ST based on the signals output by each of the plurality of neurons.

［駆動信号生成部２３３］
駆動信号生成部２３３は、ソフトウェア（記憶部２２に記憶されている）による情報処理がハードウェア（制御部２３）によって具体的に実現されているもので、ハードウェアは前述したデジタル回路およびアナログ回路の組合せで構成される。 [Drive signal generator 233]
The drive signal generation unit 233 is one in which information processing by software (stored in the storage unit 22) is specifically realized by hardware (control unit 23), and the hardware is the digital circuit and analog circuit described above. It is composed of a combination of.

駆動信号生成部２３３は前記スパイク信号列生成部２３２で生成されたスパイク信号列ＳＴを制御対象３の駆動部３０に供給する駆動信号に変換する機能を有する。この駆動信号は生体ニューロンネットワークにおけるシナプス後電位（ｐｏｓｔｓｙｎａｐｔｉｃｐｏｔｅｎｔｉａｌ：ＰＳＰ）に基づいて生成されるものである。 The drive signal generation unit 233 has a function of converting the spike signal sequence ST generated by the spike signal sequence generation unit 232 into a drive signal to be supplied to the drive unit 30 of the controlled object 3. This drive signal is generated based on the post-synaptic potential (PSP) in the biological neuron network.

スパイク信号列ＳＴからシナプス後電位相当である駆動信号ＡＳを生成するにはシナプス類似の方式を用いることが出来る。具体的にはローパスフィルタ、古典的α関数状シナプスモデル、方形波シナプスモデル、ダイナミックシナプスモデルなどである。 A method similar to the synapse can be used to generate the drive signal AS corresponding to the post-synaptic potential from the spike signal train ST. Specifically, it is a low-pass filter, a classical α-functional synapse model, a square wave synapse model, a dynamic synapse model, and the like.

駆動信号生成部２３３における信号処理方式をローパスフィルタとした場合、出力の基となるシナプス後電位（ｐｏｓｔｓｙｎａｐｔｉｃｐｏｔｅｎｔｉａｌ：ＰＳＰ）ｙ＿ｉ、制御対象３駆動部３０の活性度Ａ＿ｉは、それぞれ［数５］［数６］で記述することが可能である。

ここに、τ＿ｓはシナプス時定数、Ｎは対象とするｉ番目のスパイク信号列生成部２３２内のＬＩＦニューロン数、ｇ＿ｉ＾Ａは増幅ゲインで、Ａ＿ｉ＾０はオフセットである。 When the signal processing method in the drive signal generation unit 233 is a low-pass filter, the post-synaptic potential (PSP) y_i that is the basis of the output and the activity A_i of the controlled target 3 drive unit 30 are respectively [Equation 5][ Equation 6] can be used.

Here, τ_s is a synapse time constant, N is the number of LIF neurons in the target i-th spike signal sequence generation unit 232, g_i^A is an amplification gain, and A_i^0 is an offset.

なお、図１では駆動信号ＡＳを駆動信号生成部２３３から駆動部３０間の直結線で伝えているが、Ａ／Ｄ（Ａｎａｌｏｇ−ｔｏ−Ｄｉｇｉｔａｌ）変換を行った後、デジタル信号として通信部２１を介して駆動部３０に伝達する構成も可能である。 Although the drive signal AS is transmitted from the drive signal generation unit 233 to the drive unit 30 by a direct connection in FIG. 1, after the A/D (Analog-to-Digital) conversion is performed, the communication unit 21 outputs a digital signal. It is also possible to adopt a configuration in which the signal is transmitted to the drive unit 30 via.

１．２制御対象３
制御対象３は具体的には例えば、機械的なタスクを実行するロボットや移動システムなどである。なお本発明の制御装置２はタスクの技術分野を限定するものでは無く、動的に作用するもので、かつ環境変動に伴う動作状態変化を検出可能なものであれば、電気回路システムや化学反応システムも制御対象３とすることが可能である。 1.2 Control target 3
The controlled object 3 is specifically, for example, a robot or a moving system that executes a mechanical task. Note that the control device 2 of the present invention does not limit the technical field of the task, but can be an electric circuit system or a chemical reaction system as long as it is a dynamically acting one and can detect a change in operating state due to environmental changes. The system can also be the controlled object 3.

［駆動部３０］
駆動部３０は、タスクを実行する際に、外部からの制御信号に基づき制御対象３を動作させるものである。具体的には例えば、制御対象３がロボットの場合におけるモーター、空圧・油圧などのアクチュエータなどであるが、これらに限定されるものでは無い。 [Drive unit 30]
The drive unit 30 operates the controlled object 3 based on a control signal from the outside when executing a task. Specifically, for example, when the controlled object 3 is a robot, it is a motor, an actuator such as pneumatic/hydraulic, or the like, but is not limited to these.

［状態検知部３１］
状態検知部３１は、制御対象３が動作時に外乱などによる環境変動があった場合を含めて制御対象３の内部状態を検知するものである。内部状態としては、制御対象３における注目する箇所の位置、速度、加速度、回転、角速度、角加速度、力およびモーメントなどの機械的力学的情報、電圧、電流および抵抗などの電気的情報、音や光の物理的情報、温度、圧力、流速などの流体力学情報、濃度、ｐＨ、分子量などの化学的情報等であるが、これらに限定されるものでは無い。検知した内部状態は状態情報として通信部２１に対して送信できる構成となっている。すなわち換言すると、状態情報とは、前記制御対象における注目する箇所において、制御対象の挙動および環境変動により変化する内部状態を示す情報である。 [State detection unit 31]
The state detection unit 31 detects the internal state of the controlled object 3 including the case where the controlled object 3 changes in environment due to disturbance during operation. The internal state includes the position of a point of interest in the controlled object 3, velocity, acceleration, rotation, angular velocity, angular acceleration, mechanical mechanical information such as force and moment, electrical information such as voltage, current and resistance, and sound and It is, but not limited to, physical information of light, fluid dynamics information such as temperature, pressure and flow velocity, and chemical information such as concentration, pH and molecular weight. The detected internal state can be transmitted to the communication unit 21 as state information. That is, in other words, the state information is information indicating an internal state that changes due to the behavior of the control target and environmental changes at the point of interest in the control target.

２．制御システム１の最適化方法
第２節では、制御システム１において、制御装置２のパラメータ最適化方法について説明する。ここでは、例えば制御対象３が二脚歩行ロボットの場合であれば、指定した地点へ移動するという作業を実行するなどであるが、本発明の最適化方法は制御対象３やその作業の種類により限定されるものでは無い。 2. Optimization Method of Control System 1 Section 2 describes a parameter optimization method of the control device 2 in the control system 1. Here, for example, when the controlled object 3 is a bipedal walking robot, a task of moving to a designated point is executed, but the optimization method of the present invention depends on the controlled object 3 and the type of the task. It is not limited.

外乱が少ない環境において、複数回の基本タスクを繰り返し実行することで、前記制御装置２のパラメータを最適化する際の最適化フローを図２に示す。 FIG. 2 shows an optimization flow for optimizing the parameters of the control device 2 by repeatedly executing the basic tasks a plurality of times in an environment with little disturbance.

［最適化開始］
（ステップＳ１）
基本制御信号生成部２３１、スパイク信号列生成部２３２、駆動信号生成部２３３内の各パラメータ群を初期化する。初期化に用いるパラメータ値は記憶部２２に記憶されている情報を用いることができる。記憶部２２に記憶されている情報とは不揮発的に継続して記憶されている情報だけでなく、ユーザーが作業開始時に制御対象３の個々の特徴および外部環境状況を鑑みて外部から入力した情報も含む。 [Start optimization]
(Step S1)
Each parameter group in the basic control signal generation unit 231, the spike signal sequence generation unit 232, and the drive signal generation unit 233 is initialized. Information stored in the storage unit 22 can be used as the parameter value used for the initialization. The information stored in the storage unit 22 is not only the information continuously stored in a non-volatile manner, but also information input by the user from the outside in consideration of the individual characteristics of the controlled object 3 and the external environmental condition at the start of work. Including.

基本制御信号生成部２３１では採用したアルゴリズムにて使用されるパラメータを初期設定する。具体的には、例えばフィードバック制御の一種であるＰＩＤ（Ｐｒｏｐｏｒｔｉｏｎａｌ−Ｉｎｔｅｇｒａｌ−Ｄｉｆｆｅｒｅｎｔｉａｌ）制御では、比例ゲインＫ＿Ｐ、積分ゲインＫ＿Ｉ、微分ゲインＫ＿Ｄなどである。 The basic control signal generation unit 231 initializes the parameters used in the adopted algorithm. Specifically, for example, in PID (Proportional-Integral-Differential) control, which is one type of feedback control, a proportional gain K_P, an integral gain K_I, a differential gain K_D, and the like are used.

スパイク信号列生成部２３２では確率的スパイキングニューロンネットワーク（ｓＳＮＮ）としてのあらゆるパラメータを初期設定する。例えばスパイク信号列生成部２３２をＬＩＦニューロンで構成した場合、ニューロン数Ｎや前記［数１］［数２］に含まれる変数、具体的には減衰係数γ、バイアス入力ｂ＿ｉｊ、ノイズ強度Ｄ＿ｉ、発火閾値ｖ＾θ、リセット電位ｖ＿ｉｊ＾Ｒ、不能期間τ＿ｒｅｆなどである。 The spike signal sequence generation unit 232 initializes all parameters as a stochastic spiking neuron network (sSNN). For example, when the spike signal sequence generation unit 232 is configured by LIF neurons, the number of neurons N and variables included in the [Formula 1] and [Formula 2], specifically, the attenuation coefficient γ, the bias input b_ij, the noise intensity D_i, and the firing The threshold v^θ, the reset potential v_ij^R, and the disabled period τ_ref.

駆動信号生成部２３３では、スパイク信号列ＳＴから駆動信号ＡＳを生成するのに採用した方式に関するパラメータを初期設定する。例えばローパスフィルタ方式を採用した場合、時定数τや通過域利得などをフィルタ特性値として設定を行なう。駆動信号生成部２３３の全てもしくは一部を電気的なアナログ回路として構成する場合は、抵抗の抵抗値、コンデンサの容量値などで固定、もしくは半固定的に予め設定しておくことも可能である。 The drive signal generation unit 233 initializes parameters relating to the method adopted to generate the drive signal AS from the spike signal train ST. For example, when the low-pass filter method is adopted, the time constant τ, the passband gain, etc. are set as filter characteristic values. When all or part of the drive signal generation unit 233 is configured as an electrical analog circuit, it is possible to set the resistance value of the resistor, the capacitance value of the capacitor, or the like fixedly or semi-fixedly in advance. ..

（ステップＳ２）
基本制御信号生成部２３１、スパイク信号列生成部２３２、駆動信号生成部２３３内の各パラメータの更新を行なう。全てのパラメータは更新対象となりうるが、制御部２３全体としての概略の方向性は基本制御信号ＣＳに大きく依存するため、基本制御信号生成部２３１部の最適化を主たる対象とするのが望ましい。スパイク信号列生成部２３２および駆動信号生成部２３３に関しては、例えばニューロン数Ｎ、シナプス時定数τ＿ｓ、ノイズ強度Ｄを更新対象とし、その他シナプス結合強度に関するパラメータなどは更新しないという制御方法を取ることが可能である。後述のステップＳ４により収束していないと判定される毎に、前記各パラメータが更新される。 (Step S2)
The parameters in the basic control signal generation unit 231, the spike signal sequence generation unit 232, and the drive signal generation unit 233 are updated. Although all parameters can be updated, the general directionality of the control unit 23 as a whole largely depends on the basic control signal CS, and therefore it is desirable to mainly optimize the basic control signal generation unit 231. Regarding the spike signal sequence generation unit 232 and the drive signal generation unit 233, for example, a control method may be adopted in which the number of neurons N, the synapse time constant τ_s, and the noise intensity D are the update targets, and other parameters related to the synapse connection intensity are not updated. It is possible. Each time the parameter is determined not to converge in step S4 described below, the parameters are updated.

（ステップＳ３）
制御対象３が制御装置２からの制御に従い基本タスクを実行する。状態検知部３１にて制御対象３における前述した各種内部状態を検知し、状態情報を制御装置２内通信部２１に送信する。基本制御信号生成部２３１にて制御対象３全体としての評価値を計算する。後述のステップＳ４により収束していないと判定される毎に、基本タスクの実行も１回目、２回目、３回目と回数が増えていく。 (Step S3)
The controlled object 3 executes a basic task under the control of the control device 2. The state detection unit 31 detects the above-described various internal states of the control target 3 and transmits the state information to the communication unit 21 in the control device 2. The basic control signal generation unit 231 calculates the evaluation value of the controlled object 3 as a whole. Every time it is determined in step S4 described later that the basic task is not converged, the number of times of execution of the basic task is increased to the first time, the second time, and the third time.

（ステップＳ４）
基本制御信号生成部２３１において、システム全体が収束しているかどうかを判定する。収束していない（ＮＯ）と判定された場合は、ステップＳ２に戻ってパラメータ更新作業から継続する。パラメータ更新時における学習アルゴリズムとしては、遺伝アルゴリズムなどの進化戦略を適用することが可能であるが、それに限定するものでは無い。収束している（ＹＥＳ）と判定された場合は作業を終了する。
［最適化終了］ (Step S4)
The basic control signal generation unit 231 determines whether or not the entire system has converged. If it is determined that the values have not converged (NO), the process returns to step S2 to continue from the parameter updating work. As a learning algorithm at the time of updating parameters, an evolution strategy such as a genetic algorithm can be applied, but the learning algorithm is not limited thereto. If it is determined that they have converged (YES), the work ends.
[End of optimization]

３．秩序創発機能
第３節では、本発明の構成における制御装置２が有する秩序創発機能について詳述する。これは、基本制御信号生成部２３１にて生成した基本制御信号ＣＳから、一旦スパイク信号列生成部２３２（ｓＳＮＮ）を用いてスパイク信号列ＳＴを生成し、さらにその後駆動信号生成部２３３にて駆動信号ＡＳを生成する構成を有する、本発明における制御装置２固有の機能であり従来知られたものでは無い。 3. Order emergence function In Section 3, the order emergence function of the control device 2 in the configuration of the present invention will be described in detail. This is to generate a spike signal train ST using the spike signal train generator 232 (sSNN) from the basic control signal CS generated by the basic control signal generator 231, and then drive the spike signal train ST with the drive signal generator 233. This is a function unique to the control device 2 of the present invention having a configuration for generating the signal AS, and is not conventionally known.

本節では、秩序創発機能を示す例として、２つの確率的スパイキングニューロンネットワーク（ｓＳＮＮ）を有する場合を図３に示す。ここでは図３の水平軸上に存在する粒子の位置を制御するものとする。図３中Ｓ＿０、Ｓ＿１がｓＳＮＮで、本発明におけるスパイク信号列生成部２３２と駆動信号生成部２３３を内包するものとする。２つのｓＳＮＮはそれぞれ入力信号としてＩ＿０（ｔ）、Ｉ＿１（ｔ）を受け取る。ここでは入力信号Ｉ＿＊（ｔ）は、粒子の現在位置ｘ（ｔ）と目標位置ｘ＿０＾ｇ、ｘ＿１＾ｇの差分量として定義している。スパイク信号列生成部２３２としてはＬＩＦニューロン（第１節参照）、駆動信号生成部２３３としてはローパスフィルタ（第１節参照）を使用することとする。 In this section, as an example showing the order emergence function, a case having two stochastic spiking neuron networks (sSNN) is shown in FIG. Here, the position of particles existing on the horizontal axis in FIG. 3 is controlled. In FIG. 3, S_0 and S_1 are sSNN, and include the spike signal train generation unit 232 and the drive signal generation unit 233 of the present invention. The two sSNNs receive I_0(t) and I_1(t) as input signals, respectively. Here, the input signal I_*(t) is defined as a difference amount between the current position x(t) of the particle and the target positions x_0^g and x_1^g. A LIF neuron (see Section 1) is used as the spike signal sequence generation unit 232, and a low-pass filter (see Section 1) is used as the drive signal generation unit 233.

３．１エントロピー減少・パターン形成機能
図４Ａに、３重井戸ポテンシャル関数における質量を持った粒子の位置をｓＳＮＮによって制御する場合を示す。ここでは中心（ｘ＝０）はポテンシャルの極小値ではあるが最小値ではなく、中心の両側にポテンシャルが最小となる場所が存在する点に留意されたい。 3.1 Entropy reduction/pattern formation function Fig. 4A shows a case where the position of a particle having a mass in the triple well potential function is controlled by sSNN. It should be noted here that the center (x=0) is not the minimum value but the minimum value of the potential, and there are places where the potential is minimum on both sides of the center.

図４Ａ環境下での粒子位置移動状態のシミュレーション結果を図５Ａ、図５Ｂに示す。横軸は時間ｔ、縦軸は粒子位置ｘを示す。また図５Ａはニューロン数Ｎ＝２、図５Ｂはニューロン数Ｎ＝１５０の場合である。ＡｐＥｎは移動状態から算出したエントロピー（ＡｐｐｒｏｘｉｍａｔｅＥｎｔｒｏｐｙ）である。ニューロン数が少ない図５Ａは図５Ｂに比して粒子の移動量の絶対値は大きいが、これは２箇所存在するポテンシャル最小位置を周期的に移動していることが理由であり、その規則的な周期性のためエントロピーＡｐＥｎとしては小さい値となっている。この様に確率的スパイキングニューロンネットワーク（ｓＳＮＮ）においてはスパイク性が高いほど、エントロピー減少機能、パターン形成機能が発現する。 5A and 5B show the simulation results of the particle position movement state under the environment of FIG. 4A. The horizontal axis represents time t, and the vertical axis represents particle position x. Further, FIG. 5A shows the case where the number of neurons N=2, and FIG. 5B shows the case where the number of neurons N=150. ApEn is entropy (Approximate Entropy) calculated from the moving state. In FIG. 5A, in which the number of neurons is small, the absolute value of the amount of movement of particles is larger than that in FIG. 5B, but this is because the potential minimum position existing in two places is moved periodically, Due to such periodicity, the entropy ApEn has a small value. As described above, in the stochastic spiking neuron network (sSNN), the higher the spike property, the more the entropy reducing function and the pattern forming function are expressed.

３．２目標状態の引き込み領域拡大機能
図４Ｂに、２重井戸ポテンシャル関数における質量を持った粒子の位置をｓＳＮＮによって制御する場合を示す。ここでは中心（ｘ＝０）はポテンシャルの極大値となっており、車の山登り問題（ｍｏｕｎｔａｉｎｃａｒｔａｓｋ）と同様に、谷底からポテンシャルの極大値ｘ＝０に直接到達することは出来ず、反動や外力の助けを必要とする問題設定とする。 3.2 Function of Enlarging Entrainment Area in Target State FIG. 4B shows a case where the position of a particle having a mass in the double well potential function is controlled by sSNN. Here, the center (x=0) is the maximum value of the potential, and as with the mountain climbing problem (mountain car task), the maximum value of the potential x=0 cannot be reached directly from the bottom of the valley, and there is a reaction. The problem setting requires the help of external force.

粒子の初期位置ｘ＿０と初期速度ｖ＿０を様々に変更して、一定時間以上中心付近［−０．１，０．１］の範囲内にとどまることが出来た場合を引き込み領域と定義してシミュレーションした結果を図６Ａ、図６Ｂに示す。図６Ａはニューロン数Ｎ＝１、図６Ｂはニューロン数Ｎ＝１００の場合である。図６Ａ、図６Ｂ中白い領域が引き込み領域である。また、バイアス入力ｂをパラメータにしてニューロン数Ｎを変化させた場合における引き込み領域割合（ｂａｓｉｎｒａｔｅ）のシミュレーション結果を図６Ｃに示す。図６Ａ、図６Ｂ、図６Ｃから明らかな様に確率的スパイキングニューロンネットワーク（ｓＳＮＮ）は引き込み領域を拡大する機能を有しており、ｓＳＮＮに含まれるニューロン数は少ない方が引き込み領域拡大機能を強く発現する場合が多い。 The initial position x_0 and the initial velocity v_0 of the particle were changed variously, and the case where the particle could stay within the range of the center [-0.1, 0.1] for a certain time or longer was defined as the pull-in area for simulation. The results are shown in FIGS. 6A and 6B. 6A shows the case where the number of neurons N=1, and FIG. 6B shows the case where the number of neurons N=100. White areas in FIGS. 6A and 6B are pull-in areas. Further, FIG. 6C shows a simulation result of a pull-in area ratio (basin rate) when the number N of neurons is changed with the bias input b as a parameter. As is clear from FIGS. 6A, 6B, and 6C, the stochastic spiking neuron network (sSNN) has a function of enlarging the attraction region, and the smaller the number of neurons included in the sSNN, the greater the attraction region. Often expressed strongly.

３．３自然周波数へのバインディング機能
図４Ｃに、バネマス系における質量を持った粒子の位置をｓＳＮＮによって制御する場合を示す。ここでバネマス系とは１重井戸ポテンシャル関数と等しい。何も制御を行わないバネマス系では、ばね定数ｋと粒子の質量ｍで定まる自然周波数ｆ＿０（固有周波数）を有している。そのバネマス系に通常のフィードバック制御を行なうと、フィードバック制御のゲインなどの影響により自然周波数ｆ＿０が変調されることが知られている。
中心位置（ｘ＾ｇ＝０）を目標としてｓＳＮＮによる制御を実施し、自然周波数ｆ＿０に対するＳＮＲ（ｓｉｇｎａｌ−ｔｏ−ｎｏｉｓｅｒａｔｉｏ）をシミュレーションした結果を図７に示す。図７中、横軸はシナプス時定数τ＿ｓ、縦軸は増幅ゲインｇ＾Ａであり、より白い領域がＳＮＲが高いことを示している。自然周波数ｆ＿０が１〜１０Ｈｚという非常に広いパラメータ領域において自然周波数ｆ＿０への共鳴現象が確認できる。また、多くの領域で白い縞模様が垂直方向に伸びている事から、確率的スパイキングニューロンネットワーク（ｓＳＮＮ）を用いてバネマス系を駆動する場合、自然周波数ｆ＿０にほとんど影響を与えてない事が明白である。 3.3 Binding Function to Natural Frequency FIG. 4C shows a case where the position of a particle having a mass in the spring-mass system is controlled by sSNN. Here, the spring-mass system is equal to the single well potential function. A spring-mass system in which no control is performed has a natural frequency f_0 (natural frequency) determined by the spring constant k and the mass m of particles. It is known that when the normal feedback control is performed on the spring mass system, the natural frequency f_0 is modulated due to the influence of the gain of the feedback control.
FIG. 7 shows the result of simulating the SNR (signal-to-noise ratio) with respect to the natural frequency f_0 by performing control by sSNN with the center position (x^g=0) as the target. In FIG. 7, the horizontal axis represents the synaptic time constant τ_s, the vertical axis represents the amplification gain g^A, and the whiter region indicates that the SNR is high. A resonance phenomenon to the natural frequency f_0 can be confirmed in a very wide parameter range where the natural frequency f_0 is 1 to 10 Hz. In addition, since the white stripe pattern extends in the vertical direction in many regions, when the spring-mass system is driven using the stochastic spiking neuron network (sSNN), there is almost no effect on the natural frequency f_0. It's obvious.

４．ロボット制御システム
第４節では、実施形態として、制御対象３としてロボット、さらに具体的には筋骨格ロボットを用いた二脚歩行ロボット制御システムのシミュレーション結果を説明する。 4. Robot Control System In Section 4, a simulation result of a bipedal robot control system using a robot as a control target 3, more specifically, a musculoskeletal robot will be described as an embodiment.

図８にシミュレーションに用いた筋骨格ロボット制御システムの機能概略図を示す。図８左側が骨格（リンク）、関節（ジョイント）、筋（図中Ｍｕｓｃｌｅの線、一部省略）の構成を示しており、ロボット駆動部３０として各脚毎に８本の筋および多関節筋を接続している。また、ロボット状態検知部３１（図８中ＳｅｎｓｏｒｙＩｎｐｕｔ）として、筋発生力、筋長、関節角、上体姿勢、重心位置、足裏反力、各骨（リンク）において９軸慣性計測装置（ＩｎｅｒｔｉａＭｅａｓｕｒｅｍｅｎｔＵｎｉｔ）によって得られる３軸姿勢、３軸加速度、３軸角速度を測定可能な構成である。 FIG. 8 shows a functional schematic diagram of the musculoskeletal robot control system used in the simulation. The left side of FIG. 8 shows a structure of a skeleton (link), a joint (joint), and a muscle (muscle line in the drawing, a part of which is omitted). As the robot driving unit 30, eight muscles and multi-joint muscles are provided for each leg. Are connected. Further, as the robot state detection unit 31 (Sensory Input in FIG. 8), a muscle generating force, a muscle length, a joint angle, a body posture, a center of gravity position, a sole reaction force, a 9-axis inertial measurement device for each bone (link) ( This is a configuration capable of measuring the triaxial posture, triaxial acceleration, and triaxial angular velocity obtained by the Inertia Measurement Unit).

左右各脚は静止、振り動作など複数の相を有しており、各相毎に異なる反射活性化ルールを持っている。反射活性化ルール（図８中ＲｅｆｌｅｘＳｙｓｔｅｍ：基本制御信号生成部２３１）は発生力のポジティブフィードバック制御ルール、筋長のフィードバック制御ルール、関節角あるいは上体姿勢のＰＤ（比例微分）制御ルールの組合せで構築される。 Each of the left and right legs has a plurality of phases such as a stationary motion and a swing motion, and each phase has a different reflex activation rule. The reflex activation rule (Reflex System: basic control signal generation unit 231 in FIG. 8) is a combination of a positive feedback control rule for generated force, a feedback control rule for muscle length, and a PD (proportional derivative) control rule for joint angle or body posture. Built in.

図９Ａ、図９Ｂ、図９Ｃには、滑りやすい低摩擦環境での重心移動速度シミュレーション結果を示す。図９Ａ、図９Ｂの横軸が時間ｔ、縦軸が重心移動速度ｖ＾ｇである。通常（図中破線）の摩擦係数μは１０としているが、低摩擦環境（図中実線）では時間ｔ＝［１０，４０］にて摩擦係数μを０．０４と低く設定している。図９Ａ、図９Ｂより通常環境、低摩擦環境ともに安定した二脚歩行動作が行われている。そのとき通常環境よりも低摩擦環境の方が全体に低速度側にシフトしている。図９Ｃは歩容周波数と振幅の関係を示したもので、低摩擦環境では低周波数側に遷移している。 FIG. 9A, FIG. 9B, and FIG. 9C show the results of center-of-gravity movement speed simulation in a slippery low-friction environment. 9A and 9B, the horizontal axis represents time t, and the vertical axis represents the center-of-gravity moving speed v^g. The friction coefficient μ is normally 10 (broken line in the figure), but in a low friction environment (solid line in the figure), the friction coefficient μ is set as low as 0.04 at time t=[10, 40]. As shown in FIGS. 9A and 9B, stable bipedal walking is performed in both the normal environment and the low friction environment. At that time, the low friction environment shifts to the lower speed side as a whole than the normal environment. FIG. 9C shows the relationship between the gait frequency and the amplitude, and transitions to the low frequency side in a low friction environment.

図１０には大きな滑りが発生した状況における協調運動能力シミュレーション結果を示す。図１０Ａ、図１０Ｃは横軸ｘが位置を示し、ｘ＝［４，１６］を低摩擦区間とし摩擦係数μ＝０．０４、それ以外は通常で摩擦係数μ＝１０である。なお、図１０Ａのサンプリング間隔は０．１ｓ、図１０Ｃのサンプリング間隔は０．２５ｓである。低摩擦区間（図中ｓｌｉｐｐｅｒｙ帯）上にある、黒色帯は右足の滑り、灰色帯は左足の滑りを示している。図１０Ｂ、図１０Ｄは、それぞれ図１０Ａ、図１０Ｃに対応する時間ｔと重心移動速度ｖ＾ｇの関係を示している。 FIG. 10 shows the results of the cooperative motor performance simulation in the situation where a large slip has occurred. In FIGS. 10A and 10C, the horizontal axis x indicates the position, x=[4,16] is the low friction section, and the friction coefficient μ=0.04, and otherwise the friction coefficient μ=10. The sampling interval in FIG. 10A is 0.1 s, and the sampling interval in FIG. 10C is 0.25 s. On the low friction zone (slippery band in the figure), the black band indicates the slip of the right foot, and the gray band indicates the slip of the left foot. 10B and 10D show the relationship between the time t and the center-of-gravity moving speed v^g corresponding to FIGS. 10A and 10C, respectively.

図１０Ａ、図１０Ｂにおいては、０．５ｓ以上の時間、０．５ｍ程度の滑りが発生しているが、それに適応して歩行が継続出来ている。この際、左右の足で滑る距離が非対称になっている点に留意されたい。また、図１０Ｃ、図１０Ｄでは低摩擦区間終了地点であるｘ＝１６ｍ、ｔ＝１３ｓ付近でｖ＾ｇが極端に下がっており、これは低摩擦区間終了直前におけるやや長めの右足滑りから通常区間に入るときに転倒寸前の状態となったことを示している。この状況でも、つま先などが通常区間（μ＝１０）である滑りづらい地面に接触していることを足がかりとして、正常歩行に復帰することが出来ている。この様に、従来知られている反射回路のみの制御では実現困難であった、非常に高い適応能力を、本発明による制御装置２を用いたロボット制御システムは有している。 In FIG. 10A and FIG. 10B, a slip of about 0.5 m occurs for a time of 0.5 s or more, but the walking can be adapted to this. At this time, it should be noted that the sliding distance between the left and right feet is asymmetric. Further, in FIGS. 10C and 10D, v^g is extremely decreased near x=16 m, t=13 s, which is the end point of the low friction section, which is a little longer from the right foot slip just before the end of the low friction section to the normal section. It indicates that the vehicle was about to fall when entering. Even in this situation, it is possible to return to normal walking by using the fact that the toes and the like are in contact with the non-slip ground, which is the normal section (μ=10), as a foothold. As described above, the robot control system using the control device 2 according to the present invention has a very high adaptability, which has been difficult to realize by the conventionally known control of only the reflection circuit.

本節で説明した二脚歩行ロボット制御システムにおける協調動作には、運動系列のエントロピーを低減する必要がある。また、転倒回避にはＺＭＰ（ｚｅｒｏ−ｍｏｍｅｎｔｐｏｉｎｔ）をある範囲内に制御する必要もある。第３節にて説明した通り、本発明における制御装置２が有する秩序創発機能（３．１エントロピー減少・パターン形成機能、３．２目標状態の引き込み領域拡大機能を参照されたい）が有効に働くことで、即時的な転倒回避機能が実現されていると言える。 For the coordinated operation in the bipedal robot control system described in this section, it is necessary to reduce the entropy of the motion sequence. Further, in order to avoid falling, it is necessary to control ZMP (zero-moment point) within a certain range. As described in Section 3, the ordered emergence function (see 3.1 Entropy reduction/pattern formation function, 3.2 Target state expansion region expansion function) of the control device 2 of the present invention works effectively. Therefore, it can be said that the instant fall avoidance function is realized.

５．変形例
なお、次のような態様によって、本実施形態を更に創意工夫してもよい。 5. Modification Note that the present embodiment may be further devised in the following manner.

第４節では、二脚歩行ロボット制御システムの実施形態について説明したが、一般に移動システムでは移動に伴い外部環境の変動を伴うものであり、秩序創発機能を有する本発明の制御装置２の特徴を活かすことが出来る。また、秩序創発機能は、無人による完全自律型制御システムとして、あるいは有人システムの補助的な制御システムどちらでも活用することが出来る。移動システムとしては、具体的には例えば、多足歩行ロボット、車輪・キャタピラ型ロボット、無人航空機（ＵｎｍａｎｎｅｄＡｅｒｉａｌＶｅｈｉｃｌｅ：ＵＡＶ、ドローン）、無人水上艇（ＵｎｍａｎｎｅｄＳｕｒｆａｃｅＶｅｈｉｃｌｅ：ＵＳＶ）、無人潜水艇（ＵｎｍａｎｎｅｄＵｎｄｅｒｗａｔｅｒＶｅｈｉｃｌｅ：ＵＵＶ）、自動運転を含む自動車、航空機、船舶などであるが、これらに限定されるものでは無い。 In the fourth section, the embodiment of the bipedal walking robot control system has been described. However, in the mobile system, the external environment generally changes with the movement, and the characteristics of the control device 2 of the present invention having the order emergence function are described. You can take advantage of it. In addition, the order emergence function can be utilized either as an unmanned fully autonomous control system or as an auxiliary control system of a manned system. Specific examples of the moving system include a multi-legged walking robot, a wheel/caterpillar robot, an unmanned aircraft (Unmanned Aerial Vehicle: UAV, drone), an unmanned surface vehicle (USV), and an unmanned submersible (Unmanned). Underwater Vehicle (UUV), automobiles including autonomous driving, aircraft, ships, etc., but are not limited to these.

さらには、運搬や加工に関して作業対象物が頻繁に変更される産業用・医療用・農業用・家庭用のロボットにも適用可能である。 Furthermore, it is also applicable to industrial, medical, agricultural, and household robots whose work objects are frequently changed in transportation and processing.

本発明による制御装置２が持つ秩序創発機能は、状態の変動に適応する必要がある制御システム１であれば技術分野を限定するものでは無い。すなわち機械的な運動に対するものだけでは無く、電気的変動あるいは化学反応的変動に対しても発揮することが可能である。さらには、金融システムの制御やインターネットなどのコミュニケーションネットワークにおける情報の流入・流出・伝播の制御、空調システム、などにも適用可能である事が期待できる。したがって、例えば秩序創発機能のうち自然周波数へのバインディング機能（３．３参照）を活用した心臓ペースメーカーや人工心肺などへ応用することも可能である。 The order emergence function of the control device 2 according to the present invention is not limited to the technical field as long as it is the control system 1 that needs to adapt to changes in the state. That is, it can be exerted not only for mechanical movement but also for electrical fluctuation or chemical reaction fluctuation. Furthermore, it can be expected to be applicable to control of financial systems, control of inflow/outflow/propagation of information in communication networks such as the Internet, and air conditioning systems. Therefore, for example, it can be applied to a cardiac pacemaker or an artificial heart-lung machine that utilizes the function of binding to natural frequencies (see 3.3) among the function of emergence of order.

本発明におけるスパイク信号列生成部２３２および駆動信号生成部２３３は、図１に示した様に、基本制御信号生成部２３１、通信部２１、記憶部２２などと共に１つの制御装置２として構成することも可能であるが、スパイク信号列生成部２３２と駆動信号生成部２３３を外付け制御装置とすることも可能である。 As shown in FIG. 1, the spike signal train generation unit 232 and the drive signal generation unit 233 according to the present invention should be configured as one control device 2 together with the basic control signal generation unit 231, the communication unit 21, the storage unit 22, and the like. However, the spike signal train generation unit 232 and the drive signal generation unit 233 can be used as an external control device.

図１１に機能ブロック図を示す。図１１中、２ｂが基本制御信号ＣＳを生成する基本制御装置、３が制御対象である。基本制御装置２ｂと制御対象３の組合せだけでも従来方式の制御は可能であるが、その従来制御を補足すべく外付け制御装置２ａを接続している。外付け制御装置２ａにはスパイク信号列生成部２３２および駆動信号生成部２３３が配備されている。既存の制御システムに外付け制御装置２ａを追加することで、第３節で説明した秩序創発機能を活用することが可能となり、制御システム１の機能・性能を向上することが出来る。 FIG. 11 shows a functional block diagram. In FIG. 11, 2b is a basic control device that generates a basic control signal CS, and 3 is a control target. Although the conventional control is possible only by combining the basic control device 2b and the controlled object 3, the external control device 2a is connected to supplement the conventional control. The external control device 2a is provided with a spike signal train generation unit 232 and a drive signal generation unit 233. By adding the external control device 2a to the existing control system, the order emergence function described in Section 3 can be utilized and the function/performance of the control system 1 can be improved.

さらには、スパイク信号列は撹乱に起因して確率的に生成されるものに限らず、十分な複雑さと予測不能性を含むならばカオスなどを利用して決定論的に生成されたスパイク信号列であっても同等の機能を得る事が出来る。 Furthermore, spike signal sequences are not limited to those generated stochastically due to disturbance, and spike signal sequences generated deterministically using chaos, etc. if they have sufficient complexity and unpredictability. However, the same function can be obtained.

６．結言
以上のように、本実施形態によれば、予期しない外的環境の変化が発生しても、自律的に反応し、制御システム１全体が所望の動作を行なうことを可能とする制御装置２を実施することが出来る。 6. Conclusion As described above, according to the present embodiment, even if an unexpected external environment change occurs, the control device 2 that reacts autonomously and enables the entire control system 1 to perform a desired operation. Can be implemented.

かかる制御装置２は、駆動信号を制御対象３に供給することで前記制御対象３を制御可能に構成されるもので、スパイク信号列生成部２３２と駆動信号生成部２３３とを備え、前記スパイク信号列生成部２３２は、前記制御対象３を制御するための基本制御信号ＣＳおよび擾乱を含む内部状態によって規定されるタイミングで、スパイク信号列ＳＴを生成可能に構成され、前記駆動信号生成部は、前記スパイク信号列ＳＴに基づいて時系列に連続変化する前記駆動信号ＡＳを生成可能に構成される。 The control device 2 is configured to control the control target 3 by supplying a drive signal to the control target 3, and includes a spike signal train generation unit 232 and a drive signal generation unit 233. The column generation unit 232 is configured to be able to generate the spike signal sequence ST at a timing defined by an internal state including a basic control signal CS for controlling the controlled object 3 and the disturbance, and the drive signal generation unit is The drive signal AS that continuously changes in time series is generated based on the spike signal train ST.

また、これにより以下の制御システム１を実施することが出来る。 Moreover, the following control system 1 can be implemented by this.

かかる制御システム１は、制御対象３と、前記制御対象３を制御する制御装置２とを備え、
前記制御対象３は、ロボット、移動体、ペースメーカー、電気回路系、および化学反応系の少なくとも１つであり、前記制御装置２は、上に記載した制御装置２である。 The control system 1 includes a control target 3 and a control device 2 that controls the control target 3.
The control target 3 is at least one of a robot, a mobile body, a pacemaker, an electric circuit system, and a chemical reaction system, and the control device 2 is the control device 2 described above.

制御装置２また制御システム１をハードウェアとして実施するためのソフトウェアを、プログラムとして実施することもできる。そして、このようなプログラムを、コンピュータが読み取り可能な非一時的な記録媒体として提供してもよいし、外部のサーバからダウンロード可能に提供してもよいし、外部のコンピュータで当該プログラムを起動させて、クライアント端末で各機能を実施可能な、いわゆるクラウド・コンピューティングを実施してもよい。 Software for implementing the control device 2 or the control system 1 as hardware can also be implemented as a program. Then, such a program may be provided as a computer-readable non-transitory recording medium, or may be provided so as to be downloadable from an external server, or the program may be activated by an external computer. Then, so-called cloud computing, in which each function can be performed by the client terminal, may be performed.

かかる制御プログラムは、制御対象を制御するためのもので、コンピュータに、スパイク信号列生成機能と駆動信号生成機能とを実行させるもので、前記スパイク信号列生成機能によれば、前記制御対象３を制御するための基本制御信号ＣＳと擾乱を含む内部状態とによって規定されるタイミングで、スパイク信号列ＳＴを生成させ、前記駆動信号生成機能によれば、前記スパイク信号列ＳＴに基づいて時系列に連続変化する前記駆動信号ＡＳを生成させることとする。 Such a control program is for controlling a controlled object, and causes a computer to execute a spike signal sequence generation function and a drive signal generation function. According to the spike signal sequence generation function, the control target 3 is controlled. The spike signal train ST is generated at a timing defined by the basic control signal CS for controlling and the internal state including the disturbance, and according to the drive signal generation function, the spike signal train ST is time-series based on the spike signal train ST. The drive signal AS that continuously changes is generated.

最後に、本発明に係る種々の実施形態を説明したが、これらは、例として提示したものであり、発明の範囲を限定することは意図していない。当該新規な実施形態は、その他の様々な形態で実施されることが可能であり、発明の要旨を逸脱しない範囲で、種々の省略、置き換え、変更を行うことができる。当該実施形態やその変形は、発明の範囲や要旨に含まれるとともに、特許請求の範囲に記載された発明とその均等の範囲に含まれるものである。 Lastly, various embodiments according to the present invention have been described, but these are presented as examples and are not intended to limit the scope of the invention. The novel embodiment can be implemented in various other forms, and various omissions, replacements, and changes can be made without departing from the spirit of the invention. The embodiment and its modifications are included in the scope and gist of the invention, and are also included in the invention described in the claims and the scope equivalent thereto.

１：制御システム
２：制御装置
２ａ：外付け制御装置
２ｂ：基本制御装置
２０：通信バス
２１：通信部
２２：記憶部
２３：制御部
２３１：基本制御信号生成部
２３２：スパイク信号列生成部
２３３：駆動信号生成部
３：制御対象
３０：駆動部
３１：状態検知部
ＣＳ：制御信号
ＳＴ：スパイク信号列
ＡＳ：駆動信号
Ｉ：入力信号
ＡｐＥｎ：エントロピー
Ｎ：ニューロン数
ｘ＿０：初期位置
ｖ＿０：初期速度
ｂ：バイアス入力
ｆ＿０：自然周波数
τ＿ｓ：シナプス時定数
ｇ＾Ａ：増幅ゲイン
ｖ＾ｇ：重心移動速度
μ ：摩擦係数 1: control system 2: control device 2a: external control device 2b: basic control device 20: communication bus 21: communication unit 22: storage unit 23: control unit 231: basic control signal generation unit 232: spike signal sequence generation unit 233 : Drive signal generation unit 3: Control object 30: Drive unit 31: State detection unit CS: Control signal ST: Spike signal train AS: Drive signal I: Input signal ApEn: Entropy N: Number of neurons x_0: Initial position v_0: Initial velocity b: Bias input f_0: Natural frequency τ_s: Synapse time constant g^A: Amplification gain v^g: Center-of-gravity moving speed μ: Friction coefficient

Claims

A control device, which is configured to control the control target by supplying a drive signal to the control target,
A spike signal train generation unit and a drive signal generation unit,
The spike signal string generation unit is configured to generate a spike signal string at a timing defined by a basic control signal for controlling the controlled object and an internal state including a disturbance, and dynamics related to the internal state,
The drive signal generation unit is configured to be capable of generating the drive signal that continuously changes in time series based on the spike signal train.
Control device.

The control device according to claim 1,
Further comprising a communication unit and a basic control signal generation unit,
The communication unit is configured to be able to receive the state information of the control target, where:
The state information, at a point of interest in the control target, is information indicating an internal state that changes due to the behavior of the control target and environmental changes,
The basic control signal generation unit is configured to generate the basic control signal based on the state information,
Control device.

In the control device according to claim 1 or 2,
The spike signal train generation unit,
Having multiple neurons that make up a neuron network,
It is configured such that the spike signal train can be generated by one neuron network based on the signals output by each of the plurality of neurons.
Control device.

The control device according to any one of claims 1 to 3,
The neuron of the spike signal sequence generation unit is configured to be able to generate the spike signal by using generation of a stochastic impulse-like action potential in a living body as a model.
Control device.

A control system,
A control target, and a control device for controlling the control target,
The control target is at least one of a robot, a mobile body, a pacemaker, an electric circuit system, a chemical reaction system, a communication network, a socioeconomic management system, a financial system, a biological network, and animals and plants,
The control device is the control device according to any one of claims 1 to 4.
Control system.

The control system according to claim 5,
The control target is a musculoskeletal robot, and includes a robot drive unit and a robot state detection unit,
The robot driving unit includes a plurality of bones, a plurality of joints, a muscle that applies a tensile force between adjacent bones, and/or a multi-joint muscle that applies a tensile force across the plurality of bones,
The robot state detection unit is configured to detect at least one state of muscle force, muscle length, joint angle, body posture, center of gravity position, sole reaction force, 3-axis posture, 3-axis acceleration, and 3-axis angular velocity. Will be
Control system.

A control program for controlling a controlled object,
It causes the computer to execute the spike signal train generation function and the drive signal generation function.
According to the spike signal train generation function, a spike signal train is generated at a timing defined by a basic control signal for controlling the controlled object and an internal state including a disturbance,
According to the drive signal generation function, the drive signal that continuously changes in time series is generated based on the spike signal train,
Control program.