JPH10154002A

JPH10154002A - Synthetic control system

Info

Publication number: JPH10154002A
Application number: JP9145610A
Authority: JP
Inventors: Kazusuke Kamihira; 一介上平; Masashi Yamaguchi; 昌志山口
Original assignee: Yamaha Motor Co Ltd
Current assignee: Yamaha Motor Co Ltd
Priority date: 1996-09-26
Filing date: 1997-06-03
Publication date: 1998-06-09

Abstract

PROBLEM TO BE SOLVED: To realize a characteristic that all users satisfy by changing the control characteristic of a control system controlling a controlled system by adjusting it to the characteristics of a user and a use situation. SOLUTION: The evaluation system of evolution adaptability inputs external information and/or information on the characteristic of the user and/or information on the characteristic of the use situation, estimates the characteristic of the user using the control object and/or the use situation and evaluates an evolution processing in an evolution adaptation system based on the characteristics of the user and/or the use situation. The control module of the evolution adaptation system is hereditarily evolved based on the judgment of the evaluation system and at least one optimum control module is obtained at that time. A learning layer learns an input/output relation on the optimum control module where an evolution adaptation layer is evolved and the input/ output relation of a control system for execution on learning layer in the other control system while one control system executes control. The characteristic of the controlled system changes to a movement adapted to the characteristics of the user and use situation at every moment.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、制御対象を総合的
に制御する総合制御方式に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an overall control system for comprehensively controlling an object to be controlled.

【０００２】[0002]

【従来の技術】従来から、車両や家電製品等の製品の特
性を制御する場合、制御対象となる製品の特性は、開発
・設計段階で、その製品を使用すると思われる使用者を
想定し、その仮想使用者の好みや使用状況を加味し、で
きるだけ広い範囲の使用者に適応するように決められ
る。2. Description of the Related Art Conventionally, when controlling the characteristics of a product such as a vehicle or a home appliance, the characteristics of the product to be controlled are assumed at a development / design stage by assuming a user who will use the product. Taking into account the preferences and usage conditions of the virtual user, the virtual user is determined so as to adapt to a wide range of users.

【０００３】[0003]

【発明が解決しようとする課題】しかし、上記した製品
を使用する使用者は、個々に特有の個性を持っており、
その好みも千差万別であるため、前記したように、その
製品を使用すると思われる使用者の好み等を想定して製
品の開発・設計を行ったとしても、全ての使用者が満足
する特性を提供することは不可能であるという問題があ
る。上記した問題を解決するために、現在は、使用者が
製品購入前に、その製品の特性を確認し、特性が自分の
満足するものであるか否かを判断することが行われてい
るが、この購入前の製品の特性の確認は使用者にとって
煩わしいものである。また、基本的に同一製品は同じ特
性で制御されることが多いため、例えば、製品のデザイ
ンは気に入っているのに特性が気に入らないために、そ
の製品の購入を断念せざるえない等、特性により製品の
選択範囲が制限されるという問題も生じる。本発明は、
上記した従来の問題点を解決し、全ての使用者が満足し
得る特性を実現できる総合制御方式を提供することを目
的としている。However, users who use the above-mentioned products have individual peculiar personalities,
Since the tastes are also different, as described above, all users are satisfied even if they develop and design products assuming the preferences of the users who are supposed to use the product. The problem is that it is impossible to provide properties. In order to solve the above-mentioned problems, at present, before purchasing a product, a user checks the characteristics of the product and determines whether or not the characteristics are satisfactory. The confirmation of the characteristics of the product before the purchase is troublesome for the user. Also, since basically the same product is often controlled with the same characteristics, for example, if you like the product design but do not like the characteristics, you have to give up purchasing the product, etc. Therefore, there is also a problem that the selection range of the product is limited. The present invention
It is an object of the present invention to solve the above-mentioned conventional problems and to provide a comprehensive control system capable of realizing characteristics that can be satisfied by all users.

【０００４】[0004]

【課題を解決するための手段】上記した課題を解決する
ために、本発明に係る総合制御方式は、使用者及び／又
は使用状況の特性を判断し、その判断結果に基づいて、
使用者及び／又は使用状況の特性に合わせて制御対象を
制御する制御系の制御特性を変化させることを特徴とす
るものである。Means for Solving the Problems In order to solve the above-mentioned problems, an integrated control method according to the present invention judges the characteristics of a user and / or a use situation, and based on the judgment result,
The present invention is characterized in that the control characteristics of a control system for controlling a control target are changed in accordance with characteristics of a user and / or a use situation.

【０００５】[0005]

【発明の実施の形態】以下、本発明に係る総合制御方式
の実施の形態を添付図面に示した一実施例を参照して説
明する。図１は本発明に係る総合制御方式の基本概念を
示すブロック図である。図面に示すように、この総合制
御方式は、反射層、学習層、及び進化適応層の三つの制
御層から成り、外部から制御すべき制御対象に関する情
報（例えば、動作状態に関する情報等）を入力し、この
入力情報に基づいて反射層で基本操作量を決定し、学習
層及び進化適応層で基本操作量に対する補正量を決定
し、これら基本操作量及び補正量から最終的な制御出力
を決定する。以下、総合制御方式における反射層、学習
層、及び進化適応層の働きについて説明する。反射層
は、制御対象に関する情報（以下、外界情報と称す
る。）と、外界情報に対する基本操作量との関係を数
式、マップ、ファジールール、ニューラル回路網、又は
サブサンプションアーキテクチャ等の形式の制御系で予
め備えている層であり、外界情報が入力されると、上記
した制御系から入力された外界情報に対する基本操作量
を決定して出力する。尚、前記サブサンプションアーキ
テクチャとは、並列的な処理を行う行動型人工知能とし
て公知である。進化適応層は、評価系と進化適応系との
二つの制御系から成る。評価系は、外界情報、及び／又
は使用者の特性（例えば、好み、技量、又は状態等）に
関する情報、及び／又は使用状況の特性（例えば、使用
環境の変化等）に関する情報を入力し、これら外界情報
等から制御すべき制御対象を使用する使用者及び／又は
使用状況の特性を推定すると共に、この使用者及び／又
は使用状況の特性に基づいて進化適応系における進化処
理中の評価を行う。進化適応系は、反射層で決められた
基本操作量を使用者及び／又は使用状況の特性に合わせ
るように補正するための少なくとも一つの制御モジュー
ルを備え、この制御モジュールを評価系の判断に基づい
て遺伝的に進化させて、その時点で最適な少なくとも一
つの制御モジュールを獲得する。進化適応系は最適な制
御モジュールを獲得した後は、制御モジュールを最適な
ものに固定して、反射層から出力される基本操作量を補
正する進化補正値を出力する。学習層は、学習用と実行
用に入れ替え可能な二つの制御系を備え、一方の制御系
（実行用）で制御を実行している間、他方の制御系（学
習用）で、進化適応層の進化した最適な制御モジュール
に関する入出力関係と、学習層の実行用の制御系の入出
力関係とを合わせて学習する。学習用制御系での学習が
終了すると、制御を実行している制御系と学習後の制御
系が入れ替わり、学習後の制御系で学習結果から得られ
る制御モジュールによる制御を開始し、制御を実行して
いた制御系が学習用として機能し始める。尚、この学習
層における制御系は初期状態ではゼロを出力するように
設定されており、従って、初期状態では反射層と進化適
応層とによる制御が行われる。進化適応層は、最適な制
御モジュールに関する情報を学習層に学習させた後は、
その出力をゼロに戻し、一定時間間隔で作動して、使用
者の好み及び／又は使用環境の評価と、制御モジュール
の進化とを行い、進化適応層の出力を加えた時の評価
が、進化適応層の出力を加えていない時の評価より優れ
ている場合は、再度学習層に最適な制御モジュールに関
する情報を学習させる。尚、学習層における学習済みの
制御モジュールに関する情報は、ＩＣカードやフロッピ
ーディスク等の外部記憶手段に保存・読み出し可能にさ
れており、使用者が必要に応じて過去の最適制御モジュ
ールに関する情報を外部記憶手段から読み出して、その
情報に基づいて学習層で基本補正量を出力することがで
きるようにされている。また、使用者が外部記憶手段か
ら過去の最適制御モジュールに関する情報を読み出して
学習層を作動させる場合、学習層が読み出した制御モジ
ュールによって作動している間は、進化適応層は、その
出力がゼロに固定され、制御モジュールの進化処理を停
止する。上記した各層の働きにより、この総合制御方式
からの制御出力は、制御すべき制御対象を使用する使用
者の好みや使用環境の変化等の特性に合わせて刻々と変
化し、その結果、制御対象の特性は使用者及び／又は使
用状況の特性に適応した動作に刻々と変化していく。本
明細書では、この総合制御方式により制御対象の特性が
使用者及び／又は使用状況の特性に適応して進化してい
く状態を「調教」と称する。BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 is a block diagram showing a general control system according to an embodiment of the present invention. FIG. 1 is a block diagram showing the basic concept of the integrated control system according to the present invention. As shown in the drawing, this comprehensive control system includes three control layers, a reflection layer, a learning layer, and an evolution adaptive layer, and inputs information (for example, information about an operation state) on a control target to be controlled from the outside. Then, based on the input information, the reflective layer determines the basic operation amount, the learning layer and the evolution adaptive layer determine the correction amount for the basic operation amount, and determines the final control output from the basic operation amount and the correction amount. I do. Hereinafter, the functions of the reflection layer, the learning layer, and the evolution adaptive layer in the comprehensive control method will be described. The reflection layer controls the relationship between information about the control target (hereinafter, referred to as external information) and a basic operation amount for the external information in the form of a mathematical expression, a map, a fuzzy rule, a neural network, or a subsumption architecture. This layer is provided in advance in the system, and when external world information is input, the basic operation amount for the external world information input from the control system is determined and output. Note that the subsumption architecture is known as behavioral artificial intelligence that performs parallel processing. The evolution adaptive layer is composed of two control systems, an evaluation system and an evolution adaptive system. The evaluation system inputs external world information and / or information on characteristics of the user (e.g., preference, skill, or state, etc.) and / or information on characteristics of a use situation (e.g., change in use environment), The user using the controlled object to be controlled and / or the characteristics of the use situation are estimated from the external information and the like, and the evaluation during the evolution processing in the evolution adaptive system is performed based on the characteristics of the user and / or the use situation. Do. The evolution adaptive system includes at least one control module for correcting the basic operation amount determined by the reflective layer so as to match the characteristics of the user and / or the use situation, and the control module is configured to determine the control module based on a determination of the evaluation system. Genetically evolving to obtain at least one optimal control module at that time. After acquiring the optimal control module, the evolution adaptive system fixes the control module to the optimal one, and outputs an evolution correction value for correcting the basic operation amount output from the reflection layer. The learning layer has two control systems that can be switched between learning and execution. While the control is being executed by one control system (for execution), the other adaptive control system (for learning) uses the evolution adaptive layer. The input / output relation of the optimal control module that has evolved and the input / output relation of the control system for executing the learning layer are learned together. When the learning in the learning control system is completed, the control system that is executing the control and the control system after learning are switched, and control using the control module obtained from the learning result is started in the learned control system, and the control is executed. The control system started to function for learning. The control system in the learning layer is set so as to output zero in the initial state. Therefore, in the initial state, the control by the reflection layer and the evolution adaptive layer is performed. After the learning layer learns information about the optimal control module,
The output is returned to zero, the operation is performed at regular time intervals, the evaluation of the user's preference and / or the use environment and the evolution of the control module are performed, and the evaluation when the output of the evolution adaptation layer is added is the evolution. If it is better than the evaluation when the output of the adaptive layer is not applied, the learning layer is again made to learn the information on the optimal control module. The information about the learned control modules in the learning layer can be stored and read out in an external storage means such as an IC card or a floppy disk. The basic correction amount can be read out from the storage means and output by the learning layer based on the information. Further, when the user reads the information on the past optimum control module from the external storage means and activates the learning layer, while the learning layer operates by the read control module, the output of the evolution adaptive layer is zero. To stop the evolution of the control module. By the operation of each layer described above, the control output from the integrated control system changes every moment in accordance with the characteristics of the user who uses the control target to be controlled, such as the preference of the user who uses the control target and changes in the use environment. Characteristic changes momentarily to an operation adapted to the characteristics of the user and / or the use situation. In the present specification, a state in which the characteristics of the controlled object evolve in accordance with the characteristics of the user and / or the use situation by this comprehensive control method is referred to as “training”.

【０００６】図２は、上記した総合制御方式を経時的に
示すフローチャートである。初期状態では、学習層の出
力はゼロであり（ステップａ）、従って、制御対象の使
用が開始された直後は反射層からの基本操作量だけで制
御対象は制御される。制御対象の使用が開始されると、
進化適応層は、使用者及び／又は使用状況の特性を評価
し、この評価値に応じて、その制御モジュールを進化さ
せる（ステップｂ）。進化適応層は、各制御モジュール
を遺伝的に進化させることにより、その時点で最も望ま
しい少なくとも一つの制御モジュールを獲得する（ステ
ップｃ）。進化適応層は、その制御モジュールをステッ
プｃで得た最も望ましい制御モジュールに固定し、その
制御モジュールに基づいて進化系補正値を出力して、反
射層から出力される基本操作量を補正する。学習層で
は、進化適応層が最適制御モジュールに固定された時の
進化適応層の入出力関係と学習層の実行用制御系の入出
力関係を学習用制御系で合わせて学習する。尚、初期状
態においては学習層の実行用制御系の出力はゼロである
が、学習後は反射層からの基本操作量を学習層からの基
本補正量と進化適応層からの進化系補正量で補正するこ
とになる（ステップｄ）。反射層からの基本操作量に学
習層における学習用制御系の出力を加えた値と、実際の
制御出力（基本操作量＋基本補正量＋進化系補正量）と
の差がしきい値より小さくなった時点で学習層における
学習用制御系は学習を終了し、学習用制御系と実行用制
御系とが入れ替わって、学習後の制御系が実行用として
機能し、制御を実行していた制御系が学習用として機能
し（ステップｅ）、反射層と学習層とによる制御が行わ
れる（ステップｆ）。進化適応層は、最適な制御モジュ
ールに関する情報を学習層に学習させた後、一定時間間
隔で作動し、学習層の制御則の経時的なずれを評価する
（ステップｇ）。具体的には、進化適応層の制御モジュ
ールを遺伝的に進化させる時の初期世代において最大適
応度の改善がみられない場合には、制御層の制御則のず
れはないものとしてステップｆに移行して反射層と学習
層とによる制御を継続し、最大適応度の改善がみられる
場合には、ステップｂに移行して進化適応層で新たな最
適制御モジュールの獲得を行う。FIG. 2 is a flow chart showing the above-mentioned comprehensive control method over time. In the initial state, the output of the learning layer is zero (step a). Therefore, immediately after the use of the control target is started, the control target is controlled only by the basic operation amount from the reflection layer. When the use of the controlled object starts,
The evolution adaptation layer evaluates the characteristics of the user and / or the use situation, and evolves the control module according to the evaluation value (step b). The evolution adaptation layer obtains at least one control module that is most desirable at that time by genetically evolving each control module (step c). The evolution adaptive layer fixes the control module to the most desirable control module obtained in step c, outputs an evolution correction value based on the control module, and corrects the basic operation amount output from the reflection layer. In the learning layer, the learning control system learns the input / output relationship of the evolution adaptive layer and the input / output relationship of the execution control system of the learning layer when the evolution adaptive layer is fixed to the optimal control module. In the initial state, the output of the control system for execution of the learning layer is zero, but after learning, the basic operation amount from the reflection layer is represented by the basic correction amount from the learning layer and the evolutionary correction amount from the evolution adaptive layer. Correction is performed (step d). The difference between the value obtained by adding the output of the learning control system in the learning layer to the basic operation amount from the reflection layer and the actual control output (basic operation amount + basic correction amount + evolution system correction amount) is smaller than the threshold value. At this point, the learning control system in the learning layer ends learning, the control system for learning and the control system for execution are switched, and the control system after learning functions as execution and executes control. The system functions for learning (step e), and control is performed by the reflection layer and the learning layer (step f). After making the learning layer learn the information about the optimal control module, the evolution adaptive layer operates at regular time intervals, and evaluates the time-dependent deviation of the control law of the learning layer (step g). Specifically, when the maximum fitness is not improved in the initial generation when the control module of the evolution adaptive layer is genetically evolved, it is determined that there is no shift in the control rule of the control layer and the process proceeds to step f. When the control by the reflection layer and the learning layer is continued and the maximum fitness is improved, the process proceeds to step b to acquire a new optimal control module in the evolution adaptive layer.

【０００７】次に、上記した総合制御方式について、制
御対象として車両用エンジンを例に挙げて、さらに具体
的に説明する。図３は、エンジン１と前記総合制御方式
を実行する制御装置１０との関係を示す概略図である。
図面に示すように、制御装置１０は、エンジン回転数、
吸気負圧、スロットル開度、スロットル開度変化率、大
気圧、吸気温度、冷却水温、ギヤポジション等の情報を
入力し、これら入力情報に基づいて燃費性能と加速性能
の両立を図ったエンジン制御を行う。図４は、前記制御
装置１０の概略ブロック図である。この制御装置１０
は、上述したように反射層、学習層、及び進化適応層か
ら成る。Next, the above-described comprehensive control method will be described more specifically, taking a vehicle engine as an example of a control target. FIG. 3 is a schematic diagram illustrating a relationship between the engine 1 and a control device 10 that executes the comprehensive control method.
As shown in the drawing, the control device 10 controls an engine speed,
Engine control that inputs information such as intake negative pressure, throttle opening, throttle opening change rate, atmospheric pressure, intake air temperature, cooling water temperature, gear position, etc., and achieves both fuel economy performance and acceleration performance based on these input information. I do. FIG. 4 is a schematic block diagram of the control device 10. This control device 10
Consists of a reflective layer, a learning layer, and an evolution adaptive layer as described above.

【０００８】（反射層について）反射層は、エンジン回
転数、吸気負圧、スロットル開度、スロットル開度変化
率、大気圧、吸気温度、及び冷却水温等を入力して、こ
れらの入力信号に基づいて予め決められた数式をモデル
化した方程式から燃料噴射量の基本的な値（即ち、燃料
噴射装置の基本操作量）を決定して出力する。(Reflection Layer) The reflection layer inputs the engine speed, intake negative pressure, throttle opening, throttle opening change rate, atmospheric pressure, intake air temperature, cooling water temperature, and the like. A basic value of the fuel injection amount (that is, a basic operation amount of the fuel injection device) is determined and output from an equation obtained by modeling a predetermined mathematical formula based on the calculated value.

【０００９】（進化適応層について）進化適応層は、評
価系と進化適応系とから成る。図５は、進化適応層の基
本動作のフローチャートである。以下、このフローチャ
ートを参照して進化適応層の基本動作について説明す
る。評価系は、一定時間内の各ギヤポジションにおける
最高回転数の分布パターン（図６及び図７参照）と走行
状態指数Ｐとの関係を学習したニューラル回路網（図８
参照）を備えており、ギヤポジション信号とエンジン回
転数信号を入力し（ステップ１）、前記入力情報から前
記ニューラル回路網により走行状態指数Ｐを決定する。
例えば、スポーティな走行を好む使用者は、低速のギヤ
で高い回転数までエンジンを回す傾向にあるため、その
分布パターンは図６（ａ）に示すようになり、また、マ
イルドな走行を好む使用者は、早めに高速のギヤに変え
ていく傾向にあるため、その分布パターンは図６（ｂ）
に示すようになる。従って、図８に示したニューラル回
路網で、図６（ａ）に示す分布パターンの時に走行状態
指数Ｐが大きな値となり、また、図６（ｂ）に示す分布
パターンの時に走行状態指数Ｐが小さな値となるように
予め学習しておくと、前記ニューラル回路網から得られ
る走行状態指数Ｐは、使用者の走行に対する好みがスポ
ーティであればある程大きくなり、マイルドであればあ
るほど小さくなり、使用者の好みが反映したものにな
る。また、図７（ａ）に示すように一定時間内に１速か
ら６速までの各ギヤを使用した場合には走行状態指数Ｐ
の値が大となり、図７（ｂ）に示すように低速のギヤば
かりを使用した場合には走行状態指数Ｐの値がやや大と
なり、また図７（ｃ）のように高速のギヤばかりを使用
した場合には走行状態指数Ｐの値が小となるように前記
ニューラル回路網で学習しておけば、通常車両では、通
常走行の場合には各ギヤを使用し、渋滞路走行の時には
低速ギヤばかりを使用し、また、高速道路走行の時に
は、高速ギヤばかりを使用するので、ニューラル回路網
で得られる走行状態指数Ｐに基づいて、例えば、図７
（ａ）の場合は通常走行であり、図７（ｂ）の場合は渋
滞路走行であり、図７（ｃ）の場合は高速道路走行であ
ると推定することができる。評価系は、この走行状態指
数Ｐから、その時の走行状況を推定し、かつ、使用者の
好みが燃費性能重視か、加速性能重視かを判断して加速
重視割合αを決定する。加速度重視割合αは、図９に示
すように、予め決められた加速度重視割合αと走行状態
指数Ｐとの関数から求められ、例えば、高速道路走行時
（図７（ｃ）参照）で走行状態指数Ｐが小さい時には加
速度重視割合αは小さく燃費重視の値となり、また、ス
ポーティな走行時（図６（ｂ）参照）で走行状態指数Ｐ
が大きい時には加速度重視割合αは大きく加速度重視の
値となる。進化適応系は、燃費モジュールと加速モジュ
ールとを有し、これらモジュールを相互に協調、競合さ
せることで適応的な変化を得る。前記各モジュールは、
図１０に示すように各々２入力１出力の階層型ニューラ
ル回路網で構成され、燃費モジュールは燃費性能の向
上、加速モジュールは加速性能の向上をそれぞれ目指
す。各モジュールへの入力は、エンジン回転数信号とス
ロットル開度で、これらの入力を基に、それぞれが燃料
噴射量の補正量（即ち、反射層からの基本操作量の補正
量）を出力するよう構成されている。進化適応層は、こ
の２種類の制御モジュールの協調、競合を実現するた
め、燃費モジュールと加速モジュールを構成するニュー
ラル回路網の結合度を、評価系での評価に合わせて、即
ち、使用者の好みと使用環境に合わせて、遺伝的アルゴ
リズムを用いて交互に進化させ（ステップ４）、両方の
モジュールの進化が終了した後、両方のモジュールのニ
ューラル回路網の結合度を進化した結合度で固定して、
両モジュールの出力に基づく進化補正値Ｙによるエンジ
ン制御を行う（ステップ５）。尚、前記したステップ４
におけるモジュールの遺伝的な変化による進化は、各モ
ジュール別に交互に行われ、従って、燃費モジュールが
進化している時は加速モジュールのニューラル回路網の
結合度は固定され、また、加速モジュールが進化してい
る時は燃費モジュールのニューラル回路網の結合度は固
定されている。この進化適応層における進化補正値Ｙに
よるエンジン制御は、後述する学習層による学習が終了
するまで続けられ（ステップ６）、学習層での学習が終
了すると、燃費モジュール及び加速モジュールをリセッ
トして、進化適応層の出力をゼロにする（ステップ
７）。(Evolution Adaptive Layer) The evolution adaptive layer is composed of an evaluation system and an evolution adaptive system. FIG. 5 is a flowchart of the basic operation of the evolution adaptive layer. Hereinafter, the basic operation of the evolution adaptive layer will be described with reference to this flowchart. The evaluation system learns the relationship between the distribution pattern of the maximum rotational speed at each gear position within a certain time (see FIGS. 6 and 7) and the running state index P (FIG. 8).
), A gear position signal and an engine speed signal are input (step 1), and a running state index P is determined by the neural network from the input information.
For example, a user who likes sporty running tends to turn the engine to a high rotation speed with a low gear, so that the distribution pattern is as shown in FIG. 6 (a). Are likely to change to high-speed gears early, and the distribution pattern is shown in FIG.
It becomes as shown in. Therefore, in the neural network shown in FIG. 8, the running state index P becomes large when the distribution pattern shown in FIG. 6A is used, and when the distribution pattern shown in FIG. If learned in advance so as to have a small value, the running state index P obtained from the neural network becomes larger as the user's preference for running is more sporty, and becomes smaller as the user's driving preference is milder. , Reflecting user preferences. In addition, as shown in FIG. 7A, when each of the first to sixth gears is used within a certain time, the traveling state index P
Becomes large, and when only low-speed gears are used as shown in FIG. 7 (b), the value of the traveling state index P becomes slightly large, and only high-speed gears are used as shown in FIG. 7 (c). If learning is performed by the neural network so that the value of the running state index P becomes small when used, each gear is used in a normal vehicle and a low speed is used in a congested road when the vehicle is running normally. Since only gears are used, and only high-speed gears are used when driving on a highway, for example, based on a running state index P obtained by a neural network, FIG.
In the case of FIG. 7A, it can be estimated that the vehicle is traveling normally, in the case of FIG. 7B, it is traveling on a congested road, and in the case of FIG. The evaluation system estimates the running condition at that time from the running state index P, and determines whether the user's preference is fuel efficiency performance or acceleration performance, and determines the acceleration importance ratio α. As shown in FIG. 9, the acceleration-oriented ratio α is obtained from a function of a predetermined acceleration-oriented ratio α and the traveling state index P. For example, when the vehicle is traveling on a highway (see FIG. 7C), When the index P is small, the acceleration-oriented ratio α is a small value that emphasizes fuel economy, and when the vehicle is running sporty (see FIG. 6B), the traveling state index P
Is large, the acceleration-oriented ratio α is a value that emphasizes the acceleration. The evolution adaptive system has a fuel efficiency module and an acceleration module, and obtains an adaptive change by cooperating and competing with each other. Each of the modules,
As shown in FIG. 10, each is composed of a hierarchical neural network having two inputs and one output. The fuel efficiency module aims at improving the fuel efficiency performance, and the acceleration module aims at the acceleration performance improvement. The input to each module is an engine speed signal and a throttle opening. Based on these inputs, each module outputs a correction amount of the fuel injection amount (that is, a correction amount of the basic operation amount from the reflection layer). It is configured. The evolutionary adaptation layer adjusts the degree of coupling of the neural network that constitutes the fuel economy module and the acceleration module in accordance with the evaluation in the evaluation system, that is, the user's Alternately evolve using a genetic algorithm according to taste and usage environment (Step 4), and after the evolution of both modules is completed, fix the connectivity of the neural network of both modules at the evolved connectivity do it,
The engine is controlled by the evolution correction value Y based on the outputs of both modules (step 5). Step 4 described above
The evolution of the module due to the genetic change in is alternated for each module, so when the fuel economy module is evolving, the connectivity of the neural network of the acceleration module is fixed, and the acceleration module is evolved. The degree of coupling of the neural network of the fuel economy module is fixed. The engine control based on the evolution correction value Y in the evolution adaptation layer is continued until the learning by the learning layer described later is completed (step 6). When the learning in the learning layer is completed, the fuel economy module and the acceleration module are reset. The output of the evolution adaptation layer is set to zero (step 7).

【００１０】以下に遺伝的アルゴリズムによるモジュー
ルの進化を、図１１のフローチャートを参照して燃費モ
ジュールの進化を例に挙げて説明する。始めに、図１２
に示すように、燃費モジュールに対して、それを構成す
るニューラル回路網の結合係数を遺伝子としてコーディ
ングして複数の個体ａｎ（本実施例では９個の個体）か
らなる第１世代を生成する（ステップ１）。各個体の遺
伝子の値（即ち、ニューラル回路網の結合係数の値）の
初期値は予め決められた範囲内（ほぼ−１０〜１０の
間）でランダムに決定する。またこの時、既に学習層が
学習を行い出力をしている場合には、進化適応層の出力
をゼロにできる個体（図１２における個体ａ（１））を
一つ含ませることで、個体数に制限がある場合でもその
時点の性能を損なうことなく進化処理中の個体群の多様
性を保つことができる。次に、ステップ１で生成された
個体ａｎの中の一つ、例えば、個体ａ（１）に対して、
燃費モジュールのニューラル回路網を用いて実際の入力
情報（エンジン回転数及びスロットル開度）に対するニ
ューラル回路網の出力ｘを決定し（ステップ４）、さら
にこの出力を式（１）を用いて線形変換して燃費モジュ
ールの出力ｙｆを決定する（ステップ５）。尚、入力情
報のエンジン回転数及びスロットル開度はそれぞれ正規
化したものを用い、燃費モジュールからの出力は次式
（１）を用いてニューラル回路網の出力を線形変換して
用いる。ｙｆ＝２×Ｇｘ−Ｇ（１）ここで、ｙｆは燃費モジュールの出力、ｘは燃費モジュ
ールにおけるニューラル回路網の出力、Ｇは進化適応層
出力ゲインである。このように、ニューラル回路網の出
力ｘを線形変換して用いることにより、燃費モジュール
からの出力ｙｆが極端に大きな値になることがなく、全
体として進化がすこしづつ進むようになり、エンジンの
挙動が評価や進化のために極端に変動することがなくな
る。個体ａ（１）に対する燃費モジュールの出力ｙｆを
決定した後、この出力ｙｆと結合係数が固定された加速
モジュールの出力ｙａの平均加重をとって進化適応層の
出力（仮補正量Ｙｎ（１））を算出する（ステップ
６）。この加重は、前記評価系で求めた加速重視割合α
から決める。燃費モジュールの出力をｙｆ、加速制御モ
ジュールの出力をｙａとすると、進化適応層から出力さ
れる仮補正量Ｙｎは次式（２）のようになる。Ｙｎ＝ αｙａ＋（１−α）ｙｆ（２）つまり、加速重視割合が１の場合、仮補正量Ｙｎは加速
モジュールのみの出力となり、加速重視割合が０の場
合、仮補正量Ｙｎは燃費モジュールのみの出力となる。
個体ａ（１）に対する進化適応層の出力Ｙｎ（１）が決
定した後、この仮補正値Ｙｎ（１）を実際に進化適応層
から出力して、反射層からの基本操作量に加算し、仮補
正値Ｙｎ（１）で補正した制御出力でエンジンを作動さ
せる（ステップ７）。進化適応層における評価系は、個
体ａ（１）から得られる仮補正量Ｙｎ（１）で補正した
制御出力により動作されたエンジンから燃費に関するフ
ィードバック情報を入力して燃費を算出し（ステップ
８）、その結果に基づいて個体ａ（１）に対する評価を
行い、個体ａ（１）に対する適応度を求める（ステップ
９）。尚、燃費は、走行距離と消費燃料から算出する。
上記したステップ４からステップ９までの処理は、ステ
ップ１で生成された９個の個体ａ（１）〜ａ（９）に対
する適応度を算出するまで行われ、全ての個体に対する
適応度を算出した後に次の処理に進む（ステップ１
０）。尚、２個目の個体ａ（２）からの前記ステップ４
〜ステップ９までの処理は、各個体の適応度を同じ条件
で評価するために、処理に入る前に走行状況を確認し
（ステップ２）、走行状況が最初の個体ａ（１）の時の
走行状況と同じかどうかの判断を行う（ステップ３）。
全ての個体に対する適応度を算出した後、その個体の属
する世代が最終世代か否かを判断し（ステップ１１）、
最終世代でなければ親個体の選択を行う（ステップ１
２）。この選択にはルーレット式選択方式を用い、各個
体の適応度に比例した確率で、確率的に幾つかの親個体
を選択する。尚、この時、厳密に世代交代を適用しすぎ
ると、評価の高い個体を破壊してしまう恐れがあるた
め、エリート（評価の最も高い個体）を無条件に次世代
に残すエリート保存戦略も合わせて用いる。また、複数
の個体から成る集団内の最大適応度と平均適応度の比が
一定となるように、適応度の線形変換を行う。親個体の
選択が終わると、選択された個体を親個体として、交叉
を行い、９個の子個体から成る第二世代を生成する（ス
テップ１３）。個体間の交叉には、１点交叉、２点交
叉、又は正規分布交叉等の手法を用いる。正規分布交叉
とは、実数値表現の染色体（個体）について、両親を結
ぶ軸に対して回転対称な正規分布にしたがって子を生成
する方法である。正規分布の標準偏差は、両親を結ぶ主
軸方向の成分については両親間の距離に比例させ、その
他の軸の成分については両親を結ぶ直線と集団からサン
プルした第３の親との距離に比例させる。この交叉方法
は、親の特質が子に引き継がれやすいという利点があ
る。また、生成された９個の子個体に対して一定の確率
で、ランダムに遺伝子（結合度）の値を変更し、遺伝子
の突然変異を発生させる。上記した処理により、第２世
代が生成した後、燃費モジュールのニューラル回路網の
結合係数を第２世代の何れかの個体（エリート）で固定
して、加速モジュールの進化処理に移行し、進化モジュ
ールの第１世代の進化処理が終了した後、再度ステップ
１からの処理を行い、第２世代の各個体の評価・選択を
行う（図１４参照）。尚、この時、ステップ１におい
て、既にコーディングされた個体があるか否か、即ち、
第２世代以降か否かを判断し、コーディングされた個体
がある場合には、コーディングの処理を行わずにステッ
プ２に進む。この処理は、生成する世代が予め決められ
た最終世代に達するまで繰り返し行われる。これによ
り、各世代を構成する子個体は評価系の評価に沿って、
即ち、使用者の好みに合わせて進化していく。最終世代
に達したか否かはステップ１１で判断され、ステップ１
１で最終世代であると判断すると、その世代の９個の子
個体の中から適応度の最も高い個体（最適個体）、即
ち、エリートを一つ選び出し（ステップ１４）、燃費モ
ジュールのニューラル回路網の結合係数を、前記した最
適個体を構成する遺伝子で固定し（ステップ１５）、燃
費モジュールの進化を終了する。加速モジュールにおい
ても、上記した燃費モジュールと同様の処理が最終世代
に達するまで各世代に対して行われる。尚、加速モジュ
ールにおけるステップ８，９に対応する個体に対する評
価は、加速評価指数により行われる。尚、加速評価指数
は加速度をスロットル開度変化率で割って算出する。図
１３は、同一スロットル開度における車速の変化と加速
評価指数の関係を示すグラフである。Hereinafter, the evolution of the module by the genetic algorithm will be described with reference to the flowchart of FIG. First, FIG.
As shown in (1), a first generation consisting of a plurality of individuals an (in this embodiment, nine individuals) is generated by coding the coupling coefficient of a neural network constituting the fuel efficiency module as a gene (in the present embodiment, 9). Step 1). The initial value of the gene value of each individual (that is, the value of the coupling coefficient of the neural network) is randomly determined within a predetermined range (between approximately -10 and 10). At this time, if the learning layer has already learned and output, the number of individuals can be reduced by including one individual (individual a (1) in FIG. 12) that can make the output of the evolution adaptive layer zero. Even when there is a restriction on, diversity of the population during the evolution process can be maintained without impairing the performance at that time. Next, for one of the individuals an generated in step 1, for example, the individual a (1),
Using the neural network of the fuel economy module, the output x of the neural network with respect to the actual input information (engine speed and throttle opening) is determined (step 4), and this output is linearly transformed using equation (1). Then, the output yf of the fuel efficiency module is determined (step 5). The engine speed and the throttle opening of the input information are each normalized, and the output from the fuel consumption module is obtained by linearly converting the output of the neural network using the following equation (1). yf = 2 × Gx−G (1) where yf is the output of the fuel economy module, x is the output of the neural network in the fuel economy module, and G is the evolution adaptive layer output gain. As described above, by using the output x of the neural network in a linear transformation, the output yf from the fuel efficiency module does not become an extremely large value, and the evolution proceeds little by little as a whole, and the behavior of the engine is improved. Will not fluctuate extremely due to evaluation or evolution. After determining the output yf of the fuel economy module for the individual a (1), the output yf and the output ya of the acceleration module having a fixed coupling coefficient are averaged to obtain the output of the evolution adaptive layer (the provisional correction amount Yn (1) ) Is calculated (step 6). This weight is the acceleration-oriented ratio α obtained by the evaluation system.
Decide from. Assuming that the output of the fuel efficiency module is yf and the output of the acceleration control module is ya, the provisional correction amount Yn output from the evolution adaptation layer is represented by the following equation (2). Yn = αya + (1−α) yf (2) That is, when the acceleration-oriented ratio is 1, the provisional correction amount Yn is an output of only the acceleration module, and when the acceleration-oriented ratio is 0, the provisional correction amount Yn is only the fuel consumption module. Output.
After the output Yn (1) of the evolution adaptive layer for the individual a (1) is determined, the provisional correction value Yn (1) is actually output from the evolution adaptive layer and added to the basic operation amount from the reflection layer, The engine is operated with the control output corrected by the provisional correction value Yn (1) (step 7). The evaluation system in the evolution adaptation layer calculates the fuel efficiency by inputting the feedback information on the fuel efficiency from the engine operated by the control output corrected by the provisional correction amount Yn (1) obtained from the individual a (1) (step 8). Based on the result, the individual a (1) is evaluated, and the fitness for the individual a (1) is obtained (step 9). The fuel efficiency is calculated from the traveling distance and the fuel consumption.
The processing from step 4 to step 9 is performed until the fitness for the nine individuals a (1) to a (9) generated in step 1 is calculated, and the fitness for all the individuals is calculated. Proceed to the next process later (step 1
0). Step 4 from the second individual a (2)
In the processes from Step 9 to Step 9, in order to evaluate the fitness of each individual under the same conditions, the running condition is checked before the process is started (Step 2), and the running condition when the running condition is the first individual a (1) is checked. It is determined whether it is the same as the driving situation (step 3).
After calculating the fitness for all individuals, it is determined whether the generation to which the individual belongs is the last generation (step 11),
If it is not the last generation, a parent individual is selected (Step 1)
2). For this selection, a roulette-type selection method is used, and some parent individuals are selected stochastically with a probability proportional to the fitness of each individual. At this time, if the generational change is applied too strictly, individuals with high evaluations may be destroyed. Therefore, an elite preservation strategy that unconditionally leaves the elite (the individual with the highest evaluation) to the next generation is also combined. Used. In addition, a linear conversion of the fitness is performed so that the ratio between the maximum fitness and the average fitness in a group including a plurality of individuals is constant. When the selection of the parent individual is completed, crossover is performed using the selected individual as a parent individual to generate a second generation consisting of nine child individuals (step 13). For crossover between individuals, a technique such as one-point crossover, two-point crossover, or normal distribution crossover is used. The normal distribution crossover is a method of generating children according to a normal distribution that is rotationally symmetric with respect to an axis connecting parents with respect to a chromosome (individual) represented by a real value. The standard deviation of the normal distribution is proportional to the distance between the parents for the main axis component connecting the parents, and proportional to the distance between the straight line connecting the parents and the third parent sampled from the population for the other axis components. . This crossover method has the advantage that the characteristics of the parent are easily inherited by the child. In addition, the gene (coupling degree) value is randomly changed with a certain probability for the generated nine offspring individuals to cause gene mutation. After the second generation is generated by the above-described processing, the coupling coefficient of the neural network of the fuel economy module is fixed to any individual (elite) of the second generation, and the process proceeds to the evolution process of the acceleration module. After the first generation evolution processing is completed, the processing from step 1 is performed again to evaluate and select each individual of the second generation (see FIG. 14). At this time, in step 1, it is determined whether or not there is a coded individual, that is,
It is determined whether it is the second generation or later, and if there is a coded individual, the process proceeds to step 2 without performing the coding process. This process is repeatedly performed until the generation to be generated reaches a predetermined final generation. In this way, the offspring individuals that constitute each generation will follow the evaluation of the evaluation system,
That is, it evolves according to the user's preference. It is determined in step 11 whether or not the last generation has been reached.
When it is determined that the child is the last generation in step 1, an individual having the highest fitness (optimal individual), that is, one elite is selected from nine child individuals of the generation (step 14), and the neural network of the fuel economy module is selected. Is fixed with the gene constituting the above-mentioned optimal individual (step 15), and the evolution of the fuel economy module is terminated. In the acceleration module, the same processing as that of the fuel efficiency module described above is performed for each generation until the last generation is reached. Note that the evaluation of the individual corresponding to steps 8 and 9 in the acceleration module is performed using the acceleration evaluation index. The acceleration evaluation index is calculated by dividing the acceleration by the throttle opening change rate. FIG. 13 is a graph showing the relationship between the change in vehicle speed and the acceleration evaluation index at the same throttle opening.

【００１１】尚、上記した遺伝的アルゴリズムでは以下
の手法（１）〜（３）の手法も考慮される。（１）重複個体の突然変異異なる個体を交叉の親として選択したにもかかわらず、
これらが遺伝子的に見て全く同一であった場合、交叉す
る親の両方について、通常より高い確率で突然変異させ
る。ただし、このときの突然変異は、選ばれた遺伝子の
値に対して、正規分布に基づく変化を加えるものとす
る。（２）同一個体の交叉の回避交叉の親を選択して、これが同一の個体である場合が起
こりうるが、これを放置した場合、集団としての多様性
が失われることが予想される。このため、交叉に選択さ
れた親が同一の個体であった場合、他の選択された個体
と入れ換えをおこなって、可能な限り同じ個体の交叉を
避ける。（３）再生手法交叉の代わりに、一度に集団の全ての個体を置き換える
世代交代の手法である再生手法を用いる。In the above-described genetic algorithm, the following methods (1) to (3) are also considered. (1) Mutation of duplicate individuals Despite selecting different individuals as crossover parents,
If they are genetically identical, they are mutated with a higher than normal probability for both crossing parents. However, the mutation at this time changes the value of the selected gene based on a normal distribution. (2) Avoidance of crossover of the same individual There is a possibility that the parent of the crossover is selected and this is the same individual, but if this is left unchecked, it is expected that diversity as a group will be lost. For this reason, when the parent selected for crossover is the same individual, it is replaced with another selected individual, and crossover of the same individual is avoided as much as possible. (3) Reproduction method Instead of crossover, a reproduction method that is a method of generational replacement that replaces all individuals in a group at once is used.

【００１２】上記した遺伝的アルゴリズムにより、燃費
モジュール及び加速モジュールの進化が終了し、これら
のモジュールのニューラル回路網が両方共、最適個体の
結合度で固定されると、進化適応層は前記したように、
固定された燃費モジュール及び加速モジュールで進化補
正値Ｙの出力を行う。この進化補正値Ｙは、入力信号
（エンジン回転数及びスロットル開度）に基づいて、各
モジュールのニューラル回路網の出力を決定し、これら
ニューラル回路網の出力を前記式（１）で線形変換した
値を各モジュールの出力ｙｆ，ｙａとし、さらに、前記
式（２）を用いて両モジュールの出力ｙｆ，ｙａの加重
平均をとることにより得られる。When the evolution of the fuel economy module and the acceleration module is completed by the above-described genetic algorithm, and the neural networks of these modules are both fixed at the optimum degree of connection of the individual, the evolution adaptive layer becomes as described above. To
The fixed fuel consumption module and acceleration module output the evolution correction value Y. The evolution correction value Y determines the outputs of the neural networks of each module based on the input signals (engine speed and throttle opening), and linearly converts the outputs of these neural networks by the above equation (1). The value is set as the output yf, ya of each module, and furthermore, it is obtained by taking a weighted average of the outputs yf, ya of both modules using the above equation (2).

【００１３】上記したように、各モジュールに対して複
数の個体を生成して交叉させることで、同種のモジュー
ル間で複数の個体同士が競合してよりよいモジュールに
進化し、また、燃費モジュールと加速モジュールを別個
に交互に進化させることで、変化する方のモジュール
が、変化しない方のモジュールの出力に合わせて、適応
的に進化することになり、異なるモジュール間の協調が
図られる。ただし、一方の制御モジュールの加重が小さ
い場合、その制御モジュールに遺伝的な変化を加えても
結果に現れにくいため、ある程度加重の大きな制御モジ
ュールについてのみ、遺伝的な変化を加えることも考え
られる。As described above, by generating and crossing a plurality of individuals for each module, a plurality of individuals compete between modules of the same type to evolve into a better module. By separately and alternately evolving the acceleration modules, the changing module evolves adaptively according to the output of the non-changing module, and cooperation between different modules is achieved. However, when the weight of one control module is small, even if a genetic change is applied to the control module, it is difficult to appear in the result. Therefore, it is conceivable to apply a genetic change only to a control module having a relatively large weight.

【００１４】（学習層について）学習層は、二つのニュ
ーラル回路網Ａ，Ｂから成り、これら二つのニューラル
回路網は一方が学習用として機能している時は、他方は
実行用として機能する。学習用のニューラル回路網は、
進化適応層で各モジュールの進化が終了し、燃費モジュ
ール及び加速モジュールのニューラル回路網が各々最適
個体の結合度で固定されると、進化適応層の入力と出力
との関係を、学習層の実行用として機能しているニュー
ラル回路網の入力と出力との関係と合わせて学習する。
この間、進化適応層の出力は、それ以前の評価関数を最
大とした燃費モジュールと加速モジュールとにより行わ
れ、制御則が時間的に変化することはない。前記した学
習では、進化適応層と学習層の実行用ニューラル回路網
との入出力を、あるステップ幅で平均化し、これを入出
力データとして教師データ集合の更新に用いる。例え
ば、１秒間の平均エンジン回転数が５０００ｒｐｍ、平
均スロットル開度が２０であった場合、これらと、その
時の進化適応層及び学習層における実行用ニューラル回
路網の燃料噴射補正量（即ち、進化補正量及び基準補正
量）を合わせたものを入出力データとして用いる（図１
５参照）。この入出力データを、以前の教師データに加
えて新しい教師データ集合を得る。この時、教師データ
集合における新しいデータとのユークリッド距離が一定
値以内の古い教師データは消去する。この様子を図１６
に示す。また、教師データ集合の初期値は、すべての入
力データに対して出力をゼロにしておく。学習層では、
更新された教師データ集合に基づいて、学習用のニュー
ラル回路網の結合係数の学習を行う。前記結合係数の学
習は、学習中の学習用ニューラル回路網の出力（即ち、
仮想補正値）と反射層からの基本操作量とから得られる
仮想制御出力と、実際の制御出力との間の誤差がしきい
値以下になるまで行われ、この学習が終わると、学習用
のニューラル回路網は実行用になり、もとの制御用のニ
ューラル回路網が学習用となる。この後、学習層は新し
く得られた実行用のニューラル回路網により基本補正量
を決定して実際に出力し、同時に、進化適応層の出力は
ゼロになり、学習層と反射層とによる制御が行われる。
また、学習層の実行用のニューラル回路網の初期値は、
出力が常にゼロになるように設定しておく。こうするこ
とで、初期状態においては、反射層と進化適応層のみで
制御をおこなうようにできる。学習済みの実行用ニュー
ラル回路網の結合係数は、フロッピーディスクやＩＣカ
ード等の外部記憶手段に保存・読み出し可能とする。(Learning layer) The learning layer is composed of two neural networks A and B. When one of the two neural networks functions for learning, the other functions for execution. The neural network for learning is
When the evolution of each module is completed in the evolution adaptation layer, and the neural networks of the fuel economy module and the acceleration module are fixed at the optimum degree of connection of the individual individuals, the relationship between the input and output of the evolution adaptation layer is executed by the learning layer. The learning is performed together with the relationship between the input and output of the neural network functioning as a function.
During this time, the output of the evolution adaptation layer is performed by the fuel consumption module and the acceleration module that maximize the previous evaluation function, and the control law does not change over time. In the learning described above, inputs and outputs between the evolution adaptive layer and the neural network for execution of the learning layer are averaged with a certain step width, and this is used as input / output data for updating the teacher data set. For example, if the average engine speed per second is 5000 rpm and the average throttle opening is 20, the fuel injection correction amount of the execution neural network in the evolution adaptive layer and the learning layer at that time (that is, the evolution correction amount) The sum of the amount and the reference correction amount is used as input / output data (FIG. 1).
5). This input / output data is added to the previous teacher data to obtain a new teacher data set. At this time, old teacher data whose Euclidean distance to new data in the teacher data set is within a certain value is deleted. This situation is shown in FIG.
Shown in The initial value of the teacher data set is set to zero for all input data. At the learning layer,
Based on the updated teacher data set, the learning of the coupling coefficient of the neural network for learning is performed. The learning of the coupling coefficient is based on the output of the learning neural network during learning (ie,
This is performed until the error between the virtual control output obtained from the virtual correction value) and the basic operation amount from the reflection layer and the actual control output becomes equal to or less than the threshold value. The neural network is for execution, and the original neural network for control is for learning. After that, the learning layer determines the basic correction amount using the newly obtained neural network for execution and actually outputs it.At the same time, the output of the evolution adaptive layer becomes zero, and the control by the learning layer and the reflection layer is performed. Done.
The initial value of the neural network for executing the learning layer is
Set so that the output is always zero. By doing so, in the initial state, control can be performed only by the reflection layer and the evolution adaptive layer. The learned coupling coefficients of the execution neural network can be stored and read out in external storage means such as a floppy disk or an IC card.

【００１５】上記したように進化適応層での進化が終了
し、それを学習層で学習した後は、進化適応層は図２で
説明したように、一定時間間隔で作動して、学習層の制
御則のずれをチェックし、制御則にずれがある場合に
は、再び、燃料制御モジュールと加速制御モジュールの
進化を行う。使用者が外部記憶手段に保存した結合係数
を読み出して、当該結合係数に基づいて学習層を作動さ
せている場合には、進化適応層における制御則のずれの
チェックを行わずに、進化適応層の出力をゼロに固定し
たまま、その処理を停止し、使用者の処理開始指示に基
づいて進化適応層の処理を再開するように進化適応層を
構成してもよい。このように、進化適応層で、運転者の
好みを評価して、その評価に沿って燃費制御モジュール
及び加速制御モジュールを進化させることにより、エン
ジン１は、使用者の好みに合わせて燃費性能重視型或い
はドライバビリティ性能重視型に調教されていき、ま
た、進化適応層を一定時間間隔で機能させることによ
り、調教が、使用者の好みの変化やエンジンや車両の経
時変化に追従するようになる。As described above, after the evolution in the evolution adaptive layer is completed and the learning is learned in the learning layer, the evolution adaptive layer operates at fixed time intervals as described with reference to FIG. The deviation of the control law is checked, and if there is a deviation in the control law, the fuel control module and the acceleration control module are evolved again. If the user reads the coupling coefficient stored in the external storage means and operates the learning layer based on the coupling coefficient, the user does not check the control law deviation in the evolution adaptive layer, The evolution adaptation layer may be configured so that the process is stopped while the output of the evolution adaptation layer is fixed to zero, and the process of the evolution adaptation layer is restarted based on a processing start instruction from the user. As described above, the driver's preference is evaluated by the evolution adaptation layer, and the fuel efficiency control module and the acceleration control module are evolved in accordance with the evaluation, whereby the engine 1 emphasizes the fuel efficiency performance according to the user's preference. The training is conducted in a type or drivability performance-oriented type, and by making the evolutionary adaptation layer function at regular time intervals, the training follows changes in the user's preference and changes in the engine and the vehicle over time. .

【００１６】上記した実施例のように、進化適応層にお
いて、補正量に制限を加えつつ適応を図り、そこで得ら
れた成果を学習していく手法を用いる利点としては、以
下の２点が挙げられる。・進化適応層内の制御モジュールの多様性が確保され、
遺伝的アルゴリズムの特徴である大域的探索が可能とな
る。・進化適応層での試行錯誤的な情報処理から、学習層の
高速でより知的な情報処理を獲得することが可能にな
る。これは上記したエンジン制御においては顕著ではな
いが、移動ロボットの経路制御などについては大きな利
点となる。As in the above-described embodiment, the following two advantages are obtained by using the technique in which the evolution adaptation layer adapts while limiting the correction amount and learns the results obtained therefrom. Can be・ Diversity of control modules in the evolution adaptation layer is secured,
Global search, which is a feature of the genetic algorithm, becomes possible. -It is possible to acquire faster and more intelligent information processing of the learning layer from trial and error information processing in the evolution adaptive layer. Although this is not remarkable in the above-described engine control, it is a great advantage for the path control of the mobile robot.

【００１７】次に、車両用エンジンの空燃比を制御対象
として本発明に係る総合制御方式を実行する制御装置の
別の実施例について簡単に説明する。図１７は、制御装
置２０の概略ブロック図である。この制御装置２０は、
上記した第２実施例の制御装置１０と同様、反射層、学
習層、及び進化適応層から成る。進化適応層及び学習層
以外は第２実施例と同じであるので、ここでは詳細な説
明は省略する。進化適応層は評価系及び進化適応系から
成り、評価系は第２実施例の評価系と同様に、走行状態
指数Ｐから、その時の走行状況を推定し、かつ使用者の
好みに関する加速度重視割合αを決定する。進化適応系
は、進化適応系は、燃費が最良となるような燃料噴射制
御則の獲得を目指す燃費モジュール及び出力が最高とな
るような燃料噴射制御則の獲得を目指すパワーモジュー
ルからなる下位モジュールと、使用者の好みに合わせて
運転状態に応じた前記燃費モジュールと前記パワーモジ
ュールの出力比の獲得を目指すコントロールモジュール
から成る上位モジュールとを備えている（図１７参
照）。前記各モジュールは、第２実施例の制御モジュー
ルと同様各々２入力１出力の階層型ニューラル回路網で
構成され、燃費モジュール及びパワーモジュールは各々
正規化スロットル開度及び正規化エンジン回転数を入力
して進化補正量の一次出力値を出力し、コントロールモ
ジュールは正規化スロットル開度と正規化スロットル開
度変化率を入力して下位モジュール（即ち、燃費モジュ
ール及びパワーモジュール）の出力比を出力することで
進化補正量の最終出力値を決定する。前記燃費モジュー
ル及びパワーモジュールは、図１８に示すように運転状
態や使用者の好みに関係なく、それぞれ燃費が最良にな
る空燃比と出力が最良になる空燃比が得られるような空
燃比補正量を獲得するよう各ニューラル回路網の結合係
数を遺伝的アルゴリズムを用いて順番に自律的に進化さ
せる。これに対してコントロールモジュールは、最良燃
費空燃比と最良出力空燃比の間で、使用者の好みにあっ
た燃費モジュール及びパワーモジュールの出力比を獲得
するように、そのニューラル回路網の結合係数を遺伝的
アルゴリズムを用いて自律的に進化させる。なお、各制
御モジュールの進化は、各制御制御モジュールを構成す
るニューラル回路網の結合度を遺伝子としてコーディン
グして複数の個体を生成し、全ての個体を用いて実際に
エンジンを作動させた結果を各個体毎に評価系で評価
し、評価結果の一番高い個体（エリート個体）を含む幾
つかの親個体を選択してこれらを交叉させて次世代の子
個体を生成し、各個体に対する評価を行うことを繰り返
すことで行われる。上記した進化処理は所定世代数終了
するまで繰り返し行われ、最終世代の個体の評価が終了
した後、その最終世代のエリート個体でニューラル回路
網の結合係数を固定して、学習層の学習用制御モジュー
ルに学習させる。尚、進化処理中に個体群を生成する
時、既に学習層が学習を行い出力をしている場合には、
進化適応層の出力をゼロにできる個体を一つ入れるよう
にし、これにより、個体数が制限されていても進化中の
個体群の多様性が保てるようにする。上記した進化処理
は、進化が収束するまで所定世代数毎に学習層で学習を
行いながら、連続して同じ制御モジュールに対して、繰
り返し行われる。進化処理中は、進化の対象となる制御
モジュール以外の制御モジュールは、固定しておき、進
化処理中の制御モジュールの進化が終了した後、次の制
御モジュールの進化処理が行われる。所定世代数毎の学
習の際には、その世代のエリート個体で、ニューラル回
路網の結合係数を固定して、学習層の学習用制御モジュ
ールに学習させる。学習が終了した時点で、学習に用い
たエリート個体と、進化適応層の出力がゼロになる個体
とを含む新しい個体群を生成する。こうすることで、次
の進化処理において、前の進化処理の性能を損なうこと
なく、進化中の個体群の多様性を保つことができる。Next, another embodiment of the control device for executing the comprehensive control system according to the present invention with the air-fuel ratio of the vehicle engine being controlled will be briefly described. FIG. 17 is a schematic block diagram of the control device 20. This control device 20
Like the control device 10 of the second embodiment, the control device 10 includes a reflection layer, a learning layer, and an evolution adaptive layer. Except for the evolution adaptation layer and the learning layer, the second embodiment is the same as the second embodiment, and a detailed description is omitted here. The evolution adaptation layer is composed of an evaluation system and an evolution adaptation system. The evaluation system estimates the traveling state at that time from the traveling state index P, as in the evaluation system of the second embodiment, and sets the acceleration-oriented ratio relating to the user's preference. Determine α. The evolutionary adaptive system consists of a lower module consisting of a fuel consumption module that aims to obtain a fuel injection control law that maximizes fuel efficiency and a power module that aims to acquire a fuel injection control law that maximizes output. And a higher-level module including a control module that aims to obtain an output ratio of the fuel module and the power module according to the driving state according to the user's preference (see FIG. 17). Each of the modules is constituted by a hierarchical neural network having two inputs and one output, similarly to the control module of the second embodiment. The fuel efficiency module and the power module input the normalized throttle opening and the normalized engine speed, respectively. The control module outputs the primary output value of the evolution correction amount, and the control module inputs the normalized throttle opening and the normalized throttle opening change rate, and outputs the output ratio of the lower modules (ie, the fuel consumption module and the power module). Determines the final output value of the evolution correction amount. As shown in FIG. 18, the fuel efficiency module and the power module have an air-fuel ratio correction amount that can obtain an air-fuel ratio that maximizes fuel efficiency and an air-fuel ratio that optimizes output regardless of the driving state and the preference of the user. The coupling coefficient of each neural network is sequentially and autonomously evolved using a genetic algorithm so as to obtain. On the other hand, the control module adjusts the coupling coefficient of the neural network so as to obtain the output ratio of the fuel consumption module and the power module that is in accordance with the user's preference between the best fuel consumption air-fuel ratio and the best output air-fuel ratio. Evolve autonomously using a genetic algorithm. In addition, the evolution of each control module is based on the result of coding a degree of connectivity of the neural network that constitutes each control control module as a gene, generating multiple individuals, and actually operating the engine using all individuals. Each individual is evaluated by the evaluation system, several parent individuals including the individual with the highest evaluation result (elite individual) are selected, and these are crossed to generate a next-generation offspring individual, and each individual is evaluated. Is performed by repeating the above. The above-described evolution processing is repeatedly performed until a predetermined number of generations are completed, and after the evaluation of the final generation individual is completed, the coupling coefficient of the neural network is fixed by the final generation elite individual, and the learning layer learning control is performed. Train the module. When generating a population during the evolution process, if the learning layer has already learned and output,
One individual that can make the output of the evolutionary adaptive layer zero can be included, so that the diversity of the population during evolution can be maintained even if the number of individuals is limited. The above-described evolution processing is repeatedly performed on the same control module continuously while learning is performed by the learning layer for each predetermined number of generations until the evolution converges. During the evolution process, the control modules other than the control module to be evolved are fixed, and after the evolution of the control module under the evolution process ends, the evolution process of the next control module is performed. At the time of learning for each predetermined number of generations, the elite individual of that generation fixes the coupling coefficient of the neural network and makes the learning control module of the learning layer learn. When the learning is completed, a new population including the elite individuals used for the learning and the individuals whose output of the evolutionary adaptive layer becomes zero is generated. In this way, in the next evolution process, the diversity of the population under evolution can be maintained without impairing the performance of the previous evolution process.

【００１８】以下に上記した遺伝的アルゴリズムによる
モジュールの進化について、図１９のフローチャートを
参照して燃費モジュールの進化を例に挙げてさらに詳細
に説明する。始めに、燃費モジュールを構成するニュー
ラル回路網の結合係数を遺伝子としてコーディングして
複数の個体ａｎ（本実施例では９個の個体）からなる第
１世代を生成する（ステップ１）。この時、個体群に進
化適応層の出力をゼロにできる個体を一つ含ませること
で、進化前の性能を維持しながら個体数に制限がある場
合でも進化の過程で個体群の多様性を保つことができ
る。次に、ステップ１で生成された個体ａｎの中の一
つ、例えば、個体ａ（１）に対して、燃費モジュールの
ニューラル回路網を用いて実際の入力情報（エンジン回
転数及びスロットル開度）に対するニューラル回路網の
出力ｘ（１）を決定し（ステップ２）、さらにこの出力
を式（１）を用いて線形変換して個体ａ（１）に対する
燃費モジュールの出力ｙｆ（１）を決定する（ステップ
３）。ｙｆ（ｎ）＝２×Ｇｘ（ｎ）−Ｇ（１）ここで、ｙｆ（ｎ）は燃費モジュールの出力、ｘ（ｎ）
は燃費モジュールにおけるニューラル回路網の出力、Ｇ
は進化適応層出力ゲインであり、ｎは個体を示してい
る。個体ａ（１）に対する燃費モジュールの出力ｙｆ
（１）を決定した後、パワーモジュールの出力ｙａ及び
コントロールモジュールの出力ＯＲを用いて次式（２）
に基づいて進化適応層の出力（評価用補正量Ｙａ
（ｎ））を決定する（ステップ４）。Ｙａ（ｎ）＝ＯＲ×ｙａ＋（１−ＯＲ）×ｙｆ
（ｎ）尚、燃費モジュールの進化処理中は、パワーモジュール
及びコントロールモジュールの結合係数は固定されてい
る。上記したようにコントロールモジュールの出力ＯＲ
は、燃費モジュールの出力ｙｆとパワーモジュールの出
力ｙａとの比率であり、従って、コントロールモジュー
ルの出力ＯＲが”１”の場合は進化適応層の出力はパワ
ーモジュールの出力となり、また、前記出力ＯＲが”
０”の場合は進化適応層の出力は燃費モジュールの出力
となる。個体ａ（１）に対する進化適応層の出力Ｙａ
（１）が決定した後、この仮補正値Ｙａ（１）を実際に
進化適応層から出力し、仮補正値Ｙａ（１）と学習層か
らの基本補正値Ｙｂと反射層からの基本操作量に加算
し、この補正値Ｙａ（１）＋Ｙｂにより補正された操作
量に基づく制御を所定の時間だけ行う（ステップ５）。
個体ａ（１）を用いて実際に制御を行いながら、進化適
応層の評価系ではこの制御の結果に関する情報をフィー
ドバックして、個体ａ（１）に対する評価値（例えば燃
料消費量）を決定する（ステップ６）。上記したステッ
プ１〜ステップ６までの処理は、ステップ１で生成され
た９個の個体ａ（１）〜ａ（９）に対する全ての評価値
の算出を１サイクルの予備評価処理として、この予備評
価処理を予め決められた所定のサイクル行うまで繰り返
し行われ（ステップ７）、全ての個体に対して所定サイ
クル分の評価値を算出した後に各個体の総合評価処理
（ステップ８）に進む。総合評価処理では、各個体毎
に、上記した評価値算出サイクル中の総走行距離を全て
の評価値の合計（この場合は総燃料消費量）で割った総
合評価値を算出し、この総合評価値に基づいて各個体の
適応度の評価を行う（ステップ８）。上記したように、
各個体を用いた制御を順番に所定回数行い各個体を時分
割により擬似的に並行的に作動させることで（図２０参
照）、刻々と変化する走行状況のもとでも各個体の評価
をほぼ同じ条件で公平に行うことが可能になり、オンラ
イン評価、即ち、車両走行中の評価が可能になる。上記
した総合評価処理が終了した後、その個体の属する世代
が最終世代か否かを判断し（ステップ９）、最終世代で
なければ親個体の選択を行う（ステップ１０）。この選
択にはルーレット式選択方式を用い、各個体の適応度に
比例した確率で、確率的に幾つかの親個体を選択する。
尚、この時エリート個体は無条件に親個体として残すよ
うにする。親個体の選択が終わると、選択された個体を
親個体として、交叉を行い、再び９個の子個体から成る
第二世代を生成する（ステップ１１）。また、生成され
た９個の子個体に対して一定の確率で、ランダムに遺伝
子（結合度）の値を変更し、遺伝子の突然変異を発生さ
せる。尚、これら９個の子個体には、進化適応層の出力
をゼロにできる個体を一つ含むようにする。上記した処
理により、第２世代が生成した後、再びステップ２から
の予備評価処理を繰り返す。上記した進化処理は、予め
決められた世代数経過するまで繰り返し行われる。これ
により、各世代を構成する子個体は評価系の評価に沿っ
て、即ち、燃費モジュールの場合には、燃費が最良とな
るような燃料噴射制御則を獲得するように進化してい
く。予め決められた世代数経過したか否かはステップ９
で判断され、ステップ９で最終世代であると判断する
と、その世代の９個の子個体の中から適応度の最も高い
個体（最適個体）、即ち、エリートを一つ選び出し（ス
テップ１２）、燃費モジュールのニューラル回路網の結
合係数を、前記した最適個体を構成する遺伝子で固定し
（ステップ１３）、学習層の学習用制御モジュールに対
する学習処理に移行する。一つの制御モジュール（この
場合は燃費モジュール）に対する進化処理は学習後の学
習層と反射層とによる制御の評価が、進化前の制御の評
価より高い間は、連続して繰り返され、進化処理後の制
御の評価が進化処理前の制御の評価より向上しなくなっ
たら、その制御モジュールの進化が収束したと判断し
て、次の制御モジュール（本実施例の場合はパワーモジ
ュール）の進化処理に移行する。Hereinafter, the evolution of the module by the above-described genetic algorithm will be described in more detail with reference to the flowchart of FIG. First, a first generation consisting of a plurality of individuals an (in this embodiment, nine individuals) is generated by coding the coupling coefficient of the neural network constituting the fuel economy module as a gene (step 1). At this time, by including one individual that can make the output of the evolutionary adaptation layer zero, the diversity of the population during the evolution process is maintained even if the number of individuals is limited while maintaining the performance before evolution. Can be kept. Next, for one of the individuals an generated in step 1, for example, the individual a (1), the actual input information (engine speed and throttle opening) using the neural network of the fuel economy module. Is determined (step 2), and the output is linearly transformed using equation (1) to determine the output yf (1) of the fuel efficiency module for the individual a (1). (Step 3). yf (n) = 2 × Gx (n) -G (1) where yf (n) is the output of the fuel economy module, x (n)
Is the output of the neural network in the fuel economy module, G
Is an evolution adaptive layer output gain, and n indicates an individual. Output yf of fuel economy module for individual a (1)
After determining (1), using the output ya of the power module and the output OR of the control module, the following equation (2) is used.
Of the evolution adaptation layer (evaluation correction amount Ya
(N)) is determined (step 4). Ya (n) = OR × ya + (1−OR) × yf
(N) During the evolution processing of the fuel efficiency module, the coupling coefficient of the power module and the control module is fixed. Output OR of control module as described above
Is the ratio between the output yf of the fuel economy module and the output ya of the power module. Therefore, when the output OR of the control module is "1", the output of the evolution adaptive layer becomes the output of the power module, and the output OR But"
In the case of 0 ", the output of the evolution adaptation layer is the output of the fuel economy module. The output Ya of the evolution adaptation layer for the individual a (1)
After (1) is determined, the provisional correction value Ya (1) is actually output from the evolution adaptive layer, and the provisional correction value Ya (1), the basic correction value Yb from the learning layer, and the basic operation amount from the reflection layer are obtained. And the control based on the operation amount corrected by the correction value Ya (1) + Yb is performed for a predetermined time (step 5).
While actually performing control using the individual a (1), the evaluation system of the evolutionary adaptive layer feeds back information on the result of this control to determine an evaluation value (for example, fuel consumption) for the individual a (1). (Step 6). In the processing from step 1 to step 6 described above, the calculation of all evaluation values for the nine individuals a (1) to a (9) generated in step 1 is performed as one cycle of preliminary evaluation processing. The process is repeated until a predetermined cycle is performed (step 7). After calculating evaluation values for all the individuals for the predetermined cycle, the process proceeds to the comprehensive evaluation process for each individual (step 8). In the comprehensive evaluation process, for each individual, a total evaluation value is calculated by dividing the total travel distance in the above-described evaluation value calculation cycle by the sum of all the evaluation values (in this case, the total fuel consumption). The fitness of each individual is evaluated based on the value (step 8). As mentioned above,
The control using each individual is sequentially performed a predetermined number of times, and the individual is operated in a pseudo-parallel manner by time division (see FIG. 20). It is possible to perform fairly under the same conditions, and it is possible to perform online evaluation, that is, evaluation while the vehicle is running. After the above-described comprehensive evaluation process is completed, it is determined whether or not the generation to which the individual belongs is the last generation (step 9). If not, the parent individual is selected (step 10). For this selection, a roulette-type selection method is used, and some parent individuals are selected stochastically with a probability proportional to the fitness of each individual.
At this time, the elite individual is left unconditionally as a parent individual. When the selection of the parent individual is completed, crossover is performed using the selected individual as a parent individual to generate a second generation of nine child individuals again (step 11). In addition, the gene (coupling degree) value is randomly changed with a certain probability for the generated nine offspring individuals to cause gene mutation. Note that these nine offspring individuals include one that can make the output of the evolutionary adaptive layer zero. After the second generation is generated by the above processing, the preliminary evaluation processing from step 2 is repeated again. The above-described evolution processing is repeatedly performed until a predetermined number of generations has elapsed. As a result, the offspring constituting each generation evolves in accordance with the evaluation of the evaluation system, that is, in the case of the fuel efficiency module, to acquire a fuel injection control law that maximizes the fuel efficiency. Step 9 determines whether a predetermined number of generations has elapsed.
If it is determined in step 9 that the child is the last generation, an individual having the highest fitness (optimal individual), that is, one elite is selected from the nine child individuals of that generation (step 12), and the fuel efficiency is determined. The coupling coefficient of the neural network of the module is fixed with the gene constituting the above-mentioned optimal individual (step 13), and the process proceeds to the learning process for the learning control module of the learning layer. The evolution processing for one control module (in this case, the fuel consumption module) is continuously repeated while the evaluation of the control by the learning layer and the reflection layer after the learning is higher than the evaluation of the control before the evolution. If the evaluation of the control no longer improves than the evaluation of the control before the evolution processing, it is determined that the evolution of the control module has converged, and the process proceeds to the evolution processing of the next control module (power module in this embodiment). I do.

【００１９】パワーモジュール及びコントロールモジュ
ールの進化処理も上記した燃費モジュールの進化処理と
同様に行われるが、パワーモジュールにおける各個体に
対する総合評価値は、評価値算出サイクル中の平均エン
ジン回転数を平均スロットル開度で割って算出される。
これにより、パワーモジュールの進化処理中の各世代を
構成する子個体は評価系の評価に沿って、即ち、出力が
最良となるような燃料噴射制御則を獲得するように進化
していく。また、コントロールモジュールにおける各個
体に対する総合評価値は、評価値算出サイクル中の燃費
評価値及びレスポンス評価値と、加速度重視割合αと、
基準とする燃費評価値及びレスポンス評価値とを用いて
判断される。即ち、価値算出サイクル中の燃費評価値を
ＦＣ、レスポンス評価値をＲＰ、基準燃費評価値をＦＣ
base、基準レスポンス評価値をＲＰbaseとすると、コン
トロールモジュールにおける総合評価値ＣＮＴは次式で
与えられる。ＣＮＴ＝（１−α）×Scale×（ＦＣ−ＦＣbase）／Ｆ
Ｃbase ＋α×（ＲＰ−ＲＰbase）／ＲＰbase ここで、Scaleは燃費評価値とレスポンス評価値のバラ
ンスをとるための係数である。また、燃費評価値は上記
した燃費モジュールにおける総合評価値と同様に算出さ
れ、レスポンス評価値ＲＰは、所定のしきい値より大き
いスロットル開度の変化が一定時間続いたら加速とみな
して、その時の速度変化をスロットル開度変化率で割
り、これを所定の評価値算出サイクルの時間内の各加速
毎に算出したものの平均値とする。上記したように、燃
費評価値とレスポンス評価値を使用者の好みに基づいて
決められた加速度重視割合αの比率に合わせて評価する
ことにより、コントロールモジュールは、使用者の好み
に合わせて運転状態に応じた燃費モジュールとパワーモ
ジュールの出力比を獲得できるよう進化していく。上記
したように、下位制御モジュール群を構成する燃費モジ
ュールとパワーモジュールとが各々燃費及び出力優先に
進化し、上位制御モジュールを構成するコントロールモ
ジュールが使用者の好みに合わせて下位制御モジュール
の出力比率を決めるように進化することにより、各機能
（即ち、燃費及びエンジン出力）について最適な出力を
行える下位モジュールを使用者の好みに合った出力比率
で用いた制御を行うことができるようになる。The evolution process of the power module and the control module is performed in the same manner as the above-described evolution process of the fuel efficiency module. However, the total evaluation value for each individual in the power module is obtained by dividing the average engine speed during the evaluation value calculation cycle by the average throttle. It is calculated by dividing by the opening.
As a result, the child individuals constituting each generation during the evolution process of the power module evolve in accordance with the evaluation of the evaluation system, that is, to acquire a fuel injection control law that maximizes the output. The overall evaluation value for each individual in the control module is a fuel efficiency evaluation value and a response evaluation value during an evaluation value calculation cycle, an acceleration importance ratio α,
The determination is made using the reference fuel efficiency evaluation value and response evaluation value. That is, the fuel efficiency evaluation value during the value calculation cycle is FC, the response evaluation value is RP, and the reference fuel efficiency evaluation value is FC.
Assuming that the base and the reference response evaluation value are RPbase, the total evaluation value CNT in the control module is given by the following equation. CNT = (1−α) × Scale × (FC−FCbase) / F
Cbase + α × (RP−RPbase) / RPbase Here, Scale is a coefficient for balancing the fuel efficiency evaluation value and the response evaluation value. The fuel efficiency evaluation value is calculated in the same manner as the overall evaluation value in the fuel efficiency module described above, and the response evaluation value RP is regarded as acceleration if a change in throttle opening larger than a predetermined threshold value continues for a certain period of time. The change in speed is divided by the rate of change in throttle opening, and this is taken as the average of the values calculated for each acceleration during the time of the predetermined evaluation value calculation cycle. As described above, by evaluating the fuel efficiency evaluation value and the response evaluation value in accordance with the ratio of the acceleration-oriented ratio α determined based on the user's preference, the control module can adjust the driving state according to the user's preference. It will evolve to obtain the output ratio of the fuel efficiency module and the power module according to As described above, the fuel economy module and the power module that constitute the lower control module group have evolved to give priority to fuel economy and output, respectively, and the control module that constitutes the upper control module has the output ratio of the lower control module according to the user's preference. , Control can be performed using lower-level modules capable of optimal output for each function (ie, fuel efficiency and engine output) at an output ratio that matches the user's preference.

【００２０】（学習層について）次に、学習層について
簡単に説明する。学習層は、図１７に示すように進化適
応層の制御モジュールに対応する種類の制御モジュール
群を備え、各制御モジュール群は、二つのニューラル回
路網Ａ，Ｂから成る。これら二つのニューラル回路網は
一方が学習用として機能している時は、他方は実行用と
して機能する。この学習層における制御モジュールは、
進化層の対応する制御モジュールが所定世代数進化する
毎に、その学習用のニューラル回路網で、進化適応層の
入力と出力との関係を、学習層の実行用として機能して
いるニューラル回路網の入力と出力との関係と合わせて
学習する。上記した学習は、学習中の学習用ニューラル
回路網の出力と制御出力との誤差がしきい値より小さく
なった時点で終了し、その後、学習用のニューラル回路
網は実行用になり、もとの制御用のニューラル回路網が
学習用となる。この学習は、上記したように進化適応層
における制御モジュールの進化処理が所定の世代数行わ
れる毎に行われる。従って、一つの制御モジュールに対
して進化処理が始まってから進化が収束するまでの間
は、所定の世代数進化処理が行われる毎に学習が行われ
るようになる。このように、各制御モジュールに対して
進化と学習とを交互に繰り返すと、学習を行わずに進化
処理のみで進化させる場合に比べて評価値の向上が早く
なり進化が早く進む（図２１参照）。(Learning Layer) Next, the learning layer will be briefly described. As shown in FIG. 17, the learning layer includes a group of control modules corresponding to the control modules of the evolution adaptive layer, and each control module group includes two neural networks A and B. When one of these two neural networks is functioning for learning, the other functions for execution. The control modules in this learning layer are:
Every time the control module corresponding to the evolution layer evolves by a predetermined number of generations, a neural network for learning the neural network that functions to execute the learning layer by using the relationship between the input and output of the evolution adaptation layer. And learn the relationship between the input and output. The above-mentioned learning ends when the error between the output of the learning neural network during learning and the control output becomes smaller than a threshold value, and thereafter, the neural network for learning becomes an execution one. Is used for learning. This learning is performed every time a predetermined number of generations of the evolution processing of the control module in the evolution adaptive layer are performed as described above. Therefore, from the start of the evolution process to one control module until the convergence of the evolution, learning is performed every time a predetermined number of generation evolution processes are performed. As described above, when the evolution and the learning are alternately repeated for each control module, the evaluation value is improved faster and the evolution proceeds faster than when the evolution is performed only by the evolution processing without performing the learning (see FIG. 21). ).

【００２１】上記したように、進化適応層において、下
位モジュール群を構成する燃費モジュールとパワーモジ
ュールとが各々燃費及びエンジン出力を優先して進化
し、また、上位モジュールを構成するコントロールモジ
ュールが使用者の好みに合わせた下位モジュールの出力
比が得られるように進化することにより、使用者の好み
に合わせた最適な目標空燃比パターンを得ることがで
き、エンジン１は、使用者の好みに合わせて燃費性能重
視型、ドライバビリティ性能重視型、或いはバランス重
視型に調教されていく。また、進化適応層を一定時間間
隔で機能させることにより、調教が、使用者の好みの変
化やエンジンや車両の経時変化に追従するようになる。As described above, in the evolution adaptation layer, the fuel consumption module and the power module constituting the lower module group evolve with priority on fuel consumption and engine output, respectively, and the control module constituting the upper module is provided by the user. By evolving to obtain the output ratio of the lower module according to the user's preference, it is possible to obtain the optimum target air-fuel ratio pattern according to the user's preference, and the engine 1 is adapted to the user's preference. The trainees will be trained to focus on fuel efficiency, drivability, or balance. In addition, by making the evolution adaptive layer function at regular time intervals, the training follows changes in the user's preference and changes in the engine and the vehicle over time.

【００２２】（その他）上記した実施例では、制御モジ
ュールの分割は、加速、燃費の各機能についておこなっ
ていたが、燃料噴射量、点火時期などの制御出力につい
て、これを行ってもよい。この場合、例えば、新しく吸
気管長の制御を可能にした場合など、既存の制御モジュ
ールに変更を加える必要がなく、吸気管長制御モジュー
ルを追加することで総合的な制御が実現できるという効
果を奏する。また、エンジンの制御を行う場合の制御出
力は、上記の他、例えば、電子スロットル開度、吸排気
バルブタイミング、バルブリフト量、吸排気制御用バル
ブタイミング等が考えられ得る（図３参照）。ここで、
吸気制御用バルブとは、タンブル及びスワールの制御を
行うために吸気管に設けられるバルブであり、また、排
気制御バルブとは、排気脈動を制御するために排気管に
設けられるバルブである。また、本実施例では、遺伝的
アルゴリズムにおける進化処理で、燃費モジュールと加
速モジュールとを一世代毎に交互に進化させているが、
この進化処理は本実施例に限定されることなく、例え
ば、始めに加速モジュールを固定しておき、燃費モジュ
ールの進化処理を最終世代まで行い最適制御モジュール
を獲得した後に、燃費モジュールを最適制御モジュール
で固定して、加速モジュールを進化させるように処理し
てもよい。また、本実施例では、制御モジュールを燃費
モジュールと加速モジュールの二つに分割し、各モジュ
ールに対して遺伝的アルゴリズムに対する進化処理を行
っているが、進化適応層における制御モジュールの数
は、本実施例に限定されることなく、一種類でもよく、
又三種類以上の制御モジュールを用いてもよい。さら
に、本実施例では、遺伝的アルゴリズムにおいて、進化
を、予め決められた最終世代まで進化した時点で終了し
ているが、この進化を終了するタイミングは本実施例に
限定されることなく、例えば、進化中の個体の評価値が
予め定めた変化幅以上変化した時に終了させてもよい。
さらにまた、本実施例では、運転者の好みや走行状況を
推定するための走行状態指数Ｐを、ギヤポジションと最
高回転数の分布パターンに基づいてニューラル回路網を
用いて推定しているが、走行状態指数Ｐを推定する手段
は、本実施例に限定されることなく任意の方法でよく、
例えば、ファジー推論を用いて推定してもよい。ファジ
ー推論を用いて走行状態指数Ｐを決定する場合、例え
ば、「１速の最高回転数が大で、２速の最高回転数が大
ならば、走行状態指数Ｐは非常に大である。」といった
ＩＦ−ＴＨＥＮルールを記述し、ファジー推論を行う
（図２２参照）。ニューラル回路網の場合、結合係数は
教師信号から学習により決定されるのでブラックボック
ス的になるのに対して、このファジー推論を用いる方法
では、設計の時の知識ベース的なアプローチが可能とな
る。また、本実施例では、ギヤポジションと最高回転数
の分布パターンから使用者の好みを評価しているが、使
用者の好みを評価するパラメータは本実施例に限定され
ることなく、任意のパラメータでよく、使用者の生理的
指標、例えば、脈拍、血圧、体温、脳波等を検出する手
段を、使用者の装備品、例えば、二輪車の場合には、ヘ
ルメット、グローブ、又はブーツ等に設けて、生理的指
標を検出し、これに基づいて評価してもよい。さらに、
これらの生理的指標は使用者の状態（運転者の運転状
態）の評価に使用することも可能である。また、本実施
例では、使用者の好みを評価し、この評価に合わせて調
教を行っているが、調教の方針を決めるパラメータは本
実施例に限定されることなく、例えば、使用者の技量を
評価して、使用者の技量に合わせて調教を行ってもよ
い。この場合の使用者の技量を評価するパラメータとし
ては、例えば、車両の場合には、車両の傾き角、車両の
上下方向の加速度、ブレーキの操作量、前後ブレーキの
使用比率等が考えられる。さらにまた、本実施例では、
学習層を階層型ニューラル回路網で構成しているが、学
習層の制御系の構成は本実施例に限定されることなく、
例えば、ＣＭＡＣ(Cerebellar Model Arithmetic Compu
ter)を用いてもよい。ＣＭＡＣを用いる利点としては、
階層型ニューラル回路網に比べて、追加学習の能力が優
れていること、学習が高速である等が挙げられる。さら
にまた、本発明の第３実施例では、進化処理中の各個体
に対する総合評価処理を、時分割して、各個体を用いた
制御を連続して所定サイクル数行い各個体を擬似的に並
行的に作動させることにより、各個体を走行状態等の変
化に影響されずに公平に評価できるようにしているが、
この各個体に対する評価手法は本実施例に限定されるこ
となく、例えば、評価領域を細分化して局所的な評価値
を算出し、全ての個体に共通する局所的な評価値を用い
て総合評価をしてもよく、また、細分化した局所的評価
領域毎に重みづけを行い、この重みづけを加味して総合
評価を行ってもよい。具体的には、例えば燃費評価の場
合は、図２３に示すように、評価処理中のエンジントル
ク及びエンジン回転数と単位時間あたりの燃料消費量と
の関係を全ての個体（図２３の場合は二つ）に対して調
べ（図２３（ａ）及び（ｂ）参照）、この燃料消費量を
局所的な評価値として、共通する条件で得られた局所的
な評価値（図２３（ｃ）参照）の平均値で各個体につい
ての評価を行い得る。また、図２３とは別の方法とし
て、例えば、図２４に示すように、予め各局所的評価部
分毎に重みづけをしたウェイトマップを用意し（図２４
（ｃ）参照）、評価処理中のエンジントルク及びエンジ
ン回転数と単位時間あたりの燃料消費量との関係を全て
の個体（図２４の場合は二つ）に対して調べ（図２４
（ａ）及び（ｂ）参照）、各個体毎にウェイトマップを
加味して総合評価値を決定して評価を行ってもよい。(Others) In the above-described embodiment, the control module is divided for each function of acceleration and fuel consumption. However, this may be performed for control output such as fuel injection amount and ignition timing. In this case, for example, when the control of the intake pipe length is newly enabled, there is no need to change the existing control module, and there is an effect that comprehensive control can be realized by adding the intake pipe length control module. The control output for controlling the engine may be, for example, the electronic throttle opening, intake / exhaust valve timing, valve lift, intake / exhaust control valve timing, etc. in addition to the above (see FIG. 3). here,
The intake control valve is a valve provided on the intake pipe for controlling tumble and swirl, and the exhaust control valve is a valve provided on the exhaust pipe for controlling exhaust pulsation. Further, in the present embodiment, the fuel economy module and the acceleration module are alternately evolved for each generation by the evolution processing in the genetic algorithm.
This evolution processing is not limited to the present embodiment. For example, first, the acceleration module is fixed, the evolution processing of the fuel economy module is performed to the last generation, the optimal control module is acquired, and then the fuel economy module is optimized. And processing may be performed to evolve the acceleration module. Further, in the present embodiment, the control module is divided into a fuel consumption module and an acceleration module, and the evolution processing for the genetic algorithm is performed for each module. Without being limited to the embodiment, one type may be used,
Also, three or more types of control modules may be used. Further, in the present embodiment, in the genetic algorithm, the evolution is terminated when the evolution reaches a predetermined final generation, but the timing of terminating the evolution is not limited to the present embodiment. Alternatively, the process may be terminated when the evaluation value of the evolving individual changes by a predetermined change width or more.
Furthermore, in the present embodiment, the driving state index P for estimating the driver's preference and driving state is estimated using a neural network based on the distribution pattern of the gear position and the maximum rotation speed. Means for estimating the traveling state index P may be any method without being limited to the present embodiment.
For example, the estimation may be performed using fuzzy inference. When the running state index P is determined using the fuzzy inference, for example, "If the maximum speed of the first speed is large and the maximum speed of the second speed is large, the running state index P is very large." Such IF-THEN rules are described, and fuzzy inference is performed (see FIG. 22). In the case of a neural network, the coupling coefficient is determined by learning from a teacher signal, and thus becomes a black box. On the other hand, the method using fuzzy inference enables a knowledge-based approach at the time of design. Further, in the present embodiment, the preference of the user is evaluated from the distribution pattern of the gear position and the maximum number of revolutions. However, the parameter for evaluating the preference of the user is not limited to the present embodiment, and any parameter may be used. In, the physiological indicators of the user, for example, a means for detecting pulse, blood pressure, body temperature, brain waves, etc., provided in the user's equipment, for example, in the case of a motorcycle, a helmet, gloves, or boots, etc. Alternatively, a physiological index may be detected and evaluated based on this. further,
These physiological indices can also be used to evaluate the condition of the user (driving condition of the driver). Further, in the present embodiment, the user's preference is evaluated and training is performed in accordance with the evaluation. However, parameters for determining the training policy are not limited to the present embodiment. May be evaluated, and training may be performed in accordance with the skill of the user. In this case, as the parameters for evaluating the skill of the user, for example, in the case of a vehicle, the inclination angle of the vehicle, the vertical acceleration of the vehicle, the operation amount of the brake, the use ratio of the front and rear brakes, and the like can be considered. Furthermore, in this embodiment,
Although the learning layer is configured by a hierarchical neural network, the configuration of the control system of the learning layer is not limited to this embodiment,
For example, CMAC (Cerebellar Model Arithmetic Compu
ter) may be used. The advantages of using CMAC are:
Compared to the hierarchical neural network, the additional learning capability is excellent, and the learning speed is high. Furthermore, in the third embodiment of the present invention, the comprehensive evaluation process for each individual during the evolution process is time-divided, control using each individual is continuously performed for a predetermined number of cycles, and each individual is pseudo-parallel. By actuating each individual individually, it is possible to evaluate fairly without being affected by changes in running conditions etc.,
The evaluation method for each individual is not limited to this embodiment. For example, an evaluation area is subdivided to calculate a local evaluation value, and a comprehensive evaluation is performed using a local evaluation value common to all the individuals. Alternatively, weighting may be performed for each of the subdivided local evaluation areas, and the overall evaluation may be performed in consideration of the weighting. Specifically, for example, in the case of fuel efficiency evaluation, as shown in FIG. 23, the relationship between the engine torque and the engine speed during the evaluation process and the fuel consumption per unit time is determined for all individuals (in the case of FIG. 23, 23) (see FIGS. 23 (a) and (b)), and using this fuel consumption as a local evaluation value, a local evaluation value obtained under common conditions (FIG. 23 (c)). Can be evaluated for each individual with the average value of Further, as a method different from FIG. 23, for example, as shown in FIG. 24, a weight map in which each local evaluation portion is weighted in advance is prepared (FIG. 24).
(C), the relationship between the engine torque and the engine speed during the evaluation process and the fuel consumption per unit time is examined for all individuals (two in the case of FIG. 24) (FIG. 24).
(See (a) and (b)), the evaluation may be performed by determining a comprehensive evaluation value in consideration of a weight map for each individual.

【００２３】[0023]

【発明の効果】以上説明した本発明に係る総合制御方式
によれば、制御対象を制御する制御系の制御特性を使用
者及び／又は使用状況の特性に合わせて変化させること
ができるので、制御対象が、それを使用する使用者及び
／又は使用状況に合った特性に「調教」され、使用し易
くなり、また、使用者に、制御対象の特性を自分だけの
独自の特性に調教するという意識上の楽しみを与えるこ
とができるという効果を奏する。また、使用環境の変化
や経時劣化に対して、自動的に制御則を変化させること
で、いかなる場合でも最適な運転状態を実現することが
できるという効果を奏する。さらに、制御対象の制御パ
ラメータ取得のためのセッティングが不要になり、低コ
スト化が可能になる。また、制御対象として車両に搭載
されるエンジンを適用する時に、運転者の好みに合わせ
た制御を施すことで、エンジンの動作特性を使用者の好
みに合わせることが可能になり、また、運転者の技量に
応じた制御を施すことで、熟練度に合わせた走行性能を
得ることが可能になり、運転者に快適な走行を提供する
ことが可能になる。さらにまた、制御対象として車両に
搭載されるエンジンを適用する場合、購入者の購入後
に、エンジンの動作特性が購入者の好み等に合わせて調
教されるので、購入者は車両購入時にエンジン特性等に
よりその選択範囲を制限されることがなくなる。また、
制御対象として、補助動力付き自転車や車イスの補助動
力を適用すれば、補助動力のアシスト特性を使用者の好
みに応じた特性に合わせることが可能になるので、使用
者毎に最適なアシスト特性を得ることが可能になる。さ
らにまた、制御対象としてロボットを適用する場合に
は、ロボットの動作特性を使用者の好みに応じた特性に
合わせることが可能になるので、ロボットが使用者毎に
最適な動作を行うようになる。また、制御対象としてサ
スペンション又はシートを適用する場合には、サスペン
ション又はシートのダンパー特性を使用者の好みに応じ
た特性に合わせることが可能になるので、使用者毎に最
適なダンパー特性を得ることが可能になる。さらに、制
御対象として、車両の操舵系を適用すれば、操舵系の操
舵制御特性を使用者の好みに応じた特性に合わせること
が可能になるので、使用者毎に最適な操舵制御特性を得
ることが可能になる。According to the integrated control system of the present invention described above, the control characteristics of the control system for controlling the control target can be changed in accordance with the characteristics of the user and / or the situation of use. The subject is "trained" to the characteristics that are appropriate for the user who uses it and / or the context of use, making it easier to use, and also training the user to control the characteristics of the controlled object to their own unique characteristics. This has the effect of giving consciousness fun. In addition, by automatically changing the control law in response to a change in the use environment or deterioration over time, an advantageous effect is obtained in which the optimum operation state can be realized in any case. Further, setting for obtaining the control parameters of the control object is not required, and the cost can be reduced. In addition, when an engine mounted on a vehicle is applied as a control target, by performing control in accordance with the driver's preference, it becomes possible to adjust the operating characteristics of the engine to the user's preference. By performing the control according to the skill of the driver, it is possible to obtain the traveling performance according to the skill level, and it is possible to provide the driver with a comfortable traveling. Furthermore, when an engine mounted on a vehicle is applied as a control target, the operating characteristics of the engine are trained in accordance with the purchaser's preferences after purchasing the purchaser. Does not restrict the selection range. Also,
By applying the auxiliary power of a bicycle or wheelchair with auxiliary power as the control target, the assist characteristics of the auxiliary power can be adjusted to the characteristics according to the user's preference, so the optimal assist characteristics for each user Can be obtained. Furthermore, when a robot is applied as a control target, it is possible to adjust the operation characteristics of the robot to characteristics according to the user's preference, so that the robot performs an optimal operation for each user. . Further, when a suspension or a seat is applied as a control target, it is possible to adjust a damper characteristic of the suspension or the seat to a characteristic according to a user's preference, thereby obtaining an optimal damper characteristic for each user. Becomes possible. Furthermore, if the steering system of the vehicle is applied as the control target, the steering control characteristics of the steering system can be adjusted to the characteristics according to the user's preference, so that the optimal steering control characteristics can be obtained for each user. It becomes possible.

[Brief description of the drawings]

【図１】本発明に係る総合制御方式の基本概念を示す
ブロック図である。FIG. 1 is a block diagram showing a basic concept of an integrated control system according to the present invention.

【図２】図１に示した総合制御方式を経時的に示すフ
ローチャートである。FIG. 2 is a flowchart showing the overall control method shown in FIG. 1 over time.

【図３】エンジン１と前記総合制御方式を実行する制
御装置１０との関係を示す概略図である。FIG. 3 is a schematic diagram showing a relationship between an engine 1 and a control device 10 that executes the comprehensive control method.

【図４】制御装置１０の概略ブロック図である。FIG. 4 is a schematic block diagram of the control device 10.

【図５】図４における進化適応層の基本動作のフロー
チャートである。FIG. 5 is a flowchart of a basic operation of the evolution adaptive layer in FIG.

【図６】（ａ），（ｂ）は共に、一定時間内の各ギヤ
ポジションにおける最高回転数の分布パターンを示すグ
ラフであり、（ａ）はスポーティな走行時のグラフで、
（ｂ）はマイルドな走行時のグラフである。FIGS. 6 (a) and 6 (b) are graphs each showing a distribution pattern of the maximum number of revolutions at each gear position within a predetermined time, and FIG. 6 (a) is a graph showing a sporty traveling;
(B) is a graph at the time of mild running.

【図７】（ａ）〜（ｃ）は共に、一定時間内の各ギヤ
ポジションにおける最高回転数の分布パターンを示すグ
ラフであり、（ａ）は通常走行時のグラフで、（ｂ）は
渋滞路走行時のグラフで、（ｃ）は高速道路走行時のグ
ラフである。7 (a) to 7 (c) are graphs showing distribution patterns of the maximum number of rotations at each gear position within a certain time, FIG. 7 (a) is a graph during normal running, and FIG. FIG. 4C is a graph when traveling on a road, and FIG. 4C is a graph when traveling on a highway.

【図８】走行状態指数を推定するためのニューラル回
路網の概略図である。FIG. 8 is a schematic diagram of a neural network for estimating a driving state index.

【図９】（ａ）〜（ｃ）は共に、走行状態指数と評価
関数の関係を示すグラフであり、（ａ）を基準として、
（ｂ）が最高回転数に達するまでの時間が短い状態を示
し、（ｃ）が最高回転数に達するまでの時間が長い状態
を各々示している。9 (a) to 9 (c) are graphs each showing a relationship between a running state index and an evaluation function.
(B) shows a state in which the time required to reach the maximum rotation speed is short, and (c) shows a state in which the time required to reach the maximum rotation speed is long.

【図１０】燃費モジュール及び加速モジュールを構成
するニューラル回路網の概略図である。FIG. 10 is a schematic diagram of a neural network constituting the fuel consumption module and the acceleration module.

【図１１】遺伝的アルゴリズムによる燃費モジュール
の進化のフローチャートである。FIG. 11 is a flowchart of the evolution of a fuel economy module by a genetic algorithm.

【図１２】ニューラル回路網のコーディングを概念的
に示す図である。FIG. 12 is a diagram conceptually showing coding of a neural network.

【図１３】（ａ）〜（ｃ）は共に、同一スロットル開
度における車速の変化と加速評価指数の関係を示すグラ
フである。FIGS. 13A to 13C are graphs each showing a relationship between a change in vehicle speed and an acceleration evaluation index at the same throttle opening.

【図１４】燃費モジュールの進化と加速モジュールの
進化の関係を概念的に示す図である。FIG. 14 is a diagram conceptually showing the relationship between the evolution of a fuel economy module and the evolution of an acceleration module.

【図１５】教師データ集合が新しい教師データを獲得
する状態を概念的に示す図である。FIG. 15 is a diagram conceptually showing a state in which a teacher data set acquires new teacher data.

【図１６】教師データ集合の更新を概念的に示す図で
ある。FIG. 16 is a diagram conceptually showing updating of a teacher data set.

【図１７】本発明に係る総合制御方式を実行する制御
装置の別の実施例の概略ブロック図である。FIG. 17 is a schematic block diagram of another embodiment of the control device that executes the comprehensive control method according to the present invention.

【図１８】各制御モジュールの進化の傾向を示す図で
ある。FIG. 18 is a diagram showing an evolution tendency of each control module.

【図１９】遺伝的アルゴリズムによる燃費モジュール
の進化のフローチャートである。FIG. 19 is a flowchart of the evolution of a fuel economy module by a genetic algorithm.

【図２０】時分割方式による各個体の予備評価処理の
状態を示す図である。FIG. 20 is a diagram showing a state of a preliminary evaluation process of each individual by a time division method.

【図２１】進化処理だけでの進化と、進化処理と学習
処理とを交互に繰り返す進化との進化性能を比較したグ
ラフを示す図である。FIG. 21 is a diagram showing a graph comparing the evolution performance of evolution with only evolution processing and evolution in which evolution processing and learning processing are alternately repeated.

【図２２】ファジー推論により走行状態指数を推定す
る状態を概念的に示した図である。FIG. 22 is a diagram conceptually showing a state in which a running state index is estimated by fuzzy inference.

【図２３】燃費モジュールにおける燃費評価の別の方
法を概念的に示す図である。FIG. 23 is a diagram conceptually showing another method of fuel efficiency evaluation in the fuel efficiency module.

【図２４】燃費モジュールにおける燃費評価のさらに
別の方法を概念的に示す図である。FIG. 24 is a view conceptually showing still another method of fuel efficiency evaluation in the fuel efficiency module.

[Explanation of symbols]

１エンジン１０制御装置（第２実施例）２０制御装置（第３実施例） Reference Signs List 1 engine 10 control device (second embodiment) 20 control device (third embodiment)

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁶ 識別記号ＦＩＦ０２Ｄ 45/00 ３４０Ｆ０２Ｄ 45/00 ３４０Ｈ３４０ＺＧ０６Ｆ 15/18 ５５０Ｇ０６Ｆ 15/18 ５５０Ｅ５５０Ｃ ──────────────────────────────────────────────────の Continued on the front page (51) Int.Cl. ⁶ Identification code FI F02D 45/00 340 F02D 45/00 340H 340Z G06F 15/18 550 G06F 15/18 550E 550C

Claims

[Claims]

1. A characteristic of a user and / or a use condition is determined, and a control characteristic of a control system for controlling a control target is changed in accordance with a characteristic of the user and / or a use condition based on the result of the determination. A comprehensive control method characterized by the following.

2. The comprehensive control method according to claim 1, wherein the characteristics of the user and / or the use situation are estimated using a neural network or a fuzzy rule.

3. The comprehensive control method according to claim 1, wherein the user's characteristics are at least one of user's preference, skill, operation pattern, and state.

4. The control characteristic according to claim 1, wherein the control characteristic is adaptively changed in accordance with a change in a use environment and / or a temporal deterioration of a control target. Comprehensive control method.

5. The control system includes a reflection layer as a lowermost layer,
The comprehensive control method according to any one of claims 1 to 4, wherein the integrated control method has a hierarchical structure having a learning layer on a layer above it and an evolution adaptive layer on the top layer.

6. The general control method according to claim 5, wherein a basic amount of control output is output from the reflection layer, and outputs of the learning layer and the evolution adaptive layer are correction amounts for the basic amount. .

7. The method according to claim 1, wherein the evolution adaptation layer includes at least one control module that behaves autonomously, and the control module is adaptively evolved by competing and / or cooperating with the control module. Item 7. The comprehensive control method according to item 5 or 6.

8. The total control system according to claim 7, wherein there are a plurality of said control modules, and said plurality of control modules cooperate and compete with each other to adaptively change said control system. .

9. A lower module group in which the control module determines a primary output based on a characteristic related to a control target, and a higher module determining a secondary output from the primary output of the lower module group based on a user and / or a use situation. The integrated control system according to claim 8, further comprising a module group.

10. The system according to claim 1, wherein the lower module group is evolved while being evaluated based on characteristics related to a control target, and the upper module group is evolved while being evaluated based on a user and / or a use situation. 9. The comprehensive control method described in 9.

11. The integrated control method according to claim 8, wherein a ratio of outputs of a plurality of control modules of the evolution adaptive layer is changed according to characteristics of a user and / or a use situation.

12. The control module according to claim 7, wherein the adaptive evolution of the control module is performed using a genetic algorithm and / or a multi-agent technique.
The comprehensive control method according to any one of the above.

13. The control module of the evolution adaptation layer is evolved by a predetermined number of generations by a genetic algorithm, and each time the learning result is learned by the learning layer, the control module is most evaluated among a plurality of individuals of the generation trained by the learning layer. 13. The comprehensive control method according to claim 12, wherein a new population including a population having a high number of individuals and a population having an output of the evolution adaptive layer of zero is generated.

14. The comprehensive control method according to claim 12, wherein an evaluation function in the genetic algorithm is automatically changed according to characteristics of a user and / or a use situation.

15. The comprehensive control method according to claim 14, wherein a relationship between the evaluation function and characteristics of a user and / or a use situation can be changed by a user's instruction.

16. The comprehensive control method according to claim 5, wherein a limit is provided for an output gain of the evolution adaptive layer.

17. The learning layer according to claim 5, wherein said learning layer has two neural networks for execution and learning.
17. The comprehensive control method according to any one of items 16 to 16.

18. The comprehensive control method according to claim 17, wherein the control characteristic obtained by the evolution of the control module in the evolution adaptive layer is learned by a learning neural network in the learning layer.

19. The learning layer has a learning data set for learning, and outputs only the sum of the outputs of the evolutionary adaptive layer and the neural network for execution of the learning layer that has been output within a certain period in the past. 19. The comprehensive control method according to claim 18, wherein learning is performed by a learning neural network in a learning layer by using as new teacher data, and teacher data in other areas is the same as before.

20. When the learning of the learning neural network is completed, the learning neural network functions as an execution, and the original execution neural network functions as a learning. The integrated control system of Kishia according to any one of claims 15 to 19.

21. The information according to claim 15, wherein information on the learned neural network in the learning layer is stored in an external storage medium such as a floppy disk or an IC card, and can be stored and read. The comprehensive control method according to any one of claims.

22. The reflection layer according to claim 5, wherein the reflection layer performs processing using any one of a mathematical model, a fuzzy rule, a neural network, a map, and a subsumption architecture. The comprehensive control method according to any one of claims.

23. The comprehensive control method according to claim 3, wherein the skill of the user is estimated from an external state quantity.

24. The comprehensive control method according to claim 2, wherein the state of the user is estimated using a physiological index.

25. The comprehensive control method according to claim 24, wherein the physiological index is at least one of a user's pulse, blood pressure, body temperature, and brain wave.

26. The comprehensive control method according to claim 1, wherein the control target operates actively.

27. The comprehensive control method according to claim 26, wherein the control target comprises an engine.

28. The engine for a vehicle, wherein the characteristics of the user are determined based on the driver's preference, skill, and / or state, and the characteristics of the usage are determined based on the running condition of the vehicle. 28. The comprehensive control method according to claim 27, wherein the operation characteristics of the engine are changed based on the control information.

29. Means for detecting a driving state of a vehicle are provided, and based on at least a part of the detection result, the driver's preference, skill, and / or characteristics of one or both of a state and a driving situation. 29. The comprehensive control method according to claim 28, wherein a state index suitable for the condition is estimated, and the operating characteristics of the engine are changed based on the state index.

30. The method of claim 30, wherein the state index is a neural network,
30. The comprehensive control method according to claim 29, wherein the estimation is performed using one or both of the fuzzy rules.

31. The comprehensive control method according to claim 29, wherein an evaluation function in a genetic algorithm for coordinating and / or competing control modules of the evolutionary adaptive layer is changed based on the state index.

32. A relationship between the evaluation function and the state index based on a time required to reach a maximum rotational speed in each gear position, a rate of change of an engine rotational speed, or a driver's input using an instruction input button. 32. The general control method according to claim 31, wherein

33. The comprehensive control method according to claim 29, wherein the means for detecting the driving state of the vehicle is means for detecting an engine speed and a gear position.

34. Control outputs for changing the operating characteristics of the engine include a fuel injection amount, an ignition timing, an electronic throttle opening, an intake / exhaust valve timing, a valve lift amount,
The comprehensive control method according to any one of claims 27 to 33, wherein intake and exhaust control valve timing and the like are used.

35. A technique for estimating a driver's skill from at least one of a driver's clutch operation speed, a vehicle inclination angle, a vertical acceleration of the vehicle, a brake operation amount, and a use ratio of front and rear brakes. The comprehensive control method according to any one of claims 28 to 34.

36. Means for detecting a physiological index of a driver is provided in an accessory worn by the driver during driving, and the state of the driver is estimated based on the physiological index obtained by the means. The comprehensive control method according to any one of claims 28 to 35, wherein:

37. The integrated control system according to claim 36, wherein the accessory is at least one of a helmet, a glove, and a boot.

38. The control object is an auxiliary power of a bicycle or a wheelchair using an electric motor or an engine as an auxiliary power,
27. The comprehensive control method according to claim 26, wherein a control characteristic of the control system is a characteristic relating to an assist characteristic of the auxiliary power.

39. The comprehensive control method according to claim 26, wherein the control target is a robot, and the control characteristics of the control system are characteristics relating to the operation characteristics of the robot.

40. The motion characteristic according to claim 39, wherein the motion characteristic is at least one of a robot path selection, an arm moving method, a moving speed, and a talking method.
General control method described in

41. The comprehensive control method according to claim 39, wherein the robot is a personal robot.

42. The comprehensive control method according to claim 1, wherein the control target operates passively.

43. The control target is a vehicle steering system,
43. The comprehensive control method according to claim 42, wherein the control characteristic of the control system is a characteristic relating to a steering control characteristic of the steering system.

44. The comprehensive control system according to claim 42, wherein the control object is a suspension or a seat of a vehicle body, and a control characteristic of a control system is a characteristic relating to a damper characteristic of the suspension or the seat.

45. The characteristics of the user are determined by the driver's preference, skill,
And / or changing the control characteristic of the control system based on a result of the determination based on a result of the determination based on a result of the determination.
4. The comprehensive control method described in 4.

46. A means for detecting a driving state of the vehicle, and based on at least a part of the detection result, the driver's preference, skill, and / or characteristics of one or both of the state and the driving situation. 46. The comprehensive control method according to claim 45, wherein a state index suitable for is estimated, and operating characteristics of the engine are changed based on the state index.

47. The method of claim 27, wherein the state index is a neural network,
47. The comprehensive control method according to claim 46, wherein the estimation is performed using one or both of the fuzzy rules.

48. The comprehensive control method according to claim 46, wherein an evaluation function in a genetic algorithm for coordinating and / or competing control modules of the evolution adaptive layer is changed based on the state index.

49. A technique of estimating a driver's skill based on at least one of a driver's clutch operation speed, a vehicle inclination angle, a vertical acceleration of the vehicle, a brake operation amount, and a use ratio of front and rear brakes. The comprehensive control method according to any one of claims 45 to 48.

50. Means for detecting a physiological index of a driver is provided in an accessory worn by the driver during driving, and the state of the driver is estimated based on the physiological index obtained by the means. The comprehensive control method according to any one of claims 45 to 48, wherein:

51. The comprehensive control method according to claim 50, wherein the accessory is at least one of a helmet, a glove, and a boot.