JPH08315141A

JPH08315141A - Image recognition device

Info

Publication number: JPH08315141A
Application number: JP7115756A
Authority: JP
Inventors: Yuichi Ishizuka; 裕一石塚
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 1995-05-15
Filing date: 1995-05-15
Publication date: 1996-11-29

Abstract

PURPOSE: To obtain an image recognition device which recognizes an image or confirms a frequency by feeding back and correcting the weight and threshold, and making multi-stage connections and processing and compressing two-dimensional input information at a time. CONSTITUTION: This device is equipped with neuron type threshold elements having weight coefficients for plural inputs respectively and a storage part which is constituted by hierarchically connecting the neuron type threshold elements in a pyramid shape so that the output of the top layer neuron is regarded as a final output and also variably sets the values of the weight coefficients and the threshold. In addition to the basic constitution, compressed information obtained in an intermediate layer of the hierachy is also utilized. In the basic constitution, plural inputs are regarded as two inputs and 2-input 1-output neuro type threshold elements 21 are used.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は主に画像の認識に係わ
り、画像を特殊な構造および性質を持ったニューロン型
素子を利用して、圧縮・周波数変換等の画像処理を行
い、認識率および処理速度を改善した画像の認識装置に
関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention mainly relates to image recognition, in which an image is subjected to image processing such as compression and frequency conversion by using a neuron-type element having a special structure and property, and the recognition rate and The present invention relates to an image recognition device with improved processing speed.

【０００２】[0002]

【従来の技術】画像を認識する際、冗長な情報をできる
だけ省いて必要最小限の（圧縮）画像情報だけで認識が
行えれば、処理時間の観点から見てその方が望ましい。
従来より画像の圧縮方式は様々提案されてきたが、それ
らの多くは主に通信に用いるための圧縮技術であった。
例えば図１８は、特開昭６２−２１９７６６で示された
従来の画像圧縮装置の回路図である。図に示されたよう
に、走査するライン上で類似した情報をカウントしてま
とめあげてゆくランレングス符号化方式を改良してい
る。具体的に言うと、圧縮するデータの開始点にマーク
を付け、類似情報がある限り、その長さがランレングス
カウンタにインクリメンタルに加算され、最後に終了点
のマークをつけて圧縮コードとする方式である。しかし
この方法はある特定の方向に向かって走査した情報を、
１次元の路線を通じて高効率で伝達する場合には有効で
あるが、２次元には異なる情報を持つので、２次元の画
像全体を一定の解像度、すなわち同じ圧縮率で変換し、
解析／認識したい目的にはかえって扱いにくい方法、装
置である。2. Description of the Related Art When recognizing an image, it is desirable from the viewpoint of processing time if redundant information can be eliminated as much as possible and recognition can be performed using only the minimum necessary (compressed) image information.
Conventionally, various image compression methods have been proposed, but most of them have been compression techniques mainly used for communication.
For example, FIG. 18 is a circuit diagram of a conventional image compression apparatus shown in Japanese Patent Laid-Open No. 62-219766. As shown in the figure, the run-length coding method is improved in which similar information is counted and summarized on the scanning line. Specifically, the start point of the data to be compressed is marked, and as long as there is similar information, its length is incrementally added to the run length counter, and the end point is marked at the end to make a compressed code. Is. However, this method uses the information scanned in a certain direction,
It is effective for high-efficiency transmission through a one-dimensional route, but since two-dimensional information has different information, the entire two-dimensional image is converted with a constant resolution, that is, the same compression ratio,
It is a method and device that is rather cumbersome for the purpose of analysis / recognition.

【０００３】このような問題点を解決する１つの手法と
して、認識処理用に画像の全点で解像度のレベルを合わ
せた圧縮方式がある。例えばこれまでに、画像をいくつ
かのブロックに分け、ブロック毎に平均値を取って圧縮
データとする一般的な処理方法があった。これに対して
図１９は、特開平５−６１９７６で示された他の従来の
画像処理方法である。図の方法では、画像の輝度の分散
率を算出し補正することで圧縮の精度を高め、これによ
り認識率の向上を目指している。しかしこの方法も画像
をある決まった割合で圧縮する場合は有効であるかもし
れないが、そうでない場合、つまり中途半端な縮尺倍率
による圧縮や、複数解像度で表現された画像を何枚か用
いて、統合処理したい場合には、その都度ブロック分割
の方法から始まって各解像度表現を導かなければならな
い。このためかなり複雑な手順と時間が必要である。As one method for solving such a problem, there is a compression method in which the resolution level is matched at all points of the image for recognition processing. For example, heretofore, there has been a general processing method in which an image is divided into several blocks and an average value is taken for each block to obtain compressed data. On the other hand, FIG. 19 shows another conventional image processing method shown in JP-A-5-61976. In the method shown in the figure, the accuracy of compression is improved by calculating and correcting the dispersion ratio of the luminance of the image, and thereby the recognition rate is improved. However, this method may be effective when compressing the image at a certain ratio, but if it is not, that is, compression with a halfway scale factor or using several images expressed in multiple resolutions. Whenever it is desired to perform integrated processing, each resolution expression must be derived starting from the block division method. This requires a fairly complicated procedure and time.

【０００４】また画像をニューラルネットワークを使っ
て認識するものとしては、図２０に示された特開平２−
３００８６８の方法がある。これは画像データを直接ニ
ューラルネットワークに入力するのではなくて、あらか
じめ画像処理内容を解析し、その処理に特化した最適な
ニューラルネットワークの構造を構築し処理するもので
ある。図では論理演算（（Ａ and Ｂ） and （Ｃ
and Ｄ））を実行する（ａ）（ｂ）２種類のニュー
ラルネットワークを示している。記号 and は排他的論
理和を示し、またＡ，Ｂ，Ｃ，Ｄは『１』または『０』
のデジタル値とする。例えばＡ，Ｂ，Ｃ，Ｄとして『０
００１』が入力されると、両方のネットワークとも最左
翼にある入力層において上から３番目のニューロンだけ
が出力を出す。各ニューロンの上に記した『ｘｘ０１』
という数字は下位２ビットが０１であれば出力を出せる
ことを表している。ここで『ｘ』は０、１どちらでもよ
い。以下同様にネットワークをたどっていくと、この２
つのニューラルネットワークは、結局上述した論理演算
を行える２種類の異なる構造のニューラルネットワーク
であることが判る。（ｂ）がこの公開特許公報における
手法を使い最適化処理が施されたニューラルネットワー
クであり、（ａ）に比べてニューロン数は同数である
が、リンクの本数で６本少なく、その分だけ計算量を減
らすことができる。この構成は、まず入力と期待される
出力が判っている時に最適なニューラルネットワークの
構造を求めるのもである。しかし、ここで示されている
方法はニューロン数をいかに減らすかを目的としたもの
であり、認識処理用にその構造が工夫されたものではな
い。また中間層の出力に対しても、“認識のためのデー
タ”として用いることを考慮したシステムではなく、中
間層の出力に以降の処理に有効な情報が現れるとは考え
られない。A technique for recognizing an image using a neural network is shown in FIG.
There are 300868 methods. This is not to directly input image data to a neural network, but to analyze the image processing contents in advance and construct and process an optimal neural network structure specialized for the processing. In the figure, logical operations ((A and B) and (C
(a) and (b) two types of neural networks that perform (and D)). The symbol and indicates exclusive OR, and A, B, C, D are "1" or "0".
The digital value of. For example, if A, B, C, D is "0
When "001" is input, only the third neuron from the top in the leftmost input layer of both networks outputs. "Xx01" written above each neuron
The number indicates that if the lower 2 bits are 01, an output can be output. Here, "x" may be 0 or 1. If you follow the network in the same way,
It can be seen that the two neural networks are two types of neural networks having different structures that can perform the above-described logical operation. (B) is a neural network that has been subjected to an optimization process using the method disclosed in this patent, and the number of neurons is the same as that in (a), but the number of links is 6 less, and only that much is calculated. The amount can be reduced. With this configuration, first, when the input and expected output are known, the optimum neural network structure is obtained. However, the method shown here is aimed at reducing the number of neurons, and its structure is not devised for recognition processing. Further, even for the output of the intermediate layer, it is not a system considering the use as "data for recognition", and it is unlikely that effective information will appear in the output of the intermediate layer for the subsequent processing.

【０００５】[0005]

【発明が解決しようとする課題】従来の画像認識システ
ムは上記のように構成されており、バランスがとれた実
用的な全体システムが得られない。対象の性質の正確な
把握と同様に、分析側の特性とそれが対象に及ぼす影響
も熟知されていなければならず、情報統合の方式が判っ
ていない今の段階ではこれは難しい。現在、ある程度の
大きさを持った空間フィルター並びにコンボリューショ
ンなどの処理を高速に実行するマイクロプロセッサも登
場してきてはいるが、このような確定的な処理では対象
の多目的な特徴抽出はできないことが多い。Since the conventional image recognition system is constructed as described above, a balanced and practical whole system cannot be obtained. As well as accurate understanding of the nature of the object, the characteristics of the analysis side and its effect on the object must be well known, which is difficult at this stage when the method of information integration is unknown. At present, a spatial filter with a certain size and a microprocessor that executes processing such as convolution at high speed have been introduced, but it is not possible to extract the target multipurpose features by such deterministic processing. Many.

【０００６】本発明は上記の課題を解決するためになさ
れたもので、ニュートラルネットワークを用いて２次元
の入力情報の加工・圧縮周波数解析から認識までを一度
に帰還修正し、最適な認識システムを実現することを目
的としている。The present invention has been made in order to solve the above-mentioned problems, and an optimum recognition system is provided by performing feedback correction at once in processing and compression frequency analysis of two-dimensional input information using a neutral network to recognition. It is intended to be realized.

【０００７】[0007]

【課題を解決するための手段】本発明に係る画像認識装
置は、複数の入力に対してそれぞれ重み係数を持ったニ
ューロン型閾値素子と、このニューロン型閾値素子をピ
ラミッド型に階層接続してその最上層ニューロンの出力
を最終出力とする構成とし、かつ上記重み係数の値と閾
値を可変設定する記憶部を備えた。An image recognition apparatus according to the present invention comprises a neuron type threshold element having weighting coefficients for a plurality of inputs, and a pyramid type hierarchical connection of the neuron type threshold elements. The configuration is such that the output of the uppermost layer neuron is the final output, and a storage unit for variably setting the value of the weighting factor and the threshold value is provided.

【０００８】また更に基本構成に加えて、切り換えて階
層の任意の中間層の、任意の座標位置の中間層出力を最
終出力として出力するようにした。Further, in addition to the basic structure, the output of the intermediate layer at any coordinate position of any intermediate layer of the hierarchy is switched and output as the final output.

【０００９】また更に基本構成において、複数入力は２
入力とし、２入力１出力のニューロ型閾値素子を用いる
ようにした。Further, in the basic structure, the number of inputs is 2
A neuro-type threshold element having two inputs and one output is used as an input.

【００１０】また更に基本構成において、重み係数の値
と閾値は他のニューロン型閾値素子の階層接続の一部の
認識結果に基づいて変更するようにした。Furthermore, in the basic configuration, the value of the weighting factor and the threshold value are changed based on the recognition result of a part of the hierarchical connection of other neuron type threshold value elements.

【００１１】また更に、重み係数の値または閾値は他の
ニューロン型閾値素子の階層接続の一部の距離情報の認
識結果に基づいて変更するようにした。Furthermore, the value of the weighting factor or the threshold value is changed based on the recognition result of the distance information of a part of the hierarchical connection of other neuron type threshold value elements.

【００１２】[0012]

【作用】この発明による画像認識装置は、複数の入力に
対してそれぞれ重み係数と閾値で定まる出力がピラミッ
ド型に階層伝達され、最終出力は最下層の情報が圧縮さ
れた情報となって得られる。各ニューロンの出力は、そ
の入力の重みと閾値関数により入力が強調、平滑、また
はフィルタリングされて得られる。この作用は重み係数
と閾値とを必要により変えて何度でも繰り返されて、例
えば標準パターンと比較されて画像認識される。In the image recognition apparatus according to the present invention, the outputs determined by the weighting factor and the threshold value are hierarchically transmitted to a plurality of inputs in a pyramid type, and the final output is obtained by compressing the information of the lowermost layer. . The output of each neuron is obtained by emphasizing, smoothing, or filtering the input by the weight of the input and a threshold function. This action is repeated as many times as necessary by changing the weighting factor and the threshold value, and image recognition is performed by comparing with a standard pattern, for example.

【００１３】また更に、任意の階層の中間層の認識結果
が最終出力とされて、少ない標準パターンと比較されて
画像認識される。Furthermore, the recognition result of the intermediate layer of an arbitrary hierarchy is used as the final output and compared with a small number of standard patterns for image recognition.

【００１４】また更に、各ニューロンの入力は２入力で
あり、各入力が加算、差分演算されて出力される。Furthermore, each neuron has two inputs, and each input is subjected to addition and difference calculation and output.

【００１５】また更に、重み係数の値と閾値が変更さ
れ、画像認識結果に基づいて強調、フィルタリングされ
て得られる。Furthermore, the value of the weighting factor and the threshold value are changed, and the image is obtained by being emphasized and filtered based on the image recognition result.

【００１６】また更に、重み係数の値または閾値が階層
接続の一部の距離情報の認識結果に基づいて変更され、
周波数特性が変更される。Furthermore, the value of the weighting factor or the threshold value is changed based on the recognition result of the distance information of a part of the hierarchical connection,
The frequency characteristic is changed.

【００１７】[0017]

【Example】

実施例１．本発明の実施例を説明する前に、まず一般的
な画像を取り込む入力部分から説明する。通常画像情報
は膨大な情報量を含んでいる。例えば５１２ｘ５１２画
素で各画素８ビットの濃淡画像なら、約２１０万ビット
になる。さらに各画素Ｒ，Ｇ，Ｂそれぞれ８ビットで表
されているとするなら、その６４倍の情報量が１枚の画
像に含まれることになる。この膨大な情報量を持つ画像
を高速に分析し、認識を行うためにもこうしたデータ量
の多さはボトルネックとなっている。しかし当然のこと
ながら、認識においては全ての画像情報が必要なわけで
はない。対象によっては部分的な情報だけで、十分認識
が行えることも多い。またその方が扱うデータ量も少な
く、処理時間も短縮できるので好都合である。別の言葉
で言えば認識にあまり関係のない余計な情報がたくさん
含まれているとも言える。主に画像データの伝送分野で
情報削減方法が多く研究されてきた。しかしその方法は
伝送上の効率を追求したもので、例えば一次元データの
圧縮が必ずしも２次元データの情報を正しく与えず、圧
縮した画像を加工・処理する認識用途には逆に不便にな
ることが多い。また画像をいくつかのブロックに分け、
そのブロックの中の平均値及び分散などを算出し圧縮す
る別の画像圧縮方法があるが、これは結局のところ画像
認識よりは圧縮の方を強く考慮したシステムであり、柔
軟性や拡張性に欠ける。Example 1. Before describing an embodiment of the present invention, first, an input portion for capturing a general image will be described. Usually, the image information contains a huge amount of information. For example, a grayscale image of 512 × 512 pixels and 8 bits for each pixel has approximately 2.1 million bits. Further, if each pixel R, G, and B is represented by 8 bits, the amount of information that is 64 times that is included in one image. Such a large amount of data is a bottleneck in order to analyze and recognize images with this enormous amount of information at high speed. However, it goes without saying that not all image information is necessary for recognition. Depending on the object, it is often possible to fully recognize it with only partial information. Further, it is convenient because the amount of data handled by that person is small and the processing time can be shortened. In other words, it can be said that it contains a lot of extra information that is not related to recognition. A lot of researches have been conducted on information reduction methods mainly in the field of image data transmission. However, this method pursues transmission efficiency. For example, the compression of one-dimensional data does not always give the information of two-dimensional data correctly, which makes it inconvenient for the recognition application to process and process the compressed image. There are many. Also divide the image into several blocks,
There is another image compression method that calculates the average value and variance etc. in the block and compresses it, but in the end, this is a system that strongly considers compression rather than image recognition, and it has flexibility and expandability. Lack.

【００１８】以下、本発明の概念を説明する。図１はそ
の構成概念を示す図であり、対称形の特殊なニューラル
ネットワークを利用している。図において、１１は入力
の対象画像、１は本発明のニューラルネットワーク、２
１はその最下層のニューロン、３１は最上層のニューロ
ンを表す。また図２は本発明のニューラルネットワーク
の出力を利用してパターン解析を行い総合判断する画像
認識システムを示した図である。図において、２は後に
説明する各ニューロンのシナプスの重み、３は同じくニ
ューロンの閾値である。４はニューラルネットワークの
出力の圧縮画像情報、８は内部にプロセッサ７と、特徴
抽出部５と、パターンマッチング部６を含む総合判断部
である。図３は、図２に示す画像認識システムの動作を
示す動作フローチャート図である。本発明はニューラル
ネットワークに関する部分であるので、この総合判断部
の動作は以下に簡単に説明をしておく。即ち、総合判断
部８は、ステップＳ３でシナプス重み２と、ニューロン
閾値３の値を設定し、ステップＳ４で圧縮画像４を得
る。ステップＳ５で必要な周波数間隔が得られれば、ス
テップＳ６で特徴を抽出する。ステップＳ７でフィルタ
処理をし、ステップＳ８で所定の記憶している基準の画
像と比較して、ステップＳ９で目標を定める。ステップ
Ｓ１１で確かであれば、画像認識動作は終了する。The concept of the present invention will be described below. FIG. 1 is a diagram showing the configuration concept, and uses a special symmetric neural network. In the figure, 11 is an input target image, 1 is a neural network of the present invention, 2
Reference numeral 1 denotes a lowermost neuron, and 31 denotes a topmost neuron. FIG. 2 is a diagram showing an image recognition system for making a comprehensive judgment by performing pattern analysis using the output of the neural network of the present invention. In the figure, 2 is a synapse weight of each neuron, which will be described later, and 3 is a threshold value of the neuron. Reference numeral 4 is a compressed image information output from the neural network, and reference numeral 8 is a comprehensive determination unit including a processor 7, a feature extraction unit 5, and a pattern matching unit 6 therein. FIG. 3 is an operation flowchart showing the operation of the image recognition system shown in FIG. Since the present invention relates to a neural network, the operation of this comprehensive judgment unit will be briefly described below. That is, the comprehensive judgment unit 8 sets the values of the synapse weight 2 and the neuron threshold value 3 in step S3, and obtains the compressed image 4 in step S4. If the required frequency interval is obtained in step S5, the feature is extracted in step S6. Filtering is performed in step S7, comparison is made with a predetermined stored reference image in step S8, and a target is set in step S9. If the result is correct in step S11, the image recognition operation ends.

【００１９】さてニューラルネットワークの部分に戻っ
て説明をする。圧縮という意味において最も簡単な２入
力１出力のニューロンを使ったネットワークを考える
と、図４のような２分木の構造となる。入力画像に一番
近い又は低いレベルのニューロンへは単純に、入力画像
上である方向に１〜数画素離れた画像の輝度値が入力さ
れるものとする。ここで、わかりやすくするために、入
力はカラー画像ではなく、濃淡画像、それも０〜２５５
階調であるとして説明する。更に、輝度レベルは０が最
も暗く、２５５が最も明るいものとする。１つ１つのニ
ューロンとその閾値は具体的には図５の（ａ）、
（ｃ）、（ｄ）のようなものとする。Now, returning to the neural network portion, description will be made. Considering the simplest 2-input 1-output neuron network in terms of compression, a binary tree structure as shown in FIG. 4 is obtained. It is assumed that the brightness value of an image that is one to several pixels apart in a certain direction on the input image is simply input to the neuron closest to or lower than the input image. Here, for the sake of clarity, the input is not a color image, but a grayscale image, which is also 0-255.
It will be described as gradation. Further, regarding the brightness level, 0 is the darkest and 255 is the brightest. Each neuron and its threshold value are specifically shown in FIG.
(C) and (d).

【００２０】ニューロンを使った画像処理の詳細を画像
の強調処理を示す図６を用いて説明する。図において、
ニューロンは入力に重みを掛けて画像を処理する。２本
の入力線の末端にはシナプス結合重みがあり、図では
今、仮に上側がα（０．５）下側がβ（２）であるもの
とする。このような場合、ニューロンは入力線の上側の
データを縮小し、下側のデータを拡大する役割を持つ。The details of the image processing using neurons will be described with reference to FIG. 6 showing the image enhancement processing. In the figure,
The neuron processes the image by weighting the input. There are synaptic connection weights at the ends of the two input lines, and in the figure, it is assumed that the upper side is α (0.5) and the lower side is β (2). In such a case, the neuron has a role of reducing the data on the upper side of the input line and expanding the data of the lower side.

【００２１】また図７の（ｃ）のように入力に正負対象
の重みを掛けた場合、閾値関数が（ａ）の形状である
と、画素の差分情報がそのまま出力される。また（ｂ）
の閾値関数を用いると、（ｄ）のように差分情報が強調
されて出力されることになる。この場合、画像を上の方
から順に見た時、その輝度が暗から明に変わるところと
その逆に明から暗に変化するところが強調され検出され
ることになる。また強調の反対の“ぼかし”または平滑
化は閾値関数の傾きを緩やかにすることで実現できる。
図８は、ニューロンの入力画像に対する重みを変えて設
定した、他の回路を表す図である。図８（ｃ）の接続
は、入力画像の平均値を得る回路となり、図８（ｄ）
は、入力の平均値を強調する回路となる。ところで閾値
関数を図７（ｂ）に示すような非線形にするのは、強調
以外に耐ノイズ性を向上することも重要な目的である。
図４において、１１は画像、２１はニューロンであり、
２２はニューロン間を結ぶネットワークである。また図
５は、図４に示す構成中の各ニューロンの入出力の関係
と閾値関数を示す図である。図において、４１は入力線
路、４２はシナプス、４３はニューロンの軸索、つまり
出力線路である。４４は線形出力関数であり、４５が強
調処理、４６が平滑化用の出力関数である。４７は非線
形出力関数であり、この場合はノイズ除去能力がある。Further, when the input is multiplied by the weight of positive / negative object as shown in FIG. 7C, if the threshold function has the shape of FIG. 7A, the pixel difference information is output as it is. (B)
When the threshold function of is used, the difference information is emphasized and output as in (d). In this case, when the images are viewed in order from the top, the place where the brightness changes from dark to bright and the place where the brightness changes from light to dark are emphasized and detected. Also, "blur" or smoothing, which is the opposite of emphasis, can be realized by making the slope of the threshold function gentle.
FIG. 8 is a diagram showing another circuit set by changing the weight of the input image of the neuron. The connection shown in FIG. 8C is a circuit for obtaining the average value of the input image, and the connection shown in FIG.
Is a circuit that emphasizes the average value of the input. By the way, making the threshold function non-linear as shown in FIG. 7B is also an important purpose to improve noise resistance in addition to enhancement.
In FIG. 4, 11 is an image, 21 is a neuron,
22 is a network connecting the neurons. Further, FIG. 5 is a diagram showing the input / output relationship and the threshold function of each neuron in the configuration shown in FIG. In the figure, 41 is an input line, 42 is a synapse, and 43 is an axon of a neuron, that is, an output line. 44 is a linear output function, 45 is an enhancement process, and 46 is an output function for smoothing. 47 is a non-linear output function, and in this case, it has a noise removing capability.

【００２２】次に上記構成のニューラルネットワークの
動作を説明する。図９は、図４の基本構成でニューロン
を７層構成とし、入力画像の濃淡度を図示の階調とした
場合のシステム説明図である。図において、画像に最も
近いニューロンの層は縦方向最左翼の第「１」層であ
り、最も遠い層のニューロンは第「７」層であるとす
る。また第１層のニューロンへの入力は画像の輝度値に
対応しているものと仮定する。図９（ａ）では、２画素
離れた情報を第３層で検出しようとするものであり、ま
た図９（ｂ）では、４画素分離れた画像情報を第５層で
検出しようとするものである。図ではこれからの説明に
関連するニューロンにパターンを付加して区別してあ
る。また各ニューロンは図のように（７、４）などとい
った斜交した座標系を使っても表現できるものとする。Next, the operation of the neural network having the above configuration will be described. FIG. 9 is a system explanatory diagram in the case where the neuron has a seven-layered structure in the basic structure of FIG. 4 and the input image has the gradation level shown in the drawing. In the figure, it is assumed that the neuron layer closest to the image is the "1" layer of the leftmost wing in the vertical direction, and the neuron of the farthest layer is the "7" layer. It is also assumed that the input to the neurons in the first layer corresponds to the brightness value of the image. In FIG. 9 (a), information of two pixels apart is detected in the third layer, and in FIG. 9 (b), image information of four pixels separated is detected in the fifth layer. Is. In the figure, patterns are added to the neurons relevant to the following description to distinguish them. Each neuron can also be expressed by using a diagonal coordinate system such as (7, 4) as shown in the figure.

【００２３】まず、上記座標系の記号を使って、本構成
によるニューラルネットワークが周波数解析を行えるこ
とを説明する。例えば図９（ａ）において、ニューロン
（３、３）の出力はニューロン（１、１）とニューロン
（３、１）のデータから決まるようにするためにはま
ず、ニューロン（２、２）の入力の下側の重みをゼロと
おき、さらにニューロン（３、２）の上側の重みをゼロ
と置くことにより求めることができる。これ以外にニュ
ーロン（２、１）の出力が常にゼロとおいても、このよ
うな１画素おきのデータをアクセスすることができる。
このようにネットワークの構造そのものを、その都度変
更して必要なデータにアクセスすることができる。次は
データの加工である。ニューロン（３、３）の２本の入
力の重みとして、正負対の絶対値が１以上の実数を与え
ると、既にアクセス済みのニューロン（１、１）と
（３、１）のデータの差分を増幅することができる。こ
こに図９の各ニューロンが図１０のような閾値関数を取
るとすると、ある一定距離離れた２点間、又はフィルタ
ーが２次元なら２線分間の輝度が同じなら、強く出力さ
れ、輝度が違うと出力は小さくなることになる。つまり
ある一定距離離れた画素に周期性があれば、その近傍は
強く出力されるし、周期性がなければ弱く出力されるこ
とになる。このようにして画像の周波数分解を行うこと
ができる。First, it will be described that the neural network according to this configuration can perform frequency analysis by using the symbols of the coordinate system. For example, in FIG. 9A, in order to determine the output of the neuron (3, 3) from the data of the neuron (1, 1) and the data of the neuron (3, 1), first, the input of the neuron (2, 2) is input. It can be determined by setting the lower weight to zero and further setting the upper weight of the neuron (3, 2) to zero. Besides this, even if the output of the neuron (2, 1) is always zero, such data every other pixel can be accessed.
In this way, it is possible to access the necessary data by changing the network structure itself each time. Next is data processing. When a real number whose absolute value of positive / negative pair is 1 or more is given as the weight of two inputs of the neuron (3, 3), the difference between the data of the already accessed neurons (1, 1) and (3, 1) is calculated. Can be amplified. If each neuron of FIG. 9 takes a threshold function as shown in FIG. 10, if two points separated by a certain distance, or if the brightness of two lines is the same if the filter is two-dimensional, it is strongly output and the brightness is If it is different, the output will be smaller. That is, if the pixels separated by a certain distance have periodicity, the vicinity thereof is output strongly, and if there is no periodicity, it is output weakly. In this way, frequency decomposition of the image can be performed.

【００２４】上述の説明では中間ニューロンの各入力に
対する重みを０又は１にしてデータがそのまま上位の層
のニューロンに伝達するように説明したが、これにも様
々な値を持たせることにより、もう少し木目の細かな分
析も可能である。たとえばウェーブレット変換基底やガ
ボール関数のような複雑な関数との畳み込み積分も、こ
の単純な２分木構造のネットワークの重みと閾値関数を
変更することにより計算することができる。上記では２
入力１出力素子の例を説明したが図１に示す基本構成に
おいても同様な効果が得られる。In the above description, the weight for each input of the intermediate neuron is set to 0 or 1, and the data is transmitted as it is to the neurons in the upper layer. However, by giving various values to this as well, a little more can be obtained. A detailed analysis of the grain is also possible. For example, a convolution integral with a complex function such as a wavelet transform basis or a Gabor function can also be calculated by changing the weight and threshold function of this simple binary tree structure network. 2 in the above
Although the example of the input / output device has been described, the same effect can be obtained in the basic configuration shown in FIG.

【００２５】実施例２．本実施例では、多重解像度を利
用した画像認識装置を説明する。実施例１での図７や図
８を用いた装置で説明したように、各ニューロンに適当
な重みと閾値関数を選ぶことにより、入力画像の強調、
ぼかし、エッジ検出等が行える。このような平均値また
はそれに準じる値を取りながら２画素を１画素にまとめ
てゆく手法を述べる。この方法では入力画像上の目標物
の対照的な位置情報は、入力画像から遠い階層のニュー
ロンでも階層内での位置関係はそのまま相似的に保つよ
うにすることができる。図１１はこのことを示す各層の
ニューロンの出力信号図である。図から判るように、画
像から遠い階層のニューロンの出力が、画像の圧縮情報
そのものとして得られる。更に、画像の絶対輝度値と、
重み係数や閾値関数の非線形部の傾斜を指定して精度を
向上することができる。次に上述したニューラルネット
ワークを使って得られた情報の利用法について言及す
る。説明の都合上、次の言葉を定義しておく。図中画像
に近い方をニューラルネットワークの階層では『低い』
あるいは『下位』の層であるとする。逆にネットワーク
が先細りしてゆく方を『高い』、『上位』の層であると
する。このように画像から遠い階層のニューロンの出力
は画像の圧縮情報そのものにすることができる。勿論正
確には、画像の絶対輝度値と、増幅を行う重み係数や、
非線形関数の傾き等を細かく指定する必要があるが、こ
れにより圧縮画像を使い、認識処理を効率良く行うこと
ができる。Example 2. In this embodiment, an image recognition device using multi-resolution will be described. As described in the apparatus using FIG. 7 and FIG. 8 in the first embodiment, by emphasizing the input image by selecting appropriate weights and threshold functions for each neuron,
Blurring, edge detection, etc. can be performed. A method of collecting two pixels into one pixel while taking such an average value or a value similar thereto will be described. With this method, the positional information of the target on the input image, which is in contrast to the target image, can be kept similar to the positional relationship in the hierarchy even for neurons in the hierarchy far from the input image. FIG. 11 is an output signal diagram of neurons of each layer showing this. As can be seen from the figure, the output of the neurons in the hierarchy far from the image is obtained as the compressed information itself of the image. Furthermore, the absolute brightness value of the image,
The accuracy can be improved by specifying the weighting coefficient or the slope of the non-linear portion of the threshold function. Next, a method of using the information obtained by using the above-mentioned neural network will be mentioned. For convenience of explanation, the following words are defined. The one closer to the image in the figure is "low" in the hierarchy of the neural network
Alternatively, it is assumed to be the “lower” layer. Conversely, the direction in which the network tapers is considered to be the “high” or “upper” layer. In this way, the output of the neuron in the layer far from the image can be the compressed information itself of the image. Of course, to be exact, the absolute brightness value of the image, the weighting coefficient for amplification,
Although it is necessary to specify the slope of the non-linear function in detail, this makes it possible to use the compressed image and efficiently perform the recognition process.

【００２６】図１２は、本実施例の解像度を切換える装
置の概念を説明するための認識ニューラルネットと画像
圧縮部を示す図である。本装置では、入力画像の認識を
以下の順序で行う。即ち、最初粗い解像度で画面全体を
見渡し、画面内の状況、つまりテクスチャーならびに注
目箇所を抽出する。次に注目箇所に対してより解像度の
高い、つまり情報量の多いデータを用いて、注目箇所の
細部の認識を行う。図の右側Ａで示すように、低解像度
でしかもデータ量の少ない側で、物体の大まかな見当を
つける。このとき、微調整のための閾値とシナプスの重
みの変更要求をＣで示すようにフィードバックする。こ
れにより物体の解像度の微調整を行い、確度を上げる。
なお、位置や方向や大きさの大きな変更は図のＢで示す
ように認識用ニューラルネットワークそのものをシフト
させる必要がある。FIG. 12 is a diagram showing a recognition neural network and an image compression unit for explaining the concept of the resolution switching apparatus of this embodiment. This apparatus recognizes the input image in the following order. That is, first, the entire screen is viewed with a coarse resolution, and the situation in the screen, that is, the texture and the point of interest are extracted. Next, the details of the point of interest are recognized using the data with higher resolution, that is, the amount of information for the point of interest. As shown on the right side A of the figure, a rough estimation of the object is made on the side having a low resolution and a small amount of data. At this time, a threshold change request for fine adjustment and a synapse weight change request are fed back as indicated by C. This fine-tunes the resolution of the object and increases the accuracy.
It should be noted that when the position, direction, or size is greatly changed, it is necessary to shift the recognition neural network itself as shown by B in the figure.

【００２７】図１３は、このことを判り易く説明する図
である。図１３（ｂ）は図１３（ａ）で各層が平面的に
構成されているものを、１次元で等価的に表現した構成
である。この画像認識は、画像中にある大きさで描かれ
た識別目標を検出するためのものとする。例えば入力
層、中間層、出力層の３層構成で、入力層が１６×１６
＝２５６個のニューロンからなるとする。出力層は入力
パターンを分類しようとする数だけ必要となり、仮に今
入力パターンを３つに分類するとし、どれか一つの出力
ニューロンの発火がその選択されたパターンを表すとす
れば出力ニューロンは３個になる。もし、従来のように
出力層が固定であると、方向や大きさの変化も含めて比
較のための標準パターン数は極端に多い数が必要とな
る。認識開始時は、対象が画像全体の中にどの位の大き
さで描かれているか、あるいはどこに描かれているかさ
え判らない状態である。両方とも一度に決定することは
難しいので、今の場合、簡単のために大きさは判ってい
るとする。このような場合にまず画像中の目標物の位置
ずれ、つまりどこかに描かれているか判らない場合、図
１３（ｃ）のように、この認識ネットワークをある解像
度の画像中にシフトさせながら、順に調べて行けば見つ
かる。即ち、図１２のＢの座標位置を切り換えていって
順に走査することを意味する。FIG. 13 is a diagram for explaining this easily. FIG. 13B shows a one-dimensional equivalent representation of the planar structure of each layer in FIG. 13A. This image recognition is for detecting an identification target drawn in a certain size in the image. For example, the input layer has a three-layer structure including an input layer, an intermediate layer, and an output layer, and the input layer is 16 × 16
= 256 neurons. As many output layers as necessary to classify the input patterns are needed, and if the input patterns are classified into three now, and the firing of any one of the output neurons represents the selected pattern, the output neurons will be three. Become individual. If the output layer is fixed as in the conventional case, an extremely large number of standard patterns is required for comparison, including changes in direction and size. At the start of recognition, the size of the object in the entire image, or even where it is drawn, is unknown. Since it is difficult to determine both at once, it is assumed that the size is known for simplicity in this case. In such a case, if the position of the target in the image is misaligned, that is, if it is not known where it is drawn, while shifting the recognition network into the image of a certain resolution as shown in FIG. You can find out by examining them in order. That is, it means that the coordinate position of B in FIG. 12 is switched and scanning is performed in order.

【００２８】次に大きさが判らない場合を考える。但し
やはり簡単のため画像中のどこにそれが描かれているか
は判っているものとする。この場合は図１３（ｄ）で示
す点線の矢印方向のシフトは、図１２のＡ方向に中間層
出力の選択を切り換えて行くことを意味する。即ち、遠
くから眺めることで画像全体中では小さく示されている
認識対象を容易に見つけ出すことができる。こうして各
種の大きさの標準パターンを用意しなくてもよく、同様
に図１２のＢの座標位置を回転させれば、回転した標準
パターンを省くこともできる。このような多段階の処理
が、ニューロンの重み、閾値、閾値関数を変更するだけ
で簡単に行える。またこのような注目する局所情報の抽
出、あるいは特微抽出により、画像全体に渡り高解像度
で処理する必要がなく、これ以降の処理の高速化にもつ
ながる。さらに認識の際、大局的に画面全体から得られ
た情報を、局所的な処理を行う空間フィルターの大きさ
や傾きなどにフィードバックし、逆に局所的な情報を全
体にフィードバックさせることにより、背景と注目箇所
の分離等に大きな効果が期待できる。Next, consider the case where the size is unknown. However, for simplicity, it is assumed that it is known where it is drawn in the image. In this case, shifting in the direction of the dotted arrow shown in FIG. 13D means switching the selection of the intermediate layer output in the direction A of FIG. That is, by observing from a distance, it is possible to easily find a small recognition target in the entire image. Thus, it is not necessary to prepare standard patterns of various sizes, and similarly, if the coordinate position of B in FIG. 12 is rotated, the rotated standard pattern can be omitted. Such multi-step processing can be easily performed only by changing the weight of the neuron, the threshold value, and the threshold function. Further, by extracting the local information of interest or the feature extraction, it is not necessary to process the entire image with high resolution, which leads to speeding up of the subsequent processes. In addition, when recognizing, the information obtained from the entire screen is fed back to the size and inclination of the spatial filter that performs local processing, and conversely, the local information is fed back to the entire background, A great effect can be expected in separating the points of interest.

【００２９】実施例３．上記実施例では、入力画面の取
り組みに関しては図４のように縦方向の画素に対して処
理を行うニューラルネットワークを、横方向に並べる例
を説明した。これに対し、図１４に示すように９０度回
転した入力接続としてもよい。また図１５に示すよう
に、従来の平面状のブロックの平均を取る方法と同じ構
成のネットワーク構成を用いることもできる。これらの
ニューラルネットワークのシナプスの重みと出力関数を
適宜変化させることにより、画像の回転に強いネットワ
ークの構成や、エッジ検出など多くの空間フィルターを
構成することができる。Example 3. In the above embodiment, regarding the approach of the input screen, the example in which the neural networks for processing the pixels in the vertical direction are arranged in the horizontal direction as shown in FIG. 4 has been described. On the other hand, the input connection may be rotated by 90 degrees as shown in FIG. Further, as shown in FIG. 15, it is also possible to use a network configuration having the same configuration as the conventional method of averaging planar blocks. By appropriately changing the synaptic weights and output functions of these neural networks, it is possible to configure a network that is resistant to image rotation and many spatial filters such as edge detection.

【００３０】実施例４．認識学習結果の丸め込みについ
て説明する。差分情報がまとめ上げられてゆく結果、ネ
ットワークが先細りしてゆく上位の層になればなるほ
ど、画像情報は圧縮されて保持されることになる。この
ことは実施例２での図１１の説明でも述べた。また、図
１０のシナプス重みの形状を左右に伸縮させて用いるこ
とにより、高帯域から低帯域までの各周波数情報も、全
く同じ構成のネットワークで得られるが、こちらの方は
上位の層になればなるほど、低周波数の情報が得られる
ことになる。さらに図１０のシナプス重みの形状を上下
方向に変化させることにより、局所的な周波数解析であ
る、ウェーブレット変換も実現できる。更に詳しくは、
例えば図１４（ａ）のような８個のシナプスを持つニュ
ーロンがあれば、そのシナプスの値を図１４（ｂ）のよ
うに取ると、１番と８番の入力線に仮に値が入ってきた
場合それはこのシナプスの形状から、両方の入力が強調
されることになる。つまり入力中にある間隔離れて入っ
てくる入力を強調する作用がある。これはそうした間隔
（波長）でパターンを持つ画像データを拾うことになる
ので、結局周波数情報を拾うフィルターになる。Example 4. The rounding of the recognition learning result will be described. As a result of the difference information being collected, the higher the layer in the network becomes, the more the image information is compressed and held. This is also described in the description of FIG. 11 in the second embodiment. Further, by expanding and contracting the shape of the synapse weight in FIG. 10 to the left and right, the frequency information from the high band to the low band can be obtained with the network of the same configuration, but this one should be a higher layer. The higher the frequency, the lower frequency information will be obtained. Furthermore, by changing the shape of the synapse weight in FIG. 10 in the vertical direction, it is possible to realize a wavelet transform, which is a local frequency analysis. For more details,
For example, if there is a neuron with 8 synapses as shown in FIG. 14 (a), if the values of the synapses are taken as shown in FIG. 14 (b), values will be tentatively input to the 1st and 8th input lines. If this is the case, both inputs will be emphasized due to the shape of this synapse. In other words, it has the effect of emphasizing the input coming in at a certain interval during the input. Since this picks up image data having a pattern at such intervals (wavelengths), it eventually becomes a filter that picks up frequency information.

【００３１】これに対して、同様な周波数を拾う処理が
２入力のニューロンでも構造が動的に変えられれば実現
することができ、図１５のようにニューロンの閾値、シ
ナプスをいじれば、同様な処理を行わせることができ
る。この場合はニューロンの選択、閾値、シナプスをす
べて変更することになる。実施例２では画像の輝度（強
度）情報を入力して様々な解像度で処理していたものに
対して、図１６、図１７の構成と方法によれば、閾値関
数を距離に応じて変更することにより、輝度情報の処理
と全く同じ構造を用いて、様々な周波数レベルで処理で
きる。従って構造上の変化はなくて、閾値関数の変更の
みでよく、この変更で階層の上方ではより間隔の広い画
素の濃淡を監視することになり、２、４、８、１６と波
長が倍々に増え、周波数が半分半分に変化する。従来の
方法では小さいサイズから大きなサイズまで別々の空間
周波数フィルターを使って畳み込み積分を行わなくては
得られなかった情報が、ニューラルネットワークの構造
の中に、高周波から低周波成分まで、自動的に抽出でき
るようになり、計算量の大幅な削減が期待できる。On the other hand, a process of picking up a similar frequency can be realized even in a 2-input neuron if the structure is dynamically changed. If the threshold value of the neuron and the synapse are tampered with as shown in FIG. Processing can be performed. In this case, selection of neurons, threshold values, and synapses are all changed. In the second embodiment, the luminance (intensity) information of the image is input and processed at various resolutions, but according to the configurations and methods of FIGS. 16 and 17, the threshold function is changed according to the distance. This allows processing at various frequency levels, using exactly the same structure as the processing of luminance information. Therefore, there is no structural change and only the threshold function needs to be changed. With this change, the shades of pixels with wider intervals are monitored above the hierarchy, and the wavelength is doubled as 2, 4, 8, and 16. It increases and the frequency changes by half. Information that could not be obtained by convolution integration using separate spatial frequency filters from small size to large size in the conventional method is automatically detected in the structure of the neural network from high frequency to low frequency components. It will be possible to extract, and it is expected that the calculation amount will be greatly reduced.

【００３２】図１２は、また以下のことをも意味してい
る。即ち、通常の画像の圧縮として用いる場合でも、最
初、粗い解像度で画面全体を見渡し、そのものの描かれ
ている状況、つまりテクスチャー及び注目箇所を少ない
計算量で抽出することができる。この処理は中間層に現
れる低解像度の画像情報を、別の認識用のニューラルネ
ットワークに入力し、まず大まかな見当をつけることに
よる。この認識用のニューラルネットワークを予め、あ
る大きさの物体を識別するように学習してあるとする
と、同じ認識ネットワークを低解像度から高解像度まで
並べるか、もしくはその認識ネットワークを低解像度か
ら高解像度へとシフトしながらパターンマッチングを行
うことにより、大きさの変化した物体でも見つけること
ができる。これにいち早く低解像度の情報により注目箇
所を抽出し、次に解像度を上げ、相対的に同じ位置にあ
る注目箇所の情報を取り出し、それをさらに高解像度用
のニューラルネットワークに入力し、認識する。このよ
うな操作により、徐々に細部の認識を行うといった段階
的な処理が行え、認識ハードウェアシステムの開発、計
算コストの面から優れたシステムとなりえる。FIG. 12 also means the following: That is, even when it is used as a normal image compression, it is possible to first look over the entire screen at a coarse resolution and extract the situation in which the image is drawn, that is, the texture and the point of interest with a small amount of calculation. This processing is performed by inputting the low-resolution image information appearing in the intermediate layer to another neural network for recognition, and first making a rough estimate. If this recognition neural network is learned in advance to identify an object of a certain size, the same recognition networks are arranged from low resolution to high resolution, or the recognition network is changed from low resolution to high resolution. By performing pattern matching while shifting, it is possible to find an object with a changed size. The attention point is first extracted by the low resolution information, then the resolution is increased, the information of the attention point at the same relative position is taken out, and the information is input to the higher resolution neural network for recognition. By such an operation, stepwise processing such as gradually recognizing details can be performed, and the system can be an excellent system in terms of development of a recognition hardware system and calculation cost.

【００３３】[0033]

【発明の効果】以上のようにこの発明によれば、その値
を可変にできる重み係数を持つニューロン型閾値素子を
階層接続したので、同構造の素子を接続して画像の圧
縮、強調、平滑処理ができる効果がある。As described above, according to the present invention, since the neuron type threshold value element having the weighting coefficient whose value can be changed is hierarchically connected, the elements having the same structure are connected to compress, emphasize and smooth the image. There is an effect that can be processed.

【００３４】また更に、中間層の認識結果を有効に利用
できる効果もある。Furthermore, there is an effect that the recognition result of the intermediate layer can be effectively used.

【００３５】また更に、２入力としたので、接続が更に
簡単であり直前の入力の加算、差分により結果が強調さ
れる効果がある。Furthermore, since there are two inputs, the connection is simpler and there is the effect that the result is emphasized by the addition and difference of the immediately preceding inputs.

【００３６】また更に、重み係数と閾値を一部の結果に
より変更するようにしたので、場合により画像全体の精
密処理が省けて高速化できる効果がある。Furthermore, since the weighting factor and the threshold value are changed according to a part of the results, there is an effect that precision processing of the entire image can be omitted and the speed can be increased.

【００３７】また更に、重み係数または閾値を一部の認
識結果により変更するようにしたので、画像認識の周波
数特性を変更できる効果がある。Furthermore, since the weighting coefficient or the threshold value is changed according to a part of the recognition result, there is an effect that the frequency characteristic of the image recognition can be changed.

[Brief description of drawings]

【図１】本発明の画像圧縮部の一般的な構成図であ
る。FIG. 1 is a general configuration diagram of an image compression unit of the present invention.

【図２】本発明のニューロネットワークを使用した画
像認識システムの構成図である。FIG. 2 is a configuration diagram of an image recognition system using a neuro network of the present invention.

【図３】画像認識システム全体の動作フローチャート
図である。FIG. 3 is an operation flowchart of the entire image recognition system.

【図４】本発明の画像圧縮部の２入力１出力要素によ
る構成図である。FIG. 4 is a configuration diagram of a 2-input 1-output element of the image compression unit of the present invention.

【図５】本発明における基本構成要素であるニューロ
ンの入出力と閾値関数を示す図である。FIG. 5 is a diagram showing an input / output and a threshold function of a neuron which is a basic constituent element in the present invention.

【図６】方向性のある差分情報の強調処理を説明する
図である。FIG. 6 is a diagram illustrating an emphasis process of directional difference information.

【図７】対称な重みと２種類の閾値関数を用いた方向
差分情報の強調処理を説明する図である。FIG. 7 is a diagram illustrating an emphasis process of direction difference information using symmetric weights and two types of threshold functions.

【図８】入力画像の強調処理を説明する図である。FIG. 8 is a diagram illustrating an emphasis process of an input image.

【図９】方向を持った差分情報の強調処理を説明する
図である。FIG. 9 is a diagram illustrating a process of emphasizing difference information having directions.

【図１０】周波数解析用閾値関数を示す図である。FIG. 10 is a diagram showing a frequency analysis threshold function.

【図１１】多重解像度表現の例を示す図である。FIG. 11 is a diagram showing an example of multi-resolution representation.

【図１２】認識ニューラルネットと画像圧縮部の組合
せを示す図である。FIG. 12 is a diagram showing a combination of a recognition neural network and an image compression unit.

【図１３】本発明の中間層出力を画像認識に利用する
ことの説明図である。FIG. 13 is an explanatory diagram of using the intermediate layer output of the present invention for image recognition.

【図１４】本発明の水平方向圧縮用画像圧縮部の構成
図である。FIG. 14 is a block diagram of an image compression unit for horizontal compression of the present invention.

【図１５】本発明の平面圧縮用画像圧縮部の構成図で
ある。FIG. 15 is a configuration diagram of an image compression unit for plane compression of the present invention.

【図１６】周波数解析用のニューロンとシナプスを示
す図である。FIG. 16 is a diagram showing neurons and synapses for frequency analysis.

【図１７】周波数解析用の閾値の変化とニューロン出
力の変化を示す図である。FIG. 17 is a diagram showing changes in the threshold for frequency analysis and changes in neuron output.

【図１８】従来の第１の画像圧縮方法を示す図であ
る。FIG. 18 is a diagram showing a first conventional image compression method.

【図１９】従来の第２の画像処理方法を示す図であ
る。FIG. 19 is a diagram showing a second conventional image processing method.

【図２０】従来の第３の画像処理方法を示す図であ
る。FIG. 20 is a diagram showing a third conventional image processing method.

[Explanation of symbols]

１１画像、２１ニューロン、２２ニューロン間の
接続、４１入力線路、４２シナプス（重み）、４３
出力線路、４４ニューロンの入出力特性ｆ（ｘ）＝
ｘ、４５ニューロンの入出力特性利得＞１、４６
ニューロンの入出力特性利得＜１、４７ニューロン
の入出力特性ｆ（ｘ）が非線形。11 images, 21 neurons, 22 connections between neurons, 41 input lines, 42 synapses (weights), 43
Output line, input / output characteristics of 44 neurons f (x) =
Input / output characteristics of x, 45 neurons Gain> 1,46
Input / output characteristics of neuron Gain <1,47 Input / output characteristics of neuron f (x) is non-linear.

Claims

[Claims]

1. A structure in which a neuron-type threshold value element having a weighting coefficient for each of a plurality of inputs and the neuron-type threshold value element are hierarchically connected in a pyramid type, and the output of the uppermost neuron is the final output, An image recognition apparatus including a storage unit that variably sets the value of the weighting factor and the threshold value.

2. The image recognition apparatus according to claim 1, wherein in image recognition, the final output is the output of the intermediate layer at an arbitrary coordinate position of an arbitrary intermediate layer of the hierarchy.

3. A neuro-type threshold element having two inputs and two inputs and one output is used as a plurality of inputs.
Alternatively, the image recognition device according to claim 2.

4. The image recognition apparatus according to claim 1, wherein the value of the weighting factor and the threshold value are changed based on the recognition result of a part of the hierarchical connection of other neuron type threshold value elements.

5. The image recognition apparatus according to claim 4, wherein the value of the weighting factor or the threshold value is changed based on the recognition result of the distance information of a part of the hierarchical connection of other neuron type threshold value elements.