JPH07271916A

JPH07271916A - Learning pattern generating device and character recognizing device using the same

Info

Publication number: JPH07271916A
Application number: JP6085520A
Authority: JP
Inventors: Hisashi Chiba; 久千葉; Hitoshi Kubota; 整久保田; Katsuichi Ono; 勝一小野
Original assignee: Suzuki Motor Corp
Current assignee: Suzuki Motor Corp
Priority date: 1994-03-31
Filing date: 1994-03-31
Publication date: 1995-10-20

Abstract

PURPOSE:To enable even an unskilled operator to automatically select an appropriate learning pattern in a short time. CONSTITUTION:A learning pattern generating device 10 is provided with a recognizing object pattern storage means 12 which stores all recognizing object patterns Pr1-Prn, a similarity degree calculator means 14 which calculates the similarity degrees R1-Rn between the patterns Pr1-Prn and an input pattern Pea that can not be recognized or an input pattern Peb that is wrong recognized, and a learning pattern selector means 16 which decides whether the patterns Pea and Peb should be used as the learning patterns Ps based on the degrees R1-Rn calculated by the means 14.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、ニューラルネットワー
クを用いて文字等を認識する場合の学習パターン生成装
置及びこれを用いた文字認識装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a learning pattern generating device for recognizing characters and the like using a neural network and a character recognizing device using the learning pattern generating device.

【０００２】[0002]

【従来の技術】ニューラルネットワークを用いて文字等
を認識する場合、予めニューラルネットワークに認識対
象文字を学習させておく。学習とは、例えば認識対象文
字「Ｃ」をニューラルネットワークに入力したときに、
ニューラルネットワークから「Ｃ」が出力されるよう
に、ニューラルネットワークを構成する各ユニット間の
結合荷重等のパラメータを決定することである。また、
学習に用いられる文字等のパターンを「学習パターン」
といい、この学習パターンの集合を「学習データ」とい
う。学習パターンの例を図８に示す。学習パターンに
は、認識対象文字である学習パターン８０、認識対象文
字にノイズが加わった学習パターン８１，８２，８３、
認識対象文字が変形した学習パターン８４，８５，８６
等がある。2. Description of the Related Art When a character or the like is recognized by using a neural network, the neural network is made to learn the recognition target character in advance. Learning means, for example, when the recognition target character “C” is input to the neural network,
This is to determine parameters such as the connection weight between the units forming the neural network so that “C” is output from the neural network. Also,
Patterns such as letters used for learning are "learning patterns"
This set of learning patterns is called "learning data". An example of the learning pattern is shown in FIG. The learning pattern includes a learning pattern 80 that is a recognition target character, learning patterns 81, 82, 83 in which noise is added to the recognition target character,
Learning patterns 84, 85, 86 in which the recognition target characters are deformed
Etc.

【０００３】[0003]

【発明が解決しようとする課題】しかしながら、従来技
術では、次のような問題があった。However, the conventional technique has the following problems.

【０００４】認識率を高めるために学習パターン８
１，…，８６を増やすと、学習が収束しなくなって、最
適なパラメータを決定できなくなる。Learning pattern 8 in order to increase the recognition rate
When 1, ..., 86 are increased, learning does not converge, and it becomes impossible to determine optimum parameters.

【０００５】認識対象文字に特定のノイズが加わっ
た学習パターン８７，８８を学習すると、学習パターン
８７，８８に類似した認識対象文字８９，９０，９１と
混同を生じて認識率が低下する。すなわち、認識対象文
字８９，９０，９１の「Ｇ」，「Ｏ」，「Ｄ」を認識対
象文字「Ｃ」と誤って認識してしまう可能性が生じる。When learning patterns 87, 88 in which specific noise is added to the recognition target characters are learned, confusion occurs with recognition target characters 89, 90, 91 similar to the learning patterns 87, 88, and the recognition rate decreases. That is, there is a possibility that the recognition target characters 89, 90, 91 “G”, “O”, and “D” may be erroneously recognized as the recognition target character “C”.

【０００６】上記の問題を避けるためには、操
作者が学習すべきパターンを選択する必要がある。すな
わち、学習の収束の妨げとなる不要なパターンを除外す
ると共に、認識率の低下を招くパターンも除外する。こ
れにより、好ましいパターンのみをニューラルネットワ
ークに学習させる。この学習すべきパターンを的確に判
断するには熟練を要すると共に多くの時間及び試行錯誤
が必要であるので、操作がたいへん面倒であった。In order to avoid the above problem, the operator needs to select a pattern to be learned. That is, unnecessary patterns that hinder the convergence of learning are excluded, and patterns that reduce the recognition rate are also excluded. This causes the neural network to learn only the preferable pattern. The operation is very troublesome because it requires a lot of time and trial and error to judge the pattern to be learned accurately.

【０００７】[0007]

【発明の目的】そこで、本発明の主な目的は、熟練した
操作者によらなくても適切な学習パターンを自動的かつ
短時間に選択できる学習パターン生成装置及びこれを用
いた文字認識装置を提供することにある。SUMMARY OF THE INVENTION Therefore, a main object of the present invention is to provide a learning pattern generation device and a character recognition device using the learning pattern generation device, which can select an appropriate learning pattern automatically and in a short time without a skilled operator. To provide.

【０００８】[0008]

【課題を解決するための手段】本発明に係る学習パター
ン生成装置及びこれを用いた文字認識装置は、上記目的
を達成するためになされたものであり、次の構成を有す
る。The learning pattern generation device and the character recognition device using the same according to the present invention are made in order to achieve the above object, and have the following configurations.

【０００９】本発明に係る学習パターン生成装置は、全
ての認識対象パターンを記憶している認識対象パターン
記憶手段と、前記全ての認識対象パターンのそれぞれと
認識できなかった又は誤って認識した入力パターンとの
類似度を計算する類似度計算手段と、この類似度計算手
段によって計算された前記類似度に基づき前記入力パタ
ーンを学習パターンにするか否かを判断する学習パター
ン選択手段とを備えたものである。The learning pattern generation apparatus according to the present invention includes a recognition target pattern storage unit that stores all recognition target patterns, and an input pattern that cannot be recognized or is erroneously recognized as each of all the recognition target patterns. And a learning pattern selecting means for judging whether or not the input pattern should be a learning pattern based on the similarity calculated by the similarity calculating means. Is.

【００１０】本発明に係る文字認識装置は、入力文字を
含む画像データを入力する画像入力部と、この画像入力
部で入力された画像データを記憶する画像記憶部と、こ
の画像記憶部に記憶されている画像データから前記入力
文字に関する入力データを作成する入力データ作成部
と、この入力データ作成部で作成された入力データに対
してニューラルネットワーク処理を行うニューラルネッ
トワーク処理部と、このニューラルネットワーク処理部
の処理結果に基づいて前記入力文字に対応する認識対象
文字を出力する文字出力部とを備えたものを改良したも
のである。A character recognition apparatus according to the present invention includes an image input section for inputting image data including an input character, an image storage section for storing image data input by the image input section, and an image storage section for storing the image data. An input data creating section for creating input data relating to the input character from the image data being displayed, a neural network processing section for performing a neural network processing on the input data created by the input data creating section, and the neural network processing And a character output unit that outputs a recognition target character corresponding to the input character based on the processing result of the unit.

【００１１】すなわち、前記ニューラルネットワーク処
理部が、本発明に係る学習パターン生成装置によって得
られた学習パターンによって学習したニューラルネット
ワークを有することを特徴とするものである。That is, the neural network processing unit has a neural network learned by the learning pattern obtained by the learning pattern generating apparatus according to the present invention.

【００１２】ここでいう「画素」とは、最小分解能とし
ての画素から、これらの画素を複数個集めて一単位とし
たものまでを含む。The term "pixel" as used herein includes a pixel having a minimum resolution, and a plurality of these pixels collected as one unit.

【００１３】[0013]

【作用】本発明に係る学習パターン生成装置の作用は、
次のとおりである。認識対象パターン記憶手段には、全
ての認識対象パターンが記憶されている。類似度計算手
段は、全ての認識対象パターンのそれぞれと認識できな
かった入力パターンとの類似度、又は全ての認識対象パ
ターンのそれぞれと誤って認識した入力パターンとの類
似度を計算する。学習パターン選択手段は、類似度計算
手段によって計算された類似度に基づき、入力パターン
を学習パターンにするか否かを判断する。例えば、類似
度が大きい場合には入力パターンを学習パターンにし、
類似度が小さい場合には入力パターンを学習パターンに
しない。The operation of the learning pattern generation device according to the present invention is as follows.
It is as follows. All the recognition target patterns are stored in the recognition target pattern storage means. The similarity calculation means calculates the similarity between each of the recognition target patterns and the input pattern that cannot be recognized, or the similarity between each of the recognition target patterns and the input pattern that is erroneously recognized. The learning pattern selecting means determines whether or not the input pattern is a learning pattern based on the similarity calculated by the similarity calculating means. For example, when the degree of similarity is large, the input pattern is the learning pattern,
When the degree of similarity is small, the input pattern is not the learning pattern.

【００１４】本発明に係る文字認識装置の作用は、次の
とおりである。入力文字を含む画像データは、画像入力
部から入力され、画像記憶部で記憶される。入力データ
作成部では、画像記憶部に記憶されている画像データか
ら入力文字に関する入力データが作成される。入力デー
タはニューラルネットワーク処理部で処理され、この処
理結果に基づいて文字出力部から入力文字に対応する認
識対象文字が出力される。ニューラルネットワーク処理
部のニューラルネットワークは、請求項１記載の学習パ
ターン生成装置によって得られた学習パターンを学習し
たものである。この学習パターンは、適切なものである
と共に自動的かつ短時間に選択されたものである。The operation of the character recognition device according to the present invention is as follows. The image data including the input character is input from the image input unit and stored in the image storage unit. The input data creation unit creates input data regarding the input character from the image data stored in the image storage unit. The input data is processed by the neural network processing unit, and the recognition target character corresponding to the input character is output from the character output unit based on the processing result. The neural network of the neural network processing unit learns the learning pattern obtained by the learning pattern generating device according to the first aspect. This learning pattern is appropriate and automatically selected in a short time.

【００１５】[0015]

【発明の実施例】図１は、本発明に係る学習パターン生
成装置の一実施例を示すブロック図である。以下、この
図に基づき説明する。1 is a block diagram showing an embodiment of a learning pattern generating apparatus according to the present invention. Hereinafter, description will be given based on this figure.

【００１６】学習パターン生成装置１０は、全ての認識
対象パターンＰｒ１〜Ｐｒｎを記憶している認識対象パ
ターン記憶手段１２と、全ての認識対象パターンＰｒ１
〜Ｐｒｎのそれぞれと認識できなかった入力パターンＰ
ｅａ又は誤って認識した入力パターンＰｅｂとの類似度
Ｒ１〜Ｒｎを計算する類似度計算手段１４と、類似度計
算手段１４によって計算された類似度Ｒ１〜Ｒｎに基づ
き入力パターンＰｅａ，Ｐｅｂを学習パターンＰｓにす
るか否かを判断する学習パターン選択手段１６とを備え
ている。類似度計算手段１２及び学習パターン選択手段
１６は、例えばコンピュータ及びこれを動作させるプロ
グラムによって実現できる。認識対象パターン記憶手段
１４は、ＲＯＭ，ＲＡＭ又は磁気記録装置等によって実
現できる。認識対象パターンＰｒ１〜Ｐｒｎとしては、
文字パターン，音声パターン，血液の凝集パターン等、
どのようなものでもよい。The learning pattern generation device 10 includes a recognition target pattern storage unit 12 that stores all recognition target patterns Pr1 to Prn, and all recognition target patterns Pr1.
Input pattern P that could not be recognized as each of ~ Prn
ea or the similarity calculation means 14 for calculating the similarity R1 to Rn with the input pattern Peb that is erroneously recognized, and the input patterns Pea and Peb are learning patterns based on the similarity R1 to Rn calculated by the similarity calculation means 14. The learning pattern selecting means 16 for determining whether or not to set Ps is provided. The similarity calculation unit 12 and the learning pattern selection unit 16 can be realized by, for example, a computer and a program that operates the computer. The recognition target pattern storage means 14 can be realized by a ROM, a RAM, a magnetic recording device, or the like. As the recognition target patterns Pr1 to Prn,
Character patterns, voice patterns, blood agglutination patterns, etc.
It can be anything.

【００１７】類似度計算手段１２には、認識できなかっ
た入力パターンＰｅａ及び誤って認識した入力パターン
Ｐｅｂを記憶している外部記憶装置１８が接続されてい
る。学習パターン選択手段１６には、学習パターンＰｓ
によって学習するニューラルネットワーク処理装置２０
が接続されている。また、入力パターンＰｅａ，Ｐｅｂ
には、出力すべき認識対象パターンＰｏを示す情報が付
加されている。The similarity calculation means 12 is connected to an external storage device 18 which stores an unrecognized input pattern Pea and an erroneously recognized input pattern Peb. The learning pattern selection means 16 includes a learning pattern Ps.
Neural network processing device 20 for learning by
Are connected. In addition, input patterns Pea and Peb
Is added with information indicating the recognition target pattern Po to be output.

【００１８】図２は、学習パターン生成装置１０の動作
を示すフローチャートであり、認識できなかった入力パ
ターンＰｅａについて学習パターンＰｓを生成する場合
を示している。以下、図１及び図２に基づき説明する。FIG. 2 is a flow chart showing the operation of the learning pattern generation device 10, showing a case where the learning pattern Ps is generated for the unrecognized input pattern Pea. Hereinafter, description will be given with reference to FIGS. 1 and 2.

【００１９】予め、認識できなかった入力パターンＰｅ
ａと、入力パターンＰｅａに付加されている出力すべき
認識対象パターンＰｏの情報とを、外部記憶装置１８等
から入力しておく。また、認識対象パターンＰｒ１〜Ｐ
ｒｎには、出力すべき認識対象パターンＰｏが当然のこ
とながら含まれている。The input pattern Pe which could not be recognized in advance
a and the information of the recognition target pattern Po to be output, which is added to the input pattern Pea, are input from the external storage device 18 or the like. Further, the recognition target patterns Pr1 to P
The recognition target pattern Po to be output is naturally included in rn.

【００２０】まず、初期値ｍ＝１を設定して（ステップ
１０１）、入力パターンＰｅａと認識対象パターンＰｒ
１とを重ね合わせる（ステップ１０２）。例えば、入力
パターンＰｅａを「Ｅ＋ノイズ」とし、認識対象パター
ンＰｒ１を「Ｅ」とすると、図３に示すようになる。図
３中央の「重ね合わせ後」において、一致部分面積（画
素数）Ｒ１ａを網掛けで示し、不一致部分面積（画素
数）Ｒ１ｂを斜線で示す。すなわち、入力パターンＰｅ
ａと認識対象パターンＰｒ１との同じ座標の画素におい
て、どららも「１」であればその画素は一致しているも
のとし、どちらか一方のみが「１」であればその画素は
一致していないものとする。このとき、類似度Ｒ１は、
認識対象パターンＰｒ１の全面積（画素数）をＲ１ｃと
すれば、Ｒ１＝Ｒ１ａ／Ｒ１ｃで与えられる（ステップ
１０３）。したがって、類似度Ｒ１は、「0.0 」から
「1.0 」までの間に分布し、一致部分面積Ｒ１ａが小さ
いほど（類似していないほど）「0.0 」に近づく。ただ
し、Ｒ１ｂ／Ｒ１ｃがある定められたしきい値よりも大
きくなる場合は、Ｒ１＝0.0 として、学習パターン選択
の対象から外す。Ｒ１ｂ／Ｒ１ｃが大きいほど、入力パ
ターンＰｅａと認識対象パターンＰｒ１とは類似しない
からである。続いて、類似度Ｒ１としきい値Ｔｒとの大
小比較を行い、その結果を記憶しておく（ステップ１０
４）。しきい値Ｔｒは、実験的に求められたものであ
る。次に、ｍ＝ｍ＋１とし（ステップ１０５）、ｍ＞ｎ
になるまでステップ１０２〜１０５を繰り返す（ステッ
プ１０６）。First, an initial value m = 1 is set (step 101), and the input pattern Pea and the recognition target pattern Pr are set.
1 and 1 are overlapped (step 102). For example, when the input pattern Pea is "E + noise" and the recognition target pattern Pr1 is "E", the result is as shown in FIG. In the center of FIG. 3 after “overlapping”, the matching part area (number of pixels) R1a is shaded and the non-matching part area (number of pixels) R1b is shaded. That is, the input pattern Pe
In the pixel having the same coordinates as a and the recognition target pattern Pr1, if both are “1”, it means that the pixel is matched, and if only one of them is “1”, the pixel is matched. Make it not exist. At this time, the similarity R1 is
If the total area (number of pixels) of the recognition target pattern Pr1 is R1c, it is given by R1 = R1a / R1c (step 103). Therefore, the degree of similarity R1 is distributed between "0.0" and "1.0", and the smaller the matching part area R1a (the less similar the areas are), the closer the value is to "0.0". However, when R1b / R1c becomes larger than a predetermined threshold value, R1 = 0.0 is set and the learning pattern is not selected. This is because the input pattern Pea is less similar to the recognition target pattern Pr1 as R1b / R1c is larger. Subsequently, the similarity R1 and the threshold value Tr are compared in magnitude, and the result is stored (step 10).
4). The threshold value Tr is experimentally obtained. Next, m = m + 1 is set (step 105), and m> n
Steps 102 to 105 are repeated until (step 106).

【００２１】類似度Ｒ１〜Ｒｎの計算が終了すると、ス
テップ１０４で得られた，類似度Ｒｍとしきい値Ｔｒと
の大小比較の結果から、Ｒｍ＞Ｔｒとなるものがただ一
つあるか否かを判断する（ステップ１０７）。Ｒｍ＞Ｔ
ｒとなるものがただ一つあれば、そのＲｍに対応する認
識対象パターンＰｒｍが出力すべき認識対象パターンＰ
ｏに一致するか否かを判断する（ステップ１０８）。認
識対象パターンＰｒｍが認識対象パターンＰｏに一致す
れば、入力パターンＰｅａを学習パターンＰｓとする
（ステップ１０９）。Ｒｍ＞Ｔｒとなるものがただ一つ
ではない場合は、入力パターンＰｅａを学習パターンＰ
ｓとしない。このような入力パターンＰｅａは、他の認
識対象パターンにも類似するからである。すなわち、こ
のような入力パターンＰｅａを学習パターンＰｓとすれ
ば、出力すべき認識対象パターンＰｏと他の認識対象パ
ターンとの間で混同を生じるからである。また、認識対
象パターンＰｒｍが出力すべき認識対象パターンＰｏに
一致しない場合も、入力パターンＰｅａを学習パターン
Ｐｓとしない。このような入力パターンＰｅａは、出力
すべき認識対象パターンＰｏに最も類似するものではな
いからである。When the calculation of the similarities R1 to Rn is completed, whether or not there is only one that satisfies Rm> Tr from the result of the comparison of the similarity Rm and the threshold Tr obtained in step 104. Is determined (step 107). Rm> T
If there is only one r, the recognition target pattern Prm corresponding to that Rm should be output.
It is determined whether or not it matches o (step 108). If the recognition target pattern Prm matches the recognition target pattern Po, the input pattern Pea is set as the learning pattern Ps (step 109). If there is not only one Rm> Tr, the input pattern Pea is set to the learning pattern P.
Not s. This is because such an input pattern Pea is similar to other recognition target patterns. That is, when such an input pattern Pea is used as the learning pattern Ps, confusion occurs between the recognition target pattern Po to be output and another recognition target pattern. Even when the recognition target pattern Prm does not match the recognition target pattern Po to be output, the input pattern Pea is not set as the learning pattern Ps. This is because such an input pattern Pea is not the most similar to the recognition target pattern Po to be output.

【００２２】図４は、学習パターン生成装置１０の動作
を示すフローチャートであり、誤って認識した入力パタ
ーンＰｅｂについて学習パターンＰｓを生成する場合を
示している。以下、図１及び図４に基づき説明する。FIG. 4 is a flow chart showing the operation of the learning pattern generation device 10 and shows the case where the learning pattern Ps is generated for the input pattern Peb that is erroneously recognized. Hereinafter, description will be given with reference to FIGS. 1 and 4.

【００２３】予め、誤って認識した入力パターンＰｅｂ
と、入力パターンＰｅｂに付加されている出力すべき認
識対象パターンＰｏの情報とを、外部記憶装置１８等か
ら入力しておく。また、認識対象パターンＰｒ１〜Ｐｒ
ｎには、出力すべき認識対象パターンＰｏと誤って認識
した認識対象パターンＰｅとが当然のことながら含まれ
ている。The input pattern Peb which is erroneously recognized in advance
And the information of the recognition target pattern Po to be output, which is added to the input pattern Peb, are input from the external storage device 18 or the like. Further, the recognition target patterns Pr1 to Pr
Naturally, the recognition target pattern Po to be output and the recognition target pattern Pe which is erroneously recognized are included in n.

【００２４】まず、初期値ｍ＝１を設定して（ステップ
２０１）、入力パターンＰｅｂと認識対象パターンＰｒ
１とを重ね合わせる（ステップ２０２）。以下、ステッ
プ２０３からステップ２０６までは、入力パターンがＰ
ｅｂ，しきい値がＴｅである点を除き、図２のステップ
１０３からステップ１０６と同様であるので、説明を省
略する。First, the initial value m = 1 is set (step 201), and the input pattern Peb and the recognition target pattern Pr are set.
1 and 1 are overlapped (step 202). Hereinafter, from step 203 to step 206, the input pattern is P
Except that eb and the threshold value are Te, the steps are the same as steps 103 to 106 in FIG.

【００２５】類似度Ｒ１〜Ｒｎの計算が終了すると、認
識対象パターンＰｏにおける類似度Ｒｏが認識対象パタ
ーンＰｅにおける類似度Ｒｅよりも大きいか否かを判断
する（ステップ２０７）。Ｒｏ＞Ｒｅであれば、ステッ
プ２０４で得られた類似度Ｒｍとしきい値Ｔｅとの大小
比較の結果から、Ｒｏ＞ＴｅかつＲｅ＜Ｔｅであるか否
かを判断する（ステップ２０８）。Ｒｏ＞ＴｅかつＲｅ
＜Ｔｅであれば入力パターンＰｅｂを学習パターンＰｓ
とする（ステップ２０９）。一方、Ｒｏ≦Ｒｅである場
合は、入力パターンＰｅｂを学習パターンＰｓとしな
い。このような入力パターンＰｅｂは、出力すべき認識
対象パターンＰｏに最も類似するものではないからであ
る。また、Ｒｏ＞ＲｅであってもＲｏ≦Ｔｅである場合
は、入力パターンＰｅｂを学習パターンＰｓとしない。
このような入力パターンＰｅｂは、出力すべき認識対象
パターンＰｏと類似する度合いが小さいからである。さ
らに、Ｒｏ＞ＲｅであってもＲｅ≧Ｔｅである場合も、
入力パターンＰｅｂを学習パターンＰｓとしない。この
ような入力パターンＰｅｂは、誤って出力した認識対象
パターンＰｅにも類似するからである。すなわち、この
ような入力パターンＰｅｂを学習パターンＰｓとすれ
ば、出力すべき認識対象パターンＰｏと誤って出力した
認識対象パターンＰｅとの間で混同を生じるからであ
る。When the calculation of the similarities R1 to Rn is completed, it is determined whether the similarity Ro in the recognition target pattern Po is larger than the similarity Re in the recognition target pattern Pe (step 207). If Ro> Re, it is determined whether Ro> Te and Re <Te based on the result of the comparison between the similarity Rm obtained in step 204 and the threshold value Te (step 208). Ro> Te and Re
If Te, the input pattern Peb is the learning pattern Ps.
(Step 209). On the other hand, when Ro ≦ Re, the input pattern Peb is not set as the learning pattern Ps. This is because such an input pattern Peb is not the most similar to the recognition target pattern Po to be output. Further, even if Ro> Re, if Ro ≦ Te, the input pattern Peb is not set as the learning pattern Ps.
This is because such an input pattern Peb is less similar to the recognition target pattern Po to be output. Furthermore, even if Ro> Re or Re ≧ Te,
The input pattern Peb is not the learning pattern Ps. This is because such an input pattern Peb is similar to the erroneously output recognition target pattern Pe. That is, if such an input pattern Peb is used as the learning pattern Ps, confusion occurs between the recognition target pattern Po that should be output and the recognition target pattern Pe that was erroneously output.

【００２６】なお、本実施例における類似度Ｒ１〜Ｒｎ
の計算方法は、言うまでもなく一例に過ぎない。例え
ば、類似度Ｒ１＝（Ｒ１ａ−Ｒ１ｂ）／Ｒ１ｃとして求
めてもよい。Incidentally, the similarities R1 to Rn in this embodiment.
Needless to say, the calculation method of is only an example. For example, the degree of similarity R1 may be calculated as R1 = (R1a−R1b) / R1c.

【００２７】図５は、本発明に係る文字認識装置の一実
施例を示すブロック図である。以下、この図に基づき説
明する。FIG. 5 is a block diagram showing an embodiment of the character recognition device according to the present invention. Hereinafter, description will be given based on this figure.

【００２８】文字認識装置３０は、入力文字ａを含む画
像データｂを入力する画像入力部３２と、画像入力部３
２で入力された画像データｂを記憶する画像記憶部３４
と、画像記憶部３４に記憶されている画像データｂから
入力文字ａに関する入力データｃを作成する入力データ
作成部３６と、入力データ作成部３６で作成された入力
データｃに対してニューラルネットワーク処理を行うニ
ューラルネットワーク処理部３８と、ニューラルネット
ワーク処理部３８の処理結果に基づいて入力文字ａに対
応する認識対象文字ｄを出力する文字出力部４０とを備
えたものである。The character recognition device 30 includes an image input section 32 for inputting image data b including an input character a and an image input section 3.
The image storage unit 34 that stores the image data b input in 2
An input data creation unit 36 that creates input data c related to the input character a from the image data b stored in the image storage unit 34; and a neural network processing for the input data c created by the input data creation unit 36. The neural network processing unit 38 for performing the above, and the character output unit 40 for outputting the recognition target character d corresponding to the input character a based on the processing result of the neural network processing unit 38.

【００２９】ニューラルネットワーク処理部３８は、図
１の学習パターン生成装置１０によって得られた学習パ
ターンＰｓによって学習したニューラルネットワーク４
６（図６）を有している。The neural network processing unit 38 uses the learning pattern Ps obtained by the learning pattern generation device 10 of FIG. 1 to learn the neural network 4.
6 (FIG. 6).

【００３０】画像入力部３２は、例えばイメージスキャ
ナ，ＣＣＤカメラ等から構成され、デジタル画像を得る
ものである。画像記憶部３４は、フロッピィーディス
ク，ハードディスク等の外部記憶装置又はＲＡＭ等から
構成されている。The image input section 32 is composed of, for example, an image scanner, a CCD camera, etc., and obtains a digital image. The image storage unit 34 is composed of an external storage device such as a floppy disk or a hard disk, a RAM, or the like.

【００３１】文字出力部４０は、ニューラルネットワー
ク処理部３８から出力された認識対象文字ｄを表示する
ＣＲＴ又はプリンタ等から構成されている。The character output unit 40 is composed of a CRT or a printer that displays the recognition target character d output from the neural network processing unit 38.

【００３２】入力データ作成部３６及びニューラルネッ
トワーク処理部３８は、例えばコンピュータ及びこれを
動作させるプログラムによって実現できる。この場合、
このコンピュータで、画像入力部３２，文字出力部４０
等を制御するように構成してもよい。また、ニューラル
ネットワーク処理部３８は、ニューロチップによっても
実現できる。The input data creating section 36 and the neural network processing section 38 can be realized by, for example, a computer and a program for operating the computer. in this case,
In this computer, the image input unit 32 and the character output unit 40
Etc. may be configured to be controlled. The neural network processing unit 38 can also be realized by a neurochip.

【００３３】ニューラルネットワーク処理部３８は、図
６に示されるようなニューラルネットワーク４６を有し
ている。ニューラルネットワーク４６は、入力データ作
成部３６で作成された入力データｃが入力される入力層
４６ａと、中間層４６ｂと、出力層４６ｃとから構成さ
れている。各層はユニット（図６において「○」で示
す。）と呼ばれる構成要素から成り立っており、各ユニ
ットが結合することによりニューラルネットワーク４６
が構成されている。The neural network processing unit 38 has a neural network 46 as shown in FIG. The neural network 46 includes an input layer 46a to which the input data c created by the input data creating unit 36 is input, an intermediate layer 46b, and an output layer 46c. Each layer is composed of constituent elements called units (indicated by “◯” in FIG. 6), and the neural network 46 is formed by connecting the units.
Is configured.

【００３４】入力層４６ａの各ユニットは、中間層４６
ｂのそれぞれのユニットとすべて結合している。そし
て、入力層４６ａのユニット数は、入力データｃの画素
数に対応した個数であり、任意に設定可能である。ま
た、中間層４６ｂの各ユニットは、出力層４６ｃのそれ
ぞれのユニットとすべて結合している。中間層４６ｂの
層数及びユニット数は、任意に設定可能である。出力層
４６ｃは、少なくとも認識対象文字の数だけ用意されて
いる。本実施例では図６に示されるようにｎ個のユニッ
トを具備している。例えば、アルファベットを認識対象
文字とする場合は、ｎ＝26個となる。Each unit of the input layer 46a is equivalent to the intermediate layer 46.
It is all associated with each unit of b. The number of units of the input layer 46a is the number corresponding to the number of pixels of the input data c and can be set arbitrarily. Further, each unit of the intermediate layer 46b is combined with each unit of the output layer 46c. The number of layers and the number of units of the intermediate layer 46b can be set arbitrarily. The output layers 46c are prepared in at least the number of recognition target characters. In this embodiment, n units are provided as shown in FIG. For example, when the alphabet is the recognition target character, n = 26.

【００３５】このような構成において、入力層４６ａに
入力される入力データｃと出力層４６ｃからの出力すな
わち認識対象文字ｄとの関係を得るために、各ユニット
間の結合強度等を予め学習によって決定しておく。すな
わち、ある入力データｃを入力層４６ａに入力したと
き、その入力データｃがアルファベットの「Ａ」であれ
ば、出力層４６ｃのユニット４６_c1から「１」を出力さ
せ、アルファベットの「Ｂ」であれば、出力層４６ｃの
ユニット４６_c2から「１」を出力させるように学習させ
る。以下、同様に「Ｃ」から「Ｚ」まで学習させる。学
習データには、アルファベットの「Ａ」から「Ｚ」まで
のパターンの他に、学習パターン生成装置１０によって
得られた学習パターンＰｓも含まれている。In such a configuration, in order to obtain the relationship between the input data c input to the input layer 46a and the output from the output layer 46c, that is, the recognition target character d, the coupling strength between the units is preliminarily learned. Make a decision. That is, when a certain input data c is input to the input layer 46a, if the input data c is the alphabet "A", the unit 46 _c1 of the output layer 46c outputs "1" and the alphabet "B" is output. If there is, the unit 46 _c2 of the output layer 46 _c is learned to output “1”. Hereinafter, similarly, learning is performed from “C” to “Z”. The learning data includes a learning pattern Ps obtained by the learning pattern generation device 10 in addition to the patterns from the alphabet “A” to “Z”.

【００３６】このような学習済みのニューラルネットワ
ーク４６を使用すれば、ある未知の入力データｃが与え
られたとき、ユニット４６_c1，４６_c2，…，４６_cnから
の出力値Ｏ₁〜Ｏ_nにより認識対象文字ｄが得られる。
ニューラルネットワーク４６は、学習パターンＰｓも学
習しているため、入力パターン（本実施例における「入
力文字」）にノイズが混入している場合でも、正しいパ
ターン認識が可能である。[0036] The use of such a trained neural network 46, when there unknown input data c is given, the unit 46 _c1, 46 _c2, ..., the output value O ₁ ~ O _n from 46 _cn The recognition target character d is obtained.
Since the neural network 46 has also learned the learning pattern Ps, correct pattern recognition is possible even when noise is mixed in the input pattern (“input character” in this embodiment).

【００３７】文字出力部４０では、ニューラルネットワ
ーク処理部３８の処理結果に基づいて認識対象文字を出
力する。ユニット４６_c1の出力値をＯ₁，ユニット４６
_c2の出力値をＯ₂，…，ユニット４６_c26の出力値をＯ
₂₆とする。あるユニット４６_Cmの出力値が最大Ｏ_m.MAX
である場合、Ｏ_m.MAXがしきい値Ｔよりも大きければ、
文字出力部４０はＯ_mに対応する認識対象文字を出力す
る。すべての出力値Ｏ₁〜Ｏ₂₆がしきい値Ｔよりも小さ
ければ、文字出力部４０は「認識できない」と出力す
る。The character output unit 40 outputs the recognition target character based on the processing result of the neural network processing unit 38. The output value of the unit 46 _c1 is O ₁ , the unit 46
The output value of _c2 is O ₂ , ..., The output value of the unit 46 _c26 is O ₂ .
₂₆ . Output value of a certain unit 46 _Cm is maximum O _m.MAX
, If O _m.MAX is greater than the threshold T, then
The character output unit 40 outputs the recognition target character corresponding to O _m . If all the output values O _{1 to} O ₂₆ are smaller than the threshold value T, the character output unit 40 outputs "unrecognizable".

【００３８】図７は、文字認識装置３０の動作を示すフ
ローチャートである。以下、図５，図６及び図７に基づ
き説明する。FIG. 7 is a flowchart showing the operation of the character recognition device 30. Hereinafter, description will be given with reference to FIGS. 5, 6 and 7.

【００３９】まず、入力文字ａを含む画像データｂを画
像入力部３２から入力する（ステップ３０１）。続い
て、画像データｂを画像記憶部３４に記憶する（ステッ
プ３０２）。入力データ作成部３６は、画像記憶部３４
に記憶されている画像データｂに二値化，切り出し等の
前処理を施して、入力文字ａに関する入力データｃを作
成する（ステップ３０３）。入力データｃは、画素ごと
の濃度値「０」又は「１」の集合である。ニューラルネ
ットワーク処理部３８は、入力データｃを処理して出力
値Ｏ₁〜Ｏ_nを得る（ステップ３０４）。文字出力部４
０は、出力値Ｏ₁〜Ｏ_nに基づいて、入力文字ａに対応
する認識対象文字ｄを出力する（ステップ３０５）。First, the image data b including the input character a is input from the image input section 32 (step 301). Then, the image data b is stored in the image storage unit 34 (step 302). The input data creation unit 36 includes an image storage unit 34.
Preprocessing such as binarization and clipping is performed on the image data b stored in (1) to create input data c related to the input character a (step 303). The input data c is a set of density values “0” or “1” for each pixel. Neural network processing unit 38 processes the input data c to obtain an output value O ₁ ~ O _n (step 304). Character output unit 4
0, based on the output value O ₁ ~ O _n, and outputs a recognition object character d corresponding to the input character a (step 305).

【００４０】ところで、文字認識装置３０は、必ずしも
入力文字ａに正確に対応する認識対象文字ｄを出力でき
るわけではない。すなわち、認識できなかった入力パタ
ーンＰｅａ又は誤って認識した入力パターンＰｅｂが存
在する。この場合、操作者は、入力パターンＰｅａ，Ｐ
ｅｂに、出力すべき認識対象パターンＰｏを示す情報を
付加するだけでよい。すなわち、この情報が付加された
入力パターンＰｅａ，Ｐｅｂは、ニューラルネットワー
ク処理部３８から画像記憶部３４に記憶される。そし
て、多数の入力パターンＰｅａ，Ｐｅｂの中から、学習
パターン生成装置１０によって学習パターンＰｓが前述
したように選択される。続いて、学習パターンＰｓは、
ニューラルネットワーク処理部３８のニューラルネット
ワーク４６で学習される。このように、文字認識装置３
０は、熟練した操作者によらなくても適切な学習パター
ンＰｓを自動的かつ短時間に選択できるので、認識率が
高くかつ操作性がよい。By the way, the character recognition device 30 cannot necessarily output the recognition target character d that exactly corresponds to the input character a. That is, there is an input pattern Pea that could not be recognized or an input pattern Peb that was erroneously recognized. In this case, the operator must input the input patterns Pea, P
It is only necessary to add information indicating the recognition target pattern Po to be output to eb. That is, the input patterns Pea and Peb to which this information is added are stored in the image storage unit 34 from the neural network processing unit 38. Then, the learning pattern Ps is selected by the learning pattern generation device 10 from the large number of input patterns Pea and Peb as described above. Then, the learning pattern Ps is
The learning is performed by the neural network 46 of the neural network processing unit 38. In this way, the character recognition device 3
With 0, an appropriate learning pattern Ps can be selected automatically and in a short time without a skilled operator, so that the recognition rate is high and the operability is good.

【００４１】[0041]

【発明の効果】本発明に係る学習パターン生成装置によ
れば、全ての認識対象パターンのそれぞれと認識できな
かった又は誤って認識した入力パターンとの類似度に基
づき、その入力パターンを学習パターンとするか否かを
判断することにより、従来操作者が経験的に選択してい
た適切な学習パターンを、自動的かつ短時間に選択でき
る。According to the learning pattern generation device of the present invention, the input pattern is regarded as the learning pattern based on the similarity with the input pattern that cannot be recognized or is erroneously recognized as all the recognition target patterns. By determining whether or not to do so, it is possible to automatically and quickly select an appropriate learning pattern that was conventionally selected by the operator empirically.

【００４２】本発明に係る文字認識装置によれば、本発
明に係る学習パターン生成装置を用いることにより、適
切な学習パターンを自動的かつ短時間で選択することが
できるので、認識率及び操作性を向上できる。According to the character recognition device of the present invention, by using the learning pattern generation device of the present invention, an appropriate learning pattern can be selected automatically and in a short time. Can be improved.

[Brief description of drawings]

【図１】本発明に係る学習パターン生成装置の一実施例
を示すブロック図である。FIG. 1 is a block diagram showing an embodiment of a learning pattern generation device according to the present invention.

【図２】本発明に係る学習パターン生成装置の一実施例
の動作を示すフローチャートである。FIG. 2 is a flowchart showing the operation of an embodiment of the learning pattern generation device according to the present invention.

【図３】図１の実施例における類似度計算手段の動作を
示す概念図である。FIG. 3 is a conceptual diagram showing the operation of the similarity calculation means in the embodiment of FIG.

【図４】本発明に係る学習パターン生成装置の一実施例
の動作を示すフローチャートである。FIG. 4 is a flowchart showing the operation of an embodiment of a learning pattern generation device according to the present invention.

【図５】本発明に係る文字認識装置の一実施例を示すブ
ロック図である。FIG. 5 is a block diagram showing an embodiment of a character recognition device according to the present invention.

【図６】図５の実施例におけるニューラルネットワーク
処理部のニューラルネットワークを示す概念図である。6 is a conceptual diagram showing a neural network of a neural network processing unit in the embodiment of FIG.

【図７】本発明に係る文字認識装置の一実施例の動作を
示すフローチャートである。FIG. 7 is a flowchart showing the operation of an embodiment of the character recognition device according to the present invention.

【図８】従来の学習パターンの例を示す平面図である。FIG. 8 is a plan view showing an example of a conventional learning pattern.

[Explanation of symbols]

１０学習パターン生成装置１２認識対象パターン記憶手段１４類似度計算手段１６学習パターン選択手段Ｐｒ１〜Ｐｒｎ認識対象パターンＰｅａ認識できなかった入力パターンＰｅｂ誤って認識した入力パターンＲ１〜Ｒｎ類似度Ｐｓ学習パターン３０文字認識装置３２画像入力部３４画像記憶部３６入力データ作成部３８ニューラルネットワーク処理部４０文字出力部ａ入力文字ｂ画像データｃ入力データｄ認識対象文字 10 Learning Pattern Generation Device 12 Recognition Target Pattern Storage Means 14 Similarity Calculating Means 16 Learning Pattern Selecting Means Pr1 to Prn Recognition Target Patterns Pea Unrecognized Input Patterns Peb Accidentally Recognized Input Patterns R1 to Rn Similarity Ps Learning Patterns 30 Character recognition device 32 Image input unit 34 Image storage unit 36 Input data creation unit 38 Neural network processing unit 40 Character output unit a Input character b Image data c Input data d Recognition target character

Claims

[Claims]

1. A similarity for calculating a similarity between a recognition target pattern storage unit that stores all recognition target patterns and an input pattern that cannot be recognized or is erroneously recognized as each of all the recognition target patterns. Degree calculation means,
A learning pattern generation device, comprising: a learning pattern selection unit that determines whether or not the input pattern is a learning pattern based on the similarity calculated by the similarity calculation unit.

2. An image input section for inputting image data including input characters, an image storage section for storing the image data input by the image input section, and the image data stored in the image storage section, Based on the input data creation unit that creates input data regarding the input characters, the neural network processing unit that performs neural network processing on the input data created by this input data creation unit, and the processing result of this neural network processing unit A character recognizing device including a character output unit that outputs a recognition target character corresponding to the input character, wherein the neural network processing unit is a neural trained by the learning pattern obtained by the learning pattern generating device according to claim 1. A character recognition device having a network.