JPH06251158A

JPH06251158A - Pattern recognition dictionary generating method

Info

Publication number: JPH06251158A
Application number: JP5038319A
Authority: JP
Inventors: Ei Sakano; 鋭坂野; Hiromi Kida; 博巳木田
Original assignee: N T T DATA TSUSHIN KK; NTT Data Communications Systems Corp
Current assignee: N T T DATA TSUSHIN KK; NTT Data Corp
Priority date: 1993-02-26
Filing date: 1993-02-26
Publication date: 1994-09-09

Abstract

PURPOSE:To efficiently generate a pattern recognition dictionary whose performance is satisfactory by using a shielding means for shielding a specific part of the pattern recognition dictionary from other category in its periphery. CONSTITUTION:This method is constituted of a phase 1 for selecting a learning category and a phase 2 for sequential learning. The phase 1 consists of an initial dictionary generating part 01, a pattern recognizing part 02, a learning category set selecting part 03 and a shielding category set selecting part 04. The phase 2 consists of a learning pattern selecting part 05, a pattern recognizing part 06, a dictionary correction deciding part 07, a dictionary correction executing part 08, and an end deciding part 09. Also, in the recognition dictionary, only a necessary part of learning becomes an object of learning, and a necessary part of learning is shielded by a category having possibility of erroneous recognition, or a necessary part of learning is shielded by an artificial pattern, therefore, even if sequential learning is executed in a part of the dictionary, it is suppressed to the minimum that adverse influence is exerted on the whole dictionary.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、ＯＣＲ（光学文字読取
装置）等のパタ−ン認識装置で使用される辞書を作成す
る方法に関し、特に認識精度の高い認識辞書を効率よく
作成することができる作成方法に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a method for creating a dictionary used in a pattern recognition device such as an OCR (optical character reading device), and more particularly, to efficiently create a recognition dictionary with high recognition accuracy. Regarding how to create.

【０００２】[0002]

【従来の技術】従来より、文字や音声のパタ−ン認識を
行う場合には、各カテゴリにおける標準的なパタ−ンで
ある参照パタ−ンと、カテゴリの分布形状に従ってこの
参照パタ−ンと認識対象パタ−ンとの位置関係を補正す
る重みベクトルを認識辞書として予め用意しておき、認
識させたいパタ−ンと各カテゴリの参照パタ−ンとの類
似性を何等かの評価関数（例えば、ユ−クリッド二乗距
離）を用いて計算した上で、最も類似性の高いカテゴリ
を認識結果として出力している。この参照パタ−ンは、
通常、学習させるパタ−ン平均値として与えられ、また
重みベクトルは、標準偏差の逆数として与えられる。さ
らに、逐次学習的な方法としては、ポテンシャル関数法
やＬＶＱ（学習ベクトル量子化）のように学習時に誤り
を訂正する形で参照パタ−ンや重みを決定する方法が提
案されている。なお、この種の従来手法に関する文献と
しては、『文字認識概論』橋本新一郎、オ−ム社（198
2)、（これには、通常用いられている文字認識手法、相
加平均による標準パタ−ン作成法等が記載されてい
る）、『Self-Organization andassociative Memory
′3rd.edition』T.Kohohonen,Springer-Verlag(198
9)、（これには、標準パタ−ンを逐次的に動かす方法論
ＬＶＱ（学習ベクトル量子化）が記載されている）なら
びに『動力学モデルによる識別超平面の最適化』坂野
鋭、電子情報通信学会技術報告、ＰＲＵ-91-149(1992)
（これには、分布の形状を補正する重みを逐次的に動か
して最適化する手法が記載されている）。2. Description of the Related Art Conventionally, when recognizing a pattern of characters or voices, a reference pattern which is a standard pattern in each category and this reference pattern according to the distribution shape of the category are used. A weight vector for correcting the positional relationship with the recognition target pattern is prepared in advance as a recognition dictionary, and the similarity between the pattern to be recognized and the reference pattern of each category is evaluated by some evaluation function (for example, , Euclidean square distance), and the category with the highest similarity is output as the recognition result. This reference pattern
Usually, it is given as a pattern average value to be learned, and the weight vector is given as an inverse number of standard deviation. Further, as a sequential learning method, a method of deciding the reference pattern and weight by correcting an error at the time of learning, such as a potential function method and LVQ (learning vector quantization), has been proposed. The literature on this type of conventional method is "Introduction to Character Recognition" by Shinichiro Hashimoto and Ohmsha (198
2), (This section describes commonly used character recognition methods, standard pattern creation methods using arithmetic mean, etc.), “Self-Organization and associative Memory
′ 3rd.edition 』T. Kohohonen, Springer-Verlag (198
9), (This describes the methodology LVQ (learning vector quantization) that moves standard patterns sequentially) and "Optimization of discriminating hyperplanes by dynamic model" Takeshi Sakano, Electronic Information Communication Academic Technical Report, PRU-91-149 (1992)
(This describes a method of sequentially moving and optimizing the weights for correcting the shape of the distribution).

【０００３】[0003]

【発明が解決しようとする課題】このように、従来、カ
テゴリ毎の参照パタ−ン、およびカテゴリの分布により
類似尺度を補正する重み等を持ったパタ−ン認識用辞書
を作成する場合、この辞書を構成するデ−タは、多くの
学習パタ−ンによる逐次学習的な最適化手法により計算
されて、追加されている。しかしながら、カテゴリが多
くなった場合には全てのカテゴリについての認識実験を
行うための計算量、収束可能性等の技術が困難であり、
さらにカテゴリを追加した場合にも、全てのカテゴリに
ついて最適化を施す必要があるので、運用上に困難性が
ある。また、従来のパタ−ン認識方法は、アルゴリズム
が簡単であり、かつ多くの場合には効果が期待できるた
めに、かなり広範囲で使用されていた。しかしながら、
ポテンシャル関数法やＬＶＱ（学習ベクトル量子化）等
の逐次的学習的な辞書作成法では、認識に使用される全
てのカテゴリについて学習を逐行する必要があるため、
前述のように計算量や収束可能性等の技術で困難性があ
った。すなわち、認識に使用するカテゴリが例えば文字
の場合、ＪＩＳ第１水準３２０１カテゴリのようにその
数が多いときには、収束可能性、処理量の両面で実用的
とは言えない面がある。そこで、単に一部だけのカテゴ
リを用いて逐次学習を行う場合には、学習に分いられな
かったカテゴリにおいて誤認識が増加するため、辞書全
体の性能が低下するという問題点がある。本発明の目的
は、このような従来の課題を解決し、周辺への影響を遮
蔽することにより、性能のよいパタ−ン認識辞書を効率
よく作成することができるパタ−ン認識辞書作成方法を
提供することにある。As described above, conventionally, when a pattern recognition dictionary having a reference pattern for each category and a weight for correcting the similarity measure according to the distribution of the category is created, The data constituting the dictionary is calculated and added by the sequential learning optimization method using many learning patterns. However, when the number of categories increases, it is difficult to use techniques such as the amount of calculation and the possibility of convergence for performing recognition experiments for all categories.
Even if categories are added, it is necessary to perform optimization on all categories, which is difficult in operation. Further, the conventional pattern recognition method has been used in a fairly wide range because the algorithm is simple and the effect can be expected in many cases. However,
In sequential function dictionary creation methods such as the potential function method and LVQ (learning vector quantization), it is necessary to perform learning for all categories used for recognition.
As described above, there are difficulties in the technique such as the amount of calculation and the possibility of convergence. That is, when the category used for recognition is, for example, a character, and the number thereof is large as in the JIS first level 3201 category, there are aspects that are not practical in terms of both the convergence possibility and the processing amount. Therefore, when the learning is sequentially performed using only a part of the categories, there is a problem that the performance of the entire dictionary is deteriorated because erroneous recognition increases in the categories that are not learned. An object of the present invention is to solve the conventional problems as described above and to provide a pattern recognition dictionary creating method capable of efficiently creating a high-performance pattern recognition dictionary by shielding the influence on the surroundings. To provide.

【０００４】[0004]

【課題を解決するための手段】上記目的を達成するた
め、本発明のパタ−ン認識辞書作成方法は、（イ）文字
や音声等のパタ−ン認識のために用いられ、かつ各カテ
ゴリの標準的な参照パタ−ンと重みの集合とからなり、
複数の学習パタ−ンに対してカテゴリ毎、特徴軸毎に特
徴の相加平均、標準偏差を求め、相加平均を参照パタ−
ン、標準偏差の逆数を重みとする初期の認識辞書を予め
作成しておき、学習パタ−ンを反復的に与えながら誤認
識をできる限り少なくなるように参照パタ−ンまたは重
みの変更を行うことにより完成させるパタ−ン認識辞書
の作成方法において、パタ−ン認識辞書の特定の部分
を、その周囲の他のカテゴリから遮蔽する遮蔽手段を用
いることにより、部分的に辞書を高性能化することを特
徴としている。また、（ロ）遮蔽手段として、特定のカ
テゴリの集合に対して誤読の可能性の高いカテゴリのパ
タ−ンを用いることも特徴としている。さらに、（ハ）
遮蔽手段として、特定のカテゴリの集合に対して周辺に
疑似パタ−ンを発生させることにより遮蔽することも特
徴としている。In order to achieve the above object, the method for creating a pattern recognition dictionary of the present invention is used for (a) pattern recognition of characters, voices, etc. It consists of a standard reference pattern and a set of weights,
For each learning pattern, the arithmetic mean and standard deviation of the features are calculated for each category and each feature axis, and the arithmetic mean is referred to.
, An initial recognition dictionary with the reciprocal of the standard deviation as a weight is created in advance, and the reference pattern or the weight is changed so as to minimize misrecognition while repeatedly giving the learning pattern. In the method of creating a pattern recognition dictionary completed by doing so, the dictionary is partially improved in performance by using a shielding means that shields a specific part of the pattern recognition dictionary from other categories around it. It is characterized by that. Further, (b) as a shielding means, a pattern of a category having a high possibility of misreading a specific set of categories is used. Furthermore, (C)
As a shielding means, it is also characterized in that a pseudo pattern is generated in the periphery of a set of specific categories so as to shield it.

【０００５】[0005]

【作用】本発明においては、先ず認識に用いる全ての
カテゴリについて、カテゴリ毎にいくつかのパタ−ンの
相加平均である参照パタ−ンと標準偏差の逆数である重
みを作成し、これらを初期辞書として登録し、次に学
習に用いたパタ−ンを初期辞書により認識させることに
より、さらに逐次的学習の必要なカテゴリ集合（学習カ
テゴリ集合と呼ぶ）を選択し、次に、選択されたカテ
ゴリ集合の認識結果に着目して、そのカテゴリ集合以外
に誤認識の対象となるカテゴリまたは認識に用いた評価
関数の値が近接したものを学習カテゴリ集合を他のカテ
ゴリから遮蔽するカテゴリとして選択し、最後に、選
択された学習カテゴリ集合、遮蔽カテゴリ集合を用いて
逐次更新学習を行う。なお、上記の処理過程の代り
に、′選択されたカテゴリ集合の認識結果に着目し
て、認識に用いた評価関数の意味で学習カテゴリ集合の
うち最も学習カテゴリ集合の中心から離れたパタ−ンの
距離Ｒを計測し、その評価関数を取るような疑似パタ−
ンを大量に発生させ、これを用いて遮蔽を形成する（疑
似パタ−ンを遮蔽パタ−ンと呼ぶ）処理過程を用いても
よい。このように、本発明では、誤認識を減少させる逐
次変更型のパタ−ン認識辞書作成法を用いて、認識辞書
の一部だけを高性能化する。すなわち、最適化すべき少
数カテゴリに対して、その周辺を疑似パタ−ンでシ−ル
ドすることにより、少数のカテゴリを系全体に影響を及
ぼさずに最適化することができる。その結果、認識辞書
中の特に認識率の悪い部分や、新しく追加されたカテゴ
リとその周辺でのみ最適化操作を行うことができ、効率
のよい認識辞書を作成することができる。In the present invention, first, for all categories used for recognition, reference patterns, which are the arithmetic mean of several patterns, and weights, which are the reciprocal of the standard deviation, are created for each category, and these are calculated. By registering as an initial dictionary and then recognizing the pattern used for learning by the initial dictionary, a category set that requires further sequential learning (called a learning category set) is selected, and then the selected category set is selected. Focusing on the recognition result of the category set, other than that category set, the category that is the target of misrecognition or the one whose evaluation function value used for recognition is close is selected as the category that shields the learning category set from other categories. Finally, iterative learning is performed using the selected learning category set and occlusion category set. Instead of the above process, paying attention to the recognition result of the selected category set, the pattern farthest from the center of the learning category set among the learning category sets in terms of the evaluation function used for recognition. The pseudo pattern that measures the distance R of the
It is also possible to use a process in which a large amount of the generated pattern is generated and a shield is formed by using the generated pattern (the pseudo pattern is called a shield pattern). As described above, in the present invention, the performance of only a part of the recognition dictionary is improved by using the method of creating a pattern recognition dictionary of the sequentially changing type which reduces erroneous recognition. That is, by shielding the periphery of a minority category to be optimized by a pseudo pattern, the minority category can be optimized without affecting the entire system. As a result, the optimizing operation can be performed only in the part having a particularly low recognition rate in the recognition dictionary, the newly added category, and the periphery thereof, and an efficient recognition dictionary can be created.

【０００６】[0006]

【実施例】以下、本発明の実施例を、図面により詳細に
説明する。図１は、本発明の第１の実施例を示すパタ−
ン認識辞書作成システムの機能ブロック図である。図１
に示すように、本発明の辞書作成方法は、学習カテゴリ
選択のフェ−ズ１と逐次学習のフェ−ズ２から構成され
る。先ず、フェ−ズ１（学習カテゴリ選択部）は、初期
辞書作成部０１、パタ−ン認識部０２、学習カテゴリ集
合選択部０３、および遮蔽カテゴリ集合選択部０４から
なり、初期辞書の作成、学習対象カテゴリ集合の選択、
および遮蔽カテゴリ集合の選択の各処理を行う。フェ−
ズ２（逐次学習部）は、学習パタ−ン選択部０５、パタ
−ン認識部０６、辞書修正判定部０７、辞書修正実行部
０８、および終了判定部０９からなり、フェ−ズ１で選
択した学習カテゴリセットおよび遮蔽カテゴリセットを
構成する各パタ−ンに対して認識を行い、任意の逐次学
習アルゴリズムにより両カテゴリセットで誤認識が少な
くなるように辞書を作成する。以下、各ブロック毎にそ
の処理動作を述べる。Embodiments of the present invention will now be described in detail with reference to the drawings. FIG. 1 is a pattern diagram showing a first embodiment of the present invention.
FIG. 3 is a functional block diagram of a recognition dictionary creating system. Figure 1
As shown in FIG. 5, the dictionary creating method of the present invention comprises Phase 1 of learning category selection and Phase 2 of sequential learning. First, Phase 1 (learning category selection unit) includes an initial dictionary creation unit 01, a pattern recognition unit 02, a learning category set selection unit 03, and a masked category set selection unit 04. Select target category set,
And each process of selection of an occlusion category set is performed. Fee
Step 2 (sequential learning section) comprises a learning pattern selection section 05, a pattern recognition section 06, a dictionary correction determination section 07, a dictionary correction execution section 08, and an end determination section 09, and is selected by Phase 1. Recognition is performed for each pattern that constitutes the learning category set and the masked category set, and a dictionary is created by an arbitrary sequential learning algorithm so that erroneous recognition is reduced in both category sets. The processing operation of each block will be described below.

【０００７】先ず、フェ−ズ１の初期辞書作成部０１で
は、カテゴリ毎に相加平均として参照パタ−ンを求め、
標準偏差の逆数として重みを求め、これらを初期辞書と
する。次に、パタ−ン認識部０２では、初期辞書作成部
０１で作成された初期の認識辞書を用いて、学習に用い
たパタ−ンの認識を行う。次に、学習カテゴリ集合選択
部０３では、パタ−ン認識部０２で認識された学習パタ
−ンのうち、特に認識率の低いカテゴリを選択する。そ
の上で、そのカテゴリの誤認識が多いカテゴリを用いて
学習カテゴリ集合を構成する。次に、遮蔽カテゴリ集合
選択部０４では、学習カテゴリ集合を構成する各パタ−
ンについて認識結果を調べて、誤認識となっているカテ
ゴリの全てを遮蔽カテゴリセットとして登録する。フェ
−ズ２では、任意の逐次学習アルゴリズムにより誤認識
率が少なくなるように辞書を作成するが、その際に、選
択されたアルゴリズムでは、学習カテゴリセットに属す
るカテゴリのみを学習の対象とし、遮蔽カテゴリセット
に属するカテゴリについては変更しない。First, the initial dictionary creating section 01 of Phase 1 finds a reference pattern as an arithmetic mean for each category,
Weights are obtained as the reciprocal of the standard deviation, and these are used as the initial dictionary. Next, the pattern recognition unit 02 uses the initial recognition dictionary created by the initial dictionary creation unit 01 to recognize the pattern used for learning. Next, the learning category set selection unit 03 selects a category having a particularly low recognition rate from the learning patterns recognized by the pattern recognition unit 02. Then, a learning category set is constructed by using the category in which the category is often misrecognized. Next, the occlusion category set selection unit 04 selects each pattern forming the learning category set.
The recognition result is checked for all the registered information, and all the categories that are erroneously recognized are registered as a masked category set. In Phase 2, a dictionary is created by an arbitrary sequential learning algorithm so that the misrecognition rate is reduced. At that time, the selected algorithm targets only the categories belonging to the learning category set for learning and masks them. Do not change the categories belonging to the category set.

【０００８】フェ−ズ２の学習パタ−ン選択部０５で
は、認識辞書を学習するためのパタ−ンを１個だけ選択
し、次にパタ−ン認識部０６では、選択したパタ−ンの
認識を行う。次に辞書修正判定部０７では、選択したパ
タ−ンの認識結果により採用した逐次更新アルゴリズム
の条件に従って、辞書を更新するか否かを判定する。判
定の結果、条件を満たしたものについては、辞書修正実
行部０８においてその辞書の更新を実行する。ただし、
選択したパタ−ンが遮蔽カテゴリ集合選択部０４で登録
された遮蔽カテゴリセットに属するパタ−ンであった場
合には、辞書の更新は行わずに学習パタ−ン選択部０５
に戻り、新たな学習パタ−ンを選択する。また、辞書修
正実行部０８においては、採用した逐次更新アルゴリズ
ムの辞書変更規則により辞書の更新を行う。次に、終了
判定部０９では、採用した逐次更新アルゴリズムの終了
判定条件に従って、学習の終了を判定する。学習が終了
していないと判定された場合には、学習パタ−ン選択部
０５に戻り、新たな学習パタ−ンを選択して学習を続行
する。The learning pattern selecting section 05 of Phase 2 selects only one pattern for learning the recognition dictionary, and then the pattern recognizing section 06 selects the selected pattern. To recognize. Next, the dictionary correction determination unit 07 determines whether to update the dictionary according to the conditions of the sequential update algorithm adopted based on the recognition result of the selected pattern. If the result of determination is that the conditions are satisfied, the dictionary correction execution unit 08 updates the dictionary. However,
If the selected pattern is a pattern belonging to the masking category set registered by the masking category set selecting unit 04, the learning pattern selecting unit 05 does not update the dictionary.
Return to and select a new learning pattern. Further, the dictionary correction execution unit 08 updates the dictionary according to the dictionary change rule of the adopted sequential update algorithm. Next, the end determination unit 09 determines the end of learning according to the end determination condition of the adopted sequential update algorithm. If it is determined that the learning has not ended, the process returns to the learning pattern selection unit 05 to select a new learning pattern and continue the learning.

【０００９】次に、本発明の具体的処理動作を説明す
る。ここでは、Ｔ．Ｋｏｈｏｎｅｎにより提案されてい
るＬＶＱ（学習ベクトル量子化）を用いた例で説明する
が、説明を簡単に行うためであって、本発明はこれに限
定されることなく、他の逐次学習アルゴリズムを用いて
もよいのは勿論である。ＬＶＱ法は、パタ−ン認識にお
けるマルチテンプレ−ト辞書を作成するために用いられ
る方法であって、クラスタリング手法で初期辞書を作成
した後に学習パタ−ンを認識させたが、正しく認識でき
ず、かつその学習パタ−ンが正読側の参照パタ−ンと誤
読側の参照パタ−ンとの識別境界付近に存在する場合
に、正しく認識されるべきカテゴリの参照パタ−ン中で
最も距離が近かった参照パタ−ンを近付け、誤って認識
されたカテゴリの参照パタ−ンを遠ざけるという過程を
繰り返し行うことにより、認識辞書の位置の最適化を行
う。参照パタ−ンの移動処理の具体例を、下式（１）お
よび（２）により説明する。なお、この式（１）（２）
はパタ−ンを表現している特徴ベクトルの１次元での移
動式であるから、各特徴軸毎にこの移動式を実行する必
要がある。〔正読側〕ｍ(ｔ)＝ｍ(ｔ−１)＋α(ｔ)(ｘ−ｍ(ｔ−１)）・・・・・・・・（１）〔誤読側〕ｍ(ｔ)＝ｍ(ｔ−１)−α(ｔ)(ｘ−ｍ(ｔ−１)）・・・・・・・・（２）ここで、ｍ（ｔ）は時刻ｔの参照パタ−ン、ｍ（ｔ−
１）は時刻ｔ−１の参照パタ−ン、ｘは学習パタ−ン α（ｔ）は減少関数（０＜α＜１）である。Next, a specific processing operation of the present invention will be described. Here, T. An example using LVQ (Learning Vector Quantization) proposed by Kohonen will be described. However, the present invention is not limited to this, and other sequential learning algorithms are used for the purpose of simplifying the description. Of course, it is okay. The LVQ method is a method used to create a multi-template dictionary in pattern recognition, and the learning pattern is recognized after the initial dictionary is created by the clustering method, but it cannot be recognized correctly. If the learning pattern exists near the discrimination boundary between the reference pattern on the correct reading side and the reference pattern on the misreading side, the most distance is found among the reference patterns of the category that should be recognized correctly. The position of the recognition dictionary is optimized by repeating the process of bringing the reference pattern which is close to the reference pattern closer to the reference pattern and moving the reference pattern of the incorrectly recognized category away from the reference pattern. A specific example of the process of moving the reference pattern will be described by the following equations (1) and (2). In addition, this formula (1) (2)
Is a one-dimensional movement formula of the feature vector expressing the pattern, it is necessary to execute this movement formula for each feature axis. [Correct reading side] m (t) = m (t-1) + α (t) (x−m (t-1)) ... (1) [Error reading side] m (t) = m (t-1) -α (t) (x-m (t-1)) (2) where m (t) is the reference pattern at time t, m (t) t-
1) is a reference pattern at time t-1, x is a learning pattern α (t) is a decreasing function (0 <α <1).

【００１０】上記条件の下で、本発明の動作フロ−を説
明する。図２は、図１に示すパタ−ン認識辞書作成シス
テムの動作フロ−チャ−トである。先ず、フェ−ズ１に
おいては、初期辞書作成部０１で認識に用いられる全て
のカテゴリに対して、カテゴリ毎にいくつかのパタ−ン
の相加平均である参照パタ−ンと標準偏差の逆数である
重みベクトルを１個ずつ持った認識辞書Ｍ０を作成する
（ステップ１０１）。次に、パタ−ン認識部０２で、こ
の辞書Ｍ０を用いて全ての学習パタ−ンの認識を行う
（ステップ１０２）。次に、学習カテゴリ集合選択部０
３で、認識結果を解析し、認識率の悪いカテゴリ、特に
相互に誤読し合うカテゴリを学習カテゴリ集合として選
択する（ステップ１０３）。次に、遮蔽カテゴリ集合選
択部０４では、学習カテゴリに属する全てのパタ−ンに
ついて認識結果を分析し、全ての誤読カテゴリのうち学
習カテゴリに属さないものを遮蔽カテゴリセットとして
選択する（ステップ１０４）。選択された学習カテゴリ
セット、遮蔽カテゴリセットの辞書を、初期の認識辞書
Ｍとして登録する（ステップ１０５）。Under the above conditions, the operation flow of the present invention will be described. FIG. 2 is an operation flowchart of the pattern recognition dictionary creating system shown in FIG. First, in Phase 1, for all categories used for recognition in the initial dictionary creation unit 01, the reference pattern, which is the arithmetic mean of several patterns for each category, and the reciprocal of the standard deviation. A recognition dictionary M0 having one weight vector is created (step 101). Next, the pattern recognition unit 02 recognizes all learning patterns using this dictionary M0 (step 102). Next, the learning category set selection unit 0
In step 3, the recognition result is analyzed, and a category having a poor recognition rate, in particular, a category misreading each other is selected as a learning category set (step 103). Next, the occlusion category set selection unit 04 analyzes the recognition results for all the patterns belonging to the learning category, and selects, as the occlusion category set, all misreading categories that do not belong to the learning category (step 104). . The dictionaries of the selected learning category set and occlusion category set are registered as the initial recognition dictionary M (step 105).

【００１１】次に、フェ−ズ２においては、学習パタ−
ン選択部０５で学習パタ−ンを１個だけ選択し（ステッ
プ１０６）、パタ−ン認識部０６でこの学習パタ−ンを
認識辞書Ｍを用いて認識する（ステップ１０７）。次
に、辞書修正判定部０７で認識結果が誤読か正読かを判
定し（ステップ１０８）、正読であった場合には終了判
定部０９に移行する。また、誤読であった場合には、学
習パタ−ンとの距離が最も近い正解カテゴリの参照パタ
−ンとその距離値よりも近い距離にある全ての参照パタ
−ンの各々に対して、学習カテゴリセットに属するの
か、遮蔽カテゴリ集合に属するのかを判定する（ステッ
プ１０９）。学習カテゴリセットに属する参照パタ−ン
であれば、その参照パタ−ンは移動の対象となる。すな
わち、辞書を更新するか否かの判定の結果、条件が満た
されているものについては辞書修正実行部０８でパタ−
ンを移動させる（ステップ１１０）。また、終了判定部
０９で終了条件を満たしているか否かを判定し（ステッ
プ１１１）、条件を満たせば学習を終了し、その時点で
の認識辞書を登録して、処理を終了する。Next, in Phase 2, the learning pattern
The pattern selection unit 05 selects only one learning pattern (step 106), and the pattern recognition unit 06 recognizes this learning pattern using the recognition dictionary M (step 107). Next, the dictionary correction determining unit 07 determines whether the recognition result is a misread or a correct reading (step 108). If it is a misread, the learning pattern is learned for each of the reference pattern of the correct category whose distance from the learning pattern is the closest and each reference pattern which is closer than the distance value. It is determined whether it belongs to the category set or the occlusion category set (step 109). If the reference pattern belongs to the learning category set, the reference pattern is the target of movement. That is, as a result of the determination as to whether or not to update the dictionary, if the condition is satisfied, the dictionary correction execution unit 08 performs the pattern.
(Step 110). Further, the end determination unit 09 determines whether or not the end condition is satisfied (step 111). If the condition is satisfied, the learning ends, the recognition dictionary at that time is registered, and the process ends.

【００１２】図３は、本発明の第２の実施例を示すパタ
−ン認識辞書作成システムの機能ブロック図であり、図
４は、図３のシステムによる処理フロ−チャ−トであ
る。図３の機能ブロックにおいて、図１の機能ブロック
構成と異なる点は、遮蔽カテゴリ集合選択部０４の代り
に、遮蔽パタ−ン発生部１０を設けた点であって、それ
以外の両者の構成は全く同じである。図１の遮蔽カテゴ
リ集合選択部０４では、誤認識の可能性のあるカテゴリ
で学習の必要な部分を遮蔽してしまうのに対して、図３
の遮蔽パタ−ン発生部１０では、疑似パタ−ンで学習の
必要な部分を遮蔽してしまうのである。いずれの場合に
も、遮蔽することにより、認識辞書の一部のみを高性能
化することができる。図４においては、先ず、フェ−ズ
１の初期辞書作成部０１が、認識に用いられる全てのカ
テゴリについて、カテゴリ毎にいくつかのパタ−ンの相
加平均である参照パタ−ンと標準偏差の逆数である重み
ベクトルを１個ずつ有する認識辞書Ｍ０を作成する（ス
テップ２０１）。次に、パタ−ン認識部０２では、この
辞書を用いて全ての学習パタ−ンの認識を行う（ステッ
プ２０２）。次に、学習カテゴリ集合選択部０３では、
認識結果を解析して、認識率の悪いカテゴリ、特に相互
に誤読し合うカテゴリを学習カテゴリ集合として選択す
る（ステップ２０３）。次に、遮蔽カテゴリ集合選択部
０４では、学習カテゴリに属する全てのパタ−ンについ
て、認識結果を分析し、学習カテゴリセットの中心から
最も離れたパタ−ンと同じ距離に遮蔽パタ−ンを形成す
る（ステップ２０４）。選択された学習カテゴリセット
の辞書を初期の認識辞書Ｍとして登録する（ステップ２
０５）。この２つの処理（ステップ２０４，２０５）が
図２のフロ−と異なる点である。FIG. 3 is a functional block diagram of a pattern recognition dictionary creating system showing a second embodiment of the present invention, and FIG. 4 is a processing flowchart by the system of FIG. The functional block of FIG. 3 is different from the functional block configuration of FIG. 1 in that a masking pattern generation unit 10 is provided in place of the masking category set selection unit 04, and other configurations are the same. Exactly the same. The occluding category set selecting unit 04 in FIG. 1 occludes a portion that needs learning in a category that may be erroneously recognized.
In the shielding pattern generation unit 10 of (1), the pseudo pattern blocks the portion that needs learning. In any case, by shielding, only a part of the recognition dictionary can be improved in performance. In FIG. 4, first, the initial dictionary creating unit 01 of the phase 1 refers to the reference pattern which is an arithmetic mean of several patterns and the standard deviation for all categories used for recognition. A recognition dictionary M0 having one weight vector which is the reciprocal of 1 is created (step 201). Next, the pattern recognition unit 02 recognizes all learning patterns using this dictionary (step 202). Next, in the learning category set selection unit 03,
The recognition result is analyzed to select a category having a poor recognition rate, particularly a category misreading each other as a learning category set (step 203). Next, the occlusion category set selection unit 04 analyzes the recognition results for all the patterns belonging to the learning category, and forms occlusion patterns at the same distance as the pattern farthest from the center of the learning category set. (Step 204). The dictionary of the selected learning category set is registered as the initial recognition dictionary M (step 2
05). These two processes (steps 204 and 205) are different from the flow of FIG.

【００１３】次に、フェ−ズ２においては、学習パタ−
ン選択部０５で、学習パタ−ンを１個選択し（ステップ
２０６）、パタ−ン認識部０６において、この学習パタ
−ンを認識辞書Ｍを用いて認識する（ステップ２０
７）。次に、辞書修正判定部０７では、認識結果が誤読
であるか正読であるかを判定し（ステップ２０８）、正
読であれば終了判定部０９に移行する。また、誤読であ
れば、学習パタ−ンとの距離が最も近い正解カテゴリの
参照パタ−ンとその距離値よりも近い距離にある全ての
参照パタ−ンの各々に対して、学習カテゴリセットに属
するか、遮蔽パタ−ンに属するかを判定する（ステップ
２０９）。遮蔽パタ−ンに属する場合には、学習パタ−
ン選択部０５に戻って再度、学習パタ−ンを選択し直
す。また、学習カテゴリセットに属する参照パタ−ンで
あれば、その参照パタ−ンは移動の対象となる。すなわ
ち、辞書修正実行部０８において、参照パタ−ンの移動
を実行し（ステップ２１０）、その後、終了判定部０９
において、学習パタ−ンの誤読がなくなったか、あるい
は設定した学習回数に達したかの終了判定を行い（ステ
ップ２１１）、条件を満たせば学習を終了し、その時点
での認識辞書を登録して処理を終了する。Next, in Phase 2, a learning pattern
The learning pattern selection unit 05 selects one learning pattern (step 206), and the pattern recognition unit 06 recognizes this learning pattern using the recognition dictionary M (step 20).
7). Next, the dictionary correction determination unit 07 determines whether the recognition result is erroneous reading or correct reading (step 208), and if it is correct reading, the process proceeds to the end determination unit 09. If it is a misread, the learning pattern is set as the learning category set for each of the reference pattern of the correct category whose distance to the learning pattern is the closest and all the reference patterns which are closer than the distance value. It is determined whether or not it belongs to the shield pattern (step 209). If it belongs to the shield pattern, the learning pattern
Then, the process returns to the selection unit 05 and the learning pattern is selected again. If the reference pattern belongs to the learning category set, the reference pattern is the target of movement. That is, the dictionary correction execution unit 08 executes the movement of the reference pattern (step 210), and then the end determination unit 09.
At step 211, it is determined whether the learning pattern has been erroneously read or the set number of learnings has been reached (step 211). If the conditions are satisfied, the learning is terminated, and the recognition dictionary at that time is registered and processing is performed. finish.

【００１４】[0014]

【発明の効果】以上説明したように、本発明によれば、
認識辞書において学習の必要な部分だけを学習の対象と
し、誤認識の可能性のあるカテゴリで学習の必要な部分
を遮蔽してしまうか、あるいは疑似パタ−ンで学習の必
要な部分を遮蔽してしまうので、逐次学習を辞書の一部
分で行ったとしても辞書全体に悪影響を及ぼすことが最
小限に抑えられる。その結果、パタ−ン認識辞書の設
計、カテゴリ追加等の際に、比較的少ない処理量と単純
なアルゴリズムで高い認識率を有する認識辞書を作成す
ることができる。As described above, according to the present invention,
Only the parts that need to be learned in the recognition dictionary are targeted for learning, and the parts that need to be learned are blocked by categories that have the possibility of misrecognition, or the parts that need to be learned are blocked by pseudo patterns. Therefore, even if the sequential learning is performed in a part of the dictionary, it is possible to minimize the adverse effect on the entire dictionary. As a result, when designing a pattern recognition dictionary, adding categories, etc., a recognition dictionary having a high recognition rate can be created with a relatively small amount of processing and a simple algorithm.

[Brief description of drawings]

【図１】本発明の第１の実施例を示すパタ−ン認識辞書
作成システムの機能ブロック図である。FIG. 1 is a functional block diagram of a pattern recognition dictionary creating system showing a first embodiment of the present invention.

【図２】図１におけるパタ−ン認識辞書作成方法の動作
フロ−チャ−トである。2 is an operation flowchart of the method for creating a pattern recognition dictionary in FIG.

【図３】本発明の第２の実施例を示すパタ−ン認識辞書
作成システムの機能ブロック図である。FIG. 3 is a functional block diagram of a pattern recognition dictionary creating system showing a second embodiment of the present invention.

【図４】図３におけるパタ−ン認識辞書作成方法の動作
フロ−チャ−トである。4 is an operation flowchart of the method for creating a pattern recognition dictionary in FIG.

[Explanation of symbols]

０１初期辞書作成部０２パタ−ン認識部０３学習カテゴリ集合選択部０４遮蔽カテゴリ集合選択部０５学習パタ−ン選択部０６パタ−ン認識部０７辞書修正判定部０８辞書修正実行部０９終了判定部１０遮蔽パタ−ン発生部 01 initial dictionary creation unit 02 pattern recognition unit 03 learning category set selection unit 04 masked category set selection unit 05 learning pattern selection unit 06 pattern recognition unit 07 dictionary correction determination unit 08 dictionary correction execution unit 09 end determination unit 10 Shielding pattern generator

Claims

[Claims]

1. A standard reference pattern for each category, which is used for pattern recognition of characters, voices, etc., and a set of weights, and a plurality of learning patterns for each category. , The arithmetic mean and standard deviation of the features are calculated for each characteristic axis, the arithmetic mean is used as a reference pattern, and an initial recognition dictionary in which the reciprocal of the standard deviation is used as a weight is prepared in advance, and the learning pattern is prepared. In the method for creating a pattern recognition dictionary, which is completed by changing reference patterns or weights so as to minimize misrecognition while repeatedly giving
A pattern recognition dictionary creating method characterized in that the performance of the dictionary is partially improved by using a shielding means that shields a specific part of the computer recognition dictionary from other categories around it.

2. The pattern recognition dictionary creating method according to claim 1, wherein a pattern of a category having a high possibility of misreading a set of specific categories is used as the shielding means. A method for creating a pattern recognition dictionary.

3. The pattern recognition dictionary creating method according to claim 1, wherein the shielding means shields a set of specific categories by generating a pseudo pattern around the set. A method for creating a pattern recognition dictionary.