JP4350228B2

JP4350228B2 - Pattern recognition method and apparatus

Info

Publication number: JP4350228B2
Application number: JP27058699A
Authority: JP
Inventors: 直樹 ▲高▼田; 博史亀山
Original assignee: Glory Ltd
Current assignee: Glory Ltd
Priority date: 1999-09-24
Filing date: 1999-09-24
Publication date: 2009-10-21
Anticipated expiration: 2019-09-24
Also published as: JP2001092924A

Description

【０００１】
【発明の属する技術分野】
この発明は、パターン認識方法および装置に関し、特に、手書き文字等を高い認識率で認識することのできるパターン認識方法および装置に関する。
【０００２】
【従来の技術】
近年、パターン認識技術を利用した文字認識装置は、技術の向上とともにその対象が活字文字から手書き文字にまで広がり、需要も増大している。
【０００３】
文字認識装置は、入力された画像に対し、前処理として２値化と文字の切り出しを行った後、当該文字の特徴抽出を行う。特徴抽出は、例えば、文字の輪郭方向に関する特徴量を抽出するもので、図１１（ａ）に示すように対象となる文字を５×５に分割し、分割したそれぞれの領域に対して図１１（ｂ）に示すような８方向の特徴量を抽出する。つまり、ここで抽出した特徴量は、５×５×８の２００次元のベクトルとして表すことができる。なお、この２００次元の空間を特徴空間と呼ぶ。
【０００４】
認識対象文字の特徴量を抽出すると、この特徴量に基づいて候補パターンの選出を行う。候補パターンの選出は、抽出した特徴量を標本辞書を用いて全パターンマッチングを行い、ユークリッド距離の近いものを１５個とり、これを入力に対する近傍パターンとすることにより行う。
【０００５】
続いて、近傍パターンから上位の２カテゴリー（クラス、文字の種類）を選択し、これを候補カテゴリーとして、詳細判定を行う。
【０００６】
詳細判定は、フィッシャーの方法（石井健一郎・上田修功・前田英作・村瀬洋：パターン認識，オーム社，１９９８参照）により行う。フィッシャーの方法とは、特徴空間をより次元の小さい部分空間に変換する方法の一つであり、特徴空間上のクラス（カテゴリー）のパターンの分布から、このクラスを識別するのに最適な１次元軸を求める手法である。
【０００７】
例えば、図１２に示すクラスＣ１とＣ２のパターンを判別する場合、フィッシャー比と呼ばれる（クラス間分散／クラス内分散）を最大とするような座標変換を行って最適なＹ１軸を求め、このＹ１軸に射影を行うと（Ｘ１、Ｘ２は、変換前の軸）、クラスＣ１、Ｃ２を最適に分離することのできる境界（図中、破線で示す）を定めることができる。これに対して、フィッシャー比を最大としない座標変換を行った場合には、Ｙ２軸への射影のようにクラスＣ１、Ｃ２を分離する境界を定めることはできない。つまり、フィッシャーの方法では、異なるクラスのパターンを離し、同一クラスのパターンが固まって分布するように射影することのできる射影軸を決定している。
【０００８】
ところが、上述した方法では、図１３（ａ）に示す「フ」という文字が入力された場合、図１３（ｂ）に示す「ヲ」と図１３（ｃ）に示す「フ」が候補パターンとして選出され、詳細判定により図１３（ｂ）に示す「ヲ」と認識されてしまう場合がある。同様に、図１３（ｄ）に示す「ハ」という文字が入力された場合、図１３（ｅ）に示す「ヘ」と図１３（ｆ）に示す「ハ」が候補パターンとして選出され、詳細判定により図１３（ｅ）に示す「ヘ」と認識されてしまう場合がある。
【０００９】
これは、候補として選択されたパターンが類似している場合に起こり得る。例えば、図１３（ｄ）に示した「ハ」の場合、図１３（ｅ）に示した「ヘ」と図１３（ｆ）に示した「ハ」のそれぞれと重ね合わせると、図１４（ａ）、（ｂ）に示すような結果が得られる。この重ね合わせの結果、つまり、距離によるパターンマッチングの結果は、図１３（ｅ）に示した「ヘ」が図１３（ｄ）に示した「ハ」に最も近いことを示している。このような場合、フィッシャーの方法により詳細判定を行っても、図１５に示したようにクラスＡとクラスＢとを完全に分離することはできず、結果として誤認識が生じてしまうことになる。
【００１０】
また、上述した方法の他にも、特開平６−２８５２５号公報に記載されている「文字認識方法」のようなものも提案されている。この方法では、文字見本の原始特徴ベクトルの集合分布が最適となる特徴空間を求め、特徴抽出対象文字の原始特徴ベクトルをその特徴空間に投影して二次特徴ベクトルを得ることにより文字認識を行うものである。
【００１１】
しかし、この方法では、求めた特徴空間は、文字見本において原始特徴ベクトルの集合分布が最適となる特徴空間であり、入力データに対してこの特徴空間が最適であるかどうかは分からない。このため、変化の小さい活字では高い認識能力を有するが、手書き文字に対しては、高い認識能力を持つとは限らない。
【００１２】
【発明が解決しようとする課題】
上述したように、従来の文字認識方法では、活字文字に対しては高い認識能力を有するが、手書き文字に対しては高い認識能力を有するとは言い難かった。
【００１３】
そこで、この発明は、手書き文字に対しても高い認識能力を有するパターン認識方法および装置を提供することを目的とする。
【００１４】
【課題を解決するための手段】
上述した目的を達成するため、請求項１の発明は、所定の識別関数を用いて認識対象の特徴量を評価することにより該識別対象を２つのクラスのいずれに属するかを識別するパターン認識方法において、前記認識対象の特徴量を抽出し、該抽出した特徴量に基づいて、前記クラス毎に複数の標本パターンを登録可能な辞書手段に格納された全標本パターンのうち、前記認識対象とのマッチング結果が上位の標本パターンを近傍パターンとして複数選出し、該選出した近傍パターンに基づいて２つの候補クラスを決定し、該決定した２つの候補クラスごとに求めた該候補クラスに属する前記近傍パターンの平均をもとに、該候補クラスの標本パターンの特徴量の中から注目すべき成分を選択し、該選択した成分に基づいて識別関数の係数を決定し、該係数が決定した識別関数に基づいて前記認識対象の識別を行うことを特徴とする。
【００１５】
また、請求項２の発明は、請求項１の発明において、前記２つの候補クラスのそれぞれについて、前記辞書手段に格納された全標本パターンの平均と、前記近傍パターンの平均を求め、該２つの標本パターンの平均および該２つの近傍パターンの平均をもとに、各成分毎に差を求めて各成分の寄与度を算出し、該寄与度の所定個数の和である局所累積寄与度を最大化する成分を注目すべき成分として選択することを特徴とする。
【００１６】
また、請求項３の発明は、請求項１の発明において、前記２つの候補クラスのそれぞれについて、前記近傍パターンの平均を求め、該２つの近傍パターンの平均をもとに、各成分毎に差を求めて各成分の寄与度を算出し、該寄与度の所定個数の和である局所累積寄与度を最大化する成分を注目すべき成分として選択することを特徴とする。
【００１７】
また、請求項４の発明は、請求項１の発明において、前記識別関数は、フィッシャーの線形判別法を適用して決定されることを特徴とする。
【００１８】
また、請求項５の発明は、請求項１の発明において、前記認識対象は、文字であることを特徴とする。
【００１９】
また、請求項６の発明は、所定の識別関数を用いて認識対象の特徴量を評価することにより該識別対象を２つのクラスのいずれに属するかを識別するパターン認識装置において、前記認識対象の特徴量を抽出する特徴量抽出手段と、前記クラス毎に複数の標本パターンを登録可能で、前記クラスに属する標本パターンの特徴量を格納する辞書手段と、前記特徴量抽出手段が抽出した前記認識対象の特徴量に基づいて、前記辞書手段に格納された全標本パターンのうち、前記認識対象とのマッチング結果が上位の標本パターンを近傍パターンとして複数選出し、該選出した近傍パターンに基づいて２つの候補クラスを決定する候補選出手段と、前記２つの候補クラスごとに求めた該候補クラスに属する前記近傍パターンの平均をもとに、該候補クラスの標本パターンの特徴量の中から注目すべき成分を選択する特徴選択手段と、前記特徴選択手段が選択した成分に基づいて識別関数の係数を決定し、該係数が決定した識別関数に基づいて前記認識対象の識別を行う識別処理手段とを具備することを特徴とする。
【００２２】
【発明の実施の形態】
以下、この発明に係るパターン認識方法および装置の一実施例について、添付図面を参照して詳細に説明する。
【００２３】
図１は、この発明を適用した文字認識装置の概略構成を示す機能ブロック図である。
同図に示すように、文字認識装置１０は、画像入力部１と前処理部２、特徴抽出部３、候補選出部４、特徴選択部５、判定処理部６、辞書部７、結果出力部８を具備して構成される。
【００２４】
画像入力部１は、認識対象となる文字（列）の画像を入力し、前処理部２は、認識の前処理として画像入力部１より入力された画像の２値化と文字の切り出しなどを行う。特徴抽出部３は、前処理部２で前処理が施された文字画像から特徴ベクトルを生成する。特徴ベクトルの生成は、従来と同様に行い、例えば、文字画像を５×５に分割し、分割した各々で８方向の特徴量を抽出する。候補選出部４は、特徴抽出部３で生成された特徴ベクトルと辞書部７に格納されている全標本パターンとのユークリッド距離を求め、候補カテゴリー（クラス）を選出する。特徴選択部５は、特徴抽出部３が生成した特徴ベクトルと候補選出部４が選出した候補カテゴリーに基づいて、注目すべき成分を選択し、判定処理部６は、特徴選択部５が選択した成分を参酌してフィッシャーの方法により識別判定を行う。結果出力部６は、判定処理部６が識別判定したカテゴリーを認識結果として出力する。
【００２５】
次に、図２を参照して文字認識装置１０の動作を説明する。
図２は、文字認識装置１０の動作の流れを示すフローチャートである。
【００２６】
文字認識装置１０は、動作を開始すると、まず、画像入力部１から認識対象の文字画像を入力し（ステップ１０１）、前処理部２が画像入力部１から入力された文字画像に対する前処理を行う（ステップ１０２）。続いて、特徴抽出部３が前処理が施された文字画像から特徴の抽出、つまり、特徴ベクトルを生成し（ステップ１０３）、候補選出部４が、特徴ベクトルと辞書部７に格納されている全標本パターンとのパターンマッチング（距離計算）を行い、その結果の上位の所定数（例えば１５個）を近傍パターンとし、この近傍パターンに基づいて候補カテゴリーを２つ決定する（ステップ１０４）。
【００２７】
続いて、特徴選択部５が後述する特徴選択処理を行い（ステップ１０５）、判定処理部６が後述する詳細判定処理を行い（ステップ１０６）、結果出力部８が認識結果を出力して（ステップ１０７）、処理を終了する。
【００２８】
次に、特徴選択部５が行う特徴選択処理について説明する。
特徴選択処理は、２つの候補カテゴリーから認識結果としてのカテゴリーを決定する際に、注目すべき成分を決定する処理である。例えば、図３（ａ）に示す代表的な「ユ」と図３（ｂ）に示す代表的な「コ」を識別する際には、図３（ｃ）に示すように、文字の左上、右上、右下の順に重点を置くことで、その識別が容易となる。しかしながら、図３（ｄ）に示すような「コ」が入力された場合、文字の左上や右上に注目しても「ユ」との違いは少なく（図３（ｅ）参照）、図３（ｆ）に示すように文字の右下にのみ注目すれば、「ユ」との識別が容易となる。
【００２９】
ここで、図４を参照して特徴選択処理の流れを説明する。
図４は、特徴選択処理の流れを示すフローチャートである。
【００３０】
特徴選択処理では、まず、全特徴成分に対してフィッシャーの方法を適用して射影を行う行列Ａを求める（ステップ１５１）。続いて、式１に示す候補カテゴリーの全標本辞書平均と、式２に示す近傍パターンの平均を求める（ステップ１５２）。
【００３１】
【式１】

【式２】

次に、フィッシャーの方法適用時に安定して射影軸が求まるようにする処理を行うが、これは候補カテゴリーの近傍平均の成分u'_i、v'_iが０または１であるようなものを除く処理であり、具体的には、式３を満たす成分ｉを抽出する（ステップ１５３）。
【００３２】
【式３】

続いて、以下に定義する局所累積寄与度を算出する（ステップ１５４）。ここで、局所累積寄与度について説明する。まず、式４に示すようにｉ成分の寄与度κⁱを定義する。次に、入力データの近傍パターンを考慮すると、式４から各成分の寄与度は式５に示すように定義できる。
【００３３】
【式４】

【式５】

なお、式５において、αは近傍パターンの影響度を示しており、αが大きい程近傍情報を重視していることになる。
【００３４】
局所累積寄与度は、(κⁱ _local)^N _i=1から任意にｍ個取り出したものの総和として定義し、Κ_local(m)で表す。
【００３５】
次に、局所累積寄与度Κ_local(m)を最大にする最大局所累積寄与度Κ^* _local(m)を用いて最大化する成分を選択し（ステップ１５５）、特徴選択処理を終了する。
【００３６】
なお、ここで説明した特徴選択処理を図３（ｄ）に示した「コ」を入力パターンとした場合に適用すると、各方向（方向については、図１１（ｂ）を参照）、各成分における寄与度κⁱ _localは、図５に示すようになり、文字の下部の値が大きいことがわかる。
【００３７】
次に、判定処理部６における詳細判定処理（図２のステップ１０６）について説明する。
図６は、詳細判定処理の流れを示すフローチャートである。
【００３８】
判定処理部６は、詳細判定処理を開始すると、特徴選択部５により特徴選択された空間で候補カテゴリーにおける近傍平均、標本辞書平均、クラス内変動行列を求め（ステップ１６１）、フィッシャーの線形判別法を適用して一次元への射影行列A_sを得て（ステップ１６２）、カテゴリーを決定し（ステップ１６３）、詳細判定処理を終了する。
【００３９】
なお、入力データの特徴ベクトルをx、特徴選択された特徴ベクトルをx_s、候補カテゴリーの全標本全体の平均において特徴選択されたものをu_s、v_s、近傍パターンのうち候補カテゴリーであるものの平均において特徴選択されたものをu'_s、v'_sとすれば、識別関数は、式６で表され、その識別規則は式７で表せる。
【００４０】
【式６】

【式７】

また、図７は、図３（ｄ）に示した「コ」を入力パターンとした場合にステップ１６２で得られる射影行列A_sの各成分を示したものであり、０となっている成分は、選択されなかった成分である。この各成分を図８に示す特徴選択を行わなかった場合の射影行列A（従来と同様の方法で得たもの）と比較すると、射影行列A_sは、文字の下部の値が大きく、重み付けされていることがわかる。
【００４１】
このように特徴選択をして射影軸への射影を行うと、例えば、図９に示すようなデータＸ１（カテゴリーＢに属する）が入力されると、従来の方法で得られる射影軸１へのＸ１の射影Ｙ１はカテゴリーＡに属することになるが（Ｙ１ＹＡ１＜Ｙ１ＹＢ１）、特徴選択により得られる射影軸２へのＸ１の射影Ｙ２は、カテゴリーＢに属することになる（Ｙ２ＹＡ２＞Ｙ２ＹＢ２）。ただし、同図中の実線の楕円は、各カテゴリー（Ａ、Ｂ）の平均パターンm₁、m₂から等密度となる線であり、破線の楕円は、入力データＸ１の近傍パターンのうちカテゴリーＡであるものの領域（領域ａ）と、カテゴリーＢであるものの領域（領域ｂ）を示している。
【００４２】
また、図１０は、ある入力データに対して特徴選択を行わなかった場合（従来法）と、特徴選択を行った場合（本手法）に決定されたカテゴリーの例を示したもので（候補カテゴリーは、従来法と本手法で同一）、特徴選択を行うことにより、高い認識能力を有するパターン認識を行えることが分かる。
【００４３】
なお、この実施例では、文字認識を例として説明したが、音声認識などの他のパターン認識であっても、フィッシャーの方法を適用するパターン認識であれば特徴選択を行って認識率を向上させることができる。
【００４４】
【発明の効果】
以上説明したように、この発明によれば、入力データの近傍を考慮して特徴選択を行い、特徴選択された空間でフィッシャーの線形判別法を適用するように構成したので、手書き文字などの変化の大きい入力に対しても高い認識能力を得ることができる。
【図面の簡単な説明】
【図１】この発明を適用した文字認識装置の概略構成を示す機能ブロック図である。
【図２】文字認識装置１０の動作の流れを示すフローチャートである。
【図３】特徴選択処理を説明するための図である。
【図４】特徴選択処理の流れを示すフローチャートである。
【図５】寄与度の算出結果を示した図である。
【図６】詳細判定処理の流れを示すフローチャートである。
【図７】射影行列A_sの各成分を示した図である。
【図８】射影行列Aの各成分を示した図である。
【図９】特徴選択をした射影軸への射影を説明するための図である。
【図１０】識別結果例を示した図である。
【図１１】特徴量の抽出を説明するための図である。
【図１２】フィッシャーの方法による射影を説明するための図である。
【図１３】誤認識を説明するための図である。
【図１４】重ね合わせの結果を示した図である。
【図１５】線形分離できないカテゴリーの特徴空間での分布を示した図である。
【符号の説明】
１画像入力部
２前処理部
３特徴抽出部
４候補選出部
５特徴選択部
６判定処理部
７辞書部
８結果出力部[0001]
BACKGROUND OF THE INVENTION
The present invention relates to a pattern recognition method and apparatus, and more particularly to a pattern recognition method and apparatus capable of recognizing handwritten characters and the like with a high recognition rate.
[0002]
[Prior art]
In recent years, a character recognition apparatus using a pattern recognition technology has expanded from printed characters to handwritten characters as the technology has been improved, and the demand has been increasing.
[0003]
The character recognition device performs binarization and character segmentation as preprocessing on the input image, and then performs feature extraction of the character. The feature extraction is, for example, extracting a feature amount related to the outline direction of a character. As shown in FIG. 11A, the target character is divided into 5 × 5, and each divided region is divided into FIG. Features in eight directions as shown in (b) are extracted. That is, the feature quantity extracted here can be expressed as a 5 × 5 × 8 200-dimensional vector. This 200-dimensional space is called a feature space.
[0004]
When the feature amount of the recognition target character is extracted, a candidate pattern is selected based on the feature amount. Selection of candidate patterns is performed by performing all pattern matching on the extracted feature values using a sample dictionary, and taking 15 patterns with short Euclidean distances and setting them as neighboring patterns for the input.
[0005]
Subsequently, the upper two categories (class, character type) are selected from the neighboring patterns, and detailed determination is performed using these categories as candidate categories.
[0006]
Detailed determination is performed by the Fisher method (see Kenichiro Ishii, Nobuyoshi Ueda, Eisaku Maeda, Hiroshi Murase: Pattern Recognition, Ohmsha, 1998). Fischer's method is one of the methods to convert the feature space into a subspace with a smaller dimension. One-dimensional optimum for identifying this class from the class (category) pattern distribution on the feature space. This is a method for obtaining an axis.
[0007]
For example, when the patterns of classes C1 and C2 shown in FIG. 12 are discriminated, an optimal Y1 axis is obtained by performing coordinate transformation that maximizes (inter-class variance / intra-class variance) called the Fisher ratio. When projection is performed on the axes (X1 and X2 are axes before conversion), boundaries (indicated by broken lines in the figure) that can optimally separate the classes C1 and C2 can be determined. On the other hand, when coordinate transformation that does not maximize the Fisher ratio is performed, it is not possible to determine a boundary that separates the classes C1 and C2 as in the projection onto the Y2 axis. That is, according to the Fisher method, projection axes that can be projected so that patterns of different classes are separated and patterns of the same class are fixedly distributed are determined.
[0008]
However, in the method described above, when the character “F” shown in FIG. 13A is input, “W” shown in FIG. 13B and “F” shown in FIG. In some cases, it is selected and recognized as “wo” shown in FIG. Similarly, when the character “C” shown in FIG. 13D is input, “F” shown in FIG. 13E and “C” shown in FIG. 13F are selected as candidate patterns. The determination may be recognized as “f” shown in FIG.
[0009]
This can occur when the patterns selected as candidates are similar. For example, in the case of “C” shown in FIG. 13D, when “F” shown in FIG. 13E and “C” shown in FIG. ), Results as shown in (b) are obtained. The result of superposition, that is, the result of pattern matching by distance, indicates that “f” shown in FIG. 13E is closest to “c” shown in FIG. In such a case, even if the detailed determination is made by the Fisher method, class A and class B cannot be completely separated as shown in FIG. 15, resulting in erroneous recognition. .
[0010]
In addition to the method described above, a “character recognition method” described in Japanese Patent Laid-Open No. 6-28525 has also been proposed. In this method, character recognition is performed by obtaining a feature space in which the set distribution of primitive feature vectors of a character sample is optimal, and projecting the original feature vectors of feature extraction target characters onto the feature space to obtain secondary feature vectors. Is.
[0011]
However, in this method, the obtained feature space is a feature space in which the set distribution of primitive feature vectors is optimal in a character sample, and it is not known whether this feature space is optimal for input data. For this reason, although the type | mold with a small change has a high recognition capability, it does not necessarily have a high recognition capability with respect to a handwritten character.
[0012]
[Problems to be solved by the invention]
As described above, the conventional character recognition method has high recognition ability for printed characters, but it is difficult to say that it has high recognition ability for handwritten characters.
[0013]
SUMMARY OF THE INVENTION An object of the present invention is to provide a pattern recognition method and apparatus having a high recognition ability for handwritten characters.
[0014]
[Means for Solving the Problems]
In order to achieve the above-described object, the invention according to claim 1 is a pattern recognition method for identifying which of two classes the identification object belongs by evaluating the feature quantity of the recognition object using a predetermined identification function. The feature quantity of the recognition target is extracted, and based on the extracted feature quantity, among all the sample patterns stored in the dictionary means capable of registering a plurality of sample patterns for each class, the recognition target and A plurality of sample patterns with higher matching results are selected as neighboring patterns, two candidate classes are determined based on the selected neighboring patterns, and the neighboring patterns belonging to the candidate class determined for each of the two candidate classes thus determined Based on the average of the candidate class, select a notable component from the feature quantity of the sample pattern of the candidate class, and determine the coefficient of the discriminant function based on the selected component , And performs identification of the recognition object on the basis of the discriminant function the coefficient is determined.
[0015]
According to a second aspect of the present invention, in the first aspect of the present invention, for each of the two candidate classes, an average of all sample patterns stored in the dictionary means and an average of the neighboring patterns are obtained, and the two Based on the average of the sample pattern and the average of the two neighboring patterns, a difference is calculated for each component to calculate the contribution of each component, and the local cumulative contribution that is the sum of a predetermined number of contributions is maximized. The component to be converted is selected as a notable component.
[0016]
According to a third aspect of the present invention, in the first aspect of the invention, an average of the neighboring patterns is obtained for each of the two candidate classes, and a difference is determined for each component based on the average of the two neighboring patterns. And calculating the contribution of each component, and selecting the component that maximizes the local cumulative contribution, which is the sum of a predetermined number of contributions, as a component to be noted.
[0017]
The invention of claim 4 is characterized in that, in the invention of claim 1, the discriminant function is determined by applying Fisher's linear discriminant method.
[0018]
The invention of claim 5 is characterized in that, in the invention of claim 1, the recognition object is a character.
[0019]
According to a sixth aspect of the present invention, there is provided a pattern recognition apparatus for identifying which of the two classes the identification object belongs by evaluating a feature quantity of the recognition object using a predetermined identification function. Feature quantity extraction means for extracting feature quantities, dictionary means for storing a plurality of specimen patterns for each class, storing feature quantities of specimen patterns belonging to the class, and the recognition extracted by the feature quantity extraction means Based on the feature quantity of the object, among the all sample patterns stored in the dictionary means, a plurality of sample patterns having higher matching results with the recognition object are selected as neighboring patterns, and 2 based on the selected neighboring pattern. Candidate selection means for determining one candidate class, and the candidate based on an average of the neighboring patterns belonging to the candidate class obtained for each of the two candidate classes A feature selection unit that selects a component of interest from the feature quantity of the sample pattern of the lath, a coefficient of the discrimination function is determined based on the component selected by the feature selection unit, and the coefficient is based on the discrimination function determined And an identification processing means for identifying the recognition target.
[0022]
DETAILED DESCRIPTION OF THE INVENTION
Hereinafter, an embodiment of a pattern recognition method and apparatus according to the present invention will be described in detail with reference to the accompanying drawings.
[0023]
FIG. 1 is a functional block diagram showing a schematic configuration of a character recognition apparatus to which the present invention is applied.
As shown in the figure, the character recognition device 10 includes an image input unit 1 and a preprocessing unit 2, a feature extraction unit 3, a candidate selection unit 4, a feature selection unit 5, a determination processing unit 6, a dictionary unit 7, and a result output unit. 8 is configured.
[0024]
The image input unit 1 inputs an image of a character (string) to be recognized, and the preprocessing unit 2 performs binarization of the image input from the image input unit 1 and character segmentation as preprocessing for recognition. Do. The feature extraction unit 3 generates a feature vector from the character image that has been preprocessed by the preprocessing unit 2. The generation of feature vectors is performed in the same manner as in the past. For example, a character image is divided into 5 × 5, and feature amounts in eight directions are extracted for each divided portion. The candidate selection unit 4 obtains the Euclidean distance between the feature vector generated by the feature extraction unit 3 and all sample patterns stored in the dictionary unit 7, and selects a candidate category (class). The feature selection unit 5 selects a notable component based on the feature vector generated by the feature extraction unit 3 and the candidate category selected by the candidate selection unit 4, and the determination processing unit 6 selects the feature selection unit 5. Discrimination / determination is performed by Fisher's method in consideration of ingredients. The result output unit 6 outputs the category identified and determined by the determination processing unit 6 as a recognition result.
[0025]
Next, the operation of the character recognition device 10 will be described with reference to FIG.
FIG. 2 is a flowchart showing a flow of operations of the character recognition device 10.
[0026]
When the character recognition device 10 starts operating, first, a character image to be recognized is input from the image input unit 1 (step 101), and the preprocessing unit 2 performs preprocessing on the character image input from the image input unit 1. Perform (step 102). Subsequently, the feature extraction unit 3 extracts features from the pre-processed character image, that is, generates a feature vector (step 103), and the candidate selection unit 4 is stored in the feature vector and dictionary unit 7. Pattern matching (distance calculation) with all the sample patterns is performed, and a predetermined upper number (for example, 15) as a result is set as a neighborhood pattern, and two candidate categories are determined based on this neighborhood pattern (step 104).
[0027]
Subsequently, the feature selection unit 5 performs feature selection processing described later (step 105), the determination processing unit 6 performs detailed determination processing described later (step 106), and the result output unit 8 outputs the recognition result (step step). 107) and the process is terminated.
[0028]
Next, the feature selection process performed by the feature selection unit 5 will be described.
The feature selection process is a process of determining a component to be noted when determining a category as a recognition result from two candidate categories. For example, when identifying the representative “yu” shown in FIG. 3A and the representative “co” shown in FIG. 3B, as shown in FIG. Identification is facilitated by placing emphasis in the order of upper right and lower right. However, when “ko” as shown in FIG. 3D is input, even if attention is paid to the upper left or upper right of the character, there is little difference from “Yu” (see FIG. 3E), and FIG. If attention is paid only to the lower right of the character as shown in f), it becomes easy to identify “Yu”.
[0029]
Here, the flow of the feature selection process will be described with reference to FIG.
FIG. 4 is a flowchart showing the flow of feature selection processing.
[0030]
In the feature selection processing, first, a matrix A for performing projection by applying the Fisher method to all feature components is obtained (step 151). Subsequently, the average of all sample dictionaries of the candidate category shown in Expression 1 and the average of neighboring patterns shown in Expression 2 are obtained (Step 152).
[0031]
[Formula 1]

[Formula 2]

Next, processing is performed so that the projection axis can be obtained stably when the Fisher method is applied, but this excludes those whose neighborhood average components u ′ _i and v ′ _i are 0 or 1 in the candidate category. Specifically, the component i that satisfies Expression 3 is extracted (step 153).
[0032]
[Formula 3]

Subsequently, the local cumulative contribution defined below is calculated (step 154). Here, the local cumulative contribution will be described. First, as shown in Expression 4, the contribution κ ⁱ of the i component is defined. Next, considering the neighborhood pattern of the input data, the contribution of each component can be defined as shown in Equation 5 from Equation 4.
[0033]
[Formula 4]

[Formula 5]

In Expression 5, α indicates the influence degree of the neighboring pattern, and the larger α is, the more important the neighboring information is.
[0034]
The local cumulative contribution is defined as the sum of arbitrarily extracted m from (κ ⁱ _local ) ^N _{i = 1} , and is expressed as Κ _local (m).
[0035]
Then select the components to maximize using the maximum local cumulative contribution kappa ^* _local (m) that maximizes the local cumulative contribution kappa _local (m) (step 155), and terminates the feature selection process.
[0036]
When the feature selection process described here is applied to the case where “co” shown in FIG. 3D is used as an input pattern, each direction (see FIG. 11B for directions) and each component The contribution degree κ ⁱ _local is as shown in FIG. 5, and it can be seen that the value at the bottom of the character is large.
[0037]
Next, the detailed determination process (step 106 in FIG. 2) in the determination processing unit 6 will be described.
FIG. 6 is a flowchart showing the flow of the detailed determination process.
[0038]
When the detailed determination process is started, the determination processing unit 6 obtains a neighborhood average, a sample dictionary average, and an intra-class variation matrix in the candidate category in the space selected by the feature selection unit 5 (step 161), and Fisher's linear determination method Is applied to obtain a one-dimensional projection matrix A _s (step 162), a category is determined (step 163), and the detailed determination process ends.
[0039]
Note that x is the feature vector of the input data, x _s is the feature vector selected for the feature, u _s , v _s is the feature selected in the average of all samples in the candidate category, If u ′ _s and v ′ _s are features selected in the average, the discriminant function is expressed by Equation 6 and the discriminant rule can be expressed by Equation 7.
[0040]
[Formula 6]

[Formula 7]

Further, FIG. 7 is an illustration of the components of the projection matrix A _s obtained in step 162 when an input pattern "U" as shown in FIG. 3 (d), the component has a zero Ingredients not selected. When comparing each component with the projection matrix A (obtained by the same method as before) when the feature selection is not performed as shown in FIG. 8, the projection matrix A _s has a large value at the bottom of the character and is weighted. You can see that
[0041]
When the feature is selected in this way and projected onto the projection axis, for example, when data X1 (belonging to category B) as shown in FIG. 9 is input, the projection axis 1 obtained by the conventional method is applied. The projection Y1 of X1 belongs to category A (Y1YA1 <Y1YB1), but the projection Y2 of X1 onto the projection axis 2 obtained by feature selection belongs to category B (Y2YA2> Y2YB2). However, the solid-line ellipse in the figure is an equal density line from the average patterns m ₁ and m _{2 of} each category (A, B), and the broken-line ellipse is the category A of the neighboring patterns of the input data X1. The area (area a) of what is and the area (area b) of category B are shown.
[0042]
FIG. 10 shows examples of categories determined when feature selection is not performed on certain input data (conventional method) and when feature selection is performed (this method) (candidate category). Is the same as the conventional method and this method), it can be seen that pattern recognition with high recognition ability can be performed by selecting features.
[0043]
In this embodiment, the character recognition is described as an example. However, even in other pattern recognition such as voice recognition, if the pattern recognition to which the Fisher method is applied, the feature selection is performed to improve the recognition rate. be able to.
[0044]
【The invention's effect】
As described above, according to the present invention, the feature selection is performed in consideration of the vicinity of the input data, and the Fisher's linear discriminant method is applied in the feature-selected space. High recognition ability can be obtained even for large inputs.
[Brief description of the drawings]
FIG. 1 is a functional block diagram showing a schematic configuration of a character recognition device to which the present invention is applied.
FIG. 2 is a flowchart showing an operation flow of the character recognition apparatus 10;
FIG. 3 is a diagram for explaining feature selection processing;
FIG. 4 is a flowchart showing a flow of feature selection processing.
FIG. 5 is a diagram showing a calculation result of a contribution degree.
FIG. 6 is a flowchart showing a flow of detailed determination processing.
7 is a diagram showing the components of the projection matrix A _s.
FIG. 8 is a diagram showing each component of a projection matrix A.
FIG. 9 is a diagram for explaining projection onto a projection axis for which a feature has been selected.
FIG. 10 is a diagram showing an example of identification results.
FIG. 11 is a diagram for explaining feature amount extraction;
FIG. 12 is a diagram for explaining projection by the Fisher method.
FIG. 13 is a diagram for explaining misrecognition.
FIG. 14 is a diagram showing a result of superposition.
FIG. 15 is a diagram showing a distribution in a feature space of categories that cannot be linearly separated.
[Explanation of symbols]
DESCRIPTION OF SYMBOLS 1 Image input part 2 Preprocessing part 3 Feature extraction part 4 Candidate selection part 5 Feature selection part 6 Determination processing part 7 Dictionary part 8 Result output part

Claims

In the pattern recognition method for identifying which of the two classes the identification object belongs by evaluating the feature quantity of the recognition object using a predetermined identification function,
The feature value of the recognition target is extracted, and based on the extracted feature value, a matching result with the recognition target among all the sample patterns stored in the dictionary unit that can register a plurality of sample patterns for each class Selects a plurality of upper sample patterns as neighborhood patterns, determines two candidate classes based on the selected neighborhood patterns, and averages the neighborhood patterns belonging to the candidate classes determined for each of the two candidate classes thus determined And selecting a notable component from the feature values of the sample pattern of the candidate class, determining a coefficient of the discriminant function based on the selected component, and based on the discriminant function determined by the coefficient A pattern recognition method for identifying the recognition target.

For each of the two candidate classes, find the average of all sample patterns stored in the dictionary means and the average of the neighboring patterns, and based on the average of the two sample patterns and the average of the two neighboring patterns Calculating a contribution for each component by calculating a difference for each component, and selecting a component that maximizes a local cumulative contribution that is a sum of a predetermined number of contributions as a component to be noted. The pattern recognition method according to claim 1.

For each of the two candidate classes, find the average of the neighboring patterns, calculate the difference for each component based on the average of the two neighboring patterns, calculate the contribution of each component, The pattern recognition method according to claim 1, wherein a component that maximizes a local cumulative contribution that is a predetermined number of sums is selected as a notable component.

The discriminant function is
The pattern recognition method according to claim 1, wherein the pattern recognition method is determined by applying Fisher's linear discriminant method.

The recognition target is
The pattern recognition method according to claim 1, wherein the pattern recognition method is a character.

In a pattern recognition apparatus for identifying which of the two classes the identification object belongs by evaluating the feature quantity of the recognition object using a predetermined identification function,
Feature quantity extraction means for extracting the feature quantity of the recognition target;
A plurality of specimen patterns can be registered for each class, and dictionary means for storing feature quantities of specimen patterns belonging to the class;
Based on the feature quantity of the recognition target extracted by the feature quantity extraction unit, a plurality of sample patterns having a higher matching result with the recognition target among all the sample patterns stored in the dictionary unit are selected as neighboring patterns. Candidate selection means for determining two candidate classes based on the selected neighborhood pattern;
Feature selection means for selecting a notable component from the feature values of the sample pattern of the candidate class based on the average of the neighboring patterns belonging to the candidate class determined for each of the two candidate classes;
A pattern recognition unit comprising: an identification processing unit configured to determine a coefficient of an identification function based on the component selected by the feature selection unit, and to identify the recognition target based on the identification function determined by the coefficient. apparatus.