JPH033089A

JPH033089A - Method for recognizing pattern

Info

Publication number: JPH033089A
Application number: JP1138746A
Authority: JP
Inventors: Keisuke Goto; 啓介後藤; Tetsuya Yasuda; 哲也安田
Original assignee: Meidensha Corp; Meidensha Electric Manufacturing Co Ltd
Current assignee: Meidensha Corp; Meidensha Electric Manufacturing Co Ltd
Priority date: 1989-05-31
Filing date: 1989-05-31
Publication date: 1991-01-09
Anticipated expiration: 2014-02-03
Also published as: JP2853168B2

Abstract

PURPOSE:To automatically form a dictionary by forming the appearance frequency distribution of pattern feature points by preparing many samples, normalizing respective frequency values of the distribution and registering the normalized values as membership values. CONSTITUTION:Many samples for a certain pattern inputted from a pattern input part 2 are prepared and the pattern feature points of respective samples are extracted. A mesh area is allocated to a plane area on which the pattern is to be plotted and a mesh area is set up also in a dictionary storage part 1 corresponding to the allocated mesh area. Then, the appearance frequency which is the number of generations of each feature point is found out to form the appearance frequency distribution corresponding to the mesh area, the distribution is normalized, the normalized value is used as a membership value, a distance corresponding to the degree of separation between the pattern to be recognized and the pattern to be collated is calculated by a distance computing part 5, and the proper number of patterns is selected in the ascending order to obtain a recognized result. Consequently, a recognition method with high recognition accuracy, a dictionary constituting method for reducing storage capacity and an automatic dictionary forming method can be obtained.

Description

【発明の詳細な説明】Ａ、産業上の利用分野本発明は文字や図形等のパターンを認識する方法に関す
るものである。DETAILED DESCRIPTION OF THE INVENTION A. Field of Industrial Application The present invention relates to a method for recognizing patterns such as characters and figures.

Ｂ６発明の概要本発明は、文字や図形等のパターンを、照合対象となる
辞書を用いて認識する方法において、パターンの特徴点
の位置的特徴をあいまい集合（ファノイ集合）で表した
辞書を用い、認識対象のパターンの特徴点の位置に対応
するメンバーシップ値について演算を行い、演算結果に
もとづいて認識結果を得ることによって、パターンの認識を高い精度でかつ高速に実行できるよう
にしたものである。B6 Summary of the Invention The present invention is a method for recognizing patterns such as characters and figures using a dictionary to be checked, which uses a dictionary in which positional features of feature points of a pattern are expressed as fuzzy sets (Fanoy sets). , which performs calculations on the membership values corresponding to the positions of the feature points of the pattern to be recognized, and obtains recognition results based on the calculation results, making it possible to recognize patterns with high accuracy and at high speed. be.

Ｃ９従来の技術文字や図形等のパターンを認識する場合、例えば、書類
や図面等をイメージスキャナ等の入力装置で操作して得
られる黒と白の２値画像データ（入カバターン）に基づ
いて、予め記憶されている文字や図形の標準パターン（
辞書パターン）と位置等を照合し、辞書パターンの中か
ら最も重なり合うものを最終的に認識結果として求めて
いる。C9 Conventional technology When recognizing patterns such as characters and figures, for example, based on black and white binary image data (input pattern) obtained by operating documents, drawings, etc. with an input device such as an image scanner, Standard patterns of characters and figures stored in advance (
Dictionary patterns) and positions are compared, and the one that most overlaps among the dictionary patterns is finally determined as the recognition result.

認識方法の具体例として、例えば、人カバターンと辞書
パターンをそれぞれ小さな正方格子状（メツシュ）に区
切り、格子点が文字や図形である場合には黒（１）、そ
うでないときは（０）と表す場合において、人力と辞書
の２つのパターンの重なりの程度をハミング距離を用い
て求めることができる。これは、人力と辞書が黒と黒ま
たは白と白の場合には０、黒と白または白と黒の場合に
はｌとおき、その総和を求めるものである。つまり、黒
と黒、白と白のように重なり合っていればいるほど、そ
の総和（ハミング距離）が小さくなる。この性質により
、多数の辞書パターンの中から入カバターンと最も重な
り合っているもの、すなわち、ハミング距離が最も小さ
いものを認識結果とするものである。As a specific example of the recognition method, for example, the human cover pattern and the dictionary pattern are each divided into small square grids (mesh), and if the grid points are letters or figures, they are marked black (1), otherwise they are marked (0). In the case of expression, the degree of overlap between the two patterns of human input and dictionary can be determined using the Hamming distance. This is done by setting 0 if the human power and dictionary are black and black or white and white, and setting it as l if they are black and white or white and black, and finding the sum. In other words, the more they overlap, such as black on black or white on white, the smaller their sum (Hamming distance) becomes. Due to this property, from among a large number of dictionary patterns, the one that most overlaps with the input pattern, that is, the one with the smallest Hamming distance, is selected as the recognition result.

Ｄ１発明が解決しようとする課題この方法だと入カバターンの位置が辞書パターンの位置
とずれていたり、大きさが異なっていたり、傾きが異な
っていたりすると、重なりの程度が変わってしまい、認
識の誤りが生じやすい欠点がある。D1 Problem to be solved by the invention With this method, if the input pattern is misaligned with the dictionary pattern, has a different size, or has a different inclination, the degree of overlap will change, making recognition difficult. It has the disadvantage of being prone to errors.

また記憶容量の点でも問題がある。１文字あるいは１図
形パターンの必要空間は表示のための文字フォントのよ
うに８ｘ８画素（英数字）や２４×２４画素（漢字）の
空間では済まず、線の切れやつぶれを極力避けるため少
なくともその３〜５倍の空間を特徴とする特に、漢字や
複雑な図形の場合１２８Ｘ１２８画素程度以上確保しな
いと安定な処理が望めないことになる。There is also a problem in terms of storage capacity. The space required for one character or one graphic pattern is not limited to 8 x 8 pixels (alphanumeric characters) or 24 x 24 pixels (kanji) like character fonts for display, but at least that space is required to avoid line breaks and collapse as much as possible. Particularly in the case of Chinese characters or complex figures, which are characterized by a space 3 to 5 times larger, stable processing cannot be expected unless at least 128 x 128 pixels are secured.

文字や図形パターンの辞書種別は漢字の場合第二水準を
含めると６０００字種以上に及び、図形の場合でも多い
ときには１０００字種以上になる。In the case of kanji, there are more than 6,000 types of dictionaries for characters and graphic patterns, including the second level, and in the case of figures, there are often more than 1,000 types.

このような膨大な辞書種別に対して、ｌ辞書毎に１２８
ｘ１２ｇ画素を確保するとすれば、メモリとして２にバ
イトが必要となる。つまり、先の漢字では１２Ｍバイト
以上、図形で２Ｍバイト以上もの記憶領域が必要となる
。For such a huge number of dictionary types, 128
If x12g pixels are secured, 2 bytes are required as memory. In other words, the above-mentioned kanji require a storage area of 12M bytes or more, and graphics require a storage area of 2M bytes or more.

ハードウェア構成上この容量は無視できないほど大きく
、また、辞書の探索や整合処理にも処理時間などで影響
を与えることになる。This capacity is too large to be ignored due to the hardware configuration, and it also affects dictionary search and matching processing in terms of processing time.

更に、辞書種別の点からいえば、上述の膨大な種別につ
いてひとつひとつ辞書をマニュアルで登録する作業が必
要とされており、そのための作業時間も膨大となる問題
点がある。Furthermore, in terms of dictionary types, it is necessary to manually register each dictionary for the huge number of types mentioned above, which poses the problem of requiring an enormous amount of time.

以上まとめると、従来技術には以下の大きな３つの課題
が残されている。In summary, the following three major issues remain in the prior art.

■認識精度の高い認識手法の確立 ■記憶容量低減のための辞書構成方法の確立■辞書の自
動生成方法の確立本発明は、このような課題を解決することを目的とする
。■ Establishment of a recognition method with high recognition accuracy ■ Establishment of dictionary construction method for reducing storage capacity ■ Establishment of automatic dictionary generation method The present invention aims to solve these problems.

Ｅ９課題を解決するだめの手段文字や図形パターンは人間の歴史的または経験的規則に
従って記述された位置関係にあるが、パターンを特徴づ
ける点（特徴点）そのものはあいまい性を有している。E9 Means to solve the problem Letters and graphic patterns have positional relationships that are described according to human historical or empirical rules, but the points that characterize the patterns (feature points) themselves have ambiguity.

例えば、第２図λの○印で示す文字“Ａｏの代表的な特
徴点の場合（これを頂点と呼ぶものとする）、頂点の位
置そのものは他の特徴点からみて左に振れたり（同図ｂ
）、右に振れたり（同図Ｃ）する。また、頂点のところ
で切れたり（同図ｄ）もする。For example, in the case of a typical feature point of the character "Ao" indicated by a circle in λ in Figure 2 (this will be referred to as a vertex), the position of the vertex itself may be shifted to the left (same as Diagram b
) or swing to the right (C in the same figure). It may also break at the apex (d in the same figure).

しかし、同図ｅのように、頂点の部分だけが他の特徴点
より下の方に位置することはありえない。However, it is impossible for only the vertex portion to be located below the other feature points, as shown in e of the figure.

結局、文字“Ａ“の頂点というものは、他の特徴点より
位置関係でいうと上の方でかつまん中付近にある、とい
う極めてあいまいな尺度で規則性を持っているというこ
とになる。In the end, the apex of the letter "A" has regularity on an extremely ambiguous scale, in that it is located above other feature points in terms of positional relationship and near the middle.

本発明は、このように文字や図形パターンの特徴点の位
置関係にあいまい性があることを考慮して、その位置関
係をあいまい集合（ファジィ集合）の概念を取り入れて
表現することによりパターン毎に２次元ファジィ辞書を
登録し、認識対象のパターンと各パターンの２次元ファ
ジィ辞書とを比較してその比較結果にもとづいて認識結
果を得ようとするものである。The present invention takes into consideration that there is ambiguity in the positional relationships of feature points in character and graphic patterns, and expresses the positional relationships by incorporating the concept of fuzzy sets to express the positional relationships for each pattern. A two-dimensional fuzzy dictionary is registered, a pattern to be recognized is compared with the two-dimensional fuzzy dictionary of each pattern, and a recognition result is obtained based on the comparison result.

具体的には本発明は、あるパターンに対して多数のサン
プルを作成し、各サンプルにおけるパターンの特徴点を
抽出し、パターンが描かれる平面領域にａ×ｂ個（ａ。Specifically, the present invention creates a large number of samples for a certain pattern, extracts feature points of the pattern in each sample, and places a×b (a.

ｂは整数）のメツシュ領域を割り当てると共に、これに
対応して辞書記憶部内にメツシュ領域を設定し、このメ
ツシュ領域内の各メツシュ毎に前記特徴点の出現回数で
ある出現頻度を求めて、メツシュ領域に対応する出現頻
度分布を作成し、前記出現頻度分布上の各頻度を基準頻
度を用いることにより正規化してこの正規化した値をメ
ンバーシップ値とし、このメンバーシップ値にもとづい
て、辞書記憶部内のメツシュ領域の各メツシュにメンバ
ーシップ値を割り当てて２次元ファジィ辞書を作成し、
こうして得られる２次元ファジィ辞書を特徴点の基準個
数ｊと共に予めパターン毎に辞書記憶部内に登録してお
き、認識対象であるパターンについて特徴点を抽出し、その
特徴点の総数ｍと各特徴点のメツシュ領域におけるメツ
シュ位置（ｘｌ、ｙ、）（１≦ｉ≦ｍ）とを求め、辞書記憶部内のあるパターンＰ　Ｔ　ｈを照合対象とし
て、これの２次元ファジィ辞書の前記メツシュ位置（ｘ
ｌ、ｙＩ）に対応するメンバーシップ値ｒ　ｋ（ｘ　ｒ
、　ｙ　１）を求め、各メンバーシップ値ｆ　ｋ（ｘ　ｒ、　ｙ　１）を確信
度ｒｋｃｘ、、　ｙｔ）　）と前記特徴点の基準個数」
と前記特徴点の総数ｍとにもとづいて、認識対象のパタ
ーンと照合対象のパターンＰＴｋとの離れている度合い
に相当する距離Ｄｋを演算し、この距離Ｄｋを辞書記憶
部内の総てのパターンについて計算し、そのうちの小さ
い値から順にパターンを適数選んで認識結果とすること
を特徴とする。b is an integer), a mesh area is set in the dictionary storage unit correspondingly, and the appearance frequency, which is the number of times the feature point appears, is determined for each mesh in this mesh area, and the mesh area is Create an appearance frequency distribution corresponding to the area, normalize each frequency on the appearance frequency distribution using a reference frequency, use this normalized value as a membership value, and store the dictionary based on this membership value. Create a two-dimensional fuzzy dictionary by assigning membership values to each mesh in the mesh area within the department,
The two-dimensional fuzzy dictionary obtained in this way is registered in advance in the dictionary storage unit for each pattern along with the reference number j of feature points, the feature points are extracted for the pattern to be recognized, and the total number m of feature points and each feature point are extracted. Find the mesh position (xl, y,) (1≦i≦m) in the mesh area of
The membership value r k(x r
, y 1), and calculate each membership value f k (x r, y 1) using the certainty factor rkcx,, yt) ) and the standard number of feature points.
and the total number m of feature points, calculate a distance Dk corresponding to the degree of separation between the pattern to be recognized and the pattern to be matched PTk, and calculate this distance Dk for all patterns in the dictionary storage unit. It is characterized by calculating and selecting an appropriate number of patterns in order from the smallest value to the recognition result.

ここで前記距離Ｄｋの意味について説明する。Here, the meaning of the distance Dk will be explained.

今認識対象であるパターン（人カバターン）の特徴点の
総数がｍ、各特徴点のメツシュ位置が（ｘｌ、　ｙＩ）
　、　Ｃｘｔ、　ｙｔ）・・・・・・（ｘａ、ｙａ）で
あったとする。先ず辞書記憶部内に登録されているある
パターンＰＴｋを照合対象として、その２次元ファジィ
辞書について前記メツシュ位置（ｘｔ、ｙａ　（ただし
■≦ｉ≦ｍ）に対応するメンバーシップ値ｆｈ（ｘｔ、
ｙＩ）を拾い出す。このｆｈ（ｘｌ、ｙｉ）は、（ｘｔ
、ｙＩ）に位置する人カバターンの特徴点がパターンＰ
Ｔｋのひとつの特徴点であることの確信度を示すもので
ある。そこで本発明では、例えば確信度！、０からｆｈ
（ｘｉ。The total number of feature points of the pattern (human cover pattern) currently being recognized is m, and the mesh position of each feature point is (xl, yI).
, Cxt, yt)... (xa, ya). First, with a certain pattern PTk registered in the dictionary storage unit as a matching target, the membership value fh(xt,
yI). This fh(xl, yi) is (xt
, yI) is the pattern P
This indicates the degree of certainty that it is one feature point of Tk. Therefore, in the present invention, for example, the certainty factor! , 0 to fh
(xi.

ｙｌ）を差し引いた値を人カバターンの全特徴点数ｍあ
るいはパターンＰＴ、に係る特徴点の基準個数ｊで割っ
た値を、（ｘ　ｌ、　ｙ　ｔ）に位置する特徴点とパタ
ーンＰＴ、との距離という概念で捉え、そして人カバタ
ーンの全特徴点についての前記距離の合計値を当該パタ
ーンと照合対象であるパターンＰＴ、との距離Ｄｋとし
て捉える。このＤｋの演算式の一例を示すと、ｊとｍが
等しいときにはＤｈ＝Σ（１，Ｏｆｋ（ｘｔ、　ｙ□）　）／ｍ１＝１となる。また」と【ｎとが５′４なるときはｊ＞ｍで、１）ｈ　＝　（Σ（１，Ｏｆｋ（ｘｌ、ｙＩ））　＋（
ｊ　　ｍ））／ｊｒｎ　＞　ｊで１）ｋ−Σ（１，０−ｆｋ（ｘｌ、ｙＩ））／ｍとなる
。これらの式をまとめると次式となる。yl) divided by the total number of feature points m of the human cover turn or the reference number j of feature points related to the pattern PT, the value obtained by subtracting the value is the difference between the feature points located at (x l, y t) and the pattern PT. The concept of distance is used, and the total value of the distances for all feature points of a human cover pattern is taken as the distance Dk between the pattern and the pattern PT to be matched. An example of the calculation formula for Dk is as follows: Dh=Σ(1, Ofk(xt, y□))/m1=1 when j and m are equal. Also, when " and [n are 5'4, j>m, 1) h = (Σ(1, Ofk(xl, yI)) + (
j m))/jrn>j, 1)k-Σ(1,0-fk(xl,yI))/m. These equations can be summarized as follows.

Ｄｈ−（ｗａｘ（ａ、ｊ）−Σ　ｆｈ（Ｘ＋、ｙａ）　
／ｍａｘ（ａ＋、ｊ）　　−（１）ｉ−まただしＩｌａｘ（ｍ、Ｄはｍ、ｊの大きい方の値である
。Dh-(wax(a,j)-Σ fh(X+,ya)
/max (a+, j) - (1) i - where I lax (m, D are the larger values of m, j).

Ｆ　、実施例本発明方法の具体的手順を以下に説明する。F. Example The specific procedure of the method of the present invention will be explained below.

（１）辞書作成部端辞書を作成するため、同一の文字、または、図杉に対し
て複数のサンプルを檗備する。(1) Dictionary creation unit: To create a dictionary, prepare multiple samples for the same character or map.

それらはあるひとつの辞書を作成するときの辞書作成用
パターンとなる。These become dictionary creation patterns when creating a certain dictionary.

（２）特徴点出現頻度分布（２次元ヒストグラム）の作
成サンプルを繰り返し人力して、それぞれ特徴点を抽出す
る。一方、パターンが描かれる平面領域にａ×ｂ個（ａ
、ｂは整数）のメツシュ領域を割り当てると共に、これ
に対応して辞書記憶部内にメツシュ領域を設定する。そ
してこのメツシュ領域内の各メツシュ毎に前記特徴点の
出現回数である出現頻度を求めて、第１図に示すように
メツシュ領域に対応する出現頻度分布を作成する。(2) Creation of feature point appearance frequency distribution (two-dimensional histogram) Samples are repeatedly manually extracted to extract feature points from each sample. On the other hand, a×b pieces (a
, b is an integer), and corresponding mesh areas are set in the dictionary storage unit. Then, the appearance frequency, which is the number of times the feature point appears, is determined for each mesh in this mesh area, and an appearance frequency distribution corresponding to the mesh area is created as shown in FIG.

例えば、パターンが描かれる平面領域にｌ０Ｘ１０のメ
ツシュ領域を割り当てたとき、辞書記憶部内にも１０ｘ
ｌＯのメツシュ領域を設定する。For example, when a mesh area of 10x10 is allocated to the plane area where the pattern is drawn, the mesh area of 10x10 is also allocated in the dictionary storage unit.
Set the mesh area of lO.

このとき、ある特徴点の位置がｚ（ｘ、ｙ）−（５゜３
）と得られた場合には、ヒストグラム上の位置Ｚ（５，
３）の頻度に１を加えたものとなる。At this time, the position of a certain feature point is z (x, y) - (5°3
), the position Z(5,
It is the frequency of 3) plus 1.

特徴点位置をｚ　（ｘ、ｙ）とすれば、２次元ヒストグ
ラム上で対応する位置Ｚ（ｘ、ｙ）における頻度Ｐ（ｘ
、ｙ）は、次式で示される。If the feature point position is z (x, y), the frequency P(x, y) at the corresponding position Z (x, y) on the two-dimensional histogram is
, y) are expressed by the following equation.

Ｐ　（ｘ、ｙ）　−Ｐ　（ｘ、ｙ）＋　１（Ｐ（ｘ、ｙ
）の初期値０）（３）２次元ファジィ集合におけるメンバーシップ値の
定義と辞書の作成上で求めた特徴点出現頻度分布は、特徴点がどの様な位
置に出現しゃすいかという傾向を示している。つまり、
同一の文字、または、図形に対して、複数のサンプルに
よる頻度分布を取ると、似通った位置に特徴点が出現し
ゃすく、その付近にピークが存在することが多いことを
示している。P (x, y) −P (x, y) + 1(P(x, y
) initial value 0) (3) The feature point appearance frequency distribution obtained by defining the membership value in a two-dimensional fuzzy set and creating a dictionary shows the tendency of where feature points tend to appear. There is. In other words,
When the frequency distribution of multiple samples is taken for the same character or figure, feature points tend to appear in similar positions, indicating that a peak often exists near these points.

このことから逆に、ピーク位置を抽出することで特徴点
位置を推定することができる。From this, conversely, the feature point position can be estimated by extracting the peak position.

本発明は、この考え方をファジィ集合におけるメンバー
シップ値の設定に応用したものである。The present invention applies this idea to setting membership values in fuzzy sets.

さて、頻度分布は、ひとつのパターンに対するサンプル
数が多ければ多いほど、全体の頻度が高くなり、客観的
な判断をすることがむずかしくなるため、頻度分布の正
規化が必要である。Now, in the frequency distribution, the larger the number of samples for one pattern, the higher the overall frequency becomes, making it difficult to make objective judgments, so it is necessary to normalize the frequency distribution.

一方、ファジィ集合におけるメンバーシップ値は、あい
まいな尺度のものを［０，Ｉ］区間の実数領域における
主観的な量として表現したものである。On the other hand, the membership value in a fuzzy set is an ambiguous scale expressed as a subjective quantity in the real number domain of the [0, I] interval.

メンバーシップ値の定義は、［０，Ｉ］区間の実数領域
をヒントに、また、特徴点が頻度分布のピークに対応す
る場合が多いことをヒントにしている。The definition of the membership value is based on the real number area in the [0, I] interval and the fact that feature points often correspond to peaks of the frequency distribution.

つまり、頻度分布のピーク値は特徴点位置というあいま
いな尺度を示す指標となりうるため、０〜１の実数区間
で正規化すればそのまま２次元のファジィ集合に関する
メンバーシップ値として用いることができることになる
。In other words, the peak value of the frequency distribution can be an indicator of an ambiguous measure of feature point position, so if it is normalized in the real number interval from 0 to 1, it can be used as it is as a membership value for a two-dimensional fuzzy set. .

そこで、特徴点出現頻度の正規化と辞書記憶部への登録
を次のように行う。Therefore, the frequency of appearance of feature points is normalized and registered in the dictionary storage unit as follows.

全サンプルの特徴点数をＫとしたとき、ある−定の割合
Ｃｋを乗じたものを基準点までのピーク数にとして求め
る。When the number of feature points of all samples is K, the number of peaks up to the reference point is obtained by multiplying by a certain - constant ratio Ck.

ｋ　　＝　Ｃｈ・Ｋ次に、特徴点出現頻度分布において、頻度の大きいもの
から順に捜していき、ｋ番目の頻度の値を取るものを求
め、これを基準点の頻度Ｐ、とする。k = Ch.K Next, in the feature point appearance frequency distribution, search is performed in descending order of frequency, and the feature point having the k-th frequency value is determined, and this is set as the frequency P of the reference point.

Ｐｋをもとに、特徴点頻度分布を次式により正規化を行
う。Based on Pk, the feature point frequency distribution is normalized using the following equation.

分布上の頻度をＰ１正規化後の頻度をＭとすれば、Ｍ＝１．０　　　　・・・・・・Ｐ≧Ｐ、のときＰ／Ｐ
ｋ　　　・・・・−・Ｐ＜Ｐｋのとき分布上の全ての頻
度について行うことで、正規化した結果が得られる。こ
の結果は、０〜１区間にあり、２次元ファジィ辞書とし
て辞書記憶部に登録する。If the frequency on the distribution is P1 and the frequency after normalization is M, then M=1.0...When P≧P, P/P
When P<Pk, normalized results can be obtained by performing this on all frequencies on the distribution. This result is in the range 0 to 1 and is registered in the dictionary storage unit as a two-dimensional fuzzy dictionary.

以上で、ひとつのパターンに対する辞書作成手続きが終
了する。This completes the dictionary creation procedure for one pattern.

このようにして得られた２次元ファジィ辞書の一例を図
示すると、第３図は文字ｒＡＪの特徴点頻度分布を示す
図、第４図は第３図に示す分布を正規化したものを示す
図であり、この例ではある領域に９×９のメツシュ領域
をとっている。ここで特徴点とは、例えば黒画素に接す
る白画素を輪郭画素として抜き出し、これらを直線近似
処理等を行って情報圧縮化し、こうして得られた輪郭画
素に相当するものである。To illustrate an example of the two-dimensional fuzzy dictionary obtained in this way, Fig. 3 shows the feature point frequency distribution of the character rAJ, and Fig. 4 shows a normalized version of the distribution shown in Fig. 3. In this example, a 9×9 mesh area is set in a certain area. Here, the feature point corresponds to a contour pixel obtained by extracting, for example, a white pixel adjacent to a black pixel as a contour pixel, and compressing information by performing linear approximation processing or the like on these pixels.

次に認識対象であるパターンを入力して、辞書記憶部内
のパターン群と照合する方法の一例について述べる。第
５図中辞書記憶部ｌ内にはパターン毎にファジィ辞書と
特徴点の基準個数ｊとが登録されている。特徴点の基Ｑ
、ｇＩ数Ｊは、例えばファジィ辞書を作成するときに用
いたパターンのサンプルの特徴点の平均個数である。Next, an example of a method of inputting a pattern to be recognized and comparing it with a group of patterns in the dictionary storage section will be described. In the dictionary storage unit l in FIG. 5, a fuzzy dictionary and a reference number j of feature points are registered for each pattern. Base Q of feature points
, gI number J is, for example, the average number of feature points of the pattern samples used when creating the fuzzy dictionary.

先ずパターン人力部２にてパターンを人力し、特徴点個
数加算部３により当該パターンの特徴点の総数（全特徴
点数）ｍを求める。続いてメンバーシップ値加算部４に
より辞書記憶部ｌ内のあるパターンＰＴ、について、パ
ターン入力部２に入力したパターン（認識対象のパター
ン）の特徴点のメッンユ位置（ｘ、、ｙＩ）（１５１６
ｍ）に対応するメンバーシップ値ｆ　−（ｘ　ｉ、　ｙ
　１）　’ｆｔ求メ、これらｆｍ（ｘ＋、Ｙｉ）の総て
を加算し、そのる。そして最大値検出部６によりｊｌｍ
のうちの大きい方５ａｘ（ｊ、ｓ）を出力し、距＃１演
算部５にて先述した（１）式の演算を実行して距＃Ｄ、
を求める。First, a pattern is created manually in the pattern manual section 2, and the total number of feature points (total number of feature points) m of the pattern is determined by the feature point number adding section 3. Next, the membership value adding unit 4 calculates the main positions (x,, yI) (1516
membership value f − (x i, y
1) Find 'ft, add all of these fm(x+, Yi), and do that. Then, the maximum value detection unit 6 determines jlm
The larger one of these 5ax(j, s) is output, and the distance #1 calculation unit 5 executes the calculation of equation (1) described above to obtain the distance #D,
seek.

こうした試行を登録されているすべてのパターンについ
て行い、近距離判定部７にてＤｋの小さい順に２個のパ
ターンを取り出し、これを認識結果として認識結果出力
部８より出力する。This trial is performed for all the registered patterns, and the short distance determining section 7 picks out two patterns in descending order of Dk, which are output as recognition results from the recognition result output section 8.

以上において、２次元ファジィ辞書としては、第３図に
示す出現頻度分布を正規化して得られた第４図に示すメ
ンバーシップ値の配列をそのまま用いてもよいが、次の
ような処理を施して得られたものを用いてらよい。In the above, as a two-dimensional fuzzy dictionary, the array of membership values shown in Figure 4 obtained by normalizing the appearance frequency distribution shown in Figure 3 may be used as is, but the following processing may be applied. You can use the one obtained by

先ず各メツシュの出現頻度のうち最大値をとるメツシュ
を第１の領域の探索開始点として予め定めた規則に従い
第１の領域に含まれるメッ７ュを探索し、次いで第１の
領域以外のメツシュの中から出現頻度の最大値をとるメ
ツシュを第２の領域の探索開始点として、同様に第２の
領域に含まれるメツシュを探索し、こうした処理を繰り
返すことにより前記メツシュ領域を１以上の領域に分離
する。一方出現頻度を正規化するために例えば出現頻度
の総数にもとづいて基準頻度を求める。First, the mesh that has the maximum value among the appearance frequencies of each mesh is used as the search start point for the first area, and meshes included in the first area are searched according to predetermined rules. Using the mesh with the maximum appearance frequency as the starting point for searching the second region, the meshes included in the second region are similarly searched, and by repeating this process, the mesh region can be divided into one or more regions. Separate into On the other hand, in order to normalize the frequency of appearance, a reference frequency is determined based on, for example, the total number of frequencies of appearance.

続いて分離領域毎にその分離領域のみを含む前記メツシ
ュ領域について各出現頻度を基準頻度を用いて正規化し
てメンバーシップ値を求め、その後当該メツシュ領域の
各メッツュの１行づつについてその行に含まれるメンバ
ーシップ値の中から最大値を求めると共に、１列づつに
ついてその列に含まれるメンバーシップ値の中から最大
値を求め、これら最大値を分離領域毎に組にして第６図
に示すような！次元ファジィ辞書を作成する。Next, for each separated area, membership values are determined by normalizing each appearance frequency for the mesh area that includes only that separated area using the reference frequency, and then for each row of each mesh in the mesh area, the membership value is determined. At the same time, the maximum value is determined from among the membership values included in each column, and these maximum values are grouped for each separation area as shown in Figure 6. What! Create a dimensional fuzzy dictionary.

次いでメツシュ領域の端からｉ番目のメツシュ列にＸ座
標ｘｌを、端からｊ番目の行にＹ座標ｙ。Next, set the X coordinate xl to the i-th mesh column from the end of the mesh area, and set the Y coordinate y to the j-th row from the end.

を夫々割り当てると共に、分離領域毎に作成した１次元
ファジィ辞書について、Ｘ　ｈ　＋　３’　Ｊに夫々対
応するメンバーシップ値をＸ（ｉ）、Ｙ（ｊ）と表示し
たとき、分離領域毎に（ｘ　ｉ、　ｙ　Ｊ）のメツシュ
位置における２次元のメンバーシップ値ｒ（ｉ、ｊ）に
対してＸ（ｉ）、Ｙ（ｊ）のうちの小さい方の値を割り
当てて、分離領域に対する２次元のメンバーシップ値の
集合を得る。are assigned respectively, and the membership values corresponding to The smaller value of X(i) and Y(j) is assigned to the two-dimensional membership value r(i, j) at the mesh position of x i, y J), and the two-dimensional membership value for the separation region is Obtain the set of membership values for .

次いでメツシュ位置（ｘ　ｉ、　ｙ　、１）におけるメ
ンバーシップ値Ｆ（ｉ、ｊ）に対して、各分離領域のｆ
（ｉ、ｊ）のうちの最大値を割り当て、このＦ（ｉ、ｊ
）の集合によって第７図に示すように各分離領域の全領
域に対する２次元ファジィ辞書を得る。Then, for the membership value F(i, j) at the mesh position (x i, y , 1), f of each separation region
(i, j), and assign this F(i, j
), a two-dimensional fuzzy dictionary for all regions of each separated region is obtained as shown in FIG.

Ｇ　発明の効果本発明によれば、文字や図形等のパターンの特徴点の出
現頻度分布を、多数のサンプルを用意することにより作
成し、この出現頻度分布の各頻度を正規化して、その値
をメンバーシップ値とじて登録するようにしているため
、辞書を自動的に作成することができると共に辞書登録
のためのマニュアル作業が大幅に低減し、また人カバタ
ーンの位置や傾きにばらつきがあっても認識の誤りが生
じにくい。G. Effects of the Invention According to the present invention, the appearance frequency distribution of feature points of patterns such as characters and figures is created by preparing a large number of samples, each frequency of this appearance frequency distribution is normalized, and its value is calculated. Since the information is registered together with the membership value, it is possible to automatically create a dictionary, and the manual work required for dictionary registration is greatly reduced. Errors in recognition are also less likely to occur.

更に辞書空間としては、表示のための文字フォントの１
．２倍〜１．５倍程度のメツシュ空間があれば十分認識
できる。例えば英数字の場合９×９程度、複雑な漢字の
場合３２Ｘ３２程度のメツシュ空間で済む。そして例え
ばメンバーシップ値を、０〜ｌまでを０．！で刻んだ値
により表現した場合、データ表現としてはθ〜ｌＯの整
数で持てばよいから４ビツトで構成することができる。Furthermore, as a dictionary space, one of the character fonts for display
．． A mesh space of about 2 to 1.5 times is sufficient for recognition. For example, a mesh space of about 9 x 9 is sufficient for alphanumeric characters, and about 32 x 32 for complex kanji. For example, set the membership value from 0 to l to 0. ! When expressed as a value carved in , the data can be expressed as an integer from θ to IO, so it can be composed of 4 bits.

従ってメモリ量としては漢字ｉ文字当たり３２Ｘ３２×
４ビット−５１２バイトとなり、従来必要であった２に
バイトと比較してＩ／４程度で済み、記憶容量の低減化
を図ることができる。Therefore, the amount of memory is 32×32× per i kanji character.
It is 4 bits - 512 bytes, which requires about I/4 of the conventionally required 2 bytes, and it is possible to reduce the storage capacity.

また入カバターンを認識するにあたっては、人カバター
ンの特徴点のメツシュ位置に対応するメンバーシップ値
を辞書より求めるだけで確信度が得られ、その確信度を
確信度１．０　から差し引いた値を特徴点側々の照合対
象であるパターンとの距離として捉え、この距離を用い
て例えば（１）式により２つのパターンの距離Ｄｋを演
算し、その値が小さい程パターンのマツチングの程度が
高いとして認識しているため、演算が簡単であって、高
速なパターン認識を実現できる。そして辞書そのものに
あいまい性があるので、位置ずれに極めて強い特性があ
る。In addition, when recognizing an incoming Kabataan, the confidence can be obtained by simply finding the membership value corresponding to the mesh position of the feature point of the Kabataan from a dictionary, and the value obtained by subtracting that confidence from the confidence of 1.0 is the feature This distance is taken as the distance between each point to the pattern to be matched, and this distance is used to calculate the distance Dk between the two patterns using equation (1), and the smaller the value, the higher the degree of pattern matching is recognized. Therefore, calculations are simple and high-speed pattern recognition can be realized. Since the dictionary itself has ambiguity, it is extremely resistant to misalignment.

[Brief explanation of the drawing]

第１図は特徴点出現頻度を示す概念図、第２図は特徴点
のあいまい性を示す説明図、第３図は特徴点出現頻度分
布を示すデータ図、第４図は正規化後の特徴点出現頻度
分布を示すデータ図、第５図は本発明の実施例を実行す
る回路のブロック図、第６図は１次元ファジィ辞書のデ
ータ図、第７図は２次元ファジィ辞書のデータ図である
。１・・・辞書記憶部、３・・・特徴点個数加算部、４・
・・メンバーシップ値加算部、５・・・距離演算部。第２図特徴点のあいまい性の説明図第１図ｅｆ散点出現頑度今布印第３図特徴点出現頻度分布図第４図２次元ファジィ辞書のデータ×Figure 1 is a conceptual diagram showing the frequency of feature point appearance, Figure 2 is an explanatory diagram showing the ambiguity of feature points, Figure 3 is a data diagram showing the frequency distribution of feature points, and Figure 4 is the features after normalization. FIG. 5 is a block diagram of a circuit for implementing an embodiment of the present invention; FIG. 6 is a data diagram of a one-dimensional fuzzy dictionary; and FIG. 7 is a data diagram of a two-dimensional fuzzy dictionary. be. 1... Dictionary storage unit, 3... Feature point number addition unit, 4.
...Membership value addition section, 5...Distance calculation section. Fig. 2: Explanation of ambiguity of feature points Fig. 1: ef scattered point appearance robustness Fig. 3: minutiae appearance frequency distribution diagram Fig. 4: data of two-dimensional fuzzy dictionary ×

Claims

[Claims]

(1) In a method of recognizing patterns such as characters and figures using a dictionary to be matched, a large number of samples are created for a certain pattern, feature points of the pattern in each sample are extracted, and the pattern is drawn. ax in the plane area
At the same time as assigning b mesh regions (a and b are integers), a corresponding mesh region is set in the dictionary storage unit, and the appearance frequency, which is the number of times the feature point appears, is determined for each mesh in this mesh region. , create an appearance frequency distribution corresponding to the mesh area, normalize each frequency on the appearance frequency distribution using a reference frequency, set this normalized value as a membership value, and use this membership value as Based on this, a two-dimensional fuzzy dictionary is created by assigning a membership value to each mesh in the mesh area in the dictionary storage unit, and the two-dimensional fuzzy dictionary thus obtained is stored in advance in the dictionary storage unit for each pattern along with the reference number j of feature points. After registering, extract the feature points for the pattern to be recognized, find the total number of feature points m and the mesh position (x_1, y_1) (1≦i≦m) of each feature point in the mesh area, and create a dictionary. With a certain pattern PT_k in the storage unit as a matching target, the mesh position (x_
1, y_1), the membership value f_k(x_
1, y_1), and each membership value f_k(x_1, y_1) is subtracted from the confidence level 1.0 to obtain the total value Σ(1.0-i=1 f_k(x_1, y_1) and the feature point Based on the reference number j and the total number m of feature points, a distance D_k corresponding to the degree of separation between the pattern to be recognized and the pattern PT_k to be matched is calculated, and this distance D_k is calculated from all the feature points in the dictionary storage unit. A pattern recognition method characterized by calculating the patterns, and selecting an appropriate number of patterns from among them in order of the smallest value as a recognition result.