JPH0488591A

JPH0488591A - Dictionary preparing method

Info

Publication number: JPH0488591A
Application number: JP2204838A
Authority: JP
Inventors: Kenji Fukumizu; 健次福水
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1990-08-01
Filing date: 1990-08-01
Publication date: 1992-03-23
Anticipated expiration: 2014-12-20
Also published as: JP2991750B2

Abstract

PURPOSE:To change the number of dictionary templates corresponding to a feature pattern by providing a step for generating a standard pattern, step for evaluating the distributing condition of feature patterns and step for deciding the number of dictionary templates in the standard pattern, etc. CONSTITUTION:A clustering part 2 is provided to execute clustering into one or two clusters, a standard pattern preparation part 3 an evaluated value calculation part 5 to calculate an evaluated value, a dictionary template number selection part 5 to select the number of dictionary templates in the standard pattern are provided. The feature pattern of an input character is classified into plural clusters, and the standard pattern is generated from the representative vector of each classified cluster. On the other hand, the condition of distributing the feature patterns of the input character is evaluated and according to the evaluated value, the number of dictionary templates in the standard pattern is decided to the input character. Thus, within the range of a prescribed dictionary size, the number of dictionary templates can be changed corresponding to the feature pattern.

Description

【発明の詳細な説明】［産業上の利用分野］本発明は、パターン辞書とのマツチング距離によって文
字認識あるいは音声認識を行うパターン認識装置におい
て、所定の辞書サイズの範囲内で効果的な辞書テンプレ
ート数で構成される辞書作成方法に関する。Detailed Description of the Invention [Field of Industrial Application] The present invention provides a dictionary template that is effective within a predetermined dictionary size range in a pattern recognition device that performs character recognition or speech recognition based on matching distance with a pattern dictionary. Concerning a method for creating a dictionary composed of numbers.

[Prior art]

従来のパターン辞書は、認識すべき対象パターンが同一
カテゴリ内にあっても、雑音等の影響によってパターン
にバラツキが生ずるため、通常パターン辞書のテンプレ
ートを複数個備えている。Conventional pattern dictionaries usually include a plurality of pattern dictionary templates because even if target patterns to be recognized are in the same category, variations occur in the patterns due to the influence of noise and the like.

特に、活字文字認識においては、文字のフォントによる
形状の違いがはっきりしているので、各フォント毎に辞
書テンプレートを設けている。Particularly in printed character recognition, since there are clear differences in the shapes of characters depending on the font, a dictionary template is provided for each font.

なお、この種の関連する技術として、多数の特徴パター
ンをグループ化してグループを代表する特徴パターンを
標準パターンとする標準パターン作成装置（特開昭５９
−１６３６８０号公報）がある。As a related technology of this type, there is a standard pattern creation device (Japanese Patent Laid-Open No. 59-1992) which groups a large number of characteristic patterns and uses a characteristic pattern representing the group as a standard pattern.
-163680).

［発明が解決しようとする課題］しかしながら、上記した技術では辞書サイズが大きくな
り、認識処理速度が遅くなるという問題があり、特に辞
書テンプレートの総数を所定の範囲に抑えつつ最も少な
いテンプレート数で種々のフォントを！８識するための
辞書の構成について、考慮されていなかった。[Problems to be Solved by the Invention] However, the above technology has the problem that the dictionary size becomes large and the recognition processing speed becomes slow.In particular, it is possible to keep the total number of dictionary templates within a predetermined range and to use the smallest number of templates. font! 8. No consideration was given to the structure of the dictionary for understanding.

本発明の目的は、所定の辞書サイズの範囲内で、特徴パ
ターンに対応して辞書テンプレート数を変化させること
ができる辞書作成方法を提供することにある。An object of the present invention is to provide a dictionary creation method that can change the number of dictionary templates according to characteristic patterns within a predetermined dictionary size range.

[Means to solve the problem]

前記目的を達成するために、本発明の辞書作成方法は、
入力文字の特徴パターンを複数のクラスタに分類するス
テップと、前記分類された各クラスタの代表ベクトルか
ら標準パターンを生成するステップと、前記入力文字の
特徴パターンの分布状況を評価するステップと、前記評
価ステップによって評価された値によって、前記入力文
字に対する標準パターンの辞書テンプレート数を決定す
るステップからなる。本発明の他の実施例では、前記評
価ステップで更に文字の出現頻度を評価している。In order to achieve the above object, the dictionary creation method of the present invention includes:
a step of classifying the characteristic pattern of the input character into a plurality of clusters, a step of generating a standard pattern from a representative vector of each of the classified clusters, a step of evaluating the distribution status of the characteristic pattern of the input character, and the evaluation. The method comprises the step of determining the number of dictionary templates of standard patterns for the input characters according to the values evaluated in the step. In another embodiment of the present invention, the frequency of appearance of characters is further evaluated in the evaluation step.

［作　用］複数の文字画像データから特徴量を抽出し、Ｎ次元の特
徴ベクトルを得、クラスタリング部において、該複数の
特徴ベクトルをクラスタリングする。このようにクラス
タリングされた１個のクラスタをＪ、２個のクラスタを
Ｋ　、　Ｋ、とする。標準パターン作成部では、Ｊ、に
、、Ｋ、の各クラスタについて、そのクラスタを代表す
るベクトルを計算し、この代表するベクトル、例えばク
ラスタ内の平均ベクトルを標準パターンとして標準パタ
ーン保持部に登録する。そして、評価値計算部では、任
意の文字の特徴ベクトルがどの程度２つのクラスタに分
かれているかの尺度を表す評価値を算出し、その結果を
評価値保持部に登録し、辞書テンプレート数選択部は評
価値保持部に登録されている全ての文字の評価値を読み
だして、その評価値の小さい文字から２１種類までの文
字に対しては辞書テンプレート数を１個にし、残りのＰ
。[Operation] Feature quantities are extracted from a plurality of character image data to obtain N-dimensional feature vectors, and the plurality of feature vectors are clustered in a clustering section. Let J be one cluster clustered in this way, and let K be two clusters. The standard pattern creation unit calculates a representative vector for each cluster of J, , , K, and registers this representative vector, for example, the average vector within the cluster, as a standard pattern in the standard pattern holding unit. . Then, the evaluation value calculation section calculates an evaluation value representing the degree to which the feature vector of an arbitrary character is divided into two clusters, registers the result in the evaluation value storage section, and registers the result in the dictionary template number selection section. reads the evaluation values of all the characters registered in the evaluation value holding section, sets the number of dictionary templates to one for the characters with the lowest evaluation value up to 21 types, and uses the remaining P
.

種類の文字については、辞書テンプレート数を２個にす
る。For type characters, the number of dictionary templates is set to two.

［実施例］以下、本発明の一実施例を図面を用いて具体的に説明す
る。[Example] Hereinafter, an example of the present invention will be specifically described using the drawings.

第１図は、本発明の辞書作成装置のブロック構成図を示
す。図において、１は辞書作成用の画像データから特徴
を抽出する特徴抽出部で、２は抽出された複数の特徴ベ
クトルを１個あるいは２個のクラスタにクラスタリング
するクラスタリング部である。３は各クラスタ内の平均
ベクトルを算出することによって標準パターンを作成す
る標準パターン作成部であり、４は該標準パターン作成
部で作成された標準パターンを保持する標準パターン保
持部、５は任意の文字の特徴ベクトルがどの程度２つの
クラスタに分かれているかの尺度を表す評価値を算出す
る評価値計算部、６はその評価値を保持する評価値保持
部、７は評価値保持部６から読み出された評価値に従っ
て、標準パターン保持部に保持された標準パターンの辞
書テンプレート数を選択する辞書テンプレート数選択部
、８は該選択部で選択されたテンプレート数で構成され
たパターン辞書である。FIG. 1 shows a block diagram of a dictionary creation device according to the present invention. In the figure, 1 is a feature extraction unit that extracts features from image data for dictionary creation, and 2 is a clustering unit that clusters a plurality of extracted feature vectors into one or two clusters. 3 is a standard pattern creation unit that creates a standard pattern by calculating the average vector within each cluster; 4 is a standard pattern holding unit that holds the standard pattern created by the standard pattern creation unit; and 5 is an arbitrary An evaluation value calculation unit that calculates an evaluation value representing a measure of how much the character feature vector is divided into two clusters, 6 an evaluation value holding unit that holds the evaluation value, and 7 reading from the evaluation value holding unit 6. A dictionary template number selection section 8 selects the number of dictionary templates of the standard pattern held in the standard pattern holding section according to the evaluation value provided, and reference numeral 8 represents a pattern dictionary constituted by the number of templates selected by the selection section.

第２図は、本発明の辞書作成動作を説明するためのフロ
ーチャートである。このフローチャートを参照しつつ、
以下本発明の動作を詳細に説明すると、今、各文字の辞
書テンプレート（辞書の特徴パターン）数が１個の辞書
テンプレートからなる文字と、２個の辞書テンプレート
とからなる文字によってパターン辞書を作成すると仮定
し、１個の辞書テンプレートからなる文字を２１種類、
２個の辞書テンプレートとからなる文字をＰ１種類作成
するとき、処理速度の関係から決まるパターン辞書の総
テンプレート数の上限数をＴとする。FIG. 2 is a flowchart for explaining the dictionary creation operation of the present invention. While referring to this flowchart,
To explain the operation of the present invention in detail below, a pattern dictionary is now created using a character whose dictionary template (characteristic pattern of the dictionary) of each character is one dictionary template and a character which is composed of two dictionary templates. Assuming that, there are 21 types of characters in one dictionary template,
When creating P1 types of characters consisting of two dictionary templates, let T be the upper limit of the total number of templates in the pattern dictionary determined from the relationship of processing speed.

また、辞書に登録する字種の数を０個（Ｐ、　＋　Ｐ、
）とすると、Ｔ＝Ｐ、＋２Ｐ、、Ｐ、＝２Ｃ−Ｔ。Also, set the number of character types to be registered in the dictionary to 0 (P, + P,
), then T=P,+2P,,P,=2C-T.

Ｐ、＝Ｔ−Ｃ，（ＣＡＴ＜２Ｃ）が成立するが、０個の
字種をどのようにして効果的に２３種と２３種に割り当
てるかを、以下に説明する。P,=TC,(CAT<2C) holds, but how to effectively allocate 0 character types to 23 types and 23 types will be explained below.

各文字毎に複数の画像データを用意し、特徴抽出部１に
おいて特徴量を抽出し、Ｎ次元の特徴ベクトルを得る（
ステップ２０）、クラスタリング部２は、複数の特徴ベ
クトルからパターンの塊すなわちクラスタを見つけるこ
とにより、類似した特徴を持つパターンを１つのクラス
として認識し、１個のクラスタと２個のクラスタとに分
類（クラスタリング）する（ステップ２１）。このクラ
スタリングの方法としては、例えばに−ｍｅａｎクラス
タリング等の非階層的クラスタリング、あるいは重心法
、Ｗａｒｄ法等の階層的クラスタリング法がある。この
ようにクラスタリングされた１個のクラスタをＪ　（デ
ータ全体）、２個のクラスタをに１、Ｋｌとする。第３
図（ａ）は、１個のクラスタＪを表現した図で、各点が
特徴空間における特徴ベクトルを表し、類似した特徴量
を持った点が特徴空間上で塊を形成している。第３図（
ｂ）は、２個のクラスタに１、Ｋ、を表現した図で、特
徴空間が２つのカテゴリに分類されている。Prepare a plurality of image data for each character, extract the feature amount in the feature extraction unit 1, and obtain an N-dimensional feature vector (
Step 20) The clustering unit 2 recognizes patterns with similar features as one class by finding clusters of patterns from a plurality of feature vectors, and classifies them into one cluster and two clusters. (clustering) (step 21). Examples of this clustering method include non-hierarchical clustering such as -mean clustering, and hierarchical clustering methods such as centroid method and Ward method. Let one cluster thus clustered be J (the entire data), and two clusters be 1 and Kl. Third
Figure (a) is a diagram expressing one cluster J, where each point represents a feature vector in the feature space, and points with similar feature amounts form a cluster in the feature space. Figure 3 (
b) is a diagram expressing 1 and K in two clusters, and the feature space is classified into two categories.

標準パターン作成部３では、Ｊ、に、、Ｋ、の各クラス
タについて、そのクラスタを代表するベクトルを計算す
る。この代表するベクトルとしては、例えばクラスタ内
の平均ベクトルを用いる（ステップ２２）。これらの平
均ベクトルを標準パターンとして標準パターン保持部４
に登録する（ステップ２３）。The standard pattern creation unit 3 calculates a vector representing each cluster of J, , , K. As this representative vector, for example, the average vector within the cluster is used (step 22). The standard pattern holding unit 4 uses these average vectors as a standard pattern.
(Step 23).

次いで、評価値計算部５は、任意の文字の特徴ベクトル
がどの程度２つのクラスタに分かれているかの尺度を表
す評価値δを次式によって算出する。Next, the evaluation value calculation unit 5 calculates an evaluation value δ representing a measure of how much the feature vector of an arbitrary character is divided into two clusters using the following equation.

δ＝ｂ−（ａ、＋ａ、）ここで、ａｌ、ａ、はクラスタが２個の場合、各クラス
タにおける分散を表し、ｂはクラスタが１個の場合の分
散を表す。すなわち、第４図において、２つのクラスタ
に１、Ｋ、に分かれている場合は、クラスタに１の平均
を４２の点とすると分散８１は、クラスタに、の平均４
２からクラスタＫ。δ=b−(a,+a,) Here, al and a represent the variance in each cluster when there are two clusters, and b represents the variance when there is one cluster. That is, in Fig. 4, if the cluster is divided into two clusters, 1, K, and the average of 1 in the cluster is 42 points, then the variance 81 is the average of 4 in the cluster.
2 to cluster K.

内のある特徴ベクトルまでの距離の２乗を、クラスタに
内の全ての特徴ベクトルについて加算したものであり、
クラスタに、についても同様にその分散ａ、が算出され
る。分散すは、クラスタに１、Ｋ、の平均を４１の点と
すると、平均４１から任意の特徴ベクトルまでの距離の
２乗を、クラスタに３、Ｋ、内の全ての特徴ベクトルに
ついて加算することによって算出される。このことから
、２つのクラスタが離れているほどｂの値が大きく、つ
まり評価値δが大きくなり、他方２つのクラスタが接近
すればするほど評価値δが小さくなる。It is the sum of the square of the distance to a certain feature vector in the cluster for all feature vectors in the cluster,
Similarly, the variance a is calculated for the cluster. If the average of 1,K,in a cluster is 41 points, the distribution is,add the square of the distance from the average 41 to any,feature vector for all feature vectors in 3,K,in a cluster. Calculated by From this, the farther two clusters are apart, the larger the value of b, that is, the larger the evaluation value δ, and the closer the two clusters are, the smaller the evaluation value δ becomes.

各文字毎に評価値を計算しくステップ２４）、その結果
を評価値保持部６に登録する。以上の処理を全ての文字
に対して行う（ステップ２５）。An evaluation value is calculated for each character (step 24), and the result is registered in the evaluation value holding section 6. The above processing is performed for all characters (step 25).

次いで、辞書テンプレート数選択部７は評価値保持部６
に登録されている全ての文字の評価値を読みだして、そ
の評価値の小さいものから順にソートする（ステップ２
６）。そして、評価値の小さい文字から２１種類までの
文字に対しては辞書テンプレート数を１個にし、該文字
に対応する標準パターンを標準パターン保持部４のクラ
スタＪから読みだして、辞書テンプレートとして登録す
る（ステップ２７．２８）。残りの２３種類の文字につ
いては、辞書テンプレート数を２個にし、標準パターン
保持部４のクラスタに１、Ｋ、から標準パターンを読み
だして、辞書テンプレートとして登録する（ステップ２
７．２９）。Next, the dictionary template number selection section 7 selects the number of dictionary templates from the evaluation value holding section 6.
Read the evaluation values of all characters registered in , and sort them in descending order of evaluation value (Step 2
6). Then, the number of dictionary templates is set to one for characters with the lowest evaluation value to 21 types, and the standard pattern corresponding to the character is read from the cluster J of the standard pattern holding unit 4 and registered as a dictionary template. (steps 27.28). For the remaining 23 types of characters, the number of dictionary templates is set to 2, standard patterns are read from clusters 1 and K in the standard pattern holding unit 4, and registered as dictionary templates (step 2).
7.29).

第５図は、本発明の他の実施例のブロック構成図で、先
の実施例の構成に出現頻度保持部９を付加したものであ
る。他の構成要素は前述したものと同様であるので、説
明を省略する。FIG. 5 is a block diagram of another embodiment of the present invention, in which an appearance frequency holding section 9 is added to the structure of the previous embodiment. The other components are the same as those described above, so their explanation will be omitted.

この実施例では、評価値保持部６から読みだした値と出
現頻度保持部９から読みだされた出現頻度とを用いて、
その文字についての新たな評価値を評価値計算部５で計
算するようにしたものである。ここでいう出現頻度とは
、実際の文書等においてその文字が統計的にどの程度の
割合で出現するかを表した量であって、例えば出現頻度
Ｈが０から１０までの整数値で与えられ、その値が大き
いほど出現頻度が大きいものとすると、新しい評価値μ
として次の式を用いる。In this embodiment, using the value read from the evaluation value holding unit 6 and the appearance frequency read from the appearance frequency holding unit 9,
The new evaluation value for that character is calculated by the evaluation value calculation section 5. The frequency of appearance here refers to the amount that represents the statistical rate at which a character appears in an actual document, etc. For example, the frequency of appearance H is given as an integer value from 0 to 10. , the larger the value, the greater the appearance frequency, then the new evaluation value μ
The following formula is used as

μ＝α・δ＋β・Ｈただし、α、βはパラメータである。μ=α・δ+β・H However, α and β are parameters.

第６図は、第２の実施例の動作を説明するためのフロー
チャートである。先の実施例の動作と相違する点は、ス
テップ２５と２６の間に、出現頻度Ｈと評価値δから新
しい評価値μを算出するステップが加わった点であり、
他のステップは全く同様であるので、その動作説明を省
略する。FIG. 6 is a flowchart for explaining the operation of the second embodiment. The difference from the operation of the previous embodiment is that a step of calculating a new evaluation value μ from the appearance frequency H and the evaluation value δ is added between steps 25 and 26.
Since the other steps are exactly the same, the explanation of their operation will be omitted.

以上の処理では、各文字の辞書テンプレートが２種類（
１個の辞書テンプレートと、２個の辞書テンプレート）
の場合について説明したが、これを３種類以上にした場
合でも同様にして処理できる。各文字が３種類の辞書テ
ンプレート（１個の辞書テンプレートと、２個の辞書テ
ンプレートと、３個の辞書テンプレート）で構成される
場合は以下のようになる。In the above process, there are two types of dictionary templates for each character (
1 dictionary template and 2 dictionary templates)
Although the above case has been described, even if there are three or more types, the same process can be performed. If each character is composed of three types of dictionary templates (one dictionary template, two dictionary templates, and three dictionary templates), the following will occur.

辞書テンプレート数が１個である文字が２１種類、辞書
テンプレート数が２個である文字が２３種類、辞書テン
プレート数が３個である文字がＰ、種類となるようにパ
ターン辞書を作成すると仮定する。ここで、Ｐｌ、Ｐｌ
、Ｐ５は、Ｐ、＋２Ｐ、＋３Ｐ、＝Ｔ、Ｐ、＋Ｐ、＋Ｐ
、＝Ｃを満たすものとする。Assume that a pattern dictionary is created such that there are 21 types of characters with 1 dictionary template, 23 types of characters with 2 dictionary templates, and P types of characters with 3 dictionary templates. . Here, Pl, Pl
, P5 is P, +2P, +3P, =T, P, +P, +P
, =C.

各文字について、３個のクラスタにクラスタリングした
結果をＬｌ、Ｌｌ、Ｌｌ、２個のクラスタにクラスタリ
ングした結果をに１、Ｋ８．１個のクラスタにクラスタ
リングした結果をＪとする。For each character, the result of clustering into three clusters is Ll, Ll, Ll, the result of clustering into two clusters is 1, and the result of clustering into K8.1 clusters is J.

Ｌｉ　　（ｉ＝１，２．３）のクラスタ内分散をａｉ、
Ｋ１のクラスタ内分散をｂｉ、Ｊのクラスタ内分散をＣ
とすると、評価値δ１、δ、は次のようになる。Let the intra-cluster variance of Li (i=1, 2.3) be ai,
Bi is the intra-cluster variance of K1, and C is the intra-cluster variance of J.
Then, the evaluation values δ1, δ are as follows.

δ、＝Σｂｉ−Σａ　１１ δ、＝Ｃ−Σｂｉこれらの計算結果を前述したと同様にして評価値保持部
に登録する。辞書テンプレート数選択部は、まず評価値
δ１の小さいものから順に２１種類までの文字に対して
は辞書テンプレート数を１個にし、次に評価値δ１の小
さいものから順に２８種類までの文字に対しては辞書テ
ンプレート数を２個にし、残りの２１種類の文字を辞書
テンプレート数３個にする。δ,=Σbi−Σa 11 δ,=C−Σbi These calculation results are registered in the evaluation value holding unit in the same manner as described above. The dictionary template number selection section first sets the number of dictionary templates to one for up to 21 types of characters in order from the smallest evaluation value δ1, and then sets the number of dictionary templates to one for up to 28 types of characters in order from the smallest evaluation value δ1. In this case, the number of dictionary templates is set to two, and the number of dictionary templates for the remaining 21 types of characters is set to three.

［発明の効果］以上、説明したように、本発明によれば、特徴パターン
の分布状況あるいはカテゴリの出現頻度に応じて辞書テ
ンプレート数を変えることができるので、外界のパター
ン分布をより忠実にパターン辞書に反映することが可能
となり、認識率を格段に向上させることができる。[Effects of the Invention] As explained above, according to the present invention, the number of dictionary templates can be changed depending on the distribution of feature patterns or the appearance frequency of categories, so pattern distribution in the external world can be more faithfully reproduced. This can be reflected in the dictionary, and the recognition rate can be significantly improved.

[Brief explanation of the drawing]

第１図は本発明の辞書作成装置のブロック構成図、第２
図は本発明の辞書作成動作を説明するためのフローチャ
ート、第３図（ａ）は１個のクラスタＪを表現した図、
第３図（ｂ）は２個のクラスタに、、　Ｋ、を表現した
図、第４図は２つのクラスタに１、Ｋ、に分かれている
場合の分散を説明するための図、第５図は本発明の他の
実施例のブロック構成図、第６図は第２の実施例の動作
を説明するためのフローチャートである。１・・・特徴抽出部、　　２・・・クラスタリング部、
３・・・標準パターン作成部、４・・・標準パターン保持部、５・・・評価値計算部、　　６・・・評価値保持部、７
・・・辞書テンプレート数選択部、８・・・パターン辞書、９・・・出現頻度保持部。FIG. 1 is a block diagram of the dictionary creation device of the present invention, and FIG.
The figure is a flowchart for explaining the dictionary creation operation of the present invention, FIG. 3(a) is a diagram expressing one cluster J,
Figure 3(b) is a diagram expressing K in two clusters, Figure 4 is a diagram to explain the variance when divided into two clusters, 1, K, and Figure 5. is a block diagram of another embodiment of the present invention, and FIG. 6 is a flowchart for explaining the operation of the second embodiment. 1...Feature extraction unit, 2...Clustering unit,
3... Standard pattern creation section, 4... Standard pattern holding section, 5... Evaluation value calculation section, 6... Evaluation value holding section, 7
... Dictionary template number selection section, 8. Pattern dictionary, 9. Appearance frequency holding section.

Claims

[Claims]

(1) A pattern dictionary creation method for creating a pattern dictionary, which includes the steps of classifying characteristic patterns of input characters into a plurality of clusters, generating a standard pattern from representative vectors of each of the classified clusters, and 1. A dictionary creation method comprising: evaluating the distribution of feature patterns; and determining the number of standard pattern dictionary templates for the input characters based on the values evaluated in the evaluation step.

(2) The dictionary creation method according to claim 1, wherein in the evaluation step, the frequency of appearance of characters is further evaluated.

(3) The dictionary creation method according to claim 1, wherein the total number of dictionary templates determined is within a predetermined upper limit number.