JPH11203414A

JPH11203414A - Broadly classified dictionary preparing device

Info

Publication number: JPH11203414A
Application number: JP10002363A
Authority: JP
Inventors: Katsuhiko Itonori; 勝彦糸乘
Original assignee: Fuji Xerox Co Ltd
Current assignee: Fujifilm Business Innovation Corp
Priority date: 1998-01-08
Filing date: 1998-01-08
Publication date: 1999-07-30

Abstract

PROBLEM TO BE SOLVED: To provide a broadly classified dictionary preparing device capable of preparing a broadly classified dictionary having high recognition ratios. SOLUTION: A feature value extraction part 12 extracts the feature values of all sample images in a character image data file 11, calculates the mean value of these feature values and stores the mean value in a feature value storing part 13 in each character sort. A 1st broadly classified dictionary preparing part 14 prepares a primary broadly classfied dictionary by classifying the feature values of respective character sorts stored in the storing part 13 into categories by a 1st distance calculation method. A 2nd broadly classified dictionary preparing part 15 prepares a secondary broadly classified dictionary by calculating the sum of distances between the mean of categories at the time of changing the categories of character species and respective character species in the categories by a total distance calculation part 16 by using a 2nd distance calculation method, moving the character species to the minimum category and reconstituting the primary broadly classified dictionary.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、文字認識装置など
において、詳細認識処理の前処理として類似文字のカテ
ゴリに分類する大分類処理に使用する大分類辞書を作成
する大分類辞書作成装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a large classification dictionary creating apparatus for creating a large classification dictionary used in a large classification process for classifying into similar character categories as preprocessing of a detailed recognition process in a character recognition device or the like. It is.

【０００２】[0002]

【従来の技術】文字認識装置等では、認識に要する時間
を短縮するために、入力された複数の文字から多次元の
特徴量を抽出し、この特徴量上で類似している文字から
なるカテゴリに分類する大分類処理を用いる手法があ
る。もちろん、入力された文字は大分類された後、さら
に詳しい特徴量を用いて詳細に認識され、１つの文字種
に確定される。2. Description of the Related Art In a character recognition device or the like, in order to reduce the time required for recognition, a multidimensional feature is extracted from a plurality of input characters, and a category of characters similar in the feature is extracted. There is a method that uses a large classification process that classifies the data into two types. Of course, after the input characters are roughly classified, they are recognized in detail using more detailed feature amounts, and are determined as one character type.

【０００３】このような従来の手法を用いたものとし
て、例えば特開平１−１６１５９２号公報に記載されて
いる技術がある。すなわち、認識辞書内の特徴量の似通
っている文字群をカテゴリ化し、これらの平均の特徴量
により標準特徴量を求め、カテゴリの特徴量として大分
類辞書を作成する。そして、入力された文字画像データ
の特徴量を抽出し、大分類辞書とマッチングを行ない、
最も距離の近いカテゴリを選択する。さらに、選択した
カテゴリ内の各文字との特徴量のマッチングにより、最
終的な文字候補を決定する。As a technique using such a conventional technique, there is a technique described in, for example, Japanese Patent Application Laid-Open No. 1-161592. That is, a group of characters having similar feature amounts in the recognition dictionary is categorized, a standard feature amount is obtained from these average feature amounts, and a large classification dictionary is created as a category feature amount. Then, the feature amount of the input character image data is extracted and matched with the large classification dictionary,
Select the closest category. Further, a final character candidate is determined by matching the feature amount with each character in the selected category.

【０００４】また、特開平５−１２８３１２号公報に
は、大分類辞書を作成する時に用いる特徴量を、よりグ
ローバルな文字の特徴量を用いることで大分類処理を正
確に行なえる技術が開示されている。Japanese Patent Application Laid-Open No. 5-128312 discloses a technique for performing a large classification process accurately by using a more global character feature amount as a feature amount used when creating a large classification dictionary. ing.

【０００５】さらに特開平４−２６６１８８号公報に
は、辞書を作成するための全サンプル文字の平均を用い
て、初期シード点を決定して、順次全てのシード点から
最も離れているサンプルを次のシード点とすることで、
効率よく大分類辞書を作成する辞書作成装置が開示され
ていている。Further, Japanese Patent Laid-Open No. 4-266188 discloses that an initial seed point is determined by using an average of all sample characters for creating a dictionary, and a sample farthest from all the seed points is successively determined. By setting the seed point of
A dictionary creation device that creates a large classification dictionary efficiently has been disclosed.

【０００６】上述の従来技術は、いずれも文字種内のサ
ンプルの平均と、１つの距離計算方法を用いた各文字種
の平均間の距離を用いてカテゴリ化し、大分類辞書を作
成している。作成された大分類辞書において、カテゴリ
のまとまりの良さを表わす尺度として、一般にカテゴリ
の平均とカテゴリ内のサンプルとの自乗誤差が用いられ
る。この自乗誤差が小さいほど、カテゴリ内のサンプル
のまとまりが良いことになり、自乗誤差を最小とするよ
うにカテゴリを再構成している。In each of the above-mentioned prior arts, a large classification dictionary is created by categorizing using the average between samples within a character type and the distance between the averages of each character type using one distance calculation method. In the created large classification dictionary, a square error between an average of the categories and a sample in the category is generally used as a scale indicating the goodness of unity of the categories. The smaller the square error is, the better the sample in the category is. The category is reconfigured so as to minimize the square error.

【０００７】このときの自乗誤差を計算する際の距離計
算方法としては、一般にユークリッド距離が用いられ
る。しかし、実際の認識処理では、カテゴリ内の文字種
の分布を用いた、より計算コストが高く、認識率が高く
なるような距離計算法を用いることが多い。この場合、
ユークリッド距離で自乗誤差を最小化した大分類辞書を
用いたのでは、距離の計算尺度が異なるため、認識率は
悪くなるという問題があった。As a method of calculating the square error at this time, the Euclidean distance is generally used. However, in actual recognition processing, a distance calculation method that uses a distribution of character types within a category and has a higher calculation cost and a higher recognition rate is often used. in this case,
If a large classification dictionary that minimizes the square error with the Euclidean distance is used, there is a problem that the recognition rate is deteriorated because the calculation scale of the distance is different.

【０００８】カテゴリ化する段階から、主成分分析など
計算コストの高い距離計算方法を用いることも考えられ
るが、カテゴリ化の初期の段階では、カテゴリ内のサン
プル数が少なく、計算の精度が悪くなり、識別率のよい
大分類辞書を作成することはできない。From the categorization stage, it is conceivable to use a distance calculation method having a high calculation cost such as principal component analysis. However, in the initial stage of categorization, the number of samples in a category is small, and the accuracy of calculation becomes poor. However, a large classification dictionary with a high identification rate cannot be created.

【０００９】また、上述の特開平４−２６６１８８号公
報に記載されている方法では、全てのサンプルを用いて
平均を計算し、大分類辞書を作成しているので、文字種
内のすべてのサンプルが同じカテゴリに分類される保証
はなく、１つの文字種が複数のカテゴリに含まれ、大分
類辞書のサイズを大きくする要因となるといった問題も
あった。In the method described in Japanese Patent Application Laid-Open No. Hei 4-266188, an average is calculated using all samples and a large classification dictionary is created. There is no assurance that they will be classified into the same category, and there is also a problem that one character type is included in a plurality of categories, which may increase the size of the large classification dictionary.

【００１０】[0010]

【発明が解決しようとする課題】本発明は、上述した事
情に鑑みてなされたもので、上述のような距離計算方
法、あるいは計算精度の違いによる認識率への悪影響を
抑え、認識率の高い大分類辞書を作成することのできる
大分類辞書作成装置を提供することを目的とするもので
ある。SUMMARY OF THE INVENTION The present invention has been made in view of the above circumstances, and suppresses an adverse effect on the recognition rate due to the above-described distance calculation method or a difference in calculation accuracy, and has a high recognition rate. It is an object of the present invention to provide a large classification dictionary creation device capable of creating a large classification dictionary.

【００１１】[0011]

【課題を解決するための手段】本発明は、各文字種の平
均を用いて第１の距離計算方法で１次大分類辞書を作成
し、次に第２の距離計算方法によって１次大分類辞書内
のカテゴリの平均と各文字種の特徴との距離の総和を計
算し、各カテゴリにおいて距離の総和が最小となるよう
に文字種をカテゴリ間で移動させて２次大分類辞書を作
成するものである。２次大分類辞書を作成する際には、
移動対象の文字種の特徴と移動先のカテゴリの距離が、
第１の距離計算方法において閾値以内である文字種につ
いてのみ、カテゴリの移動を行なうように制限して２次
大分類辞書を作成することができる。According to the present invention, a primary large classification dictionary is created by a first distance calculation method using an average of each character type, and then a primary large classification dictionary is created by a second distance calculation method. Calculates the sum of the distances between the average of the categories within and the characteristics of each character type, and moves the character type between the categories so that the total sum of the distances in each category is minimized to create a secondary large classification dictionary . When creating a secondary large classification dictionary,
If the characteristics of the character type to be moved and the distance between the destination category are
In the first distance calculation method, the secondary large classification dictionary can be created by restricting the movement of the category only for the character type within the threshold value.

【００１２】例えば第１の距離計算方法として計算コス
トの低い距離計算方法を用いて初期シード点を決定して
１次大分類辞書を構成し、それをもとに、第２の距離計
算方法として実際に識別処理で用いる計算コストの高い
距離計算方法を用いて１次大分類辞書を構成し直し、実
際に大分類処理の際に用いる２次大分類辞書を作成する
ことができる。作成された２次大分類辞書は、実際に識
別処理で用いる距離計算方法によって再構成されている
ので、大分類処理および詳細認識処理ともに距離の計算
尺度を一致させることができ、最終的な認識率を向上さ
せることができる。また、初期シード点の決定を例えば
従来のようなユークリッド距離などを用いて行なうこと
によって、計算コストを低減するとともに、最初から計
算コストの高い距離計算方法を用いた場合における精度
低下を防ぐことが可能となる。For example, an initial seed point is determined using a distance calculation method having a low calculation cost as a first distance calculation method, and a first-order large classification dictionary is constructed. The primary large classification dictionary can be reconfigured by using a distance calculation method having a high calculation cost actually used in the identification processing, and a secondary large classification dictionary actually used in the large classification processing can be created. Since the created secondary large classification dictionary is reconfigured by the distance calculation method actually used in the identification processing, the calculation scale of the distance can be matched in both the large classification processing and the detailed recognition processing. Rate can be improved. Also, by determining the initial seed point using, for example, a conventional Euclidean distance, the calculation cost can be reduced, and the accuracy can be prevented from being reduced when a distance calculation method having a high calculation cost is used from the beginning. It becomes possible.

【００１３】[0013]

【発明の実施の形態】図１は、本発明の大分類辞書作成
装置の実施の一形態を示す基本構成図である。図中、１
１は文字画像データファイル、１２は特徴量抽出部、１
３は特徴量記憶部、１４は第１の大分類辞書作成部、１
５は第２の大分類辞書作成部、１６は総距離計算部、１
７は中央制御装置、１８は記憶装置である。DESCRIPTION OF THE PREFERRED EMBODIMENTS FIG. 1 is a basic configuration diagram showing an embodiment of a large classification dictionary creating apparatus according to the present invention. In the figure, 1
1 is a character image data file, 12 is a feature amount extraction unit, 1
3 is a feature amount storage unit, 14 is a first large classification dictionary creation unit, 1
5 is a second large classification dictionary creation unit, 16 is a total distance calculation unit, 1
7 is a central control unit, and 18 is a storage device.

【００１４】文字画像データファイル１１は、文字種毎
に分類されたサンプル画像を２値のディジタル画像とし
て記憶している。特徴量抽出部１２は、文字画像データ
ファイル１１内のすべてのサンプル画像に対して特徴量
を抽出し、その平均を計算する。特徴量記憶部１３は、
特徴量抽出部１２で抽出された文字種毎の特徴量を記憶
する。第１の大分類辞書作成部１４は、第１の距離計算
方法で１次大分類辞書を作成する。第２の大分類辞書作
成部１５は、第２の距離計算方法を用いて第１の大分類
辞書作成部１４で作成された１次大分類辞書を再構成
し、２次大分類辞書を作成する。総距離計算部１６は、
カテゴリの平均とカテゴリ内の文字種との距離の総和を
計算する。中央制御装置１７は、装置全体を制御する。
記憶装置１８は、中央制御装置１７が装置全体を制御す
るためのプログラム等を記憶する。The character image data file 11 stores sample images classified for each character type as binary digital images. The feature amount extraction unit 12 extracts feature amounts from all sample images in the character image data file 11 and calculates an average thereof. The feature amount storage unit 13
The feature amount for each character type extracted by the feature amount extraction unit 12 is stored. The first large classification dictionary creating unit 14 creates a first large classification dictionary by a first distance calculation method. The second large classification dictionary creating unit 15 reconstructs the first large classification dictionary created by the first large classification dictionary creating unit 14 using the second distance calculation method, and creates a second large classification dictionary I do. The total distance calculation unit 16
Calculate the sum of the distance between the average of the category and the character type in the category. The central control device 17 controls the entire device.
The storage device 18 stores a program or the like for the central control device 17 to control the entire device.

【００１５】次に本発明の大分類辞書作成装置の実施の
一形態における第１の動作例について、具体例を用いな
がら説明する。文字画像データファイル１１には、予め
収集された文字毎のサンプル画像が、ある一定の大きさ
の２値画像として格納されている。ここでは、一例とし
て各文字種は６４×６４の大きさに揃えられているもの
とする。各文字種に対して複数のサンプル画像が格納さ
れており、各文字種のサンプル画像数は、記憶装置１７
に格納されている。このとき、各文字種内のサンプル画
像数は一定である必要はないが、認識処理の精度を上げ
ることを考えると、１文字種に対して複数のフォントで
構成された数十程度のサンプル画像があることが望まし
い。Next, a first operation example of the embodiment of the large classification dictionary creating apparatus according to the present invention will be described using a specific example. The character image data file 11 stores a sample image of each character collected in advance as a binary image having a certain size. Here, as an example, it is assumed that each character type is set to a size of 64 × 64. A plurality of sample images are stored for each character type, and the number of sample images for each character type is stored in the storage device 17.
Is stored in At this time, the number of sample images in each character type does not need to be constant, but there are several tens of sample images composed of a plurality of fonts for one character type in consideration of improving the accuracy of recognition processing. It is desirable.

【００１６】次に、特徴量抽出部１２で文字画像データ
ファイル１１内の全ての文字画像から特徴を抽出し、文
字種ごとの平均値を計算して特徴量記憶部１３に文字種
コードと対応させて記憶させる。特徴量については、こ
れまでさまざまなものが提案されてきているが、ここで
は一例として１次ペリフェラル特徴を用いて説明する。
１次ペリフェラル特徴は、文字画像の外接矩形のそれぞ
れの辺から走査し、白画素（背景）から黒画素（文字
部）に変化する点までの距離を特徴とするものである。
なお、２番目に白画素から黒画素に変化する点も合わせ
て特徴とする場合は、２次ペリフェラル特徴と呼ばれ
る。Next, the feature extracting unit 12 extracts features from all the character images in the character image data file 11, calculates an average value for each character type, and stores it in the feature storing unit 13 in correspondence with the character type code. Remember. Various features have been proposed so far. Here, as an example, description will be made using a primary peripheral feature.
The primary peripheral feature is characterized by scanning from each side of a circumscribed rectangle of a character image and measuring a distance from a white pixel (background) to a black pixel (character portion).
In addition, when the second point is also characterized by a point that changes from a white pixel to a black pixel, it is called a secondary peripheral characteristic.

【００１７】図２は、１次ペリフェラル特徴の説明図で
ある。図２では文字画像を水平方向のストライプ状の８
つの領域に分割し、各領域ごとに文字の左側の辺から文
字部までの破線の矢印で示した距離を求めている。同様
にして右辺からも文字部までの距離を求めることができ
る。さらに、垂直方向のストライプ状の領域に分割し、
上辺、下辺から文字部までの距離をそれぞれの領域につ
いて求める。領域の分割数は任意であるが、ここでは水
平・垂直方向とも８つの領域に分割することとし、８×
４の合計３２次元の特徴ベクトルを得ることができる。FIG. 2 is an explanatory diagram of the primary peripheral features. In FIG. 2, the character image is represented by 8 stripes in the horizontal direction.
The area is divided into two areas, and the distance indicated by the broken arrow from the left side of the character to the character portion is obtained for each area. Similarly, the distance from the right side to the character portion can be obtained. Furthermore, it is divided into vertical stripes,
The distance from the upper side and the lower side to the character portion is obtained for each area. Although the number of divisions of the region is arbitrary, here, it is assumed that the region is divided into eight regions in both the horizontal and vertical directions, and 8 ×
Thus, a total of 32 feature vectors can be obtained.

【００１８】この特徴ベクトルは、各文字種の各サンプ
ル画像からそれぞれ抽出され、その平均ベクトルが各文
字種の特徴量として特徴量記憶部１３に文字種コードと
対応つけて記憶される。図３は、特徴量記憶部に記憶さ
れた特徴量の一例の説明図である。図３に示す例では、
例えば文字種「亜」には、その複数のサンプル画像から
抽出された３２次元の特徴ベクトルの平均ベクトル
（０．４９，０．８８，・・・）がその文字種の特徴量
として対応づけて記憶される。The feature vector is extracted from each sample image of each character type, and the average vector is stored in the feature amount storage unit 13 as the characteristic amount of each character type in association with the character type code. FIG. 3 is an explanatory diagram of an example of a feature amount stored in the feature amount storage unit. In the example shown in FIG.
For example, for the character type “A”, the average vector (0.49, 0.88,...) Of the 32-dimensional feature vectors extracted from the plurality of sample images is stored in association with the characteristic amount of the character type. You.

【００１９】次に、特徴量記憶部１３に記憶された各文
字種の平均の特徴量を用いて、第１の大分類辞書作成部
１４にて、１次大分類辞書が作成される。第１の大分類
辞書作成部１４では、例えばＤｕｄａ，Ｈａｒｒ著、
“ＰＡＴＴＥＲＮＣＬＡＳＳＩＦＩＣＡＴＩＯＮＡ
ＮＤＳＣＥＮＥＡＮＡＬＹＳＩＳ”、１９７３、Ｗ
ｉｌｅｙ−Ｉｎｔｅｒｓｃｉｅｎｃｅ社刊に記載されて
いる方法を用い、文字種の平均特徴量を用いて階層的な
クラスタリングを行なうことができる。Next, using the average feature amount of each character type stored in the feature amount storage unit 13, a first large classification dictionary creation unit 14 creates a primary large classification dictionary. In the first large classification dictionary creating unit 14, for example, by Duda, Harr,
“PATTERN CLASSIFICATION A
ND SCENE ANALYSIS ", 1973, W
Hierarchical clustering can be performed using the average feature amount of the character type using a method described in iley-Interscience.

【００２０】図４は、階層的クラスタリングの処理の一
例を示すフローチャートである。まずＳ２１において、
所望のクラスタ数をｍ、文字種の総数をｎ、初期クラス
タをＸ＝｛ｃ_i｜ｉ＝１，．．．，ｎ｝とし、ｃ_iは類
似している文字種の代表特徴ベクトルが保持される。ｃ
_iの初期値として、各文字種の代表特徴ベクトルを１つ
ずつ入れる。Ｓ２２において、現在のクラスタ数と所望
のクラスタ数ｍとを比較し、もし現在のクラスタの数が
ｍに等しければ、その時点のＸをクラスタリングの結果
として処理を終わる。そうでない場合はＳ２３へ進む。
Ｓ２３において、特徴空間におけるクラスタの距離ｄが
最も小さい２つのクラスタの組を見つけ出し、これを一
つのクラスタに統合する。そしてＳ２２へ戻る。FIG. 4 is a flowchart showing an example of the hierarchical clustering process. First, in S21,
Let m be the desired number of clusters, n be the total number of character types, and X = {c _i | i = 1,. . . , And n}, c _i is the representative feature vectors of character types that are similar is held. c
_As the initial value of _i , one representative feature vector of each character type is entered. In S22, the current number of clusters is compared with the desired number of clusters m, and if the current number of clusters is equal to m, the process ends with X at that time as a result of clustering. If not, the process proceeds to S23.
In S23, a set of two clusters in which the distance d of the cluster in the feature space is the smallest is found, and this is integrated into one cluster. Then, the process returns to S22.

【００２１】なお、所望のクラスタ数ｍは任意に与える
ことができる。また、この処理の中でクラスタ間の距離
ｄの計算方法には種々のものが考えられるが、ここでは
２つのクラスタ内の特徴ベクトルを１つずつ取り出して
組を作ってユークリッド距離を計算し、その中で最も近
い位置にあるベクトルの組の距離を２つのクラスタの距
離とする。この例では、階層的なクラスタリング手法を
用いているが、良く知られているＫ−ｍｅａｎ法など、
他の種々の方法を用いて、１次大分類辞書を作成するこ
ともできる。The desired number m of clusters can be arbitrarily given. In this process, various methods of calculating the distance d between the clusters are conceivable. Here, the Euclidean distance is calculated by taking out the feature vectors in the two clusters one by one and forming a set. The distance between the closest vector set among them is defined as the distance between two clusters. In this example, a hierarchical clustering method is used, but a well-known K-mean method or the like is used.
Various other methods may be used to create the primary large classification dictionary.

【００２２】このようにして作成されたカテゴリのまと
まりの良さを表わす尺度として、上述のように一般にカ
テゴリの平均とカテゴリ内のサンプルとの自乗誤差が用
いられる。この自乗誤差が小さいほど、カテゴリ内のサ
ンプルのまとまりが良いことになるが、このような階層
的なクラスタリング処理では、各カテゴリについて自乗
誤差が最小となっているとは言えない。そこで、前述の
文献（Ｄｕｄａ，Ｈａｒｒ著、“ＰＡＴＴＥＲＮＣＬ
ＡＳＳＩＦＩＣＡＴＩＯＮＡＮＤＳＣＥＮＥＡＮ
ＡＬＹＳＩＳ”、１９７３、Ｗｉｌｅｙ−Ｉｎｔｅｒｓ
ｃｉｅｎｃｅ社刊）では、カテゴリ内のサンプルを別の
カテゴリに移動させた時の自乗誤差の変化量を調べ、そ
の自乗誤差が最も小さくなるようにカテゴリを再構成す
ることで、擬似的に最適なカテゴリを得る方法を示して
いる。As described above, the square error between the average of the categories and the samples in the categories is generally used as a scale indicating the goodness of unity of the categories created as described above. The smaller the square error is, the better the unit of samples in the category is. However, in such a hierarchical clustering process, it cannot be said that the square error is the minimum for each category. Therefore, the above-mentioned document (Duda, Harr, “PATTERN CL”
ASSIFICATION AND SCENE AN
ALYSIS ", 1973, Wiley-Inters
(Science Co., Ltd.) examines the amount of change in the square error when a sample in a category is moved to another category, and reconfigures the category so as to minimize the square error. Shows how to get categories.

【００２３】この手法では、一般にユークリッド距離を
用いてカテゴリ内の自乗誤差を計算しているので、上述
のように実際の認識処理で用いる距離計算方法と異な
り、認識率が低下する。このような問題を回避し、さら
に認識率の高くなるような距離計算方法によって、大分
類辞書を作成するために、第２の大分類辞書作成部１５
と総距離計算部１６により２次大分類辞書を作成する。In this method, since the square error within a category is generally calculated using the Euclidean distance, the recognition rate is reduced unlike the distance calculation method used in the actual recognition processing as described above. In order to avoid such a problem and create a large classification dictionary by a distance calculation method that further increases the recognition rate, the second large classification dictionary creation unit 15
And the total distance calculation unit 16 creates a secondary large classification dictionary.

【００２４】図５は、本発明の大分類辞書作成装置の実
施の一形態における第１の動作例における第２の大分類
辞書作成部１５の処理の一例を示すフローチャートであ
る。ここでは、１文字種毎の処理だけを述べるが、実際
には文字種全体に対して図５に示す処理を行ない、カテ
ゴリ間での文字種の移動が無くなるまで処理を繰り返
す。また、以下の処理で用いる距離は、第２の距離計算
方法で求められる距離を用いる。FIG. 5 is a flow chart showing an example of the processing of the second large classification dictionary creation unit 15 in the first operation example in the embodiment of the large classification dictionary creation device of the present invention. Here, only the processing for each character type will be described. However, the processing shown in FIG. 5 is actually performed for the entire character type, and the processing is repeated until the movement of the character type between categories stops. The distance used in the following processing uses the distance obtained by the second distance calculation method.

【００２５】まず、Ｓ３１において対象文字種を更新
し、Ｓ３２において対象文字種の属しているカテゴリを
変数Ｊに格納する。次に、Ｓ３３においてカテゴリ内の
全ての文字種の平均特徴量を用いて、カテゴリの平均と
カテゴリ内の全ての文字種との距離を計算し、その総和
を求める。この処理は、総距離計算部１６によって行な
われる。このとき、対象文字種の属しているカテゴリで
は、その対象文字種をカテゴリから除いたことを想定し
てカテゴリの平均とカテゴリ内の全ての文字種との距離
の総和を計算する。また、対象文字種の属していないカ
テゴリについては、その対象文字種をカテゴリに加えた
場合を想定し、カテゴリの平均とカテゴリ内の全ての文
字種との距離の総和を計算する。このようにして計算さ
れた総和をＡ_iとする。First, the target character type is updated in S31, and the category to which the target character type belongs is stored in the variable J in S32. Next, in S33, the average feature amount of all the character types in the category is used to calculate the distance between the average of the category and all the character types in the category, and the sum is calculated. This process is performed by the total distance calculation unit 16. At this time, in the category to which the target character type belongs, the sum of the distance between the average of the category and all the character types in the category is calculated on the assumption that the target character type is excluded from the category. For a category to which the target character type does not belong, the case where the target character type is added to the category is assumed, and the sum of the distance between the average of the categories and all the character types in the category is calculated. The sum thus calculated is defined as A _i .

【００２６】Ｓ３４において、Ｓ３３で計算された総和
Ａ_iを比較し、カテゴリ内の総距離が最も小さいクラス
タＳを選択する。Ｓ３５において、選択したクラスタＳ
が対象文字種の属しているクラスタか否かを判定し、ク
ラスタＳが対象文字種の属しているクラスタでなけれ
ば、Ｓ３６において対象文字種を選択したクラスタＳに
移動し、Ｓ３７において全カテゴリの平均を更新する。In S34, the total sum A _i calculated in S33 is compared, and the cluster S having the smallest total distance in the category is selected. In S35, the selected cluster S
Is determined to be a cluster to which the target character type belongs. If the cluster S is not a cluster to which the target character type belongs, the process moves to the cluster S in which the target character type is selected in S36, and the average of all categories is updated in S37. I do.

【００２７】このような処理を文字種全体について行な
うことによって、第２の距離計算方法を用いて計算され
た距離にもとづいてカテゴリが最適化された２次大分類
辞書が構成される。以上のようにして構成される２次大
分離辞書は、第２の距離計算方法として実際に認識処理
で用いる距離計算方法を用いることにより、認識処理に
おいて認識率の高い大分類辞書を構成することができ
る。By performing such processing for the entire character type, a secondary large classification dictionary in which the category is optimized based on the distance calculated by using the second distance calculation method is constructed. The second-order large separation dictionary configured as described above constitutes a large classification dictionary having a high recognition rate in the recognition processing by using the distance calculation method actually used in the recognition processing as the second distance calculation method. Can be.

【００２８】次に本発明の大分類辞書作成装置の実施の
一形態における第２の動作例について説明する。この第
２の動作例では、第２の距離計算方法として投影距離法
を用いる場合について述べる。Next, a second operation example in the embodiment of the large classification dictionary creating apparatus of the present invention will be described. In the second operation example, a case where the projection distance method is used as the second distance calculation method will be described.

【００２９】投影距離は、文字種の特徴ベクトルと、文
字種の分布から計算される上位の主成分軸で構成される
部分空間との残差で定義される。図６は、投影距離の説
明図である。図中、４１は部分空間、４２は特徴ベクト
ル、４３は平均ベクトル、４４は残差である。ここで、
文字種の分布から計算される上位の主成分軸で構成され
る部分空間を部分空間４１とする。また、そのカテゴリ
における平均ベクトルを平均ベクトル４３で示してい
る。いま、文字種の特徴ベクトル４２が与えられたと
き、点Ｃから部分空間４１への最短距離である残差４４
が投影距離となる。The projection distance is defined by a residual between a feature vector of a character type and a subspace composed of higher-order principal component axes calculated from the distribution of the character type. FIG. 6 is an explanatory diagram of the projection distance. In the figure, 41 is a subspace, 42 is a feature vector, 43 is an average vector, and 44 is a residual. here,
A subspace composed of higher-order principal component axes calculated from the distribution of character types is referred to as a subspace 41. The average vector in that category is indicated by an average vector 43. Now, when a character type feature vector 42 is given, a residual 44 which is the shortest distance from the point C to the subspace 41 is obtained.
Is the projection distance.

【００３０】図６に示す例におけるユークリッド距離は
ベクトルＭＣの長さであり、投影距離である残差４４に
比べて大きな値となる。そのため、ユークリッド距離で
は離れていると判断されるようなカテゴリでも、投影距
離では非常に近くに位置していると判断される場合があ
る。The Euclidean distance in the example shown in FIG. 6 is the length of the vector MC, and is a value larger than the residual 44 which is the projection distance. For this reason, a category that is determined to be far from the Euclidean distance may be determined to be located very close to the projection distance.

【００３１】図７は、ユークリッド距離による２つのカ
テゴリの分布の説明図、図８は、投影距離による誤認の
発生の説明図である。図中、５１，５２はカテゴリ、５
３，５４は部分空間、５５は誤認発生箇所である。例え
ば図７に示すように、２次元の特徴量を用いてユークリ
ッド距離で分類された２つのカテゴリ５１，５２が存在
する場合を考える。この場合、投影距離で用いる部分空
間は１次元となり、カテゴリ５１とカテゴリ５２に対す
る部分空間５３，５４はそれぞれ直線として表わされ
る。投影距離によって、この２つのカテゴリを再構成す
ると、２つの部分空間５３，５４が交わる部分におい
て、図８に誤認発生箇所５５として示すようにカテゴリ
５２の一部がカテゴリ５１へと分類され、誤認を生ず
る。FIG. 7 is an explanatory diagram of the distribution of two categories according to the Euclidean distance, and FIG. 8 is an explanatory diagram of the occurrence of erroneous recognition due to the projection distance. In the figure, 51 and 52 are categories, 5
Numerals 3 and 54 are subspaces, and 55 is an erroneous recognition location. For example, as shown in FIG. 7, consider a case where there are two categories 51 and 52 classified by Euclidean distance using a two-dimensional feature amount. In this case, the subspace used for the projection distance is one-dimensional, and the subspaces 53 and 54 for the categories 51 and 52 are each represented as a straight line. When these two categories are reconstructed by the projection distance, a part of the category 52 is classified into the category 51 at the intersection of the two subspaces 53 and 54 as shown in FIG. Is generated.

【００３２】すなわち、カテゴリ５１の部分空間５３の
近くに存在する、カテゴリ５２に属する文字種の分布
は、カテゴリ５１の部分空間５３との残差が非常に小さ
くなり、カテゴリ５２に属する文字種の分布の一部はカ
テゴリ５１へと分類されることになる。実際の文字種の
分布は、特徴量の変動によりある程度の広がりを持って
いるので、誤認発生箇所５５の近傍に位置している文字
種では、誤認が発生する原因となる。That is, the distribution of the character types belonging to the category 52 that exists near the subspace 53 of the category 51 has a very small residual from the subspace 53 of the category 51, and the distribution of the character types belonging to the category 52 is very small. Some will be classified into category 51. Since the actual distribution of character types has a certain degree of spread due to the variation in the feature amount, a character type located near the misidentification occurrence position 55 may cause misidentification.

【００３３】特徴量の変動に対してより安定なカテゴリ
を作成し、上述のような誤認を減らすために、２次大分
類辞書を再構成する際の文字種のカテゴリ間の移動を制
限する。具体的には、第１の距離計算方法によって得ら
れる距離に閾値を設け、文字種の特徴量とカテゴリの平
均との第１の距離計算方法による距離がこの閾値以内で
ある文字種についてのみ、２次大分類辞書を再構成する
際の文字種のカテゴリ間の移動を行なうように構成する
ことができる。In order to create a category that is more stable with respect to fluctuations in the feature amount and reduce the above-described misidentification, movement of character types between categories when reconstructing the secondary large classification dictionary is limited. Specifically, a threshold value is provided for the distance obtained by the first distance calculation method, and only for the character type whose distance between the feature amount of the character type and the average of the category by the first distance calculation method is within this threshold value, the secondary It can be configured to move between character category categories when reconstructing the large classification dictionary.

【００３４】図９は、本発明の大分類辞書作成装置の実
施の一形態における第２の動作例における第２の大分類
辞書作成部１５の処理の一例を示すフローチャートであ
る。ここでは、上述の第１の動作例と同様に、１文字種
毎の処理だけについて述べるが、実際には文字種全体に
対して図９に示す処理を行ない、カテゴリ間での文字種
の移動が無くなるまで処理を繰り返す。FIG. 9 is a flowchart showing an example of the processing of the second large dictionary creation unit 15 in the second operation example in the embodiment of the large dictionary creation device of the present invention. Here, as in the above-described first operation example, only the processing for each character type will be described. However, in practice, the processing shown in FIG. 9 is performed on the entire character type until the character type does not move between categories. Repeat the process.

【００３５】まずＳ６１において、対象とする文字種を
更新して、Ｓ６２においてその文字種の属しているカテ
ゴリを変数Ｊに格納する。次にＳ６３において、対象文
字種の特徴ベクトルと全カテゴリの平均との距離を第１
の距離計算方法で計算し、Ｓ６４において、第１の距離
計算方法による距離が閾値以内であるカテゴリを選択す
る。Ｓ６５において、Ｓ６４で選択したカテゴリに対し
て、第２の距離計算方法を用いて図５に示した上述の第
１の動作例における２次大分類辞書の作成処理を実施す
る。First, in S61, the target character type is updated, and in S62, the category to which the character type belongs is stored in the variable J. Next, in S63, the distance between the feature vector of the target character type and the average of all categories is set to the first distance.
In step S64, a category whose distance by the first distance calculation method is within a threshold value is selected. In S65, for the category selected in S64, the process of creating the secondary large classification dictionary in the above-described first operation example shown in FIG. 5 is performed using the second distance calculation method.

【００３６】このような処理により、図８における誤認
発生箇所５５の文字種の一部または全部は、２次大分類
辞書を再構成する際にもカテゴリ５１に移動せず、カテ
ゴリ５２に属するように大分類辞書が作成される。これ
によって、特徴量の変動に強い大分類辞書を作成するこ
とができる。もちろん、上述の第１の動作例と同様に、
第２の距離計算方法として実際に認識処理で用いる距離
計算方法を用いることにより、認識処理において認識率
の高い大分類辞書を構成することができる。By such processing, some or all of the character types of the misrecognition occurrence position 55 in FIG. 8 do not move to the category 51 even when reconstructing the secondary large classification dictionary so that they belong to the category 52. A large classification dictionary is created. This makes it possible to create a large classification dictionary that is resistant to variations in the feature amount. Of course, similar to the first operation example described above,
By using the distance calculation method actually used in the recognition processing as the second distance calculation method, a large classification dictionary having a high recognition rate in the recognition processing can be configured.

【００３７】なお、このように構成した２次大分類辞書
を用いた認識処理では、まず、第１の距離計算方法によ
り文字種とカテゴリ間の距離を計算し、その距離が閾値
以内であるカテゴリに対して第２の距離計算方法によっ
てさらに認識を行なうことになる。In the recognition process using the secondary large classification dictionary configured as described above, first, the distance between the character type and the category is calculated by the first distance calculation method, and the distance between the character type and the category is within the threshold value. On the other hand, further recognition is performed by the second distance calculation method.

【００３８】[0038]

【発明の効果】以上の説明から明らかなように、本発明
によれば、文字種の平均を用いて第１の距離計算方法で
１次大分類辞書を作成し、１次大分類辞書内の各カテゴ
リを調べ、第２の距離計算方法によって、カテゴリの平
均と各文字種の特徴の平均との距離の総和を計算し、各
カテゴリにおいて距離の総和が最小となるように文字種
をカテゴリ間で移動させて２次大分類辞書を作成するこ
とにより、実際に識別処理で用いる距離計算方法と計算
尺度を一致させた大分類辞書が得られ、最終的な認識率
の高い大分類辞書を作成することができるという効果が
ある。As is apparent from the above description, according to the present invention, a primary large classification dictionary is created by the first distance calculation method using the average of character types, and each primary classification dictionary in the primary large classification dictionary is created. The categories are checked, and the sum of the distances between the average of the categories and the average of the features of each character type is calculated by the second distance calculation method. By creating a secondary large classification dictionary, a large classification dictionary having the same calculation scale as the distance calculation method actually used in the identification processing can be obtained, and a large classification dictionary with a high final recognition rate can be created. There is an effect that can be.

[Brief description of the drawings]

【図１】本発明の大分類辞書作成装置の実施の一形態
を示す基本構成図である。FIG. 1 is a basic configuration diagram showing an embodiment of a large classification dictionary creation device of the present invention.

【図２】１次ペリフェラル特徴の説明図である。FIG. 2 is an explanatory diagram of a primary peripheral feature.

【図３】特徴量記憶部に記憶された特徴量の一例の説
明図である。FIG. 3 is an explanatory diagram of an example of a feature amount stored in a feature amount storage unit.

【図４】階層的クラスタリングの処理の一例を示すフ
ローチャートである。FIG. 4 is a flowchart illustrating an example of a hierarchical clustering process.

【図５】本発明の大分類辞書作成装置の実施の一形態
における第１の動作例における第２の大分類辞書作成部
の処理の一例を示すフローチャートである。FIG. 5 is a flowchart illustrating an example of a process of a second large classification dictionary creation unit in a first operation example in the embodiment of the large classification dictionary creation device of the present invention.

【図６】投影距離の説明図である。FIG. 6 is an explanatory diagram of a projection distance.

【図７】ユークリッド距離による２つのカテゴリの分
布の説明図である。FIG. 7 is an explanatory diagram of distribution of two categories according to Euclidean distance.

【図８】投影距離による誤認の発生の説明図である。FIG. 8 is an explanatory diagram of occurrence of erroneous recognition due to a projection distance.

【図９】本発明の大分類辞書作成装置の実施の一形態
における第２の動作例における第２の大分類辞書作成部
の処理の一例を示すフローチャートである。FIG. 9 is a flowchart illustrating an example of processing of a second large classification dictionary creation unit in a second operation example according to the embodiment of the large classification dictionary creation device of the present invention.

[Explanation of symbols]

１１…文字画像データファイル、１２…特徴量抽出部、
１３…特徴量記憶部、１４…第１の大分類辞書作成部、
１５…第２の大分類辞書作成部、１６…総距離計算部、
１７…中央制御装置、１８…記憶装置、４１…部分空
間、４２…特徴ベクトル、４３…平均ベクトル、４４…
残差、５１，５２…カテゴリ、５３，５４…部分空間、
５５…誤認発生箇所。11: character image data file, 12: feature amount extraction unit,
13: feature amount storage unit; 14: first large classification dictionary creation unit;
15: second large classification dictionary creation unit, 16: total distance calculation unit,
17 central control device, 18 storage device, 41 partial space, 42 feature vector, 43 average vector, 44
Residual, 51, 52 ... category, 53, 54 ... subspace,
55: Misidentification occurrence location.

Claims

[Claims]

1. A feature extracting means for extracting a multidimensional feature amount from a character image, and a first distance calculation method using the average of the features of each character type to classify the character image into a category composed of a plurality of character types. A first-class classification dictionary
A second major classification dictionary creating means and a total calculating means for calculating the sum of the distances between the average of the categories in the first major classification dictionary created by the first major classification dictionary creating means and the character types in the categories by the second distance calculating method. A distance calculating means, and a second category in each category in the large classification dictionary created by the primary large classification dictionary creating means.
A secondary classification dictionary creating means for creating a secondary classification dictionary by moving each character type between categories so as to minimize the sum of the distances according to the distance calculation method. Dictionary creation device.

2. The method according to claim 1, wherein the second large classification dictionary creating unit determines a category of the character type whose distance between the characteristic of the character type to be moved and the average of the destination category is within a threshold value in the first distance calculation method. 2. The apparatus for creating a large classification dictionary according to claim 1, wherein the dictionary is moved.