JP2002183667A

JP2002183667A - Character-recognizing device and recording medium

Info

Publication number: JP2002183667A
Application number: JP2000377666A
Authority: JP
Inventors: Yoshihisa Oguro; 慶久大黒
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 2000-12-12
Filing date: 2000-12-12
Publication date: 2002-06-28

Abstract

PROBLEM TO BE SOLVED: To provide a character recognizing device for quickly executing recognizing processing at a high recognizing rate by setting an optional character set as a recognizing object character set, and a recording medium for recording data and a program for executing the character-recognizing processing. SOLUTION: This character-recognizing device investigates a document of a character-recognizing object, designates a character set proper as a recognizing result as the recognizing object character set, and inputs a pattern intended for reading from a scanner. A characteristic of an image of the recognizing object character set is extracted, and the extracting result is dimensionally compressed. A made characteristic vector is checked with a standard pattern stored in a pattern dictionary, the standard pattern, having a distance near an input pattern in a characteristic quantity space is selected, only a character corresponding to the designated recognizing object character set among the selected standard pattern is selected, and is used as a recognizing result. Thus, an easily mistaking character set, such as a similar character is designated as the recognizing object character set, and the character can be highly accurately recognized at a high speed.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、文字認識装置及び
記録媒体に関し、詳細には、任意の文字集合を認識対象
文字集合として設定して言語情報に基づいて高認識率で
速やかに認識処理を行う文字認識装置及び当該文字認識
処理を行うプログラムとデータを記録した記録媒体に関
する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character recognition device and a recording medium, and more particularly, to an arbitrary character set as a character set to be recognized, and to quickly perform a recognition process at a high recognition rate based on language information. 1. Field of the Invention The present invention relates to a character recognition device that performs the above-described processing, and a recording medium that stores a program and data that perform the character recognition processing.

【０００２】[0002]

【従来の技術】一般的に、文字認識装置においては、未
知の文字パターンを、辞書に格納されている標準パター
ンと照合して、最も類似度の高い標準パターンの文字コ
ードを、未知の文字パターンの文字コードとしており、
この類似度の算出方法としては、パターンの特徴を、多
次元の特徴量ベクトルに変換し、特徴量空間において、
パターン間の最小距離を求める方法が広く用いられてい
る。2. Description of the Related Art Generally, in a character recognition apparatus, an unknown character pattern is compared with a standard pattern stored in a dictionary, and the character code of the standard pattern having the highest similarity is determined. Character code,
As a method of calculating the similarity, a pattern feature is converted into a multidimensional feature vector, and in a feature space,
A method for obtaining the minimum distance between patterns is widely used.

【０００３】そして、従来の文字認識装置においては、
パターン間の最小距離を求める計算コストを削減するた
めに、次元数の少ない特徴量で大まかに識別(大分類)
し、大分類の結果、類似度が高いと判定したものだけ、
さらに多次元の特徴量を用いて識別(詳細分類)を行なう
という方法が一般的に行われている。In a conventional character recognition device,
In order to reduce the calculation cost for finding the minimum distance between patterns, roughly identify features with a small number of dimensions (large classification)
And only those that have been determined to have a high degree of similarity as a result of the
Further, a method of performing identification (detailed classification) using a multidimensional feature amount is generally performed.

【０００４】この方法は、詳細分類を行なう文字数を減
らすことによって、全体の照合コストを削減するもので
あり、大分類に関しては、全ての文字数の回数だけ行な
う必要がある。[0004] This method is to reduce the total collation cost by reducing the number of characters to be subjected to the detailed classification, and it is necessary to perform the large classification only the number of times of all the characters.

【０００５】また、一回あたりの照合コストを減じる方
法としては、判別分析あるいはＫ−Ｌ展開などによっ
て、識別能力を落すことなく、特徴ベクトルの次元数を
減少させるアルゴリズム(次元圧縮)が提案されている
（中川聖一著「確率モデルによる音声認識」ｐｐ．１５
−１６，電子情報通信学会）。この方法を用いると、特
徴ベクトルの次元数が少なくなれば、特徴ベクトル間の
最小距離を求める計算量を減じることができる。Further, as a method of reducing the collation cost per one time, an algorithm (dimensional compression) for reducing the number of dimensions of a feature vector by discriminant analysis or KL expansion without reducing the discriminating ability has been proposed. (Seiichi Nakagawa, Speech Recognition by Probability Model, pp.15)
-16, IEICE). With this method, if the number of dimensions of the feature vector is reduced, the amount of calculation for finding the minimum distance between the feature vectors can be reduced.

【０００６】そして、一般的な文字認識は、図１１に示
すように行われる。すなわち、スキャナ等のイメージ読
取装置で、読み取り（文字認識）対象のパターンを入力
し（ステップＳ１）、方向コードや輪郭特徴等のイメー
ジの特徴を抽出する（ステップＳ２）。次に、特徴抽出
結果は多次元のベクトルとして表現されているため、特
徴抽出結果を識別能力が落ちないように次元数を減じる
次元圧縮を行って特徴ベクトルを作成し（ステップＳ
３）、作成した特徴ベクトルを、辞書に格納されている
標準パターン（特徴ベクトル）と照合する（ステップＳ
４）。そして、照合結果から、特徴量空間において、入
力パターンと近い距離を持つ標準パターンを選択し（ス
テップＳ５）、選択した標準パターンに相当する文字コ
ードを認識結果として保持して、管理する（ステップＳ
６）。[0006] General character recognition is performed as shown in FIG. That is, an image reading device such as a scanner inputs a pattern to be read (character recognition) (step S1), and extracts image features such as direction codes and contour features (step S2). Next, since the feature extraction result is represented as a multi-dimensional vector, the feature extraction result is subjected to dimensional compression to reduce the number of dimensions so that the discrimination ability does not decrease, and a feature vector is created (step S).
3) Compare the created feature vector with a standard pattern (feature vector) stored in the dictionary (step S)
4). Then, from the collation result, a standard pattern having a distance close to the input pattern is selected in the feature amount space (step S5), and a character code corresponding to the selected standard pattern is stored and managed as a recognition result (step S5).
6).

【０００７】ところが、一般的な日本語活字文書の読み
取りを目的とした文字認識装置では、認識対象文字の集
合としては、ＪＩＳ第一水準文字コードなど、数千種類
を越える、多くの種類を扱っている。このため、未知パ
ターンと標準パターンとの照合回数も数千回にもおよ
び、照合処理にかかるコストは多大なものになる。However, in a character recognition apparatus for reading a general Japanese printed document, as a set of characters to be recognized, there are many types exceeding tens of thousands, such as JIS first-level character codes. ing. For this reason, the number of times of matching between the unknown pattern and the standard pattern is thousands of times, and the cost required for the matching process is enormous.

【０００８】そして、従来、二つの類似文字の差分部分
に注目して、差分部分の特徴を強調するパターン認識装
置が提案されている（特公平７−８９３６４号公報参
照）。Conventionally, a pattern recognition device has been proposed which focuses on the difference between two similar characters and emphasizes the features of the difference (see Japanese Patent Publication No. 7-89364).

【０００９】また、本出願人は、先に、誤認識する毎
に、当該文字パターンをパターン辞書に追加、するか、
あるいは、コンフュージョンテーブルに追加するか、の
判断を行なう文字認識装置を提案している（特開平５−
６２０２０号公報参照）。In addition, the present applicant first adds the character pattern to the pattern dictionary every time the character is erroneously recognized,
Alternatively, there has been proposed a character recognition device for determining whether to add to a confusion table (Japanese Unexamined Patent Application Publication No. Hei.
No. 62020).

【００１０】さらに、従来、最初に大まかに照合処理を
行い、その後、さらに詳細に照合処理を行うことで、処
理量を削減する文字種選択方法、文字認識装置及び記録
媒体が提案されている（特開平１０−２２２６１１号公
報参照）。[0010] Conventionally, there has been proposed a character type selection method, a character recognition device, and a recording medium for reducing a processing amount by first performing a rough matching process and then performing a more detailed matching process (particularly). See JP-A-10-222611).

【００１１】また、従来、画像から得られたパターンご
とに及び二つのパターンの組み合わせを仮に統合した場
合の組み合わせ毎に誤認の可能性の大きさを示す指標を
算出し、二つのパターンの組み合わせに対し、それらを
統合する前後の誤認可能性の各指標を比較して統合した
場合の誤認可能性の大小を判定して、最も類似している
パターンの組み合わせから順に並べて、誤認可能性が少
なくなると判定された組み合わせを統合し、統合後のパ
ターンで再度指標の算出から始め、コリメートレンズを
統合できるパターンの組み合わせがなくなるまで繰り返
すことで識別精度の良好な類似カテゴリ辞書を作成する
類似カテゴリ識別辞書作成装置及び方法が提案されてい
る（特開平１１−２０３４１３号公報参照）。Conventionally, for each pattern obtained from an image and for each combination obtained when a combination of two patterns is temporarily integrated, an index indicating the magnitude of the possibility of misidentification is calculated, and the combination of the two patterns is calculated. On the other hand, comparing the indices of misrecognizability before and after integrating them and judging the magnitude of misrecognition when combining them, sorting them in order from the most similar pattern combination, and reducing the possibility of misrecognition Create a similar category identification dictionary that integrates the determined combinations, starts calculating the index again with the combined pattern, and repeats until there are no more pattern combinations that can integrate the collimating lens, thereby creating a similar category dictionary with good identification accuracy. An apparatus and a method have been proposed (see JP-A-11-203413).

【００１２】[0012]

【発明が解決しようとする課題】しかしながら、このよ
うな従来の文字認識装置にあっては、通常の文字認識を
高速かつ高精度に行うとともに、類似文字に対する認識
性能を向上させる上で、改良の必要があった。However, in such a conventional character recognition device, the conventional character recognition is performed at high speed and with high accuracy, and the improvement of the recognition performance for similar characters is improved. Needed.

【００１３】すなわち、特公平７−８９３６４号公報記
載のパターン認識装置にあっては、二つの類似文字の差
分部分に注目して、差分部分の特徴を強調するため、類
似文字が多数ある場合には、全ての組合せの対による検
査が必要であり、類似文字が増えるに従って、組み合わ
される対の数が指数関数的に増大し、検査に要する処理
時間が長くなるいう問題があった。That is, in the pattern recognition device described in Japanese Patent Publication No. Hei 7-89364, the feature of the difference portion is emphasized by focusing on the difference portion between two similar characters. However, there is a problem in that the inspection by all pairs of combinations is required, and as the number of similar characters increases, the number of pairs to be combined increases exponentially, and the processing time required for the inspection becomes longer.

【００１４】また、特開平５−６２０２０号公報記載の
文字認識装置にあっては、誤認識する毎に、当該文字パ
ターンをパターン辞書に追加、するか、あるいは、コン
フュージョンテーブルに追加するか、の判断を行なうた
め、徐々に性能向上を図ることはできるが、多数の類似
文字に対して、一括して認識性能を向上させることがで
きず、改良の必要があった。In the character recognition device described in Japanese Patent Laid-Open No. 5-62020, every time a character is erroneously recognized, the character pattern is added to a pattern dictionary or added to a confusion table. Therefore, the performance can be gradually improved, but the recognition performance of a large number of similar characters cannot be collectively improved, and thus it is necessary to improve the recognition performance.

【００１５】さらに、特開平１０−２２２６１１号公報
記載の文字種選択方法、文字認識装置及び記録媒体にあ
っては、最初に大まかに照合処理を行い、その後、さら
に詳細に照合処理を行うことで、処理量を削減すること
を主目的としており、詳細処理においても誤認識してし
まうような類似文字に対しては、認識精度を向上させる
ことができないという問題があった。Further, in the character type selection method, character recognition device and recording medium described in Japanese Patent Application Laid-Open No. 10-222611, a collation process is first performed roughly, and then a more detailed collation process is performed. The main purpose is to reduce the amount of processing, and there is a problem that recognition accuracy cannot be improved for similar characters that are erroneously recognized even in detailed processing.

【００１６】また、特開平１１−２０３４１３号公報記
載の類似カテゴリ辞書作成装置及び方法にあっては、類
似文字に対する一般的な対策についての技術であり、類
似文字組のいずれかの文字であることさえわかれば、正
しい認識結果を選択できるような類似文字についても、
詳細識別するため、不必要な処理時間の増大をまねき、
処理速度が遅くなるという問題があった。Further, the similar category dictionary creating apparatus and method described in Japanese Patent Application Laid-Open No. 11-203413 is a technique for general countermeasures for similar characters, and is a character of a similar character set. For similar characters that can select the correct recognition result,
Unnecessary increase in processing time for detailed identification,
There is a problem that the processing speed becomes slow.

【００１７】すなわち、通常の文書においては、類似文
字によって混乱が生じるような単語は避けられる傾向が
ある。例えば、学-字は、類似文字であるため、「学
校」という単語は存在するが、「字校」という単語は存
在しにくい。すなわち、単語の認識結果が「学校」であ
っても「字校」であっても「学校」という単語しか存在
しないのであれば、類似文字組（学-字）を詳細識別す
る必要はなく、類似文字組のいずれかの文字であること
さえわかれば、正しい認識結果「学校」を選択すること
ができる。したがって、このような場合にまで、詳細識
別する従来技術では、不必要な処理時間の増大をまね
き、処理速度が遅くなるという問題があった。That is, in a normal document, a word that causes confusion due to similar characters tends to be avoided. For example, the scholarly character is a similar character, so the word “school” exists, but the word “character school” hardly exists. That is, if the word recognition result is “school” or “character school” and there is only the word “school”, there is no need to identify the similar character set (study-character) in detail. If it is known that the character is any one of the similar character sets, the correct recognition result “school” can be selected. Therefore, even in such a case, the conventional technology for detailed identification has a problem that unnecessary processing time is increased and the processing speed is reduced.

【００１８】そして、上述のように、一般的な日本語活
字文書の読み取りを目的とした文字認識装置では、認識
対象文字の集合としては、ＪＩＳ第一水準文字コードな
ど、数千種類を越える、多くの種類を扱っている。この
ため、未知パターンと標準パターンとの照合回数も数千
回にもおよび、照合処理にかかるコストは多大なものに
なる。As described above, in a character recognition apparatus intended to read a general Japanese type print document, a set of characters to be recognized includes more than several thousand types such as JIS first-level character codes. Handles many types. For this reason, the number of times of matching between the unknown pattern and the standard pattern is thousands of times, and the cost required for the matching process is enormous.

【００１９】ところが、特定の原稿においては、認識対
象文字が限定される場合がある。例えば、ある商品伝票
において、価格欄では、算用数字しか出現しないであろ
うし、製品コード欄には、アルファベットと算用数字と
特定の記号の組み合わせしか出現しない。However, in a specific document, the characters to be recognized may be limited. For example, in a certain product slip, only arithmetic numerals will appear in the price column, and only combinations of alphabets, arithmetic numerals and specific symbols will appear in the product code column.

【００２０】このように、認識対象原稿中に、特定の字
種（たとえば漢字、ひらがな、カタカナ、記号、漢数
字、算用数字など）しか出現しないことがある。As described above, only a specific character type (for example, kanji, hiragana, katakana, a symbol, a kanji numeral, an arithmetic numeral, etc.) may appear in a document to be recognized.

【００２１】また、以下に示すような人間が見て紛らわ
しい、形の似通った文字は、読みの混同を避けるため
に、一方の文字が使われないというルールが設けられる
場合もある。In addition, there is a case in which a rule that one character is not used for a character having a similar shape that is confusing to human eyes as shown below is used in order to avoid confusion of reading.

【００２２】Ｉ（アルファベットｉの大文字）ｌ（アルファベットＬの小文字）１（数字）０（数字）Ｏ（アルファベットｏの大文字）ｏ（アルファベットＯの小文字）２（数字）ｚ（アルファベットＺの大文字）Ｚ（アルファベットｚの小文字）人間が読み間違いしやすい文字列の例「１２３４５」「ｌ２３４５」「１ｚ３４５」このように、認識対象文字の集合は、パターン辞書（画
像特徴格納辞書）を作成した時点、あるいは文字認識装
置が作成された時点において固定されるものではなく、
文字認識装置が使用される度に変化する場合がある。I (uppercase letter of alphabet i) l (lowercase letter of alphabet L) 1 (number) 0 (number) O (uppercase letter of alphabet o) o (lowercase letter of alphabet O) 2 (number) z (uppercase letter of alphabet Z) Z (lowercase letter of the alphabet z) Examples of character strings that are easy for humans to misread "12345""12345""1z345" Or it is not fixed when the character recognition device is created,
It may change each time the character recognition device is used.

【００２３】すなわち、従来のように、認識対象文字が
固定であることを前提に、パターン辞書を作成し、文字
認識装置を作成するよりは、認識対象文字集合が可変で
あることに対応した方が、認識対象文字集合の情報を利
用できる分、高い認識精度を得ることが期待できる。That is, as compared with the case where the pattern dictionary is created and the character recognition device is created on the assumption that the character to be recognized is fixed as in the related art, a method that supports the variable character set to be recognized is used. However, since the information of the character set to be recognized can be used, high recognition accuracy can be expected.

【００２４】そこで、本発明は、類似文字のような誤認
識しやすい文字を予め一括して求めておき、通常の認識
に加えて、類似文字に特化した、詳細な認識を実施する
ことによって、類似文字に対する認識性能を高めること
を目的としている。Therefore, the present invention is to collectively obtain characters which are easily misrecognized such as similar characters in advance and perform detailed recognition specialized for similar characters in addition to ordinary recognition. It is intended to improve recognition performance for similar characters.

【００２５】また、文字認識装置が使用される分野が、
予め、ある程度限定される場合には、当該分野の言語知
識を用いることによって、識別すべき類似文字と識別す
る必要のない類似文字とを求め、類似文字認識にかかる
処理時間を削減しつつ、類似文字の認識性能を向上させ
ることを目的としている。The field in which the character recognition device is used is as follows:
In advance, if it is limited to some extent, similarity characters to be identified and similar characters that do not need to be identified are obtained by using linguistic knowledge in the relevant field, and similarity characters are reduced while reducing the processing time required for similar character recognition. The purpose is to improve the character recognition performance.

【００２６】具体的には、請求項１記載の発明は、文字
画像に対して所定の画像処理を行って画像の特徴を抽出
し、画像特徴格納辞書と照合して、当該画像特徴格納辞
書に格納されている言語情報に基づいて誤り訂正を行っ
て文字認識を行なうに際して、文字画像のうち認識を行
う文字集合を認識対象文字集合として任意に設定可能と
し、当該設定された認識対象文字集合に対してのみ画像
特徴格納辞書を使用して文字認識処理を行うことによ
り、誤認識しやすい文字である類似文字等を、予め調査
して求めて、類似文字間の識別に特化した画像特徴格納
辞書を作成し、認識を行う文字画像のうち当該類似文字
等の誤認しやすい文字集合を認識対象文字集合として指
定して、当該認識対象文字集合についてのみ作成した画
像特徴格納辞書を使用して文字認識を行い、誤認識しや
すい類似文字等の認識精度を向上させて高精度かつ高速
に文字認識を行うことのできる文字認識装置を提供する
ことを目的としている。Specifically, the invention according to claim 1 performs predetermined image processing on a character image to extract image features, compares the extracted features with an image feature storage dictionary, and stores the image feature in the image feature storage dictionary. When performing character correction by performing error correction based on the stored linguistic information, a character set to be recognized among character images can be arbitrarily set as a recognition target character set, and the set recognition target character set By performing character recognition processing using only the image feature storage dictionary for similar characters, similar characters that are easily misrecognized, etc., are investigated in advance and found, and image feature storage specialized for discrimination between similar characters is performed. A dictionary is created, and a character set that is likely to be misidentified, such as a similar character, among character images to be recognized is designated as a character set to be recognized, and an image feature storage dictionary created only for the character set to be recognized is used. It is intended to perform character recognition, to provide a character recognition device capable of to improve the recognition accuracy of such erroneous recognition easily similar character performs character recognition with high accuracy and high speed.

【００２７】請求項２記載の発明は、認識対象文字集合
を適宜変更可能とし、当該変更された認識文字集合に対
して順次文字認識処理を行うことにより、誤認識しやす
い類似文字等の認識精度をより一層向上させ、高精度か
つ高速に文字認識を行うことのできる文字認識装置を提
供することを目的としている。According to the second aspect of the present invention, the character set to be recognized can be appropriately changed, and character recognition processing is sequentially performed on the changed character set to be recognized, so that the recognition accuracy of similar characters and the like which are easily erroneously recognized can be improved. It is an object of the present invention to provide a character recognition device capable of performing character recognition with high accuracy and high speed.

【００２８】請求項３記載の発明は、認識すべき文字毎
に、認識対象文字集合を設定可能とし、当該設定された
認識対象文字集合に対して文字認識処理を行うことによ
り、誤認識しやすい類似文字等の認識精度をより一層速
やかに認識し、高精度かつより一層高速に文字認識を行
うことのできる文字認識装置を提供することを目的とし
ている。According to the third aspect of the present invention, a set of characters to be recognized can be set for each character to be recognized, and a character recognition process is performed on the set of characters to be recognized. It is an object of the present invention to provide a character recognition device capable of recognizing recognition accuracy of similar characters and the like more quickly and performing high-accuracy and higher-speed character recognition.

【００２９】請求項４記載の発明は、認識対象文字集合
の設定を、画像特徴格納辞書に格納されている言語情報
に基づいて行えるようにすることにより、文字認識の対
象となる原稿の分野が限定されている場合に、その分野
に特有な言語的知識を用いて、類似文字の詳細識別の必
要性の有無を決定し、類似文字の認識精度を落すことな
く、詳細識別にかかる処理量を低減し、より高精度かつ
より一層高速に文字認識を行うことのできる文字認識装
置を提供することを目的としている。According to the fourth aspect of the present invention, the character set to be recognized can be set based on the linguistic information stored in the image feature storage dictionary. When limited, it is possible to determine the necessity of detailed identification of similar characters using linguistic knowledge specific to the field, and to reduce the amount of processing required for detailed identification without reducing the recognition accuracy of similar characters. It is an object of the present invention to provide a character recognition device that can reduce the number of characters and perform character recognition with higher accuracy and higher speed.

【００３０】請求項５記載の発明は、画像特徴格納辞書
に格納されている言語情報に基づいて、複数の文字をま
とめて１つのグループとして文字データと同様に認識対
象文字として設定できるようにすることにより、文字認
識の対象となる原稿に出現する単語が固定されていた
り、出現頻度が明確な場合に、言語的知識を用いてその
出現有無や頻度等を判別して、類似文字の詳細識別の必
要性の有無を決定し、類似文字の認識精度を落すことな
く、詳細識別にかかる処理量を低減し、より高精度かつ
より一層高速に文字認識を行うことのできる文字認識装
置を提供することを目的としている。According to a fifth aspect of the present invention, a plurality of characters can be collectively set as one group as a recognition target character in the same manner as character data, based on language information stored in an image feature storage dictionary. If words appearing in a document subject to character recognition are fixed or the frequency of occurrence is clear, the presence / absence and frequency of such words are determined using linguistic knowledge to identify similar characters in detail. A character recognition device capable of determining whether or not the character recognition is necessary, reducing the amount of processing for detailed identification without lowering the recognition accuracy of similar characters, and performing more accurate and faster character recognition. It is intended to be.

【００３１】請求項６記載の発明は、請求項１から請求
項５のいずれかに記載の文字認識処理を実行するのに必
要な文字認識処理プログラム及び前記画像特徴格納辞書
等の前記文字認識処理プログラムを実行するのに必要な
データを記録することにより、既存のハードウェア資源
を用いて文字認識処理プログラムを実行して、認識を行
う文字画像のうち当該類似文字等の誤認しやすい文字集
合を認識対象文字集合として指定して、当該認識対象文
字集合についてのみ作成した画像特徴格納辞書を使用し
て文字認識を行い、誤認識しやすい類似文字等の認識精
度を向上させて高精度かつ高速に文字認識を行うことの
できる記録媒体を提供することを目的としている。According to a sixth aspect of the present invention, there is provided a character recognition processing program for executing the character recognition processing according to any one of the first to fifth aspects, and the character recognition processing such as the image feature storage dictionary. By recording the data necessary to execute the program, the character recognition processing program is executed using the existing hardware resources, and a character set that is likely to be misidentified such as the similar character is recognized from among the character images to be recognized. Designate as a character set to be recognized, perform character recognition using the image feature storage dictionary created only for the character set to be recognized, and improve the recognition accuracy of similar characters that are easy to misrecognize to achieve high accuracy and high speed. It is an object of the present invention to provide a recording medium capable of performing character recognition.

【００３２】[0032]

【課題を解決するための手段】請求項１記載の発明の文
字認識装置は、文字画像に対して所定の画像処理を行っ
て画像の特徴を抽出し、画像特徴格納辞書と照合して、
当該画像特徴格納辞書に格納されている言語情報に基づ
いて誤り訂正を行って文字認識を行なう文字認識装置に
おいて、前記文字画像のうち認識を行う文字集合を認識
対象文字集合として任意に設定可能とし、当該設定され
た認識対象文字集合に対してのみ前記画像特徴格納辞書
を使用して文字認識処理を行うことにより、上記目的を
達成している。According to a first aspect of the present invention, there is provided a character recognition apparatus which performs predetermined image processing on a character image to extract image features, and compares the extracted image features with an image feature storage dictionary.
In a character recognition device that performs character correction by performing error correction based on linguistic information stored in the image feature storage dictionary, a character set to be recognized among the character images can be arbitrarily set as a character set to be recognized. The above object is achieved by performing a character recognition process using only the set of recognition target character sets using the image feature storage dictionary.

【００３３】上記構成によれば、文字画像に対して所定
の画像処理を行って画像の特徴を抽出し、画像特徴格納
辞書と照合して、当該画像特徴格納辞書に格納されてい
る言語情報に基づいて誤り訂正を行って文字認識を行な
うに際して、文字画像のうち認識を行う文字集合を認識
対象文字集合として任意に設定可能とし、当該設定され
た認識対象文字集合に対してのみ画像特徴格納辞書を使
用して文字認識処理を行うので、誤認識しやすい文字で
ある類似文字等を、予め調査して求めて、類似文字間の
識別に特化した画像特徴格納辞書を作成し、認識を行う
文字画像のうち当該類似文字等の誤認しやすい文字集合
を認識対象文字集合として指定して、当該認識対象文字
集合についてのみ作成した画像特徴格納辞書を使用して
文字認識を行うことができ、誤認識しやすい類似文字等
の認識精度を向上させて高精度かつ高速に文字認識を行
うことができる。According to the above arrangement, a predetermined image processing is performed on the character image to extract the image feature, collate the image feature with the image feature storage dictionary, and extract the language information stored in the image feature storage dictionary. When performing character recognition by performing error correction based on a character set to be recognized among character images, the character set to be recognized can be arbitrarily set as a character set to be recognized, and an image feature storage dictionary is provided only for the set character set to be recognized. Is used to perform character recognition processing. Therefore, a similar character or the like that is easily misrecognized is searched and found in advance, and an image feature storage dictionary specialized for discriminating between similar characters is created and recognition is performed. A character set that is easily misidentified such as similar characters in character images is specified as a character set to be recognized, and character recognition is performed using an image feature storage dictionary created only for the character set to be recognized. It can be, to improve the recognition accuracy of such erroneous recognition easily similar character can perform character recognition with high accuracy and high speed.

【００３４】この場合、例えば、請求項２に記載するよ
うに、前記文字認識装置は、前記認識対象文字集合を適
宜変更可能とし、当該変更された認識文字集合に対して
順次前記文字認識処理を行うものであってもよい。In this case, for example, as described in claim 2, the character recognition device enables the character set to be recognized to be appropriately changed, and sequentially performs the character recognition process on the changed character set to be recognized. It may be performed.

【００３５】上記構成によれば、認識対象文字集合を適
宜変更可能とし、当該変更された認識文字集合に対して
順次文字認識処理を行うので、誤認識しやすい類似文字
等の認識精度をより一層向上させることができ、高精度
かつ高速に文字認識を行うことができる。According to the above configuration, the character set to be recognized can be changed as appropriate, and the character recognition process is sequentially performed on the changed character set to be recognized. The character recognition can be performed with high accuracy and high speed.

【００３６】また、例えば、請求項３に記載するよう
に、前記文字認識装置は、認識すべき文字毎に、前記認
識対象文字集合を設定可能とし、当該設定された認識対
象文字集合に対して前記文字認識処理を行うものであっ
てもよい。In addition, for example, the character recognition device can set the character set to be recognized for each character to be recognized, and can set the character set to be recognized. The character recognition processing may be performed.

【００３７】上記構成によれば、認識すべき文字毎に、
認識対象文字集合を設定可能とし、当該設定された認識
対象文字集合に対して文字認識処理を行うので、誤認識
しやすい類似文字等の認識精度をより一層速やかに認識
することができ、高精度かつより一層高速に文字認識を
行うことができる。According to the above arrangement, for each character to be recognized,
The recognition target character set can be set, and character recognition processing is performed on the set recognition target character set. Therefore, the recognition accuracy of similar characters and the like that are likely to be erroneously recognized can be more quickly recognized, and high accuracy can be achieved. In addition, character recognition can be performed even faster.

【００３８】さらに、例えば、請求項４に記載するよう
に、前記文字認識装置は、前記画像特徴格納辞書に格納
されている前記言語情報に基づいて前記認識対象文字集
合の設定可能とし、当該設定された認識対象文字集合に
対して前記文字認識処理を行うものであってもよい。Further, for example, as set forth in claim 4, the character recognition device enables the character set to be recognized to be set based on the language information stored in the image feature storage dictionary. The character recognition processing may be performed on the set of recognition target characters.

【００３９】上記構成によれば、認識対象文字集合の設
定を、画像特徴格納辞書に格納されている言語情報に基
づいて行えるようにしているので、文字認識の対象とな
る原稿の分野が限定されている場合に、その分野に特有
な言語的知識を用いて、類似文字の詳細識別の必要性の
有無を決定し、類似文字の認識精度を落すことなく、詳
細識別にかかる処理量を低減することができ、より高精
度かつより一層高速に文字認識を行うことができる。According to the above configuration, the character set to be recognized can be set based on the linguistic information stored in the image feature storage dictionary, so that the field of the document to be subjected to character recognition is limited. Linguistic knowledge specific to the field, determine whether or not detailed identification of similar characters is necessary, and reduce the amount of processing required for detailed identification without reducing the recognition accuracy of similar characters. Thus, character recognition can be performed with higher precision and at higher speed.

【００４０】また、例えば、請求項５に記載するよう
に、前記文字認識装置は、前記画像特徴格納辞書に格納
されている前記言語情報に基づいて、複数の文字をまと
めて１つのグループとして文字データと同様に前記認識
対象文字として設定可能とし、当該設定された認識対象
文字集合に対して前記文字認識処理を行うものであって
もよい。Further, for example, as described in claim 5, the character recognition device collects a plurality of characters into one group based on the language information stored in the image feature storage dictionary. The data may be set as the recognition target character similarly to the data, and the character recognition processing may be performed on the set recognition target character set.

【００４１】上記構成によれば、画像特徴格納辞書に格
納されている言語情報に基づいて、複数の文字をまとめ
て１つのグループとして文字データと同様に認識対象文
字として設定できるようにしているので、文字認識の対
象となる原稿に出現する単語が固定されていたり、出現
頻度が明確な場合に、言語的知識を用いてその出現有無
や頻度等を判別して、類似文字の詳細識別の必要性の有
無を決定し、類似文字の認識精度を落すことなく、詳細
識別にかかる処理量を低減することができ、より高精度
かつより一層高速に文字認識を行うことができる。According to the above configuration, based on the language information stored in the image feature storage dictionary, a plurality of characters can be collectively set as one group as a recognition target character similarly to character data. If the words appearing in the document subject to character recognition are fixed or the frequency of occurrence is clear, it is necessary to determine the occurrence and frequency of such words using linguistic knowledge to identify similar characters in detail. It is possible to determine the presence or absence of the character and reduce the processing amount for the detailed identification without lowering the recognition accuracy of the similar character, and perform the character recognition with higher accuracy and higher speed.

【００４２】請求項６記載の発明の記録媒体は、請求項
１から請求項５のいずれかに記載の文字認識処理を実行
するのに必要な文字認識処理プログラム及び前記画像特
徴格納辞書等の前記文字認識処理プログラムを実行する
のに必要なデータを記録することにより、上記目的を達
成している。According to a sixth aspect of the present invention, there is provided a recording medium including a character recognition processing program necessary for executing the character recognition processing according to any one of the first to fifth aspects and the image feature storage dictionary. The object is achieved by recording data necessary for executing the character recognition processing program.

【００４３】上記構成によれば、請求項１から請求項５
のいずれかに記載の文字認識処理を実行するのに必要な
文字認識処理プログラム及び前記画像特徴格納辞書等の
前記文字認識処理プログラムを実行するのに必要なデー
タを記録しているので、既存のハードウェア資源を用い
て文字認識処理プログラムを実行して、認識を行う文字
画像のうち当該類似文字等の誤認しやすい文字集合を認
識対象文字集合として指定して、当該認識対象文字集合
についてのみ作成した画像特徴格納辞書を使用して文字
認識を行うことができ、誤認識しやすい類似文字等の認
識精度を向上させて高精度かつ高速に文字認識を行うこ
とができる。According to the above configuration, the present invention is characterized in that:
Since the data necessary for executing the character recognition processing program such as the character recognition processing program required to execute the character recognition processing described in any one of the above and the image feature storage dictionary or the like is recorded, the existing data is recorded. Executes a character recognition processing program using hardware resources, designates a character set that is likely to be misidentified such as similar characters among character images to be recognized as a character set to be recognized, and creates only the character set to be recognized. Character recognition can be performed using the image feature storage dictionary thus obtained, and the accuracy of recognition of similar characters or the like which are likely to be erroneously recognized can be improved to perform high-accuracy and high-speed character recognition.

【００４４】[0044]

【発明の実施の形態】以下、本発明の好適な実施の形態
を添付図面に基づいて詳細に説明する。なお、以下に述
べる実施の形態は、本発明の好適な実施の形態であるか
ら、技術的に好ましい種々の限定が付されているが、本
発明の範囲は、以下の説明において特に本発明を限定す
る旨の記載がない限り、これらの態様に限られるもので
はない。特に、認識対象文書を予め調査することによっ
て、認識対象の文字集合を限定することが可能な場合の
文字認識装置に、広く適用することができる。Preferred embodiments of the present invention will be described below in detail with reference to the accompanying drawings. It should be noted that the embodiments described below are preferred embodiments of the present invention, and therefore, various technically preferable limitations are added. However, the scope of the present invention is not limited to the following description. The embodiments are not limited to these embodiments unless otherwise specified. In particular, the present invention can be widely applied to a character recognition device in which a character set to be recognized can be limited by checking a document to be recognized in advance.

【００４５】図１〜図４は、本発明の文字認識装置の第
１の実施の形態を示す図であり、本実施の形態は、請求
項１に対応するものである。図１は、本発明の文字認識
装置の第１の実施の形態を適用した文字認識装置１の概
念図である。FIGS. 1 to 4 show a first embodiment of a character recognition apparatus according to the present invention. This embodiment corresponds to claim 1 of the present invention. FIG. 1 is a conceptual diagram of a character recognition device 1 to which a first embodiment of the character recognition device of the present invention is applied.

【００４６】図１において、文字認識装置１は、パター
ン辞書２、パターン入力手段３、特徴抽出手段４、次元
圧縮手段５、パターン照合手段６、最小距離パターン選
択手段７、最小パターン保持・管理手段８、文字選択手
段９、認識対象文字指定手段１０及び認識対象文字集合
保持手段１１等を備えている。In FIG. 1, a character recognition device 1 includes a pattern dictionary 2, a pattern input unit 3, a feature extraction unit 4, a dimension compression unit 5, a pattern comparison unit 6, a minimum distance pattern selection unit 7, a minimum pattern holding / managing unit. 8, a character selection unit 9, a recognition target character designation unit 10, a recognition target character set holding unit 11, and the like.

【００４７】パターン辞書（画像特徴格納辞書）２は、
学習用サンプルの特徴抽出結果を次元圧縮されたもの
を、特徴量ベクトル（標準パターン）として保持してい
る。The pattern dictionary (image feature storage dictionary) 2
A dimensionally compressed result of the feature extraction of the learning sample is stored as a feature amount vector (standard pattern).

【００４８】パターン入力手段３は、スキャナ等の読み
取りを意図するイメージを入力する手段であり、特徴抽
出手段４は、パターン入力手段３によって入力されたイ
メージから画像の特徴を抽出する。この特徴抽出手段４
の抽出結果は、多次元のベクトルとして表現されてい
る。The pattern input means 3 is a means for inputting an image intended to be read by a scanner or the like, and the feature extracting means 4 extracts image features from the image input by the pattern input means 3. This feature extraction means 4
Is expressed as a multidimensional vector.

【００４９】次元圧縮手段５は、特徴抽出手段４の抽出
結果を、識別能力が落ちないように次元数を減じ、パタ
ーン照合手段６は、次元圧縮手段５で作成した特徴量ベ
クトルと、パターン辞書２に保持されている標準パター
ンとを照合して、特徴量空間における両者の距離を求め
る。The dimension compressing means 5 reduces the number of dimensions of the extraction result of the feature extracting means 4 so that the discriminating ability does not deteriorate, and the pattern matching means 6 compares the feature vector created by the dimension compressing means 5 with the pattern dictionary. Then, the distance between the two in the feature amount space is obtained by collating with the standard pattern held in 2.

【００５０】最小距離パターン選択手段７は、パターン
照合手段６の照合結果に基づいて、入力パターンと近い
距離にある標準パターンを選択し、この場合、複数の標
準パターンを選択してもよい。The minimum distance pattern selecting means 7 selects a standard pattern close to the input pattern based on the result of the comparison by the pattern comparing means 6, and in this case, a plurality of standard patterns may be selected.

【００５１】最小距離パターン保持・管理手段８は、最
小距離パターン選択手段７で選択された標準パターンを
保持・管理し、文字選択手段９は、最小距離パターン保
持・管理手段８に保持・管理されている認識結果候補の
うち、指定された文字集合に該当する結果を選択する。The minimum distance pattern holding / managing means 8 holds / manages the standard pattern selected by the minimum distance pattern selecting means 7, and the character selecting means 9 is held / managed by the minimum distance pattern holding / managing means 8. Of the candidate recognition results, the result corresponding to the specified character set is selected.

【００５２】認識対象文字指定手段１０は、認識結果と
して必要な文字の集合を予め指定する。例えば、認識対
象文字指定手段１０は、｛数字｝、｛アルファベッ
ト｝、｛ひらがな｝、｛カタカナ｝等の字種や｛Ａ，
Ｂ，Ｃ｝、｛１，２，３｝等、実際に該当する文字を列
挙して指定する。このような指定は、正規表現を利用す
ることで、容易に実現することができる。The recognition target character specification means 10 specifies in advance a set of characters required as a recognition result. For example, the recognition target character designating means 10 may use a character type such as {number}, {alphabet}, {hiragana}, {katakana}, or {A,
B, C}, {1, 2, 3}, etc., are enumerated and specified. Such designation can be easily realized by using a regular expression.

【００５３】認識対象文字集合保持手段１１は、認識対
象文字指定手段１０によって指定された文字集合を保持
する。The recognition target character set holding means 11 holds the character set specified by the recognition target character specification means 10.

【００５４】上記文字認識装置１は、具体的には、例え
ば、図２に示すようなコンピュータ２０のハードウェア
を利用して、ソフトウェアで実現される。すなわち、コ
ンピュータ２０は、ＣＰＵ（Central Processing Unit
）２１、メモリ２２、表示装置２３、ハードディスク
２４、キーボード２５、ＣＤ（Compact Disc）−ＲＯＭ
（Read Only Memory）ドライブ２６、フロッピー（登録
商標）ディスク（ＦＤ）ドライブ２７及び通信装置２８
等を備えており、主要各部は、バス２９で接続されてい
る。The character recognition device 1 is specifically realized by software using the hardware of a computer 20 as shown in FIG. 2, for example. That is, the computer 20 has a CPU (Central Processing Unit).
) 21, memory 22, display device 23, hard disk 24, keyboard 25, CD (Compact Disc) -ROM
(Read Only Memory) drive 26, floppy (registered trademark) disk (FD) drive 27, and communication device 28
The main parts are connected by a bus 29.

【００５５】本実施の形態の文書認識ソフトウェアプロ
グラム及びパターン辞書２の辞書データ等の必要なデー
タを記憶した記録媒体としてのＣＤ−ＲＯＭあるいはフ
ロッピーディスクをＣＤ−ＲＯＭドライブ２６あるいは
ＦＤドライブ２７に挿入して、当該文字認識ソフトウェ
アプログラム及び必要なデータを、例えば、ハードディ
スク２４に読み込み、ＣＰＵ２１がハードディスク２４
内の当該文字認識ソフトウェアプログラムをメモリ２２
をワークメモリとして利用しつつ、実行することで、上
記図１に示した文字認識装置１を実現することができ
る。A CD-ROM or floppy disk as a recording medium storing the document recognition software program of this embodiment and necessary data such as dictionary data of the pattern dictionary 2 is inserted into the CD-ROM drive 26 or the FD drive 27. The character recognition software program and necessary data are read into, for example, the hard disk 24, and the CPU 21
The character recognition software program in the memory 22
The character recognition device 1 shown in FIG. 1 described above can be realized by executing while using as a work memory.

【００５６】そして、コンピュータ２０は、ＣＰＵ２１
の制御下で、例えば、キーボード２５あるいは図示しな
いマウス等のポインティングディバイス等から各種命令
等が入力操作され、必要な情報を表示装置２３に表示出
力させる。The computer 20 has a CPU 21
Under the control of, various commands are input from the keyboard 25 or a pointing device such as a mouse (not shown), and necessary information is displayed and output on the display device 23.

【００５７】通信装置２８には、電話回線やＬＡＮ（Lo
cal Area Network）等の各種通信回線が接続され、コン
ピュータ２０は、通信装置２８を介して通信装置２８に
接続された通信回線を介して他のコンピュータ等の各種
情報処理装置や記憶媒体等との間で情報の授受を行う。The communication device 28 includes a telephone line and a LAN (Lo
The computer 20 is connected to various information processing devices such as other computers and storage media via a communication line connected to the communication device 28 via the communication device 28. Transfer information between the two.

【００５８】また、文字認識装置１は、図３に示すよう
に、インターネット等の通信回線を介して接続された複
数のコンピュータＰＣ１〜ＰＣｎ（図３では、ＰＣ１〜
ＰＣ３のみ記載）で構築することもできる。すなわち、
文字認識装置１を構成する各手段、特に、パターン辞書
２を通信回線を介して接続された１つあるいは複数のコ
ンピュータＰＣ１〜ＰＣｎに蓄積させ、通信回線で接続
されたそれぞれのコンピュータＰＣ１〜ＰＣｎ等がその
ハードディスク等に搭載されている文字認識ソフトウェ
アプログラムで文字認識処理を実行するに際して、他の
コンピュータＰＣ１〜ＰＣｎのパターン辞書２を参照す
る等のように、文字認識装置１の各手段を複数のコンピ
ュータＰＣ１〜ＰＣｎに分散させたり、あるいは、相互
に保管する形態で文字認識装置１を実現することもでき
る。As shown in FIG. 3, the character recognition apparatus 1 includes a plurality of computers PC1 to PCn (PC1 to PCn in FIG. 3) connected via a communication line such as the Internet.
(Only PC3 is described). That is,
Each means constituting the character recognition device 1, particularly, the pattern dictionary 2 is stored in one or a plurality of computers PC1 to PCn connected via a communication line, and the respective computers PC1 to PCn connected via the communication line When executing character recognition processing with a character recognition software program mounted on the hard disk or the like, each unit of the character recognition device 1 is referred to as a plurality of units such as referring to the pattern dictionary 2 of the other computers PC1 to PCn. The character recognition device 1 can be realized in a form of being distributed to the computers PC1 to PCn or stored in each other.

【００５９】次に、本実施の形態の作用を説明する。本
実施の形態の文字認識装置１は、図４に示すように、認
識対象文字指定手段１０により、まず、文字認識対象の
原稿を調査して、認識結果として妥当な文字集合を認識
対象文字集合として指定して、認識対象文字指定手段１
０の指定した認識対象文字集合を認識対象文字集合保持
手段１１で保持し（ステップＳ１０１）、スキャナ等の
イメージ読取装置であるパターン入力手段３から読み取
りを意図するパターンを入力する（ステップＳ１０
２）。Next, the operation of the present embodiment will be described. As shown in FIG. 4, the character recognition apparatus 1 according to the present embodiment first examines a document to be subjected to character recognition by the recognition target character designating unit 10 and determines a character set appropriate as a recognition result. As a recognition target character specifying means 1
The recognition target character set designated as 0 is held in the recognition target character set holding unit 11 (step S101), and a pattern intended to be read is input from the pattern input unit 3 which is an image reading device such as a scanner (step S10).
2).

【００６０】次に、文字認識装置１は、特徴抽出手段４
でイメージの特徴を抽出する（ステップＳ１０３）。特
徴抽出手段４は、このイメージの特徴の抽出を、方向コ
ードや輪郭特徴など各種を利用して行い、特徴抽出結果
は、多次元のベクトルとして表現される。特徴抽出手段
４の抽出結果を、次元圧縮手段５が識別能力が落ちない
ように次元圧縮して次元数を減じる（ステップＳ１０
４）。Next, the character recognition device 1 includes a feature extracting means 4
To extract image features (step S103). The feature extracting means 4 performs the feature extraction of the image using various types such as a direction code and a contour feature, and the feature extraction result is expressed as a multidimensional vector. The number of dimensions is reduced by dimensionally compressing the extraction result of the feature extracting means 4 so that the discriminating ability is not reduced by the dimension compressing means 5 (step S10).
4).

【００６１】次に、パターン照合手段６が、上記作成さ
れた特徴ベクトルを、パターン辞書２に格納されている
標準パターン（特徴ベクトル）と照合し（ステップＳ１
０５）、最小距離パターン選択手段７が、特徴量空間に
おいて、入力パターンと近い距離を持つ標準パターンを
選択して、最小距離パターン保持・管理手段８がこの選
択された標準パターンを保持・管理する（ステップＳ１
０６）。Next, the pattern matching means 6 compares the created feature vector with a standard pattern (feature vector) stored in the pattern dictionary 2 (step S1).
05), the minimum distance pattern selecting means 7 selects a standard pattern having a distance close to the input pattern in the feature amount space, and the minimum distance pattern holding / managing means 8 holds / manages the selected standard pattern. (Step S1
06).

【００６２】そして、文字選択手段９が、最小距離パタ
ーン保持・管理手段８の保持・管理している標準パター
ンのうち、認識対象文字指定手段１０で指定されて認識
対象文字集合保持手段１１の保持している認識対象文字
集合に該当する文字のみを選択して、認識結果１２（図
１参照）とする（ステップＳ１０７）。Then, the character selection means 9 is specified by the recognition target character specification means 10 and stored in the recognition target character set storage means 11 among the standard patterns stored and managed by the minimum distance pattern storage and management means 8. Only the characters corresponding to the set of characters to be recognized are selected, and are set as a recognition result 12 (see FIG. 1) (step S107).

【００６３】このように、本実施の形態の文字認識装置
１は、文字画像に対して所定の画像処理を行って画像の
特徴を抽出し、パターン辞書２と照合して、当パターン
辞書２に格納されている言語情報に基づいて誤り訂正を
行って文字認識を行なうに際して、文字画像のうち認識
を行う文字集合を認識対象文字集合として任意に設定可
能とし、当該設定された認識対象文字集合に対してのみ
パターン辞書２を使用して文字認識処理を行っている。As described above, the character recognition device 1 of the present embodiment performs predetermined image processing on a character image to extract the features of the image, compares it with the pattern dictionary 2, When performing character correction by performing error correction based on the stored linguistic information, a character set to be recognized among character images can be arbitrarily set as a recognition target character set, and the set recognition target character set The character recognition process is performed by using the pattern dictionary 2 only for this.

【００６４】したがって、誤認識しやすい文字である類
似文字等を、予め調査して求めて、類似文字間の識別に
特化したパターン辞書２を作成し、認識を行う文字画像
のうち当該類似文字等の誤認しやすい文字集合を認識対
象文字集合として指定して、当該認識対象文字集合につ
いてのみ作成したパターン辞書２を使用して文字認識を
行うことができ、誤認識しやすい類似文字等の認識精度
を向上させて高精度かつ高速に文字認識を行うことがで
きる。Therefore, a similar character or the like which is easily misrecognized is obtained by examining in advance, and a pattern dictionary 2 specialized for discriminating between similar characters is created. A character set that is easily misidentified is designated as a character set to be recognized, and character recognition can be performed using the pattern dictionary 2 created only for the character set to be recognized. Character recognition can be performed with high accuracy and high speed by improving the accuracy.

【００６５】図５及び図６は、本発明の文字認識装置の
第２の実施の形態を示す図であり、本実施の形態は、請
求項２に対応するものである。FIGS. 5 and 6 show a character recognition apparatus according to a second embodiment of the present invention. This embodiment corresponds to claim 2 of the present invention.

【００６６】なお、本実施の形態は、上記第１の実施の
形態の文字認識装置１と同様の文字認識装置に適用した
ものであり、本実施の形態の説明においては、上記第１
の実施の形態と同様の構成部分には、同一の符号を付し
て、その詳細な説明を省略する。This embodiment is applied to a character recognition device similar to the character recognition device 1 of the first embodiment. In the description of the present embodiment, the first embodiment will be described.
The same components as those of the embodiment are denoted by the same reference numerals, and detailed description thereof will be omitted.

【００６７】図５は、本発明の文字認識装置の第２の実
施の形態を適用した文字認識装置３０の概念図であり、
本実施の形態の文字認識装置３０も、上記第１の実施の
形態の文字認識装置１と同様に、図２に示したコンピュ
ータ２０あるいは図３に示した通信回線で接続された複
数のコンピュータＰＣ１〜ＰＣｎ等で実現される。FIG. 5 is a conceptual diagram of a character recognition device 30 to which the second embodiment of the character recognition device of the present invention is applied.
Like the character recognition device 1 of the first embodiment, the character recognition device 30 of the present embodiment also includes a computer 20 shown in FIG. 2 or a plurality of computers PC1 connected by a communication line shown in FIG. ~ PCn and the like.

【００６８】図５において、文字認識装置３０は、上記
第１の実施の形態の文字認識装置１と同様のパターン辞
書２、パターン入力手段３、特徴抽出手段４、次元圧縮
手段５、パターン照合手段６、最小距離パターン選択手
段７、最小パターン保持・管理手段８、文字選択手段
９、認識対象文字指定手段１０及び認識対象文字集合保
持手段１１等を備えているとともに、次元圧縮手段３１
と学習サンプル特徴ベクトル保持手段３２等を備えてい
る。In FIG. 5, a character recognition device 30 includes a pattern dictionary 2, a pattern input device 3, a feature extraction device 4, a dimension compression device 5, a pattern collation device similar to the character recognition device 1 of the first embodiment. 6, a minimum distance pattern selecting unit 7, a minimum pattern holding / managing unit 8, a character selecting unit 9, a recognition target character designating unit 10, a recognition target character set holding unit 11, and the like.
And learning sample feature vector holding means 32 and the like.

【００６９】上記パターン辞書２、パターン入力手段
３、特徴抽出手段４、次元圧縮手段５、パターン照合手
段６、最小距離パターン選択手段７、最小パターン保持
・管理手段８、文字選択手段９、認識対象文字指定手段
１０及び認識対象文字集合保持手段１１は、上記第１の
実施の形態と同様であるため、その説明を省略する。The pattern dictionary 2, pattern input means 3, feature extraction means 4, dimension compression means 5, pattern collation means 6, minimum distance pattern selection means 7, minimum pattern holding / management means 8, character selection means 9, recognition target The character designating unit 10 and the recognition target character set holding unit 11 are the same as those in the first embodiment, and a description thereof will be omitted.

【００７０】次元圧縮手段３１は、認識対象文字指定手
段１０で指定されて認識対象文字集合保持手段１１に保
持されている文字集合に基づいて、学習サンプルの特徴
ベクトル、すなわち、次元圧縮前の特徴ベクトルを次元
圧縮する。The dimension compressing means 31 is based on the character set specified by the recognition target character designating means 10 and held in the recognition target character set holding means 11, ie, the feature vector of the learning sample, that is, the feature vector before the dimension compression. Dimensionally compress a vector.

【００７１】学習作特徴ベクトル保持手段３２は、学習
サンプルのイメージを特徴抽出した結果を多次元のベク
トルとして保持する。The learning work feature vector holding means 32 holds the result of feature extraction of the image of the learning sample as a multidimensional vector.

【００７２】次に、本実施の形態の作用を説明する。本
実施の形態の文字認識装置３０は、全ての文字の、次元
圧縮前の特徴ベクトルを学習サンプル特徴ベクトル３２
に保持し、認識対象文字集合が変わる度に、次元圧縮処
理を行なった後、実際の認識処理を行なうことで、精度
よく次元圧縮を行って、認識率をより一層向上させてい
る。Next, the operation of the present embodiment will be described. The character recognition device 30 according to the present embodiment converts feature vectors of all characters before dimension compression into learning sample feature vectors 32.
Each time the recognition target character set is changed, a dimension compression process is performed, and then an actual recognition process is performed, whereby the dimension compression is performed with high accuracy, and the recognition rate is further improved.

【００７３】すなわち、上記請求項１記載の文字認識装
置１においては、認識結果の中から、認識対象文字指定
手段１０で指定された文字だけを、選択しているが、パ
ターン辞書２が次元圧縮されている場合には、次元圧縮
の精度が悪くなることがある。例えば、次元圧縮は、全
ての文字の特徴ベクトルに対して、次元圧縮前と次元圧
縮後とにおいて平均誤差が最小となること、あるいは、
カテゴリ間の分離が最大になること、のような基準を満
たす変換軸を求めるものであるから、認識対象文字集合
が変化すれば、変換軸を改めて求めなければ、精度良く
次元圧縮することができない。That is, in the character recognition device 1 according to the first aspect, only the characters specified by the recognition target character specifying means 10 are selected from the recognition results, but the pattern dictionary 2 is subjected to the dimensional compression. In such a case, the precision of the dimensional compression may deteriorate. For example, dimensional compression is to minimize the average error before and after dimensional compression for the feature vectors of all characters, or
Since the conversion axis that satisfies the criteria such as the maximum separation between categories is obtained, if the character set to be recognized changes, the conversion axis cannot be accurately compressed unless the conversion axis is obtained again. .

【００７４】そこで、本実施の形態で、全ての文字の、
次元圧縮前の特徴ベクトルを学習サンプル特徴ベクトル
保持手段３２に保持しておき、認識対象文字集合が変わ
る度に、次元圧縮処理を行なった後、実際の認識処理を
行なうことで、精度よく次元圧縮して、認識率を向上さ
せている。Therefore, in this embodiment, all characters
The feature vector before the dimensional compression is held in the learning sample feature vector holding means 32, and every time the character set to be recognized changes, the dimensional compression processing is performed, and then the actual recognition processing is performed, so that the dimensional compression can be performed accurately. Then, the recognition rate is improved.

【００７５】すなわち、本実施の形態の文字認識装置３
０は、図６に示すように、まず、認識対象文字集合を変
えるかどうかチェックし（ステップＳ２０１）、認識対
象文字集合を変える場合には、新たに認識文字集合を指
定するために、文字認識対象の原稿を調査して、認識結
果として妥当な文字集合を認識対象文字集合として指定
して、認識対象文字指定手段１０の指定した認識対象文
字集合を認識対象文字集合保持手段１１で保持する（ス
テップＳ２０２）。文字認識装置３０は、この指定され
た認識対象文字集合に基づいて、次元圧縮手段３１で、
学習サンプルベクトル保持手段３２の保持する特徴ベク
トルを次元圧縮し、パターン辞書２を作成し直す（ステ
ップＳ２０３）。That is, the character recognition device 3 of this embodiment
0, as shown in FIG. 6, first, it is checked whether or not the character set to be recognized is changed (step S201). When the character set to be recognized is changed, character recognition is performed in order to newly specify the character set to be recognized. The target document is examined, a character set appropriate as a recognition result is specified as a character set to be recognized, and the character set to be recognized specified by the character to be recognized specifying means 10 is held by the character set to be held holding means 11 ( Step S202). The character recognition device 30 uses the dimension compression means 31 based on the specified character set to be recognized.
The feature vector held by the learning sample vector holding unit 32 is dimensionally compressed, and the pattern dictionary 2 is created again (step S203).

【００７６】文字認識装置３０は、パターン辞書２を作
成し直すと、スキャナ等のイメージ読取装置であるパタ
ーン入力手段３から読み取りを意図するパターンを入力
する（ステップＳ２０４）。After re-creating the pattern dictionary 2, the character recognition device 30 inputs a pattern intended to be read from the pattern input means 3 which is an image reading device such as a scanner (step S204).

【００７７】一方、ステップＳ２０１で、認識対象文字
集合を変えないときには、文字認識装置３０は、パター
ン辞書２を新たに次元圧縮し直す必要がないため、ステ
ップＳ４に移行して、パターン入力手段３から読み取り
を意図するパターンを入力する（ステップＳ２０４）。On the other hand, if the character set to be recognized is not changed in step S201, the character recognition device 30 does not need to newly re-dimensionally compress the pattern dictionary 2, so the process proceeds to step S4 and the pattern input means 3 , A pattern intended to be read is input (step S204).

【００７８】文字認識装置３０は、パターン入力を行う
と、特徴抽出手段４でイメージの特徴を抽出する（ステ
ップＳ２０５）。特徴抽出手段４は、このイメージの特
徴の抽出を、方向コードや輪郭特徴など各種を利用して
行い、特徴抽出結果は、多次元のベクトルとして表現さ
れる。文字認識装置３０は、特徴抽出を行うと、特徴抽
出手段４の抽出結果を、次元圧縮手段５で、上記ステッ
プＳ２０２で指定された文字集合に基づいて、識別能力
が落ちないように次元圧縮して次元数を減じる（ステッ
プＳ２０６）。When the character recognition device 30 performs pattern input, the feature extracting means 4 extracts features of the image (step S205). The feature extracting means 4 performs the feature extraction of the image using various types such as a direction code and a contour feature, and the feature extraction result is expressed as a multidimensional vector. After performing the feature extraction, the character recognizing device 30 dimensionally compresses the extraction result of the feature extracting means 4 by the dimension compressing means 5 based on the character set designated in step S202 so that the discriminating ability does not decrease. To reduce the number of dimensions (step S206).

【００７９】次に、パターン照合手段６が、上記作成さ
れた特徴ベクトルを、パターン辞書２に格納されている
標準パターン（特徴ベクトル）と照合し（ステップＳ２
０７）、最小距離パターン選択手段７が、特徴量空間に
おいて、入力パターンと近い距離を持つ標準パターンを
選択して、最小距離パターン保持・管理手段８が、この
選択された標準パターンを保持・管理する（ステップＳ
２０８）。Next, the pattern matching means 6 matches the created feature vector with a standard pattern (feature vector) stored in the pattern dictionary 2 (step S2).
07), the minimum distance pattern selecting means 7 selects a standard pattern having a distance close to the input pattern in the feature space, and the minimum distance pattern holding / managing means 8 holds / manages the selected standard pattern. (Step S
208).

【００８０】そして、文字選択手段９が、最小距離パタ
ーン保持・管理手段８の保持・管理している標準パター
ンのうち、認識対象文字指定手段１０で指定されて認識
対象文字集合保持手段１１の保持している認識対象文字
集合に該当する文字のみを選択して、認識結果１２とす
る（ステップＳ２０９）。Then, the character selection means 9 is designated by the recognition target character designation means 10 and held in the recognition target character set holding means 11 among the standard patterns held and managed by the minimum distance pattern holding / management means 8. Only the characters corresponding to the set of characters to be recognized are selected and set as a recognition result 12 (step S209).

【００８１】このように、本実施の形態の文字認識装置
１は、認識対象文字集合を適宜変更可能とし、当該変更
された認識文字集合に対して順次文字認識処理を行って
いる。As described above, the character recognition apparatus 1 of the present embodiment enables the character set to be recognized to be changed as appropriate, and sequentially performs character recognition processing on the changed recognized character set.

【００８２】したがって、誤認識しやすい類似文字等の
認識精度をより一層向上させることができ、高精度かつ
高速に文字認識を行うことができる。Therefore, it is possible to further improve the recognition accuracy of a similar character or the like which is likely to be erroneously recognized, and it is possible to perform the character recognition with high accuracy and high speed.

【００８３】図７は、本発明の文字認識装置の第３の実
施の形態を示す図であり、本実施の形態は、請求項３に
対応するものである。FIG. 7 is a diagram showing a third embodiment of the character recognition device of the present invention. This embodiment corresponds to claim 3.

【００８４】なお、本実施の形態は、上記第１の実施の
形態の文字認識装置１と同様の文字認識装置に適用した
ものであり、本実施の形態の説明においては、必要に応
じて、上記第１の実施の形態で用いた符号をそのまま用
いて説明する。The present embodiment is applied to a character recognition device similar to the character recognition device 1 of the first embodiment, and in the description of this embodiment, if necessary, Description will be made using the reference numerals used in the first embodiment as they are.

【００８５】本実施の形態の文字認識装置１は、全ての
文字の、次元圧縮前の特徴ベクトルを学習サンプル特徴
ベクトル３２に保持し、認識対象文字集合が変わる度
に、次元圧縮処理を行なった後、実際の認識処理を行な
うことで、精度よく次元圧縮を行って、認識率をより一
層向上させている。The character recognition apparatus 1 of this embodiment holds the feature vectors of all the characters before the dimension compression in the learning sample feature vector 32, and performs the dimension compression processing every time the character set to be recognized changes. Thereafter, by performing actual recognition processing, dimension compression is performed with high accuracy, and the recognition rate is further improved.

【００８６】すなわち、上記第１の実施の形態の文字認
識装置１においては、上述のように、認識結果の中か
ら、認識対象文字指定手段１０で指定された文字だけ
を、選択しているが、パターン辞書２が次元圧縮されて
いる場合には、次元圧縮の精度が悪くなることがある。
例えば、次元圧縮は、全ての文字の特徴ベクトルに対し
て、次元圧縮前と次元圧縮後とにおいて平均誤差が最小
となること、あるいは、カテゴリ間の分離が最大になる
こと、のような基準を満たす変換軸を求めるものである
から、認識対象文字集合が変化すれば、変換軸を改めて
求めなければ、精度良く次元圧縮できない。That is, in the character recognition device 1 of the first embodiment, as described above, only the character specified by the recognition target character specifying means 10 is selected from the recognition results. When the pattern dictionary 2 is dimensionally compressed, the precision of the dimension compression may be reduced.
For example, dimensional compression is based on a criterion such as minimizing the average error before and after dimensional compression, or maximizing the separation between categories for feature vectors of all characters. Since the conversion axis that satisfies is determined, if the character set to be recognized changes, the dimension cannot be accurately compressed unless the conversion axis is determined again.

【００８７】そこで、本実施の形態の文字認識装置１
は、パターン辞書２として、よく使用される文字集合に
対して、予め文字集合毎に次元圧縮したパターン辞書２
を複数用意し、文字認識の際に、用意されている文字集
合を選択することで、当該文字集合に最適な次元圧縮を
行ったパターン辞書２を選択して、使用する。Therefore, the character recognition device 1 of the present embodiment
Is a pattern dictionary 2 obtained by dimensionally compressing a frequently used character set for each character set in advance.
Are prepared, and at the time of character recognition, a prepared character set is selected to select and use the pattern dictionary 2 which has been subjected to optimal dimensional compression for the character set.

【００８８】すなわち、文字認識処理装置１は、図７に
示すように、まず、認識対象文字指定手段１０で、認識
対象文字集合をＡ→Ｂの順に絞ることを指定されると
（ステップＳ３０１）、スキャナ等のイメージ読取装置
であるパターン入力手段３から読み取りを意図するパタ
ーンを入力する（ステップＳ３０２）。That is, as shown in FIG. 7, when the character recognition processing device 1 is specified by the recognition target character specification means 10 to narrow down the set of characters to be recognized in the order of A → B (step S301). Then, a pattern intended to be read is inputted from the pattern input means 3 which is an image reading device such as a scanner (step S302).

【００８９】パターン入力を行うと、文字認識装置１
は、特徴抽出手段４でイメージの特徴を抽出し（ステッ
プＳ３０３）、特徴抽出手段４の抽出結果を、認識対象
文字集合Ａとして設定する（ステップＳ３０４）。When a pattern is input, the character recognition device 1
Extracts the features of the image by the feature extraction unit 4 (step S303), and sets the extraction result of the feature extraction unit 4 as a recognition target character set A (step S304).

【００９０】次に、文字認識装置１は、認識対象文字集
合Ａ用の辞書を、現在のパターン辞書２として設定し
（ステップＳ３０５）、次元圧縮手段５で、指定された
認識対象文字集合Ａに基づいて、識別能力が落ちないよ
うに特徴抽出手段４の抽出結果を次元圧縮する（ステッ
プＳ３０６）。Next, the character recognition apparatus 1 sets the dictionary for the character set A to be recognized as the current pattern dictionary 2 (step S305), and the dimension compression means 5 sets the dictionary to the specified character set A for recognition. Based on this, the extraction result of the feature extraction means 4 is dimensionally compressed so that the discrimination ability does not decrease (step S306).

【００９１】次に、パターン照合手段６が、上記作成さ
れた特徴ベクトルを、パターン辞書２に格納されている
標準パターンのうち上記設定した現在の標準パターン
（特徴ベクトル）と照合し（ステップＳ３０７）、最小
距離パターン選択手段７が、特徴量空間において、入力
パターンと近い距離を持つ標準パターンを複数個選択し
て、最小距離パターン保持・管理手段８が、この選択さ
れた複数の標準パターンを保持し、認識結果候補として
作成して管理する（ステップＳ３０８）。Next, the pattern matching means 6 compares the created feature vector with the set current standard pattern (feature vector) among the standard patterns stored in the pattern dictionary 2 (step S307). The minimum distance pattern selecting means 7 selects a plurality of standard patterns having a distance close to the input pattern in the feature space, and the minimum distance pattern holding / managing means 8 holds the selected plurality of standard patterns. Then, it is created and managed as a recognition result candidate (step S308).

【００９２】次に、文字認識装置１は、認識対象文字指
定手段１０で、認識対象文字集合をＢに設定し（ステッ
プＳ３０９）、認識対象文字集合Ｂ用の辞書を、現在の
パターン辞書２として設定する（ステップＳ３１０）。Next, the character recognition device 1 sets the recognition target character set to B by the recognition target character designation means 10 (step S309), and sets the dictionary for the recognition target character set B as the current pattern dictionary 2. It is set (step S310).

【００９３】認識対象文字集合Ｂ用の辞書を現在のパタ
ーン辞書２として設定すると、文字認識装置１は、ステ
ップＳ３０３で抽出された結果を、次元圧縮手段５で、
認識対象文字集合Ｂに基づいて、識別能力が落ちないよ
うに、次元圧縮する（ステップＳ３１１）。When the dictionary for the character set B to be recognized is set as the current pattern dictionary 2, the character recognizing device 1 outputs the result extracted in step S303 to the dimension compressing means 5,
Based on the recognition target character set B, the dimension is compressed so that the discrimination ability does not decrease (step S311).

【００９４】次に、パターン照合手段６が、上記ステッ
プＳ３０８で選択されている認識候補のうち、認識対象
文字集合Ｂに属する結果についてのみ、ステップＳ３１
１で作成された特徴ベクトルと、パターン辞書２に格納
されている標準パターンのうち上記設定した現在の標準
パターン（特徴ベクトル）と照合し（ステップＳ３１
２）、最小距離パターン選択手段７が、特徴量空間にお
いて、入力パターンと近い距離を持つ標準パターンを複
数個選択して、最小距離パターン保持・管理手段８が、
この選択された複数の標準パターンを保持し、認識結果
候補として作成して管理する（ステップＳ３１３）。Next, the pattern matching means 6 determines in step S31 only the results belonging to the character set B to be recognized among the recognition candidates selected in step S308.
1 is compared with the current standard pattern (feature vector) set above among the standard patterns stored in the pattern dictionary 2 (step S31).
2) The minimum distance pattern selecting means 7 selects a plurality of standard patterns having a distance close to the input pattern in the feature amount space, and the minimum distance pattern holding / managing means 8
The plurality of selected standard patterns are held, created and managed as recognition result candidates (step S313).

【００９５】このように、本実施の形態の文字認識装置
１は、認識すべき文字毎に、認識対象文字集合を設定可
能とし、当該設定された認識対象文字集合に対して文字
認識処理を行っている。As described above, the character recognition apparatus 1 according to the present embodiment enables a character set to be recognized to be set for each character to be recognized, and performs a character recognition process on the set character set to be recognized. ing.

【００９６】したがって、次元圧縮前の特徴ベクトルを
全文字分、保持することなく、また、文字集合が変化す
る度にパターン辞書を作成し直すための次元圧縮処理を
行うことなく、記憶容量の削減及び処理時間の短縮化を
図りつつ、文字集合に最適な次元圧縮を行なったパター
ン辞書を使用して、より一層認識率を向上させることが
できる。Therefore, the storage capacity can be reduced without retaining the feature vectors before the dimension compression for all the characters and without performing the dimension compression processing for recreating the pattern dictionary every time the character set changes. In addition, it is possible to further improve the recognition rate by using a pattern dictionary that has been subjected to optimal dimensional compression for a character set while reducing processing time.

【００９７】なお、本実施の形態では、文字集合Ａの部
分集合である文字集合Ｂを、最終的な認識文字集合とし
て指定して、２段階に選択を行っているが、２段階に限
るものではなく、より多くの段階に分けて選択を行って
もよい。In this embodiment, the character set B, which is a subset of the character set A, is designated as the final recognized character set, and the selection is made in two stages. Instead, the selection may be made in more stages.

【００９８】図８は、本発明の文字認識装置の第４の実
施の形態を示す図であり、本実施の形態は、請求項４に
対応するものである。FIG. 8 is a diagram showing a fourth embodiment of the character recognition apparatus according to the present invention. This embodiment corresponds to claim 4 of the present invention.

【００９９】なお、本実施の形態は、上記第１の実施の
形態の文字認識装置１と同様の文字認識装置に適用した
ものであり、本実施の形態の説明においては、必要に応
じて、上記第１の実施の形態で用いた符号をそのまま用
いて説明する。Note that the present embodiment is applied to a character recognition device similar to the character recognition device 1 of the first embodiment, and in the description of this embodiment, if necessary, Description will be made using the reference numerals used in the first embodiment as they are.

【０１００】本実施の形態の文字認識装置１は、安定し
て認識可能な文字に対しては、文字集合を変えて多段階
に認識する必要がないことに着目して、文字毎に認識対
象文字集合を指定することで、不必要に多段階に認識処
理を行なうことなく、より細かく選択処理を設定して、
認識精度を向上させている。The character recognition apparatus 1 according to the present embodiment focuses on the fact that it is not necessary to change the character set and recognize it in multiple stages for characters that can be stably recognized. By specifying the character set, the selection process can be set more finely without performing the recognition process unnecessarily in multiple stages.
The recognition accuracy has been improved.

【０１０１】すなわち、上記第３の実施の形態の文字認
識装置１では、一般的によく使われる文字集合を予め想
定して準備して、多段階認識処理を行うことで、認識率
を向上させているが、安定して認識可能な文字に対して
は、文字集合を変えて多段階に認識することなく、高認
識率で認識することができる。That is, in the character recognition device 1 of the third embodiment, a character set which is generally used is assumed and prepared in advance, and a multi-stage recognition process is performed to improve the recognition rate. However, characters that can be stably recognized can be recognized at a high recognition rate without performing multi-stage recognition by changing the character set.

【０１０２】そこで、本実施の形態の文字認識装置１
は、文字毎に認識対象文字集合を指定することで、不必
要に多段階に認識処理を行なうことなく、より一層細か
く選択処理を設定して、認識精度を向上させている。Therefore, the character recognition device 1 of the present embodiment
By specifying a character set to be recognized for each character, the selection process is set more finely without performing the recognition process unnecessarily in multiple stages, thereby improving the recognition accuracy.

【０１０３】すなわち、本実施の形態の文字認識装置１
は、図８に示すように、まず、認識対象文字指定手段１
０で、認識対象文字集合Ａを文字毎（Ａ→文字毎）に絞
ることを指定し（ステップＳ４０１）、スキャナ等のイ
メージ読取装置であるパターン入力手段３から読み取り
を意図するパターンを入力する（ステップＳ４０２）。That is, the character recognition device 1 of the present embodiment
First, as shown in FIG.
In step S401, it is specified that the character set A to be recognized is narrowed down to each character (A → each character) (step S401), and a pattern intended to be read is input from the pattern input unit 3 which is an image reading device such as a scanner (step S401). Step S402).

【０１０４】パターン入力を行うと、文字認識装置１
は、特徴抽出手段４でイメージの特徴を抽出し（ステッ
プＳ４０３）、特徴抽出手段４の抽出結果を、認識対象
文字集合Ａとして設定する（ステップＳ４０４）。When a pattern is input, the character recognition device 1
Extracts the features of the image by the feature extraction unit 4 (step S403), and sets the extraction result of the feature extraction unit 4 as a recognition target character set A (step S404).

【０１０５】次に、文字認識装置１は、認識対象文字集
合Ａ用の辞書を、現在のパターン辞書２として設定し
（ステップＳ４０５）、次元圧縮手段５で、指定された
認識対象文字集合Ａに基づいて、識別能力が落ちないよ
うに特徴抽出手段４の抽出結果を次元圧縮する（ステッ
プＳ４０６）。Next, the character recognition device 1 sets the dictionary for the character set A to be recognized as the current pattern dictionary 2 (step S405), and the dimension compression means 5 sets the dictionary to the specified character set A to be recognized. Based on this, the extraction result of the feature extraction means 4 is dimensionally compressed so that the discrimination ability does not decrease (step S406).

【０１０６】次に、パターン照合手段６が、上記作成さ
れた特徴ベクトルを、パターン辞書２に格納されている
標準パターンのうち上記設定した現在の標準パターン
（特徴ベクトル）と照合し（ステップＳ４０７）、最小
距離パターン選択手段７が、特徴量空間において、入力
パターンと近い距離を持つ標準パターンを複数個選択し
て、最小距離パターン保持・管理手段８が、この選択さ
れた複数の標準パターンを保持し、認識結果候補として
作成して管理する（ステップＳ４０８）。Next, the pattern matching means 6 compares the created feature vector with the current standard pattern (feature vector) set above among the standard patterns stored in the pattern dictionary 2 (step S407). The minimum distance pattern selecting means 7 selects a plurality of standard patterns having a distance close to the input pattern in the feature space, and the minimum distance pattern holding / managing means 8 holds the selected plurality of standard patterns. Then, it is created and managed as a recognition result candidate (step S408).

【０１０７】次に、文字認識装置１は、認識対象文字指
定手段１０で、ステップＳ４０８の認識結果に基づい
て、認識対象文字毎に認識対象文字集合Ｘを設定し（ス
テップＳ４０９）、認識対象文字集合Ｘ用の辞書を、現
在のパターン辞書２として設定する（ステップＳ４１
０）。Next, the character recognition device 1 sets the recognition target character set X for each recognition target character based on the recognition result of step S408 by the recognition target character designation means 10 (step S409). The dictionary for the set X is set as the current pattern dictionary 2 (step S41).
0).

【０１０８】認識対象文字集合Ｘ用の辞書を現在のパタ
ーン辞書２として設定すると、文字認識装置１は、ステ
ップＳ４０３で抽出された結果を、次元圧縮手段５で、
認識対象文字集合Ｘに基づいて、識別能力が落ちないよ
うに、次元圧縮する（ステップＳ４１１）。When the dictionary for the character set X to be recognized is set as the current pattern dictionary 2, the character recognizing device 1 outputs the result extracted in step S403 to the dimension compressing means 5,
Based on the recognition target character set X, the dimension is compressed so that the discrimination ability does not decrease (step S411).

【０１０９】次に、パターン照合手段６が、上記ステッ
プＳ４０８で選択している認識候補のうち、認識対象文
字集合Ｘに属する結果についてのみ、ステップＳ４１１
で作成された特徴ベクトルと、パターン辞書２に格納さ
れている標準パターンのうち上記設定した現在の標準パ
ターン（特徴ベクトル）と照合し（ステップＳ４１
２）、最小距離パターン選択手段７が、特徴量空間にお
いて、入力パターンと近い距離を持つ標準パターンを複
数個選択して、最小距離パターン保持・管理手段８が、
この選択された複数の標準パターンを保持し、認識結果
候補として作成して管理する（ステップＳ４１３）。Next, the pattern matching means 6 determines only the result belonging to the character set X to be recognized among the recognition candidates selected in step S408, in step S411.
Is compared with the set current standard pattern (feature vector) among the standard patterns stored in the pattern dictionary 2 (step S41).
2) The minimum distance pattern selecting means 7 selects a plurality of standard patterns having a distance close to the input pattern in the feature amount space, and the minimum distance pattern holding / managing means 8
The selected plurality of standard patterns are held, created and managed as recognition result candidates (step S413).

【０１１０】このように、本実施の形態の文字認識装置
１は、パターン辞書２に格納されている言語情報に基づ
いて、複数の文字をまとめて１つのグループとして文字
データと同様に認識対象文字として設定できるようにし
ている。As described above, the character recognition device 1 according to the present embodiment collects a plurality of characters into one group based on the language information stored in the pattern dictionary 2 in the same manner as the character data. It is possible to set as.

【０１１１】したがって、文字認識の対象となる原稿に
出現する単語が固定されていたり、出現頻度が明確な場
合に、言語的知識を用いてその出現有無や頻度等を判別
して、類似文字の詳細識別の必要性の有無を決定し、類
似文字の認識精度を落すことなく、詳細識別にかかる処
理量を低減することができ、より高精度かつより一層高
速に文字認識を行うことができる。Therefore, if a word that appears in a document to be subjected to character recognition is fixed or the frequency of occurrence is clear, the presence / absence and frequency of the word are determined using linguistic knowledge to determine the similar character. It is possible to determine the necessity of the detailed identification, reduce the processing amount for the detailed identification without lowering the recognition accuracy of the similar character, and perform the character recognition with higher accuracy and higher speed.

【０１１２】なお、本実施の形態の場合、文字認識装置
の使用される原稿において、出現する単語がある程度わ
かっている場合、例えば、国語辞典の見出し語に選ばれ
ているような単語である場合、出現することの分かって
いる単語を調査することによって、認識精度を向上させ
ることのできる重要な類似文字集合を求めることができ
る。In the case of the present embodiment, when a word that appears in a manuscript used by the character recognition device is known to some extent, for example, when the word is selected as a headword in a Japanese language dictionary By investigating words that are known to appear, an important similar character set that can improve recognition accuracy can be obtained.

【０１１３】この類似文字集合を求める方法としては、
例えば、特徴量ベクトルをクラスタリングによってまと
める方法や認識結果の誤りを集計してコンフュージョン
マトリクスを作成する方法を用いることができる。な
お、クラスタリングとしては、ＬＢＧアルゴリズム(上
坂・尾関著「パターン認識と学習のアルゴリズム」，第
７章，文一総合出版等参照）等を用いることができる。As a method for obtaining the similar character set,
For example, a method of collecting feature amount vectors by clustering or a method of totaling errors in recognition results to create a confusion matrix can be used. As the clustering, an LBG algorithm (see "Algorithm of Pattern Recognition and Learning" written by Uesaka and Ozeki, Chapter 7, Bunichi Sogo Publishing, etc.) and the like can be used.

【０１１４】例えば、類似文字集合｛右，戸，石｝があ
り、原稿中に出現が予想される単語に「右田」、「戸
田」、「石田」があるとすると、たとえ言語知識によ
る、誤り訂正の機会があったとしても、｛右，戸，石｝
を正しく識別できない限り、正解文字が得られないこと
が分かる。すなわち、類似文字集合｛右，戸，石｝は、
識別にとって重要な類似文字集合なのである。For example, if there is a similar character set {right, door, stone}, and words that are expected to appear in the manuscript include “Migita”, “Toda”, and “Ishida”, even if the word Even if there is an opportunity for correction, {right, door, stone}
It can be understood that the correct character cannot be obtained unless the character is correctly identified. That is, the similar character set {right, door, stone}
It is a similar character set important for identification.

【０１１５】このような集合は、単語中に含まれる文字
を、その類似文字集合に置き換え、他の単語と同じにな
るものがあるかを調査することにより得られる。Such a set is obtained by replacing a character included in a word with a similar character set and examining whether there is a character that is the same as another word.

【０１１６】例えば、上記例では、以下のようになっ
て、三者同一になる。For example, in the above example, the three become identical as follows.

【０１１７】「右田」→｛右，戸，石｝＋田「戸田」→｛右，戸，石｝＋田「石田」→｛右，戸，石｝＋田この結果から、認識処理を行なう以前に、識別にとって
重要な類似文字集合を、予め求めておくことができる。"Mita" → {right, door, stone} + field "Toda" → {right, door, stone} + field "Ishida" → {right, door, stone} + field From this result, recognition processing is performed. Previously, a similar character set important for identification can be determined in advance.

【０１１８】したがって、これらの重要な類似文字集合
を識別するための辞書を作成しておき、通常のパターン
辞書から参照できるようにすることで、通常の認識に続
いて詳細識別を行なうことができ、認識精度を向上させ
ることができる。Therefore, by creating a dictionary for identifying these important similar character sets and referencing them from a normal pattern dictionary, it is possible to perform detailed identification following normal recognition. , The recognition accuracy can be improved.

【０１１９】このような類似文字集合辞書は、例えば、
図９に「鯉」、「鰹」、「鮭」の辞書レコードの例を示
すように、詳細識別の不要な類似文字集合の辞書を作成
することにより対応することができる。Such a similar character set dictionary is, for example,
FIG. 9 shows an example of dictionary records of “koi”, “bonito”, and “salmon”, which can be dealt with by creating a dictionary of a similar character set that does not require detailed identification.

【０１２０】また、本実施の形態の場合、上記の場合と
は逆に、文字認識装置１の使用される原稿において、出
現する単語が固定されている場合には、類似文字を詳細
に識別する必要がない場合がある。In the case of the present embodiment, on the contrary to the above case, if the words appearing in the original used by the character recognition device 1 are fixed, similar characters are identified in detail. May not be necessary.

【０１２１】例えば、いま、類似文字集合｛鰹，鮭，
鯉｝があり、この類似文字集合内の文字を含む単語とし
ては、「初鰹」「塩鮭」「鯉口」だけが、原稿中には出
現しないことが分かっているものとする。For example, now, a similar character set {bonito, salmon,
It is assumed that there is a carp 単語 and it is known that only “first bonito”, “salt salmon”, and “koiguchi” do not appear in the manuscript as words including characters in the similar character set.

【０１２２】この場合、入力パターンが、｛鰹，鮭，
鯉｝の類似文字集合のうちのどれかを詳細に識別しなく
ても、隣接する文字と単語情報によって、以下のよう
に、あいまいさが解消される。In this case, the input pattern is bonito, salmon,
Even if any of the similar character sets of Koi is not identified in detail, the ambiguity is resolved by the adjacent character and word information as follows.

【０１２３】初＋｛鰹，鮭，鯉｝ ○ 初鰹 × 初鮭 × 初鯉塩＋｛鰹，鮭，鯉｝ × 塩鰹 ○ 塩鮭 × 塩鯉｛鰹，鮭，鯉｝＋口 × 鰹口 × 鮭口 ○ 鯉口したがって、｛鰹，鮭，鯉｝のように識別する必要のな
い類似文字集合は、パターン辞書２中において、まとめ
て一つの標準パターン(特徴量ベクトル)とすることで、
標準パターンの数を減少させることができ、入力パター
ンとの照合回数を削減することができる。その結果、認
識精度を下げることなく、処理時間を短縮させることが
できる。First + bonito, salmon, carp ○ First bonito × first salmon × first carp Salt + bonito, salmon, carp × salted bonito ○ Salted salmon × salted carp bonito, salmon, carp + mouth × bonito × Therefore, similar character sets that do not need to be distinguished, such as {bonito, salmon, carp}, are collectively defined as one standard pattern (feature vector) in the pattern dictionary 2.
The number of standard patterns can be reduced, and the number of times of matching with the input pattern can be reduced. As a result, the processing time can be reduced without lowering the recognition accuracy.

【０１２４】このような詳細識別の不要な類似文字集合
の標準パターンとしては、例えば、図１０に、その辞書
レコードを示すようなものを用いることができる。As a standard pattern of a similar character set that does not require detailed identification, for example, a pattern shown in FIG. 10 showing the dictionary record can be used.

【０１２５】以上、本発明者によってなされた発明を好
適な実施の形態に基づき具体的に説明したが、本発明は
上記のものに限定されるものではなく、その要旨を逸脱
しない範囲で種々変更可能であることはいうまでもな
い。The invention made by the present inventor has been specifically described based on the preferred embodiments. However, the present invention is not limited to the above, and various modifications can be made without departing from the gist of the invention. It goes without saying that it is possible.

【０１２６】[0126]

【発明の効果】請求項１記載の発明の文字認識装置によ
れば、文字画像に対して所定の画像処理を行って画像の
特徴を抽出し、画像特徴格納辞書と照合して、当該画像
特徴格納辞書に格納されている言語情報に基づいて誤り
訂正を行って文字認識を行なうに際して、文字画像のう
ち認識を行う文字集合を認識対象文字集合として任意に
設定可能とし、当該設定された認識対象文字集合に対し
てのみ画像特徴格納辞書を使用して文字認識処理を行う
ので、誤認識しやすい文字である類似文字等を、予め調
査して求めて、類似文字間の識別に特化した画像特徴格
納辞書を作成し、認識を行う文字画像のうち当該類似文
字等の誤認しやすい文字集合を認識対象文字集合として
指定して、当該認識対象文字集合についてのみ作成した
画像特徴格納辞書を使用して文字認識を行うことがで
き、誤認識しやすい類似文字等の認識精度を向上させて
高精度かつ高速に文字認識を行うことができる。According to the character recognition apparatus of the first aspect of the present invention, a predetermined image processing is performed on the character image to extract the image feature, and the extracted image feature is compared with the image feature storage dictionary. When performing character correction by performing error correction based on linguistic information stored in a storage dictionary, a character set to be recognized among character images can be arbitrarily set as a character set to be recognized, and the set recognition target is set. Since the character recognition process is performed only for the character set by using the image feature storage dictionary, similar characters that are easily misrecognized are determined by examining in advance, and images specialized for distinguishing between similar characters An image feature storage dictionary that creates a feature storage dictionary and designates a character set that is likely to be misidentified such as similar characters among character images to be recognized as a character set to be recognized, and creates only the character set to be recognized. It is possible to perform character recognition can be performed character recognition, to improve the recognition accuracy of such erroneous recognition easily similar character with high accuracy and fast in use.

【０１２７】請求項２記載の発明の文字認識装置によれ
ば、認識対象文字集合を適宜変更可能とし、当該変更さ
れた認識文字集合に対して順次文字認識処理を行うの
で、誤認識しやすい類似文字等の認識精度をより一層向
上させることができ、高精度かつ高速に文字認識を行う
ことができる。According to the character recognition apparatus of the second aspect of the invention, the character set to be recognized can be changed as appropriate, and the changed character set is sequentially subjected to character recognition processing. The recognition accuracy of characters and the like can be further improved, and character recognition can be performed with high accuracy and at high speed.

【０１２８】請求項３記載の発明の文字認識装置によれ
ば、認識すべき文字毎に、認識対象文字集合を設定可能
とし、当該設定された認識対象文字集合に対して文字認
識処理を行うので、誤認識しやすい類似文字等の認識精
度をより一層速やかに認識することができ、高精度かつ
より一層高速に文字認識を行うことができる。According to the character recognition device of the third aspect of the present invention, a set of characters to be recognized can be set for each character to be recognized, and the character recognition process is performed on the set of characters to be recognized. In addition, the recognition accuracy of similar characters or the like which are easily misrecognized can be more quickly recognized, and character recognition can be performed with higher accuracy and higher speed.

【０１２９】請求項４記載の発明の文字認識装置によれ
ば、認識対象文字集合の設定を、画像特徴格納辞書に格
納されている言語情報に基づいて行えるようにしている
ので、文字認識の対象となる原稿の分野が限定されてい
る場合に、その分野に特有な言語的知識を用いて、類似
文字の詳細識別の必要性の有無を決定し、類似文字の認
識精度を落すことなく、詳細識別にかかる処理量を低減
することができ、より高精度かつより一層高速に文字認
識を行うことができる。According to the character recognition apparatus of the present invention, the character set to be recognized can be set based on the language information stored in the image feature storage dictionary. When the field of the original document is limited, linguistic knowledge unique to the field is used to determine whether detailed identification of similar characters is necessary, and to reduce the accuracy of similar character recognition without lowering the accuracy. The amount of processing required for identification can be reduced, and character recognition can be performed with higher accuracy and at higher speed.

【０１３０】請求項５記載の発明の文字認識装置によれ
ば、画像特徴格納辞書に格納されている言語情報に基づ
いて、複数の文字をまとめて１つのグループとして文字
データと同様に認識対象文字として設定できるようにし
ているので、文字認識の対象となる原稿に出現する単語
が固定されていたり、出現頻度が明確な場合に、言語的
知識を用いてその出現有無や頻度等を判別して、類似文
字の詳細識別の必要性の有無を決定し、類似文字の認識
精度を落すことなく、詳細識別にかかる処理量を低減す
ることができ、より高精度かつより一層高速に文字認識
を行うことができる。According to the character recognition device of the present invention, based on the linguistic information stored in the image feature storage dictionary, a plurality of characters are grouped into one group in the same manner as the character data. When words appearing in the document to be subjected to character recognition are fixed or when the frequency of occurrence is clear, the presence or absence and frequency of the occurrence are determined by using linguistic knowledge. , It is possible to determine the necessity of detailed identification of similar characters, and to reduce the processing amount required for detailed identification without lowering the recognition accuracy of similar characters, and perform character recognition with higher accuracy and at higher speed. be able to.

【０１３１】請求項６記載の発明の記録媒体によれば、
請求項１から請求項５のいずれかに記載の文字認識処理
を実行するのに必要な文字認識処理プログラム及び前記
画像特徴格納辞書等の前記文字認識処理プログラムを実
行するのに必要なデータを記録しているので、既存のハ
ードウェア資源を用いて文字認識処理プログラムを実行
して、認識を行う文字画像のうち当該類似文字等の誤認
しやすい文字集合を認識対象文字集合として指定して、
当該認識対象文字集合についてのみ作成した画像特徴格
納辞書を使用して文字認識を行うことができ、誤認識し
やすい類似文字等の認識精度を向上させて高精度かつ高
速に文字認識を行うことができる。According to the recording medium of the invention described in claim 6,
A character recognition processing program required to execute the character recognition processing according to any one of claims 1 to 5, and data necessary to execute the character recognition processing program such as the image feature storage dictionary are recorded. Since the character recognition processing program is executed by using the existing hardware resources, a character set that is easily misidentified such as the similar character in the character image to be recognized is designated as a character set to be recognized.
Character recognition can be performed using the image feature storage dictionary created only for the character set to be recognized. it can.

[Brief description of the drawings]

【図１】本発明の文字認識装置の第１の実施の形態を適
用した文字認識装置の概念図。FIG. 1 is a conceptual diagram of a character recognition device to which a first embodiment of the character recognition device of the present invention is applied.

【図２】図１の文字認識装置を実現するコンピュータの
ブロック構成図。FIG. 2 is a block diagram of a computer that realizes the character recognition device of FIG. 1;

【図３】図１の文字認識装置をネットワークに接続され
た複数のコンピュータで実現する際のシステム構成図。FIG. 3 is a system configuration diagram when the character recognition device of FIG. 1 is implemented by a plurality of computers connected to a network.

【図４】図１の文字認識装置による文字認識処理を示す
フローチャート。FIG. 4 is a flowchart illustrating a character recognition process performed by the character recognition device of FIG. 1;

【図５】本発明の文字認識装置の第２の実施の形態を適
用した文字認識装置の概念図。FIG. 5 is a conceptual diagram of a character recognition device to which a second embodiment of the character recognition device of the present invention is applied.

【図６】図５の文字認識装置による文字認識処理を示す
フローチャート。FIG. 6 is a flowchart showing a character recognition process performed by the character recognition device of FIG. 5;

【図７】本発明の文字認識装置の第３の実施の形態の文
字認識装置による文字認識処理を示すフローチャート。FIG. 7 is a flowchart illustrating a character recognition process performed by the character recognition device according to the third embodiment of the present invention.

【図８】本発明の文字認識装置の第４の実施の形態の文
字認識装置による文字認識処理を示すフローチャート。FIG. 8 is a flowchart illustrating a character recognition process performed by a character recognition device according to a fourth embodiment of the present invention.

【図９】類似文字集合辞書の辞書レコードの一例を示す
図。FIG. 9 is a diagram showing an example of a dictionary record of a similar character set dictionary.

【図１０】類似文字集合の標準パターン辞書の辞書レコ
ードの一例を示す図。FIG. 10 is a diagram showing an example of a dictionary record of a standard pattern dictionary of a similar character set.

【図１１】従来の一般的な文字認識処理を示すフローチ
ャート。FIG. 11 is a flowchart showing a conventional general character recognition process.

[Explanation of symbols]

１文字認識装置２パターン辞書３パターン入力手段４特徴抽出手段５次元圧縮手段６パターン照合手段７最小距離パターン選択手段８最小パターン保持・管理手段９文字選択手段１０認識対象文字指定手段１１認識対象文字集合保持手段２０コンピュータ２１ＣＰＵ２２メモリ２３表示装置２４ハードディスク２５キーボード２６ＣＤ−ＲＯＭドライブ２７フロッピーディスクドライブ２８通信装置２９バスＰＣ１〜ＰＣｎコンピュータ３０文字認識装置３１次元圧縮手段３２学習サンプル特徴ベクトル保持手段 DESCRIPTION OF SYMBOLS 1 Character recognition apparatus 2 Pattern dictionary 3 Pattern input means 4 Feature extraction means 5 dimensional compression means 6 Pattern collation means 7 Minimum distance pattern selection means 8 Minimum pattern holding / management means 9 Character selection means 10 Character recognition target designation means 11 Characters to be recognized Set holding means 20 Computer 21 CPU 22 Memory 23 Display device 24 Hard disk 25 Keyboard 26 CD-ROM drive 27 Floppy disk drive 28 Communication device 29 Bus PC1-PCn Computer 30 Character recognition device 31 Dimensional compression means 32 Learning sample feature vector holding means

Claims

[Claims]

An image feature is extracted by performing predetermined image processing on a character image, collated with an image feature storage dictionary, and an error correction is performed based on language information stored in the image feature storage dictionary. A character set to be recognized among the character images can be arbitrarily set as a character set to be recognized, and the image feature storage is performed only for the set character set to be recognized. A character recognition device for performing character recognition processing using a dictionary.

2. The character according to claim 1, wherein the character recognition device enables the character set to be recognized to be appropriately changed, and performs the character recognition processing on the changed character set to be recognized sequentially. Recognition device.

3. The character recognition device according to claim 1, wherein the character set to be recognized can be set for each character to be recognized, and the character recognition processing is performed on the set of character set to be recognized. The character recognition device according to claim 1 or 2.

4. The character recognition device according to claim 1, wherein the character set to be recognized is set based on the language information stored in the image feature storage dictionary. 4. The character recognition device according to claim 1, wherein a recognition process is performed.

5. The character recognition apparatus according to claim 1, wherein a plurality of characters are collectively set as one group as the character to be recognized in the same manner as character data, based on the language information stored in the image feature storage dictionary. The character recognition device according to claim 4, wherein the character recognition processing is performed on the set of the recognition target character set.

6. A program for executing the character recognition processing program required for executing the character recognition processing according to claim 1 and the character recognition processing program such as the image feature storage dictionary. A recording medium for recording necessary data.