JP2000181995A

JP2000181995A - Character recognizing device

Info

Publication number: JP2000181995A
Application number: JP10359157A
Authority: JP
Inventors: Akihiro Uda; 明弘宇田; Yasuto Ishitani; 康人石谷
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 1998-12-17
Filing date: 1998-12-17
Publication date: 2000-06-30

Abstract

PROBLEM TO BE SOLVED: To enable the recognition of high accuracy with a suitable calculation quantity to the state of a document by measuring the suitability of respective dictionaries based on reliability in any one of the plural dictionaries and the character recognized result and performing the character recognition while using the dictionary selected out of the respective dictionaries corresponding to the measured suitability. SOLUTION: A suitability measuring part 13 holds correlative information defined between the state of the document and an output from a similarity calculating part 11 and estimates the scratch, crush or unknown font of a certain document at the time of operating. The estimated information is held as a data base, this data base is collated after pattern recognition and quality or suitability between a recognition object character and a dictionary is discriminated. Corresponding to this suitability, a dictionary selecting part 14 selects a dictionary 12, the character recognition is performed while using the selected dictionary 12, and the result is synthesized with the similarity calculating part 11. Thus, recognizing processing can be highly accurately performed.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、他の行の文字認識
などの情報から文字の品質を推定する文字認識装置に関
する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character recognition device for estimating the quality of a character from information such as character recognition of another line.

【０００２】[0002]

【従来の技術】紙文書には多様なフォントで印字されて
いたり、コピーを繰り返したために画像が劣化する場合
がある。これに対し、従来の文字認識装置ではさまざま
な品質やフォントに対して固定的な認識方式を用いてい
た。そのため品質の良い文書にも多くの計算量を使った
り、低品質文書に対しても文書を一定の劣化と想定して
いて認識精度が向上しないという欠点があった。2. Description of the Related Art Paper documents are printed in various fonts, and images are sometimes deteriorated due to repeated copying. On the other hand, the conventional character recognition device uses a fixed recognition method for various qualities and fonts. Therefore, there are drawbacks that a large amount of calculation is used for a high-quality document, and that the recognition accuracy is not improved even if a low-quality document is assumed to be a certain degree of deterioration of the document.

【０００３】加えて、あらゆる品質に対して精度の高い
認識率を実現しようとして、必要以上に多様なパタンを
学習してしまった場合、パタンの追加により認識精度が
低下する過学習と呼ばれる現象があることが知られてい
る。そのため従来方法では多様な品質や未知のフォント
の文書に対して認識精度を向上することが難しかった。[0003] In addition, if an attempt is made to realize a highly accurate recognition rate for all qualities, and if a variety of patterns are learned more than necessary, a phenomenon called over-learning, in which the recognition accuracy is reduced by the addition of the patterns, occurs. It is known that there is. Therefore, it has been difficult for the conventional method to improve the recognition accuracy for documents of various qualities and unknown fonts.

【０００４】大町真一郎、阿曽弘具，“つぶれを考慮し
た低品質印刷文字の高精度認識”，電子情報通信学会論
文誌，(D-II) vol.J79-D-II，No.9，pp.1535-1542(199
6)。これはつぶれを考慮した低品質辞書の生成とそれに
よる劣化文字の認識率向上のみがかかれており、様々な
劣化に適応的に作用するものではない。[0004] Shinichiro Omachi and Hiroki Aso, "High-accuracy recognition of low-quality printed characters considering crushing," IEICE Transactions, (D-II) vol.J79-D-II, No.9, pp. .1535-1542 (199
6). This is only to generate a low-quality dictionary in consideration of collapse and to improve the recognition rate of degraded characters, and does not act adaptively on various types of degradation.

【０００５】大町真一郎、阿曽弘具, “品質を考慮した
印刷文字認識手法”, 情報処理学会論文誌, vol.38, N
o.12, pp.2490-2498(1997).これは予めつぶれ度で示し
た品質判定を前処理として行い,品質に応じて認識方式
を選択する。結果としてパタンに対して2度のパタン認
識を行う事に相当する処理量が必要となる。[0005] Shinichiro Omachi and Hiroki Aso, "Print Character Recognition Method Considering Quality", Information Processing Society of Japan, vol.38, N
o.12, pp.2490-2498 (1997). In this method, the quality judgment indicated in advance by the degree of crush is performed as preprocessing, and the recognition method is selected according to the quality. As a result, a processing amount equivalent to performing pattern recognition twice for the pattern is required.

【０００６】澤木美奈子,村瀬洋,萩田紀博, “変形推定
に基づいた参照パターンの自動選択による文字認識”，
信学技報,PRMU97-187(1997-12),pp.15-22(特開平7-2491
03)これらは上記に同じく,初期辞書,劣化対応辞書と2度
の認識を行う。Minako Sawaki, Hiroshi Murase, Norihiro Hagita, “Character Recognition by Automatic Selection of Reference Pattern Based on Deformation Estimation”,
IEICE Technical Report, PRMU97-187 (1997-12), pp.15-22 (Japanese Unexamined Patent Publication No. 7-2491
03) These perform the recognition twice as the initial dictionary and the degradation correspondence dictionary as above.

【０００７】[0007]

【発明が解決しようとする課題】従来技術によれば固定
的に高品質文書を認識する際にも低品質文書と同じ処理
を用いていたため認識率と認識速度を両立することが難
しかった。According to the prior art, it is difficult to achieve both the recognition rate and the recognition speed because the same processing as that for a low-quality document is used when a high-quality document is fixedly recognized.

【０００８】また、あらゆる文書で高い認識率を実現し
ようとして必要以上に多様な学習パタンを学習させてし
まうと学習前に認識できていたパタンが認識できなくな
るという現象がおきる場合があり、十分な正読率を得る
ことが難しかった。[0008] Further, if an attempt is made to learn a variety of learning patterns more than necessary in order to realize a high recognition rate for all documents, a phenomenon may occur in which patterns that could be recognized before learning cannot be recognized. It was difficult to get a correct reading rate.

【０００９】本発明はこの問題を鑑みてなされたもの
で、既に認識済みの領域の認識結果から文書の特定領域
における認識辞書との適合度をあらかじめ推定し、もっ
とも適切であると推測されるパラメタや辞書あるいは識
別器または識別器の組合せ方をすることによって、文書
の状態に対して適切な計算量で高い精度の認識を行うこ
とを目的とする。SUMMARY OF THE INVENTION The present invention has been made in view of the above problem, and it has been presumed from the recognition result of an already recognized area that the degree of matching with a recognition dictionary in a specific area of a document has been estimated. It is an object of the present invention to perform high-precision recognition with an appropriate amount of calculation for the state of a document by using a dictionary, a classifier, or a classifier or a combination of classifiers.

【００１０】[0010]

【課題を解決するための手段】これらの課題を解決する
ために、本願発明は、文字認識の際に参照される複数の
辞書と、認識済みの文字認識結果とその信頼度の少なく
とも一方をもとに、前記辞書の適合度を測定する測定手
段と、この測定手段によって測定された適合度に応じて
前記辞書から選択された辞書を用いて文字認識を行う識
別手段とを具備することを特徴とする。In order to solve these problems, the present invention provides a plurality of dictionaries to be referred to in character recognition, a recognized character recognition result and at least one of its reliability. And measuring means for measuring the fitness of the dictionary, and identification means for performing character recognition using a dictionary selected from the dictionary according to the fitness measured by the measuring means. And

【００１１】また、文字パタンを認識するための複数の
識別手段と、認識済みの文字認識結果とその信頼度の少
なくとも一方をもとに、前記識別手段と文字との適合度
を測定する測定手段手段と、この測定手段によって測定
された適合度によって前記識別手段を選択あるいは合成
する手段とを具備することを特徴とする。A plurality of identification means for recognizing a character pattern; and a measuring means for measuring a degree of conformity between the identification means and a character based on at least one of the recognized character recognition result and its reliability. Means, and means for selecting or synthesizing the identification means according to the fitness measured by the measurement means.

【００１２】また、辞書あるいは識別手段と文字との適
合度を測定した領域を包含する領域あるいは近接領域の
適合度測定結果をもとに、適合度を再設定する適合度再
設定手段と、この適合度再設定手段によって、再設定さ
れた適合度を用いて再度文字認識を行う手段とを具備す
ることを特徴とする。[0012] Further, a fitness resetting means for resetting the fitness based on the fitness measurement result of an area including a region where the fitness between the dictionary or the identification means and the character and the character is measured or a proximity area, Means for performing character recognition again using the re-set fitness by the fitness re-setting means.

【００１３】また、認識の際に参照される複数の辞書
と、認識済みの文字認識結果とその信頼度の少なくとも
一方をもとに、前記辞書の適合度を測定する測定手段
と、この測定手段によって測定された適合度に応じて合
成または変形された辞書により文字認識を行う識別手段
とを具備することを特徴とする。[0013] Further, a measuring means for measuring the fitness of the dictionary based on a plurality of dictionaries to be referred to at the time of recognition, at least one of the recognized character recognition result and its reliability, and this measuring means And a recognition unit for performing character recognition using a dictionary synthesized or transformed in accordance with the degree of conformity measured by the method.

【００１４】また、認識済みの文字認識結果とその信頼
度の少なくとも一方をもとに、文字の変形度あるいは品
質を測定する測定手段と、文字認識の際に候補を絞り込
む大分類手段と、前記測定手段によって測定された変形
度あるいは品質をもとに、前記大分類手段による文字候
補の数を増減させる手段と、この手段による文字候補を
もとに、文字認識を行う詳細識別手段とを具備すること
を特徴とする。A measuring means for measuring the degree of deformation or quality of the character based on at least one of the recognized character recognition result and its reliability; a large classification means for narrowing down candidates in character recognition; A means for increasing or decreasing the number of character candidates by the large classification means based on the degree of deformation or quality measured by the measurement means; and a detailed identification means for performing character recognition based on the character candidates by the means. It is characterized by doing.

【００１５】また、認識済みの文字認識結果とその信頼
度の少なくとも一方をもとに、文字の変形度あるいは品
質を測定する測定手段と、この測定手段によって測定さ
れた変形度あるいは品質をもとに、類似度計算のパラメ
ータを算出する手段と、この手段で算出されたパラメー
タに基づき類似度計算を行う手段とを具備することを特
徴とする。A measuring means for measuring the degree of deformation or quality of a character based on at least one of the recognized character recognition result and its reliability, and a measuring means for measuring the degree of deformation or quality measured by the measuring means. A means for calculating a parameter of similarity calculation; and a means for calculating similarity based on the parameter calculated by this means.

【００１６】さらに、認識済みの文字認識結果とその信
頼度の少なくとも一方をもとに、文字の変形度あるいは
品質を測定する測定手段と、文字認識の際に第１の文字
候補を絞り込む複数の大分類手段と、複数の大分類手段
から出力された第１の文字候補に対して、前記測定手段
によって測定された変形度あるいは品質をもとに、第２
の文字候補を決定する手段とを具備することを特徴とす
る。Further, a measuring means for measuring the degree of deformation or quality of a character based on at least one of the recognized character recognition result and its reliability, and a plurality of first character candidates for narrowing down the first character candidates at the time of character recognition. The first character candidate output from the large classifying unit and the second character classifying unit outputs a second character candidate based on the degree of deformation or quality measured by the measuring unit.
Means for determining a character candidate.

【００１７】このように構成された文字認識装置におい
ては、文字の変形度あるいは品質に応じて複数ある大分
類手段の統合方式を変更する。このため、変形も少な
く、文書画像の画質も良質である場合には確実に読める
ため、大分類手段いずれからも出力されている候補カテ
ゴリを後段の詳細識別で識別すれば候補数を削減でき高
速に認識できる。一方、文字の変形が大きかったり文書
画像の画質が劣悪な場合には大分類手段のいずれか一つ
に出力されれば詳細識別を行い、認識処理を高精度に行
う事が出来る。In the thus configured character recognition device, the integration method of a plurality of large classification means is changed according to the degree of deformation or quality of the character. For this reason, when there is little deformation and the image quality of the document image is good, it can be read reliably, and if the candidate category output from any of the major classification means is identified by detailed identification at the subsequent stage, the number of candidates can be reduced and high speed can be achieved. Can be recognized. On the other hand, when the character deformation is large or the image quality of the document image is inferior, if it is output to any one of the large classification means, the detailed identification can be performed, and the recognition processing can be performed with high accuracy.

【００１８】[0018]

【発明の実施の形態】以下、図面に基づいて本発明の実
施例にかかる文字認識装置について説明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS A character recognition apparatus according to an embodiment of the present invention will be described below with reference to the drawings.

【００１９】本発明の請求項１に対応する一実施例とし
て図１に示す通り１行毎に認識し、その認識結果から現
在認識に使用している辞書と認識対象画像の適合度を推
定し、次に認識すべき認識対象を最適な辞書を用いて認
識する装置について述べる。これは従来技術と異なり、
すでに読み終えた文書領域における文字変形(あるいは
品質)の評価に基づいている。As an embodiment corresponding to claim 1 of the present invention, recognition is performed line by line as shown in FIG. 1, and the relevance between the dictionary currently used for recognition and the image to be recognized is estimated from the recognition result. Next, an apparatus for recognizing a recognition target to be recognized next using an optimal dictionary will be described. This is different from the prior art,
It is based on the evaluation of character deformation (or quality) in the already read document area.

【００２０】図２は本発明の請求項１に対応する例の文
字認識装置の構成を示すブロック図であり、文字パタン
の類似度を求める類似度計算部１１と識別の際に参照さ
れる複数の辞書１２と認識済みの認識結果を用いて辞書
との適合度を測定する適合度測定部１３と推定された適
合度に応じて辞書を選択する辞書選択部１４とによって
構成されている。FIG. 2 is a block diagram showing the configuration of a character recognition apparatus according to an embodiment of the present invention. The similarity calculation unit 11 for obtaining the similarity of a character pattern and a plurality of units referred to in identification. And a dictionary selecting unit 14 for selecting a dictionary according to the estimated degree of matching using the dictionary 12 and the recognized recognition result to measure the degree of matching with the dictionary.

【００２１】類似度計算部１１は、入力パタンに対する
認識結果と類似度を出力する。この類似度計算部１１は
辞書１２が切り替え可能な構成になっている。類似度計
算部１１と辞書１２を組み合わせたものを識別器１５と
呼ぶ。次の式は複合類似度の定義を示している。The similarity calculator 11 outputs a recognition result and a similarity for the input pattern. The similarity calculation unit 11 has a configuration in which the dictionary 12 can be switched. A combination of the similarity calculation unit 11 and the dictionary 12 is called a discriminator 15. The following equation shows the definition of the composite similarity.

【数１】辞書や重みの部分は主成分分析で求めたものである。(Equation 1) The dictionaries and weights are obtained by principal component analysis.

【００２２】適合度測定部１３は、文書の状態と類似度
計算部１１からの出力との間で定義される相関情報を保
持し、動作時にはある文書を認識した際の類似度の傾向
から文書の状態（例えばかすれ、つぶれ、未知フォント
など）を推定する手段である。The goodness-of-fit measuring unit 13 holds correlation information defined between the state of the document and the output from the similarity calculating unit 11, and operates based on the similarity tendency when a certain document is recognized during operation. (For example, blurring, crushing, unknown font, etc.).

【００２３】一般に正常なパタンによって学習された辞
書に対して、かすれたパタンやつぶれたパタンを認識し
た場合は類似度が低くなる。また認識結果となる候補文
字にもある特定の傾向が出現する場合がある。これらの
情報により、認識対象文字と辞書や各種閾値の適合度を
調べることができる。Generally, when a blurred pattern or a crushed pattern is recognized with respect to a dictionary learned by a normal pattern, the similarity is low. In addition, there is a case where a certain tendency appears in a candidate character as a recognition result. Based on these pieces of information, it is possible to check the matching degree between the recognition target character and the dictionary or various thresholds.

【００２４】また他の方法として、ストロークの太さな
どを構造解析的に調べたり、黒画素の密度あるいは黒連
結成分の個数などから、文字のかすれやつぶれを調べる
ことができる。これらの情報をデータベースとして保持
し、認識後にこのデータベースを照合することにより品
質や認識対象文字と辞書との適合性を判定することもで
きる。As another method, the thickness of a stroke or the like can be examined structurally, or the blurring or crushing of a character can be examined from the density of black pixels or the number of black connected components. It is also possible to hold such information as a database and collate the database after recognition to determine the quality and compatibility between the recognition target character and the dictionary.

【００２５】辞書選択部１４は、適合度測定部１３によ
る出力（辞書に対する適合度）に応じて辞書１２を選択
し、類似度計算部１１と合成する。The dictionary selecting section 14 selects the dictionary 12 according to the output from the fitness measuring section 13 (fitness to the dictionary), and synthesizes it with the similarity calculating section 11.

【００２６】次に請求項１に対応する例の全体的な処理
の流れを図３を用いて説明する。請求項１の本実施例の
文字認識装置では、パタンを認識するフェーズ（Ｓ１
１）、辞書に対する適合度を測定するフェーズ（Ｓ１
２）、識別部を合成するフェーズ（Ｓ１３）の３つのフ
ェーズを持つ。Next, an overall processing flow of an example corresponding to claim 1 will be described with reference to FIG. In the character recognition apparatus according to the first embodiment, a phase of recognizing a pattern (S1
1), a phase for measuring the degree of matching with the dictionary (S1)
2) and three phases of combining the identification units (S13).

【００２７】パタンを認識するフェーズＳ１１では識別
器に認識対象パタンを入力し、類似度および認識結果を
出力する。辞書に対する適合度を測定するフェーズＳ１
２では認識時の類似度に基づいて認識された領域と現在
使用している辞書との適合度を測定する。認識部を合成
するフェーズＳ１３では最も適合すると思われる辞書を
選択し、類似度計算部と合成する事により識別器を構成
する。In the phase S11 of recognizing a pattern, a pattern to be recognized is input to a classifier, and a similarity and a recognition result are output. Phase S1 for measuring the relevance to the dictionary
In step 2, the relevance between the recognized area and the dictionary currently used is measured based on the similarity at the time of recognition. In phase S13 of combining the recognition units, a dictionary that is considered to be most suitable is selected and combined with the similarity calculation unit to form a classifier.

【００２８】このフェーズＳ１１〜Ｓ１３を繰り返す事
により動的に辞書を切り替え、品質や文字の変形に適応
的な認識を行う事ができる。By repeating the phases S11 to S13, dictionaries are dynamically switched, and recognition adaptive to quality and character deformation can be performed.

【００２９】また、上述では図１のように１行単位で適
合度を求めたが文字単位や部分画像単位で適合度を求
め、辞書選択を行う事もできる。Further, in the above description, the degree of conformity is obtained in units of one line as shown in FIG. 1, but the degree of conformity may be obtained in units of characters or partial images, and a dictionary may be selected.

【００３０】図４は本発明の請求項２に対応する例の文
字認識装置の構成を示すブロック図であり、文字パタン
を識別する複数の識別器２１と他パタンの認識結果を用
いて辞書との適合度を測定する適合度測定部２２と適合
度と複数の識別結果から識別器の組合せ方法を決定する
認識部統合部２３とを具備している。FIG. 4 is a block diagram showing the configuration of a character recognition apparatus according to a second embodiment of the present invention. A plurality of discriminators 21 for identifying a character pattern and a dictionary using a recognition result of another pattern are used. And a recognizing unit integrating unit 23 that determines a combination method of classifiers from the degree of conformity and a plurality of identification results.

【００３１】識別器２１および適合度測定部２２は図２
に同じであり、図４においては識別器２１が複数存在す
る。各々の識別器２１は同じ特徴を扱っても良いが異な
る特徴を用いる事により各特徴の特性を有効に活用する
事もできる。The discriminator 21 and the fitness measurer 22 are shown in FIG.
In FIG. 4, there are a plurality of discriminators 21. Each discriminator 21 may handle the same feature, but by using different features, the characteristics of each feature can be effectively utilized.

【００３２】識別器統合部２３は適合度測定部２２によ
り測定された適合度に応じて複数存在する識別器の組合
せを決定し、その組合せに基づいて認識を行う。図５で
示すように組合せに関しては認識部を直列に結合するＣ
１だけでなく、並列に繋ぎ、特定の重み付けで結果を統
合したりＣ２、あるいは特定の条件で分岐Ｃ３やフィー
ドバックを行う事もできる。The discriminator integrating unit 23 determines a combination of a plurality of discriminators according to the fitness measured by the fitness measuring unit 22, and performs recognition based on the combination. As shown in FIG. 5, for the combination, C that couples the recognition units in series
Not only 1 but also connecting in parallel, the results can be integrated with specific weighting, C2, or branch C3 or feedback can be performed under specific conditions.

【００３３】図６は請求項２に対応する実施例の流れ図
である。図３と異なる点は辞書を選択するのではなく識
別器全体を選択する所である。もちろん請求項１のよう
に取り扱う辞書を辞書適合度等の状況に応じて選択する
機構を備えていても良い。FIG. 6 is a flow chart of an embodiment according to the second aspect. The difference from FIG. 3 is that the entire discriminator is selected instead of selecting a dictionary. As a matter of course, a mechanism for selecting a dictionary to be handled according to a situation such as a dictionary matching degree may be provided.

【００３４】図７は請求項３に対応する実施例の概念図
である。請求項１や請求項２に対応する実施例において
辞書の適合度を測定するためには一般により広い範囲の
測定結果を適用した方が精度の良い結果が得られる。例
えば大局的に低品質であると判定され、一部の局所領域
のみが高品質と推定された場合、認識の性質として誤認
識が多くなる傾向がある。その場合にも再度、大局的な
品質測定結果を用いて再度認識を行う事により、文字認
識の精度を向上する事ができる。FIG. 7 is a conceptual diagram of an embodiment according to claim 3. In the embodiments corresponding to the first and second aspects, in order to measure the degree of matching of the dictionary, a more accurate result is generally obtained by applying a wider range of measurement results. For example, if it is determined that the quality is globally low and only a part of the local area is estimated to be high quality, the recognition tends to increase in the number of erroneous recognitions. In this case, the accuracy of character recognition can be improved by performing recognition again using the global quality measurement result.

【００３５】本発明請求項４に対応する一実施例として
複数の変形に対応した辞書合成を用いた文字認識装置に
ついて述べる。図８は認識対象と辞書との適合度測定に
関する概念的説明図である。As one embodiment corresponding to claim 4 of the present invention, a character recognition apparatus using dictionary synthesis corresponding to a plurality of transformations will be described. FIG. 8 is a conceptual explanatory diagram relating to the measurement of the degree of matching between the recognition target and the dictionary.

【００３６】認識辞書を生成する際にある標準的なパタ
ン集合から生成した標準辞書とそれを例えば太め／細め
処理をして生成した２つの辞書および傾きを左右に施し
た２つの辞書の計５辞書を予め準備しておく。入力パタ
ン対して複数の辞書で認識を行う。認識結果が同一であ
る場合にはその類似度に応じて文字の変形情報を得て、
蓄積しておく。When a recognition dictionary is generated, a standard dictionary generated from a standard pattern set, two dictionaries generated by subjecting the standard dictionary to thickening / thinning processing, and two dictionaries tilted to the left and right are used. Prepare a dictionary in advance. Recognize input patterns with multiple dictionaries. If the recognition results are the same, character deformation information is obtained according to the similarity,
Accumulate.

【００３７】また５辞書による認識結果に相違がある場
合には再度認識する印として対象文字パタンを登録して
おく。一定の領域について認識が終わったところで領域
内の変形の傾向を蓄積された変形情報から得て、それに
基づいて辞書を合成する。If there is a difference in the recognition result by the five dictionaries, the target character pattern is registered as a mark to be recognized again. When the recognition of a certain area is completed, the tendency of deformation in the area is obtained from the stored deformation information, and a dictionary is synthesized based on the obtained information.

【００３８】辞書の合成方法は変形辞書の標準辞書に対
する差分を変形情報の重みで加える。以上の方法で得ら
れた辞書は領域内の変形の傾向に最も近い辞書であるの
で精度の高い認識を行うことができる。In the dictionary synthesizing method, a difference between the transformed dictionary and the standard dictionary is added by the weight of the transformed information. Since the dictionary obtained by the above method is the dictionary closest to the tendency of deformation in the area, highly accurate recognition can be performed.

【００３９】図９は請求項４に対応する例の実行手順を
示している。文字認識の結果の他に平均パタンとの差分
を出力し、その傾向から辞書を変形する。このようにし
て変形された辞書を用いれば文書の劣化傾向やフォント
の傾向に追従でき、ひいては精度良く認識を行うことが
できる。FIG. 9 shows an execution procedure of an example corresponding to claim 4. The difference from the average pattern is output in addition to the result of character recognition, and the dictionary is transformed based on the tendency. By using the dictionary modified in this way, it is possible to follow the tendency of deterioration of the document and the tendency of the font, and it is possible to perform recognition with high accuracy.

【００４０】図１０は請求項４の一例であり、文字単位
でつぶれ度を測定し、次のパタンに対して算出されたつ
ぶれ度で次パタンの辞書を変形し、認識を行うことを表
している。FIG. 10 shows an example of claim 4, in which the degree of crushing is measured in units of characters, and the dictionary of the next pattern is transformed with the degree of crushing calculated for the next pattern to perform recognition. I have.

【００４１】図１１はつぶれ度を算出する際の概念を示
している。カテゴリ毎の辞書を定義している空間では、
空間中にある各特徴ベクトルの平均ベクトルを標準パタ
ンとして定義できる。標準パタンからの位置を検出し、
様々な位置での劣化の傾向（例えば傾きやかすれ、つぶ
れなど）を予め調査しておく事により、入力パタンの劣
化の傾向を知る事ができる。文書中の文字あるいは局所
領域毎に劣化の傾向が著しく異なる事は少ない。したが
って同じ劣化傾向を想定して隣り合う認識対象領域を認
識する事により文書の劣化傾向に対して適応的な認識辞
書を構成でき、認識精度を向上することができる。FIG. 11 shows the concept of calculating the degree of crush. In a space that defines a dictionary for each category,
The average vector of each feature vector in the space can be defined as a standard pattern. Detects the position from the standard pattern,
By investigating the tendency of deterioration at various positions (for example, inclination, blurring, crushing, etc.) in advance, it is possible to know the tendency of deterioration of the input pattern. It is unlikely that the tendency of deterioration will differ significantly for each character or local area in a document. Therefore, by recognizing adjacent recognition target areas assuming the same deterioration tendency, a recognition dictionary adaptive to the deterioration tendency of the document can be formed, and the recognition accuracy can be improved.

【００４２】本発明の請求項５に対応する一実施例とし
て文字の変形度（あるいは品質）に応じて大分類による
文字候補数を決定する文字認識装置について述べる。本
発明の構成としては例えば図１２のように大分類／中分
類／詳細識別といった多段構成の認識部を考える。ま
た、各分類部および詳細識別部は文字変形度（あるいは
品質）を入力として内部パラメタを変更することができ
る。As one embodiment corresponding to the fifth aspect of the present invention, a character recognition apparatus for determining the number of character candidates according to a large classification according to the degree of deformation (or quality) of a character will be described. As a configuration of the present invention, for example, a recognition unit having a multi-stage configuration such as large classification / intermediate classification / detailed identification as shown in FIG. 12 is considered. In addition, each classification unit and the detailed identification unit can change internal parameters by using the degree of character deformation (or quality) as input.

【００４３】図１３は文字変形度に応じて候補数を増減
する場合の概念を説明した図である。大分類部から詳細
識別部への候補リストを推定された文字の変形度に応じ
て増減する事により詳細識別部での計算量を増減する。FIG. 13 is a diagram for explaining the concept of increasing or decreasing the number of candidates according to the degree of character deformation. By increasing or decreasing the candidate list from the large classification unit to the detailed identification unit according to the estimated degree of deformation of the character, the amount of calculation in the detailed identification unit is increased or decreased.

【００４４】例えば文字の変形度の測定に類似度を用い
る場合、１位類似度と（１位類似度−２位類似度）によ
り文字の変形度を推定する(図１４参照)。推定された変
形度を用いて大分類の候補数を決定する(図１５参照)。
変形度が大きいと推定された場合には詳細識別で認識す
べき文字候補数を多くし、変形度が小さいと推定された
場合には文字候補数を少なくする。For example, when the similarity is used for measuring the degree of deformation of a character, the degree of deformation of the character is estimated from the first-place similarity and (first-place similarity-second-place similarity) (see FIG. 14). The number of large classification candidates is determined using the estimated degree of deformation (see FIG. 15).
When the degree of deformation is estimated to be large, the number of character candidates to be recognized in the detailed identification is increased, and when the degree of deformation is estimated to be small, the number of character candidates is reduced.

【００４５】本発明の請求項６に対応する一実施例とし
て文字の変形度に応じて認識部の計算精度(あるいは計
算量)を決定するパラメタを変更する文字認識装置につ
いて述べる。本発明では例えば複合類似度法を用いた場
合、以下の式におけるパラメタAs one embodiment corresponding to claim 6 of the present invention, a character recognition apparatus for changing a parameter for determining the calculation accuracy (or calculation amount) of a recognition unit according to the degree of deformation of a character will be described. In the present invention, for example, when the compound similarity method is used, the parameters in the following equation

【数２】は一般に変形度が大きくなると大きな値が必要となる。
変形度が小さいと判断された場合には一旦類似度を途中
までで求めておき、その中の上位候補についてのみ残り
を計算する。(Equation 2) Generally, a larger value is required as the degree of deformation increases.
When it is determined that the degree of deformation is small, the degree of similarity is once obtained in the middle, and the remainder is calculated only for the top candidates among them.

【数３】この式において(Equation 3) In this equation

【数４】の途中、即ちＸで一旦、計算を中断して途中段階の類似
度を求める。結果をソートし、文字の変形度が小さい場
合、残りは計算しないか、あるいは非常に少ない上位候
補のみ計算を続行する。変形度が大きい場合には途中段
階では計算は中断せず最後まで計算する。これにより、
変形の少ないパタンに対しては計算量を大幅に削減でき
る。(Equation 4) , The calculation is temporarily stopped at X, and the similarity at the intermediate stage is obtained. The result is sorted, and if the degree of deformation of the character is small, the rest is not calculated, or the calculation is continued with only a very small number of upper candidates. When the degree of deformation is large, the calculation is not interrupted in the middle stage but is calculated to the end. This allows
For patterns with little deformation, the amount of calculation can be greatly reduced.

【００４６】本発明の請求項７に対応する一実施例とし
て文字の変形度に応じて複数の大分類部文字候補リスト
の統合方法を変更する文字認識装置について述べる。As one embodiment corresponding to claim 7 of the present invention, a character recognition apparatus for changing the method of integrating a plurality of large classification part character candidate lists in accordance with the degree of character deformation will be described.

【００４７】本発明は例えば図１６のように複数の様々
な特徴に対する大分類部から出力される文字候補リスト
（第１の文字候補）において入力パタンの文字変形度が
大きいと推定された場合には候補のＯＲを取り、文字変
形度が小さい場合には候補のＡＮＤを取る（第２の文字
候補）。The present invention is applicable to a case where the character deformation degree of the input pattern is estimated to be large in the character candidate list (first character candidate) output from the large classification unit for a plurality of various features as shown in FIG. Takes the OR of the candidates and takes the AND of the candidates if the degree of character deformation is small (second character candidate).

【００４８】この方法によれば異なる特徴を用いる相乗
効果によってＯＲの場合にはさらに分類時の候補洩れを
少なくし、ＡＮＤの場合には候補数を大幅に削減でき
る。According to this method, due to a synergistic effect using different features, in the case of OR, the leakage of candidates at the time of classification can be further reduced, and in the case of AND, the number of candidates can be greatly reduced.

【００４９】また、本願発明の実施例における処理をコ
ンピュータで実行可能なプログラムで実現し、本願発明
は、このプログラムをコンピュータで読み取り可能な記
憶媒体として実現することも可能である。The processing in the embodiment of the present invention is realized by a computer-executable program, and the present invention can be realized by a computer-readable storage medium.

【００５０】なお、本願発明における記憶媒体として
は、磁気ディスク、フロッピーディスク、ハードディス
ク、光ディスク（ＣＤ−ＲＯＭ，ＣＤ−Ｒ，ＤＶＤ
等）、光磁気ディスク（ＭＯ等）、半導体メモリ等、プ
ログラムを記憶でき、かつコンピュータが読み取り可能
な記憶媒体であれば、その記憶形式は何れの形態であっ
てもよい。The storage medium in the present invention includes a magnetic disk, a floppy disk, a hard disk, and an optical disk (CD-ROM, CD-R, DVD).
Etc.), a magneto-optical disk (MO or the like), a semiconductor memory, or any other storage medium that can store a program and that can be read by a computer.

【００５１】また、記憶媒体からコンピュータにインス
トールされたプログラムの指示に基づきコンピュータ上
で稼動しているＯＳ（オペレーションシステム）や、デ
ータベース管理ソフト、ネットワーク等のＭＷ（ミドル
ウェア）等が本実施形態を実現するための各処理の一部
を実行してもよい。An OS (operation system) running on the computer, database management software, MW (middleware) such as a network, etc., based on instructions of a program installed in the computer from the storage medium realizes this embodiment. May be executed.

【００５２】さらに、本願発明における記憶媒体は、コ
ンピュータと独立した媒体に限らず、ＬＡＮやインター
ネット等により伝送されたプログラムをダウンロードし
て記憶または一時記憶した記憶媒体も含まれる。Further, the storage medium in the present invention is not limited to a medium independent of a computer, but also includes a storage medium in which a program transmitted via a LAN, the Internet, or the like is downloaded and stored or temporarily stored.

【００５３】また、記憶媒体は１つに限らず、複数の媒
体から本実施形態における処理が実行される場合も、本
発明における記憶媒体に含まれ、媒体の構成は何れの構
成であってもよい。Further, the number of storage media is not limited to one, and a case where the processing in the present embodiment is executed from a plurality of media is also included in the storage medium of the present invention, and the configuration of the medium is any configuration. Good.

【００５４】なお、本願発明におけるコンピュータは、
記憶媒体に記憶されたプログラムに基づき、本実施形態
における各処理を実行するものであって、パソコン等の
１つからなる装置、複数の装置がネットワーク接続され
たシステム等の何れの構成であってもよい。The computer according to the present invention is:
The computer executes each process in the present embodiment based on a program stored in a storage medium, and may have any configuration such as an apparatus such as a personal computer or a system in which a plurality of apparatuses are connected to a network. Is also good.

【００５５】また、本願発明におけるコンピュータと
は、パソコンに限らず、情報処理機器に含まれる演算処
理装置、マイコン等も含み、プログラムによって本願発
明の機能を実現することが可能な機器、装置を総称して
いる。Further, the computer in the present invention is not limited to a personal computer, but also includes an arithmetic processing unit, a microcomputer, and the like included in an information processing device, and generically refers to devices and devices capable of realizing the functions of the present invention by a program. are doing.

【００５６】[0056]

【発明の効果】以上に述べてきたように本発明によれば
パタン認識時に予め品質を推定し、その結果に応じて次
のパタンを認識することにより必要十分な処理量で高い
精度で文字パタンを認識できる。As described above, according to the present invention, the quality is pre-estimated at the time of pattern recognition, and the next pattern is recognized according to the result, whereby the character pattern can be processed with a necessary and sufficient processing amount and with high accuracy. Can be recognized.

[Brief description of the drawings]

【図１】本願発明の請求項１に対応する例の概念説明図
である。FIG. 1 is a conceptual explanatory diagram of an example corresponding to claim 1 of the present invention.

【図２】本願発明の請求項１に対応する文字認識装置の
ブロック図である。FIG. 2 is a block diagram of a character recognition device according to claim 1 of the present invention.

【図３】本願発明の請求項１に対応する文字認識装置の
処理の流れ図である。FIG. 3 is a flowchart of processing of a character recognition device according to claim 1 of the present invention.

【図４】本願発明の請求項２に対応する文字認識装置
のブロック図である。FIG. 4 is a block diagram of a character recognition device according to claim 2 of the present invention.

【図５】本願発明の請求項２に対応する文字認識装置の
統合処理の概念を説明するための図である。FIG. 5 is a diagram for explaining the concept of integration processing of a character recognition device according to claim 2 of the present invention.

【図６】本願発明の請求項２に対応する文字認識装置の
処理の流れ図である。FIG. 6 is a flowchart of a process performed by the character recognition device according to claim 2 of the present invention;

【図７】本願発明の請求項３に対応する例の概念説明図
である。FIG. 7 is a conceptual explanatory view of an example corresponding to claim 3 of the present invention.

【図８】本願発明の請求項４に対応する例の辞書変形の
場合の概念図である。FIG. 8 is a conceptual diagram in the case of dictionary modification of an example corresponding to claim 4 of the present invention.

【図９】本願発明の請求項４に対応する例の辞書変形の
場合の流れ図である。FIG. 9 is a flowchart in the case of dictionary modification of an example corresponding to claim 4 of the present invention.

【図１０】本願発明の請求項４に対応する例の概念説明
図である。FIG. 10 is a conceptual explanatory diagram of an example corresponding to claim 4 of the present invention.

【図１１】本願発明の請求項４に対応する例の概念説明
図である。FIG. 11 is a conceptual explanatory diagram of an example corresponding to claim 4 of the present invention.

【図１２】本願発明の請求項５に対応する例の全体構成
図である。FIG. 12 is an overall configuration diagram of an example corresponding to claim 5 of the present invention.

【図１３】本願発明の請求項５に対応する例の概念説明
図である。FIG. 13 is a conceptual explanatory view of an example corresponding to claim 5 of the present invention.

【図１４】本願発明の請求項５に対応する例の類似度と
変形度との相関図である。FIG. 14 is a correlation diagram between the similarity and the degree of deformation of the example corresponding to claim 5 of the present invention.

【図１５】本願発明の請求項５に対応する例の変形度と
候補文字数である。FIG. 15 shows the degree of deformation and the number of candidate characters in an example corresponding to claim 5 of the present invention.

【図１６】本願発明の請求項７に対応する例の候補文字
リストの統合を示す図である。FIG. 16 is a diagram showing integration of a candidate character list in an example corresponding to claim 7 of the present invention.

[Explanation of symbols]

１１…類似度計算部１２、２１…辞書１３、２２…適合度測定部１４…辞書選択部１５…識別器２３…識別器統合部 11: Similarity calculation unit 12, 21: Dictionary 13, 22 ... Fitness measurement unit 14: Dictionary selection unit 15: Classifier 23: Classifier integration unit

Claims

[Claims]

1. A plurality of dictionaries to be referred to in character recognition, a measuring means for measuring the fitness of the dictionary based on at least one of the recognized character recognition result and its reliability, A character recognition unit that performs character recognition using a dictionary selected from the dictionaries in accordance with the degree of conformity measured by the means.

2. A plurality of identification means for recognizing a character pattern, and measurement means for measuring a degree of conformity between the identification means and a character based on at least one of a recognized character recognition result and its reliability. And a means for selecting or synthesizing the identification means according to the degree of conformity measured by the measurement means.

3. A fitness resetting means for resetting fitness based on a fitness measurement result of a region including a region where the fitness of the dictionary or the identification means and the character is measured or a proximity area, 3. The character recognition apparatus according to claim 1, further comprising: means for performing character recognition again by using the re-set relevance by the re-set relevance unit.

4. A plurality of dictionaries to be referred to at the time of recognition, a measuring means for measuring the fitness of the dictionary based on at least one of a recognized character recognition result and its reliability, and a measuring means. A character recognition unit that performs character recognition using a dictionary synthesized or modified in accordance with the degree of conformity measured by the character recognition apparatus.

5. A measuring means for measuring the degree of deformation or quality of a character based on at least one of a recognized character recognition result and its reliability, a large classification means for narrowing down candidates in character recognition, A means for increasing or decreasing the number of character candidates by the large classification means based on the degree of deformation or quality measured by the measurement means; and a detailed identification means for performing character recognition based on the character candidates by the means. A character recognition device.

6. A measuring means for measuring the degree of deformation or quality of a character based on at least one of a recognized character recognition result and its reliability, and a measuring means for measuring the degree of deformation or quality measured by the measuring means. A character recognition device comprising: means for calculating a parameter for similarity calculation; and means for performing similarity calculation based on the parameter calculated by this means.

7. A measuring means for measuring the degree of deformation or quality of a character based on at least one of a recognized character recognition result and its reliability, and a plurality of first character candidates for narrowing down first character candidates in character recognition. Major classification means; and means for determining a second character candidate for the first character candidates output from the plurality of major classification means based on the degree of deformation or quality measured by the measurement means. A character recognition device comprising: