JPH08263587A

JPH08263587A - Method and device for document input

Info

Publication number: JPH08263587A
Application number: JP7061404A
Authority: JP
Inventors: Junichi Kubota; 淳市久保田; Yoshimoto Yamamoto; 喜大山本; Yukie Shoda; 幸恵庄田
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1995-03-20
Filing date: 1995-03-20
Publication date: 1996-10-11

Abstract

PURPOSE: To provide the document input device which can correctly recognize an inputted document image in a short time and easily correct the recognition result, and print the result with character size well balanced to a ruled-line frame. CONSTITUTION: A CPU 120 generates a correction result which can grammatically be interpreted from a combination of candidate codes having likelihood above a likelihood level in a likelihood level temporary storage part and displays a 1st candidate for the correction result on a display 106 each time the correction result is generated for a specific number of recognition sequences. The CPU 120 compares the correction result after corrections by an operator with an identical recognition sequence consisting of candidate character codes having the maximum likelihood and outputs the precision of character recognition. When the calculated precision is less than a precision level in a precision level temporary storage part, the reference value of likelihood in the likelihood level temporary storage part is varied.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、画像として読み取った
文書を、ワードプロセッサ、パーソナルコンピュータ等
のコンピュータ制御による文字認識を用いて文字コード
として入力する文書入力方法および文書入力装置に関す
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a document input method and a document input device for inputting a document read as an image as a character code by using character recognition under computer control of a word processor, a personal computer or the like.

【０００２】[0002]

【従来の技術】文書を効率的に入力するためＯＣＲ（Ｏ
ｐｔｉｃａｌＣｈａｒａｃｔｅｒＲｅａｄｅｒ）や手
書き文字認識装置が開発、利用されている。このような
ＯＣＲあるいは手書き文字認識装置では、紙面などに書
かれた文字を画像として読み取り、その画像を１文字単
位に切り出して文字特徴を抽出し、抽出した文字特徴か
らその画像が表す文字を認識する。このような文字認識
において、例えば、漢字などの文字を認識する場合、漢
字などは類似文字が極めて多いために、文字単位の認識
処理では正解率に限界があり、文字認識過程で認識誤り
が発生する。この認識誤りを補正するため、認識結果と
単語辞書とを照合し、認識結果として得られた文字の並
びが日本語として適切か否かを判断することによって、
認識誤り補正処理が行われる。このような認識誤り補正
処理の例が、丹羽、萱島による学会報告「電子情報通信
学会技術報告ＰＲＵ９１−１３５文字認識後処理法と
後処理による効果の分析」などに開示されている。前記
学会報告では、文字認識過程で発生する認識誤りを、７
０％の誤り訂正率で補正することができた例について言
及されており、このような方法による認識誤り補正処理
が、認識結果として入力される日本語文書の正解率を高
めるのに有用であることが判る。2. Description of the Related Art OCR (O
A physical character reader) and a handwritten character recognition device have been developed and used. In such an OCR or handwritten character recognition device, a character written on a paper surface is read as an image, the image is cut out in character units to extract character characteristics, and the character represented by the image is recognized from the extracted character characteristics. To do. In such character recognition, for example, when recognizing characters such as Kanji, there are a lot of similar characters in Kanji, so there is a limit to the correct answer rate in character recognition processing, and recognition errors occur in the character recognition process. To do. In order to correct this recognition error, the recognition result is compared with the word dictionary, and it is determined whether the sequence of characters obtained as the recognition result is appropriate as Japanese.
Recognition error correction processing is performed. An example of such recognition error correction processing is disclosed in the academic report by Niwa and Kayashima, "Technical Report of the Institute of Electronics, Information and Communication Engineers PRU91-135 character recognition post-processing method and analysis of effect of post-processing". According to the academic conference report, the recognition error generated in the character recognition process is
Reference is made to an example in which correction can be performed with an error correction rate of 0%, and the recognition error correction processing by such a method is useful for increasing the correct answer rate of a Japanese document input as a recognition result. I understand.

【０００３】[0003]

【発明が解決しようとする課題】しかしながら、日本語
の帳票（ここで、「帳票」とは、帳簿、伝票、届け出用
紙など、必要事項を記入するための空欄を設けて印刷し
た事務用紙をいう。）、レポート、パンフレット等の文
書に多く見られるように、文字が所定の枠内あるいは所
定の区間内に一定間隔で配置された、すなわち、文字が
均等割りつけされた日本語文書を画像から文字コードに
変換して入力する場合、従来の認識誤り補正処理では、
文字と文字との間の空白を単語区切りと見なしてしま
い、単語辞書との照合がうまくいかず、補正処理の効果
を十分に発揮することができなかった。However, a Japanese form (here, "form" means an office paper, such as a book, a slip, and a notification form, printed with blanks for entering necessary information. .), Reports, pamphlets, etc., are often seen in documents, the characters are arranged at regular intervals in a predetermined frame or in a predetermined interval, that is, Japanese documents in which characters are evenly distributed from an image are displayed. When inputting after converting to a character code, conventional recognition error correction processing
The white space between the characters was regarded as a word delimiter, the collation with the word dictionary was not successful, and the effect of the correction process could not be fully exerted.

【０００４】さらに、前述のような均等割付された文字
列を、各文字が紙面上のどの位置に配置されていたかを
示すレイアウト情報を残したまま文字認識する場合、認
識結果として得られた文字コードを前記レイアウト情報
とは別に管理していたため、入力結果の文字列を修正す
る場合、修正の手間がかかるという問題があった。具体
的には、例えば、「備考」が均等割付されて「備」と
「考」との間に空白が入っていたとき、従来の文書入力
装置では、「備」と「考」とに分割された文字認識をし
てしまう。この場合、「備考」という文字列を「その
他」に変更したいとき、「備」を「そ」に、「考」を
「他」にと認識結果である個別の文字毎に修正し、さら
にその間の空白に「の」を一文字追加するという手間が
かかってしまう。Further, when character recognition is performed on the above-mentioned uniformly-allocated character string while leaving layout information indicating at which position on the paper each character is arranged, the character obtained as a recognition result. Since the code is managed separately from the layout information, there is a problem that it takes time and effort to correct the character string of the input result. Specifically, for example, when "notes" are evenly allocated and there is a space between "notes" and "notes", the conventional document input device divides them into "notes" and "notes". The recognized characters will be recognized. In this case, when you want to change the character string "Remarks" to "Other", modify "Bi" to "So" and "Remark" to "Other" for each individual character that is the recognition result, and further It takes a lot of trouble to add a single "no" to the blank space.

【０００５】また、帳票枠内文字のように、レイアウト
上は縦書きと横書きの両方の解釈が成り立つ場合、認識
しようとする文字列の方向が判別できない上、前記文字
列方向について間違った解釈をすると、補正処理の効果
が発揮できないという問題があった。また、住所表示を
画像から文字コードとして入力する場合、住所表示には
類似字形の地名が多くあることから、そのような地名を
認識誤りし易いという問題があった。Further, when the layout has both vertical and horizontal interpretations, such as characters in the form frame, the direction of the character string to be recognized cannot be determined, and the character string direction cannot be correctly interpreted. Then, there is a problem that the effect of the correction process cannot be exhibited. In addition, when inputting an address display as a character code from an image, there are many place names having similar glyphs in the address display, and thus there is a problem that such place names are likely to be erroneously recognized.

【０００６】また、コピーされた原稿などが入力される
場合、コピーむら等のため原稿の一部が薄くなり、文字
認識率が大幅に低下することがある。従来の認識誤り補
正処理では、このような場合にも画一的な処理をしてい
たため、画質の悪化などのため文字認識性能の大幅な低
下が生じた場合に、必ずしも最適な候補を提示すること
ができなかった。逆に、原稿の画質が良く、はじめから
認識率が高い場合にも、最悪の場合を想定した認識誤り
補正処理を行っていたので、後処理に長い時間を要して
いた。Further, when a copied original or the like is input, a part of the original may be thinned due to uneven copy and the character recognition rate may be significantly lowered. In the conventional recognition error correction processing, uniform processing is performed even in such a case, and therefore, when the character recognition performance is significantly deteriorated due to deterioration of image quality or the like, the optimum candidate is always presented. I couldn't. On the other hand, even when the image quality of the original is good and the recognition rate is high from the beginning, the recognition error correction processing is performed assuming the worst case, and therefore the post-processing requires a long time.

【０００７】また、従来の文書入力装置では、文書入力
装置に予め設定されていた印字文字サイズでそのまま印
刷してしまうので、認識結果として得られた文字を、設
定された罫線枠に比べて釣り合いの悪い文字サイズで印
刷することがあった。上記課題に鑑みて、本発明は、均
等割り付けされた文字、文字列方向がどちらともとれる
帳票枠内文字、住所表示中の地名および印字画質の悪い
原稿中の文字など、文書の種別や原稿の状態にかかわら
ず、文書を正しく認識し、認識後の文字の修正を容易に
行うことができるとともに、認識誤り補正処理にかかる
時間を短縮し、設定された罫線枠に対して釣り合いの取
れたサイズの文字を印刷することができる文書入力装置
を提供することを目的とする。Further, in the conventional document input device, since the characters are printed as they are in the print character size preset in the document input device, the characters obtained as a recognition result are balanced as compared with the set ruled line frame. There was a case that it was printed with a bad character size. In view of the above-mentioned problems, the present invention provides a document type and a document such as characters evenly distributed, characters in a form frame that can be taken in both character string directions, place names in an address display, and characters in a document with poor print quality. Regardless of the status, the document can be recognized correctly and the characters after recognition can be easily corrected, the time required for recognition error correction processing can be shortened, and the size can be balanced with the set ruled line frame. It is an object of the present invention to provide a document input device capable of printing the characters.

【０００８】[0008]

【課題を解決するための手段】上記課題に鑑みて、本発
明の請求項１記載の文書入力装置は、１つの文字の画像
の文字認識結果として、文字コードの複数の候補を、そ
の確からしさを示す尤度と対応付けて記憶する文字コー
ド候補記憶手段と、前記尤度のあらかじめ定める尤度基
準値を記憶する尤度基準記憶手段と、取り込まれた文書
の画像中の文字列を表す同一の文字列画像について、前
記尤度基準記憶手段内の前記尤度基準値以上の尤度を有
する前記文字コードの候補の組み合わせから、文法的に
正しく解釈することができる文字列の複数の候補を生成
する文字列候補生成手段と、前記文字列画像に対応して
生成された前記文字列の候補に対して、同一の前記文字
列画像について生成された文字列の候補のうち、所定の
候補を表示する表示手段と、前記文字列画像に対応して
生成された前記文字列の候補に対して、オペレータから
の修正指示を入力する入力手段と、前記文字列画像に対
応して生成された前記文字列の候補に対して、前記修正
指示に従って、誤って認識されている文字コードを、正
しい文字コードに修正する修正手段と、前記修正手段に
よる修正の後、前記文字列画像に対応する修正後の文字
列と、同一文字列画像に対応する最も尤度の高い文字コ
ードの候補からなる文字列とを比較する比較手段と、前
記比較手段による比較の後、前記比較結果に基づいて、
前記両者の間で相違する文字数と前記一方の文字列に含
まれる文字数との割合から、文字認識の正解率を算出す
る正解率算出手段と、前記正解率のあらかじめ定める正
解率基準値を記憶する正解率基準記憶手段と、前記正解
率が算出された後、算出された正解率が、前記正解率基
準値と比較して所定の条件を満足する場合には、算出さ
れた正解率に応じて尤度基準記憶手段内の前記尤度基準
値を変更する尤度基準値変更手段とを備える。In view of the above problems, the document input device according to the first aspect of the present invention proposes a plurality of character code candidates as the character recognition result of an image of one character. Is stored in association with the likelihood, the likelihood reference storage means for storing a predetermined likelihood reference value of the likelihood, and the same character string representing the character string in the image of the captured document. A plurality of character string candidates that can be grammatically correctly interpreted from a combination of the character code candidates having a likelihood equal to or higher than the likelihood reference value in the likelihood reference storage means. With respect to the character string candidate generating means to generate and the character string candidates generated corresponding to the character string image, a predetermined candidate is selected from the character string candidates generated for the same character string image. Table to display Means, input means for inputting a correction instruction from the operator to the character string candidates generated corresponding to the character string image, and the character string generated corresponding to the character string image. In accordance with the correction instruction, the correction means corrects the character code that is erroneously recognized to the correct character code, and the corrected character string corresponding to the character string image after the correction by the correction means. And a comparing means for comparing a character string composed of a character code candidate with the highest likelihood corresponding to the same character string image, and after the comparison by the comparing means, based on the comparison result,
A correct answer rate calculating means for calculating a correct answer rate of character recognition from a ratio of the number of different characters between the two and the number of characters included in the one character string, and a predetermined correct answer rate reference value of the correct answer rate is stored. Correctness rate reference storage means, after the correctness rate is calculated, the calculated correctness rate, in comparison with the correctness rate reference value, if a predetermined condition is satisfied, depending on the calculated correctness rate Likelihood reference value changing means for changing the likelihood reference value in the likelihood reference storage means.

【０００９】請求項２記載の文書入力装置は、１つの文
字の画像の文字認識結果として、文字コードの複数の候
補を、その確からしさを示す尤度と対応付けて記憶する
文字コード候補記憶手段と、前記尤度のあらかじめ定め
る尤度基準値を記憶する尤度基準記憶手段と、取り込ま
れた文書の画像中の文字列を表す同一の文字列画像につ
いて、前記尤度基準記憶手段内の前記尤度基準値以上の
尤度を有する前記文字コードの候補の組み合わせから、
文法的に正しく解釈することができる文字列の複数の候
補を生成する文字列候補生成手段と、前記文字列画像に
対応して生成された前記文字列の候補に対して、同一の
前記文字列画像について生成された文字列の候補のう
ち、所定の候補を表示する表示手段と、前記文字列画像
に対応して生成された前記文字列の候補に対して、オペ
レータからの修正指示を入力する入力手段と、前記入力
手段から入力された修正指示を計数する修正指示計数手
段と、前記修正指示数のあらかじめ定める修正指示数基
準値を記憶する修正指示基準記憶手段と、前記文字列画
像に対応して生成された前記文字列の候補に対して、前
記修正指示に従って、誤って認識されている文字コード
を、正しい文字コードに修正する修正手段と、前記修正
手段による修正の後、計数された修正指示数が、前記修
正指示基準記憶手段に記憶されている修正指示数基準値
と比較して所定の条件を満足する場合には、計数された
修正指示数に応じて尤度基準記憶手段内の前記尤度基準
値を変更する尤度基準値変更手段とを備える。According to another aspect of the document input device of the present invention, as a character recognition result of an image of one character, a plurality of character code candidates are stored in association with a likelihood indicating the likelihood of the character code. And a likelihood reference storage unit that stores a predetermined likelihood reference value of the likelihood, and the same character string image that represents a character string in an image of a captured document, in the likelihood reference storage unit. From a combination of the character code candidates having a likelihood equal to or higher than the likelihood reference value,
Character string candidate generating means for generating a plurality of character string candidates that can be grammatically correctly interpreted, and the same character string for the character string candidates generated corresponding to the character string image. A display unit for displaying a predetermined candidate of the character string candidates generated for the image and a correction instruction from the operator for the character string candidate generated corresponding to the character string image. Input means, correction instruction counting means for counting the correction instructions input from the input means, correction instruction reference storage means for storing a predetermined reference value of correction instruction number of the correction instruction number, and corresponding to the character string image Correction means for correcting a character code that is erroneously recognized to a correct character code in accordance with the correction instruction for the candidate of the character string generated by When the counted number of correction instructions is compared with the correction instruction number reference value stored in the correction instruction reference storage means and satisfies a predetermined condition, the likelihood criterion is calculated according to the counted number of correction instructions. Likelihood reference value changing means for changing the likelihood reference value in the storage means.

【００１０】請求項３記載の文書入力装置は、一行の文
字列を表す文字列画像から切り出された１文字の画像
と、それに隣接する１文字の画像との間の空白を検出す
る空白検出手段と、検出された空白が所定の大きさより
大きい場合、前記１文字の画像とそれに隣接する１文字
の画像とは、それぞれ、異なる単語に属する文字である
と識別する単語識別手段と、取り込まれた文書の画像中
の所定の領域内の文字列画像について、前記単語識別手
段の識別結果を無効とする単語識別限定手段とを備え
る。A document input device according to a third aspect of the present invention is a space detecting means for detecting a space between an image of one character cut out from a character string image representing one line of a character string and an image of one character adjacent thereto. When the detected blank is larger than a predetermined size, the one-character image and the one-character image adjacent thereto are each identified by a word identifying unit that identifies them as characters belonging to different words. A word identification limiting unit that invalidates the identification result of the word identification unit for a character string image in a predetermined area in the image of the document.

【００１１】請求項４記載の文書入力装置は、取り込ま
れた文書の画像から切り出された１文字の画像の文字認
識結果として文字コードの複数の候補を記憶する文字コ
ード候補記憶手段と、取り込まれた文書の画像中の文字
列を表す同一の文字列画像について、前記文字コードの
候補を組み合わせ、文法的に正しく解釈することができ
る文字列の複数の候補を生成する文字列候補生成手段
と、生成された文字列の候補を格納する文字列候補記憶
手段と、１つの文字列画像に対応する所定の文字列の候
補を、前記文字列候補記憶手段から読み出して、識別さ
れた罫線枠内に均等割り付けして表示する表示手段とを
備える。According to another aspect of the present invention, there is provided a document input device, which stores character code candidate storage means for storing a plurality of character code candidates as a character recognition result of an image of one character cut out from an image of a captured document. For the same character string image representing a character string in the image of the document, a character string candidate generating unit that combines the character code candidates and generates a plurality of character string candidates that can be grammatically correctly interpreted, A character string candidate storage unit that stores the generated character string candidates and a predetermined character string candidate that corresponds to one character string image are read from the character string candidate storage unit and are stored in the identified ruled line frame. Display means for displaying by equally allocating.

【００１２】請求項５記載の文書入力装置は、取り込ま
れた文書の画像から切り出された１文字の画像の文字認
識結果として文字コードの複数の候補を記憶する文字コ
ード候補記憶手段と、取り込まれた文書の画像内の画像
領域と空白領域との分布に基づいて、取り込まれた文書
の画像内の文字列の方向が縦横いずれの方向であるかを
識別する文字列方向識別手段と、取り込まれた文書の画
像内の画像領域と空白領域との分布からは、文字列の方
向が縦横いずれの方向であるとも解釈できる場合、前記
両方向について、それぞれ、文字列を表す同一の文字列
画像について、前記文字コードの候補を組み合わせ、文
法的に正しく解釈することができる文字列の複数の候補
を生成する文字列候補生成手段と、前記縦方向の文字列
画像について生成された文字列の候補のうちの所定の候
補と、前記横方向の文字列画像について生成された文字
列の候補のうちの所定の候補とを比較して、前記所定の
候補が予め定める条件を満足する方の文字列方向および
その文字列の候補を有効と判定する文字列方向判定手段
とを備える。According to a fifth aspect of the present invention, there is provided a document input device, which stores character code candidate storage means for storing a plurality of character code candidates as a character recognition result of an image of one character cut out from an image of a captured document. A character string direction identifying means for identifying whether the direction of the character string in the captured image of the document is vertical or horizontal, based on the distribution of the image area and the blank area in the image of the captured document; From the distribution of the image area and the blank area in the image of the document, when it can be interpreted that the direction of the character string is either vertical or horizontal, in both directions, the same character string image representing the character string, Character string candidate generation means for combining the character code candidates to generate a plurality of character string candidates that can be grammatically correctly interpreted, and for the vertical character string image A predetermined candidate among the character string candidates that have been generated and a predetermined candidate among the character string candidates that have been generated for the horizontal character string image, and compare the predetermined conditions for the predetermined candidate. The character string direction determining means that determines that the character string direction that is more satisfied and the candidate of the character string are valid.

【００１３】請求項６記載の文書入力装置は、取り込ま
れた文書の画像内の罫線枠を識別する罫線枠識別手段
と、取り込まれた文書の画像から切り出された１文字の
画像の文字認識結果として、文字コードの複数の候補
を、当該文字の画像の前記文書の画像内における位置お
よび文字サイズとともに記憶する文字コード候補記憶手
段と、取り込まれた文書の画像中の文字列を表す同一の
文字列画像について、前記文字コードの候補の組み合わ
せから、文法的に正しく解釈することができる文字列の
複数の候補を生成する文字列候補生成手段と、生成され
た文字列の候補のうちの所定の候補を、識別された前記
罫線枠内に印字する際、当該文字列の候補内の前記文字
コードの候補が対応する画像の文字サイズを調べ、印字
することができる印字フォントサイズの中から、調べた
文字サイズに最も近い印字フォントサイズを選択する印
字文字サイズ選択手段と、識別された罫線枠を印字する
とともに、生成された文字列の候補のうちの所定の候補
を、印字された罫線枠内に、選択された印字文字サイズ
で印字する印字手段とを備える。According to another aspect of the document input device of the present invention, a ruled line frame identifying means for identifying a ruled line frame in an image of a captured document and a character recognition result of an image of one character cut out from the image of the captured document. As a character code candidate storage unit that stores a plurality of character code candidates together with the position and character size of the image of the character in the image of the document, the same character representing the character string in the image of the captured document. With respect to the column image, a character string candidate generating unit that generates a plurality of character string candidates that can be grammatically correctly interpreted from the combination of the character code candidates, and a predetermined character string candidate among the generated character string candidates. When printing a candidate in the identified ruled line frame, the character size of the image corresponding to the candidate of the character code in the candidate of the character string can be checked and printed. From the font sizes, a printing character size selecting means for selecting the printing font size closest to the examined character size, the identified ruled line frame is printed, and a predetermined candidate among the generated character string candidates is displayed. A printing unit for printing in a selected print character size is provided in the printed ruled line frame.

【００１４】請求項７記載の文書入力装置は、請求項６
記載の文書入力装置において、前記文字コード候補記憶
手段を参照して、前記文字列の候補内の前記文字コード
の候補が対応する画像の前記文書の画像内における位置
を調べ、当該文字の画像を挟む上下の空白領域の大きさ
を算出する上下間隔算出手段と、前記文字コード候補記
憶手段を参照して、前記文字列の候補内の前記文字コー
ドの候補が対応する画像の前記文書の画像内における位
置を調べ、当該文字の画像を挟む左右の空白領域の大き
さを算出する左右間隔算出手段と、選択される印字フォ
ントサイズの上下方向の大きさが、算出された上下の空
白領域の大きさと比べて所定の大きさを越えないよう、
選択される印字フォントサイズの上下方向の大きさを限
定する上下サイズ限定手段と、選択される印字フォント
サイズの左右方向の大きさが、算出された左右の空白領
域の大きさと比べて所定の大きさを越えないよう、選択
される印字フォントサイズの左右方向の大きさを限定す
る左右サイズ限定手段とを備え、前記印字文字サイズ選
択手段は、前記上下サイズ限定手段と前記左右サイズ限
定手段とによって限定された印字フォントサイズの中か
ら、調べた文字サイズに最も近い印字フォントサイズを
選択する。A document input device according to claim 7 is the document input device according to claim 6.
In the described document input device, referring to the character code candidate storage means, the position of the image corresponding to the character code candidate in the character string candidates in the image of the document is checked, and the image of the character is displayed. In the image of the document of the image to which the character code candidate in the character string candidate corresponds, by referring to the upper and lower space calculation means for calculating the size of the upper and lower blank areas to be sandwiched and the character code candidate storage means. And the horizontal spacing calculation means for calculating the size of the left and right blank areas sandwiching the image of the character, and the vertical size of the selected print font size is the calculated size of the upper and lower blank areas. In order not to exceed the prescribed size,
A vertical size limiting means for limiting the vertical size of the selected print font size, and a horizontal size of the selected print font size is a predetermined size compared with the calculated size of the left and right blank areas. So as not to exceed the left and right size limiting means for limiting the size of the selected print font size in the left and right direction, and the print character size selecting means includes the upper and lower size limiting means and the left and right size limiting means. From the limited print font size, select the print font size that is closest to the examined character size.

【００１５】請求項８記載の文書入力装置は、取り込ま
れた文書の画像から切り出された１文字の画像の文字認
識結果として文字コードの複数の候補を記憶する文字コ
ード候補記憶手段と、文字列を表す同一の文字列画像に
ついて、前記文字コードの候補を組み合わせ、用語とし
て正しく解釈することができる文字列の複数の候補を、
所定の優先順位とともに生成する文字列候補生成手段
と、生成された文字列の候補の中から、特定の文字コー
ドを識別し、その文字コードを含む文字列の候補および
当該文字列の候補と同一文字列画像について生成された
文字列の候補が表す用語の属性を判定する属性判定手段
と、相互に関連した属性を有する用語同士を対応付けて
格納する属性間対応用語記憶手段と、あらかじめ定める
属性別に、属性が判定された前記文字列の候補を前記優
先順位とともに格納する属性別文字列候補記憶手段と、
前記属性別文字列候補記憶手段に格納されている文字列
の候補であって、前記属性間対応用語記憶手段に格納さ
れている用語の属性を表すものについて、前記各文字列
の候補をキーとして、前記前記属性間対応用語記憶手段
から対応する用語を検索する用語検索手段と、前記属性
別文字列候補記憶手段に格納されている文字列の候補の
うち、相互に関連した他の属性の文字列の候補をキーと
する前記用語検索手段の検索結果と共通する文字列の候
補の優先順位を高める優先順位変更手段とを備える。According to another aspect of the document input device of the present invention, a character code candidate storage means for storing a plurality of character code candidates as a character recognition result of an image of one character cut out from the captured image of the document, and a character string. For the same character string image representing, a plurality of character string candidates that can be correctly interpreted as a term by combining the character code candidates,
A character string candidate generating unit that generates with a predetermined priority, and a specific character code is identified from the generated character string candidates, and the character string candidate including the character code and the same character string candidate are identified. Attribute determination means for determining attributes of terms represented by character string candidates generated for a character string image, inter-attribute correspondence term storage means for storing terms having mutually related attributes in association with each other, and predetermined attributes Separately, attribute-based character string candidate storage means for storing the character string candidates whose attributes have been determined together with the priority order,
Regarding the character string candidates stored in the attribute-specific character string candidate storage means, which represent the attribute of the term stored in the inter-attribute correspondence term storage means, the character string candidates are used as keys. A term search means for searching a corresponding term from the attribute correspondence term storage means, and a character string candidate stored in the attribute-specific character string candidate storage means, which is a character of another attribute that is mutually related Priority changing means for increasing the priority of the character string candidates that are common to the search result of the term searching means using the column candidates as keys.

【００１６】請求項９記載の文書入力方法は、取り込ま
れた文書の画像中の文字列を表す同一の文字列画像につ
いて、前記尤度基準記憶部に格納されている尤度基準値
以上の尤度を有する前記文字コードの候補の組み合わせ
から、文法的に正しく解釈することができる文字列の複
数の候補を生成する文字列候補生成ステップと、前記文
字列画像に対応して生成された前記文字列の候補に対し
て、同一の前記文字列画像について生成された文字列の
候補のうち、所定の候補を表示する表示ステップと、前
記文字列画像に対応して生成された前記文字列の候補に
対して、オペレータからの修正指示を入力する入力ステ
ップと、前記文字列画像に対応して生成された前記文字
列の候補に対して、前記修正指示に従って、誤って認識
されている文字コードを、正しい文字コードに修正する
修正ステップと、前記修正の後、前記文字列画像に対応
する修正後の文字列と、同一文字列画像に対応する最も
尤度の高い文字コードの候補からなる文字列とを比較す
る比較ステップと、前記比較の後、前記比較結果に基づ
いて、前記両者の間で相違する文字数と前記一方の文字
列に含まれる文字数との割合から、文字認識の正解率を
算出する正解率算出ステップと、前記正解率が算出され
た後、算出された正解率が、前記正解率基準記憶部に格
納されている正解率基準値と比較して所定の条件を満足
する場合には、算出された正解率に応じて尤度基準記憶
部内の前記尤度基準値を変更し、前記文字列候補生成ス
テップに戻る尤度基準値変更ステップとを備える。According to a ninth aspect of the present invention, in the document input method, the same character string image representing the character string in the captured image of the document is more than the likelihood reference value stored in the likelihood reference storage unit. A character string candidate generating step of generating a plurality of character string candidates that can be grammatically correctly interpreted from a combination of the character code candidates having a degree; and the character generated corresponding to the character string image. A display step of displaying a predetermined candidate among the character string candidates generated for the same character string image for the column candidate, and the character string candidate generated corresponding to the character string image. , An input step of inputting a correction instruction from the operator, and a character string that is erroneously recognized in accordance with the correction instruction for the character string candidate generated corresponding to the character string image. Correction code for correcting the character code to a correct character code, and after the correction, a corrected character string corresponding to the character string image and a candidate for the most likely character code corresponding to the same character string image. A comparison step of comparing with a character string, and after the comparison, based on the comparison result, from the ratio of the number of characters different between the two and the number of characters included in the one character string, the correct rate of character recognition And a correct answer rate calculation step for calculating the correct answer rate, and the calculated correct answer rate satisfies a predetermined condition by comparing with a correct answer rate reference value stored in the correct answer rate reference storage unit. In this case, a likelihood reference value changing step of changing the likelihood reference value in the likelihood reference storage unit according to the calculated correct answer rate and returning to the character string candidate generating step is provided.

【００１７】請求項１０記載の文書入力方法は、取り込
まれた文書の画像中の文字列を表す同一の文字列画像に
ついて、前記尤度基準記憶部内の前記尤度基準値以上の
尤度を有する前記文字コードの候補の組み合わせから、
文法的に正しく解釈することができる文字列の複数の候
補を生成する文字列候補生成ステップと、前記文字列画
像に対応して生成された前記文字列の候補に対して、同
一の前記文字列画像について生成された文字列の候補の
うち、所定の候補を表示する表示ステップと、前記文字
列画像に対応して生成された前記文字列の候補に対し
て、オペレータからの修正指示を入力する入力ステップ
と、入力された修正指示を計数する修正指示計数ステッ
プと、前記文字列画像に対応して生成された前記文字列
の候補に対して、前記修正指示に従って、誤って認識さ
れている文字コードを、正しい文字コードに修正する修
正ステップと、前記修正の後、計数された修正指示数
が、前記修正指示基準記憶部に記憶されている修正指示
数基準値と比較して所定の条件を満足する場合には、計
数された修正指示数に応じて尤度基準記憶部内の前記尤
度基準値を変更する尤度基準値変更ステップとを備え
る。In the document input method according to the tenth aspect, the same character string image representing the character string in the image of the captured document has a likelihood equal to or higher than the likelihood reference value in the likelihood reference storage unit. From the combination of the character code candidates,
A character string candidate generating step of generating a plurality of character string candidates that can be grammatically correctly interpreted; and the same character string for the character string candidates generated corresponding to the character string image. A display step of displaying a predetermined candidate among the character string candidates generated for the image, and a correction instruction from an operator is input to the character string candidate generated corresponding to the character string image. An input step, a correction instruction counting step of counting input correction instructions, and a character that is erroneously recognized according to the correction instruction with respect to the character string candidates generated corresponding to the character string image. A correction step of correcting the code to a correct character code, and after the correction, the counted correction instruction number is compared with a correction instruction number reference value stored in the correction instruction reference storage unit. When satisfying the condition, and a likelihood reference value changing step of changing the likelihood reference value in the likelihood reference storage unit in accordance with the counted number of correction instructions.

【００１８】請求項１１記載の文書入力方法は、一行の
文字列を表す文字列画像から切り出された１文字の画像
と、それに隣接する１文字の画像との間の空白を検出す
る空白検出ステップと、検出された空白が所定の大きさ
より大きい場合、前記１文字の画像とそれに隣接する１
文字の画像とは、それぞれ、異なる単語に属する文字で
あると識別する単語識別ステップと、取り込まれた文書
の画像中の所定の領域内の文字列画像について、前記識
別結果を無効とする単語識別限定ステップとを備える。In the document input method according to the eleventh aspect of the present invention, a blank detecting step for detecting a blank between an image of one character cut out from a character string image representing one line of a character string and an image of one character adjacent thereto. If the detected space is larger than a predetermined size, the image of the one character and the adjacent 1
The image of characters is a word identification step for identifying characters belonging to different words, and a word identification for invalidating the identification result for a character string image in a predetermined area in an image of a captured document. And a limiting step.

【００１９】請求項１２記載の文書入力方法は、取り込
まれた文書の画像中の文字列を表す同一の文字列画像に
ついて、前記文字コードの候補を組み合わせ、文法的に
正しく解釈することができる文字列の複数の候補を生成
する文字列候補生成ステップと、生成された文字列の候
補を文字列候補記憶部に格納する文字列候補格納ステッ
プと、１つの文字列画像に対応する所定の文字列の候補
を、前記文字列候補記憶部から読み出して、識別された
罫線枠内に均等割り付けして表示する表示ステップとを
備える。According to a twelfth aspect of the present invention, in the document inputting method, the same character string image representing the character string in the image of the captured document can be combined with the character code candidates and grammatically correctly interpreted. A character string candidate generating step of generating a plurality of column candidates, a character string candidate storing step of storing the generated character string candidates in a character string candidate storage unit, and a predetermined character string corresponding to one character string image. And a display step of displaying the candidates of No. 1 from the character string candidate storage unit and evenly allocating the candidates within the identified ruled line frame for display.

【００２０】請求項１３記載の文書入力方法は、取り込
まれた文書の画像内の画像領域と空白領域との分布に基
づいて、取り込まれた文書の画像内の文字列の方向が縦
横いずれの方向であるかを識別する文字列方向識別ステ
ップと、取り込まれた文書の画像内の画像領域と空白領
域との分布からは、文字列の方向が縦横いずれの方向で
あるとも解釈できる場合、前記両方向についてそれぞ
れ、文字列を表す同一の文字列画像について、前記文字
コードの候補の組み合わせから、文法的に正しく解釈す
ることができる文字列の複数の候補を生成する文字列候
補生成ステップと、前記縦方向の文字列画像について生
成された文字列の候補のうちの所定の候補と、前記横方
向の文字列画像について生成された文字列の候補のうち
の所定の候補とを比較して、前記所定の候補が予め定め
る条件を満足する方の文字列方向およびその文字列の候
補を有効と判定する文字列方向判定ステップとを備え
る。In the document input method according to the thirteenth aspect, the direction of the character string in the image of the captured document is either vertical or horizontal based on the distribution of the image area and the blank area in the image of the captured document. If the direction of the character string can be interpreted as either the vertical or horizontal direction based on the character string direction identification step for identifying whether the character string is in the two directions, and the distribution of the image area and the blank area in the image of the captured document, With respect to the same character string image representing a character string, a character string candidate generating step of generating a plurality of character string candidates that can be grammatically correctly interpreted from the combination of the character code candidates, and the vertical character A predetermined candidate among the character string candidates generated for the directional character string image and a predetermined candidate among the character string candidates generated for the horizontal character string image are compared. To, and a character string direction determining step of determining an effective candidate character string direction and the string towards said given candidate satisfies the conditions set forth previously.

【００２１】請求項１４記載の文書入力方法は、取り込
まれた文書の画像内の罫線枠を識別する罫線枠識別ステ
ップと、取り込まれた文書の画像中の文字列を表す同一
の文字列画像について、前記文字コードの候補の組み合
わせから、文法的に正しく解釈することができる文字列
の複数の候補を生成する文字列候補生成ステップと、生
成された文字列の候補のうちの所定の候補を、識別され
た前記罫線枠内に印字する際、当該文字列の候補内の前
記文字コードの候補が対応する画像の文字サイズを調
べ、印字することができる印字フォントサイズの中か
ら、調べた文字サイズに最も近い印字フォントサイズを
選択する印字文字サイズ選択ステップと、識別された罫
線枠を印字するとともに、生成された文字列の候補のう
ちの所定の候補を、印字された罫線枠内に、選択された
印字文字サイズで印字する印字ステップとを備える。According to a fourteenth aspect of the present invention, there is provided a document input method for a ruled line frame identifying step for identifying a ruled line frame in an image of a captured document and for the same character string image representing a character string in the image of the captured document. A character string candidate generating step of generating a plurality of character string candidates that can be grammatically correctly interpreted from a combination of the character code candidates, and a predetermined candidate among the generated character string candidates, When printing in the identified ruled line frame, the character size of the image corresponding to the character code candidate in the character string candidate is checked, and the character size checked from the printable font sizes that can be printed The step of selecting the print character size that selects the print font size closest to the above, printing the identified ruled line frame, and marking the predetermined candidates of the generated character string candidates. A ruled line frame, and a printing step of printing by the print character size selected.

【００２２】請求項１５記載の文書入力方法は、請求項
１４記載の文書入力方法において、前記罫線枠識別ステ
ップの後、前記文字コード候補記憶部を参照して、前記
文字列の候補内の前記文字コードの候補が対応する画像
の前記文書の画像内における位置を調べ、当該文字の画
像を挟む上下の空白領域の大きさを算出する上下間隔算
出ステップと、前記文字コード候補記憶部を参照して、
前記文字列の候補内の前記文字コードの候補が対応する
画像の前記文書の画像内における位置を調べ、当該文字
の画像を挟む左右の空白領域の大きさを算出する左右間
隔算出ステップと、選択される印字フォントサイズの上
下方向の大きさが、算出された上下の空白領域の大きさ
と比べて所定の大きさを越えないよう、選択される印字
フォントサイズの上下方向の大きさを限定する上下サイ
ズ限定ステップと、選択される印字フォントサイズの左
右方向の大きさが、算出された左右の空白領域の大きさ
と比べて所定の大きさを越えないよう、選択される印字
フォントサイズの左右方向の大きさを限定する左右サイ
ズ限定ステップとを備える。According to a fifteenth aspect of the present invention, in the document input method according to the fourteenth aspect, after the ruled line frame identifying step, the character code candidate storage section is referred to, and By referring to the character code candidate storage unit, a vertical interval calculation step of checking the position of the image corresponding to the character code candidate in the image of the document and calculating the size of the upper and lower blank areas sandwiching the image of the character. hand,
A left-right interval calculation step of calculating the size of the left and right blank areas sandwiching the image of the character by checking the position in the image of the document of the image corresponding to the character code candidate in the character string candidate, and selecting The vertical size of the selected print font size is limited so that the vertical size of the selected print font size does not exceed the calculated size of the upper and lower blank areas. The size limit step and the size of the selected print font size in the horizontal direction do not exceed the specified size compared with the calculated size of the left and right blank areas. And a left and right size limiting step for limiting the size.

【００２３】請求項１６記載の文書入力方法は、文字列
を表す同一の文字列画像について、前記文字コードの候
補を組み合わせ、用語として正しく解釈することができ
る文字列の複数の候補を、所定の優先順位とともに生成
する文字列候補生成ステップと、生成された文字列の候
補の中から、特定の文字コードを識別し、その文字コー
ドを含む文字列の候補および当該文字列の候補と同一文
字列画像について生成された文字列の候補が表す用語の
属性を判定する属性判定ステップと、あらかじめ定める
属性別に、属性が判定された前記文字列の候補を前記優
先順位とともに属性別文字列候補記憶部に格納する属性
別文字列候補格納ステップと、前記属性別文字列候補記
憶部に格納されている文字列の候補であって、前記属性
間対応用語記憶部に格納されている用語の属性を表すも
のについて、前記各文字列の候補をキーとして、前記前
記属性間対応用語記憶部から対応する用語を検索する用
語検索ステップと、前記属性別文字列候補記憶部に格納
されている文字列の候補のうち、相互に関連した他の属
性の文字列の候補をキーとする前記検索結果と共通する
文字列の候補の優先順位を高める優先順位変更ステップ
とを備える。According to a sixteenth aspect of the present invention, in the document inputting method, the same character string image representing a character string is combined with the character code candidates, and a plurality of character string candidates that can be correctly interpreted as a term are determined. A character string candidate generating step that generates together with the priority order, and a specific character code is identified from the generated character string candidates, and a character string candidate including the character code and the same character string as the character string candidate An attribute determination step of determining an attribute of a term represented by a character string candidate generated for an image, and the attribute-determined character string candidates are stored in the attribute-based character string candidate storage unit together with the priority according to a predetermined attribute. An attribute-based character string candidate storage step of storing the character string candidates stored in the attribute-based character string candidate storage unit, wherein A term search step of searching for a corresponding term from the inter-attribute correspondence term storage section using the candidate of each of the character strings as a key for a stored term representing an attribute, and the attribute-based character string candidate storage section Among the character string candidates stored in, the priority change step for increasing the priority of the character string candidates that are common to the search result using the character string candidates of other attributes that are mutually related as a key. ..

【００２４】[0024]

【作用】本発明に従えば、請求項１記載の文書入力装置
において、文字コード候補記憶手段は、１つの文字の画
像の文字認識結果として、文字コードの複数の候補を、
その確からしさを示す尤度と対応付けて記憶する。尤度
基準記憶手段は、前記尤度のあらかじめ定める尤度基準
値を記憶する。文字列候補生成手段は、取り込まれた文
書の画像中の文字列を表す同一の文字列画像について、
前記尤度基準記憶手段内の前記尤度基準値以上の尤度を
有する前記文字コードの候補の組み合わせから、文法的
に正しく解釈することができる文字列の複数の候補を生
成する。表示手段は、前記文字列画像に対応して生成さ
れた前記文字列の候補に対して、同一の前記文字列画像
について生成された文字列の候補のうち、所定の候補を
表示する。入力手段は、前記文字列画像に対応して生成
された前記文字列の候補に対して、オペレータからの修
正指示を入力する。修正手段は、前記文字列画像に対応
して生成された前記文字列の候補に対して、前記修正指
示に従って、誤って認識されている文字コードを、正し
い文字コードに修正する。比較手段は、前記修正手段に
よる修正の後、前記文字列画像に対応する修正後の文字
列と、同一文字列画像に対応する最も尤度の高い文字コ
ードの候補からなる文字列とを比較する。正解率算出手
段は、前記比較手段による比較の後、前記比較結果に基
づいて、前記両者の間で相違する文字数と前記一方の文
字列に含まれる文字数との割合から、文字認識の正解率
を算出する。正解率基準記憶手段は、前記正解率のあら
かじめ定める正解率基準値を記憶する。尤度基準値変更
手段は、前記正解率が算出された後、算出された正解率
が、前記正解率基準値と比較して所定の条件を満足する
場合には、算出された正解率に応じて尤度基準記憶手段
内の前記尤度基準値を変更する。According to the present invention, in the document input device according to the first aspect, the character code candidate storage means selects a plurality of character code candidates as a character recognition result of an image of one character.
The likelihood is stored in association with the likelihood indicating the certainty. The likelihood reference storage means stores a predetermined likelihood reference value of the likelihood. The character string candidate generation means, for the same character string image representing the character string in the image of the captured document,
A plurality of character string candidates that can be grammatically correctly interpreted are generated from a combination of the character code candidates having a likelihood equal to or higher than the likelihood reference value in the likelihood reference storage unit. The display means displays a predetermined candidate among the character string candidates generated for the same character string image with respect to the character string candidates generated corresponding to the character string image. The input means inputs a correction instruction from an operator to the character string candidates generated corresponding to the character string image. The correction means corrects a character code, which is erroneously recognized, into a correct character code in accordance with the correction instruction with respect to the character string candidates generated corresponding to the character string image. After the correction by the correction unit, the comparison unit compares the corrected character string corresponding to the character string image with a character string including a character code candidate with the highest likelihood corresponding to the same character string image. . The correct answer rate calculation means, after the comparison by the comparison means, based on the comparison result, from the ratio of the number of characters different between the two and the number of characters included in the one character string, the correct answer rate of character recognition, calculate. The correct answer rate reference storage means stores a predetermined correct answer rate reference value of the correct answer rate. Likelihood reference value changing means, after the correct answer rate is calculated, when the calculated correct answer rate satisfies a predetermined condition compared with the correct answer rate reference value, depending on the calculated correct answer rate. Then, the likelihood reference value in the likelihood reference storage means is changed.

【００２５】従って、本発明によれば、前記文字列画像
に対応して生成された前記文字列の候補に対して、前記
正解率で表される文字認識の精度を算出し、算出された
文字認識の精度に応じて、前記文字列候補生成手段が組
み合わせに用いる文字コードの候補の数を限定するの
で、文字認識精度の変動に適応した文字列候補生成処理
を行うことができる。すなわち、認識対象となる文書の
原稿が、何度もコピーされて文字のかすれや汚れを生じ
た原稿であって、文字認識精度が低いときには、前記文
字列候補生成手段が組み合わせに用いる文字コードの候
補の数を多くすることによって、生成される文字列の候
補の正解率を高めることができる。また、認識対象とな
る文書の原稿が鮮明な画像で得られた場合のように、文
字認識精度が高いときには、前記文字列候補生成手段が
組み合わせに用いる文字コードの候補の数を少なくする
ことによって、前記文字列候補生成手段の文字列候補生
成速度を短縮することができ、結果的に、文書入力装置
全体の処理時間を短縮することができる。Therefore, according to the present invention, the accuracy of character recognition represented by the accuracy rate is calculated for the character string candidates generated corresponding to the character string image, and the calculated character is calculated. Since the number of character code candidates used for the combination by the character string candidate generation means is limited according to the recognition accuracy, it is possible to perform the character string candidate generation processing adapted to the fluctuation of the character recognition accuracy. That is, when the original of the document to be recognized is an original in which the characters are blurred or smeared by being copied many times, and the character recognition accuracy is low, the character code candidates used by the character string candidate generating means are By increasing the number of candidates, the correct answer rate of the generated character string candidates can be increased. Further, when the character recognition accuracy is high, such as when the document of the document to be recognized is obtained as a clear image, the number of character code candidates used for combination by the character string candidate generation means is reduced. The character string candidate generating speed of the character string candidate generating means can be shortened, and as a result, the processing time of the entire document input device can be shortened.

【００２６】請求項２記載の文書入力装置において、文
字コード候補記憶手段は、１つの文字の画像の文字認識
結果として、文字コードの複数の候補を、その確からし
さを示す尤度と対応付けて記憶する。尤度基準記憶手段
は、前記尤度のあらかじめ定める尤度基準値を記憶す
る。文字列候補生成手段は、取り込まれた文書の画像中
の文字列を表す同一の文字列画像について、前記尤度基
準記憶手段内の前記尤度基準値以上の尤度を有する前記
文字コードの候補の組み合わせから、文法的に正しく解
釈することができる文字列の複数の候補を生成する。表
示手段は、前記文字列画像に対応して生成された前記文
字列の候補に対して、同一の前記文字列画像について生
成された文字列の候補のうち、所定の候補を表示する。
入力手段は、前記文字列画像に対応して生成された前記
文字列の候補に対して、オペレータからの修正指示を入
力する。修正指示計数手段は、前記入力手段から入力さ
れた修正指示を計数する。修正指示基準記憶手段は、前
記修正指示数のあらかじめ定める修正指示数基準値を記
憶する。修正手段は、前記文字列画像に対応して生成さ
れた前記文字列の候補に対して、前記修正指示に従っ
て、誤って認識されている文字コードを、正しい文字コ
ードに修正する。尤度基準値変更手段は、前記修正手段
による修正の後、計数された修正指示数が、前記修正指
示基準記憶手段に記憶されている修正指示数基準値と比
較して所定の条件を満足する場合には、計数された修正
指示数に応じて尤度基準記憶手段内の前記尤度基準値を
変更する。In the document input device according to the present invention, the character code candidate storage means associates a plurality of character code candidates with a likelihood indicating the certainty thereof as a character recognition result of an image of one character. Remember. The likelihood reference storage means stores a predetermined likelihood reference value of the likelihood. The character string candidate generation means is a candidate for the character code having a likelihood equal to or higher than the likelihood reference value in the likelihood reference storage means for the same character string image representing a character string in the image of the captured document. A plurality of candidates for a character string that can be grammatically interpreted are generated from the combination of. The display means displays a predetermined candidate among the character string candidates generated for the same character string image with respect to the character string candidates generated corresponding to the character string image.
The input means inputs a correction instruction from an operator to the character string candidates generated corresponding to the character string image. The correction instruction counting means counts the correction instruction input from the input means. The correction instruction reference storage means stores a predetermined correction instruction number reference value of the correction instruction number. The correction means corrects a character code, which is erroneously recognized, into a correct character code in accordance with the correction instruction with respect to the character string candidates generated corresponding to the character string image. The likelihood reference value changing means, after the correction by the correcting means, compares the counted correction instruction number with a correction instruction number reference value stored in the correction instruction reference storage means and satisfies a predetermined condition. In this case, the likelihood reference value in the likelihood reference storage means is changed according to the counted number of correction instructions.

【００２７】従って、本発明によれば、計数された修正
指示数に応じて尤度基準記憶手段内の前記尤度基準値を
変更するので、請求項１における正解率を算出する手間
が省け、より容易に上記効果を得ることができる。請求
項３記載の文書入力装置において、空白検出手段は、一
行の文字列を表す文字列画像から切り出された１文字の
画像と、それに隣接する１文字の画像との間の空白を検
出する。単語識別手段は、検出された空白が所定の大き
さより大きい場合、前記１文字の画像とそれに隣接する
１文字の画像とは、それぞれ、異なる単語に属する文字
であると識別する。単語識別限定手段は、取り込まれた
文書の画像中の所定の領域内の文字列画像について、前
記単語識別手段の識別結果を無効とする。Therefore, according to the present invention, since the likelihood reference value in the likelihood reference storage means is changed according to the counted number of correction instructions, the trouble of calculating the correct answer rate in claim 1 can be saved, The above effect can be obtained more easily. In the document input device according to the third aspect, the space detecting unit detects a space between the image of one character cut out from the character string image representing the character string of one line and the image of one character adjacent thereto. When the detected blank is larger than a predetermined size, the word identifying means identifies that the one-character image and the one-character image adjacent thereto are characters belonging to different words. The word identification limiting means invalidates the identification result of the word identification means for a character string image in a predetermined area in the image of the captured document.

【００２８】従って、本発明によれば、罫線枠内の文字
間の空白による単語区切りを無視することにより、均等
割付された文字も、単語の途中で分割されることなく正
しく認識することができ、認識誤り補正効果を十分発揮
することができる。請求項４記載の文書入力装置におい
て、文字コード候補記憶手段は、取り込まれた文書の画
像から切り出された１文字の画像の文字認識結果として
文字コードの複数の候補を記憶する。罫線枠識別手段
は、取り込まれた文書の画像内の罫線枠を識別する。文
字列候補生成手段は、取り込まれた文書の画像中の文字
列を表す同一の文字列画像について、前記文字コードの
候補を組み合わせ、文法的に正しく解釈することができ
る文字列の複数の候補を生成する。文字列候補記憶手段
は、生成された文字列の候補を格納する。表示手段は、
１つの文字列画像に対応する所定の文字列の候補を、前
記文字列候補記憶手段から読み出して、識別された罫線
枠内に均等割り付けして表示する。Therefore, according to the present invention, evenly-assigned characters can be correctly recognized without being divided in the middle of the word by ignoring the word division due to the space between the characters in the ruled line frame. , The recognition error correction effect can be sufficiently exerted. In the document input device according to the fourth aspect, the character code candidate storage means stores a plurality of character code candidates as a character recognition result of an image of one character cut out from the captured image of the document. The ruled line frame identifying means identifies the ruled line frame in the image of the captured document. The character string candidate generation means combines the character code candidates for the same character string image representing the character string in the captured image of the document, and creates a plurality of character string candidates that can be grammatically interpreted. To generate. The character string candidate storage means stores the generated character string candidates. The display means
A predetermined character string candidate corresponding to one character string image is read out from the character string candidate storage means, and is evenly allocated and displayed in the identified ruled line frame.

【００２９】従って、本発明によれば、識別された罫線
枠内に均等割り付けして表示されている所定の文字列の
候補は、前記文字列候補記憶手段内に文字列として格納
されているので、均等割り付けされて表示された文字列
を修正する際に、文字列を単位として修正することがで
きる。このため、オペレータは、従来のように１文字毎
に修正指示を入力する必要がなく、均等割り付け表示さ
れた文字列を容易に修正することができる。Therefore, according to the present invention, the predetermined character string candidates that are evenly displayed in the identified ruled line frame are stored as character strings in the character string candidate storage means. When correcting a character string that is evenly allocated and displayed, the character string can be corrected as a unit. Therefore, the operator does not need to input a correction instruction for each character as in the conventional case, and can easily correct the evenly displayed character strings.

【００３０】請求項５記載の文書入力装置において、文
字コード候補記憶手段は、取り込まれた文書の画像から
切り出された１文字の画像の文字認識結果として文字コ
ードの複数の候補を記憶する。文字列方向識別手段は、
取り込まれた文書の画像内の画像領域と空白領域との分
布に基づいて、取り込まれた文書の画像内の文字列の方
向が縦横いずれの方向であるかを識別する。文字列候補
生成手段は、取り込まれた文書の画像内の画像領域と空
白領域との分布からは、文字列の方向が縦横いずれの方
向であるとも解釈できる場合、前記両方向について、そ
れぞれ、文字列を表す同一の文字列画像について、前記
文字コードの候補を組み合わせ、文法的に正しく解釈す
ることができる文字列の複数の候補を生成する。文字列
方向判定手段は、前記縦方向の文字列画像について生成
された文字列の候補のうちの所定の候補と、前記横方向
の文字列画像について生成された文字列の候補のうちの
所定の候補とを比較して、前記所定の候補が予め定める
条件を満足する方の文字列方向およびその文字列の候補
を有効と判定する。In the document input device according to the fifth aspect, the character code candidate storage means stores a plurality of character code candidates as a character recognition result of an image of one character cut out from the image of the captured document. The character string direction identification means is
Based on the distribution of the image area and the blank area in the image of the captured document, it is identified whether the direction of the character string in the image of the captured document is vertical or horizontal. If the character string candidate generation means can interpret that the direction of the character string is either vertical or horizontal from the distribution of the image area and the blank area in the captured image of the document, the character string is generated in each of the two directions. For the same character string image representing, the character code candidates are combined to generate a plurality of character string candidates that can be grammatically interpreted correctly. The character string direction determination means determines a predetermined candidate of the character string candidates generated for the vertical character string image and a predetermined candidate of the character string candidates generated for the horizontal character string image. By comparing with the candidates, it is determined that the character string direction and the candidate of the character string in which the predetermined candidate satisfies a predetermined condition are valid.

【００３１】従って、取り込まれた文書の画像内の画像
領域と空白領域との分布からは、文字列の方向が縦横い
ずれの方向であるとも解釈できる場合には、一旦、両方
向について文字列の候補を生成した上、前記文字列方向
判定手段が、他の判断基準に従って、文字列の方向を判
定するので、より正確に文字列の方向を判定することが
できる。従って、認識誤り補正処理の効果を、十分得る
ことができる。Therefore, if it is possible to interpret from the distribution of the image area and the blank area in the image of the captured document that the direction of the character string is either vertical or horizontal, once the character string candidates are set in both directions. In addition, since the character string direction determining means determines the direction of the character string according to another criterion, the direction of the character string can be determined more accurately. Therefore, the effect of the recognition error correction processing can be sufficiently obtained.

【００３２】請求項６記載の文書入力装置において、罫
線枠識別手段は、取り込まれた文書の画像内の罫線枠を
識別する。文字コード候補記憶手段は、取り込まれた文
書の画像から切り出された１文字の画像の文字認識結果
として、文字コードの複数の候補を、当該文字の画像の
前記文書の画像内における位置および文字サイズととも
に記憶する。文字列候補生成手段は、取り込まれた文書
の画像中の文字列を表す同一の文字列画像について、前
記文字コードの候補の組み合わせから、文法的に正しく
解釈することができる文字列の複数の候補を生成する。
印字文字サイズ選択手段は、生成された文字列の候補の
うちの所定の候補を、識別された前記罫線枠内に印字す
る際、当該文字列の候補内の前記文字コードの候補が対
応する画像の文字サイズを調べ、印字することができる
印字フォントサイズの中から、調べた文字サイズに最も
近い印字フォントサイズを選択する。印字手段は、識別
された罫線枠を印字するとともに、生成された文字列の
候補のうちの所定の候補を、印字された罫線枠内に、選
択された印字文字サイズで印字する。In the document input device according to the sixth aspect, the ruled line frame identifying means identifies the ruled line frame in the image of the captured document. The character code candidate storage means selects a plurality of character code candidates as the character recognition result of the image of one character cut out from the captured image of the document, the position of the character image in the image of the document, and the character size. Memorize with. The character string candidate generation means is a plurality of character string candidates that can be grammatically correctly interpreted from the combination of the character code candidates for the same character string image representing the character string in the captured document image. To generate.
The print character size selection means, when printing a predetermined candidate of the generated character string candidates in the identified ruled line frame, an image corresponding to the character code candidate in the character string candidate. Check the character size of and select the print font size that is closest to the checked character size from the print font sizes that can be printed. The printing means prints the identified ruled line frame, and also prints a predetermined candidate of the generated character string candidates in the printed ruled line frame in the selected print character size.

【００３３】従って、本発明によれば、印字された罫線
枠内に、生成された文字列の候補のうちの所定の候補
を、取り込まれた文書の画像から切り出された文字の画
像の文字サイズに最も近い印字フォントサイズで印字す
ることができる。すなわち、本発明によれば、印字され
た罫線枠内に、生成された文字列の候補のうちの所定の
候補を、認識対象の文書中に印字されていた文字サイズ
に応じた印字フォントサイズで印字することができる。Therefore, according to the present invention, the character size of the image of the character cut out from the image of the captured document is a predetermined candidate of the generated character string candidates in the printed ruled line frame. It is possible to print with the print font size closest to. That is, according to the present invention, a predetermined candidate of the generated character string candidates is printed in the printed ruled line frame with a print font size corresponding to the character size printed in the document to be recognized. Can be printed.

【００３４】請求項７記載の文書入力装置では、請求項
６記載の文書入力装置において、上下間隔算出手段は、
前記文字コード候補記憶手段を参照して、前記文字列の
候補内の前記文字コードの候補が対応する画像の前記文
書の画像内における位置を調べ、当該文字の画像を挟む
上下の空白領域の大きさを算出する。左右間隔算出手段
は、前記文字コード候補記憶手段を参照して、前記文字
列の候補内の前記文字コードの候補が対応する画像の前
記文書の画像内における位置を調べ、当該文字の画像を
挟む左右の空白領域の大きさを算出する。上下サイズ限
定手段は、選択される印字フォントサイズの上下方向の
大きさが、算出された上下の空白領域の大きさと比べて
所定の大きさを越えないよう、選択される印字フォント
サイズの上下方向の大きさを限定する。左右サイズ限定
手段は、選択される印字フォントサイズの左右方向の大
きさが、算出された左右の空白領域の大きさと比べて所
定の大きさを越えないよう、選択される印字フォントサ
イズの左右方向の大きさを限定する。前記印字文字サイ
ズ選択手段は、前記上下サイズ限定手段と前記左右サイ
ズ限定手段とによって限定された印字フォントサイズの
中から、調べた文字サイズに最も近い印字フォントサイ
ズを選択する。In the document input device according to claim 7, in the document input device according to claim 6, the vertical gap calculating means is
By referring to the character code candidate storage means, the position of the image corresponding to the character code candidate in the character string candidates in the image of the document is checked, and the size of the upper and lower blank areas sandwiching the image of the character is checked. Calculate the The left-right interval calculation means refers to the character code candidate storage means, checks the position in the image of the document of the image corresponding to the character code candidate in the character string candidates, and sandwiches the image of the character. Calculate the size of the left and right blank areas. The vertical size limiting means sets the vertical direction of the selected print font size so that the vertical size of the selected print font size does not exceed a predetermined size compared with the size of the calculated upper and lower blank areas. Limit the size of. The horizontal size limiting means sets the horizontal direction of the selected print font size so that the horizontal size of the selected print font size does not exceed the predetermined size compared with the calculated size of the left and right blank areas. Limit the size of. The print character size selection unit selects a print font size closest to the examined character size from the print font sizes limited by the vertical size limiting unit and the horizontal size limiting unit.

【００３５】従って、本発明によれば、上記効果に加え
て、印字された罫線枠内に、生成された文字列の候補の
うちの所定の候補を、罫線枠に対して釣り合いがとれた
文字サイズで印刷することができる。請求項８記載の文
書入力装置において、文字コード候補記憶手段は、取り
込まれた文書の画像から切り出された１文字の画像の文
字認識結果として文字コードの複数の候補を記憶する。
文字列候補生成手段は、文字列を表す同一の文字列画像
について、前記文字コードの候補を組み合わせ、用語と
して正しく解釈することができる文字列の複数の候補
を、所定の優先順位とともに生成する。属性判定手段
は、生成された文字列の候補の中から、特定の文字コー
ドを識別し、その文字コードを含む文字列の候補および
当該文字列の候補と同一文字列画像について生成された
文字列の候補が表す用語の属性を判定する。属性間対応
用語記憶手段は、相互に関連した属性を有する用語同士
（例えば、住所名と郵便番号、社員名と社員番号など）
を対応付けて格納する。属性別文字列候補記憶手段は、
あらかじめ定める属性別に、属性が判定された前記文字
列の候補を前記優先順位とともに格納する。用語検索手
段は、前記属性別文字列候補記憶手段に格納されている
文字列の候補であって、前記属性間対応用語記憶手段に
格納されている用語の属性を表すものについて、前記各
文字列の候補（例えば、郵便番号あるいは社員番号の候
補）をキーとして、前記前記属性間対応用語記憶手段か
ら対応する用語（例えば、住所名あるいは社員名）を検
索する。優先順位変更手段は、前記属性別文字列候補記
憶手段に格納されている文字列の候補のうち、相互に関
連した他の属性の文字列の候補（例えば、郵便番号ある
いは社員番号の候補）をキーとする前記用語検索手段の
検索結果（例えば、住所名あるいは社員名）と共通する
文字列の候補（例えば、住所名あるいは社員名の候補）
の優先順位を高める。Therefore, according to the present invention, in addition to the above effects, a predetermined candidate of the generated character string candidates is displayed in the printed ruled line frame as a character balanced with the ruled line frame. Can be printed in size. In the document input device according to the eighth aspect, the character code candidate storage means stores a plurality of character code candidates as a character recognition result of an image of one character cut out from the captured image of the document.
The character string candidate generation means combines the character code candidates for the same character string image representing the character string, and generates a plurality of character string candidates that can be correctly interpreted as a term together with a predetermined priority. The attribute determination means identifies a specific character code from the generated character string candidates, and a character string candidate including the character code and a character string generated for the same character string image as the candidate character string. The attribute of the term represented by the candidate of is determined. The attribute correspondence term storage means includes terms having attributes that are related to each other (for example, address name and zip code, employee name and employee number, etc.).
Are stored in association with each other. The attribute-based character string candidate storage means is
The character string candidates whose attributes have been determined are stored together with the priority for each predetermined attribute. The term search means is a character string candidate stored in the attribute-based character string candidate storage means, and represents the attribute of a term stored in the inter-attribute correspondence term storage means Using the candidate (for example, zip code or employee number candidate) as a key, the corresponding term (for example, address name or employee name) is searched from the attribute correspondence term storage means. The priority changing means selects character string candidates (for example, zip code or employee number candidates) of other attributes that are mutually related from the character string candidates stored in the attribute-specific character string candidate storage means. Candidates for character strings that are common to the search results (for example, address names or employee names) of the term search means used as keys (for example, address name or employee name candidates)
Raise the priority of.

【００３６】従って、本発明によれば、同一の属性を表
す用語の中で、例えば、「大田」と「太田」のように、
それぞれに正しい用語であるが類似字形を多く含むため
に認識誤りを生じやすい文字列を多く含んでいる住所表
示、氏名などの用語を認識する場合、上記検索結果との
照合を行うことにより、より正確な認識誤り補正結果を
得ることができる。Therefore, according to the present invention, among terms expressing the same attribute, for example, "Ota" and "Ota",
When recognizing a term such as an address display or a name, which is a correct term, but contains many character strings that tend to cause recognition errors because it contains many similar glyphs, by comparing with the above search results, An accurate recognition error correction result can be obtained.

【００３７】請求項９記載の文書入力方法において、予
め定める尤度基準値と予め定める正解率基準値とを、そ
れぞれ尤度基準記憶部と正解率基準記憶部とに予め格納
しておき、文字列候補生成ステップは、取り込まれた文
書の画像中の文字列を表す同一の文字列画像について、
前記尤度基準記憶部に格納されている尤度基準値以上の
尤度を有する前記文字コードの候補の組み合わせから、
文法的に正しく解釈することができる文字列の複数の候
補を生成する。表示ステップは、前記文字列画像に対応
して生成された前記文字列の候補に対して、同一の前記
文字列画像について生成された文字列の候補のうち、所
定の候補を表示する。入力ステップは、前記文字列画像
に対応して生成された前記文字列の候補に対して、オペ
レータからの修正指示を入力する。修正ステップは、前
記文字列画像に対応して生成された前記文字列の候補に
対して、前記修正指示に従って、誤って認識されている
文字コードを、正しい文字コードに修正する。比較ステ
ップは、前記修正の後、前記文字列画像に対応する修正
後の文字列と、同一文字列画像に対応する最も尤度の高
い文字コードの候補からなる文字列とを比較する。正解
率算出ステップは、前記比較の後、前記比較結果に基づ
いて、前記両者の間で相違する文字数と前記一方の文字
列に含まれる文字数との割合から、文字認識の正解率を
算出する。尤度基準値変更ステップは、前記正解率が算
出された後、算出された正解率が、前記正解率基準記憶
部に格納されている正解率基準値と比較して所定の条件
を満足する場合には、算出された正解率に応じて尤度基
準記憶部内の前記尤度基準値を変更し、前記文字列候補
生成ステップに戻る。In the document input method according to the ninth aspect, a predetermined likelihood reference value and a predetermined correct answer rate reference value are stored in advance in the likelihood reference storage section and the correct answer rate reference storage section, respectively, and characters are stored. The column candidate generation step is performed on the same character string image representing the character string in the image of the captured document,
From a combination of the character code candidates having a likelihood equal to or greater than the likelihood reference value stored in the likelihood reference storage unit,
Generate multiple candidates for a string that can be grammatically interpreted. The displaying step displays a predetermined candidate among the character string candidates generated for the same character string image with respect to the character string candidates generated corresponding to the character string image. The input step inputs a correction instruction from an operator for the character string candidates generated corresponding to the character string image. The correction step corrects a character code that is erroneously recognized to a correct character code for the character string candidate generated corresponding to the character string image according to the correction instruction. In the comparing step, after the correction, the corrected character string corresponding to the character string image is compared with the character string including the character code candidate having the highest likelihood corresponding to the same character string image. In the correct answer rate calculating step, after the comparison, based on the comparison result, the correct answer rate of character recognition is calculated from the ratio of the number of different characters between the two and the number of characters included in the one character string. In the likelihood reference value changing step, after the correct answer rate is calculated, the calculated correct answer rate is compared with the correct answer rate reference value stored in the correct answer rate reference storage unit and satisfies a predetermined condition. To change the likelihood reference value in the likelihood reference storage unit according to the calculated correct answer rate, and return to the character string candidate generating step.

【００３８】従って、本発明によれば、請求項１記載の
本発明と同様の効果を得ることができる。請求項１０記
載の文書入力方法において、予め定める尤度基準値と予
め定める修正指示数基準値とを、それぞれ尤度基準記憶
部と修正指示基準記憶部とに格納しておき、文字列候補
生成ステップは、取り込まれた文書の画像中の文字列を
表す同一の文字列画像について、前記尤度基準記憶部内
の前記尤度基準値以上の尤度を有する前記文字コードの
候補の組み合わせから、文法的に正しく解釈することが
できる文字列の複数の候補を生成する。表示ステップ
は、前記文字列画像に対応して生成された前記文字列の
候補に対して、同一の前記文字列画像について生成され
た文字列の候補のうち、所定の候補を表示する。入力ス
テップは、前記文字列画像に対応して生成された前記文
字列の候補に対して、オペレータからの修正指示を入力
する。修正指示計数ステップは、入力された修正指示を
計数する。修正ステップは、前記文字列画像に対応して
生成された前記文字列の候補に対して、前記修正指示に
従って、誤って認識されている文字コードを、正しい文
字コードに修正する。尤度基準値変更ステップは、前記
修正の後、計数された修正指示数が、前記修正指示基準
記憶部に記憶されている修正指示数基準値と比較して所
定の条件を満足する場合には、計数された修正指示数に
応じて尤度基準記憶部内の前記尤度基準値を変更する。Therefore, according to the present invention, it is possible to obtain the same effect as that of the present invention according to claim 1. The document input method according to claim 10, wherein a predetermined likelihood reference value and a predetermined correction instruction number reference value are stored in a likelihood reference storage unit and a correction instruction reference storage unit, respectively, to generate a character string candidate. In the step, for the same character string image representing the character string in the image of the captured document, from the combination of the candidate of the character code having the likelihood equal to or more than the likelihood reference value in the likelihood reference storage unit, the grammar Generate multiple candidate strings that can be correctly interpreted. The displaying step displays a predetermined candidate among the character string candidates generated for the same character string image with respect to the character string candidates generated corresponding to the character string image. The input step inputs a correction instruction from an operator for the character string candidates generated corresponding to the character string image. The correction instruction counting step counts the input correction instruction. The correction step corrects a character code that is erroneously recognized to a correct character code for the character string candidate generated corresponding to the character string image according to the correction instruction. In the likelihood reference value changing step, after the correction, if the counted correction instruction number is compared with the correction instruction number reference value stored in the correction instruction reference storage unit and satisfies a predetermined condition, The likelihood reference value in the likelihood reference storage unit is changed according to the counted number of correction instructions.

【００３９】従って、本発明によれば、請求項２記載の
本発明と同様の効果を得ることができる。請求項１１記
載の文書入力方法において、空白検出ステップは、一行
の文字列を表す文字列画像から切り出された１文字の画
像と、それに隣接する１文字の画像との間の空白を検出
する。単語識別ステップは、検出された空白が所定の大
きさより大きい場合、前記１文字の画像とそれに隣接す
る１文字の画像とは、それぞれ、異なる単語に属する文
字であると識別する。単語識別限定ステップは、取り込
まれた文書の画像中の所定の領域内の文字列画像につい
て、前記識別結果を無効とする。Therefore, according to the present invention, it is possible to obtain the same effect as that of the present invention according to claim 2. In the document input method according to the eleventh aspect, in the white space detecting step, a white space is detected between an image of one character cut out from a character string image representing a character string of one line and an image of one character adjacent thereto. In the word identifying step, when the detected blank is larger than a predetermined size, the one-character image and the adjacent one-character image are identified as characters belonging to different words. The word identification limiting step invalidates the identification result for the character string image in a predetermined area in the captured image of the document.

【００４０】従って、本発明によれば、請求項３記載の
本発明と同様の効果を得ることができる。請求項１２記
載の文書入力方法において、罫線枠識別ステップは、取
り込まれた文書の画像内の罫線枠を識別する。文字列候
補生成ステップは、取り込まれた文書の画像中の文字列
を表す同一の文字列画像について、前記文字コードの候
補を組み合わせ、文法的に正しく解釈することができる
文字列の複数の候補を生成する。文字列候補格納ステッ
プは、生成された文字列の候補を文字列候補記憶部に格
納する。表示ステップは、１つの文字列画像に対応する
所定の文字列の候補を、前記文字列候補記憶部から読み
出して、識別された罫線枠内に均等割り付けして表示す
る。Therefore, according to the present invention, it is possible to obtain the same effect as that of the present invention according to claim 3. In the document input method according to claim 12, the ruled line frame identifying step identifies a ruled line frame in the image of the captured document. The character string candidate generating step combines a plurality of character string candidates that can be grammatically correctly interpreted by combining the character code candidates for the same character string image representing the character string in the captured document image. To generate. The character string candidate storing step stores the generated character string candidates in the character string candidate storage unit. In the display step, a predetermined character string candidate corresponding to one character string image is read from the character string candidate storage unit, and is equally distributed and displayed in the identified ruled line frame.

【００４１】従って、本発明によれば、請求項４記載の
本発明と同様の効果を得ることができる。請求項１３記
載の文書入力方法において、文字列方向識別ステップ
は、取り込まれた文書の画像内の画像領域と空白領域と
の分布に基づいて、取り込まれた文書の画像内の文字列
の方向が縦横いずれの方向であるかを識別する。文字列
候補生成ステップは、取り込まれた文書の画像内の画像
領域と空白領域との分布からは、文字列の方向が縦横い
ずれの方向であるとも解釈できる場合、前記両方向につ
いてそれぞれ、文字列を表す同一の文字列画像につい
て、前記文字コードの候補の組み合わせから、文法的に
正しく解釈することができる文字列の複数の候補を生成
する。文字列方向判定ステップは、前記縦方向の文字列
画像について生成された文字列の候補のうちの所定の候
補と、前記横方向の文字列画像について生成された文字
列の候補のうちの所定の候補とを比較して、前記所定の
候補が予め定める条件を満足する方の文字列方向および
その文字列の候補を有効と判定する。Therefore, according to the present invention, it is possible to obtain the same effect as that of the present invention according to claim 4. The document input method according to claim 13, wherein in the character string direction identifying step, the direction of the character string in the image of the captured document is determined based on the distribution of the image area and the blank area in the image of the captured document. Identify which direction is vertical or horizontal. In the character string candidate generating step, when it is possible to interpret that the direction of the character string is either vertical or horizontal from the distribution of the image area and the blank area in the captured image of the document, the character string is extracted in both directions. For the same character string image shown, a plurality of character string candidates that can be grammatically correctly interpreted are generated from the combination of the character code candidates. In the character string direction determination step, a predetermined candidate of the character string candidates generated for the vertical character string image and a predetermined candidate of the character string candidates generated for the horizontal character string image are determined. By comparing with the candidates, it is determined that the character string direction and the candidate of the character string in which the predetermined candidate satisfies a predetermined condition are valid.

【００４２】従って、本発明によれば、請求項５記載の
本発明と同様の効果を得ることができる。請求項１４記
載の文書入力方法において、文字の画像を文字認識しな
がら、文字コードの複数の候補を当該文字の画像の前記
文書の画像内における位置および文字サイズとともに文
字コード候補記憶部に予め格納し、罫線枠識別ステップ
は、取り込まれた文書の画像内の罫線枠を識別する。文
字列候補生成ステップは、取り込まれた文書の画像中の
文字列を表す同一の文字列画像について、前記文字コー
ドの候補の組み合わせから、文法的に正しく解釈するこ
とができる文字列の複数の候補を生成する。印字文字サ
イズ選択ステップは、生成された文字列の候補のうちの
所定の候補を、識別された前記罫線枠内に印字する際、
当該文字列の候補内の前記文字コードの候補が対応する
画像の文字サイズを調べ、印字することができる印字フ
ォントサイズの中から、調べた文字サイズに最も近い印
字フォントサイズを選択する。印字ステップは、識別さ
れた罫線枠を印字するとともに、生成された文字列の候
補のうちの所定の候補を、印字された罫線枠内に、選択
された印字文字サイズで印字する。Therefore, according to the present invention, it is possible to obtain the same effect as that of the present invention according to claim 5. 15. The document input method according to claim 14, wherein while recognizing a character image, a plurality of character code candidates are stored in advance in a character code candidate storage unit together with a position of the character image in the image of the document and a character size. Then, the ruled line frame identifying step identifies the ruled line frame in the image of the captured document. In the character string candidate generating step, a plurality of character string candidates that can be grammatically correctly interpreted from the combination of the character code candidates for the same character string image representing the character string in the captured image of the document. To generate. The printing character size selecting step, when printing a predetermined candidate of the generated character string candidates in the identified ruled line frame,
The character size of the image corresponding to the character code candidate in the character string candidates is checked, and the print font size closest to the checked character size is selected from the print font sizes that can be printed. In the printing step, the identified ruled line frame is printed, and a predetermined candidate of the generated character string candidates is printed in the printed ruled line frame with the selected print character size.

【００４３】従って、本発明によれば、請求項６記載の
本発明と同様の効果を得ることができる。請求項１５記
載の文書入力方法では、請求項１４記載の文書入力方法
において、上下間隔算出ステップは、前記罫線枠識別ス
テップの後、前記文字コード候補記憶部を参照して、前
記文字列の候補内の前記文字コードの候補が対応する画
像の前記文書の画像内における位置を調べ、当該文字の
画像を挟む上下の空白領域の大きさを算出する。左右間
隔算出ステップは、前記文字コード候補記憶部を参照し
て、前記文字列の候補内の前記文字コードの候補が対応
する画像の前記文書の画像内における位置を調べ、当該
文字の画像を挟む左右の空白領域の大きさを算出する。
上下サイズ限定ステップは、選択される印字フォントサ
イズの上下方向の大きさが、算出された上下の空白領域
の大きさと比べて所定の大きさを越えないよう、選択さ
れる印字フォントサイズの上下方向の大きさを限定す
る。左右サイズ限定ステップは、選択される印字フォン
トサイズの左右方向の大きさが、算出された左右の空白
領域の大きさと比べて所定の大きさを越えないよう、選
択される印字フォントサイズの左右方向の大きさを限定
する。Therefore, according to the present invention, it is possible to obtain the same effect as that of the present invention described in claim 6. In the document input method according to claim 15, in the document input method according to claim 14, the vertical spacing calculation step refers to the character code candidate storage unit after the ruled line frame identification step, and the character string candidates are referred to. The position of the image corresponding to the character code candidate in the inside of the image of the document is checked, and the sizes of the upper and lower blank areas sandwiching the image of the character are calculated. The left-right interval calculating step refers to the character code candidate storage unit to check the position in the image of the document of the image corresponding to the character code candidate in the character string candidates, and sandwiches the image of the character. Calculate the size of the left and right blank areas.
The vertical size limiting step is performed in the vertical direction of the selected print font size so that the vertical size of the selected print font size does not exceed the predetermined size compared with the size of the calculated upper and lower blank areas. Limit the size of. The right / left size limiting step is performed in the left / right direction of the selected print font size so that the size of the selected print font size in the left / right direction does not exceed the calculated size of the left and right blank areas. Limit the size of.

【００４４】従って、本発明によれば、請求項７記載の
本発明と同様の効果を得ることができる。請求項１６記
載の文書入力方法において、相互に関連した属性を有す
る用語同士を対応付けて属性間対応用語記憶部に予め格
納し、文字列候補生成ステップは、文字列を表す同一の
文字列画像について、前記文字コードの候補を組み合わ
せ、用語として正しく解釈することができる文字列の複
数の候補を、所定の優先順位とともに生成する。属性判
定ステップは、生成された文字列の候補の中から、特定
の文字コードを識別し、その文字コードを含む文字列の
候補および当該文字列の候補と同一文字列画像について
生成された文字列の候補が表す用語の属性を判定する。
属性別文字列候補格納ステップは、あらかじめ定める属
性別に、属性が判定された前記文字列の候補を前記優先
順位とともに属性別文字列候補記憶部に格納する。用語
検索ステップは、前記属性別文字列候補記憶部に格納さ
れている文字列の候補であって、前記属性間対応用語記
憶部に格納されている用語の属性を表すものについて、
前記各文字列の候補をキーとして、前記前記属性間対応
用語記憶部から対応する用語を検索する。優先順位変更
ステップは、前記属性別文字列候補記憶部に格納されて
いる文字列の候補のうち、相互に関連した他の属性の文
字列の候補をキーとする前記検索結果と共通する文字列
の候補の優先順位を高める。Therefore, according to the present invention, it is possible to obtain the same effect as that of the present invention described in claim 7. 17. The document input method according to claim 16, wherein terms having mutually related attributes are associated with each other and are stored in advance in the inter-attribute corresponding term storage unit, and the character string candidate generation step is the same character string image representing a character string. With respect to, a combination of the character code candidates is generated, and a plurality of character string candidates that can be correctly interpreted as terms are generated with a predetermined priority. The attribute determining step identifies a specific character code from the generated character string candidates, and detects a character string candidate including the character code and a character string generated for the same character string image as the candidate character string. The attribute of the term represented by the candidate of is determined.
The attribute-specific character string candidate storage step stores, for each predetermined attribute, the character string candidates whose attributes have been determined together with the priority in the attribute-specific character string candidate storage unit. The term search step is a candidate for a character string stored in the attribute-based character string candidate storage unit, and represents the attribute of a term stored in the inter-attribute correspondence term storage unit,
Using the candidate of each character string as a key, the corresponding term is retrieved from the inter-attribute corresponding term storage unit. In the priority changing step, among the character string candidates stored in the attribute-specific character string candidate storage unit, a character string common to the search result is keyed to a character string candidate of another attribute associated with each other. Increase the priority of candidates.

【００４５】従って、本発明によれば、請求項８記載の
本発明と同様の効果を得ることができる。Therefore, according to the present invention, it is possible to obtain the same effect as that of the present invention according to claim 8.

【００４６】[0046]

【実施例】図１は、本発明の第１実施例である文書入力
装置１００のハードウェア構成を示すブロック図であ
る。文書入力装置１００は、ＣＰＵ１２０、メモリ制御
ユニット兼用バス制御ユニット１０２、主記憶１０１、
ローカルバス１０４、グラフィクス制御ユニット１０
５、ディスプレイ１０６、ＳＣＳＩ（ＳｍａｌｌＣｏ
ｍｐｕｔｅｒＳｙｓｔｅｍＩｎｔｅｒｆａｃｅ）制
御ユニット１０７、ハードディスク装置１０８、スキャ
ナー１０９、バスブリッジ回路１１０、外部バス１１
１、入出力制御ユニット１１２、キーボード１１３、プ
リンタ１１４およびマウス１１５を備える。なお、文書
入力装置１００の主要部は、例えば、パーソナルコンピ
ュータＰａｎａｃｏｍＶ２４（ＣＦ−Ｖ２４１１ＲＷＥ
松下電器）などの汎用パーソナルコンピュータによっ
て実現され、その構成要素である個々の入出力装置等
は、各々、市販されている標準品でよい。演算装置（Ｃ
ＰＵ）１０１は、メモリ制御ユニット兼用バス制御ユニ
ット１０２を介して、主記憶１０１およびローカルバス
１０４と結合されている。1 is a block diagram showing a hardware configuration of a document input device 100 according to a first embodiment of the present invention. The document input device 100 includes a CPU 120, a memory control unit / bus control unit 102, a main memory 101,
Local bus 104, graphics control unit 10
5, display 106, SCSI (Small Co
computer system interface) control unit 107, hard disk device 108, scanner 109, bus bridge circuit 110, external bus 11
1, an input / output control unit 112, a keyboard 113, a printer 114, and a mouse 115. The main part of the document input device 100 is, for example, a personal computer Panacom V24 (CF-V2411RWE).
A general-purpose personal computer such as Matsushita Electric Industrial Co., Ltd., etc., and the individual input / output devices and the like that are the constituents thereof may be standard products that are commercially available. Computing device (C
The PU) 101 is connected to the main memory 101 and the local bus 104 via the memory control unit / bus control unit 102.

【００４７】主記憶１０１は、ＣＰＵ１２０が実行する
ためのプログラムおよび各入出力ディスク装置から入力
されたデータを格納するとともに、処理途中のデータを
一時記憶することによってＣＰＵ１２０の作業場所を提
供する。図２は、文書入力装置１００の認識結果を格納
するために主記憶１０１上に設けられる記憶領域の区分
と、その記憶内容とを示す説明図である。図２の左側に
示すように、主記憶１０１上には、画像情報記憶部２０
１、ブロック管理情報記憶部２０２、表示データ記憶部
２０３および認識列管理部２０４の各記憶領域が設けら
れる。The main memory 101 stores a program to be executed by the CPU 120 and data input from each of the input / output disk devices, and provides a work place for the CPU 120 by temporarily storing data during processing. FIG. 2 is an explanatory diagram showing the division of the storage area provided on the main memory 101 for storing the recognition result of the document input device 100 and the storage contents. As shown on the left side of FIG. 2, the image information storage unit 20 is provided on the main memory 101.
1, a block management information storage unit 202, a display data storage unit 203, and a recognition column management unit 204 are provided.

【００４８】画像情報記憶部２０１は、例えば、スキャ
ナー１０９などから読み込まれた、入力対象となる文書
の画像を記憶する。ブロック管理情報記憶部２０２は、
文書入力処理における各段階で作成される、図２の右側
に示すような後述のブロック管理情報２１０を記憶す
る。表示データ記憶部２０３は、例えば、認識対象とし
てスキャナー１０９から入力された画像およびその認識
結果などの表示データを記憶する。The image information storage unit 201 stores the image of the document to be input, which is read by the scanner 109 or the like, for example. The block management information storage unit 202
The block management information 210 described below, which is created at each stage in the document input process and is shown on the right side of FIG. 2, is stored. The display data storage unit 203 stores, for example, display data such as an image input from the scanner 109 as a recognition target and a recognition result thereof.

【００４９】認識列管理部２０４は、認識列ごとに作成
される後述の各認識列管理情報３０１を記憶する。前記
認識対象ブロックは、スキャナー１０９から入力された
画像中において、認識対象として指定された矩形領域を
いい、前記認識列は、指定された認識対象ブロックに含
まれる、例えば、一行の文字列に相当する画像をいう。The recognition sequence management unit 204 stores each recognition sequence management information 301, which will be described later, created for each recognition sequence. The recognition target block refers to a rectangular area designated as a recognition target in an image input from the scanner 109, and the recognition column is included in the designated recognition target block, for example, corresponds to a single-line character string. Image

【００５０】ブロック管理情報２１０は、スキャナー１
０９から入力された画像中において、認識対象テキスト
として指定された、それぞれの認識対象ブロックに対応
して作成され、各ブロック管理情報２１０は、画像位置
情報２１１、罫線枠情報２１２、文字列方向２１３およ
び認識列管理部アドレス２１４からなる。画像位置情報
２１１は、そのブロック管理情報２１０によって管理さ
れる認識対象ブロックのブロック識別番号と、当該認識
対象ブロックが画像情報記憶部２０１に記憶されている
画像情報、すなわち、スキャナー１０９から入力された
前記画像情報中で位置する座標を示す。The block management information 210 is stored in the scanner 1
In the image input from 09, it is created corresponding to each recognition target block designated as the recognition target text, and each block management information 210 includes image position information 211, ruled line frame information 212, and a character string direction 213. And a recognition column management unit address 214. The image position information 211 is the block identification number of the recognition target block managed by the block management information 210 and the image information in which the recognition target block is stored in the image information storage unit 201, that is, input from the scanner 109. The coordinates located in the image information are shown.

【００５１】罫線枠情報２１２は、当該認識対象ブロッ
ク内の罫線枠の有無を示す。文字列方向２１３は、当該
認識対象ブロック内において、文字列の方向が縦である
か横であるかを示す。ただし、ある認識対象ブロックに
ついて、文字列方向が縦方向と横方向との両方に解釈が
成り立つ場合には、それぞれの場合に対応して、２つの
後述する認識列管理情報３０１が生成される。The ruled line frame information 212 indicates the presence or absence of a ruled line frame in the recognition target block. The character string direction 213 indicates whether the direction of the character string is vertical or horizontal in the recognition target block. However, when a character string direction is interpreted in both the vertical direction and the horizontal direction for a certain recognition target block, two pieces of recognition string management information 301, which will be described later, are generated corresponding to each case.

【００５２】認識列管理部アドレス２１４は、認識列管
理部２０４内において、当該認識対象ブロック内に含ま
れる認識列分の認識列管理情報３０１が記憶されている
記憶領域の先頭アドレスを示す。図３は、認識列管理部
２０４に記憶される認識列管理情報３０１のデータフォ
ーマットを示す説明図である。The recognition column management unit address 214 indicates the start address of the storage area in the recognition column management unit 204 in which the recognition column management information 301 for the recognition column included in the recognition target block is stored. FIG. 3 is an explanatory diagram showing a data format of the recognition sequence management information 301 stored in the recognition sequence management unit 204.

【００５３】認識列管理部２０４には、１つの認識対象
ブロックに含まれるそれぞれの認識列にたいして作成さ
れた認識列管理情報３０１が、各認識対象ブロック毎に
記憶される。各認識列管理情報３０１は、図３に示すよ
うに、当該認識列に含まれる各文字毎に作成された認識
文字情報３０２と当該認識列に対して得られた補正結果
３０８とからなる。The recognition sequence management unit 204 stores the recognition sequence management information 301 created for each recognition sequence included in one recognition target block for each recognition target block. As shown in FIG. 3, each recognition string management information 301 includes recognition character information 302 created for each character included in the recognition string and a correction result 308 obtained for the recognition string.

【００５４】認識文字情報３０２は、当該認識列におけ
る１文字毎の認識結果を示し、文字位置情報３０３、区
切り情報３０４および候補文字情報３０７からなる。候
補文字情報３０７は、候補文字コード３０５とその尤度
３０６とからなる。文字位置情報３０３は、当該認識文
字情報３０２に対応する画像の当該認識対象ブロック内
における位置を、例えば、認識対象ブロックの左上隅を
座標（０，０）とするビットマップ上の矩形領域を表す
位置座標で示す。The recognized character information 302 indicates a recognition result for each character in the recognition string, and includes character position information 303, delimiter information 304, and candidate character information 307. The candidate character information 307 includes a candidate character code 305 and its likelihood 306. The character position information 303 represents the position of the image corresponding to the recognized character information 302 in the recognition target block, for example, a rectangular area on the bitmap with the upper left corner of the recognition target block as coordinates (0, 0). Indicated by position coordinates.

【００５５】区切り情報３０４は、文字位置情報３０３
で示された位置のテキスト画像が単語あるいは文節の末
尾であるか否かを示す。例えば、当該テキスト画像の直
後に全角文字以上の大きさの空白があれば「１」、なけ
れば「０」が書き込まれる。これにより、例えば、一般
文書においては、空白で大きく間の離れた段組レイアウ
トの行を一つの行として解釈する誤りを取り除くことが
できる。なお、ここでは、単語区切りを有効とする空白
の大きさは、全角文字以上としたが、倍角文字以上とし
てもよいし、３倍角文字以上としてもよいし、４倍角以
上としてもよい。The delimiter information 304 is the character position information 303.
Indicates whether the text image at the position indicated by is the end of a word or phrase. For example, "1" is written if there is a blank of a size equal to or larger than the full-width character immediately after the text image, and "0" is written otherwise. Thus, for example, in a general document, it is possible to eliminate the error of interpreting blank and widely separated columns in a column layout as one line. In addition, here, the size of the space in which the word division is valid is set to be a double-byte character or more, but may be a double-byte character or more, a triple-width character or more, or a quadruple-width character or more.

【００５６】候補文字コード３０５は、認識結果として
得られた候補文字の文字コードを示す。尤度３０６は、
文字位置情報３０３で示された位置の画像から抽出され
た文字認識用特徴と、文書入力装置１００が別途保有す
る活字認識用辞書との照合の結果得られた候補文字コー
ド３０５の確からしさを示す。尤度３０６の算出方法
は、従来の文字認識における方法と同様であるので、説
明を省略する。The candidate character code 305 indicates the character code of the candidate character obtained as a recognition result. Likelihood 306 is
The certainty of the candidate character code 305 obtained as a result of the collation between the character recognition feature extracted from the image at the position indicated by the character position information 303 and the type recognition dictionary separately held by the document input device 100 is shown. . Since the method of calculating the likelihood 306 is the same as the method in the conventional character recognition, description thereof will be omitted.

【００５７】補正結果３０８は、認識列毎に作成され、
複数の候補からなる。補正結果３０８は、１文字毎の認
識結果である候補文字コード３０５を、単語辞書を参照
しながら組み合わせて生成された、文節として解釈可能
な複数の文節解釈に対して、従来の形態素解析、文法検
定などに従って、補正が加えられた結果である補正結果
を示す。また、補正結果確定部２１３によって確定され
た認識列は、補正結果３０８の第１候補として記憶され
る。The correction result 308 is created for each recognition column,
It consists of multiple candidates. The correction result 308 is obtained by combining a candidate character code 305, which is a recognition result for each character, with reference to a word dictionary and generating a plurality of bunsetsu interpretations that can be interpreted as a bunsetsu. The correction result, which is the result of the correction according to the test or the like, is shown. The recognition sequence confirmed by the correction result confirmation unit 213 is stored as the first candidate for the correction result 308.

【００５８】図４は、主記憶１０１上に設けられる単語
辞書記憶部４１２の記憶領域区分を示す説明図である。
単語辞書記憶部４１２には、認識誤り補正に用いる各種
照合用辞書およびテーブルが格納される。単語辞書記憶
部４１２には、辞書管理情報記憶部４０１、一般辞書部
４０２、固有名詞辞書部４０３、帳票用語辞書部４０
４、郵便番号−住所照合テーブル部４０５および電話番
号−住所照合テーブル部４０６が設けられる。FIG. 4 is an explanatory diagram showing storage area divisions of the word dictionary storage unit 412 provided on the main memory 101.
The word dictionary storage unit 412 stores various collation dictionaries and tables used for recognition error correction. The word dictionary storage unit 412 includes a dictionary management information storage unit 401, a general dictionary unit 402, a proper noun dictionary unit 403, and a form term dictionary unit 40.
4. A postal code-address matching table unit 405 and a telephone number-address matching table unit 406 are provided.

【００５９】辞書管理情報記憶部４０１には、一般辞書
アドレス記憶部４０７、固有名詞辞書アドレス記憶部４
０８、帳票用語辞書アドレス記憶部４０９、郵便番号テ
ーブルアドレス記憶部４１０および電話番号テーブルア
ドレス記憶部４１１が設けられ、認識誤り補正に用いる
後述の各種照合用辞書およびテーブルが格納されている
単語辞書記憶部４１２内のアドレスを一括して保持す
る。The dictionary management information storage unit 401 includes a general dictionary address storage unit 407 and a proper noun dictionary address storage unit 4
08, a form term dictionary address storage unit 409, a postal code table address storage unit 410, and a telephone number table address storage unit 411 are provided, and a word dictionary storage in which various collating dictionaries and tables to be described later used for recognition error correction are stored. The addresses in the unit 412 are collectively held.

【００６０】一般辞書部４０２は、自立語および付属語
などをトライ構造で収録した単語辞書など、一般文書の
補正に用いられる一般辞書を記憶する。固有名詞辞書部
４０３は、会社名、部署名および個人名など、固有名詞
を収録し、名刺の補正に用いられる後述の固有名詞辞書
を記憶する。帳票用語辞書部４０４は、帳票に高頻度で
出現する帳票用語を収録し、帳票形式入力の際の補正に
用いられる後述の帳票用語辞書を記憶する。The general dictionary section 402 stores a general dictionary used for correcting a general document, such as a word dictionary in which independent words and adjunct words are recorded in a trie structure. The proper noun dictionary unit 403 stores proper nouns such as a company name, a department name, and a personal name, and stores a later-described proper noun dictionary used for correcting a business card. The form term dictionary unit 404 stores the form terms that appear frequently in the form and stores a form term dictionary (to be described later) used for correction when inputting the form format.

【００６１】郵便番号−住所照合テーブル部４０５は、
郵便番号に対応して、その地域の住所表示に用いられる
住所名データを収録した、郵便番号−住所照合テーブル
を記憶する。電話番号−住所照合テーブル部４０６は、
電話番号の市外番号、および市内局番号にその地域の住
所表示に用いられる住所名データを収録した、電話番号
−住所照合テーブルを記憶する。The postal code / address matching table section 405 is
A zip code-address matching table that stores address name data used for displaying addresses in the area is stored in correspondence with the zip code. The telephone number-address matching table unit 406 is
A telephone number-address matching table in which address name data used for displaying an address in the area is recorded in the area code of the telephone number and the local office number is stored.

【００６２】一般辞書アドレス記憶部４０７は、単語辞
書記憶部４１２内の一般辞書部４０２のアドレスを記憶
する。固有名詞辞書アドレス記憶部４０８は、単語辞書
記憶部４１２内の固有名詞辞書部４０３のアドレスを記
憶する。帳票用語辞書アドレス記憶部４０９は、単語辞
書記憶部４１２内の帳票用語辞書部４０４のアドレスを
記憶する。The general dictionary address storage unit 407 stores the address of the general dictionary unit 402 in the word dictionary storage unit 412. The proper noun dictionary address storage unit 408 stores the address of the proper noun dictionary unit 403 in the word dictionary storage unit 412. The form term dictionary address storage unit 409 stores the address of the form term dictionary unit 404 in the word dictionary storage unit 412.

【００６３】郵便番号テーブルアドレス記憶部４１０
は、単語辞書記憶部４１２内の郵便番号−住所照合テー
ブル部４０５のアドレスを記憶する。電話番号テーブル
アドレス記憶部４１１は、単語辞書記憶部４１２内の電
話番号−住所照合テーブル部４０６のアドレスを記憶す
る。図５は、主記憶１０１上に設けられる名刺データ認
識結果一時記憶部５０１の記憶領域区分を示す説明図で
ある。Postal code table address storage unit 410
Stores the address of the postal code / address matching table unit 405 in the word dictionary storage unit 412. The telephone number table address storage unit 411 stores the address of the telephone number / address matching table unit 406 in the word dictionary storage unit 412. FIG. 5 is an explanatory diagram showing storage area divisions of the business card data recognition result temporary storage unit 501 provided on the main memory 101.

【００６４】名刺データ認識結果一時記憶部５０１は、
氏名記憶部５０２、所属団体名記憶部５０３、所属部署
名記憶部５０４、住所記憶部５０５、電話番号記憶部５
０６および郵便番号記憶部５０７を備え、名刺入力処理
における認識結果を一時記憶する。氏名記憶部５０２
は、名刺に印字された画像から読み取られた、第１候補
から第５候補までの氏名の補正結果３０８を記憶する。The business card data recognition result temporary storage unit 501 is
Name storage unit 502, organization name storage unit 503, department name storage unit 504, address storage unit 505, telephone number storage unit 5
06 and a postal code storage unit 507 are provided to temporarily store the recognition result in the business card input process. Name storage unit 502
Stores the correction results 308 of the names of the first to fifth candidates read from the image printed on the business card.

【００６５】所属団体名記憶部５０３、名刺に印字され
た画像から読み取られた、第１候補から第５候補までの
所属団体名の補正結果３０８を記憶する。所属部署名記
憶部５０４、名刺に印字された画像から読み取られた、
第１候補から第５候補までの所属部署名の補正結果３０
８を記憶する。住所記憶部５０５、名刺に印字された画
像から読み取られた、第１候補から第５候補までの住所
の補正結果３０８を記憶する。The organization name storage unit 503 stores the correction results 308 of the organization names of the first to fifth candidates read from the image printed on the business card. The department name storage unit 504, read from the image printed on the business card,
Correction results of the department names of the first to fifth candidates 30
Remember 8. The address storage unit 505 stores the correction result 308 of the addresses from the first candidate to the fifth candidate read from the image printed on the business card.

【００６６】電話番号記憶部５０６、名刺に印字された
画像から読み取られた、第１候補から第５候補までの電
話番号の補正結果３０８を記憶する。郵便番号記憶部５
０７、名刺に印字された画像から読み取られた、第１候
補から第５候補までの郵便番号の補正結果３０８を記憶
する。図６は、主記憶１０１上に設けられる名簿情報記
憶部６０１の記憶領域区分を示す説明図である。The telephone number storage unit 506 stores the correction results 308 of the telephone numbers of the first to fifth candidates read from the image printed on the business card. Zip code storage unit 5
07, the postcode correction results 308 of the first to fifth candidates read from the image printed on the business card are stored. FIG. 6 is an explanatory diagram showing storage area divisions of the name list information storage unit 601 provided on the main memory 101.

【００６７】名簿情報記憶部６０１は、個人別に登録さ
れた名簿情報６０８を記憶する。各名簿情報６０８は、
氏名領域６０２、所属団体名領域６０３、所属部署名領
域６０４、住所領域６０５、電話番号領域６０６および
郵便番号領域６０７を備え、文書入力装置１００に登録
された名簿情報を記憶する。氏名領域６０２は、登録さ
れた個人名を記憶する。The name list information storage unit 601 stores name list information 608 registered for each individual. Each list information 608 is
A name area 602, an affiliated organization name area 603, an affiliated department name area 604, an address area 605, a telephone number area 606, and a postal code area 607 are provided, and the name list information registered in the document input device 100 is stored. The name area 602 stores the registered personal name.

【００６８】所属団体名領域６０３は、当該個人が所属
する団体名を記憶する。所属部署名領域６０４は、当該
個人が所属する部署名を記憶する。住所領域６０５は、
当該個人の住所を記憶する。電話番号領域６０６は、当
該個人の電話番号を記憶する。郵便番号領域６０７は、
前記住所の郵便番号を記憶する。The affiliated organization name area 603 stores the organization name to which the individual belongs. The department name area 604 stores the department name to which the individual belongs. The address area 605 is
Remember the address of the individual. The telephone number area 606 stores the telephone number of the individual. The postal code area 607 is
Store the postal code of the address.

【００６９】図７は、主記憶１０１上に設けられる作業
領域７００の記憶領域区分を示す説明図である。作業領
域７００は、文書種別一時記憶部７０１、ブロック番号
一時記憶部７０２、精度水準記憶部７０３、操作水準記
憶部７０４、尤度水準記憶部７０５、補正用辞書アドレ
ス記憶部７０６、修正キー頻度記憶部７０７および処理
用バッファ７０８を備え、認識処理および認識補正処理
の各段階で生成された各種変数、中間処理結果を記憶す
る。FIG. 7 is an explanatory diagram showing the storage area division of the work area 700 provided on the main memory 101. The work area 700 includes a document type temporary storage unit 701, a block number temporary storage unit 702, a precision level storage unit 703, an operation level storage unit 704, a likelihood level storage unit 705, a correction dictionary address storage unit 706, and a correction key frequency storage. A unit 707 and a processing buffer 708 are provided to store various variables and intermediate processing results generated at each stage of the recognition processing and the recognition correction processing.

【００７０】文書種別一時記憶部７０１は、現在認識対
象となっている文書の種別を記憶する。ブロック番号一
時記憶部７０２は、現在認識対象となっている認識対象
ブロックの識別番号を記憶する。精度水準記憶部７０３
は、文字認識結果の精度の基準値として予め定められた
精度水準を記憶する。前記文字認識結果の精度は、文字
認識結果である候補文字コード３０５の第１候補を並べ
て得られる文字列と、最終確定した文字列との比較照合
の結果、一致した文字数と照合された文字数との割合で
表される。The document type temporary storage unit 701 stores the type of the document which is the current recognition target. The block number temporary storage unit 702 stores the identification number of the recognition target block that is currently the recognition target. Accuracy level storage unit 703
Stores a predetermined accuracy level as a reference value of the accuracy of the character recognition result. The accuracy of the character recognition result is obtained by comparing and collating the character string obtained by arranging the first candidates of the candidate character code 305, which is the character recognition result, with the finally confirmed character string, and the number of matched characters and the number of matched characters. Expressed as a ratio of.

【００７１】操作水準記憶部７０４は、例えば、オペレ
ータが補正結果３０８を修正する際に次の候補の表示を
指示するために用いる後退キーや、間違って認識された
文字を書き直すために用いる削除キーなど、キーボード
１１３の中の修正指示用のキーについて、その打鍵数の
基準値として予め定められた操作水準を記憶する。尤度
水準記憶部７０５は、文字認識結果である候補文字コー
ド３０５の尤度３０６の下限値として予め定められた尤
度水準を記憶する。The operation level storage unit 704 is, for example, a back key used for instructing the display of the next candidate when the operator corrects the correction result 308, or a delete key used for rewriting a character recognized by mistake. For the correction instruction keys in the keyboard 113, a predetermined operation level is stored as a reference value of the number of keystrokes. The likelihood level storage unit 705 stores a likelihood level that is predetermined as a lower limit value of the likelihood 306 of the candidate character code 305 that is the character recognition result.

【００７２】補正用辞書アドレス記憶部７０６は、文書
入力装置１００が現在使用している補正用辞書のアドレ
スを３つまで記憶する。修正キー頻度記憶部７０７は、
表示された補正結果３０８を修正するために用いられた
後退キー、削除キーなどの打鍵回数を逐次記録する。処
理用バッファ７０８は、文書入力装置１００が認識処理
および認識補正処理の各段階で生成した中間的な処理結
果を記憶する。The correction dictionary address storage unit 706 stores up to three addresses of the correction dictionary currently used by the document input device 100. The correction key frequency storage unit 707 is
The number of times of hitting the backward key, the delete key, etc. used for correcting the displayed correction result 308 is sequentially recorded. The processing buffer 708 stores the intermediate processing result generated by the document input device 100 at each stage of the recognition processing and the recognition correction processing.

【００７３】メモリ制御ユニット兼用バス制御ユニット
１０２は、ＣＰＵ１２０と主記憶１０１およびローカル
バス１０４との間のデータの伝送を制御する。ローカル
バス１０４は、データ伝送路である。ローカルバス１０
４には、グラフィクス制御ユニット１０５を介してディ
スプレイ１０６が結合され、ＳＣＳＩ制御ユニット１０
７を介してハードディスク装置１０８、スキャナー１０
９が結合されている。The memory control unit / bus control unit 102 controls data transmission between the CPU 120 and the main memory 101 and the local bus 104. The local bus 104 is a data transmission path. Local bus 10
4, a display 106 is coupled to the SCSI control unit 10 via a graphics control unit 105.
Via the hard disk device 108, the scanner 10
9 are connected.

【００７４】グラフィクス制御ユニット１０５は、ＣＰ
Ｕ１２０による処理結果などのデータを、例えば、ビッ
トマップデータで表される表示データに変換するととも
に、ディスプレイ１０６を制御して前記表示データを表
示面に表示させる。ディスプレイ１０６は、例えば、ビ
ットマップデータで表される表示データを表示する。The graphics control unit 105 uses the CP
The data such as the processing result by the U120 is converted into display data represented by, for example, bitmap data, and the display 106 is controlled to display the display data on the display surface. The display 106 displays display data represented by bitmap data, for example.

【００７５】ＳＣＳＩ制御ユニット１０７は、多種の入
出力装置を接続することが可能で、複数の計算機から入
出力装置を共有することができる小型機用汎用インター
フェースである。ハードディスク装置１０８は、内部に
備える磁気ディスクにデータを格納する。The SCSI control unit 107 is a general-purpose interface for small machines, to which various input / output devices can be connected and which can be shared by a plurality of computers. The hard disk device 108 stores data on a magnetic disk provided inside.

【００７６】スキャナー１０９は、例えば、印刷物等に
光を走査しながら照射し、その反射光の光量を検出する
ことによって紙面等に印刷されたデータを読み取る。さ
らに、バスブリッジ回路１１０によってローカルバス１
０４に結合された外部バス１１１には、入出力制御ユニ
ット１１２を介して、キーボード１１３、プリンタ１１
４、マウス１１５などが結合されている。バスブリッジ
回路１１０は、ローカルバス１０４と外部バス１１１と
を接続して相互にデータの伝送を行わせる接続部であ
る。The scanner 109 reads data printed on a paper surface or the like by irradiating a printed matter or the like with light while scanning and detecting the amount of the reflected light. Furthermore, the bus bridge circuit 110 allows the local bus 1
A keyboard 113 and a printer 11 are connected to an external bus 111 connected to the printer 04 via an input / output control unit 112.
4, a mouse 115 and the like are connected. The bus bridge circuit 110 is a connection unit that connects the local bus 104 and the external bus 111 to mutually transmit data.

【００７７】外部バス１１１は、キーボード１１３、プ
リンタ１１４、マウス１１５などの外部入出力装置から
のデータを伝送する伝送路である。入出力制御ユニット
１１２は、キーボード１１３、プリンタ１１４、マウス
１１５などの低速入出力装置を制御して前記各入出力装
置を外部バス１１１に接続する。The external bus 111 is a transmission path for transmitting data from external input / output devices such as the keyboard 113, printer 114, mouse 115 and the like. The input / output control unit 112 controls low-speed input / output devices such as the keyboard 113, the printer 114, and the mouse 115 to connect the input / output devices to the external bus 111.

【００７８】キーボード１１３は、複数のキーを備え、
押下されたキーに対応するキー信号を出力する。プリン
タ１１４は、ＣＰＵ１２０の処理結果などを紙面に印字
することによって出力する。マウス１１５は、ローラお
よびクリックキーを備え、前記ローラの回転に対応した
カーソル制御信号を出力し、クリックキーが押下される
とクリック信号を出力する。The keyboard 113 has a plurality of keys,
The key signal corresponding to the pressed key is output. The printer 114 outputs the processing result of the CPU 120 and the like by printing it on a paper surface. The mouse 115 includes a roller and a click key, outputs a cursor control signal corresponding to the rotation of the roller, and outputs a click signal when the click key is pressed.

【００７９】ＣＰＵ１２０は、メモリ制御ユニット兼用
バス制御ユニット１０２を介して、主記憶１０１とデー
タのやり取りを行う。ＣＰＵ１２０は、主記憶１０１に
格納されている手順プログラムを実行し、前記各入出力
装置を制御することによって、後述する本実施例の文書
入力装置１００を実現する。以下、文書入力装置１０
０、すなわち、ＣＰＵ１２０の処理手順を、フローチャ
ートおよび具体例を参照しながら説明する。以下の説明
では、本実施例の文書入力装置１００において使用され
る各種変数は、予め初期化されているものとして説明す
る。The CPU 120 exchanges data with the main memory 101 via the memory control unit / bus control unit 102. The CPU 120 executes the procedure program stored in the main memory 101 and controls each of the input / output devices to implement the document input device 100 of this embodiment described later. Hereinafter, the document input device 10
0, that is, the processing procedure of the CPU 120 will be described with reference to flowcharts and specific examples. In the following description, various variables used in the document input device 100 of this embodiment will be described as being initialized in advance.

【００８０】図８は、ＣＰＵ１２０の処理手順の全体の
概要を示すフローチャートである。図９は、文書入力処
理の初期段階における表示画面およびデータの具体例を
示す説明図である。オペレータが、スキャナー１０９か
ら入力を意図した文書を読み込ませると（ステップＳ８
０１）、ＣＰＵ１２０は、図９（ａ）に示すメニュー画
面をディスプレイ１０６に表示させ、スキャナー１０９
から入力された文書が「帳票形式」か「名刺」か「一般
文書」かをオペレータに尋ねる（ステップＳ８０２）。FIG. 8 is a flowchart showing an outline of the entire processing procedure of the CPU 120. FIG. 9 is an explanatory diagram showing a specific example of the display screen and data in the initial stage of the document input process. When the operator reads the document intended for input from the scanner 109 (step S8)
01), the CPU 120 causes the display 106 to display the menu screen shown in FIG.
The operator is asked whether the document input from is "form form", "business card", or "general document" (step S802).

【００８１】ＣＰＵ１２０は、オペレータからの応答に
従い、後の処理を分岐する（ステップＳ８０３）。な
お、ここでは、オペレータがメニュー画面から入力され
た文書の種別を入力することとしたが、スキャナー１０
９から読み込まれた文書の大きさ、罫線枠の有無などか
ら、ＣＰＵ１２０が前記文書種別を認識するようにして
もよい。The CPU 120 branches the subsequent processing according to the response from the operator (step S803). Note that here, the operator inputs the type of the document input from the menu screen, but the scanner 10
The CPU 120 may recognize the document type based on the size of the document read from the document 9, the presence / absence of a ruled line frame, and the like.

【００８２】ステップＳ８０３において、『一般文書』
が前記メニュー画面で選択されると、ＣＰＵ１２０は、
一般的な文章を対象とした一般辞書を格納している一般
辞書部４０２のアドレスを、辞書管理情報記憶部４０１
内の一般辞書アドレス記憶部４０７から読み出して、補
正用辞書アドレス記憶部７０６に記憶させる（ステップ
Ｓ８０４）。In step S803, "general document"
When is selected on the menu screen, the CPU 120
The address of the general dictionary unit 402 that stores the general dictionary for general sentences is set to the dictionary management information storage unit 401.
It is read out from the general dictionary address storage unit 407 in the memory and stored in the correction dictionary address storage unit 706 (step S804).

【００８３】ＣＰＵ１２０は、後述の、一般文書向けの
テキスト認識入力処理および認識誤り補正処理を行う。
これにおいて、ＣＰＵ１２０は、所定数、例えば、４つ
の認識列について補正結果３０８が得られるごとに、前
記補正結果３０８の第１候補を認識対象となったテキス
ト画像とともに、ディスプレイ１０６に表示させ、文書
入力装置１００が別途保有するテキスト編集プログラム
を起動する。オペレータは、表示された補正結果３０８
について、キーボード１１３などを用いて認識誤り箇所
を正しい文字に修正し、修正が完了すれば、確認の入力
を行う。ＣＰＵ１２０は、前記修正結果に基づいて、次
の文字列について認識誤り補正処理を行う。このように
して、ＣＰＵ１２０は、所定数の認識列ごとに前記認識
誤り補正処理と補正結果の修正処理とを繰り返し、すべ
ての文字について前記処理を終了すると、認識入力され
た文書を、必要があれば印刷して、文書入力を完了する
（ステップＳ８０５）。この際、ＣＰＵ１２０は、オペ
レータが補正結果３０８の修正のため使用した、キーボ
ード１１３の中の後退キーと削除キーとの打鍵を監視
し、打鍵の度に修正キー頻度記憶部７０７の値をインク
リメントする。The CPU 120 performs text recognition input processing and recognition error correction processing for general documents, which will be described later.
In this case, the CPU 120 causes the display 106 to display the first candidate of the correction result 308 together with the text image as the recognition target on the display 106 every time the correction result 308 is obtained for a predetermined number, for example, four recognition columns. The input device 100 activates a text editing program separately held. The operator uses the displayed correction result 308.
With respect to, the recognition error portion is corrected to a correct character using the keyboard 113 or the like, and when the correction is completed, confirmation input is performed. The CPU 120 performs recognition error correction processing for the next character string based on the correction result. In this way, the CPU 120 repeats the recognition error correction processing and the correction result correction processing for each predetermined number of recognition columns, and when the processing is completed for all characters, the recognized and input document is required. If it is printed, the document input is completed (step S805). At this time, the CPU 120 monitors the keystrokes of the backward key and the delete key in the keyboard 113 used by the operator to correct the correction result 308, and increments the value of the corrected key frequency storage unit 707 each time the keystroke is made. .

【００８４】ステップＳ８０３において、『名刺』が前
記メニュー画面で選択されると、ＣＰＵ１２０は、名
刺、住所録など固有名詞を対象とした、補正用の固有名
詞辞書を格納している固有名詞辞書部４０３のアドレス
を、辞書管理情報記憶部４０１内の固有名詞辞書アドレ
ス記憶部４０８から読み出して、補正用辞書アドレス記
憶部７０６に記憶させる（ステップＳ８０７）。When "business card" is selected on the menu screen in step S803, the CPU 120 causes the proper noun dictionary section for storing proper noun dictionaries for proper nouns such as business cards and address books. The address of 403 is read from the proper noun dictionary address storage unit 408 in the dictionary management information storage unit 401 and stored in the correction dictionary address storage unit 706 (step S807).

【００８５】次いで、ＣＰＵ１２０は、後述の、名刺向
けのテキスト認識入力を行ったのち（ステップＳ８０
８）、認識した名刺データを、名簿情報記憶部６０１内
に記憶されている名簿情報と照合し、同一個人名が名簿
情報記憶部６０１内に記憶されていない名刺データにつ
いては、新規名簿情報として名簿情報記憶部６０１に新
規登録する。また、同一個人名が名簿情報記憶部６０１
内に記憶されているが他の部分が異なる名刺データにつ
いては、前記名刺データからの情報によって前記名簿情
報を更新する（ステップＳ８０９）。Next, the CPU 120 carries out text recognition input for business cards, which will be described later (step S80).
8), the recognized business card data is collated with the name list information stored in the name list information storage unit 601, and the business card data for which the same individual name is not stored in the name list information storage unit 601 is treated as new name list information. It is newly registered in the name list information storage unit 601. Further, the same personal name is stored in the name list information storage unit 601.
For the business card data stored in the other but different in other parts, the name list information is updated with the information from the business card data (step S809).

【００８６】ステップＳ８０３において、『帳票形式』
が前記メニュー画面で選択されると、ＣＰＵ１２０は、
一般辞書を格納する一般辞書部４０２に加えて、帳票に
高頻度で出現する帳票用語を収録した帳票用語辞書を格
納する帳票用語辞書部４０４のアドレスを、辞書管理情
報記憶部４０１内の一般辞書アドレス記憶部４０７と帳
票用語辞書アドレス記憶部４０９とから読み出して、補
正用辞書アドレス記憶部７０６に記憶させる（ステップ
Ｓ８１０）。In step S803, "form form"
When is selected on the menu screen, the CPU 120
In addition to the general dictionary unit 402 that stores the general dictionary, the address of the form term dictionary unit 404 that stores the form term dictionary in which the form terms that frequently appear in the form are stored is set to the general dictionary in the dictionary management information storage unit 401. It is read from the address storage unit 407 and the form term dictionary address storage unit 409 and stored in the correction dictionary address storage unit 706 (step S810).

【００８７】次いで、ＣＰＵ１２０は、罫線枠で囲われ
たブロックと、空白で囲われたテキスト領域ブロックと
を帳票から抽出するとともに、抽出された前記両者を、
それぞれ、罫線枠ブロック、一般ブロックとして個別に
ブロック管理情報２１０を作成し、ブロック管理情報記
憶部２０２に記憶させる（ステップＳ８１１）。なお、
罫線枠ブロックであるか、一般ブロックであるかは、ブ
ロック管理情報２１０内の罫線枠情報２１２の項目に書
き込まれる。また、これらの手法は、「表の構造理解の
ための罫線抽出と領域分け電子情報通信学会技術研究
報告Ｖｏｌ．９０Ｎｏ．２３５（ＰＲＵ９０６８−
８０）ｐ３３−３８」や「空白領域情報を用いた文書画
像の領域分割電子情報通信学会春季大会講演論文集
Ｄ−５６８ｐ７−４１０（１９９２）」等に記載され
た方式を用いており、またこの詳細手順は本発明の主眼
でもないので、詳細な説明を省略する。Next, the CPU 120 extracts the block surrounded by the ruled line frame and the text area block surrounded by the blank from the form, and the extracted both are
The block management information 210 is individually created as a ruled line frame block and a general block, and is stored in the block management information storage unit 202 (step S811). In addition,
Whether it is a ruled line frame block or a general block is written in the item of ruled line frame information 212 in the block management information 210. In addition, these methods are described in “Technical Report Vol.90 No. 235 (PRU90 68-
80) p33-38 "or" area division of document image using blank area information "Proceedings of IEICE Spring Conference
D-568 p7-410 (1992) ”and the like are used, and since this detailed procedure is not the main subject of the present invention, detailed description thereof will be omitted.

【００８８】ＣＰＵ１２０は、前記罫線枠ブロック、前
記一般ブロック毎に、内部のテキスト文字と、文字列方
向を認識したのち（ステップＳ８１２）、各ブロック内
で認識された文字列に、認識された文字列方向と対応し
た均等割付情報を付与する（ステップＳ８１３）。な
お、ステップＳ８１２およびステップＳ８１３の処理に
ついては、後に詳述する。After the CPU 120 recognizes the internal text characters and the character string direction for each of the ruled line frame block and the general block (step S812), the recognized character is recognized in the character string recognized in each block. Equal allocation information corresponding to the column direction is added (step S813). The processing of steps S812 and S813 will be described in detail later.

【００８９】図１０は、図８に示したステップＳ８０５
におけるテキスト認識処理の内容を詳細に説明するフロ
ーチャートである。ＣＰＵ１２０は、図９（ｂ）に示す
ように、スキャナー１０９から入力された文書を、ディ
スプレイ１０６に表示させる。オペレータは、ディスプ
レイ１０６に表示された文書の画像を確認しながら、マ
ウス１１５を用いて、前記認識対象ブロックを指定する
（ステップＳ１００１）。図９（ｂ）において網掛けさ
れた部分は、ステップＳ１００１においてオペレータに
指定された認識対象ブロックの一例を示す。FIG. 10 shows step S805 shown in FIG.
3 is a flowchart illustrating in detail the content of the text recognition process in FIG. As shown in FIG. 9B, the CPU 120 causes the display 106 to display the document input from the scanner 109. The operator designates the recognition target block by using the mouse 115 while checking the image of the document displayed on the display 106 (step S1001). The shaded area in FIG. 9B indicates an example of the recognition target block designated by the operator in step S1001.

【００９０】ＣＰＵ１２０は、指定された認識対象ブロ
ック内の画像を、横方向と縦方向に写像したヒストグラ
ムを基にして、当該認識対象ブロック内の文字領域およ
びその文字方向を推定する。具体的には、ＣＰＵ１２０
は、横方向と縦方向に写像した画像のヒストグラムを生
成し、画像および空白領域に周期性があって画像と画像
との間隔の狭い方向を文字方向と推定する。推定された
文字列方向およびステップＳ１００１において指定され
た認識対象ブロックの画像の位置は、それぞれ、ブロッ
ク管理情報２１０内の文字列方向２１３および画像位置
情報２０３の各項目に書き込まれる。さらに、ＣＰＵ１
２０は、前記表示データ記憶部２０３および文字列方向
２１３に基づいて、当該認識対象ブロックに含まれてい
る各認識列に対応した画像を切り出すとともに、前記各
認識列に含まれている１文字分の画像を切り出す。切り
出された各文字に対応する画像の位置は、認識文字情報
３０２の文字位置情報３０３の項目に書き込まれる（ス
テップＳ１００２）。The CPU 120 estimates a character area in the recognition target block and its character direction based on a histogram obtained by mapping the image in the designated recognition target block in the horizontal and vertical directions. Specifically, the CPU 120
Generates a histogram of images that are mapped in the horizontal and vertical directions, and estimates the direction in which there is periodicity in the image and the blank area and the distance between the images is narrow as the character direction. The estimated character string direction and the position of the image of the recognition target block designated in step S1001 are written in the items of the character string direction 213 and the image position information 203 in the block management information 210, respectively. Furthermore, CPU1
20 cuts out an image corresponding to each recognition string included in the recognition target block based on the display data storage unit 203 and the character string direction 213, and at the same time, extracts one character included in each recognition string. Cut out the image of. The position of the image corresponding to each cut out character is written in the item of the character position information 303 of the recognized character information 302 (step S1002).

【００９１】図９（ｃ）は、ステップＳ１００２の処理
によって、図９（ｂ）に示した認識対象ブロック内から
切り出された４個の認識列を示す。従って、図９（ｂ）
に示した認識対象ブロック内の画像に対して、４個の認
識列管理情報３０１が作成されるとともに、各認識列管
理情報３０１内には、当該認識列に含まれる文字数分、
例えば、認識列２については３２個の認識文字情報３０
２が作成され、前記認識列管理部２０４に記憶される。FIG. 9C shows four recognition columns cut out from the recognition target block shown in FIG. 9B by the processing of step S1002. Therefore, FIG. 9 (b)
4 pieces of recognition sequence management information 301 are created for the image in the recognition target block shown in FIG.
For example, for the recognition string 2, 32 pieces of recognition character information 30
2 is created and stored in the recognition column management unit 204.

【００９２】次いで、ＣＰＵ１２０は、文字位置情報３
０３で示される位置の画像から、文字識別用特徴を抽出
した上、抽出した前記文字識別用特徴と文書入力装置１
００が別途保有する活字認識用辞書とを比較照合し、類
似度が高いものから順に、第１位から第１０位までの候
補文字コード３０５を、尤度３０６とともに生成する
（ステップＳ１００３）。Next, the CPU 120 causes the character position information 3
The character identification feature is extracted from the image at the position indicated by 03, and the extracted character identification feature and the document input device 1 are also extracted.
00 is compared and collated with a type recognition dictionary that is separately held, and candidate character codes 305 from the first rank to the tenth rank are generated together with the likelihood 306 in descending order of similarity (step S1003).

【００９３】生成された候補文字コード３０５および尤
度３０６は、認識文字情報３０２内の候補文字情報３０
７として、認識列管理部２０４に格納される。この単位
文字認識手法は、「ＮａｔｉｏｎａｌＴｅｈｃｈｎｉ
ｃａｌｒｅｐｏｒｔｖｏｌ．３５Ｎｏ．１（Ｆｅ
ｂ．１９８９）」等に記載された手法であり、本発明の
主眼ではないので、説明を省略する。この結果、認識列
管理部２０４に、各認識文字に関する認識文字情報３０
２が格納される。また、このとき、ＣＰＵ１２０は、文
字領域の間の空白領域が全角幅以上の大きさである場合
には、前記文字領域間の空白領域を単語あるいは文節の
区切りとして有効とし、そうでない場合には無効とす
る。ＣＰＵ１２０は、有効な前記空白による単語区切り
を直後に有する画像には、対応した認識文字情報３０２
の区切り情報３０４に「１」を書き込み、そうでない画
像には、前記区切り情報３０４に「０」を書き込む。The generated candidate character code 305 and likelihood 306 are the candidate character information 30 in the recognized character information 302.
7 is stored in the recognition column management unit 204. This unit character recognition method is based on "National Tehchni".
cal report vol. 35 No. 1 (Fe
b. 1989) ”, etc., and is not the main subject of the present invention, and therefore the description thereof is omitted. As a result, the recognition string management unit 204 causes the recognition character information 30 regarding each recognition character to be recognized.
2 is stored. At this time, the CPU 120 validates the blank area between the character areas as a word or phrase delimiter when the blank area between the character areas is larger than the full width, and otherwise. Invalidate. The CPU 120 determines that the recognized character information 302 corresponding to an image immediately after which is a valid word segmentation by the blank.
“1” is written in the delimiter information 304 of the above, and “0” is written in the delimiter information 304 in the image other than that.

【００９４】図１１は、文字認識処理により生成された
認識文字情報３０２および認識誤り補正後に表示される
表示画面の具体例を示す説明図である。図１１（ａ）
は、ステップＳ１００３において、ＣＰＵ１２０が、図
９（ｃ）に示した認識列３の中で２０文字目に相当する
画像を、「日」と認識したときの認識文字情報３０２の
内容の一例を示す。FIG. 11 is an explanatory diagram showing a specific example of the recognized character information 302 generated by the character recognition processing and the display screen displayed after recognition error correction. FIG. 11 (a)
Shows an example of the contents of the recognized character information 302 when the CPU 120 recognizes the image corresponding to the 20th character in the recognition sequence 3 shown in FIG. 9C as “day” in step S1003. ..

【００９５】次いで、ＣＰＵ１２０は、文字認識結果の
適合率を上げ、不要な後処理照合を減らすため、候補文
字情報３０７内の候補文字コード３０５のうち、第５位
以下の候補文字コード３０５であって、その尤度３０６
が、尤度水準記憶部７０５に記憶されている尤度水準よ
り低いものを、候補文字情報３０７から取り除く（ステ
ップＳ１００４）。図１１（ａ）に示した認識文字情報
３０２の例では、例えば、前記尤度水準が「４０」の場
合、その尤度３０６が「４０」より低い第８位候補から
第１０位候補までが候補文字情報３０７から取り除か
れ、候補文字情報３０７内の認識文字候補は、第１位候
補から第７位候補の７つに限定される。なお、ステップ
Ｓ１００４の処理については、後にさらに詳述する。Next, in order to increase the matching rate of the character recognition result and reduce unnecessary post-processing collation, the CPU 120 selects the candidate character code 305 of the fifth rank or lower among the candidate character codes 305 in the candidate character information 307. And its likelihood 306
, Which is lower than the likelihood level stored in the likelihood level storage unit 705 is removed from the candidate character information 307 (step S1004). In the example of the recognized character information 302 shown in FIG. 11A, for example, when the likelihood level is “40”, the 8th to 10th candidates whose likelihood 306 is lower than “40” are The recognized character candidates that are removed from the candidate character information 307 and are in the candidate character information 307 are limited to seven candidates from the first rank candidate to the seventh rank candidate. The process of step S1004 will be described in more detail later.

【００９６】ＣＰＵ１２０は、認識列管理情報３０１に
対応する認識列単位で、候補文字情報３０７内の候補文
字コード３０５を、補正用辞書アドレス記憶部７０６で
指定された補正用辞書と文字列照合しながら配列し、所
定の優先順位に従って、文節として解釈可能な文節の並
びからなる補正結果を、補正結果３０８に書き込む（ス
テップＳ１００５）。なお、ステップＳ１００５の処理
については、後にさらに詳述する。The CPU 120 collates the candidate character code 305 in the candidate character information 307 with the correction dictionary designated by the correction dictionary address storage unit 706 in character string units in recognition string units corresponding to the recognition string management information 301. However, the correction result composed of a sequence of phrases that can be interpreted as a phrase according to a predetermined priority order is written in the correction result 308 (step S1005). The process of step S1005 will be described in detail later.

【００９７】ＣＰＵ１２０は、連続する所定数の、例え
ば、４つの認識列に対応する補正結果３０８が書き込ま
れる毎に（ステップＳ１００６）、後述の補正結果修正
処理を行い（ステップＳ１００７）、この段階で、まだ
認識残りのブロックが存在し、認識入力が完了していな
い場合（ステップＳ１００８）、ステップＳ１００３の
処理に戻り、そうでない場合、ステップＳ１００７の処
理と同様、後述の補正結果修正処理を行った後、処理を
終了する（ステップＳ１００９）。Each time the CPU 120 writes the correction results 308 corresponding to a predetermined number of consecutive recognition columns, for example, four recognition columns (step S1006), it performs a correction result correction process described later (step S1007), and at this stage. If the recognition remaining block still exists and the recognition input is not completed (step S1008), the process returns to the process of step S1003. If not, the correction result correction process described below is performed as in the process of step S1007. Then, the process ends (step S1009).

【００９８】図１０（ａ）は、ステップＳ１００７およ
びステップＳ１００９に示した前記補正結果修正処理の
具体的な処理手順を示す。ＣＰＵ１２０は、補正結果３
０８の第１候補を、認識対象ブロックの画像とともに、
ディスプレイ１０６に表示し、文書入力装置１００が別
途格納しているテキスト編集機能を起動する。図１１
（ｂ）は、このときのディスプレイ１０６の表示例を示
す。図１１（ｂ）に示す表示画面の上半分は認識対象ブ
ロックの画像を示し、表示画面の下半分は補正結果３０
８の第１位候補からなる補正結果を示す。オペレータ
は、表示された補正結果３０８が間違っている場合、起
動されたテキスト編集機能により、キーボード１１３を
用いて、表示された補正結果３０８の中で間違っている
箇所の文字を修正する（ステップＳ１０１１）。この
際、ＣＰＵ１２０は、オペレータが補正結果３０８の修
正のため使用した、キーボード１１３の中の後退キーと
削除キーとの打鍵を監視し、打鍵の度に修正キー頻度記
憶部７０７の値をインクリメントする。FIG. 10A shows a specific processing procedure of the correction result correction processing shown in steps S1007 and S1009. The CPU 120 displays the correction result 3
08 first candidate, together with the image of the recognition target block,
The text is displayed on the display 106 and the text editing function separately stored in the document input device 100 is activated. Figure 11
(B) shows a display example of the display 106 at this time. The upper half of the display screen shown in FIG. 11B shows the image of the recognition target block, and the lower half of the display screen shows the correction result 30.
8 shows the result of correction consisting of 8 first place candidates. When the displayed correction result 308 is incorrect, the operator uses the activated text editing function to use the keyboard 113 to correct the incorrect character in the displayed correction result 308 (step S1011). ). At this time, the CPU 120 monitors the keystrokes of the backward key and the delete key in the keyboard 113 used by the operator to correct the correction result 308, and increments the value of the corrected key frequency storage unit 707 each time the keystroke is made. .

【００９９】オペレータが修正を完了し、補正結果３０
８の確定指示をキーボード１１３から入力すると、ＣＰ
Ｕ１２０は、認識列管理部２０４に記憶されている候補
文字情報３０７から候補文字コード３０５の第１候補を
取り出して得られた文字列と、最終確定された文字列と
を比較照合し、前記文字列同士の一致の度合いを算出す
る。すなわち、ＣＰＵ１２０は、以下の式に基づいて、
精度ｒを算出する（ステップＳ１０１２）。The operator completes the correction, and the correction result 30
When the confirmation instruction of 8 is input from the keyboard 113, the CP
U120 compares and collates the character string obtained by extracting the first candidate of the candidate character code 305 from the candidate character information 307 stored in the recognition string management unit 204 with the final confirmed character string, and The degree of coincidence between columns is calculated. That is, the CPU 120, based on the following formula,
The accuracy r is calculated (step S1012).

【０１００】ｒ／１００＝（一致した文字数）／（照合した文字数）ＣＰＵ１２０は、算出された精度ｒと、精度水準記憶部
７０３内に記憶されている予め定められた精度水準とを
比較し（ステップＳ１０１３）、算出された精度ｒが前
記精度水準よりも低い場合、尤度水準記憶部７０５内の
尤度水準を１０％低い値に変更し、処理を終了する（ス
テップＳ１０１４）。なお、ここでは、算出された精度
ｒが前記精度水準記憶部７０３内に記憶されている精度
水準よりも低い場合についてだけ説明したが、逆に、算
出された精度ｒが前記精度水準記憶部７０３内に記憶さ
れている精度水準よりも高い場合には、尤度水準記憶部
７０５内の尤度水準を１０％高い値に変更するようにし
てもよい。R / 100 = (number of matched characters) / (number of collated characters) The CPU 120 compares the calculated accuracy r with a predetermined accuracy level stored in the accuracy level storage unit 703 ( (Step S1013), if the calculated precision r is lower than the precision level, the likelihood level in the likelihood level storage unit 705 is changed to a value 10% lower, and the process ends (step S1014). It should be noted that here, only the case where the calculated precision r is lower than the precision level stored in the precision level storage unit 703 has been described, but conversely, the calculated precision r is the precision level storage unit 703. If the accuracy level is higher than the accuracy level stored therein, the likelihood level in the likelihood level storage unit 705 may be changed to a value 10% higher.

【０１０１】なお、以上では、上記のように算出された
精度ｒと、前記精度水準記憶部７０３内に記憶されてい
る精度水準とを比較し、その比較結果に基づいて、尤度
水準記憶部７０５内の尤度水準を変更する場合について
説明したが、別の基準値をもとに尤度水準記憶部７０５
内の尤度水準を変更してもよい。具体的には、ステップ
Ｓ１０１１の処理において説明したように、ＣＰＵ１２
０は、オペレータが補正結果３０８を修正する際に用い
たキーボード１１３の中の後退キーと削除キーとの打鍵
数を、その都度、修正キー頻度記憶部７０７内に計数し
ている。これにより、オペレータが修正を完了し、補正
結果３０８の確定指示をキーボード１１３から入力する
と、ＣＰＵ１２０は、修正キー頻度記憶部７０７内に記
憶されている打鍵数と、操作水準記憶部７０４内に記憶
されている操作水準とを比較し、修正キー頻度記憶部７
０７内に記憶されている打鍵数が前記操作水準より大き
い場合には、尤度水準記憶部７０５内の尤度水準を１０
％低い値に変更するようにしてもよい。また、同様に、
修正キー頻度記憶部７０７内に記憶されている打鍵数が
前記操作水準より小さい場合には、尤度水準記憶部７０
５内の尤度水準を１０％高い値に変更するようにしても
よい。これにより、精度ｒを算出するまでもなく、容易
に文字認識の精度を判断することができる。In the above, the accuracy r calculated as described above is compared with the accuracy level stored in the accuracy level storage unit 703, and the likelihood level storage unit 703 is based on the comparison result. The case of changing the likelihood level in 705 has been described, but the likelihood level storage unit 705 is based on another reference value.
The likelihood level within may be changed. Specifically, as described in the processing of step S1011, the CPU 12
For 0, the number of times the backward key and the delete key in the keyboard 113 used by the operator to correct the correction result 308 are counted in the corrected key frequency storage unit 707 each time. As a result, when the operator completes the correction and inputs a confirmation instruction of the correction result 308 from the keyboard 113, the CPU 120 stores the number of keystrokes stored in the correction key frequency storage unit 707 and the operation level storage unit 704. The corrected key frequency storage unit 7 is compared with the operated operation level.
If the number of keystrokes stored in 07 is larger than the operation level, the likelihood level in the likelihood level storage unit 705 is set to 10
The value may be changed to a lower value. Similarly,
When the number of keystrokes stored in the modified key frequency storage unit 707 is smaller than the operation level, the likelihood level storage unit 70
The likelihood level in 5 may be changed to a value 10% higher. This makes it possible to easily determine the accuracy of character recognition without calculating the accuracy r.

【０１０２】図１２は、図１０のステップＳ１００４に
おける候補文字限定処理を説明するフローチャートであ
る。ＣＰＵ１２０は、限定開始候補番号Ｎを「５」に設
定する（ステップＳ１２０１）。ＣＰＵ１２０は、第Ｎ
位候補である候補文字コード３０５を尤度３０６ととも
に、認識列管理部２０４に記憶されている候補文字情報
３０７から取り出し（ステップＳ１２０２）、取り出さ
れた尤度３０６が、尤度水準記憶部７０５に記憶されて
いる尤度水準と比較して小さいか否かを判定する（ステ
ップＳ１２０３）。小さい場合、候補文字情報３０７内
の第Ｎ位以下の候補の尤度３０６の値を「０」にしてこ
の候補を無効にし、候補文字限定処理を終了する（ステ
ップＳ１２０４）。FIG. 12 is a flow chart for explaining the candidate character limiting process in step S1004 of FIG. The CPU 120 sets the limited start candidate number N to "5" (step S1201). The CPU 120 is the Nth
The candidate character code 305, which is a position candidate, is extracted together with the likelihood 306 from the candidate character information 307 stored in the recognition string management unit 204 (step S1202), and the extracted likelihood 306 is stored in the likelihood level storage unit 705. A comparison is made with the stored likelihood level to determine whether it is smaller (step S1203). If it is smaller, the value of the likelihood 306 of the Nth or lower candidate in the candidate character information 307 is set to “0” to invalidate this candidate, and the candidate character limiting process is ended (step S1204).

【０１０３】小さくない場合、Ｎをインクリメントし
（ステップＳ１２０５）、Ｎの値を最大候補数である
「１０」と比較して（ステップＳ１２０６）、Ｎの値が
「１０」以下である場合、ステップＳ１２０２の処理に
戻る。そうでなければ終了する。なお、変数Ｎは、処理
用バッファ７０８の領域に記憶される。図１３は、図１
０のステップＳ１００５における認識誤り補正処理を説
明するフローチャートである。If it is not small, N is incremented (step S1205), the value of N is compared with the maximum number of candidates, "10" (step S1206), and if the value of N is "10" or less, step It returns to the process of S1202. Otherwise it ends. The variable N is stored in the area of the processing buffer 708. 13 is the same as FIG.
It is a flowchart explaining the recognition error correction process in step S1005 of 0.

【０１０４】ＣＰＵ１２０は、解釈開始点を認識列管理
部２０４に格納された認識列の先頭位置に設定する（ス
テップＳ１３０１）。ＣＰＵ１２０は、認識列管理部２
０４内の候補文字情報３０７に格納された候補文字コー
ド３０５を、補正用辞書アドレス記憶部７０６で指定さ
れる一般辞書と照合し、解釈開始点から始まり、日本語
の文法として成り立つ文節解釈を生成する（ステップＳ
１３０２）。なお、ステップＳ１３０２の処理について
は、後に詳述する。The CPU 120 sets the interpretation start point to the head position of the recognition sequence stored in the recognition sequence management unit 204 (step S1301). The CPU 120 uses the recognition sequence management unit 2
The candidate character code 305 stored in the candidate character information 307 in 04 is collated with the general dictionary designated by the correction dictionary address storage unit 706 to generate a phrase interpretation that starts from the interpretation start point and is valid as a Japanese grammar. Yes (Step S
1302). The process of step S1302 will be described later in detail.

【０１０５】次に、ＣＰＵ１２０は、各認識列の中で、
処理し残した文字列があるか否かを判定する（ステップ
Ｓ１３０３）。処理し残した文字列があれば、次の解釈
開始点を設定したのち、ステップＳ１３０２の処理に戻
り、設定した解釈開始点から次の文節解釈を求める（ス
テップＳ１３０４）。処理し残した文字列がなければ、
処理対象となっている認識列について、文節として解釈
が可能な文節解釈の組み合わせからなる、すべての文節
解釈列が生成されたので、ＣＰＵ１２０は、生成された
文節解釈列を、文節解釈数が少ないものから順に並べ
る。また、生成された文節解釈列の中に、それを構成す
る文節解釈数が同じ文節解釈列がある場合には、各文節
解釈列を構成する各候補文字コード３０５の尤度３０６
の値の和（以下、「認識尤度得点」という。）が大きい
方を先にして並べる。ＣＰＵ１２０は、このようにして
並べられた各文節解釈列に、並びの順に優先順位を付け
る（ステップＳ１３０５）。Next, the CPU 120 selects, in each recognition sequence,
It is determined whether there is a character string left to be processed (step S1303). If there is a character string left unprocessed, the next interpretation start point is set, and then the process returns to step S1302 to obtain the next phrase interpretation from the set interpretation start point (step S1304). If there is no string left to process,
For the recognition sequence to be processed, all bunsetsu interpretation sequences, which are combinations of bunsetsu interpretations that can be interpreted as bunsetsu, have been generated. Arrange the items in order. In addition, when there is a bunsetsu interpretation string having the same number of bunsetsu interpretations in the generated bunsetsu interpretation string, the likelihood 306 of each candidate character code 305 forming each bunsetsu interpretation string.
The larger sum of the values of (hereinafter, referred to as "recognition likelihood score") is arranged first. The CPU 120 prioritizes the respective phrase interpretation strings arranged in this way in the order of arrangement (step S1305).

【０１０６】例えば、「現在／７は／日常的な」という
文節解釈列は、文節解釈の数が３であるが、「現在では
／日常的な」という候補は文節解釈の数が２なので、後
者の方が優先順が高い。また、例えば、「現在では／日
常的な」という文節解釈列の認識尤度得点は、「現」、
「在」、「で」、「は」、「日」、「常」、「的」、
「な」の各候補文字コード３０５の尤度３０６の値の合
計である。これに対して、「現在では／白樺的な」とい
う文節解釈列は、「現」、「在」、「で」、「は」、
「白」、「樺」、「的」、「な」の各候補文字コード３
０５の尤度３０６の値の合計となる。これらの認識尤度
得点の差により、生成された文節解釈列の中で、最終的
に、第１候補が「現在では／日常的な」、第２候補が
「現在では／白樺的な」、第３候補が「現在／７は／日
常的な」という順位がつく。なお、この解釈開始点も処
理用バッファ７０８に記憶される変数であり、例えば、
文節解釈の先頭文字の文字位置情報３０３で表される。For example, in the bunsetsu interpretation sequence "present / seven / daily", the number of bunsetsu interpretations is 3; The latter has higher priority. Further, for example, the recognition likelihood score of the phrase interpretation sequence "currently / daily" is "present",
"A", "de", "ha", "day", "constant", "target",
It is the sum of the values of the likelihood 306 of each candidate character code 305 of “NA”. On the other hand, the phrase interpretation sequence "currently / white birch" is "present", "present", "de", "ha",
Candidate character code 3 for "white", "birch", "target", and "na"
It is the sum of the values of the likelihood 306 of 05. Due to the difference in the recognition likelihood scores, the first candidate is “currently / daily”, the second candidate is “currently / birchish” in the generated phrase interpretation sequence. The third candidate is ranked as “current / 7 / daily”. Note that this interpretation start point is also a variable stored in the processing buffer 708.
It is represented by the character position information 303 of the first character of the phrase interpretation.

【０１０７】図１３（ｂ）は、図１３（ａ）のステップ
Ｓ１３０２における文節解釈生成処理をさらに詳細に説
明するフローチャートである。ＣＰＵ１２０は、解釈開
始点で指定された文字位置情報３０３に対応する候補文
字情報３０７を参照し、候補文字コード３０５に格納さ
れた認識文字候補を補正用辞書アドレス記憶部７０６で
指定された誤り補正用辞書と照合しながら、１文字ずつ
並べ、自立語として解釈可能な自立語候補をすべて生成
する（ステップＳ１３１１）。生成された自立語候補は
中間バッファとして確保された処理用バッファ７０８に
格納される。なお、ここで用いる辞書検索の手法は、
「文字認識における自然言語処理情報処理Ｖｏｌ．
３４Ｎｏ．１０ｐｐ．１２７４−１２８０（１９９
３）」などに記載された探索方式を基にしたもので、ト
ライ構造の辞書を状態遷移によってデジタル検索するこ
とにより辞書検索の高速化を図っている。FIG. 13B is a flow chart for explaining the phrase interpretation generation processing in step S1302 of FIG. 13A in more detail. The CPU 120 refers to the candidate character information 307 corresponding to the character position information 303 specified at the interpretation start point, and corrects the recognized character candidate stored in the candidate character code 305 in the correction dictionary address storage unit 706. All the independent word candidates that can be interpreted as an independent word are generated by collating the characters one by one while collating them with the dictionary for use (step S1311). The generated independent word candidate is stored in the processing buffer 708 secured as an intermediate buffer. The dictionary search method used here is
"Natural language processing in character recognition Information processing Vol.
34 No. 10 pp. 1274-1280 (199
3) ”, etc., and is designed to speed up the dictionary search by digitally searching a trie-structured dictionary by state transitions.

【０１０８】自立語の検索が終わると、ＣＰＵ１２０
は、対象自立語番号ｎを「１」に設定した上で（ステッ
プＳ１３１２）、ステップＳ１３１１で生成された第ｎ
番目の自立語について、次の解釈開始点を起点として、
それに続く文字列を、付属語辞書と照合する（ステップ
Ｓ１３１３）。前記付属語辞書には、自立語辞書と同
様、トライ構造で付属語が収録されている。また、ＣＰ
Ｕ１２０は、前記自立語辞書に基づいて、自立語と付属
語の接続、付属語同士の接続の検定を行い、日本語の文
法として成り立つものだけを付属語解釈として残す。こ
れらの、文法検定処理などは、従来のかな漢字変換など
でも用いられている手法であり、その詳細は本発明の主
眼ではないので、説明を省略する。以上の結果、一文節
として解釈できる文節解釈が、前記処理用バッファ７０
８上に生成される。When the retrieval of the independent word is completed, the CPU 120
Sets the target independent word number n to "1" (step S1312), and then sets the n-th word generated in step S1311.
For the th independent word, starting from the next starting point of interpretation,
The subsequent character string is collated with the attached word dictionary (step S1313). Similar to the independent word dictionary, the adjunct word dictionary has adjunct words in a trie structure. Also, CP
Based on the independent word dictionary, the U120 tests the connection between the independent word and the adjunct word and the connection between the adjunct words, and leaves only the one that holds as a Japanese grammar as the adjunct word interpretation. The grammar verification process and the like are methods that are also used in the conventional kana-kanji conversion and the like, and the details thereof are not the main subject of the present invention, and thus the description thereof will be omitted. As a result, the phrase interpretation that can be interpreted as one phrase is the processing buffer 70.
8 generated on.

【０１０９】ＣＰＵ１２０は、対象自立語番号ｎをイン
クリメントしたのち（ステップＳ１３１４）、対象自立
語番号ｎの自立語候補が、まだ、生成された自立語候補
のうちの最終候補でないと判定すると（ステップＳ１３
１５）、ステップＳ１３１３の処理に戻り、上記と同様
に、次の自立語候補について、それに続く文字列の付属
語辞書照合を行う。After incrementing the target independent word number n (step S1314), the CPU 120 determines that the independent word candidate of the target independent word number n is not yet the final candidate among the generated independent word candidates (step S1314). S13
15) Return to the processing of step S1313, and similarly to the above, for the next independent word candidate, perform the auxiliary word dictionary collation of the subsequent character string.

【０１１０】なお、上記ステップＳ１３１１およびステ
ップＳ１３１３の文節解釈生成処理における辞書照合で
は、認識文字情報３０２内の区切り情報３０４の単語区
切り情報が有効（すなわち「１」）になっている場合、
ＣＰＵ１２０は、この単語区切りを有する候補文字コー
ド３０５を語尾として完結する自立語候補、付属語候補
および文節解釈だけを有効とし、その他を無効とする。In the dictionary collation in the phrase interpretation generation processing of steps S1311 and S1313, when the word delimiter information of the delimiter information 304 in the recognized character information 302 is valid (that is, "1"),
The CPU 120 enables only the independent word candidate, the adjunct word candidate, and the phrase interpretation that are completed by using the candidate character code 305 having this word division as a word end, and invalidates the others.

【０１１１】図１４は、図８のステップＳ８０８におけ
る名刺テキスト認識処理を説明するフローチャートであ
る。ＣＰＵ１２０は、図１０のステップＳ１００２にお
ける認識対象領域抽出処理と同様に、スキャナー１０９
から読み取られた名刺の画像から、横方向、縦方向に写
像したヒストグラムに基づいて、名刺上のテキスト領域
を抽出する（ステップＳ１４０１）。FIG. 14 is a flow chart for explaining the business card text recognition processing in step S808 of FIG. The CPU 120, similarly to the recognition target area extraction processing in step S1002 of FIG.
A text area on the business card is extracted from the image of the business card read from (1) based on the histograms that are mapped in the horizontal and vertical directions (step S1401).

【０１１２】ＣＰＵ１２０は、図１０のステップＳ１０
０３における認識候補生成処理で説明した一般文書の場
合と同様に、１文字に対応する画像について、それぞれ
第１０位までの候補文字コード３０５と尤度３０６との
組を生成し、認識列管理部２０４に格納する（ステップ
Ｓ１４０２）。次いで、ＣＰＵ１２０は、基本的には図
１３（ａ）に示した処理手順と同様の手順で認識誤り補
正処理を行い、その補正結果を名刺データ認識結果一時
記憶部５０１に格納する（ステップＳ１４０３）。この
際、利用する辞書は、補正用辞書アドレス記憶部７０６
にアドレスが記憶されている固有名詞辞書である。な
お、ステップＳ１４０３の処理については、後に詳述す
る（ステップＳ１４０３）。The CPU 120 executes step S10 of FIG.
As in the case of the general document described in the recognition candidate generation processing in No. 03, for each image corresponding to one character, a set of candidate character codes 305 up to the tenth place and likelihood 306 is generated, and the recognition sequence management unit It stores in 204 (step S1402). Next, the CPU 120 basically performs the recognition error correction process in the same procedure as the processing procedure shown in FIG. 13A, and stores the correction result in the business card data recognition result temporary storage unit 501 (step S1403). . At this time, the dictionary to be used is the correction dictionary address storage unit 706.
It is a proper noun dictionary whose address is stored in. The process of step S1403 will be described later in detail (step S1403).

【０１１３】ＣＰＵ１２０は、名刺データ認識結果一時
記憶部５０１に格納された各名刺データの第１候補を、
認識対象ブロックの画像とともに、ディスプレイ１０６
に表示し、文書入力装置１００が別途格納しているテキ
スト編集機能を起動する。オペレータは、前記表示画面
を見ながら、補正結果修正処理を行う（ステップＳ１４
０４）。The CPU 120 selects the first candidate of each business card data stored in the business card data recognition result temporary storage unit 501,
The display 106 together with the image of the block to be recognized
, And the text editing function separately stored in the document input device 100 is activated. The operator performs the correction result correction process while looking at the display screen (step S14).
04).

【０１１４】図１４（ｂ）は、図１４（ａ）のステップ
Ｓ１４０３における名刺認識誤り補正処理を、より詳細
に説明するフローチャートである。ＣＰＵ１２０は、図
１３（ａ）に示した処理手順と同様の手順で認識誤り補
正を行う。この結果は、補正結果３０８に格納される
（ステップＳ１４１１）。この際、さらに、ＣＰＵ１２
０は、その補正結果３０８に含まれている「〒」などの
郵便番号マークや、「ＴＥＬ」、「ＦＡＸ」等の属性識
別文字列や、固有名詞辞書に格納された単語に付いてい
る、地名、人名、部署名を表す品詞情報を手がかりにし
て、その補正結果３０８が、「氏名」、「所属団体
名」、「所属部署名」、「住所」、「電話番号」、「Ｆ
ＡＸ番号」のいずれを表すデータであるかを推定し、名
刺データ認識結果一時記憶部５０１内の該当する記憶領
域に、前記補正結果３０８の第１候補から第５候補まで
を格納する。図１５は、名刺入力処理における表示画面
およびデータの具体例を示す説明図である。図１５
（ａ）は、ＣＰＵ１２０に認識され、固有名詞辞書を参
照して認識誤り補正処理が行われた結果、名刺データ認
識結果一時記憶部５０１の氏名記憶部５０２に格納され
た人名データの例を示す。FIG. 14B is a flow chart for explaining in more detail the business card recognition error correction processing in step S1403 of FIG. 14A. The CPU 120 performs recognition error correction in the same procedure as the processing procedure shown in FIG. This result is stored in the correction result 308 (step S1411). At this time, the CPU 12
0 is attached to a postal code mark such as “〒” included in the correction result 308, an attribute identification character string such as “TEL” or “FAX”, or a word stored in the proper noun dictionary. Using the part-of-speech information representing the place name, the person's name, and the department name as a clue, the correction result 308 shows the “name”, “affiliation name”, “affiliation department name”, “address”, “telephone number”, “F”.
Which of the “AX numbers” the data represents is estimated, and the first to fifth candidates of the correction result 308 are stored in the corresponding storage area in the business card data recognition result temporary storage unit 501. FIG. 15 is an explanatory diagram showing a specific example of a display screen and data in the business card input process. FIG.
9A shows an example of personal name data stored in the name storage unit 502 of the business card data recognition result temporary storage unit 501 as a result of recognition error correction processing performed by the CPU 120 with reference to the proper noun dictionary. .

【０１１５】次に、ＣＰＵ１２０は、郵便番号記憶部５
０７に格納された郵便番号候補をキーとして、郵便番号
に対応して郵便番号−住所照合テーブル部４０５内に格
納されている住所名データを取り出し、作業領域７００
内の処理用バッファ７０８に格納する（ステップＳ１４
１２）。同様に、ＣＰＵ１２０は、電話番号記憶部５０
６に格納された電話番号をキーとして、電話番号に対応
して電話番号−住所照合テーブル部４０６内に格納され
ている住所名データを取り出し、作業領域７００内の処
理用バッファ７０８に格納する（ステップＳ１４１
３）。Next, the CPU 120 causes the postal code storage unit 5
Using the postal code candidate stored in 07 as a key, the address name data stored in the postal code-address collation table unit 405 corresponding to the postal code is extracted, and the work area 700
It is stored in the internal processing buffer 708 (step S14).
12). Similarly, the CPU 120 controls the telephone number storage unit 50.
Using the telephone number stored in No. 6 as a key, the address name data stored in the telephone number-address matching table unit 406 corresponding to the telephone number is extracted and stored in the processing buffer 708 in the work area 700 ( Step S141
3).

【０１１６】次いで、ＣＰＵ１２０は、電話番号−住所
照合テーブル部４０６内に格納されている住所名データ
および郵便番号−住所照合テーブル部４０５内に格納さ
れている住所名データと、住所記憶部５０５内に格納さ
れている住所名データとを照合する（ステップＳ１４１
４、ステップＳ１４１５）。その結果、前記住所名デー
タに共通する住所名データが存在する場合、それを住所
記憶部５０５の住所に再設定する。すなわち、ＣＰＵ１
２０は、住所記憶部５０５に格納されている住所名デー
タのうち、処理用バッファ７０８に取り出された住所名
データと共通する住所名データだけを、住所記憶部５０
５に格納する。また、同様に、電話番号記憶部５０６に
格納されている電話番号データのうち、前記住所名デー
タと対応する電話番号データだけを、電話番号記憶部５
０６に格納し、郵便番号記憶部５０７に格納されている
郵便番号データのうち、前記住所名データと対応する郵
便番号データだけを、郵便番号記憶部５０７に格納する
（ステップＳ１４１６）。なお、ステップＳ１４１６で
は、処理用バッファ７０８に取り出された住所名データ
と共通する住所名データだけを住所記憶部５０５に格納
し、当該住所名データに対応する電話番号データだけを
電話番号記憶部５０６に、対応する郵便番号データだけ
を郵便番号記憶部５０７に格納するようにしたが、この
方法に限らず、該当する前記各データの優先順位を１ず
つ繰り上げてもよいし、前記各データに該当しない住所
名データ、電話番号データおよび郵便番号データを、住
所記憶部５０５、電話番号記憶部５０６および郵便番号
記憶部５０７内で無効とするようにしてもよい。Next, the CPU 120 stores the address name data stored in the telephone number / address collation table unit 406, the address name data stored in the postal code / address collation table unit 405, and the address storage unit 505. Is collated with the address name data stored in (step S141).
4, step S1415). As a result, when there is address name data common to the address name data, it is reset to the address in the address storage unit 505. That is, CPU1
Of the address name data stored in the address storage unit 505, 20 stores only the address name data common to the address name data fetched in the processing buffer 708.
Store in 5. Similarly, of the telephone number data stored in the telephone number storage unit 506, only the telephone number data corresponding to the address name data is stored in the telephone number storage unit 5
Of the zip code data stored in the zip code storage unit 507, the zip code data corresponding to the address name data is stored in the zip code storage unit 507 (step S1416). In step S1416, only the address name data common to the address name data fetched in the processing buffer 708 is stored in the address storage unit 505, and only the telephone number data corresponding to the address name data is stored in the telephone number storage unit 506. In the above, only the corresponding postal code data is stored in the postal code storage unit 507. However, the present invention is not limited to this method, and the priority of each of the corresponding data may be increased by one, or the corresponding postal data may be stored. The address name data, the telephone number data, and the postal code data that do not exist may be invalidated in the address storage unit 505, the telephone number storage unit 506, and the postal code storage unit 507.

【０１１７】前記照合の結果、共通する住所名データが
存在しない場合、ＣＰＵ１２０は、住所名データ、郵便
番号データまたは電話番号データのいずれかに間違いが
あることをディスプレイ１０６に表示させ、警告する
（ステップＳ１４１７）。具体的には、例えば、住所記
憶部５０５に格納されている第１位候補が「八尾市太田
新町」で、第２位候補が「七尾市大田新町」であると
き、前者の郵便番号は「５８１」、後者のそれは「９２
６」、前者の市外番号は「０７２９」、後者のそれは
「０７６７」である。郵便番号記憶部５０７に格納され
ている郵便番号の候補に、「９２６」しか現れていない
場合には、住所記憶部５０５に格納されている第１候補
「八尾市太田新町」は、住所記憶部５０５内の候補から
除かれる。If the common address name data does not exist as a result of the collation, the CPU 120 displays on the display 106 that there is an error in the address name data, the postal code data or the telephone number data, and warns ( Step S1417). Specifically, for example, when the first candidate stored in the address storage unit 505 is "Ota Shinmachi, Yao City" and the second candidate is "Ota Shinmachi, Nanao City", the former postal code is " 581 ", that of the latter is" 92 "
6 ", the former area code is" 0729 ", and the latter area code is" 0767 ". When only “926” appears in the postal code candidates stored in the postal code storage unit 507, the first candidate “Ota Shinmachi, Yao-shi” stored in the address storage unit 505 is the address storage unit. It is excluded from the candidates in 505.

【０１１８】なお、上記では、入力された文書が名刺の
場合に、郵便番号および電話番号と住所との関係から住
所名データを照合したが、郵便番号および電話番号と住
所との関係に限らず、社員名と社員番号との関係など、
関連性の有る他の固有名詞同士を対応づけて予め記憶し
ておき、照合に用いるようにしてもよい。また、入力さ
れた文書が名刺である場合に限らず、製品名と型番、書
籍名とＩＳＢＮ番号など、同様にして照合することによ
り、それらに対応する文字列を正しく認識することがで
きる。In the above description, when the input document is a business card, the address name data is collated based on the relationship between the postal code, the telephone number and the address, but it is not limited to the relationship between the postal code, the telephone number and the address. , Relationship between employee name and employee number,
Other proper nouns having relevance may be associated with each other, stored in advance, and used for collation. Further, not only when the input document is a business card, but also by collating the product name and the model number, the book name and the ISBN number, and the like, the corresponding character strings can be correctly recognized.

【０１１９】図１６は、図８のステップＳ８０９におけ
る名簿更新処理の手順を説明するフローチャートであ
る。ＣＰＵ１２０は、氏名記憶部５０２から前記氏名デ
ータを取り出し、名簿情報記憶部６０１の氏名領域６０
２に格納されている氏名データと比較し、一致する氏名
データを検索する（ステップＳ１６０１）。FIG. 16 is a flow chart for explaining the procedure of the name list update processing in step S809 of FIG. The CPU 120 extracts the name data from the name storage unit 502, and stores the name area 60 in the name list information storage unit 601.
The name data stored in No. 2 is compared and the matching name data is searched (step S1601).

【０１２０】名簿情報記憶部６０１の氏名領域６０２内
に該当する氏名データがない場合は（ステップＳ１６０
２）、名刺データ認識結果一時記憶部５０１内の当該候
補データを、新たな名簿情報としてディスプレイ１０６
に表示させ、ステップＳ１６０６の処理に移る（ステッ
プＳ１６０３）。図１５（ｂ）は、このときにディスプ
レイ１０６に表示される新規名簿情報の具体例を示す。
このとき、オペレータは、例えば、図１５（ｂ）に示し
た表示画面を見ながら、マウス１１５を操作し、表示画
面内の所定の領域にカーソルを位置させてクリックする
ことにより、ＣＰＵ１２０に対して、表示された新規名
簿情報の追加、あるいは、表示された各項目の修正など
を指示することができる。If there is no corresponding name data in the name area 602 of the name list information storage unit 601 (step S160).
2), the candidate data in the business card data recognition result temporary storage unit 501 is displayed on the display 106 as new name list information.
Displayed in step S1606, and the process proceeds to step S1606 (step S1603). FIG. 15B shows a specific example of the new name list information displayed on the display 106 at this time.
At this time, the operator operates the mouse 115 while looking at the display screen shown in FIG. It is possible to instruct addition of the displayed new name list information or correction of each displayed item.

【０１２１】氏名領域６０２内に該当する氏名データが
存在する場合は（ステップＳ１６０２）、ＣＰＵ１２０
は、名刺データ認識結果一時記憶部５０１の各名刺デー
タと、検索された前記氏名データに対応する名簿情報記
憶部６０１内の各名簿情報とを照合する。すなわち、Ｃ
ＰＵ１２０は、所属団体名記憶部５０３と所属団体名領
域６０３、所属部署名記憶部５０４と所属部署名領域６
０４、住所記憶部５０５と住所領域６０５、電話番号記
憶部５０６と電話番号領域６０６および郵便番号記憶部
５０７と郵便番号領域６０７をそれぞれ、照合する。照
合の結果、全て一致している場合は、修正の必要がない
とみなして、処理を終了する（ステップＳ１６０４）。
名刺データ認識結果一時記憶部５０１の各名刺データ
と、検索された前記氏名データに対応する名簿情報記憶
部６０１内の各名簿情報とに、一部でも異なる情報があ
れば、該当する名刺データ認識結果一時記憶部５０１の
名刺データと、名簿情報記憶部６０１内の名簿情報と
を、ディスプレイ１０６に対照表示させ、オペレータの
確認を求める（ステップＳ１６０５）。図１５（ｃ）
は、このときディスプレイ１０６に表示される、前記対
象表示画面の具体例を示す。また、このとき、オペレー
タは、例えば、図１５（ｂ）に示した表示画面と同様、
図１５（ｃ）に示した表示画面を見ながら、マウス１１
５を操作し、表示画面内の所定の領域にカーソルを位置
させてクリックすることにより、ＣＰＵ１２０に対し
て、表示された名簿データによる名簿情報の修正、ある
いは、表示された各項目の修正などを指示することがで
きる。If the corresponding name data exists in the name area 602 (step S1602), the CPU 120
Compares each business card data in the business card data recognition result temporary storage unit 501 with each name list information in the name list information storage unit 601 corresponding to the retrieved name data. That is, C
The PU 120 includes an affiliated organization name storage unit 503 and an affiliated organization name area 603, and an affiliated department signature storage unit 504 and an affiliated department signature area 6.
04, the address storage unit 505 and the address area 605, the telephone number storage unit 506 and the telephone number area 606, and the postal code storage unit 507 and the postal code area 607, respectively. As a result of the collation, if all of them match, it is determined that the correction is not necessary, and the process ends (step S1604).
If there is any difference between the name card data in the name card data recognition result temporary storage unit 501 and the name list information in the name list information storage unit 601 corresponding to the searched name data, the corresponding name card data recognition Result The business card data in the temporary storage unit 501 and the name list information in the name list information storage unit 601 are displayed on the display 106 in a corresponding manner, and the operator's confirmation is requested (step S1605). FIG. 15 (c)
Shows a specific example of the target display screen displayed on the display 106 at this time. Further, at this time, the operator, like the display screen shown in FIG.
While looking at the display screen shown in FIG.
5 is operated to position the cursor in a predetermined area on the display screen and click the button, so that the CPU 120 can correct the name list information based on the name list data displayed, or correct each displayed item. You can give instructions.

【０１２２】ＣＰＵ１２０は、オペレータから修正を指
示する入力があれば（ステップＳ１６０６）、文書入力
装置１００が別途保有しているテキスト編集機能を起動
する。オペレータは、前記テキスト編集機能用いて、表
示された名刺データの内容を修正する。ＣＰＵ１２０
は、オペレータによって修正された名刺データを、名刺
データ認識結果一時記憶部５０１内の該当する記憶部に
格納する（ステップＳ１６０７）。ステップＳ１６０６
において、オペレータから、名刺データの内容が妥当で
あることを示す入力があった場合は、名刺データ認識結
果一時記憶部５０１に格納されている名刺データで、名
簿情報記憶部６０１に格納されている名簿情報を書き換
える（ステップＳ１６０８）。When the operator inputs an instruction to make a correction (step S1606), the CPU 120 activates the text editing function of the document input device 100. The operator uses the text editing function to correct the contents of the displayed business card data. CPU120
Stores the business card data corrected by the operator in the corresponding storage unit in the business card data recognition result temporary storage unit 501 (step S1607). Step S1606
In the case where there is an input from the operator indicating that the contents of the business card data are valid, the business card data stored in the business card data recognition result temporary storage unit 501 is stored in the name book information storage unit 601. The name list information is rewritten (step S1608).

【０１２３】図１７は、図８のステップＳ８１２におけ
る帳票テキスト認識処理の手順を説明するフローチャー
トである。ＣＰＵ１２０は、処理対象ブロック番号Ｍを
「１」に設定する（ステップＳ１７０１）。次いで、Ｃ
ＰＵ１２０は、ブロック管理情報２１０の画像位置情報
２０３に基づいて、画像情報記憶部２０１から、第Ｍ認
識対象ブロック内の画像情報を取り出し、当該画像情報
に含まれるテキストを文字認識する（ステップＳ１７０
２）。この手順は、図１０のステップＳ１００３におけ
る認識候補生成処理と同様の手順で行われる。FIG. 17 is a flow chart for explaining the procedure of the form text recognition process in step S812 of FIG. The CPU 120 sets the processing target block number M to "1" (step S1701). Then C
The PU 120 extracts the image information in the Mth recognition target block from the image information storage unit 201 based on the image position information 203 of the block management information 210, and character-recognizes the text included in the image information (step S170).
2). This procedure is performed in the same procedure as the recognition candidate generation process in step S1003 of FIG.

【０１２４】ＣＰＵ１２０は、横方向と縦方向に写像し
た画像のヒストグラムを生成し、画像および空白領域に
周期性があって画像と画像との間隔の狭い方向が文字方
向と推定する。図１８は、帳票形式入力処理において入
力されるテキスト画像と、その文字列方向の推定方法と
の具体例を示す説明図である。図１８（ａ）は、１つの
認識対象ブロック内の文字画像の配置と、その横方向お
よび縦方向の写像との一例を示す。縦方向は、文字の間
隔に見られるはずの空白領域が不規則であるので、この
場合は文字列方向は横方向であると推定する（ステップ
Ｓ１７０３）。図１８（ｂ）は、１つの認識対象ブロッ
ク内の文字画像の配置と、その横方向および縦方向の写
像との他の例を示す。一方、図１８（ａ）の例に比べ
て、図１８（ｂ）のような帳票形式の場合は、どちらの
方向にも文字間隔は同じであるので、縦方向にも横方向
にも文字列の方向が考えられる。The CPU 120 generates a histogram of the images mapped in the horizontal direction and the vertical direction, and estimates that the direction in which the image and the blank area have periodicity and the interval between the images is narrow is the character direction. FIG. 18 is an explanatory diagram showing a specific example of a text image input in the form format input process and a method of estimating the character string direction. FIG. 18A shows an example of the layout of character images in one recognition target block and their horizontal and vertical mappings. In the vertical direction, the blank area that should be seen in the character spacing is irregular, so in this case, the character string direction is estimated to be the horizontal direction (step S1703). FIG. 18B shows another example of the layout of character images in one recognition target block and the mapping in the horizontal and vertical directions. On the other hand, compared to the example of FIG. 18A, in the case of the form format as shown in FIG. 18B, the character spacing is the same in both directions, so the character string is displayed in both the vertical and horizontal directions. The direction of

【０１２５】ＣＰＵ１２０は、縦方向の文字列方向が解
釈できるか否かを判定し（ステップＳ１７０４）、解釈
できる場合は、認識列管理部２０４内に、文字列方向２
１３を「縦」とし、縦方向の認識列と対応したブロック
管理情報２１０を生成し（ステップＳ１７０５）、解釈
できない場合は、ステップＳ１７０７の処理に移る。Ｃ
ＰＵ１２０は、横方向の文字列方向が解釈できるか否か
を判定し（ステップＳ１７０６）、解釈できる場合、認
識列管理部２０４内に、文字列方向２１３を「横」と
し、横方向の認識列と対応したブロック管理情報２１０
を生成し（ステップＳ１７０７）、横方向の文字列方向
が解釈できない場合、ステップＳ１７０８の処理に移
る。The CPU 120 determines whether or not the vertical character string direction can be interpreted (step S1704). If it can be interpreted, the character string direction 2 is stored in the recognition string management unit 204.
13 is set to “vertical”, the block management information 210 corresponding to the vertical recognition sequence is generated (step S1705), and if it cannot be interpreted, the process proceeds to step S1707. C
The PU 120 determines whether or not the horizontal character string direction can be interpreted (step S1706), and if the horizontal character string direction can be interpreted, the character string direction 213 is set to “horizontal” in the recognition string management unit 204, and the horizontal recognition string is recognized. Block management information 210 corresponding to
Is generated (step S1707), and if the horizontal character string direction cannot be interpreted, the process proceeds to step S1708.

【０１２６】次いで、ＣＰＵ１２０は、選択された文書
種別が帳票形式であるため、均等割付された表内タイト
ル中の空白による単語区切りを無視する。具体的には、
区切り情報３０４の単語区切り情報を、例えば「０」に
し、無効にする（ステップＳ１７０８）。これにより、
均等割り付けされた表内タイトル中の文字列が、均等割
り付けによる空白によって、誤って単語に区切られるこ
とを防止することができる。Next, since the selected document type is the form format, the CPU 120 ignores the word division due to the blank in the evenly assigned in-table titles. In particular,
The word delimiter information in the delimiter information 304 is set to, for example, "0" and invalidated (step S1708). This allows
It is possible to prevent the character strings in the evenly-allocated titles in the table from being mistakenly divided into words due to the blanks in the evenly-allocated space.

【０１２７】ＣＰＵ１２０は、図１３（ａ）に示した手
順と同様に、認識誤り補正処理を行う。ただし、ここで
は、補正用辞書アドレス記憶部７０６に記憶されている
一般辞書部４０２内の一般辞書と、帳票用語辞書部４０
４内の帳票用語辞書とを参照する（ステップＳ１７０
９）。この段階で、同一領域の画像に対して、文字列方
向を縦方向と解釈したブロック管理情報２１０と、文字
列方向を横方向と解釈したブロック管理情報２１０とが
生成されている場合、ＣＰＵ１２０は、各々の補正結果
３０８の第１候補に含まれる文節解釈の総数を比較する
（ステップＳ１７１０）。例えば、図１８（ｂ）に示し
たテキスト画像において、文字列方向を横と解釈した場
合では、文節解釈として「銀行」と「振込」が得られ、
その合計は２になるが、文字列方向を縦と解釈した場合
は、「銀」、「振」、「行進」（「込」を「進」と誤っ
て補正したとき）の３になることがある。The CPU 120 performs the recognition error correction process in the same manner as the procedure shown in FIG. However, here, the general dictionary in the general dictionary unit 402 stored in the correction dictionary address storage unit 706 and the form term dictionary unit 40 are stored.
Reference is made to the form term dictionary in 4 (step S170).
9). At this stage, when the block management information 210 in which the character string direction is interpreted as the vertical direction and the block management information 210 in which the character string direction is interpreted as the horizontal direction are generated for the image in the same region, the CPU 120 , And the total number of phrase interpretations included in the first candidate of each correction result 308 is compared (step S1710). For example, in the text image shown in FIG. 18B, when the character string direction is interpreted as horizontal, “bank” and “transfer” are obtained as the phrase interpretation,
The total is 2, but when the character string direction is interpreted as vertical, it becomes 3 for "silver", "shake", and "marching" (when "include" is mistakenly corrected as "advance"). There is.

【０１２８】この結果、ＣＰＵ１２０は、文節解釈の総
数が少ない方の文字列方向の解釈を採用し、採用されな
かった文字列方向解釈のブロック管理情報２１０は無効
にされる（ステップＳ１７１１、ステップＳ１７１
２）。最後に、ＣＰＵ１２０は、処理対象ブロック番号
Ｍをインクリメントした上（ステップＳ１７１３）、最
終認識対象ブロックまでこの処理を繰り返す（ステップ
Ｓ１７１４）。As a result, the CPU 120 adopts the interpretation in the character string direction having the smaller total number of phrase interpretations, and invalidates the block management information 210 of the character string direction interpretation that is not adopted (steps S1711, S171).
2). Finally, the CPU 120 increments the process target block number M (step S1713), and repeats this process until the final recognition target block (step S1714).

【０１２９】図１９は、図８のステップＳ８１３に示し
た印刷位置設定処理を説明するフローチャートである。
以下では、帳票テキストの認識処理が完了した後、各認
識対象ブロック内における認識文字の印刷位置を設定す
る処理の手順を示す。ＣＰＵ１２０は、まず処理対象ブ
ロック番号Ｍを「１」に設定し（ステップＳ１９０
１）、第Ｍ認識対象ブロックと対応する認識列管理部２
０４のブロック管理情報２１０を参照する（ステップＳ
１９０２）。FIG. 19 is a flow chart for explaining the print position setting process shown in step S813 of FIG.
In the following, the procedure of processing for setting the print position of the recognized character in each recognition target block after the recognition processing of the form text is completed will be described. The CPU 120 first sets the processing target block number M to "1" (step S190).
1), the recognition sequence management unit 2 corresponding to the Mth recognition target block
The block management information 210 of No. 04 is referred (step S
1902).

【０１３０】次いで、ＣＰＵ１２０は、罫線枠情報２１
２が「１」である認識対象ブロックについては、認識列
ごとに、補正結果３０８の先頭文字と末尾文字とに対応
する各候補文字情報３０７の区切り情報３０４に、それ
ぞれ、均等割付開始制御情報と均等割付終了制御情報と
を付与する。具体的には、例えば、均等割付開始制御情
報および均等割付終了制御情報として、区切り情報３０
４にそれぞれ、「１」を設定する（ステップＳ１９０
３）。なお、ステップＳ１９０３では、罫線枠内の文字
列をすべて均等割り付けするよう、各認識列に均等割付
開始制御情報および均等割付終了制御情報を付与した
が、本発明の均等割り付け対象は、前記認識列に限定さ
れない。例えば、まず、１つの認識列に含まれている文
字の文字位置情報３０３から、その認識列内で連続する
文字画像の文字間隔を算出し、原稿の認識列内の文字列
が均等割り付けされているか否かを判断するとともに、
前記判断結果に従って、文字列が均等割り付けされてい
る認識列については、均等割付開始制御情報および均等
割付終了制御情報を付与するようにしてもよい。Next, the CPU 120 causes the ruled line frame information 21
For the recognition target block in which 2 is “1”, the uniform allocation start control information is added to the delimiter information 304 of each candidate character information 307 corresponding to the first character and the last character of the correction result 308 for each recognition column. The uniform allocation end control information is added. Specifically, for example, the division information 30 is used as the even allocation start control information and the even allocation end control information.
4 is set to "1" (step S190).
3). In addition, in step S1903, the uniform allocation start control information and the uniform allocation end control information are added to each recognition column so that all the character strings in the ruled line frame are evenly allocated. Not limited to. For example, first, from the character position information 303 of characters included in one recognition string, the character spacing between consecutive character images in the recognition string is calculated, and the character strings in the recognition string of the original are evenly allocated. And determine whether or not
According to the determination result, the uniform allocation start control information and the uniform allocation end control information may be added to the recognition string to which the character strings are evenly allocated.

【０１３１】ＣＰＵ１２０は、処理対象ブロック番号Ｍ
をインクリメントしながら（ステップＳ１９０４）、ス
テップＳ１９０２からステップＳ１９０４までの処理を
認識対象ブロックが終了するまで繰り返す（ステップＳ
１９０５）。図２０は、入力された帳票を印刷する処理
の手順を示すフローチャートである。以下では、認識文
字の印刷位置の設定処理が完了した後、例えば、認識結
果がディスプレイ１０６に表示され、オペレータから認
識結果の印刷を指示する入力があった場合の帳票印刷処
理の手順を示す。The CPU 120 determines the block number M to be processed.
While incrementing (step S1904), the processing from step S1902 to step S1904 is repeated until the recognition target block ends (step S1904).
1905). FIG. 20 is a flowchart showing a procedure of processing for printing the input form. The following describes the procedure of the form printing process when the recognition result is displayed on the display 106 and the operator inputs an instruction to print the recognition result after the process of setting the print position of the recognition character is completed.

【０１３２】ＣＰＵ１２０は、表示データ記憶部２０３
をクリアし、処理対象ブロック番号Ｍを「１」に設定す
る（ステップＳ２００１）。ＣＰＵ１２０は、第Ｍ認識
対象ブロックと対応する画像位置情報２０３、罫線枠情
報２０４、文字列方向２１３を参照し（ステップＳ２０
０２）、罫線枠があるかないかの判定を行い（ステップ
Ｓ２００３）、ある場合は、表示データ記憶部２０３上
で、画像位置情報２０３と対応する位置に罫線枠を書き
込む（ステップＳ２００４）。The CPU 120 has a display data storage unit 203.
Is cleared and the block number M to be processed is set to "1" (step S2001). The CPU 120 refers to the image position information 203 corresponding to the Mth recognition target block, the ruled line frame information 204, and the character string direction 213 (step S20).
02), it is determined whether or not there is a ruled line frame (step S2003). If there is a ruled line frame, the ruled line frame is written in the display data storage unit 203 at a position corresponding to the image position information 203 (step S2004).

【０１３３】ＣＰＵ１２０は、処理対象認識列番号Ｌを
「１」に設定する（ステップＳ２００５）。ＣＰＵ１２
０は、第Ｌ認識列内の文字と対応する認識文字情報３０
２の文字位置情報３０３情報を参照し、プリンタ１１４
の印字フォントサイズの中からそれと最も近い印字フォ
ントサイズを選択する。例えば、プリンタ１１４の印字
フォントが１６×１６、２４×２４、３２×３２、４８
×４８で、文字位置情報３０３の座標値から算出された
文字サイズが３５×３４である場合、ＣＰＵ１２０は、
プリンタ１１４の印字フォントサイズと、算出された文
字サイズとの差の絶対値が最も小さい印字フォントサイ
ズである３２×３２を選択する（ステップＳ２００
６）。同時に、ＣＰＵ１２０は、選択された文字サイズ
が、所定の基準値で定められる許容範囲内にあるか否か
を判定し、必要な場合は再設定する。具体的には、選択
された文字サイズの文字高が、行間隔もしくは罫線枠と
の間隔より３倍以上大きいときは３倍までにとどめる。
選択された文字サイズの文字幅が、文字間隔もしくは罫
線枠との間隔よりも３倍以上大きいときは３倍までにと
どめる。The CPU 120 sets the processing target recognition column number L to "1" (step S2005). CPU12
0 is the recognized character information 30 corresponding to the character in the Lth recognition string
The printer 114 is referred to by referring to the second character position information 303 information.
From the print font sizes of, select the closest print font size. For example, the print font of the printer 114 is 16 × 16, 24 × 24, 32 × 32, 48.
When the character size calculated from the coordinate value of the character position information 303 is 35 × 34, the CPU 120
32 × 32, which is the print font size having the smallest absolute value of the difference between the print font size of the printer 114 and the calculated character size, is selected (step S200).
6). At the same time, the CPU 120 determines whether the selected character size is within an allowable range defined by a predetermined reference value, and resets it if necessary. Specifically, when the character height of the selected character size is three times or more larger than the line interval or the space between the character frame and the ruled line frame, the height is limited to three times.
When the character width of the selected character size is three times or more larger than the character spacing or the spacing with the ruled line frame, it is limited to three times.

【０１３４】ＣＰＵ１２０は、第Ｌ認識列と対応する認
識列管理部２０４の認識列管理情報３０１を参照し、参
照した第Ｌ認識列の補正結果３０８に含まれている文字
に対応した区切り情報３０４に、前記均等割付制御情報
が付与されている場合（ステップＳ２００７）、補正結
果３０８の先頭文字と対応する文字位置情報３０３と、
末尾文字と対応する文字位置情報３０３とを参照し、こ
の文字間に補正結果３０８の残りの文字列を、以下のよ
うにして均等間隔に配置する。The CPU 120 refers to the recognition sequence management information 301 of the recognition sequence management unit 204 corresponding to the L-th recognition sequence, and delimits information 304 corresponding to the character included in the correction result 308 of the referred L-th recognition sequence. In the case where the uniform allocation control information is added (step S2007), the character position information 303 corresponding to the first character of the correction result 308,
With reference to the last character and the corresponding character position information 303, the remaining character strings of the correction result 308 are arranged between these characters at equal intervals as follows.

【０１３５】すなわち、オペレータからの入力により印
刷指定されたとき、ＣＰＵ１２０は、この均等割り付け
情報が付与された部分は、認識対象ブロックの幅総ドッ
ト数をＷ、１文字の印字ドット数をＣ、文字数をｎと
し、以下の式から文字間隔幅ｉを求め、印字する。ｉ＝
（Ｗ−ｎ＊Ｃ）／（Ｃ＋１）例えば、認識対象ブロック
の幅総ドット数１８０ドットに、印字ドット数２４ドッ
トの文字を５文字印刷する場合、文字間隔幅ｉは、以下
のように１０ドット幅となる。That is, when the printing is designated by the input from the operator, the CPU 120 determines that the portion to which the uniform allocation information is added is W, the total number of dots of the recognition target block is W, the number of printing dots of one character is C, When the number of characters is n, the character interval width i is calculated from the following formula and printed. i =
(W-n * C) / (C + 1) For example, when printing 5 characters with a print dot number of 24 dots on the total width dot number of 180 dots in the recognition target block, the character interval width i is 10 as follows. It becomes the dot width.

【０１３６】ｉ＝（１８０−５＊２４）／（５＋１）＝（１８０−１２０）／６＝６０／６＝１０ＣＰＵ１２０は、これを１つの認識対象ブロック内の全
認識列に対して繰り返し、終了すると次の認識対象ブロ
ックの処理に移る（ステップＳ２００９、ステップＳ２
０１０）。さらに、ＣＰＵ１２０は、上記ステップＳ２
００２からステップＳ２０１０までの処理を、全認識対
象ブロックに対して行う（ステップＳ２０１１、ステッ
プＳ２０１２）。I = (180-5 * 24) / (5 + 1) = (180-120) / 6 = 60/6 = 10 The CPU 120 repeats this for all the recognition columns in one recognition target block, When finished, the process moves to the next block to be recognized (step S2009, step S2).
010). Further, the CPU 120 causes the above step S2.
The processing from 002 to step S2010 is performed for all the recognition target blocks (steps S2011 and S2012).

【０１３７】以上のように本実施例によれば、一般文書
を入力する場合、候補文字情報３０７の中から文字認識
結果の第１候補を取り出して得られた文字列と、オペレ
ータの確認により最終確定された文字列とを比較照合
し、文字認識の精度を求め、これに応じて文字認識候補
の候補数を変えることにより、文字認識精度の変動に適
応した認識誤り補正処理を行うことができる。As described above, according to the present embodiment, when a general document is input, the character string obtained by extracting the first candidate of the character recognition result from the candidate character information 307 and the final character by the operator's confirmation. By comparing and collating with the confirmed character string and obtaining the accuracy of character recognition, and changing the number of candidate character recognition candidates accordingly, it is possible to perform recognition error correction processing that adapts to variations in character recognition accuracy. .

【０１３８】具体的には、本実施例によれば、認識対象
となる文書の原稿が、何度もコピーされて文字のかすれ
や汚れを生じた原稿であって、文字認識精度が低いとき
には、認識誤り補正処理に用いる候補文字コード３０５
の候補数を多くして、認識誤り補正処理の補正の精度を
高めることができる。また、本実施例によれば、認識対
象となる文書の原稿が鮮明な画像で得られた場合のよう
に、文字認識精度が高いときには、認識誤り補正処理に
用いる候補文字コード３０５の候補数を少なくして、認
識誤り補正処理に要する時間を短縮し、結果的に、文書
入力装置１００全体の処理時間を短縮することができ
る。Specifically, according to the present embodiment, when the original of the document to be recognized is an original which has been copied many times to cause faint or smeared characters, and the character recognition accuracy is low, Candidate character code 305 used for recognition error correction processing
It is possible to increase the accuracy of the correction of the recognition error correction processing by increasing the number of candidates of. Further, according to the present embodiment, when the character recognition accuracy is high, such as when the document of the document to be recognized is obtained as a clear image, the number of candidates of the candidate character code 305 used for the recognition error correction process is set. By reducing the number, the time required for the recognition error correction processing can be shortened, and as a result, the processing time of the entire document input device 100 can be shortened.

【０１３９】さらに、オペレータが表示された補正結果
３０８を修正するために用いた修正用のキーの打鍵数を
予め定めた操作水準と比較することにより、大まかにで
はあるが、前記文字認識の精度を判断することができ
る。これにより、候補文字情報３０７の中から文字認識
結果の第１候補を取り出して得られた文字列と、オペレ
ータの確認により最終確定された文字列とを１文字ずつ
比較照合して精度を算出するという手間が省け、容易に
上記効果を得ることができる。Further, by comparing the number of keystrokes of the correction key used by the operator to correct the displayed correction result 308 with a predetermined operation level, the accuracy of the character recognition is roughly determined. Can be judged. Thus, the character string obtained by extracting the first candidate of the character recognition result from the candidate character information 307 and the character string finally confirmed by the operator's confirmation are compared and collated one by one to calculate the accuracy. Therefore, the above effect can be easily obtained.

【０１４０】また、帳票形式入力の場合、文字間の短い
空白による区切りを無視することにより、均等割付され
た文字も、単語の途中で分割されることなく正しく辞書
照合することができ、辞書照合による誤り補正効果を十
分発揮することができる。また、文書入力装置１００
は、原稿内の文字レイアウトを文字位置情報３０３とし
て記憶し、認識誤り補正処理によって得られた補正結果
３０８を文節解釈の並びからなる認識列を単位として記
憶する。従って、表示された補正結果３０８内の文字列
を訂正する場合にも、１つの文節解釈を単位として訂正
することができ、誤って文字認識された文字を１文字ず
つ訂正するわずらわしさをなくすことができる。ただ
し、帳票形式入力で、罫線枠内に均等割り付けして表示
された文字列を訂正する場合、訂正後の文字列を罫線枠
内に均等割り付けして表示させるために、文字列の先頭
文字と末尾文字とに均等割り付け情報を付与する必要が
ある。Further, in the case of inputting in the form of a form, by ignoring the delimiter due to the short space between the characters, evenly assigned characters can be correctly collated without being divided in the middle of the word, and the dictionary collation can be performed. The error correction effect due to can be sufficiently exerted. Also, the document input device 100
Stores the character layout in the manuscript as the character position information 303, and stores the correction result 308 obtained by the recognition error correction processing in units of a recognition sequence composed of a sequence of phrase interpretations. Therefore, even when the character string in the displayed correction result 308 is corrected, the correction can be made in units of one clause interpretation, and the trouble of correcting the characters erroneously recognized one by one can be eliminated. You can However, when correcting the character string that was displayed by evenly allocating it in the ruled line frame in the form input, in order to display the corrected character string evenly in the ruled line frame, the first character of the character string Even distribution information must be added to the last character.

【０１４１】また、本実施例によれば、文書入力装置１
００は、縦方向と横方向の両方の文字列方向の解釈がで
きるとき、当該認識対象ブロックについて、両方向の認
識列解釈を生成し、辞書照合による認識誤り補正処理の
結果を比較することにより正しい文字列方向の解釈を求
めることができ、文字列の方向が決まらず、そのため認
識誤り補正の効果が発揮されないといった問題点を解決
することができる。Further, according to this embodiment, the document input device 1
00 is correct by generating a recognition sequence interpretation in both directions for the recognition target block when the interpretation in both the vertical and horizontal character string directions is possible and comparing the results of the recognition error correction processing by dictionary matching. It is possible to solve the problem that the interpretation of the character string direction can be obtained, the direction of the character string is not determined, and therefore the effect of the recognition error correction is not exerted.

【０１４２】また、本実施例によれば、帳票形式入力の
場合、画像上の文字位置情報３０３および画像位置情報
２１１に基づいて、印字文字サイズを設定し、設定され
る印字文字サイズの範囲を制限することにより、罫線枠
内の文字が著しくバランスの悪い文字サイズで印字され
てしまうという問題点を解決することができる。また、
本実施例によれば、名刺入力の場合、郵便番号や電話番
号を手掛かりとして補正結果として得られた住所候補等
を制限することにより、地名認識の補正効果を高め、類
字形文字を多く含んだ地名の認識正解率が低いという問
題点を解決することができる。また、本実施例によれ
ば、郵便番号および電話番号と住所名との関係に限ら
ず、また、名刺入力の場合に限らず、同様にして、固有
名詞および一般名詞などの用語の正しい補正結果３０８
を得ることができる。Further, according to the present embodiment, in the case of the form format input, the print character size is set based on the character position information 303 on the image and the image position information 211, and the range of the set print character size is set. By restricting, it is possible to solve the problem that the characters in the ruled line frame are printed in a character size that is significantly unbalanced. Also,
According to the present embodiment, in the case of business card input, the correction effect of the place name recognition is enhanced by limiting the address candidates etc. obtained as the correction result by using the postal code or the telephone number as a clue, and a lot of glyph characters are included. It is possible to solve the problem that the recognition accuracy rate of a place name is low. Further, according to the present embodiment, the correct correction result of terms such as proper nouns and general nouns is not limited to the relationship between the postal code and the telephone number and the address name, and is not limited to the case of inputting a business card. 308
Can be obtained.

【０１４３】[0143]

【発明の効果】従って、請求項１記載の本発明によれ
ば、所定数の前記文字列画像に対する前記文字列の候補
が生成される毎に、前記正解率で表される文字認識の精
度を算出し、算出された文字認識の精度に応じて、前記
文字列候補生成手段が組み合わせに用いる文字コードの
候補の数を限定するので、文字認識精度の変動に適応し
た文字列候補生成処理を行うことができる。すなわち、
認識対象となる文書の原稿が、何度もコピーされて文字
のかすれや汚れを生じた原稿であって、文字認識精度が
低いときには、前記文字列候補生成手段が組み合わせに
用いる文字コードの候補の数を多くすることによって、
生成される文字列の候補の正解率を高めることができ
る。また、認識対象となる文書の原稿が鮮明な画像で得
られた場合のように、文字認識精度が高いときには、前
記文字列候補生成手段が組み合わせに用いる文字コード
の候補の数を少なくすることによって、前記文字列候補
生成手段の文字列候補生成速度を短縮することができ、
結果的に、文書入力装置全体の処理時間を短縮すること
ができる。Therefore, according to the present invention as set forth in claim 1, the accuracy of the character recognition represented by the correct answer rate is improved every time the character string candidates for a predetermined number of the character string images are generated. According to the calculated accuracy of character recognition, the number of character code candidates used for the combination by the character string candidate generation means is limited. Therefore, the character string candidate generation processing adapted to the fluctuation of the character recognition accuracy is performed. be able to. That is,
When the original of the document to be recognized is an original in which the characters are blurred or smeared by being copied many times, and the character recognition accuracy is low, the character string candidate generation means selects a character code candidate used for the combination. By increasing the number,
The accuracy rate of the generated character string candidates can be increased. When the character recognition accuracy is high, such as when the document of the document to be recognized is obtained as a clear image, the number of character code candidates used for combination by the character string candidate generation means is reduced. , The character string candidate generation speed of the character string candidate generating means can be shortened,
As a result, the processing time of the entire document input device can be shortened.

【０１４４】従って、請求項２記載の本発明によれば、
計数された修正指示数に応じて尤度基準記憶手段内の前
記尤度基準値を変更するので、請求項１における正解率
を算出する手間が省け、より容易に上記効果を得ること
ができる。従って、請求項３記載の本発明によれば、罫
線枠内の文字間の空白による単語区切りを無視すること
により、均等割付された文字も、単語の途中で分割され
ることなく正しく認識することができ、認識誤り補正効
果を十分発揮することができる。Therefore, according to the present invention described in claim 2,
Since the likelihood reference value in the likelihood reference storage means is changed according to the counted number of correction instructions, the time and effort for calculating the correct answer rate in claim 1 can be saved, and the above effect can be obtained more easily. Therefore, according to the present invention as set forth in claim 3, evenly-assigned characters can be correctly recognized without being divided in the middle of a word by ignoring word divisions due to spaces between characters in a ruled line frame. The recognition error correction effect can be sufficiently exerted.

【０１４５】従って、請求項４記載の本発明によれば、
識別された罫線枠内に均等割り付けして表示されている
所定の文字列の候補は、前記文字列候補記憶手段内に文
字列として格納されているので、修正の際に、文字列を
単位として修正することができる。このため、オペレー
タは、従来のように１文字毎に修正指示を入力する必要
がなく、均等割り付け表示された文字列を容易に修正す
ることができる。Therefore, according to the present invention of claim 4,
Since the predetermined character string candidates that are evenly displayed in the identified ruled line frame are stored as character strings in the character string candidate storage means, the character string is used as a unit for correction. Can be modified. Therefore, the operator does not need to input a correction instruction for each character as in the conventional case, and can easily correct the evenly displayed character strings.

【０１４６】従って、請求項５記載の本発明によれば、
取り込まれた文書の画像内の画像領域と空白領域との分
布からは、文字列の方向が縦横いずれの方向であるとも
解釈できる場合には、一旦、両方向について文字列の候
補を生成した上、前記文字列方向判定手段が、他の判断
基準に従って、文字列の方向を判定するので、より正確
に文字列の方向を判定することができる。従って、認識
誤り補正処理の効果を、十分得ることができる。Therefore, according to the present invention of claim 5,
From the distribution of the image area and the blank area in the image of the captured document, if it can be interpreted that the direction of the character string is either vertical or horizontal, once the character string candidates are generated in both directions, Since the character string direction determining means determines the direction of the character string according to another criterion, it is possible to more accurately determine the direction of the character string. Therefore, the effect of the recognition error correction processing can be sufficiently obtained.

【０１４７】従って、請求項６記載の本発明によれば、
印字された罫線枠内に、生成された文字列の候補のうち
の所定の候補を、取り込まれた文書の画像から切り出さ
れた文字の画像の文字サイズに最も近い印字フォントサ
イズで印字することができる。すなわち、本発明によれ
ば、印字された罫線枠内に、生成された文字列の候補の
うちの所定の候補を、認識対象の文書中に印字されてい
た文字サイズに応じた印字フォントサイズで印字するこ
とができる。Therefore, according to the present invention of claim 6,
It is possible to print a predetermined candidate of the generated character string candidates in the printed ruled frame with a print font size that is closest to the character size of the character image cut out from the captured document image. it can. That is, according to the present invention, a predetermined candidate of the generated character string candidates is printed in the printed ruled line frame with a print font size corresponding to the character size printed in the document to be recognized. Can be printed.

【０１４８】従って、請求項７記載の本発明によれば、
上記効果に加えて、印字された罫線枠内に、生成された
文字列の候補のうちの所定の候補を、罫線枠に対して釣
り合いがとれた文字サイズで印刷することができる。従
って、請求項８記載の本発明によれば、同一の属性を表
す用語の中で、例えば、「大田」と「太田」のように、
それぞれに正しい用語であるが類似字形を多く含むため
に認識誤りを生じやすい文字列を多く含んでいる住所表
示、氏名などの用語を認識する場合、上記検索結果との
照合を行うことにより、より正確な認識誤り補正結果を
得ることができる。Therefore, according to the present invention of claim 7,
In addition to the above effect, a predetermined candidate of the generated character string candidates can be printed in the printed ruled line frame in a character size balanced with the ruled line frame. Therefore, according to the present invention as set forth in claim 8, among terms representing the same attribute, for example, "Ota" and "Ota",
When recognizing a term such as an address display or a name, which is a correct term, but contains many character strings that tend to cause recognition errors because it contains many similar glyphs, by comparing with the above search results, An accurate recognition error correction result can be obtained.

【０１４９】従って、請求項９記載の本発明によれば、
請求項１記載の本発明と同様の効果を得ることができ
る。従って、請求項１０記載の本発明によれば、請求項
２記載の本発明と同様の効果を得ることができる。従っ
て、請求項１１記載の本発明によれば、請求項３記載の
本発明と同様の効果を得ることができる。Therefore, according to the present invention of claim 9,
It is possible to obtain the same effect as that of the present invention according to claim 1. Therefore, according to the present invention described in claim 10, it is possible to obtain the same effect as that of the present invention described in claim 2. Therefore, according to the present invention described in claim 11, it is possible to obtain the same effect as that of the present invention described in claim 3.

【０１５０】従って、請求項１２記載の本発明によれ
ば、請求項４記載の本発明と同様の効果を得ることがで
きる。従って、請求項１３記載の本発明によれば、請求
項５記載の本発明と同様の効果を得ることができる。従
って、請求項１４記載の本発明によれば、請求項６記載
の本発明と同様の効果を得ることができる。Therefore, according to the present invention of claim 12, it is possible to obtain the same effect as that of the present invention of claim 4. Therefore, according to the present invention of claim 13, it is possible to obtain the same effect as that of the present invention of claim 5. Therefore, according to the present invention of claim 14, it is possible to obtain the same effect as that of the present invention of claim 6.

【０１５１】従って、請求項１５記載の本発明によれ
ば、請求項７記載の本発明と同様の効果を得ることがで
きる。従って、請求項１６記載の本発明によれば、請求
項８記載の本発明と同様の効果を得ることができる。Therefore, according to the present invention of claim 15, it is possible to obtain the same effect as that of the present invention of claim 7. Therefore, according to the present invention of claim 16, it is possible to obtain the same effect as that of the present invention of claim 8.

[Brief description of drawings]

【図１】本発明の第１実施例である文書入力装置１００
のハードウェア構成を示すブロック図である。FIG. 1 is a document input device 100 according to a first embodiment of the present invention.
FIG. 3 is a block diagram showing the hardware configuration of the above.

【図２】文書入力装置１００の認識結果を格納するため
に主記憶１０１上に設けられる記憶領域の区分と、その
記憶内容とを示す説明図である。FIG. 2 is an explanatory diagram showing divisions of a storage area provided on a main memory 101 for storing a recognition result of the document input device 100 and storage contents thereof.

【図３】認識列管理部２０４に記憶される認識列管理情
報３０１のデータフォーマットを示す説明図である。FIG. 3 is an explanatory diagram showing a data format of recognition string management information 301 stored in a recognition string management unit 204.

【図４】主記憶１０１上に設けられる単語辞書記憶部４
１２の記憶領域区分を示す説明図である。FIG. 4 is a word dictionary storage unit 4 provided on the main memory 101.
It is explanatory drawing which shows 12 storage area divisions.

【図５】主記憶１０１上に設けられる名刺データ認識結
果一時記憶部５０１の記憶領域区分を示す説明図であ
る。5 is an explanatory diagram showing storage area divisions of a business card data recognition result temporary storage unit 501 provided on the main memory 101. FIG.

【図６】主記憶１０１上に設けられる名簿情報記憶部６
０１の記憶領域区分を示す説明図である。FIG. 6 is a name list information storage unit 6 provided on the main memory 101.
It is explanatory drawing which shows the memory area division of 01.

【図７】主記憶１０１上に設けられる作業領域の記憶領
域区分を示す説明図である。7 is an explanatory diagram showing storage area division of a work area provided on the main memory 101. FIG.

【図８】ＣＰＵ１２０の処理手順の全体の概要を示すフ
ローチャートである。FIG. 8 is a flowchart showing an overall outline of the processing procedure of the CPU 120.

【図９】文書入力処理の初期段階における表示画面およ
びデータの具体例を示す説明図である。FIG. 9 is an explanatory diagram showing a specific example of a display screen and data in an initial stage of document input processing.

【図１０】図８に示したステップＳ８０５におけるテキ
スト認識処理の内容を詳細に説明するフローチャートで
ある。FIG. 10 is a flowchart illustrating in detail the content of text recognition processing in step S805 shown in FIG.

【図１１】文字認識処理により生成された認識文字情報
３０２および認識誤り補正後に表示される表示画面の具
体例を示す説明図である。FIG. 11 is an explanatory diagram showing a specific example of the recognized character information 302 generated by the character recognition processing and the display screen displayed after recognition error correction.

【図１２】図１０のステップＳ１００４における候補文
字限定処理を説明するフローチャートである。FIG. 12 is a flowchart illustrating a candidate character limitation process in step S1004 of FIG.

【図１３】図１０のステップＳ１００５における認識誤
り補正処理を説明するフローチャートである。13 is a flowchart illustrating a recognition error correction process in step S1005 of FIG.

【図１４】図８のステップＳ８０８における名刺テキス
ト認識処理を説明するフローチャートである。FIG. 14 is a flowchart illustrating a business card text recognition process in step S808 of FIG.

【図１５】名刺入力処理における表示画面およびデータ
の具体例を示す説明図である。FIG. 15 is an explanatory diagram showing a specific example of a display screen and data in a business card input process.

【図１６】図８のステップＳ８０９における名簿更新処
理の手順を説明するフローチャートである。16 is a flowchart illustrating a procedure of name list update processing in step S809 of FIG.

【図１７】図８のステップＳ８１２における帳票テキス
ト認識処理の手順を説明するフローチャートである。FIG. 17 is a flowchart illustrating a procedure of form text recognition processing in step S812 of FIG.

【図１８】帳票形式入力処理において入力されるテキス
ト画像と、その文字列方向の推定方法との具体例を示す
説明図である。FIG. 18 is an explanatory diagram showing a specific example of a text image input in a form format input process and a method of estimating the character string direction.

【図１９】図８のステップＳ８１３に示した印刷位置設
定処理を説明するフローチャートである。19 is a flowchart illustrating the print position setting process shown in step S813 of FIG.

【図２０】入力された帳票を印刷する処理の手順を示す
フローチャートである。FIG. 20 is a flowchart illustrating a procedure of a process of printing an input form.

[Explanation of symbols]

１００文書入力装置１０１主記憶１０２バス制御ユニット１０４ローカルバス１０５グラフィクス制御ユニット１０６ディスプレイ１０７ＳＣＳＩ制御ユニット１０８ハードディスク装置１０９スキャナー１１０バスブリッジ回路１１１外部バス１１２入出力制御ユニット１１３キーボード１１４プリンタ１１５マウス１２０ＣＰＵ 100 Document Input Device 101 Main Memory 102 Bus Control Unit 104 Local Bus 105 Graphics Control Unit 106 Display 107 SCSI Control Unit 108 Hard Disk Device 109 Scanner 110 Bus Bridge Circuit 111 External Bus 112 Input / Output Control Unit 113 Keyboard 114 Printer 115 Mouse 120 CPU

Claims

[Claims]

1. A document input device for recognizing a character image included in an image of a captured document and inputting a document captured as an image as a document having a character code obtained as a recognition result. As a character recognition result of an image of one character, a character code candidate storage unit that stores a plurality of character code candidates in association with a likelihood indicating the likelihood, and stores a likelihood reference value set in advance for the likelihood. Likelihood criterion storage means, and for the same character string image representing the character string in the image of the captured document, of the character code having a likelihood equal to or greater than the likelihood reference value in the likelihood criterion storage means. From a combination of candidates, a character string candidate generation unit that generates a plurality of candidates for a character string that can be grammatically correctly interpreted, and a character string candidate corresponding to the character string image. In contrast, among the character string candidates generated for the same character string image, display means for displaying a predetermined candidate, and for the character string candidates generated corresponding to the character string image, An input unit for inputting a correction instruction from the operator, and the character string candidates generated corresponding to the character string image are corrected in accordance with the correction instruction to correct a character code that is erroneously recognized. A correction unit that corrects to a character code, a character string after correction by the correction unit, which corresponds to the character string image, and a character that is a candidate of a character code with the highest likelihood corresponding to the same character string image. Comparing means for comparing the column, after the comparison by the comparing means, based on the comparison result, from the ratio of the number of characters that differ between the two and the number of characters included in the one character string, character recognition A correct answer rate calculating means for calculating a correct answer rate, a correct answer rate reference storage means for storing a predetermined correct answer rate reference value of the correct answer rate, and a calculated correct answer rate after the correct answer rate is calculated, And a likelihood reference value changing means for changing the likelihood reference value in the likelihood reference storage means according to the calculated correct answer rate when the predetermined condition is satisfied by comparison with the reference value. Characteristic document input device.

2. A document input device for recognizing a character image included in an image of a captured document and inputting a document captured as an image as a document having a character code obtained as a recognition result. As a character recognition result of an image of one character, a character code candidate storage unit that stores a plurality of character code candidates in association with a likelihood indicating the likelihood, and stores a likelihood reference value set in advance for the likelihood. Likelihood criterion storage means, and for the same character string image representing the character string in the image of the captured document, of the character code having a likelihood equal to or greater than the likelihood reference value in the likelihood criterion storage means. From a combination of candidates, a character string candidate generation unit that generates a plurality of candidates for a character string that can be grammatically correctly interpreted, and a character string candidate corresponding to the character string image. In contrast, among the character string candidates generated for the same character string image, display means for displaying a predetermined candidate, and for the character string candidates generated corresponding to the character string image, An input means for inputting a correction instruction from the operator, a correction instruction counting means for counting the correction instruction input from the input means, and a correction instruction standard for storing a predetermined correction instruction number reference value of the correction instruction number. Storage means, correction means for correcting a character code that is erroneously recognized to a correct character code in accordance with the correction instruction for the character string candidates generated corresponding to the character string image, After the correction by the correction means, when the counted correction instruction number is compared with the correction instruction number reference value stored in the correction instruction reference storage means and satisfies a predetermined condition,
And a likelihood reference value changing unit that changes the likelihood reference value in the likelihood reference storage unit according to the counted number of correction instructions.

3. A document input device for recognizing an image of characters included in an image of a captured document and inputting the document captured as an image as a document having a character code obtained as a recognition result, which comprises one line Blank detecting means for detecting a blank between the image of one character cut out from the character string image representing the character string and the image of one character adjacent to the image, and when the detected blank is larger than a predetermined size, 1
The image of a character and the image of one character adjacent thereto are the word identification means for identifying the character as a character belonging to a different word, and the character string image in a predetermined area in the captured image of the document. A document input device, comprising: a word identification limiting unit that invalidates an identification result of the identification unit.

4. A document input device for recognizing a character image included in an image of a captured document and inputting the document captured as the image as a document having a character code obtained as a recognition result. Character code candidate storing means for storing a plurality of character code candidates as a character recognition result of an image of one character cut out from an image of a captured document, and ruled line frame identification for identifying a ruled line frame in the captured image of the document Means and a character string for generating a plurality of character string candidates that can be grammatically interpreted by combining the character code candidates for the same character string image representing the character string in the captured document image Candidate generating means, character string candidate storing means for storing the generated character string candidates, and a predetermined character string candidate corresponding to one character string image, It is read from the unit, a document input device comprising: a display means for displaying to Distribute identified border frame.

5. A document input device for recognizing a character image included in an image of a captured document and inputting a document captured as an image as a document having a character code obtained as a recognition result. Character code candidate storage means for storing a plurality of character code candidates as a character recognition result of an image of one character cut out from an image of a captured document, and distribution of an image area and a blank area in the captured image of the document A character string direction identification means for identifying whether the direction of the character string in the image of the captured document is vertical or horizontal, and the distribution of the image area and the blank area in the image of the captured document. From the above, when it can be interpreted that the direction of the character string is either the vertical or horizontal direction, the character code of the same character string image representing the character string in each of the two directions. A character string candidate generation unit that generates a plurality of character string candidates that can be combined with each other and can be correctly interpreted grammatically, and a predetermined candidate among the character string candidates generated for the vertical character string image. And a predetermined candidate of the character string candidates generated for the horizontal character string image are compared, and the character string direction and the character string of one of the character strings satisfying a predetermined condition by the predetermined candidate. A document input device, comprising: a character string direction determining means for determining a candidate as valid.

6. A document input device for recognizing a character image included in an image of a captured document and inputting the document captured as the image as a document having a character code obtained as a recognition result. Ruled line frame identifying means for identifying the ruled line frame in the image of the captured document and a plurality of character code candidates as the character recognition result of the image of one character cut out from the captured image of the document. Character code candidate storage means for storing together with the position and character size in the image of the document, and the same character string image representing the character string in the captured image of the document, from the combination of the character code candidates, the grammar Character string candidate generation means for generating a plurality of character string candidates that can be correctly interpreted, and a predetermined candidate among the generated character string candidates. When printing in the identified ruled line frame, the character size of the image corresponding to the character code candidate in the character string candidate is checked, and the character size checked from the printable font sizes that can be printed A printing character size selection means for selecting a printing font size closest to the above, and printing the identified ruled line frame, and selecting a predetermined candidate of the generated character string candidates in the printed ruled line frame. And a printing unit that prints in the specified print character size.

7. The document input device according to claim 6, wherein the position in the image of the document of the image corresponding to the character code candidate in the character string candidates is referred to with reference to the character code candidate storage means. By referring to the vertical space calculation means for calculating the size of the upper and lower blank areas sandwiching the image of the character and the character code candidate storage means, and the character code candidates in the character string candidates correspond to each other. The position of the image to be scanned in the image of the document is checked, and the left and right space calculating means for calculating the size of the left and right blank areas sandwiching the image of the character, and the vertical size of the selected print font size are
An upper and lower size limiter that limits the size of the selected print font size in the vertical direction so that it does not exceed the predetermined size compared to the size of the calculated upper and lower blank areas, and the left and right of the selected print font size. The size of the direction is
And a left and right size limiting means for limiting the size of the selected print font size in the left and right direction so as not to exceed a predetermined size as compared with the calculated left and right blank areas. The document input device is characterized in that a print font size closest to the examined character size is selected from the print font sizes limited by the vertical size limiting unit and the horizontal size limiting unit.

8. A document input device for recognizing a character image included in an image of a captured document and inputting the document captured as an image as a document having a character code obtained as a recognition result. Character code candidate storing means for storing a plurality of character code candidates as a result of character recognition of an image of one character cut out from an image of a document, and the character code candidates for the same character string image representing a character string. Character string candidate generating means for generating a plurality of character string candidates that can be correctly interpreted as terms with a predetermined priority, and a specific character code is identified from the generated character string candidates. Then, the attribute of the term represented by the character string candidate including the character code and the character string candidate generated for the same character string image as the character string candidate is determined. Attribute determination means, inter-attribute correspondence term storage means for storing terms having mutually related attributes in association with each other, and storing the character string candidates for which attributes have been determined together with the priority order for each predetermined attribute And a character string candidate stored in the attribute-based character string candidate storage means that represents the attribute of a term stored in the inter-attribute correspondence term storage means. Of the character string candidates stored in the attribute-based character string candidate storage means, a term search means for searching for a corresponding term from the inter-attribute correspondence term storage means using each of the character string candidates as a key , Priority changing means for increasing the priority of the character string candidates that are common to the search result of the term searching means using the character string candidates of other attributes that are mutually related as keys. A document input device characterized by:

9. A predetermined likelihood reference value and a predetermined correct answer rate reference value are stored in a likelihood reference storage section and a correct answer rate reference storage section, respectively, and a character included in an image of a captured document is stored. While recognizing an image as a character, as a character recognition result of an image of one character, a plurality of character code candidates are stored in a character code candidate storage unit in association with a likelihood indicating the certainty, and captured as an image. A document input method for inputting a document as a document including a character code obtained as a recognition result, wherein the same character string image representing a character string in an image of a captured document is stored in the likelihood criterion storage unit. A character string candidate generation step for generating a plurality of character string candidates that can be grammatically correctly interpreted from a combination of the character code candidates having a likelihood equal to or greater than the stored likelihood reference value. A display step of displaying a predetermined candidate among the character string candidates generated for the same character string image with respect to the character string candidate generated corresponding to the character string image, For the character string candidates generated corresponding to the character string image, an input step of inputting a correction instruction from an operator, and for the character string candidates generated corresponding to the character string image A correction step of correcting a character code that is erroneously recognized according to the correction instruction to a correct character code; A comparing step of comparing with a character string composed of the corresponding most likely character code candidates; and after the comparison, based on the comparison result, the number of characters different between the two and the one character string A correct answer rate calculation step of calculating a correct answer rate of character recognition from a ratio with the number of characters included, and a correct answer rate calculated after the correct answer rate is calculated, and the correct answer rate stored in the correct answer rate reference storage unit. When satisfying a predetermined condition compared with the rate reference value, change the likelihood reference value in the likelihood reference storage unit according to the calculated correct answer rate,
And a likelihood reference value changing step of returning to the character string candidate generating step.

10. A predetermined likelihood reference value and a predetermined correction instruction number reference value are stored in a likelihood reference storage unit and a correction instruction reference storage unit, respectively, and characters included in an image of a captured document are stored. While recognizing the image of the image, as a character recognition result of the image of one character, a plurality of candidates for the character code are stored in the character code candidate storage unit in association with the likelihood indicating the certainty, and captured as an image. A document input method for inputting a document as a document consisting of a character code obtained as a recognition result, wherein the same character string image representing the character string in the image of the captured document is stored in the likelihood reference storage unit. The character string candidate generation step for generating a plurality of character string candidates that can be grammatically correctly interpreted from a combination of the character code candidates having a likelihood equal to or higher than the likelihood reference value of A display step of displaying a predetermined candidate among the character string candidates generated for the same character string image with respect to the character string candidate generated corresponding to the character string image, An input step of inputting a correction instruction from an operator for the candidate of the character string generated corresponding to the character string image, a correction instruction counting step of counting the input correction instruction, and the character string image A correction step of correcting a character code that is erroneously recognized into a correct character code according to the correction instruction for the corresponding candidate of the character string generated, and the correction instruction counted after the correction. When the number satisfies the predetermined condition compared with the correction instruction number reference value stored in the correction instruction reference storage unit, the likelihood in the likelihood reference storage unit is calculated according to the counted correction instruction number. Document input method characterized by comprising the likelihood reference value changing step of changing the reference value.

11. A document input method for recognizing a character image included in an image of a captured document and inputting a document captured as an image as a document having a character code obtained as a result of recognition, comprising: A blank detection step of detecting a blank between an image of one character cut out from a character string image representing the character string of and the image of one character adjacent to the image, and when the detected blank is larger than a predetermined size, 1
The character image and the image of one character adjacent to the character image are identified as characters belonging to different words, and a character string image in a predetermined area in the captured image of the document is described above. And a word identification limiting step of invalidating the identification result.

12. While recognizing one character image cut out from the captured image of a document, a plurality of character code candidates are stored in the character code candidate storage unit as a recognition result, and the document captured as the image is stored. A document input method for inputting as a document composed of character codes obtained as a recognition result, the ruled line frame identifying step of identifying a ruled line frame in the image of the captured document, and the characters in the image of the captured document. A character string candidate generating step of generating a plurality of character string candidates that can be grammatically correctly interpreted by combining the character code candidates for the same character string image representing a string, and the generated character string candidates. Is stored in the character string candidate storage section, and a predetermined character string candidate corresponding to one character string image is read from the character string candidate storage section. To, document input method characterized by comprising a display step of displaying to Distribute the identified line border.

13. While recognizing a character image included in an image of a captured document, a plurality of character code candidates are stored in a character code candidate storage unit as a character recognition result of an image of one character and captured as an image. A document input method for inputting a document as a document composed of the character codes obtained as a recognition result, and the document is scanned based on the distribution of the image area and the blank area in the image of the captured document. From the character string direction identification step that identifies whether the direction of the character string in the image is vertical or horizontal, and the distribution of the image area and the blank area in the image of the captured document, the direction of the character string is vertical or horizontal. When it can be interpreted as either direction, the same character string image representing the character string in each of the two directions is grammatically correct from the combination of the character code candidates. A character string candidate generating step of generating a plurality of interpretable character string candidates; a predetermined candidate of character string candidates generated for the vertical character string image; and a horizontal character string. A character string that compares a predetermined candidate among the character string candidates generated for the image, and determines the character string direction and the character string candidate in which the predetermined candidate satisfies a predetermined condition. A document inputting method comprising: a direction determining step.

14. While character-recognizing a character image included in an image of a captured document, a plurality of character code candidates in the image of the character of the character image are detected as a character recognition result of the image of one character. A method for inputting a document stored as an image together with the position and character size in the character code candidate storage unit as a document consisting of the character code obtained as a recognition result. A ruled line frame identifying step of identifying a ruled line frame and a character string that can be grammatically correctly interpreted from the combination of the character code candidates for the same character string image representing the character string in the image of the captured document. And a predetermined candidate of the generated character string candidates is printed in the identified ruled line frame. In doing so, the character size of the image corresponding to the character code candidate in the character string candidate is checked, and the print font size closest to the checked character size is selected from the print font sizes that can be printed. Print character size selection step and printing of the identified ruled line frame, and printing of predetermined candidates of the generated character string candidates in the printed ruled line frame at the selected print character size A method for inputting a document, comprising:

15. The document input method according to claim 14, wherein, after the ruled line frame identifying step, the image corresponding to the character code candidate in the character string candidates is referred to by referring to the character code candidate storage unit. Of the character string candidates by referring to the character code candidate storage unit to check the position of the document in the image, and calculate the size of the upper and lower blank areas sandwiching the image of the character. Of the position of the image corresponding to the character code candidate in the image of the document, the left and right space calculation step of calculating the size of the left and right blank areas sandwiching the image of the character, and the selected print font size The vertical size is
The upper and lower size limiting step that limits the size of the selected print font size in the vertical direction so that it does not exceed the predetermined size compared to the calculated size of the upper and lower blank areas, and the left and right of the selected print font size. The size of the direction is
Document input characterized by comprising a left and right size limiting step of limiting the left and right size of the selected print font size so as not to exceed a predetermined size as compared with the calculated left and right blank areas. Method.

16. An image of one character while preliminarily storing, in an inter-attribute correspondence term storage unit, associating terms having mutually related attributes with each other and recognizing a character image included in an image of a captured document. Is a document input method for storing a plurality of character code candidates as a character recognition result in the character code candidate storage unit and inputting a document captured as an image as a document including the character code obtained as a recognition result. A character string candidate generating step of generating a plurality of character string candidates that can be correctly interpreted as terms by combining the character code candidates for the same character string image representing a column, with a predetermined priority order; A specific character code is identified from among the character string candidates, and a character string candidate containing the character code and the same character string image as the candidate character string are displayed. And an attribute determination step of determining the attribute of the term represented by the generated character string candidate, and storing the character string candidates for which the attribute has been determined together with the priority in the attribute-specific character string candidate storage unit for each predetermined attribute. An attribute-specific character string candidate storage step, which is a candidate for a character string stored in the attribute-specific character string candidate storage unit, and which represents an attribute of a term stored in the inter-attribute correspondence term storage unit Of the character string candidates stored in the attribute-specific character string candidate storage unit, a term search step of searching for a corresponding term from the inter-attribute correspondence term storage unit using the character string candidates as a key A priority order changing step for increasing the priority order of the character string candidates that are common to the search result using the character string candidates of other attributes that are mutually related as a key. Document input method.