JPH0355679A - Collating method between character recognition result and character input result - Google Patents
Collating method between character recognition result and character input resultInfo
- Publication number
- JPH0355679A JPH0355679A JP1191099A JP19109989A JPH0355679A JP H0355679 A JPH0355679 A JP H0355679A JP 1191099 A JP1191099 A JP 1191099A JP 19109989 A JP19109989 A JP 19109989A JP H0355679 A JPH0355679 A JP H0355679A
- Authority
- JP
- Japan
- Prior art keywords
- character
- characters
- result
- input
- recognition result
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 19
- 238000012937 correction Methods 0.000 abstract description 4
- 238000012545 processing Methods 0.000 abstract description 4
- 238000012795 verification Methods 0.000 description 15
- 238000010586 diagram Methods 0.000 description 5
- 230000000694 effects Effects 0.000 description 2
- 230000005856 abnormality Effects 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 230000002040 relaxant effect Effects 0.000 description 1
Landscapes
- Character Discrimination (AREA)
Abstract
Description
この発明は、文字認識結果と文字入力結果の照合方法、
すなわち認識すべき対象文字が文字認識装置を介して認
識されたときの認識結果文字と、前記対象文字がキーボ
ードを介して入力されたときの出力である入力結果文字
とを照合する方法に関する。This invention provides a method for collating character recognition results and character input results;
That is, it relates to a method of comparing a recognition result character when a target character to be recognized is recognized via a character recognition device with an input result character that is an output when the target character is input via a keyboard.
従来、文書のコード化のために文書に係る大量の文字入
力をおこなう場合には、その信頼性向上のために次のよ
うな方法がとられてきた。まず、文書をイメージスキャ
ナによって画像変換し、その変換出力を文字認識装置に
よって文字として認識し、照合用データを作成する。こ
の照合用データと、オペレータによ−って入力された確
認用データとを、プログラムによって照合する。両者間
に差異箇所があると、その箇所がいずれの間違いによる
ものかが判断され、この判断にしたがって、間違いが無
くなるまで、この箇所をキー人力によって訂正していく
。BACKGROUND ART Conventionally, when inputting a large amount of characters for a document in order to encode the document, the following methods have been used to improve the reliability. First, a document is converted into an image using an image scanner, and the converted output is recognized as characters by a character recognition device to create matching data. This verification data is verified by a program against the verification data input by the operator. If there is a discrepancy between the two, it is determined which error was caused by that location, and based on this determination, this location is corrected by key personnel until the error is eliminated.
以上説明したように、従来の技術では、オペレータによ
る確認用データが正しくても、文字認識装置による照合
用データに誤りがあると、不一致の表示がなされ、差異
箇所としていずれが正しいかの判断が要求される。従来
、文字認識装置に基づく誤りのために、この不一致が多
発し、無用の判断が要求されることがしばしばであり、
そのため照合作業の効率が著しく阻害されていた。
この発明の課題は、従来の技術がもつ以上の問題点を解
消し、認識すべき対象文字が文字認識装置を介して認識
されたときの認識結果文字と、前記対象文字がキーボー
ドを介して入力されたときの出力である入力結果文字と
をより効率的に照合する文字認識結果と文字入力結果の
照合方法を提供することにある。As explained above, in the conventional technology, even if the verification data by the operator is correct, if there is an error in the verification data by the character recognition device, a discrepancy is displayed, and it is difficult to determine which is the correct difference. required. Traditionally, these discrepancies often occur due to errors in character recognition devices, often requiring unnecessary judgment.
As a result, the efficiency of verification work was significantly hindered. An object of the present invention is to solve the above-mentioned problems of the conventional technology, and to provide a recognition result character when a target character to be recognized is recognized through a character recognition device, and a recognition result character when the target character is inputted via a keyboard. It is an object of the present invention to provide a method for collating character recognition results and character input results, which more efficiently collates the input result characters that are output when a character is recognized.
この課題を解決するために、本発明に係る文字認識結果
と文字入力結果の照合方法は、認識すべき対象文字が文
字認識装置を介して認識されたときの認識結果文字と、
前記対象文字がキーボードを介して入力されたときの出
力である人力結果文字とを照合する方法において、前記
認識結果文字として類似度に基づいて一以上の候補文字
を選定すること:
前記入力結果文字が前記認識結果文字の候補文字のいず
れかと同じであるときには照合結果は一致であると判定
すること;
の各ステップを備える。In order to solve this problem, a method for matching character recognition results and character input results according to the present invention provides a recognition result character when a target character to be recognized is recognized via a character recognition device,
In the method of comparing the target character with a human result character that is an output when the target character is input via a keyboard, selecting one or more candidate characters as the recognition result character based on the degree of similarity: the input result character is the same as any of the candidate characters of the recognition result characters, determining that the matching result is a match;
認識結果文字として類似度に基づいて一以上の候補文字
が選定される。次に、各候補文字と人力結果文字とが比
較され、入力結果文字が候補文字のいずれかと同じであ
れば、照合結果は一致であると判定され、候補文字のい
ずれとも同じでなければ、照合結果は不一致であると判
定される。したがって、不一致との判定によって、該当
入力結果文字または認識結果文字としての候補文字の少
なくともいずれかに異常があることが判り、これの訂正
が可能である。One or more candidate characters are selected as recognition result characters based on similarity. Each candidate character is then compared with the human result character, and if the input result character is the same as any of the candidate characters, the match result is determined to be a match, and if it is not the same as any of the candidate characters, the match is determined to be a match. The result is determined to be a mismatch. Therefore, by determining that there is a mismatch, it is determined that there is an abnormality in at least one of the corresponding input result character or the candidate character as the recognition result character, and this can be corrected.
本発明に係る文字認識結果と文字入力結果の照合方法が
適用された照合装置について以下に図面を参照しながら
説明する。
第2図はこの照合装置の構或を示すブロック図である。
同図において、1は対象文書用のイメージスキャナ、2
はイメージスキャナ1の出力に基づき文字認識をおこな
う認識部である。なお、この認識部2では、従来の対象
文字に対するもっとも類似度の高い文字が一つだけ認識
結果として出力されるのとは異なり、類似度の高い順に
、または類似度に係るしきい値を緩和して、いずれにし
ても3個までの文字が候補文字として出力されるように
してある。
3は対象文書を見ながらオペレータによって文字が入力
されるキーボードである。4は、認識部2からの認識結
果文字に係るデータ用のメモリである。また5は、キー
ボード4からの入力結果文字に係るデータ用のメモリで
ある。
6は照合部で、各メモリ4,5から,の対応する文字デ
ータを照合して、一致.不一致の判定をする。7はCR
Tで、照合結果を画面に表示する。
なお、CRT7とともに照合結果を印刷して出力するプ
リンタを設置することもできる。
さて、第3図はメモリ4における認識結果文字のデータ
ファイルの構成図である。同図において、各アドレス0
01,002,003のXI.X2.X3は、いずれも
図示してない対象文字Xに対する認識結果についての候
補文字で、この場合は限度の3個まで選定してある。ま
た、各アドレス004.005のYLY2は、いずれも
図示してない対象文字yに対する認識結果についての候
補文字で、この場合には2個しか選定できなかったこと
を示す。同様に、アドレス006のZlは、図示してな
い対象文字2に対する認識結果についての1個だけの候
補文字である。
照合装置の動作について、第1図のフローチャートを主
に、第2図のブロック図を補助的にそれぞれ参照しなが
ら説明する。第1図において、ステップS1で、メモリ
4(第2図参照)から対象文字Xに対する認識結果文字
の候補文字Xi と、メモリ5から対象文字Xに対する
入力結果文字Xoとを読み出す.
ステップS2で、候補文字Xiに係るカウンタiを初期
化、i=1する。ステップS3で、Xiの有無が判断さ
れ、YES (有り)なら、ステップS4に、No(無
し)ならステップS7の照合結果二NGに、それぞれ移
行する。ステップS4で、候補文字Xiが入力結果文字
XOと一致するかどうかが判断され、YES (一致)
なら、ステップS8で、照合結果=OKとなる。No(
不一致)なら、ステップS5,S6を経て、ステップS
3以降の手続きが全ての候補文字について繰り返される
。繰返しが終了したら、先程のステップS7に移って終
了する。ステップS7,3Bの後にステップS9で照合
結果がCRT表示され、対象文字Xについての照合処理
が終了する。続いて次の対象文字についての照合が同様
におこなわれることになる。A collation device to which a method for collating character recognition results and character input results according to the present invention is applied will be described below with reference to the drawings. FIG. 2 is a block diagram showing the structure of this verification device. In the figure, 1 is an image scanner for the target document;
is a recognition unit that performs character recognition based on the output of the image scanner 1. Note that, unlike conventional methods in which only one character with the highest degree of similarity to the target character is output as a recognition result, this recognition unit 2 outputs the character in the order of the highest degree of similarity or by relaxing the threshold related to the degree of similarity. In any case, up to three characters are output as candidate characters. Reference numeral 3 denotes a keyboard on which characters are input by the operator while viewing the target document. 4 is a memory for data related to the recognition result characters from the recognition unit 2; Further, 5 is a memory for data related to input result characters from the keyboard 4. 6 is a matching unit that matches the corresponding character data from each memory 4 and 5 to find a match. Determine inconsistency. 7 is CR
Press T to display the matching results on the screen. It is also possible to install a printer that prints and outputs the verification results together with the CRT 7. Now, FIG. 3 is a diagram showing the structure of a data file of recognition result characters in the memory 4. In the same figure, each address 0
01,002,003 XI. X2. X3 is a candidate character for the recognition result for the target character X, none of which is shown in the figure, and in this case, up to three characters are selected, which is the limit. Further, YLY2 at each address 004.005 is a candidate character for the recognition result for the target character y (not shown), and in this case, only two characters were selected. Similarly, Zl at address 006 is the only candidate character for the recognition result for target character 2 (not shown). The operation of the collation device will be described with reference mainly to the flowchart in FIG. 1 and supplementary to the block diagram in FIG. 2. In FIG. 1, in step S1, a candidate character Xi of the recognition result character for the target character X is read out from the memory 4 (see FIG. 2), and an input result character Xo for the target character X is read out from the memory 5. In step S2, a counter i related to candidate character Xi is initialized, i=1. In step S3, the presence or absence of Xi is determined, and if YES (present), the process moves to step S4, and if No (absent), the process moves to step S7, where the verification result is NG. In step S4, it is determined whether the candidate character Xi matches the input result character XO, and YES (match) is determined.
If so, the verification result becomes OK in step S8. No(
If not, the process goes through steps S5 and S6, and then goes to step S.
The procedures from 3 onwards are repeated for all candidate characters. When the repetition is completed, the process moves to step S7 and ends. After steps S7 and 3B, the matching result is displayed on the CRT in step S9, and the matching process for the target character X is completed. Subsequently, matching for the next target character will be performed in the same way.
以上説明したように、この発明においては、認識結果文
字としての一以上の候補文字と、入力結果文字との照合
によるから、従来の技術に比べ次のようなすぐれた効果
がある。すなわち、一致判定に基づいて認識,入力の各
結果文字の正確さに係る信頼性向上が図れるとともに、
無用の多頻度で起こり得る不一致判定に起因する照合処
理や後続の訂正処理の無駄が防止でき、照合.訂正処理
の効率化が図れる。As explained above, the present invention has the following superior effects compared to the conventional technology because it compares one or more candidate characters as recognition result characters with input result characters. In other words, it is possible to improve the reliability of the accuracy of characters recognized and input based on the match judgment, and
It is possible to prevent wasted verification processing and subsequent correction processing due to mismatch judgments that occur frequently and needlessly, and to improve verification efficiency. Correction processing can be made more efficient.
第1図は本発明方法が適用された照合装置の照合動作を
示すフローチャート、
第2図はこの照合装置の構成を示すブロック図、第3図
はこの照合装置における認識結果文字のデータファイル
の構或図である。
符号説明
1:イメージスキャナ、2:認識部、FIG. 1 is a flowchart showing the verification operation of a verification device to which the method of the present invention is applied, FIG. 2 is a block diagram showing the configuration of this verification device, and FIG. 3 is the structure of a data file of recognition result characters in this verification device. This is a diagram. Code explanation 1: Image scanner, 2: Recognition unit,
Claims (1)
れたときの認識結果文字と、前記対象文字がキーボード
を介して入力されたときの出力である入力結果文字とを
照合する方法において、前記認識結果文字として類似度
に基づいて一以上の候補文字を選定すること;前記入力
結果文字が前記認識結果文字の候補文字のいずれかと同
じであるときには照合結果は一致であると判定すること
;の各ステップを備えることを特徴とする文字認識結果
と文字入力結果の照合方法。1) A method of comparing a recognition result character when a target character to be recognized is recognized via a character recognition device with an input result character that is an output when the target character is input via a keyboard, selecting one or more candidate characters as the recognition result character based on similarity; determining that the collation result is a match when the input result character is the same as any of the candidate characters of the recognition result characters; A method for matching a character recognition result and a character input result, comprising the following steps.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP1191099A JPH0355679A (en) | 1989-07-24 | 1989-07-24 | Collating method between character recognition result and character input result |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP1191099A JPH0355679A (en) | 1989-07-24 | 1989-07-24 | Collating method between character recognition result and character input result |
Publications (1)
Publication Number | Publication Date |
---|---|
JPH0355679A true JPH0355679A (en) | 1991-03-11 |
Family
ID=16268839
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP1191099A Pending JPH0355679A (en) | 1989-07-24 | 1989-07-24 | Collating method between character recognition result and character input result |
Country Status (1)
Country | Link |
---|---|
JP (1) | JPH0355679A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7003162B2 (en) | 2000-11-27 | 2006-02-21 | Omron Corporation | Apparatus and method for examining images |
-
1989
- 1989-07-24 JP JP1191099A patent/JPH0355679A/en active Pending
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7003162B2 (en) | 2000-11-27 | 2006-02-21 | Omron Corporation | Apparatus and method for examining images |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JPH0355679A (en) | Collating method between character recognition result and character input result | |
JPH04124785A (en) | Confirmation and correction method for ocr recognition result | |
CN114741474B (en) | Data processing method applied to project declaration system | |
JPH05274467A (en) | Data input device | |
JP2908187B2 (en) | Confirmation and correction method of OCR recognition result | |
JPH05119896A (en) | Character string input system | |
JPH11272801A (en) | Method for recognizing series of patterns in image and its detecting deformation and record medium where its program is recorded | |
JPH02126384A (en) | Pattern recognizing method for character recognizing system | |
JPH0331981A (en) | Character recognizing device | |
JP3037727B2 (en) | OCR system | |
US6751353B1 (en) | Method for adaptive recognition of information images and system of implementation thereof | |
JP3033444B2 (en) | Database comparison method | |
JPH0291785A (en) | Character recognizing device | |
JPH03161844A (en) | Spelling correcting system | |
JPH02115930A (en) | Patch managing system | |
JPH0492973A (en) | Image information registration/retrieval device | |
JPH03161886A (en) | Method for correcting misreading of ocr | |
JPH06251187A (en) | Method and device for correcting character recognition error | |
JPH04220787A (en) | Character reader | |
JPH1159345A (en) | Safety check certificate reading device in vehicle data processing system | |
JPH0318987A (en) | Dictionary registering method | |
JPH03164887A (en) | Optical character reader | |
JPH08287190A (en) | Optical character reader | |
JPH06332956A (en) | Data processing method | |
JP2002123793A (en) | Slip reading method and terminal equipment |