JPH0325691A - Correcting method for misread character - Google Patents

Correcting method for misread character

Info

Publication number
JPH0325691A
JPH0325691A JP1159749A JP15974989A JPH0325691A JP H0325691 A JPH0325691 A JP H0325691A JP 1159749 A JP1159749 A JP 1159749A JP 15974989 A JP15974989 A JP 15974989A JP H0325691 A JPH0325691 A JP H0325691A
Authority
JP
Japan
Prior art keywords
character
misread
characters
misreading
correct
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP1159749A
Other languages
Japanese (ja)
Other versions
JP2669897B2 (en
Inventor
Akiko Konno
紺野 章子
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fuji Electric Co Ltd
Fuji Facom Corp
Original Assignee
Fuji Electric Co Ltd
Fuji Facom Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fuji Electric Co Ltd, Fuji Facom Corp filed Critical Fuji Electric Co Ltd
Priority to JP1159749A priority Critical patent/JP2669897B2/en
Publication of JPH0325691A publication Critical patent/JPH0325691A/en
Application granted granted Critical
Publication of JP2669897B2 publication Critical patent/JP2669897B2/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Landscapes

  • Character Discrimination (AREA)

Abstract

PURPOSE:To effectively correct the misread characters when many misreading cases occur by storing the misreading contents and the preceding and following character strings to perform the automatic correction with presence of character strings regarded as equal to each other and to urge the corresponding operation with the characters similar to each other. CONSTITUTION:When a misread character is corrected, the misreading contents and the character strings preceding and following the misread character are stored and a character string equal to the misread character is searched out of the subsequent reading results. Then the misread character is automatically corrected if either one of preceding and following characters of the character strings regarded as equal to each other is coincident with the misread character. When both preceding and following characters are not coincident with the misread character, the positions of both characters are shown to decide whether the relevant character is correct or not, whether the correction equal to the automatic correction is required or not, and whether the candidate characters must be selected or not. Then the corresponding operation is urged. Thus the repetitive misreading cases are extracted and corrected automatically. As a result, such kinds of correction can be easily and accurately carried out.

Description

【発明の詳細な説明】 〔産業上の利用分野〕 本発明は、入力された文字を認識し出力する文字認識装
置にて誤読された文字を修正する方法に関する. 〔従来の技術〕 文字認識後の誤読文字修正としては、画面上に表示され
た認識結果の文字列からオペレータが誤読文字を指示し
、その文字に対する候補文字群を表示させ、その中から
正しい文字を選択するという方法が広く行なわれている
. 〔発明が解決しようとする課題〕 しかしながら、このような方法では、文書中で同じ誤読
が複数個所で発生しても、その都度オペレータが誤読個
所を発見し候補文字選択を行なわなければならず、見落
としが起こることも多い。
DETAILED DESCRIPTION OF THE INVENTION [Field of Industrial Application] The present invention relates to a method for correcting characters misread by a character recognition device that recognizes and outputs input characters. [Prior art] To correct misread characters after character recognition, an operator indicates the misread character from the recognition result character string displayed on the screen, displays a group of candidate characters for that character, and selects the correct character from among them. The method of selecting is widely used. [Problems to be Solved by the Invention] However, in such a method, even if the same misreading occurs in multiple locations in a document, the operator must discover the misreading location and select candidate characters each time. Overlooks often occur.

したがって、本発明の課題は繰り返し起こる同じ誤読を
抽出して修正し得るようにすることにある. 〔課題を解決するための手段〕 入力された文字を認識し出力する文字認識装置で誤読さ
れた文字を修正するに当たり、まず誤読文字を修正して
誤読の内容とその前後の文字列を記憶しておき、以後の
読み取り結果の中から誤読文字と同じ文字列を探索し、
その結果同じ文字列とみなされたものの前後の文字のい
ずれか一方が一致するときは自動的に修正を施し、前後
の文字が一致しないものはその場所を示し、その文字が
正解か、または自動修正と同様の修正が必要か、もしく
は候補文字選択が必要かを表示して対応する操作を促す
Therefore, the problem of the present invention is to extract and correct the same misreadings that occur repeatedly. [Means for solving the problem] When correcting characters that are misread by a character recognition device that recognizes and outputs input characters, first correct the misread characters and memorize the content of the misread and the character strings before and after it. Then, search for the same string as the misread character from the subsequent reading results,
As a result, if one of the characters before or after matches the same character string, it will be automatically corrected, and if the characters before or after do not match, the location will be shown and the character will be correct or automatically corrected. Displays whether a similar correction is required or whether candidate character selection is required and prompts the corresponding operation.

〔作用〕[Effect]

1つの文書において、同じ誤読が頻繁に起こる可能性が
多いことに着目して、誤読文字の修正を効率的に行なお
うとするもので、繰り返し起こる同じ誤読を抽出して自
動的に修正を施すことにより、この種の修正を容易かつ
正確に行なう。
Focusing on the fact that the same misreadings are likely to occur frequently in a single document, the system attempts to efficiently correct misreading characters.It extracts the same misreadings that occur repeatedly and automatically corrects them. This makes this type of correction easy and accurate.

〔実施例〕〔Example〕

第1図は本発明の実施例を示すフローチャート、第2図
は本発明を具体的に説明するための説明図、第3図は本
発明が適用されるシステム概要図、第4図はその動作を
説明するための概要フローチャートである。
Fig. 1 is a flowchart showing an embodiment of the present invention, Fig. 2 is an explanatory diagram for specifically explaining the present invention, Fig. 3 is a schematic diagram of a system to which the present invention is applied, and Fig. 4 is its operation. 2 is a schematic flowchart for explaining.

まず、第3図から説明する. 同図において、1は文書画像を光学的走査により画像メ
モリへ人力するスキャナ、2は文書画像からl文字ずつ
の画像を取り出し認識する認識部および誤読文字の修正
を行なう修正部からなる文字認識装置、3は認識結果を
表示.i認するディスプレイ、4は誤読文字の指示等を
行なうキーボードである. その動作は第4図の如く、ステフブ1で文書全体の画像
を取り込み、ステップ2で1文字毎の画像に分割する。
First, let's explain from Figure 3. In the figure, 1 is a scanner that manually inputs a document image into an image memory by optical scanning, 2 is a character recognition device consisting of a recognition unit that extracts and recognizes images of l characters from the document image, and a correction unit that corrects misread characters. , 3 displays the recognition results. 4 is a keyboard for indicating misread characters, etc. The operation is as shown in FIG. 4, in step 1 an image of the entire document is captured, and in step 2 it is divided into images for each character.

そして、ステップ3で文字パターンの辞書と比較し、そ
の結果として特徴間距離の小さいN文字が出力される。
Then, in step 3, a comparison is made with a dictionary of character patterns, and as a result, N characters with small inter-feature distances are output.

さらに、ステップ4で本発明の特徴となる誤読文字の修
正を行ない、ステップ5で認識結果を出力する。
Furthermore, in step 4, misread characters, which are a feature of the present invention, are corrected, and in step 5, the recognition results are output.

以下、第1図,第2図を参照して本発明を説明する. まず、誤読文字を修正して誤読の内容とその前後の文字
列とを記憶しておき、以後の文字読み取り結果の中から
誤読文字と同一の文字列を探索する.そして、前後の文
字のうち、どちらかが一致する場合は自動的に修正を施
す. 第2図fatは誤読文字を含む、文字認識装置の読み取
り結果例を示す。この例では「一」(長音)を、「一」
(漢字)に誤読しているところが4ケ所ある(下線付の
文字参照).そして、そのうちの3ケ所が「メーカ」と
いう文字列に現われている。したがって、最初の「一」
(第2図(b)の■の個所)でこれを長音「一」に修正
すると、前後の文字列「メーカ」の中の「一」が第2図
山)の■,■の如<「一」へ自動的に修正される.ただ
し、修正したことを示す何らかの目印、例えば網かけ等
により表示し、オペレータのチェックを促す.また、前
後の文字列が一致しなかった「一」は「一層」の「一」
や「ディーラ」の「一」(第2図(C)の■,■参照)
のように2ケ所あるが、それらに対してはカーソルを「
一」の文字に移動させて第2図(e)に示すように、1
:正解,2:修正,3:候補選択なる指示を表示し、オ
ペレータの判断を求める。ここで、1:正解が選択され
た場合はそのまま、2:修正が選択されれば自動修正と
同様に修正を行なう。また、3:候補選択が選ばれると
、通常どおり候補文字中から正解を選択することができ
る。
The present invention will be explained below with reference to FIGS. 1 and 2. First, the system corrects the misread character, memorizes the content of the misread character and the character strings before and after it, and then searches for the same character string as the misread character among the subsequent character reading results. Then, if either of the preceding or following characters matches, it will be automatically corrected. FIG. 2 fat shows an example of the reading results of the character recognition device, including misread characters. In this example, "ichi" (long sound) is
(Kanji) has four mispronunciations (see the underlined characters). Three of them appear in the character string "manufacturer". Therefore, the first "one"
(The part marked with ■ in Figure 2 (b)), if this is corrected to the long sound ``ichi'', the ``ichi'' in the character string ``manufacturer'' before and after it becomes 1” automatically. However, some kind of mark, such as shading, will be displayed to indicate that the correction has been made, to prompt the operator to check. In addition, "ichi" in which the preceding and following character strings do not match is "ichi" in "ichilayer".
or “one” in “dealer” (see ■, ■ in Figure 2 (C))
There are two locations like this, but for them, move the cursor to "
As shown in Figure 2(e),
: Correct answer, 2: Correction, 3: Candidate selection instructions are displayed and the operator's judgment is requested. Here, if 1: Correct answer is selected, the answer is left unchanged, and if 2: Correction is selected, correction is made in the same way as automatic correction. Moreover, when 3: Candidate selection is selected, the correct answer can be selected from among the candidate characters as usual.

この例のように、誤読文字が単語の真中にある場合は前
後の文字列が一致するが、実際には誤読文字が単語の先
頭.末尾または活用語尾にある等さまざまなケースが考
えられる.したがって、誤読文字と同じ文字の前か後の
どちらか1文字が、修正を施した誤読文字と一致した場
合は自動修正を行なうものとする。これは、自動修正さ
れた場合は網かけ等で表示されるので、それが万一間違
っていても発見が容易だからである。
As in this example, if the misread character is in the middle of a word, the surrounding characters will match, but in reality the misread character is at the beginning of the word. Various cases are possible, such as at the end or at the end of a conjugated word. Therefore, if either one character before or after the same character as the misread character matches the corrected misread character, automatic correction is performed. This is because if it is automatically corrected, it will be displayed with shading, so even if it is wrong, it will be easy to find.

〔発明の効果〕〔Effect of the invention〕

本発明によれば、文字認識装置による誤読文字の修正段
階において、おなし誤読が多く発生する場合程効率的に
誤読文字修正が可能となる利点が得られる。
According to the present invention, in the stage of correcting misread characters by the character recognition device, there is an advantage that the more misreadings occur, the more efficiently misread characters can be corrected.

【図面の簡単な説明】[Brief explanation of drawings]

第1図は本発明の実施例を示すフローチャート、第2図
は本発明を具体的に説明するための説明図、第3図は本
発明が適用されるシステム概要図、第4図はその動作を
説明するための概要フローチャートである。 符号説明 l・・・スキャナ、2・・・文字認識装置、3・・・デ
ィスプレイ、4・・・キーボード。
Fig. 1 is a flowchart showing an embodiment of the present invention, Fig. 2 is an explanatory diagram for specifically explaining the present invention, Fig. 3 is a schematic diagram of a system to which the present invention is applied, and Fig. 4 is its operation. 2 is a schematic flowchart for explaining. Code explanation 1...Scanner, 2...Character recognition device, 3...Display, 4...Keyboard.

Claims (1)

【特許請求の範囲】 1)入力された文字を認識し出力する文字認識装置で誤
読された文字を修正するに当たり、まず誤読文字を修正
して誤読の内容とその前後の文字列を記憶しておき、以
後の読み取り結果の中から誤読文字と同じ文字列を探索
し、 その結果同じ文字列とみなされたものの前後の文字のい
ずれか一方が一致するときは自動的に修正を施し、前後
の文字が一致しないものはその場所を示し、その文字が
正解か、または自動修正と同様の修正が必要か、もしく
は候補文字選択が必要かを表示して対応する操作を促す
ことを特徴とする誤読文字の修正方法。
[Claims] 1) When correcting a character that is misread by a character recognition device that recognizes and outputs input characters, first correct the misread character and memorize the content of the misread and the character strings before and after the misread character. search for the same character string as the misread character from the subsequent reading results, and if one of the characters before or after matches the same character string, it will automatically correct it and Misreading characterized by indicating the location of characters that do not match, and prompting the corresponding operation by displaying whether the character is correct, whether correction similar to automatic correction is required, or whether candidate character selection is required. How to correct text.
JP1159749A 1989-06-23 1989-06-23 How to correct misread characters Expired - Lifetime JP2669897B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP1159749A JP2669897B2 (en) 1989-06-23 1989-06-23 How to correct misread characters

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP1159749A JP2669897B2 (en) 1989-06-23 1989-06-23 How to correct misread characters

Publications (2)

Publication Number Publication Date
JPH0325691A true JPH0325691A (en) 1991-02-04
JP2669897B2 JP2669897B2 (en) 1997-10-29

Family

ID=15700426

Family Applications (1)

Application Number Title Priority Date Filing Date
JP1159749A Expired - Lifetime JP2669897B2 (en) 1989-06-23 1989-06-23 How to correct misread characters

Country Status (1)

Country Link
JP (1) JP2669897B2 (en)

Also Published As

Publication number Publication date
JP2669897B2 (en) 1997-10-29

Similar Documents

Publication Publication Date Title
JPH11110480A (en) Method and device for displaying text
JPH0325691A (en) Correcting method for misread character
JPH05274467A (en) Data input device
JP2870375B2 (en) Sentence correction device
JP3221968B2 (en) Character recognition device
JP2002207960A (en) Method and program for recognized character correction
JPH0319590B2 (en)
JP3071745B2 (en) Post-processing method of character recognition result
JP2001283156A (en) Device and method for recognizing address and computer readable recording medium stored with program for allowing computer to execute the same method
JP2936761B2 (en) Proofreading device for Japanese documents
JPH05120472A (en) Character recognizing device
JPH0612520A (en) Confirming and correcting system for character recognizing device
JPH053631B2 (en)
JPH0484261A (en) Error notation retrieval system
JPH01134584A (en) Device for recognizing character
JP2907947B2 (en) Optical character reading system
JP2002123815A (en) Filing device
JPS6398788A (en) Recognizing device
JPH11143993A (en) Recognized character correction device and its method
JPH04268986A (en) Character recognizing device
JPH0793424A (en) Document input device
JPH0785050A (en) Automatic correction system
JPS62164181A (en) Character recognizing device
JPH0338788A (en) Character recognition processor
JPS60128579A (en) Information recognition system