JPH0250783A - Method for modifying recognition result in character recognizing device - Google Patents

Method for modifying recognition result in character recognizing device

Info

Publication number
JPH0250783A
JPH0250783A JP63202360A JP20236088A JPH0250783A JP H0250783 A JPH0250783 A JP H0250783A JP 63202360 A JP63202360 A JP 63202360A JP 20236088 A JP20236088 A JP 20236088A JP H0250783 A JPH0250783 A JP H0250783A
Authority
JP
Japan
Prior art keywords
word
image information
recognition result
coordinates
area
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP63202360A
Other languages
Japanese (ja)
Inventor
Yoshihiro Kitamura
義弘 北村
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sharp Corp
Original Assignee
Sharp Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sharp Corp filed Critical Sharp Corp
Priority to JP63202360A priority Critical patent/JPH0250783A/en
Publication of JPH0250783A publication Critical patent/JPH0250783A/en
Pending legal-status Critical Current

Links

Landscapes

  • Character Discrimination (AREA)

Abstract

PURPOSE:To rapidly and easily execute modifying work by displaying the image information of a input character corresponding to the word of a recognition result based on a word coordinate from a recognizing part on a display scope, and making a position on the image information display of the word confirmable. CONSTITUTION:The word of a recognition result is stored at a word unit with cord information in a word storing area 18, and area coordinates (x1, y1) and (x2, y2) of the word segmented as one word are stored in a word coordinate storing area 19. The relation of the segmented word and the coordinate makes the coordinates of the front part upper edge and the rear part lower edge of a square area involving the segmented word as area coordinates. Based on word coordinates (x1, y1) and (x2, y2) in a word memory 14 of a recognizing part 1, since a storage result sentence and input image information are correspond-displayed and the both are indicated with making correspond, them to each other, the modification, etc. of the recognition result can be executed while watching the input image information.

Description

【発明の詳細な説明】 〈産業上の技術分野〉 この発明は文字認識装置における認識結果修正方法に関
する。
DETAILED DESCRIPTION OF THE INVENTION <Industrial Technical Field> The present invention relates to a recognition result correction method in a character recognition device.

く従来の技術〉 文書の文字情報をコンピュータ処理により認識する文字
認識装置として、認識しようとする文字情報、例えば英
数字を光電変換し、該光電変換された電気信号を1文字
単位で切り出し、認識部において所定の認識論理に従っ
て1文字ずつ認識を行う、光学式文字読取装fl(OC
R)が知られている。
Prior Art> A character recognition device that recognizes character information in a document by computer processing photoelectrically converts the character information to be recognized, such as alphanumeric characters, cuts out the photoelectrically converted electrical signals character by character, and performs recognition. The optical character reading system fl (OC
R) is known.

この種の文字認識装置において、従来は認識された文字
の正読率が低くて疑わしいと判定、いわゆるリジェクト
(否定)された場合、陰極線管(CRT)等を用いた表
示部にリジェクトされた文字のみが点滅又は反忙表示さ
れ、操作者は該表示を見ながら当該リジェクト文字を原
稿と照合して確認しつつキーボード等の修正手段を介し
てリジェクト文字の修正を行っていた。
In this type of character recognition device, conventionally, when a recognized character has a low correct reading rate and is judged to be suspicious, so-called rejected, the rejected character is displayed on a display unit using a cathode ray tube (CRT), etc. The operator corrects the rejected characters using a correction means such as a keyboard while checking the display and comparing the rejected characters with the original.

しかしながら、リジェクトされた文字が、例えばjfi
−Ill等の接触文字とか、「1liJ、[oO」等の
判別が回置な文字であるとか、文字行又は文字列の切り
出しエラーに起因する場合は、1文字のみのイメージ情
報によっては操作者は正しく判断することが困難でいち
いち原稿と照合しなければならず、修正作業に多大な手
間を要し、作業能率が良くないという欠点があった。
However, if the rejected characters are e.g.
- If the problem is due to contact characters such as Ill, inverted characters such as "1liJ, [oO", etc., or an error in cutting out a character line or string, the operator may It is difficult to judge correctly, and it has to be checked against the manuscript one by one, which requires a lot of time and effort to make corrections, which has the disadvantage of poor work efficiency.

、〈発明が解決しようとする問題点〉 この発明は上記欠点を解消して文字認識装置におけるa
識結果の修正を非常に能率的に行えるようにした認識結
果修正方法を提供することを目的とする。
, <Problems to be solved by the invention> This invention solves the above-mentioned drawbacks and improves the a
An object of the present invention is to provide a recognition result correction method that allows the recognition result to be corrected very efficiently.

く問題点を解決するための手段〉 上記目的を達成するために、この発明の認識結果修正方
法は、認識部からの単語座標をもとに認識結果の単語に
対応する入力文字のイメージ情報を同一画面上に表示し
、 前記単語のイメージ情報表示上での位置を確認出来るよ
うにしたことを特徴とするものである。
Means for Solving the Problems> In order to achieve the above object, the recognition result correction method of the present invention calculates image information of input characters corresponding to the word of the recognition result based on the word coordinates from the recognition unit. This is characterized in that the words are displayed on the same screen so that the position of the word on the image information display can be confirmed.

〈作用〉 制御部は入力部からの認識結果単語の指示情報に従って
、該単語に対応する画像メモリ内のイメージ情報部分を
他と区別して表示させる。これにより認識結果大表示と
入力イメージ情報との対応関係が一層明確になる。
<Operation> According to the instruction information of the recognition result word from the input unit, the control unit displays the image information portion in the image memory corresponding to the word in a manner that distinguishes it from others. This makes the correspondence between the large recognition result display and the input image information even clearer.

〈実施例〉 以下図面に基づいて本発明の詳細な説明する。<Example> The present invention will be described in detail below based on the drawings.

第1図は本発明に係る認識結果修正方法を適用できる文
字認識装置のブロック図を示す、1は認識部であり、4
は入力部、3は制御部及び2はB?a結果表示2aと画
像(イメージ情報)表示2bを有する表示部である。
FIG. 1 shows a block diagram of a character recognition device to which the recognition result correction method according to the present invention can be applied, 1 is a recognition unit, 4
is the input section, 3 is the control section, and 2 is B? This is a display unit having a result display 2a and an image (image information) display 2b.

認識部1は、第2図に示す通りイメージスキャナ10、
画像メモリ11、切り出し部12、認識部13及び単語
メモリ14がら成る。切り出し部12では単語間のスペ
ースを検出して単語の切り出しを行っており、該切り出
し情報は#112aを介して単語メモリ14に送られ単
語間区切り情報として利用される。単語の切り出しに関
しては同一出願人の出願(特願昭6l−310412)
に詳細に開示されているので説明は省略する。
The recognition unit 1 includes an image scanner 10, as shown in FIG.
It consists of an image memory 11, a cutting section 12, a recognition section 13, and a word memory 14. The cutting unit 12 detects spaces between words and cuts out words, and the cutting information is sent to the word memory 14 via #112a and used as inter-word delimiter information. Regarding word cutting, an application filed by the same applicant (Japanese Patent Application No. 6l-310412)
Since it is disclosed in detail in , the explanation will be omitted.

第3図(A)は認識部1中の単語メモリ14の記憶7オ
ーマツトの一構成例を示す図である。単語記憶領域18
、単語座標記憶領域19及びフラッグ記憶置載20から
成り、本実施例では、それぞれ50バイト、各16バイ
ト及び各1ビツトの記m容量を用いているが、これに限
定されるものではない。
FIG. 3(A) is a diagram showing an example of the structure of the word memory 14 in the recognition section 1 with seven storage formats. Word storage area 18
, a word coordinate storage area 19, and a flag storage area 20, and in this embodiment, the storage capacity is 50 bytes each, 16 bytes each, and 1 bit each, but the storage capacity is not limited to this.

単語記憶領域18には、認識結果の単語が単語単位にコ
ード情報で記憶されており、単語座標記憶領域19には
、1単語として切り出された単語の領域座標(xlwy
l)、(にLy2)が記憶されている。
In the word storage area 18, words resulting from recognition are stored in code information word by word, and in the word coordinate storage area 19, area coordinates (xlwy
l), (in Ly2) are stored.

切り出された単語と座標の関係は第3図(B)に示す通
りであり、切り出された単語を含む方形領域の前部上端
と後部下端の座標を領域座標としている。 フラッグ(
F)記憶領域20は、本実施例の場合、それぞれ1ビツ
トで成るリノエク)F20a、XペルチェツクF20b
、単語長F20c及1記号F20dを含んでいる。
The relationship between the cut out word and the coordinates is as shown in FIG. 3(B), and the coordinates of the front upper end and rear lower end of the rectangular area containing the cut out word are taken as area coordinates. Flag (
F) In the case of this embodiment, the memory area 20 is comprised of 1-bit memory area) F20a and X-pel check F20b.
, word length F20c and one symbol F20d.

リジェクトF 20gは、記憶部13で文字としてBn
でさなかった文字パターンを含む単語については「1」
となり、その他の場合は「0」となる。
Reject F 20g is written as Bn in the storage unit 13.
"1" for words containing character patterns that were not found.
In other cases, it is "0".

スペルチェックF20bは、単語単位に実施するスペル
チェック処理で失敗した単語については「1」、その他
は「0」に設定される。
The spell check F20b is set to "1" for words that fail in the spell check process performed on a word-by-word basis, and to "0" for other words.

単語長F20cは、切り出された単語長が所定文字数よ
り多い場合に「0」、少ない場合に「1」が設定される
。これは単語を摺成する文字数が多い場合は誤りにくい
という経験則に基づいて設けられている。記号F20d
は、認識単語が記号の場合に「1」、それ以外の場合に
「0」が設定される。
The word length F20c is set to "0" if the cut word length is greater than a predetermined number of characters, and is set to "1" if it is less. This is based on the empirical rule that when a word has a large number of characters, it is difficult to make mistakes. Symbol F20d
is set to "1" if the recognized word is a symbol, and "0" otherwise.

記号の場合はスペルチェックも効かず誤りやすいことに
起因する。
This is because spell checking is not effective in the case of symbols and they are prone to errors.

第5図にこの発明の動作フローを示す。まず第4図表示
部2の表示画面23に、切り出し等の前処理、文字単位
の認識処理及びスペルチェック等単語単位の認識確認処
理を終えた認識結果文23aと画像メモリ11に記憶さ
れている入力文字のイメージ情報23bが表示される(
Sl)、第4図では同一画面に両者を表示する場合を示
しているが、認識結果文とイメージ情報を別画面に表示
するようにしても良い。
FIG. 5 shows the operational flow of the present invention. First, on the display screen 23 of the display unit 2 in FIG. The image information 23b of the input characters is displayed (
SI), FIG. 4 shows a case where both are displayed on the same screen, but the recognition result sentence and the image information may be displayed on separate screens.

次いで認識結果文23a上の単語、例えば目視に上り認
識結果が誤っていると思われる単語を、入力部4上のカ
ーソル制御キー等で指定する(Sl)。
Next, a word on the recognition result sentence 23a, for example, a word whose recognition result appears to be incorrect upon visual inspection, is specified using the cursor control key on the input unit 4 (Sl).

すると指定単語に対応する単語メモリ17の単語座標記
憶領域19の指示された座標部分の入力文字のイメージ
情報を特定しくS3)、それが例えばm4図に示すよう
に囲い標n(ロ)k!?で囲まれて他と区別表示される
(S4)。
Then, the image information of the input character at the designated coordinate part of the word coordinate storage area 19 of the word memory 17 corresponding to the designated word is specified (S3), and it is, for example, marked n(b)k! as shown in figure m4. ? It is surrounded by and displayed to distinguish it from others (S4).

以上の実施例は、英単語を対象としているが、日本語文
についても同様に実施できる。尚この場合上記実施例に
おいて単語単位とあるのは語単位の取り扱いとなる。
Although the above embodiments are aimed at English words, they can be implemented similarly for Japanese sentences. In this case, what is referred to as word units in the above embodiments refers to word units.

く効 果〉 以上の説明から明らかな通り、本発明によれば認識部1
の乍語メモリ14中の単語座[(xi−yl)、(x2
+y2)をもとに、記憶結果文と入力イメージ情報を対
応表示すると共に両者を対応付けて指示できるようにし
たから、入力イメージ情報を見なからaa結果の修正等
ができるので、修正作業が迅速かつ容易に出来るように
なる。
Effect> As is clear from the above explanation, according to the present invention, the recognition unit 1
The word locus [(xi-yl), (x2
Based on +y2), we have made it possible to display the memorized result sentences and input image information in correspondence, and to give instructions by associating both, so it is possible to modify the aa results without looking at the input image information, so the correction work is easier. It can be done quickly and easily.

【図面の簡単な説明】[Brief explanation of the drawing]

!@1図は本発明の方法を適用できる文字認識装置のブ
ロック図、第2図は認識部のブロック図、第3図(A)
は単語メモリの記憶7オーマツトを示す図、第3図CB
)は単語と単語座標との関係を示す 図、第4図は本発明の表示例を示す図及びtj&5図は
本発明の動作フローを示す画である。 1:認識部、2:表示部、3:制御部、4:入力部、1
0:イメージスキャナ、11:画像メモリ、12:切り
出し部、13:認識処理部、14:単語メモリ、12a
:信号線、15:単語メモリ情報出力線、16:画像メ
モリ情報出力線、17:単語記憶7オーマツト、18:
単語記憶領域、19:単語座標記憶領域、20:フラッ
グ記憶頭載、23:表示画面、23a:認識結果文、2
3b二人力文字イメージ情報代理人 弁理士 杉 山 
毅 至(他1名)(A) 第 図 第 図
! @Figure 1 is a block diagram of a character recognition device to which the method of the present invention can be applied, Figure 2 is a block diagram of the recognition unit, and Figure 3 (A)
Figure 3 CB is a diagram showing the storage format of word memory.
) is a diagram showing the relationship between words and word coordinates, FIG. 4 is a diagram showing a display example of the present invention, and FIG. tj & 5 is a diagram showing the operation flow of the present invention. 1: Recognition unit, 2: Display unit, 3: Control unit, 4: Input unit, 1
0: Image scanner, 11: Image memory, 12: Cutting section, 13: Recognition processing section, 14: Word memory, 12a
: Signal line, 15: Word memory information output line, 16: Image memory information output line, 17: Word memory 7-ormat, 18:
Word storage area, 19: Word coordinate storage area, 20: Flag storage head, 23: Display screen, 23a: Recognition result sentence, 2
3b Two-person character image information agent Patent attorney Sugiyama
Itaru Tsuyoshi (1 other person) (A) Figure Figure

Claims (1)

【特許請求の範囲】 認識部において認識された結果を修正するにあたり、 認識部からの単語座標をもとに認識結果の単語に対応す
る入力文字のイメージ情報を表示画面上に表示し、 前記単語のイメージ情報表示上での位置を確認出来るよ
うにしたことを特徴とする文字認識装置における認識結
果修正方法。
[Scope of Claims] In correcting the result recognized by the recognition unit, image information of the input character corresponding to the word of the recognition result is displayed on the display screen based on the word coordinates from the recognition unit, and the word is displayed on the display screen. A method for correcting recognition results in a character recognition device, characterized in that the position of a character on an image information display can be confirmed.
JP63202360A 1988-08-12 1988-08-12 Method for modifying recognition result in character recognizing device Pending JPH0250783A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP63202360A JPH0250783A (en) 1988-08-12 1988-08-12 Method for modifying recognition result in character recognizing device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP63202360A JPH0250783A (en) 1988-08-12 1988-08-12 Method for modifying recognition result in character recognizing device

Publications (1)

Publication Number Publication Date
JPH0250783A true JPH0250783A (en) 1990-02-20

Family

ID=16456219

Family Applications (1)

Application Number Title Priority Date Filing Date
JP63202360A Pending JPH0250783A (en) 1988-08-12 1988-08-12 Method for modifying recognition result in character recognizing device

Country Status (1)

Country Link
JP (1) JPH0250783A (en)

Similar Documents

Publication Publication Date Title
JP2835178B2 (en) Document reading device
JPH0250783A (en) Method for modifying recognition result in character recognizing device
JPH08329187A (en) Document reader
JPH117493A (en) Character recognition processor
JP3269889B2 (en) Optical character reading system
JPH08335248A (en) Document reader
JPH0388086A (en) Document reader
JPH0560876B2 (en)
JPH08272900A (en) Document reader
JP2586117B2 (en) Character recognition device
JPH01292586A (en) Back-up device for recognition of character
JPH0475184A (en) Input device
JPS6320584A (en) Document preparing device
JPH07192079A (en) Character recognition device
JPS6227867A (en) Picture data correcting system
JPH08153161A (en) Document image recognition device
JPS61198376A (en) Optical character reader
JP2002133367A (en) Character recognition device
JPH0258185A (en) Facsimile character recognition system
JPH01134584A (en) Device for recognizing character
JPS6267621A (en) Command input checking method
JPS63316189A (en) Optical character recognition device
JPS61198375A (en) Optical character reader
JPH08123896A (en) Handwritten character input device
JP2001222679A (en) Character read system