JPS6115285A - Optical character reader - Google Patents

Optical character reader

Info

Publication number
JPS6115285A
JPS6115285A JP59134624A JP13462484A JPS6115285A JP S6115285 A JPS6115285 A JP S6115285A JP 59134624 A JP59134624 A JP 59134624A JP 13462484 A JP13462484 A JP 13462484A JP S6115285 A JPS6115285 A JP S6115285A
Authority
JP
Japan
Prior art keywords
character
characters
frame
character frame
image
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP59134624A
Other languages
Japanese (ja)
Inventor
Keiichi Aoyama
恵一 青山
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Toshiba Corp
Original Assignee
Toshiba Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Toshiba Corp filed Critical Toshiba Corp
Priority to JP59134624A priority Critical patent/JPS6115285A/en
Publication of JPS6115285A publication Critical patent/JPS6115285A/en
Pending legal-status Critical Current

Links

Landscapes

  • Character Discrimination (AREA)

Abstract

PURPOSE:To secure adaptation between a character frame and a printed form by means of the ink of a color other than the drop-out colors, by forming the character frame and the characters within the frame into separate patterns and displaying the recognition unable characters in the images synthesized with the character frame. CONSTITUTION:The image of characters and a character frame held by a line buffer 13 are separated from each other by a detection/separation circuit 14 and divided into the character image and the character frame image. Then the separating and recognizing jobs are performed every character to reject the recognition unable characters. The dot pitch of the display color is changed for the character frame for discriminating from characters. Thus the read characters are displayed at the upper part of a screen of a CRT20. While a pattern of the rejected character is synthesized with the character frame and displayed at the lower part of the CRT screen.

Description

【発明の詳細な説明】 〔発明の技術分野〕 本発明は光学的文字読取装置(OCR)に関し、特に認
識不能なりジエクト文字の表示の改良に関するものであ
る。
DETAILED DESCRIPTION OF THE INVENTION [Technical Field of the Invention] The present invention relates to an optical character reader (OCR), and more particularly to an improvement in the display of unrecognizable characters.

〔発明の技術的背景〕[Technical background of the invention]

OCRで読取るための片仮名、平仮名、数字、記号等の
文字を記入するOCR用の帳票には、一般に文字の大き
さ、ピッチ等を制限するための文字枠があらかじめ印刷
されている。この文字枠はドロップアウトカラーと呼ば
れる特殊なインクで印刷されており、 OCRが文字を
読取る際には文字枠と帳票の白地部分が光学的に同等と
なるようになっている。
OCR forms in which characters such as katakana, hiragana, numbers, and symbols are written to be read by OCR generally have character frames printed in advance to limit the size, pitch, etc. of the characters. This character frame is printed with a special ink called dropout color, so that when the OCR reads the characters, the character frame and the white part of the form are optically equivalent.

添付図面の第6図はOCR用帳票の説明図である。なお
、以下の図面の説明において同一要素は同一符号で示し
である。帳W1にはドロップアウトカラーで文字枠2が
印刷されており、この中には図示の如< ”1” 、”
3” 、”5” 、”7”の文字(数字)が記入されて
いる。ここで、文字11111、“13+1.1“7″
に関しては正しく記入されているが、文字′″5″は正
しく記入されていない(認識不能)ものとする。
FIG. 6 of the accompanying drawings is an explanatory diagram of an OCR form. In addition, in the following description of the drawings, the same elements are indicated by the same reference numerals. Book W1 has a character frame 2 printed in dropout color, and within this is the characters <``1'' and `` as shown in the illustration.
The characters (numbers) ``3'', ``5'', and ``7'' are entered.Here, the characters 11111, ``13+1.1``7''
It is assumed that the character ``5'' has been correctly filled in, but the character ``5'' has not been correctly filled in (unrecognizable).

第7図は第6図の帳票の文字をOCRで読取ったときの
表示装置の画面の説明図である。画面3の左上部には帳
票1から読取られ認識された文字″’ 1 ” 、  
“°3゛″、“7″が表示され、認識不能(リジェクト
)の文字115 I+については記号゛?°゛を表示し
てリジェクト文字であることを示すと共に、リジェクト
した文字のパターンを符号4で示すように表示している
FIG. 7 is an explanatory diagram of the screen of the display device when the characters on the form shown in FIG. 6 are read by OCR. At the top left of screen 3 are the characters read and recognized from form 1, ``' 1 ''.
“°3゛” and “7” are displayed, and the unrecognized (reject) character 115. For I+, the symbol “?” is displayed. The symbol ``°'' is displayed to indicate that the character is a rejected character, and the pattern of the rejected character is also displayed as shown by the reference numeral 4.

(背景技術の問題点) このように従来は、帳票に記入された文字等のイメージ
のみを抜き出してOCRに取込み、lx処理およびリジ
ェクト処理を施すようにしていたので、文字枠はドロッ
プアウトカラーで印刷しな番プればならなかった。この
ためOCR用の帳票は一般の用紙印刷にように手軽には
印刷できず、また印刷単価が高いという欠点が生じてい
た。
(Problems with the background technology) In this way, conventionally, only the image of the characters written on the form was extracted, imported into OCR, and subjected to lx processing and reject processing, so the character frame was set in a dropout color. I had to print it out. For this reason, forms for OCR cannot be printed as easily as printing on general paper, and the printing unit cost is high.

〔発明の目的〕 本発明は上記の従来技術の欠点を克服づるためになされ
たもので、ドロップアウトカラー以外のインクによって
文字枠が印刷された帳票に適応する光学的文字読取装置
(OCR)を提供することを目的とする。
[Object of the Invention] The present invention has been made to overcome the above-mentioned drawbacks of the prior art, and provides an optical character reader (OCR) that is applicable to documents whose character frames are printed with ink other than dropout colors. The purpose is to provide.

〔発明の概要〕[Summary of the invention]

上記の目的を達成するため本発明は、文字枠および枠内
の文字を光電変換しパターン情報を出ツノする光電変換
手段と、このパターン情報から文字枠と文字とを分離す
る分離手段と、分離された文字パターンを認識し、認識
不能な文字に対して文字パターンと文字枠パターンとを
表示する手段とを偏える光学的文字読取装置を提供でる
もので・ある。
In order to achieve the above object, the present invention provides a photoelectric conversion means for photoelectrically converting a character frame and characters within the frame to output pattern information, a separating means for separating the character frame and characters from this pattern information, and a separating means for separating the character frame and characters from the pattern information. The present invention provides an optical character reading device that recognizes a character pattern that has been written and biases the display of a character pattern and a character frame pattern toward unrecognizable characters.

(発明の実施例) 以下、添付図面の第1図乃至第5図を参照して本発明の
一実施例を説明づる。第1図は同実施例の構成図である
。帳票上の文字および文字枠のイメージは光電変換回路
11に取り込まれて電気信号に変換され、前処理回路1
2で雑8の除去、イメージの整形等がなされる。ライン
バッファ13は1ラインごとの文字および文字枠をイメ
ージとして保持するもので、検出切出し回路14はライ
ンバッファ13のイメージを文字と文字枠に分離すると
共に、1ラインの文字から1文字づつを切出し分離する
。切出し分離された文字は正規化回路15で正規化され
る。万なわち、文字の大きさが揃えられ、文字枠の中心
位置に移される。認識回路16は正規化された文字を辞
書メモリ17の標準パターンを参考にして認識し、認識
不能な文字を判別する。
(Embodiment of the Invention) Hereinafter, an embodiment of the present invention will be described with reference to FIGS. 1 to 5 of the accompanying drawings. FIG. 1 is a block diagram of the same embodiment. Images of characters and character frames on the form are taken into the photoelectric conversion circuit 11 and converted into electrical signals, and then sent to the preprocessing circuit 1.
In step 2, the removal of miscellaneous parts, image shaping, etc. are performed. The line buffer 13 holds characters and character frames for each line as images, and the detection and extraction circuit 14 separates the image in the line buffer 13 into characters and character frames, and also extracts one character from each line of characters. To separate. The extracted and separated characters are normalized by a normalization circuit 15. In other words, the characters are aligned in size and moved to the center of the character frame. The recognition circuit 16 recognizes the normalized characters with reference to the standard patterns in the dictionary memory 17, and discriminates unrecognizable characters.

編集回路18は認識回路16で認識不能か否か判別され
た文字と、検出切出し回路14で分離された文字枠と、
KEYボード19からの入力等にもとづいて編集処理を
し、結果をCRT20で表示出力覆る。また、インター
フェース回路21を介して通信回線から外部に送信した
り、磁気テープ(MT)22やフロッピーディスク(F
D)23に記憶させたりづ−る。
The editing circuit 18 separates the characters determined to be unrecognizable by the recognition circuit 16 and the character frames separated by the detection cutting circuit 14.
Editing is performed based on the input from the KEY board 19, and the results are displayed and output on the CRT 20. It can also be sent to the outside from a communication line via the interface circuit 21, or can be sent to the outside via a magnetic tape (MT) 22 or floppy disk (F).
D) Let 23 memorize it.

第2図は第1図に示す実施例で用いる帳票の説明図であ
る。帳票1に印刷された文字枠2はドロップアラ1〜カ
ラーではなく、通常のインク(イメージとしてOCRに
取り込まれつるインク)で印刷されている。
FIG. 2 is an explanatory diagram of a form used in the embodiment shown in FIG. The character frame 2 printed on the form 1 is not printed in drop color, but in normal ink (ink that is captured by OCR as an image).

第3図は第2図に示す帳票の文字および文字枠イメージ
の説明図である。第1図に示1装置はこのイメージに対
して検出切出し、認識等の処理をし、第4図に示(よう
な表示をCRT20にて行う。ここで文字゛6″はIl
l不能であるとしてリジェクトされた文字である。
FIG. 3 is an explanatory diagram of characters and character frame images of the form shown in FIG. 2. The device 1 shown in FIG. 1 performs processing such as detection, cropping and recognition on this image, and displays the image as shown in FIG. 4 on the CRT 20. Here, the character "6" is
This is a character that was rejected as being impossible.

第5図は第1図に示す実施例によって第3図に示すイメ
ージを処理する動作の説明図である。ブロック101に
示す如くラインバッファ13に保持された文字および文
字枠のイメージは、検出切出し回路14で分離されて文
字のイメージ(ブロック102)と文字枠のイメージ(
ブロック103)とになる。この分離作業は文字枠があ
らかじめ定められた大きさとピッチで並べられているこ
とを利用すれば、比較的容易にt7うことができる。
FIG. 5 is an explanatory diagram of the operation of processing the image shown in FIG. 3 by the embodiment shown in FIG. As shown in block 101, the character and character frame images held in the line buffer 13 are separated by the detection cutout circuit 14 into a character image (block 102) and a character frame image (block 102).
block 103). This separation work can be performed relatively easily at t7 by utilizing the fact that the character frames are arranged at a predetermined size and pitch.

次いで、文字ごとの分離と[作業が実行され、認識不能
な文字(第5図の例では3番[lの文字” 6 ” )
はリジェクトされる(ブロック104)。
Next, character-by-character separation and [operation are performed, and unrecognized characters (in the example of Figure 5, number 3 [l character "6")]
is rejected (block 104).

また文字枠に関しては、文字との見分けをつけるために
ドツトピッチを粗くしたり表示色を変えたりするなどの
処理がされる(ブロック105)。
Regarding the character frame, processing is performed such as making the dot pitch coarser and changing the display color in order to distinguish it from the characters (block 105).

このようにしてCRTの画面3の上部には読取られた文
字が表示され、リジェクトされた文字のパターンと文字
枠が画面3のF部に合成されて表示される(ブロック1
05)。
In this way, the read characters are displayed at the top of the CRT screen 3, and the rejected character pattern and character frame are composited and displayed on the F section of the screen 3 (block 1
05).

このようにリジェクト表示の際に文字と文字枠のイメー
ジを合成して表示することができるので、字形の類似し
た文字等(例えばアポストロフィとカンマ)も文字枠と
の位置関係により容易に判別できる。また、文字枠も表
示Jることで文字の判別が難しくなることがないように
、文字と文字枠の表示方法を変えることもできる。
In this way, when displaying a reject, the image of the character and the character frame can be combined and displayed, so that characters with similar shapes (for example, an apostrophe and a comma) can be easily distinguished based on their positional relationship with the character frame. Furthermore, the display method of characters and character frames can be changed so that character frames are also displayed so that it does not become difficult to distinguish between characters.

〔発明の効果〕〔Effect of the invention〕

上記の如く本発明によれば、文字枠と枠内の文字等をイ
メージとして取込み、取込んだイメージを文字枠パター
ンと文字パターンに分離してこの文字の認識処理をし、
リジェクトされた文字の表示を文字枠ど合成したイメー
ジで行うようにしたので、リジェクトされた文字パター
ンを文字枠パターンとの位置関係からも判断できるので
、オペレータによるリジエク1−文字の認識および訂正
等が容易で、かつドロップアウトカラー以外のインクに
よって文字枠が印刷された帳票に適応する光学的文字読
取装置を得ることがでる。
As described above, according to the present invention, a character frame and characters within the frame are captured as an image, the captured image is separated into a character frame pattern and a character pattern, and the characters are recognized.
Rejected characters are displayed as a composite image of the character frame, so the rejected character pattern can also be determined from the positional relationship with the character frame pattern, allowing operators to perform rejig 1 - character recognition and correction, etc. It is possible to obtain an optical character reading device that is easy to read, and is applicable to documents whose character frames are printed with ink other than dropout colors.

【図面の簡単な説明】[Brief explanation of drawings]

第1図は本発明の一実施例の構成図、第2図は第1図に
示す実施例で用いる軽輩の説明図、第3図は第2図に示
す帳票のイメージの説明図、第4図は第2図に示す軽輩
を第1図に示づ実施例で処理したときのCRT画面の説
明図、第5図は第1図に示す実施例のイメージ処理動作
の説明図、第6図は従来のOCR用帳票の説明図、第7
図は第6図に示す帳票を従来装置で処理したときの表示
画面の説明図である。 1・・・帳票、2・・・文字枠、3・・・画面。 駅 区     区 [F]ト 錬     緘
FIG. 1 is a block diagram of an embodiment of the present invention, FIG. 2 is an explanatory diagram of a junior member used in the embodiment shown in FIG. 1, FIG. 3 is an explanatory diagram of an image of the form shown in FIG. 2, and FIG. 4 is an explanatory diagram of the CRT screen when the light truck shown in FIG. 2 is processed by the embodiment shown in FIG. 1, FIG. 5 is an explanatory diagram of the image processing operation of the embodiment shown in FIG. Figure 6 is an explanatory diagram of a conventional OCR form, Figure 7
The figure is an explanatory diagram of a display screen when the form shown in FIG. 6 is processed by a conventional device. 1...Form, 2...Character frame, 3...Screen. Station Ward Ward [F] Toren Tan

Claims (1)

【特許請求の範囲】 帳票にあらかじめ印刷された文字枠およびこの文字枠内
に記録された文字を光電変換しパターン情報を出力する
光電変換手段と、 この出力されたパターン情報を文字パターンと文字枠パ
ターンに分離する分離手段と、 この分離された文字パターンを認識する文字認識手段と
、 この文字認識手段が認識不能な文字に対してその文字パ
ターンとその文字パターンに対応する文字枠パターンと
を表示する表示手段とを具備することを特徴とする光学
的文字読取装置。
[Claims] A photoelectric conversion means for photoelectrically converting a character frame preprinted on a form and characters recorded in the character frame to output pattern information; Separation means for separating into patterns; Character recognition means for recognizing the separated character patterns; and Displaying, for characters that the character recognition means cannot recognize, the character pattern and a character frame pattern corresponding to the character pattern. What is claimed is: 1. An optical character reading device comprising display means for
JP59134624A 1984-06-29 1984-06-29 Optical character reader Pending JPS6115285A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP59134624A JPS6115285A (en) 1984-06-29 1984-06-29 Optical character reader

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP59134624A JPS6115285A (en) 1984-06-29 1984-06-29 Optical character reader

Publications (1)

Publication Number Publication Date
JPS6115285A true JPS6115285A (en) 1986-01-23

Family

ID=15132724

Family Applications (1)

Application Number Title Priority Date Filing Date
JP59134624A Pending JPS6115285A (en) 1984-06-29 1984-06-29 Optical character reader

Country Status (1)

Country Link
JP (1) JPS6115285A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0679462U (en) * 1993-04-28 1994-11-08 昌一 柴田 Rotating adhesive cleaner holder
US7440617B2 (en) 1997-12-19 2008-10-21 Canon Kabushiki Kaisha Communication system and control method thereof, and computer-readable memory

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0679462U (en) * 1993-04-28 1994-11-08 昌一 柴田 Rotating adhesive cleaner holder
US7440617B2 (en) 1997-12-19 2008-10-21 Canon Kabushiki Kaisha Communication system and control method thereof, and computer-readable memory

Similar Documents

Publication Publication Date Title
EP1052593B1 (en) Form search apparatus and method
JPS5837779A (en) Document processor
JPS6115285A (en) Optical character reader
JPH0373914B2 (en)
CN204856534U (en) System of looking that helps is read to low eyesight based on OCR and TTS
JP3955467B2 (en) Image processing program and image processing apparatus
JP3276555B2 (en) Format recognition device and character reader
JPH0991371A (en) Character display device
US6678427B1 (en) Document identification registration system
JPH0749557Y2 (en) Character recognition device
JPH0498586A (en) Image forming device
JP2924356B2 (en) Optical character reader
JPH08202824A (en) Document picture recognition device
JP2001307020A (en) Character recognizing device
JP2922356B2 (en) Character recognition processor
JPS6111888A (en) Document reader
JPS61198378A (en) Optical character reader
JPH08129608A (en) Character recognition device
JP2570571B2 (en) Optical character reader
JPH0554178A (en) Character recognizing device and slip for correction
JPS61198375A (en) Optical character reader
JPH0789361B2 (en) Form registration device
JPS60160486A (en) Optical character reader
JPH06243285A (en) Character recognition method
JPH0475186A (en) Character reader