JPS59206987A - Letter recognizing device - Google Patents

Letter recognizing device

Info

Publication number
JPS59206987A
JPS59206987A JP58081474A JP8147483A JPS59206987A JP S59206987 A JPS59206987 A JP S59206987A JP 58081474 A JP58081474 A JP 58081474A JP 8147483 A JP8147483 A JP 8147483A JP S59206987 A JPS59206987 A JP S59206987A
Authority
JP
Japan
Prior art keywords
character
letter
segmenting
pattern
line
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP58081474A
Other languages
Japanese (ja)
Other versions
JPH039506B2 (en
Inventor
Katsuhiko Furuya
古屋 勝彦
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Toshiba Corp
Original Assignee
Toshiba Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Toshiba Corp filed Critical Toshiba Corp
Priority to JP58081474A priority Critical patent/JPS59206987A/en
Publication of JPS59206987A publication Critical patent/JPS59206987A/en
Publication of JPH039506B2 publication Critical patent/JPH039506B2/ja
Granted legal-status Critical Current

Links

Landscapes

  • Character Input (AREA)
  • Character Discrimination (AREA)

Abstract

PURPOSE:To improve accuracy of letter segmenting even when fluctuation occurs in carrying speed etc. by correcting position of segmenting of other letter line according to the actual position of segmenting of letter segmenting of a continuous letter line. CONSTITUTION:A control section 4 receives reading control information (FC) from an external control device 5. A sheet of form is fed and reading pattern is stored in a buffer memory (BM) 2. Address of each letter position of a letter line to be recognized is calculated from the FC, and converted to addresses CA1, CA2-CAn in the BM2. The BM2 is scanned from address a specified value (e) prior to reading start position MS1 given by the FC, and detection and segmenting operation of letters are started. In detecting and segmenting of letters, if a black part is detected by scanning of the BM2, it is judged whether the black part is a part of a letter or not. If it is a letter, segmenting the first letter is performed as the beginning of the first letter.

Description

【発明の詳細な説明】 〔発明の技術分野〕 本発明は文字認識装置に係り、特に帳票の搬送速度の変
動等があっても確実に文字認識をおこなうことのできる
文字認識装置に関する。
DETAILED DESCRIPTION OF THE INVENTION [Technical Field of the Invention] The present invention relates to a character recognition device, and more particularly to a character recognition device that can reliably perform character recognition even when there are fluctuations in the conveyance speed of documents.

〔発明の技術的背景とその問題点〕[Technical background of the invention and its problems]

従来の光学的文字認識装置においては帳票の搬送速度の
変動により、文字切出しおよび認aを誤る場合が少なく
なく、特に帳票を搬送面に対し垂直に保持し搬送をおこ
なうドキュメント型光学的文字認識装置では搬送技術上
の制約から搬送速度の変動が大きく問題であった。搬送
速度の変動が大きいと特に空白(ヌペーヌ)の切出しお
よび認識が困難であった。これは速度変動による白部分
の連続と、ヌペースによる白部分の連続とが区別しにく
いからである。
In conventional optical character recognition devices, there are many cases where characters are cut out and recognized incorrectly due to fluctuations in the conveyance speed of the document, especially in document-type optical character recognition devices that carry the document while holding it perpendicular to the conveyance surface. However, due to limitations in transport technology, fluctuations in transport speed were a major problem. When the conveyance speed fluctuates greatly, it is particularly difficult to cut out and recognize blank spaces (nupaine). This is because it is difficult to distinguish between a continuous white area due to speed fluctuations and a continuous white area due to NuPace.

〔発明の目的〕[Purpose of the invention]

本発明は、上記事情を考慮してなされたもので、帳票の
搬送速度の変動等があっても正しい文字切出しおよび認
識をすることができる文字認識装置を提供することを目
的とする。
The present invention has been made in consideration of the above circumstances, and it is an object of the present invention to provide a character recognition device that can correctly cut out and recognize characters even when there are fluctuations in the transport speed of a form.

〔発明の概要〕[Summary of the invention]

この目的を達成するため本発明による文字認識装置は、
帳票の読取パターンのうち途中に空白がない連続文字行
について1文字パターンずつ切出し、この切出し位置と
予め定められた仮切出し位置との平均位置誤差を求め、
池の文字行について予め定められた仮切出し位#を前記
平均位置誤差により修正して池の文字行のパターンを切
出すことを特徴とする。
To achieve this objective, the character recognition device according to the present invention comprises:
Cutting out one character pattern at a time for continuous character lines with no blank spaces in the reading pattern of the form, calculating the average positional error between this cutting position and a predetermined temporary cutting position,
The present invention is characterized in that the predetermined temporary cutting position # for the character line of the character line is corrected by the average position error to cut out the pattern of the character line of the character character line.

〔発明の実施例〕[Embodiments of the invention]

第1図に本発明の一実施例による文字認識装置を示す。 FIG. 1 shows a character recognition device according to an embodiment of the present invention.

光電変換部1では文字等が記載された帳票を読取り、2
@化する。2直化された読取パターンは一帳票分がバッ
ファメモリ2に記憶される。
The photoelectric conversion unit 1 reads a form containing characters, etc., and
Change to @. The converted reading pattern for one form is stored in the buffer memory 2.

制御部4はバッファメモリ2に記憶された読取パターン
を1文字ずつ切出す能、文字認識装置全体の制御をする
ものである。制御部4により切出された1文字ずつのパ
ターンは文字認識部8で認識される。
The control section 4 has the ability to cut out the reading pattern stored in the buffer memory 2 one character at a time and controls the entire character recognition device. The character recognition unit 8 recognizes the character-by-character pattern cut out by the control unit 4.

本実施例による文字認識装置は、複数行記載された帳票
を一度に読取り、認識するものであるが、この帳票は、
記載された複数行のうち少なくとも1行は途中に空白が
なく連続して文字等が記載されているものとする。ここ
では例えば帳票の第1行目にそのような連続文字行があ
るものとして説明する。
The character recognition device according to this embodiment reads and recognizes a form with multiple lines written at once.
It is assumed that at least one line out of the plurality of lines is written with continuous characters, etc., with no spaces in the middle. Here, explanation will be given assuming that such a continuous character line exists in the first line of the form, for example.

次にこの文字認識装置の動作を第2図、第8図により説
明する。まず外部制御装置5より帳票の文字行位置や読
取開始位置、字種等の読取制御情報(以下「FC」とい
う)を制御部4が受取る(ブロック11)。次に帳票を
1枚フィードして、第2図に示すようと帳票全体の読取
パターンをバッファメモリ四に格納する(ブロック12
)。次に制御部4によりバッファメモリ廉を走査して帳
票左端アドレyf検出する(ブロック13)。さらにF
Cより認識する文字行の各文字位置のアドレスを計算し
、先に求めた帳票左端アドレスを加え、バッファメモリ
2内のアドレスOA、 、 OA、 、・・・。
Next, the operation of this character recognition device will be explained with reference to FIGS. 2 and 8. First, the control unit 4 receives reading control information (hereinafter referred to as "FC") such as character line position, reading start position, character type, etc. of the form from the external control device 5 (block 11). Next, one sheet is fed, and the reading pattern of the entire sheet is stored in buffer memory 4 as shown in FIG. 2 (block 12).
). Next, the control unit 4 scans the buffer memory to detect the left end address yf of the form (block 13). Further F
Calculate the address of each character position in the character line to be recognized from C, add the left end address of the form obtained earlier, and obtain the addresses OA, , OA, , . . . in the buffer memory 2.

OAn (nは文字数)に変換する(ブロック14)。Convert to OAn (n is the number of characters) (block 14).

次にブロック15で認識すべき文字行が1行目か否か判
断し、1行目の場合はブロック16へ処理が移る。ブロ
ック16では、FCにより与えられた読取開始位tMs
、より所定fff e前のアドレスよりバッファメモリ
2′Ik走査し、文字の検出、切出し動作を開始する。
Next, in block 15, it is determined whether the character line to be recognized is the first line, and if it is the first line, the process moves to block 16. In block 16, the reading start position tMs given by the FC
, the buffer memory 2'Ik is scanned from an address a predetermined value earlier than fffe, and character detection and extraction operations are started.

この所定直eは帳票搬送速度誤差、帳票印刷精吐等によ
り決定される直である。文字の検出、切出し動作は、バ
ッファメモリ2を走査して黒部分を検出する七、その黒
部分が文字の1部であるか否か判定し、文字であれば第
1文字目の始1りとして第1文字目の切出しをおこなう
This predetermined straightness e is determined by a form conveyance speed error, form printing ejection, and the like. Character detection and extraction operations involve scanning the buffer memory 2 to detect a black part, determining whether the black part is part of a character, and if it is a character, starting from the beginning of the first character. The first character is extracted as follows.

2文字目以後も同様にして切出しをおこなう。その後、
実際に切出された各文字の中心位置のアドレスTAl 
# T’* I・・・、 Tl1kn  と先に計算で
求めたアドレスOk1.OA、、・・・、CAn との
誤差”1 * dl e ”’m dyIを求め、その
平均iinを次式により求める。
Cut out the second and subsequent characters in the same manner. after that,
Address TAL of the center position of each character actually cut out
# T'* I..., Tl1kn and the address Ok1. which was calculated earlier. The error "1*dl e "'m dyI with respect to OA, .

第2図の場合には計算で求めたアドレスOA、より実際
に切出されたアドレスTAiの方が右にずれていること
がわかる。次に平均@D’に所定のメモリに格納しくブ
ロック17)た後、切出された文字の認識をおこがう(
ブロック18)。次にブロック20で次行があるか否か
判断し、あればブロック13に処理を移し次行の文字認
識をおこなう。
In the case of FIG. 2, it can be seen that the calculated address OA and the actually extracted address TAi are shifted to the right. Next, after storing the average @D' in a predetermined memory (block 17), the extracted characters are recognized (
Block 18). Next, in block 20, it is determined whether there is a next line, and if so, the process moves to block 13 to perform character recognition on the next line.

次行の文字認識はまずFCに従って各文字位置の計算上
のアドレスOA 1f求め(ブロック13゜14)、1
行目ではないのでブロック18に処理を移しくブロック
15)、各文字位置のアドレスCL tに先に求めた平
均位置誤差Djjy−加えて修正し、文字の切出しをお
こなう。次にその切出した文字の認識をおこなう(ブロ
ック19)。さらに次行があればブロック13に処理を
移して(ブロック20)、上述した処理を繰り返すが、
その帳票のすべての文字行について認識を終了している
場合はブロック12に処理を移して(ブロック20)、
新たな帳票について以上の処理を繰り返す。
To recognize the next line of characters, first calculate the calculated address OA 1f for each character position according to the FC (blocks 13 and 14).
Since it is not the line, the processing is moved to block 18 (block 15), where the previously determined average position error Djjy- is added to the address CLt of each character position to correct it and cut out the character. Next, the cut out characters are recognized (block 19). If there is another line, the process moves to block 13 (block 20) and the above process is repeated.
If recognition has been completed for all character lines in the form, the process moves to block 12 (block 20),
Repeat the above process for a new form.

このように本実施例によれば、連続文字行の文字切出し
の実際の切出し位置に従って、池の文字行の切出し位置
を修正することとしているため、帳票の搬送速度等に変
動があっても正しく文字を切出すことができる。
In this way, according to the present embodiment, the cutting position of the continuous character line is corrected in accordance with the actual character cutting position of the continuous character line, so even if there is a change in the conveyance speed of the form, etc., the cutting position can be corrected. You can cut out characters.

なお、文字の検出、切出しについては、バッファメモリ
ヲ直接走査するのではなく、各文字行の垂直射影パター
ンによりおこなってもよい。
Note that character detection and extraction may be performed using the vertical projection pattern of each character line instead of directly scanning the buffer memory.

また、連続文字行が1行目にない場合は、その連続文字
行から文字の検出、切出しをおこ彦う必要があることは
いうまでもない。
Furthermore, if there is no continuous character line in the first line, it goes without saying that it is necessary to detect and extract characters from the continuous character line.

〔発明の効果〕〔Effect of the invention〕

以上の通り、本発明によれば、帳票の搬送速度の変動、
帳票における印字または手書文字の位置変動等があって
も、非常に尚い精吐で文字の切出し、認識をおこなうこ
とができる。特に空白(ヌペーヌ)の切出し、認識に有
効である。
As described above, according to the present invention, fluctuations in the conveyance speed of documents,
Even if there is a change in the position of printed or handwritten characters on a form, the characters can be cut out and recognized with very little effort. It is particularly effective for cutting out and recognizing blank spaces (nupaine).

【図面の簡単な説明】[Brief explanation of the drawing]

第1図は本発明の一実施例による文字認識装置のブロッ
ク図、第2図は同装置の読取パターンの一具体例を示す
パターン図、第8図は同装置の動作を示すフローチャー
トである。 1・・・光電変換部、2・・・バッファメモリ、8・・
・文字認識部、4・・・制御部、5・・・外部制御装置
。 出願人代理人  猪 股   清
FIG. 1 is a block diagram of a character recognition device according to an embodiment of the present invention, FIG. 2 is a pattern diagram showing a specific example of a reading pattern of the device, and FIG. 8 is a flowchart showing the operation of the device. 1... Photoelectric conversion unit, 2... Buffer memory, 8...
-Character recognition unit, 4...control unit, 5...external control device. Applicant's agent Kiyoshi Inomata

Claims (1)

【特許請求の範囲】 少なくともひとつの文字行は途中に空白がない連続文字
行である複数の文字行が記録された帳票を読取る読取部
と、この読取部により読取られた帳票の読取パターンを
記憶する記憶部と、この記憶部に記憶された前記読取パ
ターンの各文字行のパターンを1文字ずつ切出す切出部
と、この切出部により切出された文字パターンを認識す
る認識部とを備えた文字認識装置において、 前記切出部は、前記読取パターンのうち前記連続文字行
について1文字パターンずつ切出し、この切出し位置と
予め定められた仮切出し位置との平均位置誤差を求め、
能の文字行について予め定められた仮切出し位1fを前
記平均位置誤差によシ修正して能の文字行のパターンを
切出すこと全特徴とする文字認識装置。
[Claims] A reading unit that reads a form in which at least one character line is a continuous character line with no spaces in between, and a reading pattern of the form read by the reading unit. a storage section for cutting out the pattern of each character line of the reading pattern stored in the storage section, a cutting section for cutting out the pattern of each character line of the reading pattern stored in the storage section, one character at a time, and a recognition section for recognizing the character pattern cut out by the cutting section. In the character recognition device equipped with the above, the cutting unit cuts out each character pattern from the continuous character line in the reading pattern, and calculates an average positional error between this cutting position and a predetermined temporary cutting position;
A character recognition device characterized in that a pattern of Noh character lines is cut out by correcting a predetermined temporary cutting position 1f for Noh character lines using the average position error.
JP58081474A 1983-05-10 1983-05-10 Letter recognizing device Granted JPS59206987A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP58081474A JPS59206987A (en) 1983-05-10 1983-05-10 Letter recognizing device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP58081474A JPS59206987A (en) 1983-05-10 1983-05-10 Letter recognizing device

Publications (2)

Publication Number Publication Date
JPS59206987A true JPS59206987A (en) 1984-11-22
JPH039506B2 JPH039506B2 (en) 1991-02-08

Family

ID=13747396

Family Applications (1)

Application Number Title Priority Date Filing Date
JP58081474A Granted JPS59206987A (en) 1983-05-10 1983-05-10 Letter recognizing device

Country Status (1)

Country Link
JP (1) JPS59206987A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS63123181A (en) * 1986-11-12 1988-05-26 Sanyo Electric Co Ltd Printed character recognition system
JPS6419488A (en) * 1987-07-15 1989-01-23 Nec Corp Type recognizing device

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS63123181A (en) * 1986-11-12 1988-05-26 Sanyo Electric Co Ltd Printed character recognition system
JPS6419488A (en) * 1987-07-15 1989-01-23 Nec Corp Type recognizing device

Also Published As

Publication number Publication date
JPH039506B2 (en) 1991-02-08

Similar Documents

Publication Publication Date Title
JPS58103266A (en) Character image processor
JPS59206987A (en) Letter recognizing device
JPH07230525A (en) Method for recognizing ruled line and method for processing table
JPH0340430B2 (en)
JPH0557632B2 (en)
JPH036552B2 (en)
JPH11250179A (en) Character reocognition device and its method
JPH0696273A (en) Recognition field retrieving method in business form reader
JP2963474B2 (en) Similar character identification method
JPH0782524B2 (en) Optical character reader
JPS6027083A (en) Optical character reader
JP3356819B2 (en) Mark recognition device
JPH01283682A (en) Character reader
JPH05174184A (en) Optical character reader
JPH04139593A (en) Postal code number recognition device
JPS6252687A (en) Character detecting and segmenting system for character reader
JPS5860381A (en) Skew detecting system
JPS62279482A (en) Character recognizing device
JPH01245376A (en) Character segmenting device for character reader
JPS61196382A (en) Character segmenting system
JPS5818773A (en) Character cutting out system
JPS63195783A (en) Character segmenting system
JPS62200490A (en) Optical character reader
JPH0437969A (en) Optical character reader
JPS59206989A (en) Letter segmenting device