JPH0652351A - Optical character reader - Google Patents

Optical character reader

Info

Publication number
JPH0652351A
JPH0652351A JP4202809A JP20280992A JPH0652351A JP H0652351 A JPH0652351 A JP H0652351A JP 4202809 A JP4202809 A JP 4202809A JP 20280992 A JP20280992 A JP 20280992A JP H0652351 A JPH0652351 A JP H0652351A
Authority
JP
Japan
Prior art keywords
surrounding frame
unit
frame
description content
character
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP4202809A
Other languages
Japanese (ja)
Other versions
JP3006294B2 (en
Inventor
Kenji Ogawara
健志 大河原
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
Original Assignee
NEC Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Corp filed Critical NEC Corp
Priority to JP4202809A priority Critical patent/JP3006294B2/en
Publication of JPH0652351A publication Critical patent/JPH0652351A/en
Application granted granted Critical
Publication of JP3006294B2 publication Critical patent/JP3006294B2/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Landscapes

  • Character Input (AREA)

Abstract

PURPOSE:To easily designate and manage a format of plip in the case of reading it by the optical character reader. CONSTITUTION:Data on the surface of the slip are optically acquired by an image input part 11, and a surrounding frame contained in the input image is detected by a surrounding frame detection part 12. Next, the class of the surrounding frame is judged by a surrounding frame identification part 13 and a recognition part 16 and described contents corresponding to the class of the surrounding frame are extracted from a described content group previously stored in a described content storage part 14. Finally, the recognition part 16 reads characters or images in the detected surrounding frame according to the extracted described contents.

Description

【発明の詳細な説明】Detailed Description of the Invention

【0001】[0001]

【産業上の利用分野】本発明は光学的文字読取装置に関
し、特に伝票等の帳票を読取る帳票読取用の光学的文字
読取装置に関する。
BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an optical character reading device, and more particularly to an optical character reading device for reading a form such as a slip.

【0002】[0002]

【従来の技術】従来、帳票読取用の光学的文字読取装置
の使用においては、全額欄等の桁ズレ等に起因する誤読
防止,あるいは読取性能の向上のために、帳票上の各文
字の位置および文字種等の書式をあらかじめ記述してお
くのが一般的であった。これらの作業は繁雑で多くの時
間を要するものである。
2. Description of the Related Art Conventionally, in the use of an optical character reading device for reading a form, the position of each character on the form is prevented in order to prevent erroneous reading due to misalignment of the total amount column or the like or to improve reading performance. It was common to describe the format such as and character type in advance. These tasks are complicated and time consuming.

【0003】ところが、帳票の各項目の文字配置には一
定のパターンがあるため、帳票を新たに設計したり、変
更する場合、常に類似した書式を入力していることが多
い。
However, since there is a fixed pattern in the character layout of each item on the form, when a form is newly designed or changed, a similar format is often input.

【0004】[0004]

【発明が解決しようとする課題】従来の帳票読取用の光
学的文字読取装置では、帳票の新規作成あるいは変更に
よって帳票上の各文字の文字種や位置等の書式が変更と
なる場合、新たに書式を作成しなければならない。その
ため、1種類の帳票に1つの書式が必要となるので、帳
票の新規作成や変更を行うごとに書式の数が増え、その
管理方法が複雑になる。
In the conventional optical character reading apparatus for reading a form, when the form such as the character type and position of each character on the form is changed by newly creating or changing the form, a new form is newly created. Must be created. Therefore, one form is required for one type of form, and the number of forms increases each time a form is newly created or changed, and the management method becomes complicated.

【0005】さらに、同一帳票内に同一形式の項目がた
くさんある場合、すべての項目に対して同様な内容を繰
り返し指定しなければならず、書式指定が繁雑になると
いう問題点があった。
Further, when there are many items of the same format in the same form, the same content must be repeatedly specified for all the items, resulting in a complicated format specification.

【0006】なお、印刷文書や表の読取を対象として、
書式の自動解析が行なわれているが、通常の帳票に対し
て、その記載内容を正しく判別すること並びに文字を1
文字ずつ正しく抽出する性能が現段階では不十分である
という問題点があった。
[0006] Incidentally, for the purpose of reading printed documents and tables,
Although the format is automatically analyzed, it is necessary to correctly discriminate the contents of a normal form and set the characters to 1
There is a problem that the performance of extracting characters character by character is insufficient at this stage.

【0007】本発明の目的は上述した問題点を解決し、
事前に文字位置・文字種等の情報を与えておくことによ
り運用効率のよい帳票読取が可能となる光学的文字読取
装置を提供することにある。
The object of the present invention is to solve the above-mentioned problems,
An object of the present invention is to provide an optical character reading device capable of reading a form with high operational efficiency by giving information such as character position and character type in advance.

【0008】[0008]

【課題を解決するための手段】本発明の光学的文字読取
装置は、帳票などの囲み枠を含む紙面上のデータを光学
的に入力して入力画面を得る画像入力部と、前記入力画
像上の前記囲み枠の検知を行なう囲み枠検知部と、検知
された前記囲み枠の識別を行う囲み枠識別部と、前記囲
み枠の種別に対応した前記囲み枠内の文字や画像に関す
る文字種や文字位置情報を含む記載内容を記憶した記載
内容記憶部と、識別した前記囲み枠内の前記記載内容を
前記記載内容記憶部から読み出し抽出する記載内容抽出
部と、前記記載内容に応じて前記囲み枠内の文字あるい
は画像を抽出し読取処理を行なう認識部とを備えた構成
を有する。
An optical character reading apparatus according to the present invention comprises an image input section for optically inputting data on a sheet of paper including a surrounding frame such as a form to obtain an input screen, and an input image on the input image. Of the surrounding frame, a surrounding frame detecting unit for detecting the surrounding frame, a surrounding frame identifying unit for identifying the detected surrounding frame, and a character type or character relating to the characters or images in the surrounding frame corresponding to the type of the surrounding frame. A description content storage unit that stores description content including position information, a description content extraction unit that reads out and extracts the description content in the identified enclosure from the description content storage unit, and the enclosure frame according to the description content. And a recognition unit that extracts a character or an image in the image and performs a reading process.

【0009】[0009]

【実施例】次に、本発明について図面を参照して説明す
る。
DESCRIPTION OF THE PREFERRED EMBODIMENTS Next, the present invention will be described with reference to the drawings.

【0010】図1は、本発明の一実施例のブロック図で
ある。図1に示す実施例は、帳票等の紙面上のデータを
光学的に撮像する画像入力部11と、画像入力部11で
取得した入力画像に含む囲み枠を検知する囲み枠検知部
12と、検知した囲み枠の種別の識別を行なう囲み枠識
別部13と、囲み枠の種別に対応した枠内記載内容を記
憶した記載内容記憶部14と、囲み枠識別部13で識別
した囲み枠の識別にもとづいて記載内容記憶部14から
囲み枠内の記載内容を抽出する記載内容抽出部15と、
記載内容の認識を行なう認識部16とを備えた構成を有
する。
FIG. 1 is a block diagram of an embodiment of the present invention. In the embodiment shown in FIG. 1, an image input unit 11 that optically captures data on a paper such as a form, a surrounding frame detection unit 12 that detects a surrounding frame included in an input image acquired by the image input unit 11, An enclosing frame identification unit 13 that identifies the type of the enclosing frame that has been detected, a description content storage unit 14 that stores in-frame description content that corresponds to the type of the enclosing frame, and an identification of the enclosing frame that is identified by the enclosing frame identification unit 13. A description content extraction unit 15 for extracting the description content in the box based on the description content storage unit 14;
And a recognition unit 16 that recognizes the described content.

【0011】次に、本実施例の動作について説明する。Next, the operation of this embodiment will be described.

【0012】帳票上のデータは、画像入力部11によっ
て光学的に取り込まれ、取り込まれた入力画像に囲み枠
が存在するかどうかが囲み枠検知部12によって検知さ
れる。
The data on the form is optically taken in by the image input section 11, and the enclosing frame detecting section 12 detects whether or not an enclosing frame exists in the input image taken in.

【0013】囲み枠が検知された場合には、その囲み枠
の種別を囲み枠識別部13によって判定する。
When the surrounding frame is detected, the type of the surrounding frame is determined by the surrounding frame identifying section 13.

【0014】囲み枠の種別が判定されると、その囲み枠
内に記述されている文字や画像に関する記載内容があら
かじめ格納されている記載内容記憶部14から、記載内
容を記載内容抽出部15が抽出し、抽出した記載内容に
対して認識部16が入力画像上の囲み枠内から文字ある
いは画像を抽出して読取による認識処理を行なう。
When the type of the enclosing frame is determined, the described content extraction unit 15 extracts the described content from the described content storage unit 14 in which the described contents regarding the characters and images described in the enclosed frame are stored in advance. The recognition unit 16 extracts a character or an image from the enclosed frame on the input image for the extracted description content, and performs recognition processing by reading.

【0015】図2は、本実施例において使用する帳票の
一例である。帳票5の紙面上には囲み枠21で読取領域
22が囲まれており、また囲み枠21の左上の隅に囲み
枠21の種別を判定するための識別枠23がある。
FIG. 2 shows an example of a form used in this embodiment. The reading area 22 is surrounded by a box 21 on the paper surface of the form 5, and an identification frame 23 for determining the type of the box 21 is provided at the upper left corner of the box 21.

【0016】その識別枠23には、記載内容記憶部14
に記憶されている囲み枠21に割り当てられた囲み枠識
別コード24たとえば「01」が記入されている。
In the identification frame 23, the description content storage unit 14
An enclosing frame identification code 24 such as “01” assigned to the enclosing frame 21 stored in FIG.

【0017】記載内容記憶部14には、囲み枠21の種
別枠23内に記入された囲み枠識別コードと、囲み枠識
別コード24が記入されている囲み枠21内の文字や画
像に関する文字種や位置情報といった記載内容情報が記
憶されている。
In the description content storage unit 14, the surrounding frame identification code entered in the type frame 23 of the surrounding frame 21 and the character type relating to the character or image in the surrounding frame 21 in which the surrounding frame identification code 24 is written, The description content information such as position information is stored.

【0018】囲み枠検知部12では、画像入力部11で
取得した入力画像上の囲み枠の位置を検知し、その位置
情報を囲み枠識別部13へ出力する。なお、囲み枠検知
部12における囲み枠の位置検知は、周知の技術を用い
て実現することができるのでここでは詳細な説明を省略
する。
The enclosing frame detector 12 detects the position of the enclosing frame on the input image acquired by the image input unit 11, and outputs the position information to the enclosing frame identifying unit 13. It should be noted that the position detection of the surrounding frame by the surrounding frame detection unit 12 can be realized by using a well-known technique, and thus detailed description thereof will be omitted here.

【0019】囲み枠識別部13は、囲み枠検知部12か
ら提供される囲み枠の位置情報にもとづき、囲み枠21
の識別枠23内に記述された囲み枠識別コード24を検
出する。このために、まず、囲み枠検知部12から入力
された格納位置にある囲み枠21から識別枠23を検出
する。次に識別枠23内の画像データを認識部16入力
し、画像データ内から囲み枠識別コード24を読み取ら
せ、その結果を認識部16から受ける。受け取った結果
は、囲み枠識別コード24として記載内容抽出部15へ
入力する。
The enclosing frame identifying section 13 is based on the position information of the enclosing frame provided from the enclosing frame detecting section 12 and is surrounded by the enclosing frame 21.
The enclosing frame identification code 24 described in the identification frame 23 is detected. For this purpose, first, the identification frame 23 is detected from the surrounding frame 21 at the storage position input from the surrounding frame detection unit 12. Next, the image data in the identification frame 23 is input to the recognition unit 16, the surrounding frame identification code 24 is read from the image data, and the result is received from the recognition unit 16. The received result is input to the description content extraction unit 15 as a box identification code 24.

【0020】本実施例では、囲み枠識別コード24を数
字で表現しているが、他の種類の文字であっても、また
バーコード等の符号であってもかまわない。また、帳票
内であればいずれの位置に設定してもかまわない。
In the present embodiment, the enclosing frame identification code 24 is expressed by numbers, but it may be a character of another type or a code such as a bar code. Further, it may be set at any position in the form.

【0021】記載内容抽出部15は、囲み枠識別部13
から入力された囲み枠識別コード24の指定内容を記載
内容記憶部14から読み出し、囲み枠識別コード24に
対応する記載内容情報を認識部16へ入力する。
The description content extraction unit 15 includes a surrounding frame identification unit 13
The specified contents of the surrounding frame identification code 24 input from the are read from the description contents storage unit 14, and the description contents information corresponding to the surrounding frame identification code 24 is input to the recognition unit 16.

【0022】認識部16は、前述した如く、囲み枠識別
部13が抽出した識別枠23内の画像データの文字読取
処理を行い、読取結果を囲み枠識別部13へ転送すると
ともに、記載内容抽出部15で抽出された記載内容に従
って入力画像上の囲み枠内の文字あるいは画像の読取処
理を行う。
As described above, the recognizing unit 16 performs a character reading process of the image data in the identification frame 23 extracted by the enclosing frame identifying unit 13, transfers the read result to the enclosing frame identifying unit 13, and extracts the description content. In accordance with the description content extracted by the unit 15, the character or image in the box on the input image is read.

【0023】[0023]

【発明の効果】以上説明したように本発明は、帳票上の
囲み枠位置を自動検知し、囲み枠内の文字や画像を抽出
する際に必要な書式を囲み枠を識別して設定することに
より、帳票の新規作成や変更によって新たに書式を作成
する必要がなく、書式管理が容易にでき、また同一帳票
内に同一形式の項目がたくさんある場合、1項目分の指
定ですむため、書式指定が簡単に行えるという効果があ
る。
As described above, according to the present invention, the position of the surrounding frame on the form is automatically detected, and the format required for extracting the characters and images in the surrounding frame is set by identifying the surrounding frame. Therefore, it is not necessary to create a new format by creating or changing a form, format management is easy, and if there are many items of the same format in the same form, one item can be specified. This has the effect of making it easy to specify.

【図面の簡単な説明】[Brief description of drawings]

【図1】本発明の一実施例のブロック図である。FIG. 1 is a block diagram of an embodiment of the present invention.

【図2】図1の実施例で使用する帳票の一例を示す図で
ある。
FIG. 2 is a diagram showing an example of a form used in the embodiment of FIG.

【符号の説明】[Explanation of symbols]

11 画像入力部 12 囲み枠検知部 13 囲み枠識別部 14 記載内容記憶部 15 記載内容抽出部 16 認識部 21 囲み枠 22 読取領域 23 識別枠 24 囲み枠識別コード 11 Image Input Unit 12 Enclosed Frame Detection Unit 13 Enclosed Frame Identification Unit 14 Enclosed Content Storage Unit 15 Described Content Extraction Unit 16 Recognition Unit 21 Enclosed Frame 22 Reading Area 23 Identification Frame 24 Enclosed Frame Identification Code

Claims (1)

【特許請求の範囲】[Claims] 【請求項1】 帳票などの囲み枠を含む紙面上のデータ
を光学的に入力して入力画像を得る画像入力部と、前記
入力画像上の前記囲み枠の検知を行なう囲み枠検知部
と、検知された前記囲み枠の識別を行う囲み枠識別部
と、前記囲み枠の種別に対応した前記囲み枠内の文字や
画像に関する文字種や文字位置情報を含む記載内容を記
憶した記載内容記憶部と、識別した前記囲み枠内の前記
記載内容を前記記載内容記憶部から読み出し抽出する記
載内容抽出部と、前記記載内容に応じて前記囲み枠内の
文字あるいは画像を抽出し読取処理を行なう認識部とを
備えることを特徴とする光学的文字読取装置。
1. An image input unit for optically inputting data on a sheet including a surrounding frame such as a form to obtain an input image, and a surrounding frame detecting unit for detecting the surrounding frame on the input image. A surrounding frame identification unit that identifies the detected surrounding frame, and a description content storage unit that stores description content including character types and character position information regarding characters and images in the surrounding frame corresponding to the type of the surrounding frame. A description content extracting unit that reads out and extracts the described description content in the identified enclosure from the description content storage unit; and a recognition unit that extracts a character or image in the enclosure according to the description content and performs a reading process. An optical character reading device comprising:
JP4202809A 1992-07-30 1992-07-30 Optical character reader Expired - Lifetime JP3006294B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP4202809A JP3006294B2 (en) 1992-07-30 1992-07-30 Optical character reader

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP4202809A JP3006294B2 (en) 1992-07-30 1992-07-30 Optical character reader

Publications (2)

Publication Number Publication Date
JPH0652351A true JPH0652351A (en) 1994-02-25
JP3006294B2 JP3006294B2 (en) 2000-02-07

Family

ID=16463565

Family Applications (1)

Application Number Title Priority Date Filing Date
JP4202809A Expired - Lifetime JP3006294B2 (en) 1992-07-30 1992-07-30 Optical character reader

Country Status (1)

Country Link
JP (1) JP3006294B2 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6955436B2 (en) 2002-01-23 2005-10-18 Sony Corporation Image display device and image projector apparatus

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6955436B2 (en) 2002-01-23 2005-10-18 Sony Corporation Image display device and image projector apparatus
US7128426B2 (en) 2002-01-23 2006-10-31 Sony Corporation Method of producing image display device and image projector apparatus
KR100954016B1 (en) * 2002-01-23 2010-04-20 소니 주식회사 Image display and image projector

Also Published As

Publication number Publication date
JP3006294B2 (en) 2000-02-07

Similar Documents

Publication Publication Date Title
US8213717B2 (en) Document processing apparatus, document processing method, recording medium and data signal
CN102171708A (en) Business document processor
US5249240A (en) Program creating method
EP1202213B1 (en) Document format identification apparatus and method
US5854860A (en) Image filing apparatus having a character recognition function
JP3006294B2 (en) Optical character reader
JP3732254B2 (en) Format information generation method and format information generation apparatus
JP4517822B2 (en) Image processing apparatus and program
JP2877380B2 (en) Optical character reader
JPH07152856A (en) Optical character reader
JP2000339405A (en) Optical character recognition system, format control generation method of slip in the same and storage medium storing format control generation method
JPH06333085A (en) Optical character reader
JP3001618B2 (en) How to copy characters on paper and how to recognize symbols
JP3310063B2 (en) Document processing device
JP2925270B2 (en) Character reader
JPS63137383A (en) Character reader
JPH10116314A (en) Table processing method and its device
JPH04123262A (en) List type data processor
JPH0789361B2 (en) Form registration device
JPH01199285A (en) Optical character reader
JPH10154157A (en) Electronic filing system
JP2005092597A (en) Documents reader, its program, scanning device, invisible image print controlling unit, its program, and sheet shape medium
JP2000029986A (en) Method for reading slip data and recording medium and device for reading slip data
JPH06251191A (en) Image processing device
JPS62179079A (en) Character reader

Legal Events

Date Code Title Description
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 19991026