JPH03252785A

JPH03252785A - Optical character reader

Info

Publication number: JPH03252785A
Application number: JP2049207A
Authority: JP
Inventors: Toshiko Matsuo; 松尾　敏子
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1990-03-02
Filing date: 1990-03-02
Publication date: 1991-11-12

Abstract

PURPOSE:To facilitate document description by attaching data for recognition to recognize a left or right side bracket similar pattern in a dictionary for KATAKANA(square form of Japanese syllabary) character recognition. CONSTITUTION:A customer describes a left side bracket similar pattern or a right side bracket similar pattern in which a conventional left or right side bracket similar pattern is changed partially and distinction from the KATAKANA character (no), etc., is clarified on a document, etc. In an OCR to read and recognize the document, etc., the data for recognition to recognize the left side bracket similar pattern or the right side bracket similar pattern is attached on the dictionary for KATAKANA character recognition. Therefore, generally speaking, only the KATAKANA character is described in the column of the receiver name of a transfer document, however, an item for which the description of a left side bracket or right side bracket is permitted to be omitted can be recognized. Thereby, it is remarkably convenient for the description for a customer, and descriptive time can be shortened.

Description

【発明の詳細な説明】［産業上の利用分野〕本発明は光学式文字読取装置に関し、特にカタカナ文字
及びカッコを認識する光学式文字読取装置に関するもの
である。DETAILED DESCRIPTION OF THE INVENTION [Field of Industrial Application] The present invention relates to an optical character reading device, and more particularly to an optical character reading device that recognizes katakana characters and parentheses.

［従来の技術］例えば、金融機関における為替振込業務等において、光
学式文字読取装置（以下ＯＣＲという）を使用して、為
替帳票を自動的に読取り、その後の業務処理を行うこと
がある。[Prior Art] For example, in money transfer operations at financial institutions, an optical character reader (hereinafter referred to as OCR) may be used to automatically read money order forms and perform subsequent business processing.

第２図は上記為替帳票の記入例を示す図であり、第３図
は為替帳票をＯＣＲにより読取り認識した結果を表示部
の画面に表示する画面表示例を示す図であり、第４図は
為替帳票に省略形として金銀システムにおいて使用可能
とされている略号を説明する図である。FIG. 2 is a diagram showing an example of filling in the above-mentioned exchange form, FIG. 3 is a diagram showing an example of a screen display in which the result of reading and recognizing the exchange form by OCR is displayed on the screen of the display unit, and FIG. FIG. 2 is a diagram illustrating abbreviations that can be used in the gold and silver system as abbreviations for exchange forms.

第２図〜第４図を参照し、従来のＯＣＲを用いた帳票の
読取り認識方法について説明する。A conventional document reading and recognition method using OCR will be described with reference to FIGS. 2 to 4.

顧客は為替帳票などの帳票に、第２図に示される記入例
のように、必要事項（受取人名、振込先銀行名、金額、
依頼人名等）を記入し銀行窓口にいる銀行員（以下、テ
ラーと呼ぶ）に提出する。The customer enters the necessary information (recipient's name, transfer bank name, amount,
Fill in the client's name, etc.) and submit it to the bank employee at the bank counter (hereinafter referred to as the teller).

テラーは該帳票に銀行としての必要事項を記入後、該帳
票を上記テラーあるいは別の操作員がＯＣＲに読取りを
行わせる。ＯＣＲが帳票から読取ったデータは、第３図
の表示例に示されるように、それぞれの記入項目毎に、
ＯＣＲが認識した結果の文字と認識前の画像としてのイ
メージデータとして、上下に対比して表示部に表示され
る。この表承部に表示された結果をオペレータがチエツ
クして、もし誤りがあればこれを訂正し、訂正後の正し
いデータをホストコンピュータ等の上位局へ送信するべ
く操作を行う。After the teller fills in the necessary information for the bank on the form, the teller or another operator causes the OCR to read the form. The data read by OCR from the form is displayed for each entry as shown in the display example in Figure 3.
Characters as a result of OCR recognition and image data as an image before recognition are displayed on the display unit in vertical comparison. An operator checks the results displayed on this representation, corrects any errors, and performs operations to transmit the corrected data to a higher-level station such as a host computer.

ところで、ＯＣＲに読取らせるべき帳票への記入項目と
して例えば、為替振込帳票の受取人名があるが、この項
目はカタカナ文字にて記入することになっている。例え
ば受取人名が「東西株式会社」であれば「トウザイ力ブ
シキガイシャ」と記入する必要があるが、第４図の現用
略号の説明図に示されるように、為替取引においては省
略形の記入も許されており、「トウザイ力）」または「
トウザイ（力）」と記入しても同じ意味を持つ。Incidentally, an example of an item to be entered in a form to be read by OCR is the recipient's name on a money transfer form, but this item is to be entered in katakana characters. For example, if the beneficiary name is "Tozai Co., Ltd.," it is necessary to enter "Tozai Riki Bushiki Gaisha," but as shown in the diagram of current abbreviations in Figure 4, abbreviations are also allowed in foreign exchange transactions. ``Tozai force)'' or ``
It has the same meaning even if it is written as "Tozai (power)".

このような受取人名や依頼人名のような項目を含む帳票
を読取り認識するためのＯＣＲの内蔵辞書として、従来
技術においては、カタカナ文字のみの辞書を使用し、カ
ッコの認識ができないとするか、あるいは“（”や“）
”も認識させるためカタカナ文字と英記号や英文字を含
む辞書を使用するようにしていた。In the conventional technology, a dictionary containing only katakana characters is used as a built-in dictionary of OCR to read and recognize documents that include items such as recipient names and client names, and parentheses cannot be recognized. Or “(” or “)
In order to recognize ``, a dictionary containing katakana characters, English symbols, and letters was used.

［発明が解決しようとする課題］上記のような従来の光学式文字読取装置（ＯＣＲ）にお
いて、カタカナ文字のみの辞書を使用して前記の受取人
名や依頼人名のような項目を読取る場合には、ＯＣＲは
省略形については“（”や“）”が認識できないので、
顧客は省略形を記入することができず、帳票記入に時間
がかかり不便であった。[Problems to be Solved by the Invention] In the conventional optical character reading device (OCR) as described above, when reading items such as the above-mentioned recipient name and client name using a dictionary containing only katakana characters, , OCR cannot recognize "(" or ")" for abbreviations, so
Customers were not able to write in abbreviations, and it took time and inconvenience to fill in forms.

また、カタカナ文字と英記号や英文字を含む辞書を使用
する場合には、顧客には便利になるが、辞書の容量が大
きくなり装置のコストが増加するという問題点があった
。Furthermore, when using a dictionary that includes katakana characters, English symbols, and letters, it is convenient for the customer, but there is a problem in that the capacity of the dictionary becomes large and the cost of the device increases.

さらにこの場合には認識上の問題として、例えばカナ文
字の“ノ”と英記号の“）”との認識誤りが発生する可
能性があるという別の問題点もあった。Furthermore, in this case, there is another problem in recognition that, for example, there is a possibility that a recognition error may occur between the kana character "ノ" and the English symbol ")".

本発明は、上述した問題点を解決するためになされたも
ので、顧客には帳票記入が容品で、装置のコストも余り
増加せず、しかも認識率が向上する光学式文字読取装置
を提供することを目的とする。The present invention was made in order to solve the above-mentioned problems, and provides customers with an optical character reading device that allows them to easily fill out forms, does not significantly increase the cost of the device, and improves the recognition rate. The purpose is to

［課題を解決するための手段］本発明に係る光学式文字読取装置は、少くともカタカナ
文字を認識する光学式文字読取装置において、帳票等に
記入された、従来の左側カッコまたは右側カッコを一部
変形し、カタカナ文字の“ノ”等との区別を明確にした
左側カッコ類似パターンまたは右側カッコ類似パターン
を読取り認識するため、あらかじめカタカナ文字を認識
するためのカタカナ文字認識用辞書に、前記左側カッコ
類似パターンまたは右側カッコ類似パターンの少くとも
いずれか一方を認識するための認識用データを付加して
備えたものである。[Means for Solving the Problems] An optical character reading device according to the present invention is an optical character reading device that recognizes at least katakana characters, and is capable of changing the conventional left-hand parenthesis or right-hand parenthesis written on a form, etc. In order to read and recognize left parenthesis-like patterns or right-hand parenthesis similar patterns that have been deformed to clearly distinguish them from katakana characters such as “ノ”, the left side parenthesis-like pattern or right-hand parenthesis similar pattern is written in advance in a katakana character recognition dictionary for recognizing katakana characters. It is provided with additional recognition data for recognizing at least either a parenthesis similar pattern or a right parenthesis similar pattern.

［作用コ本発明においては、少くともカタカナ文字を認識する光
学式文字読取装置において、従来の左側カッコまたは右
側カッコを一部変形し、カタカナ文字の“）”等との区
別を明確にした左側カッコ類似パターンまたは右側カッ
コ類似パターンを帳票等に記入するようにして、前記帳
票等を読取り認識するため、あらかじめカタカナ文字を
認識するためのカタカナ文字認識用辞書に、前記左側カ
ッコ類似パターンまたは右側カッコ類似パターンの少く
ともいずれか一方を認識するための認識用データを付加
しておくことにより、カタカナ文字と誤認識することな
しに、正しく左側カッコ及び右側カッコを認識する。[Operations] In the present invention, in an optical character reading device that recognizes at least katakana characters, the conventional left-hand parenthesis or right-hand parenthesis is partially modified, and the left-hand side parenthesis is clearly distinguished from the ``)'' etc. of katakana characters. The left parenthesis similar pattern or the right parenthesis similar pattern is written in a form, etc., and in order to read and recognize the form, etc., the left parenthesis similar pattern or right parenthesis similar pattern is written in a katakana character recognition dictionary for recognizing katakana characters. To correctly recognize a left parenthesis and a right parenthesis without erroneously recognizing them as katakana characters by adding recognition data for recognizing at least one of similar patterns.

［実施例］第１図は本発明に係るＯＣＲの認識処理の一実施例を示
すフローチャートである。また、第５図は本発明に係る
ＯＣＲの辞書構成例を示す図、第６図は本発明に係るＯ
ＣＲのカッコ記入形式例を示す図である。[Embodiment] FIG. 1 is a flowchart showing an embodiment of OCR recognition processing according to the present invention. Further, FIG. 5 is a diagram showing an example of the dictionary configuration of the OCR according to the present invention, and FIG.
It is a figure which shows the example of parenthesis entry format of CR.

第７図は本発明に係るＯＣＲを用いた為替データ通信装
置の機器構成図である。同図において、１は為替帳票、
２は為替帳票１のキャラクタ及びイメージデータを読み
取る卓上型光学式データ読取装置（以下卓上ＯＣＲとい
う）、３は簡易キーボード、４はリモートＯＣＲ制御装
置（以下０ＣＲＣＴＬという）、６は公衆通信網、５及
び７は公衆通信網６を介してモデム信号の送受を行なう
ためのネットワーク・コントロール・ユニット（以下Ｎ
ＣＵという）、８はリモートＯＣＲ認識装置ｆ（以下Ｍ
ＣＵという）、９はＭＣＵワークスティジョン（以下Ｍ
ＣＵ−ＷＳという）、１０はＣＲ７表示器（以下ＣＲＴ
という）、１１はキーボード、１２はプリンタ、１３は
専用通信線、１４はホスト装置である。FIG. 7 is an equipment configuration diagram of a money exchange data communication device using OCR according to the present invention. In the same figure, 1 is the exchange form,
2 is a tabletop optical data reader (hereinafter referred to as a tabletop OCR) for reading the character and image data of the exchange form 1; 3 is a simple keyboard; 4 is a remote OCR control device (hereinafter referred to as 0CRCTL); 6 is a public communication network; and 7 is a network control unit (hereinafter N) for transmitting and receiving modem signals via the public communication network 6.
8 is a remote OCR recognition device f (hereinafter referred to as M
9 is the MCU Workstation (hereinafter referred to as M
CU-WS), 10 is a CR7 display (hereinafter referred to as CRT).
), 11 is a keyboard, 12 is a printer, 13 is a dedicated communication line, and 14 is a host device.

４１はマイクロプロセッサ（以下ＣＰＵという）で、５
メガバイト（以下ＭＢという）程度のメモリを内蔵する
。４２は通信制御部で、０ＣＲＣＴＬ４と外部機器との
間のデータ通信を行なう。４３はフロッピィディスク（
以下ＦＤという）で、例えば５インチのＦＤが２枚程度
の容量をもつ。４４はディスク（ＤＩＳＫ）で、例えば
２０Ｍ　Ｂ程度の容量をもつ。４５は圧伸プロセッサで
、データ圧縮又はデータ伸長を行なう。４６はＧＰＩＢ
インタフェイス（以下ＧＰＩＢという）で卓上０ＣＲ２
とのデータ通信用インタフェイスである。上記４１〜４
６の機器がすべて０ＣＲＣＴＬ４に内蔵される。41 is a microprocessor (hereinafter referred to as CPU);
It has a built-in memory of about megabytes (hereinafter referred to as MB). 42 is a communication control unit that performs data communication between OCRCTL4 and external equipment. 43 is a floppy disk (
For example, it has a capacity of about two 5-inch FDs (hereinafter referred to as FDs). 44 is a disk (DISK), which has a capacity of, for example, about 20 MB. A companding processor 45 performs data compression or data expansion. 46 is GPIB
Desktop 0CR2 with interface (hereinafter referred to as GPIB)
This is an interface for data communication with. 41-4 above
All 6 devices are built into 0CRCTL4.

８１はＣＰＵで５ＭＢ程度のメモリを内蔵する。The 81 is a CPU with about 5MB of built-in memory.

８２は通信制御部で専用通信線１３を介してホスト装置
１４との間のデータ通信を行なう。８３はＦＤで５イン
チのＦＤ２枚程度の容量をもつ。８４はディスクで、例
えば１３０ＭＢ程度の容量をもつ。８５は受信プロセッ
サで、本例では６チヤネル（以下ＣＨという）までのデ
ータ受信回線を備え、ＤＭＡインタフェイス方式で、ト
ータルスループットが２４ＫＢ（キロバイト）／秒程度
の速度によりデータ受信処理を行なう。従って各営業店
から公衆通信網６及びＮＣＵ７を介した通信回線がＣＨ
＃１〜ＣＨ＃６まで入力できる。８６は漢字認識プロセ
ッサで、共有メモリインタフェイスを備え、多階調例え
ば１８階調のイメージデータである漢字の認識処理を行
なう。８７はローカルインタフェイス（以下ＬＩＦとい
う）でＭＣＵ−ＷＳ９との間のデータ通信を行なうため
のインタフェイスである。上記８１〜８７の機器がすべ
てＭＣＵ３に内蔵される。A communication control unit 82 performs data communication with the host device 14 via the dedicated communication line 13. 83 is an FD and has a capacity of about two 5-inch FDs. A disk 84 has a capacity of, for example, about 130 MB. Reference numeral 85 denotes a reception processor, which in this example is equipped with data reception lines of up to 6 channels (hereinafter referred to as CH), and performs data reception processing using a DMA interface method with a total throughput of about 24 KB (kilobytes)/second. Therefore, the communication line from each branch via the public communication network 6 and NCU 7 is CH
You can input from #1 to CH#6. Reference numeral 86 denotes a kanji recognition processor, which is equipped with a shared memory interface and performs recognition processing of kanji, which is multi-gradation, for example, 18-gradation image data. Reference numeral 87 denotes a local interface (hereinafter referred to as LIF), which is an interface for performing data communication between the MCU and the WS 9. All of the above-mentioned devices 81 to 87 are built into the MCU 3.

９１はＣＰＵで５ＭＢ程度のメモリを内蔵する。91 has a built-in CPU and about 5MB of memory.

９２はＬＩＦでＭＣＵ３との間のデータ通信を行なうた
めのインタフェイスである。９３はＦＤで例えば５イン
チＦＤ２枚程度の容量をもつ。９４はディスクで例えば
２０Ｍ　Ｂ程度の容量をもつ。９５は圧伸プロセッサで
ある。上記９１〜９５の機器がすべてＭＣＵ−ＷＳ９に
内蔵される。Reference numeral 92 is an LIF interface for data communication with the MCU 3. 93 is an FD and has a capacity of, for example, two 5-inch FDs. 94 is a disk with a capacity of, for example, about 20 MB. 95 is a companding processor. All of the above devices 91 to 95 are built into the MCU-WS9.

卓上０ＣＲ２は手書ぎ又は活字で記載された帳票サイズ
Ａ６〜Ｂ４程度の為替帳票を、読取り速度４枚／分程度
で読み取ることができる。読み取られたデータはキャラ
クタデータ又は／及びイメージデータとして処理される
。簡易キーボード３は０ＣＲＣＴＬ４との通信用インタ
フェイスを備え、また４０文字、２行程度表示できる液
晶表示器又はＣＲＴを有する。また簡易キーボード３に
は、アイデンティフィケイション（以下ＩＤという）カ
ードを挿入しないと、キーボードの操作を行なうことが
できない。従ってこのＩＤカードの所有を許された特定
の操作員のみが簡易キーボード３の操作を行なうことが
できる。ＣＰ　Ｕ４１．８１及び９１はそれぞれ５ＭＢ
程度のＩＣメモリを内蔵するが、大容量の補助記憶装置
としてディスク４４．８４及び９４と、中容量の補助記
憶装置としてＦＤ（フロッピィディスク）４３．８３及
び９３をそれぞれ使用する。ＦＤは例えば初期動作時に
イニシャルプログラムをロードする等の補助制御プログ
ラムを格納するために使用される。The tabletop OCR2 can read money order forms written in handwriting or printed letters with a size of A6 to B4 at a reading speed of about 4 pages/minute. The read data is processed as character data and/or image data. The simple keyboard 3 has a communication interface with the OCRCTL 4, and also has a liquid crystal display or CRT capable of displaying about 40 characters and about 2 lines. Further, unless an identification (hereinafter referred to as ID) card is inserted into the simple keyboard 3, the keyboard cannot be operated. Therefore, only a specific operator who is allowed to possess this ID card can operate the simple keyboard 3. CPU U41.81 and 91 are each 5MB
It has a built-in IC memory of about 100 yen, but uses disks 44, 84 and 94 as large-capacity auxiliary storage devices, and FDs (floppy disks) 43, 83 and 93 as medium-capacity auxiliary storage devices, respectively. The FD is used to store an auxiliary control program such as loading an initial program at the time of initial operation.

まず、第７図の光学式文字読取装置（ＯＣＲ）を用いた
為替データ通信装置について、ハードウェアとしての各
機器の動作を説明する。First, the operation of each device as hardware in the money order data communication device using an optical character reader (OCR) shown in FIG. 7 will be explained.

営業店は為替帳票１を卓上０ＣＲ２で読み取る。The sales branch reads the money order form 1 on the tabletop 0CR2.

実際の操作手順としては、先ず操作を許可された操作員
がＩＤカードを簡易キーボード３に挿入し、自動送信す
る何枚（１枚以上何枚でもよい）かの為替帳票１を卓上
０ＣＲ２にセットした後、読み取り送信開始釦を押すこ
とにより帳票の読み取りが開始される。為替帳票１から
読み取るべき情報は、例えば第２図の帳票記入例に示さ
れる。The actual operating procedure is as follows: First, an authorized operator inserts an ID card into the simple keyboard 3, and sets a number of exchange forms 1 to be automatically sent (one or more is fine) on the tabletop 0CR2. After that, by pressing the reading/sending start button, reading of the form is started. The information to be read from the exchange form 1 is shown, for example, in the example of form entry in FIG.

卓上０ＣＲ２は第２図に示されるような漢字、キャラク
タ及びイメージ情報を読み取るが、漢字については多階
調データとして読み取り、キャラクタのうちカナ文字又
は数字は読取り後に認識処理を行なう。また為替帳票と
して必須項目である金額、振込口座、受取人氏名等の重
要情報は、卓上０ＣＲ２がキャラクタとして認識できて
も、照合のためイメージ情報としても並列的に処理され
る。さらに卓上０ＣＲ２が読み取ったキャラクタを認識
できないときには、これをイメージ情報として読取った
ものとして処理する。卓上０ＣＲ２は上記のように読み
取り、また一部認識処理を行ったデータを、０ＣＲＣＴ
Ｌ４内のインタフェイスＣＰＩ３４６を介して、０ＣＲ
ＣＴＬ４へ供給する。The tabletop OCR2 reads Kanji, characters, and image information as shown in FIG. 2. Kanji are read as multi-gradation data, and among the characters, kana letters or numbers are recognized after being read. Also, important information such as the amount, transfer account, and recipient's name, which are essential items in a money order form, are processed in parallel as image information for verification, even if the tabletop 0CR2 can be recognized as a character. Further, when the tabletop OCR2 cannot recognize the read character, it processes this read as image information. The tabletop 0CR2 reads the data as described above and performs some recognition processing, and converts it to 0CRCT.
0CR via interface CPI346 in L4
Supply to CTL4.

０ＣＲＣＴＬ４は内蔵するＣＰＵ４１の制御により、卓
上０ＣＲ２により読取ったデータをディスク４４に書込
む。ＣＰＵ４１はディスク４４に書込むデータがイメー
ジデータのときは、圧伸プロセッサ４５を用いてデータ
の圧縮処理を行ない、データ数を少なくしてディスク４
４に書込む。そして０ＣＲＣＴＬ４内のＣＰＵ４１によ
って図示されない送信電文が作成され、ＮＣＵ３、公衆
通信網６及びＭＣＵ７を介して地区センタのリモートＯ
ＣＲ処理装置（ＭＣＵ）８へ送信される。The OCRCTL4 writes the data read by the tabletop OCR2 to the disk 44 under the control of the built-in CPU41. When the data to be written on the disk 44 is image data, the CPU 41 compresses the data using the companding processor 45 to reduce the number of data and write the data on the disk 44.
Write in 4. Then, a transmission message (not shown) is created by the CPU 41 in the 0CRCTL 4, and sent to the remote office of the district center via the NCU 3, public communication network 6, and MCU
It is transmitted to the CR processing unit (MCU) 8.

地区センタでは営業店から送信されたＭＣＵ７を介して
入力されたデータをＭＣＵＳ内の受信プロセッサ８５が
受信する。この装置の例においては、受信プロセッサ８
５は最大６チヤネルまでの入力信号を、トータルスルー
ブツトが２４Ｋ　Ｂ　／秒の速度により受信することが
できる。At the district center, a reception processor 85 within the MCUS receives data transmitted from the sales branch and input via the MCU 7. In this device example, the receive processor 8
5 can receive up to 6 channels of input signals with a total throughput of 24 K B /sec.

ＭＣＵ３は受信した漢字イメージデータに対して、漢字
文字認識を行ないカナデータに変換する。The MCU 3 performs kanji character recognition on the received kanji image data and converts it into kana data.

これはＭＣＵＳ内の漢字認識プロセッサ８６が受信プロ
セッサ８５から供給される漢字イメージデータの認識処
理を行ない、カナデータに変換するものである。In this process, the kanji recognition processor 86 in the MCUS recognizes the kanji image data supplied from the reception processor 85 and converts it into kana data.

またＭＣＵ３内のＣＰＵ８１は、受信プロセッサ８５が
受信したデータをディスク８４に書込む。この場合漢字
イメージデータは前記変換されたカナデータとしてディ
スク８４に書込まれる。ディスク８４に書込まれた為替
帳票データはｎｃｕ−ｗｓ　９を介してＣＲＴＩＯに自
動表示される。即ちＭＣＵ３内のＣＰＵ８１がＬ　Ｉ　
Ｆ　８７ヨリＬＩＦ９２を介しテＭＣＵ−ＷＳ９に為替
帳票データを配信し、ＭＣＵ−ＷＳ　Ｑ内のＣＰＵ９１
は配信されたデータを、例えば第３図の画面表示例に示
されるように、各項目毎にＯＣＲが認識した結果の文字
と、認識前の画像としてのイメージデータとを対比させ
て、ＣＲＴＩＯに表示する。Further, the CPU 81 in the MCU 3 writes the data received by the reception processor 85 onto the disk 84 . In this case, the kanji image data is written to the disk 84 as the converted kana data. The exchange form data written to the disk 84 is automatically displayed on the CRTIO via the ncu-ws 9. That is, the CPU 81 in the MCU 3
The exchange form data is distributed to the MCU-WS9 via the F87 LIF92, and the CPU91 in the MCU-WS Q
For example, as shown in the screen display example in Figure 3, the distributed data is sent to CRTIO by comparing the characters recognized by OCR for each item with the image data as images before recognition. indicate.

この表示のときに、卓上０ＣＲ２や漢字認識プロセッサ
８６が認識不能のイメージデータもＣＲＴｌＯに表示さ
れる。従って操作員は本装置が認識処理を行ったが間違
って認識されたデータの訂正と、本装置が認識処理を実
行することができなかったイメージデータを判読し、そ
の判読データの追加をキーボード１１を用いて行なうこ
とができる。At the time of this display, image data that cannot be recognized by the tabletop OCR2 or the Kanji recognition processor 86 is also displayed on the CRTlO. Therefore, the operator must correct data that was incorrectly recognized by this device, read image data that could not be recognized by this device, and add the interpreted data using the keyboard. This can be done using

ＣＰＵ９１はＣＲＴ　１０にイメージデータを表示する
とき、圧伸プロセッサ９５を用いて、圧縮されたイメー
ジデータの復元（伸長）を行ない表示する。When displaying image data on the CRT 10, the CPU 91 uses the companding processor 95 to restore (expand) the compressed image data and display it.

このようにしてデータの訂正や追加により帳票データが
完成すると、操作員はキーボード１１上の送信釦を操作
して前記完成された為替帳票データを専用通信線１３を
介してホスト装置１４へ送信する。When the form data is completed by correcting or adding data in this manner, the operator operates the send button on the keyboard 11 to transmit the completed exchange form data to the host device 14 via the dedicated communication line 13. .

即ちＭＣＵ３内のＣＰＵ８１は通信制御部８２を用い、
専用通信線１３を介して、ホスト装置１４とデータの送
受信を行なう。この送受信の行なわれたデータは、ＭＣ
Ｕ−ＷＳ　９のＣＰＵ９１によりＣＲＴ　１０及びプリ
ンタ１２に出力され記録される。That is, the CPU 81 in the MCU 3 uses the communication control unit 82,
Data is sent and received to and from the host device 14 via the dedicated communication line 13. This transmitted and received data is transferred to the MC.
The CPU 91 of the U-WS 9 outputs the data to the CRT 10 and printer 12 for recording.

以上により本発明に係るＯＣＲを用いたハードウェアと
しての為替データ通信装置の動作説明を終了する。This concludes the explanation of the operation of the money exchange data communication device as hardware using OCR according to the present invention.

次に本発明に係るＯＣＲの辞書構成について第５図によ
り説明する。同図に示された辞書にはカナ文字の全部あ
るいは一部を認識するための認識用データ及びカナコー
ドを有しているくカナ辞書部〉と、これに加え本発明に
係る左側カッコ類似パターンと右側カッコ類似パターン
の両方、またはいずれか一方のカッコ類似パターンの認
識用データ及び該当コードとを有している。Next, the OCR dictionary structure according to the present invention will be explained with reference to FIG. The dictionary shown in the figure includes a kana dictionary part which has recognition data and kana codes for recognizing all or part of kana characters, and in addition, a left parenthesis similar pattern according to the present invention. and the right parenthesis similar pattern, or the recognition data and corresponding code for either one of the parenthesis similar patterns.

この本発明に係るカッコ類似パターンは、例えば第６図
の記入形式例に示される。同図における左側または右側
カッコ類似パターンは、従来の左側カッコ“（°または
右側カッコ“）”そのものではなく、カッコの弓形の中
央部からそれぞれ左側または右側に短い水平線を突出さ
せ、カナ文字の“ノ°との区別が明確につくように、カ
ッコに類似しているが一部変形したカッコ類似（カッコ
様）パターンを使用している。そして本発明においては
、顧客が帳票の受取人名等の項目に省略記号（第４図参
照）を記入する際に、前記カッコ類似パターンを記入し
てもらい、この帳票を読取り認識するＯＣＲは、内蔵す
る辞書を用いてこのカッコ類似パターンを認識して、正
しい左側カッコまたは右側カッコのコードを取出すよう
にしている。This parenthesis similar pattern according to the present invention is shown, for example, in the entry format example of FIG. The left- or right-side parenthesis-like pattern in the figure is not the traditional left-hand parenthesis “(° or right-hand parenthesis)” itself, but a short horizontal line protruding from the arcuate center of the parenthesis to the left or right, respectively, and the kana character “ In order to clearly distinguish it from ノ°, we use a parenthesis-like (bracket-like) pattern that is similar to parentheses but partially modified. In the present invention, when a customer enters an abbreviation symbol (see Figure 4) in an item such as the recipient's name on a form, the customer is asked to fill in the parenthetical similar pattern, and the built-in OCR reads and recognizes this form. This parenthesis-like pattern is recognized using a dictionary, and the code for the correct left parenthesis or right parenthesis is extracted.

また本発明に係る左側カッコ類似パターンと右側カッコ
類似パターンは左右対称のパターンであるので、両方の
カッコ類似パターンの認識データを認識用辞書に備えな
くとも、いずれか一方のカッコ類似パターンの認識デー
タを備えることにより、左右対称パターンとして識別の
上、両方のカッコ類似パターンの認識を行なうことが可
能となる。Furthermore, since the left parenthesis similar pattern and the right parenthesis similar pattern according to the present invention are left-right symmetrical patterns, the recognition data of either one of the parenthesis similar patterns does not need to be provided in the recognition dictionary. By providing this, it is possible to identify both left-right symmetrical patterns and to recognize both bracket-like patterns.

なお、本発明におけるカッコ類似パターンは、第６図に
示される記入形式例に限定されるものではなく、カナ文
字の“ノ”フ”り”等との区別が明確につくものであれ
ば、他のカッコ類似パターンでもよい。例えば三日月や
糸を張った弓のパターンであっても同様の動作を行うこ
とができる。Note that the parentheses-like pattern in the present invention is not limited to the entry format example shown in FIG. Other bracket-like patterns may also be used. For example, a similar action can be performed with a crescent moon or a stringed bow pattern.

次に本発明に係るＯＣＲの認識処理の一実施例について
、第１図を主にして、第５図及び第６図を参照して説明
する。Next, an embodiment of the OCR recognition process according to the present invention will be described with reference to FIG. 1 and FIGS. 5 and 6.

第２図の帳票記入例に示すように、顧客が帳票の受取人
名や依頼人名の項目にカナ文字を記入し、テラーに提出
する。テラー等の操作員はこの帳票を卓上０ＣＲ２にセ
ットし読取りを行なわせる。As shown in the example of filling out a form in FIG. 2, the customer writes kana characters in the recipient's name and client's name fields on the form and submits it to the teller. An operator such as a teller sets this form on the tabletop 0CR2 and causes it to be read.

第１図のステップＳ１０において、卓上０ＣＲ２は上記
該当項目に記入されているデータを読取り、ステップＳ
ｌｌにおいて、読取ったイメージデータを０ＣＲＣＴＬ
４内のメモリに一旦記憶する。In step S10 of FIG. 1, the tabletop 0CR2 reads the data entered in the corresponding item above, and in step S
ll, the read image data is 0CRCTL
Temporarily store it in the memory in 4.

次にステップＳ１２において、０ＣＲＣＴＬ４は一旦記
憶したイメージデータから１文字分に相当するデータを
読出す。Next, in step S12, OCRCTL4 reads data corresponding to one character from the once stored image data.

ステップＳ１３において、第５図に示されるＯＣＲの辞
書のうち〈カナ辞書部〉の認識用データに基づき認識処
理を行ない、対応するカナコードを検索する。そしてス
テップＳ１４において、対応するカナコードが有るかの
判別を行ない、有ればステップＳ１５へ移り、無ければ
ステップ８１８へ移る。In step S13, recognition processing is performed based on the recognition data of the <kana dictionary section> of the OCR dictionary shown in FIG. 5, and a corresponding kana code is searched. Then, in step S14, it is determined whether there is a corresponding kana code. If there is, the process moves to step S15, and if there is not, the process moves to step 818.

対応するカナコードが有れば、ステップＳ１５において
、該当するカナコード（例えばＪＩＳ　７単位の該当カ
ナコード）を取出し、このコードをステップＳ２０にお
いて、メモリに記憶しておく。If there is a corresponding kana code, the corresponding kana code (for example, the corresponding kana code in JIS 7 units) is extracted in step S15, and this code is stored in the memory in step S20.

対応するカナコードが無ければ、ステップＳｌＢにおい
て、第５図に示されるＯＣＲの辞書のうち左側カッコ類
似パターン認識用データに基づき認識処理を行ない、左
側カッコ類似パターンと認識できるかを判別する。判別
結果が認識できる場合はステップＳ１７へ移り、認識で
きない場合はステップ３１８へ移る。If there is no corresponding kana code, in step S1B, a recognition process is performed based on data for recognizing a left parenthesis similar pattern in the OCR dictionary shown in FIG. 5, and it is determined whether it can be recognized as a left parenthesis similar pattern. If the determination result can be recognized, the process moves to step S17, and if it cannot be recognized, the process moves to step S318.

ステップ３１Ｂにおいて認識できる場合は、ステップＳ
１７において、“（”コード（例えばＪＩＳ　７単位の
“（”コード）を取出し、このコードをステップＳ２０
において、メモリに記憶しておく。If it is recognized in step 31B, step S
In step S20, the "(" code (for example, the "(" code in JIS 7 units) is extracted, and this code is sent to step S20.
, and store it in memory.

ステップＳｌＢにおいて認識できない場合は、ステップ
８１８において、第５図に示されるＯＣＲの辞書のうち
右側カッコ類似パターン認識用データに基づき認識処理
を行ない、右側カッコ類似パターンと認識できるかを判
別する。判別結果が認識できる場合はステップＳ１９へ
移り、認識できない場合はステップＳ２１へ移る。If the pattern cannot be recognized in step S1B, a recognition process is performed in step 818 based on data for recognizing a right-side parenthesis similar pattern in the OCR dictionary shown in FIG. 5, and it is determined whether it can be recognized as a right-side parenthesis similar pattern. If the determination result can be recognized, the process moves to step S19, and if it cannot be recognized, the process moves to step S21.

ステップ３１ｇにおいて認識できる場合は、ステップＳ
１９において、“）”コード（例えばＪＩＳ　７単位の
“）”コード）を取出し、このコードをステップＳ２０
においてメモリに記憶しておく。If it is recognized in step 31g, step S
In step S19, the ")" code (for example, the ")" code in JIS 7 unit) is extracted, and this code is sent to step S20.
Store it in memory at .

上記のいずれとも認識できない場合は、ステップ５２１
において、該当文字は認識不能文字であるとして、この
旨メモリに記憶する。If none of the above is recognized, step 521
In this case, the corresponding character is assumed to be an unrecognized character, and this fact is stored in the memory.

ステップＳ２２においては、該当項目に記入された全文
字の認識処理が終了したかの判別を行ない、全文字終了
していない場合はステップＳ１２に戻り、上記処理を繰
り返して行なう。また全文字終了している場合はステッ
プＳ２３に移る。In step S22, it is determined whether the recognition process for all the characters entered in the corresponding item has been completed, and if all the characters have not been recognized, the process returns to step S12 and the above process is repeated. If all characters have been completed, the process moves to step S23.

ステップ８２３においては、該当項目の認識文字として
文字コード（認識不能文字のイメージデータを含む）を
メモリより取出して、例えばその他の必要情報と合わせ
て送信電文を組立て、第７図で説明したように地区セン
タのＭＣ０８等の上位局に送信する。ソノ後Ｍ　ＣＵ　
８　ハＭＣＵ−ＷＳ９　（７）　ＣＲＴＩＯに例えば第
３図の画面表示例のような表示を行なう。In step 823, the character code (including image data of unrecognized characters) is retrieved from the memory as the recognized character of the corresponding item, and is assembled into a transmission message together with other necessary information, as explained in FIG. It is sent to a higher-level station such as MC08 at the district center. Sonogo M CU
8 C MCU-WS9 (7) Display on the CRTIO, for example, the screen display example shown in FIG.

なお、前記為替処理の場合には、第５図に示される省略
形はそのままでホスト装置１４等の上位局へ送信するこ
とができる。しかしホスト装置１４等の上位局において
省略形が許されないシステムの場合には、例えば卓上０
ＣＲ２内に第５図に示される略号と正式名称との対応辞
書（図示せず）を内蔵し、第１図のステップＳ２２にお
いて、全文字の処理が終了後１、メモリに記憶している
文字コードと前記対応辞書とを比較検索して略号の部分
を正式名称に変換後、次のステップ８２３において電文
を組立て上位局に送信するようにしてもよい。In the case of the money exchange processing, the abbreviations shown in FIG. 5 can be sent as they are to a higher-level station such as the host device 14. However, in the case of a system where abbreviations are not allowed in a higher-level station such as the host device 14, for example,
A correspondence dictionary (not shown) between the abbreviations and official names shown in FIG. 5 is built into the CR2, and in step S22 of FIG. After comparing and searching the code with the corresponding dictionary and converting the abbreviation part into an official name, the message may be assembled in the next step 823 and sent to the higher-level station.

［発明の効果コ以上のように本発明によれば、少くともカタカナ文字を
認識するＯＣＲにおいて、従来の左側カッコまたは右側
カッコを一部変更し、カタカナ文字の“）”等との区別
を明確にした左側カッコ類似パターンまたは右側カッコ
類似パターンを顧客から帳票等に記入してもらうように
して、前記帳票等を読取り認識するＯＣＲには、あらか
じめカタカナ文字を認識するためのカタカナ文字認識用
辞書に、前記左側カッコ類似パターンまたは右側カッコ
類似パターンを認識するための認識用データを付加する
ようにしたので、為替取引のための金銀システムにおけ
る振込帳票の受取人８欄のように、−船釣にはカナ文字
のみで記入するが、記入を省略するために左側カッコや
右側カッコの記入を許す項目に対しても認識可能となり
、顧客にとっては記入に便利でかつ記入時間が短縮でき
る効果がある。[Effects of the Invention] As described above, according to the present invention, at least in OCR that recognizes katakana characters, the conventional left or right parentheses are partially changed to clearly distinguish them from katakana characters such as ")". The left parenthesis similar pattern or the right parenthesis similar pattern that was set in ``2'' is written by the customer on a form, etc., and the OCR that reads and recognizes the form etc. is equipped with a katakana character recognition dictionary to recognize katakana characters in advance. , Recognition data for recognizing the similar pattern in the left parenthesis or the similar pattern in the right parenthesis is added, so that, like the recipient column 8 of the transfer form in the gold and silver system for exchange transactions, is filled in only in kana characters, but in order to omit entry, it is possible to recognize items that allow entry in left parentheses or right parentheses, which is convenient for the customer and has the effect of shortening the entry time.

またＯＣＲにとっても、従来のカタカナ文字認識辞書に
左側又は右側類似パターンの認識用データと該当コード
を付加するのみでよいので、従来の英記号を認識するた
めの辞書を使用して認識する場合に比較して、辞書の記
憶容量も少くてすみ、余りコストアップすることなしに
、カタカナ文字との区別が明確になされ、認識率が向上
する効果が得られる。Also, for OCR, it is only necessary to add the recognition data and the corresponding code for the left or right similar pattern to the conventional katakana character recognition dictionary, so when recognizing using the conventional dictionary for recognizing English symbols, In comparison, the memory capacity of the dictionary is small, and the recognition rate can be improved by clearly distinguishing the characters from katakana characters without increasing the cost too much.

[Brief explanation of drawings]

第１図は本発明に係るＯＣＲの認識処理の一実施例を示
すフローチャート、第２図は帳票記入例を示す図、第３
図は画面表示例を示す図、第４図は現用略号を説明する
図、！５図は本発明に係るＯＣＲの辞書構成例を示す図
、第６図は本発明に係るＯＣＲのカッコ記入形式例を示
す図、第７図は本発明に係るＯＣＲを用いた為替データ
通信装置の機器構成図である。図において１は為替帳票、２は卓上ＯＣＲ，３は簡易キ
ーボード、４は０ＣＲＣＴＬ、　５．７はＮＣＵ。６は公衆通信網、８　ハＭ　ＣＵ　、　９　ハＭＣ１ｌ
−ＶＳ、　１０ハＣＲＴ、１１はキーボード、１２はプ
リンタ、１３は専用通信線、１４はホスト装置、４１．
８１．９１はＣＰＵ。４２．８２は通信制御部、４８．８３．９３はＦＤ、４
４．８４．９４はディスク、４５．４９は圧伸プロセッ
サ、４６はＣＰＩＢ、　８５は受信プロセッサ、８６は
漢字認識プロセッサ、８７．９２はＬＩＦである。第図現用略号１祝明する図第４ｒ！！Ｊワ甲（調に首己入イ万１１五示すｐり第２因６ｔＸｔｋｈ　イＭ’ｌｔ　プｉＹ５コ第３図２ト４し上目１でイＦろＯＣＲの右懇書手繭ｊχイ１１
を示γｂ弓第５囚第囚FIG. 1 is a flowchart showing an example of OCR recognition processing according to the present invention, FIG. 2 is a diagram showing an example of form entry, and FIG.
The figure shows an example of screen display, and Figure 4 is a figure explaining the currently used abbreviations. FIG. 5 is a diagram showing an example of an OCR dictionary structure according to the present invention, FIG. 6 is a diagram showing an example of an OCR bracket entry format according to the present invention, and FIG. 7 is a money exchange data communication device using OCR according to the present invention. FIG. In the figure, 1 is a money order form, 2 is a desktop OCR, 3 is a simple keyboard, 4 is 0CRCTL, and 5.7 is an NCU. 6 is a public communication network, 8 is MCCU, 9 is MC1l
-VS, 10 CRT, 11 keyboard, 12 printer, 13 dedicated communication line, 14 host device, 41.
81.91 is the CPU. 42.82 is the communication control unit, 48.83.93 is the FD, 4
4.84.94 is a disk, 45.49 is a companding processor, 46 is CPIB, 85 is a receiving processor, 86 is a Kanji recognition processor, and 87.92 is LIF. Diagram Current Abbreviation 1 Congratulations Diagram 4r! ! J wa Ko (key in the neck 1115 showing p 2nd cause 6t 11
Shows γb bow 5th prisoner 5th prisoner

Claims

[Claims] In an optical character reading device that recognizes at least katakana characters, at least one of a left parenthesis similar pattern or a right parenthesis similar pattern is included in a katakana character recognition dictionary for recognizing katakana characters. An optical character reading device characterized by being provided with additional recognition data for recognition.