JPS63311491A

JPS63311491A - optical character reader

Info

Publication number: JPS63311491A
Application number: JP62147340A
Authority: JP
Inventors: Mikio Yamaguchi; 幹雄山口
Original assignee: Sumitomo Electric Industries Ltd
Current assignee: Sumitomo Electric Industries Ltd
Priority date: 1987-06-13
Filing date: 1987-06-13
Publication date: 1988-12-20
Anticipated expiration: 2011-07-10
Also published as: JP2514663B2

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】〔産業上の利用分野〕本発明は、手持ち式のスキャナで原稿上を走査すること
により文字・記号等（以下代表して文字のみに関して述
べるが記号に関しても全く同様である）を読み取る光学
文字読取装置に関するものである。[Detailed Description of the Invention] [Field of Industrial Application] The present invention is capable of scanning characters, symbols, etc. (hereinafter, only characters will be described as a representative, but the same applies to symbols as well) by scanning a document with a hand-held scanner. The present invention relates to an optical character reading device that reads (a).

[Conventional technology]

スーパーマーケットや百貨店等で、単品毎の売上げ情報
を収集して在庫管理を行うＰ　ＯＳ　（ＰａｉｎｔＯｆ
　５ａｌｅｓ　）システムが普及している。このＰ。At supermarkets, department stores, etc., POS (PaintOf
5ales) system is widespread. This P.

Ｓシステムでは手持ち式の光学文字読取装置がよく使用
されている。Hand-held optical character readers are often used in S systems.

このような装置として、本出願人は特願昭６２−１１０
８３号や特願昭６２−５６２９３号を特許出願している
。手持ち式の光学文字読取装置の代表的な構成を第２図
に示す。As such a device, the present applicant has filed a patent application No. 110/1986.
No. 83 and Japanese Patent Application No. 62-56293. FIG. 2 shows a typical configuration of a hand-held optical character reading device.

第２図において、１はスキャナであり、手２で、原稿３
に当てかうだけで原稿に記憶された文字を読み取るもの
である。原稿３はたとえば、ＰＯＳシステムで用いる値
札の用紙である。４は光源であり、５はレンズ系、６は
イメージセンサであり、少な（とも用紙３に記載された
文字の一行分の視野が必要であり、第２図では横は一行
分、縦は一文字の３倍くらいとしている。７は制御・二
値化回路であり、イメージセンサ６の出力信号であるア
ナログ信号を文字領域及び背景頭載のおのおの対応する
二値化信号に変換し、画面メモリ８に送る。In Fig. 2, 1 is a scanner, and a hand 2 is used to hold a document 3.
It reads the characters stored in the manuscript simply by matching them with the characters. The document 3 is, for example, a sheet of price tag used in a POS system. 4 is a light source, 5 is a lens system, and 6 is an image sensor. 7 is a control/binarization circuit that converts the analog signal that is the output signal of the image sensor 6 into binary signals corresponding to the character area and the background head. send to

９から１３は、画面メモリ８の中の各文字を認識し、そ
の文字の視野内の位置（Ｘ座標）を求める手段である。9 to 13 are means for recognizing each character in the screen memory 8 and determining the position (X coordinate) of the character within the visual field.

画面メモリ８はイメージセンサ６の視野のほぼ全体の二
値化データを格納する。第３図ｆａｔにイメージセンサ
６の二値化データの説明を示している。The screen memory 8 stores binarized data of almost the entire field of view of the image sensor 6. FIG. 3 fat shows an explanation of the binarized data of the image sensor 6.

横（Ｘ）×縦（Ｙ）の大きさがｐｘｑ画素のイメージセ
ンサであり、視野のなかの文字を写し込んでいる。It is an image sensor with horizontal (X) x vertical (Y) pixels of pxq pixels, and captures characters within its field of view.

文字、記号は文字識別回路１３で認識されるが、文字識
別回路１３は１文字ずつ認識するものであ、るので、画
面メモリ８からは１文字分のデータを取り出す必要があ
る。−桁切り出し回路９は画面メモリ８から一文字切り
出し回路１１の処理能力であるｍ　Ｘ　ｑ画素相当分の
データを取り出し一桁メモリ１０に格納する。−文字切
り出し回路１１は一桁メモリから文字識別回路１３の処
理能力であるｍｘｎ画素相当分のデータを取り出し、−
文字メモリ１２に格納するものである。Characters and symbols are recognized by the character recognition circuit 13, but since the character recognition circuit 13 recognizes one character at a time, it is necessary to retrieve data for one character from the screen memory 8. - The digit extraction circuit 9 extracts data equivalent to m×q pixels, which is the processing capacity of the single character extraction circuit 11, from the screen memory 8 and stores it in the single digit memory 10. -The character extraction circuit 11 extracts data equivalent to mxn pixels, which is the processing capacity of the character identification circuit 13, from the one-digit memory, and -
It is stored in the character memory 12.

第３図（ａｌにおいて、まず−桁切り出し回路９はＸ−
１からＸ−ｍ１Ｙ−１からｙ−ｑ迄のデータを画面メモ
リ８から取出し、−桁メモリ１０に転送する（第３図（
ｂｌ））。−桁切り出し回路９は一桁メモリ１０の内容
を見て文字像を含む範囲（この例ではＹ−１１からＹ−
１１＋ｎ−１）のｎ行分を一文字メモリ１２に転送する
。（第３図（ｃ、））、−文字メモリ１２に文字が入っ
ているときは文字識別回路１３により文字が認識される
。In FIG. 3 (al), the negative digit extraction circuit 9 is
The data from 1 to X-m1Y-1 to y-q is taken out from the screen memory 8 and transferred to the -digit memory 10 (Fig.
bl)). - The digit extraction circuit 9 looks at the contents of the single digit memory 10 and selects a range including character images (in this example, from Y-11 to Y-
11+n-1) for n lines are transferred to the one-character memory 12. (FIG. 3(c,)), - When a character is stored in the character memory 12, the character recognition circuit 13 recognizes the character.

次にＸ−２からＸ−ｍ＋１Ｙ＝１からＹ＝ｑ迄のデータ
を画面メモリ８から取り出し、−桁メモリ１０に転送す
る（第３図（ｂｚ））。そして文字像を含む範囲の画像
を一文字メモリ１２に転送する。以下、同様にして画面
メモリ８がら取り出す位置を順にずらして一桁メモリ１
０に転送し、文字像を含む画像を一文字メモリ１２に転
送し、文字識別回路１３で処理を行うことで一行分の認
識を行う。Next, data from X-2 to X-m+1 Y=1 to Y=q is taken out from the screen memory 8 and transferred to the negative digit memory 10 (FIG. 3 (bz)). Then, the image in the range including the character image is transferred to the one-character memory 12. Thereafter, in the same way, shift the position from which to take out screen memory 8 in order, and select single digit memory 1.
0, the image including the character image is transferred to the one-character memory 12, and the character recognition circuit 13 processes it to recognize one line.

一桁メモリ１０から一文字メモリ１２に転送する範囲の
求め方を第４図に示す、先ず一桁メモリ１０の各行に対
して横ＯＲを求める。The method for determining the range to be transferred from the one-digit memory 10 to the one-character memory 12 is shown in FIG. 4. First, horizontal OR is determined for each row of the one-digit memory 10.

横ＯＲとは横方向の一行に注目してその行に黒画素があ
れば１とし、黒画素がなければ０とする演算である。い
まセンサの無出力を１とし、白出力を０として表現する
と、横ＯＲの結果とはすなわち一行の各画素の論理和を
取った結果にほかならない、そこでこの演算を横ＯＲと
呼んでいる。Horizontal OR is an operation that focuses on one row in the horizontal direction and sets it to 1 if there is a black pixel in that row, and sets it to 0 if there is no black pixel. Now, if the no output of the sensor is expressed as 1 and the white output is expressed as 0, then the result of horizontal OR is nothing but the result of taking the logical sum of each pixel in one row, so this operation is called horizontal OR.

そして文字がある部分では第４開山）に示すように、そ
の範囲だけ横ＯＲの結果は黒となる。−行メモリから一
文字メモリに転送する範囲は、たとえばＹ−１３から横
ＯＲが黒になったとすると、文字の上方の白を含めてＹ
−１１からｎ画素とする。In the area where there are characters, the result of horizontal OR is black for that area, as shown in the fourth opening (4th opening). -For example, if the horizontal OR becomes black from Y-13, the range to be transferred from the line memory to the character memory is Y-13, including the white above the character.
−11 to n pixels.

以上の処理によって、センサ６の視野の中に含まれる、
文字、記号を読み取ることができる。イメージセンサ６
を走査して用紙３の画像を画面メモリ８に蓄え、画面メ
モリ８の中の各文字を認識する処理は３回行われ、文字
の認識結果とその文字の視野内の位置（文字が認識され
たときに一桁切り出し回路９が一桁メモリ１０に画面を
切り出したときのＸ座標）が１４．１５．１６の識別結
果バッファ＃１、＃２、＃３に蓄えられる。Through the above processing, the
Can read letters and symbols. Image sensor 6
The image of the paper 3 is stored in the screen memory 8 by scanning the image, and the process of recognizing each character in the screen memory 8 is performed three times. 14.15.16, the X coordinate at which the one-digit extraction circuit 9 cuts out the screen in the one-digit memory 10 is stored in the identification result buffers #1, #2, and #3 on 14.15.16.

第２図の文字認識装置においては、一つの文字に対して
繰り返し認識した結果の多数決を取ることで、認識率向
上を図っている。１４から１８は、３回画面を取り込ん
で認識したときの文字の認識結果の多数決を取る手段で
ある。まず、認識結果バンファ＃１から＃３に記憶され
ている文字のＸ座標の値と認識結果が桁合わせ処理部１
７に送られる０桁合わせ処理部１７は文字のＸ座標の値
に基づいて、同一桁と判断できる文字認識結果を対応づ
ける。多数決処理部１８は対応づけられた文字認識結果
同士の多数決をとり、その桁に対する最終的な認識結果
を得る。多数決の例を第５図に示す、（ａ）は原稿に記
載されている行である。In the character recognition device shown in FIG. 2, the recognition rate is improved by taking a majority vote from the results of repeated recognition of one character. 14 to 18 are means for taking a majority vote of the character recognition results when the screen is captured and recognized three times. First, the X coordinate values of the characters stored in the recognition result buffers #1 to #3 and the recognition results are set in the digit alignment processing unit 1.
The 0-digit alignment processing unit 17 sent to the 0-digit alignment processing unit 17 associates character recognition results that can be determined to be of the same digit based on the X-coordinate value of the characters. The majority decision processing unit 18 takes a majority decision between the matched character recognition results and obtains the final recognition result for that digit. An example of majority voting is shown in FIG. 5, where (a) is a line written in the manuscript.

（ｂｌ）（ｂｚ　）（ｂｓ　）はそれぞれ、１度目、２
度目、３度目の認識における認識結果を表している−（
ｂｌ）では「１」の文字が欠け、（ｂりでは「２」の文
字が欠け、（ｂ、）では「３」の文字が欠けているが、
文字の視野内におけるＸ座標を基にして各桁における認
識結果の対応を取ってから多数決を取ることで（ｂ４）
のように正解が得られている。なお、文字のＸ座標を用
いずに、単純に認識した文字の先頭から対応づけて多数
決を取ると・　（ｃ）のように正解が得られない。(bl) (bz) (bs) are the first and second times, respectively.
Represents the recognition results for the third and third recognitions - (
The character "1" is missing in (bl), the character "2" is missing in (b,), and the character "3" is missing in (b,).
By comparing the recognition results for each digit based on the X coordinate within the field of view of the character and then taking a majority vote (b4)
The correct answer is obtained as follows. Note that if you do not use the X coordinate of the characters and simply match them from the beginning of the recognized characters and take a majority vote, you will not get the correct answer as shown in (c).

第２図の１９から２３は原稿上の一つの行に対して正し
く読み取れた認識結果を１回だけ出力するための手段で
ある０行が視野の中に在るかぎり、文字が繰り返されて
認識され、１８から多数決結果が繰り返し出力される。23 from 19 to 23 in Figure 2 is a means to output the recognition result that has been correctly read for one line on the manuscript only once.As long as line 0 is within the field of view, the characters are recognized repeatedly. The majority decision result is repeatedly output from 18 onwards.

スキャナ１を「Ｃ１２３４５６７８９０Ｊの行に当てが
いながら上から下に動かしたときの、視野の動きと、多
数決結果の変化を第６図に示す、（ａ、）の位置では、
認識結果（ｂ、）はすべてリジェクト（認識不能：？で
表している）になっている、（ａ、）の位置の認識結果
（ｂｔ）も同様にすべてリジェクトである＊（ａｓ）位
置では、「０」の文字だけ視野の中に入って認識されて
いる。（ｂｌ）から（ｂｓ）までの多数決結果は（Ｃ７
）のようになる、ここで、「０」の文字に付いては（ｂ
、）の文字認識結果を（ｂｌ）（ｂ８）のりジェツトよ
りも優先している。すなわち、リジェクトよりも文字認
識結果に重みを設定している。スキャナを更に視野に動
かし、（ａ４）（ａｓ　）（ａ＝　）の位置における認
識結果Ｃｂａ　）（ｂｓ　）（ｂａ　）の多数決結果は
（Ｃ２）の通りである。同様に、（ａ、）（ａｍ　）（
ａ９）の位置における認識結果（ｂ、）（ｂｓ　）（ｂ
ａ　）の多数決結果は（Ｃ１）の通りである。Figure 6 shows the movement of the field of view and the change in the majority decision result when the scanner 1 is moved from top to bottom while applying the line "C1234567890J". At the position (a,),
The recognition results (b,) are all rejected (unrecognizable: represented by ?). Similarly, the recognition results (bt) at the position (a,) are also all rejected. *At the (as) position, Only the character "0" is recognized within the visual field. The majority vote result from (bl) to (bs) is (C7
), where, for the character “0”, (b
, ) are given priority over the (bl) (b8) paste jets. In other words, more weight is placed on character recognition results than on rejects. The scanner is moved further into the field of view, and the recognition result Cba ) (bs ) (ba ) at the position (a4) (as ) (a= ) is as shown in (C2). Similarly, (a,)(am)(
Recognition result (b,) (bs ) (b
The majority vote result for a) is as shown in (C1).

１８からは（ｃ、）（ｃ意）（Ｃ３）が逐次出力される
が、１９はフォーマットチェック部で、１８から得られ
る多数決結果が予め定めである所定のフォーマント（た
とえば、Ｃで始まる行はＣの後に数字が１０文字続かな
ければならない）を満たしているかどうかを判定する。18 sequentially outputs (c,) (c い) (C3), but 19 is a format checker that outputs the majority result obtained from 18 in a predetermined format (for example, a line starting with C). (C must be followed by 10 numbers).

タイマー２０は１８から多数決結果が得られてからの経
過時間を測定する。所定のフォーマットを満たす多数決
結果Ｒ＋が得られたなら、前回レジスタ２１、比較器２
２、出力制御部２３は、次のように動作する。まず、比
較器２２において、Ｒ１と前回レジスタ２１に記憶され
ている内容Ｒ１−１とが比較される。ＲｉとＲｉ、の内
容が一致しなければ比較器２２からはＮＥＷの信号が出
て出力制御部２３はＲ４をその行の認識結果ＲＬＩ□と
して出力する。The timer 20 measures the elapsed time from when a majority result is obtained from the timer 18. If a majority result R+ satisfying the predetermined format is obtained, the previous register 21 and comparator 2
2. The output control section 23 operates as follows. First, the comparator 22 compares R1 with the content R1-1 stored in the previous register 21. If the contents of Ri and Ri do not match, the comparator 22 outputs a NEW signal and the output control section 23 outputs R4 as the recognition result RLI□ of that row.

Ｒ１とＲ１−２の内容が一致すれば、比較器２２からは
ＮＥＷの信号が出ず、出力制御部２３はＲ１を出力しな
い（読み捨てる）、一方、前回レジスタ２１は１’２ｉ
−＋を比較器２２に送った後は、Ｒ１を記憶する。タイ
マー２０は１８から多数決結果が得られてからの時間を
測定し、あらかじめ定めた一定時間Ｔ　ｅＬｌ経過後に
前回レジスタ２１の内容を消去する。電源を入れた直後
の前回レジスタの状態は消去状態である。ＴＣＬＩは値
札を持ち換えるのに必要な時間（たとえば１秒）よりも
短く、たとえば０．６秒程度に設定してお（、なお、行
が視野の中にあって１８から繰り返し多数決結果が得ら
れるときの繰り返しの周期はたとえば０．２秒程度であ
る。If the contents of R1 and R1-2 match, the comparator 22 will not output a NEW signal and the output control unit 23 will not output R1 (discard it).
After sending -+ to the comparator 22, R1 is stored. The timer 20 measures the time since the majority vote result is obtained from the timer 18, and erases the contents of the previous register 21 after a predetermined fixed time T eLl has elapsed. The state of the previous register immediately after the power is turned on is the erased state. The TCLI is set to be shorter than the time required to change the price tag (for example, 1 second), for example, about 0.6 seconds (note that if the line is within the field of view and the majority result is repeatedly obtained from 18), The repetition period when this happens is, for example, about 0.2 seconds.

第６図、第７図を用いて、値札を読み取るときの１９か
ら２３の動作を説明する。（ｃ、）の多数決結果「？’
ｉｌ’ｉ１７’ｉｌ’７’ｉ’７’ｉ”１ｌＱＪは所定
のフォーマントを満たしていないので、フォーマットチ
ェック部９からは何も出力されない、スキャナを上から
下に動かして（Ｃ２）の多数決結果「Ｃ１２３４５６７
８９０Ｊが得られると、これは所定のフォーマットを満
たすのでフォーマントチェック部１９からはこの多数決
結果が出力Ｒｉ　される、比較器２２ではＲムと前回レ
ジスタの内容Ｒｉ、が比較されるが、電源を入れた直後
は前回レジスタの内容は消去されているので、Ｒ，とＲ
ｉ　−１の内容は必ず一致しない、そのため比較器２２
からはＮＥＷの信号が出力制御部２３に出て、出力制御
部２３からはｒｃ１２３４５６７８９０Ｊが、行認識結
果ＲＬＩＮＥとして出力される。一方、前回レジスタ２
１にはｒｃ１２３４５６７８９０Ｊが記憶される０次に
（Ｃコ）の多数決結果「Ｃ１２３４５６７８９０Ｊが得
られるが、（Ｃよ）の時と同様にフォーマットチェック
部を経て、比較器２２に送られる。しかし、前回レジス
タ２１の内容Ｒｉ、がｒｃ１２３４５６７８９０ｊにな
っておりＲｉ　と一致するので、比較器からはもはやＮ
ＥＷの信号が出ず、出力制御部２３からＲＬＩ□として
出力されない０以上の様にして、視野に入った一つの行
に対しては一回だけ行認識結果ＲＬＩ□を出力する。The operations 19 to 23 when reading a price tag will be explained using FIGS. 6 and 7. Majority result of (c,) "?'
Since il'i17'il'7'i'7'i"1lQJ does not satisfy the predetermined format, nothing is output from the format checker 9. Move the scanner from top to bottom and check the majority decision (C2). Result “C1234567
When 890J is obtained, it satisfies the predetermined format, so the formant check section 19 outputs this majority result Ri.The comparator 22 compares Rm with the previous register contents Ri, but the power supply Immediately after inputting , the contents of the previous register have been erased, so R, and R
The contents of i −1 do not necessarily match, so the comparator 22
A NEW signal is output from the output control unit 23, and the output control unit 23 outputs rc1234567890J as the line recognition result RLINE. On the other hand, the previous register 2
rc1234567890J is stored in 1. The majority result of the 0th order (C) is ``C1234567890J, but it is sent to the comparator 22 through the format check section as in the case of (C).However, the previous time Since the content Ri of register 21 is rc1234567890j and matches Ri, the comparator no longer outputs N.
The line recognition result RLI□ is output only once for one line that has entered the field of view, such that the EW signal is not output and the output control unit 23 does not output the RLI□.

スキャナを動かして複数行を読み取るときの動作を第７
図を用いて説明する。フォーマットチェック部１９には
ＣＳＮ、￥で始まる各行のフォーマットが登録されてい
るとする。まず＜ａ＞の値札でｒｃ１２３４５６７８９
０Ｊの行にスキャナを当てかったときは先程説明した通
り、−回だけ行認識結果ＲＬＩＮＩ！として出力される
０次にスキャナを下に動かしてｒＮ１２３４５６７８９
０Ｊの行にスキャナを当てかったとき、視野に「Ｎ１２
３４５６７８９０Ｊの行が入って、多数決処理部１８か
ら初めてｒＮ１２３４５６７８９０Ｊの認識結果が得ら
れたときは、前回レジスタの内容Ｒト１はｒｃ１２３４
５６７８９０Ｊになっているので、比較器２２からはＮ
ＥＷの信号が出て、ｒＮｌ　２３４５６７８９０Ｊが行
認識結果ＲＬＩ□として出力される。それ以降は繰り返
してｒＮ１２３４５６７８９０Ｊが多数決処理部１８か
ら出力されても前回レジスタの内容と一致するため、Ｒ
ＬＩ□として出力されない。すなわち、「Ｎ１２３４５
６７８９０Ｊは一回だけ出力される。同様にして、「￥
１２３．４５６．Ｊの行が視野に入ったときは、行認識
結果ＲＬＩ□として［￥１２３゜４５６、Ｊが出力され
る。なお、ＪＩＳ　　Ｂ９５５１によるＰｏＳ用値札で
は、一つの値札の中の各行は、異なる内容になっている
。このため、比較器２２で前回認識した結果Ｒ４−３と
今回認識した結果Ｒ１を比較することで、同じ行を読ん
だが否かが判別できるのである。Chapter 7 describes the operation when moving the scanner to read multiple lines.
This will be explained using figures. It is assumed that the format check unit 19 has registered the format of each line starting with CSN and ¥. First, the price tag of <a> is rc123456789.
As explained earlier, when the scanner hits the line 0J, the line recognition result RLINI! Moving the scanner down to the 0th order outputs rN123456789
When I applied the scanner to the 0J line, I saw "N12" in my field of view.
When the line 34567890J is entered and the recognition result rN1234567890J is obtained from the majority processing unit 18 for the first time, the previous register content R1 is rc1234.
Since it is 567890J, N is output from comparator 22.
The EW signal is output, and rNl 234567890J is output as the row recognition result RLI□. After that, even if rN1234567890J is repeatedly output from the majority processing unit 18, it matches the contents of the previous register, so R
It is not output as LI□. In other words, “N12345
67890J is output only once. Similarly, “￥
123.456. When the row J comes into view, [¥123°456, J] is output as the row recognition result RLI□. Note that in the PoS price tag according to JIS B9551, each line in one price tag has different contents. Therefore, by comparing the previous recognition result R4-3 with the current recognition result R1 using the comparator 22, it is possible to determine whether or not the same line has been read.

値札を（ａ）から（ｂ）に持ち換える間は、視野には文
字が入っていない、このときは、多数決処理部１８から
は何も出力が得られない、タイマー２０は多数決処理部
１８が認識結果を出力してからの経過時間を測定してお
り、値札を持ち換えたために、ＴＣＬＩ以上各文字認識
手段から出力が得られない状態が続くと、前回レジスタ
２１の内容を消去する。したがって、値札を（ｂ）に持
ち換えて「￥１２３．４５６．Ｊの行にスキャナを当て
かうと「￥１２３，４５６．Ｊは出力される。While the price tag is being changed from (a) to (b), there are no characters in the visual field.At this time, no output is obtained from the majority decision processing section 18, and the timer 20 indicates that the majority decision processing section 18 The time elapsed since the recognition result was output is measured, and if the condition in which no output is obtained from each character recognition means continues for more than TCLI because the price tag is changed, the contents of the previous register 21 are erased. Therefore, if you switch the price tag to (b) and apply the scanner to the line ``¥123,456.J,'' ¥123,456.J will be output.

すなわち、同じ内容の行であっても異なる値札ならば続
けて読み取ることができる０以上の説明から判るように
、タイマー２０は原稿（値札）の交換を検出する機能を
持っている。In other words, as can be seen from the explanation of 0 or more, the timer 20 has a function of detecting the exchange of manuscripts (price tags), so that even if the lines have the same content, different price tags can be read consecutively.

[Problem that the invention seeks to solve]

スキャナ１を行に対して傾けて（スキューをかけて）行
に当てかったときの様子を第８図に示す。FIG. 8 shows the situation when the scanner 1 is tilted (skewed) with respect to the row and applied to the row.

スキニー角θが小さく、（ａりのように行全体が視野の
中に入るタイミングのあるとき、（ａ、）（ａｓ　）（
ａｓ　）の認識結果（ｂ＋　）（ｂｚ　）（ｂ、）の多
数決結果（Ｃ）には行の認識結果が正しく得られている
。When the skinny angle θ is small and there is a timing when the entire row is within the field of view like (a), (a,) (as) (
The majority result (C) of the recognition result (b+) (bz) (b, ) for (as) shows that the line recognition result is correct.

スキュー角θを大きくしたときの様子を第９図に示す、
スキュー角θの大きさは視野に完全に入った一つの文字
が認識出来なくなる程大きくはないとする。スキュー角
が太き（なることで、行全体が視野に入るタイミングは
ないが、（ａｌ）で行の右端が視野に入り、（ａｓ）で
行の左端が視野に入ることで多数決結果（Ｃ）には行の
認識結果が正しく得られている。Figure 9 shows the situation when the skew angle θ is increased.
It is assumed that the skew angle θ is not so large that a single character completely within the visual field cannot be recognized. Due to the large skew angle, there is no timing when the entire row will be in the field of view, but the right end of the row will be in the field of view with (al) and the left end of the row will be in the field of view with (as), so the majority result (C ), the line recognition results are correctly obtained.

スキュー角θを大きくしたまま、ゆっくりとスキャナを
上から下に動かした時の、視野の動きと認識結果を第１
０図に示す、スキュー角θが大きいことで、行全体が視
野の中に入るタイミングはない、また、スキャナがゆっ
くりと動いているため、どの隣接している認識結果（ｂ
＋　）　　（ｂｔ４＋　）（ｂ＋＋＊）（ｉは１から８
までの整数）の多数決結果をとっても行全体の認識結果
を正しく得ることができない。The first graph shows the movement of the field of view and the recognition results when the scanner is slowly moved from top to bottom while keeping the skew angle θ large.
As shown in Figure 0, due to the large skew angle θ, there is no timing for the entire row to be within the field of view, and since the scanner is moving slowly, which adjacent recognition result (b
+ ) (bt4+ ) (b++*) (i is from 1 to 8
Even if you take the majority vote result for the whole line (up to an integer up to), it is not possible to obtain the correct recognition result for the entire line.

第１０図の説明から判るように、従来技術による光学文
字読取装置では、視野に行全体が一度に入らないような
大きなスキュー角で、ゆっくりとスキャナを動かして行
を捉えたときはその行全体の認識結果を得ることができ
ない０本発明はこの欠点を解消するために案出されたも
ので、視野に行全体が一度に入らないようなスキニー角
でも光学文字読取装置の行全体の認識を可能にすること
を目的としている。As can be seen from the explanation of FIG. 10, in the conventional optical character reading device, when the scanner is moved slowly and the line is captured at a large skew angle so that the entire line is not included in the field of view at once, the entire line is captured. The present invention was devised to solve this problem, and it is possible to recognize the entire line of an optical character reading device even at skinny angles where the entire line cannot be seen at once in the field of view. It aims to make it possible.

[Structure of the invention]

第１図に本発明を用いた光学文字読取装置の構成例を示
す。図中１から２３までの符号をつけた部分は第２図の
従来技術の光学文字読取装置で同符号を付けた部分と同
じ機能・構成を持つ、ただし、桁合わせ処理部１７′は
第２図の桁合わせ処理部１７の機能に付は加えて、桁合
わせ後の文字のＸ座標の値も出力する０桁合わせ後の文
字のＸ座標はたとえば、同一桁と見なせる文字のＸ座標
の値の平均値を用いる。８から１８までが、イメージセ
ンサの捉えた画面の中の各文字を認識し、その位置を出
力する文字認識手段である０文字認識手段から得られた
、視野に入った各文字の認識結果とその位置情報を以下
の説明では認識行と呼ぶことにする。FIG. 1 shows an example of the configuration of an optical character reading device using the present invention. The parts numbered 1 to 23 in the figure have the same functions and configurations as the parts numbered the same in the conventional optical character reading device shown in FIG. In addition to the functions of the digit alignment processing unit 17 shown in the figure, it also outputs the value of the X coordinate of the character after digit alignment.The X coordinate of the character after 0 digit alignment is, for example, the value of the Use the average value of 8 to 18 are the recognition results of each character that entered the field of view obtained from the 0 character recognition means, which is a character recognition means that recognizes each character on the screen captured by the image sensor and outputs its position. In the following explanation, this position information will be referred to as a recognized line.

３１．３２．３３は視野に複数の領域を設定したときに
、それぞれの領域に行の一部分が存在するか否かを検出
する行検出手段である。第１図では行検出手段が３つ存
在する場合を例示しているが、複数ならば３以外の数の
場合の構成も可能である。３４．３５．３６は、それぞ
れ対応する行存在検出手段が行を検出したときに認識行
を記憶する行記憶手段である。３７は、行記憶手段のそ
れぞれに記憶された認識行において、位置情報に基づい
て文字同士を対応づける桁合わせ処理手段である。３９
は、桁合わせ処理手段によって、各文字同士の対応が付
けられた認識行を総合して一つの行を得る行総合処理手
段である。以下の説明では、行総合処理手段によって得
られた行を総合行と呼ぶことにする。３９は、行入れ換
わり検出手段であり、イメージセンサ６の捉える行が入
れ換わった事を検出する手段である。Reference numerals 31, 32, and 33 are line detection means for detecting whether a part of a line exists in each area when a plurality of areas are set in the field of view. Although FIG. 1 shows an example in which there are three row detection means, a configuration in which there are a plurality of row detection means other than three is also possible. 34, 35, and 36 are line storage means that store recognized lines when the corresponding line existence detection means detects a line. Reference numeral 37 denotes a digit alignment processing means for associating characters with each other based on position information in the recognized lines stored in each of the line storage means. 39
is a line synthesis processing means that obtains one line by synthesizing recognized lines in which correspondences between characters have been added by a digit alignment processing means. In the following description, the line obtained by the line synthesis processing means will be referred to as a comprehensive line. Reference numeral 39 denotes a line change detection means, which is a means for detecting that the lines captured by the image sensor 6 have been changed.

[Effect]

スキャナ１を新たな行に当てかったときは、行入れ換わ
り検出手段３９の働きにより、行記憶手段３４．３５．
３６の内容は消去されている。以下に、行検出手段、行
記憶手段、桁合わせ処理手段、行総合処理手段の動作を
第１１図の例を用いて説明する。When the scanner 1 is applied to a new line, the line storage means 34, 35, .
The contents of 36 have been deleted. The operations of the line detection means, line storage means, digit alignment processing means, and line comprehensive processing means will be explained below using the example shown in FIG.

いま、視野３０が（ａ）で示すように、Ｓ７、Ｓｔ　、
Ｓｓの３つの領域に分割されており、それぞれの領域に
おいて行が入っているか否かを行検出手段＃１、＃２、
＃３がそれぞれ判定するとする。第１１図（ｂ）に示す
行をスキャナを傾けながら読みとると、視野と行の位置
関係ならびに認識行はそれぞれ（Ｃ＋　）（ｃ、）Ｃｃ
ｓ　）と（ｄ、）（ｄｇ　）（ｄｓ　）のように逐次変
化する。（ｄ、）（ｄｇ　）　　（ｄｓ　）の認識行が
逐次得られるときの行記憶手段の記憶内容と行総合処理
結果を（ｅ）に示す。Now, as the visual field 30 shows in (a), S7, St,
Ss is divided into three areas, and line detection means #1, #2,
Assume that #3 is determined respectively. When reading the line shown in Figure 11(b) while tilting the scanner, the positional relationship between the field of view and the line and the recognized line are (C+) (c,)Cc, respectively.
s) and (d,)(dg)(ds). (e) shows the memory contents of the line storage means and the result of the line synthesis process when the recognized lines (d,) (dg) (ds) are obtained one after another.

スキャナを行に当てかう前は、行入れ換わり検出手段３
９によって、行記憶手段＃１、＃２、＃３の内容は消去
状態にある。まず、（Ｃ，）のように視野のＳ、の領域
に行が入りて、（ｄ＋　）の認識行が得られたときの様
子を第■段階に示す。Before applying the scanner to a line, the line swap detection means 3
9, the contents of the row storage means #1, #2, #3 are in an erased state. First, the situation when a line enters the field of view S, as shown in (C,), and the recognized line (d+) is obtained is shown in step (2).

行がＳ、の領域に入ったことを行検出手段＃３が検出し
、認識行は行記憶手段＃３に記憶される。Line detection means #3 detects that the line has entered the area S, and the recognized line is stored in line storage means #3.

各行記憶手段に記憶されている内容１０１．１０２．１
０３は桁合わせ処理手段３７によって、桁合わせ処理が
行われる（１０４）、第■段階は１０１．１０２は消去
状態にあるので、行総合処理によって得られる総合行１
０５は１０３と同じになる。そして、１０５で得られた
総合行の文字はフォーマットチェック部１９に送られる
。今、フォーマットチェック部は第１１図（ｂ）に対応
してｒＶＪの文字の後には１２個の数字が続く、と登録
されているとする。１０５は所定のフォーマットに合致
しないので、フォーマントチェック部からは何も出力さ
れない。Contents stored in each row storage means 101.102.1
03 is subjected to digit alignment processing by the digit alignment processing means 37 (104), and since 101.
05 is the same as 103. Then, the characters of the general line obtained in step 105 are sent to the format check section 19. It is now assumed that the format check section has registered that 12 numbers follow the letters rVJ, corresponding to FIG. 11(b). 105 does not match the predetermined format, so nothing is output from the formant check section.

次に、第１１図（Ｃ３）のように視野のＳ、の領域が入
って、（ｄｔ　）の認識行が得られたときの様子を第■
段階に示す０行がＳ、の領域に入ったことを行検出手段
＃２が検出し、認識行は行記憶手段＃２に記憶される。Next, as shown in Fig. 11 (C3), the situation when the field of view S is entered and the recognition line (dt) is obtained is shown in Fig.
Line detecting means #2 detects that line 0 shown in the step enters the area S, and the recognized line is stored in line storing means #2.

このとき、行記憶手段＃１、＃３の内容は第Φ段階の時
と同じままである。At this time, the contents of row storage means #1 and #3 remain the same as in the Φth stage.

各行記憶手段に記憶されている内容１０６．１０７．１
０８は桁合わせ処理手段３７によって、桁合わせ処理が
行われる（１０９）、行総合処理においては、各桁にお
いて！！識された文字を選び出して総合行１１０を得る
。そして、１１０の総合行の文字はフォーマットチェッ
ク部１９に送られる。１１０は所定のフォーマントに合
致しないので、フォーマットチェック部からは何も出力
されない。Contents stored in each row storage means 106.107.1
08 is subjected to digit alignment processing by the digit alignment processing means 37 (109), in each digit in the line comprehensive processing! ! The identified characters are selected to obtain a composite line 110. The characters in the 110th general line are then sent to the format check section 19. 110 does not match the predetermined format, so nothing is output from the format check section.

次に、第１１図（Ｃ３）のように視野の８１の領域に行
が入って、（ｄ、）の認識行が得られたときの様子を第
■段階に示す０行がＳ、の領域に入ったことを行検出手
段＃１が検出し、認識行は行記憶手段＃１に記憶される
。第■、■段階のときと同様にして、１１１．１１２．
１１３は桁合わせ処理手段３７によって、桁合わせ処理
が行われる（１１４）、行総合処理においては、各桁に
おいて認識された文字を選び出して総合行１１５を得る
０行全体は第■段階の総合行で得られ、このときフォー
マットチェック部１９、出力制御部２３を経て出力され
る（第１図ＲＬ＋ｓｘ）　＊以上の処理により、スキャ
ナを原稿上の行に当てがって動かしたとき行全体が視野
に一度に入らなくても、行全体の認識結果を得ることが
できる。Next, as shown in FIG. 11 (C3), the line enters the 81 area of the visual field and the recognized line of (d,) is obtained. The line detection means #1 detects that the line has entered the line, and the recognized line is stored in the line storage means #1. 111.112.
113 is subjected to digit alignment processing by the digit alignment processing means 37 (114).In the line synthesis process, characters recognized in each digit are selected to obtain the overall line 115.The entire 0 line is the overall line of the stage At this time, it is output via the format check section 19 and the output control section 23 (RL+sx in Figure 1). It is possible to obtain recognition results for the entire row without having to enter the entire line at once.

スキャナをさらに動かして次の行の読み取りに移るとき
は、行入れ換わり検出手段３日が行の入れ換わりを検出
して、行記憶手段＃１．＃２、＃３の内容を消去する。When the scanner is moved further to read the next line, the line swap detection means #3 detects the line swap, and the line storage means #1. Delete the contents of #2 and #3.

なお、行全体が（ｆ）のように一度に視野に入って認識
行（ｇ）が得られた場合の動作は（ｈ）のようになる、
Ｓ７、Ｓ２、Ｓ３の各領域において、行が視野に入った
ことが行検出手段＃ｌ、＃２、＃３で検出されるので、
（ｇ）の認識行は行記憶手段＃ｌ、＃２、＃３のそれぞ
れに記憶される（１１６．１１７．１１８）。そして、
桁合わせ処理・行総合処理の結果１２０で、行全体が得
られる。In addition, when the entire line enters the field of view at once as in (f) and the recognized line (g) is obtained, the operation is as shown in (h).
In each region S7, S2, and S3, the row detection means #l, #2, and #3 detect that the row has entered the field of view, so
The recognized line in (g) is stored in each of line storage means #1, #2, and #3 (116.117.118). and,
The entire row is obtained as a result 120 of the column alignment processing/row integration processing.

〔Example〕

行検出手段＃１、＃２、＃３と行入れ換わり検出手段の
実施例を第１２図に示す、（ａ）はイメージセンサの視
野３０を表しており、Ｓｒ　、Ｓｔ、Ｓ、の領域に分割
する。そしてそれぞれの領域においてＲ＋　−Ｒｓ　、
Ｒｓで例示されているように行を検出する範囲を設定す
る０行検出手段はこの範囲の中に行があることを検出す
る。（ｂ）は行検出手段と行入れ換わり検出手段を実施
する回路である。１２１．１２２．１２３は、それぞれ
Ｒｌ　ｓＲｚ、Ｒｓの範囲で横ＯＲ演算を行う回路であ
る。An embodiment of the line detection means #1, #2, #3 and the line interchange detection means is shown in FIG. To divide. And in each region R+ −Rs,
The zero line detection means, which sets a range for detecting lines as exemplified by Rs, detects that there is a line within this range. (b) is a circuit implementing the row detection means and the row interchange detection means. 121, 122, and 123 are circuits that perform a horizontal OR operation in the ranges of Rl, sRz, and Rs, respectively.

１２４．１２５．１２６は横ＯＲの結果において連続す
る黒画素の長さが所定範囲であるか否かを判定する黒長
さ判定部である０文字行が範囲Ｒｉ（ｉ−１，２，３）
の中にあるときは、横ＯＲ結果はその文字に対応して黒
画素が文字の高さ分だけ連続するので、黒長さ判定部に
よって文字がその領域にあるか否かが判定され、その結
果がＥＸＩＳＴｉの信号線に出力される。ＥＸＩＳＴ、
の信号は対応する行記憶手段に送られ、各行記憶手段が
ｆ！識行を記憶するタイミングを与える。１２７はＥＸ
ＴＳＴ＋　、ＥＸＩＳＴｘ　、ＢＸＩＳＴ３の論理和を
とるオアゲートであるる。１２７の出力ＥＸ　Ｉ　ＳＴ
はＲ，、Ｒオ、Ｒｊの何れかに文字があれば真になる。124.125.126 is a black length determination unit that determines whether the length of consecutive black pixels in the horizontal OR result is within a predetermined range. )
If the horizontal OR result corresponds to that character, the black pixels are consecutive for the height of the character, so the black length determination unit determines whether the character is in that area, and The result is output to the EXISTi signal line. EXIST,
The signal f! is sent to the corresponding row storage means, and each row storage means receives f! Gives you the timing to memorize your knowledge. 127 is EX
This is an OR gate that takes the logical sum of TST+, EXISTx, and BXIST3. 127 output EX I ST
becomes true if there is a character in any of R,, Ro, and Rj.

行の入れ換わりを検出するには、行が視野から出ていく
こと、すなわち、ＢＸＩＳＴが偽になることを検出すれ
ばよい、１２８はＥＸ　Ｉ　ＳＴの論理を反転して、行
の入れ換わり信号ＡＬＬＣＬＲを作成するインバータゲ
ートである。ＡＬＬＣＬＲの信号はすべての行記憶手段
に送られ、行記憶手段の内容を消去するタイミングを与
える。To detect a row swap, it is sufficient to detect that the row leaves the field of view, that is, when BXIST becomes false. 128 inverts the logic of EX I ST and outputs the row swap signal. This is an inverter gate that creates ALL CLR. The ALLCLR signal is sent to all row storage means and provides the timing for erasing the contents of the row storage means.

行が視野の中に入っていないときは、多数決処理部１８
からは認識行が得られない、そこで、認識行が得られな
いことを検出して行の入れ換わりと判定する実施例も可
能である。認識行が得られない時間を測定してそれが所
定時間（ＴＬとする）を越えることで、行の入れ換わり
と判定する実施例を第１３図に示す、第１３図では、タ
イマー１３０は認識行が得られない時間がＴＬ以上続く
とＴ　Ｌｕｐの信号を出す。ＴＬは、行が視野の中にあ
るときに繰り返し認識行が得られる周期（たとえば０．
２秒）よりもやや大きく　（たとえば０．２５秒）設定
しておく。When the row is not within the field of view, the majority decision processing unit 18
Therefore, an embodiment is also possible in which the fact that no recognized line is obtained is detected and it is determined that the lines have been swapped. FIG. 13 shows an embodiment in which it is determined that a row has been replaced by measuring the time during which a recognized row is not obtained and the time exceeds a predetermined time (referred to as TL). In FIG. 13, the timer 130 If a row is not obtained for a period longer than TL, a T Lup signal is output. TL is the period (e.g., 0.
2 seconds) (for example, 0.25 seconds).

第１３図においては、１から２３までの符号をつけた部
分は第１図の同符号の部分と同じ機能・構成である。６
はイメージセンサであり、第２図の場合と同様に、少な
くとも用紙３に記載された文字の一行分の視野が必要で
あり、第１３図では横は一行分、縦は一文字の３倍くら
いとしている。また、複数行が原稿に記載されている場
合は、一度に複数行が視野に入ると複雑な処理が必要に
なるので、処理系を単純にするには、イメージセンサの
視野高さを原稿上の行間隔よりも小さくした方がよい。In FIG. 13, the parts numbered 1 to 23 have the same functions and configurations as the parts with the same numbers in FIG. 6
is an image sensor, and as in the case of Figure 2, it requires a field of view of at least one line of characters written on paper 3, and in Figure 13, the width is one line and the height is about three times the length of one character. There is. In addition, if multiple lines are written on the document, complex processing is required if multiple lines enter the field of view at once, so to simplify the processing system, the height of the image sensor's field of view should be set above the document. It is better to make it smaller than the line spacing.

第１３図では、行検出手段＃１から＃３と行記憶手段と
桁合わせ処理手段と行総合処理手段は、マイクロプロセ
ッサ１３１とＲＯＭ１３２とＲＡＭ１３３を用いて実施
されている。ＲＯＭ１３２には行検出手段と桁合わせ処
理手段と行総合処理手段を実施するためのマイクロプロ
セッサ１３１のプログラムが格納されている０行記憶手
段３４はＲＡＭ１３３上に設定された変数領域で実施さ
れている。In FIG. 13, the line detection means #1 to #3, the line storage means, the digit alignment processing means, and the line synthesis processing means are implemented using a microprocessor 131, ROM 132, and RAM 133. The ROM 132 stores a program for the microprocessor 131 for implementing the line detection means, digit alignment processing means, and line comprehensive processing means. The zero line storage means 34 is implemented in a variable area set on the RAM 133. .

マイクロプロセッサ１３１の処理の概略フローチャート
を第１４図に示す、光学文字読取装置の電源を投入した
ときは、■から処理が始まる。■はＲＡＭに設けた行記
憶域の内容を消去する処理である。■はタイマー１３０
がＴＬＵＰ信号を出しているか否かを判定する処理で、
ＴＬｔ＋？信号が出ているとき、すなわち行の入れ換わ
りがあったときは■の処理に進む、■は認識行が得られ
ているか否かの判定である。■は！！識行を読み込む処
理である。■から［株］が行検出手段を実施しており、
視野を複数の領域に分けたときに各領域において行が入
ったことを検出して、それぞれの領域に対応する行記憶
域に認識行を記憶する処理である。第１４図のフローチ
ャートは、視野を３つの領域Ｓ１、Ｓｔ、Ｓｓ（第１５
図（ａ））に分けた場合の処理過程であるｅｓＩの領域
に対応する行記憶域＃１にｔｇａｉ行を記憶する処理は
０である。■は認識行のＳｌの領域において認識された
（リジェクトでない）文字の数が、行記憶域＃１に記憶
されている内容のＳｌの領域において認識されている文
字の数よりも多いか否かを判定する処理である。■■、
■［相］はそれぞれＳｔ　、Ｓｓの領域に関して■■と
同様の処理を行うことを示している。■は行記憶域＃ｌ
から＃３に記憶されている内容を認識された文字の位置
情報に基づいて相対応する文字同士を求める桁合わせ処
理である。＠は■で行われた桁合わせ結果に基づいて総
合処理を行い、総合行を求める処理である。０は総合行
をフォーマットチェック部１９に送り出す処理である。A schematic flowchart of the processing of the microprocessor 131 is shown in FIG. 14. When the power of the optical character reading device is turned on, the processing starts from ①. (2) is a process for erasing the contents of the row storage area provided in the RAM. ■ is timer 130
In the process of determining whether or not the is outputting a TLUP signal,
TLt+? When the signal is being output, that is, when there is a change in the rows, the process proceeds to step (2), in which it is determined whether or not a recognized row has been obtained. ■Ha! ! This is the process of reading the information. From ■, [stock] has implemented line detection means,
This is a process of dividing the field of view into a plurality of areas, detecting the presence of a line in each area, and storing the recognized line in the line storage area corresponding to each area. The flowchart in FIG. 14 divides the visual field into three regions S1, St, and Ss (
The process of storing the tgai line in the line storage area #1 corresponding to the esI area, which is the processing process in the case of division in Figure (a)), is 0. ■Whether or not the number of characters recognized (not rejected) in the Sl area of the recognized line is greater than the number of characters recognized in the Sl area of the content stored in line storage area #1. This is the process of determining. ■■,
■[Phase] indicates that the same processing as ■■ is performed for the St and Ss regions, respectively. ■ is row storage area #l
This is a digit matching process for finding characters that correspond to each other based on the position information of the recognized characters stored in #3. @ is a process in which comprehensive processing is performed based on the result of digit alignment performed in ■ to obtain a comprehensive row. 0 is a process of sending a comprehensive line to the format check unit 19.

第１６図から第１８図に、第１４図の処理の詳細フロー
チャートを示す、以降の説明では、第１５図のように変
数、定数を用いるとする。すなわち、第１５図（ａ）の
ように、Ｓｌ　、Ｓｔ　、Ｓｓの各領域のＸ座標の範囲
は、Ｘ、からＸ８まで、Ｘ、からＸ。16 to 18 show detailed flowcharts of the process shown in FIG. 14. In the following explanation, variables and constants are used as shown in FIG. 15. That is, as shown in FIG. 15(a), the range of the X coordinate of each region of Sl, St, and Ss is from X to X8, and from X to X.

まで、Ｘ、からＸ４までとする。また、（ｂ）の表のよ
うに、ｉ！！ｍ行はＮ文字であり、その文字と位置をａ
＋：）　、　ｘｆ：ｌと表し、行記憶域＃１に記憶され
ている内容はＩ文字あり、その文字と位置をａｌ、ｘ（
！ｌと表しくｉ−１，２、・・・・・・、■）、また、
Ｓｌの領域にはいる認識された（リジェクトでない）文
字数をＰｌと表す０行記憶域＃２、＃３についても同様
に、Ｊ％　ａ７　、Ｘｊ−、ｐｇとＫｓａ＊ｓｘ′：）
、Ｐ、の記号を使うこととする０桁合わせ後の桁数はし
て表し、各桁の位置はｙ、で、文字の組合わせはす、、
ｂＨ、ｂｌで表すとする。総合行の文字数はＭで、文字
はＣ１で表すとする。, X, to X4. Also, as shown in table (b), i! ! Line m has N characters, and the characters and positions are a
+:), xf:l, the contents stored in row storage area #1 include I characters, and the characters and positions are expressed as al, x(
! i-1, 2, ......, ■), also expressed as l,
Similarly, for 0-line storage areas #2 and #3 where the number of recognized (non-rejected) characters in the area of Sl is expressed as Pl, J% a7, Xj-, pg and Ksa*sx':)
The number of digits after zero digit adjustment is expressed as , P, and the position of each digit is y, and the combination of letters is,
Let it be expressed as bH and bl. It is assumed that the number of characters in the total line is M, and the character is represented by C1.

第１６図に、第１４図■■の処理の詳細フローチャート
を示す、■はＳｌの領域に入る認識行の認識された文字
数を数える変数ｐを初期化する処理である。■■■は■
■■の処理を繰り返すための繰り返し処理である。■は
、■で処理対象とするａ７がＳｌの領域に入るか否かの
判定処理である。■■はａｌｌがリジェクト（？の記号
で表わす）でないときにｐの数を１増やす処理である。FIG. 16 shows a detailed flowchart of the process shown in FIG. ■■■ is■
This is an iterative process for repeating the process of ■■. (2) is a process for determining whether or not a7 to be processed in (2) falls within the area of Sl. ■■ is a process in which the number of p is increased by 1 when all is not rejected (represented by a ? symbol).

■はｐがＰ、よりも多いか否かを判定する処理である。(2) is a process of determining whether p is greater than P.

■から■までの処理が第１４図の■の処理の詳細な処理
である。■はｐを新たなＰ＋　として登録し、Ｎ゛　を
行記憶域＃１に記憶する文字数■として登録する処理で
ある。＠■＠０は認識行を行記憶域＃１に記憶する処理
である。The processing from ① to ② is the detailed processing of ② in FIG. (2) is a process in which p is registered as a new P+ and N' is registered as the number of characters (2) to be stored in the row storage area #1. @■@0 is a process of storing the recognized line in the line storage area #1.

第１７図に、第１４図■の桁合わせ処理の詳細フローチ
ャートを示す、第１７図のフローチャートにおいては■
、■で初期化を行っている。■はけ記憶域に記憶した内
容の最後を予め特別の大きな値Ｘ１１によって示してお
く処理である。Ｘ１４は第１５図（ａ）のＸ４に、後述
するＷの値を加えたものよりも大きな値にしておく、■
は注目している、行記憶域＃１のｉ番目の文字の座標ｘ
３：ゝ、行記憶域＃２の」番目の文字の座標Ｘ＜ｉ′、
行記憶域＃３のに番目の文字の座標ｘＴの最小値Ｘｍ１
ｎを求める処理である。■ではＸ■ｉｎが■の処理で用
いたＸＮと同じになっていれば、すべての文字の桁合わ
せ処理が終わったと判断している。■において用いてい
るＷは同じ桁であると判断できるＸ座標の幅を示してい
る。いま、ｘｌ：′≦Ｘ■ｉｎ＋Ｗが成り立てば、ｘＴ
は現在処理を進めている桁に入る場合の処理、すなわち
、■、■の処理を行う、■は行記憶域＃１のｉ番目の文
字ａ（１１を桁合わせ後の文字ｂｔ）として登録する処
理である。■は１番目の処理が終わったのでｉの値を増
やす処理である。FIG. 17 shows a detailed flowchart of the digit alignment process shown in FIG. 14.
,■ is used for initialization. (2) This is a process in which the end of the contents stored in the brush storage area is indicated in advance by a special large value X11. Set X14 to a value larger than the sum of X4 in Figure 15(a) and the value of W, which will be described later.■
is the coordinate x of the i-th character in the row storage area #1 that we are looking at
3: ゝ, coordinates of the ``th character in row storage area #2, X<i',
Minimum value Xm1 of the coordinate xT of the second character in row storage area #3
This is the process of finding n. In case (2), if X■in is the same as XN used in the process (2), it is determined that the digit alignment process for all characters has been completed. W used in (2) indicates the width of the X coordinate that can be determined to be the same digit. Now, if xl:'≦X■in+W holds, then xT
performs the processing when it falls into the digit currently being processed, that is, the processing of ■ and ■.■ is registered as the i-th character a (character bt after digit adjustment of 11) in row storage area #1. It is processing. (2) is a process in which the value of i is increased since the first process has been completed.

一方、ｘ１１ゝが現在処理を進めている桁に入らない場
合は■の処理を行う、■の処理は該当する桁に行記憶域
＃１が得られていない（文字欠け）という記号＃をす、
に登録する処理である（＃は第５図（ｂ）における文字
と文字の間の空白と同じ意味である）、■〜■と同様に
して、行記憶域＃２、行記憶域＃３に対する処理■〜＠
、０〜■を行う。On the other hand, if x11゜ does not fit into the column currently being processed, process ■ is performed. ,
(# has the same meaning as the space between characters in Fig. 5(b)). In the same way as ■ to ■, register for line storage area #2 and line storage area #3. Processing■〜＠
, 0 to ■.

Ｏは桁合わせ後の位置としてＸｍ１ｎをｙ、に登録する
処理である。［相］は桁合わせ後の文字数ｌを１増やす
処理である。［相］は最終的に得られた文字数ｚ−１を
変数りとして登録しておく処理である。O is a process of registering Xm1n in y as the position after digit alignment. [Phase] is a process in which the number l of characters after digit alignment is increased by one. [Phase] is a process in which the finally obtained number of characters z-1 is registered as a variable.

第１８図＜ａ＞は、第１４図＠の総合処理の詳細フロー
チャート（その１）である、■は総合行の文字を数える
変数ｍを初期化する処理である。■■＠は■から■の処
理を、桁合わせ後の桁数りだけ繰り返して行うための繰
り返し処理である。■によって、注目している１番目の
桁の座標ｙ、がＳｌの領域に入るか否かが判定され、Ｓ
、の領域に入るときは■に進む、■［Ｆ］においては、
行記憶域＃１の、桁合わせ後の文字ｂＴ１１が＃でない
ならば（文字欠けでないならば）、それを総合行の文字
Ｃ，とじて登録する処理を行っている。■では、ｙ。FIG. 18 <a> is a detailed flowchart (part 1) of the general processing shown in FIG. ■■@ is an iterative process in which the processes from ■ to ■ are repeated as many times as the number of digits after digit alignment. Based on ■, it is determined whether the coordinate y of the first digit of interest falls within the area of Sl, and S
When entering the area of , proceed to ■, and in ■[F],
If the character bT11 in the line storage area #1 after alignment is not # (if there is no missing character), a process is performed to register it as the character C in the general line. ■So, y.

がＳｌの領域に入るか否かが判定され、Ｓ２の領域に入
るときは■に進む、■■においては、行記憶域＃２の、
桁合わせ後の文字す、が＃でないならば（文字欠けでな
いならば）、それを総合行の文字Ｃ１とじて登録する処
理を行っている。［株］に達する場合はｙ、がＳｓの領
域に入る場合であり、０■においては、行記憶域＃３の
、桁合わせ後の文字す、が＃でないならば（文字欠けで
ないならば）、それを総合行の文字Ｃ１として登録する
処理を行っている。■は、総合行の文字数ｍ−１を変数
Ｍに登録する処理である。It is determined whether or not it enters the area of Sl, and if it enters the area of S2, proceed to ■. In ■■, the row storage area #2,
If the character S after alignment is not # (if there is no missing character), a process is performed to register it as the character C1 in the general line. If it reaches [stock], y enters the area of Ss, and in 0■, if the character S after column alignment in row storage area #3 is not # (if no character is missing) , a process is being performed to register it as the character C1 of the general line. (2) is a process of registering the number of characters (m-1) in the general line in the variable M.

第１８図（ａ）は各領域Ｓ＋　、Ｓｔ　、Ｓｓ毎に用い
る行記憶域＃１、＃２、＃３を分けた場合の処理である
が、領域毎に用いる行記憶域を限定しない処理例も可能
であり、第１８図（ｂ）に示す、第１８図（ｂ）におい
て、■は桁合わせ後の桁数りを総合行の文字数Ｍとして
登録する処理である。■■［相］は■から０の処理をＬ
だけ繰り返して行うための繰り返し処理である。FIG. 18(a) shows the process when the row storage areas #1, #2, and #3 used for each area S+, St, and Ss are divided, but it is an example of processing in which the row storage areas used for each area are not limited. is also possible, as shown in FIG. 18(b). In FIG. 18(b), ■ is a process of registering the number of digits after digit alignment as the total number of characters M in the line. ■■[Phase] is the process from ■ to 0.
This is an iterative process that is performed repeatedly.

第１８図（ｂ）の■０■においては、ｂ、が文字欠け（
桁合わせ処理において＃の記号を設定したとき）でもリ
ジェクト（？記号で示される）でもないときは、ｂ（１
１をＣ１にする。ｂ′′、′が文字欠けかりジェツトの
ときは■に進む、■■■においてはす、が文字欠けでも
リジェクトでもないときはす、をＣ６にしている。ｂ、
が文字欠けかりジェツトのときは［相］に進む、０■Ｏ
においてはす、が文字欠けでもリジェクトでもないとき
はす、をＣ１にしている。ｂ、が文字欠けかりジェツト
のときはＯに進む、０に進むのは結局す、とす、とす。In ■0■ of Fig. 18(b), b is a missing character (
b(1
1 to C1. When b'', ' is a missing character jet, proceed to ■, and when it is neither a missing character nor a reject, proceed to C6. b,
If is a missing character jet, proceed to [phase], 0■O
In this case, when ``su'' is neither a missing character nor a reject, ``su'' is set to C1. When b is a missing character jet, it advances to O, and it goes to 0 after all.

のいずれもがリジェクトか文字欠けのときであり、この
ときは、リジェクトをＣ５にしている。All of these are rejected or missing characters, and in this case, the reject is set to C5.

なお、第１４図の■の初期化の処理は、実際には■、Ｊ
、、に１Ｐ１、Ｐ！、Ｐ３の変数を０にすればよい。Note that the initialization process of ■ in Fig. 14 is actually performed by ■, J
,, 1P1,P! , P3 should be set to 0.

以上の実施例においては、視野に設定する領域の数が３
である場合を示したが、複数ならば３以外の領域を設定
する実施例も可能である。また、視野に複数の領域を設
定したとき、互いに排他的に（重なり合わないように）
設定する必要はなく、たとえば第１９図のように、一部
分を重なり合わせながら５つの領域を設定する実施例も
可能である。In the above embodiment, the number of areas set in the field of view is 3.
Although the case is shown in which there are a plurality of areas, an embodiment in which areas other than three are set is also possible. Also, when setting multiple areas in the field of view, mutually exclusive (so that they do not overlap)
It is not necessary to set these, and for example, as shown in FIG. 19, an embodiment in which five regions are set while partially overlapping is also possible.

〔Effect of the invention〕

本発明を用いることで、行全体が一度に視野に入らない
ようなスキュー角でも行全体を読み取ることのできる光
学文字読取装置が実現できる。このことは、逆に言えば
、従来技術による光学文字読取装置（第２０図（ａ））
に比べて視野の高さを小さくできることを意味する（第
２０図（ｂ））。By using the present invention, it is possible to realize an optical character reading device that can read an entire line even at a skew angle that prevents the entire line from entering the visual field at once. Conversely, this means that the conventional optical character reading device (Fig. 20(a))
This means that the height of the field of view can be made smaller compared to (Fig. 20(b)).

第２０図（ｂ）の視野高さを持つスキャナでも、本発明
を用いることで、行にスキャナを当てがって上下に動か
せば行全体を読み取ることができる。Even with a scanner having the field of view height shown in FIG. 20(b), by using the present invention, the entire line can be read by applying the scanner to the line and moving it up and down.

視野の高さを小さくできると次の様な効果がある。Reducing the height of the field of view has the following effects.

・イメージセンサに必要な画素数が少なくなるので、よ
り安価なイメージセンサを用いることができ、装置の低
廉化ができる。- Since the number of pixels required for the image sensor is reduced, a cheaper image sensor can be used and the cost of the device can be reduced.

・視野高さが小さくなることで、スキャナのサイズが小
さくなり、スキャナの操作性が増す。- By reducing the field of view height, the size of the scanner becomes smaller and the operability of the scanner increases.

・視野高さが小さくなるので、スキャナ内の照明光源が
原稿を照明するときのむらを少な（しやすい、このため
、従来技術による光学文字読取装置の照明系に比べて、
設計・開発が容易になる。- Because the field of view height is smaller, the illumination light source inside the scanner is less likely to cause unevenness when illuminating the document.
Design and development becomes easier.

このように、本発明のもたらす波及効果は大きい。As described above, the ripple effects brought about by the present invention are large.

[Brief explanation of the drawing]

第１図は本発明を使用した光学文字読取装置の構成例、
第２図は従来技術による光学文字読取装置、第３図は一
文字切り出し処理までの説明図、第４図は一文字切り出
し方法の説明図、第５図は桁合わせ・多数決結果処理の
説明図、第６図、第８図、第９図は視野内の行の動きと
多数決結果の説明図、第７図は値札の読み取り説明図、
第１０図は従来技術の問題点の説明図、第１１図は本発
明の詳細な説明図、第１２図は行検出手段と行入れ換わ
り検出手段の実施例図、第１３図は本発明の実施例図、
第１４図はマイクロブ凸セッサの処理の概略フローチャ
ート、第１５図は定数・変数の説明図、第１６図から第
１８図はそれぞれ、行検出処理、桁合わせ処理、行総合
処理の詳細フローチャート、第１９図は領域の設定の仕
方の説明図、第２０図は本発明を用いた光学文字読取装
置の視野の説明図である。１・・・・・・スキャナ　　　　２・・・・・・手３・
・・・・・原稿　　　　　　４・・・・・・光源５・・
・・・・レンズ系　　　　６・・・・・・イメージセン
サ７・・・・・・制御二値化回路　８・・・・・・画面
メモリ９・・・・・・−桁切り出し回路１０・・・・・
・−桁メモリ１１・・・・・・−文字切り出し回路１２・・・・・・−文字メモリ　１３・・・・・・文字
認識回路１４．１５．１６・・・・・・認識結果バッフ
ァ１７．１７′・・・・・・桁合わせ処理部１８・・・
・・・多数決処理部１９・・・・・・フォーマットチェック部２０．１３０
・・・・・・タイマー２１・・・・・・前回レジスタ　２２・・・・・・比較
器２３・・・・・・出力制御部３０・・・・・・イメージセンサの視野３Ｌ　３２．３
３・・・・・・行検出手段３４．３５．３６・・・・・
・行記憶手段３７・・・・・・桁合わせ手段３８・・・・・・行総合処理手段３９・・・・・・行入れ換わり検出手段１０１．１０６
．１１１．１１６・・・・・・行記憶手段＃ｌの内容１０２．１０７．１１２．１１７・・・・・・行記憶手
段＃２の内容１０３．１０８．１１３．１１８・・・・・・行記憶手
段＃３の内容１０４．１０９．１１４．１１９・・・・・・桁合わせ
結果１０５．１１０．１１５．１２０・・・・・・行総合結
果１２１．１２２．１２３・・・・・・横ＯＲ回路１２
４．１２５．１２６・・・・・・黒長さ判定部１２７・
・・用オアゲート１２８・・・・・・インバータゲート１３１・・・・・・マイクロプロセッサ１３２・・・・
・・ＲＯＭ　　　　１３３・・・・・・ＲＡＭ。特許出瀬人　　住友電気工業株式会社同　代理人　　鎌　　１）　文　　二第３図第５図第４図　　　　　　（ａ）　　￥１２３４５第６図ｎ仏５）口冒−一口（ｂ、）Ｃ１２３４５６７８９０，８
）　　　第７図第８図第９図第１０図 μ７牛第１５図（ａ）第１６図第１８ｖＡ（ａ）FIG. 1 shows an example of the configuration of an optical character reading device using the present invention.
Fig. 2 is an optical character reading device according to the prior art, Fig. 3 is an explanatory diagram of the process up to single character extraction processing, Fig. 4 is an explanatory diagram of the single character extraction method, Fig. 5 is an explanatory diagram of digit alignment/majority result processing, Figures 6, 8, and 9 are illustrations of the movement of lines within the field of view and majority voting results, and Figure 7 is an illustration of price tag reading.
FIG. 10 is an explanatory diagram of the problems of the prior art, FIG. 11 is a detailed explanatory diagram of the present invention, FIG. 12 is a diagram of an embodiment of the line detection means and line interchange detection means, and FIG. 13 is an illustration of the present invention. Example diagram,
Fig. 14 is a schematic flowchart of the microb convex processor processing, Fig. 15 is an explanatory diagram of constants and variables, and Figs. FIG. 19 is an explanatory diagram of how to set the area, and FIG. 20 is an explanatory diagram of the field of view of the optical character reading device using the present invention. 1...Scanner 2...Hand 3.
...Original 4...Light source 5...
... Lens system 6 ... Image sensor 7 ... Control binarization circuit 8 ... Screen memory 9 ... - Digit extraction circuit 10 ... ...
・-Digit memory 11...-Character cutting circuit 12...-Character memory 13...Character recognition circuit 14.15.16...Recognition result buffer 17 .17'... Digit alignment processing section 18...
...Majority processing section 19...Format check section 20.130
... Timer 21 ... Previous register 22 ... Comparator 23 ... Output control section 30 ... Image sensor field of view 3L 32.3
3...Line detection means 34.35.36...
・Line storage means 37...Column alignment means 38...Line comprehensive processing means 39...Line interchange detection means 101.106
．． 111.116... Contents of line storage means #l 102.107.112.117... Contents of line storage means #2 103.108.113.118... Lines Contents of storage means #3 104.109.114.119... Digit alignment result 105.110.115.120... Line total result 121.122.123... Horizontal OR circuit 12
4.125.126... Black length determination section 127.
... OR gate 128 ... Inverter gate 131 ... Microprocessor 132 ...
...ROM 133...RAM. Patent Deseto Sumitomo Electric Industries Co., Ltd. Agent Sickle 1) Text 2 Figure 3 Figure 5 Figure 4 (a) ¥12345 Figure 6 n French 5) Mouth - Mouthful (b,) C1234567890,8
) Figure 7 Figure 8 Figure 9 Figure 10 Figure μ7 Cow Figure 15 (a) Figure 16 Figure 18vA (a)

Claims

[Claims]

(1) In an optical character reading device that reads characters on a document by holding an image sensor housing (scanner) in your hand and applying it to the document, an image of how multiple characters, such as one line of characters or symbols, can be captured within the field of view. A sensor, a character recognition means that recognizes each character on the screen captured by the image sensor and outputs its position, and a system that determines whether a portion of a line exists in each area in multiple areas set in the field of view. A line detection means for detecting a line, a plurality of line storage means for storing recognized lines obtained from the character recognition means when the corresponding line detection means detects a line, and a recognized line stored in each of the line storage means. , a digit alignment processing means for matching characters with each other based on character position information, and a line synthesis means for obtaining a comprehensive line that integrates the recognized lines stored in the line storage means based on the result of the digit alignment processing means. and line transposition detection means for detecting a transposition of lines within the field of view, and when the line detection means detects the presence of a line, the line storage means corresponding to the line detection means is obtained from the character recognition means. The digit alignment processing means associates the characters of the recognized line stored in the line storage means with each other, and obtains a comprehensive line by the line synthesis means.Meanwhile, the line interchange detection means 1. An optical character reading device which erases the contents of a line storage means when detecting a line change.

(2) In the optical character reading device according to claim 1, the line detection means includes a horizontal OR circuit for calculating the logical sum of black pixels in the horizontal direction in a predetermined range provided in the field of view of the image sensor; It consists of a black length determination section that determines whether the calculation results of the OR circuit are continuous for a predetermined range of length in the vertical direction, and the black length determination section detects the calculation results of the horizontal OR circuit that are continuous for a predetermined range of length. An optical character reading device characterized in that it sometimes determines that a line exists.

(3) In the optical character reading device according to claim 1, the line detection means checks the positional information of each character in the recognized line obtained from the character recognition result, and determines whether the character is not unrecognizable and has positional information within a predetermined range. It is a means for determining that a line exists based on the presence of a character, and there are more non-unrecognizable characters in the line than there are non-unrecognizable characters having position information in the range in the recognized line already stored in the line storage means. When the range is obtained,
An optical character reading device characterized in that a newly obtained recognized line is stored in a line storage means.

(4) In the optical character reading device according to claims 1 to 3, the line transposition detection means determines that a line transposition occurs when none of the line detection means detects a line. An optical character reading device featuring:

(5) In the optical character reading device according to claim 1 or 3, the line change detection means measures the time during which no recognized line is obtained when the character recognition means repeatedly performs recognition processing. What is claimed is: 1. An optical character reading device comprising: a timer for determining that a line has been swapped when a recognized line is not obtained for a predetermined period of time;

(6) In the optical character reading device according to claims 1 to 5, the height of the field of view of the image sensor is 1 to 3 times the height of the character.
An optical character reading device characterized in that the positional information of a character used for digit alignment is horizontal positional information within a visual field of the character.

(7) In the optical character reading device according to claims 1 to 5, the height of the field of view of the image sensor is smaller than the line spacing between the character lines written on the document, and the position is used for digit alignment. An optical character reading device characterized in that the information is horizontal position information within a field of view of the character.