JP2514663B2

JP2514663B2 - Optical character reader

Info

Publication number: JP2514663B2
Application number: JP62147340A
Authority: JP
Inventors: 幹雄山口
Original assignee: Sumitomo Electric Industries Ltd
Current assignee: Sumitomo Electric Industries Ltd
Priority date: 1987-06-13
Filing date: 1987-06-13
Publication date: 1996-07-10
Anticipated expiration: 2011-07-10
Also published as: JPS63311491A

Description

【発明の詳細な説明】〔産業上の利用分野〕本発明は、手持ち式のスキャナで原稿上を走査するこ
とにより文字・記号等（以下代表して文字のみに関して
述べるが記号に関しても全く同様である）を読み取る光
学文字読取装置に関するものである。DETAILED DESCRIPTION OF THE INVENTION [Industrial field of application] The present invention is designed to scan characters on a manuscript with a hand-held scanner (hereinafter, only characters will be described as a representative, but the same applies to symbols). A) optical character reader.

[Conventional technology]

スーパーマーケットや百貨店等で、単品毎の売上げ情
報を収集して在庫管理を行うPOS（Point of Sales）シ
ステムが普及している。このPOSシステムでは手持ち式
の光学文字読取装置がよく使用されている。POS (Point of Sales) systems, which collect sales information for individual items and manage inventory, have become widespread in supermarkets and department stores. Handheld optical character readers are often used in this POS system.

このような装置として、本出願人は特願昭62-11083号
や特願昭62-56293号を特許出願している。手持ち式の光
学文字読取装置の代表的な構成を第２図に示す。As such a device, the present applicant has applied for a patent for Japanese Patent Application No. 62-11083 and Japanese Patent Application No. 62-56293. A typical structure of a handheld optical character reader is shown in FIG.

第２図において、１はスキャナであり、手２で、原稿
３に当てがうだけで原稿に記憶された文字を読み取るも
のである。原稿３はたとえば、POSシステムで用いる値
札の用紙である。４は光源であり、５はレンズ系、６は
イメージセンサであり、少なくとも用紙３に記載された
文字の一行分の視野が必要であり、第２図では横に一行
分、縦は一文字の３倍くらいとしている。７は制御・二
値化回路であり、イメージセンサ６の出力信号であるア
ナログ信号を文字領域及び背景領域のおのおの対応する
二値化信号に変換し、画面メモリ８に送る。In FIG. 2, reference numeral 1 denotes a scanner, which is used to read characters stored in a manuscript simply by applying it to the manuscript 3. The manuscript 3 is, for example, a price tag sheet used in the POS system. Reference numeral 4 is a light source, 5 is a lens system, and 6 is an image sensor, which requires a field of view of at least one line written on the paper 3, and in FIG. I'm doubling it. Reference numeral 7 denotes a control / binarization circuit which converts an analog signal, which is an output signal of the image sensor 6, into a corresponding binarized signal for each of a character area and a background area, and sends it to the screen memory 8.

９から13は、画面メモリ８の中の各文字を認識し、そ
の文字の視野内の位置（ｘ座標）を求める手段である。9 to 13 are means for recognizing each character in the screen memory 8 and obtaining the position (x coordinate) of the character in the visual field.

画面メモリ８はイメージセンサ６の視野のほぼ全体の
二値化データを格納する。第３図（ａ）にイメージセン
サ６の二値化データの説明を示している。横（Ｘ）×縦
（Ｙ）の大きさがｐ×ｑ画素のイメージセンサであり、
視野のなかの文字を写し込んでいる。The screen memory 8 stores binarized data of almost the entire visual field of the image sensor 6. FIG. 3A shows an explanation of the binarized data of the image sensor 6. An image sensor having a size of horizontal (X) × vertical (Y) of p × q pixels,
The characters in the field of view are captured.

文字、記号は文字識別回路13で認識されるが、文字識
別回路13は１文字ずつ認識するものであるので、画面メ
モリ８からは１文字分のデータを取り出す必要がある。
一桁切り出し回路９は画面メモリ８から一文字切り出し
回路11の処理能力であるｍ×ｑ画素相当分のデータを取
り出し一桁メモリ10に格納する。一文字切り出し回路11
は一桁メモリから文字識別回路13の処理能力であるｍ×
ｎ画素相当分のデータを取り出し、一文字メモリ12に格
納するものである。Characters and symbols are recognized by the character identifying circuit 13, but since the character identifying circuit 13 recognizes character by character, it is necessary to retrieve data for one character from the screen memory 8.
The one-digit cutout circuit 9 takes out from the screen memory 8 data corresponding to m × q pixels, which is the processing capacity of the one-character cutout circuit 11, and stores it in the one-digit memory 10. Single character extraction circuit 11
Is the processing capacity of the character identification circuit 13 from the one-digit memory m ×
Data corresponding to n pixels is taken out and stored in the one-character memory 12.

第３図（ａ）において、まず一桁切り出し回路９はＸ
＝１からＸ＝ｍ、Ｙ＝１からＹ＝ｑ迄のデータを画面メ
モリ８から取出し、一桁メモリ10に転送する（第３図
（b₁））。一桁切り出し回路９は一桁メモリ10の内容を
見て文字像を含む範囲（この例ではＹ＝11からＹ＝11＋
ｎ−１）のｎ行分を一文字メモリ12に転送する。（第３
図（c₁））。一文字メモリ12に文字が入っているときは
文字識別回路13により文字が認識される。次にＸ＝２か
らＸ＝ｍ＋１、Ｙ＝１からＹ＝ｑ迄のデータを画面メモ
リ８から取り出し、一桁メモリ10に転送する（第３図
（b₂））。そして文字像を含む範囲の画像を一文字メモ
リ12に転送する。以下、同様にして画面メモリ８から取
り出す位置を順にずらして一桁メモリ10に転送し、文字
像を含む画像を一文字メモリ12に転送し、文字識別回路
13で処理を行うことで一行分の文字の認識を行う。In FIG. 3 (a), first, the single digit cutout circuit 9 is X.
The data from = 1 to X = m and from Y = 1 to Y = q are taken out from the screen memory 8 and transferred to the one-digit memory 10 (FIG. 3 (b ₁ )). The one-digit cutting circuit 9 looks at the contents of the one-digit memory 10 and includes a character image in the range (Y = 11 to Y = 11 + in this example).
n-1) n lines are transferred to the one-character memory 12. (Third
Figure (c ₁ )). When a character is stored in the one-character memory 12, the character is recognized by the character identification circuit 13. Next, the data from X = 2 to X = m + 1 and Y = 1 to Y = q are taken out from the screen memory 8 and transferred to the one-digit memory 10 (FIG. 3 (b ₂ )). Then, the image in the range including the character image is transferred to the one-character memory 12. Thereafter, similarly, the positions taken out from the screen memory 8 are sequentially shifted and transferred to the one-digit memory 10, the image including the character image is transferred to the one-character memory 12, and the character identification circuit
By performing the process in 13, the character of one line is recognized.

一桁メモリ10から一文字メモリ12に転送する範囲の求
め方を第４図に示す。先ず一桁メモリ10の各行に対して
横ORを求める。FIG. 4 shows how to obtain the range to be transferred from the single digit memory 10 to the single character memory 12. First, the horizontal OR is calculated for each row of the one-digit memory 10.

横ORとは横方向の一行に注目してその行に黒画素があ
れば１とし、黒画素がなければ０とする演算である。い
まセンサの黒出力を１とし、白出力を０として表現する
と、横ORの結果とはすなわち一行の各画素の論理和を取
った結果にほかならない。そこでこの演算を横ORと呼ん
でいる。そして文字がある部分では第４図（ｂ）に示す
ように、その範囲だけ横ORの結果は黒となる。一行メモ
リから一文字メモリに転送する範囲は、たとえばＹ＝13
から横ORが黒になったとすると、文字の上方の白を含め
てＹ＝11からｎ画素とする。The horizontal OR is an operation in which one row in the horizontal direction is focused and 1 is set if there is a black pixel in the row, and 0 is set if there is no black pixel. Now, if the black output of the sensor is expressed as 1 and the white output is expressed as 0, the result of the lateral OR is nothing but the result of the logical sum of the pixels in one row. Therefore, this operation is called horizontal OR. Then, as shown in FIG. 4 (b), the result of the horizontal OR in that part of the character is black as shown in FIG. The range for transferring from one-line memory to one-character memory is, for example, Y = 13.
Therefore, if the horizontal OR becomes black, Y = 11 to n pixels including white above the character.

以上の処理によって、センサ６の視野の中に含まれ
る、文字、記号を読み取ることができる。イメージセン
サ６を走査して用紙３の画像を画面メモリ８に蓄え、画
面メモリ８の中の各文字を認識する処理は３回行われ、
文字の認識結果とその文字の視野内の位置（文字が認識
されたときに一桁切り出し回路９が一桁メモリ10に画面
を切り出したときのｘ座標）が14、15、16の識別結果バ
ッファ＃１、＃２、＃３に蓄えられる。Through the above processing, the characters and symbols included in the field of view of the sensor 6 can be read. The process of scanning the image sensor 6 to store the image of the paper 3 in the screen memory 8 and recognizing each character in the screen memory 8 is performed three times,
Identification result buffers of 14, 15 and 16 showing the character recognition result and the position of the character in the visual field (x-coordinate when the one-digit cutting circuit 9 cuts the screen into the one-digit memory 10 when the character is recognized). It is stored in # 1, # 2, and # 3.

第２図の文字認識装置においては、一つの文字に対し
て繰り返し認識した結果の多数決を取ることで、認識率
向上を図っている。14から18は、３回画面を取り込んで
認識したときの文字の認識結果の多数決を取る手段であ
る。まず、認識結果バッファ＃１から＃３に記憶されて
いる文字のｘ座標の値と認識結果が桁合わせ処理部17に
送られる。桁合わせ処理部17は文字のｘ座標の値に基づ
いて、同一桁と判断できる文字認識結果を対応づける。
多数決処理部18は対応づけられた文字認識結果同士の多
数決をとり、その桁に対する最終的な認識結果を得る。
多数決の例を第５図に示す。（ａ）は原稿に記載されて
いる行である。（b₁）（b₂）（b₃）はそれぞれ、１度
目、２度目、３度目の認識における認識結果を表してい
る。（b₁）では「１」の文字が欠け、（b₂）では「２」
の文字が欠け、（b₃）では「３」の文字が欠けている
が、文字の視野内におけるｘ座標を基にして各桁におけ
る認識結果の対応を取ってから多数決を取ることで
（b₄）のように正解が得られている。なお、文字のｘ座
標を用いずに、単純に認識した文字の先頭から対応づけ
て多数決を取ると、（ｃ）のように正解が得られない。In the character recognition device of FIG. 2, the recognition rate is improved by taking a majority decision of the result of repeatedly recognizing one character. 14 to 18 are means for obtaining a majority of the recognition results of characters when the screen is captured and recognized three times. First, the x-coordinate value of the character and the recognition result stored in the recognition result buffers # 1 to # 3 are sent to the digit alignment processing unit 17. The digit alignment processing unit 17 associates character recognition results that can be determined to be the same digit based on the value of the x coordinate of the character.
The majority decision processing unit 18 obtains a final recognition result for the digit by taking a majority decision between the associated character recognition results.
An example of majority decision is shown in FIG. (A) is a line described in the manuscript. (B ₁ ) (b ₂ ) (b ₃ ) represent the recognition results in the first, second, and third recognitions, respectively. (B ₁₎ the lack of character of "1", (b ₂₎ In the "2"
The character “3” is missing in (b ₃ ) and the character “3” is missing in (b ₃ ). _The correct answer is obtained as in ₄ ). If a majority decision is made by simply associating from the beginning of the recognized character without using the x coordinate of the character, the correct answer cannot be obtained as in (c).

第２図の19から23は原稿上の一つの行に対して正しく
読み取れた認識結果を１回だけ出力するための手段であ
る。行が視野の中に在るかぎり、文字が繰り返されて認
識され、18から多数決結果が繰り返し出力される。スキ
ャナ１を「C1234567890」の行に当てがいながら上から
下に動かしたときの、視野の動きと、多数決結果の変化
を第６図に示す。（a₁）の位置では、認識結果（b₁）は
すべてリジェクト（認識不能:?で表している）になって
いる。（a₂）の位置の認識結果（b₂）も同様にすべてリ
ジェクトである。（a₃）位置では、「０」の文字だけ視
野の中に入って認識されている。（b₁）から（b₃）まで
の多数決結果は（c₁）のようになる。ここで、「０」の
文字に付いては（b₃）の文字認識結果を（b₁）（b₂）の
リジェクトよりも優先している。すなわち、リジェクト
よりも文字認識結果に重みを設定している。スキャナを
更に視野に動かし、（a₄）（a₅）（a₆）の位置における
認識結果（b₄）（b₅）（b₆）の多数決結果は（c₂）の通
りである。同様に、（a₇）（a₈）（a₉）の位置における
認識結果（b₇）（b₈）（b₉）の多数決結果は（c₃）の通
りである。Reference numerals 19 to 23 in FIG. 2 are means for outputting the recognition result which is correctly read for one line on the original document only once. As long as the line is in the field of view, the letters are repeated and recognized, and the majority decision result is repeatedly output from 18. FIG. 6 shows the movement of the visual field and the change in the majority result when the scanner 1 is moved from top to bottom while being applied to the line “C1234567890”. At the position of (a ₁ ), all recognition results (b ₁ ) are rejected (indicated by unrecognizable :?). (A ₂₎ the position of the recognition result of (b ₂₎ also are all rejected as well. In (a ₃₎ position, it has been recognized entered in the field of view only the characters of "0". The majority result from (b ₁ ) to (b ₃ ) is like (c ₁ ). Here, with the character "0" is in preference to reject the character recognition result of the _{_{(b 3) (b 1)}} (b 2). That is, the character recognition result is weighted rather than rejected. Further moving the field of view scanner is as _{_{(a 4) (a 5)}} (a 6) the recognition result at the position of _{_{(b 4) (b 5)}} (b 6) the majority result of (c _2). Similarly, as _{_{(a 7) (a 8)}} (a 9) the recognition result at the position of _{_{(b 7) (b 8)}} (b 9) majority result of (c _3).

18からは（c₁）（c₂）（c₃）が逐次出力されるが、19
はフォーマットチェック部で、18から得られる多数決結
果が予め定めてある所定のフォーマット（たとえば、Ｃ
で始まる行はＣの後に数字が10文字続かなければならな
い）を満たしているかどうかを判定する。タイマー20は
18から多数決結果が得られてからの経過時間を測定す
る。所定のフォーマットを満たす多数決結果R_iが得られ
たなら、前回レジスタ21、比較器22、出力制御部23は、
次のように動作する。まず、比較器22において、R_iと前
回レジスタ21に記憶されている内容R_i-1とが比較され
る。R_iとR_i-1の内容が一致しなければ比較器22からはNE
Wの信号が出て出力制御部23はR_iをその行の認識結果R
_LINEとして出力する。R_iとR_i-1の内容が一致すれば、比
較器22からはNEWの信号が出ず、出力制御部23はR_iを出
力しない（読み捨てる）。一方、前回レジスタ21はR_i-1
を比較器22に送った後は、R_iを記憶する。タイマー20は
18から多数決結果が得られてからの時間を測定し、あら
かじめ定めた一定時間T_CLR経過後に前回レジスタ21の内
容を消去する。電源を入れた直後の前回レジスタの状態
は消去状態である。T_CLRは値札を持ち換えるのに必要な
時間（たとえば１秒）よりも短く、たとえば0.6秒程度
に設定しておく。なお、行が視野の中にあって18から繰
り返し多数決結果が得られるときの繰り返しの周期はた
とえば0.2秒程度である。From (18), (c ₁ ) (c ₂ ) (c ₃ ) is sequentially output.
Is a format check unit, and a majority format result obtained from 18 has a predetermined format (for example, C
The line beginning with must satisfy C) followed by 10 digits). Timer 20
Measure the elapsed time since the majority result was obtained from 18. If the majority result R _i satisfying the predetermined format is obtained, the previous register 21, the comparator 22, the output control unit 23,
It works as follows. First, in the comparator 22, R _i is compared with the content R _i−1 previously stored in the register 21. If the contents of R _i and R _i-1 do not match, the comparator 22
The signal W is output and the output control unit 23 determines R _i to be the recognition result R
Output as _LINE . If the contents of R _i and R _i-1 match, the NEW signal is not output from the comparator 22 and the output control unit 23 does not output R _i (discard it). On the other hand, the previous register 21 is R _i-1
After sending to the comparator 22, R _i is stored. Timer 20
The time after the majority result is obtained from 18 is measured, and the content of the previous register 21 is erased after the elapse of a predetermined fixed time T _CLR . The state of the previous register immediately after the power is turned on is the erased state. T _CLR is shorter than the time required to change the price tag (for example, 1 second), and is set to about 0.6 seconds, for example. The repetition cycle when the row is in the field of view and the majority result is repeatedly obtained from 18 is, for example, about 0.2 seconds.

第６図、第７図を用いて、値札を読み取るときの19か
ら23の動作を説明する。（c₁）の多数決結果「????????
??0」は所定のフォーマットを満たしていないので、フ
ォーマットチェック部９からは何も出力されない。スキ
ャナを上から下に動かして（c₂）の多数決結果「C12345
67890」が得られると、これは所定のフォーマットを満
たすのでフォーマットチェック部19からはこの多数決結
果が出力R_iされる。比較器22ではR_iと前回レジスタの内
容R_i-1が比較されるが、電源を入れた直後は前回レジス
タの内容は消去されているので、R_iとR_i-1の内容は必ず
一致しない。そのため比較器22からはNEWの信号が出力
制御部23に出て、出力制御部23からは「C1234567890」
が、行認識結果R_LINEとして出力される。一方、前回レ
ジスタ21には「C1234567890」が記憶される。次に
（c₃）の多数決結果「C1234567890」が得られるが、（c
₂）の時と同様にフォーマットチェック部を経て、比較
器22に送られる。しかし、前回レジスタ21の内容R_i-1が
「C1234567890」になっておりR_iと一致するので、比較
器からはもはやNEWの信号が出ず、出力制御部23からR
_LINEとして出力されない。以上の様にして、視野に入っ
た一つの行に対しては一回だけ行認識結果R_LINEを出力
する。The operations 19 to 23 when reading the price tag will be described with reference to FIGS. 6 and 7. (C ₁ ) Majority result "????????
Since "? 0" does not satisfy the predetermined format, nothing is output from the format check unit 9. Move the scanner from top to bottom (c ₂ ) majority result “C12345
67890 ”is obtained, the format check unit 19 outputs the majority decision result R _i because it satisfies a predetermined format. The comparator 22 compares R _i with the contents of the previous register R _i-1, but the contents of the previous register are erased immediately after the power is turned on, so the contents of R _i and R _i-1 always match. do not do. Therefore, the NEW signal is output from the comparator 22 to the output control unit 23, and "C1234567890" is output from the output control unit 23.
Is output as the line recognition result R _LINE . On the other hand, “C1234567890” is stored in the previous register 21. Next, the majority result of (c ₃ ) "C1234567890" is obtained, but (c
_{As in the case of 2} ), it is sent to the comparator 22 through the format check unit. However, since the content R _i-1 of the register 21 last time is “C1234567890” and matches R _i , the NEW signal is no longer output from the comparator, and the output control unit 23 outputs R
Not output as _LINE . As described above, the line recognition result R _LINE is output only once for each line in the field of view.

スキャナを動かして複数行を読み取るときの動作を第
７図を用いて説明する。フォーマットチェック部19には
Ｃ、Ｎ、￥で始まる各行のフォーマットが登録されてい
るとする。まず（ａ）の値札で「C1234567890」の行に
スキャナを当てがったときは先程説明した通り、一回だ
け行認識結果R_LINEとして出力される。次にスキャナを
下に動かして「N1234567890」の行にスキャナを当てが
ったとき、視野に「N1234567890」の行が入って、多数
決処理部18から初めて「N1234567890」の認識結果が得
られたときは、前回レジスタの内容R_i-1は「C123456789
0」になっているので、比較器22からはNEWの信号が出
て、「N1234567890」が行認識結果R_LINEとして出力され
る。それ以降は繰り返して「N1234567890」が多数決処
理部18から出力されても前回レジスタの内容と一致する
ため、R_LINEとして出力されない。すなわち、「N123456
7890」は一回だけ出力される。同様にして、「￥123,45
6.」の行が視野に入ったときは、行認識結果R_LINEとし
て「￥123,456.」が出力される。なお、JIS B9551によ
るPOS用値札では、一つの値札の中の各行は、異なる内
容になっている。このため、比較器22で前回認識した結
果R_i-1と今回認識した結果R_iを比較することで、同じ行
を読んだか否かが判別できるのである。The operation of moving the scanner to read a plurality of lines will be described with reference to FIG. It is assumed that the format check unit 19 has registered the format of each line starting with C, N, and \. First, when the scanner is applied to the line of "C1234567890" with the price tag of (a), as described above, the line recognition result R _LINE is output only once. Next, when the scanner is moved down and the scanner is applied to the line of "N1234567890", when the line of "N1234567890" enters the field of view and the recognition result of "N1234567890" is obtained from the majority processing unit 18 for the first time. Is the previous register contents R _i-1 is “C123456789
Since it is "0", the NEW signal is output from the comparator 22, and "N1234567890" is output as the line recognition result R _LINE . After that, even if "N1234567890" is repeatedly output from the majority processing unit 18, it is not output as R _LINE because it matches the previous register contents. That is, "N123456
7890 ”is output only once. Similarly, "¥ 123,45
When the line "6." enters the field of view, "\ 123,456." Is output as the line recognition result R _LINE . In the price tag for POS according to JIS B9551, each line in one price tag has different contents. Therefore, as a result of the recognition time and the result R _i-1 recognized last time comparator 22 to compare the R _i, it can be determined whether or not reading the same line.

値札を（ａ）から（ｂ）に持ち換える間は、視野には
文字が入っていない。このときは、多数決処理部18から
は何も出力が得られない。タイマー20は多数決処理部18
が認識結果を出力してからの経過時間を測定しており、
値札を持ち換えたために、T_CLR以上各文字認識手段から
出力が得られない状態が続くと、前回レジスタ21の内容
を消去する。したがって、値札を（ｂ）に持ち換えて
「￥123,456.」の行にスキャナを当てがうと「￥123,45
6.」は出力される。すなわち、同じ内容の行であっても
異なる値札ならば続けて読み取ることができる。以上の
説明から判るように、タイマー20は原稿（値札）の交換
を検出する機能を持っている。There are no characters in the field of view while the price tag is changed from (a) to (b). At this time, no output is obtained from the majority decision processing unit 18. Timer 20 is majority processing unit 18
Is measuring the elapsed time after outputting the recognition result,
If the state in which no output is obtained from each character recognition means for T _CLR or more continues because the price tag is changed, the contents of the register 21 are erased last time. Therefore, if you change the price tag to (b) and apply the scanner to the line "¥ 123,456."
6. ”is output. That is, even if the lines have the same content, different price tags can be continuously read. As can be seen from the above description, the timer 20 has a function of detecting the exchange of the manuscript (price tag).

[Problems to be solved by the invention]

スキャナ１を行に対して傾けて（スキューをかけて）
行に当てがったときの様子を第８図に示す。スキュー角
θが小さく、（a₂）のように行全体が視野の中に入るタ
イミングのあるとき、（a₁）（a₂）（a₃）の認識結果
（b₁）（b₂）（b₃）の多数決結果（ｃ）には行の認識結
果が正しく得られている。Tilt scanner 1 to the row (with skew)
FIG. 8 shows the state when the line is applied. The skew angle θ is small, when the entire line as (a ₂₎ is a timing falling within the field of _{_{view, (a 1) (a 2}} ) (a 3) of the recognition result _{_{(b 1) (b 2)}} ( In the majority result (c) of b ₃ ), the recognition result of the line is correctly obtained.

スキュー角θを大きくしたときの様子を第９図に示
す。スキュー角θの大きさは視野に完全に入った一つの
文字が認識出来なくなる程大きくはないとする。スキュ
ー角が大きくなることで、行全体が視野に入るタイミン
グはないが、（a₁）で行の右端が視野に入り、（a₃）で
行の左端が視野に入ることで多数決結果（ｃ）には行の
認識結果が正しく得られている。FIG. 9 shows how the skew angle θ is increased. It is assumed that the skew angle θ is not so large that one character completely entering the visual field cannot be recognized. Although there is no timing for the entire row to enter the field of view due to the increased skew angle, the majority of the results (c ₁ ) appear when the right edge of the row is in the field of view and (a ₃ ) is the left edge of the line is in the field of view. The line recognition result is correctly obtained in).

スキュー角θを大きくしたまま、ゆっくりとスキャナ
を上から下に動かした時の、視野の動きと認識結果を第
10図に示す。スキュー角θが大きいことで、行全体が視
野の中に入るタイミングはない。また、スキャナがゆっ
くりと動いているため、どの隣接している認識結果
（b_i）（b_i+1）（b_i+2）（ｉは１から８までの整数）の
多数決結果をとっても行全体の認識結果を正しく得るこ
とができない。Show the movement of the visual field and the recognition result when slowly moving the scanner from top to bottom with the skew angle θ increased.
Shown in Figure 10. Due to the large skew angle θ, there is no time for the entire row to enter the field of view. Also, since the scanner is moving slowly, it is possible to obtain the majority decision result of any adjacent recognition results (b _i ) (b _{i + 1} ) (b _{i + 2} ) (i is an integer from 1 to 8). The whole recognition result cannot be obtained correctly.

第10図の説明から判るように、従来技術による光学文
字読取装置では、視野に行全体が一度に入らないような
大きなスキュー角で、ゆっくりとスキャナを動かして行
を捉えたときはその行全体の認識結果を得ることができ
ない。本発明はこの欠点を解消するために案出されたも
ので、視野に行全体が一度に入らないようなスキュー角
でも光学文字読取装置の行全体の認識を可能にすること
を目的としている。As can be seen from the explanation of FIG. 10, in the conventional optical character reading device, when the line is captured by slowly moving the scanner with a large skew angle so that the entire line does not fit in the field of view at one time, the entire line is read. Can not get the recognition result of. The present invention has been devised to solve this drawback, and it is an object of the present invention to enable the entire line of an optical character reader to be recognized even at a skew angle such that the entire line of view does not fit at once.

[Structure of Invention]

第１図に本発明を用いた光学文字読取装置の構成例を
示す。図中１から23までの符号をつけた部分は第２図の
従来技術の光学文字読取装置で同符号を付けた部分と同
じ機能・構成を持つ。ただし、桁合わせ処理部17′は第
２図の桁合わせ処理部17の機能に付け加えて、桁合わせ
後の文字のｘ座標の値も出力する。桁合わせ後の文字の
ｘ座標はたとえば、同一桁と見なせる文字のｘ座標の値
の平均値を用いる。８から18までが、イメージセンサの
捉えた画面の中の各文字を認識し、その位置を出力する
文字認識手段である。文字認識手段から得られた、視野
に入った各文字の認識結果とその位置情報を以下の説明
では認識行と呼ぶことにする。FIG. 1 shows a configuration example of an optical character reading device using the present invention. In the figure, the parts denoted by the reference numerals 1 to 23 have the same functions and configurations as the parts denoted by the same reference numerals in the conventional optical character reader of FIG. However, the digit alignment processing unit 17 'outputs the value of the x coordinate of the character after digit alignment in addition to the function of the digit alignment processing unit 17 in FIG. For the x-coordinate of the character after digit alignment, for example, the average value of the x-coordinate values of the characters that can be regarded as the same digit is used. Characters 8 to 18 are character recognition means for recognizing each character on the screen captured by the image sensor and outputting the position. The recognition result of each character in the field of view obtained from the character recognition means and its position information will be referred to as a recognition line in the following description.

31、32、33は視野に複数の領域を設定したときに、そ
れぞれの領域に行の一部分が存在するか否かを検出する
行検出手段である。第１図では行検出手段が３つ存在す
る場合を例示しているが、複数ならば３以外の数の場合
の構成も可能である。34、35、36は、それぞれ対応する
行存在検出手段が行を検出したときに認識行を記憶する
行記憶手段である。37は、行記憶手段のそれぞれに記憶
された認識行において、位置情報に基づいて文字同士を
対応づける桁合わせ処理手段である。39は、桁合わせ処
理手段によって、各文字同士の対応が付けられた認識行
を総合して一つの行を得る行総合処理手段である。以下
の説明では、行総合処理手段によって得られた行を総合
行と呼ぶことにする。39は、行入れ換わり検出手段であ
り、イメージセンサ６の捉える行が入れ換わった事を検
出する手段である。Reference numerals 31, 32, and 33 are row detection means for detecting whether or not a part of the row exists in each area when a plurality of areas are set in the field of view. Although FIG. 1 exemplifies a case in which there are three row detecting means, if there are a plurality of row detecting means, a configuration with a number other than 3 is also possible. Reference numerals 34, 35 and 36 are row storage means for storing the recognized row when the corresponding row existence detecting means detects the row. 37 is a digit alignment processing means for associating characters with each other in the recognition line stored in each of the line storage means based on the position information. Reference numeral 39 is a line comprehensive processing means for obtaining one line by synthesizing the recognized lines to which the respective characters are associated by the digit alignment processing means. In the following description, the line obtained by the line total processing means will be referred to as a total line. Reference numeral 39 is a line interchange detecting means, and is a means for detecting that the lines captured by the image sensor 6 have been interchanged.

[Action]

スキャナ１を新たな行に当てがったときは、行入れ換
わり検出手段39の働きにより、行記憶手段34、35、36の
内容は消去されている。以下に、行検出手段、行記憶手
段、桁合わせ処理手段、行総合処理手段の動作を第11図
の例を用いて説明する。When the scanner 1 is applied to a new row, the contents of the row storage means 34, 35 and 36 are erased by the function of the row interchange detection means 39. Below, the operations of the line detection means, the row storage means, the digit alignment processing means, and the line integration processing means will be explained using the example of FIG.

いま、視野30が（ａ）で示すように、S₁、S₂、S₃の３
つの領域に分割されており、それぞれの領域において行
が入っているか否かを行検出手段＃１、＃２、＃３がそ
れぞれ判定するとする。第11図（ｂ）に示す行をスキャ
ナを傾けながら読みとると、視野と行の位置関係ならび
に認識行はそれぞれ（c₁）（c₂）（c₃）と（d₁）（d₂）
（d₃）のように逐次変化する。（d₁）（d₂）（d₃）の認
識行が逐次得られるときの行記憶手段の記憶内容と行総
合処理結果を（ｅ）に示す。Now, as shown in (a), the field of view 30 is ₃ of S ₁ , S ₂ , and S ₃ .
It is divided into two areas, and the row detecting means # 1, # 2, and # 3 respectively determine whether or not there is a row in each area. When the line shown in FIG. 11 (b) is read while tilting the scanner, the positional relationship between the field of view and the line and the recognized line are (c ₁ ) (c ₂ ) (c ₃ ) and (d ₁ ) (d ₂ ), respectively.
It changes sequentially like (d ₃ ). The stored contents of the row storage means and the row total processing result when the recognized rows of (d ₁ ) (d ₂ ) (d ₃ ) are sequentially obtained are shown in (e).

スキャナを行に当てがう前は、行入れ換わり検出手段
39によって、行記憶手段＃１、＃２、＃３の内容は消去
状態にある。まず、（c₁）のように視野のS₃の領域に行
が入って、（d₁）の認識行が得られたときの様子を第
段階に示す。行がS₃の領域に入ったことを行検出手段＃
３が検出し、認識行は行記憶手段＃３に記憶される。各
行記憶手段に記憶されている内容101、102、103は桁合
わせ処理手段37によって、桁合わせ処理が行われる（10
4）。第段階は101、102は消去状態にあるので、行総
合処理によって得られる総合行105は103と同じになる。
そして、105で得られた総合行の文字はフォーマットチ
ェック部19に送られる。今、フォーマットチェック部は
第11図（ｂ）に対応して「Ｖ」の文字の後には12個の数
字が続く、と登録されているとする。105は所定のフォ
ーマットに合致しないので、フォーマットチェック部か
らは何も出力されない。Before the scanner is applied to the line, the line replacement detection means
39, the contents of the row storage means # 1, # 2, # 3 are in the erased state. First, the state when a line enters the area S ₃ of the visual field as in (c ₁ ) and the recognition line in (d ₁ ) is obtained is shown in the second stage. Row detection means that a row has entered the area of S ₃ #
3 is detected and the recognized line is stored in the line storage means # 3. The contents 101, 102, 103 stored in each row storage means are subjected to digit alignment processing by the digit alignment processing means 37 (10
Four). In the first stage, since 101 and 102 are in the erased state, the total row 105 obtained by the row total processing is the same as 103.
Then, the characters of the comprehensive line obtained at 105 are sent to the format check unit 19. Now, it is assumed that the format check unit is registered as corresponding to FIG. 11B, that the character "V" is followed by twelve numbers. Since 105 does not match the predetermined format, nothing is output from the format check unit.

次に、第11図（c₂）のように視野のS₂の領域が入っ
て、（d₂）の認識行が得られたときの様子を第段階に
示す。行がS₂の領域に入ったことを行検出手段＃２が検
出し、認識行は行記憶手段＃２に記憶される。このと
き、行記憶手段＃１、＃３の内容は第段階の時と同じ
ままである。各行記憶手段に記憶されている内容106、1
07、108は桁合わせ処理手段37によって、桁合わせ処理
が行われる（109）。行総合処理においては、各桁にお
いて認識された文字を選び出して総合行110を得る。そ
して、110の総合行の文字はフォーマットチェック部19
に送られる。110は所定のフォーマットに合致しないの
で、フォーマットチェック部からは何も出力されない。Next, as shown in FIG. 11 (c ₂ ), the state when the S ₂ region of the visual field is entered and the recognition line of (d ₂ ) is obtained is shown in the first stage. The row detection means # 2 detects that the row has entered the area S ₂ , and the recognized row is stored in the row storage means # 2. At this time, the contents of the row storage means # 1 and # 3 remain the same as in the first stage. Contents 106, 1 stored in each row storage means
Digit alignment processing means 37 performs digit alignment processing for 07 and 108 (109). In the line total processing, the character recognized in each digit is selected to obtain the total line 110. Then, the characters in the 110 total lines are the format check section 19
Sent to Since 110 does not match the predetermined format, nothing is output from the format check unit.

次に、第11図（c₃）のように視野のS₁の領域に行が入
って、（d₃）の認識行が得られたときの様子を第段階
に示す。行がS₁の領域に入ったことを行検出手段＃１が
検出し、認識行は行記憶手段＃１に記憶される。第、
段階のときと同様にして、111、112、113は桁合わせ
処理手段37によって、桁合わせ処理が行われる（11
4）。行総合処理においては、各桁において認識された
文字を選び出して総合行115を得る。行全体は第段階
の総合行で得られ、このときフォーマットチェック部1
9、出力制御部23を経て出力される（第１図R_LINE）。Next, as shown in FIG. 11 (c ₃ ), a line enters the area S ₁ of the visual field, and the state when the recognition line (d ₃ ) is obtained is shown in the first stage. The row detecting means # 1 detects that the row has entered the area of S ₁ , and the recognized row is stored in the row storing means # 1. First,
In the same manner as in the step, the digit alignment processing means 37 performs digit alignment processing on the reference numerals 111, 112, and 113 (11
Four). In the line total processing, the character recognized in each digit is selected to obtain the total line 115. The entire line is obtained in the stage 1 general line, at this time the format check section 1
9, output through the output control unit 23 (Fig. 1, R _LINE ).

以上の処理により、スキャナを原稿上の行に当てがっ
て動かしたとき行全体が視野に一度に入らなくても、行
全体の認識結果を得ることができる。スキャナをさらに
動かして次の行の読み取りに移るときは、行入れ換わり
検出手段39が行の入れ換わりを検出して、行記憶手段＃
１、＃２、＃３の内容を消去する。By the above processing, when the scanner is applied to a line on the document and moved, the recognition result of the entire line can be obtained even if the entire line does not enter the visual field at once. When the scanner is further moved to read the next line, the line exchange detecting means 39 detects the line exchange and the line storing means #
The contents of 1, # 2 and # 3 are erased.

なお、行全体が（ｆ）のように一度に視野に入って認
識行（ｇ）が得られた場合の動作は（ｈ）のようにな
る。S₁、S₂、S₃の各領域において、行が視野に入ったこ
とが行検出手段＃１、＃２、＃３で検出されるので、
（ｇ）の認識行は行記憶手段＃１、＃２、＃３のそれぞ
れに記憶される（116、117、118）。そして、桁合わせ
処理・行総合処理の結果120で、行全体が得られる。The operation when the entire row enters the field of view at once and the recognition row (g) is obtained as in (f) is as shown in (h). In each of the areas S ₁ , S ₂ , and S ₃ , the row detection means # 1, # 2, and # 3 detect that a row has entered the field of view.
The recognized line (g) is stored in each of the line storage means # 1, # 2, and # 3 (116, 117, 118). Then, the whole line is obtained as the result 120 of the digit alignment process / line integrated process.

〔Example〕

行検出手段＃１、＃２、＃３と行入れ換わり検出手段
の実施例を第12図に示す。（ａ）はイメージセンサの視
野30を表しており、S₁、S₂、S₃の領域に分割する。そし
てそれぞれの領域においてR₁、R₂、R₃で例示されている
ように行を検出する範囲を設定する。行検出手段はこの
範囲の中に行があることを検出する。（ｂ）は行検出手
段と行入れ換わり検出手段を実施する回路である。12
1、122、123は、それぞれR₁、R₂、R₃の範囲で横OR演算
を行う回路である。124、125、126は横ORの結果におい
て連続する黒画素の長さが所定範囲であるか否かを判定
する黒長さ判定部である。文字行が範囲R_i（ｉ＝１、
２、３）の中にあるときは、横OR結果はその文字に対応
して黒画素が文字の高さ分だけ連続するので、黒長さ判
定部によって文字がその領域にあるか否かが判定され、
その結果がEXIST_iの信号線に出力される。EXIST_iの信号
は対応する行記憶手段に送られ、各行記憶手段が認識行
を記憶するタイミングを与える。127はEXIST₁、EXIS
T₂、EXIST₃の論理和をとるオアゲートである。127の出
力EXISTはR₁、R₂、R₃の何れかに文字があれば真にな
る。行の入れ換わりを検出するには、行が視野から出て
いくこと、すなわち、EXISTが偽になることを検出すれ
ばよい。128はEXISTの論理を反転して、行の入れ換わり
信号ALLCLRを作成するインバータゲートである。ALLCLR
の信号はすべての行記憶手段に送られ、行記憶手段の内
容を消去するタイミングを与える。FIG. 12 shows an embodiment of the line-swapping means # 1, # 2, and # 3 and the row-swapping detection means. (A) shows the field of view 30 of the image sensor, which is divided into regions S ₁ , S ₂ , and S ₃ . Then, in each area, the range for detecting rows is set as exemplified by R ₁ , R ₂ , and R ₃ . The row detecting means detects that there is a row in this range. (B) is a circuit that implements the row detecting means and the row interchange detecting means. 12
Reference numerals ₁ , 122 and 123 are circuits for performing a horizontal OR operation in the range of R ₁ , R ₂ and R ₃ , respectively. Reference numerals 124, 125, and 126 are black length determination units that determine whether or not the length of consecutive black pixels in the horizontal OR result is within a predetermined range. Character lines are in the range R _i (i = 1,
2), the horizontal OR result indicates that the black pixels corresponding to the character are continuous for the height of the character, so the black length determination unit determines whether the character is in that area. Judged,
The result is output to the EXIST _i signal line. The EXIST _i signal is sent to the corresponding row storage means, and each row storage means gives the timing for storing the recognition row. 127 is EXIST ₁ , EXIS
This is an OR gate that takes the logical sum of T ₂ and EXIST ₃ . The output EXIST at 127 is true if there is a character in any of R ₁ , R ₂ and R ₃ . To detect the interchange of rows, it is sufficient to detect that the rows come out of the visual field, that is, EXIST becomes false. Reference numeral 128 is an inverter gate which inverts the logic of EXIST and creates a row interchange signal ALLCLR. ALLCLR
Signal is sent to all the row storage means to give a timing to erase the contents of the row storage means.

行が視野の中に入っていないときは、多数決処理部18
からは認識行が得られない。そこで、認識行が得られな
いことを検出して行の入れ換わりと判定する実施例も可
能である。認識行が得られない時間を測定してそれが所
定時間（T_Lとする）を越えることで、行の入れ換わりと
判定する実施例を第13図に示す。第13図では、タイマー
130は認識行が得られない時間がT_L以上続くとT_LUPの信
号を出す。T_Lは、行が視野の中にあるときに繰り返し認
識行が得られる周期（たとえば0.2秒）よりもやや大き
く（たとえば0.25秒）設定しておく。If the line is not in the field of view, the majority processing unit 18
Can not get the recognition line. Therefore, an embodiment is possible in which it is determined that the recognition line is not obtained and the line is replaced. FIG. 13 shows an embodiment in which the time when no recognition line is obtained is measured, and when it exceeds a predetermined time (T _L ), it is judged that the line is replaced. In Figure 13, the timer
130 outputs a signal of T _{LUP when} the time when no recognition line is obtained continues for T _L or more. _TL is set to be slightly larger (for example, 0.25 seconds) than the cycle (for example, 0.2 seconds) at which the repeated recognition row is obtained when the row is in the visual field.

第13図においては、１から23までの符号をつけた部分
は第１図の同符号の部分と同じ機能・構成である。６は
イメージセンサであり、第２図の場合と同様に、少なく
とも用紙３に記載された文字の一行分の視野が必要であ
り、第13図では横は一行分、縦は一文字の３倍くらいと
している。また、複数行が原稿に記載されている場合
は、一度に複数行が視野に入ると複雑な処理が必要にな
るので、処理系を単純にするには、イメージセンサの視
野高さを原稿上の行間隔よりも小さくした方がよい。In FIG. 13, the parts denoted by the reference numerals 1 to 23 have the same functions and configurations as the parts having the same reference numerals in FIG. Reference numeral 6 denotes an image sensor, which requires a field of view of at least one line of the characters written on the paper 3 as in the case of FIG. 2. In FIG. 13, the horizontal line is one line, and the vertical line is about three times as large as one character. I am trying. Also, if multiple lines are written on the original, complicated processing is required if multiple lines are in the field of view at one time. Therefore, to simplify the processing system, set the height of the image sensor field of view to the original. It is better to make it smaller than the line spacing of.

第13図では、行検出手段＃１から＃３と行記憶手段と
桁合わせ処理手段と行総合処理手段は、マイクロプロセ
ッサ131とROM132とRAM133を用いて実施されている。ROM
132には行検出手段と桁合わせ処理手段と行総合処理手
段を実施するためのマイクロプロセッサ131のプログラ
ムが格納されている。行記憶手段34はRAM133上に設定さ
れた変数領域で実施されている。In FIG. 13, the line detection means # 1 to # 3, the row storage means, the digit alignment processing means, and the line total processing means are implemented using the microprocessor 131, the ROM 132, and the RAM 133. ROM
A program of the microprocessor 131 for implementing the line detection means, the digit alignment processing means, and the line integrated processing means is stored in 132. The row storage means 34 is implemented in the variable area set on the RAM 133.

マイクロプロセッサ131の処理の概略フローチャート
を第14図に示す。光学文字読取装置の電源を投入したと
きは、から処理が始まる。はRAMに設けた行記憶域
の内容を消去する処理である。はタイマー130がT_LUP
信号を出しているか否かを判定する処理で、T_LUP信号が
出ているとき、すなわち行の入れ換わりがあったときは
の処理に進む。は認識行が得られているか否かの判
定である。は認識行を読み込む処理である。から
が行検出手段を実施しており、視野を複数の領域に分け
たときに各領域において行が入ったことを検出して、そ
れぞれの領域に対応する行記憶域に認識行を記憶する処
理である。第14図のフローチャートは、視野を３つの領
域S₁、S₂、S₃（第15図（ａ））に分けた場合の処理過程
である。S₁の領域に対応する行記憶域＃１に認識行を記
憶する処理はである。は認識行のS₁の領域において
認識された（リジェクトでない）文字の数が、行記憶域
＃１に記憶されている内容のS₁の領域において認識され
ている文字の数よりも多いか否かを判定する処理であ
る。、はそれぞれS₂、S₃の領域に関してと
同様の処理を行うことを示している。は行記憶域＃１
から＃３に記憶されている内容を認識された文字の位置
情報に基づいて相対応する文字同士を求める桁合わせ処
理である。はで行われた桁合わせ結果に基づいて総
合処理を行い、総合行を求める処理である。は総合行
をフォーマットチェック部19に送り出す処理である。FIG. 14 shows a schematic flowchart of the processing of the microprocessor 131. When the power of the optical character reader is turned on, the process starts from. Is a process for deleting the contents of the row storage area provided in the RAM. Timer 130 is T _LUP
In the process of determining whether or not a signal is being output, when the T _LUP signal is output, that is, when there is a row exchange, the process proceeds to. Is a determination as to whether a recognition line has been obtained. Is the process of reading the recognition line. Processing for detecting a line in each area when the field of view is divided into a plurality of areas and storing the recognized row in the row storage area corresponding to each area. Is. The flowchart of FIG. 14 shows the processing steps when the field of view is divided into three regions S ₁ , S ₂ , and S ₃ (FIG. 15 (a)). The process of storing the recognized row in the row storage area # 1 corresponding to the area of S ₁ is as follows. Whether the number of characters recognized (not rejected) in the area S ₁ of the recognition line is larger than the number of characters recognized in the area S ₁ of the contents stored in the line storage area # 1. This is a process for determining whether or not. , Indicate that the same processing as in the areas of S ₂ and S ₃ is performed, respectively. Is row storage # 1
Is a digit alignment process for finding the corresponding characters based on the position information of the recognized characters stored in contents # 3 to # 3. Is a process of performing an overall process based on the result of digit alignment performed in step S3 to obtain an overall line. Is a process for sending the total line to the format check unit 19.

第16図から第18図に、第14図の処理の詳細フローチャ
ートを示す。以降の説明では、第15図のように変数、定
数を用いるとする。すなわち、第15図（ａ）のように、
S₁、S₂、S₃の各領域のｘ座標の範囲は、X₁からX₂まで、
X₂からX₃まで、X₃からX₄までとする。また、（ｂ）の表
のように、認識行はＮ文字であり、その文字と位置を▲
a⁽⁰⁾ _n▼、▲x⁽⁰⁾ _n▼と表し、行記憶域＃１に記憶されて
いる内容はＩ文字あり、その文字と位置を▲a⁽¹⁾ _i▼、
▲x⁽¹⁾ _i▼と表し（ｉ＝１、２、……、Ｉ）、また、S₁
の領域にはいる認識された（リジェクトでない）文字数
をP₁と表す。行記憶域＃２、＃３についても同様に、
Ｊ、▲a⁽²⁾ _j▼、▲x⁽²⁾ _j▼、P₂とＫ、▲a⁽³⁾ _K▼、▲x
⁽³⁾ _K▼、P₃の記号を使うこととする。桁合わせ後の桁数
はＬで表し、各桁の位置はy_lで、文字の組合わせは▲b
⁽¹⁾ _l▼、▲b⁽²⁾ _l▼、▲b⁽³⁾ _l▼で表すとする。総合行の
文字数はＭで、文字はc_mで表すとする。16 to 18 show detailed flowcharts of the processing in FIG. In the following description, it is assumed that variables and constants are used as shown in FIG. That is, as shown in FIG. 15 (a),
The range of the x coordinate of each area of S ₁ , S ₂ , and S ₃ is from X ₁ to X ₂ ,
X ₂ to X ₃ , X ₃ to X ₄ . Also, as shown in the table of (b), the recognition line is N characters, and the character and position are
Represented by a ⁽⁰⁾ _n ▼, ▲ x ⁽⁰⁾ _n ▼, the content stored in the line storage area # 1 has an I character, and the character and position are ▲ a ⁽¹⁾ _i ▼,
It is expressed as ▲ x ⁽¹⁾ _i ▼ (i = 1, 2, ..., I), and S ₁
The number of recognized (non-rejected) characters in the area of is denoted by P ₁ . Similarly for row storage areas # 2 and # 3,
J, ▲ a ⁽²⁾ _j ▼, ▲ x ⁽²⁾ _j ▼, P ₂ and K, ▲ a ⁽³⁾ _K ▼, ▲ x
⁽³⁾ Use the symbols _K ▼ and P ₃ . The number of digits after digit alignment is represented by L, the position of each digit is y _l , and the combination of characters is ▲ b.
⁽¹⁾ _l ▼, ▲ b ⁽²⁾ _l ▼, ▲ b ⁽³⁾ _l ▼. The total number of characters is M and the characters are c _m .

第16図に、第14図の処理の詳細フローチャートを
示す。はS₁の領域に入る認識行の認識された文字数を
数える変数ｐを初期化する処理である。は
の処理を繰り返すための繰り返し処理である。は、
で処理対象とする▲a⁽⁰⁾ _n▼がS₁の領域に入るか否かの
判定処理である。は▲a⁽⁰⁾ _n▼がリジェクト（？の
記号で表わす）でないときにｐの数を１増やす処理であ
る。はｐがP₁よりも多いか否かを判定する処理であ
る。からまでの処理が第14図のの処理の詳細な処
理である。はｐを新たなP₁として登録し、Ｎを行記憶
域＃１に記憶する文字数Ｉとして登録する処理である。
は認識行を行記憶域＃１に記憶する処理であ
る。FIG. 16 shows a detailed flowchart of the processing of FIG. Is a process for initializing a variable p that counts the number of recognized characters in the recognized line that enters the area of S ₁ . Is an iterative process for repeating the process of. Is
This is a process for determining whether or not ▲ a ⁽⁰⁾ _n ▼ to be processed falls within the area of S ₁ . Is a process of increasing the number of p by 1 when ▲ a ⁽⁰⁾ _n ▼ is not a reject (denoted by the? Symbol). Is a process for determining whether p is larger than P ₁ . The processing from to is the detailed processing of the processing in FIG. Is a process of registering p as a new P ₁ and N as the number of characters I to be stored in the row storage area # 1.
Is a process of storing the recognized line in the line storage area # 1.

第17図に、第14図の桁合わせ処理の詳細フローチャ
ートを示す。第17図のフローチャートにおいては、
で初期化を行っている。は行記憶域に記憶した内容の
最後を予め特別の大きな値X_Mによって示しておく処理で
ある。X_Mは第15図（ａ）のX₄に、後述するＷの値を加え
たものよりも大きな値にしておく。は注目している、
行記憶域＃１のｉ番目の文字の座標▲x⁽¹⁾ _i▼と、行記
憶域＃２のｊ番目の文字の座標▲x⁽²⁾ _j▼と、行記憶域
＃３のｋ番目の文字の座標▲x⁽³⁾ _K▼の最小値Xminを求
める処理である。ではXminがの処理で用いたX_Mと同
じになっていれば、すべての文字の桁合わせ処理が終わ
ったと判断している。において用いているＷは同じ桁
であると判断できるｘ座標の幅を示している。いま、が成り立てば、▲x⁽¹⁾ _i▼は現在処理を進めている桁に
入る場合の処理、すなわち、、の処理を行う。は
行記憶域＃１のｉ番目の文字▲a⁽¹⁾ _i▼を桁合わせ後の
文字▲b⁽¹⁾ _l▼として登録する処理である。はｉ番目
の処理が終わったのでｉの値を増やす処理である。一
方、▲x⁽¹⁾ _i▼が現在処理を進めている桁に入らない場
合はの処理を行う。の処理は該当する桁に行記憶域
＃１が得られていない（文字欠け）という記号＃を▲b
⁽¹⁾ _l▼に登録する処理である（＃は第５図（ｂ）におけ
る文字と文字の間の空白と同じ意味である）。〜と
同様にして、行記憶域＃２、行記憶域＃３に対する処理
〜、〜を行う。は桁合わせ後の位置としてXm
inをy_lに登録する処理である。は桁合わせ後の文字数
ｌを１増やす処理である。は最終的に得られた文字数
ｌ−１を変数Ｌとして登録しておく処理である。FIG. 17 shows a detailed flowchart of the digit alignment process of FIG. In the flowchart of FIG. 17,
It is initializing with. Is a process in which the end of the contents stored in the row storage area is indicated in advance by a special large value X _M. X _M is set to a value larger than the value obtained by adding the value of W described later to X ₄ in FIG. 15 (a). Is paying attention,
The coordinates of the i-th character in row storage area # 1 ▲ x ⁽¹⁾ _i ▼, the coordinates of the j-th character in row storage area # 2 ▲ x ⁽²⁾ _j ▼, and the k-th position in row storage area # 3 This is a process for obtaining the minimum value Xmin of the coordinates ▲ x ⁽³⁾ _K ▼ of the character. In as long as the same as the X _M used in the processing of Xmin, it has been determined that the digit registration process of all of the characters is finished. W used in the above indicates the width of the x-coordinate that can be determined to be in the same digit. Now If is satisfied, ▲ x ⁽¹⁾ _i ▼ performs the processing when it enters the digit currently being processed, that is, the processing of. Is a process for registering the i-th character ▲ a ⁽¹⁾ _i ▼ in the line storage area # 1 as the character ▲ b ⁽¹⁾ _l ▼ after digit alignment. Is a process for increasing the value of i since the i-th process is completed. On the other hand, if ▲ x ⁽¹⁾ _i ▼ does not enter the digit currently being processed, the process is performed. The line # indicates that the line storage area # 1 has not been obtained (missing character) at the corresponding digit.
⁽¹⁾ It is a process of registering in _l ▼ (# has the same meaning as a space between characters in FIG. 5B). In the same manner as, the processes for row storage area # 2 and row storage area # 3 are performed. Is Xm as the position after digit alignment
This is the process of registering in to y _l . Is a process of increasing the number of characters l after digit alignment by 1. Is a process of registering the finally obtained number of characters l-1 as a variable L.

第18図（ａ）は、第14図の総合処理の詳細フローチ
ャート（その１）である。は総合行の文字を数える変
数ｍを初期化する処理である。はからの処理
を、桁合わせ後の桁数Ｌだけ繰り返して行うための繰り
返し処理である。によって、注目しているｌ番目の桁
の座標y_lがS₁の領域に入るか否かが判定され、S₁の領域
に入るときはに進む。においては、行記憶域＃１
の、桁合わせ後の文字▲b⁽¹⁾ _l▼が＃でないならば（文
字欠けでないならば）、それを総合行の文字c_mとして登
録する処理を行っている。では、y_lがS₂の領域に入る
か否かが判定され、S₂の領域に入るときはに進む。
においては、行記憶域＃２の、桁合わせ後の文字▲b
⁽²⁾ _l▼が＃でないならば（文字欠けでないならば）、そ
れを総合行の文字c_mとして登録する処理を行っている。
に達する場合はy_lがS₃の領域に入る場合であり、
においては、行記憶域＃３の、桁合わせ後の文字▲b⁽³⁾
_l▼が＃でないならば（文字欠けでないならば）、それ
を総合行の文字c_mとして登録する処理を行っている。
は、総合行の文字数ｍ−１を変数Ｍに登録する処理であ
る。FIG. 18 (a) is a detailed flowchart (1) of the comprehensive process of FIG. Is a process of initializing a variable m for counting the characters in the total line. This is a repetitive process for repetitively performing the process from (a) to (d) for the number of digits L after digit alignment. The coordinate y _l a l-th digit of interest is determined whether or not to enter the area of S _1, the process proceeds to when entering the area of S _1. In row storage # 1
If the character ▲ b ⁽¹⁾ _l ▼ after the digit alignment is not # (if there is no character missing), it is registered as the character c _m of the general line. In, y _l is determined whether or not to enter the area of S _2, the process proceeds to when entering the area of S _2.
In line storage area # 2, the character ▲ b after column alignment
^{(2) If} _l ▼ is not # (if there is no character missing), it is registered as the character c _m of the comprehensive line.
If y _l is in the region of S ₃ , then
In line storage area # 3, characters after column alignment ▲ b ⁽³⁾
_{If l} ▼ is not # (if there is no missing character), it is registered as the character _cm of the general line.
Is a process of registering the number of characters m−1 of the total line in the variable M.

第18図（ａ）は各領域S₁、S₂、S₃毎に用いる行記憶域
＃１、＃２、＃３を分けた場合の処理であるが、領域毎
に用いる行記憶域を限定しない処理例も可能であり、第
18図（ｂ）に示す。第18図（ｂ）において、は桁合わ
せ後の桁数Ｌを総合行の文字数Ｍとして登録する処理で
ある。はからの処理をＬだけ繰り返して行う
ための繰り返し処理である。FIG. 18 (a) the row storage # 1 used for each of the regions S _1, S _2, S _3, # 2, is a process in the case of dividing the # 3, only the line storage used for each region A processing example that does not
It is shown in Fig. 18 (b). In FIG. 18 (b), a process of registering the digit number L after digit alignment as the character number M of the comprehensive line. Is a repetitive process for repetitively performing the process starting with.

第18図（ｂ）のにおいては、▲b⁽¹⁾ _l▼が文字
欠け（桁合わせ処理において＃の記号を設定したとき）
でもリジェクト（？記号で示される）でもないときは、
▲b⁽¹⁾ _l▼をc_lにする。▲b⁽¹⁾ _l▼が文字欠けかリジェク
トのときはに進む。においては▲b⁽²⁾ _l▼が文
字欠けでもリジェクトでもないときは▲b⁽²⁾ _l▼をc_lに
している。▲b⁽²⁾ _l▼が文字欠けかリジェクトのときは
に進む。においては▲b⁽³⁾ _l▼が文字欠けでも
リジェクトでもないときは▲b⁽³⁾ _l▼をc_lにしている。
▲b⁽³⁾ _l▼が文字欠けかリジェクトのときはに進む。
に進むのは結局▲b⁽¹⁾ _l▼と▲b⁽²⁾ _l▼と▲b⁽³⁾ _l▼の
いずれもがリジェクトか文字欠けのときであり、このと
きは、リジェクトをc_lにしている。In Fig. 18 (b), ▲ b ⁽¹⁾ _l ▼ is missing characters (when the # symbol is set in the digit alignment process).
But if it's not rejected (indicated by a? Sign),
Set ▲ b ⁽¹⁾ _l ▼ to c _l . ▲ b ⁽¹⁾ _{l If} ▼ is missing or rejected, go to. In ▲ b ⁽²⁾ When _l ▼ is neither rejected in character missing are with ▲ b ⁽²⁾ _l ▼ a c _l. ▲ b ⁽²⁾ _{l If} ▼ is missing or rejected, go to. In ▲ b ⁽³⁾ When _l ▼ is neither rejected in character missing are with ▲ b ⁽³⁾ _l ▼ a c _l.
▲ b ⁽³⁾ _{l If} ▼ is missing or rejected, proceed to.
After all, ▲ b ⁽¹⁾ _l ▼ and ▲ b ⁽²⁾ _l ▼ and ▲ b ⁽³⁾ _l ▼ are rejected or missing characters.In this case, set the reject to c _l . ing.

なお、第14図のの初期化の処理は、実際にはＩ、
Ｊ、Ｋ、P₁、P₂、P₃の変数を０にすればよい。It should be noted that the initialization process of FIG.
The variables of J, K, P ₁ , P ₂ and P ₃ may be set to 0.

以上の実施例においては、視野に設定する領域の数が
３である場合を示したが、複数ならば３以外の領域を設
定する実施例も可能である。また、視野に複数の領域を
設定したとき、互いに排地的に（重なり合わないよう
に）設定する必要はなく、たとえば第19図のように、一
部分を重なり合わせながら５つの領域を設定する実施例
も可能である。In the above embodiments, the case where the number of regions set in the field of view is 3 has been shown, but an example in which regions other than 3 are set is also possible if there are multiple regions. Also, when multiple areas are set in the field of view, it is not necessary to set them so that they do not overlap each other (so that they do not overlap). For example, as shown in FIG. 19, five areas are set while partially overlapping each other. Examples are possible.

〔The invention's effect〕

本発明を用いることで、行全体が一度に視野に入らな
いようなスキュー角でも行全体を読み取ることのできる
光学文字読取装置が実現できる。このことは、逆に言え
ば、従来技術による光学文字読取装置（第20図（ａ））
に比べて視野の高さを小さくできることを意味する（第
20図（ｂ））。第20図（ｂ）の視野高さを持つスキャナ
でも、本発明を用いることで、行にスキャナを当てがっ
て上下に動かせば行全体を読み取ることができる。視野
の高さを小さくできると次の様な効果がある。By using the present invention, it is possible to realize an optical character reader capable of reading an entire line even at a skew angle such that the entire line does not enter the field of view at once. To put it the other way around, this is an optical character reader according to the prior art (Fig. 20 (a)).
This means that the height of the field of view can be reduced compared to.
Figure 20 (b)). Even in a scanner having a visual field height shown in FIG. 20 (b), by using the present invention, the entire row can be read by applying the scanner to the row and moving the row up and down. If the height of the field of view can be reduced, the following effects can be obtained.

・イメージセンサに必要な画素数が少なくなるので、よ
り安価なイメージセンサを用いることができ、装置の低
廉化ができる。-Since the number of pixels required for the image sensor is reduced, a cheaper image sensor can be used and the cost of the device can be reduced.

・視野高さが小さくなることで、スキャナのサイズが小
さくなり、スキャナの操作性が増す。-Since the field of view height is reduced, the size of the scanner is reduced and the operability of the scanner is increased.

・視野高さが小さくなるので、スキャナ内の照明光源が
原稿を照明するときのむらを少なくしやすい。このた
め、従来技術による光学文字読取装置の照明系に比べ
て、設計・開発が容易になる。-Since the visual field height is small, it is easy to reduce unevenness when the illumination light source in the scanner illuminates the document. Therefore, the design and development are easier than the illumination system of the optical character reader according to the related art.

このように、本発明のもたらす波及効果は大きい。Thus, the ripple effect of the present invention is great.

[Brief description of drawings]

第１図は本発明を使用した光学文字読取装置の構成例、
第２図は従来技術による光学文字読取装置、第３図は一
文字切り出し処理までの説明図、第４図は一文字切り出
し方法の説明図、第５図は桁合わせ・多数決結果処理の
説明図、第６図、第８図、第９図は視野内の行の動きと
多数決結果の説明図、第７図は値札の読み取り説明図、
第10図は従来技術の問題点の説明図、第11図は本発明の
動作の説明図、第12図は行検出手段と行入れ換わり検出
手段の実施例図、第13図は本発明の実施例図、第14図は
マイクロプロセッサの処理の概略フローチャート、第15
図は定数・変数の説明図、第16図から第18図はそれぞ
れ、行検出処理、桁合わせ処理、行総合処理の詳細フロ
ーチャート、第19図は領域の設定の仕方の説明図、第20
図は本発明を用いた光学文字読取装置の視野の説明図で
ある。１……スキャナ、２……手３……原稿、４……光源５……レンズ系、６……イメージセンサ７……制御二値化回路、８……画面メモリ９……一桁切り出し回路、10……一桁メモリ 11……一文字切り出し回路 12……一文字メモリ、13……文字認識回路 14、15、16……認識結果バッファ 17、17′……桁合わせ処理部 18……多数決処理部 19……フォーマットチェック部 20、130……タイマー 21……前回レジスタ、22……比較器 23……出力制御部 30……イメージセンサの視野 31、32、33……行検出手段 34、35、36……行記憶手段 37……桁合わせ手段 38……行総合処理手段 39……行入れ換わり検出手段 101、106、111、116……行記憶手段＃１の内容 102、107、112、117……行記憶手段＃２の内容 103、108、113、118……行記憶手段＃３の内容 104、109、114、119……桁合わせ結果 105、110、115、120……行総合結果 121、122、123……横OR回路 124、125、126……黒長さ判定部 127……オアゲート 128……インバータゲート 131……マイクロプロセッサ 132……ROM、133……RAM。FIG. 1 is a structural example of an optical character reader using the present invention,
FIG. 2 is an optical character reading device according to the prior art, FIG. 3 is an explanatory view up to a single character cutout process, FIG. 4 is an explanatory view of a single character cutout method, FIG. 5 is an explanatory view of digit alignment / majority result processing, Figures 6, 8 and 9 are illustrations of row movements in the field of view and the results of majority voting, and Figure 7 is an illustration of reading price tags,
FIG. 10 is an explanatory view of the problems of the prior art, FIG. 11 is an explanatory view of the operation of the present invention, FIG. 12 is an embodiment diagram of the row detecting means and the row interchange detecting means, and FIG. FIG. 14 is a schematic flowchart of the processing of the microprocessor, FIG.
Figures are explanatory diagrams of constants / variables, Figures 16 to 18 are detailed flow charts of line detection processing, digit alignment processing, and line total processing, respectively. Figure 19 is an explanatory diagram of how to set areas, and Figure 20.
The figure is an explanatory view of the field of view of an optical character reading apparatus using the present invention. 1 ... Scanner, 2 ... Hand, 3 ... Original, 4 ... Light source, 5 ... Lens system, 6 ... Image sensor, 7 ... Control binarization circuit, 8 ... Screen memory, 9 ... Single digit cutting circuit , 10 …… Single digit memory 11 …… Single character cutout circuit 12 …… Single character memory, 13 …… Character recognition circuit 14, 15, 16 …… Recognition result buffer 17, 17 ′ …… Digit matching processing unit 18 …… Majority processing Section 19 …… Format check section 20,130 …… Timer 21 …… Previous register, 22 …… Comparator 23 …… Output control section 30 …… Image sensor field of view 31,32,33 …… Line detection means 34,35 , 36 ... Line storage means 37 ... Digit alignment means 38 ... Line total processing means 39 ... Line exchange detection means 101, 106, 111, 116 ... Contents of line storage means # 1 102, 107, 112, 117 ... Contents of line storage means # 2 103, 108, 113, 118 ... Contents of line storage means # 3 104, 109, 114, 119 ... Digit alignment Result 105, 110, 115, 120 …… Total row result 121, 122, 123 …… Horizontal OR circuit 124, 125, 126 …… Black length determination unit 127 …… OR gate 128 …… Inverter gate 131 …… Microprocessor 132 ... ROM, 133 ... RAM.

Claims

(57) [Claims]

1. An optical character reading apparatus for reading a character on an original by holding an image sensor housing (scanner) with a hand and applying it to the original, and a plurality of characters / symbols (hereinafter, simply referred to as characters). ) Is the target of recognition, an image sensor that fits at least one line of characters within the field of view, character recognition means that recognizes each character on the screen captured by the image sensor and outputs its position, and sets it in the field of view. A plurality of line detection means for detecting whether or not a part of the line exists in each of the plurality of displayed areas, and the recognition obtained from the character recognition means when the corresponding line detection means detects the line. A plurality of line storage means for storing lines; a digit alignment processing means for associating the recognized lines stored in each of the line storage means with each other based on the positional information of the characters; Based on the result of, the line detection means has a line totaling means for obtaining a totalized line in which the recognized lines stored in the line storage means are integrated, and a line replacement detecting means for detecting the replacement of the lines in the visual field. When the means detects the presence of a line, the line storage means corresponding to the line detection means stores the recognition line obtained from the character recognition means, and the digit alignment processing means stores the characters of the recognition line stored in the line storage means. An optical system characterized in that the lines are associated with each other by the line aligning means, the total line is obtained by the line totalizing means, and when the line exchange detecting means detects the line exchange, the contents of the line storing means are erased. Character reader.

2. The optical character reading device according to claim 1, wherein the line interchange detecting means determines that the line is interchanged when all the line detecting means detect no line. And an optical character reader.

3. The optical character reading device according to claim 1 or 2, wherein a black pixel is laterally arranged in a predetermined range provided in the visual field of the image sensor by the line detection means. It consists of a horizontal OR circuit that calculates the logical sum, and a black length determination unit that determines that the operation result of the horizontal OR circuit continues for a predetermined range in the vertical direction. An optical character reading device characterized by determining that a row exists when the calculation result of the horizontal OR circuit is detected.

4. The optical character reader according to claim 1 or 2, wherein the line detection means checks the position information of each character of the recognition line obtained from the character recognition result, It is a means to determine that a line exists by having a non-unrecognizable character having position information in a predetermined range, and a non-unrecognizable character having position information in the range in a recognition line already stored in the line storage means. Than, when many non-recognizable characters are obtained in the range,
An optical character reader characterized in that a newly obtained recognition line is stored in a line storage means.

5. The optical character reading device according to claim 1 or 4, wherein the line replacement detecting means determines that the recognized line is recognized when the character recognizing means repeatedly performs recognition processing. An optical character reading device comprising a time measuring means for measuring the time when the recognition line is not obtained, and determining the replacement of the line when the time when the recognition line is not obtained exceeds a predetermined time.

6. The optical character reader according to any one of claims 1 to 5, wherein the height of the visual field of the image sensor is 1 to 3 times the height of the character.
An optical character reading device characterized in that the position information of a character used for digit alignment is a position information in the horizontal direction within the visual field of the character within a range of about double.

7. The optical character reader according to any one of claims 1 to 5, wherein the height of the field of view of the image sensor is smaller than the line spacing of the character lines written on the document. An optical character reading device characterized in that position information used for digit alignment is position information in the lateral direction within the visual field of the character.