JPS5824975A

JPS5824975A - Optical character reader

Info

Publication number: JPS5824975A
Application number: JP56122445A
Authority: JP
Inventors: Masaki Komiya; 小宮　雅紀
Original assignee: Toshiba Corp; Tokyo Shibaura Electric Co Ltd
Current assignee: Toshiba Corp
Priority date: 1981-08-06
Filing date: 1981-08-06
Publication date: 1983-02-15

Abstract

PURPOSE:To speed up the recognition speed and to make the device small in size, by constituting a plurality of recognition sections to share one dictionary memory. CONSTITUTION:A character pattern photoelectric-converted at a photoelectric conversion section 1 is sequentially written in a line buffer 3 via a pre-processing section 2. During this processing, a control recognition processor 4 accesses a format table 5, check the style of characters constituting character rows during scanning and selects a dictionary to be referenced. Thus, the content of the dictionary is repetitively read out from a dictionary memory 6. One character is cut out from the buffer 3 and the normalization of the size of the characters and line width is made and written in a recognition section 8a. Next, the 2nd character pattern is cut out for normalization and given to a recognition section 8b. Similarly, the cut-out character patterns are given to a recognition section 8 not in operation. When the processor 4 receives a recognition end signal from the recognition section, the result of calculation of similarity stored in the recognition section 8 is referenced to determine the answer and the answer is written in an output buffer 9.

Description

【発明の詳細な説明】本発明は光学的文字読み堆シ装置に関し、４１１Ｋ。[Detailed description of the invention] 411K The present invention relates to an optical character reading/composition device.

複数の認識部が１つの辞書メモリを共有すゐ方式の光学
的文字読み＊ａ装置に関する。The present invention relates to an optical character reading*a device in which a plurality of recognition units share one dictionary memory.

近年の光学的文字読み＊Ｃ装置（以下、ＯＣＲと称す）
は、多種多様な文字を読み取ることができるようになっ
てきている。また、これに伜なりて、ＯＣＲ内の辞書メ
モリに格納される参照用標準パターンの数も増加の一途
をえどっている。その丸め、辞書メモリの記憶容量は莫
大な４０となってきている。Recent optical character reading*C devices (hereinafter referred to as OCR)
has become able to read a wide variety of characters. Furthermore, in line with this, the number of reference standard patterns stored in the dictionary memory within the OCR is also increasing. Rounding off, the storage capacity of the dictionary memory has become an enormous 40.

ところで、ＯＣＲの認識速度を向上させる丸めには、同
時並行動作する複数の認識部を設は為ことが有効である
。ところが、これら複数の認識部のそれぞれに大容量の
辞書メモリを付加すると、装置が大形化し、しかも高価
なものとなってしまう。By the way, for rounding to improve the recognition speed of OCR, it is effective to provide a plurality of recognition units that operate in parallel. However, if a large-capacity dictionary memory is added to each of these plurality of recognition sections, the device becomes large and expensive.

本発明は、上記実情に鑑みてなされ丸ものである。それ
ゆえ、本発明の目的は、認識速度が速く、しかも小形で
安価なＯＣＲを提供することにＴｏｈ。The present invention has been made in view of the above circumstances. Therefore, an object of the present invention is to provide an OCR that has high recognition speed, is small in size, and is inexpensive.

本発明の他の目的祉、複数Ｏｗ識部が１つの辞書メモリ
を共有する方式のＯＣＲを提供することＫある。Another object of the present invention is to provide an OCR system in which multiple OCR units share one dictionary memory.

以下、図面を参照して本発明の詳細な説明する。Hereinafter, the present invention will be described in detail with reference to the drawings.

第１１拡、本発明一実施例のブロック図である。FIG. 11 is an eleventh enlarged block diagram of an embodiment of the present invention.

図中、符号１は光電変換部である。この充電変換部１に
は、読み取られるべき文字が記入された帳票を搬送する
搬送機構と、帳票上の文字パターンを走査する走査機構
とが含まれている。符号２拡前処通部である。この前処
理部２Ｆｉ、、光電変換された文字パターンに雑音除去
などの前処ｍを施す。In the figure, reference numeral 1 is a photoelectric conversion section. The charging conversion unit 1 includes a transport mechanism that transports a form on which characters to be read are written, and a scanning mechanism that scans the character pattern on the form. Code 2 is the expansion treatment section. This preprocessing unit 2Fi performs preprocessing such as noise removal on the photoelectrically converted character pattern.

符号３ｔｉ？インバツフアである。このラインバッファ
３は前処理された文字パターンを一行分記憶する。符号
４はＩ！識制御プロセッ４Ｉ″（以下、単にプロセッサ
と称す）である、このプ四セッサ４は文字認識に係わる
種々のデータ処理および装置内各部の制御を行なう、符
号５線フオーマツトテーブルである。このフォーラット
テーブル５には、文字読み取りに必要な各種のフォーマ
ット情報が記憶されている。符号６は辞書メ毫すである
。この辞書メモｌＪ６には各種の文字の標準パターンが
格納されている。符号７は辞書制御部である。この辞書
制御部７は、辞書メモリ６の読み出し制御を行なう、符
号８ａは第１認識部である。この第ｔｇ識部８ａは、帳
票上の文字パターンと辞書メモリ６から読み出された標
準パターンとを比較し、パターンマツチング法による文
字認識を行なう。Code 3ti? It's inbatshua. This line buffer 3 stores one line of preprocessed character patterns. Code 4 is I! This processor 4, which is a recognition control processor 4I'' (hereinafter simply referred to as a processor), is a 5-line format table that processes various data related to character recognition and controls various parts within the device. The rat table 5 stores various format information necessary for reading characters. Reference numeral 6 is a dictionary memory. Standard patterns of various characters are stored in this dictionary memory 1J6. Reference numeral 6 is a dictionary memory. 7 is a dictionary control unit. This dictionary control unit 7 controls reading of the dictionary memory 6. Reference numeral 8a is a first recognition unit. The standard pattern read from 6 is compared with the standard pattern, and character recognition is performed using a pattern matching method.

符号８ｂは第２１１識部である。こＯ第２１１識郁８ｂ
は、第１ＩＩ識部８ａと同じものであるが、第ｉｌｌ識
部８ａとは独立に動作する。符号９は、出力バッファで
ある。この出力バッファ９Ｋａ認識結果が記憶される。Reference numeral 8b is the 211th identification section. Koo No. 211 Shikiku 8b
is the same as the first II recognition section 8a, but operates independently of the illumination recognition section 8a. Reference numeral 9 is an output buffer. This output buffer 9Ka recognition result is stored.

上述した構成要素は、前処理部２および辞書メモリ６を
除き、パスＩＯＫ接続されている。また。The above-mentioned components, except for the preprocessing section 2 and the dictionary memory 6, are connected by a path IOK. Also.

光電変換部１と前処理部２社、信号ｌｌ５ｌＩＫよって
接続されている。ま九、前処理部２とラインバッファ３
は信号線群１２によって接続されている。まえ、辞書メ
モリ６と辞書制御部７は、信号線群１″ＡＫよって接続
されている。さらに、第１１１１１１１８ａおよび第２
ｉｌ！！識部８ｂは、信号線群１４によって辞書制御部
７に接続されている。The photoelectric conversion section 1 and the two preprocessing sections are connected by signals ll5lIK. Nine, preprocessing section 2 and line buffer 3
are connected by a signal line group 12. In the front, the dictionary memory 6 and the dictionary control unit 7 are connected by a signal line group 1''AK.
Il! ! The identification section 8b is connected to the dictionary control section 7 by a signal line group 14.

次に、第２図を参照して、辞書メモリ６の配憶内容を説
明する。第２図（ａ）Ｋ示されているように、本爽施例
において、辞書メモリ６０内部には３つの辞書Ｄｏ　、
Ｄｓおよび病が格納されてい石０手書１文字読み取シ用
辞書Ｄ０は、例えば第３図（ａ）Ｋ示されているような
字体の手書き文字を読与取ゐＷＡＫ利用される。この辞
書へは、辞書メ毫り６の＆〜誉地か１賜番地オでの間の
領域に格納されている。活字文字読み取プ用辞書ハは、
例えば１８３図（ｂ）に示されているような字体の活字
文字を読み取る際に利用される。仁の辞書へは、辞書メ
モリ６の８Ａ１１１地から私１番池までの間の領域に格
納されている。ドツト文字読み取り用辞書へは、例えは
第３図（Ｃ）に示されているような字体のドツト文字を
読み取る際に利用される。この辞書りおけ辞書メモリ６
の８Ａ、番地から働１８番地までの間の領域に格納され
ている。Next, the contents stored in the dictionary memory 6 will be explained with reference to FIG. As shown in FIG. 2(a)K, in this embodiment, the dictionary memory 60 has three dictionaries Do,
The dictionary D0 for reading one handwritten character in which Ds and disease are stored is used to read and read handwritten characters in the font shown in FIG. 3(a)K, for example. This dictionary is stored in the area between & and Honchi of dictionary page 6 and address 1 and address O. Dictionary for reading printed characters is
For example, it is used when reading printed characters in the typeface shown in Figure 183(b). Jin's dictionary is stored in the area between area 8A111 of dictionary memory 6 and I-1 pond. The dot character reading dictionary is used, for example, when reading dot characters in a font as shown in FIG. 3(C). This dictionary storage dictionary memory 6
It is stored in the area between address 8A and address 18.

なお、２種類以上の字体の活字文字を読み填る必要があ
る場合には、活字文字読み取り用辞書を２種類以上設け
てもよい、同様に、２種類以上の字体のドツト文字を読
み取る必要がある場合には、ドツト文字読み取ｐ用辞書
を２種類以上設けてもよい。In addition, if it is necessary to read printed characters of two or more types of fonts, two or more types of printed character reading dictionaries may be provided.Similarly, it is necessary to read dot characters of two or more types of fonts. In some cases, two or more types of dot character reading p dictionaries may be provided.

次に、辞書Ｄ０．Ｄ１およびり、のよｐ詳細な構造を説
明する。第２図（ｂ）には、−例として、辞書Ｄ０の内
部構造が示されていゐ０図示されているように、辞書へ
の内部にはｎ種類の文字（例えば、１０種類の数字と２
６種類の英字と「刊「−」などの記号）の標準パターン
Ｐ１〜Ｐｎが格納されている。Next, dictionary D0. The detailed structure of D1 and RI will now be explained. As an example, FIG. 2(b) shows the internal structure of a dictionary D0. As shown in FIG.
Six kinds of alphabetic characters and standard patterns P1 to Pn (symbols such as "-") are stored.

各標準パターンＰ１〜Ｐｎの先１［Ｋは、ヘッダー鳩〜
Ｈｎが付加されている。各ヘッダーＨ１〜Ｈｎには、そ
の文字の文字コードＣ８〜Ｃｎ、対応する標準パターン
Ｐ１〜Ｐｎのデータ長（九とえはビット数）Ｂｓ〜Ｂｎ
、およびその他の情報が含まれている。他の辞書り、お
よび１もこれと同様な内部構造を有している。Tip 1 of each standard pattern P1 to Pn [K is the header pigeon]
Hn is added. Each header H1 to Hn contains the character code C8 to Cn of the character, and the data length (9 is the number of bits) of the corresponding standard pattern P1 to Pn, Bs to Bn.
, and other information. Other dictionaries and 1 have similar internal structures.

次に、第４図を参照して、辞書制御ｓ７のよ）詳細な構
成を説明する０図示されているように、辞書制御部フに
は、辞書番号レジスタ７１と、辞書アドレスメモリ７２
と、辞書アドレスカウンタｎと、文字コード検出回路７
４とが含まれている。辞書番号レジスタ７１の入力端子
群は、パス１０に接続されている。ｆた、辞書番号レジ
スタ７１の出力端子群ｔｌＦＷアドレスメモリ７２のア
ドレス入力端子群に接続されている。この辞書アドレス
メ毫り７２０ｉ番地にれ、辞書Ｄｉの先願番地８Ａｉお
よび最Ｍｌｌ地ＲＡｉが記憶されている。したが９て、
辞書番号レジスタ７１に辞書番号ｉをセットすることＫ
よって、辞書アドレスメモ９７２から辞書ＤｉＯ先頂番
地８Ａｉおよび最終番地ＥＡｓが読み出される。Next, with reference to FIG. 4, the detailed configuration of the dictionary control section s7 will be explained.As shown in FIG.
, dictionary address counter n, and character code detection circuit 7
4 is included. The input terminal group of the dictionary number register 71 is connected to the path 10. Furthermore, the output terminal group tlFW of the dictionary number register 71 is connected to the address input terminal group of the address memory 72. In this dictionary address 720i, the earliest application address 8Ai and the highest application address RAi of the dictionary Di are stored. However, at 9
Setting the dictionary number i in the dictionary number register 71K
Therefore, the dictionary DiO top address 8Ai and the final address EAs are read from the dictionary address memo 972.

辞書アドレスメモリ７２から読み出された辞書Ｄｉの先
頭番地ＳＡｉシよび最終番地ＥＡｉは、辞書アドレスカ
ウンタ７３に供給される。すると、辞書アドレスカウン
タ７３社、先頭番地８Ａｉから最終番地ＥＡｉまで０間
を繰シ返しカウンタする。すなわち、辞書アドレスカウ
ンタｎａ先頭番地Ｓ入ｉからカウントを開始し、最終番
地ＥＡｉｉでカウントすると、再び先頭番地８Ａｉから
カウントを始める。このカウンタ７３のカウント値は、
信号線群１３ａ（信号線群１３の一部である）を介して
、辞書メ篭り６のアドレス入力箋子群に供給される。し
たがって、辞書メモリ６からは、辞書Ｄｉの内容が繰〉
返し読み出される。このようＫして辞書メモリ６から読
み出された辞書ＤＩの内容は、信号一群１３ｂ（信号線
群１３の一部である）を介して一担辞書制御部７に取り
込まれ、信号一群１４ａ（信号線群１４の一部である）
を介して第ｉｌｌ識部８ａおよび第２ｍ識部８ｂＫ供給
される。まえ、文字コード検出回路７４は、信号線群１
３ｂ上に文字コードＣ１〜Ｃｎが出現し九とき、これを
検知して文字ブード検出信号Ｃ０ＤＥを発生する。この
信号Ｃ０ＤＥは、信号線１４ｂ（信号線群１４の一部で
ある）を介して第１認識部８ｍおよび第２１１識郁８ｂ
Ｋ供給される。The starting address SAi and ending address EAi of the dictionary Di read from the dictionary address memory 72 are supplied to a dictionary address counter 73. Then, the dictionary address counter 73 repeatedly counts between zeros from the first address 8Ai to the last address EAi. That is, the dictionary address counter na starts counting from the first address S entry i, counts at the last address EAii, and then starts counting again from the first address 8Ai. The count value of this counter 73 is
The signal is supplied to the address input paper group of the dictionary booklet 6 via the signal line group 13a (which is a part of the signal line group 13). Therefore, the contents of the dictionary Di are repeated from the dictionary memory 6.
It is read back. The contents of the dictionary DI read out from the dictionary memory 6 in this way are taken into the one-way dictionary control unit 7 via the signal group 13b (part of the signal line group 13), and the contents of the dictionary DI read out from the dictionary memory 6 in this manner are taken into the single dictionary control unit 7 via the signal group 13b (part of the signal line group 13), and the signal group 14a ( (Part of signal line group 14)
It is supplied to the ill-th recognition section 8a and the second m-th recognition section 8bK through. In the front, the character code detection circuit 74 is connected to the signal line group 1.
When character codes C1 to Cn appear on 3b, this is detected and a character code detection signal C0DE is generated. This signal C0DE is transmitted to the first recognition unit 8m and the 211th recognition unit 8b via the signal line 14b (which is part of the signal line group 14).
K is supplied.

なン、辞書アドレスカウンタｎは、そのカウント値が先
頭アドレスＳＡｉに勢しくなると、信号Ｑ音発生する。The dictionary address counter n generates a signal Q sound when its count value reaches the leading address SAi.

この信号ＣＩ　Ｂｓ辞書Ｄｉ内Ｏ最初Ｏ文字コードＣ１
が信号線群１３ｂ上に送出されていることを文字コード
検出回路７４に知らせる。２番目以後の文字コードＣ１
〜Ｃｎの検出線、文字コード検出回路７４内で行なわれ
る。その際、次の文字コードの存在位置を知るために、
各ヘッダー８１〜式に含まれているデータＢ、〜Ｂｎ　
（標準パターンＰ１〜Ｐｎのデータ長）が利用される。This signal CI Bs dictionary Di O first O character code C1
The character code detection circuit 74 is informed that the character code is being sent onto the signal line group 13b. Second and subsequent character code C1
The detection lines ˜Cn are carried out in the character code detection circuit 74. At that time, in order to know the location of the next character code,
Data B, ~Bn included in each header 81 ~ expression
(data length of standard patterns P1 to Pn) is used.

次に、第１ｍ！！！識部８ａおよび第２ｉｌ！識部８ｂ
Ｏよシ詳細な構成を説明すゐ６本実施例にシいて、第１
ｍ織部８鳳および第２１１識部８ｂＯ内部構威祉同−で
あるので、以後、両者を区別すゐ必要がない場合には、
単に認識部８と称す、第５図は、ｉ！識郡部８内部構成
を示す図である。図示されているように、ＩＩ識部８に
は一文字ノ（ツファ８０１および類似度計算回路８０２
が含まれている。後述するように、−文字バッファ８０
１に祉認識対象となる一文字分の文字パターンが格納さ
れる。類似度計算回路８０２は、この−文字バッファ８
０１に格納された文字パターンと信号線群１４ａを通じ
て送られて来る標準パターンとの間の類似度を計算する
。Next, the 1st m! ! ! Shikibu 8a and 2nd il! Shikibu 8b
The detailed configuration will be explained below.6 Based on this embodiment, the first
Since the internal structure of Oribe 8o and 211th Intelligence Department 8bO is the same, from now on, if there is no need to distinguish between the two,
FIG. 5, simply referred to as recognition section 8, shows i! FIG. 8 is a diagram showing the internal configuration of the intelligence unit 8. FIG. As shown in the figure, the II recognition unit 8 includes a single character (tufa 801 and similarity calculation circuit 802).
It is included. As described below, - character buffer 80
1 stores a character pattern for one character to be recognized. The similarity calculation circuit 802 calculates this −character buffer 8
The similarity between the character pattern stored in 01 and the standard pattern sent through the signal line group 14a is calculated.

求められた類似度は、類似度計算回路８０２の内部に記
憶され、プロセッサ４からの要求に応じて、バス１０に
送出される。The obtained similarity is stored inside the similarity calculation circuit 802 and sent to the bus 10 in response to a request from the processor 4.

なお、類似度計算回路８０２は、後述するイネーブル信
号ＩＮが与えられた場合にのみ動作する。Note that the similarity calculation circuit 802 operates only when an enable signal IN, which will be described later, is applied.

したがって、−文字バッファ８０１に格納されている文
字パターンの種類（数字、英字、記号などの区別）が予
め判明している場合には、信号線群１４ａ上に必要な標
準パターンが送出されている場合にのみ類似度計算回路
８０２　ｔ−動作させることができる。また、類似度計
算回路８０２は類似度計算が終了するとローレベルにな
る計算終了信号ＦＩＭを発生する。Therefore, if the type of character pattern stored in the character buffer 801 (distinguishing between numbers, letters, symbols, etc.) is known in advance, the necessary standard pattern is sent out on the signal line group 14a. The similarity calculation circuit 802 can be operated only if t-. Furthermore, the similarity calculation circuit 802 generates a calculation end signal FIM that becomes low level when the similarity calculation is completed.

次に、上述したイネーブル信号を作成する丸めの回路構
成を説明する。第Ｓ図に示されているように、認識部８
には第１メモリ８０３と第８メモリ８０４とが備わりて
いゐ０文字コードのビット数をｍとすると、第１メモリ
８０３および第２メ篭り泡には、それぞれ２ｍ個（文字
コードのＩｌ数）Ｏ記憶領域が備わっている。各記憶領
域には、１ビツトのデータが記憶される。第１メモリ８
０Ｂは、プロセッサ４からバス１０を通じて送られて来
る認識指令信号ＲＥＣＧＮが７・イレベルの場合は試み
出しモードとなシ、ローレベルの場合は書自込みモード
となる。また、第１メ屹り８０３のデータ入力端子には
プロセッサ４かもバス１０を通じて送られて来る１ビツ
トデータＶｖＤＡＴＡが供給される。一方、第２メモリ
８０４はアンドゲート８０５の出力信号がハイレベルで
ある場合は読み出しモードとな）。Next, a rounding circuit configuration for creating the above-mentioned enable signal will be explained. As shown in FIG.
is equipped with a first memory 803 and an eighth memory 804. If the number of bits of the 0 character code is m, then the first memory 803 and the second memory have 2m bits each (Il number of character codes). It has O storage area. Each storage area stores 1 bit of data. 1st memory 8
0B is in the trial start mode when the recognition command signal RECGN sent from the processor 4 through the bus 10 is at the 7-high level, and is in the write mode when it is at the low level. Further, the data input terminal of the first input terminal 803 is supplied with 1-bit data VvDATA sent from the processor 4 via the bus 10. On the other hand, the second memory 804 is in the read mode when the output signal of the AND gate 805 is at a high level).

ローレベルの場合社書き込与゛モードとなぁ、こＯアン
トゲ−）　＄０５の２つの入力端子にはそれぞれ信号Ｒ
ＥＣＧＮおよび信号ＦＩＮが印加される。第２メ４９８
０４のデータ入力端子に拡信号ＲＥＣＧＮが印加含れる
。If it is low level, it is the company write mode.) The two input terminals of $05 each have a signal R.
ECGN and signal FIN are applied. 2nd me 498
The expanded signal RECGN is applied to the data input terminal 04.

第１メモリ８０３およびｊＩ２メモリ８０４は、アドレ
スカウンタ８０６によってアドレス設定される。The first memory 803 and the jI2 memory 804 are addressed by an address counter 806.

このカウンタ８０６は、プロセッサ４からバス１０を通
じてクリア指令信号ＣＬＥＡＲが送られて来るとクリア
され、クロック信号ＣＬＯＣＫが送られて来るとカウン
トアツプされる。また、アンドゲート８０７がロード指
令信号ＬＯＡＤを発生すると、そのときに信号線群１４
ａ上に送出されているデータがカウンタ８０６にロード
される。アンドゲート８０７には、認識指令信号ＲＥＣ
ＧＮと、辞書制御部７から送られて来る文字コード検出
信号Ｃ０ＤＩＣとが供給される。This counter 806 is cleared when a clear command signal CLEAR is sent from the processor 4 via the bus 10, and counted up when a clock signal CLOCK is sent. Further, when the AND gate 807 generates the load command signal LOAD, the signal line group 14
The data being sent on a is loaded into counter 806. The AND gate 807 includes a recognition command signal REC.
GN and a character code detection signal C0DIC sent from the dictionary control section 7 are supplied.

第１メ毫り８０３の出力は、アントゲ−）　８０８に印
加される。また、第２メモリ８０４の出力は、認識終了
信号）Ｗとしてバス１０およびインバータ卿に送出され
る。インバータ８０９の出力信号はアントゲ−）８０８
に印加される。アンドゲート８０８には、認識指令信号
ＲＥＣＧＮも印加される。アンドゲート８０８の出力信
号は、イネーブル信号ＥＮとして、類似度計算回路８０
２に供給される。The output of the first screen 803 is applied to an analog game 808. Further, the output of the second memory 804 is sent to the bus 10 and the inverter as a recognition end signal)W. The output signal of the inverter 809 is 808
is applied to A recognition command signal RECGN is also applied to the AND gate 808 . The output signal of the AND gate 808 is sent to the similarity calculation circuit 80 as an enable signal EN.
2.

第１メモリ８０３および第２メ毫り８０４の内容れ、認
識開始に先立って、プロセッサ４の働ＩＩＫよシ初期化
される。その際、プロセッサ４紘壕ず認識指令信号ＲＥ
ＣＧＮをローベルにする。すると、第１メモリ８０３お
よび第２メモリ８０４は書會込み篭−ドになる。また、
アンドゲート８０９はイネーブル信号ＥＮｏ発生を中止
し、アントゲ−）　８０７はロード指令信号ＬＯＡＤの
発生を中止する。これＫよシ、認識部８嬬休止状態とな
る０次いで、プロセッサ４はクリア指令信号ＣＬＥＡＲ
を発生する。The contents of the first memory 803 and the second memory 804 are initialized by the processor 4 prior to the start of recognition. At that time, the processor 4 recognizes the command signal RE.
Set CGN to Robel. Then, the first memory 803 and the second memory 804 become a book storage area. Also,
The AND gate 809 stops generating the enable signal ENo, and the AND gate 807 stops generating the load command signal LOAD. After this, the recognition unit 8 goes into a dormant state.Then, the processor 4 outputs a clear command signal CLEAR.
occurs.

これによシ、第１メ毫り８０３および第２メ篭す鍋の０
１１地にデータが書き込まれる。第１メモ１７８０３に
書き込まれるデータＷＤＡＴＡは、プロセッサ４からバ
スｌＯを送じて送られて来る。また、信号ＲＥＣＧＮが
ローレベルになりているので、第２メモリ８０４に紘“
Ｏ″′が書き込まれる。In addition to this, the first frame 803 and the second frame 0
Data is written to location 11. Data WDATA written to the first memory 17803 is sent from the processor 4 via the bus IO. In addition, since the signal RECGN is at a low level, the second memory 804
O″′ is written.

０番地への書置込みが終了すると、プロセッサ４はカウ
ント信号Ｃ０ＵＮＴを発生し、カラン列部のカウント値
を「１」Ｋする。これＫより、１番地への書き込みが行
なわれる。以下同様にして、第１メモＩ７８０３および
第２メモリ８０４の全記憶領域にデータが書き込まれる
。When the writing to address 0 is completed, the processor 4 generates a count signal C0UNT and increments the count value of the column column by "1". From this K onwards, writing to address 1 is performed. Thereafter, data is written to all storage areas of the first memo I 7803 and the second memory 804 in the same manner.

第１メモリ８０３　Ｋ書き込まれるデータＷＤＡＴＡの
値祉、その記憶領域のアドレスと同じ値の文字コードを
有する標準パターンについて類似度を計算する必要があ
る場合は“１”Ｋされ、そうでない場合は＠０”Ｋされ
る。一方、第２メモリ８０４には常に”Ｏ”が書き込ま
れる。The value of the data WDATA to be written in the first memory 803 is set to "1" if it is necessary to calculate the similarity for a standard pattern having a character code with the same value as the address of its storage area, otherwise @ 0"K. On the other hand, "O" is always written in the second memory 804.

第１メモリ８０３および第２メモＩＪ　８０４の初期化
が完了すると、プロセッサ４は認識指令信号ぼｌをハイ
レベルにすゐ、すると、第１メモリ８０３および第２メ
モリ８０４は読み出しモードになる。また、アントゲ−
）　８０７は文字コード検出信号Ｃ０ＤＥが発生するた
びにロード指令信号ＬＯＡＤを発生するようになる。し
たがって、信号線群１４ａ上に文字コードが出現すると
、その値がレジスタ８０６にロードされ、第１メ毫り８
０３および第２メモリ８Ｇ４内の対応する記憶領域の内
容が読み出される。第１メモ！Ｊンドゲート８０８はイネーブル信号ＥＮを発生しない、
すなわち、その場合類似度計算は行なわない。When the initialization of the first memory 803 and the second memory IJ 804 is completed, the processor 4 sets the recognition command signal vol to a high level, and the first memory 803 and the second memory 804 enter the read mode. Also, anime games
) 807 generates a load command signal LOAD every time the character code detection signal C0DE is generated. Therefore, when a character code appears on the signal line group 14a, its value is loaded into the register 806 and the first message 806 is loaded.
03 and the contents of the corresponding storage areas in the second memory 8G4 are read out. First memo! J and gate 808 does not generate an enable signal EN;
That is, in that case, similarity calculation is not performed.

第１メモ！Ｊ　８０３から１１”が読み出され、第２メ
毫りから“０”が読み出され九場合は、アンドゲート８
０８がイネーブル信号ＥＮを発生し、類似度計算が行な
われる。類似度計算が終了すると、計算終了偏分ＦＩＮ
がローレベルになる。これにより、第２メモリ８０４紘
書き込みモードになる。信号擬■劇はハイレベルになっ
ているので、このと自第２メモリ８０４には＠１″が書
き込まれる。このようにして、第２メモ！７８０４内の
各記憶領域には対応する標準パターンに関する類似度計
算が終了するたびに１１”が書き込まれて行く、シたが
って、同一〇標準パターンについて２度目の類似度計算
が行なわれようとすると、第２メモリ８０４から″１″
が読み出され、ｌ！！繊終了信号ＥＮＤが発生する。こ
れＫよシ、プロセッサ４に認識終了が通知堪れる。First memo! If ``11'' is read from J 803 and ``0'' is read from the second message, AND gate 8
08 generates an enable signal EN, and similarity calculation is performed. When the similarity calculation is completed, the calculation end partial FIN
becomes low level. This causes the second memory 804 to enter the write mode. Since the signal pseudo* is at a high level, @1'' is written to the second memory 804. In this way, each storage area in the second memo!7804 is filled with the corresponding standard pattern. 11" is written each time the similarity calculation for the same standard pattern is completed. Therefore, when the second similarity calculation is about to be performed for the same standard pattern, "1" is written from the second memory 804.
is read and l! ! A fiber end signal END is generated. This will notify processor 4 of the end of recognition.

次に、本実施例の全体的な動作を説明する０本実施例の
ＯＣＲは、帳票上の文字を１行分ずつ読み取る。各行の
読み取シに先立って、プロセッサ４はフォーマットテー
ブル５をアクセスし、次に読み取られるべき文字行の存
在位置を調べる。その結果に応じて、プロセッサ４は光
電変換部１に高速搬送指令を与え、帳票を次の行位置ま
で高速搬送させる０次いで、プロセッサ４は光電変換部
１に走査開始指令を与える。すると、光電変換部１は文
字パターンの走査を開始する。走査方式によっては、走
査中に帳票を低速搬送する場合もある。光電変換された
文字パターンは、信号線１１を通じて、逐次前処［１ｓ
２に送られる。前も環部２祉、光電変換された文字パタ
ーンを量子化した後、これに雑音除去などの前処理を施
す、前処塩された文字パターン社、信号線群１２を通じ
て、逐次ラインバッファ３に書き込まれて行く。−桁分
の文字ハターンの書き込みが完了すると、ラインバッフ
ァ３は、その旨をプロセッサ４に通知する。Next, the overall operation of this embodiment will be explained.The OCR of this embodiment reads characters on a form one line at a time. Prior to reading each line, processor 4 accesses format table 5 to determine the location of the next character line to be read. According to the result, the processor 4 gives a high-speed conveyance command to the photoelectric conversion unit 1 to cause the form to be conveyed at high speed to the next line position.Then, the processor 4 gives a scan start command to the photoelectric conversion unit 1. Then, the photoelectric conversion unit 1 starts scanning the character pattern. Depending on the scanning method, the form may be conveyed at low speed during scanning. The photoelectrically converted character pattern is sequentially transmitted through the signal line 11 to the prefix [1s
Sent to 2. After the photoelectrically converted character pattern is quantized, it is subjected to preprocessing such as noise removal. It will be written. When the writing of character patterns corresponding to - digits is completed, the line buffer 3 notifies the processor 4 of this fact.

文字パターンがライ／バッファ３に書き込まれている間
にプロセッサ４紘再度フォーｉットテーブル５をアクセ
スし、走査中の文字行を構成してらる文字の字体（手書
き文字か活字文字かドツト文字かの区別）を調べ、参照
すべ龜辞書を選ぶ。While the character pattern is being written to the line/buffer 3, the processor 4 accesses the format table 5 again and determines the font of the characters (handwritten, printed, dot) that make up the character line being scanned. (distinction) and select a dictionary to refer to.

選ばれた辞書の番号ｉは辞書制御部７の辞書番号レジス
タ７１に書き込まれる。これにより、以後辞書メモリ６
からは辞書ＤＩの内容が繰シ返し読み出される。なお、
本実施例の０ＣＲＫＴｈいては、同一の文字行中に手書
き文字と活字文字のように字体の異なる文字が混在する
ことは許されていない。The selected dictionary number i is written into the dictionary number register 71 of the dictionary control section 7. As a result, the dictionary memory 6
From then on, the contents of the dictionary DI are repeatedly read out. In addition,
0CRKTh of this embodiment does not allow characters with different fonts, such as handwritten characters and printed characters, to coexist in the same character line.

次に、プロセッサ４はもう一１Ｋフォーマットテーブル
５をアクセスし、走査中０文字行に含まれる各フィール
ド（商品名欄、単価欄、数量欄など）の開始位置および
終了位置ならびに各フィールドに記入される文字の種類
（数字、英字、記号などの区別）を調べておく。Next, the processor 4 accesses another 1K format table 5, and fills in the start and end positions of each field (product name field, unit price field, quantity field, etc.) included in the 0-character line during scanning, as well as the fields filled in. Check the types of characters (numbers, letters, symbols, etc.).

ラインバッファ３かも書き込み終了が通知されると、プ
ロセッサ４はラインバッファ３に記憶されている一桁分
の文字パターンの中から最左端〇−文字分を切シ出す、
このと亀、先にフォーマットテーブル５から読み出した
第１フイールド（最左端フィールド）の開始位置が参照
される０次いで、プロセッサ４はラインバッファ３から
切り出された一文字分の文字パターンについて文字の大
きさや線幅の正規化を行なう。正規化された文字パター
ンは、第１ａ識部８ａの一文字パッファ８０１にＳき込
まれる。この後、プロセッサ４は第１フイールドに記入
される文字の種類に応じて、第１認識部８ａ内の第１メ
モＩＪ　８０３を初期化する。以上の処理が終了すると
、プロセッサ４は第ｉｌｌ識部８ａに対して認識終了信
号几ＥＣＧＮを送出する。When the line buffer 3 is notified of the end of writing, the processor 4 cuts out the leftmost 0-characters from the one-digit character pattern stored in the line buffer 3.
At this point, the start position of the first field (leftmost field) read earlier from the format table 5 is referenced. Next, the processor 4 determines the character size and character pattern for one character extracted from the line buffer 3. Normalize line width. The normalized character pattern is loaded into the single character puffer 801 of the 1a identification section 8a. Thereafter, the processor 4 initializes the first memo IJ 803 in the first recognition unit 8a according to the type of character written in the first field. When the above processing is completed, the processor 4 sends a recognition end signal ECGN to the ill recognition section 8a.

これＫよシ、第１認識部８ａにおける認識動作が起動さ
れる。At this point, the recognition operation in the first recognition section 8a is activated.

次Ｋ、プロセッサ４は２文字目の文字パターンの切り出
しおよび正規化を行なう。正規化された２文字目の文字
パターンは第２１１！識部８ｂに送られる。以後同様に
して、次々と切り出された文字パターンが動作中でない
方の認識部８に送られる。Next, the processor 4 cuts out and normalizes the character pattern of the second character. The normalized second character pattern is number 211! It is sent to the intelligence section 8b. Thereafter, in the same manner, character patterns cut out one after another are sent to the recognition section 8 that is not in operation.

認識部８からの認識終了信号ＥＮＤを受は取ると。When the recognition end signal END is received from the recognition unit 8.

プロセッサ４は認識部８に記憶されている類似度計算結
果を参照して答えを決定し、出力バッ７．ア９に書き込
む。The processor 4 determines the answer by referring to the similarity calculation results stored in the recognition unit 8, and outputs an output buffer 7. Write in A9.

第１フイールドに含まれる全文字の認識が終了すると、
同様にして第２フイールドの認識が開始される。ただし
、初期化時に第１メモ９８０３　Ｋ書き込まれる内容は
、第２フイールドに記入される文字の種類に応じて変更
される。When all characters in the first field have been recognized,
Recognition of the second field is started in the same manner. However, the contents written in the first memo 9803K at the time of initialization are changed depending on the type of characters written in the second field.

以上、本発明をパターンマツチング方式の■３に適用し
た場合について説明したが、本発明は時機抽出方式のＯ
ＣＲにも適用可能である。その場合、プロセッサ４は切
り出された文字パターンについて、正規化を行なうかわ
シに幾何学的位相特徴（たとえば、ブロック、ループ、
ストローク、凹凸等の特徴）の抽出をしたシ、文字線縁
の傾斜方向を示す方向コード列を作成したりして特徴パ
ターンを作成し、その結果を一文字パッファ８０１に書
龜込む。また、辞書メモリ６に拡標準となる特徴パター
ンが格納される。さらに、類似度計算回路８０２０代わ
９に、相関器が用いられる。Above, the case where the present invention is applied to (3) of the pattern matching method has been explained, but the present invention also applies to the O
It is also applicable to CR. In that case, the processor 4 performs normalization on the extracted character pattern, and also performs geometric topological features (for example, blocks, loops, etc.).
A feature pattern is created by extracting features (such as strokes and unevenness) and creating a direction code string indicating the inclination direction of the character line edge, and the results are loaded into the single character puffer 801. Further, the dictionary memory 6 stores feature patterns that serve as expanded standards. Further, a correlator is used in place of the similarity calculation circuit 8020.

また、上記実施例においては認識部８を２つ設け九例を
説明したが、３つ以上の認識部を設けることも可能であ
る。Further, in the above embodiment, nine examples were described in which two recognition units 8 were provided, but it is also possible to provide three or more recognition units.

また、上記１１１施例においては、文字種（手書き文字
か活字文字かドツト文字かの区別）に応じて３つの辞書
り、　−Ｄ、を設は九が、これをよシ細分して手書き数
字、手書き英字、手書き記号、活字数字、活字英字、活
字記号、ドツト数字、ドツト英字およびドツト記号用の
辞書を設けることなども可能である。In addition, in the above-mentioned Example 111, three dictionaries are provided depending on the character type (distinction between handwritten characters, printed characters, and dot characters). It is also possible to provide dictionaries for handwritten letters, handwritten symbols, printed numbers, printed letters, printed symbols, dotted numbers, dotted letters and dotted symbols.

以上詳述したように、本発明によれば複数の認識部が１
つの辞書メモリを共有する方式のＯＣＲが提供される。As described in detail above, according to the present invention, a plurality of recognition units are connected to one
An OCR system that shares two dictionary memories is provided.

し九がって、認識速度が速く、シかも小形で安価ｔＯｃ
Ｒが提供される。Furthermore, the recognition speed is fast, and it is also small and inexpensive.
R is provided.

[Brief explanation of drawings]

第１図は本発明一実施例の構成図、第２図は辞書メモリ
の内容を示す図、第３図は文字種の一例を示す図、第４
図は辞書制御部の構成図、第５図は認識部の構成図であ
る。１・・・光電変換部　　　　　２・・・前処理部３・・
・ラインバッファ４・・・制御認識プロセッサ５・・・フォーマットテーブル６・・・辞書メモリ　　　　　７・・・辞書制御部８ａ
・・・第１ＩＩ識部　　　　８ｂ・・・第２認織部９・
・・出力バッファ特許出願人　東京芝浦電気株式会社代理人弁理士　則　　近　　憲　　佑（他１名）第１図第２図（（１）　　　　　　　（ｂ）第３図第４図ワFig. 1 is a configuration diagram of an embodiment of the present invention, Fig. 2 is a diagram showing the contents of a dictionary memory, Fig. 3 is a diagram showing an example of character types, and Fig. 4 is a diagram showing an example of character types.
The figure is a block diagram of the dictionary control section, and FIG. 5 is a block diagram of the recognition section. 1... Photoelectric conversion section 2... Pre-processing section 3...
- Line buffer 4...Control recognition processor 5...Format table 6...Dictionary memory 7...Dictionary control unit 8a
...1st II Knowledge Department 8b...2nd Knowledge Department 9.
...Output Buffer Patent Applicant Tokyo Shibaura Electric Co., Ltd. Representative Patent Attorney Kensuke Noriyuki (and 1 other person) Figure 1 Figure 2 ((1) (b) Figure 3 Figure 4

Claims

[Claims] A plurality of recognition units, a dictionary memo 49, and a dictionary control unit. a recognition control unit; the dictionary control unit selectively reads out a plurality of standard patterns from the dictionary memory and supplies them to the plurality of recognition units in response to a command from the recognition control unit; The optical character is characterized in that character recognition is performed by selecting a plurality of 81I patterns from among the plurality of standard patterns in accordance with instructions from the recognition control unit. Read a lot
1 device.