JPH03246781A

JPH03246781A - Feature vectoring circuit

Info

Publication number: JPH03246781A
Application number: JP2042641A
Authority: JP
Inventors: Hirotomo Aso; 阿曽　弘具; Masayuki Kimura; 木村　正行; Kenji Suzuki; 健司鈴木; Hisayoshi Hayasaka; 早坂　久義; Yoshiyuki Sakurai; 桜井　義之
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1990-02-26
Filing date: 1990-02-26
Publication date: 1991-11-05

Abstract

PURPOSE:To increase the processing speed by performing the arithmetic processing for generation of a feature vector by a specific circuit constitution. CONSTITUTION:At least line element information with one dot as the unit of a character pattern is applied to and stored in a first storage means 1, and a second address generating means 4 generates the address to read out line element information stored in the first data storage means 1 and the address to read out weight data corresponding to this line element information from a second storage means 2. Outputted weight data is applied to accumulating means 5-1 to 5-n, and they are provided correspondingly to line element directions, and a decoding means 6 decodes line element information and enables accumulating means 5-1 to 5-n by the output result to accumulate weight information of directions. Thus, a feature vector is quickly obtained when line element data is obtained correspondingly to dots.

Description

【発明の詳細な説明】〔概　　　要〕特徴ベクトルによってパターンを認識する認識装置にお
ける特徴ベクトル化回路に関し、例えばドツト対応で線
素データを求めている場合でも高速に特徴ベクトルを求
める特徴ベクトル化回路を提供することを目的とし、文字パターンの少なくともドツト単位の線素情報が加わ
り、該線素情報を記憶する第１のデータ記憶手段と、該
第１のデータ記憶手段における前記線素情報の記憶位置
を指示するアドレスを発生する第１のアドレス発生手段
と、前記線素情報の重みデータを記憶する第２のデータ
記憶手段と、前記第１のデータ記憶手段で記憶する線素
情報を読み出すアドレスと、該アドレスで読み出される
線素情報に対応する前記重みデータを第２のデータ記憶
手段より読み出すアドレスとを発生する第２のアドレス
発生手段と、前記第２のデータ記憶手段より出力される
重みデータが加わり線素の方向に対応して設けられた数
の累算手段と、前記第１のデータ記憶手段で記憶する線
素情報をデコードするとともに、該デコード結果で前記
累算手段をイネーブルにして方向に対する重み情報を累
算させるデコード手段とよりなるように構成する。[Detailed Description of the Invention] [Summary] Regarding a feature vectorization circuit in a recognition device that recognizes patterns using feature vectors, the present invention relates to a feature vectorization circuit that quickly obtains feature vectors even when obtaining line element data corresponding to dots, for example. A first data storage means for adding line element information in at least a dot unit of a character pattern and storing the line element information; and a first data storage means for storing the line element information in the first data storage means. a first address generation means for generating an address indicating a position; a second data storage means for storing weight data of the line element information; and an address for reading out the line element information stored in the first data storage means. and an address for reading out the weight data corresponding to the line element information read at the address from the second data storage means, and a weight output from the second data storage means. Data is added to decode the line element information stored in the number accumulating means provided corresponding to the direction of the line element and the first data storage means, and enable the accumulating means with the decoding result. and decoding means for accumulating weight information for each direction.

[Industrial application field]

本発明は文字パターン等の認識装置にかかり、さらに詳
しくは特徴ベクトルによってパターンを認識する認識装
置における特徴ベクトル化回路に関する。The present invention relates to a recognition device for character patterns, and more particularly to a feature vectorization circuit in a recognition device that recognizes patterns using feature vectors.

〔従来の技術〕コンピュータシステムの発展により、画像データを取り
込むとともに、取り込んだ画像データから文字を切り出
し、読み取った書類の文書のそれぞれの文字を認識する
読み取り装置が実用化している。この読み取り装置はた
とえばイメージスキャナ等によって読み取ったドツトデ
ータをあらかじめ定められた領域単位で分割し、その分
割内での文字（枡内文字）とあらかじめ定められた文字
データとを比較し、１番偵かよった文字を結果として出
力している。このあらかじめ定められた文字データは一
般的には辞書メモリに格納されており、辞書メモリはた
とえば各規定の文字を特徴化したデータとして記憶して
いる。そして認識すべき文字が入力した時、同様にその
入力した文字を特徴化し、前述の辞書メモリに格納され
ているあらかしめ定められた特徴データとの距離を求め
ている。この求めた距離から最も小さい文字を認識結果
として出力している。[Background Art] With the development of computer systems, reading devices have been put into practical use that capture image data, cut out characters from the captured image data, and recognize each character of the read document. This reading device divides the dot data read by an image scanner or the like into predetermined area units, and compares the characters within the divisions (the characters in the squares) with the predetermined character data to find the first image. The distorted characters are output as a result. This predetermined character data is generally stored in a dictionary memory, and the dictionary memory stores, for example, data characterizing each prescribed character. When a character to be recognized is input, the input character is characterized in the same way, and the distance from the previously determined characteristic data stored in the dictionary memory is determined. The smallest character from this determined distance is output as the recognition result.

前述の従来のコンピュータシステムにおける文字等の認
識においては前述の辞書メモリに記憶する特徴化したデ
ータさらには入力した文字（画像データ）の特徴化には
特徴ベクトルが用いられている。この特徴ベクトルは例
えば文字を構成する線素の方向をドツト単位等で求め、
さらにそれを１個の文字領域単位に分割しそれぞれの分
割した領域内におけるそれぞれのベクトル方向を集計し
たものである。この特徴ベクトルにより、入力した文字
の認識率を高めている。In the recognition of characters, etc. in the conventional computer system described above, feature vectors are used to characterize the characterized data stored in the dictionary memory and also to characterize the input characters (image data). This feature vector, for example, determines the direction of line elements that make up a character in units of dots,
Furthermore, it is divided into character area units and the vector directions within each divided area are totaled. This feature vector increases the recognition rate of input characters.

[Problem to be solved by the invention]

前述の特徴ベクトルにより認識率を高めているが、求め
た線素データから特定領域単位でその方向を集計するた
めに、領域単位で個々の線素データを読み出し同一方向
単位であらかじめ設けられたレジスタの値をインクリメ
ントする等して方向の数を求めていた。また領域内の位
置によって重み付けをする場合には１ドツト単位で読み
出した方向に対応したレジスタに、重み付けした値を加
算する等の処理を行っていた。このような処理はすべて
線素を表す個々のデータ例えばドツト単位で線素を求め
ている場合には、ドツト単位でデータを処理していた。The recognition rate is increased using the feature vector described above, but in order to aggregate the directions for each specific area from the obtained line element data, we read out the individual line element data for each area and set a register in advance for each same direction. The number of directions was calculated by incrementing the value of . Furthermore, when weighting is performed depending on the position within the area, processing such as adding the weighted value to a register corresponding to the direction read out in units of one dot is performed. All of these processes involve processing individual data representing line elements, for example, when line elements are determined in units of dots, the data is processed in units of dots.

このためこの特徴ベクトルを求めるのに多くの時間を有
するという問題を有していた。Therefore, there is a problem in that it takes a lot of time to obtain this feature vector.

また、現在においては、認識率等を高めるため、文字を
読み出すときの分解能を高め、従来より多くのドツトか
ら認識を行うようになってきている。Furthermore, in order to increase the recognition rate, the resolution when reading characters is increased and recognition is now performed from a larger number of dots than before.

このため１文字を読み出したときに構成するドツト情報
も多くなり、前述の特徴ベクトルを求めるにもさらに多
くの時間を有すという問題を有していた。For this reason, when one character is read out, there is a large amount of dot information, which causes a problem in that it takes even more time to obtain the aforementioned feature vectors.

本発明は、例えばドツト対応で線素データを求めている
場合でも高速に特徴ベクトルを求める特徴ベクトル化回
路を提供することを目的とする。SUMMARY OF THE INVENTION An object of the present invention is to provide a feature vectorization circuit that quickly obtains feature vectors even when line element data is obtained in correspondence with dots, for example.

[Means to solve the problem]

第１図は本発明の原理ブロック回である。 FIG. 1 is a block diagram of the principle of the present invention.

第１の記憶手段１は文字パターンの少なくともドツト単
位の線素情報が加わり前記線素情報を記憶する。この線
素情報は例えばそのドツトに対し線素がどちらの方につ
ながっているかを表す情報である。The first storage means 1 stores the line element information in addition to at least dot unit line element information of the character pattern. This line element information is, for example, information indicating in which direction the line element is connected to the dot.

第１のアドレス発生手段２は前記第１のデータ記憶手段
１における前記線素情報の記憶位置を指示するアドレス
を発生する。The first address generation means 2 generates an address indicating the storage position of the line element information in the first data storage means 1.

第２のデータ記憶手段３は前記線素情報の重みデータを
記憶する。例えばこの重みデータは各線素のドツトの位
置による重みである。The second data storage means 3 stores weight data of the line element information. For example, this weight data is a weight based on the position of the dot of each line element.

第２のアドレス発生手段４は前記第１のデータ記憶手段
１で記憶する線素情報を読み出すアドレスと、このアド
レスで読み出される線素情報に対応した前記重みデータ
を前記第２ので記憶手段２より読み出すアドレスとを発
生する。A second address generating means 4 generates an address for reading line element information stored in the first data storage means 1 and the weight data corresponding to the line element information read at this address from the second storage means 2. Generates the read address.

累算手段５−１〜５−ｎは前記第２のデータ記憶３より
出力される重みデータが加わり、線素に方向に対応して
設けられている。The accumulating means 5-1 to 5-n add the weight data output from the second data storage 3, and are provided corresponding to the direction of the line element.

デコード手段６は前記第１の記憶手段１で記憶する線素
情報をデコードするとともに、その出力結果で前記累算
手段５−１〜５−ｎをイネーブルにして、方向に対して
重み情報を累算する。The decoding means 6 decodes the line element information stored in the first storage means 1, and uses the output result to enable the accumulation means 5-1 to 5-n to accumulate weight information for the direction. Calculate.

[For production]

例えば１文字の領域内の線素化情報はドツト単位や複数
ドツトで第１の発生手段２で指示されるアドレス位置に
第１の記憶手段１内に記憶される。For example, the line segmentation information within the area of one character is stored in the first storage means 1 at an address position designated by the first generation means 2 in units of dots or in a plurality of dots.

その記憶したデータから第２のアドレス発生手段４は特
定領域内における特徴ベクトルを求めるため、特定領域
内に存在するドツトを指示すべきアドレスを第１にデー
タ記憶手段１に加え、第１のデータ記憶手段１はそのア
ドレスに指示したデータすなわち線素データをデコード
手段６に加える。In order to obtain a feature vector within the specific area from the stored data, the second address generating means 4 first adds an address that indicates a dot existing within the specific area to the data storage means 1, and then generates the first data. The storage means 1 adds the data specified at the address, that is, the line element data, to the decoding means 6.

第２のアドレス手段４は第１のデータ記憶手段１に加え
るアドレス位置に対応して、その領域内における重み情
報を記憶する第２の重みデータ記憶手段３に対応する位
置のアドレスを加える。第２のデータ記憶手段３より出
力される重みデータは累算手段５−１〜５−ｎに加わる
。累算手段５−１〜５−ｎは線素の方向に対応して設け
られており、デコード手段６で格納方向に対応する値を
デコードし、累算手段５−１〜５−ｎ内の１個を指示す
る。この指示により、方向単位で重み付けが累算され累
算手段５−１〜５−ｎ内にはそれぞれの方向に対する累
算値が得られる。Corresponding to the address position added to the first data storage means 1, the second address means 4 adds the address of the corresponding position to the second weight data storage means 3 which stores weight information in that area. The weight data output from the second data storage means 3 is applied to the accumulating means 5-1 to 5-n. The accumulating means 5-1 to 5-n are provided corresponding to the direction of the line elements, and the decoding means 6 decodes the value corresponding to the storage direction, and the values in the accumulating means 5-1 to 5-n are Indicate one item. According to this instruction, the weighting is accumulated for each direction, and the accumulated values for each direction are obtained in the accumulating means 5-1 to 5-n.

第１のアドレス手段２で発生するアドレスは各文字内の
各領域単位で累算できるよう各ドツトを特定のアドレス
位置にそれぞれ記憶するので第２のアドレス発生手段４
より読み出す時には第２のデータ記憶手段３で記憶する
各重み付は対応で読み出すことができ処理を高速化する
ことができる。Since each dot is stored in a specific address position so that the address generated by the first address means 2 can be accumulated for each area within each character, the second address generation means 4
When reading out the data, each weighting stored in the second data storage means 3 can be read out in correspondence, thereby speeding up the processing.

〔Example〕

以下図面を用いて本発明の詳細な説明する。 The present invention will be described in detail below using the drawings.

第２図は本発明の実施例のシステム構成図である。FIG. 2 is a system configuration diagram of an embodiment of the present invention.

イメージスキャナ等によって読み取られた情報は画像デ
ータとして画像メモリ１０に格納される。Information read by an image scanner or the like is stored in the image memory 10 as image data.

この画像メモリ１０はイメージスキャナで読み取る１頁
分の記憶容量を有しており、読み取った情報のそれぞれ
各ドツトを白あるいは黒の２値すなわち０，１のデータ
として記憶する。This image memory 10 has a storage capacity for one page read by an image scanner, and stores each dot of read information as binary data of white or black, that is, 0 and 1 data.

画像メモリ１０に格納された画像データはノイズ除去モ
ジュール１１に加わり、読み取り時に発生した雑音を除
去する。例えば、このノイズ除去モジュール１１によっ
て除去されるノイズは文字情報等に無関係な雑音例えば
３×３のマスクで中心を黒、その中心のドツトを囲む８
ドツトが白等の雑音であり、その中心のドツトをノイズ
除去モジュール１１は白とする。このノイズ除去モジュ
ールは文字認識前処理部１２内に設けているがこれに限
るわけでなく、例えば後述する正規化モジュール１６内
に文字単位で格納する時に行ってもよく、またさらには
細線化、線素化の時に行ってもよい。The image data stored in the image memory 10 is applied to a noise removal module 11 to remove noise generated during reading. For example, the noise removed by this noise removal module 11 is noise unrelated to character information, etc. For example, a 3 x 3 mask with a black center and 8 dots surrounding the center dot.
The dots are noises such as white, and the noise removal module 11 makes the dots in the center white. Although this noise removal module is provided in the character recognition pre-processing section 12, it is not limited thereto, and may be performed when storing each character in the normalization module 16, which will be described later. It may be performed at the time of line element formation.

ノイズ除去モジュール１１によってノイズ除去された画
像情報は行ヒストグラムモジュール１３、列ヒストグラ
ムモジュール１４、さらには読み出し制御モジュール１
５に加わる。行ヒストグラムモジュール１３は読み取っ
た情報、例えば前述したイメージスキャナによって読み
取った用紙の内容を各ドツト単位で列方向に投影し、各
ドツト単位の行のドツト数を求めるモジュールである。The image information from which noise has been removed by the noise removal module 11 is sent to the row histogram module 13, the column histogram module 14, and further to the readout control module 1.
Join 5. The row histogram module 13 is a module that projects the read information, for example, the content of the paper read by the above-mentioned image scanner, in the column direction in units of dots, and calculates the number of dots in the row for each dot unit.

すなわち、１ドツトの行（横方向）に対し、その１ドツ
ト行にいくつの黒ドツトが存在するかを各１ドツト行単
位で求める処理である。また列ヒストグラム１４は前述
した行ヒストグラムと同様に列方向に対し投影し、その
投影した黒ドツトの数を求める処理である。That is, this is a process of determining how many black dots are present in each one-dot row (horizontal direction) for each one-dot row. Also, the column histogram 14 is a process of projecting in the column direction in the same way as the row histogram described above, and calculating the number of projected black dots.

画像メモリｌＯから行方向に順次１ド・ノド単位で読み
出し、ノイズ除去モジュール１１を介して加わったデー
タ（ラスタースキャンと同様のドツトの読み出し）から
、行ヒストグラムモジュール１３は順次界のドツトをカ
ウントする（１ドツト行分）。そして、順次行単位で黒
のドツト数を求める。この黒のドツト数が各行に対応す
る行ヒストグラムとなる。また列ヒストグラム１４は１
ドツト行内のドツト数に対応してそれぞれカウンタを有
し１行のドツトが順次別わる度に黒ドツトに対応するカ
ウンタをインクリメントする。前述した動作を１頁分行
うことにより行ヒストグラムモジュール１６ならびに列
ヒストグラムモジュール１４からは、それぞれ行位置な
らびに列位置に対するドツト数を表したいわゆる行ヒス
トグラム。The row histogram module 13 sequentially counts the dots in the field from the data that is sequentially read out from the image memory IO in the row direction in units of dots and added via the noise removal module 11 (dot readout similar to raster scanning). (1 dot line). Then, the number of black dots is sequentially calculated for each line. The number of black dots becomes the row histogram corresponding to each row. Also, the column histogram 14 is 1
Each dot has a counter corresponding to the number of dots in the dot row, and each time the dots in one row are different, the counter corresponding to the black dot is incremented. By performing the above-described operations for one page, the row histogram module 16 and the column histogram module 14 produce so-called row histograms representing the number of dots for each row position and column position.

列ヒストグラムが求められる。そしてその結果は読み出
し制御モジュール１５に加わる。A column histogram is required. The result is then applied to the read control module 15.

読み出し制御モジュール１５はそれらの行ヒストグラム
、列ヒストグラムから行の位置ならびに列の位置を順次
求める。例えばこの位置は行ヒストグラムの周期や列ヒ
ストグラムの周期によって得ることができる。The readout control module 15 sequentially obtains row positions and column positions from these row histograms and column histograms. For example, this position can be obtained by the period of the row histogram or the period of the column histogram.

読み出し制御モジュール１５は行ならびに列の位置を求
めるが、この他に以下の処理を行う。画像データ例えば
イメージスキャナから読みとった情報は紙の位置等によ
り傾きを有することがある。The read control module 15 determines the row and column positions, but also performs the following processing. Image data, for example, information read from an image scanner, may have a tilt depending on the position of the paper and the like.

このため、読み出し制御モジュール１５は列ヒストグラ
ムならびに行ヒストグラムが最大値をとるよう、ヒスト
グラムを求める角度を順次変更し、補正角度を求める。For this reason, the readout control module 15 sequentially changes the angle at which the histogram is obtained so that the column histogram and the row histogram take the maximum value, and obtains a correction angle.

そして前述したノイズ除去モジュール１１から加わる画
像情報を再度入力して、最終的なヒストグラムを求め、
その補正した傾きにより得られた行ヒストグラム（ヒス
トグラムが最大値をとる）が０から正に変化する点（正
がら０でも可）より１周期分その傾きに対応した１行の
データを読み出し、読み出し制御モジュール１５内に設
けられた行バッファに格納する。Then, input the image information added from the above-mentioned noise removal module 11 again to obtain the final histogram,
From the point where the row histogram obtained by the corrected slope (the histogram takes the maximum value) changes from 0 to positive (possibly positive to 0), one line of data corresponding to the slope is read out for one period. The data is stored in a row buffer provided within the control module 15.

読み出し制御モジュール１５はさらにその行バッファに
格納した１行のデータの内、行内における列ヒストグラ
ムを再度求め、列ヒストグラムが０から正に変化する位
置からそのデータを切り出し正規化モジュール１６に出
力する。また変換表作成モジュール１７にも出力する。The read control module 15 further obtains the column histogram within the row of one row of data stored in the row buffer, cuts out the data from the position where the column histogram changes from 0 to positive, and outputs it to the normalization module 16. It is also output to the conversion table creation module 17.

この切り出したデータはＩ文字領域のデータである。This extracted data is data of the I character area.

変換表作成モジュール１７は正規化モジュール１６によ
って１文字を正規化するための変換データを求めるモジ
ュールであり、読み出し制御モジュール１５によって切
り出した１文字領域に対し、列方向ならびに行方向に投
影し、黒ドツトが存在する列ならびに行からドツト単位
（行や列単位）で、列ならびに行方向のカウンタをイン
クリメントし、１文字の領域内の最終値′までの値を求
める。The conversion table creation module 17 is a module that obtains conversion data for normalizing one character by the normalization module 16. It projects the conversion data in the column direction and the row direction on the one character area cut out by the readout control module 15, and The counters in the column and row directions are incremented dot by dot (row and column) from the column and row where the dot exists, and the values up to the final value '' in the area of one character are determined.

正規化モジュール１６では、この１文字で切り出したド
ツトの行方向並びに列方向の最終値並びに切り出した１
文字の大きさから、その文字が切り出し領域内の全域に
わたって存在する文字に拡大する。例えば６４Ｘ６４ド
ツトの領域を１文字領域とする拡大処理を行う。文字の
列方向並びに行方向の値が変換表作成モジュール１７に
おいて４８（列並びに行とも）ドツトであったならば、
４８ドツトの文字を６４ドツトに変換する処理を行う。In the normalization module 16, the final value of the dot cut out by this one character in the row direction and column direction and the cut out 1
Based on the size of the character, the character is expanded to cover the entire area within the extraction area. For example, an enlargement process is performed to make an area of 64×64 dots into one character area. If the value of the character in the column direction and row direction is 48 dots (both column and row) in the conversion table creation module 17,
Processing is performed to convert 48-dot characters to 64-dot characters.

この処理では特定位置の行や列のデータを繰り返して同
じデータとし文字を拡大する。また、縮小の場合には特
定位置の行や列を繰り返し読み出してＯＲ加算し同一行
や同−例として縮小する。In this process, data in a row or column at a specific position is repeated to make the same data and enlarge the characters. Furthermore, in the case of reduction, rows and columns at specific positions are repeatedly read out and ORed together to reduce them as the same row or example.

正規化モジュールＩ６によって１文字領域例えば６４Ｘ
６４ドツト内に１文字が拡大された後は、細線化モジュ
ール１８がその文字を細線化する処理を行う。この細線
化モジュール１８では中心ドツトの上下左右１ドッｌ−
（３Ｘ３）とさらにその左１ドツトと中心からの上２ド
ツト目の合計１１ドツトのマスクで細線化処理を行う。Normalization module I6 allows one character area, e.g. 64X
After one character has been enlarged within 64 dots, the thinning module 18 performs a process of thinning the character. In this thinning module 18, one dot above, below, left and right of the center dot is
Thinning processing is performed using a mask of 11 dots (3×3), one dot to the left, and two dots above from the center.

前述のマスクによってあらかじめ決められたパターンで
あるときに中心ドツトをＯとする制御により１回の処理
によって文字を構成するドツトの１ドツト分の回りの細
線化が図れる。このマスクの細線化を順次繰り返すこと
により１ドツトの線による文学上することができる。By controlling the center dot to be O when the pattern is predetermined by the mask described above, thinning of the area around one dot forming a character can be achieved in one process. By sequentially repeating thinning of this mask, it is possible to achieve a one-dot line.

細線化モジュール１８によって得られた例えば６４Ｘ６
４ドツトの細線化文字は線素化モジュール１９に加わり
線素化される。この線素化モジュールでは目的のドツト
すなわち中心ドツトから上下方向の黒ドツトが存在する
場合、左右方向に存在する場合、右上、左下に存在する
場合、さらには左上、右下に存在する場合の合計４種類
の線素によって各ドツトを表す。なお上述の４種類の内
、複数に属する場合には例えば、上下方向、続いて左右
方向等の順に優先化を行い、各ドツト単位でその線素が
どちらの方向の存在するかを求める。For example, 64×6 obtained by the thinning module 18
The 4-dot thinned character is added to the line segmentation module 19 and converted into line segments. This line segmentation module calculates the sum of black dots that exist in the vertical direction from the target dot, that is, the center dot, when they exist in the horizontal direction, when they exist in the upper right and lower left, and when they exist in the upper left and lower right. Each dot is represented by four types of line elements. If the line element belongs to more than one of the above four types, priority is given in the order of, for example, the vertical direction, then the horizontal direction, etc., and in which direction the line element exists is determined for each dot.

なお中心が０ドツトすなわち白であった場合には線は存
在しないとする。Note that if the center is 0 dot, that is, white, it is assumed that no line exists.

線素化モジュール１９においては、上下、左右、右上が
り斜め、左上がり斜めの４方向さらには線素が存在しな
い場合の５種類があるので、その状態を各ドツト単位で
３ビツトの値で表し、合計３Ｘ６４　Ｘ６４の情報とし
、特徴ベクトルモジュール２０に加える。In the line segmentation module 19, there are four directions: up and down, left and right, diagonally upward to the right, diagonally upward to the left, and five types, including the case where there is no line element, so the state is expressed as a 3-bit value for each dot. , a total of 3×64×64 information and added to the feature vector module 20.

特徴ベクトルモジュール２０においては前述した線素化
モジュール１９で得られた線素化情報を、左右上下にそ
れぞれ８ドツト単位で分割し、その分割した領域を下と
右方向に１領域づつ（２×２領域）の合計１６ドツトの
領域を１ベクトルモジユール領域とし、その１ベクトル
モジユール領域内にいくつの上下方向、左右方向、右上
方向、左上方向の４方向の線素が存在するかをカウント
する。１６Ｘ１６ドツトの領域を１ベクトルモジユール
領域として特徴ベクトルを求めるが、この１ベクトルモ
ジユール領域は８ドツト単位で移動させるので行方向な
らびに列方向に対しそれぞれ７領域であり合計７×７の
特徴ベクトルの領域となる。In the feature vector module 20, the line segmentation information obtained by the line segmentation module 19 described above is divided into 8-dot units in the left, right, top, and bottom, respectively, and the divided areas are divided into one area each in the downward and right directions (2× 2 areas) with a total of 16 dots as one vector module area, and count how many line elements exist in four directions: up and down, left and right, upper right, and upper left. do. A feature vector is calculated using a 16x16 dot area as one vector module area, but since this one vector module area is moved in units of 8 dots, there are 7 areas in each of the row and column directions, resulting in a total of 7 x 7 feature vectors. This is the area of

特徴ベクトル化モジュール２０においては前述した１領
域型位でその方向の数を求めているが、この数を求める
場合にはそれぞれ重み付けをし、中心部を高く周り部を
外にいくにしたがって低くしている。例えばその重み付
けを中心の４×４の領域の各ドツトを重み４、その周り
の２ドツト分の各ドツトを３、さらにその周りの２ドツ
ト分の各ドツトを２、さらにその回りの２ドツト分の各
ドツトを１とし、重み付けを行って特徴ベクトルを求め
る。In the feature vectorization module 20, the number of directions is calculated for each area type as described above, but when calculating this number, weighting is applied to each area, with the center being higher and the surrounding areas being lower as they go outward. ing. For example, each dot in the 4 x 4 area centered on the center is given a weight of 4, each of the two dots around it is given a weight of 3, each of the two dots around it is given a weight of 2, and then the two dots around it are given a weight of 4. Each dot is set to 1, weighting is performed, and a feature vector is determined.

この特徴ベクトルは特定の認識すべき文字を正規化モジ
ュール１６によってすべて同じ大きさにしているので、
同一文字であるならばほぼ同一の特徴ベクトルを有し、
文字単位でその特徴ベクトルが異なってくる。しかしな
がら非常によく似たモジュールも存在するので、本発明
の実施例においては演算の処理の高速化さらには認識率
の向上をはかるため、特徴ベクトルの標準パターンを用
いてそれぞれの特徴ベクトル化領域すなわちマス内でク
ラス分けを行い、各マス内でＬクラス（例えばＬ＝２０
）の標準パターンと、加わる未知入力との距離を求める
。すなわち標準パターンの各マス内の特徴ベクトルと特
報ベクトルモジュール２０によって得られたマス内の特
徴ベクトルとの距離をマス単位で求める。その各マスは
クラス分け（クラス１〜クラスし）されており、各マス
内クラスの距離の順位を距離の小さい順に第５番目まで
のクラスを求める。This feature vector has specific characters to be recognized all made the same size by the normalization module 16, so
If they are the same character, they have almost the same feature vector,
The feature vectors differ for each character. However, since there are very similar modules, in the embodiment of the present invention, in order to speed up the calculation process and improve the recognition rate, a standard pattern of feature vectors is used for each feature vectorization region, i.e. Classification is performed within each square, and L class (for example, L = 20) is divided within each square.
) and the unknown input. That is, the distance between the feature vector in each square of the standard pattern and the feature vector in the square obtained by the special notice vector module 20 is determined for each square. Each of the squares is divided into classes (class 1 to class 2), and the distance ranking of the classes within each square is determined in descending order of distance to the fifth class.

距離計算モジュール２１はこの距離をクラス辞書２３−
１　（標準パターンをクラス単位で記憶）を用いて演算
する。尚、個別でもその個々の候補文字に対して求める
場合には候補辞書２３−２を用いる（この時にはスイッ
チＳＷは候補辞書２３−２を選択する）。The distance calculation module 21 stores this distance in the class dictionary 23-
1 (standard patterns are stored in class units). Note that when searching for individual candidate characters, the candidate dictionary 23-2 is used (at this time, the switch SW selects the candidate dictionary 23-2).

上位選出＆得点割当モジュール２２では前述の上位５ク
ラスを求めるとともに、各クラスに対応した得点を各マ
ス単位で決定する。すなわち上位選出＆得点割当モジュ
ール２２は距離計算モジュール２１より得られた距離か
らクラス単位で第１〜第５番目の順位の各クラスに対し
与える得点を決定し、各文字の得点を求める。例えば第
１番目の距離（短い距離）であったときには５点、その
次に４点、３，２．１とクラスに対し得点を与える。こ
れはマス１からマス４９に対応してそれぞれ設けられる
。上位選出得点モジュール２２の処理結果は総合評価モ
ジュール２４に加わる。The top selection and score allocation module 22 determines the top five classes mentioned above and determines the score corresponding to each class for each square. That is, the top selection and score assignment module 22 determines the score to be given to each of the first to fifth ranking classes in class units based on the distance obtained from the distance calculation module 21, and calculates the score of each character. For example, if it is the first distance (short distance), 5 points are given, then 4 points, 3, 2.1 points, etc. are given to the class. These are provided corresponding to squares 1 to 49, respectively. The processing results of the top selection score module 22 are added to the comprehensive evaluation module 24.

総合評価モジュール２４は入力対象すなわち入力文字と
その候補とが整合する度合いを計算するモジュールであ
り、連想整合モード、全数整合モード、個別整合モード
の３種類の動作がある。The comprehensive evaluation module 24 is a module that calculates the degree of matching between an input object, that is, an input character and its candidate, and has three types of operation: an associative matching mode, an exhaustive matching mode, and an individual matching mode.

連想整合モードは、連想辞書２３−３に格納されている
候補に対応したマスクとその属するクラスからその候補
の得点を計算するモードである。The associative matching mode is a mode in which the score of a candidate is calculated from the mask corresponding to the candidate stored in the associative dictionary 23-3 and the class to which the candidate belongs.

連想辞書は第２図（ｂ）の如く、各マスク毎に候補■Ｄ
をアドレスとして、その候補がそのマスクにおいて属す
るクラスのクラスＩＤを格納している。As shown in Figure 2(b), the associative dictionary has candidates ■D for each mask.
The class ID of the class to which the candidate belongs in the mask is stored, using the address as the address.

このデータは、各候補のマスクＩＤに対応するＤ次元の
部分ベクトルの集合をその（重み付き）距離によってク
ラスタリングして得られるものであり、結果だけが連想
辞書に格納される。同時に距離計算モジュールにおける
クラス辞書２３−１も対応して作成される。This data is obtained by clustering a set of D-dimensional partial vectors corresponding to the mask ID of each candidate according to their (weighted) distances, and only the results are stored in the associative dictionary. At the same time, a class dictionary 23-1 in the distance calculation module is also created correspondingly.

尚、連想辞書２３−３とクラス辞書２３−１は対応して
おり、その種類は同じになる。２種類以上の辞書を１つ
のメモリに格納する場合、使用辞書指定は辞書参照開始
位置となる。（この辞書を候補ＩＤについて分割して、
それぞれについて並列に総合評価を行うことができ、よ
り高速なものが要求される場合容易に実現できる）。Note that the associative dictionary 23-3 and the class dictionary 23-1 correspond to each other and have the same type. When storing two or more types of dictionaries in one memory, the specification of the dictionary to be used becomes the dictionary reference start position. (Divide this dictionary into candidate IDs,
Comprehensive evaluation can be performed for each in parallel, and if higher speed is required, it can be easily achieved).

連想辞書２３−３は、候補ａがマスクｍで属するクラス
のクラスＩＤ：Ｋを記した表であり、これをＣ（ｍ、ａ
）＝にと表すと、候補ａ（＝１〜Ｃ）に対して、で得られる（Ｍ＝４９）、尚、ここでＰ　（ｍ、ｋ）は
得点を表している。この式により候補ａに対する総合評
価値Ｖ　（ａ）を得る。The associative dictionary 23-3 is a table in which the class ID: K of the class to which candidate a belongs with mask m is written.
)=, for candidate a (=1 to C), the following is obtained (M=49), where P (m, k) represents the score. The overall evaluation value V (a) for candidate a is obtained using this formula.

総合評価モジュールの全数整合モード、個別整合モード
は各候補に対し、計算するモードであり。The total matching mode and individual matching mode of the comprehensive evaluation module are modes in which calculations are made for each candidate.

全数整合モードはａ＝１〜Ｃ１個別整合モードはＪ＝１
〜ｃｋ、ａ＝ｂ（ｊ）とし、距離をｄ　（ｍ、ａ）で表
しを求める。この値Ｖ　（ａ）は候補ａと入力対象との特
徴ベクトルの（重み付き）距離である。Total matching mode is a=1 to C1 Individual matching mode is J=1
~ck, a=b(j), and find the distance expressed by d(m, a). This value V (a) is the (weighted) distance of the feature vector between candidate a and the input object.

上位候補選出モジュール２５は各文字対応での上位から
決められた複数の文字例えば５文字を選出し出力する。The top candidate selection module 25 selects and outputs a plurality of characters, for example, five characters determined from the top in each character correspondence.

この上位５文字が読みとった画像データにおける認識結
果となる。The top five characters become the recognition result in the read image data.

前述した動作は全てパイプライン処理で成されるもので
ある。すなわち画像データを記憶する画像メモリ１０内
の例えば１頁分のデータをパイプライン処理によって読
み出し、制御分モジュール１５で行単位に分割するとと
もに、正規化モジュール１６に１文字単位で出力する。All of the operations described above are performed by pipeline processing. That is, data for one page, for example, in the image memory 10 that stores image data is read out by pipeline processing, divided into lines by the control module 15, and outputted to the normalization module 16 in units of characters.

その文字車で前述の細線化、線素化、特徴ベクトル化さ
らには認識処理を行う。The character wheel is subjected to the aforementioned thinning, line segmentation, feature vectorization, and recognition processing.

上位選出モジュール２５は総合評価値に基づいて、候補
に順位をつけ、上位５個を選出するモジュールであり、
入力が連想全数整合モードであるならば（（ａ’、　Ｖ
（ａ）　ｌ　ａ’、　ａ　＝　１〜ｃを修正したもの）個別整数台モードであるならば（（ｊ、　ｖ（ａ）ｌｊ　＝　１〜ｃｋ　、　ａ　＝　
ｂ　（ｊ））（個別整合の総合評価出力）降／昇順：　（文字連想二人きい順、その他：小さい順
）である。また出力は人力のソート結果の順に並んだ候
補ＩＤ（または入力順序）とその総合評価値である。The top selection module 25 is a module that ranks candidates based on the overall evaluation value and selects the top five candidates.
If the input is in associative exhaustive matching mode ((a', V
(a) l a', a = 1~c modified) If it is in the individual integer unit mode ((j, v(a) lj = 1~ck, a =
b (j)) (Comprehensive evaluation output of individual matching) Descending/ascending order: (Character association dyad order, Others: Smallest order). Further, the output is the candidate IDs (or input order) arranged in the order of the manually sorted results and their comprehensive evaluation values.

前述した本発明の実施例においては、係る本発明の実施
例における特徴ベクトル化回路を用いたパターン認識装
置について説明した。以下ではさらにそのパターン認識
装置における特徴ベクトル化回路をさらに詳細に説明す
る。In the embodiments of the present invention described above, a pattern recognition device using the feature vectorization circuit in the embodiments of the present invention has been described. Below, the feature vectorization circuit in the pattern recognition device will be explained in more detail.

°第３図は本発明の実施例の詳細な回路構成図である。3 is a detailed circuit configuration diagram of an embodiment of the present invention.

切り出した１個の文字パターンが細線化モジュール１８
によって１ドツトの太さの文字に変換し、さらに線素化
モジュール１９によってその細線がどちら方向に連続し
ている（線素）を各ドツト単位で求めている。この線素
化モジュール１９によって求めたデータ（線素化データ
）はデータバッファ３１に加わる。例えばこのデータバ
ッファ３１は１文字車位でその線素化データを記憶する
要領を有しているＦＩＦＯである。そしてデータバッフ
ァ３１に記憶されたデータは１６ビツト単位で読み出さ
れデータ交換メモリ３２に加わる。One character pattern cut out is processed by the thinning module 18
The character is converted into a character with a thickness of one dot, and the line segmentation module 19 determines in which direction (line element) the thin line continues for each dot. The data (line elementization data) obtained by this line elementization module 19 is added to the data buffer 31. For example, the data buffer 31 is a FIFO capable of storing line segmentation data in units of one character. The data stored in the data buffer 31 is then read out in units of 16 bits and added to the data exchange memory 32.

一方図示しないＣＰＵ等からの読み出しに対応して、順
次クロックパルスがシーケンスカウンタ３３に加わりシ
ーケンスカウンタ３３は順次パルスをカウントする。こ
のクロックパルスはデータバッファ３１から１回の読み
出しに１パルスを発生する。すなわちデータの読み出し
に対応してシーケンスカウンタは順次そのカウント値を
歩進する。On the other hand, in response to reading from a CPU (not shown) or the like, clock pulses are sequentially applied to the sequence counter 33, and the sequence counter 33 sequentially counts the pulses. This clock pulse generates one pulse for one reading from the data buffer 31. That is, the sequence counter sequentially increments its count value in response to data reading.

シーケンスカウンタ１０のカウント値はアドレス発生Ｒ
ＯＭに加わる。アドレス発生ＲＯＭ３４はデータ交換メ
モリ３２に格納すべき位置を記憶している。シーケンス
カウンタ３３のカウント値はデータバッファ３１より読
み出すアドレスの位置に１対１で対応しており、たとえ
ばシーケンスカウンタ３３が“１゛のときには、第４図
に示す如くデータ■、■、■、■を指示する。The count value of the sequence counter 10 is the address generation R.
Join OM. The address generation ROM 34 stores the location to be stored in the data exchange memory 32. The count value of the sequence counter 33 has a one-to-one correspondence with the address position read from the data buffer 31. For example, when the sequence counter 33 is "1", the data ■, ■, ■, ■ as shown in FIG. instruct.

本発明の第３図における実施例においては１６ビツト幅
でデータを（線素データ）を読み出している。線素化デ
ータは１ドツトに対し４方向と線素が存在しないさらに
はクロスしている等の情報であり、１ドツトの方向を３
ビツトで表わしている。尚、読み出しの効率を高めるた
め１ドツトを４ビツト構成とし４ドツトの線素化情報を
１回の読み出しによってデータ交換メモリ３２に加える
。In the embodiment of the present invention shown in FIG. 3, data (line element data) is read out with a width of 16 bits. The line segmentation data includes information such as 4 directions for 1 dot and the fact that line elements do not exist or even cross each other, and the direction of 1 dot is
It is expressed in bits. In order to improve readout efficiency, one dot is made up of 4 bits, and the line segmentation information of 4 dots is added to the data exchange memory 32 by one readout.

データバッファ３１はＦＩＦＯ構造であり第１番目に入
力したデータＦＩＦＯ−ＤＡＴＡＩ　（■〜■）がデー
タ交換メモリ３２に加わる。The data buffer 31 has a FIFO structure, and the first input data FIFO-DATAI (■ to ■) is added to the data exchange memory 32.

アドレス発生ＲＯＭ３４はこのとき“００００”すなわ
ち“０”を出力しており交換メモリ３２はアドレス“０
０００”にデータＦ　Ｉ　ＦＯ−ＤＡＴＡＩを記憶する
。続いてＦ　Ｉ　ＦＯ−ＤＡＴＡ２が加わるが、シーケ
ンスカウンタ３３は０００１であるがアドレス発生ＲＯ
Ｍ３４内の０００１には６４　（１０進）が記憶されて
おりこの値がデータ交換メモリ３２の入力アドレスに加
わるのでＦＩＦＯ−ＤＡＴＡ２　（データ■、■、■、
■よりなる）は６４番地に格納される。順次このように
６４番地単位でデータを格納するように、アドレス発生
ＲＯＭ３４はデータ交換メモリ３２にアドレスを加える
。そして１７個目のデータすなわちデータＦ　Ｉ　ＦＯ
−ＤＡＴＡｌ、７が加わわるとこのときにはアドレスは
“１“となりデータ変換メモリ３２はアドレス１番地に
データＦ　Ｉ　ＦＯ−ＤＡＴＡｌ７を記憶する。続いて
２番目にはＦ　Ｉ　ＦＯ−ＤＡＴＡｌ　８を６５番地に
・・・と順次光に格納した１６個のデータに続いてそれ
ぞれ記憶させる。At this time, the address generation ROM 34 outputs "0000", that is, "0", and the exchange memory 32 outputs the address "0".
The data FIFO-DATAI is stored in "000". Next, FIFO-DATA2 is added, but although the sequence counter 33 is 0001, the address generation RO
0001 in M34 stores 64 (decimal), and this value is added to the input address of the data exchange memory 32, so FIFO-DATA2 (data ■, ■, ■,
) is stored at address 64. The address generation ROM 34 adds addresses to the data exchange memory 32 so as to sequentially store data in units of 64 addresses. And the 17th data, data F I FO
-DATA1,7 is added, at this time the address becomes "1" and the data conversion memory 32 stores data FIFO-DATA17 at address 1. Next, FI FO-DATA 8 is stored at address 65, following the 16 pieces of data stored in the optical system in sequence.

本発明の実施例においては１文字の領域は６４ドツト×
６４ドツトの領域でありデータバッファ３１にパラレル
に加わるので４ドツト単位となる。In the embodiment of the present invention, the area for one character is 64 dots×
It is an area of 64 dots and is added to the data buffer 31 in parallel, so it is a unit of 4 dots.

すなわち、データバッファ３１は１０２４段のＦＩＦＯ
であり、６４ドツト×６４ドツトの合計４０９６ドツト
を４ドツト単位で記憶する。そしてこのデータバッファ
３１で記憶したデータＦＩＦＯ−ＤＡＴＡ１〜１０２４
　（各データは１６ビツト）は第５図に示す如く、前述
した動作により６４アドレス単位で順次格納される。こ
の４ドツト単位の線素データは第６図に示す変換後イメ
ージデータとメモリアドレスの関係図から明確なように
、縦方向に４ドツトが同時に加わる。ゆえに、第６図に
示す如く縦方向の４ドツトの線素化データが加わるとす
るならば０から６３が横方向のアドレスとなる。すなわ
ち横方向においてはＯ〜６３に合計６４ドツト分のアド
レスを有し縦方向は４ドツト共通に読み出すので１６の
アドレスとなる。In other words, the data buffer 31 is a 1024-stage FIFO.
A total of 4096 dots (64 dots x 64 dots) are stored in units of 4 dots. The data FIFO-DATA1 to 1024 stored in this data buffer 31
(Each data is 16 bits) is sequentially stored in units of 64 addresses by the operation described above, as shown in FIG. As is clear from the relationship between converted image data and memory addresses shown in FIG. 6, four dots are simultaneously added to the line element data in units of four dots in the vertical direction. Therefore, if four dots of line element data in the vertical direction are added as shown in FIG. 6, addresses 0 to 63 will be in the horizontal direction. That is, in the horizontal direction, there are addresses for a total of 64 dots from O to 63, and in the vertical direction, 4 dots are read out in common, resulting in 16 addresses.

前述した動作により、データ交換メモリ３２には第５図
に示す如く例えば縦方向の４ドツトのデータが６４アド
レス離れて順次格納される。By the above-described operation, data of, for example, four dots in the vertical direction are sequentially stored in the data exchange memory 32 at 64 addresses apart, as shown in FIG.

この格納動作終了の後、シーケンスカウン３５がカウン
ト動作を開始し、順次アドレス発生Ｒ０Ｍ３６と、ハン
ファ３７を介して重み付はテーブル３８に加わる。シー
ケンスカウンタは図示しないがＣＰＵからの指示によっ
てカウント動作を開始する回路であり、シーケンスカウ
ン３５の歩進するデータがアドレス発生ＲＯＭ３６のア
ドレスに加わることにより、アドレス発生ＲＯＭ３６は
マス単位（１６Ｘ１６）でのアクセスを行うべきアドレ
スを発生する。第７図はマス１の発生アドレス図表であ
る。このときにはマスＩとして０−１５．６１−７９．
１２Ｂ−１４３，１９２２０７であり、シーケンスカウ
ンが００時に“ＯＩＩを１の時に“１　”を・・・１５
０時に°６４°“を１７の時に“６５“を・・・３２の
時に“１２８”′を３３の時に“°１２９”を・・・４
８の時に“１９２°°４９の時に“１９３”　・・・を
それぞれ発生する。このアドレス発生ＲＯＭ３６で発生
したデータはデータ交換メモリ３２のアウトプットアド
レスに加わっており、第６図における変化後のイメージ
データとメモリアドレスの関係図内における０から横方
向に対し０−１６、縦方向に対し０，６４，１２８．１
９２の合計６４アドレスがデータ交換メモリ３２により
出力される。このデータ交換メモリに出力されたデータ
はタイミングを合わせるためのフリップロップ（ラッチ
）３９に一時的に取り込まれた後デコーダ４０−１〜４
０−４．に４ビット単位ですなわちドツト単位で加わる
。After this storage operation is completed, the sequence counter 35 starts counting, and the weighting is sequentially added to the table 38 via the address generation R0M 36 and the Hanwha 37. Although not shown, the sequence counter is a circuit that starts a counting operation in response to an instruction from the CPU. By adding the incrementing data of the sequence counter 35 to the address of the address generation ROM 36, the address generation ROM 36 counts in units of squares (16 x 16). Generates the address to be accessed. FIG. 7 is a diagram of generated addresses for square 1. In this case, the square I is 0-15.61-79.
12B-143, 192207, and when the sequence count is 00, "OII" is 1, "1"...15
At 0:00, it is "64". At 17, it is "65". At 32, it is "128". At 33, it is "129"...4
8, "192°, 49", etc. are generated respectively. The data generated in the address generation ROM 36 is added to the output address of the data exchange memory 32, and after the change in FIG. From 0 in the image data and memory address relationship diagram, 0-16 in the horizontal direction, 0, 64, 128.1 in the vertical direction
A total of 64 addresses, 92, are output by the data exchange memory 32. The data output to this data exchange memory is temporarily captured in a flip-flop (latch) 39 for timing adjustment, and then sent to decoders 40-1 to 40-4.
0-4. is added in units of 4 bits, that is, in units of dots.

デコーダ４０〜１〜４０−４はそれぞれの各ドツトにお
けるベクトル方向（０°、９０’、４５゜１３５°）の
データをデコードし各方向の成分が存在する場合にＨレ
ベルの信号を出力する。それぞれのデコーダ４０−１〜
４０−４のデコード０＠の出力はアンドゲート４１−１
〜４３−１に加わる。また９０°、４５°、１３５＠も
同様にそれぞれ対応するアンドゲートに加わる。Decoders 40-1 to 40-4 decode data in vector directions (0°, 90', 45° and 135°) for each dot, and output an H level signal when components in each direction are present. Each decoder 40-1~
The output of decode 0@ of 40-4 is AND gate 41-1
~Join 43-1. Similarly, 90°, 45°, and 135@ are added to the corresponding AND gates.

本発明の実施例においては１文字領域内の各マス単位で
のベクトルの重み付けを行って求めるものであり、前述
のデコード値によってゲートがオンとなったアンドゲー
トは重み付はテーブルＲＡＭ３８からタイミング合わせ
用のフリップロップ（ラッチ）４５を介してそれぞれ４
ビツトのデータが加わる。本発明の実施例においては１
マス内における１６ドツトの内２ドツト単位のマス目に
おいて重み付けを設定しているのでその隣合う対応する
ドツトの値を用いるよう重み付けＲＡＭテーブルは８ビ
ツトでありそれのそれぞれ２ドツト単位のデータを対応
するアンドゲートに加えている。例えばＯｏに対応して
アンドゲート４４−１゜アンドゲート４３−１がドツト
単位の隣合うドツトであり、またアントゲ−）４２−１
とアントゲ−）４１−１が隣合うドツトであるので上位
４ビツトと下位４ビツトによってそれぞれ加えるアンド
ゲートの組み合わせを作っている。アントゲ−）４１−
１〜４４−１の出力は累算回路４６−１〜４９−１に加
わる。In the embodiment of the present invention, vectors are weighted for each square within one character area, and the AND gate whose gate is turned on by the decoded value described above is weighted by timing adjustment from the table RAM 38. 4 through flip-flops (latches) 45 for each
Bit data is added. In the embodiment of the present invention, 1
Since weighting is set in units of 2 dots out of the 16 dots in a square, the weighting RAM table is 8 bits, and the data of each 2 dot is corresponded to the values of the adjacent corresponding dots. In addition to the and gate. For example, corresponding to Oo, the AND gate 44-1 and the AND gate 43-1 are adjacent dots in dot units, and the AND gate 42-1
Since the dots 41-1 and 41-1 are adjacent dots, the upper 4 bits and the lower 4 bits are used to create a combination of AND gates. anime game) 41-
The outputs of 1 to 44-1 are applied to accumulation circuits 46-1 to 49-1.

本発明の実施例においては９０°、４５°、１３５°に
も対応してそれぞれ重み付けをした特徴ベクトルを求め
るのでアンドゲートを同様にそれに対応して設は累算回
路（４６−２〜４９−２゜４６−３〜４９−３．４６−
４〜４９−４）にそれぞれ加えている。In the embodiment of the present invention, weighted feature vectors are obtained corresponding to 90°, 45°, and 135°, so AND gates are similarly set up in the accumulator circuits (46-2 to 49-49). 2゜46-3~49-3.46-
4 to 49-4) respectively.

以下では前述したＯｏ、９０°、４５°、１３５°に対
してそれぞれ同様であるのでＯｏについて詳しく説明す
る。デコーダ４０−１〜４０−４によって０°と判別し
た時には、それに対応するアンドゲートがオンとなり、
重み付はテーブルＲＡＭより出力される４ビツトのデー
タを累算回路に出力する。このデータが加わることによ
り、累算回路は４６−１〜４９−１はその値を累算する
。Below, Oo will be explained in detail since it is the same for Oo, 90°, 45°, and 135° as described above. When the decoders 40-1 to 40-4 determine that it is 0°, the corresponding AND gate is turned on.
For weighting, 4-bit data output from the table RAM is output to the accumulator circuit. By adding this data, the accumulation circuits 46-1 to 49-1 accumulate the values.

マスク１の全ての動作が終了すると、図示しないＣＰＵ
間の指示により累算回路４６−１〜４８−１は順次デー
タを累算回路４９−１に加える。これにより、累算回路
４９−１は零度方向に対する重み付けの特徴ベクトルを
求める。When all operations of mask 1 are completed, the CPU (not shown)
Accumulator circuits 46-1 to 48-1 sequentially add data to accumulator circuit 49-1 according to the instructions in between. Thereby, the accumulation circuit 49-1 obtains a weighted feature vector for the zero degree direction.

前述した重み付はテーブルＲＡＭ３８は４ドツト単位で
その重み付けを記憶しており（２Ｘ２）、例えばこのデ
ータが縦方向のドツトデータであるならば、ドツト単位
で共通となり、前述した如く４ビツトの値がアンドゲー
ト４４−１．４３−１に、またもう一方のデータが４２
−１．４１−１に加わる。これに対し横方向すなわち横
に並んだ２ドツトは共通であるので、シーケンスカンウ
タ３５によって重み付はテーブルＲＡＭ３８が出力する
データは、第８図の如く横方向に２ドツト分同−アドレ
スであるので、同一値となる。さらに詳しく表すならば
シーケンスカンウタによって順次歩進しアドレス発生Ｒ
ＯＭに加える１２ビツトのデータに対しＩ　ＬＳＢを除
いた下位６ビツトを出力する。これによりシーケンスカ
ンウタがＯ〜１の時に重み付はテーブルＲＡＭ３８には
２ドツトに対してアドレス０が加わり、続いてアドレス
１．２．３と２個のドツトに対し同一のアドレスが指定
され、同一の重みデータが出力される。これにより、２
ドツト読み出すたびにデコーダでデコードし累算回路の
ゲイトを制御するが、この２ドツトの時の重みは変化せ
ず２×２ドツト単位で合計１６ドツトのマス単位での重
み付けを行うことができ、また重み付けを行った累算を
行うことができる。The above-mentioned weighting is stored in the table RAM 38 in units of 4 dots (2×2). For example, if this data is vertical dot data, it will be common in units of dots, and as mentioned above, the 4-bit value will be is the AND gate 44-1.43-1, and the other data is 42
-1.41-1. On the other hand, since the two dots arranged horizontally are common, the data output from the table RAM 38 is weighted by the sequence counter 35 and has the same address for the two dots in the horizontal direction as shown in FIG. Therefore, the values are the same. To express this in more detail, the sequence counter sequentially increments and generates an address R.
For the 12-bit data added to OM, the lower 6 bits excluding ILSB are output. As a result, when the sequence counter is O to 1, address 0 is added to the weighting table RAM 38 for 2 dots, and then addresses 1, 2, 3, and the same address are specified for the 2 dots. The same weight data is output. This results in 2
Each time a dot is read out, it is decoded by a decoder and the gate of the accumulator circuit is controlled, but the weight for two dots does not change, and weighting can be done in units of squares of 2 × 2 dots for a total of 16 dots. Also, weighted accumulation can be performed.

以上のように動作させることによりマスｌの発生アドレ
ス図表（第７図参照）で発生するアドレスによって１６
Ｘ１６ドツトのデータを呼び出すことができまたそれに
対応する重み付はテーブルも順次（第８図参照）ドツト
対応で２ドツトおきにデータが変化することとなる。な
お、アドレス発生ＲＯＭは１２ビツトのアドレスを有し
、１０ビツトの出力を有しているのでそれぞれのマス単
位でのアドレスを発生することができ、このアドレス発
生ＲＯＭ３６と重み付はテーブルＲＡＭ３８を対応させ
ることによりそれぞれのマス単位での特徴ベクトルを求
めることができる。By operating as described above, 16
The data of X16 dots can be called up, and the corresponding weighting table is also sequentially (see FIG. 8), so that the data changes every two dots in correspondence with the dots. Note that the address generation ROM 36 has a 12-bit address and a 10-bit output, so it can generate an address for each square, and this address generation ROM 36 and weighting correspond to the table RAM 38. By doing this, it is possible to obtain the feature vector for each square.

第９図はマス２における発生アドレス図表、第１０図は
マス２に対応する重み付はテーブルアドレス図表であり
、順次そのマスに対して重み付けを変化させることがで
きる。FIG. 9 is a diagram of generated addresses in square 2, and FIG. 10 is a table address diagram with weighting corresponding to square 2, and the weighting can be sequentially changed for the squares.

本発明の実施例においてはデータバッファ３１の出力は
ＣＰＵのバスに接続したバッファ５１に接続しているの
で記憶したデータをモニターすることが可能である。ま
た例えばシーケンスカンウタ３５にもこのバッファ５１
は接続しているのでシーケンスカンウタをＣＰＵの制御
によりイニシャルセット等を行うことができる。In the embodiment of the present invention, the output of the data buffer 31 is connected to the buffer 51 connected to the CPU bus, so that it is possible to monitor the stored data. For example, this buffer 51 may also be used in the sequence counter 35.
Since these are connected, the sequence counter can be initialized under the control of the CPU.

一方重み付はテーブルＲＡＭ３８のアドレス入力とデー
タ端子にはそれぞれバッファ５２．５３が設けられてい
る。例えば特徴ベクトル用の辞書の重み付けが異なった
ような辞書データが存在する場合その辞書に対応した重
み付けをしなくてはならず、この時にはバッファ５２を
介して重み付はテーブルＲＡＭ３８をアクセスしバッフ
ァ５３を介して重み付はデータを格納することをＣＰＵ
によって行うことができる。すなわち任意に目的に応じ
て重み付はテーブルＲＡＭ３８の内容を変更することが
できる。On the other hand, for weighting, buffers 52 and 53 are provided at the address input and data terminals of the table RAM 38, respectively. For example, if dictionary data for feature vectors has different weightings, it is necessary to weight the dictionaries accordingly. The weighted data is stored through the CPU
This can be done by That is, the contents of the table RAM 38 can be arbitrarily changed depending on the purpose.

一方累算回路４９−１〜４９−４にはデータバッファ５
５とバッファ５４が接続している。このバッファにより
それぞれの累算結果（０°〜１３５°）のデータをＣＰ
Ｕ内に取り込むことができ、例えば文字単位での認識な
どを再度行うようなとき（確認認識）には、バッファ５
４を介して累算回路４９−１〜４９−４のデータを読み
取ることができる。On the other hand, data buffer 5 is provided in the accumulation circuits 49-1 to 49-4.
5 and a buffer 54 are connected. This buffer allows the data of each cumulative result (0° to 135°) to be transferred to CP.
For example, when performing character-by-character recognition again (confirmation recognition), the buffer 5
The data of the accumulation circuits 49-1 to 49-4 can be read through the accumulation circuits 49-1 to 49-4.

一方第２図に示した如く各回路によって認識する場合に
はデータバッファ５５によって格納した例えばマス単位
のデータを距離計算モジュール２１に加えることにより
高速で求めた特徴ベクトルに対してパイプライン処理で
距離を求めることができる。On the other hand, when recognition is performed by each circuit as shown in FIG. can be found.

〔Effect of the invention〕

以上述べたように本発明によれば特徴ベクトルを生成す
る場合の演算処理の煩雑さを防止し、簡単な回路構成に
よって同様の演算を行うことができる。また上述の如く
文字認識アルゴリズムで使用される特徴ベクトル生成回
路における処理情報量が多くても、特徴ベクトル生成ア
ルゴリズムを簡単な回路によって実現し、小型化高速化
を可能としている。またこれにより上述の如く文字認識
アルゴリズムを用いた認識装置の小型化高速化を得るこ
とができる。As described above, according to the present invention, it is possible to prevent the complexity of calculation processing when generating feature vectors, and to perform similar calculations with a simple circuit configuration. Furthermore, even if the amount of processing information in the feature vector generation circuit used in the character recognition algorithm is large as described above, the feature vector generation algorithm can be realized by a simple circuit, making it possible to reduce the size and increase the speed. Further, as described above, it is possible to miniaturize and speed up the recognition apparatus using the character recognition algorithm.

【図面の簡単な説明】第１図は、本発明の原理ブロック図、第２図は、本発明の実施例のシステム構成図、第３図は
、本発明の実施例の詳細な回路構成図、第４図は、入力
データ説明図、第５図は、メモリ内データ記憶配置図、第６図は、変換
後のイメージデータとメモリアドレスの関係図、第７図は、〈マスクＮｏ、ｌ　＞の発生アドレス図表、
・第８図は、対応する重みテーブルアドレス図表、第９図
は、〈マスクＮｏ、２　＞の発生アドレス図表、第１０図は、対応する重みテーブルアドレス図表である
。１・・・第１のデータ記憶手段、２・・・第１のアドレス発生手段、３・・・第２のデータ記憶手段、４・・・第２のアドレス発生手段、５・・・累算手段、・デコード手段。[Brief Description of the Drawings] Figure 1 is a principle block diagram of the present invention, Figure 2 is a system configuration diagram of an embodiment of the invention, and Figure 3 is a detailed circuit diagram of an embodiment of the invention. , FIG. 4 is an explanatory diagram of input data, FIG. 5 is a diagram of data storage arrangement in memory, FIG. 6 is a diagram of the relationship between image data after conversion and memory addresses, and FIG. 7 is a diagram of <Mask No., l > Occurrence address chart,
- FIG. 8 is a corresponding weight table address diagram, FIG. 9 is a generation address diagram of <Mask No., 2>, and FIG. 10 is a corresponding weight table address diagram. DESCRIPTION OF SYMBOLS 1... First data storage means, 2... First address generation means, 3... Second data storage means, 4... Second address generation means, 5... Accumulation Means, ・Decoding means.

Claims

[Scope of Claims] 1) A first data storage means for adding line element information in at least a dot unit of a character pattern and storing the line element information (
1), a first address generating means (2) for generating an address indicating a storage position of the line element information in the first data storage means (1), and storing weight data of the line element information. a second data storage means (3), an address for reading the line element information stored in the first data storage means (1), and a second data storage means for storing the weight data corresponding to the line element information read at the address; Data storage means (
2) a second address generation means (4) which generates an address to be read from the second data storage means; and an accumulation of a number provided corresponding to the direction of the line element by adding the weight data output from the second data storage means. means (5-1 to 5-n); decoding the line element information stored in the first data storage means (1), and using the decoding result to the accumulating means (5-1 to 5-n); A feature vectorization circuit characterized in that it comprises a decoding means (6) for enabling weight information for a direction and accumulating weight information for the direction. 2) The feature vectorization circuit according to claim 1, wherein m sets of the accumulating means (5-1 to 5-n) are provided, and the line element information added in units of m bits is processed in parallel.