JPS6327991A

JPS6327991A - Formation of histogram for input information recognizing device

Info

Publication number: JPS6327991A
Application number: JP61172515A
Authority: JP
Inventors: Masahiro Nakamura; 昌弘中村
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1986-07-22
Filing date: 1986-07-22
Publication date: 1988-02-05

Abstract

PURPOSE:To decrease the dimension number of the feature quantity of a dictionary and to increase the recognition speed of an input information recognizing device by integrating some of plural blocks divided according to the identification ability of the feature quantity, and thus generating a histogram. CONSTITUTION:An image information processor which uses an optical information input device consists of a keyboard 1 for KANA (Japanese- syllabary)/KANJI (Chinese-character) conversion or the like as well as character input, an input device 2 which reads an original optically, a processor 3 which performs operation in a mode specified by them, an output device 4 composed of a display device or printer, a program ROM 5 required for the operation of the device 3, a RAM 6 stored with common algorithm, and a word dictionary memory 7. Consequently, a direction code is added to the contour part of an input information pattern, which is divided into plural blocks according to the feature quantity identifying ability of the dictionary; and thus histogram of direction codes of each block is generated and compared and calculated by using the dictionary to determine a candidate and information according to the distance.

Description

【発明の詳細な説明】［技術分野］本発明は１文字Ｌｙ！、識装置における特徴量抽出技術
に関し、特に、文字認識装置用ヒストグラム作成技術に
関するものである。[Detailed Description of the Invention] [Technical Field] The present invention is based on one character Ly! The present invention relates to a feature extraction technique for a character recognition device, and in particular to a histogram creation technique for a character recognition device.

［従来技術］コンピュータを用いた画像情報処理装置の入力装置、例
えば原稿上の情報を光学的に読取り入力する光学的情報
入力装置（以下、ＯＣＲという）においては、一般に、
辞書（テンプレート）マツチング法をベースとした認識
法を用いて入力文字（入力画像情報）の認識を行ってい
る。[Prior Art] In an input device of an image information processing apparatus using a computer, for example, an optical information input device (hereinafter referred to as OCR) that optically reads and inputs information on a document, generally,
Input characters (input image information) are recognized using a recognition method based on a dictionary (template) matching method.

このようなＯＣＲ等における文字認識装置においては、
入力文字パターンの輪郭部分に方向コードを付け、前記
入力文字パターンを複数ブロックに分割し、この分割さ
れたブロック毎に、その方向コードのヒストグラムを作
成し、このヒストグラムとあらかじめ用意した辞書とを
比較演算し、その距離により候補文字を決定して１文字
認識を行っている。すなわち、前記分割した領域につい
て、それぞれの方向別にヒストグラムを作成し。In character recognition devices such as OCR,
A direction code is attached to the outline of the input character pattern, the input character pattern is divided into multiple blocks, a histogram of the direction code is created for each divided block, and this histogram is compared with a dictionary prepared in advance. A candidate character is determined based on the calculated distance, and single character recognition is performed. That is, a histogram is created for each direction for the divided regions.

この各ヒストグラムを特徴量として、距離演算を行い、
文字を決定する型の文字認識装置においては、例えば、
領域を４×４に分割すると１２８（＝４ｘ４ｘ８）次元
の特徴量が出現する。この特徴量の中には辞書としての
識別能力の高いものと低いものが混在している。これら
の高いものと低いものを同じレベル（取扱）で距離演算
を行っている。Using each histogram as a feature, distance calculation is performed,
In a character recognition device that determines characters, for example,
When the area is divided into 4×4, 128 (=4×4×8) dimensional features appear. Among these feature quantities, there are a mixture of those with high discrimination ability and those with low discrimination ability as a dictionary. Distance calculations are performed on these high and low values at the same level (handling).

しかしながら、前記の文字認識装置におけるヒストグラ
ム作成方法では、辞書としての識別能力の高い特徴量と
低い特徴量を同じレベル（取扱）で距離演算を取扱って
いるため、認識速度が遅いという問題があった。However, in the above-mentioned method for creating a histogram in a character recognition device, the distance calculation is handled at the same level (handling) for features with high and low discrimination ability as a dictionary, so there is a problem that the recognition speed is slow. .

［目的］本発明の目的は、入力情報認識装置のＬ＆識速度を速く
することができる技術を提供することにある。[Objective] An object of the present invention is to provide a technique that can increase the L& recognition speed of an input information recognition device.

本発明の他の目的は１文字認識装置用ヒストグラムの作
成を能率的に行うことができる技術を提供することにあ
る。Another object of the present invention is to provide a technique that can efficiently create a histogram for a single character recognition device.

本発明の前記ならびにその他の目的と新規な特徴は１本
明細書の記述及び添付図面によって明らかになるであろ
う。The above and other objects and novel features of the present invention will become apparent from the description of this specification and the accompanying drawings.

［構成コ本発明は、入力情報パターンの輪郭部分に方向コードを
付け、前記入力情報パターンを辞書の特徴量の識別能力
に応じて複数ブロックに分割し。[Configuration] In the present invention, a direction code is attached to the contour portion of an input information pattern, and the input information pattern is divided into a plurality of blocks according to the discriminating ability of the feature amount of a dictionary.

この分割されたブロック毎に、その方向コードのヒスト
グラムを作成し、このヒストグラムとあらかじめ用意し
た辞書とを比較演算し、その距離により候補文字を決定
する入力情報認識装置用ヒストグラム作成方法であって
、前記辞書の特徴量の識別能力に応じて分割された複数
のブロックの一部を統合してヒストグラムを作成する手
段を備えたことを特徴とするものである。A histogram creation method for an input information recognition device that creates a histogram of the direction code for each divided block, compares this histogram with a dictionary prepared in advance, and determines candidate characters based on the distance, the method comprising: The present invention is characterized by comprising means for creating a histogram by integrating a portion of a plurality of blocks divided according to the discrimination ability of the feature amount of the dictionary.

［実施例コ以下１本発明の一実施例を図面を用いて具体的に説明す
る。[Example 1] An example of the present invention will be specifically described below with reference to the drawings.

なお、実施例を説明するための全図において。In addition, in all the figures for explaining an example.

同一機能を有するものは同一符号を付け、その繰り返し
の説明は省略する。Components having the same function are given the same reference numerals, and repeated explanations thereof will be omitted.

第１図は１本発明の一実施例の入力情報認識装置用ヒス
トグラム作成方法を実施するためのＯＣＲの概略構成を
示すブロック図、第２図は、第１図に示すＯＣＲを用いた画像情報処理装
置の概略構成を示すブロック図である。FIG. 1 is a block diagram showing a schematic configuration of an OCR for carrying out a histogram creation method for an input information recognition device according to an embodiment of the present invention, and FIG. 2 shows image information using the OCR shown in FIG. FIG. 1 is a block diagram showing a schematic configuration of a processing device.

第２図において、キーボード１は、文字を入力する他に
各種のモード（仮名漢字変換、漢字仮名変換、ＯＣＲ文
字認識等）を指定するものに用いられる。ＯＣＲ入力装
置２は、原稿を光学的に読取り入力する。処理装置３は
、キーボード１や０ＣＲ２からの入力情報について指定
されたモードに従った処理を実行し、出力装置４に出力
する。In FIG. 2, a keyboard 1 is used not only to input characters but also to designate various modes (kana-kanji conversion, kanji-kana conversion, OCR character recognition, etc.). The OCR input device 2 optically reads and inputs a document. The processing device 3 executes processing according to the specified mode on the input information from the keyboard 1 and OCR 2, and outputs it to the output device 4.

出力装置４は、ディスプレイ装置、プリンタ等を総称し
て示したものである。処理装置３の処理に必要なプログ
ラムメモリ（ＲＯＭ）５に格納されるが、キーボード入
力による仮名漢字変換、ＯＣＲ文字認識の後処理、ＯＣ
Ｒ入力された文字等の仮名漢字変換や漢字仮名変換につ
いてできるだけ共通のアルゴリズムが利用される。デー
タメモリ（ＲＡＭ）６は、処理装置３での処理途中のデ
ータやパラメータを格納するのに用いられる。単語辞書
メモリ７には読み表記対応データを付加した単語辞書が
格納されている。The output device 4 is a general term for a display device, a printer, etc. It is stored in the program memory (ROM) 5 that is necessary for the processing of the processing device 3, but it also performs kana-kanji conversion by keyboard input, post-processing of OCR character recognition, OC
R A common algorithm is used as much as possible for kana-kanji conversion and kanji-kana conversion of input characters. A data memory (RAM) 6 is used to store data and parameters that are being processed by the processing device 3. The word dictionary memory 7 stores a word dictionary to which reading orthography correspondence data is added.

前記第２図に示す０ＣＲ２は、第１図に示すように、光
源と電荷結合素子（ＣＯＤ）等からなる光学的スキャナ
ー１１により、原稿上の文字等の画像情報を読み取って
入力する。この入力された仮名文字列又は仮名漢字混合
文字列、英字列等の画像情報を１文字切出しユニット１
２により、１文字毎に切出され、特徴抽出ユニット１３
でその切出された文字の特徴を抽出する。この抽出され
たデータは、特徴マツチングユニット１４で特徴辞書メ
モリ（ＲＯＭ又はＲＡＭ）１５に格納されている特徴辞
書データとのマツチングがとられる。As shown in FIG. 1, the OCR 2 shown in FIG. 2 reads and inputs image information such as characters on a document using an optical scanner 11 comprising a light source, a charge-coupled device (COD), and the like. Unit 1 extracts one character of image information such as the input kana character string, kana-kanji mixed character string, alphabetic character string, etc.
2, each character is extracted by the feature extraction unit 13.
Extract the features of the extracted characters. This extracted data is matched with feature dictionary data stored in a feature dictionary memory (ROM or RAM) 15 in a feature matching unit 14.

マツチングがとられれば、入力文字がＬｙ！、識され処
理装置３に送られる。If matching is achieved, the input characters are Ly! , and sent to the processing device 3.

次に、本発明の一実施例の入力情報認識装置用ヒストグ
ラム作成方法における前記入力情報パターンを辞書の特
徴量の識別能力に応じて複数のブロック（領域）に分割
する処理プロセスを第３図に示すそのフローチャートに
従って説明する。Next, FIG. 3 shows the process of dividing the input information pattern into a plurality of blocks (regions) according to the discriminating ability of the feature amount of the dictionary in the histogram creation method for an input information recognition device according to an embodiment of the present invention. The explanation will be given according to the flowchart shown below.

段階１０１で入力情報パターンの輪郭部に方向別コード
（チェインコード）を付ける処理を行い、段階１０２に
移る。前記方向別コードは、第４図に示す方向ベクトル
を用いて符号化を行うものである。At step 101, a direction code (chain code) is added to the outline of the input information pattern, and the process moves to step 102. The direction-specific code is encoded using the direction vector shown in FIG.

段階１０２で前記入力情報パターンの輪郭部についた方
向別のコード数をカウントし、その総数Ｓを求める０次
に、段階１０３でＸ方向への最初の分割点（Ｓ／ｎ）を
求め、段階１０４でＸ方向へのそれ以降の分割点を求め
る。例えば、領域ｎ×ｍに分割するとしてコードの総数
をＳとしたとき１分割点はそれぞれＳ　／　ｎ　＋　２
　Ｓ　／　ｎ　＋・・・。In step 102, the number of codes attached to the outline of the input information pattern in each direction is counted, and the total number S is determined.Next, in step 103, the first dividing point (S/n) in the X direction is determined, and step At step 104, subsequent dividing points in the X direction are determined. For example, if the area is divided into n×m and the total number of codes is S, each division point is S / n + 2
S/n+...

（ｎ−１）Ｓ／ｎとなる座標である１次に、段階１０５
でＸ方向への各分割座標を求める。(n-1)S/n coordinates, step 105
Find the coordinates of each division in the X direction.

同様に、段階１０６でＹ方向の最初の分割点（Ｓ／ｍ）
を求め１段階１０７でＹ方向へのそれ以降の分割点を求
める。例えば、Ｓ／ｍ、２Ｓ／ｍ、・・・、（ｍ−１）
Ｓ／ｍとなる座標である。Similarly, in step 106, the first dividing point in the Y direction (S/m)
In step 107, subsequent division points in the Y direction are determined. For example, S/m, 2S/m,..., (m-1)
The coordinates are S/m.

次に１段階１０８でＹ方向への各分割座標を求めて入力
画像の分割処理が終了する。Next, in step 108, each division coordinate in the Y direction is determined, and the input image division processing is completed.

次に、原稿上の情報パターンをＸ方向、Ｙ方向にスキャ
ンし、それぞれコードの数が各分割点となる座標を求め
る。Next, the information pattern on the document is scanned in the X and Y directions, and the coordinates at which the number of codes corresponds to each dividing point are determined.

第５図は、４×４に領域を分割したときの辞書の各ブロ
ックのヒストグラムの分散を示している。FIG. 5 shows the distribution of the histogram of each block of the dictionary when the area is divided into 4×4.

中央の４ブロツクは周辺の１２ブロツクと比較すると、
その数値がかなり小さい（すなわち、識別能力が低い）
。ここで、数値が大きい程分散度が大きい（分散度が大
きい程情報量が多いといわれている）ことを示している
。Comparing the 4 blocks in the center with the 12 blocks around it,
The number is quite small (i.e., the discrimination ability is low)
. Here, the larger the value, the greater the degree of dispersion (it is said that the greater the degree of dispersion, the greater the amount of information).

このヒストグラムの分散度の小さい中央部分に注目して
、この中央部分の複数のブロックを統合してヒストグラ
ムを作成する方法が本発明の最も特徴とする点である。The most distinctive feature of the present invention is a method of creating a histogram by focusing on the central portion of the histogram where the degree of dispersion is small and integrating a plurality of blocks in this central portion.

次に本発明の一実施例におけるヒストグラムの作成の処
理プロセスを第６図に示すフローチャートに従って説明
する。Next, a process for creating a histogram in an embodiment of the present invention will be described with reference to the flowchart shown in FIG.

段階２０１でブロック毎に方向別にコード数をカウント
してヒストグラムを作成する処理を行い。In step 201, the number of codes is counted in each direction for each block and a histogram is created.

段階２０２で中央ブロックについてそれぞれの方向のヒ
ストグラム加算してヒストグラム作成処理は終了する。At step 202, histograms in each direction are added for the central block, and the histogram creation process ends.

例えば、第７図に示す入力情報パターンにおいて１周辺
の１２ブロツク（第７図の６゜７．１０．１１を除いた
ブロック）はそれぞれヒストグラムを作成し、中央の４
ブロツク（６，７゜１０．１１）は、これを１つにまと
めてヒストグラムを作成する。For example, in the input information pattern shown in Figure 7, a histogram is created for each of the 12 blocks around 1 (blocks excluding 6°7, 10, 11 in Figure 7), and the 4 blocks in the center are
Block (6, 7° 10.11) combines these into one to create a histogram.

このようにして作成した１２ｘ８＋８＝１０４次元の特
徴量と辞書とにより、距離演算を行い候補情報（例えば
文字）を選択する。Using the thus created 12x8+8=104-dimensional features and the dictionary, distance calculations are performed to select candidate information (for example, characters).

以上１本発明を実施例にもとずき具体的に説明したが、
本発明は、前記実施例に限定されるものではなく、その
要旨を逸脱しない範囲において種々変更可能であること
は言うまでもない。The present invention has been specifically explained above based on examples, but
It goes without saying that the present invention is not limited to the embodiments described above, and can be modified in various ways without departing from the spirit thereof.

〔Effect of the invention〕

以上、説明したように５本発明によれば、辞書の特徴量
の識別能力に応じて分割された複数のブロックの一部を
統合してヒストグラムを作成する手段を備えたことによ
り、特徴量の次元数を低減することができるので、ｉｌ
ｌ速度を向上することができる。As explained above, according to the present invention, by providing a means for creating a histogram by integrating a part of a plurality of blocks divided according to the discriminating ability of the feature amount of the dictionary, it is possible to Since the number of dimensions can be reduced, il
l speed can be improved.

また、前記特徴量の次元数を低減することにより、メモ
リを低減することができる。Further, by reducing the number of dimensions of the feature amount, memory can be reduced.

[Brief explanation of the drawing]

第１図は１本発明の一実施例の入力情報認識装置用ヒス
トグラム作成方法を実施するためのＯＣＲの概略構成を
示すブロック図、第２図は、第１図に示すＯＣＲを用いた画像情報処理装
置の概略構成を示すブロック図、第３図は、本発明の一
実施例の入力情報認識装置用ヒストグラム作成方法にお
ける入力情報パターンを辞書の特徴量の識別能力に応じ
て複数のブロック（領域）に分割する処理プロセスのフ
ローチャート。第４図は、方向別コードと方向ベクトルとの関係を示す
図、第５図は、４Ｘ４に領域を分割した時の辞書の各ブロッ
クのヒストグラムの分散を示す図、第６図は、本発明の
一実施例のヒストグラム作成の処理プロセスのフローチ
ャート、第７図は、入力情報パターンの分割ブロック例を示す図
である。図中、３・・・処理装置、１１・・・スキャナー、１２
・・・文字切出しユニット、１３・・・特徴抽出ユニッ
ト、１４・・・特徴マツチングユニット、１５・・・特
徴辞書メモリである。FIG. 1 is a block diagram showing a schematic configuration of an OCR for implementing a histogram creation method for an input information recognition device according to an embodiment of the present invention, and FIG. 2 shows image information using the OCR shown in FIG. FIG. 3, a block diagram showing a schematic configuration of a processing device, shows an input information pattern in a histogram creation method for an input information recognition device according to an embodiment of the present invention, which is divided into a plurality of blocks (regions) according to the feature quantity discrimination ability of the dictionary. ) Flow chart of the processing process. FIG. 4 is a diagram showing the relationship between direction codes and direction vectors. FIG. 5 is a diagram showing the distribution of the histogram of each block of the dictionary when the area is divided into 4×4 areas. FIG. Flowchart of Process for Creating Histogram in One Embodiment FIG. 7 is a diagram showing an example of divided blocks of an input information pattern. In the figure, 3...processing device, 11...scanner, 12
. . . character cutting unit, 13 . . . feature extraction unit, 14 . . . feature matching unit, 15 . . . feature dictionary memory.

Claims

[Claims]

(1) Attach a direction code to the outline of the input information pattern, divide the input information pattern into multiple blocks according to the feature recognition ability of the dictionary, and create a histogram of the direction code for each divided block. A histogram creation method for an input information recognition device, in which candidate information is determined based on the distance by comparing the histogram with a dictionary prepared in advance, and determining candidate information based on the distance between the histogram and a dictionary prepared in advance. 1. A histogram creation method for an input information recognition device, comprising means for creating a histogram by integrating parts of a plurality of blocks.