JP2000067164A

JP2000067164A - Method and device for pattern recognition and record medium where template generating program is recorded

Info

Publication number: JP2000067164A
Application number: JP10240796A
Authority: JP
Inventors: Minako Sawaki; 美奈子澤木; Hiroshi Murase; 洋村瀬; Norihiro Hagita; 紀博萩田
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 1998-08-26
Filing date: 1998-08-26
Publication date: 2000-03-03

Abstract

PROBLEM TO BE SOLVED: To provide the pattern recognizing method which is robust to deterioration and deformation by making use of the presence of restrictions on the contiguity relation between characters, etc., when a recognition object is limited. SOLUTION: This device has a learning image storage means 1 which stores an image for learning, a learning pattern extracting means 2 which extracts a learning pattern included in the image for learning including circumferential image information, a template generating means 4 which generates a template by using the learning pattern, a template storage means 5 which stores the template, a pattern storage means 6 which stores an input image pattern, and a recognizing means 7 which recognizes the input image pattern by using the template.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、文字や図形を中心
とする画像パターンの認識技術にかかわり、特に、印刷
漢字、手書き漢字、英数字、記号、数式、図形など多く
のカテゴリを対象として、しかも、それらパターンの劣
化の度合いが大きく変化する場合に好適な認識方法及び
装置並びにテンプレート作成プログラムを記録した記録
媒体に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a technology for recognizing image patterns centered on characters and figures. In addition, the present invention relates to a recognition method and an apparatus suitable for a case where the degree of deterioration of these patterns greatly changes, and a recording medium on which a template creation program is recorded.

【０００２】[0002]

【従来の技術】ここでは、パターン認識の一例として、
文字認識を対象として説明する。従来、漢字ＯＣＲ(Ｏp
tical Ｃharacter Ｒeader）などの文字認識装置では、
入力された文書画像パターンに対して、入力パターンと
テンプレートとのパターン照合により認識を行う方法が
知られている。このような手法は、一般的に、広告や新
聞の見出しなどにみられる、特異なテクスチャで構成さ
れているデザイン処理された文字や、かすれ、つぶれ、
文字背景雑音などが激しい画像に対して、比較的ロバス
トであることが報告されている。2. Description of the Related Art Here, as an example of pattern recognition,
A description will be given for character recognition. Conventionally, Kanji OCR (Op
character recognition devices such as tical
There is known a method of performing recognition on an input document image pattern by performing pattern matching between the input pattern and a template. Such techniques are commonly used in design-processed characters with unusual textures, such as those found in advertisements and newspaper headlines, as well as blurring, crushing,
It has been reported that the image is relatively robust to an image in which the character background noise is severe.

【０００３】しかし、照明条件などの画像の観測条件な
どにより字形が変動した場合や、周囲に他の文字などの
他の情報が存在する場合には、入力パターンとテンプレ
ートの類似度が下がり、認識率が低下する問題点があっ
た。However, when the character shape fluctuates due to image observation conditions such as lighting conditions, or when there is other information such as other characters in the surroundings, the similarity between the input pattern and the template decreases, and the recognition pattern decreases. There was a problem that the rate decreased.

【０００４】このような問題を解決するために、従来
は、（１）入力パターン内の他の情報を雑音とみなして
取り除いたり、字形の変動を補正してから、個別文字テ
ンプレートを用いて認識を行う方法、（２）単語テンプ
レートを用いることにより個別文字の変形の影響を吸収
する方法の、主に２つのアプローチが取られてきた。In order to solve such a problem, conventionally, (1) other information in an input pattern is regarded as noise and removed, or a variation in character shape is corrected, and then recognition is performed using an individual character template. , And (2) a method of absorbing the influence of individual character deformation by using a word template.

【０００５】しかし、個別文字テンプレートを用いる方
法の場合、入力パターン内の他の情報のみを除去するこ
とは困難である。これは、文字のストロークと他の情報
が非常に類似していることが頻繁に起こるためである。
そのため、しばしば他の情報が残ったり、着目している
情報が損失したりする。その結果、テンプレートとの照
合の際の類似度が下がり、認識率が低下する。特に、
「ｌ」や「ｒ」のような小さいパターンでは、周囲の他
の情報の影響を受けやすく認識が困難となる場合が多
い。[0005] However, in the case of the method using the individual character template, it is difficult to remove only other information in the input pattern. This is because it often happens that character strokes and other information are very similar.
Therefore, other information often remains or information of interest is lost. As a result, the similarity at the time of matching with the template decreases, and the recognition rate decreases. In particular,
Small patterns such as "l" and "r" are often affected by other information around and difficult to recognize.

【０００６】一方、単語テンプレートを用いる方法で
は、複数文字同士の照合を行うため、小さい文字など字
形が変動し誤認識しやすい文字が部分的に含まれている
場合でも、単語全体としては認識可能となることが期待
できる。しかし、単語数に応じたテンプレートを作成す
ることが必要となり、単語数が増えるほどその組み合わ
せが膨大になる。また、それに伴い、照合時間も増加す
る。さらに、入力パターンに未知語が含まれている場
合、対応するテンプレートがないため、認識できない。On the other hand, in the method using a word template, a plurality of characters are collated with each other. Therefore, even when a character such as a small character has a fluctuating character shape and is apt to be erroneously recognized, the entire word can be recognized. Can be expected. However, it is necessary to create a template according to the number of words, and as the number of words increases, the number of combinations increases. In addition, the matching time increases accordingly. Further, if an unknown word is included in the input pattern, it cannot be recognized because there is no corresponding template.

【０００７】[0007]

【発明が解決しようとする課題】上述のように、従来の
ＯＣＲをはじめとするパターン認識技術は、劣化や変形
が大きいパターンを正しく認識できる手法が十分に確立
していなかった。As described above, in the conventional pattern recognition technology such as OCR, a technique for correctly recognizing a pattern having large deterioration or deformation has not been sufficiently established.

【０００８】本発明の目的は、上記のようなテンプレー
ト照合に基づくパターン認識の問題点に対して、認識対
象が限られている場合、文字等の隣接関係に制約がある
ことを利用し、周囲の画像情報を加味したテンプレート
を作成することにより、劣化や変形にロバストなパター
ン認識方法及び装置を提供することである。ここで、周
囲の画像情報の例としては、着目している文字パターン
の前後の文字パターンや、アンダーラインなどの周囲に
付与される装飾などがあげられる。An object of the present invention is to solve the above-mentioned problem of pattern recognition based on template matching by using the fact that there are restrictions on the adjacency of characters and the like when recognition targets are limited. It is an object of the present invention to provide a pattern recognition method and apparatus which is robust against deterioration and deformation by creating a template in consideration of the above image information. Here, examples of the surrounding image information include character patterns before and after the character pattern of interest, and decorations provided around the underline and the like.

【０００９】さらに、学習用画像から学習パターンを自
動的に切り出しテンプレートを学習することにより、人
手による作業なしに高い認識率が得られる辞書を作成す
るためのコンピュータ読み取り可能なプログラムを記録
した記録媒体を提供することを目的とする。Further, a recording medium storing a computer-readable program for creating a dictionary capable of obtaining a high recognition rate without manual work by automatically extracting a learning pattern from a learning image and learning a template. The purpose is to provide.

【００１０】[0010]

【課題を解決するための手段】上記目的を達成するため
に、請求項１の発明は、入力画像パターンと予め学習用
画像を用いて作成してある画像カテゴリのテンプレート
との間の類似度あるいは距離値を計算し、該入力画像パ
ターンのカテゴリを認識するパターン認識方法におい
て、学習用画像に含まれる学習パターンを周囲の画像情
報も含めて抽出し、前記抽出した学習パターンを用いて
テンプレートを作成し、前記テンプレートを用いて入力
画像パターンを認識することを特徴とする。In order to achieve the above object, the present invention is directed to an image processing apparatus comprising the steps of: providing a similarity degree between an input image pattern and a template of an image category which has been created in advance using a learning image; In a pattern recognition method for calculating a distance value and recognizing a category of the input image pattern, a learning pattern included in a learning image is extracted including surrounding image information, and a template is created using the extracted learning pattern. And recognizing an input image pattern using the template.

【００１１】請求項２の発明は、請求項１記載のパター
ン認識方法において、手動で指示された位置情報を用い
て学習用画像から学習パターンを抽出することを特徴と
する。According to a second aspect of the present invention, in the pattern recognition method according to the first aspect, a learning pattern is extracted from a learning image by using position information manually designated.

【００１２】請求項３の発明は、請求項１記載のパター
ン認識方法において、学習用画像をあらかじめ用意され
ている初期辞書で認識し、その結果を用いて学習用画像
から学習パターンを抽出することを特徴とする。According to a third aspect of the present invention, in the pattern recognition method according to the first aspect, a learning image is recognized by an initial dictionary prepared in advance, and a learning pattern is extracted from the learning image using the result. It is characterized by.

【００１３】請求項４の発明は、請求項１乃至請求項３
記載のパターン認識方法において、作成された複数テン
プレートをクラスタリングすることによりテンプレート
数を削減することを特徴とする。The invention according to claim 4 is the invention according to claims 1 to 3.
The described pattern recognition method is characterized in that the number of templates is reduced by clustering a plurality of created templates.

【００１４】請求項５の発明は、請求項１乃至請求項４
記載のパターン認識方法において、周囲の画像情報を含
まない学習パターンを学習用画像から抽出し、該学習パ
ターンからもテンプレートを作成することを特徴とす
る。The invention according to claim 5 is the invention according to claims 1 to 4.
The described pattern recognition method is characterized in that a learning pattern that does not include surrounding image information is extracted from a learning image, and a template is also created from the learning pattern.

【００１５】請求項６の発明は、入力画像パターンと予
め学習用画像を用いて作成してある画像カテゴリのテン
プレートとの間の類似度あるいは距離値を計算し、該入
力画像パターンのカテゴリを認識するパターン認識装置
において、入力画像パターンを記憶するパターン記憶手
段と、学習用画像を記憶する学習用画像記憶手段と、前
記学習用画像に含まれる学習パターンを周囲の画像情報
も含めて抽出する学習パターン抽出手段と、前記学習パ
ターンを用いてテンプレートを作成するテンプレート作
成手段と、前記テンプレートを記憶するテンプレート記
憶手段と、前記テンプレートを用いて、前記入力画像パ
ターンを認識する認識手段とを有することを特徴とす
る。According to a sixth aspect of the present invention, a similarity or a distance value between an input image pattern and an image category template created in advance using a learning image is calculated, and the category of the input image pattern is recognized. A pattern storage device for storing an input image pattern, a learning image storage device for storing a learning image, and learning for extracting a learning pattern included in the learning image including surrounding image information. Pattern extraction means, template creation means for creating a template using the learning pattern, template storage means for storing the template, and recognition means for recognizing the input image pattern using the template. Features.

【００１６】請求項７の発明は、請求項６記載のパター
ン認識装置において、更に、周囲の画像情報を含まない
学習パターンを学習用画像から抽出する単独学習パター
ン抽出手段を有することを特徴とする。According to a seventh aspect of the present invention, there is provided the pattern recognition apparatus according to the sixth aspect, further comprising a single learning pattern extracting means for extracting a learning pattern not including surrounding image information from the learning image. .

【００１７】請求項８の発明は、入力画像パターンと予
め学習用画像を用いて作成してある画像カテゴリのテン
プレートとの間の類似度あるいは距離値を計算し、該入
力画像パターンのカテゴリを認識するためのテンプレー
ト作成プログラムを記録したコンピュータ読み取り可能
な記録媒体あって、学習用画像に含まれる学習パターン
を周囲の画像情報も含めて抽出する処理プロセスと、前
記抽出した学習パターンを用いてテンプレートを作成す
る処理プロセスとを有することを特徴とする。The invention according to claim 8 calculates a similarity or a distance value between an input image pattern and a template of an image category created using a learning image in advance, and recognizes the category of the input image pattern. There is a computer-readable recording medium recording a template creation program for performing a processing process of extracting a learning pattern included in a learning image including surrounding image information, and a template using the extracted learning pattern. And a creating process.

【００１８】[0018]

【発明の実施の形態】以下、本発明の実施の形態につい
て図面を参照して詳細に説明する。なお、以下では、画
像パターンの代表例として白または黒の２値からなる文
字パターンについて説明するが、他の画像パターンにつ
いても同様に実現できる。Embodiments of the present invention will be described below in detail with reference to the drawings. In the following, a character pattern composed of binary values of white or black will be described as a typical example of an image pattern, but other image patterns can be similarly realized.

【００１９】図１は本発明の一実施の形態のパターン認
識装置の構成例を示すブロック図で、１は学習用画像記
憶回路、２は学習パターン抽出回路、３は単独学習パタ
ーン抽出回路、４はテンプレート作成回路、５はテンプ
レート記憶回路、６はパターン記憶回路、７は認識回路
である。なお、本パターン認識装置は、実際には、所謂
コンピュータのハードウエア及びソフトウエア資源を利
用して実現されるものである。FIG. 1 is a block diagram showing a configuration example of a pattern recognition apparatus according to an embodiment of the present invention. 1 is a learning image storage circuit, 2 is a learning pattern extraction circuit, 3 is a single learning pattern extraction circuit, Is a template creation circuit, 5 is a template storage circuit, 6 is a pattern storage circuit, and 7 is a recognition circuit. The pattern recognition apparatus is actually realized using so-called computer hardware and software resources.

【００２０】学習用画像記憶回路１はテンプレート作成
で用いる学習用画像を記憶している。この学習用画像に
は、入力画像パターンの周囲のカテゴリの組のすべてに
対して学習パターンが得られるような画像を利用する。
該学習用画像は、ｎ画素からなり、例えば、画素の値
は、白画素を「０」、黒画素を「１」で表す。The learning image storage circuit 1 stores learning images used for creating a template. As the learning image, an image is used in which a learning pattern can be obtained for all the sets of categories around the input image pattern.
The learning image is composed of n pixels. For example, pixel values are represented by “0” for white pixels and “1” for black pixels.

【００２１】学習パターン抽出回路２は、学習用画像記
憶回路１から学習用画像を入力し、該学習用画像に含ま
れる学習パターンを周囲の画像情報も含めて抽出する。
学習パターン抽出の方法については後述する。抽出され
た学習パターンは、テンプレート作成回路４に渡され
る。The learning pattern extraction circuit 2 inputs a learning image from the learning image storage circuit 1 and extracts a learning pattern included in the learning image including surrounding image information.
The method of extracting the learning pattern will be described later. The extracted learning pattern is passed to the template creation circuit 4.

【００２２】単独学習パターン抽出回路３は、学習用画
像記憶回路１から学習用画像を入力し、着目しているパ
ターンのみからなる学習パターンを抽出する。抽出され
た学習パターンは、テンプレート作成回路４に渡され
る。該単独学習パターン抽出回路３は、単独学習パター
ンを使用しない場合は、省略することも可能である。The single learning pattern extraction circuit 3 receives a learning image from the learning image storage circuit 1 and extracts a learning pattern consisting of only the pattern of interest. The extracted learning pattern is passed to the template creation circuit 4. The single learning pattern extraction circuit 3 can be omitted when no single learning pattern is used.

【００２３】テンプレート作成回路４は、学習パターン
抽出回路２あるいは単独学習パターン抽出回路３から学
習パターンを入力し、テンプレートを作成する。テンプ
レート作成回路４で作成されたテンプレートが、テンプ
レート記憶回路５に新規に登録、あるいは、追加・更新
される。The template creation circuit 4 receives a learning pattern from the learning pattern extraction circuit 2 or the single learning pattern extraction circuit 3 and creates a template. The template created by the template creation circuit 4 is newly registered in the template storage circuit 5, or added / updated.

【００２４】テンプレート記憶回路５は、入力画像パタ
ーンの認識判定に用いる各カテゴリのテンプレートを、
各カテゴリ毎に一つあるいは複数（一般には複数）記憶
している。The template storage circuit 5 stores templates of each category used for recognition determination of an input image pattern.
One or more (generally a plurality) are stored for each category.

【００２５】パターン記憶回路６は、認識される１もし
くはそれ以上の画像パターンを入力パターンとして記憶
している。これらの画像パターンはＭ画素からなり、例
えば、２値画像パターンの場合には各画素は「０」また
は「１」で示されている。The pattern storage circuit 6 stores one or more recognized image patterns as input patterns. These image patterns are composed of M pixels. For example, in the case of a binary image pattern, each pixel is indicated by “0” or “1”.

【００２６】認識回路７は、テンプレート記憶回路５か
らテンプレートを入力し、また、パターン記憶回路５か
ら認識対象の入力パターンを入力し、テンプレートと入
力パターンを照合（ずらし照合）することによりパター
ン認識を行い、その認識結果を出力する。The recognition circuit 7 receives a template from the template storage circuit 5, inputs an input pattern to be recognized from the pattern storage circuit 5, and performs pattern recognition by comparing the template with the input pattern (shifted matching). And outputs the recognition result.

【００２７】本発明の特徴は、学習用画像に含まれてい
る学習パターンを、周囲の情報も含めて抽出し、その学
習パターンを用いて認識用のテンプレートを作成すると
ころにある。以下では、学習パターンの抽出およびテン
プレート作成の詳細な説明を行う。A feature of the present invention resides in that a learning pattern included in a learning image is extracted including surrounding information, and a template for recognition is created using the learning pattern. In the following, a detailed description will be given of the extraction of a learning pattern and the creation of a template.

【００２８】はじめに、学習パターン抽出回路２および
テンプレート作成回路４により、学習用画像から自動的
にテンプレートを作成する処理について、図２のフロー
チャートを用いて説明する。なお、この処理手順は、テ
ンプレート作成プログラムとして、コンピュータ読み取
り可能な記録媒体、例えばフロッピーディスク、メモリ
カード、コンパクトディスク（ＣＤ−ＲＯＭ）などに記
録して提供することが可能である。First, the process of automatically creating a template from a learning image by the learning pattern extraction circuit 2 and the template creation circuit 4 will be described with reference to the flowchart of FIG. This processing procedure can be provided as a template creation program recorded on a computer-readable recording medium such as a floppy disk, a memory card, or a compact disk (CD-ROM).

【００２９】学習用画像記憶回路１から、学習用画像が
読み出され、学習パターン抽出回路２に入力される（ス
テップ２０１）。学習用画像は、一つもしくは複数の学
習パターンを含んでいるとする。学習用画像の例を図３
の画像３０１に示す。A learning image is read from the learning image storage circuit 1 and input to the learning pattern extraction circuit 2 (step 201). It is assumed that the learning image includes one or a plurality of learning patterns. Fig. 3 shows an example of a learning image
The image 301 is shown in FIG.

【００３０】学習パターン抽出回路２は、まず、テンプ
レート記憶回路５から初期辞書を読み込む（ステップ２
０２）。初期辞書は、例えば、各カテゴリごとに１つの
テンプレートを学習用画像から手動等で切り出して作成
してあるとする。図４に、カテゴリー「ｅ」の初期辞書
内のテンプレートの例を示す。The learning pattern extraction circuit 2 first reads an initial dictionary from the template storage circuit 5 (step 2).
02). It is assumed that the initial dictionary is created by, for example, manually extracting one template for each category from the learning image. FIG. 4 shows an example of a template in the initial dictionary of the category “e”.

【００３１】次に、学習パターン抽出回路２は、学習用
画像を、例えば初期辞書を用いて認識する（ステップ２
０３）。認識手法としては、例えば、入力画像に対し傾
き補正などの前処理を行うことにより前処理済画像を作
成し、その前処理済画像に対し補完類似度を用いたずら
し照合による方法を用いることができる（例えば、澤木
ほか；「辞書の自動選択による本棚画像中の文字認
識」、１９９８年電子情報通信学会総合大会（１９９８
年３月）Ｄ−１３−３３）。具体的には、傾き補正後の
入力画像上を、認識用観測窓を垂直方向に走査し、各認
識用観測窓と初期辞書内の参照パターンとの照合を行
う。この照合に補完類似度を用い、位置ずれを考慮して
垂直方向のみでなく水平方向への走査も行う。照合した
各位置での最大類似度があらかじめ学習してある各文字
カテゴリの類似度しきい値を越えた場合、そのカテゴリ
を初期辞書による認識結果とする。この認識方法によ
り、高い類似度を示した部分の画像位置およびそのテン
プレートのカテゴリが得られる。図３の画像３０２に、
学習用画像３０１に傾き補正を行って得られた前処理済
画像を示す。また、図３の３０３に、図３の３０２の上
の行を初期辞書によるずらし照合を行った際に、高い類
似度を示したカテゴリを、高い類似度を示した位置に示
す。Next, the learning pattern extraction circuit 2 recognizes the learning image using, for example, an initial dictionary (step 2).
03). As a recognition method, for example, a pre-processed image is created by performing pre-processing such as tilt correction on an input image, and a method based on shifted matching using complementary similarity is used for the pre-processed image. Yes (for example, Sawaki et al., “Character Recognition in Bookshelf Images by Automatic Dictionary Selection”, IEICE General Conference 1998 (1998
D-13-33). Specifically, the observation window for recognition is scanned in the vertical direction on the input image after the inclination correction, and each observation window for recognition is compared with the reference pattern in the initial dictionary. Scanning is performed not only in the vertical direction but also in the horizontal direction in consideration of misalignment by using the complementary similarity in this comparison. When the maximum similarity at each collated position exceeds the similarity threshold of each character category that has been learned in advance, the category is used as a recognition result by the initial dictionary. With this recognition method, the image position of the portion showing the high similarity and the category of the template can be obtained. In the image 302 of FIG.
A preprocessed image obtained by performing a tilt correction on the learning image 301 is shown. Further, at 303 in FIG. 3, a category having a high similarity when the line above 302 in FIG. 3 is shifted and compared by the initial dictionary is shown at a position having a high similarity.

【００３２】次に、学習パターン抽出回路２は、高い類
似度を示した位置およびカテゴリを用いて、学習パター
ンを抽出する（スナップ２０４）。学習パターンの抽出
方法の例としては、入力された位置を中心として、矩形
領域を抽出する。図５の画像５０１に、前処理済画像の
例を示す。また、図５の画像５０２に、画像５０１から
抽出されたカテゴリー「ｅ」の学習パターンの一例を示
す。Next, the learning pattern extraction circuit 2 extracts a learning pattern by using the position and the category showing the high similarity (snap 204). As an example of a method of extracting a learning pattern, a rectangular area is extracted around an input position. An example of a preprocessed image is shown in an image 501 of FIG. An example of a learning pattern of the category “e” extracted from the image 501 is shown in an image 502 of FIG.

【００３３】抽出された学習パターンは、テンプレート
作成回路８に入力される。テンプレート作成回路４で
は、入力された学習パターンを用いて、テンプレートを
作成する（ステップ２０５）。テンプレート作成方法と
しては、例えば、文字列から学習パターンを抽出した場
合、学習パターンのカテゴリとその前後の文字のカテゴ
リからなる３つのカテゴリの組が同じである学習パター
ンを集め、各画素位置で平均値をとる方法や、クラスタ
リングによる方法などを用いることができる。作成され
たテンプレートの例を図６に挙る。The extracted learning pattern is input to the template creating circuit 8. The template creation circuit 4 creates a template using the input learning pattern (step 205). As a template creation method, for example, when a learning pattern is extracted from a character string, learning patterns having the same set of three categories consisting of the category of the learning pattern and the categories of characters before and after the learning pattern are collected, and averaged at each pixel position. A method of taking a value, a method by clustering, or the like can be used. FIG. 6 shows an example of the created template.

【００３４】作成されたテンプレートは、テンプレート
記憶回路５に入力される（ステップ２０６）。テンプレ
ート記憶回路４では、入力されたテンプレートを新規保
存・追加・更新する。The created template is input to the template storage circuit 5 (step 206). In the template storage circuit 4, the input template is newly stored / added / updated.

【００３５】次に、他の実施例として、学習用画像か
ら、手動で学習パターンを抽出する実施例について説明
する。Next, as another embodiment, an embodiment in which a learning pattern is manually extracted from a learning image will be described.

【００３６】この実施例では、図２のステップ２０２お
よび２０３の代わりに、読み込まれた学習用画像をディ
スプレィに表示し、学習用画像中の学習パターンの位置
およびカテゴリを手動により入力する。その後、上記の
実施例と同様である。この実施例による方法で作成され
たテンプレートセットの例を同じく図６に示す。In this embodiment, instead of steps 202 and 203 in FIG. 2, the read learning image is displayed on the display, and the position and category of the learning pattern in the learning image are manually input. After that, it is the same as the above embodiment. FIG. 6 also shows an example of a template set created by the method according to this embodiment.

【００３７】図６のテンプレートを用いて、本棚画像中
の３４６８文字を認識したときの結果を表１に示す。表
１では、比較のため、図６のテンプレートから周囲の情
報を削除したテンプレートを用いて認識した場合の認識
率も同時に掲載してある。Table 1 shows the results when 3468 characters in the bookshelf image were recognized using the template of FIG. In Table 1, for the sake of comparison, the recognition rate when recognition is performed using a template in which surrounding information has been deleted from the template in FIG. 6 is also shown.

【表１】 [Table 1]

【００３８】次に、学習用画像から周囲の画像情報を含
まない学習パターンを抽出してテンプレートに用いる例
について説明する。即ち、入力画像パターンの周囲のカ
テゴリの組のすべてに対してあらかじめ学習パターンが
入手できない場合などには、周囲の画像情報を含まない
学習パターンからなるテンプレートも用意する必要があ
る。Next, an example will be described in which a learning pattern that does not include surrounding image information is extracted from a learning image and used as a template. That is, when learning patterns cannot be obtained in advance for all of the set of categories around the input image pattern, it is necessary to prepare a template including a learning pattern that does not include surrounding image information.

【００３９】図１において、学習用画像記憶回路１から
学習用画像を読み出し、単独学習パターン抽出回路３に
入力する。単独学習パターン抽出回路３では、該学習用
画像から、例えば手動で周囲情報を含まない学習パター
ンを切り出す。これは、フィルタリングなどによりノイ
ズ成分を除去し、周囲情報を含まない学習パターンを抽
出してもよい。該学習パターンはテンプレート作成回路
４に送られる。テンプレート作成回路４では、入力され
た学習パターンからテンプレートを作成し、テンプレー
ト記憶回路５４に送る。テンプレート記憶回路５では、
入力された個別パターンからなるテンプレートを、周囲
の情報も含めたテンプレートに追加・更新する。In FIG. 1, a learning image is read from the learning image storage circuit 1 and is input to a single learning pattern extraction circuit 3. In the independent learning pattern extraction circuit 3, for example, a learning pattern that does not include surrounding information is manually cut out from the learning image. In this case, a noise component may be removed by filtering or the like, and a learning pattern not including surrounding information may be extracted. The learning pattern is sent to the template creation circuit 4. The template creation circuit 4 creates a template from the input learning pattern and sends it to the template storage circuit 54. In the template storage circuit 5,
Adds / updates the template composed of the input individual patterns to a template including surrounding information.

【００４０】いずれの実施例においても、テンプレート
作成後に、従来から公知のクラスタリング手法を用いて
テンプレート数を削減する処理を行い、新たに作成され
たテンプレートを保存することも可能である。In any of the embodiments, after the template is created, a process for reducing the number of templates can be performed by using a conventionally known clustering method, and the newly created template can be stored.

【００４１】なお、以上の説明では、文字を中心に示し
たが、文字以外の画像パターンについても同様が処理が
可能である。In the above description, characters are mainly shown, but the same processing can be applied to image patterns other than characters.

【００４２】[0042]

【発明の効果】以上説明したように、発明によれば、以
下のような効果が得られる。（１）周囲の情報を含んだテンプレートを作成し、この
テンプレートを用いて認識を行うことにより、個別文字
の変形が大きい場合でも周囲の文字などの情報を利用す
ることにより、正しく認識できる利点がある。さらに、
従来法が行ったいたような単語テンプレートを作成する
必要がないため、テンプレート数を減らすことができ、
また未知語が入力された場合でも柔軟に対処することが
できる。As described above, according to the present invention, the following effects can be obtained. (1) By creating a template including surrounding information and performing recognition using this template, even when individual characters are greatly deformed, the advantage that the information such as surrounding characters can be used for correct recognition can be obtained. is there. further,
Since there is no need to create word templates as in the conventional method, the number of templates can be reduced,
Also, it is possible to flexibly cope with a case where an unknown word is input.

【００４３】（２）学習用画像中の学習パターンの位置
を手動で入力することにより、学習パターンの劣化の品
質を目で確認しながら抽出でき、抽出誤りがない、とい
う利点がある。(2) By manually inputting the position of the learning pattern in the learning image, it is possible to extract the learning pattern while checking the quality of the deterioration of the learning pattern visually, and there is an advantage that there is no extraction error.

【００４４】（３）学習用画像をあらかじめ用意されて
いる初期辞書で認識し、その結果を用いて学習用画像か
ら学習パターンを自動的に抽出し、テンプレートの作成
・追加・更新を行うことにより、自動的に学習用画像に
適した辞書が構成され、認識率を向上させるという利点
がある。(3) By recognizing a learning image with an initial dictionary prepared in advance, automatically extracting a learning pattern from the learning image using the result, and creating, adding, and updating a template. Thus, there is an advantage that a dictionary suitable for the learning image is automatically formed, and the recognition rate is improved.

【００４５】（４）作成された辞書にクラスタリングを
適用することにより、冗長なテンプレートを削減するこ
とが可能となる。(4) By applying clustering to the created dictionary, redundant templates can be reduced.

【００４６】（５）単独学習パターン抽出手段を持たせ
ることで、入力パターンの文字カテゴリと周囲のカテゴ
リの組み合わせと同じ組み合わせのテンプレートを持っ
ていない場合でも、入力パターンを正しく認識できるよ
うになる。(5) By providing the independent learning pattern extracting means, the input pattern can be correctly recognized even if the user does not have a template having the same combination as the combination of the character category of the input pattern and the surrounding categories.

[Brief description of the drawings]

【図１】本発明によるパターン認識装置の構成例を示す
ブロック図である。FIG. 1 is a block diagram illustrating a configuration example of a pattern recognition device according to the present invention.

【図２】本発明の一実施形態のテンプレート作成方法を
説明するためのフローチャートである。FIG. 2 is a flowchart illustrating a template creation method according to an embodiment of the present invention.

【図３】学習用画像および前処理済画像および認識結果
の例を示す図である。FIG. 3 is a diagram illustrating an example of a learning image, a preprocessed image, and a recognition result.

【図４】初期辞書中のテンプレートの例を示す図であ
る。FIG. 4 is a diagram showing an example of a template in an initial dictionary.

【図５】前処理済画像から学習パターンを抽出する方法
を説明するための図である。FIG. 5 is a diagram for explaining a method of extracting a learning pattern from a preprocessed image.

【図６】作成されたテンプレートの例を示す図である。FIG. 6 is a diagram illustrating an example of a created template.

[Explanation of symbols]

１学習用画像記憶回路２学習パターン抽出回路３単独学習パターン抽出回路４テンプレート作成回路５テンプレート記憶回路６パターン記憶回路７認識回路 REFERENCE SIGNS LIST 1 image storage circuit for learning 2 learning pattern extraction circuit 3 independent learning pattern extraction circuit 4 template creation circuit 5 template storage circuit 6 pattern storage circuit 7 recognition circuit

フロントページの続き (72)発明者萩田紀博東京都新宿区西新宿三丁目19番２号日本電信電話株式会社内Ｆターム(参考） 5B064 AB02 AB03 AB13 AB16 AB17 DA27 DB06 5L096 FA19 HA08 JA03 JA09 JA22 KA04 LA17 MA07 Continued on the front page (72) Inventor Norihiro Hagita 3-19-2 Nishi-Shinjuku, Shinjuku-ku, Tokyo F-term (reference) in Nippon Telegraph and Telephone Corporation 5B064 AB02 AB03 AB13 AB16 AB17 DA27 DB06 5L096 FA19 HA08 JA03 JA09 JA22 KA04 LA17 MA07

Claims

[Claims]

1. A pattern recognition method for calculating a similarity or a distance value between an input image pattern and a template of an image category created using a learning image in advance, and recognizing the category of the input image pattern. Extracting a learning pattern included in a learning image including surrounding image information, creating a template using the extracted learning pattern, and recognizing an input image pattern using the template. Recognition method.

2. The pattern recognition method according to claim 1, wherein a learning pattern is extracted from a learning image using manually designated position information.

3. A pattern recognition method according to claim 1, wherein the learning image is recognized by an initial dictionary prepared in advance, and a learning pattern is extracted from the learning image by using the recognition result. Recognition method.

4. The pattern recognition method according to claim 1, wherein the number of templates is reduced by clustering a plurality of created templates.

5. The pattern recognition method according to claim 1, wherein a learning pattern that does not include surrounding image information is extracted from the learning image, and a template is created from the learning pattern. Pattern recognition method.

6. A pattern recognition apparatus that calculates a similarity or a distance value between an input image pattern and a template of an image category created using a learning image in advance, and recognizes the category of the input image pattern. A pattern storage unit that stores an input image pattern, a learning image storage unit that stores a learning image, and a learning pattern extraction unit that extracts a learning pattern included in the learning image including surrounding image information, Pattern recognition, comprising: template creation means for creating a template using the learning pattern; template storage means for storing the template; and recognition means for recognizing the input image pattern using the template. apparatus.

7. The pattern recognition apparatus according to claim 6, further comprising a single learning pattern extraction unit for extracting a learning pattern not including surrounding image information from the learning image.

8. A template for calculating a similarity or a distance value between an input image pattern and a template of an image category created using a learning image in advance, and recognizing the category of the input image pattern. A computer-readable recording medium storing a program, a processing process of extracting a learning pattern included in a learning image including surrounding image information, and a processing process of creating a template using the extracted learning pattern. A recording medium on which a template creation program is recorded.