JPH1063779A

JPH1063779A - Character recognition method, and device therefor

Info

Publication number: JPH1063779A
Application number: JP8223873A
Authority: JP
Inventors: Kazuo Shimura; 一男志村
Original assignee: Fuji Photo Film Co Ltd
Current assignee: Fujifilm Holdings Corp
Priority date: 1996-08-26
Filing date: 1996-08-26
Publication date: 1998-03-06

Abstract

PROBLEM TO BE SOLVED: To recognize character from a picture including an inverted character in a character recognizing device with high precision. SOLUTION: A binarized picture 3 is generated by extracting and binarizing the contour of a picture from an original picture 1 including an inverted character 2 by a binarized picture generating means 30 and an inner part of the contour is painted out by setting pixel values of pixels in a closed area inside the contour of the character whose contour is extracted to be equal to the pixel value of the contour by a painting-out means 35. Then, the character is recognized by performing pattern matching using a template 7 for a binarized picture 3' in which inner part of its contour is painted out by a matching means 40.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、文字を含む画像中
から文字を認識する方法および装置に関するものであ
り、詳細にはテンプレートを用いたパターンマッチング
による文字認識を行う文字認識方法および装置に関する
ものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a method and an apparatus for recognizing characters in an image including characters, and more particularly to a method and an apparatus for recognizing characters by pattern matching using a template. It is.

【０００２】[0002]

【従来の技術】従来より、人体等の被写体をＣＴ、ＭＲ
Ｉ等によって撮影して得られた医用画像の画像データ
を、各画像データに適切な画像処理を施した後、画像を
再生記録することが行われており、これらの画像データ
に基づき被写体の画像を写真感光材料等の記録材料、Ｃ
ＲＴ等に可視像として出力させる画像記録再生システム
がすでに実用化されている。2. Description of the Related Art Conventionally, a subject such as a human body is subjected to CT, MR, or the like.
The image data of a medical image obtained by photographing with I or the like is subjected to appropriate image processing for each image data, and then the image is reproduced and recorded. Based on these image data, the image of the subject is Is a recording material such as a photosensitive material, C
An image recording / reproducing system for outputting a visible image to an RT or the like has already been put to practical use.

【０００３】通常医用画像には、患者名、ＩＤ番号、撮
影日、生年月日等のＩＤ情報が付されており、これらの
ＩＤ情報は、一般に英文字、数字、記号等（以下「文
字」と総称する）から構成されている。従って、医用画
像には絵柄と文字とが混在している。この絵柄と文字が
混在した医用画像から自動的に文字のみを識別すること
が望まれる場合がある。それは、例えば以下のような場
合である。Normally, medical images are provided with ID information such as a patient name, an ID number, an imaging date, a date of birth, and the like. These ID information generally include English characters, numerals, symbols, and the like (hereinafter, “characters”). ). Therefore, a pattern and a character are mixed in the medical image. There is a case where it is desired to automatically identify only characters from a medical image in which pictures and characters are mixed. This is the case, for example, as follows.

【０００４】通常、画像記録再生システムにおいては、
撮影された画像データが検索情報となるＩＤ情報と関連
づけてファイリング装置に記憶されている。このファイ
リング装置への画像データファイリング時における画像
データとＩＤ情報との関連づけは、外部入力により行わ
れており操作者側の負担となっていた。そのため、撮影
時に画像中に付されたＩＤ情報を自動的に認識できるよ
うにすることが望まれている。Normally, in an image recording / reproducing system,
The captured image data is stored in the filing device in association with ID information serving as search information. The association between the image data and the ID information at the time of filing the image data to the filing device is performed by an external input, and is a burden on the operator side. Therefore, it is desired to be able to automatically recognize the ID information added to the image at the time of shooting.

【０００５】従来、文字を含む画像から文字を認識する
ための方法としては、認識すべき文字のテンプレートを
用意し、画像と比較して認識を行うテンプレートマッチ
ングという方法が知られている。この方法は、画像上の
認識対象領域と全てのテンプレートとを順次マッチング
させ、マッチング結果が最も良いテンプレートを認識結
果とするものであり、認識対象領域が指定されていない
場合は画像上での認識対象領域を順次ずらして、画像全
面に対してマッチングをとっていく必要がある。Conventionally, as a method for recognizing a character from an image including the character, there is known a method called template matching in which a template of a character to be recognized is prepared and compared with an image to perform recognition. In this method, the recognition target area on the image and all templates are sequentially matched, and the template with the best matching result is determined as the recognition result. If no recognition target area is specified, the recognition on the image is performed. It is necessary to match the entire image by shifting the target area sequentially.

【０００６】以下、一般的なテンプレートマッチングに
よる認識方法を説明する。画像パターンｆとテンプレー
トパターンｇが、それぞれ図４に示すような大きさと位
置関係にあるとき、画像の点（m，n）における画像パタ
ーンｆと、テンプレートパターンｇの類似度は、以下に
示す評価関数で表される。Hereinafter, a general recognition method using template matching will be described. When the image pattern f and the template pattern g have a size and a positional relationship as shown in FIG. 4, respectively, the similarity between the image pattern f and the template pattern g at the point (m, n) of the image is evaluated as follows. Expressed by a function.

【０００７】[0007]

【数１】 (Equation 1)

【０００８】ここで通常、Ｒ(m,n)は相互相関係数と呼
ばれ、領域Ｄは、|i-m|＜M/2，|j-n|＜N/2を満たす
（i，j）の範囲である。画像上でテンプレートを動か
して順次、画像とテンプレートとの類似度を上記の相互
相関係数Ｒにより判別する。相互相関係数Ｒの値は大き
いほど画像の（m，n）を中心としてテンプレートに近い
図形が存在すること、すなわち、画像とテンプレートと
の相関が強いことを示し、画像とテンプレートとが完全
一致で１の値、全くの不一致で０の値をとる。通常、最
も相互相関係数Ｒの大きいものを認識結果とする。Here, usually, R (m, n) is called a cross-correlation coefficient, and a region D has a range of (i, j) satisfying | im | <M / 2, | jn | <N / 2. It is. By moving the template on the image, the similarity between the image and the template is sequentially determined based on the cross-correlation coefficient R described above. A larger value of the cross-correlation coefficient R indicates that a figure closer to the template exists around the image (m, n), that is, the image and the template have a stronger correlation, and the image and the template are completely matched. Takes a value of 1 and a value of 0 when there is no match. Normally, the one having the largest cross-correlation coefficient R is regarded as the recognition result.

【０００９】しかしながら、上記のテンプレートマッチ
ング方法では、上記式に示されるように２乗や平方根の
計算が必要であること、また、通常多階調の画像に対し
てマッチングを行うことから、計算量が多くなり認識に
時間がかかるという欠点がある。However, the above-described template matching method requires the calculation of the square or the square root as shown in the above equation, and usually requires matching for a multi-tone image. However, there is a drawback that recognition takes a long time.

【００１０】そこで、高速な認識処理のため、画像を２
値化して、２値化画像に対してマッチングを行う文字認
識装置、更には、文字が絵柄と重なった部分についても
精度良く認識可能とするため、画像の輪郭抽出した２値
化画像に対してマッチングを行う文字認識装置が、既に
本出願人により、提案されている（特願平８−167302号
等）。Therefore, for high-speed recognition processing, two images are used.
A character recognition device that performs binarization and performs matching on a binarized image. Furthermore, in order to accurately recognize a portion where a character overlaps a picture, a binarized image extracted from the outline of the image is used. A character recognition device that performs matching has already been proposed by the present applicant (Japanese Patent Application No. 8-167302).

【００１１】[0011]

【発明が解決しようとする課題】上述の輪郭抽出した２
値化画像におけるマッチングにより、認識の精度はある
程度向上する。しかし、図２に示すような反転文字２を
含む画像１を輪郭抽出２値化すると、２値化画像３にお
いて文字の輪郭内部にもエッジ４が残る。そのため、テ
ンプレートとマッチングが完全なものとならず、上述の
評価関数の評価値が下がってしまうという欠点がある。
この評価値の低下は誤認識の原因となりうるものであ
る。SUMMARY OF THE INVENTION
The accuracy of the recognition is improved to some extent by the matching in the binarized image. However, when the image 1 including the inverted character 2 as shown in FIG. 2 is subjected to outline extraction binarization, the edge 4 remains inside the outline of the character in the binarized image 3. For this reason, there is a drawback that the template and the matching are not perfect, and the evaluation value of the above-mentioned evaluation function is lowered.
This decrease in the evaluation value can cause erroneous recognition.

【００１２】そこで、本発明は上記事情に鑑み、反転文
字を含む画像からも精度良く文字を認識する文字認識方
法および装置を提供することを目的とする。In view of the above circumstances, it is an object of the present invention to provide a character recognition method and apparatus for recognizing characters with high accuracy even from an image including inverted characters.

【００１３】[0013]

【課題を解決するための手段】本発明の文字認識方法
は、文字を含む画像中から該文字を認識する文字認識方
法であって、前記画像から画像の輪郭を抽出した２値化
画像を作成し、前記輪郭に囲まれた閉領域を前記輪郭の
画素値と同じ値とし、２値化された文字のテンプレート
を有する辞書を用意し、前記２値化画像に対して前記テ
ンプレートの文字を用いてパターンマッチングを行い文
字を認識することを特徴とするものである。A character recognition method according to the present invention is a character recognition method for recognizing a character from an image including the character, and generates a binary image by extracting an outline of the image from the image. Then, a closed region surrounded by the outline is set to the same value as the pixel value of the outline, a dictionary having a template of a binarized character is prepared, and the character of the template is used for the binarized image. And performs pattern matching to recognize characters.

【００１４】本発明の文字認識装置は、文字を含む画像
中から該文字を認識する文字認識装置であって、前記画
像から画像の輪郭を抽出した２値化画像を生成する２値
化画像生成手段と、前記輪郭に囲まれた閉領域の画素値
を前記輪郭の画素値と同じ値にする手段と、２値化され
た文字のテンプレートを有する辞書と、前記２値化画像
に対して前記テンプレートを用いてパターンマッチング
させることにより文字を認識するマッチング手段とを備
えることを特徴とするものである。A character recognition apparatus according to the present invention is a character recognition apparatus for recognizing a character from an image including the character, and generates a binary image by extracting a contour of the image from the image. Means, means for making the pixel value of a closed region surrounded by the contour the same as the pixel value of the contour, a dictionary having a binarized character template, and A matching unit that recognizes characters by performing pattern matching using a template.

【００１５】前記画像の輪郭を抽出した２値化画像の生
成により、画像中の文字は、一般に中抜き文字、若しく
は袋文字と呼ばれる文字として抽出される。それ故、前
記「輪郭に囲まれた閉領域」とは、主として中抜き文字
の中抜き部分をいう。しかし、例えば図３の２値化画像
３に示す中抜き文字「Ａ」，「Ｂ」等のように外側の輪
郭によって囲まれた閉領域の内部にさらに閉領域を有す
る輪郭（内部輪郭）がある文字の場合、該内部輪郭に囲
まれた閉領域をも前記「輪郭に囲まれた閉領域」に相当
する。そのため、これらの文字に関しては、通常の中抜
き文字の中抜き部だけでなく内部輪郭に囲まれた閉領域
をも輪郭の画素値と同じ値としているテンプレートを用
いてマッチングすることが望ましい。By generating a binarized image by extracting the outline of the image, characters in the image are extracted as characters generally called hollow characters or bag characters. Therefore, the “closed region surrounded by the outline” mainly refers to a hollow portion of a hollow character. However, for example, a contour (inner contour) having a further closed area inside a closed area surrounded by an outer contour, such as hollow characters “A” and “B” shown in the binarized image 3 in FIG. In the case of a certain character, the closed area surrounded by the internal outline also corresponds to the “closed area surrounded by the outline”. Therefore, for these characters, it is desirable to perform matching using a template having the same value as the pixel value of the outline not only of the hollow portion of the normal hollow character but also of a closed region surrounded by the internal outline.

【００１６】前記「閉領域を前記輪郭の画素値と同じ値
とし」とは、例えば、画像が「０」、「１」に２値化さ
れており、輪郭の画素値が「１」である場合、閉領域の
画素値を「１」とすることを意味する。The phrase "the closed area has the same value as the pixel value of the contour" means, for example, that the image is binarized into "0" and "1" and the pixel value of the contour is "1". In this case, it means that the pixel value of the closed area is “1”.

【００１７】すなわち、本発明の文字認識方法および装
置においては、中抜き部分を輪郭部の画素値と同じ画素
値とすることによって、画像の輪郭抽出２値化の際に抽
出された中抜き文字の内部に生じうるエッジによるマッ
チング誤差をなくし、マッチング精度の向上を図ったも
のである。That is, in the character recognition method and apparatus according to the present invention, the outline character is extracted at the time of binarizing the outline of the image by setting the outline portion to the same pixel value as the pixel value of the outline portion. This eliminates a matching error due to an edge that may occur inside the symbol and improves the matching accuracy.

【００１８】なお、「画像の輪郭を抽出した２値化」の
方法としては、例えば、特願平８−167302号に開示され
ている２値化方法を用いればよく、また文字の輪郭抽出
方法としては、その他に一次微分フィルタ（ロバーツの
フィルタ、ブレヴィットのフィルタ、ソーベルのフィル
タ等）あるいはラプラシアンフィルタ等を用いる方法が
挙げられる。As the method of "binarization by extracting the outline of an image", for example, the binarization method disclosed in Japanese Patent Application No. 8-167302 may be used. Other examples include a method using a first-order differential filter (Roberts filter, Brevit filter, Sobel filter, or the like), a Laplacian filter, or the like.

【００１９】ここで画像とは文字と絵柄が混在したもの
をいう。Here, the image means a mixture of characters and pictures.

【００２０】[0020]

【発明の効果】本発明の文字認識方法および装置では、
画像の輪郭を抽出２値化した２値化画像において、抽出
されている輪郭に囲まれる閉領域を、該輪郭の画素値と
同じ画素値として文字のテンプレートを用いてパターン
マッチングを行うので、反転文字等を輪郭抽出２値化す
る際に中抜き文字の中抜き部分に生じうるエッジによる
マッチング評価値の低下を防ぐことができ、誤認識を低
減した高精度な文字認識を行うことができる。According to the character recognition method and apparatus of the present invention,
In a binarized image obtained by extracting and binarizing an outline of an image, a closed region surrounded by the extracted outline is subjected to pattern matching using a character template as the same pixel value as the pixel value of the outline. When a character or the like is subjected to contour extraction binarization, it is possible to prevent a decrease in a matching evaluation value due to an edge that may occur in a hollow portion of a hollow character, thereby performing highly accurate character recognition with reduced erroneous recognition.

【００２１】[0021]

【発明の実施の形態】以下、本発明の文字認識方法およ
び装置の具体的な実施の形態を図面を用いて説明する。DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, specific embodiments of the character recognition method and apparatus of the present invention will be described with reference to the drawings.

【００２２】図１に示すように、本発明の文字認識装置
10は、各種撮影装置９から入力された原画像データＳに
基づく原画像の文字領域を認識する文字領域認識手段20
と、該文字領域認識手段20により認識された文字領域の
画像の輪郭を抽出すると共に２値化して２値化画像を生
成する２値化画像生成手段30と、該２値化画像生成手段
30によって生成された２値化画像において抽出された輪
郭で囲まれた閉領域の画素値をエッジ部の画素値と同じ
値とする塗りつぶし手段35と、辞書45の有する各種、各
サイズの文字のテンプレート７を用いてパターンマッチ
ングさせることにより文字を認識するマッチング手段40
とから成る。As shown in FIG. 1, the character recognition device of the present invention
Reference numeral 10 denotes a character area recognizing means 20 for recognizing a character area of the original image based on the original image data S input from the various photographing devices 9.
A binarized image generating means 30 for extracting the outline of the image of the character area recognized by the character area recognizing means 20 and binarizing the image to generate a binarized image;
A filling means 35 for setting the pixel value of the closed region surrounded by the contour extracted in the binarized image generated by the pixel 30 to the same value as the pixel value of the edge portion; Matching means 40 for recognizing characters by performing pattern matching using template 7
Consisting of

【００２３】文字領域認識手段20は、詳しくは、画像構
成判定手段22を備え、該画像構成判定手段22は、原画像
を複数の領域に分割し各領域の画像構成を表すデータを
作成するデータ作成手段24と、該データ作成手段24によ
り作成されたデータを基に、各領域が文字領域であるか
否かを判定する判定手段26とからなる。More specifically, the character area recognizing means 20 includes an image structure determining means 22. The image structure determining means 22 divides the original image into a plurality of areas and generates data representing the image structure of each area. It comprises a creating means 24 and a judging means 26 for determining whether each area is a character area based on the data created by the data creating means 24.

【００２４】２値化画像生成手段30は、原画像の各画素
毎に該画素の所定近傍範囲中の画素濃度の最大値と最小
値との差を求める演算手段32と、該演算手段32によって
求められた濃度差と所定の閾値との比較により各画素を
２値化して画像の輪郭を抽出する２値化手段34とから成
る。The binarized image generating means 30 includes, for each pixel of the original image, a calculating means 32 for calculating a difference between a maximum value and a minimum value of a pixel density in a predetermined neighborhood of the pixel, and the calculating means 32 A binarizing means 34 for binarizing each pixel by comparing the obtained density difference with a predetermined threshold to extract the outline of the image;

【００２５】以下に、本文字認識装置の作用を説明す
る。The operation of the character recognition device will be described below.

【００２６】各種撮影装置９によって撮影された原画像
を担持する画像データはデジタル化されて原画像データ
Ｓとして文字認識装置10に入力される。文字認識装置10
においては、まず、文字領域認識手段20による文字領域
の認識が行われる。文字領域の認識は例えば、特願平８
−179344号記載の画像判定装置等を用いて行うことがで
きる。Image data carrying an original image photographed by the various photographing devices 9 is digitized and input to the character recognition device 10 as original image data S. Character recognition device 10
In, first, a character area is recognized by the character area recognition means 20. For example, Japanese Patent Application No. 8
The image determination can be performed using an image determination device described in JP-A-179344.

【００２７】画像を複数の領域に分割し、画像判定手段
22内のデータ作成手段24により各領域の画像構成を表す
ヒストグラムデータを作成する。The image is divided into a plurality of areas, and
The data creating means 24 in the area 22 creates histogram data representing the image configuration of each area.

【００２８】ヒストグラムデータは、一般に、絵柄領域
については中間濃度域の頻度が大きく低濃度および高濃
度の頻度が小さいグラフ、文字領域については中間濃度
域の頻度が小さく低濃度および高濃度に大きな頻度を持
つグラフとなる。なお、文字と絵柄の組み合わさった領
域（以下、混合領域という）は全体的に頻度を持つグラ
フとなる。そこで、判定手段24においては、各領域のヒ
ストグラムデータを基に、各領域が文字を含む領域（文
字領域あるいは混合領域）であるか否かを判定する。こ
のように文字を含む領域を予め識別することにより、マ
ッチング範囲を限定することを可能とし、処理の高速化
を図っている。Generally, the histogram data is a graph in which the frequency of the intermediate density area is large and the frequency of the low density and the high density are small for the picture area, and the frequency of the intermediate density area is small and the frequency of the low density and the high density are large for the character area. Becomes a graph with. Note that an area in which characters and patterns are combined (hereinafter, referred to as a mixed area) is a graph having frequency as a whole. Therefore, the determining means 24 determines whether each area is an area including a character (a character area or a mixed area) based on the histogram data of each area. By previously identifying the area including the character, the matching range can be limited, and the processing can be speeded up.

【００２９】文字領域判定手段20によって文字を含む領
域であると判定された領域（以下、文字付与領域とい
う）の画像について、２値化画像生成手段30による２値
化が行われる。画像の輪郭抽出２値化は例えば、特願平
８−167302号等に開示されている方法を用いて行う。The binarized image generation unit 30 performs binarization on an image of an area (hereinafter, referred to as a character addition area) determined to be an area including a character by the character area determination unit 20. The image contour extraction binarization is performed using, for example, a method disclosed in Japanese Patent Application No. 8-167302.

【００３０】文字付与領域の画像データについて、まず
演算手段32において画像の各画素の所定の近傍範囲中に
おける濃度の最大値と最小値との差が求められる。例え
ば注目画素を２×２画素マスク中の左下に位置させた場
合、その２×２画素マスク（所定近傍範囲）中の計４画
素における階調濃度（原画像データＳ）の最大値および
最小値を求め、これを注目画素の濃度差とする。当然な
がら、２×２という範囲は一例であり、これに限る必要
はない。次に、２値化手段34において、前記演算手段32
によって求められた前記濃度差を所定の閾値と比較し
て、例えば閾値より濃度差が大きい場合を「１」、小さ
い場合を「０」として注目画素を２値化する。この２値
化を画像中の各画素について行うことにより、画像中の
近傍画素同志の濃度差の大きい部分、すなわち画像の輪
郭が抽出され、２値化画像が生成される。通常、文字と
周辺画像の濃度差は大きいため、文字の輪郭が抽出され
る。なお、反転文字を含む場合には、例えば、図２に示
すような反転文字Ｅを含む原画像文字領域１からは２値
化画像３が生成され、２値化画像３には中抜き文字
「Ｅ」の中抜き部に反転境界線がエッジ４として抽出さ
れる。For the image data of the character providing area, first, the difference between the maximum value and the minimum value of the density in a predetermined vicinity range of each pixel of the image is calculated by the calculating means 32. For example, when the target pixel is located at the lower left of the 2 × 2 pixel mask, the maximum value and the minimum value of the gradation density (original image data S) at a total of 4 pixels in the 2 × 2 pixel mask (predetermined neighborhood range) , And this is used as the density difference of the pixel of interest. Naturally, the range of 2 × 2 is an example, and it is not necessary to limit to this. Next, in the binarizing means 34, the arithmetic means 32
The target pixel is binarized by comparing the density difference obtained by the above with a predetermined threshold value, for example, setting “1” when the density difference is larger than the threshold value and “0” when the density difference is smaller than the threshold value. By performing this binarization for each pixel in the image, a portion where the density difference between neighboring pixels in the image is large, that is, the outline of the image is extracted, and a binarized image is generated. Usually, since the density difference between the character and the surrounding image is large, the outline of the character is extracted. In the case of including the inverted character, for example, a binary image 3 is generated from the original image character area 1 including the inverted character E as shown in FIG. The inversion boundary line is extracted as the edge 4 in the hollow portion of “E”.

【００３１】その他、輪郭抽出２値化処理として、注目
画素とその周囲の各画素の信号（濃度）差（Ｓ(i,j)−
Ｓ(i+m,j+n)）を求め、この信号差の絶対値（｜Ｓ(i,j)
−Ｓ(i+m,j+n)|）が最大のもの（max|Ｓ(i,j)−Ｓ(i+m,
j+n)|）を注目画素における信号差とみなして、この濃
度差が所定の閾値（Th)よりも大きい場合に注目画素の
画素値を「１」、濃度差が所定の閾値以下であれば、
「０」とする、即ち、Ｓ'(i,j)＝max(|Ｓ(i,j)−Ｓ(i+m,j+n）｜), (m=-1,0,
+1，n=-1,0,1) if Ｓ'(i,j)＞Th，Ｓ"(i,j)＝１ else Ｓ"(i,j)＝０なる演算により２値化してもよい。In addition, as a contour extraction binarization process, the signal (density) difference (S (i, j) −
S (i + m, j + n)), and the absolute value of this signal difference (| S (i, j)
−S (i + m, j + n) |) (max | S (i, j) −S (i + m,
j + n) |) is regarded as a signal difference in the target pixel, and when the density difference is larger than a predetermined threshold (Th), the pixel value of the target pixel is “1”, and the density difference is equal to or smaller than the predetermined threshold. If
"0", that is, S '(i, j) = max (| S (i, j) -S (i + m, j + n) |), (m = -1,0,
+ 1, n = -1,0,1) if S '(i, j)> Th, S "(i, j) = 1 else S" (i, j) = 0 Good.

【００３２】次に、塗りつぶし手段35において、図２の
２値化画像３において抽出された輪郭に囲まれた部分
（閉領域）の各画素の画素値を該輪郭の画素値（ここで
は、「１」）と同じ値にする。即ち、輪郭抽出されて中
抜き文字となっている２値化画像中の中抜き文字「Ｅ」
の輪郭内部（中抜き部）を塗り潰した２値化画像３’と
する。これによって、輪郭抽出時に生じる輪郭内部のエ
ッジ４は見かけ上なくなる。Next, in the filling means 35, the pixel value of each pixel of the portion (closed area) surrounded by the contour extracted in the binarized image 3 of FIG. 1)). That is, the outline character “E” in the binarized image whose outline is extracted and becomes the outline character
Is defined as a binarized image 3 ′ in which the inside of the outline (the hollow portion) is painted out. As a result, the edge 4 inside the contour generated at the time of contour extraction disappears.

【００３３】その後、辞書45の有する各種、各サイズの
文字のテンプレート７を用いてマッチング手段40におい
てパターンマッチングが行われる。Thereafter, pattern matching is performed by the matching means 40 using the template 7 of characters of various sizes and sizes of the dictionary 45.

【００３４】ここで、マッチングは２値化された画像に
対して行うため、排他的積和（exclusive or）による類
似度評価関数Ｒ’を用いることができ、既述の評価関数
Ｒを用いる場合と比較して計算時間を短縮することがで
きる。なお、画像上の点（m，n）における画像パターン
ｆとテンプレートｇのＲ'(m,n)は、Here, since the matching is performed on the binarized image, a similarity evaluation function R 'based on exclusive product-sum (exclusive or) can be used. The calculation time can be shortened as compared with. Note that R ′ (m, n) of the image pattern f and the template g at the point (m, n) on the image is

【００３５】[0035]

【数２】 (Equation 2)

【００３６】で表され、前述の相互相関係数Ｒの場合と
は逆に、Ｒ’が最も小さいテンプレートが認識結果とさ
れる。In contrast to the case of the aforementioned cross-correlation coefficient R, the template having the smallest R ′ is determined as the recognition result.

【００３７】なお、上述の２値化の際に、「１」，
「０」ではなく、「１」，「−１」に２値化して単純な
積和値を評価関数に用いてもよい。その場合の評価関数
Ｒ"(m,n)は、In the above-mentioned binarization, "1",
Instead of “0”, the sum may be binarized to “1” or “−1” and a simple sum of products value may be used for the evaluation function. The evaluation function R "(m, n) in that case is

【００３８】[0038]

【数３】 (Equation 3)

【００３９】で表され、Ｒ”が最も大きいテンプレート
が認識結果とされる。And the template having the largest R ″ is the recognition result.

【００４０】ここで、領域Ｄは、|i-m|＜Ｍ／２，|j-n|
＜Ｎ／２を満たす(i,j) の範囲である。Here, the area D is | im | <M / 2, | jn |
<N / 2 is satisfied (i, j).

【００４１】このように、本発明の文字認識装置におい
ては、画像の輪郭抽出２値化により抽出された文字の輪
郭の内部を塗りつぶした上でマッチングを行うので、輪
郭抽出２値化時に文字の輪郭内部に生じるエッジにより
マッチング評価値が下がることはない。As described above, in the character recognition device of the present invention, since the matching is performed after the inside of the outline of the character extracted by the outline extraction binarization of the image is filled, the character is extracted at the time of outline extraction binarization. The matching evaluation value does not decrease due to the edge generated inside the contour.

【００４２】なお、Ａ，Ｂ，Ｄ，等の中抜き文字のよう
に、外側の輪郭に囲まれる閉領域内に内側の輪郭に囲ま
れる閉領域が存在する場合があり、図３にこの例を示
す。図３に示すように、反転文字Ａを含む原画像１は、
画像の輪郭抽出２値化により、２値化画像３とされる。
このとき、中抜き文字「Ａ」の外側の輪郭5aおよび該輪
郭5a内部にある輪郭5bおよび反転境界エッジ４がそれぞ
れ抽出される。その後、塗りつぶし手段35によって、輪
郭に囲まれた閉領域を塗りつぶす場合、外側輪郭5aおよ
び内部輪郭5bに囲まれる領域、即ちこれが中抜き文字の
中抜き部分であるが、本来この部分のみ塗りつぶすこと
が理想的である。しかしながら、該塗りつぶし手段35に
よって、内部輪郭5bに囲まれる領域も「輪郭で囲まれる
閉領域」と認識される場合には、この部分も塗りつぶし
てしまうこととなり２値化画像３’となる。このような
２値化画像３’に対して通常の文字のテンプレート７を
用いてマッチングを行ったのでは、マッチング評価値が
低くなりマッチング精度の向上が図れない。そこで、テ
ンプレート７’のように、内部輪郭に囲まれる領域を塗
りつぶした文字のテンプレートを予め用意しておき、こ
れを用いてマッチングを行うこととする。これにより、
完全なマッチングが可能となる。ただし、このような内
部輪郭による閉領域をも塗りつぶして、マッチング可能
なものは、アルファベット、数字、平仮名等の平易な文
字に限られ、漢字等の複雑な文字の認識には適さない。
しかしながら、一般に医用画像中にＩＤ情報として用い
られるのは、アルファベット、数字程度に限られるため
上記方法を用いることができる。It should be noted that there are cases where a closed region surrounded by an inner contour exists within a closed region surrounded by an outer contour, such as a hollow character such as A, B, D, etc. FIG. Is shown. As shown in FIG. 3, the original image 1 including the inverted character A is
The binarized image 3 is obtained by binarizing the outline of the image.
At this time, the outline 5a outside the hollow character "A", the outline 5b inside the outline 5a, and the inverted boundary edge 4 are respectively extracted. Thereafter, when the closed area surrounded by the outline is painted out by the painting means 35, the area surrounded by the outer outline 5a and the inner outline 5b, that is, the hollow part of the hollow character, is originally filled only with this part. Ideal. However, if the area surrounded by the internal outline 5b is also recognized as the “closed area surrounded by the outline” by the filling means 35, this part is also filled and becomes the binarized image 3 ′. If matching is performed on such a binarized image 3 'using a normal character template 7, the matching evaluation value becomes low, and the matching accuracy cannot be improved. Therefore, a template of a character in which an area surrounded by an internal contour is filled like a template 7 'is prepared in advance, and matching is performed using the template. This allows
Perfect matching is possible. However, what can be matched by also filling a closed area with such an internal contour is limited to plain characters such as alphabets, numbers, and hiragana, and is not suitable for recognition of complicated characters such as kanji.
However, in general, ID information used in medical images is limited to alphabets and numerals, so that the above method can be used.

【００４３】上述のようにして、画像の文字付与領域か
ら文字認識が行われ、認識されたＩＤ番号等の文字情報
Ｉは、原画像データＳと共にファイリング装置50へ出力
される。As described above, the character recognition is performed from the character providing area of the image, and the recognized character information I such as the ID number is output to the filing device 50 together with the original image data S.

【００４４】このようにして、認識結果である文字情報
Ｉを自動的にＣＴ、ＭＲＩ等の医用画像データと対応付
けてファイリングすることができ、各画像データの検索
情報として用いることができる。In this manner, the character information I as the recognition result can be automatically filed in association with medical image data such as CT and MRI, and can be used as search information of each image data.

[Brief description of the drawings]

【図１】本発明の文字認識装置の概略ブロック図。FIG. 1 is a schematic block diagram of a character recognition device of the present invention.

【図２】本発明の文字認識装置におけるマッチングの一
例を説明する図。FIG. 2 is a view for explaining an example of matching in the character recognition device of the present invention.

【図３】本発明の文字認識装置におけるマッチングの別
の一例を説明する図。FIG. 3 is a view for explaining another example of matching in the character recognition device of the present invention.

【図４】マッチング方法を説明する図。FIG. 4 is a diagram illustrating a matching method.

[Explanation of symbols]

１原画像２反転文字３輪郭抽出２値化画像３’ 閉領域塗りつぶし２値化画像４エッジ５輪郭６閉領域塗りつぶし文字７，７’ テンプレート９撮影装置 10 文字認識装置 20 文字領域認識手段 22 画像構成判定手段 30 ２値化画像生成手段 35 塗りつぶし手段 40 マッチング手段 45 辞書 50 ファイリング装置 DESCRIPTION OF SYMBOLS 1 Original image 2 Inverted character 3 Outline extraction binarized image 3 'Closed area filling binarized image 4 Edge 5 Outline 6 Closed area filling character 7, 7' Template 9 Imaging device 10 Character recognition device 20 Character region recognition means 22 Image Configuration determining means 30 Binary image generating means 35 Filling means 40 Matching means 45 Dictionary 50 Filing device

Claims

[Claims]

1. A character recognition method for recognizing a character from an image including the character, comprising: creating a binarized image by extracting an outline of the image from the image; A dictionary having a binarized character template is prepared with the same value as the pixel value of the contour, and pattern matching is performed on the binarized image using the character of the template to recognize the character. Character recognition method.

2. A character recognition device for recognizing a character from an image including the character, comprising: a binarized image generating unit configured to generate a binarized image obtained by extracting an outline of the image from the image; Means for making the pixel value of the enclosed closed area the same as the pixel value of the outline; a dictionary having a binarized character template; and pattern matching using the template for the binarized image. And a matching unit for recognizing a character by causing the character to be recognized.