JP3413236B2

JP3413236B2 - Character recognition method

Info

Publication number: JP3413236B2
Application number: JP06866693A
Authority: JP
Inventors: 康司山中; 哲也岩田; 義典佐藤; 美楯松井
Original assignee: Konica Minolta Inc
Current assignee: Konica Minolta Inc
Priority date: 1993-03-26
Filing date: 1993-03-26
Publication date: 2003-06-03
Anticipated expiration: 2018-06-03
Also published as: EP0617380A2; JPH06282688A; EP0617380A3

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、Ｘ線フィルムに記され
る文字情報等の文字を合成識別関数を用いて認識するも
のにおいて、識別性能の高い合成識別関数を形成する方
法、該形成された合成識別関数を用いた文字認識方法及
び文字認識の前処理方法に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a method of forming a synthetic discriminant function having a high discriminating ability in recognizing a character such as character information written on an X-ray film by using the synthetic discriminant function. The present invention relates to a character recognition method using the composite identification function and a preprocessing method for character recognition.

【０００２】[0002]

【従来の技術】現在、医療画像の多くはＸ線フィルムで
あり、Ｘ線フィルムのデジタル化はデジタイザーにより
自動的に行われるが、フィルムのＩＤ番号や患者名等の
属性を記した文字情報の入力は現在人手により行われて
いる。大きな病院では、一日に処理しなければならない
Ｘ線フィルムの数は数百枚にも昇り、現状のままでは多
大な労力をフィルムの文字情報の入力にかけなければな
らない状態であり、かかる文字情報を自動認識するシス
テムの開発が望まれている。2. Description of the Related Art Currently, most medical images are X-ray films, and digitization of X-ray films is automatically performed by a digitizer. Input is currently being done manually. In a large hospital, the number of X-ray films that must be processed in one day has risen to several hundreds, and under the current circumstances, a great deal of effort has to be put into inputting text information on the film. The development of a system that automatically recognizes is desired.

【０００３】[0003]

【発明が解決しようとする課題】従来より文字認識に関
して種々の方式が試みられているが、文字認識において
は、認識率を１００％にすることは非常に難しく、ま
た、周知のベイズ決定則等を用いて認識率を上げようと
すると、認識に要する時間が掛かってしまうという問題
があった。Various methods have been attempted for character recognition, but it is very difficult to achieve a recognition rate of 100% in character recognition, and the well-known Bayes decision rule, etc. There is a problem that it takes a long time to recognize when trying to increase the recognition rate using.

【０００４】[0004]

【０００５】本発明は、このような従来の課題に鑑みな
されたもので、文字認識の精度を高めつつ短時間で認識
できるようにすることを目的とする。 The present invention has been made in view of such conventional problems, and an object of the present invention is to improve the accuracy of character recognition and enable recognition in a short time .

【０００６】[0006]

【課題を解決するための手段】このため、本発明に係る
文字認識方法は、認識すべき文字の種類を２つのクラス
に分類し、該分類の各組み合わせに対して夫々次式で示
されるフィシャー比を最大とするように形成した合成識
別関数のグループをコンピュータに記憶し、該コンピュ
ータで演算処理を行って、前記記憶された中で最もフィ
シャー比が大きい所定の分類の組み合わせに対応する合
成識別関数を選択し、次に前記合成識別関数で分類され
る２つのクラスのうち分類可能なクラスについて、再度
２つのクラスに分類して前記同様にして所定の分類の組
み合わせに対応する合成識別関数を選択し、かかる選択
を繰り返して全ての種類の文字を認識することを特徴と
する。Therefore, according to the present invention,
The character recognition method has two classes for the types of characters to be recognized.
, And each combination of the classification is shown by the following formula.
The synthetic knowledge formed to maximize the Fischer ratio
A group of different functions is stored in the computer and the computer
The data is processed by the computer and the
When the combination of a predetermined classification with a large shear ratio
Select the discriminant function, and then sort by the composite discriminant function
Of the two classes that can be classified,
Classify into two classes and set a predetermined class as above.
Select the composite discriminant function corresponding to the matching, and select
Characteristic of recognizing all kinds of characters by repeating .

【数２】 [Equation 2]

【０００７】[0007]

【数２】 [Equation 2]

【０００８】[0008]

【０００９】その場合、前記合成識別関数のグループ
は、まず、１つの文字と残りの文字とを識別する第１の
合成識別関数を形成し、次いで、前記残りの文字の中か
ら１つの文字と当該文字を除く残りの文字とを識別する
第２の合成識別関数を形成し、以下同様の操作を繰り返
して全ての文字を認識できるように形成されたものであ
ってもよい。In that case, the group of composite discriminant functions first forms a first composite discriminant function for discriminating one character from the rest of the characters, and then one of the remaining characters. A second composite discriminant function for discriminating the remaining characters excluding the character may be formed, and the same operation may be repeated thereafter to recognize all the characters.

【００１０】そして、前記のようにして形成された各合
成識別関数を用いて認識しようとする文字との相関演算
を行い、相関値の大きい方のクラスを選択して大きなク
ラスから小さなクラスへと分類識別していって特定の文
字として認識するものである。また、文字を認識するま
でに用いられた各合成識別関数との相関演算により求め
られた相関値に基づいて、当該識別結果に誤りを生じる
確率を求めて記憶しておくようにしてもよい。 Then, a correlation calculation with a character to be recognized is performed using each composite discriminant function formed as described above , and a class having a larger correlation value is selected to change from a large class to a small class. They are classified and recognized as specific characters. Further, the probability of causing an error in the identification result may be obtained and stored based on the correlation value obtained by the correlation calculation with each synthetic identification function used until the character is recognized.

【００１１】[0011]

【課題を解決するための手段】そして、文字が誤って認
識されたことが判明したときに、前記記憶された識別結
果の誤りの確率が最も高い合成識別関数を検出し、該識
別結果を変えて識別しなおして文字を認識しなおすよう
にしてもよい。その場合、各文字を認識して得られた文
字の組み合わせを、正しい文字の組み合わせの記録デー
タと照合して確認を行うことにより、誤って認識されて
いる文字の判明を行うようにしてもよい。また、本発明
にかかる文字認識方法として、文字情報の記録されてい
る文字領域を検出するステップと、前記検出された文字
領域より文字情報を抽出するステップと、前記抽出され
た文字情報と、文字に対応した複数の合成識別関数との
演算を行ない、それぞれの演算結果に基づく相関値を出
力するステップと、前記相関値の出力結果に基づいて、
前記複数の合成識別関数のうちから１つを選択し、文字
を特定するステップと、特定された文字をデータベース
中のデータと照合を行なうステップと、前記照合により
不一致となった場合は、前記合成識別関数の出力に基づ
いて前記特定された文字を異なる文字へ変換するステッ
プと、を有するようにしてもよい。 When it is determined that a character is erroneously recognized, the composite discriminant function having the highest probability of error in the stored discrimination result is detected and the discrimination result is changed. The characters may be recognized again by re-identifying. In this case, the character combination obtained by recognizing each character may be collated with the recorded data of the correct character combination for confirmation, so that the character recognized by mistake may be identified. . Also, the present invention
Character information is recorded as a character recognition method for
The step of detecting a character area that is detected, and the detected character
Extracting character information from the area, and
Character information and multiple composite discriminant functions corresponding to the character
The calculation is performed and the correlation value based on each calculation result is output.
Force step, and based on the output result of the correlation value,
Select one from the plurality of composite discriminant functions
Identifying steps and database of identified characters
Performing a data collation in, by the comparison
If they do not match, based on the output of the composite discriminant function,
To convert the specified character to a different character.
And-flops, may have a.

【００１２】[0012]

【００１３】[0013]

【００１４】[0014]

【作用】複数の異なる文字を合成識別関数との相関演算
によって識別する場合、フォントの異なる同一の文字に
対しての相関性が高くとも、他の文字との非相関性が高
くなければ、必ずしも良好な識別機能を有しているとは
いえない。特にフォントの数が増えるに従って同一の文
字に対する相関性を高めようとすると、他の文字との非
相関性は薄れてくる。したがって、識別性能の高い合成
識別関数とは同一の文字に対する相関性の高さと、異な
る文字に対する非相関性の高さとのバランスにより決定
されるといえる。When a plurality of different characters are identified by the correlation calculation with the composite identification function, even if the correlation with the same character with different font is high, but the decorrelation with other characters is not high, it is not always necessary. It cannot be said that it has a good identification function. In particular, if the correlation with the same character is increased as the number of fonts increases, the non-correlation with other characters decreases. Therefore, it can be said that the composite discriminant function having high discriminative ability is determined by a balance between high correlation with respect to the same character and high non-correlation with respect to different characters.

【００１５】本発明でかかる合成識別関数の評価基準と
するフィシャー比において、分子の値は識別すべき２つ
のクラスに対する相関値の相違度を表し、分母の値は各
クラスにおける相関値の分散度の合計を表している。つ
まり、同一の文字に対する相関性が高いほど相関値の分
散度は小さくなるため、分母の値は小さくなり、異なる
文字の非相関性が高いほど、相関値の相違度が大きくな
るため、分子の値は大きくなって、夫々フィシャー比を
大きくさせることとなる。したがって、同一の文字に対
する相関性が高く、異なる文字に対する非相関性も高い
という条件を満たすとき、フィシャー比は最も大きくな
って、識別機能も最大となる。In the Fischer ratio used as the evaluation criterion of the composite discriminant function in the present invention, the value of the numerator represents the degree of difference in the correlation value between the two classes to be discriminated, and the value of the denominator is the degree of dispersion of the correlation value in each class. Represents the total of. In other words, the higher the correlation with respect to the same character, the smaller the degree of dispersion of the correlation value, the smaller the value of the denominator, and the higher the decorrelation of different characters, the greater the degree of difference in the correlation value. The larger the value, the larger the Fischer ratio. Therefore, when the condition that the correlation with the same character is high and the decorrelation with the different character is high, the Fischer ratio becomes the maximum and the identification function also becomes the maximum.

【００１６】以上の点から、フィシャー比を評価基準と
して合成識別関数を形成することにより、識別機能の高
い合成識別関数を得ることができる。また、２つの文字
を識別するだけであれば、単純にフィシャー比が最大と
なるように１つの合成識別関数を形成すればよいが、そ
れ以上の数の文字を識別する場合には、合成識別関数の
数、識別に要する時間等を考慮する必要がある。From the above points, by forming a composite discriminant function using the Fischer ratio as an evaluation criterion, a composite discriminant function having a high discriminating function can be obtained. Further, if only two characters are to be identified, one synthetic identification function may be simply formed so that the Fischer ratio is maximized. It is necessary to consider the number of functions and the time required for identification.

【００１７】所謂ツリー構造等の識別手順により、例え
ば前記１０種類の文字を２クラスに分類し、該分類され
たものを夫々更に２クラスに分類していくというように
大クラスから小クラスへと分類し、これらの分類を合成
識別関数により識別する方法がある。その場合の合成識
別関数を形成する方法として、考えられる分類の組み合
わせの全てに対して、夫々フィシャー比が最大となるよ
うに合成識別関数を形成し、これらの中でフィシャー比
が最大となる分類の組み合わせを選択するようにすれ
ば、大〜小の各クラスで分類識別性能を可及的に高めた
合成識別関数を得ることができる。For example, from the large class to the small class, for example, the above 10 kinds of characters are classified into 2 classes and the classified characters are further classified into 2 classes according to the identification procedure such as a so-called tree structure. There is a method of classifying and classifying these classifications by a composite classification function. As a method of forming a composite discriminant function in that case, a composite discriminant function is formed so that the Fischer ratio is maximized for all possible combinations of classifications, and the classification having the maximum Fischer ratio among these is formed. If a combination of is selected, it is possible to obtain a composite discriminant function in which the classification / discrimination performance is improved as much as possible in each of large and small classes.

【００１８】したがって、前記方法で形成された合成識
別関数を用いて文字認識を行うことにより、高精度で、
かつ、短時間で文字認識を行うことができる。上記の場
合に、各合成識別関数を１つの文字と残りの文字とを識
別するものを形成すれば、より、少ない識別回数で認識
できる確率が向上する。また、各合成識別関数との相関
値に基づいて、識別結果に誤りを生じる確率を記憶して
おくことにより、後に誤りを生じたことが判明したとき
に、該誤識別確率のデータに基づいて、誤識別確率の高
かった合成識別関数の識別結果を変更して、識別しなお
すことができる。かかる誤識別確率最大の合成識別関数
を検出して自動的に識別しなおすようにすれば、より効
率的である。Therefore, by performing character recognition by using the composite discriminant function formed by the above-mentioned method, high accuracy
Moreover, character recognition can be performed in a short time. In the above case, if each synthetic discriminant function is formed so as to discriminate one character from the rest of the characters, the probability of recognition with a smaller number of times of discrimination is improved. Further, by storing the probability that an error will occur in the identification result based on the correlation value with each composite identification function, when it is found that an error occurs later, based on the data of the misidentification probability. The identification result of the composite identification function, which has a high probability of incorrect identification, can be changed and identification can be performed again. It is more efficient to detect such a composite discriminant function having the maximum false discrimination probability and automatically perform discrimination again.

【００１９】一方、前記ツリー構造とは、別に各文字に
対して夫々対応する合成識別関数を形成する方法もあ
る。即ち、各合成識別関数は、所定の文字とそれ以外の
文字とを識別するものであって、該所定の文字との相関
性は高く、それ以外の文字に対しては非相関性を高くす
る必要がある。その場合もフィシャー比を評価基準とし
て形成することで、夫々識別機能を可及的に高めた合成
識別関数のグループを形成することができる。On the other hand, apart from the tree structure, there is also a method of forming a composite discriminant function corresponding to each character. That is, each composite discriminant function discriminates a predetermined character from other characters and has a high correlation with the predetermined character and a high non-correlation with other characters. There is a need. Also in this case, by forming the Fischer ratio as the evaluation reference, it is possible to form a group of composite discriminant functions each having a discriminant function enhanced as much as possible.

【００２０】この合成識別関数のグループを用いて文字
認識を行う場合には、認識しようとする文字を各合成識
別関数に対して相関演算を行い、相関値が最大のものを
選択することで認識が行われる。また、この方法で認識
に誤りがあることが判明した場合には、次に相関値が高
いものを選択して認識することで、正しく訂正される確
率を高めることができる。When character recognition is performed using this group of synthetic discriminant functions, the character to be recognized is subjected to a correlation operation with respect to each synthetic discriminant function and recognized by selecting the one having the maximum correlation value. Is done. Further, when it is found that the recognition has an error by this method, the probability of correct correction can be increased by selecting and recognizing the one having the next highest correlation value.

【００２１】更に、医療用等文字認識を確実なものとす
るために、認識された文字の組み合わせを別途記録され
た正しい文字の組み合わせとの照合を行えばよく、認識
された文字が記録されたデータの中に存在しない場合に
認識に誤りがあると判明するため、それによって前記の
ように識別しなおすことができる。 Further, in order to ensure character recognition for medical purposes, it is sufficient to compare the recognized character combination with a separately recorded correct character combination, and the recognized character is recorded. If it is not present in the data, it will prove to be erroneous and thus can be re-identified as described above .

【００２２】[0022]

【実施例】以下に本発明の実施例を図に基づいて説明す
る。図１は、合成識別関数を用いてＸ線フィルムに記さ
れた文字情報を認識し、かつ、該文字情報認識に先立っ
て本発明にかかる方法で前処理を行う文字情報自動認識
システムの構成を示す。Embodiments of the present invention will be described below with reference to the drawings. FIG. 1 shows a configuration of a character information automatic recognition system that recognizes character information written on an X-ray film by using a composite discriminant function and performs preprocessing by the method according to the present invention prior to the character information recognition. Show.

【００２３】図において、Ｘ線が照射され周縁部に文字
情報が書き込まれたＸ線フィルムＦがデジタイザー１に
セットされると、デジタイザー１はＸ線画像及び文字情
報のデータをデジタル化し、画像データとして本発明に
係る文字認識を行う処理コンピュータ２に入力される。
処理コンピュータ２による文字情報自動認識のプロセス
は、図に示すように大きく分けて前処理と文字認識・照
合に分けられる。In the figure, when the X-ray film F irradiated with X-rays and having the character information written in the peripheral portion is set in the digitizer 1, the digitizer 1 digitizes the X-ray image and the character information data to obtain the image data. Is input to the processing computer 2 which performs character recognition according to the present invention.
The process of automatic recognition of character information by the processing computer 2 is roughly divided into preprocessing and character recognition / collation as shown in the figure.

【００２４】前処理では、まず入力されてきたＸ線フィ
ルムＦ上から文字情報の写しこまれている領域を特定
し、文字の傾きを検出して補正し、フィルム挿入方向を
検出して画像を正規の位置に補正した後、文字を切り出
す。文字認識・照合では、合成識別関数（以下ＳＤＦと
いう）を用いて切り出された１文字１文字を認識して文
字情報を構成し、文字情報に付されたチェックサム等の
エラーコレクションコード（ＥＣＣ）を利用してチェッ
クを行い、正しければそのまま、また間違いがあれば文
字の入替えを行った上で、放射線情報システム３に記憶
された患者情報のデータベースとの照合を行う。In the preprocessing, first, the area on the input X-ray film F in which the character information is copied is specified, the inclination of the character is detected and corrected, and the film insertion direction is detected to detect the image. After correcting to the regular position, cut out the characters. In the character recognition / collation, a character is extracted by using a composite identification function (hereinafter referred to as SDF) to form character information, and an error collection code (ECC) such as a checksum attached to the character information is formed. Is used to check, if correct, and if there is a mistake, the letters are replaced, and then collated with the database of patient information stored in the radiation information system 3.

【００２５】次に、各処理の詳細を説明する。前処理
は、入力されたＸ線フィルムの画像データから文字情報
の認識に必要な文字を切り出すことを行う。Ｘ線フィル
ムには、そのサイズにより幾つかの種類があり、例えば
大角と呼ばれる縦１４インチ、横１４インチのＸ線フィ
ルムはサンプリング間隔０．１７５ｍｍ、１０ビット
の階調で２０００×２０００画素にデジタル化する。文
字情報には検査日、患者ＩＤ、患者氏名、生年月日、性
別が記録されている。前処理を大きく文字情報部領域の
特定、文字の傾き補正、フィルムの挿入方向の検出、文
字の切り出し、の４つのプロセスに分け各々の段階での
処理について述べる。Next, the details of each process will be described. In the pre-processing, the characters necessary for recognizing the character information are cut out from the input X-ray film image data. There are several types of X-ray film depending on their size. For example, a 14-inches long and 14-inches long X-ray film called a large angle has a sampling interval of 0. 175 mm, digitized into 2000 × 2000 pixels with 10-bit gradation. The textual information includes the examination date, patient ID, patient name, date of birth, and sex. The pre-processing is roughly divided into four processes of specifying the character information area, correcting the inclination of the character, detecting the insertion direction of the film, and cutting the character, and the processing at each stage will be described.

【００２６】前処理の第１段階はＸ線フィルム上から文
字情報が記載されている領域を特定することを行う。図
２にある病院で使用されている大角のＸ線フィルムの文
字情報が写しこまれている場所の一覧表を示した。候補
となる場所は９箇所であるが、図３に示すようにＸ線フ
ィルムが上下あるいは表裏逆にデジタイズされた場合に
も対応しなければならないので候補となる箇所は合計４
０か所に及ぶ。Ｘ線フィルムにはその他大四つ、六つ
切、半切等の種類があり、種類によって文字情報の写し
こまれている箇所、サイズ等は異なっている。さらに病
院によってもその写し込み方式は異なっており、細かな
対応が必要である。The first step of the pre-processing is to specify the area on the X-ray film where the character information is written. FIG. 2 shows a list of places where the character information of the large-angle X-ray film used in the hospital is copied. There are nine candidate locations, but there are a total of four candidate locations because X-ray film must be digitized upside down or upside down as shown in FIG.
It reaches 0 places. There are other types of X-ray film, such as large four, six-section, and half-section, and the location where the character information is imprinted and the size are different depending on the type. Furthermore, the method of imprinting differs depending on the hospital, and detailed measures are required.

【００２７】そこで、本発明では、文字の傾きの検出及
びフィルムの入力状態の判断を容易に行うため図４に示
すような縦横のラインを文字情報部に付加した。即ち、
縦のラインは文字情報の左側，横のラインは文字情報の
下側に付加してある。かかる文字情報部領域の特定処理
を図５に示したフローチャートに従って説明する。Therefore, in the present invention, vertical and horizontal lines as shown in FIG. 4 are added to the character information portion in order to easily detect the inclination of the character and judge the input state of the film. That is,
The vertical line is added to the left side of the character information, and the horizontal line is added to the lower side of the character information. The process of identifying the character information area will be described with reference to the flowchart shown in FIG.

【００２８】前記メモリに登録された候補となる各領域
毎に画像データの分散値を次式に従って演算する (ステ
ップ１) 。The variance value of the image data is calculated according to the following equation for each candidate area registered in the memory (step 1).

【００２９】[0029]

【数３】 [Equation 3]

【００３０】演算された分散値が最大となる領域を文字
情報部領域として検出する。文字等エッジがあり、か
つ、濃淡がある部分の分散値は大きくなることを利用す
るものである (ステップ２) 。ここで、誤検出されるこ
とを考慮して大きい方から上位３つの領域を候補領域と
して記憶しておき、前記縦横ラインの成分の有無を同時
に検出して、該成分が検出されない場合は、第２, 第３
の候補の領域に切り換える。The area where the calculated variance value is maximum is detected as the character information area. The fact that the variance value of a portion where there is an edge such as a character and there is shading is large is used (step 2). Here, in consideration of being erroneously detected, the top three areas from the largest one are stored as candidate areas, the presence or absence of the components of the vertical and horizontal lines are detected at the same time, and if the components are not detected, 2nd and 3rd
Switch to the candidate area.

【００３１】次に、領域の長辺を水平方向と一致させる
ために分散値が最大の領域が左隅上辺に位置するように
画像データを回転操作する (ステップ３) 。ここで、領
域が角部に位置して直角な両辺を含むような場合は、領
域を左隅上辺に位置させても領域の長辺が垂直方向に一
致することがあるので判別できない。そのような場合
は、後述する縦横のプロジェクション処理における最大
値のもの同士を比較して長辺と短辺とを区別すればよ
い。Next, in order to match the long side of the area with the horizontal direction, the image data is rotated so that the area having the maximum dispersion value is located on the upper side of the left corner (step 3). Here, in the case where the region is located at the corner and includes both sides that are perpendicular to each other, even if the region is located at the upper side of the left corner, the long sides of the region may coincide with each other in the vertical direction, and therefore it cannot be determined. In such a case, the long sides and the short sides may be distinguished by comparing the maximum values in the vertical and horizontal projection processing described later.

【００３２】第２段階は、文字の傾きを検出して該傾き
に対する補正を行う。現在のＸ線検査はネームカードと
呼ばれる患者の属性情報が書き込まれたカードを機械に
Ｘ線検査技師が手で装置に挿入し、光学的にＸ線フィル
ムに焼き込み、患者の情報をフィルムに記録する形式で
ある。ネームカードを手で挿入する方式であるために、
文字がフィルムに対して著しく傾いているものがあり、
それが文字の切り出しの失敗の原因となる場合がある。
そのため文字の傾きの補正が必要である。In the second step, the inclination of the character is detected and the inclination is corrected. In the current X-ray inspection, an X-ray inspection technician manually inserts a card, which is called a name card, into which the attribute information of the patient is written, into the machine, and optically burns it onto an X-ray film to record the patient information on the film. This is the format for recording. Because it is a method to insert the name card by hand,
Some characters are significantly inclined with respect to the film,
It may cause the character cutout failure.
Therefore, it is necessary to correct the inclination of the character.

【００３３】かかる文字の傾き検出及び傾き補正の処理
を、図６に示したフローチャートに従って説明する。特
定された文字情報記載領域のデータ (濃度データ) に対
し、次式で表される微分演算を行う (ステップ11) 。 f(i,j)＝［｛f(i ＋1,j)−f(i,j)｝²＋｛f(i,j ＋1)−
f(i,j)｝²］^1/2 かかる微分演算を行った微分画像に対して、ｎ値化 (閾
値処理) を行う (ステップ12) 。The processing for detecting and correcting the inclination of the character will be described with reference to the flowchart shown in FIG. The differential operation represented by the following equation is performed on the data (density data) in the specified character information description area (step 11). f (i, j) = [ {f (i + 1, j) -f (i, j)} 2 + {f (i, j +1) -
f (i, j)} ² ] ^{1/2 The} n-valued (threshold processing) is performed on the differential image subjected to the differential operation (step 12).

【００３４】かかる処理を施した画像データに対してプ
ロジェクション処理を行う (ステップ13) 。プロジェク
ション処理とは図３に示したように画像データを一方向
に投影したときのデータの合計値の分布を求める処理で
ある。最初は、投影方向を横(水平) 方向に設定して求
める。そして、水平方向に対して所定の傾き角度±β
(例えば±４°) まで単位角度αずつ傾斜させつつ投影
方向のプロジェクション処理を行う (ステップ14→ステ
ップ15→ステップ13) 。前記傾斜は、画像に対してアフ
ィン変換を施すことにより行う。アフィン変換は、変換
後の画像のピクセル（ｘ, ｙ）の階調値を、次式で決定
される変換前の画像のピクセル（ｘ’, ｙ’）の階調値
で決定するものである。A projection process is performed on the image data that has undergone such a process (step 13). The projection process is a process of obtaining the distribution of the total value of the data when the image data is projected in one direction as shown in FIG. Initially, the projection direction is set to the horizontal (horizontal) direction. Then, a predetermined tilt angle ± β with respect to the horizontal direction
Projection processing in the projection direction is performed while inclining each unit angle α to (for example, ± 4 °) (step 14 → step 15 → step 13). The inclination is performed by performing affine transformation on the image. In the affine transformation, the gradation value of the pixel (x, y) of the image after conversion is determined by the gradation value of the pixel (x ', y') of the image before conversion, which is determined by the following equation. .

【００３５】[0035]

【数４】 [Equation 4]

【００３６】但し、ｎ＝１〜β／α かかるプロジェクション処理の終了後、各投影方向のプ
ロジェクション処理におけるデータ合計値の最大値を有
する角度θを求める (ステップ16) 。ここで、文字配列
方向のラインが投影方向と一致したときに該ラインデー
タの合計値が最大となるから、最大値を有する投影方向
は文字配列方向のラインの方向と一致する。したがっ
て、該投影方向の水平方向となす角度θを文字の傾き角
として求めることができるのである。ここで、単位角度
αを小さくするほど、検出精度が上がることは勿論であ
るが、補間によって精度を高めることもできる。However, n = 1 to β / α After completion of the projection process, the angle θ having the maximum value of the total data value in the projection process in each projection direction is obtained (step 16). Here, when the line in the character arrangement direction coincides with the projection direction, the total value of the line data becomes maximum, so the projection direction having the maximum value coincides with the direction of the line in the character arrangement direction. Therefore, the angle θ formed by the projection direction and the horizontal direction can be obtained as the inclination angle of the character. Here, it goes without saying that the smaller the unit angle α, the higher the detection accuracy, but the interpolation can also increase the accuracy.

【００３７】このようにして求めた文字の傾き角θ分だ
け前記アフィン変換により画像を回転させることにより
傾き補正を行う (ステップ17) 。第３段階は、Ｘ線フィ
ルムの挿入方向の判断である。前記したようにＸ線フィ
ルムの正規の挿入方向に対して上下，表裏が逆となる場
合があるため、それらを判別して正規の位置に補正する
ものである。The tilt is corrected by rotating the image by the affine transformation by the tilt angle θ of the character thus obtained (step 17). The third step is to judge the insertion direction of the X-ray film. As described above, there are cases where the top and bottom and the front and back are opposite to the normal insertion direction of the X-ray film, so these are discriminated and corrected to the normal position.

【００３８】かかる、上下及び表裏の正否判別処理につ
いて図７，図８に示したフローチャートに従って説明す
る。前記傾き補正を行った画像に対して、微分演算 (ス
テップ21) ，ｎ値化 (ステップ22) ，横方向のプロジェ
クション処理 (ステップ23) を行った後、最大値が上下
いずれにあるかを判別する (ステップ24) 。簡易的に
は、前記ステップ13〜15で画像を回転させつつ行ったプ
ロジェクション処理において最大値となったものを用い
て、ステップ24へ進んで判別を行ってもよい。The up / down and front / back correctness determination processing will be described with reference to the flowcharts shown in FIGS. After performing the differential operation (step 21), the n-value conversion (step 22), and the horizontal projection process (step 23) on the image for which the tilt correction is performed, it is determined whether the maximum value is above or below. Yes (step 24). In a simple manner, it is possible to proceed to step 24 and make the determination by using the projection image having the maximum value in the projection processing performed while rotating the image in steps 13 to 15.

【００３９】そして、最大値が下側にあると判定された
ときは、そのまま終了するが、上側にあると判定された
ときは、画像の上下方向を反転させる (ステップ25) 。
即ち、横方向のラインは、文字情報の下側に記されてい
るので、最大値として検出される横方向のラインが上側
に位置する場合は、上下反転させることにより上下方向
の向きを正しくすることができる。Then, when it is determined that the maximum value is on the lower side, the process is finished as it is, but when it is determined that it is on the upper side, the vertical direction of the image is reversed (step 25).
That is, since the horizontal line is written on the lower side of the character information, if the horizontal line detected as the maximum value is located on the upper side, it is inverted upside down to correct the vertical direction. be able to.

【００４０】次いで、縦 (垂直) 方向のプロジェクショ
ン処理を行い (ステップ26) 、最大値が左右いずれにあ
るかを判別する (ステップ27) 。そして、左側にあると
判定されたときは、そのまま終了するが、右側にあると
判定されたときは、画像の左右方向を反転させる (ステ
ップ28) 。即ち、縦方向のラインは、文字情報の左側に
記されているので、最大値として検出される縦方向のラ
インが右側に位置する場合は、左右反転させることによ
り左右方向の向きを正しくすることができる (ステップ
29) 。Next, projection processing in the vertical (vertical) direction is performed (step 26), and it is determined whether the maximum value is on the left or right (step 27). Then, when it is determined that the image is on the left side, the process ends as it is, but when it is determined that the image is on the right side, the horizontal direction of the image is reversed (step 28). That is, since the vertical line is written on the left side of the character information, when the vertical line detected as the maximum value is located on the right side, the horizontal direction should be reversed to correct the horizontal direction. Can (step
29).

【００４１】以上の処理により、画像を傾きなく正常な
方向にセットされることとなる。第４段階は、文字の切
り出しである。即ち、文字の認識に際して一つ一つの文
字が記載されている領域を検出することである。かかる
文字の切り出し処理について図９に示したフローチャー
トに従って説明する。By the above processing, the image is set in the normal direction without inclination. The fourth stage is the cutting of characters. That is, it is to detect an area in which each character is described when recognizing a character. The character cutting process will be described with reference to the flowchart shown in FIG.

【００４２】前記のようにして入力方向の補正を行った
画像に対し、微分演算，ｎ値化を行った後、縦方向，横
方向のプロジェクション処理を行う (ステップ31，32)
。但し、かかるプロジェクション処理を新たに行うこ
となく、前記入力方向の補正の際に傾き補正後の画像に
対して縦横方向のプロジェクション処理を行っているの
で、そのデータを正しい方向にセットしなおすようにし
てもよい。The image corrected in the input direction as described above is subjected to the differential operation and the n-value conversion, and then the vertical and horizontal projection processes are performed (steps 31 and 32).
. However, since the projection processing in the vertical and horizontal directions is performed on the image after the tilt correction at the time of correcting the input direction without newly performing such projection processing, it is necessary to reset the data in the correct direction. May be.

【００４３】かかる縦横のプロジェクション処理によ
り、文字の記載部分はデータ量の合計値が大きいので山
状となり、文字と文字との間の部分は空白であるためデ
ータ値が小さくなって谷となるので、この谷の部分を通
る縦横の格子軸で囲まれる領域を１つの文字領域とし
て、各文字の切り出し位置を決定し (ステップ33) 、該
決定された切り出し位置に基づいて文字の切り出しを行
う (ステップ34) 。By the vertical and horizontal projection processing, the character description portion has a large sum of the data amount and thus becomes a mountain shape, and the portion between the characters is a blank because the data value becomes small and becomes a valley. , The area surrounded by the vertical and horizontal grid axes passing through the valley is defined as one character area, the cut-out position of each character is determined (step 33), and the character is cut out based on the determined cut-out position ( Step 34).

【００４４】このようにして前処理を行った後、切り出
された文字の認識・照合を行う。かかる文字認識・照合
の処理を図10に示したフローチャートに従って説明す
る。まず、切り出された文字とＳＤＦとの相関演算を行
うことにより、文字情報の読み取り結果を出力する (ス
テップ41) 。相関演算とは、簡単にいえば入力される文
字パターンベクトルとＳＤＦベクトルとの内積を演算す
ることであり、相関値は、該相関演算された内積値であ
る。文字情報として０〜９までの１０文字を認識する必
要があり、本実施例では、その１０文字を標準パターン
にして１０枚のＳＤＦが合成され、夫々相関値は各文字
のパターンに対しては＋１、その他のパターンに対して
は−１を設定してある。After the preprocessing is performed as described above, the cut out characters are recognized and collated. The character recognition / collation processing will be described with reference to the flowchart shown in FIG. First, the result of reading the character information is output by performing a correlation calculation between the cut out character and the SDF (step 41). In short, the correlation calculation is to calculate the inner product of the input character pattern vector and the SDF vector, and the correlation value is the inner product value obtained by the correlation calculation. It is necessary to recognize 10 characters from 0 to 9 as the character information. In the present embodiment, 10 SDFs are combined using the 10 characters as a standard pattern, and the correlation value is different for each character pattern. +1 is set to +1 and other patterns are set to -1.

【００４５】そして、１つの文字に対して前記１０枚の
ＳＤＦと相関演算を行いその出力値を比較して最も＋１
に近い相関値を出力したＳＤＦを調べることによりその
パターンを決定する (ステップ42) 。図11にその模式図
を示す。ＳＤＦによって読み取られた文字情報は文字情
報の冗長な部分をチェックした後、それを満たしていれ
ばデータベースと照合を行う (ステップ43) 。例えば、
文字情報の冗長性には文字情報に付加されているチェッ
クサムを用いる。チェックサムとは、文字情報のある桁
に冗長性を持たせその桁の番号を他の番号の関数として
決定するもので、誤りの検出能力がある。データベース
中に一致する番号がある場合には、文字情報は正しく読
み取られたものとして認識される。無い場合には、読み
誤りがあるものとして文字情報のある桁に訂正を加え
(ステップ44) 、チェックサムを調べた後再度照合を行
う (ステップ45) 。訂正すべき桁の特定とどの文字に訂
正するかの判断はＳＤＦの相関出力値を比較して行う。
ある程度まで訂正を加えても一致する番号がないもの
は、読み取り不能とする (ステップ46，47) 。また照合
は誤認識を防ぐための照合済の文字情報とも照合を行
い、該当する番号が２組以上存在する場合には共に読み
取り不能を出力する。Then, a correlation operation is performed on one character and the above 10 SDFs, and the output values are compared to obtain the most +1.
The pattern is determined by examining the SDF that has output a correlation value close to (step 42). Figure 11 shows the schematic diagram. In the character information read by the SDF, the redundant part of the character information is checked, and if it is satisfied, then it is collated with the database (step 43). For example,
The checksum added to the character information is used for the redundancy of the character information. The checksum is for determining the number of the digit as a function of other numbers by making a digit of the character information redundant and has an error detecting ability. If there is a matching number in the database, the character information is recognized as having been correctly read. If there is not, correct the digit with the character information as a reading error.
After checking the checksum (step 44), the verification is performed again (step 45). The digit to be corrected and the character to be corrected are determined by comparing the correlation output values of SDF.
If there is no matching number even if it is corrected to some extent, it is considered unreadable (steps 46 and 47). In addition, the collation also collates with the collated character information for preventing erroneous recognition, and when there are two or more sets of corresponding numbers, both are output as unreadable.

【００４６】尚、新たなフォントの文字を識別する場合
には、かかるフォントに対応する標準パターンを加えた
ＳＤＦに修正して追加登録すればよい。認識プロセスの
各段階について計算機シミュレーションを行った。まず
実際のＸフィルムからテストパターンとして文字を切り
出し、文字認識の正読率を調べた。認識手法は既述した
通りである。正読率は、９８．９％であった。この正読
率をもとに文字情報 (ＩＤ番号) の照合と訂正のシミュ
レーションを行った。チェックサムを満たすＩＤ番号を
ランダムに１０００個作りデータベースとする。その各
々のＩＤ番号と同じ番号を上記のテストパターンを組み
合わせて作り、それをＳＤＦで識別した後照合、訂正の
シミュレーションを行った。文字の訂正は７桁のＩＤ番
号のうち３桁まで行った。結果を表１に示す。シミュレ
ーションで読み取り不能とされたＩＤ番号は文字の誤読
が４文字以上あったもので、訂正しきれなかったことが
原因である。各文字に対する正読率が９８．９％である
という結果から７桁全て正しく読む確率は９２．５％で
ある。本手法はＩＤ番号の認識率を９９．７％まで向上
することができることを示した。この結果から前記アル
ゴリズムが有効であるといえる。To identify a character of a new font, the SDF added with a standard pattern corresponding to the font may be modified and additionally registered. Computer simulation was performed for each stage of the recognition process. First, characters were cut out from an actual X film as a test pattern, and the correct reading rate of character recognition was examined. The recognition method is as described above. The correct reading rate was 98.9%. A simulation of collation and correction of character information (ID number) was performed based on this correct reading rate. A database is created in which 1000 ID numbers satisfying the checksum are randomly created. The same number as each ID number was created by combining the above test patterns, and the IDs were identified by SDF, and then simulations of collation and correction were performed. Correction of characters was performed up to 3 digits of the 7-digit ID number. The results are shown in Table 1. The ID number that was made unreadable in the simulation was because there were four or more misreading characters, and it could not be corrected. From the result that the correct reading rate for each character is 98.9%, the probability that all 7 digits are correctly read is 92.5%. It was shown that this method can improve the recognition rate of ID numbers to 99.7%. From this result, it can be said that the above algorithm is effective.

【００４７】[0047]

【表１】 [Table 1]

【００４８】また、Ｘ線フィルムをデジタル化してから
ＩＤ番号を読み取り、データベースと照合した後認識す
る一連のプロセスを通しての実験も行った。これは本研
究で提案したシステムを実際のフィルムを使って作動さ
せてみることにより明らかになる問題点や、変更すべき
点を確認するためのものである。実験に用いた撮影フィ
ルムは、文字情報部に前記縦横のラインを施してある。
これをフィルムの入力の仕方に制限を加えることなくデ
ジタル化し、認識を行った。結果は表２に示したとおり
認識は１２７枚テストした結果、１２４枚 (９７．６
％) であった。読み取り不能とされた３枚の原因はＩＤ
番号の最後の桁の数が文字領域部の端の部分に写しこま
れ、コントラストが悪いため前処理を行っても正しく読
み取れなかったことである。これらＸ線フィルムは本シ
ミュレーションのために作成したもので通常よりもネー
ムカードを斜めに写しこむ等悪い条件のＸ線フィルムで
あることを考慮すると、実際の使用時には認識率はより
向上すると考えられる。Experiments were also carried out through a series of processes in which the X-ray film was digitized, the ID number was read, the ID number was checked against the database, and then recognized. This is to confirm the problems and changes to be clarified when the system proposed in this research is operated by using an actual film. The photographic film used in the experiment has the vertical and horizontal lines in the character information portion.
This was digitized without any restrictions on how to input the film, and recognition was performed. The results are shown in Table 2. As a result of testing 127 sheets of recognition, 124 sheets (97.6 sheets)
%) Met. The three causes of being unreadable are ID
The number of the last digit of the number was imprinted on the edge of the character area and the contrast was poor, so it could not be read correctly even after pre-processing. Considering that these X-ray films were created for this simulation and are in bad conditions such as imprinting a name card obliquely than usual, the recognition rate is considered to be further improved in actual use. .

【００４９】[0049]

【表２】 [Table 2]

【００５０】次に前記文字認識に用いられるＳＤＦをフ
ィシャー比で評価しつつ最適なものに形成する方法の実
施例を説明する。まず、ＳＤＦの機能から説明する。フ
ォントの数がＭ個とすると、同一文字に対するＭ個のフ
ォントのパターンはＮ次元ベクトルで次式のように示さ
れる。Next, an embodiment of a method for forming an optimum SDF used for character recognition while evaluating it with a Fischer ratio will be described. First, the function of the SDF will be described. Assuming that the number of fonts is M, the pattern of M fonts for the same character is represented by an N-dimensional vector as the following equation.

【００５１】[0051]

【数５】 [Equation 5]

【００５２】このように標準パターンとその出力値を決
定すれば、ＳＤＦは設計することができる。(３) 式で
表されるＳＤＦに関する条件式 (２) はＳＤＦをＮ次元
空間のＭ個の標準パターンで張られる部分空間の中のベ
クトルに限定することを意味しており、これより導いた
(３) 式のＳＤＦは数学的な線形方程式 (１) の一般解
とはなっていない。これに対して、Z.Bahri 等は１９８
８年に線形方程式 (１) の式の一般的な解として一般化
ＳＤＦを導き、それまで導かれていた様々なＳＤＦが一
般化ＳＤＦを用いて表せることを示した。By thus determining the standard pattern and its output value, the SDF can be designed. The conditional expression (2) regarding the SDF expressed by the expression (3) means that the SDF is limited to the vector in the subspace defined by the M standard patterns of the N-dimensional space, and is derived from this.
The SDF of equation (3) is not a general solution of mathematical linear equation (1). In contrast, Z. Bahri et al.
In 1988, we derived a generalized SDF as a general solution of the equation of linear equation (1), and showed that various SDFs that had been derived so far can be expressed using the generalized SDF.

【００５３】従来までのＳＤＦ (従来型ＳＤＦ) 、一般
化ＳＤＦを特異値分解法 (ＳＶＤ)による表現で表すと
以下のようになる。The conventional SDF (conventional SDF) and the generalized SDF are expressed as follows by the singular value decomposition method (SVD).

【００５４】[0054]

【数６】 [Equation 6]

【００５５】また、標準パターンに加法的に物体依存で
ないノイズによる設定相関値からのずれを、従来型ＳＤ
Ｆと一般化ＳＤＦについてＳＶＤによる表現で表すと以
下のようになる。Further, the deviation from the set correlation value due to noise that is not additive to the standard pattern and is not object-dependent
F and the generalized SDF are expressed by the SVD expression as follows.

【００５６】[0056]

【数７】 [Equation 7]

【００５７】一般化ＳＤＦは標準パターンに直交する成
分も用いて設計を行うため、設計の際の自由度が従来の
ＳＤＦに比較して非常に多いという特徴がある。前記ノ
イズによるずれから、従来型ＳＤＦと一般化ＳＤＦにつ
いて分散を示すと以下のようになる。Since the generalized SDF is designed also by using the component orthogonal to the standard pattern, it has a feature that the degree of freedom in design is much larger than that of the conventional SDF. From the shift due to the noise, the variances of the conventional SDF and the generalized SDF are shown below.

【００５８】[0058]

【数８】 [Equation 8]

【００５９】ここで、従来のＳＤＦの設定値からのずれ
の分散は標準パターンにのみ依存しているが、一般化Ｓ
ＤＦは標準パターンに直交している成分にも依存してい
る。この点に注目すると、一般化ＳＤＦでは標準パター
ンに直交する成分を次式を満たすように決定すれば、有
色である一般のノイズに対しても安定なＳＤＦの設計を
行える。Here, the variance of the deviation from the set value of the conventional SDF depends only on the standard pattern, but the generalized SDF
The DF also depends on the component that is orthogonal to the standard pattern. Paying attention to this point, in the generalized SDF, if the component orthogonal to the standard pattern is determined so as to satisfy the following equation, a stable SDF can be designed even with general colored noise.

【００６０】[0060]

【数９】 [Equation 9]

【００６１】そこで、フィシャー比を評価基準にしてこ
のβ_iの決定を行い、最適なＳＤＦを形成する。線形判
別関数ｈの関数であるフィシャー比は次式で表される。Therefore, this β _i is determined using the Fischer ratio as an evaluation criterion, and an optimum SDF is formed. The Fischer ratio, which is a function of the linear discriminant function h, is expressed by the following equation.

【００６２】[0062]

【数10】 [Equation 10]

【００６３】このフィシャー比を最大にする線形判別関
数は２クラスの分離を最も強調する多次元空間の射影軸
であり、パターンが多次元空間でガウス分布になると仮
定すればフィシャー比を最大にする条件より最適なＳＤ
Ｆを形成することができる (図12参照) 。さて、フィシ
ャー比は前記ｈ_gを用いて次のように書くことができ
る。クラス１、２には夫々Ｍ₁個、Ｍ₂個の標準パター
ンがあるとし、各クラスの各パターンをｘ₁ ⁽ⁱ⁾等と表
し、その平均ベクトルをバーを付して表せば、分母は、The linear discriminant function that maximizes the Fischer ratio is the projection axis of the multidimensional space that most emphasizes the separation of the two classes, and the Fischer ratio is maximized assuming that the pattern has a Gaussian distribution in the multidimensional space. SD that is more optimal than the conditions
F can be formed (see FIG. 12). Now, the Fischer ratio can be written as follows using the above h _g . If there are M ₁ and M ₂ standard patterns in classes 1 and ₂ , respectively, and each pattern of each class is represented as x ₁ ^(i), etc., and the average vector is represented by a bar, the denominator is ,

【００６４】[0064]

【数11】 [Equation 11]

【００６５】この固有値問題を解き、最大の固有値を持
つ固有ベクトルが求めるｈ_gであるしかし、２クラスの
フィシャー比の場合、Ｄの定義式 (４) より (６) 式の
右辺は、The eigenvector having the maximum eigenvalue is h _{g obtained} by solving this eigenvalue problem. However, in the case of the Fischer ratio of 2 classes, the right side of the equation (6) from the defining equation (4) of D is

【００６６】[0066]

【数12】 [Equation 12]

【００６７】という結論を得る。このようにフィシャー
比を評価基準としてＳＤＦを最適に設計することができ
ることを、明らかにした。かかるフィシャー比を評価基
準として形成された一般化ＳＤＦと前記従来のＳＤＦと
の比較を以下に示す。比較は「０」，「１」の２つの数
字の識別問題に対して行った。標準パターンはＸ線フィ
ルムの文字に前処理を行い、それを上下左右にシフトし
たものを各パターンに対して５枚、計１０枚である。図
13に、これらの標準パターンに対し、(７) 式で決定さ
れるＳＶＤ解析を加え求めた固有値 (図示左上欄) 、同
じく (７) 式で決定される相関出力値 (図示右上欄) 、
最適化されたＳＤＦ及び従来型ＳＤＦのＳＶＤの固有ベ
クトルｕ_iに対する係数 (図示左下欄及び右下欄) を図
13に示す。従来のＳＤＦの相関出力値はパターン「０」
に対して＋１、パターン「１」に対しては−１を設定し
たものである。各パターンに対する相関出力値は図14に
模式的に示すように、多次元空間に分布するパターンを
フィシャー比が最大になるように形成したＳＤＦに射影
したものであり、パターンを分離する際の閾値は射影軸
上での相関値の分布の平均値、及び分散を調べることに
より、解析的に決定することができる。そのためには標
準パターンは可能な限り多くしてＳＤＦを形成すること
が必要である。またＳＤＦの固有ベクトルｕ_iに対する
係数を比較してみると、固有値の比較的小さい第３ベク
トル以下 (ｊ≧３) の係数が従来のＳＤＦは最適なＳＤ
Ｆと比較して大きくなっている。 (＋１，−１) 出力の
ＳＤＦのこの成分が大きくなっている原因は、異なるパ
ターンに対しての同一の＋１、あるいは−１の相関値を
出力するように設計したためである。ＳＶＤの固有値が
小さい成分は相関値への伝達の感度が小さいものであ
り、識別を行う際に重要となるものではない。この成分
が大きくなっていることは、ノイズの成分を拡大する原
因となっている。これに対し、フィシャー比を最大とな
るよに形成されたＳＤＦでは、これらの成分が小さく抑
えられているのは最適化の現れである。We conclude that In this way, it was clarified that the SDF can be optimally designed by using the Fischer ratio as an evaluation criterion. A comparison between the generalized SDF formed using the Fischer ratio as an evaluation criterion and the conventional SDF will be shown below. The comparison was made for the identification problem of two numbers "0" and "1". As for the standard pattern, the characters of the X-ray film are pre-processed, and the characters are shifted vertically and horizontally, 5 sheets for each pattern, that is, 10 sheets in total. Figure
The eigenvalues obtained by adding SVD analysis determined by equation (7) to these standard patterns (in the upper left column in the figure), the correlation output values similarly determined in equation (7) (the upper right column in the figure),
Figure showing the coefficient (bottom left column and bottom right column in the figure) for the eigenvector u _i of the optimized SDF and the SVD of the conventional SDF
Shown in 13. The correlation output value of the conventional SDF is the pattern "0"
Is set to +1 and -1 is set to the pattern "1". As shown schematically in FIG. 14, the correlation output value for each pattern is obtained by projecting a pattern distributed in a multidimensional space onto an SDF formed so as to maximize the Fischer ratio, and a threshold for separating patterns. Can be analytically determined by examining the mean and variance of the distribution of correlation values on the projection axis. For that purpose, it is necessary to form the SDF with as many standard patterns as possible. Further, comparing the coefficients of the SDF with respect to the eigenvector u _i , the coefficient of the third vector or less (j ≧ 3) having a relatively small eigenvalue has the optimum SD in the conventional SDF.
It is larger than F. The reason why this component of the (+1, -1) output SDF is large is that the SDF is designed to output the same +1 or -1 correlation value for different patterns. The component having a small eigenvalue of SVD has low sensitivity of transmission to the correlation value, and is not important when performing discrimination. The increase in this component causes the noise component to expand. On the other hand, in the SDF formed so as to maximize the Fischer ratio, the fact that these components are kept small is a manifestation of optimization.

【００６８】次に、このようなフィシャー比を評価基準
とするＳＤＦの具体的な形成方法の実施例について説明
する。２つの文字を識別するだけであれば、２つの文字
の標準パターンを用いて単純にフィシャー比が最大とな
るように１つのＳＤＦを形成すればよいが、前記Ｘ線フ
ィルムの文字情報認識のように数字のように０〜９まで
１０種類の文字を識別する場合には、複数のＳＤＦを用
いて１つの番号に特定する必要がある。Next, an example of a specific method of forming an SDF using such a Fischer ratio as an evaluation standard will be described. If only two characters are to be identified, one SDF may be simply formed by using a standard pattern of two characters so that the Fischer ratio is maximized. When identifying 10 kinds of characters from 0 to 9 like numbers, it is necessary to specify a single number using a plurality of SDFs.

【００６９】ところで、所謂ツリー構造等の識別手順に
より、例えば前記１０種類の文字を２クラスの文字群に
分類し、該分類されたものを夫々更に２クラスに分類し
ていくというように大クラスから小クラスへと分類し、
これらの分類を合成識別関数により識別する方法があ
る。そこで、第１の方法として、かかるツリー構造にお
けるＳＤＦを形成する方法の実施例を図15に示した識別
ツリー構造の模式図及び図16のフローチャートに従って
説明する。まず、「０」〜「９」の各１個の数字とそれ
以外の９個の数字群との２クラスに識別する10通りの組
み合わせについて、夫々フィシャー比が最大となるよう
にＳＤＦを形成する（ステップ51) 。By the way, for example, the above 10 kinds of characters are classified into a character group of 2 classes by a classification procedure such as a so-called tree structure, and the classified characters are further classified into 2 classes, respectively. Into small classes,
There is a method of identifying these classifications by a composite identification function. Therefore, as a first method, an embodiment of a method for forming an SDF in such a tree structure will be described with reference to the schematic diagram of the identification tree structure shown in FIG. 15 and the flowchart of FIG. First, the SDF is formed so that the Fischer ratio is maximized for each of 10 combinations of two numbers, each of which is a number from "0" to "9" and a group of 9 numbers other than that. (Step 51).

【００７０】次に、前記10個形成されたＳＤＦの中でフ
ィシャー比が最大となるＳＤＦを選択する (ステップ5
2) 。本実施例の場合、フィシャー比が最大となるの
は、「７」とそれ以外の数字群とを識別するＳＤＦであ
り、該フィシャー比の最大値は35．１であった。そこ
で、第１のＳＤＦとして「７」とそれ以外の数字群とを
識別するＳＤＦを選択する。Next, among the 10 SDFs formed, the SDF having the maximum Fischer ratio is selected (step 5
2). In the case of the present example, the SDF that identifies "7" and the other numeral groups has the maximum Fischer ratio, and the maximum value of the Fischer ratio is 35.1. Therefore, as the first SDF, the SDF that identifies “7” and the other number groups is selected.

【００７１】次に前記選択された合成識別関数により識
別される１個の数字、つまり「７」を除く残りの数字群
について、前記同様に１個の数字とそれ以外の８個の数
字群との２クラスに識別する９通りの組み合わせについ
て夫々フィシャー比がフィシャー比が最大となるように
９個のＳＤＦを形成し、その中でフィシャー比が最大と
なるＳＤＦを選択する (ステップ53) 。このとき選択さ
れた第２のＳＤＦは、「６」とそれ以外の数字群とを識
別するＳＤＦであり、フィシャー比は17．５であった。Next, for one number identified by the selected composite discriminant function, that is, for the remaining number groups except "7", one number and eight other number groups other than the same as above. 9 SDFs are formed so that the Fischer ratio is maximized for each of the 9 combinations identified in the two classes, and the SDF that maximizes the Fischer ratio is selected (step 53). The second SDF selected at this time was an SDF that discriminates between “6” and other numeral groups, and the Fischer ratio was 17.5.

【００７２】以下同様にして、残りの数字について１個
の数字と残りの数字とを識別するＳＤＦでフィシャー比
が最大となるものを順次選択していく (ステップ54・・
ステップ70) 。最後に選択される第10のＳＤＦは、２つ
の数字を識別するＳＤＦとなる。本実施例では、「５」
と「３」を識別するＳＤＦであった。結果は、図15に示
すようであった。ＳＤＦにより順次識別される１個の数
字はは、「７」→「６」→「０」→「１」→「４」→
「２」→「８」→「９」→「５」，「３」であり、各Ｓ
ＤＦのフィシャー比は図示の通りである。In the same manner, the SDF for discriminating one number from the remaining numbers and the remaining numbers are sequentially selected so as to maximize the Fisher ratio (step 54 ...
Step 70). The tenth SDF selected last is the SDF that identifies the two numbers. In this embodiment, "5"
It was an SDF that identified "3". The results were as shown in Figure 15. One number sequentially identified by SDF is "7" → "6" → "0" → "1" → "4" →
“2” → “8” → “9” → “5”, “3”, and each S
The Fischer ratio of DF is as shown.

【００７３】このようにすれば、第１〜第10の各識別に
おいて、フィシャー比を評価基準として求められた可及
的に高い識別性能を有するＳＤＦを得ることができ、そ
れによって０〜９の数字を最も良好に認識できる10個の
ＳＤＦグループが得られたことになる。次にフィシャー
比を評価基準として形成されたＳＤＦを用いて文字情報
の認識を行う方法の実施例について説明する。本実施例
では、前記のようにして形成された10個のＳＤＦグルー
プにより前記Ｘ線フィルムの文字情報部に記載されてい
るＩＤ番号の数字を認識する実施例を図17に示したフロ
ーチャートに従って説明する。By doing so, in each of the first to tenth discriminations, it is possible to obtain an SDF having the highest possible discrimination performance obtained by using the Fischer ratio as an evaluation criterion, whereby 0 to 9 are obtained. We now have 10 SDF groups that can best recognize the numbers. Next, an example of a method of recognizing character information using an SDF formed using the Fischer ratio as an evaluation standard will be described. In the present embodiment, an example in which the numeral of the ID number described in the character information section of the X-ray film is recognized by the 10 SDF groups formed as described above will be described with reference to the flowchart shown in FIG. To do.

【００７４】まず、前記のようにして文字情報部から文
字 (数字) の切り出しを行った後、入力された数字のパ
ターンについて、前記第１のＳＤＦと相関演算を行って
相関値ｘを求め、該相関値ｘを閾値と比較して入力数字
パターンが「７」に属するかそれ以外の数字群に属する
かを識別する（ステップ81）。そして、「７」と認識さ
れた場合は、対応する文字情報部の領域における数字が
「７」であることを記憶する。同時に、該第１のＳＤＦ
による識別結果の誤りを生じる確率を前記相関値ｘに基
づいて求める。具体的には、図18に示すように、第１の
ＳＤＦによって相関値ｘに対応する数字が「７」に属す
る確率密度Ｐ１と、「７」以外の数字で「７」に最も相
関性が高い数字「Ｎ」に属する確率密度Ｐ２との比Ｐ２
／Ｐ１を取ることにより求める。Ｐ２／Ｐ１の値は閾値
においては１となり、１に近い値となるほど２つの数字
を正しく識別する確率が小さい、つまり、識別結果に誤
りを生じている確率が高いと評価できるので、この結果
を、誤認識確率Ｅとして同時に記憶しておく（ステップ
82）。First, after cutting out characters (numbers) from the character information section as described above, the correlation pattern x is obtained by performing a correlation operation with the first SDF for the inputted number pattern. The correlation value x is compared with a threshold value to identify whether the input numeral pattern belongs to "7" or other numeral groups (step 81). Then, when it is recognized as "7", it is stored that the number in the area of the corresponding character information portion is "7". At the same time, the first SDF
The probability of causing an error in the discrimination result by is determined based on the correlation value x. Specifically, as shown in FIG. 18, the probability density P1 in which the number corresponding to the correlation value x belongs to “7” by the first SDF and the number other than “7” have the highest correlation with “7”. Ratio P2 with probability density P2 belonging to high number "N"
Calculate by taking / P1. The value of P2 / P1 is 1 at the threshold value, and the closer to 1 the value is, the smaller the probability of correctly distinguishing two numbers, that is, it can be evaluated that the probability of an error in the identification result is high. , And the false recognition probability E is stored at the same time (step
82).

【００７５】また、入力数字パターンが前記第１のＳＤ
Ｆによって「７」以外の数字群に属すると識別された場
合には、前記同様相関値ｘに基づいて該識別結果に誤り
が生じている確率としてＰ１／Ｐ２の値を誤識別確率Ｅ
として記憶しておく（ステップ83）。この場合のＰ１
は、前記「Ｎ」に属する確率密度で、Ｐ２が「７」に属
する確率密度となる。The input numeral pattern is the first SD.
When F is identified as belonging to a number group other than “7”, the value of P1 / P2 is used as the probability that an error has occurred in the identification result based on the correlation value x as described above, and the misidentification probability E
(Step 83). P1 in this case
Is a probability density belonging to “N”, and P2 is a probability density belonging to “7”.

【００７６】次いで、入力数字パターンが第２のＳＤＦ
との相関演算を行い、同様にして「６」と残りの数字群
とのいずれに属するかを識別し、識別結果と誤認識確率
Ｅとを記憶する（ステップ84，85）。かかる操作を、入
力数字パターンが特定の数字に認識されるまで行う (ス
テップ85，86・・・105,106 ) 。前記したように、各Ｓ
ＤＦは、夫々の識別においてフィシャー比を最大にする
ものが用いられるため、高い認識性能が得られる。Next, the input numeral pattern is the second SDF.
Correlation calculation is performed to identify which of "6" and the remaining numeral group belongs, and the identification result and the false recognition probability E are stored (steps 84 and 85). This operation is repeated until the input numeral pattern is recognized by a specific numeral (steps 85, 86 ... 105, 106). As mentioned above, each S
As the DF, one that maximizes the Fischer ratio is used in each identification, and thus high recognition performance is obtained.

【００７７】このようにして、各入力数字パターンにつ
いて認識が終了し、その組み合わせによりＩＤ番号を決
定し、チェックサム等によるチェックを行った後、デー
タベースとの照合を行い、誤りがあると判明したときに
は、以下のように再識別しなおす。各認識された数字に
ついて、それらの識別時に記憶された誤認識確率Ｅの値
に基づいて誤りを生じた確率が最も高いと判断される数
字を選択し、該数字についてＥの値が最大となっている
ＳＤＦの識別結果を見つけ出し、該識別結果を変更して
識別しなおし別の数字を認識する。例えば、図15におい
て「２」と認識した数字が誤りと判明した場合（ステッ
プ201 ）、その認識に用いられた第１〜第６のＳＤＦの
識別結果の誤認識確率Ｅ１〜Ｅ６の値を比較し、第４の
ＳＤＦによる識別結果の誤認識確率Ｅ４が最大である場
合には、その識別結果を変更して「１」として認識しな
おす（ステップ202 ）。Ｅ１〜Ｅ５の中に最大値がある
場合は、識別結果を変更することで、特定の数字を認識
することができる。Ｅ６が最大である場合は、第７のＳ
ＤＦから識別しなおすことになる。In this way, the recognition of each input numeral pattern is completed, the ID number is determined by the combination, and after checking by the checksum etc., it is verified that there is an error by collating with the database. Sometimes it is necessary to re-identify as follows. For each recognized number, the number that is determined to have the highest probability of error based on the value of the misrecognition probability E stored at the time of identification is selected, and the value of E becomes the maximum for that number. The SDF identification result is found, the identification result is changed and identification is performed again, and another number is recognized. For example, when the number recognized as "2" in FIG. 15 is found to be incorrect (step 201), the values of the misrecognition probabilities E1 to E6 of the identification results of the first to sixth SDFs used for the recognition are compared. However, if the erroneous recognition probability E4 of the identification result by the fourth SDF is the maximum, the identification result is changed and recognized again as "1" (step 202). When there is a maximum value among E1 to E5, a specific number can be recognized by changing the identification result. If E6 is the maximum, the seventh S
It will be identified again from the DF.

【００７８】前記の場合、「２」の数字を「１」に変更
して、再度照合を行い、それで見つからない場合は、２
番目に誤認識確率Ｅが高いものの識別結果を変更して識
別しなおすといった方法で訂正を行う。かかる方法によ
れば、誤認識確率Ｅに基づいて訂正精度を高められると
同時に訂正時間を可及的に短縮することができる。In the above case, the number of "2" is changed to "1" and the collation is performed again.
Second, although the recognition error probability E is high, the identification result is changed and the identification is performed again. According to this method, the correction accuracy can be improved based on the error recognition probability E, and at the same time, the correction time can be shortened as much as possible.

【００７９】本実施例では、１個の数字と残りの数字群
とを識別するツリー構造のＳＤＦグループとしたが、例
えば、０〜９を、まず、５個と５個との数字群のクラス
に識別する第１のＳＤＦと、各５個の数字群のクラスを
更に２個と３個との数字群のクラスに識別する第２，第
３のＳＤＦを形成し、更に各クラスを識別していくとい
ったツリー構造のＳＤＦグループを形成するようにして
もよい。In this embodiment, an SDF group having a tree structure for identifying one number and the remaining number group is used, but for example, 0-9 are the classes of five and five number groups first. To form a first SDF for identifying each of the five number groups, and a second and third SDF for identifying each of the five number group classes into two and three number group classes, and further identifying each class. You may make it form the SDF group of the tree structure which goes up.

【００８０】かかるツリー構造の認識方法によれば、認
識までの識別回数が文字数より大幅に少なくて済むた
め、演算時間も短縮できる利点があるが、認識しようと
する各文字とそれ以外の各文字とを識別するＳＤＦをフ
ィシャー比を評価基準として形成し、全ての文字に対応
して文字数分ＳＤＦを形成する方法もある。この方法で
は、識別しようとする所定の文字画像の標準パターン
と、それ以外の文字を重ね合わせた画像の標準パターン
とを用いて、フィシャー比を最大とするＳＤＦを形成す
ればよい。According to such a tree structure recognition method, the number of times of recognition until recognition is significantly smaller than the number of characters, which has the advantage that the calculation time can be shortened. However, each character to be recognized and other characters There is also a method in which SDFs for identifying and are formed using the Fischer ratio as an evaluation criterion, and SDFs corresponding to the number of characters are formed corresponding to all characters. In this method, an SDF that maximizes the Fischer ratio may be formed using a standard pattern of a predetermined character image to be identified and a standard pattern of an image in which other characters are superimposed.

【００８１】かかる方法により形成されたＳＤＦを用い
た文字認識については、前記前処理の実施例で説明した
通りである。Character recognition using the SDF formed by such a method is as described in the embodiment of the preprocessing.

【００８２】[0082]

【発明の効果】以上説明してきたように本発明にかかる
文字認識方法によれば、フィシャー比を評価基準として
形成した識別機能が高い合成識別関数を用いて、ツリー
構造用で大クラスから小クラスへと細分類していくもの
或いは識別しようとする各文字とそれ以外の文字とを識
別するものに対して夫々識別機能を可及的に高めること
ができる。As described above, according to the present invention
According to the character recognition method, the Fischer ratio is used as the evaluation standard.
For a tree structure for subclassifying from a large class to a small class or for distinguishing each character to be distinguished from other characters by using a formed composite function having a high classification function The identification function can be enhanced as much as possible.

【００８３】また、認識に誤りを生じた場合でも、誤り
の高い識別結果を推定して、その結果を変更して再度認
識を行うことにより、正答を獲得するまでの演算時間を
可及的に短縮できる。 Even if an error occurs in the recognition, a highly erroneous identification result is estimated, the result is changed, and the result is recognized again, so that the calculation time until the correct answer is obtained is minimized. Can be shortened .

[Brief description of drawings]

【図１】本発明にかかる文字認識システムの全体構成，
機能を示す図FIG. 1 is an overall configuration of a character recognition system according to the present invention,
Diagram showing functions

【図２】同上実施例に使用されるＸ線フィルムにおける
文字情報記載領域の例を示す図FIG. 2 is a diagram showing an example of a character information writing area in an X-ray film used in the above-mentioned embodiment.

【図３】前記Ｘ線フィルムの入力状態の態様を示す図FIG. 3 is a diagram showing a mode of an input state of the X-ray film.

【図４】同上実施例における文字情報記載部分とプロジ
ェクション処理の結果を示す図FIG. 4 is a diagram showing a character information description portion and a result of projection processing in the embodiment.

【図５】同上実施例の文字情報記載領域の検出を示すフ
ローチャートFIG. 5 is a flowchart showing detection of a character information writing area according to the embodiment.

【図６】同じく文字の傾きの検出及び補正を示すフロー
チャートFIG. 6 is a flow chart showing the detection and correction of the inclination of a character.

【図７】同じく文字の方向，位置を検出して正常に直す
ための前半の処理を示すフローチャートFIG. 7 is a flowchart showing the first half of the process for detecting the direction and position of a character and correcting it normally.

【図８】図６に続く後半の処理を示すフローチャートFIG. 8 is a flowchart showing the latter half of the process following FIG.

【図９】文字の切り出し位置の決定及び切り出し処理を
示すフローチャートFIG. 9 is a flowchart showing determination of a character cutout position and cutout processing.

【図10】同じく文字認識・照合処理を示すフローチャー
トFIG. 10 is a flowchart showing the same character recognition / collation processing.

【図11】ＳＤＦを用いた文字の識別機能を示す図FIG. 11 is a diagram showing a character identification function using SDF.

【図12】複数のパターンを２クラスを分離するＳＤＦへ
の射影状態を示す図FIG. 12 is a diagram showing a projection state of a plurality of patterns onto an SDF that separates two classes.

【図13】フィシャー比に基づいて最適化されたＳＤＦと
従来のＳＤＦとの固有ベクトルｕ_iに対する係数を比較
する図FIG. 13 is a diagram comparing coefficients for the eigenvector u _i of the SDF optimized based on the Fischer ratio and the conventional SDF.

【図14】最適化されたＳＤＦの出力値を示す図FIG. 14 is a diagram showing optimized SDF output values.

【図15】ツリー構造用のＳＤＦによる識別法の模式図FIG. 15 is a schematic diagram of an SDF identification method for a tree structure.

【図16】ツリー構造用のＳＤＦの形成方法の一実施例を
示すフローチャートFIG. 16 is a flowchart showing an embodiment of a method for forming an SDF for tree structure.

【図17】同上方法により形成されたＳＤＦを用いた文字
認識方法の一実施例を示すフローチャートFIG. 17 is a flowchart showing an embodiment of a character recognition method using an SDF formed by the above method.

【図18】同上実施例における誤認識確率を説明するため
の図FIG. 18 is a diagram for explaining an erroneous recognition probability in the same embodiment.

[Explanation of symbols]

２処理用コンピュータＦＸ線フィルム 2 Processing computer F X-ray film

フロントページの続き (72)発明者松井美楯東京都新宿区西新宿１丁目26番２号コニカ株式会社内 (56)参考文献特開昭58−166490（ＪＰ，Ａ) 特開昭63−55677（ＪＰ，Ａ) 特開平２−68684（ＪＰ，Ａ) 特開平４−273590（ＪＰ，Ａ) 特開平６−28525（ＪＰ，Ａ) 大山永昭吉本賢祐山口雅弘，１. ＩＤ認識方法の研究フィルムＩＤの自動認識 −合成識別関数とデータ照合技術を用いて−，ＩＮＮＥＲＶＩＳＩＯＮ，日本，株式会社医療科学社，1993年３月，第８巻第４号，ｐ．34−37 (58)調査した分野(Int.Cl.⁷，ＤＢ名) G06K 9/00 - 9/82 Front page continuation (72) Inventor Mitsui Matsui 1-26-2, Nishishinjuku, Shinjuku-ku, Tokyo Konica Corporation (56) Reference JP-A-58-166490 (JP, A) JP-A-63- 55677 (JP, A) JP-A-2-68684 (JP, A) JP-A-4-273590 (JP, A) JP-A-6-28525 (JP, A) Oyama Yamaaki Akira Yoshimoto Kensuke Yamaguchi, 1. ID recognition Research on Method Automatic Recognition of Film ID-Using Synthetic Discriminant Function and Data Matching Technology-, INNERVISION, Japan, Medical Science Co., Ltd., March 1993, Volume 8, No. 4, p. 34-37 (58) Fields investigated (Int.Cl. ⁷ , DB name) G06K 9/00-9/82

Claims

(57) [Claims]

1. A type of character to be recognized is divided into two classes.
, And for each combination of the classification,
The synthetic discriminant function formed to maximize the Fischer ratio
Storing a group of numbers in a computer, the computer
The arithmetic processing is performed in the
-Composite knowledge corresponding to a combination of predetermined classifications with a large ratio
Select another function and then classify by the composite discriminant function 2
Reclassify two of the two classes again
And classify into the same class as above
Select the corresponding composite discriminant function and repeat the selection.
Characterized by returning to recognize all types of characters
Character recognition method. [Equation 1]

2. The group of composite discriminant functions is first 1
Forming a first composite discriminant function for discriminating one character from the remaining letters, and then discriminating one character from the rest of the characters and the rest of the characters excluding the character; characterized in that to form a, and is formed so as to be able to recognize all characters by repeating the same procedure the following
The character recognition method according to claim 1.

3. The group of composite discriminant functions is formed so as to maximize the Fischer ratio of each of the composite discriminant functions for discriminating between each character to be recognized and other characters .
The character recognition method according to claim 1, wherein:

4. A correlation calculation with a character to be recognized is performed by using each composite discriminant function , a class having a larger correlation value is selected, and classification and discrimination are performed from a large class to a small class. 請 characterized by recognizing a character
The character recognition method according to claim 1 or claim 2 .

5. The probability of causing an error in the discrimination result is calculated and stored based on the correlation value obtained by the correlation calculation with each synthetic discrimination function used until the character is recognized. The character recognition method according to claim 4 .

When the 6. was misrecognized character is found, the probability of error in the stored identification result detects the highest synthetic discriminant function, again identified by changing the identification result character The character recognition method according to claim 5 , wherein the character recognition is performed again.

7. A method of performing a correlation operation between each synthetic discriminant function and a character to be recognized to obtain a correlation value respectively, and recognizing as a character corresponding to the largest correlation value among these. The character recognition method according to claim 3 .

8. The character recognition method according to claim 7, wherein when the character is found to be erroneously recognized, it is recognized again as a character corresponding to the next largest correlation value.

9. The character which is erroneously recognized is identified by checking the combination of characters obtained by recognizing each character with the recorded data of the correct combination of characters. The character recognition method according to any one of claims 4 to 8 .

10. A step of detecting a character area in which character information is recorded, a step of extracting character information from the detected character area, the extracted character information, and a group of the composite identification function.
Performs calculation of flops, and outputting a correlation value based on the respective operation result based on the output result of the correlation value, of the synthetic discriminant function
The step of selecting one from the group and specifying the character, the step of matching the specified character with the data in the database, and the step of outputting the composite discriminant function when there is a mismatch The character recognition method according to any one of claims 1 to 9, further comprising: converting the specified character into a different character based on the character.