JP2865611B2

JP2865611B2 - Character feature extraction method and device, and character recognition method and device

Info

Publication number: JP2865611B2
Application number: JP8055124A
Authority: JP
Inventors: 博光川尻; 隆敏吉川; 洋堀井; 田中　　淳司
Original assignee: Tottori Sanyo Electric Co Ltd; Sanyo Denki Co Ltd
Current assignee: Sanyo Electric Co Ltd; Sanyo Denki Co Ltd
Priority date: 1996-03-12
Filing date: 1996-03-12
Publication date: 1999-03-08
Anticipated expiration: 2016-03-12
Also published as: JPH09245123A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、文字の特徴を抽出
する方法及び装置に係り、詳しくは、読み取った文字を
認識する文字認識の前処理として行われる文字の特徴抽
出方法及び装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a method and an apparatus for extracting a characteristic of a character, and more particularly, to a method and an apparatus for extracting a characteristic of a character which are performed as preprocessing of character recognition for recognizing a read character.

【０００２】[0002]

【従来の技術】従来、用紙に印刷あるいは手書きされた
文字をＣＣＤ（固体撮像素子）イメージセンサ等を用い
て読み取り、その文字を認識する装置が知られている。
読み取られた文字の画像データは、２値化回路により２
値画像データに変換され、この２値画像から各文字領域
が１文字単位で切り出される。さらに、切り出された各
文字領域は、マトリクス状（例えば８×８）のメッシュ
領域（小領域）に分割され、正規化される（所謂、大き
さの正規化）。そして、正規化された文字領域の各メッ
シュ領域（この場合、６４個）内に存在する黒画素の数
が求められる。次に、各メッシュ領域の黒画素数が全メ
ッシュ領域中の最大黒画素数で除算され、更に、得られ
た値に２５５がそれぞれ乗算されて、２５６階調のメッ
シュ濃度が得られる（所謂、濃度の正規化）。このメッ
シュ濃度を得ることで、切り出された個々の文字の特徴
が抽出される。そして、抽出された文字の特徴量と、予
め用意された辞書中の各文字パターンの特徴量との比較
演算処理による文字認識、あるいは、文字パターンを学
習するニューラルネットワークを用いた文字認識が行わ
れ、その文字認識結果が表示装置の画面に表示される。2. Description of the Related Art Conventionally, there has been known an apparatus which reads characters printed or handwritten on paper using a CCD (solid-state image sensor) image sensor or the like and recognizes the characters.
The image data of the read character is converted into two by the binarization circuit.
It is converted into value image data, and each character area is cut out from the binary image in units of one character. Furthermore, each cut-out character area is divided into a matrix area (for example, 8 × 8) mesh area (small area) and normalized (so-called size normalization). Then, the number of black pixels existing in each mesh area (64 in this case) of the normalized character area is obtained. Next, the number of black pixels in each mesh area is divided by the maximum number of black pixels in all the mesh areas, and the obtained values are multiplied by 255, respectively, to obtain a mesh density of 256 gradations (so-called, Concentration normalization). By obtaining this mesh density, the characteristics of the cut-out individual characters are extracted. Then, character recognition is performed by comparing and calculating the feature amount of the extracted character and the feature amount of each character pattern in a dictionary prepared in advance, or character recognition using a neural network for learning the character pattern is performed. The character recognition result is displayed on the screen of the display device.

【０００３】[0003]

【発明が解決しようとする課題】しかしながら、上記し
た文字認識処理においては、読み取り時における用紙と
ＣＣＤイメージセンサとの相対的な位置ずれ等により、
図５に示すように、同じ文字「１」の１回目の読み取り
で切り出された文字領域ＷＥ１と、２回目の読み取りで
切り出された文字領域ＷＥ２とは、異なる黒画素の分布
を示す。従って、大きさの正規化後、文字領域ＷＥ１に
おける各メッシュ領域の黒画素数と、文字領域ＷＥ２に
おける各メッシュ領域の黒画素数も異なることになる。
この結果、濃度の正規化を行ったときに文字領域のメッ
シュ濃度が大きく変動するという問題がある。すなわ
ち、同じ文字であるにもかかわらず、メッシュ濃度が変
動することにより、辞書やニューラルネットワークを用
いた文字認識において誤認識をもたらす原因となる。However, in the above-described character recognition processing, due to the relative displacement between the sheet and the CCD image sensor during reading, etc.
As shown in FIG. 5, the character region WE1 cut out by the first reading of the same character “1” and the character region WE2 cut out by the second reading show different distributions of black pixels. Therefore, after the size is normalized, the number of black pixels in each mesh area in the character area WE1 is different from the number of black pixels in each mesh area in the character area WE2.
As a result, there is a problem that the mesh density of the character area fluctuates greatly when the density is normalized. That is, even though the characters are the same, a change in the mesh density causes erroneous recognition in character recognition using a dictionary or a neural network.

【０００４】又、全く同じ形状サイズの文字でも上述し
たことが発生する。ましてや、実際に文字認識装置にお
いて文字の大きさの正規化及び濃度の正規化を行った場
合、元の文字の大きさや形状等により、メッシュ濃度が
全体的に薄い場合や逆に濃い場合があり、文字の特徴に
ばらつきが生じる。このような場合でも、上記した文字
認識において誤認識を引き起こす要因となり得る。[0004] In addition, the above-described problem occurs even with characters having exactly the same shape and size. Furthermore, when character size normalization and density normalization are actually performed in a character recognition device, the mesh density may be low or high depending on the size and shape of the original character. , The characteristics of the characters vary. Even in such a case, the above-described character recognition may cause erroneous recognition.

【０００５】本発明は上記問題点を解決するためになさ
れたものであって、その目的は、文字の特徴量のばらつ
きを少なくすることができる文字の特徴抽出方法及びそ
の装置を提供することにある。SUMMARY OF THE INVENTION The present invention has been made to solve the above problems, and an object of the present invention is to provide a character feature extraction method and apparatus capable of reducing variations in character feature amounts. is there.

【０００６】又、本発明は、文字認識のために抽出され
る文字の特徴量のばらつきを少なくして、文字認識を容
易に行うことができる文字認識方法及びその装置を提供
することにある。Another object of the present invention is to provide a character recognizing method and a character recognizing method capable of easily performing character recognition by reducing the variation in the characteristic amount of characters extracted for character recognition.

【０００７】[0007]

【課題を解決するための手段】上記問題点を解決するた
め請求項１に記載の発明では、文字データ領域を形成す
る各小領域の濃度値を算出し、その算出された前記各小
領域の濃度値の平均が、その濃度値のダイナミックレン
ジを保持しつつ、所定の平均濃度値に近似するように、
前記各小領域の濃度値を変換する文字の特徴抽出方法を
要旨とする。According to the first aspect of the present invention, a density value of each small area forming a character data area is calculated, and the density value of each of the calculated small areas is calculated. The average of the density values is similar to a predetermined average density value while maintaining the dynamic range of the density value,
The gist is a character feature extraction method for converting the density value of each small area.

【０００８】請求項２に記載の発明では、文字データ領
域を形成する各小領域における文字データの構成要素の
占める割合に基づいて、前記各小領域の濃度値を算出す
るステップを備えた文字の特徴抽出方法において、前記
算出ステップは、前記各小領域の濃度値の平均が、その
濃度値のダイナミックレンジを保持しつつ、所定の平均
濃度値に近似するように、前記各小領域の濃度値を変換
するステップを含むことを要旨とする。[0008] In the invention according to the second aspect, a step of calculating a density value of each of the small areas based on a ratio of constituent elements of the character data in each of the small areas forming the character data area is provided. In the feature extraction method, the calculating step may be performed such that an average of the density values of the respective small areas is close to a predetermined average density value while maintaining a dynamic range of the density values. Is included.

【０００９】請求項３に記載の発明では、文字データ領
域を形成する各小領域における文字データの構成要素の
占める割合に基づいて、前記各小領域の濃度値を算出す
るステップを備えた文字の特徴抽出方法において、前記
算出ステップは、前記各小領域の濃度値の平均が、その
濃度値のダイナミックレンジを保持しつつ、所定の平均
濃度値に近似するように、前記各小領域の濃度値を変換
するステップを含み、前記所定の平均濃度値は、変換さ
れた前記各小領域の濃度値を、変換前の前記各小領域の
濃度値のバランス状態に保つことが可能な値に設定され
ることを要旨とする。According to a third aspect of the present invention, there is provided a method for calculating the density value of each of the small areas based on the ratio of constituent elements of the character data in each of the small areas forming the character data area. In the feature extraction method, the calculating step may be performed such that an average of the density values of the respective small areas is close to a predetermined average density value while maintaining a dynamic range of the density values. The predetermined average density value is set to a value that can maintain the converted density value of each of the small areas in a balanced state of the density value of each of the small areas before conversion. The point is that

【００１０】請求項４に記載の発明では、請求項２又は
３に記載の方法において、前記構成要素は、文字データ
を構成する黒画素データであり、前記変換ステップにお
いて、その黒画素データが存在する各小領域の濃度値が
変換されることを要旨とする。According to a fourth aspect of the present invention, in the method according to the second or third aspect, the constituent element is black pixel data constituting character data, and the black pixel data exists in the conversion step. The point is that the density value of each small area to be converted is converted.

【００１１】請求項５に記載の発明では、文字データ領
域を形成する各小領域において文字データの構成要素の
占める割合を示す値を算出する第１ステップと、その算
出された各小領域の値を正規化して、濃度値を算出する
第２ステップとを備えた文字の特徴抽出方法において、
前記第１ステップの後、前記第２ステップの前に、前記
各小領域における濃度値の平均が、その濃度値のダイナ
ミックレンジを保持しつつ、所定の平均濃度値に近似す
るように、前記各小領域における構成要素の割合を示す
値を変換するステップを備えた文字の特徴抽出方法を要
旨とする。According to the fifth aspect of the present invention, a first step of calculating a value indicating a ratio of constituent elements of character data in each small area forming a character data area, and a calculated value of each small area And a second step of calculating a density value.
After the first step and before the second step, the average of the density values in each of the small areas is approximated to a predetermined average density value while maintaining the dynamic range of the density values. A gist of the present invention is a character feature extraction method including a step of converting a value indicating a ratio of a component in a small region.

【００１２】請求項６に記載の発明では、請求項５に記
載の方法において、前記構成要素は、文字データを構成
する黒画素データであり、前記変換ステップにおいて、
その黒画素データが存在する小領域について、当該黒画
素データの割合を示す値が変換されることを要旨とす
る。According to a sixth aspect of the present invention, in the method according to the fifth aspect, the constituent element is black pixel data constituting character data, and in the conversion step,
The gist is that the value indicating the ratio of the black pixel data is converted for the small area where the black pixel data exists.

【００１３】請求項７に記載の発明では、文字データの
領域を形成する各小領域の濃度値を算出し、この算出さ
れた前記各小領域の濃度値を所定のレンジ幅の濃度値に
変換する文字の特徴抽出方法において、この文字データ
の各領域の変換前の濃度値の上下関係を保ちながら、こ
の文字データの各構成要素の占める領域の変換後の濃度
値の平均値が予め定めた所定値に近づくように、前記濃
度値の変換を行う文字の特徴抽出方法を要旨とする。According to the present invention, the density value of each small area forming the character data area is calculated, and the calculated density value of each small area is converted into a density value of a predetermined range width. In the character feature extraction method, the average value of the converted density values of the area occupied by each component of the character data is determined in advance while maintaining the vertical relationship of the density values of each area of the character data before conversion. A gist of the present invention is a character feature extraction method for converting the density value so as to approach a predetermined value.

【００１４】請求項８に記載の発明では、文字データ領
域を複数の小領域に分割して各小領域の大きさを正規化
するステップと、その正規化された各小領域における文
字データの構成要素の占める割合に基づいて、前記各小
領域の濃度値を算出することにより文字の特徴を抽出す
るステップと、その抽出された文字の特徴に基づいて当
該文字を識別するステップとを備えた文字の特徴抽出方
法において、前記抽出ステップは、前記各小領域の濃度
値の平均が、その濃度値のダイナミックレンジを保持し
つつ、所定の平均濃度値に近似するように、前記各小領
域の濃度値を変換するステップを含む文字認識方法を要
旨とする。According to the present invention, the character data area is divided into a plurality of small areas to normalize the size of each small area, and the structure of the character data in each normalized small area. A character extracting step of calculating a density value of each of the small areas based on a ratio of the element to extract a characteristic of the character; and a step of identifying the character based on the characteristic of the extracted character. In the feature extraction method, the extracting step may be performed so that an average of the density values of the respective small areas is close to a predetermined average density value while maintaining a dynamic range of the density values. The gist is a character recognition method including a step of converting a value.

【００１５】請求項９に記載の発明では、文字データ領
域を複数の小領域に分割して各小領域の大きさを正規化
するステップと、その正規化された各小領域における文
字データの構成要素の占める割合に基づいて、前記各小
領域の濃度値を算出することにより文字の特徴を抽出す
るステップと、その抽出された文字の特徴に基づいて当
該文字を識別するステップとを備えた文字認識方法にお
いて、前記抽出ステップは、前記各小領域の濃度値の平
均が、その濃度値のダイナミックレンジを保持しつつ、
所定の平均濃度値に近似するように、前記各小領域の濃
度値を変換するステップを含み、前記所定の平均濃度値
は、変換された前記各小領域の濃度値を、変換前の前記
各小領域の濃度値のバランス状態に保つことが可能な値
に設定される文字認識方法を要旨とする。According to the ninth aspect of the present invention, the step of dividing the character data area into a plurality of small areas to normalize the size of each small area, and the structure of the character data in each normalized small area A character extracting step of calculating a density value of each of the small areas based on a ratio of the element to extract a characteristic of the character; and a step of identifying the character based on the characteristic of the extracted character. In the recognition method, in the extracting step, the average of the density values of each of the small areas holds a dynamic range of the density value,
Converting the density value of each of the small areas so as to approximate a predetermined average density value, wherein the predetermined average density value is the density value of each of the converted small areas, The gist is a character recognition method that is set to a value that can maintain the balance of the density value of a small area.

【００１６】請求項１０に記載の発明では、請求項８又
は９に記載の方法において、前記構成要素は、文字デー
タを構成する黒画素データであり、前記変換ステップに
おいて、その黒画素データが存在する各小領域の濃度値
が変換されることを要旨とする。According to a tenth aspect of the present invention, in the method according to the eighth or ninth aspect, the constituent element is black pixel data constituting character data, and the black pixel data exists in the conversion step. The point is that the density value of each small area to be converted is converted.

【００１７】請求項１１に記載の発明では、文字データ
領域を形成する各小領域内における前記文字データの構
成要素の占める割合に基づいて、前記各小領域の濃度値
を算出する手段と、前記算出手段により算出された前記
各小領域の濃度値の平均が、その濃度値のダイナミック
レンジを保持しつつ、所定の平均濃度値に近似するよう
に、前記各小領域の濃度値を変換する手段とを備えた文
字の特徴抽出装置を要旨とする。The invention according to claim 11, wherein the density value of each of the small areas is calculated based on the ratio of the constituent elements of the character data in each of the small areas forming the character data area; Means for converting the density value of each of the small areas so that the average of the density values of each of the small areas calculated by the calculation means is close to a predetermined average density value while maintaining the dynamic range of the density value. The gist is a character feature extraction device including

【００１８】請求項１２に記載の発明では、請求項１１
に記載の装置において、前記所定の平均濃度値は、前記
変換手段により変換された前記各小領域の濃度値を、変
換前の前記各小領域の濃度値のバランス状態に保つこと
が可能な値に設定されることを要旨とする。[0018] According to the invention described in claim 12, in the invention described in claim 11,
In the apparatus according to (1), the predetermined average density value is a value capable of keeping the density value of each of the small areas converted by the conversion unit in a balanced state of the density value of each of the small areas before conversion. It is assumed that it is set to.

【００１９】請求項１３に記載の発明では、文字データ
領域を複数の小領域に分割して各小領域の大きさを正規
化する手段と、前記正規化手段により正規化された各小
領域内における前記文字データの構成要素の占める割合
に基づいて、前記各小領域の濃度値を算出する手段と、
前記算出手段により算出された前記各小領域の濃度値の
平均が、その濃度値のダイナミックレンジを保持しつ
つ、所定の平均濃度値に近似するように、前記各小領域
の濃度値を変換する手段と、前記変換手段により変換さ
れた前記各小領域の濃度値に基づいて、当該文字を識別
する手段とを備えた文字認識装置を要旨とする。According to the thirteenth aspect of the present invention, the character data area is divided into a plurality of small areas to normalize the size of each small area, and the size of each small area normalized by the normalizing means is reduced. Means for calculating the density value of each of the small areas, based on the ratio of the components of the character data in
The density value of each of the small areas is converted such that the average of the density values of each of the small areas calculated by the calculation means is close to a predetermined average density value while maintaining the dynamic range of the density value. A gist of the present invention is a character recognition device comprising: means for identifying the character based on the density value of each of the small areas converted by the conversion means.

【００２０】請求項１４に記載の発明では、請求項１３
に記載の装置において、前記所定の平均濃度値は、前記
変換手段により変換された前記各小領域の濃度値を、変
換前の前記各小領域の濃度値のバランス状態に保つこと
が可能な値に設定されることを要旨とする。According to the fourteenth aspect of the present invention, in the thirteenth aspect,
In the apparatus according to (1), the predetermined average density value is a value capable of keeping the density value of each of the small areas converted by the conversion unit in a balanced state of the density value of each of the small areas before conversion. It is assumed that it is set to.

【００２１】請求項１５に記載の発明では、請求項１３
又は１４に記載の文字認識装置を備えたファクシミリ装
置を要旨とする。（作用）請求項１，２及び１１に記載の発明によれば、
各小領域の濃度値の平均が、その濃度値のダイナミック
レンジを保持しつつ、所定の平均濃度値に近似するよう
に、各構成要素の濃度値が変換される。従って、変換後
の濃度値の平均がほぼ一定に維持されて、文字の特徴量
のばらつきが少なくなる。According to the invention of claim 15, claim 13 is provided.
Or a facsimile device provided with the character recognition device described in 14 above. (Operation) According to the invention described in claims 1, 2, and 11,
The density value of each component is converted such that the average of the density values of each small area approximates a predetermined average density value while maintaining the dynamic range of the density value. Therefore, the average of the converted density values is maintained substantially constant, and the variation in the character feature amount is reduced.

【００２２】請求項３及び１２に記載の発明によれば、
変換前の各構成要素の濃度値のバランス状態を保ちつ
つ、各小領域の濃度値の変換を行うことが可能となる。
請求項４及び１０に記載の発明によれば、黒画素データ
が存在する各小領域の濃度値のみが効率よく変換され
る。According to the third and twelfth aspects of the invention,
The conversion of the density value of each small area can be performed while maintaining the balance state of the density values of the respective components before conversion.
According to the fourth and tenth aspects of the present invention, only the density value of each small area where the black pixel data exists is efficiently converted.

【００２３】請求項５に記載の発明によれば、各小領域
の濃度値が算出される前に、各構成要素の占める割合を
示す値の変換が行われる。請求項６に記載の発明によれ
ば、黒画素データが存在する各小領域について、当該黒
画素データの割合を示す値のみが効率よく変換される。According to the fifth aspect of the present invention, before the density value of each small area is calculated, the value indicating the ratio of each constituent element is converted. According to the sixth aspect of the present invention, for each small area where black pixel data exists, only the value indicating the ratio of the black pixel data is efficiently converted.

【００２４】請求項７に記載の発明によれば、文字デー
タの各領域の変換前の濃度値の上下関係が保たれつつ、
この文字データの各構成要素の占める領域の変換後の濃
度値の平均値が予め定めた所定値に近づくように、濃度
値の変換が行われる。According to the seventh aspect of the present invention, the vertical relationship between the density values before conversion of each area of the character data is maintained,
The conversion of the density values is performed so that the average value of the converted density values of the area occupied by the respective constituent elements of the character data approaches a predetermined value.

【００２５】請求項８及び１３に記載の発明によれば、
各構成要素の濃度値が変換されて文字の特徴量のばら
つきが少ない状態で文字認識を容易に行うことが可能と
なる。According to the invention described in claims 8 and 13,
Character density can be easily recognized in a state where the density value of each component is converted and the variation in the characteristic amount of the character is small.

【００２６】請求項９及び１４に記載の発明によれば、
各構成要素の濃度値が変換され文字の特徴量のばらつき
が少なく、かつ、変換前の各構成要素の濃度値のバラン
ス状態が保たれた状態で文字認識を容易かつ正確に行う
ことが可能となる。According to the ninth and fourteenth aspects of the invention,
It is possible to easily and accurately perform character recognition in a state in which the density value of each component is converted, the variation in the characteristic amount of the character is small, and the balance state of the density value of each component before conversion is maintained. Become.

【００２７】請求項１５に記載の発明によれば、請求項
１２又は１３に記載の文字認識装置を備えたファクシミ
リ装置が提供される。According to a fifteenth aspect of the present invention, there is provided a facsimile apparatus including the character recognition device according to the twelfth or thirteenth aspect.

【００２８】[0028]

【発明の実施の形態】以下、本発明を具体化した一実施
の形態について図１〜図４に従って説明する。本実施の
形態では、文字の特徴抽出を行う文字認識装置がファク
シミリ装置に適用されている。なお、この種のファクシ
ミリ装置の更に詳細な説明については、特願平７−２９
９６０９号に記載されているので、ここでは簡単に説明
する。図１にファクシミリ装置１の電気的構成を示すブ
ロック図である。同図に示すように、ファクシミリ装置
１は、用紙挿入部１ａに用紙挿入用ローラ２を備え、そ
の挿入口には用紙検知器３及びＣＣＤ（Charge Coupled
Device ）イメージセンサ４が配設されている。ＣＣＤ
イメージセンサ４は２値化回路５，画像メモリ６を介し
て制御部７に接続されており、この制御部７には、用紙
検知器３，用紙挿入用ローラ２を駆動するモータ８の駆
動制御を行うモータ駆動回路９，入力部１０，表示部１
１の表示制御を行う表示制御回路１２，トーンダイヤル
部１３及びモデム１４が接続されている。トーンダイヤ
ル部１３及びモデム１４は、電話回線に接続されたＮＣ
Ｕ（網制御装置）１５に接続されている。制御部７は、
文字切り出し処理部１６，正規化処理部１７，文字認識
部１８，辞書１９，電話番号管理処理部２０，ＦＡＸ符
号化処理部２１を備えている。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS One embodiment of the present invention will be described below with reference to FIGS. In the present embodiment, a character recognition device that performs character feature extraction is applied to a facsimile device. For a more detailed description of this type of facsimile apparatus, refer to Japanese Patent Application No. 7-29.
No. 9609, it will be briefly described here. FIG. 1 is a block diagram showing an electrical configuration of the facsimile machine 1. As shown in FIG. 1, the facsimile apparatus 1 includes a paper insertion roller 2 in a paper insertion section 1a, and a paper detector 3 and a CCD (Charge Coupled) at its insertion opening.
Device) An image sensor 4 is provided. CCD
The image sensor 4 is connected to a control unit 7 via a binarization circuit 5 and an image memory 6. The control unit 7 includes a paper detector 3, a drive control of a motor 8 for driving the paper insertion roller 2. Drive circuit 9, input unit 10, display unit 1 for performing
1, a display control circuit 12 for performing display control, a tone dial unit 13, and a modem 14 are connected. The tone dial unit 13 and the modem 14 are connected to an NC connected to a telephone line.
U (network control device) 15. The control unit 7
It includes a character cutout processing section 16, a normalization processing section 17, a character recognition section 18, a dictionary 19, a telephone number management processing section 20, and a FAX encoding processing section 21.

【００２９】用紙検知器３は、例えば光学式センサから
なり、用紙挿入部１ａに用紙Ｐが挿入されたことを検知
するもので、この用紙検知器３からの検知信号は制御部
７に出力される。ＣＣＤイメージセンサ４は多数の固体
撮像素子より構成され、用紙Ｐに記載された文字の光学
像に応じたアナログ画像信号ＡＳを生成する。この生成
された画像信号ＡＳは２値化回路５に出力される。２値
化回路５は、ＣＣＤイメージセンサ４から順次出力され
たアナログ画像信号ＡＳを、白黒２値のデジタル画像信
号（以下、２値画像信号という）ＩＤに変換する。この
変換された２値画像信号ＩＤは画像メモリ６に出力され
る。The paper detector 3 comprises, for example, an optical sensor and detects that the paper P has been inserted into the paper insertion section 1a. A detection signal from the paper detector 3 is output to the control section 7. You. The CCD image sensor 4 is composed of a number of solid-state imaging devices, and generates an analog image signal AS corresponding to an optical image of a character written on the paper P. The generated image signal AS is output to the binarization circuit 5. The binarizing circuit 5 converts the analog image signal AS sequentially output from the CCD image sensor 4 into a black and white binary digital image signal (hereinafter, referred to as a binary image signal) ID. The converted binary image signal ID is output to the image memory 6.

【００３０】画像メモリ６は、ＲＡＭ（ランダムアクセ
スメモリ）より構成され、２値化回路５から入力される
２値画像信号ＩＤを、制御部７からのアドレス信号に応
じたメモリ領域に画像データとして読出し可能に記憶す
る。制御部７は用紙検知器３から検知信号に基づき、Ｃ
ＣＤイメージセンサ４に読み取り速度を制御する読み取
りタイミング信号を出力するとともに、この読み取りタ
イミング信号に同期してモータ８が駆動されるようにモ
ータ駆動回路９に駆動制御信号を出力する。The image memory 6 is composed of a RAM (random access memory), and stores a binary image signal ID input from the binarization circuit 5 as image data in a memory area corresponding to an address signal from the control unit 7. It is stored in a readable manner. The control unit 7 performs C based on the detection signal from the paper detector 3.
A read timing signal for controlling the read speed is output to the CD image sensor 4 and a drive control signal is output to the motor drive circuit 9 so that the motor 8 is driven in synchronization with the read timing signal.

【００３１】入力部１０は、電話番号を入力する番号入
力キーやＦＡＸ送信キーなどを備えており、キー入力デ
ータ信号を制御部７に出力する。制御部７は、図２に示
すように、用紙Ｐの先頭の所定領域Ａに書かれた電話番
号ＴＮの読み取り画像データが格納された画像メモリ６
からその画像データを読み出して、文字切り出し処理部
１６，正規化処理部１７及び文字認識部１８による文字
認識処理を行う。The input unit 10 includes a number input key for inputting a telephone number, a facsimile transmission key, and the like, and outputs a key input data signal to the control unit 7. As shown in FIG. 2, the control unit 7 stores the image memory 6 in which the read image data of the telephone number TN written in the predetermined area A at the head of the paper P is stored.
Then, the image data is read out from, and character recognition processing is performed by the character cutout processing unit 16, the normalization processing unit 17, and the character recognition unit 18.

【００３２】文字切り出し部１６は、画像メモリ６から
読み出した２値画像データの画素に対して文字列の傾斜
角を想定して種々の傾斜角に対する垂直成分ヒストグラ
ムを求める。そして、文字切り出し部１６は、各ヒスト
グラム値の分布に基づいて、文字の切り出し方向を判定
する。すなわち、ヒストグラム値が１画素以上の領域が
最多となる傾斜角を文字列方向とする。更に、文字切り
出し部１６は、１画素以上のヒストグラム値が連続的に
現れる領域を一文字分とし、判定された切り出し方向に
沿って、各文字を切り出す。The character cutout section 16 obtains vertical component histograms for various inclination angles by assuming the inclination angle of the character string with respect to the pixels of the binary image data read from the image memory 6. Then, the character cutout unit 16 determines a character cutout direction based on the distribution of each histogram value. That is, the inclination angle at which the area where the histogram value is one pixel or more is the largest is defined as the character string direction. Further, the character cutout unit 16 sets an area where a histogram value of one pixel or more continuously appears as one character, and cuts out each character along the determined cutout direction.

【００３３】正規化処理部１７は、文字切り出し部１６
により切り出された各文字を形成する画素データの連接
情報に基づいて、各文字の外接矩形をとり、文字領域を
設定する。正規化処理部１７は、更に、各文字領域を８
×８のマトリクス状のメッシュ領域（小領域）に分割し
て、正規化する（所謂、大きさの正規化）。正規化処理
部１７は、大きさの正規化がなされた文字領域の各メッ
シュ領域（この場合、６４個）内に存在する正規化前濃
度としての黒画素の数を求める。更に、各メッシュ領域
の黒画素数を全メッシュ領域中の最大黒画素数で除算
し、その除算により得られた値に２５５をそれぞれ乗算
して、２５６階調のメッシュ濃度を得る（所謂、濃度の
正規化）。このメッシュ濃度は個々の文字の特徴を表し
ている。The normalization processing section 17 includes a character cutout section 16
A circumscribed rectangle of each character is set based on the connection information of the pixel data forming each character cut out by the above, and a character area is set. The normalization processing unit 17 further sets each character area to 8
It is divided into mesh areas (small areas) of a matrix of × 8 and normalized (so-called size normalization). The normalization processing unit 17 obtains the number of black pixels as the density before normalization existing in each mesh area (64 in this case) of the character area whose size has been normalized. Further, the number of black pixels in each mesh area is divided by the maximum number of black pixels in all the mesh areas, and the value obtained by the division is multiplied by 255 to obtain a mesh density of 256 gradations (so-called density). Normalization). This mesh density represents the characteristics of each character.

【００３４】正規化処理部１７は、各文字のメッシュ濃
度値を得る毎に、そのメッシュ濃度の平均値を算出し、
得られた平均濃度値Ｋと予め設定された補正濃度値Ｃを
用いて、その各メッシュ濃度の値Ｘ0 を変換する。すな
わち、メッシュ濃度値Ｘ0 が以下に示す条件（i ）に当
てはまるとき、式（１）に従って値Ｘ1 に変換され、条
件（ii）に当てはまるとき、式（２）に従って値Ｘ2 に
変換される。Each time the mesh density value of each character is obtained, the normalization processing unit 17 calculates an average value of the mesh density,
Using the obtained average density value K and a preset correction density value C, the value X0 of each mesh density is converted. That is, when the mesh density value X0 satisfies the following condition (i), it is converted to a value X1 according to equation (1), and when it satisfies condition (ii), it is converted to a value X2 according to equation (2).

【００３５】（i ）Ｘ0 ≧Ｋのとき、Ｘ1 ＝（Ｘ0 −Ｋ）×（255 −Ｃ）／（255 −Ｋ）＋Ｃ・・・（１）（ii）Ｘ0 ＜Ｋのとき、Ｘ2 ＝Ｃ×（１−（Ｋ−Ｘ0 ）／Ｋ）・・・（２）平均濃度値Ｋは、メッシュ濃度値Ｘ0 の合計を、０を越
える濃度値をもつ（黒点が存在する）メッシュ領域の数
で除算することにより得られた値である。補正濃度値Ｃ
は、現在の平均濃度値Ｋを補正して、その平均濃度値Ｋ
のばらつきを最小化するために用いられる値であって、
この値は実験的に求められる。本実施の形態において、
補正濃度値Ｃは１８０に設定されている。すなわち、各
文字のメッシュ濃度の現在の平均濃度値Ｋが補正濃度値
Ｃに近い値に補正されるように、式（１）及び（２）に
従う変換が行われる。(I) When X0 ≧ K, X1 = (X0−K) × (255−C) / (255−K) + C (1) (ii) When X0 <K, X2 = C × (1− (K−X0) / K) (2) The average density value K is obtained by calculating the sum of the mesh density values X0 by the number of mesh areas having a density value exceeding 0 (black spots exist). This is the value obtained by the division. Correction density value C
Corrects the current average density value K and calculates the average density value K
Value used to minimize the variation of
This value is determined experimentally. In the present embodiment,
The correction density value C is set to 180. That is, the conversion according to the equations (1) and (2) is performed so that the current average density value K of the mesh density of each character is corrected to a value close to the correction density value C.

【００３６】文字認識部１８は、予め用意された辞書１
９に接続されており、辞書１９に記憶された基本文字パ
ターンの特徴量と、入力された変換後の文字パターンの
特徴量との比較演算処理を行って類似性を判断する。そ
して、最も類似した特徴をもつ文字が認識結果とされ
る。辞書１９は、認識の対象となる全ての文字の図形特
徴を記憶した認識辞書であり、入力部１０の番号入力キ
ーに備えられた全ての数字，記号に対し、その書体（斜
め文字、癖字など）等の違い毎のパターンの図形特徴
（メッシュ特徴）を記憶している。文字認識部１８によ
り認識された文字（本実施の形態では電話番号又はＦＡ
Ｘ番号に使用される数字，記号）は、電話番号管理処理
部２０に出力される。The character recognizing unit 18 stores the dictionary 1 prepared in advance.
9 is compared with the feature amount of the basic character pattern stored in the dictionary 19 and the feature amount of the input converted character pattern to determine similarity. Then, the character having the most similar feature is set as the recognition result. The dictionary 19 is a recognition dictionary in which graphic features of all characters to be recognized are stored. For all numbers and symbols provided on the number input keys of the input unit 10, the typefaces (diagonal characters, habit characters) are used. , Etc.) are stored for each pattern. Characters recognized by the character recognition unit 18 (in this embodiment, telephone numbers or FA
The numbers and symbols used for the X number are output to the telephone number management processing unit 20.

【００３７】表示部１１は、例えば液晶表示装置（ＬＣ
Ｄ）からなり、入力部１０から制御部７に入力された番
号入力データや、制御部７にて読み取られた文字データ
の文字認識結果を画面に表示する。この表示部１１は、
制御部７により表示制御回路１２を介して表示制御され
る。入力部１０及び表示制御部１２は電話番号管理処理
部２０に接続されている。The display unit 11 is, for example, a liquid crystal display (LC)
D), the number input data input from the input unit 10 to the control unit 7 and the character recognition result of the character data read by the control unit 7 are displayed on the screen. This display unit 11
The display is controlled by the control unit 7 via the display control circuit 12. The input unit 10 and the display control unit 12 are connected to a telephone number management processing unit 20.

【００３８】電話番号管理処理部２０は、入力部１０か
ら出力された電話番号データ及び文字認識部１８から出
力された文字認識結果を記憶する記憶部と、入力部１０
からの入力データに基づいて、その記憶部に対して処理
を施す処理部とを備えている。文字認識による番号の確
認モードにおいて、数字キーの操作により入力部１０か
ら番号データを入力したとき、電話番号管理処理部２０
は、その番号データを表示制御回路１２を介して表示部
１１の画面に表示する。更に、自己の記憶部に記憶され
た文字認識結果としての電話番号データをその入力され
た番号データに置き換える。更に、管理処理部２０は、
入力部１０からの送信キーの操作信号に基づき、そのと
き表示部１１に表示中の電話番号に対応する電話番号デ
ータを、記憶部から読出してトーンダイヤル部１３に転
送する。The telephone number management processing unit 20 stores a telephone number data output from the input unit 10 and a character recognition result output from the character recognition unit 18.
And a processing unit for performing processing on the storage unit based on input data from the storage unit. In the number confirmation mode by character recognition, when the number data is input from the input unit 10 by operating the numeric keys, the telephone number management processing unit 20
Displays the number data on the screen of the display unit 11 via the display control circuit 12. Further, the telephone number data as the character recognition result stored in its own storage unit is replaced with the input number data. Further, the management processing unit 20
Based on a transmission key operation signal from the input unit 10, telephone number data corresponding to the telephone number currently displayed on the display unit 11 is read out from the storage unit and transferred to the tone dial unit 13.

【００３９】トーンダイヤル部１３は、電話番号管理処
理部２０から転送された電話番号データに応じてプッシ
ュ回線対応のトーン信号を発生する。発生されたトーン
信号はＮＣＵ１５に送信される。ＮＣＵ１５は、制御部
７からの制御信号に基づき電話回線に対するトーンダイ
ヤル部１３とモデム１４との接続を選択的に切り換える
とともに、電話回線が相手先のファクシミリ装置と接続
されたことを確認する。ＦＡＸ符号化処理部２１は、入
力部１０からのＦＡＸ送信キーの操作信号に基づき、画
像メモリ６から順次読出される画像データの符号化処理
を行って、ＦＡＸ送信信号を生成する。生成されたＦＡ
Ｘ送信信号は、電話回線接続後にモデム１４，ＮＣＵ１
５を介して電話回線から回線接続先に送信される。The tone dial unit 13 generates a tone signal corresponding to a push line in accordance with the telephone number data transferred from the telephone number management processing unit 20. The generated tone signal is transmitted to the NCU 15. The NCU 15 selectively switches the connection between the tone dial unit 13 and the modem 14 to the telephone line based on the control signal from the control unit 7, and confirms that the telephone line has been connected to the other party's facsimile machine. The FAX encoding processing unit 21 performs an encoding process on image data sequentially read from the image memory 6 based on an operation signal of a FAX transmission key from the input unit 10 to generate a FAX transmission signal. Generated FA
The X transmission signal is transmitted to the modem 14, NCU 1 after connection to the telephone line.
5 is transmitted from the telephone line to the line connection destination.

【００４０】次に上記のように構成されたファクシミリ
装置１の文字認識までの作用を図３のフローチャートに
従って説明する。ステップＳ１において、ユーザが、所
定領域に電話番号が手書きされた送信用紙Ｐを用紙挿入
部１ａの挿入口に挿入すると、用紙検知器３により用紙
Ｐが検知される。すると、制御部７はＣＣＤイメージセ
ンサ４を作動させるとともに、モータ駆動回路９を介し
てモータ８を駆動させて、ローラ２を回転させる。これ
により、用紙Ｐが所定速度で先頭の所定領域だけ送り込
まれ、ＣＣＤイメージセンサ４により電話番号が読み取
られる。Next, the operation of the facsimile apparatus 1 configured as described above up to character recognition will be described with reference to the flowchart of FIG. In step S <b> 1, when the user inserts the transmission sheet P in which the telephone number is handwritten in the predetermined area into the insertion slot of the sheet insertion unit 1 a, the sheet P is detected by the sheet detector 3. Then, the controller 7 operates the CCD image sensor 4 and drives the motor 8 via the motor drive circuit 9 to rotate the roller 2. As a result, the paper P is fed at a predetermined speed in only a predetermined area at the head, and the telephone number is read by the CCD image sensor 4.

【００４１】ステップＳ２において、ＣＣＤイメージセ
ンサ４からの読み取りアナログ画像信号ＡＳは、２値化
回路５により２値画像信号ＩＤに変換されて画像メモリ
６に蓄積され、さらに画像メモリ６に蓄積された画像デ
ータは、文字切り出し処理部１６に順次に転送される。
文字切り出し処理部１６は、画像メモリ６から入力した
２値画像データの画素に対して、種々の傾斜角に対する
垂直成分ヒストグラムを求め、各ヒストグラム値の分布
に基づいて、文字の切り出し方向を判定する。こうし
て、判定された切り出し方向に沿って、各文字が切り出
される。In step S2, the analog image signal AS read from the CCD image sensor 4 is converted into a binary image signal ID by the binarization circuit 5, and stored in the image memory 6, and further stored in the image memory 6. The image data is sequentially transferred to the character cutout processing unit 16.
The character cutout processing unit 16 obtains vertical component histograms for various inclination angles for the pixels of the binary image data input from the image memory 6, and determines the character cutout direction based on the distribution of each histogram value. . Thus, each character is cut out along the determined cutout direction.

【００４２】ステップＳ３において、正規化処理部１７
は、切り出された各文字の外接矩形をとって文字領域を
設定する。更に、各文字領域を８×８のマトリクス状の
メッシュ領域に分割して、大きさの正規化を行う。In step S3, the normalization processing section 17
Sets the character area by taking the circumscribed rectangle of each cut-out character. Further, each character area is divided into 8 × 8 matrix mesh areas, and the size is normalized.

【００４３】ステップＳ４において、大きさの正規化が
なされた文字領域の各メッシュ領域内に存在する黒画素
の数が求められ、各メッシュ領域についての２５６階調
のメッシュ濃度が算出されて、濃度の正規化が行われ
る。すなわち、メッシュ濃度は、従来の技術で説明した
ように、各メッシュ領域について黒画素数（正規化前濃
度）を全メッシュ領域中の最大黒画素数で除算し、その
除算により得られた値に２５５をそれぞれ乗算すること
により、得られる。図４は、濃度の正規化が行われた後
の文字領域ＷＥ１，ＷＥ２における各メッシュ領域３０
の濃度をそれぞれ示している。この文字領域ＷＥ１，Ｗ
Ｅ２は、同じ文字「１」の１回目と２回目の読み取りに
よりそれぞれ得られたものであり、従来の技術の説明で
用いた図５の各文字領域ＷＥ１，ＷＥ２とそれぞれ対応
している。In step S4, the number of black pixels existing in each mesh area of the character area whose size has been normalized is determined, and the mesh density of 256 gradations for each mesh area is calculated. Is normalized. That is, as described in the background art, the mesh density is obtained by dividing the number of black pixels (density before normalization) for each mesh area by the maximum number of black pixels in all mesh areas, and obtaining a value obtained by the division. 255 respectively. FIG. 4 shows each mesh area 30 in the character areas WE1 and WE2 after density normalization has been performed.
Are shown respectively. This character area WE1, W
E2 is obtained by the first and second readings of the same character "1", and corresponds to each of the character areas WE1 and WE2 in FIG. 5 used in the description of the conventional technique.

【００４４】ステップＳ５において、正規化処理部１７
は、各文字のメッシュ濃度値を得る毎に、そのメッシュ
濃度の平均値を算出し、得られた平均濃度値Ｋと予め設
定された補正濃度値Ｃを用いて、上記した式（１），
（２）に従って、その各メッシュ濃度の値Ｘ0 をＸ1 及
びＸ2 に変換する。例えば、図４に示すように、文字領
域ＷＥ１について、そのメッシュ濃度の平均濃度値Ｋ＝
１５０と算出された場合、その平均濃度値Ｋと予め設定
された補正濃度値Ｃ＝１８０を用いて、式（１）（２）
に従って各メッシュ濃度値Ｘ0 がＸ1 ，Ｘ2 に変換され
る。この変換によって、文字「１」のメッシュ濃度の平
均濃度値Ｋ＝１６２となり、補正濃度値Ｃ＝１８０に近
い値に補正される。In step S5, the normalization processing section 17
Is obtained by calculating the average value of the mesh density every time the mesh density value of each character is obtained, and using the obtained average density value K and the preset correction density value C, the above equation (1),
According to (2), the value X0 of each mesh density is converted into X1 and X2. For example, as shown in FIG. 4, for the character area WE1, the average density value K of the mesh density
When it is calculated as 150, the average density value K and the preset correction density value C = 180 are used to calculate the equations (1) and (2).
Is converted into X1 and X2. By this conversion, the average density value K of the mesh density of the character “1” becomes K = 162, and the density is corrected to a value close to the corrected density value C = 180.

【００４５】更に、文字領域ＷＥ２について、そのメッ
シュ濃度の平均濃度値Ｋ＝１６７と算出された場合、同
様に式（１），（２）に従って、各メッシュ濃度値Ｘ0
がＸ1 ，Ｘ2 に変換される。この変換によって、文字
「１」のメッシュ濃度の平均濃度値Ｋ＝１７４となり、
補正濃度値Ｃ＝１８０に近い値に補正される。このよう
に、変換後において、メッシュ濃度の平均値のばらつき
（Ｋ＝１５０，Ｋ＝１６７）が少なくなっている（Ｋ＝
１６２，Ｋ＝１７４）ことが分かる。なお、説明を分か
りやすくするために、ステップＳ４の濃度の正規化と、
ステップＳ５の濃度値変換とを別ステップで行うように
したが、これらを当然一つのステップにまとめてもよ
い。Further, when the average density K of the mesh density of the character area WE2 is calculated to be K = 167, the mesh density values X0 are similarly calculated according to the equations (1) and (2).
Is converted into X1 and X2. By this conversion, the average density K = 174 of the mesh density of the character “1” is obtained.
The correction density value is corrected to a value close to 180. Thus, after the conversion, the variation in the average value of the mesh density (K = 150, K = 167) is reduced (K =
162, K = 174). In order to make the explanation easy to understand, the normalization of the density in step S4 and
Although the density value conversion in step S5 is performed in a separate step, these may be naturally combined into one step.

【００４６】ここで、補正濃度値Ｃ＝１８０は実験を行
った結果得られた値であるが、この値が適正である理由
は、平均濃度値Ｋのばらつきを最小化することができる
というだけでなく、メッシュ濃度値のバランスを保つこ
とが可能であるという点にあるものと推定される。例え
ば、補正濃度値Ｃを高め（２５５に近い値）に設定した
場合、変換により比較的低い側のメッシュ濃度値が上に
シフトして特徴が強調されるという良さはあるが、逆に
高い側のメッシュ濃度値の差がなくなり（つまり、値の
変化が保存されなくなり）、メッシュ濃度値のバランス
がくずれてしまうという欠点がある。従って、比較的低
い側のメッシュ濃度値をシフトしつつ、高い側のメッシ
ュ濃度値の変化を保存して、メッシュ濃度値のバランス
を保つことが可能な補正濃度値が１８０程度であるもの
と推定される。従って、平均濃度値Ｋのばらつきを最小
化し、かつ、メッシュ濃度値のバランスを保つことがで
きれば、補正濃度値Ｃを１８０以外の１７０や１６０等
の値に変更可能である。Here, the corrected density value C = 180 is a value obtained as a result of an experiment. The reason that this value is appropriate is that the variation of the average density value K can be minimized. It is presumed that the balance of the mesh density values can be maintained. For example, when the correction density value C is set to a high value (a value close to 255), there is a merit that the mesh density value on the relatively low side is shifted upward by the conversion to emphasize the feature, but conversely, the high density side There is a disadvantage that the difference between the mesh density values is lost (that is, the change in the value is not preserved), and the balance of the mesh density values is lost. Therefore, it is estimated that the correction density value capable of maintaining the balance of the mesh density value while preserving the change of the mesh density value on the high side while shifting the mesh density value on the relatively low side is about 180. Is done. Therefore, if the dispersion of the average density value K can be minimized and the balance of the mesh density values can be maintained, the corrected density value C can be changed to a value other than 180, such as 170 or 160.

【００４７】ステップ６において、文字認識部１８によ
り、入力された文字パターンの特徴量と、辞書１９に記
憶された基本文字パターンの特徴量との比較演算処理に
よる文字認識が行われる。こうして、用紙Ｐの所定領域
に記載された各文字が認識されて、その文字認識結果が
電話番号データとして、電話番号管理処理部２０に記憶
されるとともに、表示制御回路１２を介して表示部１１
の画面Ｇに表示される。In step 6, the character recognition unit 18 performs character recognition by comparing and calculating the characteristic amount of the input character pattern and the characteristic amount of the basic character pattern stored in the dictionary 19. In this way, each character described in the predetermined area of the paper P is recognized, and the character recognition result is stored as telephone number data in the telephone number management processing unit 20 and is displayed via the display control circuit 12 on the display unit 11.
Is displayed on the screen G.

【００４８】以上詳述したように本実施の形態のファク
シミリ装置１は、以下のような効果を奏する。（i ）メッシュ濃度の平均濃度値Ｋが補正濃度値Ｃに近
い値となるように、上記式（１），（２）を用いて各メ
ッシュ濃度値を変換するようにしたことにより、平均濃
度値Ｋをほぼ均一にして、メッシュ濃度の変動を少なく
することができる。この結果、個々の文字の特徴量のば
らつきが少なくなって、辞書１９を用いた文字認識が容
易となり、誤認識の発生を少なくすることができる。As described in detail above, the facsimile apparatus 1 of the present embodiment has the following effects. (I) Each mesh density value is converted by using the above equations (1) and (2) so that the average density value K of the mesh density is close to the corrected density value C. By making the value K substantially uniform, the fluctuation of the mesh density can be reduced. As a result, the variation in the feature amount of each character is reduced, and character recognition using the dictionary 19 is facilitated, and occurrence of erroneous recognition can be reduced.

【００４９】（ii）上記式（１），（２）で用いられる
補正濃度値Ｃを、変換前のメッシュ濃度値のバランスを
保つことが可能な値（１８０）に設定したことにより、
変換前の特徴量を保持しつつ、正確な文字認識を行うこ
とができる。(Ii) By setting the correction density value C used in the above equations (1) and (2) to a value (180) capable of maintaining the balance of the mesh density value before conversion,
Accurate character recognition can be performed while retaining the feature amount before conversion.

【００５０】（iii ）文字のメッシュ濃度の変化の幅
（ダイナミックレンジの幅）は０から２５５となり、従
来と同様であり、かつ、文字のメッシュ濃度の上下関係
も保ったままである。つまり、認識時における個々の文
字の特徴量は、さほど減少していない。しかも、同じ文
字種を認識する場合における特徴量のばらつきを抑制す
ることができる。従って、正確な文字認識を行うことが
できる。(Iii) The width of the change in the mesh density of the character (the width of the dynamic range) is from 0 to 255, which is the same as the conventional one, and the vertical relationship of the mesh density of the character is maintained. That is, the feature amount of each character at the time of recognition does not decrease so much. In addition, it is possible to suppress variations in the feature amounts when the same character type is recognized. Therefore, accurate character recognition can be performed.

【００５１】なお、本発明は上記実施の形態に限定され
るものではなく、以下のように実施してもよい。（イ）上記した式（１），（２）以外の変換式を用いて
メッシュ濃度値を変換するようにしてもよい。The present invention is not limited to the above embodiment, but may be implemented as follows. (A) The mesh density value may be converted using a conversion formula other than the above formulas (1) and (2).

【００５２】（ロ）上記実施の形態では、濃度の正規化
（濃度を０〜２５５の所定のレンジ幅に納める）を行っ
た後に、上記した式（１），（２）の変換を行ったが、
これは、逆でもよい。つまり、まず最初に濃度補正を行
う。この場合の補正濃度値Ｃ’は、Ｃ' ＝((正規化前濃度（黒画素数）の最高値) −（正規
化前濃度の最低値))×（補正濃度値Ｃ) ／２５６通常、正規化前濃度の最低値は０であるため、Ｃ' ＝正
規化前濃度の最高値×補正濃度値Ｃ／２５６となる。(B) In the above embodiment, after the density is normalized (concentration is set within a predetermined range of 0 to 255), the above equations (1) and (2) are converted. But,
This may be reversed. That is, first, density correction is performed. In this case, the corrected density value C ′ is: C ′ = ((highest value of density before normalization (the number of black pixels)) − (lowest value of density before normalization)) × (corrected density value C) / 256 Since the minimum value of the density before normalization is 0, C ′ = the maximum value of the density before normalization × the corrected density value C / 256.

【００５３】そして、式（１）は、Ｚ1 ＝（正規化前濃
度Ｚ0 −Ｋ）×（正規化前濃度の最高値−Ｃ’）／（正
規化前濃度の最高値−Ｋ）＋Ｃ’となり、式（２）は、
Ｚ2 ＝Ｃ’×（１−（Ｋ−Ｚ0 ）／Ｋ）となる。なお、
正規化前濃度Ｚ0 としては、黒点が存在するメッシュ領
域についてのみ変換の対象となる。この後、変換された
正規化前濃度Ｚ1 ，Ｚ2 を用いて濃度の正規化が行われ
て、メッシュ濃度値が算出される。Then, equation (1) becomes Z1 = (concentration before normalization Z0−K) × (highest value of density before normalization−C ′) / (highest value of density before normalization−K) + C ′. , Equation (2)
Z2 = C'.times. (1- (K-Z0) / K). In addition,
As the density Z0 before normalization, only the mesh area where the black point exists is subject to conversion. Thereafter, density normalization is performed using the converted pre-normalization densities Z1 and Z2, and a mesh density value is calculated.

【００５４】（ハ）上記実施の形態では、辞書１９を用
いた文字認識としたが、代わりに、ニューラルネットワ
ーク型文字認識手段としてのニューロコンピュータ又は
ノイマン型コンピュータを用いた文字認識としてもよ
い。ニューラルネットワーク型文字認識に関しては、特
開平４−１７７４８５号、特開平４−２６０９８７号、
特開平６−４４４０９号公報等に記載されており周知の
技術である。(C) In the above embodiment, character recognition is performed using the dictionary 19. Alternatively, character recognition may be performed using a neural computer or a Neumann computer as a neural network type character recognition means. Regarding neural network type character recognition, JP-A-4-177485, JP-A-4-260987,
This is a well-known technique described in JP-A-6-44409.

【００５５】（ニ）上記実施の形態では、認識対象文字
を電話番号に使用される数字，記号としたが、その他の
例えばひらがな、カタカナ、英字、漢字などをその認識
対象文字に含んでもよい。(D) In the above embodiment, the characters to be recognized are numbers and symbols used in the telephone number. However, other characters to be recognized may include, for example, hiragana, katakana, English characters, and kanji.

【００５６】（ホ）ファクシミリ装置に限定されず、本
発明をその他の装置に適用することができる。(E) The present invention is not limited to a facsimile apparatus, but can be applied to other apparatuses.

【００５７】[0057]

【発明の効果】以上詳述したように本発明によれば、文
字データを形成する各小領域の濃度値の平均を、その濃
度値のダイナミックレンジを保持しつつ、所定の平均濃
度値に近似するように、各小領域の濃度値の変換を行う
ようにしたことにより、文字の特徴量のばらつきを少な
くすることができる。As described above in detail, according to the present invention, the average of the density values of the respective small areas forming the character data is approximated to a predetermined average density value while maintaining the dynamic range of the density values. As described above, by performing the conversion of the density value of each small area, it is possible to reduce the variation in the characteristic amount of the character.

[Brief description of the drawings]

【図１】ファクシミリ装置を示すブロック図。FIG. 1 is a block diagram showing a facsimile apparatus.

【図２】用紙に記入された電話番号を示す説明図。FIG. 2 is an explanatory diagram showing telephone numbers written on a sheet.

【図３】ファクシミリ装置の作用を示すフローチャー
ト。FIG. 3 is a flowchart showing the operation of the facsimile machine.

【図４】メッシュ濃度値の変換処理を示す説明図。FIG. 4 is an explanatory diagram showing a conversion process of a mesh density value.

【図５】従来例の正規化及び特徴抽出処理を示す説明
図。FIG. 5 is an explanatory diagram showing normalization and feature extraction processing of a conventional example.

[Explanation of symbols]

１７…算出手段、変換手段及び正規化手段としての正規
化処理部、１８…文字認識手段としての文字認識部、Ｋ
…平均濃度値、Ｃ…補正濃度値（所定の平均濃度値）、
Ｘ0 …（変換前の）濃度値、Ｘ1,Ｘ2 …（変換後の）濃
度値。17 ... Normalization processing unit as calculation means, conversion means and normalization means, 18 ... Character recognition unit as character recognition means, K
... average density value, C ... correction density value (predetermined average density value),
X0 ... density value (before conversion), X1, X2 ... density value (after conversion).

───────────────────────────────────────────────────── フロントページの続き (72)発明者堀井洋大阪府守口市京阪本通２丁目５番５号三洋電機株式会社内 (72)発明者田中淳司鳥取県鳥取市南吉方３丁目201番地鳥取三洋電機株式会社内 (58)調査した分野(Int.Cl.⁶，ＤＢ名) G06K 9/62 630──────────────────────────────────────────────────続き Continuing on the front page (72) Hiroshi Horii 2-5-5 Keihanhondori, Moriguchi-shi, Osaka Sanyo Electric Co., Ltd. (72) Inventor Junji Tanaka 3-201 Minamiyoshikata, Tottori-city, Tottori (58) Field surveyed (Int.Cl. ⁶ , DB name) G06K 9/62 630

Claims

(57) [Claims]

1. A density value (X0) of each small area forming a character data area is calculated, and an average (K) of the calculated density values (X0) of each small area is calculated as the density value (X0). A) converting the density value (X0) of each of the small areas so as to approximate a predetermined average density value (C) while maintaining the dynamic range of (1).

2. A character feature comprising a step (S4) of calculating a density value (X0) of each small area based on a ratio of constituent elements of character data in each small area forming a character data area. In the extraction method, the calculating step (S4) may include determining a density value (X0) of each of the small areas.
Converting the density value (X0) of each of the small areas such that the average (K) of the small areas approximates a predetermined average density value (C) while maintaining the dynamic range of the density value (X0) ( S5) A character feature extraction method characterized by including:

3. A character feature comprising a step (S4) of calculating a density value (X0) of each small area based on a ratio of constituent elements of character data in each small area forming the character data area. In the extraction method, the calculating step (S4) may include determining a density value (X0) of each of the small areas.
Converting the density value (X0) of each of the small areas such that the average (K) of the small areas approximates a predetermined average density value (C) while maintaining the dynamic range of the density value (X0) ( S5), the predetermined average density value (C) converts the converted density value (X1, X2) of each small area into a balance state of the density value (X0) of each small area before conversion. A character feature extraction method characterized by being set to a value that can be maintained.

4. The method according to claim 2, wherein
The component is black pixel data constituting character data, and in the conversion step, the density value (X0) of each small area where the black pixel data exists is converted. Method.

5. A value (Z0) indicating a ratio of constituent elements of character data in each small area forming a character data area.
, And a second step of normalizing the calculated value (Z0) of each small area to calculate a density value. Before the second step, the average (K) of the density values in each of the small areas is close to a predetermined average density value (C ′) while maintaining the dynamic range of the density value (X0). And a step of converting a value (Z0) indicating the ratio of the constituent elements in each of the small areas.

6. The method according to claim 5, wherein the component is black pixel data constituting character data;
In the conversion step, a value (Z0) indicating a ratio of the black pixel data is converted for a small area in which the black pixel data exists, wherein the character is extracted.

7. A character feature extraction method for calculating a density value of each small area forming an area of character data and converting the calculated density value of each small area into a density value of a predetermined range width. The average value of the converted density values of the area occupied by the constituent elements of the character data approaches a predetermined value (C) while maintaining the vertical relationship of the density values of each area of the character data before conversion. In this manner, the character feature extraction method is characterized in that the conversion of the density value is performed.

8. A step of dividing the character data area into a plurality of small areas and normalizing the size of each small area, and based on the ratio of the constituent elements of the character data in each normalized small area. Extracting a character feature by calculating a density value (X0) of each of the small areas; and identifying the character based on the extracted character feature. The extracting step is such that the average (K) of the density values (X0) of each of the small areas is close to a predetermined average density value (C) while maintaining the dynamic range of the density value (X0). A character recognition method comprising a step (S5) of converting a density value (X0) of each of the small areas.

9. A step of dividing a character data area into a plurality of small areas to normalize the size of each small area, and based on a ratio of a component of character data in each normalized small area. Extracting a character feature by calculating a density value (X0) of each of the small areas; and identifying the character based on the extracted character feature. The extracting step is such that the average (K) of the density values (X0) of each of the small areas is close to a predetermined average density value (C) while maintaining the dynamic range of the density value (X0). A step (S5) of converting the density value (X0) of each small area, wherein the predetermined average density value (C) is obtained by converting the converted density value (X1, X2) of each small area before conversion. Of the density value (X0) of each of the small areas Character recognition method characterized in that it is set to the possible values keeping.

10. The method according to claim 8, wherein the component is black pixel data constituting character data, and the density value of each small area where the black pixel data exists in the conversion step. (X0) is converted.

11. A means (1) for calculating a density value (X0) of each of the small areas based on a ratio of constituent elements of the character data in each of the small areas forming the character data area.
7), and the average (K) of the density values (X0) of the small areas calculated by the calculation means (17) is a predetermined average density value while maintaining the dynamic range of the density value (X0). (C)
Means (17) for converting the density value (X0) of each of the small areas so as to approximate

12. The apparatus according to claim 11, wherein the predetermined average density value (C) is a density value (X1, X2) of each of the small areas converted by the conversion means (17).
A character feature extraction apparatus characterized in that the density value (X0) of each of the small areas before the conversion is set to a value capable of maintaining a balance state.

13. A means (17) for dividing a character data area into a plurality of small areas to normalize the size of each small area, and a means for normalizing each small area by said normalizing means (17). Means (1) for calculating the density value (X0) of each of the small areas based on the ratio of the constituent elements of the character data
7), and the average (K) of the density values (X0) of the small areas calculated by the calculation means (17) is a predetermined average density value while maintaining the dynamic range of the density value (X0). (C)
Means (17) for converting the density value (X0) of each of the small areas so as to approximate to the following: based on the density values (X1, X2) of each of the small areas converted by the conversion means (17). And a means (18) for identifying the character.

14. The apparatus according to claim 13, wherein the predetermined average density value (C) is a density value (X1, X2) of each of the small areas converted by the conversion means (17).
A character recognition device characterized in that the character recognition device is set to a value that can maintain the balance of the density value (X0) of each of the small areas before conversion.

15. A facsimile device comprising the character recognition device according to claim 13.