JPH06274695A

JPH06274695A - Character recognition method

Info

Publication number: JPH06274695A
Application number: JP5058434A
Authority: JP
Inventors: Tamotsu Nakajima; 有中島
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1993-03-18
Filing date: 1993-03-18
Publication date: 1994-09-30

Abstract

PURPOSE:To obtain a character recognition method in which a dictionary can be easily generated by calculating the distribution of a chain code in the prescribed area with designated contour of a character image, and recognizing a character in detail based on the distribution. CONSTITUTION:A binarization processing part 1 binarizes image data, and a runlength data generating part 2 generates runlength data from binary image data, and a vertical phase expression data generating part 3 generates vertical phase expression. Also, a middle classification condition generating part 4 generates a middle classification condition by using the position of a run in the vertical phase expression. Furthermore, a chain code distribution generating part 5 generates chain code distribution normalized for a partial contour and normalized code length distribution, and delivers them to a character recognizing part 7, and the character recognition part 7 performs recognition processing based on the distribution. In such a way, since the dictionary can be generated from sampling data by using a statistical method, the dictionary can be easily generated.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、辞書作成を容易にした
文字認識方法に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character recognition method that facilitates dictionary creation.

【０００２】[0002]

【従来の技術】２値化された文字画像を認識する方法に
おいて、本出願人は先に、２値化された画像データから
行毎にランデータを生成し、隣接する行間のランデータ
パターンが異なる一対の行（位相変化ペア）を抽出し、
該一対の行の連なりによって線図形を表現し（縦位相表
現）、該縦位相表現を用いて文字を認識する方法を提案
した（特願平４−４６５７４号）。この縦位相表現を用
いた方法は、画素全体に対してノイズ除去を行っていた
従来の方法に比べて、圧縮された非常に少ないデータ上
でノイズ除去（前掲した出願では、これをつの削り処理
という）が行えるので、高速であるという利点がある。2. Description of the Related Art In a method for recognizing a binarized character image, the present applicant has previously generated run data for each row from binarized image data, and a run data pattern between adjacent rows can be obtained. Extract different pairs of rows (phase change pairs),
A method has been proposed in which a line figure is expressed by a series of a pair of lines (vertical phase expression), and a character is recognized using the vertical phase expression (Japanese Patent Application No. 4-46574). The method using this vertical phase representation removes noise on a very small amount of compressed data as compared to the conventional method that performs noise removal on the entire pixel (in the above-mentioned application, this noise removal is That is, there is an advantage that it is fast.

【０００３】縦位相表現を用いた文字認識方法について
簡単に説明すると、２値化された画像データ（図１０）
は、位相変化ペア２１〜２４で構成された縦位相に表現
される（図１１の２５）。A character recognition method using the vertical phase representation will be briefly described. Binarized image data (FIG. 10)
Is represented by a vertical phase composed of the phase change pairs 21 to 24 (25 in FIG. 11).

【０００４】縦位相表現の接続状態情報を一つの文字列
で表現したものを、縦位相表現におけるトポロジーコー
ドといい、“モデル”とも呼ばれる。分類は、モデルが
辞書にあるものと同一か否かを判定することにより行
う。図１２には、１０種類のトポロジーコード（ただ
し、その他のコード＊を含む）を示す。例えば、数字
「２」については、ｄ，Ａ，ｐＩ，ｐとなる（図１
３）。先に提案した文字認識方法では、縦位相表現のト
ポロジーコードの組み合わせによって文字を大分類し
た。A representation of the connection state information in the vertical phase representation by one character string is called a topology code in the vertical phase representation and is also called a "model". Classification is done by determining whether the model is the same as in the dictionary. FIG. 12 shows ten types of topology codes (including other codes *). For example, the number "2" is d, A, pI, p (see FIG. 1).
3). In the previously proposed character recognition method, the characters are roughly classified according to the combination of the topology codes of the vertical phase representation.

【０００５】しかしながら、図１４に示すような文字を
認識する場合、文字“６”，“４”，“０”のいずれも
同一のトポロジーコード（ｄ，Ｉｄ，ＩＡ，ＶＩ，Ｖ，
ｐ）となるので、縦位相表現のトポロジーコードを用い
るだけでは不十分であった。このように、縦位相表現の
トポロジーコードは、文字の大分類までしか行うことが
できず、したがって更に詳細に分類を行う必要がある。However, when recognizing a character as shown in FIG. 14, all the characters "6", "4" and "0" have the same topology code (d, Id, IA, VI, V,
p), it is not enough to use the topology code of the vertical phase expression. As described above, the topology code of the vertical phase representation can perform only the major classification of characters, and thus the classification needs to be performed in more detail.

【０００６】このような点を解決した文字認識方法とし
て、本出願人は、ｇｉデータを用いた文字認識方法およ
び方向コードを用いた文字認識方法を既に提案した（特
願平４−５２２１３号）。すなわち、ｇｉデータによる
分類は、縦位相表現から容易に得られる量（数値）の情
報を用いて行う分類であり、縦位相表現に含まれている
位相変化ペアの高さ情報と、縦位相表現の示すランの位
置情報を用いる。そして、分類は、閾値による分類を組
み合わせた簡単な論理式によって行われる。As a character recognition method that solves such a point, the present applicant has already proposed a character recognition method using gi data and a character recognition method using a direction code (Japanese Patent Application No. 4-52213). . That is, the classification based on the gi data is a classification performed using the amount (numerical value) information easily obtained from the vertical phase expression, and the height information of the phase change pair included in the vertical phase expression and the vertical phase expression. The position information of the run indicated by is used. Then, the classification is performed by a simple logical expression in which the classification by the threshold value is combined.

【０００７】例えば、図１５のトポロジーコード（ｄ，
Ｉｄ，ＩＡ，ＶＩ，Ｖ，ｐ）を生成する文字は、９８文
字であり、その内訳は文字“６”が９６例、文字“０”
および文字“４”がそれぞれ１例ずつであった。そし
て、図１５のモデル（ｄ，Ｉｄ，ＩＡ，ＶＩ，Ｖ，ｐ）
において、各位相変化ペアを上から順に、ｇｉ０，ｇｉ
１，・・・ｇｉ５とし、ｇｉ５とｇｉ４の高さの差を
ａ、ｇｉ１とｇｉ０の高さの差をｂ、ｇｉ４とｇｉ２の
高さの差をｃ、ｇｉ５とｇｉ０の高さの差をｈとしたと
き、上記したモデル（ｄ，Ｉｄ，ＩＡ，ＶＩ，Ｖ，ｐ）
に属する９８文字のデータは、ａ，ｂ，ｃ，ｈを用いた
次の条件によって完全に分離することができた。For example, the topology code (d,
The characters that generate Id, IA, VI, V, p) are 98 characters, and the breakdown is 96 examples of the character "6" and the character "0".
And the letter "4" was one example each. Then, the model of FIG. 15 (d, Id, IA, VI, V, p)
In the above, each phase change pair is gi0, gi in order from the top.
1, ... gi5, the height difference between gi5 and gi4 is a, the height difference between gi1 and gi0 is b, the height difference between gi4 and gi2 is c, and the height difference between gi5 and gi0 is If h, then the above model (d, Id, IA, VI, V, p)
The data of 98 characters belonging to could be completely separated by the following conditions using a, b, c and h.

【０００８】文字“６”の条件：ａ／ｈ＜０．１５かつ
０．２５≦ｂ／ｈ＜０．８０文字“０”の条件：ａ／ｈ＜０．１５かつｂ／ｈ＜０．
１５文字“４”の条件：０．２５≦ａ／ｈ＜０．６０。Condition for character "6": a / h <0.15 and 0.25≤b / h <0.80 Condition for character "0": a / h <0.15 and b / h <0.
Condition of 15 characters “4”: 0.25 ≦ a / h <0.60.

【０００９】また、方向コードによる分類は、ｇｉデー
タを用いた方法では分類できない数字の分類を可能とす
るものであり、この方向コード（図１６）は、輪郭の一
部分に対して等間隔でサンプリング点をとり、各点の傾
きをコード化して並べたものである。そして、詳細分類
は、文字認識用のオートマトンを用いて行う。Further, the classification by the direction code enables the classification of numbers which cannot be classified by the method using the gi data, and the direction code (FIG. 16) is sampled at equal intervals for a part of the contour. The points are taken and the slopes of the points are coded and arranged. Then, the detailed classification is performed using an automaton for character recognition.

【００１０】[0010]

【発明が解決しようとする課題】しかしながら、上記し
た詳細分類の方法では、データを参照しながらオートマ
トンの構成を考えなければならず、オートマトン作成に
多大の労力を要するという問題がある。また、上記した
方法は、オートマトンが人の直観と離れているため、オ
ートマトン作成に訓練を必要とするという欠点もある。However, in the above detailed classification method, it is necessary to consider the structure of the automaton while referring to the data, and there is a problem that a great amount of labor is required to create the automaton. In addition, the above-mentioned method has a drawback that training is required to create an automaton because the automaton is far from the human intuition.

【００１１】本発明の目的は、辞書作成を容易にした文
字認識方法を提供することにある。An object of the present invention is to provide a character recognition method that facilitates dictionary creation.

【００１２】[0012]

【課題を解決するための手段】前記目的を達成するため
に、請求項１記載の発明では、２値化された画像データ
から行毎にランデータを生成し、隣接する行間のランデ
ータパターンが異なる位相変化ペアを抽出し、該位相変
化ペアにおけるランの接続状態で構成された縦位相表現
を用いて文字を認識する方法において、文字画像の輪郭
の一部または全部を指定し、該指定された輪郭の所定領
域におけるチェインコードの分布を算出し、該分布に基
づいて文字を詳細認識することを特徴としている。In order to achieve the above object, in the invention according to claim 1, run data is generated for each row from binarized image data, and a run data pattern between adjacent rows is generated. In a method of extracting a different phase change pair and recognizing a character by using a vertical phase expression composed of connected states of runs in the phase change pair, a part or all of the outline of a character image is specified and the specified It is characterized in that the distribution of the chain code in a predetermined area of the contour is calculated and the characters are recognized in detail based on the distribution.

【００１３】請求項２記載の発明では、２値化された画
像データから行毎にランデータを生成し、隣接する行間
のランデータパターンが異なる位相変化ペアを抽出し、
該位相変化ペアにおけるランの接続状態で構成された縦
位相表現を用いて文字を認識する方法において、文字画
像の輪郭の一部または全部を指定し、該指定された輪郭
を多角形で近似表現し、該表現を用いて文字を詳細認識
することを特徴としている。According to the second aspect of the invention, run data is generated for each row from the binarized image data, and phase change pairs having different run data patterns between adjacent rows are extracted.
In a method for recognizing a character by using a vertical phase expression composed of connected states of runs in the phase change pair, a part or all of the outline of a character image is specified, and the specified outline is approximated by a polygon. However, the feature is that the character is recognized in detail by using the expression.

【００１４】請求項３記載の発明では、２値化された画
像データから行毎にランデータを生成し、隣接する行間
のランデータパターンが異なる位相変化ペアを抽出し、
該位相変化ペアにおけるランの接続状態で構成された縦
位相表現を用いて文字を認識する方法において、文字画
像の輪郭の一部または全部を指定し、該指定された輪郭
をシンボルの集まりとして表現し、該表現を用いて文字
を詳細認識することを特徴としている。According to the third aspect of the invention, run data is generated for each row from the binarized image data, and phase change pairs having different run data patterns between adjacent rows are extracted.
In a method for recognizing a character by using a vertical phase expression composed of connected states of runs in the phase change pair, a part or all of the outline of a character image is specified, and the specified outline is expressed as a collection of symbols. However, the feature is that the character is recognized in detail by using the expression.

【００１５】請求項４記載の発明では、２値化された画
像データから行毎にランデータを生成し、隣接する行間
のランデータパターンが異なる位相変化ペアを抽出し、
該位相変化ペアにおけるランの接続状態で構成された縦
位相表現を用いて文字を認識する方法において、文字画
像の輪郭の一部または全部を指定し、該指定された輪郭
をフーリエ変換し、該フーリエ変換された特徴に基づい
て文字を詳細認識することを特徴としている。According to a fourth aspect of the present invention, run data is generated for each row from the binarized image data, and phase change pairs having different run data patterns between adjacent rows are extracted.
In a method for recognizing a character by using a vertical phase representation composed of connected states of runs in the phase change pair, a part or all of the outline of a character image is specified, the specified outline is Fourier-transformed, and The feature is that the character is recognized in detail based on the Fourier-transformed feature.

【００１６】[0016]

【作用】２値化処理部では画像データを２値化し、ラン
レングスデータ生成部では２値化された画像データから
ランレングスデータを作成し、縦位相表現データ生成部
では、縦位相表現を作成する。中分類条件生成部は、縦
位相表現におけるランの位置を用いて中分類条件を生成
する。チェインコード分布生成部は、部分輪郭に対して
正規化されたチェインコード分布と正規化されたコード
長分布を生成し、文字認識部に渡す。文字認識部は、そ
の分布に基づいて認識処理する。The binarization processing unit binarizes the image data, the run length data generation unit creates run length data from the binarized image data, and the vertical phase expression data generation unit creates vertical phase expression. To do. The middle classification condition generation unit generates the middle classification condition using the position of the run in the vertical phase expression. The chain code distribution generation unit generates a normalized chain code distribution and a normalized code length distribution for the partial contour, and passes them to the character recognition unit. The character recognition unit performs recognition processing based on the distribution.

【００１７】[0017]

【実施例】以下、本発明の一実施例を図面を用いて具体
的に説明する。〈チェインコードを用いる方法〉図１は、本発明の第１
の実施例のブロック構成図である。図において、１は、
画像を読み取って２値化する２値化処理部、２は、２値
化された画像データからランレングスデータを生成する
ランレングスデータ生成部、３は、隣接する行のランレ
ングスデータから位相変化ペアを抽出して縦位相表現を
生成する縦位相表現データ生成部である。これら２値化
処理部１、ランレングスデータ生成部２、縦位相表現デ
ータ生成部３は、前掲した出願（特願平４−４６５７４
号）に記載されたものと同一の機能、構成を有するもの
である。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of the present invention will be specifically described below with reference to the drawings. <Method Using Chain Code> FIG. 1 shows a first method of the present invention.
It is a block diagram of an embodiment of the. In the figure, 1 is
A binarization processing unit that reads an image and binarizes it, a run length data generation unit that generates run length data from the binarized image data, and a phase change from run length data of adjacent rows. A vertical phase expression data generation unit that extracts a pair and generates a vertical phase expression. The binarization processing unit 1, the run length data generation unit 2, and the vertical phase expression data generation unit 3 are described in the above-mentioned application (Japanese Patent Application No. 4-46574).
No.) has the same function and configuration as those described in No.

【００１８】４は、縦位相表現におけるランの位置を用
いて中分類条件を生成する中分類条件生成部であり、縦
位相表現データとランレングスデータを参照しながら簡
単な算術式と閾値を用いて中分類条件を生成する。この
中分類条件生成部４は、前掲した出願（特願平４−５２
２１３号）に記載されたものと同一の機能、構成を有す
るものである（ただし、本発明では、先の出願のｇｉデ
ータ分類部を中分類条件生成部という）。Reference numeral 4 denotes a middle classification condition generating section for generating middle classification conditions by using run positions in the vertical phase expression, and uses simple arithmetic expressions and threshold values while referring to the vertical phase expression data and the run length data. To generate middle classification conditions. The middle classification condition generating unit 4 is the same as the above-mentioned application (Japanese Patent Application No. 4-52).
No. 213) has the same function and configuration (however, in the present invention, the gi data classification unit of the previous application is referred to as a middle classification condition generation unit).

【００１９】５は、部分輪郭に対してチェインコードの
分布を求めるチェインコード分布生成部である。６は、
縦位相表現データを参照してつの削り処理を行うノイズ
除去部であり、縦位相表現データとランレングスデータ
に変更を加える。このノイズ除去部は、前掲した出願
（特願平４−４６５７４号）に記載されたものと同一の
機能、構成を有している。７は、縦位相表現、中分類条
件、およびチェインコード分布に基づいて詳細認識を行
う文字認識部である。Reference numeral 5 is a chain code distribution generator for obtaining the distribution of chain codes for the partial contour. 6 is
It is a noise removing unit that performs one shaving process by referring to the vertical phase expression data, and changes the vertical phase expression data and the run length data. This noise removing unit has the same function and configuration as those described in the above-mentioned application (Japanese Patent Application No. 4-46574). A character recognition unit 7 performs detailed recognition based on the vertical phase expression, the middle classification condition, and the chain code distribution.

【００２０】図２に示す画像データを例にして、本発明
の文字認識方法について詳述する。図２に示すように、
２値化処理部１によって２値化された画像データから、
ランレングスデータ生成部２はランレングスデータを作
成し、縦位相表現データ生成部３は、このランレングス
データから前述したようにして縦位相表現を作成する。
図２の画像データの縦位相表現（トポロジーコード）
は、“ｄ，Ａ，Ｖ，ｐ”となる。The character recognition method of the present invention will be described in detail with reference to the image data shown in FIG. As shown in FIG.
From the image data binarized by the binarization processing unit 1,
The run length data generation unit 2 creates run length data, and the vertical phase representation data generation unit 3 creates vertical phase representation from this run length data as described above.
Vertical phase representation of image data in Figure 2 (topology code)
Becomes "d, A, V, p".

【００２１】中分類条件生成部４は、図３に示すような
中分類条件を生成する。つまり、モデル“ｄ，Ａ，Ｖ，
ｐ”に対して、ｘ＝ａ／ｂとしたとき、ｘ＜０．８５
（グループ１），０．８５≦ｘ＜１．２５（グループ
２），１．２５≦ｘ（グループ３）によってデータを分
類する。図２に示す画像データは、“ｄ，Ａ，Ｖ，ｐ”
のグループ１に分類される。The middle classification condition generating unit 4 generates middle classification conditions as shown in FIG. That is, the model “d, A, V,
When x = a / b for p ″, x <0.85
The data is classified by (group 1), 0.85 ≦ x <1.25 (group 2), 1.25 ≦ x (group 3). The image data shown in FIG. 2 is “d, A, V, p”.
It is classified into Group 1 of.

【００２２】チェインコード分布生成部５は、図４
（ａ）に示すように、分類された“ｄ，Ａ，Ｖ，ｐ”の
グループ１の部分輪郭（ｒ，ｒ’）に対して、チェイン
コードを生成する。図６（ａ）、（ｂ）は、チェインコ
ードの定義を説明する図で、図６（ａ）に示すように８
方向のコードを決め、図６（ｂ）に示すように、図形の
輪郭を該コードのチェーンで表現する。The chain code distribution generator 5 is shown in FIG.
As shown in (a), a chain code is generated for the classified “d, A, V, p” partial contours (r, r ′) of group 1. FIGS. 6A and 6B are diagrams for explaining the definition of the chain code, and as shown in FIG.
The code of the direction is determined, and the outline of the figure is expressed by the chain of the code as shown in FIG. 6 (b).

【００２３】チェインコード分布生成部５は、文字認識
部７によって指定された部分輪郭に対して、以下に示す
ようにチェインコード分布を生成する。具体的には、図
２の画像データの場合、文字認識部７からチェインコー
ド分布生成部５に対して、「図４（ａ）の部分輪郭ｒ
を、１／３，１／３，１／３に分割したときのチェイン
コード分布」と、「図４（ａ）の部分輪郭ｒ’を、１／
４，１３／２０，１／１０に分割したときのチェインコ
ード分布」を求める指示が与えられる。The chain code distribution generation unit 5 generates a chain code distribution for the partial contour designated by the character recognition unit 7 as shown below. Specifically, in the case of the image data of FIG. 2, the character recognition unit 7 instructs the chain code distribution generation unit 5 to display “partial contour r of FIG.
Is divided into ⅓, ⅓ and ⅓, and “partial contour r ′ in FIG.
4, 13/20, 1/10, and a chain code distribution ”is given.

【００２４】以下、図４（ａ）の部分輪郭ｒを１／３，
１／３，１／３に分割してチェインコード分布を得る場
合を例にして、図５のフローチャートを参照して説明す
る。ステップ５００では、部分輪郭と分割比（１／３，
１／３，１／３）が与えられる。ステップ５０１では、
与えられた輪郭を分割比に基づいて分割する。以下、分
割された輪郭を分割輪郭という。図４（ｂ）は、部分輪
郭ｒを１：１：１に分割した例を示し、分割されたそれ
ぞれをｒ１，ｒ２，ｒ３とする。In the following, the partial contour r of FIG.
An example of obtaining a chain code distribution by dividing into 1/3 and 1/3 will be described with reference to the flowchart of FIG. In step 500, the partial contour and the division ratio (1/3,
1/3, 1/3) is given. In step 501,
The given contour is divided based on the division ratio. Hereinafter, the divided contour is referred to as a divided contour. FIG. 4B shows an example in which the partial contour r is divided into 1: 1: 1, and the respective divided portions are designated as r1, r2, and r3.

【００２５】ステップ５０２では、まず最初の分割輪郭
ｒ１がＲとなる。ステップ５０３では、Ｒに対してチェ
インコードが生成される。この例の場合、「５４５４４
５５４５５５５６５５６５５６」なるチェインコードが
生成される。ステップ５０４では、ステップ５０３で得
られたチェインコードの分布と長さが求められる。この
場合、分布は「コード４が４回、コード５が１２回、コ
ード６が３回」となり、長さは１９文字となる。In step 502, the first divided contour r1 becomes R. In step 503, a chain code is generated for R. In the case of this example, “54544
A chain code "554555556565556" is generated. In step 504, the distribution and length of the chain code obtained in step 503 are obtained. In this case, the distribution is “4 times for code 4, 12 times for code 5, 3 times for code 6”, and the length is 19 characters.

【００２６】ステップ５０５では、ステップ５０４で得
られたチェインコードの分布が正規化される。この例の
場合、「０が０％、１が０％、２が０％、３が０％、４
が２１％、５が６３％、６が１６％、７が０％」とな
る。以後、これを「０，０，０，０，２１，６３，１
６，０」のように表す。In step 505, the chain code distribution obtained in step 504 is normalized. In the case of this example, “0 is 0%, 1 is 0%, 2 is 0%, 3 is 0%, 4
Is 21%, 5 is 63%, 6 is 16%, and 7 is 0%. " After that, this is "0,0,0,0,21,63,1
6, 0 ".

【００２７】ステップ５０６では、正規化されたチェイ
ンコード分布が出力される。ステップ５０７では、全て
の分割輪郭に対して処理が終わったかを判定する。この
場合、まだ分割輪郭ｒ２，ｒ３が処理されていないの
で、ステップ５０２に行く。このループをあと２回繰り
返すと、分割輪郭ｒ２に対して、「５６６６６６６０７
７０７７０００１０１０７６５６６５」なる長さ２６の
チェインコードが得られた後、「２７，８，０，０，
０，１２，３４，１９」なる正規化されたチェインコー
ドが出力され、分割輪郭ｒ３に対して、「６６５６６５
６５６６５６６７」長さ１４のチェインコードが得られ
た後、「０，０，０，０，０，２９，６４，７」なる正
規化されたチェインコードが出力され、ステップ５０８
に行く。In step 506, the normalized chain code distribution is output. In step 507, it is determined whether the processing has been completed for all the divided contours. In this case, since the divided contours r2 and r3 have not been processed yet, the process goes to step 502. When this loop is repeated two more times, “566666607” for the divided contour r2.
After obtaining the chain code of length 26 of 707700010107656665 "," 27,8,0,0,
The normalized chain code "0, 12, 34, 19" is output, and "665665" is output for the divided contour r3.
65665667 "After the length 14 chain code is obtained, the normalized chain code" 0,0,0,0,0,29,64,7 "is output, and step 508 is performed.
go to.

【００２８】ステップ５０８では、各分割輪郭の長さ１
９文字、２６文字、１４文字が正規化され、「ｒ１，ｒ
２，ｒ３に対しそれぞれ３２％，４４％，２４％」とい
う結果が得られる。ステップ５０９では、この結果であ
る「３２，４４，２４」を出力する。In step 508, the length of each divided contour is 1
9 characters, 26 characters and 14 characters are normalized to "r1, r
The results are 32%, 44%, and 24% for 2 and r3, respectively. In step 509, the result "32, 44, 24" is output.

【００２９】上記したようにして得られた正規化された
チェインコード分布「０，０，０，０，２１，６３，１
６，０」，「２７，８，０，０，０，１２，３４，１
９」，「０，０，０，０，０，２９，６４，７」および
正規化されたコード長分布「３２，４４，２４」を基
に、文字認識部７が文字認識を行う。文字認識部７で
は、予めサンプルデータから各要素の平均と標準偏差を
求めておいた辞書を使い、各分布値が平均とどれだけ離
れているかに応じて認識を行う。辞書に必要な平均と標
準偏差は自動的に求めておくことができる。The normalized chain code distribution "0,0,0,0,21,63,1" obtained as described above
6,0 "," 27,8,0,0,0,12,34,1
The character recognition unit 7 performs character recognition based on "9", "0,0,0,0,0,29,64,7" and the normalized code length distribution "32,44,24". The character recognition unit 7 uses a dictionary in which the average and standard deviation of each element have been obtained in advance from sample data, and performs recognition according to how far each distribution value is from the average. The average and standard deviation required for the dictionary can be calculated automatically.

【００３０】上記した例では、「“ｄ，Ａ，Ｖ，ｐ”の
グループ１」ならば「図４（ａ）の部分輪郭ｒを１／
３，１／３，１／３に分割してチェインコード分布を求
めよ」というルールは、人間が作成しなければならない
が、その労力はオートマトン作成に比べて大幅に少な
い。In the above example, if "the group 1 of" d, A, V, p "is", the partial contour r of FIG.
The rule "obtain the chain code distribution by dividing it into 3, 1/3, 1/3" has to be created by a human, but the labor is significantly less than that in creating an automaton.

【００３１】〈多角形近似を用いる方法〉図７は、本発
明の第２の実施例のブロック構成図である。図７におい
て、２値化処理部１、ランレングスデータ生成部２、縦
位相表現データ生成部３、中分類条件生成部４、ノイズ
除去部６は、前述した実施例１のものと同一である。５
は、部分輪郭に対して、多角形表現を求める多角形表現
生成部、７は、縦位相表現、中分類条件、および多角形
表現に基づいて詳細認識を行う文字認識部である。<Method Using Polygonal Approximation> FIG. 7 is a block diagram of the second embodiment of the present invention. In FIG. 7, the binarization processing unit 1, the run length data generation unit 2, the vertical phase expression data generation unit 3, the middle classification condition generation unit 4, and the noise removal unit 6 are the same as those in the first embodiment described above. . 5
Is a polygon expression generation unit for obtaining a polygon expression for the partial contour, and 7 is a character recognition unit for performing detailed recognition based on the vertical phase expression, the middle classification condition, and the polygon expression.

【００３２】図２に示す画像データを例にして、本実施
例２の文字認識方法について説明すると、前述したと同
様にして、図２に示す画像データは、“ｄ，Ａ，Ｖ，
ｐ”のグループ１に分類され、図４（ａ）の部分輪郭
ｒ，ｒ’に対して多角形表現が生成される。The character recognition method of the second embodiment will be described by taking the image data shown in FIG. 2 as an example. In the same manner as described above, the image data shown in FIG.
The polygonal representation is generated for the partial contours r and r ′ of FIG.

【００３３】多角形表現生成部５では、文字認識部７に
よって指定された輪郭に対して多角形で近似した表現を
生成する。この多角形の近似表現の生成としては、例え
ば、「Ｗｉｌｌｉａｍｓの方法」、「Ｒａｍｅｒの方
法」、「Ｌ１による近似」、「分割融合法」などの方法
を用いる（詳細については、例えば、森俊二他，画像認
識の基礎［１］，オーム社，１９８６年、ｐｐ１７３−
１９０を参照）。文字認識部７は、このようにして得ら
れた多角形表現を用いて認識処理を行う。The polygonal expression generating unit 5 generates a polygonal approximation of the contour specified by the character recognition unit 7. For the generation of the approximate expression of the polygon, for example, methods such as "Williams method", "Ramer method", "L1 approximation", and "division fusion method" are used. Others, Basics of Image Recognition [1], Ohmsha, 1986, pp173-
190). The character recognition unit 7 performs recognition processing using the polygonal expression thus obtained.

【００３４】〈シンボルを用いる方法〉図８は、本発明
の第３の実施例のブロック構成図である。図８におい
て、２値化処理部１、ランレングスデータ生成部２、縦
位相表現データ生成部３、中分類条件生成部４、ノイズ
除去部６は、前述した実施例１のものと同一である。５
は、部分輪郭に対してシンボル表現を求めるシンボル表
現生成部、７は、縦位相表現、中分類条件、およびシン
ボル表現に基づいて詳細認識を行う文字認識部である。<Method Using Symbols> FIG. 8 is a block diagram of the third embodiment of the present invention. In FIG. 8, the binarization processing unit 1, the run length data generation unit 2, the vertical phase expression data generation unit 3, the middle classification condition generation unit 4, and the noise removal unit 6 are the same as those of the above-described first embodiment. . 5
Is a symbol expression generation unit that obtains a symbol expression for the partial contour, and 7 is a character recognition unit that performs detailed recognition based on the vertical phase expression, the middle classification condition, and the symbol expression.

【００３５】図２に示す画像データを例にして、本実施
例３の文字認識方法について説明すると、前述したと同
様にして、図２に示す画像データは、“ｄ，Ａ，Ｖ，
ｐ”のグループ１に分類され、図４（ａ）の部分輪郭
ｒ，ｒ’に対してシンボル表現が生成される。The character recognition method of the third embodiment will be described by taking the image data shown in FIG. 2 as an example. In the same manner as described above, the image data shown in FIG.
The symbol representation is generated for the partial contours r and r ′ of FIG.

【００３６】シンボル表現生成部５では、文字認識部７
によって指定された輪郭に対してシンボル表現を生成す
る。このシンボル表現の生成としては、本出願人が先に
提案した、重なりを有する直線群によって輪郭を表現す
る方法（特願平３−２５３１８５号）を用いる。文字認
識部７は、このようにして得られたシンボル表現を用い
て認識処理を行う。In the symbol expression generation unit 5, the character recognition unit 7
Generates a symbolic representation for the contour specified by. For the generation of this symbol expression, the method proposed by the present applicant to express an outline by a group of straight lines having an overlap (Japanese Patent Application No. 3-253185) is used. The character recognition unit 7 performs recognition processing using the symbolic expression thus obtained.

【００３７】〈フーリエ変換を用いる方法〉図９は、本
発明の第４の実施例のブロック構成図である。図９にお
いて、２値化処理部１、ランレングスデータ生成部２、
縦位相表現データ生成部３、中分類条件生成部４、ノイ
ズ除去部６は、前述した実施例１のものと同一である。
５は、部分輪郭に対してフーリエ変換表現を求めるフー
リエ変換表現生成部、７は、縦位相表現、中分類条件、
およびフーリエ変換表現に基づいて詳細認識を行う文字
認識部である。<Method Using Fourier Transform> FIG. 9 is a block diagram of the fourth embodiment of the present invention. In FIG. 9, a binarization processing unit 1, a run length data generation unit 2,
The vertical phase expression data generation unit 3, the middle classification condition generation unit 4, and the noise removal unit 6 are the same as those in the first embodiment described above.
Reference numeral 5 is a Fourier transform expression generation unit that obtains a Fourier transform expression for a partial contour, 7 is a vertical phase expression, a middle classification condition,
And a character recognition unit for performing detailed recognition based on Fourier transform representation.

【００３８】図２に示す画像データを例にして、本実施
例４の文字認識方法について説明すると、前述したと同
様にして、図２に示す画像データは、“ｄ，Ａ，Ｖ，
ｐ”のグループ１に分類され、図４（ａ）の部分輪郭
ｒ，ｒ’に対してフーリエ変換表現が生成される。The character recognition method of the fourth embodiment will be described by taking the image data shown in FIG. 2 as an example. In the same manner as described above, the image data shown in FIG.
It is classified into the group 1 of p ″, and the Fourier transform expression is generated for the partial contours r and r ′ of FIG.

【００３９】すなわち、フーリエ変換表現生成部５で
は、文字認識部７によって指定された輪郭に対してフー
リエ変換することによって、空間周波数領域における特
徴を生成する。このフーリエ変換表現の生成方法として
は、ＦＦＴ（ＦａｓｔＦｏｕｒｉｅｒＴｒａｎｓｆ
ｏｒｍ）など種々の方法を用いる。文字認識部７は、フ
ーリエ変換された特徴に基づいて認識処理を行う。That is, the Fourier transform representation generating section 5 generates a feature in the spatial frequency domain by performing a Fourier transform on the contour designated by the character recognizing section 7. As a method of generating this Fourier transform expression, an FFT (Fast Fourier Transform) is used.
orm). The character recognition unit 7 performs recognition processing based on the Fourier-transformed features.

【００４０】なお、本発明は、上記したものに限定され
るものではなく、例えば上記した方法を組み合わせたり
あるいはオートマトンを用いて特徴を得て、これに基づ
いて認識を行うように変更することも可能である。It should be noted that the present invention is not limited to the above-described ones, but may be modified so that, for example, the above-mentioned methods are combined, or features are obtained by using an automaton, and recognition is performed based on the obtained features. It is possible.

【００４１】[0041]

【発明の効果】以上、説明したように、請求項１、４記
載の発明によれば、サンプルデータから統計的手法を用
いて辞書を作成することができるので、辞書の作成が容
易になる。As described above, according to the first and fourth aspects of the present invention, the dictionary can be created from the sample data by using the statistical method, so that the dictionary can be created easily.

【００４２】請求項２、３記載の発明によれば、人の直
観に合った多角形、シンボル表現を用いて辞書を作成す
ることができるので、辞書の作成が容易になる。According to the second and third aspects of the present invention, the dictionary can be created using polygons and symbolic expressions that match the intuition of a person, so that the dictionary can be created easily.

[Brief description of drawings]

【図１】本発明の第１の実施例のブロック構成図であ
る。FIG. 1 is a block configuration diagram of a first embodiment of the present invention.

【図２】２値化された画像データの例である。FIG. 2 is an example of binarized image data.

【図３】モデル“ｄ，Ａ，Ｖ，ｐ”の中分類条件を説明
する図である。FIG. 3 is a diagram illustrating medium classification conditions of a model “d, A, V, p”.

【図４】（ａ）は部分輪郭を示し、（ｂ）は部分輪郭を
分割輪郭に分けた例を示す図である。4A is a diagram showing a partial contour, and FIG. 4B is a diagram showing an example in which the partial contour is divided into divided contours.

【図５】チェインコード分布生成部の処理フローチャー
トである。FIG. 5 is a processing flowchart of a chain code distribution generation unit.

【図６】（ａ）、（ｂ）は、チェインコードの定義を説
明する図である。6A and 6B are diagrams for explaining the definition of a chain code.

【図７】本発明の第２の実施例のブロック構成図であ
る。FIG. 7 is a block configuration diagram of a second embodiment of the present invention.

【図８】本発明の第３の実施例のブロック構成図であ
る。FIG. 8 is a block configuration diagram of a third embodiment of the present invention.

【図９】本発明の第４の実施例のブロック構成図であ
る。FIG. 9 is a block configuration diagram of a fourth embodiment of the present invention.

【図１０】２値化された画像データである。FIG. 10 is binarized image data.

【図１１】図１０の縦位相表現図である。11 is a vertical phase representation diagram of FIG. 10. FIG.

【図１２】トポロジーコードを示す図である。FIG. 12 is a diagram showing a topology code.

【図１３】数字２の縦位相表現とトポロジーコードを示
す図である。FIG. 13 is a diagram showing a vertical phase representation of a numeral 2 and a topology code.

【図１４】トポロジーコードのみでは文字を分類できな
い例を示す図である。FIG. 14 is a diagram showing an example in which characters cannot be classified only by a topology code.

【図１５】ｇｉデータによる分類を説明する図である。FIG. 15 is a diagram illustrating classification by gi data.

【図１６】方向コードを示す図である。FIG. 16 is a diagram showing a direction code.

[Explanation of symbols]

１２値化処理部２ランレングスデータ生成部３縦位相表現データ生成部４中分類条件生成部５チェインコード分布生成部６ノイズ除去部７文字認識部 1 Binarization processing unit 2 Run length data generation unit 3 Vertical phase expression data generation unit 4 Medium classification condition generation unit 5 Chain code distribution generation unit 6 Noise removal unit 7 Character recognition unit

Claims

[Claims]

1. Run data is generated for each row from binarized image data, phase change pairs with different run data patterns between adjacent rows are extracted, and the run connection state in the phase change pair is configured. In the method for recognizing a character using the vertical phase representation, a part or all of the outline of the character image is specified, the chain code distribution in a predetermined area of the specified outline is calculated, and the character is calculated based on the distribution. A character recognition method characterized by recognizing in detail.

2. Run data is generated for each row from binarized image data, phase change pairs with different run data patterns between adjacent rows are extracted, and the run connection state in the phase change pair is configured. In a method for recognizing a character using the vertical phase expression, a part or all of the outline of the character image is specified, and the specified outline is approximated by a polygon,
A character recognition method characterized in that a character is recognized in detail using the expression.

3. Run data is generated for each row from binarized image data, phase change pairs having different run data patterns between adjacent rows are extracted, and the run connection state in the phase change pair is configured. In a method for recognizing a character using a vertical phase expression, a part or all of the outline of a character image is specified, the specified outline is expressed as a set of symbols, and the character is recognized in detail using the expression. A character recognition method characterized by the above.

4. Run data is generated for each row from binarized image data, phase change pairs having different run data patterns between adjacent rows are extracted, and the run connection state in the phase change pair is configured. In a method for recognizing a character using a vertical phase representation, a part or all of the outline of a character image is specified, the specified outline is Fourier-transformed, and the character is recognized in detail based on the Fourier-transformed feature. A character recognition method characterized by: