JPH05290217A

JPH05290217A - Character recognition method

Info

Publication number: JPH05290217A
Application number: JP4095007A
Authority: JP
Inventors: Tamotsu Nakajima; 有中島
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1992-04-15
Filing date: 1992-04-15
Publication date: 1993-11-05

Abstract

PURPOSE:To broadly classify characters through the use of codes expressing the connection sate of a graphic. CONSTITUTION:A binarization processing part 1 binarizes an image by reading it and a run length data generation part 2 generates run data. A horizontal phase expression data generation part 3 extracts a pair of phase change from run length data at an adjacent string to generate a horizontal phase expression. A dependance relation generation part 4 generates a dependance relation (a graph consists of a node, a branch, start and end) of the horizontal phase expression. A sedin code generation part 5 outputs a code consisting of the combination of the six characters of 's', 'e', 'd', 'i', 'n' and a comma from the dependance relation. A character recognition part 6 broadly classifies characters with the sedin code.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、図形の接続状態を表す
コード表現を用いて文字を認識する文字認識方法に関す
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character recognition method for recognizing a character by using a code expression representing a connection state of figures.

【０００２】[0002]

【従来の技術】２値化された文字画像を認識する方法に
おいて、本出願人は先に、２値化された画像データから
行毎にランデータを生成し、隣接する行間のランデータ
パターンが異なる一対の行（位相変化ペア）を抽出し、
該一対の行の連なりによって線図形を表現し（縦位相表
現）、該縦位相表現を用いて文字を認識する方法を提案
した（特願平３−２５３１８６号）。この縦位相表現に
は、位相的情報である接続状態情報と、若干の量的情報
が含まれる。2. Description of the Related Art In a method for recognizing a binarized character image, the present applicant has previously generated run data for each line from binarized image data, and a run data pattern between adjacent lines can be obtained. Extract different pairs of rows (phase change pairs),
A method has been proposed in which a line figure is expressed by a series of the pair of lines (vertical phase expression) and a character is recognized using the vertical phase expression (Japanese Patent Application No. 3-253186). This vertical phase expression includes connection state information, which is topological information, and some quantitative information.

【０００３】すなわち、２値化された画像データ（図４
４）は、位相変化ペア４５１〜４５４で構成された縦位
相に表現される（図４５の４５５）。これはまたスタン
プ（白丸と黒丸）によっても表現される（図４６）。That is, binarized image data (see FIG.
4) is expressed in the vertical phase composed of the phase change pairs 451 to 454 (455 in FIG. 45). This is also represented by stamps (white and black circles) (Fig. 46).

【０００４】縦位相表現の接続状態情報を一つの文字列
で表現したものを、縦位相表現におけるトポロジーコー
ドといい、“モデル”とも呼ばれる。分類は、モデルが
辞書にあるものと同一か否かを判定することにより行
う。図４７には、１０種類のトポロジーコード（ただ
し、その他のコード＊を含む）を示す。例えば、数字
「２」については、ｄ，Ａ，ｐＩ，ｐとなる（図４
８）。先に提案した文字認識方法では、縦位相表現のト
ポロジーコードの組み合わせによって文字を大分類し
た。A representation of the connection state information in the vertical phase representation by one character string is called a topology code in the vertical phase representation and is also called a "model". Classification is done by determining whether the model is the same as in the dictionary. FIG. 47 shows ten types of topology codes (including other codes *). For example, the number "2" is d, A, pI, p (see FIG. 4).
8). In the previously proposed character recognition method, the characters are roughly classified according to the combination of the topology codes of the vertical phase representation.

【０００５】しかしながら、図４９に示すような文字を
認識する場合、文字“６”，“４”，“０”のいずれも
同一のトポロジーコード（ｄ，Ｉｄ，ＩＡ，ＶＩ，Ｖ，
ｐ）となるので、縦位相表現のトポロジーコードを用い
るだけでは不十分であった。このように、縦位相表現の
トポロジーコードは、文字の大分類までしか行うことが
できず、したがって更に詳細に分類を行う必要がある。However, when recognizing a character as shown in FIG. 49, all the characters "6", "4" and "0" have the same topology code (d, Id, IA, VI, V,
p), it is not enough to use the topology code of the vertical phase representation. As described above, the topology code of the vertical phase representation can perform only the major classification of characters, and thus the classification needs to be performed in more detail.

【０００６】このような点を解決した文字認識方法とし
て、本出願人は、ｇｉデータを用いた文字認識方法およ
び方向コードを用いた文字認識方法を既に提案した（特
願平４−５２２１３号）。すなわち、ｇｉデータによる
分類は、縦位相表現から容易に得られる量（数値）の情
報を用いて行う分類であり、縦位相表現に含まれている
位相変化ペアの高さ情報と、縦位相表現の示すランの位
置情報を用いる。そして、分類は、閾値による分類を組
み合わせた簡単な論理式によって行われる。As a character recognition method that solves such a point, the present applicant has already proposed a character recognition method using gi data and a character recognition method using a direction code (Japanese Patent Application No. 4-52213). .. That is, the classification based on the gi data is a classification performed using the amount (numerical value) information easily obtained from the vertical phase expression, and the height information of the phase change pair included in the vertical phase expression and the vertical phase expression. The position information of the run indicated by is used. Then, the classification is performed by a simple logical expression in which the classification by the threshold value is combined.

【０００７】例えば、図５０のトポロジーコード（ｄ，
Ｉｄ，ＩＡ，ＶＩ，Ｖ，ｐ）を生成する文字は、９８文
字であり、その内訳は文字“６”が９６例、文字“０”
および文字“４”がそれぞれ１例ずつであった。そし
て、図５０のモデル（ｄ，Ｉｄ，ＩＡ，ＶＩ，Ｖ，ｐ）
において、各位相変化ペアを上から順に、ｇｉ０，ｇｉ
１，・・・ｇｉ５とし、ｇｉ５とｇｉ４の高さの差を
ａ、ｇｉ１とｇｉ０の高さの差をｂ、ｇｉ４とｇｉ２の
高さの差をｃ、ｇｉ５とｇｉ０の高さの差をｈとしたと
き、上記したモデル（ｄ，Ｉｄ，ＩＡ，ＶＩ，Ｖ，ｐ）
に属する９８文字のデータは、ａ，ｂ，ｃ，ｈを用いた
次の条件によって完全に分離することができた。For example, the topology code (d,
The characters that generate Id, IA, VI, V, p) are 98 characters, the breakdown of which is 96 examples of the character "6" and the character "0".
And the letter "4" was one example each. Then, the model of FIG. 50 (d, Id, IA, VI, V, p)
In the above, each phase change pair is gi0, gi in order from the top.
1, ... gi5, the height difference between gi5 and gi4 is a, the height difference between gi1 and gi0 is b, the height difference between gi4 and gi2 is c, and the height difference between gi5 and gi0 is If h, then the above model (d, Id, IA, VI, V, p)
The data of 98 characters belonging to could be completely separated by the following conditions using a, b, c and h.

【０００８】文字“６”の条件：ａ／ｈ＜０．１５かつ
０．２５≦ｂ／ｈ＜０．８０文字“０”の条件：ａ／ｈ＜０．１５かつｂ／ｈ＜０．
１５文字“４”の条件：０．２５≦ａ／ｈ＜０．６０。Condition for character "6": a / h <0.15 and 0.25≤b / h <0.80 Condition for character "0": a / h <0.15 and b / h <0.
Condition of 15 characters “4”: 0.25 ≦ a / h <0.60.

【０００９】また、方向コードによる分類は、ｇｉデー
タを用いた方法では分類できない数字の分類を可能とす
るものであり、この方向コードは、輪郭の一部分に対し
て等間隔でサンプリング点をとり、各点の傾きをコード
化して並べたものである。分類は、文字認識用のオート
マトンを用いて行う。The direction code classification enables classification of numbers that cannot be classified by the method using gi data. This direction code takes sampling points at equal intervals with respect to a part of the contour, The inclination of each point is coded and arranged. Classification is performed using an automaton for character recognition.

【００１０】[0010]

【発明が解決しようとする課題】しかしながら、上記提
案した方法では、グループｄ，ｐに対してオートマトン
の作成が非常に困難になるという問題が生じた。すなわ
ち、グループｄ，ｐは、最も簡単な縦位相表現モデルで
あるとともに、出現頻度の高いモデルの一つでもある
（約１３０００文字の手書き数字データの内、２４６３
文字がそのモデルに該当し、第２位であった）。このグ
ループｄ，ｐには多くの字種（数字、１，２，３，５，
７）が含まれるばかりでなく、同一の字種であっても異
なる形と分類すべき多くの変形が含まれる。However, the method proposed above has a problem that it is very difficult to create an automaton for the groups d and p. That is, the groups d and p are not only the simplest model for expressing the vertical phase but also one of the models with a high frequency of appearance (of the handwritten numeral data of about 13,000 characters, 2463).
The letters corresponded to the model and were in second place). There are many character types (numbers, 1, 2, 3, 5, 5) in these groups d and p.
Not only 7) is included, but also many variations that should be classified as different shapes even if they are of the same character type are included.

【００１１】このような場合、他のモデルでは、ｇｉデ
ータによる分類が有効であり、字種、変形の数共に絞る
ことが可能である。しかし、グループｄ，ｐでは、一番
上の黒ランの位置と一番下の黒ランの位置しか分からな
いため、ｇｉデータによる分類は、ほとんど機能しな
い。In such a case, in other models, classification by gi data is effective, and both the character type and the number of deformations can be narrowed down. However, in the groups d and p, only the positions of the uppermost black run and the lowermost black run are known, so that the classification by gi data hardly functions.

【００１２】さらに、他のモデルでは、分類に有効な特
徴を含むよう、狭い範囲で輪郭の一部を選び、その部分
の方向コードに対して着目した特徴を考慮しながらオー
トマトンを作成することができた。これはまた、「まず
輪郭のａ部分で大分類し、次にｂ部分で分類し、・・
・」の如く、段階的な分類が有効に利用可能であること
をも意味していた。Further, in other models, it is possible to select a part of the contour within a narrow range so as to include a feature effective for classification, and to create an automaton while taking into consideration the feature focused on the direction code of that part. did it. It also says, "First, roughly classify in the a part of the contour, then in the b part, ...
It also meant that stepwise classification could be effectively used, such as "."

【００１３】ところが、グループｄ，ｐでは、輪郭の一
部として右側面または左側面（あるいは両側面）しか選
ぶことができず、このため多くの特徴が一度に方向コー
ドに影響を及ぼすことになり、分類に有効な輪郭の一部
を選ぶことができず、この結果、段階的な分類も不可能
となった。However, in the groups d and p, only the right side surface or the left side surface (or both side surfaces) can be selected as a part of the contour, and therefore many characteristics affect the direction code at one time. , It was not possible to select a part of the contour that is effective for classification, and as a result, stepwise classification was also impossible.

【００１４】このように、グループｄ，ｐは、多くの字
種と多種類の変形を有するにも係らず、ｇｉデータを用
いた分類ができず、さらに輪郭の部分指定も有効に機能
しないため、オートマトンの作成は非常に困難となり、
例えば、数字“１”および“７”（および若干の数字
“２”と“３”）を認識するオートマトンを作成した
が、約７００例のデータが認識不能となった。As described above, although the groups d and p have many character types and many kinds of deformation, they cannot be classified using gi data, and the contour portion designation does not function effectively. , Making automata becomes very difficult,
For example, an automaton that recognizes the numbers "1" and "7" (and some numbers "2" and "3") was created, but about 700 cases of data became unrecognizable.

【００１５】本発明の目的は、図形の接続状態を表すコ
ードを用いて文字（モデルｄ，ｐ）を大分類する文字認
識方法を提供することにある。An object of the present invention is to provide a character recognition method for roughly classifying characters (models d and p) using a code representing the connection state of figures.

【００１６】[0016]

【課題を解決するための手段】前記目的を達成するため
に、請求項１記載の発明では、２値化された画像データ
から列毎にランデータを生成し、隣接する列間のランデ
ータパターンが異なる位相変化ペアを抽出し、該位相変
化ペアで構成された横位相表現を用いて文字を認識する
方法において、前記横位相表現された図形の接続関係を
生成し、該接続関係に対応したコードを生成し、該コー
ドによって文字を分類することを特徴としている。In order to achieve the above object, in the invention according to claim 1, run data is generated for each column from binarized image data and a run data pattern between adjacent columns is generated. In the method of extracting a phase change pair different from each other, and recognizing a character by using a horizontal phase expression composed of the phase change pair, a connection relation of the horizontal phase represented figures is generated, and a correspondence relation is generated. A feature is that a code is generated and characters are classified by the code.

【００１７】請求項２記載の発明では、前記コードは、
少なくとも線の発生を表す第１の文字と、線の消失を表
す第２の文字と、線の合流を表す第３の文字と、線の分
岐を表す第４の文字と、変化のない線を表す第５の文字
とを組み合わせてなることを特徴としている。According to a second aspect of the invention, the code is
At least a first character indicating the occurrence of a line, a second character indicating the disappearance of the line, a third character indicating the joining of the lines, a fourth character indicating the branch of the line, and a line that does not change It is characterized in that it is formed by combining it with the fifth character that represents it.

【００１８】請求項３記載の発明では、縦位相表現によ
る文字の分類と、請求項１に記載の文字の分類とを組み
合わせたことを特徴としている。The invention according to claim 3 is characterized in that the character classification according to the vertical phase representation and the character classification according to claim 1 are combined.

【００１９】[0019]

【作用】２値化処理部では、画像を読み取って２値化
し、ランレングスデータ生成部においてランデータが生
成され、横位相表現データ生成部では、隣接する列のラ
ンレングスデータから位相変化ペアを抽出して横位相表
現を生成する。依存関係生成部は、横位相表現から依存
関係（グラフは、４種類のノード、白枝、黒枝、スター
ト、エンドからなる）を生成する。ｓｅｄｉｎコード生
成部は、依存関係から“ｓ”，“ｅ”，“ｄ”，
“ｉ”，“ｎ”とコンマの６文字の組み合わせからなる
コードを出力し、文字認識部では、ｓｅｄｉｎコードに
よって文字を大分類する。これにより、モデルｄ，ｐに
ついての分類が可能になる。In the binarization processing unit, the image is read and binarized, and the run length data generation unit generates run data. In the horizontal phase expression data generation unit, the phase change pair is generated from the run length data of the adjacent columns. Extract to generate a horizontal phase representation. The dependency relationship generation unit generates a dependency relationship (the graph includes four types of nodes, white branches, black branches, start, and end) from the horizontal phase representation. The sedin code generation unit determines "s", "e", "d", from the dependency relationship.
A code consisting of a combination of six characters "i" and "n" and a comma is output, and the character recognition unit roughly classifies the characters by the sedin code. As a result, the models d and p can be classified.

【００２０】[0020]

【実施例】以下、本発明の一実施例を図面を用いて具体
的に説明する。図１は、本発明の実施例のブロック構成
図である。図において、１は、画像を読み取って２値化
する２値化処理部、２は、２値化された画像データから
ランレングスデータを生成するランレングスデータ生成
部、３は、隣接する列のランレングスデータから位相変
化ペアを抽出して横位相表現を生成する横位相表現デー
タ生成部、４は、横位相表現から依存関係を生成する依
存関係生成部、５は、依存関係からｓｅｄｉｎコードを
生成するｓｅｄｉｎコード生成部、６は、文字認識部で
ある。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of the present invention will be specifically described below with reference to the drawings. FIG. 1 is a block diagram of an embodiment of the present invention. In the figure, 1 is a binarization processing unit that reads an image and binarizes it, 2 is a run-length data generation unit that generates run-length data from binarized image data, 3 is an adjacent column A horizontal phase expression data generation unit 4 that generates a horizontal phase expression by extracting a phase change pair from the run length data, a dependency relationship generation unit 4 that generates a dependency relationship from the horizontal phase expression, and a sedin code from the dependency relationship. The generated sedin code generation unit 6 is a character recognition unit.

【００２１】ここで、２値化処理部１、ランレングスデ
ータ生成部２は、前掲した出願に記載されたものと同一
の機能、構成を有するものである。また、本実施例で
は、横位相表現（つまり隣合う二列の接続状態の変化に
着目して得た縦位相表現）を用いているが、前掲した出
願に記載された縦位相表現と本質的な違いはない。図２
は、横位相表現の例とそのトポロジーコードを示す。Here, the binarization processing unit 1 and the run length data generation unit 2 have the same functions and configurations as those described in the above-mentioned application. Further, in the present embodiment, the horizontal phase expression (that is, the vertical phase expression obtained by paying attention to the change in the connection state of two adjacent columns) is used, but it is essentially the same as the vertical phase expression described in the above-mentioned application. There is no difference. Figure 2
Shows an example of horizontal phase representation and its topology code.

【００２２】ところで、横位相表現で手書き数字を分類
しようとすると、数字の３に対して多量のモデルができ
る（約１３０００字の手書き数字データに対して２４０
種類）。この中には、同一のモデルとみなすことができ
るグループが多数出現する。図３は、同一のモデルとみ
なすことができる１０種類の異なるトポロジーコードの
例を示す。前述したモデルｄ，ｐに限定しても、数字３
のモデルは７９種類となり、このモデルをｄ，ｐの詳細
分類に用いると記憶すべきモデル数が非常に多くなる。
そこで、本実施例では、図３に示した例に対し、同一の
文字列を出力する、トポロジーコードに代わる接続状態
の表現方法として、以下に詳述するｓｅｄｉｎコードを
提案するものである。By the way, when trying to classify handwritten numbers by the horizontal phase expression, a large number of models can be created for the number 3 (240 for handwritten number data of about 13,000 characters).
type). In this, many groups that can be regarded as the same model appear. FIG. 3 shows examples of 10 different topology codes that can be regarded as the same model. Even if limited to the models d and p described above, the number 3
There are 79 types of models, and if this model is used for the detailed classification of d and p, the number of models to be stored becomes very large.
In view of this, the present embodiment proposes the sedin code described below in detail as a method of expressing the same character string as the connection state alternative to the topology code, which is an alternative to the topology code shown in FIG.

【００２３】〈ｓｅｄｉｎコード〉ｓｅｄｉｎコード
は、図形の接続状態を一つの文字列で表現したもので、
これは幾つかのトポロジーコードを「同じ」とみなして
得た同値類別でもある。ｓｅｄｉｎコードは、“ｓ”，
“ｅ”，“ｄ”，“ｉ”，“ｎ”およびコンマの６文字
からなり、コンマで区切られた各部分を“スライス”と
呼ぶ。<Sedin code> The sedin code expresses the connection state of figures by one character string.
This is also an equivalence classification obtained by regarding some topology codes as “same”. The sedin code is "s",
Each part composed of 6 characters "e", "d", "i", "n" and a comma, and separated by a comma is called a "slice".

【００２４】ｓｅｄｉｎコードは、図４に示すように、
各スライスが表す接続状態を横につなげて得られた接続
状態を表している。各スライスは、コンマ以外の文字列
からなるが、これらはそれぞれ図５に示す接続状態を示
す。すなわち、ｓは、線の発生（Ｓｔａｒｔ）であり、
ｅは、線の消失（Ｅｎｄ）であり、ｄは、合流（Ｄｅｃ
ｒｅａｓｅ）であり、ｉは、分岐（Ｉｎｃｒｅａｓｅ）
であり、ｎは、変化のない線（Ｎｏｃｈａｎｇｅ）で
ある。スライスの接続状態は、図６に示すように、各文
字の表すグラフをその順序で上から下に並べたものであ
る。The sedin code is, as shown in FIG.
The connection state obtained by horizontally connecting the connection states represented by the slices is shown. Each slice consists of a character string other than commas, and these indicate the connection state shown in FIG. 5, respectively. That is, s is the generation of a line (Start),
e is the disappearance of the line (End), and d is the confluence (Dec).
i) and i is a branch (Increase)
And n is a line without change (No change). As shown in FIG. 6, the connection state of slices is a graph in which each character is arranged in that order from top to bottom.

【００２５】ｓｅｄｉｎコードは、以下に詳述するよう
にして生成される。まず、横位相表現から依存関係グラ
フが作成され、次いで、この依存関係グラフから文字列
であるｓｅｄｉｎコードが生成される。The sedin code is generated as described in detail below. First, a dependency graph is created from the horizontal phase representation, and then a sedin code that is a character string is generated from this dependency graph.

【００２６】〈依存関係グラフ〉依存関係グラフの例を
図７に示す。このグラフは、有向グラフであり、図８の
図形、および図２の横位相表現に対応している。図７の
依存関係グラフは、４種類のノード（前述したｓ，ｅ，
ｉ，ｄ）と２種類の枝（細線で表された白枝と太線で表
された黒枝）とｓｔａｒｔ、ｅｎｄからなる。枝を０〜
１６の番号で表し、ノードをＡ〜Ｈの記号で表してい
る。また、枝２，５，８，１０，１２，１３，１４は黒
枝で、枝０，１，３，４，６，７，９，１１，１５，１
６は白枝であり、各枝は、接続状態が変化しない白ラン
または黒ランの連なりに対応する。<Dependency Graph> An example of the dependency graph is shown in FIG. This graph is a directed graph and corresponds to the figure in FIG. 8 and the horizontal phase representation in FIG. The dependency graph of FIG. 7 has four types of nodes (s, e,
i, d), two types of branches (white branches represented by thin lines and black branches represented by thick lines), and start and end. 0 to branches
It is represented by the number 16 and the nodes are represented by symbols A to H. The branches 2, 5, 8, 10, 12, 13, 14 are black branches, and the branches 0, 1, 3, 4, 6, 7, 9, 11, 15, 1 are
6 is a white branch, and each branch corresponds to a series of white runs or black runs whose connection state does not change.

【００２７】また、依存関係グラフのノードは、それぞ
れ図９の極値点（輪郭点のｙ座標が極大値あるいは極小
値となる部分）に対応していて、依存関係グラフのエッ
ジは、それぞれ図１０の各領域に対応していて、これら
の横方向の接続関係を表したものが依存関係グラフであ
る。Further, the nodes of the dependency graph correspond to the extreme points (portions where the y-coordinates of the contour points are the maximum value or the minimum value) of FIG. 9, and the edges of the dependency graph are respectively shown in the figure. The dependency relationship graph corresponds to each of the ten areas and represents the connection relationship in the horizontal direction.

【００２８】依存関係グラフのノードは、接続状態の変
化するところに対応して作られる。ノードには、一本の
枝を入力とし、三本の枝を出力するものと、三本の枝を
入力とし、一本の枝を出力する、２種類のノードがあ
り、前者のノードを増加ノード、後者のノードを減少ノ
ードと呼ぶ。図１１に示すように、一つのノードでは表
現できない接続状態変化点は、基本となる増加ノードと
減少ノードを組み合わせることによって表現する。従っ
て、図１２に示す図形は、本来、図１３に示すような依
存関係グラフで表すべきであるが、これを基本ノードに
変換して、図１４のように表す。The nodes of the dependency graph are created corresponding to the change of the connection state. There are two types of nodes: one that inputs one branch and outputs three branches, and one that inputs three branches and outputs one branch. The node and the latter node are called reduced nodes. As shown in FIG. 11, a connection state change point that cannot be expressed by one node is expressed by combining a basic increasing node and a decreasing node. Therefore, the figure shown in FIG. 12 should originally be represented by a dependency graph as shown in FIG. 13, but this is converted into a basic node and represented as shown in FIG.

【００２９】また、依存関係グラフは、次のような形式
でメモリなどに保存される。すなわち、各枝の番号を用
いて全ての増加ノード、減少ノードを図１５に示すよう
に記録する。なお、この図には、ノードのタイプが書か
れていないが、実際にはこれらもメモリに格納される。
また、左端の枝の番号は０とする。図１５の各行（つま
り、各増加ノード、減少ノードに対応するフィールド）
を依存関係ルールと呼び、枝の順序は重要な情報である
ので、保存される。The dependency graph is stored in a memory or the like in the following format. That is, all increasing nodes and decreasing nodes are recorded as shown in FIG. 15 using the numbers of the branches. Although the node types are not shown in this figure, they are actually stored in the memory.
The number of the leftmost branch is 0. Each row in FIG. 15 (that is, fields corresponding to each increasing node and decreasing node)
Is called a dependency rule, and the order of the branches is important information, so it is preserved.

【００３０】〈依存関係生成部〉以下に、依存関係を生
成する方法について説明する。本実施例の生成手法は、
図７に示す依存関係を左右二つに分ける「断面」が、左
から右に移動していくという、モデルによって作成され
る。この断面は、図１６に示すように、上から下に枝番
号が連なったものであり、リストの形式で格納され、以
降これを断面リストという。断面リスト上の各ノード
は、依存関係の枝に相当し、枝番号を持ち、以降これを
断面ノードという。<Dependency Relationship Generation Unit> A method of generating a dependency relationship will be described below. The generation method of this embodiment is
The “cross section” that divides the dependency relationship shown in FIG. 7 into two parts, left and right, is created by a model that moves from left to right. As shown in FIG. 16, this cross section is a series of branch numbers from top to bottom and is stored in the form of a list, and this is hereinafter referred to as a cross section list. Each node on the cross-section list corresponds to a branch of the dependency relationship and has a branch number, and this is hereinafter referred to as a cross-section node.

【００３１】図１７、図１８、図１９は、依存関係を生
成する処理フローチャートである。本手法では、「原画
像は白地に黒の図形であり、端に触れていない」ことを
前提にしている。断面リストの初期状態は、「ｉｄ＝０
の枝（断面ノード）一つ」である。そして、位相変化ペ
アを越える度に、断面リストが右に移動し、依存関係ル
ールが作られる。終了時の断面リストは、「ｉｄ＝ｎの
断面ノード（枝）一つ」となる。なお、このｎを記録す
ることにより、以後の処理が若干簡単になる。FIG. 17, FIG. 18, and FIG. 19 are processing flow charts for generating a dependency relationship. In this method, it is premised that "the original image is a black figure on a white background and the edges are not touched". The initial state of the cross section list is "id = 0.
One branch (section node) ”. Then, each time the phase change pair is exceeded, the cross-section list moves to the right, and a dependency rule is created. The section list at the end is “one section node (branch) with id = n”. Incidentally, by recording this n, the subsequent processing is slightly simplified.

【００３２】枝の番号（つまり断面ノードの番号）は、
処理フローチャート中の「断面ノードの生成」ステップ
で付与される。断面ノード生成時には、前回生成した断
面ノードの番号を記録しておき、それより一つ多い番号
を持つノードを生成する。この処理によって、「枝」の
認識と枝番号の割り付けが行われることになる。The branch number (that is, the section node number) is
It is given in the step of "generation of section node" in the processing flowchart. When a cross-section node is generated, the number of the cross-section node generated last time is recorded, and a node having a number one more than that is generated. By this processing, the "branch" is recognized and the branch number is assigned.

【００３３】以下に、図２０に示す横位相表現から図１
５に示す依存関係を生成する場合を具体例にして説明す
る。図２０の横位相表現において、ｇ０，ｇ１は、始め
の２つの位相変化ペアを表し、図２１は、各位相変化ペ
アｇ０，ｇ１の内部表現であり、１０１〜１１２はそれ
ぞれスタンプを示し、各スタンプ間の直線はブランチを
示す。The horizontal phase representation shown in FIG.
A case where the dependency shown in FIG. 5 is generated will be described as a specific example. In the horizontal phase representation of FIG. 20, g0 and g1 represent the first two phase change pairs, FIG. 21 is an internal representation of each phase change pair g0 and g1, and 101 to 112 represent stamps, respectively. The straight line between stamps indicates a branch.

【００３４】図１７のステップ００１において、断面リ
ストが図２２（ａ）のように初期化される。図２２にお
いて、２０１から２０７で示される数字０から６は、断
面ノード番号であり、最初の断面ノード番号は、ｉｄ＝
０となる。ステップ００２では、図２０の初めの位相変
化ペアｇ０（＝ｇ）が処理対象となる。ステップ００３
では、ｕ＝（図２１のスタンプ１０１）、ｄ＝（図２１
のスタンプ１０２）とされる。ステップ００４では、ｃ
ｎ＝（図２２（ａ）の２０１）とされる。In step 001 of FIG. 17, the section list is initialized as shown in FIG. In FIG. 22, numerals 0 to 6 denoted by 201 to 207 are section node numbers, and the first section node number is id =
It becomes 0. In step 002, the first phase change pair g0 (= g) in FIG. 20 is processed. Step 003
Then, u = (stamp 101 in FIG. 21), d = (FIG. 21
Stamp 102). In step 004, c
n = (201 in FIG. 22A) is set.

【００３５】ステップ００５では、ｕもｄもｎｉｌでな
いので、ステップ００６に進む。ステップ００６におい
ては、ｕのｎｅｘｔが存在し、且つｕのｎｅｘｔのブラ
ンチ数が０で、且つｕのｎｅｘｔのｎｅｘｔの最初のブ
ランチがｄであるかがチェックされ、この場合、ｕのｎ
ｅｘｔが存在しないので、ステップ００８に進む。ステ
ップ００８で、ｄ（＝スタンプ１０２）のｎｅｘｔは、
スタンプ１０３であり存在し、スタンプ１０３のブラン
チ数は０であり、ｄ（＝スタンプ１０２）のｎｅｘｔの
ｎｅｘｔ（＝スタンプ１０４）の最初のブランチはｕ
（＝１０１）であるので、ステップ００９に進む。図１
９は、ステップ００９の処理を示す。At step 005, since neither u nor d is nil, the routine proceeds to step 006. In step 006, it is checked whether u next exists, the number of branches of u next is 0, and the first branch of next of u next is d, and in this case, n of u
Since ext does not exist, the process proceeds to step 008. In step 008, the next of d (= stamp 102) is
The stamp 103 exists, the number of branches of the stamp 103 is 0, and the first branch of the next (= stamp 104) of d (= stamp 102) is u.
Since (= 101), the process proceeds to step 009. Figure 1
9 indicates the processing of step 009.

【００３６】図１９のステップ０１５において、まず、
断面ノードが３つ生成される。このノード生成時に、こ
れまで使われていない枝番号を、これら３つのノードに
割り当てる。これまで使われているｉｄは０のみである
ので、ここでは１，２，３が割り当てられる。これらの
ｉｄを用いて、図１５の最初の行である、０−＞１，２，３という依存関係ルールが作成され、出力される。In step 015 of FIG. 19, first,
Three cross-section nodes are generated. When this node is generated, branch numbers that have not been used so far are assigned to these three nodes. Since the only id that has been used so far is 0, 1, 2, 3 are assigned here. Using these ids, a dependency rule of 0-> 1,2,3, which is the first line in FIG. 15, is created and output.

【００３７】次いで、ステップ０１６において、断面リ
ストの変形が行われ、図２２（ａ）の断面リストが図２
２（ｂ）になる。ステップ０１７で、ｃｎを３つの断面
ノードの内、最後のノードとし（ｃｎ＝２０４）、ｄ
（＝スタンプ１０２）を、ｄのｎｅｘｔのｎｅｘｔ、つ
まりｄ＝スタンプ１０４とする。Next, in step 016, the section list is modified so that the section list of FIG.
2 (b). In step 017, cn is set as the last node of the three cross-section nodes (cn = 204), and d
Let (= stamp 102) be the next of d's next, that is, d = stamp 104.

【００３８】図１７のステップ００５に戻り、ステップ
００５で、ｕ＝１０１，ｄ＝１０４であるので、ステッ
プ００６に進み、ｕのｎｅｘｔは存在しないので、ステ
ップ００８に進み、ステップ００８で、ｄのｎｅｘｔは
存在しないので、ステップ０１０に進む。Returning to step 005 in FIG. 17, since u = 101 and d = 104 in step 005, the process proceeds to step 006. Since there is no next of u, the process proceeds to step 008, and at step 008, d Since next does not exist, the process proceeds to step 010.

【００３９】ステップ０１０で、ｕは、ｕのｎｅｘｔと
され、つまりｕ＝ｎｉｌとされ、ｄについてもｄ＝ｎｉ
ｌとされ、ｃｎについてもｃｎ＝ｎｉｌ（ｅｎｄ）とさ
れる。ステップ００５で、ｕもｄもｎｉｌであるので、
ステップ０１１に進み、全ての位相変化ペアについて処
理が終了していないので（この時点では、位相変化ペア
ｇ０のみの処理が終了）、ステップ００２に戻る。In step 010, u is next to u, that is, u = nil, and d is also d = ni.
1 and cn is also set to cn = nil (end). Since u and d are both nil in step 005,
The process proceeds to step 011 and the processing is not completed for all the phase change pairs (at this point, the processing for only the phase change pair g0 is completed), and therefore the process returns to step 002.

【００４０】ステップ００２で、次の位相変化ペアｇ
を、ｇ＝ｇ１とする。ステップ００３で、ｕ＝スタンプ
１０５、ｄ＝スタンプ１０８とされる。ステップ００４
で、断面リストは、図２２（ｂ）に示すものであるの
で、ｃｎ＝２０２（最初のノード）となる。ステップ０
０５で、ｕ＝１０５≠ｎｉｌ，ｄ＝１０８≠ｎｉｌであ
るので、ステップ００６に進む。ステップ００６で、ｕ
のｎｅｘｔは、スタンプ１０６で存在するが、スタンプ
１０６のブランチ数は１であるので、ステップ００６の
条件を満たさず、ステップ００８に進む。In step 002, the next phase change pair g
Is set to g = g1. In step 003, u = stamp 105 and d = stamp 108. Step 004
Since the cross section list is that shown in FIG. 22B, cn = 202 (first node). Step 0
In 05, since u = 105 ≠ nil and d = 108 ≠ nil, the process proceeds to step 006. In step 006, u
Next exists in the stamp 106, but since the number of branches of the stamp 106 is 1, the condition of step 006 is not satisfied and the process proceeds to step 008.

【００４１】ステップ００８で、ｄのｎｅｘｔはスタン
プ１０９で存在し、スタンプ１０９のブランチ数は０
で、ｄのｎｅｘｔのｎｅｘｔはスタンプ１１０であり、
スタンプ１１０の最初のブランチはスタンプ１０５であ
り、これはｕに等しいので、ステップ００８の条件が満
たされ、ステップ００９に進む。In step 008, the next of d exists in the stamp 109, and the number of branches of the stamp 109 is 0.
And the next of d's next is the stamp 110,
The first branch of stamp 110 is stamp 105, which is equal to u, so the condition of step 008 is met and step 009 is proceeded to.

【００４２】図１９のステップ０１５において、新しく
生成された断面ノードのｉｄをそれぞれ４，５，６とす
る。ｃｎ＝２０２（ｉｄ＝１）であるので、図１５の第
２行の依存関係ルール１−＞４，５，６が生成され、出力される。ステップ０１６で、断面リス
トの変形が行われ、図２２（ｂ）から図２２（ｃ）にな
る。ステップ０１７で、ｃｎ＝２０７（図２２（ｃ））
とされるとともに、ｄ＝スタンプ１１０とされる。In step 015 of FIG. 19, the ids of the newly generated cross section nodes are set to 4, 5, and 6, respectively. Since cn = 202 (id = 1), the dependency rule 1-> 4,5,6 in the second row of FIG. 15 is generated and output. In step 016, the cross-section list is modified so as to be changed from FIG. 22B to FIG. In step 017, cn = 207 (FIG. 22 (c))
And d = stamp 110.

【００４３】図１７のステップ００５に戻り、ステップ
００５で、ｕ＝１０５≠ｎｉｌ，ｄ＝１１０≠ｎｉｌで
あるので、ステップ００６に進む。ステップ００６で、
ｕのｎｅｘｔは、スタンプ１０６で存在するが、スタン
プ１０６のブランチ数は１であるので、ステップ００６
の条件を満たさず、ステップ００８に進む。Returning to step 005 in FIG. 17, since u = 105 ≠ nil and d = 110 ≠ nil in step 005, the process proceeds to step 006. In step 006,
The u next exists in the stamp 106, but since the number of branches of the stamp 106 is 1, step 006.
If the condition is not satisfied, the process proceeds to step 008.

【００４４】ステップ００８で、ｄのｎｅｘｔはスタン
プ１１１で存在するが、スタンプ１１１のブランチ数は
１であるので、ステップ００８の条件を満たさず、ステ
ップ０１０に進む。ステップ０１０で、ｕ＝１０６，ｄ
＝１１１，ｃｎ＝２０３（図２２（ｃ））とされる。In step 008, the next of d exists in the stamp 111, but since the number of branches of the stamp 111 is 1, the condition of step 008 is not satisfied and the process proceeds to step 010. In step 010, u = 106, d
= 111, cn = 203 (FIG. 22 (c)).

【００４５】ステップ００５ではＹｅｓとなり、ステッ
プ００６ではＮｏとなって、ステップ００８に進み、ス
テップ００８でＮｏとなって、ステップ０１０に進み、
ステップ０１０で、ｕ＝スタンプ１０７、ｄ＝スタンプ
１１２，ｃｎ＝２０４（図２２（ｃ））とされる。ステ
ップ００５に戻り、ステップ００５は、Ｙｅｓ、ステッ
プ００６は、Ｎｏ、ステップ００８は、Ｎｏとなり、ス
テップ０１０で、ｕ＝ｎｉｌ，ｄ＝ｎｉｌ，ｃｎ＝ｎｉ
ｌ（ｅｎｄ）とされる。ステップ００５に戻り、ｕ＝ｎ
ｉｌ，ｄ＝ｎｉｌなので、Ｎｏとなり、ステップ０１１
で、まだ、位相変化ペアｇ０，ｇ１までしか処理が終了
していないので、ステップ００２に戻る。以下、同様に
して、全ての位相変化ペアについて処理され、依存関係
が生成される。Step 005 is Yes, Step 006 is No, Step 008 is proceeded to, Step 008 is No, Step 010 is proceeded to,
In step 010, u = stamp 107, d = stamp 112, and cn = 204 (FIG. 22C) are set. Returning to step 005, step 005 is Yes, step 006 is No, step 008 is No, and in step 010, u = nil, d = nil, cn = ni.
It is set to l (end). Return to step 005, u = n
Since il and d = nil, the result is No, and step 011
Since the processing has been completed up to the phase change pair g0 and g1, the process returns to step 002. Hereinafter, in the same manner, all the phase change pairs are processed to generate the dependency relationship.

【００４６】〈ｓｅｄｉｎコードの生成〉前述した依存
関係から文字列であるｓｅｄｉｎコードを生成する方法
について以下、説明する。ｓｅｄｉｎコードは、エネル
ギー伝搬という考え方に基づいて作成される。すなわ
ち、依存関係グラフ上で、各ノードおよび枝に「活性化
している／していない」の二つの状態を定め、初期状態
を「全てのノードおよび枝は非活性状態」とする。図１
５に示す依存関係をグラフで表すと、図２３になる（こ
の図は、既に説明した図７と同一の図である）。<Generation of sedin code> A method of generating a sedin code which is a character string from the above-described dependency will be described below. The sedin code is created based on the idea of energy propagation. That is, two states of "activated / not activated" are defined for each node and branch on the dependency graph, and the initial state is set to "all nodes and branches are inactive state". Figure 1
A graphical representation of the dependency shown in FIG. 5 is FIG. 23 (this figure is the same as FIG. 7 already described).

【００４７】まず、全てのノード、枝を非活性状態とす
る。図２３の全てのノード（ＡからＨ）、枝（０から１
６）は非活性状態にある。なお、図２４は、ノードと枝
の活性、非活性状態を表した図である。First, all nodes and branches are made inactive. All nodes (A to H), branches (0 to 1) in FIG.
6) is in the inactive state. Note that FIG. 24 is a diagram showing active and inactive states of nodes and branches.

【００４８】ステップ１で、まず枝０を活性化させる
（図２５）。ステップ２で、活性化しているノードから
出る全ての枝を活性化させる。この時点では活性化して
いるノードはないので、何もしない。ステップ３で、入
力の枝全てが活性化しているノードを全て活性化させ
る。入力の枝全てが活性化しているノードは、Ａのみで
あるので、ノードＡを活性化させる（図２６）。ステッ
プ４で、今新しく活性化したノードのコードを上から順
に出力する。上から順に辿るとき、これらノードにつな
がっていない黒枝があったら、その位置にコード“ｎ”
を出力する。今新しく活性化したノードのコード“ｓ”
を出力する。In step 1, branch 0 is first activated (FIG. 25). In step 2, all branches coming out of the activated node are activated. At this point, no node is active, so do nothing. In step 3, all the nodes whose input branches are activated are activated. Since only A is the node in which all the input branches are activated, node A is activated (FIG. 26). In step 4, the code of the newly activated node is output in order from the top. When tracing from top to bottom, if there is a black branch that is not connected to these nodes, the code is "n" at that position.
Is output. Code "s" of newly activated node
Is output.

【００４９】ステップ５で、全てのノードが活性化して
いたら処理を終了する。まだ、活性化していないノード
があるので、処理を続ける。ステップ６で、コンマを出
力する（全出力は、“ｓ，”となる）。ステップ２に戻
り、枝１，２，３が活性化される（図２７）。ステップ
３で、ノードＢが活性化される（図２８）。ステップ４
で、図２８を、ノードＢを辿るように左右に切ると、こ
の断面は黒枝２を通るので、コード“ｓｎ”が出力され
る。ステップ５で処理が続行され、ステップ６でコンマ
を出力する（全出力は、“ｓ，ｓｎ”となる）。ステッ
プ２に戻り、枝４，５が活性化される（図２９）。以
下、同様に処理され、最終的にはコード“ｓ，ｓｎ，ｎ
ｓｎ，ｎｉｎ，ｄｄ，ｎｅ，ｅ”が生成される。In step 5, if all the nodes have been activated, the process ends. Since there is a node that is not activated yet, the processing is continued. In step 6, a comma is output (the total output is "s,"). Returning to step 2, the branches 1, 2, 3 are activated (FIG. 27). In step 3, Node B is activated (FIG. 28). Step 4
Then, when FIG. 28 is cut to the left and right so as to follow the node B, this section passes through the black branch 2, so the code "sn" is output. The process is continued in step 5, and a comma is output in step 6 (all outputs are "s, sn"). Returning to step 2, the branches 4 and 5 are activated (FIG. 29). Thereafter, the same processing is performed, and finally the code “s, sn, n
sn, nin, dd, ne, e ″ are generated.

【００５０】本ｓｅｄｉｎコードは、横位相表現よりも
若干粗く文字を分類することができるが、図３で示した
水準に比べてまだ、詳細に分類しているので、本実施例
では、更に文字を粗く分類するための追加規則（１）、
（２）を用いる。この規則は、上記したステップ３で再
帰的に働く。以後、コードｓにあたる増加ノードを、ノ
ードｓ、コードｅにあたる減少ノードを、ノードｅと書
く。Although this sedin code can classify characters slightly coarser than the horizontal phase representation, since it classifies characters in detail compared to the level shown in FIG. 3, in the present embodiment, further character classification is performed. An additional rule (1) for roughly classifying
(2) is used. This rule works recursively in step 3 above. Hereinafter, the increasing node corresponding to the code s will be referred to as a node s, and the decreasing node corresponding to the code e will be referred to as a node e.

【００５１】（１）；ノードｓが活性化して、かつそれ
により生じた白枝の活性化により、別のノードｓの入力
が活性化されるならば、そのノードも活性化させる。（２）；ノードｅが活性化して、かつそれにより生じた
白枝の活性化により、別のノードｅの全ての入力が活性
化されるならば、そのノードも活性化させる。(1) If the node s is activated and the white branch generated thereby activates the input of another node s, that node is also activated. (2); If the node e is activated and the activation of the white branch caused thereby activates all the inputs of another node e, that node is also activated.

【００５２】この他にも、例えば次のような追加規則も
ある。（３）；ノードｅに関する規則であり、図３０に示すよ
うに、ノードｅの入力枝を上から順にａ，ｂ，ｃとし、
出力枝をｄとする。ステップ３において、ａ，ｂ，ｃ全
てではなく、真中の黒枝ｂのみが活性化していれば、そ
のノードｅを活性化させるが、ステップ２において、枝
ｄは、ａ，ｂ，ｃ全てが活性化するまで活性化させな
い。In addition to this, there are the following additional rules, for example. (3); a rule relating to node e. As shown in FIG. 30, the input branches of node e are a, b, and c in order from the top,
Let the output branch be d. In step 3, if only the middle black branch b is activated instead of all a, b, and c, the node e is activated, but in step 2, the branch d has all a, b, and c. Do not activate until activated.

【００５３】図３１、図３２、図３３は、依存関係（図
１５）から上記した追加規則（１）、（２）を用いて、
ｓｅｄｉｎコードを生成する処理フローチャートであ
る。本実施例の処理においては、どこまで活性化してい
るかを断面リストを用いて管理している。すなわち、断
面リストの左側は活性化しているノード、右側は活性化
していないノードを意味する。31, 32, and 33 use the above-mentioned additional rules (1) and (2) from the dependency relationship (FIG. 15),
It is a processing flowchart which produces | generates a sedin code. In the processing of the present embodiment, the extent of activation is managed using a cross section list. That is, the left side of the cross-section list means an activated node and the right side means a non-activated node.

【００５４】関数ｍａｋｅ−ｓｔｒｉｎｇ（図３２、３
３）は、断面リストを変形することでノードの活性化を
管理するとともに、コードｓ，ｉ，ｅ，ｄを出力する。
前述した追加規則は、この関数中に組み込まれている。
また、ステップ中に、ルールの左辺、右辺とあるのは、
図１５の依存関係ルールにおける矢印を挾んだ左辺と右
辺を指す。また、増加ノードに対応したルールの場合、
左辺は１個、右辺は３個のｉｄからなり、減少ノードに
対応したルールの場合、左辺は３個、右辺は１個のｉｄ
からなる。Function make-string (FIGS. 32 and 3)
3) manages the activation of the node by transforming the section list and outputs the codes s, i, e, d.
The additional rules mentioned above are incorporated into this function.
Also, in the step, the left side and right side of the rule are
It refers to the left side and the right side of the dependency rule in FIG. Also, in the case of a rule corresponding to an increasing node,
The left side consists of 1 id, and the right side consists of 3 ids. In the case of a rule corresponding to a decreasing node, the left side has 3 ids and the right side has 1 id.
Consists of.

【００５５】また、ステップ中に、「断面ノードｃｎに
マッチするルールはあるか？」という、記述の「ルー
ル」は、依存関係ルールであり、「マッチする」は、次
のことを意味する。すなわち、ルールの左辺が１項の場
合、断面ノードｃｎのｉｄが左辺と等しいこと、ルール
の左辺が３項の場合、ｃｎ，ｃｎのｎｅｘｔ，ｃｎのｎ
ｅｘｔのｎｅｘｔの３つの断面ノードが存在し、かつそ
れらのｉｄがこの順番でルールの左辺に対応しているこ
とを意味する。In the step, the "rule" in the description "Is there a rule that matches the section node cn?" Is a dependency rule, and "match" means the following. That is, if the left side of the rule is one term, the id of the cross-section node cn is equal to the left side, and if the left side of the rule is three terms, cn, cn next, and n of cn.
It means that there are three cross-section nodes next to ext, and their ids correspond in this order to the left side of the rule.

【００５６】以下に、図１５の依存関係から追加規則
（１）、（２）を用いて、ｓｅｄｉｎコード“ｓｓｓ，
ｎｉｎ，ｄｄ，ｅｅ”を生成する例を挙げて、図３１、
図３２、図３３の処理動作を説明する。Below, using the additional rules (1) and (2) from the dependency relation of FIG. 15, the sedin code “sss,
As an example of generating "nin, dd, ee", FIG.
The processing operation of FIGS. 32 and 33 will be described.

【００５７】図３１のステップ３０１において、依存関
係ルールの検索を高速かつ容易に行うために索引を作
る。ステップ３０２で、図３４（ａ）に示す断面リスト
を作成する。ステップ３０３で、使用していないルール
がまだ（この時点では全部）残っているので、Ｎｏとな
り、ステップ３０４に進む。In step 301 of FIG. 31, an index is created in order to search the dependency rule quickly and easily. In step 302, a cross section list shown in FIG. 34 (a) is created. In step 303, the unused rules still remain (all at this point), so the result is No and the process proceeds to step 304.

【００５８】ステップ３０４で、ｃｎ＝４０１（図３４
（ａ））となる。ステップ３０５で、枝の白黒を管理す
る変数ｗｈｉｔｅに１を代入する。ステップ３０６で、
ｃｎ＝４０１≠（ｅｎｄ）であるので、Ｎｏとなり、ス
テップ３０７に進み、ステップ３０７で、ｃｎのｉｄは
０であり、０のみを入力とするルール（０−＞１，２，
３）が存在するので、ｙｅｓとなり、ステップ３０８に
進む。In step 304, cn = 401 (FIG. 34)
(A)). In step 305, 1 is substituted for the variable white that manages the black and white of the branch. In step 306,
Since cn = 401 ≠ (end), the determination result is No, the process proceeds to step 307, and in step 307, the id of cn is 0, and a rule (0-> 1, 2,
Since 3) exists, the answer is yes, and the process proceeds to step 308.

【００５９】ステップ３０８では、ｃｎ＝４０１，ｗｈ
ｉｔｅ＝１で関数ｍａｋｅ−ｓｔｒｉｎｇを呼び出す
（ｃａｌｌ１）。呼び出された関数ｍａｋｅ−ｓｔｒ
ｉｎｇ（図３２）において、渡された引数をｃｎ＝４０
１，ｗｈｉｔｅ＝１とし（ステップ３１４）、ステップ
３１５で、マッチしたルール（０−＞１，２，３）は増
加ノードのものであるので、ｙｅｓとなり、ステップ３
１６になる。ステップ３１６で、生成した断面ノードｃ
１，ｃ２，ｃ３にルールの右辺のｉｄ（１，２，３）を
割り当て、断面リストを図３４（ｂ）のように変形す
る。In step 308, cn = 401, wh
The function make-string is called with ite = 1 (call 1). Called function make-str
In ing (FIG. 32), the passed argument is cn = 40
1, white = 1 (step 314), and in step 315, since the matched rule (0-> 1,2,3) belongs to the increasing node, it becomes yes, and step 3
Become 16. Section node c generated in step 316
IDs (1, 2, 3) on the right side of the rule are assigned to 1, c2, c3, and the cross-section list is transformed as shown in FIG. 34 (b).

【００６０】ステップ３１７で、ｃ３＝４０４なので、
ｃｎ＝４０４とする。ステップ３１８で、ｗｈｉｔｅ＝
１であるので、Ｙｅｓとなり、ステップ３１９で、ｃ１
＝４０２なので、ｃｃ＝４０２とする。ステップ３２０
で、ｃｃのｉｄは１であり、１のみを入力とするルール
が存在し（１−＞４，５，６）、かつこのルールは増加
ノードのものであるので、Ｙｅｓとなって、ステップ３
２１に進み、ｃｃ＝４０２，ｗｈｉｔｅ＝１を引数とし
て関数ｍａｋｅ−ｓｔｒｉｎｇを再帰的に呼び出す（ｃ
ａｌｌ２）。Since c3 = 404 in step 317,
Let cn = 404. In step 318, white =
Since it is 1, it becomes Yes, and in step 319, c1
= 402, so cc = 402. Step 320
Since the id of cc is 1, and there is a rule that inputs only 1 (1-> 4, 5, 6), and this rule is for the increasing node, the answer is Yes, and Step 3
21, the function make-string is recursively called with cc = 402 and white = 1 as arguments (c
all 2).

【００６１】ステップ３１４で、ローカル変数ｃｎ＝４
０２，ｗｈｉｔｅ＝１とする。ステップ３１５で、ルー
ルは増加ノードのものだったので、Ｙｅｓとなって、ス
テップ３１６で、断面ノードｃ１，ｃ２，ｃ３を生成
し、これらのｉｄをルールの右辺（４，５，６）の値と
する。断面リストは、図３４（ｃ）のように変形され
る。ステップ３１７で、ｃ３＝４０７なので、ｃｎ＝４
０７とする。ステップ３１８で、ｗｈｉｔｅ＝１なので
Ｙｅｓとなり、ステップ３１９で、ｃ１＝４０５なの
で、ｃｃ＝４０５とする。ステップ３２０で、ｃｃのｉ
ｄは４であり、４のみを入力とするルールは存在しな
い。また、ｃｃのｎｅｘｔ（４０６），ｃｃのｎｅｘｔ
のｎｅｘｔ（４０７）のｉｄはそれぞれ５，６であり、
４，５，６を入力とするルールを探しても存在しないの
で、ｎｏとなって、ステップ３２２に進み、ステップ３
２２でコード“ｓ”を出力する。In step 314, the local variable cn = 4
02, white = 1. In step 315, since the rule belongs to the increasing node, it becomes Yes, and in step 316, the cross-section nodes c1, c2, and c3 are generated, and these ids are the values on the right side (4,5, 6) of the rule. And The section list is transformed as shown in FIG. In step 317, c3 = 407, so cn = 4
07. In step 318, since white = 1, the result is Yes, and in step 319, c1 = 405, and therefore cc = 405. In step 320, i in cc
d is 4, and there is no rule that inputs 4 only. Also, cc next (406), cc next
Next (407) has ids 5 and 6, respectively,
Even if a rule with 4, 5, 6 as input is searched for, it does not exist, so the result is no, the process proceeds to step 322, and step 3
At 22, the code "s" is output.

【００６２】ステップ３２３で、ｃ３＝４０７なので、
ｃｃ＝４０７とする。ステップ３２４で、ｃｃのｉｄは
６であり、ｃｃのみを入力とするルールは存在し（６−
＞７，８，９）、かつまたそのルールは増加ノードのも
のであるので、Ｙｅｓとなってステップ３２５に進む。
ステップ３２５で、ｃｃ＝４０７，ｗｈｉｔｅ＝１を引
数として関数ｍａｋｅ−ｓｔｒｉｎｇを再帰的に呼び出
す（ｃａｌｌ３）。Since c3 = 407 in step 323,
Let cc = 407. In step 324, the id of cc is 6, and there is a rule that inputs cc only (6-
> 7,8,9) and again the rule is for the increasing node, so Yes and go to step 325.
In step 325, the function make-string is recursively called with cc = 407 and white = 1 as arguments (call 3).

【００６３】ステップ３１４で、ローカル変数ｃｎ＝４
０７，ｗｈｉｔｅ＝１とする。ステップ３１５で、ルー
ルは増加ノードのものだったので、Ｙｅｓとなって、ス
テップ３１６で、断面ノードｃ１，ｃ２，ｃ３を生成
し、これらのｉｄをルールの右辺（７，８，９）の値と
する。断面リストは、図３４（ｄ）のように変形され
る。ステップ３１７で、ｃ３＝４１０なので、ｃｎ＝４
１０とする。ステップ３１８で、ｗｈｉｔｅ＝１なので
Ｙｅｓとなり、ステップ３１９で、ｃ１＝４０８なの
で、ｃｃ＝４０８とする。ステップ３２０で、ｃｃのｉ
ｄは７であり、７のみを入力とするルールは存在しな
い。また、ｃｃのｎｅｘｔ（４０９），ｃｃのｎｅｘｔ
のｎｅｘｔ（４１０）のｉｄはそれぞれ８，９であり、
７，８，９を入力とするルールを探しても存在しないの
で、ｎｏとなって、ステップ３２２に進み、ステップ３
２２でコード“ｓ”を出力し、ここまでの処理で出力し
たコードは“ｓｓ”となる。In step 314, the local variable cn = 4
07 and white = 1. In step 315, since the rule belongs to the increasing node, it becomes Yes, and in step 316, the cross-section nodes c1, c2, and c3 are generated, and these ids are the values on the right side (7, 8, 9) of the rule. And The section list is transformed as shown in FIG. In step 317, c3 = 410, so cn = 4
Set to 10. In step 318, since white = 1, the result is Yes, and in step 319, c1 = 408, so cc = 408. In step 320, i in cc
d is 7, and there is no rule that inputs 7 only. Also, cc next (409), cc next
Next (410) has ids of 8 and 9, respectively.
Even if a rule with 7, 8 and 9 as input is searched for, it does not exist, so the result is no, the process proceeds to step 322, and
The code “s” is output at 22, and the code output in the processing up to this point becomes “ss”.

【００６４】ステップ３２３で、ｃｃ＝４１０となる。
ステップ３２４で、ｃｃのｉｄは９であり、９のみを入
力とするルールは存在しない。また９，２，３を入力と
するルールもないので、Ｎｏとなって、ステップ３２７
に進み、リターンバリューとしてｃｎ＝４１０を返す。At step 323, cc = 410.
In step 324, the id of cc is 9, and there is no rule that inputs 9 alone. Further, since there is no rule for inputting 9, 2 and 3, the result is No and step 327
And returns cn = 410 as the return value.

【００６５】ｃａｌｌ３に対して制御が戻る（ステッ
プ３２５）。変数ｃｃ，ｃ１，ｃ２，ｃ３，ｗｈｉｔｅ
はそれぞれ呼び出した時点の値、すなわち、４０７、４
０５、４０６、４０７、１に戻る。ｃｎは、呼び出した
関数のリターンバリューすなわち、４１０となる。ステ
ップ３２７で、リターンバリューとして４１０を返す。The control returns to the call 3 (step 325). Variables cc, c1, c2, c3, white
Are the values at the time of calling, that is, 407, 4
Return to 05, 406, 407, and 1. cn is the return value of the called function, that is, 410. In step 327, 410 is returned as the return value.

【００６６】ｃａｌｌ２に対して制御が戻る（ステッ
プ３２１）。変数ｃｎ，ｃｃ，ｃ１，ｃ２，ｃ３，ｗｈ
ｉｔｅはそれぞれ呼び出した時点の値、すなわち、４０
４、４０２、４０２、４０３、４０４、１に戻る。ステ
ップ３２２で、コード“ｓ”を出力し、これまでの全出
力は、“ｓｓｓ”となる。Control is returned to call 2 (step 321). Variables cn, cc, c1, c2, c3, wh
ite is the value at the time of each call, that is, 40
Return to 4, 402, 402, 403, 404, 1. In step 322, the code "s" is output, and all the outputs so far are "sss".

【００６７】ステップ３２３で、ｃｃ＝４０４となる。
ステップ３２４で、ｃｃのｉｄは３、３のみを入力とす
るルールはない。またｃｃのｎｅｘｔはｅｎｄなので、
減少ノードに相当するルールもないので、Ｎｏとなり、
ステップ３２７でリターンバリューとして４０４を返
す。At step 323, cc = 404.
In step 324, there is no rule that only inputs 3 and 3 as the id of cc. Also, because the cc next is end,
Since there is no rule corresponding to the decreasing node, it becomes No,
In step 327, 404 is returned as the return value.

【００６８】ｃａｌｌ１に制御が戻る（ステップ３０
８）。変数ｗｈｉｔｅは呼び出した時点の値、すなわち
１となる。ｃｎはリターンバリュー４０４となる。ステ
ップ３１１で、１−ｗｈｉｔｅ＝０（すなわち黒）、ｃ
ｎ＝ｅｎｄとなる。ステップ３０６で、ｃｎ＝ｅｎｄで
あるのでＹｅｓとなり、ステップ３１２で、コンマを出
力する。これまでの処理によって全出力は“ｓｓｓ”と
なる。Control returns to call 1 (step 30).
8). The variable white has a value at the time of calling, that is, 1. cn becomes the return value 404. In step 311, 1-white = 0 (that is, black), c
n = end. In step 306, since cn = end, the result is Yes, and in step 312, a comma is output. All the outputs become "sss" by the processing so far.

【００６９】以下、同様に処理して、最終的には、図１
５のルールからコード“ｓｓｓ，ｎｉｎ，ｄｄ，ｅｅ”
が得られる。Thereafter, the same processing is performed, and finally, FIG.
Code "sss, nin, dd, ee" from rule 5
Is obtained.

【００７０】上記したようにして生成される、本発明の
ｓｅｄｉｎコードの特徴について説明すると、例えば、
図３５に示す図形を従来からの手法である有向グラフで
表現すると、図３６に示すグラフとなる。ところが、図
３７、図３８、図３９に示すような図形に対しては全て
同一の有向グラフになり、各図形の分類が不可能にな
る。この対して、本発明のｓｅｄｉｎコードによると、
図３７、図３８、図３９に示す図形の分類が可能にな
る。図４０、図４１、図４２は、それぞれ図３７、図３
８、図３９に対応した依存関係グラフとｓｅｄｉｎコー
ドを示す。The features of the sedin code of the present invention generated as described above will be explained.
When the figure shown in FIG. 35 is represented by a directed graph which is a conventional method, the graph shown in FIG. 36 is obtained. However, for the graphics as shown in FIGS. 37, 38, and 39, all have the same directed graph, making it impossible to classify the graphics. On the other hand, according to the sedin code of the present invention,
The figures shown in FIGS. 37, 38, and 39 can be classified. 40, 41, and 42 are shown in FIGS. 37 and 3, respectively.
8 shows a dependency graph and sedin code corresponding to FIG. 39.

【００７１】また、図４３に示すように、本発明のｓｅ
ｄｉｎコードの各文字と、スタンプとを対応させること
ができるという特徴もある。従って、スタンプを指定
し、そのスタンプに対応するランの位置が分かるので、
本出願人が既に提案した「ｇｉデータによる分類」と同
様の方法で、つまり例えば、選択されたスタンプからＬ
１，Ｌ２を算出し、Ｌ１／Ｌ２によって文字を分類する
ことができる。Further, as shown in FIG.
Another feature is that each character of the din code can be associated with a stamp. Therefore, you can specify the stamp and know the position of the run corresponding to that stamp.
In the same manner as the “classification by gi data” that the applicant has already proposed, that is, for example, from the selected stamp, L
It is possible to calculate 1, L2 and classify characters by L1 / L2.

【００７２】さらに、各スタンプは、グラフ上で接続し
ている相手が分かるので（前掲した出願に記載のｎｅｘ
ｔ，ｐｒｅｖ，ｐａｒｔｎｅｒ，ｂｒａｎｃｈの各情
報）、ｓｅｄｉｎコード中の文字に対応したスタンプの
ｎｅｘｔ−＞ｐａｒｔｎｅｒに対応したランの位置情報
を利用したり、あるいはあるスタンプのｂｒａｎｃｈ
［０］−＞ｎｅｘｔ−＞ｎｅｘｔに対応したランの位置
情報などを利用して、文字を分類することも可能であ
る。Further, since each stamp can identify the connected party on the graph (see the above-mentioned application
(t, prev, partner, branch information), the position information of the run corresponding to the next-> partner of the stamp corresponding to the character in the sedin code is used, or the branch of a stamp is used.
It is also possible to classify characters by using the position information of the run corresponding to [0]->next-> next.

【００７３】本実施例の文字認識部６においては、モデ
ルｄ，ｐに関して次のようにして文字の分類を行う。す
なわち、（１）；縦位相表現のトポロジーコードによって文字を
大分類し、さらにｇｉデータおよび方向コードを用いた
詳細分類によって認識可能な文字については、該手法に
よって認識を行う。（２）；上記した（１）の手法で認識できない文字につ
いては、図形を横位相で表現し、ｓｅｄｉｎコードによ
り分類する。（３）；さらに、ｓｅｄｉｎコード中の各文字に対応す
る原図形中のランの位置を用いて「ｇｉデータによる分
類」と同様の方法で分類を行う。In the character recognition unit 6 of this embodiment, the characters are classified for the models d and p as follows. That is, (1); the characters are roughly classified according to the topology code of the vertical phase representation, and the characters that can be recognized by the detailed classification using the gi data and the direction code are recognized by the method. (2); For the characters that cannot be recognized by the above method (1), the figure is expressed in the horizontal phase and classified by the sedin code. (3); Further, using the position of the run in the original figure corresponding to each character in the sedin code, classification is performed by the same method as "classification by gi data".

【００７４】横位相表現だけで１３５４９字の手書き数
字データを大分類しようとすると、数字“３”のモデル
が多くなる（２４０種類）が、これを横位相で表現し、
ｓｅｄｉｎコードにより分類すると、３４種類のモデル
で済む。モデルｄ，ｐの数字“３”に限定すると、横位
相のみで７９種類のモデルが必要となるのに対し、ｓｅ
ｄｉｎコードでは８種類のモデルで済む。If the handwritten numeral data of 13549 characters is roughly classified only by the horizontal phase representation, the number of models of the number "3" increases (240 types), but this is expressed by the horizontal phase.
If classified by the sedin code, 34 types of models are sufficient. If models d and p are limited to the number “3”, 79 types of models are required only for the lateral phase, whereas se
With the din code, only 8 models are required.

【００７５】また、上記したデータ全体を縦位相表現の
トポロジーコードで分類すると、２９３種類のモデルが
必要になり、このうちトポロジーコードのみで判別でき
ないものが５６種類、１２５７６文字であった。Further, if the above-mentioned whole data is classified by the topology code of the vertical phase representation, 293 kinds of models are required, and of these, 56 kinds and 12576 characters cannot be identified only by the topology code.

【００７６】これに対して、横位相表現のｓｅｄｉｎコ
ードで文字を大分類した場合、ｓｅｄｉｎコードは２７
８種類、このうちｓｅｄｉｎコードのみで判別できない
ものが３８種類、９９４２文字であった。すなわち、本
実施例の横位相表現のｓｅｄｉｎコードの方が分解能が
高く、これにより横位相表現のｓｅｄｉｎコードを用い
て図形の大分類を行う文字認識方法が実現可能となっ
た。On the other hand, when the characters are roughly classified by the horizontal phase expression sedin code, the sedin code is 27.
Eight types, of which 38 types and 9,942 characters could not be identified only by the sedin code. That is, the horizontal phase expression sedin code of this embodiment has a higher resolution, which makes it possible to realize a character recognition method for classifying figures by using the horizontal phase expression sedin code.

【００７７】[0077]

【発明の効果】以上、説明したように、請求項１記載の
発明によれば、横位相表現された図形の接続関係に対応
したコードによって文字を大分類しているので、分類の
精度が高く、またモデル数を少なくすることができる。As described above, according to the invention described in claim 1, since the characters are largely classified by the code corresponding to the connection relation of the figures expressed in the horizontal phase, the classification accuracy is high. Also, the number of models can be reduced.

【００７８】請求項２記載の発明によれば、ｓｅｄｉｎ
コードを用いてモデルｄ，ｐを分類することが可能とな
り、分類されたデータ量を少なくすることができる。According to the invention of claim 2, sedin
It becomes possible to classify the models d and p using a code, and the amount of classified data can be reduced.

【００７９】請求項３記載の発明によれば、縦位相表現
による分類と横位相表現による分類とを組み合わせて文
字を分類しているので、大分類の精度を向上させること
ができる。According to the third aspect of the invention, since the characters are classified by combining the classification by the vertical phase representation and the classification by the horizontal phase representation, the accuracy of the major classification can be improved.

[Brief description of drawings]

【図１】本発明の実施例のブロック構成図である。FIG. 1 is a block diagram of an embodiment of the present invention.

【図２】横位相表現の例とそのトポロジーコードを示す
図である。FIG. 2 is a diagram showing an example of a horizontal phase expression and its topology code.

【図３】同一のモデルとみなすことができる１０種類の
異なるトポロジーコードの例を示す図である。FIG. 3 is a diagram illustrating an example of ten different topology codes that can be regarded as the same model.

【図４】本発明のｓｅｄｉｎコードを例示した図であ
る。FIG. 4 is a diagram illustrating a sedin code of the present invention.

【図５】ｓｅｄｉｎコードを構成する各文字（ｓ，ｅ，
ｉ，ｄ，ｎ）の接続状態を示す図である。[Fig. 5] Each character (s, e,
It is a figure which shows the connection state of i, d, n).

【図６】ｓｅｄｉｎコードと、ｓｅｄｉｎコードの表す
接続状態を示す図である。FIG. 6 is a diagram showing a sedin code and a connection state represented by the sedin code.

【図７】依存関係グラフの例を示す図である。FIG. 7 is a diagram showing an example of a dependency graph.

【図８】図７の依存関係グラフに対応する図である。8 is a diagram corresponding to the dependency graph of FIG. 7. FIG.

【図９】依存関係グラフのノードに対応した極値点を示
す図である。FIG. 9 is a diagram showing extreme points corresponding to nodes in the dependency graph.

【図１０】依存関係グラフのエッジに対応した図形の各
領域を示す図である。FIG. 10 is a diagram showing each region of a graphic corresponding to an edge of a dependency graph.

【図１１】一つのノードでは表現できない接続状態変化
点を示す図である。FIG. 11 is a diagram showing connection state change points that cannot be expressed by one node.

【図１２】図形の一例を示す図である。FIG. 12 is a diagram showing an example of a graphic.

【図１３】図１２の図形を依存関係グラフで表した図で
ある。FIG. 13 is a diagram showing the graphic of FIG. 12 as a dependency graph.

【図１４】図１３の依存関係グラフを基本ノードに変換
した図である。FIG. 14 is a diagram in which the dependency graph of FIG. 13 is converted into basic nodes.

【図１５】依存関係をメモリに格納するときの形式を示
す図である。FIG. 15 is a diagram showing a format in which a dependency relationship is stored in a memory.

【図１６】断面リストを示す図である。FIG. 16 is a diagram showing a cross-section list.

【図１７】依存関係を生成処理するフローチャートであ
る。FIG. 17 is a flowchart of a dependency relationship generation process.

【図１８】図１７のフローチャートの一部詳細フローで
ある。FIG. 18 is a partial detailed flow of the flowchart of FIG.

【図１９】図１７のフローチャートの一部詳細フローで
ある。19 is a partial detailed flow of the flowchart of FIG.

【図２０】横位相表現の一例を示す図である。FIG. 20 is a diagram showing an example of horizontal phase representation.

【図２１】位相変化ペアの内部表現図である。FIG. 21 is an internal representation diagram of a phase change pair.

【図２２】（ａ），（ｂ），（ｃ）は、断面リストが変
形する過程を示す図である。22 (a), (b), and (c) are diagrams showing a process in which the cross-section list is deformed.

【図２３】全てのノード、枝が非活性状態にある依存関
係グラフである。FIG. 23 is a dependency graph in which all nodes and branches are inactive.

【図２４】ノードと枝の活性、非活性状態を表した図で
ある。FIG. 24 is a diagram showing active and inactive states of nodes and branches.

【図２５】ノード、枝が非活性状態から活性状態に変化
する依存関係グラフである。FIG. 25 is a dependency graph in which nodes and branches change from an inactive state to an active state.

【図２６】ノード、枝が非活性状態から活性状態に変化
する依存関係グラフである。FIG. 26 is a dependency graph in which nodes and branches change from an inactive state to an active state.

【図２７】ノード、枝が非活性状態から活性状態に変化
する依存関係グラフである。FIG. 27 is a dependency graph in which nodes and branches change from an inactive state to an active state.

【図２８】ノード、枝が非活性状態から活性状態に変化
する依存関係グラフである。FIG. 28 is a dependency graph in which nodes and branches change from an inactive state to an active state.

【図２９】ノード、枝が非活性状態から活性状態に変化
する依存関係グラフである。FIG. 29 is a dependency graph in which nodes and branches change from an inactive state to an active state.

【図３０】ノードｅに関する規則を説明する図である。FIG. 30 is a diagram illustrating a rule regarding node e.

【図３１】ｓｅｄｉｎコードを生成する処理フローチャ
ートである。FIG. 31 is a processing flowchart for generating a sedin code.

【図３２】関数ｍａｋｅ−ｓｔｒｉｎｇの処理フローチ
ャートである。FIG. 32 is a processing flowchart of a function make-string.

【図３３】関数ｍａｋｅ−ｓｔｒｉｎｇの処理フローチ
ャートの続きである。FIG. 33 is a continuation of the processing flowchart of the function make-string.

【図３４】（ａ），（ｂ），（ｃ），（ｄ）は、断面リ
ストの変形を示す図である。34 (a), (b), (c) and (d) are diagrams showing modifications of the cross-section list.

【図３５】図形の一例である。FIG. 35 is an example of a graphic.

【図３６】図３５の図形を有向グラフで表した図であ
る。36 is a diagram showing the figure of FIG. 35 as a directed graph.

【図３７】有向グラフ表現では区別できない図形の例で
ある。FIG. 37 is an example of a figure that cannot be distinguished by the directed graph representation.

【図３８】有向グラフ表現では区別できない図形の例で
ある。FIG. 38 is an example of a figure that cannot be distinguished by the directed graph representation.

【図３９】有向グラフ表現では区別できない図形の例で
ある。FIG. 39 is an example of a figure that cannot be distinguished by the directed graph representation.

【図４０】図３７に対応した依存関係グラフとｓｅｄｉ
ｎコードを示す図である。40 is a dependency graph and sedi corresponding to FIG.
It is a figure which shows an n code.

【図４１】図３８に対応した依存関係グラフとｓｅｄｉ
ｎコードを示す図である。41 is a dependency graph and sedi corresponding to FIG.
It is a figure which shows an n code.

【図４２】図３９に対応した依存関係グラフとｓｅｄｉ
ｎコードを示す図である。42 is a dependency graph and sedi corresponding to FIG. 39;
It is a figure which shows an n code.

【図４３】ｓｅｄｉｎコードの各文字とスタンプの対応
を説明する図である。FIG. 43 is a diagram for explaining the correspondence between each character of the sedin code and the stamp.

【図４４】２値化された画像データである。FIG. 44 is binarized image data.

【図４５】図４４の縦位相表現図である。45 is a vertical phase representation diagram of FIG. 44.

【図４６】図４５をスタンプで表した図である。FIG. 46 is a view showing FIG. 45 by a stamp.

【図４７】縦位相表現のトポロジーコードを示す図であ
る。FIG. 47 is a diagram showing a topology code for vertical phase expression.

【図４８】数字２の縦位相表現とトポロジーコードを示
す図である。FIG. 48 is a diagram showing a vertical phase representation of number 2 and a topology code.

【図４９】トポロジーコードのみでは文字を分類できな
い例を示す図である。FIG. 49 is a diagram showing an example in which characters cannot be classified only by the topology code.

【図５０】モデル（ｄ，Ｉｄ，ＩＡ，ＶＩ，Ｖ，ｐ）の
ｇｉデータによる分類を説明する図である。FIG. 50 is a diagram illustrating classification of models (d, Id, IA, VI, V, p) by gi data.

[Explanation of symbols]

１２値化処理部２ランレングスデータ生成部３横位相表現データ生成部４依存関係生成部５ｓｅｄｉｎコード生成部６文字認識部 1 Binarization processing unit 2 Run length data generation unit 3 Horizontal phase expression data generation unit 4 Dependency relation generation unit 5 sedin code generation unit 6 Character recognition unit

Claims

[Claims]

1. A run data is generated for each column from binarized image data, a phase change pair having a different run data pattern between adjacent columns is extracted, and a horizontal phase representation composed of the phase change pair. In the method of recognizing characters using
A character recognition method characterized in that a connection relation of the figures expressed in the horizontal phase is generated, a code corresponding to the connection relation is generated, and characters are classified by the code.

2. The code includes at least a first character indicating occurrence of a line, a second character indicating disappearance of the line, a third character indicating confluence of the lines, and a fourth character indicating branching of the lines. And the
The character recognition method according to claim 1, wherein the character recognition method is a combination with a fifth character representing a line that does not change.

3. A character recognition method, characterized in that the character classification according to the vertical phase representation and the character classification according to claim 1 are combined.