JP3313272B2

JP3313272B2 - Address reading method and identification function weight vector generation method

Info

Publication number: JP3313272B2
Application number: JP00043896A
Authority: JP
Inventors: 昌史古賀; 達彦影広; 裕酒匂; 浩道藤澤; 日佐男緒方; 好博嶋; 正人寺本
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1996-01-08
Filing date: 1996-01-08
Publication date: 2002-08-12
Anticipated expiration: 2016-01-08
Also published as: JPH09185681A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、郵便物等の文書上
に記載された住所を読み取り、文書を区分する、住所読
取り区分装置にかかわる。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an address reading / separating apparatus which reads an address written on a document such as a mail and sorts the document.

【０００２】[0002]

【従来の技術】住所が記載されている文字の並び（住所
文字行）から住所を読み取るためには、以下の３つの機
能が必要である。2. Description of the Related Art The following three functions are required to read an address from a character sequence (address character line) in which the address is described.

【０００３】（１）文字パターンを切り出す。（文字切
り出し）（２）各々の文字パターンの文字種（文字コード）を識
別する。（文字識別）（３）文字の識別結果を住所文字列として解釈する。
（文字列照合）これらの内、手書きで書かれた文字行は文字のピッチの
ばらつきが大きいため、（１）の文字切り出しが特に難
しい。(1) Cut out a character pattern. (Character cutout) (2) The character type (character code) of each character pattern is identified. (Character identification) (3) The character identification result is interpreted as an address character string.
(Character string collation) Of these, character lines written by hand have large variations in character pitch, and thus character extraction in (1) is particularly difficult.

【０００４】従来から、手書きの文字行から正確に文字
を切り出すためには、多重仮説検定法が有効であること
が知られている。これは、文字の切り出し方に複数の仮
説を立て、文字パターンの候補を切り出した後、文字識
別や文字列照合によって正しい仮説を決定する方式であ
る。例えば、文字識別結果で仮説を検定する藤澤らの方
式（昭和５９年度電子通信学会総合全国大会１５５８
「接触した手書き文字の自動分離を行う文字切り出し方
式」）がある。また、文字の外形により、文字パターン
の候補の切り出しの成否を判定する方式として石寺らの
方式（１９９５年電子情報通信学会総合大会Ｄー５７６
「手書き住所読み取りのための文字切り出し法」）があ
る。また、文字識別結果および文字列照合結果により、
仮説を検定する方式としては、村瀬等の方式（信学論
（D）vol. J69-D、 No. 9「言語情報を利用した手書き
文字列からの文字切り出しと認識」）、および大井の方
式（信学技報PRU92-40「住所読取りにおける丁目・街区
認識方式」）等がある。Conventionally, it has been known that a multiple hypothesis testing method is effective in accurately extracting a character from a handwritten character line. This is a method in which a plurality of hypotheses are set for character extraction, character pattern candidates are extracted, and then a correct hypothesis is determined by character identification or character string collation. For example, Fujisawa et al.'S method of testing a hypothesis based on the results of character identification (1558, IEICE General Conference 1558)
“Character cutout method that automatically separates touched handwritten characters”). Also, as a method for determining the success or failure of cutting out a character pattern candidate based on the outline of a character, the method of Ishidera et al. (1995 IEICE General Conference D-576).
"Character segmentation method for reading handwritten addresses"). Also, according to the character identification result and the character string collation result,
Murase et al.'S method for testing hypotheses (IEICE (D) vol. J69-D, No. 9 "Character extraction and recognition from handwritten character strings using linguistic information") and Oi's method (Technical Report PRU92-40 "Chome / Block Recognition Method in Address Reading").

【０００５】この多重仮説検定方式とは、文字パターン
の候補を抽出し、文字識別や文字列照合の結果を用いて
検定する方式である。例えば、図１９（ｂ）に示す住所
文字列に対しては、図２０（ａ）に示すような文字パタ
ーンの候補が生成される。図２０（ａ）の中では、パタ
ーンとパターンの境界を節、パターンを弧とするグラフ
で、文字パターンの候補の関係が表現されている。この
ように、文字パターンの候補の関係をグラフで表現した
ものを切り出しグラフと呼ぶ。このように文字パターン
の候補の関係を表現すれば、文字パターンの切り出し
は、切り出しグラフの始点から終点へ至る最適な経路を
見いだす問題に置き換えられる。[0005] The multiple hypothesis test method is a method in which candidates for character patterns are extracted and tested using the results of character identification and character string collation. For example, for the address character string shown in FIG. 19B, character pattern candidates as shown in FIG. 20A are generated. In FIG. 20A, the relationship between character pattern candidates is represented by a graph in which a boundary between patterns is a node and the pattern is an arc. In this manner, the relationship between the character pattern candidates represented by a graph is referred to as a cut-out graph. By expressing the relationship between the character pattern candidates in this way, the extraction of the character pattern can be replaced with the problem of finding the optimal path from the start point to the end point of the extraction graph.

【０００６】[0006]

【発明が解決しようとする課題】以上述べたように従来
技術においては、例えば、図１９の（ａ）と（ｂ）の住
所は、どちらも（ｃ）に示すようなパターンを含んでい
る。（ｃ）を、１文字（「板」）に切出すか、２文字
「水反」と切り出すべきかは、文字識別結果を用いる従
来方式では判断できない。なぜなら、どちらの切り出し
方でも、切り出されたパターンを文字識別した結果、高
い類似度が得られるからである。また仮に、「中住市板
町」と「中住市水反町」の両方が住所としてあり得るな
ら、上記の二つの切り出し方のいずれが正しいかは、文
字列照合を用いる従来の方式でも判断できない。文字パ
ターンの大きさと前後のパターンとの位置関係を利用す
れば、このような場合でも正しく文字を切り出すことが
できる。例えば、（ａ）は文字の間隔を大きく取って書
かれているのに対し、（ｃ）中のパターンの間隙ｇは小
さい。したがって、（ｃ）は１文字だと判断できる。こ
れに対し、（ｂ）は文字の間隔を小さく書かれており、
（ｃ）中のパターンの間隙ｇは文字の間隔として解釈す
ることは妥当である。したがって、（ｂ）では２文字だ
と判断できる。As described above, in the prior art, for example, the addresses shown in FIGS. 19A and 19B both include the pattern shown in FIG. 19C. Whether the character (c) should be cut into one character (“plate”) or two characters “water counter” cannot be determined by the conventional method using the character identification result. This is because, in either of the extraction methods, a high similarity is obtained as a result of character identification of the extracted pattern. Also, if both "Takamachi, Nakazumi City" and "Mizutani, Nakazumi City" can be used as the address, which of the above two cutting methods is correct is determined by the conventional method using character string matching. Can not. If the positional relationship between the size of the character pattern and the preceding and succeeding patterns is used, characters can be correctly cut out even in such a case. For example, (a) is written with a large space between characters, whereas the gap g of the pattern in (c) is small. Therefore, it can be determined that (c) is one character. On the other hand, (b) is written with a small character interval,
It is appropriate to interpret the gap g of the pattern in (c) as the space between characters. Therefore, in (b), it can be determined that there are two characters.

【０００７】一方、図２０（ａ）から、文字識別結果に
より不適切な弧を削除すると、図２０（ｂ）のような切
り出しグラフになる。しかし、図１９（ｃ）の部分に
は、複数の切り出し方が候補として残っており、従来の
方法では文字切り出しを誤る可能性がある。しかし、従
来の方式では、このような前後のパターンとの関係に基
づいて文字の切り出し方を決定するのは困難であった。
例えば、藤沢等の方式、石寺らの方式は、パターン一つ
一つの妥当性を判断するものであり、前後のパターンと
の関係を利用することはできない。また、大井の方式、
村瀬等の方式は、文字列照合で各パターンの前後との関
係を利用しているが、前後の文字との間隔などの相対的
特徴量の情報は利用できない。On the other hand, if an inappropriate arc is deleted from the character identification result in FIG. 20A, a cut-out graph as shown in FIG. 20B is obtained. However, in the portion of FIG. 19C, a plurality of extraction methods remain as candidates, and there is a possibility that character extraction is erroneous in the conventional method. However, in the conventional method, it is difficult to determine how to cut out characters based on such a relationship with the preceding and following patterns.
For example, the method of Fujisawa et al. And the method of Ishidera et al. Determine the validity of each pattern, and cannot use the relationship with the preceding and succeeding patterns. Also, Oi's method,
The method of Murase et al. Uses the relationship between the front and back of each pattern in character string collation, but cannot use information on relative feature amounts such as the spacing between the preceding and following characters.

【０００８】本発明が解決しようとする課題は、多重仮
説検定法に関わることで、文字識別結果および文字列照
合のみでは正しく切出の候補の検定ができない住所文字
行から、注目パターンおよびその前後のパターンの相対
的特徴量を利用し、正確に文字を切り出す事である。The problem to be solved by the present invention is related to the multiple hypothesis test method, in which a character pattern is extracted from an address character line that cannot be correctly detected as a candidate for extraction by only character identification results and character string collation. The character is accurately cut out using the relative feature amount of the pattern.

【０００９】[0009]

【課題を解決するための手段】上記の課題を解決するた
め、本発明では、多重仮説検定方式に、文字パターンの
大きさや前後のパターンとの位置関係に基づく評価値
（概形ペナルティ）を導入する。また、本発明は、種々
の切り出し誤りの仮説に対応したペナルティ算出手段を
設けることにより、大きさや位置関係の情報を容易に扱
えることを特徴とする。ここで、図２０において節４と
節６の間に張られる弧に対応するパターン（以下「パタ
ーン４−６」と呼ぶ）を例に挙げて、本方法の原理を説
明する。まず、パターン４−６近傍は、実は仮説２０２
の破線で示された矩形のように切り出されるべきであっ
たと仮定し、この仮説が妥当かどうかを文字パターンの
大きさや前後のパターンとの位置関係に基づき評価す
る。同様に、切り出し方の辞書２０１中の７つの仮説に
基づきパターン４−６を評価する。この場合、以下の理
由により、仮説２０３が妥当と判断される。In order to solve the above-mentioned problems, the present invention introduces an evaluation value (rough penalty) based on the size of a character pattern and the positional relationship with preceding and succeeding patterns in a multiple hypothesis testing method. I do. Further, the present invention is characterized in that information of size and positional relationship can be easily handled by providing a penalty calculating means corresponding to various cutoff error hypotheses. Here, the principle of the present method will be described with reference to an example of a pattern (hereinafter, referred to as “pattern 4-6”) corresponding to an arc stretched between nodes 4 and 6 in FIG. First, in the vicinity of the pattern 4-6, the hypothesis 202
It is assumed that it should have been cut out like a rectangle shown by a broken line, and the validity of this hypothesis is evaluated based on the size of the character pattern and the positional relationship with the preceding and following patterns. Similarly, the pattern 4-6 is evaluated based on seven hypotheses in the cutout dictionary 201. In this case, the hypothesis 203 is determined to be valid for the following reason.

【００１０】（１）パターン４−５の間隔とパターン
５−６の間隔は、前後の間隔（パターン４−６に対する
パターン６−７、パターン３−４の間隔）と比べて十分
大きい。(1) The interval between the pattern 4-5 and the interval between the patterns 5-6 is sufficiently larger than the interval before and after (the interval between the pattern 6-7 and the pattern 3-4 with respect to the pattern 4-6).

【００１１】（２）パターン４−５とパターン５−６
は、いずれも幅が十分に広い。(2) Pattern 4-5 and Pattern 5-6
Are all wide enough.

【００１２】このように、仮説２０１中のいづれかの仮
説が妥当と判断された場合には、当該パターン（この例
ではパターン４−６）を切り出しグラフから削除する。
なお、このような判断は、あらかじめ収集したサンプル
を学習して求めた線形識別関数により行なう。このこと
により、切り出しグラフから弧を削減し、従来の方式で
は対応困難であった文字行から高い精度で文字パターン
が切り出せるようになる。また、このように切り出しの
誤り方を体系化することにより、切り出し候補の検定に
必要なパラメータの学習が容易になる。As described above, when any one of the hypotheses 201 is determined to be valid, the pattern (in this example, pattern 4-6) is cut out and deleted from the graph.
Such a determination is made by a linear discriminant function obtained by learning samples collected in advance. As a result, arcs can be reduced from the cut-out graph, and a character pattern can be cut out with high precision from a character line that was difficult to handle in the conventional method. In addition, by systematizing the cutout error in this way, it becomes easy to learn the parameters necessary for the test of the cutout candidate.

【００１３】以上のようにして本発明の実現方法は説明
されるが、このために、請求項１においては、本発明の
対象となる住所読取装置において、画像上から住所の記
載されている文字行を抽出する文字行抽出処理を行い、
ここで文字パターンを一意に定められない場合は複数の
文字パターンの候補を切り出すパターン切出処理を行
い、この切り出されたパターンを文字識別する文字識別
処理を行う。続いて、想定し得る文字切り出しの各種の
誤り方毎に、それぞれの読み出されたパターンが当該の
誤り方で切り出されたパターンであると言う仮説の妥当
性を、各パターン及びその前後複数のパターンとの相対
的特徴量に基づき概形ペナルティとして算出し、その後
に文字識別結果と概形ペナルティに応じて文字パターン
を絞り込み、予め記憶してある住所文字列の辞書と照合
する住所辞書照合処理を行うことにより住所読み取りを
行っている。The method of realizing the present invention is described above. For this purpose, according to the first aspect of the present invention, in an address reading device to which the present invention is applied, a character in which an address is described from an image on an image is described. Perform character line extraction processing to extract lines,
If the character pattern cannot be uniquely determined, pattern extraction processing for extracting a plurality of character pattern candidates is performed, and character identification processing for character identifying the extracted pattern is performed. Subsequently, for each of the various types of conceivable character segmentation errors, the validity of the hypothesis that each read pattern is a pattern segmented in the error manner is determined for each pattern and a plurality of before and after the hypothesis. An address dictionary matching process that calculates an approximate penalty based on the relative feature amount with the pattern, then narrows down the character pattern according to the character identification result and the approximate penalty, and matches it with a dictionary of address character strings stored in advance. The address is read.

【００１４】請求項２においては、上記各概形ペナルテ
ィを算出するために、正しく切り出されたパターン候補
と、誤って切り出されたパターン候補とを識別する識別
関数を用いている。In the second aspect, in order to calculate each of the outline penalties, an identification function for identifying a correctly extracted pattern candidate and an erroneously extracted pattern candidate is used.

【００１５】請求項３においては、上記識別関数におい
て、少なくとも各パターンの前後のパターンとの間隔を
特徴量の一つとして処理している。According to a third aspect of the present invention, in the discrimination function, at least an interval between each pattern and a pattern before and after each pattern is processed as one of the feature amounts.

【００１６】請求項４においては、上記識別関数におい
て、少なくとも各パターンの前後のパターン中の間隙を
特徴量の一つとしている。According to a fourth aspect of the present invention, in the discrimination function, at least a gap in a pattern before and after each pattern is set as one of the feature quantities.

【００１７】請求項５においては、上記識別関数を作成
するために、先ず文字行の画像を入力データとし、文字
パターンの切り出し方が一意に定められない場合は複数
の文字パターンの候補を切り出し、これらの文字パター
ンの候補が正しく切り出されているか否か、また、誤っ
ている場合には如何なる種類の誤りかを分類することに
より、各文字パターンの候補の切り出し誤りを判別し、
この結果に応じてパターン候補を分類格納し、更に分類
格納されているパターン候補を用いて識別関数を学習決
定する方法としている。In the fifth aspect, in order to create the identification function, first, an image of a character line is used as input data, and a plurality of character pattern candidates are cut out when a method of cutting out a character pattern is not uniquely determined. Whether or not these character pattern candidates are correctly cut out, and if they are wrong, by classifying what kind of error, the cutout error of each character pattern candidate is determined,
According to this method, the pattern candidates are classified and stored, and the classification function is learned and determined using the classified and stored pattern candidates.

【００１８】[0018]

【発明の実施の形態】本発明の実施の形態の一例の構成
を、図１のデータ・フロー・ダイアグラムを用いて説明
する。本明細書では、データ・フロー・ダイアグラムは
ゲーン・サクソン記法に従う（Ｊ．マーチン「ソフトウ
エア構造化技法」近代科学社、ISBN4- 7649 - 0124 - 2
C3050 P5562E）。本例は、住所の記載された面を光電
変換して得られた全体画像を入力とし、住所読取結果の
住所文字列を出力とする住所認識処理である。まず、文
字行抽出処理３０１により、全体画像から住所文字行を
抽出する。次に、文字パターン候補抽出処理３０２は、
住所文字行中より文字パターンの候補を抽出し、切り出
しグラフを生成する。概形ペナルティ算出処理３０３
は、各パターン候補の概形ペナルティ（ｐ）を算出す
る。文字識別処理３０４は、各文字パターン候補を文字
識別し、複数の候補文字のコードと各候補文字の類似度
を出力する。パターン確信度算出処理３０５は、文字識
別結果の類似度、概形ペナルティを基に、各パターン候
補の確からしさ（パターン確信度）を算出する。住所辞
書照合処理３０６は、パターン確信度に基づきパターン
候補を取捨選択し、文字識別結果の候補文字を住所文字
列辞書と照合する。DESCRIPTION OF THE PREFERRED EMBODIMENTS The configuration of an embodiment of the present invention will be described with reference to a data flow diagram of FIG. As used herein, data flow diagrams follow the Gaine-Saxon notation (J. Martin "Software Structuring Techniques", Modern Science, ISBN 4-7649-0124-2.
C3050 P5562E). This example is an address recognition process in which an entire image obtained by photoelectrically converting a surface on which an address is described is input and an address character string of an address read result is output. First, a character line extraction process 301 extracts an address character line from the entire image. Next, the character pattern candidate extraction processing 302
Character pattern candidates are extracted from the address character line, and a cut-out graph is generated. Outline penalty calculation processing 303
Calculates the approximate penalty (p) of each pattern candidate. The character identification processing 304 identifies each character pattern candidate, and outputs codes of a plurality of candidate characters and the degree of similarity between the candidate characters. The pattern certainty calculation process 305 calculates the certainty (pattern certainty) of each pattern candidate based on the similarity of the character identification result and the approximate penalty. The address dictionary matching process 306 selects and discriminates pattern candidates based on the pattern certainty factor, and matches candidate characters of the character identification result with the address character string dictionary.

【００１９】図２は、本例の処理の手順を示す。まず、
ステップ４０１で、全体画像より文字行を抽出する。次
に、ステップ４０２で文字行中より、文字パターンの候
補を抽出し、切り出しグラフを生成する。次に、切り出
しグラフ中の全ての文字パターン候補について、制御ル
ープ４０３の中で、ステップ４０５からステップ４０７
を繰り返す。制御ループ４０３の中では、まずステップ
４０５で、各文字パターン候補の概形ペナルティを算出
する。次に、各文字パターン候補をステップ４０６で文
字識別する。次に、ステップ４０７で、切り出しグラフ
上の各弧に対応するパターンの確信度を計算する。制御
ループ４０３を終了後、ステップ４０４で、文字識別結
果を住所辞書と照合する。FIG. 2 shows the procedure of the processing of this embodiment. First,
In step 401, a character line is extracted from the entire image. Next, in step 402, character pattern candidates are extracted from the character line, and a cut-out graph is generated. Next, for all character pattern candidates in the cut-out graph, steps 405 to 407 are performed in the control loop 403.
repeat. In the control loop 403, first, in step 405, the approximate penalty of each character pattern candidate is calculated. Next, in step 406, each character pattern candidate is identified. Next, in step 407, the certainty factor of the pattern corresponding to each arc on the cut-out graph is calculated. After the control loop 403 ends, in step 404, the character identification result is compared with the address dictionary.

【００２０】次に、文字行抽出処理３０１の入力および
出力を、図３を用いて説明する。文字行抽出処理３０１
は、全体画像中の郵便物の像５０１から、住所文字行を
抽出する処理である。住所文字行とは、宛名の都道府県
名、市名から、丁目・番地までを含む矩形の領域５０２
のことである。宛名が２行以上に及ぶ場合は、行数分だ
けの矩型領域５０２を出力する。このように住所文字行
を抽出する方法としては、東芝レビュー1993 Vol. 48 N
o. 7「郵便機械における画像処理技術」３章３節にある
様な方法を用いる。Next, the input and output of the character line extraction processing 301 will be described with reference to FIG. Character line extraction processing 301
Is a process for extracting an address character line from the postal matter image 501 in the entire image. The address character line is a rectangular area 502 that includes from the name of the prefecture and city of the address to the street and address.
That is. When the address extends over two or more lines, rectangular areas 502 corresponding to the number of lines are output. As a method for extracting address character lines in this way, Toshiba Review 1993 Vol. 48 N
o. Use the method described in Chapter 3 Section 3 of "Image Processing Technology for Postal Machines".

【００２１】次に、文字パターン候補抽出処理３０２の
原理を、図４を用いて説明する。まず、住所文字行よ
り、文字の境界の候補を抽出する。図中で０から９まで
の番号（境界番号）を付与している縦棒が、境界の候補
である。境界の候補は、信学技報IE88-138「不定ピッチ
文字列を含む印刷文書における印刷文書における文字切
出手法」に記載があるような統合矩形同士の間隙とす
る。ここで、境界の右のパターンの左端のｘ座標から境
界の左のパターンの右端のｘ座標を引いた値を境界間
隙、境界の右のパターンの左端のｘ座標から境界の左の
パターンの右端のｘ座標の平均を境界座標と定義する。
例えば、境界番号４に対応する境界座標は６０４のｘ座
標、境界間隙は６０５で表される幅となる。次に、境界
座標の差が、文字行の高さから推定した文字サイズを越
えないような境界の組み合わせを調べ、それら境界に挾
まれるパターンを文字パターンの候補として登録する。
図４の例では、境界座標の差６０１及び６０２は推定し
た文字サイズを越えず、境界座標の差６０３は越える。
このため、パターン０−１とパターン０−２は登録する
が、パターン０−３は登録しない。Next, the principle of the character pattern candidate extraction processing 302 will be described with reference to FIG. First, character boundary candidates are extracted from the address character line. The vertical bars to which numbers (boundary numbers) from 0 to 9 are assigned in the drawing are candidates for the boundary. A candidate for the boundary is a gap between integrated rectangles as described in IEICE Technical Report IE88-138, "Character extraction method in a printed document containing a character string with an indefinite pitch". Here, a value obtained by subtracting the x coordinate of the right end of the pattern on the left side of the pattern from the x coordinate of the left end of the pattern on the right side of the boundary is the boundary gap, and the right end of the pattern on the left side of the boundary from the x coordinate of the left end of the right pattern of the boundary. Is defined as boundary coordinates.
For example, the boundary coordinate corresponding to the boundary number 4 is the x coordinate of 604, and the boundary gap is the width represented by 605. Next, a combination of boundaries is checked so that the difference between the boundary coordinates does not exceed the character size estimated from the height of the character line, and a pattern sandwiched between the boundaries is registered as a character pattern candidate.
In the example of FIG. 4, the difference 601 between the boundary coordinates does not exceed the estimated character size, and the difference 603 between the boundary coordinates does.
Therefore, the pattern 0-1 and the pattern 0-2 are registered, but the pattern 0-3 is not registered.

【００２２】図５は、文字パターン候補抽出処理３０２
で生成される切り出しグラフの弧を表現するパターンテ
ーブルの形式を示す。パターンテーブルの各レコード
が、それぞれパターン候補一つに対応する。パターンテ
ーブルの各レコードには、パターンの輪郭を格納する領
域７０１がり、パターンの輪郭はチェーンコード等で記
述する。また、パターンの左側の境界の番号７０２、右
側の境界の番号を格納する領域７０３があり、境界の番
号は、図４に示すように、文字行の左端を０番とし、左
から右へ順に１刻みに増えるよう定める。さらに、文字
識別の結果の候補文字を格納する領域７０４があり、本
実施の形態では、類似度が高い順に最大３文字の候補文
字を格納する。上記領域７０４に格納した候補文字の類
似度を格納する領域７０５があり、候補文字、類似度は
左詰めとし、候補文字数が３以下の場合には、候補文字
の領域の余白にはヌルコードを、類似度の領域の余白に
は０を埋める。なお、パターン確信度を格納するために
領域７０６がある。FIG. 5 shows a character pattern candidate extraction process 302.
Shows the format of a pattern table that represents the arcs of the cut-out graph generated by. Each record of the pattern table corresponds to one pattern candidate. Each record of the pattern table has an area 701 for storing the outline of the pattern, and the outline of the pattern is described by a chain code or the like. In addition, there is an area 703 for storing the number 702 of the left boundary of the pattern and the number of the right boundary, and the number of the boundary is 0 in the left end of the character line, as shown in FIG. Decide to increase in increments of one. Further, there is an area 704 for storing candidate characters as a result of character identification. In the present embodiment, up to three candidate characters are stored in descending order of similarity. There is an area 705 for storing the similarity of the candidate characters stored in the area 704. The candidate characters and the similarity are left-justified. If the number of candidate characters is 3 or less, a null code is set in the blank of the candidate character area. Zeros are filled in the margins of the areas of similarity. Note that there is an area 706 for storing the pattern certainty factor.

【００２３】図６は、文字パターン候補抽出処理３０２
で生成される切り出しグラフの節を表現する境界テーブ
ルの形式を示す。本テーブルの各レコードが１つの境界
を表す。各レコードは境界番号８０１、境界座標８０
２、境界間隙８０３をそれぞれ格納する領域である。FIG. 6 shows a character pattern candidate extraction process 302.
Shows the format of the boundary table that represents the nodes of the cut-out graph generated by. Each record in this table represents one boundary. Each record has a boundary number 801 and boundary coordinates 80
2. This is an area for storing the boundary gap 803.

【００２４】文字識別処理３０４は、例えば森「パター
ン認識」（ISBN4 - 88552 - 075 -4 C3055、電子情報通
信学会）pp. 32 - 109 に記載のあるような既知の方式
を用いて実現する。文字識別対象のカテゴリーは、漢
字、平仮名、片仮名、アラビア数字および記号のうち、
地名もしくは丁目・街区で用いられるものとする。文字
識別の出力としては、複数の候補文字および各候補文字
の標準パターンに対する入力パターンの類似度が得られ
るものとする。The character identification processing 304 is realized by using a known method as described in, for example, Mori “Pattern Recognition” (ISBN4-88552-075-4C3055, IEICE) pp. 32-109. Categories for character identification include kanji, hiragana, katakana, Arabic numerals and symbols.
It shall be used for the place name or the street / block. As the output of character identification, a plurality of candidate characters and the similarity of the input pattern to the standard pattern of each candidate character are obtained.

【００２５】次に、概形ペナルティ算出処理３０３の構
成を、図７のデータ・フロー・ダイアグラムを用いて説
明する。本処理の入力である文字パターン候補は、パタ
ーンテーブル中の１つのレコードおよび境界テーブルで
表現される。ここで、各パターンの候補に対し、それが
誤って切り出されたものであるという仮定を評価する処
理として、切り出し誤り仮説評価処理９０１、９０２、
９０３等を行う。本実施の形態では、図８に示す切り出
し誤りの種類Ｅ１からＥ７に対応する７種類の切り出し
誤り仮説評価処理を用いる。切り出し誤り仮説評価処理
の出力ｐｉは、大きければ大きいほど切り出し誤り仮説
が確からしい事を示す。これらの切り出し誤り仮説評価
の出力ｐｉは、処理９０４で加算され、概形ペナルティ
ｐとして出力される。Next, the configuration of the outline penalty calculation processing 303 will be described with reference to the data flow diagram of FIG. A character pattern candidate which is an input of this process is represented by one record in the pattern table and a boundary table. Here, as processing for evaluating the assumption that each pattern candidate is erroneously cut out, each of the pattern error candidates is evaluated by a cut-out error hypothesis evaluation processing 901, 902,
903 and so on. In the present embodiment, seven types of extraction error hypothesis evaluation processes corresponding to the types of extraction errors E1 to E7 shown in FIG. 8 are used. The output pi of the extraction error hypothesis evaluation process indicates that the larger the output pi, the more likely the extraction error hypothesis is. The outputs pi of these cut-out error hypothesis evaluations are added in the process 904 and output as the approximate penalty p.

【００２６】図８は、切り出し誤りの仮説の種類を説明
する図である。図中で、黒い太線は注目しているパター
ン候補を、破線の矩形は正しい文字パターンの外接矩形
を、白抜きの線は注目しているパターン候補の周辺のパ
ターンの一部を表す。例えば、Ｅ１は注目しているパタ
ーン候補が実は文字の左側の一部を誤って切り出したも
のであるという仮説を表す。また、Ｅ７は注目している
パターン候補が実は２文字を誤って１つのパターンとし
たものであるという仮説を示す。FIG. 8 is a diagram for explaining the types of hypotheses of cutout errors. In the figure, a thick black line represents a pattern candidate of interest, a dashed rectangle represents a circumscribed rectangle of a correct character pattern, and a white line represents a part of a pattern around the pattern candidate of interest. For example, E1 represents a hypothesis that the pattern candidate of interest is actually a part of the left side of a character cut out by mistake. E7 indicates a hypothesis that the pattern candidate of interest is actually one pattern in which two characters are erroneously set.

【００２７】図９は、切り出し誤り仮説評価処理の構成
を示すデータ・フロー・ダイアグラムである。本処理の
入力であるパターン候補はパターンテーブル中の１つの
レコードおよび境界テーブルで表現される。ここで、仮
説評価処理１１０１は、図７の仮説評価処理９０１、９
０２、９０３等の切り出し誤り仮説評価処理のいづれか
に対応する。特徴量抽出処理１１０２は、入力された文
字パターン候補から、文字パターンの大きさや前後のパ
ターンとの位置関係等の特徴量を抽出する。特徴量は、
以下の式で表わされるｎ次元のベクトルとする。FIG. 9 is a data flow diagram showing the configuration of the cut-out error hypothesis evaluation process. A pattern candidate which is an input of this processing is represented by one record in the pattern table and a boundary table. Here, the hypothesis evaluation processing 1101 is the hypothesis evaluation processing 901 and 9 shown in FIG.
02, 903, etc., corresponding to any of the cut-out error hypothesis evaluation processes. The feature amount extraction processing 1102 extracts feature amounts such as the size of the character pattern and the positional relationship with the preceding and following patterns from the input character pattern candidate. Features are
Let it be an n-dimensional vector represented by the following equation.

【００２８】Ｆ＝（ｆ１、ｆ２、・・、ｆｎ）次に、処理１１０３で、特徴量Ｆより、ｐｉをもとめ
る。ｐｉは、正しく切り出された候補パターンとＥｉの
ような誤りをした候補パターンを識別する線形識別関数
の値であり、以下のような式で定義できる。F = (f1, f2,..., Fn) Next, in processing 1103, pi is obtained from the feature value F. pi is a value of a linear discriminant function for identifying a correctly extracted candidate pattern and an erroneous candidate pattern such as Ei, and can be defined by the following expression.

【００２９】ｐｉ＝Ｆ・Ｖｉ＋ｃｉ（Ｆ・Ｖ
ｉ：ＶｉとＦの内積）ここでＶｉ：線形識別関数の重みベクトルｃｉ：定数Ｖｉ、ｃｉは、後述する方法で予め学習しておき、パラ
メータ辞書１１０４に記憶しておく。本例とは別の例と
して、文字識別結果の候補文字に応じて、パラメータ辞
書を切替えてもよい。Pi = F · Vi + ci (F · V
i: inner product of Vi and F) Here, Vi: weight vector of the linear discriminant function ci: constants Vi and ci are learned in advance by a method described later and stored in the parameter dictionary 1104. As another example different from this example, the parameter dictionary may be switched according to the candidate character of the character identification result.

【００３０】図１０に、本例で用いる特徴量を説明す
る。図中で、黒い太線で表されたパターン１２０１は注
目している文字パターン候補を、白抜きで表されたパタ
ーン１２０２、１２０３は周辺のパターンを示す。ま
た、破線の矩形は、各パターンに外接する矩形を示す。
本例では、特徴量の次元数ｎ＝６とし、各次元ｆｊは以
下のように定義する。FIG. 10 illustrates the feature values used in this example. In the figure, a pattern 1201 represented by a thick black line indicates a character pattern candidate of interest, and patterns 1202 and 1203 represented by white indicate peripheral patterns. The dashed rectangle indicates a rectangle circumscribing each pattern.
In this example, the number of dimensions of the feature amount is n = 6, and each dimension fj is defined as follows.

【００３１】ｆ１：注目しているパターンの高さｆ２：注目しているパターンの幅ｆ３：注目しているパターンと左隣のパターンの間隔ｆ４：注目しているパターンと右隣のパターンの間隔ｆ５：注目しているパターン中の最大の間隙ｆ６：注目しているパターン中の連結成分数本例では、全ての切り出し誤り仮説評価処理で同じ特徴
量を用いるが、各々の処理で異なる特徴量を用いてもよ
い。また、各特徴量を、文字行の高さｈ等の文字行の全
体的な特徴量で正規化してもよい。F1: height of the pattern of interest f2: width of the pattern of interest f3: interval between the pattern of interest and the pattern on the left f4: interval between the pattern of interest and the pattern on the right f5: Maximum gap in the pattern of interest f6: Number of connected components in the pattern of interest In this example, the same feature is used in all the cut-out error hypothesis evaluation processes, but different features are used in each process. May be used. Further, each feature amount may be normalized by the overall feature amount of the character line such as the character line height h.

【００３２】図１１は、切り出し誤り仮説評価法の原理
を示す。座標軸１３０１、１３０２は、実際にはｎ次元
である特徴空間を仮想的に示す。パターングループ１３
０３は、正しく切り出されている文字パターン候補の特
徴量Ｆの分布、パターングループ１３０４は、切り出し
誤り仮説Ｅｉが真である文字パターン候補の特徴量Ｆの
分布を示す。図中のWｉは、パターングループ１３０３
及び１３０４を識別する識別関数の重みベクトルであ
り、これらグループ１３０３と１３０４を識別する識別
平面Ｂ１３０５であり、Ｗｉと直交する。ここでａ：原点から超平面Ｂまでのユークリッド距離とすると、超平面Ｂは、以下の式を満たすＦの集合とし
て表わされる。FIG. 11 shows the principle of the method for evaluating a cut-out error hypothesis. Coordinate axes 1301 and 1302 virtually show a feature space that is actually n-dimensional. Pattern group 13
03 indicates the distribution of the feature amount F of the correctly extracted character pattern candidate, and the pattern group 1304 indicates the distribution of the feature amount F of the character pattern candidate for which the extraction error hypothesis Ei is true. Wi in the figure is a pattern group 1303
And 1304, which are identification function weight vectors for identifying the groups 1303 and 1304, which are orthogonal to Wi. Here, assuming that a is the Euclidean distance from the origin to the hyperplane B, the hyperplane B is expressed as a set of F satisfying the following equation.

【００３３】（Ｗｉ・Ｆ）＝ａ・｜Ｗｉ｜（Ｗｉ・Ｆ：ＷｉとＦの内積、｜Ｗｉ｜：Ｗｉのノル
ム）また、グループ１３０３と１３０４を識別する線形識別
関数の値ｄは、以下のようになり、ｄ＞０ならばＦはグ
ループ１３０４に、ｄ＜０ならグループ１３０３と識別
される。(Wi · F) = a · | Wi | (Wi · F: inner product of Wi and F, | Wi |: norm of Wi) Further, the value d of the linear discriminant function that identifies the groups 1303 and 1304 is If d> 0, F is identified as group 1304, and if d <0, F is identified as group 1303.

【００３４】ｄ＝（Ｗｉ・Ｆ） − ａ・｜Ｗｉ｜Ｗｉおよびａ・｜Ｗｉ｜は、鳥脇「認識工学」（ISBN4
- 339 - 01059 - 6 C3355 P2781E、コロナ社）pp. 113
- 119の様な方式でもとめられる。しかし、このｄの値
をそのままｐｉとして用いるのは、各Ｅｉの特徴空間上
での分布が異なるため、適切でない。このため、以下の
式のように正規化した線形識別関数の値をｐｉとして用
いる。D = (Wi · F) −a · | Wi | Wi and a · | Wi | are Toriwaki “recognition engineering” (ISBN4
-339-01059-6 C3355 P2781E, Corona) pp. 113
-It can be stopped by a method like 119. However, it is not appropriate to use the value of d as pi as it is because the distribution of each Ei in the feature space is different. Therefore, the value of the linear discriminant function normalized as in the following equation is used as pi.

【００３５】ｐｉ＝ｄ／（ｓ・｜Ｗｉ｜）＝（Ｗｉ・Ｆ）／（ｓ・｜Ｗｉ｜） − ａ／ｓ（ｓ：１３０３と１３０４をあわせた集合に対するｄの
分散）したがって、パラメータ辞書１１０４に格納するＶｉ、
ｃｉは、以下の様に求められる。Pi = d / (s · | Wi |) = (Wi · F) / (s · | Wi |) −a / s (s: variance of d with respect to the set of 1303 and 1304) Vi stored in the dictionary 1104,
ci is determined as follows.

【００３６】Ｖｉ＝Ｗｉ／（ｓ・｜Ｗｉ｜）ｃｉ＝ａ／ｓ次に、パターン確信度算出処理３０５の概略を説明す
る。パターン確信度とは、切り出しグラフ上の各弧、す
なわち各パターン候補が、どれだけ確からしいかを示す
尺度であり、以下の式で求める。Vi = Wi / (s · | Wi |) ci = a / s Next, the outline of the pattern certainty factor calculation processing 305 will be described. The pattern certainty factor is a measure indicating how probable each arc on the cut-out graph, that is, each pattern candidate, is obtained by the following equation.

【００３７】（パターン確信度）＝｛ｃ１・（１位候
補文字類似度）−ｃ２・ｐ｝式中のｐは概形ペナルティである。また、ｃ１、ｃ２は
定数であり、システム毎に適宜調整する。(Pattern certainty factor) = {c1. (First-order candidate character similarity) -c2.p} In the equation, p is a rough penalty. Further, c1 and c2 are constants, and are appropriately adjusted for each system.

【００３８】次に、住所辞書照合処理３０６の構成を、
図１２のデータ・フロー・ダイアグラムを用いて説明す
る。本処理の入力であるパターン候補、パターン確信度
候補文字、類似度は、パターンテーブルおよび境界テー
ブルにより渡される。まず、パターン確信度が一定値以
下のパターン候補を、パターン候補選別処理１４０１で
選別する。図２０（ａ）の例では、パターン０−２、０
−３等が、文字識別結果の類似度が小さいために、パタ
ーン確信度も小さくなり削除され、図２０（ｂ）のよう
になる。また、パターン４−６が、概形ペナルティが大
きいために、パターン確信度も小さくなり、削除され
る。次に、辞書照合処理１４０３で、各パターン候補に
対応する文字識別結果の候補文字を、あらかじめ住所文
字列辞書１４０４に格納してある住所文字列と照合し、
照合に成功した住所文字列の候補（候補文字列）を出力
する。候補住所文字列ソート１４０５は、候補文字列を
照合確信度の大きい順に並べ変え、住所読取結果として
出力する。照合確信度とは、文字識別結果の候補文字と
候補文字列を照合した際の一致の度合いであり、照合確
信度の値が大きい程、当該候補文字列は確からしい候補
と推定できる。Next, the configuration of the address dictionary matching process 306 will be described.
This will be described with reference to the data flow diagram of FIG. The pattern candidate, the pattern certainty degree candidate character, and the similarity, which are the inputs of this processing, are passed by the pattern table and the boundary table. First, pattern candidates whose pattern certainty factor is equal to or less than a certain value are selected in a pattern candidate selection process 1401. In the example of FIG. 20A, patterns 0-2, 0
-3, etc., because the similarity of the character identification result is small, the pattern certainty is also reduced and deleted, as shown in FIG. In addition, since the pattern 4-6 has a large outline penalty, the pattern certainty becomes small and is deleted. Next, in a dictionary matching process 1403, candidate characters of a character identification result corresponding to each pattern candidate are compared with an address character string stored in an address character string dictionary 1404 in advance.
Outputs the candidate (candidate character string) of the address character string that has been successfully verified. The candidate address character string sort 1405 sorts the candidate character strings in descending order of the degree of verification certainty and outputs the result as an address read result. The collation certainty is the degree of matching when the candidate character of the character identification result is collated with the candidate character string, and the larger the value of the collation certainty, the more likely the candidate character string is a probable candidate.

【００３９】図１３は、辞書照合処理１４０３の処理の
概略を示す。本処理では、文字識別結果に基づいて生成
したオートマトンが受理する住所文字列を、住所文字列
辞書１４０４中より選び出す。オートマトンが受理する
住所文字列を求める方式は、丸川等の方式（情報処理学
会論文誌第３５巻第６号「手書き漢字住所認識のための
エラー修正アルゴリズム」）に準じる。ここで図１３に
おける中央の枠は、補パターン選別後、文字識別結果の
候補文字より生成したオートマトン１５０１を模式的て
おり、パターン間の境界が状態、文字識別結果の候補文
字が遷移である。各状態の番号は、切り出しグラフの節
の番号と同じにしてある。なお、オートマトンは、パタ
ーンテーブルと同じ形式のテーブルで実現する。オート
マトン１５０１中の太線は、住所文字列辞書１４０４中
の文字列１５０３（「中住市水反町１２」）が、どのよ
うな遷移を経てオートマトン１５０１に受理されたかを
示す。このように、住所文字列辞書１４０４中の文字列
が受理された場合には、当該文字列を候補文字列の一つ
として出力する。また、照合確信度ｍｃは、照合の際の
各遷移に対応する確信度ｔｃ（遷移確信度）の総和とす
る。FIG. 13 shows an outline of the dictionary collation processing 1403. In this processing, an address character string received by the automaton generated based on the character identification result is selected from the address character string dictionary 1404. The method of obtaining the address character string accepted by the automaton conforms to the method of Marukawa et al. (Information Processing Society of Japan, Vol. 35, No. 6, "Error Correction Algorithm for Handwritten Kanji Address Recognition"). Here, the center frame in FIG. 13 schematically shows the automaton 1501 generated from the candidate characters of the character identification result after the complementary pattern selection. The boundary between the patterns is a state, and the candidate characters of the character identification result are transitions. The number of each state is the same as the number of the section of the cut-out graph. The automaton is realized by a table having the same format as the pattern table. The bold line in the automaton 1501 indicates what transition has been made by the automaton 1501 through the character string 1503 (“Mizutani-cho 12, Nakazumi City”) in the address character string dictionary 1404. As described above, when a character string in the address character string dictionary 1404 is received, the character string is output as one of the candidate character strings. The verification certainty mc is a sum of certainty tc (transition certainty) corresponding to each transition at the time of verification.

【００４０】ｍｃ＝ Σｔｃ遷移確信度は、以下の式で求める。Mc = Σtc The transition confidence is obtained by the following equation.

【００４１】ｔｃ＝｛ｃ１・ｓｍ−ｃ２・ｐ｝・ｊｍただしｓm：各遷移に対応する候補文字の類似度ｊｍ：遷移の前後の状態番号の差式中の定数ｃ１、ｃ２は、パターン確信度を求めるのに
用いたものと同じものを用いる。図１３の例では、文字
列「中住市水反町１１」も受理され、住所認識結果１５
０２に出力される。しかし、後者の場合は、前者より小
さい類似度の候補文字を使って受理されているため、照
合確信度は小さくなる。Tc = {c1.sm-c2.p} .jm where sm: similarity of candidate characters corresponding to each transition jm: difference between state numbers before and after the transition The constants c1 and c2 in the formula are pattern certainty. Use the same one used to determine the degree. In the example of FIG. 13, the character string “11, Sumimachi, Nakazumi-shi” is also accepted, and the address recognition result 15
02 is output. However, in the case of the latter, since the candidate character is received using a candidate character having a similarity smaller than the former, the verification certainty is reduced.

【００４２】図１４は、本実施の形態におけるハードウ
エア構成を示す。上に述べてきた本発明の実施形態の例
は、同図のような郵便物宛名読み取り区分機の、住所読
み取りを司る認識装置１６０４に適用される。図中の太
線は、郵便物の流れを示す。画像はスキャナ１６０１に
より入力される。ここで、住所読取り時間を確保するた
めに郵便物搬送路上にディレイライン１６０２が設けら
れている。この住所読み取り結果に基づき郵便物はソー
タ１６０３により区分される。スキャナ１６０１と認識
装置１６０４は入出力用ケーブル１６１２で接続する。
ソータ１６０３と認識装置１６０４は、入出力用ケーブ
ル１６１３で接続する。認識装置１６０４には、認識装
置内部の各部分を接続するためのバス１６１１、スキャ
ナ１６０１との通信を司る入出力用インタフェース１６
０５、認識装置全体の制御および住所認識処理を司る演
算処理装置１６０６、ソータ１６０３との通信を司る入
出力用インタフェース１６０７、起動等の操作を行うた
めのキーボード１６０８、実行状況を必要に応じ表示す
るためのＣＲＴ１６０９、住所認識に必要なテーブル、
プログラム、辞書などを格納するためのメモリ１６１０
を備える。FIG. 14 shows a hardware configuration in this embodiment. The example of the embodiment of the present invention described above is applied to a recognition device 1604 for reading an address of a mail address reading / sorting machine as shown in FIG. The thick line in the figure indicates the flow of mail. The image is input by the scanner 1601. Here, a delay line 1602 is provided on the mail conveyance path to secure the address reading time. The mail is sorted by the sorter 1603 based on the address reading result. The scanner 1601 and the recognition device 1604 are connected by an input / output cable 1612.
The sorter 1603 and the recognition device 1604 are connected by an input / output cable 1613. A recognition device 1604 includes a bus 1611 for connecting each part inside the recognition device, and an input / output interface 16 for controlling communication with the scanner 1601.
05, an arithmetic processing unit 1606 for controlling the overall recognition device and address recognition processing, an input / output interface 1607 for communicating with the sorter 1603, a keyboard 1608 for performing operations such as activation, and an execution status are displayed as necessary. CRT 1609, table required for address recognition,
Memory 1610 for storing programs, dictionaries, etc.
Is provided.

【００４３】図１５は、図９における切り出し誤り仮説
評価１１０１に用いるパラメータ辞書１１０４を学習す
るためのサンプルを収集するためのサンプル収集ツール
の表示画面の例である。図１５において、１７０１は画
面を表示するＣＲＴであり、１７０２は文字行画像表示
用のウインドウである。ウインドウ内には文字行画像全
体が表示されるが、この中で現在注目しているパターン
は、色を変えて表示する（図中の黒太線で描かれた部
分）。オペレータは、ウインドウ１７０２中の画像を観
察し、これが正しく切り出されたパターンか、誤ってい
るか、誤っている場合には図８のどの誤り方かを判断
し、それぞれに対応するボタンをマウスカーソル１７０
４でクリックする。ボタンは、パネル１７０３上に配置
する。ボタンのクリックのイベントを受けると、サンプ
ル収集ツールは、注目していたパターンの特徴量を、誤
りの種類に対応したファイルに格納し、新たなパターン
をウインドウ１７０２に表示する。FIG. 15 is an example of a display screen of a sample collection tool for collecting a sample for learning the parameter dictionary 1104 used for the extraction error hypothesis evaluation 1101 in FIG. In FIG. 15, reference numeral 1701 denotes a CRT for displaying a screen, and 1702, a window for displaying a character line image. The entire character line image is displayed in the window, and the pattern currently focused on is displayed in a different color (portion drawn by a thick black line in the figure). The operator observes the image in the window 1702, determines whether the image is a correctly cut out pattern, is incorrect, or determines which error in FIG. 8 is wrong.
Click with 4. The buttons are arranged on the panel 1703. Upon receiving the button click event, the sample collection tool stores the feature amount of the pattern of interest in a file corresponding to the type of error, and displays a new pattern in the window 1702.

【００４４】図１６は、図９におけるパラメータ辞書１
１０４を学習するためのシステムのデータ・フロー・ダ
イアグラムである。あらかじめ収集された文字行画像Ｄ
Ｂ（データベース）１８０１を用いて、上記のサンプル
収集ツール１８０２が切り出し正解のパターンＤＢ（デ
ータベース）１８０３、および図８における切り出し誤
りの仮説における各種パターンデータベースＥ１からＥ
７に対応した切り出し誤りのパターンＤＢ（１８０４、
１８０５等）を生成する。学習ツール１８０６は、切り
出し正解のパターンＤＢ１８０３およびＥ１に対応した
切り出し誤りパターンのデータベース１８０４を入力と
し、図１１で説明したような方式でＶ１、ｃ１を求め、
パラメータ辞書１８０８に出力する。以下、Ｅ２からＥ
７に対応した切り出し誤りのパターンデータベース（１
８０５等）に対しても同様に、学習ツール（１８０７
等）を用いてＶｉ、ｃｉを求め、パラメータ辞書１８０
８に出力する。FIG. 16 shows the parameter dictionary 1 in FIG.
4 is a data flow diagram of a system for learning 104. Character line image D collected in advance
Using the B (database) 1801, the sample collection tool 1802 extracts the correct pattern DB (database) 1803 and the various pattern databases E1 to E in the hypothesis of the cut error in FIG.
7 corresponding to the extraction error pattern DB (1804,
1805). The learning tool 1806 receives as input the pattern DB 1803 of the cut-out correct answer and the database 1804 of the cut-out error pattern corresponding to E1, and obtains V1 and c1 by the method described with reference to FIG.
Output to the parameter dictionary 1808. Hereinafter, E2 to E
Pattern database (1)
805 etc.) in the same manner.
, Etc.) to obtain Vi and ci, and obtain the parameter dictionary 180
8 is output.

【００４５】図１７は、パラメータ辞書を格納するテー
ブルの形式を示す。テーブル中の各レコード pdic[i]
に、各Ｅｉに対応するパラメータ、Ｖｉ、ｃｉを格納す
る。例えば、テーブルの先頭１９０３は pdic[1] に対
応し、Ｖ１、ｃ１を格納する。また、先頭からｉ番目レ
コード目１９０４は pdic[i] に対応し、Ｖｉ、ｃｉを
格納する。各レコードにおいて、ｃｉは領域１９０１に
格納され、Ｖｉは領域１９０２に格納される。FIG. 17 shows the format of a table for storing parameter dictionaries. Each record in the table pdic [i]
, The parameters Vi, ci corresponding to each Ei are stored. For example, the head 1903 of the table corresponds to pdic [1] and stores V1 and c1. The i-th record 1904 from the top corresponds to pdic [i] and stores Vi and ci. In each record, ci is stored in area 1901, and Vi is stored in area 1902.

【００４６】図１８は、概形ペナルティ算出処理の処理
手順を示す。まず、ステップ２００１で変数ｐを０に初
期化する。次に、制御ループ２００２で、変数ｉをイン
クリメントしながら、ステップ２００３、２００４を繰
り返す。ステップ２００３は、切り出し誤り仮説評価を
起動するステップである。ステップ２００４は、切り出
し誤り仮説評価の結果得られたｐｉを、ｐに加算するス
テップである。ステップ２００８は、変数ｐを概形ペナ
ルティとして出力する。ステップ２００５から２００６
は、切り出し誤り仮説評価を行なうサブルーチンの処理
手順を示す。ステップ２００５は変数ｐｉにパラメータ
辞書より読み込んだｃｉの値（pdic[i].c）を代入する
ステップである。ステップ２００６は、特徴抽出によっ
てえられたＦとパラメータ辞書より読み込んだＶｉの内
積を求めるための制御ループで、特徴量の次元数だけ変
数ｊをインクリメントしながら、Ｖｉの各次元の値（pd
ic[i].v[j]）とＦの各次元の値（f[j]）の積をｐｉに加
算する。FIG. 18 shows a processing procedure of the rough penalty calculation processing. First, in step 2001, a variable p is initialized to 0. Next, in a control loop 2002, steps 2003 and 2004 are repeated while incrementing the variable i. Step 2003 is a step of activating the segmentation error hypothesis evaluation. Step 2004 is a step of adding pi obtained as a result of the segmentation error hypothesis evaluation to p. Step 2008 outputs the variable p as the approximate penalty. Steps 2005 to 2006
Shows a processing procedure of a subroutine for performing cutout error hypothesis evaluation. Step 2005 is a step of substituting the value of ci (pdic [i] .c) read from the parameter dictionary into the variable pi. Step 2006 is a control loop for calculating the inner product of F obtained by the feature extraction and Vi read from the parameter dictionary, and while increasing the variable j by the number of dimensions of the feature amount, the value of each dimension of Vi (pd
ic [i] .v [j]) and the value of each dimension of F (f [j]) is added to pi.

【００４７】[0047]

【発明の効果】以上の本発明の方法により、文字識別結
果および文字列照合のみでは正しく切り出しの候補の検
定ができない住所文字行からでも、文字パターンの大き
さと前後のパターンとの位置関係を利用し、正確に文字
を切り出す事が可能となる。According to the method of the present invention described above, the size of the character pattern and the positional relationship between the preceding and succeeding patterns can be used even from an address character line in which it is not possible to correctly test a candidate for extraction by only character identification results and character string collation. Then, characters can be accurately cut out.

[Brief description of the drawings]

【図１】本発明の実施の形態の一例を示す構成図。FIG. 1 is a configuration diagram illustrating an example of an embodiment of the present invention.

【図２】本発明の実施の形態の一例における処理手順を
示す図。FIG. 2 is a diagram showing a processing procedure in an example of an embodiment of the present invention.

【図３】郵便物の全体画像と文字行の関係を示す文字配
置図。FIG. 3 is a character arrangement diagram showing a relationship between a whole mail image and character lines.

【図４】パターンと境界の関係を示す図。FIG. 4 is a diagram showing a relationship between a pattern and a boundary.

【図５】切り出しグラフの弧の格納するパターンテーブ
ルの形式を示す図表。FIG. 5 is a table showing a format of a pattern table for storing arcs of a cut-out graph;

【図６】切り出しグラフの節の格納するノードテーブル
の形式を示す図表。FIG. 6 is a table showing a format of a node table in which nodes of a cut-out graph are stored.

【図７】概形ペナルティ算出処理の手順を示すデータ・
フロー・ダイヤグラム図。FIG. 7 shows data indicating a procedure of a rough penalty calculation process.
Flow diagram.

【図８】切り出し誤りの仮説の種類を示すデータ・フロ
ー・ダイヤグラム図。FIG. 8 is a data flow diagram showing the types of cutoff error hypotheses.

【図９】切り出し誤り仮説評価処理の手順を示す図。FIG. 9 is a diagram illustrating a procedure of a cutout error hypothesis evaluation process.

【図１０】切り出し誤り仮説評価処理に用いる特徴量を
示す図。FIG. 10 is a diagram showing feature amounts used in a cutout error hypothesis evaluation process.

【図１１】切り出し誤り仮説評価処理の原理を示す図。FIG. 11 is a diagram showing the principle of a cutout error hypothesis evaluation process.

【図１２】住所辞書照合処理の手順を示すデータ・フロ
ー・ダイヤグラム図。FIG. 12 is a data flow diagram illustrating a procedure of an address dictionary matching process.

【図１３】辞書照合処理の原理を示す図。FIG. 13 is a view showing the principle of dictionary matching processing.

【図１４】本発明の実施形態の一例を示すハードウエア
構成図。FIG. 14 is a hardware configuration diagram showing an example of an embodiment of the present invention.

【図１５】サンプル収集ツールの表示画面の例を示す
図。FIG. 15 is a view showing an example of a display screen of a sample collection tool.

【図１６】パラメータ辞書を学習するためのシステム構
成図。FIG. 16 is a system configuration diagram for learning a parameter dictionary.

【図１７】パラメータ辞書の格納形式を示す図表。FIG. 17 is a table showing a storage format of a parameter dictionary.

【図１８】概形ペナルティ算出処理の手順を示すフロー
チャート図。FIG. 18 is a flowchart showing the procedure of a rough penalty calculation process.

【図１９】読取り対象の住所文字列の例を示す図。FIG. 19 is a diagram showing an example of an address character string to be read.

【図２０】切り出しグラフおよび切り出し誤り仮説の例
を示す図。FIG. 20 is a diagram showing an example of a cutout graph and a cutout error hypothesis.

[Explanation of symbols]

３０１……文字行抽出処理、３０２……文字パターン候
補抽出処理、３０３……概形ペナルティ算出処理、３０
４……文字識別処理、３０５……パターン確信度算出、
３０６……住所辞書照合処理、５０１……郵便物の
像５０２……宛名住所書き込み領域６０１……境界座標の差６０２……境界座
標の差６０３……境界座標の差６０４……境界座
標６０５……境界間隙７０１……パター
ン輪郭格納領域７０２……パターン左側境界番号欄７０３……パター
ン右側境界番号欄７０４……候補文字格納領域７０５……候補文
字類似度格納領域７０６……パターン確信度格納領域８０１……境界番
号欄８０２……境界座標欄８０３……境界間
隙欄１１０１……切り出し誤り仮説評価処理１２０１……注目文字パターン候補１２０２……周辺パターン１２０３……周辺
パターン１３０１……座標軸１３０２……座標
軸１３０３……正しく切り出されている文字パターン候補
グループ１３０４……切り出し誤り仮説が真である文字パターン
候補グループ１４０４……住所文字列辞書１５０２……住所
認識結果１６０４……認識装置１６１１……バス１６１２……入出力ケーブル１６１３……入出
力ケーブル１７０１……ＣＲＴ表示画面１７０２……文字
画像表示ウインドウ１７０３……パネル301: character line extraction processing, 302: character pattern candidate extraction processing, 303: outline penalty calculation processing, 30
4 ... Character identification processing, 305 ... Calculation of pattern certainty factor,
306: Address dictionary collation processing, 501: Mail image 502: Address / address writing area 601: Boundary coordinate difference 602: Boundary coordinate difference 603: Boundary coordinate difference 604: Boundary coordinate 605 ... Boundary gap 701 ... Pattern contour storage area 702 ... Pattern left boundary number field 703 ... Pattern right boundary number field 704 ... Candidate character storage area 705 ... Candidate character similarity storage area 706 ... Pattern certainty storage area 801 Boundary number field 802 Boundary coordinate field 803 Boundary gap field 1101 Extraction error hypothesis evaluation processing 1201 Attention character pattern candidate 1202 Peripheral pattern 1203 Peripheral pattern 1301 Coordinate axis 1302 Coordinate axes 1303... Correctly cut out character pattern candidate group 1304. A character pattern candidate group for which the error hypothesis is true 1404... An address character string dictionary 1502... An address recognition result 1604... A recognition device 1611... A bus 1612... An I / O cable 1613. Screen 1702: Character image display window 1703: Panel

フロントページの続き (72)発明者藤澤浩道東京都国分寺市東恋ケ窪１丁目280番地株式会社日立製作所中央研究所内 (72)発明者緒方日佐男東京都国分寺市東恋ケ窪１丁目280番地株式会社日立製作所中央研究所内 (72)発明者嶋好博東京都国分寺市東恋ケ窪１丁目280番地株式会社日立製作所中央研究所内 (72)発明者寺本正人愛知県尾張旭市晴丘町池上１番地株式会社日立製作所オフィスシステム事業部内 (56)参考文献特開平４−299485（ＪＰ，Ａ) 特開昭61−175878（ＪＰ，Ａ) 特開平７−6203（ＪＰ，Ａ) (58)調査した分野(Int.Cl.⁷，ＤＢ名) G06K 9/62 G06K 9/34 Continuing from the front page (72) Inventor Hiromichi Fujisawa 1-280 Higashi-Koikekubo, Kokubunji-shi, Tokyo Inside the Hitachi, Ltd. Central Research Laboratory (72) Inventor Hisao Ogata 1-280 Higashi-Koikekubo, Kokubunji-shi, Tokyo Hitachi, Ltd. Central Inside the research institute (72) Yoshihiro Shima 1-280 Higashi Koikebo, Kokubunji-shi, Tokyo Central Research Laboratory, Hitachi, Ltd. (72) Inventor Masato Teramoto 1 Ikegami, Haruoka-cho, Owariasahi-shi, Aichi Pref. Hitachi, Ltd. Office Systems Division (56) reference Patent flat 4-299485 (JP, a) JP Akira 61-175878 (JP, a) JP flat 7-6203 (JP, a) (58 ) investigated the field (Int.Cl. ⁷ G06K 9/62 G06K 9/34

Claims

(57) [Claims]

An image signal described on a document is converted into an electric signal.
Image input means for converting and inputting
Means for reading the address, Possible character segmentation
Record the weight vector of the discriminant function for each type of error.
Recording meansCharacter that extracts a character line on which an address is described from an image in an address reading device having
If the line extraction process and how to extract the character pattern cannot be uniquely determined
Pattern extraction to extract multiple character pattern candidates
Processing, character identification processing for character identifying the extracted pattern,Regarding the cut out pattern, the respective patterns and
And relative features of multiple patterns before and after
Any of the above types of error using the function weight vector
Is calculated, and from the identification function
The process of calculating the approximate penalty indicating the validity of the extraction method
And Character pattern according to character recognition result and outline penalty
To narrow down the address candidates, and
And sequentially execute address dictionary matching processing for matching
Characteristic address reading method.

2. The address reading method according to claim 1, wherein at least an interval between patterns before and after each pattern is one of the feature amounts.

3. The address reading method according to claim 1, wherein at least a gap in each pattern is one of the feature quantities.

4. A method for creating a discriminant function weight vector for each outline penalty calculation process according to claim 1, wherein an image of a character line is input and a plurality of characters are extracted when a method of extracting a character pattern cannot be uniquely determined. Pattern cutout processing for cutting out pattern candidates, and whether character pattern candidates are correctly cut out,
Error How to cut out that if you are wrong stores pre-Me
A cut-out error type discriminating process for classifying what kind of error among the types, a pattern storing means for classifying and storing each character pattern candidate according to a result of the cut-out error discriminating process, characterized by sequentially executing the process of learning the identification function weight vectors for each said pattern using a character pattern candidates are identified function
Number weight vector generation method.