JPH11316801A

JPH11316801A - Character read method and address read method

Info

Publication number: JPH11316801A
Application number: JP11057792A
Authority: JP
Inventors: Tatsuhiko Kagehiro; 達彦影広; Masashi Koga; 昌史古賀; Yutaka Sako; 裕酒匂; Hiromichi Fujisawa; 浩道藤澤; Hisao Ogata; 日佐男緒方; Yoshihiro Shima; 好博嶋; Shigeru Watanabe; 成渡辺
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1999-03-05
Filing date: 1999-03-05
Publication date: 1999-11-16

Abstract

PROBLEM TO BE SOLVED: To efficiently read a handwritten address character string on a postal matter or the like. SOLUTION: Characters are temporarily segmented from picture information of an address character line (206) to generate a table 219 of hypothesis character patterns, and character discrimination is performed for hypothesis character patterns (207), and especially, peripheral information (the length, the width, the aspect ratio, spaces before and after patterns, etc.), are extracted with respect to characters in the block part of the address character string (212), and this information is used to segment characters with a high precision (215). Thus, the character segmentation precision is improved to improve the recognition precision of the address character string. The precision of character segmentation of the handwritten address character string and that of a character string written in the vertical direction with Chinese numerical characters are improved.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明が属する技術分野】本発明は文字読取り方法に関
し、特に郵便物等に記載されている住所文字列を読取る
場合いに好ましい文字読取り方法に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character reading method, and more particularly to a character reading method suitable for reading an address character string described in a mail or the like.

【０００２】[0002]

【従来の技術】郵便物の宛名自動読取りを自動的に行う
方法として、通便物上に記載された宛名の都道府県名、
市区群名、町名等の地名を電子装置等で自動適に読み取
る方法が種々考案されており、例えば特許出願公開平２
−６４８８２号では、住所の先頭から町名までの文字列
と丁目番地部分の文字列において異なった文字切出し処
理を行って宛名を認識している。また、郵便物上の宛名
記載領域を検出する方法として特許出願公開平５−１５
１３８９号があり、これは郵便番号位置を検出してこの
位置から宛名記載領域を検出する方法である。宛名記載
領域から文字行を切出す方法としては特許出願公告昭６
０−４１３９６号があり、これは記載されている文字等
のブロックパターンの高さを検出し、同じ高さのパター
ンを行として検出する方法である。切出された文字行の
縦横書き判別法には特許出願公開昭６３−１８７８５号
があり、これは宛名領域の文字群の水平方向周辺長と垂
直方向周辺長を求め、これらの周辺長を比較することに
より文字群の行方向を判別する。2. Description of the Related Art As a method of automatically reading an address of a postal matter, a prefecture name of an address written on a mail,
Various methods have been devised for automatically and appropriately reading place names such as city names, town names, and the like using an electronic device.
In Japanese Patent No. -64882, an address is recognized by performing different character extraction processing on a character string from the head of an address to a town name and a character string on a street address. As a method of detecting an address description area on a mail, Japanese Patent Application Laid-Open No.
No. 1389 is a method of detecting a postal code position and detecting an address writing area from this position. As a method of extracting a character line from the address description area, a patent application publication 6
No. 0-41396, which is a method of detecting the height of a block pattern of a written character or the like and detecting a pattern having the same height as a line. Japanese Patent Application Laid-Open No. 63-18785 discloses a method for judging the vertical and horizontal writing of the extracted character line. This method determines the horizontal and vertical peripheral lengths of a character group in an address area, and compares these peripheral lengths. To determine the line direction of the character group.

【０００３】文字行からの個々の文字切出す技術に関し
ては、電子通信学会論文誌（Ｄ）Ｊ６８−Ｄ，Ｎｏ．１
２、第２１２３頁ないし第２１３１頁に示されているよ
うに、文字幅、文字ピッチ、文字間隔等を基に文字切出
しを決定している。その他に、電子通信学会（Ｄ）Ｊ６
８−Ｄ，Ｎｏ．４，第７６５頁ないし第７７２頁に示
されているように、文字識別結果の情報を用いて文字切
出しを決定する方法もある。しかし、これも分離文字、
特に縦書きの漢数字の羅列における文字切出しの曖昧性
を解決するには不十分である。A technique for extracting individual characters from a character line is described in IEICE Transactions (D) J68-D, No. 1
2. As shown on pages 2123 to 2131, character extraction is determined based on character width, character pitch, character spacing, and the like. In addition, IEICE (D) J6
8-D, no. 4, page 765 to page 772, there is also a method of determining character cutout using information on character identification results. But this is also a separator,
In particular, it is insufficient to resolve the ambiguity of character segmentation in a series of vertical Chinese characters.

【０００４】[0004]

【発明が解決しようとする課題】上記従来技術、特に文
字行からの個々の文字切出しに関しては、文字パターン
の文字幅、文字ピッチ、文字間隔等の特徴量を用いた
り、文字識別結果得られる候補文字の類似度を用いて文
字切出しを決定する方法では、手書き文字列の場合、文
字幅、文字ピッチ、文字間長等の特徴量は一定でなく、
曖昧なため、これらの情報を基に手書き文字列における
文字切出しを決定するのは難しく、正確な文字切出しは
困難であった。また、パターンの縦横長、縦横比、パタ
ーンの前後の隙間等の特徴量と、文字識別結果得られる
候補文字の類似度を統合して文字切出しを決定した場合
でも、文字種に応じて最適な縦横長、縦横比、文字間隔
等が異なっており、文字切出しの情報として用いるには
不十分である。例えば、図１の宛名記載面１０１のに示
すように、丁目番地記載部分（以下街区記載部分とも略
称）１０２の漢数字「三−二−一」の場合、パターンの
縦横長、縦横比は文字種に依存して大きく変動するた
め、文字切出しに曖昧性が生じ多様な認識結果の解釈が
存在する。そのため、１０３に示すように、街区記載部
分の文字切出しの曖昧性により発生する多様な認識結果
（６通り）が予想される。認識結果１０４は正しい認識
結果である。As for the above-mentioned prior art, particularly for individual character extraction from a character line, a character pattern such as a character width, a character pitch, a character interval, or the like can be used. In the method of determining character segmentation using character similarity, in the case of a handwritten character string, feature amounts such as character width, character pitch, character length, etc. are not constant,
Because of the ambiguity, it is difficult to determine character extraction in a handwritten character string based on such information, and it has been difficult to accurately extract characters. In addition, even when the character cutout is determined by integrating the feature amounts such as the length and width of the pattern, the aspect ratio, the gap before and after the pattern, and the similarity of the candidate character obtained from the character identification result, the optimum length and width depending on the character type is determined. The length, the aspect ratio, the character spacing, and the like are different, and are insufficient for use as information for character cutout. For example, as shown in the address description surface 101 in FIG. 1, in the case of the Chinese numeral "3-2-1" of the address description portion 102 (hereinafter abbreviated as the block description portion) 102, the length and width of the pattern and the aspect ratio are determined by the character type. , The character segmentation becomes ambiguous, and there are various interpretations of recognition results. Therefore, as shown in 103, various recognition results (six patterns) generated due to the ambiguity of character segmentation of the block description part are expected. The recognition result 104 is a correct recognition result.

【０００５】従って、本発明の目的はこの問題を解決
し、文字行から各文字を精度よく識別する文字読み取り
方法を実現することである。本発明の他の目的は町域記
載部分と街区記載部分からなる住所文字を精度良く読み
取る方法を実現することである。Accordingly, it is an object of the present invention to solve this problem and to realize a character reading method for accurately identifying each character from a character line. It is another object of the present invention to realize a method for accurately reading an address character including a town area description part and a block description part.

【０００６】[0006]

【課題を解決するための手段】上記目的を達成するた
め、本発明の文字読み取り方法は、画像入力装置によっ
て電子画像化された識別すべき文字行中の連結成分（例
えば黒画素の連続であるストローク）を組み合わせて、
文字として可能性のある複数の文字切出しを行う（この
切り出したパターンを以下仮説文字パタンと略称）第１
ステップと、上記仮説文字パターンに対し文字識別用辞
書を参照し文字識別し、その文字識別結果（認識候補文
字及びその類似度）を上記仮説文字パターンの補助情報
として得る第２ステップと、上記補助情報に加えて、上
記仮説文字パターン周辺情報と得る第３ステップと、上
記第２ステップで得えた認識候補文字をキーとして文字
の周辺情報をもつ切出用辞書を参照し、上記第３ステッ
プで抽出した上記仮説文字パターンの周辺情報の確信度
を求め、仮説文字パターンの重み付けをする第４ステッ
プと、上記仮説文字パターンの重み付けを用いて文字切
出しを決定する第５ステップと、上記第５ステップによ
り決定された文字切出し結果から生成された文字識別結
果の集合を基に文字識別辞書を用いて単語照合を行い上
記識別すべき文字情報の認識を行い上記文字行を読み取
る第６ステップをもつ。In order to achieve the above object, a character reading method according to the present invention provides a connected component (for example, a continuation of black pixels) in a character line to be identified which is electronically imaged by an image input device. Stroke)
Cut out a plurality of characters that may be characters (this cut-out pattern is hereinafter abbreviated as a hypothetical character pattern).
A second step of performing character identification on the hypothetical character pattern by referring to a character identification dictionary and obtaining a character identification result (recognition candidate character and similarity thereof) as auxiliary information of the hypothetical character pattern; In addition to the information, the third step of obtaining the hypothetical character pattern peripheral information and the extraction dictionary having character peripheral information with the recognition candidate character obtained in the second step as a key, and in the third step, A fourth step of obtaining the certainty factor of the peripheral information of the extracted hypothetical character pattern and weighting the hypothetical character pattern; a fifth step of determining a character cutout using the weight of the hypothetical character pattern; The characters to be identified by performing word matching using a character identification dictionary based on a set of character identification results generated from the character cutout results determined by Performs recognition of broadcast having a sixth step of reading the character line.

【０００７】また、上記文字文字読み取り方法を用いて
町域と街区が書かれた住所の文字列の読み取りを行う場
合、上記住所の文字列を含む画像情報を電気信号に変換
し、電気信号の画像情報から上記住所の文字列の文字行
領域を切出し、上記第１ステップによって切出された文
字行中の連結成分を組み合わせて、複数の仮説文字パタ
ーンを生成する。When reading a character string of an address in which a town area and a block are written by using the above-described character / character reading method, image information including the character string of the address is converted into an electric signal, and the electric signal is read. A character line area of the character string of the address is cut out from the image information, and a plurality of hypothetical character patterns are generated by combining the connected components in the character line cut out in the first step.

【０００８】上記複数の仮説文字パターンのそれぞれに
対し文字識別用辞書をもちいて文字識別し、上記仮説文
字パターンに類似する候補文字の情報を得る。上記候補
文字の情報を用いて町域辞書を参照して上記住所の町域
の読み取りを行うと共に、上記街区の先頭位置を検出す
る。町域辞書は有りうる全ての町名が格納されている。A character identification dictionary is used to identify each of the plurality of hypothetical character patterns, and information on candidate characters similar to the hypothetical character patterns is obtained. The town area of the address is read by referring to the town area dictionary using the information of the candidate characters, and the head position of the block is detected. The town area dictionary stores all possible town names.

【０００９】上記街区の先頭位置を検出すると、街区の
仮説文字パターンのパターン周辺情報を求め、そのパタ
ーン周辺情報を基に文字切出用辞書を参照し上記パター
ン周辺情報の確信度を求める。上記確信度を考慮し上記
街区の文字に対する上記仮説文字パターンの文字切りだ
しの再決定を行い、上記再決定において得られた上記仮
説文字パターンに類似する候補文字の情報を用いて街区
辞書を参照し上記街区の文字の読み取りを決定する。街
区辞書は街区として有りえる全ての街区の文字情報が格
納されている。When the head position of the block is detected, pattern peripheral information of the hypothetical character pattern of the block is obtained, and a certainty factor of the pattern peripheral information is obtained by referring to a character extraction dictionary based on the pattern peripheral information. Considering the certainty factor, re-determining the character cutout of the hypothetical character pattern for the character of the block, and referring to the block dictionary using information on candidate characters similar to the hypothetical character pattern obtained in the re-determination. Then, the character reading of the above-mentioned block is determined. The block dictionary stores character information of all blocks that can be considered as blocks.

【００１０】上述のように本発明の文字読み取り方法
は、識別すべき文字行の切出しを、文字識別により得ら
れる類似度を文字切出しに反映する手法と、文字パター
ンの縦横長、縦横比、パターン間隔長の特徴量を文字切
出しに反映する手法を統合することにより、互いの手法
の短所を補い合い、有効な情報を最適に用いて、文字列
の認識において、文字幅、文字ピッチ、文字間隔等が不
規則であっても、正確な文字切出しを可能にしている。As described above, according to the character reading method of the present invention, the extraction of a character line to be identified is reflected on the character extraction based on the similarity obtained by character identification, and the aspect ratio, aspect ratio and pattern By integrating the method of reflecting the feature amount of the interval length in character segmentation, the disadvantages of each other's method are compensated for and the effective information is optimally used to recognize the character string, and character width, character pitch, character spacing, etc. Even if is irregular, it enables accurate character extraction.

【００１１】また、手書き文字列の認識における文字切
出しで、全ての文字に対して共通の文字幅、文字ピッ
チ、文字間隔長を推定し、その推定値を元に文字切出し
を行うことが困難であるという問題に対しては、文字切
出し仮説検定用の文字切出辞書を用意し、文字識別結果
の文字種をキーにして辞書を参照して文字パターンの特
徴量と比較し、確信度を算出して文字切出しに反映す
る。In character extraction in recognition of a handwritten character string, it is difficult to estimate a common character width, character pitch, and character interval length for all characters, and to extract characters based on the estimated values. In order to solve this problem, prepare a character extraction dictionary for character extraction hypothesis testing, use the character type of the character identification result as a key, refer to the dictionary, compare it with the character pattern feature, and calculate the confidence factor. To reflect on character extraction.

【００１２】この文字切出辞書に、予め各特徴量に対す
るパターンの確信度として尤度分布を算出し辞書に格納
しておく。ただし、この確信度の算出には、多大な労力
と時間、経験的知識が必要であるという問題があるが、
本発明によると、文字切出し仮説を画面に表示し、正解
の文字切出し仮説を選択する作業のみにより、文字種毎
に最適な重み付けの確信度を生成することが可能にな
る。In this character extraction dictionary, a likelihood distribution is calculated in advance as a certainty factor of a pattern with respect to each feature amount and stored in the dictionary. However, there is a problem that calculating this confidence requires a great deal of effort, time, and empirical knowledge.
According to the present invention, it is possible to generate an optimal weighted certainty factor for each character type only by displaying the character extraction hypothesis on the screen and selecting the correct character extraction hypothesis.

【００１３】[0013]

【発明の実施の形態】図２は本発明による文字読み取り
方法の一実施の形態を示す処理フロー図である。本実施
の形態は文字読み取り方法を宛名の住所の自動読み取り
に適用したものである。町域と街区からなる住所の読み
取り処理は、郵便物等の宛名記載面２０１を撮像手段
（スキャナ部）により画像入力として取り込み（２０
２）、画像入力から、住所情報のみを含んだ住所記載領
域の情報を抽出し（２０３）、上記住所記載の情報から
文字行の切り出しを行う（２０４）。文字行の切出しが
行われた画像情報は縦横書き判定で、文字行中の文字列
が縦書きか横書きかを判定し（２０５）、その判定結果
を用いて縦横書きの処理モードの切り替えをする（２２
１）。FIG. 2 is a processing flowchart showing an embodiment of a character reading method according to the present invention. In this embodiment, the character reading method is applied to automatic reading of an address of an address. In the reading process of the address consisting of the town area and the block, the address writing surface 201 of a mail or the like is captured as an image input by the imaging means (scanner unit) (20).
2) The information of the address description area including only the address information is extracted from the image input (203), and a character line is cut out from the address description information (204). The image information from which the character line has been cut out is subjected to vertical / horizontal writing determination to determine whether the character string in the character line is vertical or horizontal writing (205), and the vertical / horizontal writing processing mode is switched using the determination result. (22
1).

【００１４】次ぎに、文字行の中の文字黒画素の塊（連
結成分）の組み合わせにより、文字を構成する可能性が
ある所で文字切出しの処理を行う。以下、この文字切出
し処理を文字切出し仮説生成（２０６）と略称し、切出
し仮説生成で切り出された文字パターンを仮説文字パタ
ーンと略称する。複数の仮説文字パターンの中には読み
取るべき正しい文字パターンの他に不適当なパターンが
含まれている。生成された仮説文字パターンをパターン
テーブル２１９に登録する。仮説文字パターン及びパタ
ーンテーブル２１９については図６及び図７によって詳
細に説明する。Next, character extraction processing is performed at a place where a character may be constituted by a combination of character black pixel blocks (connected components) in a character line. Hereinafter, this character extraction processing is abbreviated as character extraction hypothesis generation (206), and the character pattern extracted by the extraction hypothesis generation is abbreviated as a hypothesis character pattern. A plurality of hypothetical character patterns include an inappropriate pattern in addition to a correct character pattern to be read. The generated hypothetical character pattern is registered in the pattern table 219. The hypothetical character pattern and pattern table 219 will be described in detail with reference to FIGS.

【００１５】パターンテーブル２１９に登録された全て
の仮説文字パターンに対して文字識別辞書２０８を用い
て文字識別を行う（２０７）。上記文字識別は各仮説文
字パターンに類似した複数の認識候補文字、各認識候補
の仮説文字パターンに対する類似度、文字行における仮
説文字パターンの位置、連結成分数等の情報をそれぞれ
の仮説文字パターンの属性情報として求める。上記認識
候補文字及び属性情報をパターンテーブル２１９に各仮
説文字パターンに対して書き込む。上記認識候補文字及
び属性情報が書き込まれたパターンテーブル２１９を基
に仮説文字パターンの中から文字切出しとして妥当なも
のを選択し、文字切出しを決定する（２０９）。文字切
出し決定により選択された仮説文字パターンに対応した
認識候補文字列を基に認識候補文字列の集合を作る（２
１０）。この認識候補文字列の集合には、仮説文字パタ
ーン毎に下位候補文字までの文字種を登録する。この登
録された文字種をラティスと呼ぶ。Character identification is performed on all the hypothetical character patterns registered in the pattern table 219 using the character identification dictionary 208 (207). The character identification is performed by recognizing information such as a plurality of recognition candidate characters similar to each hypothesis character pattern, the degree of similarity of each recognition candidate to the hypothesis character pattern, the position of the hypothesis character pattern in the character line, and the number of connected components. Request as attribute information. The recognition candidate character and the attribute information are written in the pattern table 219 for each hypothetical character pattern. Based on the pattern table 219 in which the recognition candidate characters and the attribute information are written, a proper character cutout is selected from the hypothetical character patterns, and character cutout is determined (209). A set of recognition candidate character strings is created based on the recognition candidate character strings corresponding to the hypothetical character pattern selected by the character extraction determination (2).
10). In this set of recognition candidate character strings, character types up to lower-order candidate characters are registered for each hypothetical character pattern. This registered character type is called a lattice.

【００１６】このラティスを用いて町域辞書２２０を用
いて町域照合を行い（２１１）、住所の町域部の文字と
して妥当な文字列を生成する。町域辞書２２０は町域と
して有りうる全ての町域名称が記載されているものであ
る。また、町域照合２１１により町域を記載した文字の
読み取りが完了すると共に、町域を記載した文字の最後
尾の文字が確定し、この次が街区先頭位置であるという
街区先頭位置情報を得る。Using this lattice, town area matching is performed using the town area dictionary 220 (211), and a character string appropriate as a character in the town area of the address is generated. The town area dictionary 220 describes all possible town area names. In addition, the reading of the character describing the town area is completed by the town area collation 211, and the last character of the character describing the town area is determined, and the block start position information that the next is the block start position is obtained. .

【００１７】上記街区先頭位置情報を得て、パターンテ
ーブル２１９中の街区部分のみについて仮説文字パター
ンの縦長、横長、縦横比、パターン間隔長、連結成分数
（パターン周辺情報と呼ぶ）の情報を抽出する（２１
２）。抽出されたパターン周辺情報に対する確信度を文
字切出用辞書２１４を用いて算出する（２１３）。算出
された確信度は対応した仮説文字パターンの属性として
パターンテーブル２１９中に書き込まれる。上記確信度
を利用してパターンテーブル２１９中の仮説文字パター
ンを選び直す、すなわち、文字切出し決定２０９の際に
選ばれた仮説文字パターンと街区部分のみ異なった文字
切出し再決定が行われる（２１５）。文字切出し再決定
２１５が行われた後に、その情報を基にラティス再生成
を行う（２１６）。再生成されたラティスを入力として
街区辞書２２２を用いて街区照合を行い（２１７）、街
区文字の読み取りを行う。街区辞書は街区として有りえ
る全ての街区文字が用意されている。これにより町域照
合（２１１）で読み取った町名の文字と共に住所の読み
取りが完了する（宛名認識結果２１８）。The above-mentioned block head position information is obtained, and information on the vertical length, horizontal length, aspect ratio, pattern interval length, and number of connected components (referred to as pattern peripheral information) of only the block portion in the pattern table 219 is extracted. Yes (21
2). The degree of certainty for the extracted pattern peripheral information is calculated using the character extraction dictionary 214 (213). The calculated certainty factor is written in the pattern table 219 as the attribute of the corresponding hypothetical character pattern. The hypothesis character pattern in the pattern table 219 is reselected using the above-mentioned certainty factor, that is, the character extraction re-determination is performed which differs from the hypothesis character pattern selected at the time of the character extraction determination 209 only in the block (215). . After the character cutout redetermination 215 is performed, the lattice is regenerated based on the information (216). Using the regenerated lattice as an input, block matching is performed using the block dictionary 222 (217), and block characters are read. The block dictionary prepares all block characters that can be used as blocks. Thus, the reading of the address is completed together with the characters of the town name read in the town area comparison (211) (address recognition result 218).

【００１８】以下、図２の処理の各部の詳細及び処理を
行う装置について説明する。なお、画像入力２０２ない
し縦横書き切り替え２２１部は従来の技術と同様であ
り、また、文字切出仮説生成２０６ないし町域照合２１
１部は実質的に前記電子通信学会（Ｄ）Ｊ６８−Ｄ，
Ｎｏ．４，第７６５頁ないし第７７２頁の技術が適用で
きる。Hereinafter, details of each unit of the processing of FIG. 2 and an apparatus for performing the processing will be described. The image input 202 and the vertical / horizontal switching 221 are the same as those in the prior art.
One part is substantially the aforementioned IEICE (D) J68-D,
No. 4, pages 765 to 772 can be applied.

【００１９】図３は上記住所読み取り方法を実施する装
置の構成を示す。同図において、３０１は画像入力２０
２を行うスキャナ部、３０３は住所読み取り結果、すな
わち図２の宛名認識結果２１８の区分情報に基づき郵便
物を区分するソーター部、３１２は住所読取る、即ち、
図２における処理２０２から２２２を行う認識装置であ
る。スキャナ部３０１と認識装置３１２は入出力用ケー
ブル３０４で接続する。ソーター部３０３と認識装置３
１２は入出力用ケーブル３０５で接続する。FIG. 3 shows the configuration of an apparatus for implementing the above address reading method. In the figure, reference numeral 301 denotes an image input 20
2, a sorter unit for sorting mails based on the address reading result, that is, the sorting information of the address recognition result 218 in FIG.
This is a recognition device that performs processes 202 to 222 in FIG. The scanner unit 301 and the recognition device 312 are connected by an input / output cable 304. Sorter unit 303 and recognition device 3
12 is connected by an input / output cable 305.

【００２０】認識装置３１２には認識装置内部の各部分
を接続するためのバス３１３、スキャナ部３０１との通
信を行うインターフェイス部３０６、認識装置３１２全
体の制御及び住所認識処理を行う演算処理装置部３０
７、ソーター部３０３との通信を行うインターフェイス
部３０８、起動時の操作等を行うためのキーボード３０
９、実行状況を必要に応じて表示するためのＣＲＴ３１
０、住所の読み取りに必要なテーブル、プログラム、辞
書等を格納するためのメモリ３１１を備える。The recognition device 312 includes a bus 313 for connecting various parts inside the recognition device, an interface unit 306 for communicating with the scanner unit 301, and an arithmetic processing unit for controlling the entire recognition device 312 and performing address recognition processing. 30
7, an interface unit 308 for communicating with the sorter unit 303, and a keyboard 30 for performing operations at the time of startup and the like
9. CRT 31 for displaying the execution status as needed
0, a memory 311 for storing tables, programs, dictionaries, etc. necessary for reading addresses.

【００２１】図４は図２の画像入力２０２から文字行抽
出２０４の処理を説明する図である。４０７は、画像入
力２０２から宛名領域切出２０３により切り出された宛
名領域の画像である。宛名領域４０７中の黒画素をｙ軸
４０２と平行の軸４０８上に投影したヒストグラムが４
０３、４０４、４０５である。これらのヒストグラムの
値を検出することにより、点線４０６のような文字行の
上下のｙ座標を抽出し、文字行を抽出する（２０４）。FIG. 4 is a view for explaining the processing of character line extraction 204 from the image input 202 of FIG. Reference numeral 407 denotes an image of the destination area cut out from the image input 202 by the destination area cutout 203. The histogram in which the black pixels in the address area 407 are projected on an axis 408 parallel to the y-axis 402 is 4
03, 404, and 405. By detecting the values of these histograms, the upper and lower y-coordinates of the character line such as the dotted line 406 are extracted, and the character line is extracted (204).

【００２２】図５は図２の縦横書き判定２０５の処理を
説明する図である。５０１は抽出された横書きの文字列
の画像を示している。５０２、５０３は文字行の行頭と
行末の文字パターンを示し、文字パターン５０２及び５
０３そのまま切出した文字パターンが５０５及び５０９
であり、パターン回転処理５０４及び５１１で９０度回
転させた文字パターンが５０６及び５１０である。これ
らの文字パターンを、文字認識をする（５０７）。その
結果の類似度を比較（５０８）し、その結果を基に縦横
書き判別２０５を行う。この判別結果を基に縦書き用特
徴抽出と横書き用特徴抽出の入力切り替えを行う（図２
の２２１）。FIG. 5 is a diagram for explaining the processing of the vertical / horizontal writing determination 205 in FIG. Reference numeral 501 denotes an image of the extracted horizontally written character string. Numerals 502 and 503 denote character patterns at the beginning and end of the character line.
03 505 and 509
The character patterns 506 and 510 are rotated by 90 degrees in the pattern rotation processes 504 and 511. Character recognition is performed on these character patterns (507). The similarity of the result is compared (508), and vertical / horizontal writing determination 205 is performed based on the result. Based on this determination result, input switching between vertical writing feature extraction and horizontal writing feature extraction is performed (FIG. 2).
221).

【００２３】画像中のレイアウト情報を用いて判別する
方法の場合、規定のレイアウト以外の住所文字列を含ん
だ画像が入力された際に誤判別する可能性が高くなる
が、本実施例によると文字認識そのものを用いて判別を
行うため安定して縦横書き判別を行うことができる。ま
た、住所文字列の最初と最後の一文字づつにおいてその
ままの向きの文字と９０度回転した文字と類似度の差が
大きくない場合は、住所文字列の最初からと最後からの
文字の切出しを一文字づつずらして同様の判別方法を行
う。これにより、「田」や「山」等の９０度回転しても
類似度に差が生じにくい文字を避け、判別に適した文字
による縦横書き判別が行われ、精度の向上を図ることが
できる。In the case of using the layout information in the image, the possibility of erroneous determination is high when an image including an address character string other than the prescribed layout is input. However, according to the present embodiment, Since the discrimination is performed using the character recognition itself, the vertical and horizontal writing can be discriminated stably. If the difference between the first and last characters of the address string and the similarity between the character in the same direction and the character rotated 90 degrees is not large, cut out one character from the beginning and end of the address string. A similar discrimination method is performed by shifting them one by one. As a result, characters that are unlikely to cause a difference in similarity even when rotated by 90 degrees such as “field” or “mountain” are avoided, and vertical / horizontal writing determination using characters suitable for determination is performed, thereby improving accuracy. .

【００２４】図６は仮説文字パターンを入力画像と対応
付けて説明する図である。識別すべき手書き住所文字列
６０１を入力画像として図２の文字切出し仮説生成２０
６で、１文字として切出すことが可能な境界線（点線６
０３−ｉ（ｉ＝１，２…ｎ）で表す）で入力画像のパタ
ーンを切り出す。説明の都合上点線６０３−ｉに対応し
た丸で囲んだ（０）、、、、…６０２−ｉ（ｉ
＝１，２…ｎ）をノードと呼ぶ。ノードとノードを結ん
だ線６０４をアークと呼び、アーク６０４に対応するパ
ターン６０５、６０６…６１１等が仮説文字パターンで
ある。例えば、読み取られるべき文字パターン「兵」は
文字切出し仮説の可能性としてパターン「兵」６０５の
外パターン「丘」６０６とパターン「八」６０７の仮説
文字パターンに分割可能である。また、パターン「可」
も同様にパターン「可」６１０の他にパターン「一」６
０９とパターン６１１に分割可能である。このように複
数の仮説文字パターンはノード間をアークで結ぶ間に有
る。FIG. 6 is a diagram for explaining a hypothetical character pattern in association with an input image. Using the handwritten address character string 601 to be identified as an input image, the character extraction hypothesis generation 20 of FIG.
6, a boundary line that can be cut out as one character (dotted line 6)
03-i (i = 1, 2,..., N)) is used to cut out the pattern of the input image. (0),..., 602-i (i
= 1, 2,..., N) are called nodes. A line 604 connecting the nodes is called an arc, and patterns 605, 606... 611, etc. corresponding to the arc 604 are hypothetical character patterns. For example, the character pattern “Hyo” to be read can be divided into a hypothesis character pattern of a pattern “Hoka” 606 and a pattern “Hachi” 607 outside the pattern “Hyo” 605 as the possibility of the character cutout hypothesis. Also, the pattern "OK"
Similarly, in addition to the pattern “OK” 610, the pattern “One” 6
09 and a pattern 611. As described above, a plurality of hypothetical character patterns exist between the nodes connected by the arc.

【００２５】図７はパターンテーブル２１９のデータ構
造を示す。７０１は仮説文字パターンとして切出された
パターンの画像情報が格納されたメモリの場所を示すポ
インタである。７０２にはこの仮説文字パターンに対応
したアーク上の確信度（重みを表し、ノードの始点と終
点の距離によって異なり、この距離が長い方が重みは大
きくなる。）が格納されている。７０３には仮説文字パ
ターン中の連結成分数（例えば、文字パターン「三」は
連結成分数が３、「八」は連結成分数が２）が格納さ
れ、７０４には仮説文字パターンのｘ，ｙ座標（例え
ば、図６の仮説文字パターン「丘」を囲むブロック左
上、右下点の座標）が格納されている。７０５はアーク
の前ノードの番号が格納され、７０６にはアークの後ノ
ードの番号が格納され、このノード番号を参照すること
により仮説文字パターンのネットワークを表現できる。
７０７は仮説文字パターンを文字識別辞書２０８を参照
して文字識別２０７した際に得られる複数の候補文字列
が格納され、７０８は各候補文字が仮説文字パターン類
似する程度を表す類似度が格納してある。FIG. 7 shows the data structure of the pattern table 219. Reference numeral 701 denotes a pointer indicating a location in a memory where image information of a pattern cut out as a hypothetical character pattern is stored. 702 stores the degree of certainty on the arc corresponding to this hypothetical character pattern (representing the weight, which depends on the distance between the start point and the end point of the node, and the longer the distance, the greater the weight). Reference numeral 703 stores the number of connected components in the hypothetical character pattern (for example, the character pattern “3” has three connected components, and “eight” has two connected components). The coordinates (for example, the coordinates of the upper left and lower right points of the block surrounding the hypothetical character pattern “hill” in FIG. 6) are stored. Reference numeral 705 stores the number of the node before the arc, and reference numeral 706 stores the number of the node after the arc. By referring to this node number, a network of a hypothetical character pattern can be expressed.
Reference numeral 707 stores a plurality of candidate character strings obtained when the hypothetical character pattern is subjected to character identification 207 with reference to the character identification dictionary 208. Reference numeral 708 stores a degree of similarity indicating the degree to which each candidate character is similar to the hypothetical character pattern. It is.

【００２６】類似度の計算方法は特に特定されず、従来
知られている方法を使用できる。太線７０９は１つの仮
設文字パターンのテーブルの範囲を示し、また、この範
囲が１つのアークに対応する。例えば、仮設文字パター
ンが「兵」のとき１つのアーク６０４−１に対応する。
従って、前ノード番号７０５は（０）で、後ノード番号
７０６はである。The method of calculating the similarity is not particularly specified, and a conventionally known method can be used. A bold line 709 indicates the range of one temporary character pattern table, and this range corresponds to one arc. For example, when the temporary character pattern is “soldier”, it corresponds to one arc 604-1.
Therefore, the previous node number 705 is (0) and the rear node number 706 is

【００２７】図８は図６の文字切出し決定処理２０９を
説明する図である。図はパターンテーブル２１９のデー
タを基に文字切出し決定処理２０９を行い、一意に決定
された仮説文字パターン列を示す。文字切出し決定処理
２０９は、パターンテーブルの全ての仮説文字パターン
に対し文字識別を行ったときに得られた候補文字の類似
度を各アークの確信度７０２として登録しておき、ノー
ド番号０からノード番号８へたどるアークの組み合わせ
による複数のルートについて、各ルートのアークの確信
度の合計を求め、その確信度の合計の最も高いルートに
ある仮説文字パターン列を文字切出しとして決定する。
図８は確信度の合計の最も高いルートがノード（0）−
−−−−−を経ることを示している。FIG. 8 is a view for explaining the character cutout determination processing 209 of FIG. The figure shows a hypothetical character pattern string uniquely determined by performing the character extraction determination processing 209 based on the data of the pattern table 219. In the character cutout determination process 209, the similarity of candidate characters obtained when character identification is performed on all the hypothetical character patterns in the pattern table is registered as the certainty factor 702 of each arc. For a plurality of routes based on the combination of the arcs following the number 8, the sum of the certainty factors of the arcs of each route is obtained, and the hypothetical character pattern sequence in the route having the highest total certainty factor is determined as character extraction.
FIG. 8 shows that the root with the highest total confidence is node (0) −
−−−−−.

【００２８】図８と図６とを比較すると、図８では、図
６のノード番号（0）から、から、からのア
ーク６０４が無く、また、ノード番号から、から
、からのアーク６０４も無くなっている。このよ
うに、パターンテーブル２１９中の全ての仮説文字パタ
ーンを文字識別によって評価することにより文字らしい
パターンの確信度が高くなり、文字切出しが決定（図２
の２０９）される。When FIG. 8 is compared with FIG. 6, in FIG. 8, there is no arc 604 from the node number (0) of FIG. 6, and no arc 604 from the node number. ing. As described above, by evaluating all the hypothetical character patterns in the pattern table 219 by character identification, the certainty of the character-like pattern is increased, and character cutout is determined (FIG. 2).
209) is performed.

【００２９】図９は一意に決定された文字切出しに対す
る文字識別結果を入力とした町域照合処理２１１により
得られる町域部分の認識結果と街区部分先頭位置を説明
するための図である。６０１は読み取られる住所文字列
の画像を、点線９０２は文字切出しが決定した境界線
を、７０７は文字切出しに対応した仮説文字パターンの
文字識別結果の候補文字列を示す。候補文字列７０７を
基に町域照合（図２の２１１）を行った結果選択された
文字が丸で囲まれた文字９０３であり、選択された各文
字「兵」、「庫」…「荷」を組み合わせて町域認識結果
の文字列９１０を生成する。点線９０５は町域照合によ
り得られた文字列が入力画像６０１中で対応する範囲を
示している。町域照合処理２１１により、街区部分の先
頭位置９１１が確定される。FIG. 9 is a diagram for explaining the recognition result of the town area portion obtained by the town area collation processing 211 with the input of the character identification result for the uniquely determined character segmentation and the start position of the town block portion. Reference numeral 601 denotes an image of an address character string to be read, dotted line 902 denotes a boundary determined by character extraction, and 707 denotes a candidate character string of a character identification result of a hypothetical character pattern corresponding to character extraction. The character selected as a result of performing the town area matching (211 in FIG. 2) based on the candidate character string 707 is a character 903 surrounded by a circle, and each selected character “Holder”, “Storage”. To generate a character string 910 of the town area recognition result. A dotted line 905 indicates a range in the input image 601 where the character string obtained by the town area matching corresponds. The start position 911 of the block is determined by the town area matching process 211.

【００３０】同図において、９０６、９０７、９０８、
９０９は街区部分の仮説文字パターンであり、９１２…
９１６はそれぞれ街区部分の仮説文字パターンに対する
文字識別結果の候補文字列を示す。これらは図２のラテ
ィス生成２１１までの処理で得られている。街区先頭位
置９１１以降は漢数字又は算用数字で記述されているこ
とが多く、街区部分の文字切出し処理は、漢字で記述さ
れている町域部分の文字切出しと異なった処理を行う。
つまり、町域部分の文字切出しと同様の手法を街区部分
に適用すると、パターン「三」を仮説文字パターン９０
６、９０７のように分離したり、仮説文字パターン
「二」を９０８、９０９のように分離したりする場合が
多い。また、使用される文字も限定される。In the figure, 906, 907, 908,
Reference numeral 909 denotes a hypothetical character pattern in the block, and 912...
Reference numeral 916 denotes a candidate character string of the character identification result for the hypothetical character pattern in the block. These are obtained by the processing up to lattice generation 211 in FIG. The block starting position 911 and thereafter are often described with kanji numerals or arithmetic numerals, and the character segmentation process of the block portion is different from the character segmentation process of the town area portion described with kanji.
In other words, when the same method as the character extraction of the town area is applied to the block, the pattern “three” is changed to the hypothetical character pattern 90.
6, 907, and the hypothetical character pattern “two” is often separated as 908, 909. Also, the characters used are limited.

【００３１】図１０は街区部分の文字切出し再決定の処
理フロー図を示す。即ち、図２のパターン周辺情報抽出
２１２ないし文字切出し再決定処理２１５の処理フロー
図である。なお、具体的文字パターンの例については後
で図１１ないし図１４を用いて詳細に説明する。町域照
合２１１から得られた街区先頭位置の情報９１１を入力
として街区先頭検出１０１３を行い、パターンテーブル
２１９より街区部分の仮説文字パターンの認識候補文字
を文字種情報としてを抽出する（１００２）。本実施例
の場合は、文字識別２０７の結果の候補文字列の中で類
似度が最も高い候補文字を文字種情報としている。抽出
され文字種情報をキーとして、切出し用辞書２１４を参
照する。文字種抽出１００２の際に、文字種に対応する
仮説文字パターンの周辺情報を抽出する（２１２）。上
記周辺情報は仮説文字パターンの縦長、横長、縦横比、
パターン間隔長、連結成分数のデータである。FIG. 10 is a flowchart showing the process of re-determining the character segmentation of the block. That is, it is a processing flow diagram of the pattern peripheral information extraction 212 or the character cutout re-determination processing 215 of FIG. An example of a specific character pattern will be described later in detail with reference to FIGS. Block top detection 1013 is performed using the block top position information 911 obtained from the town area comparison 211 as input, and recognition candidate characters for the hypothetical character pattern in the block are extracted from the pattern table 219 as character type information (1002). In the case of this embodiment, the candidate character having the highest similarity among the candidate character strings resulting from the character identification 207 is set as the character type information. The extraction dictionary 214 is referred to using the extracted character type information as a key. At the time of character type extraction 1002, peripheral information of a hypothetical character pattern corresponding to the character type is extracted (212). The above peripheral information is the length, width, aspect ratio,
This is data on the pattern interval length and the number of connected components.

【００３２】切出し用辞書２１４の参照は、上記文字種
をキーにして辞書２１４で上記抽出された縦長、横長、
縦横比、パターン間隔長、連結成分数の周辺情報に対す
る尤度比を求める。ここで、辞書２１４には確信度とし
て各周辺情報の値に対する尤度比が登録されており、各
周辺情報の値に対する尤度比が確信度として算出される
（１００５、１００６、１００７、１００８）。算出さ
れたそれぞれの確信度を統合する（１０１０）。切出し
た仮説文字パターンが文字識別された文字種として正し
かったという事象をＨとし、ｅ₁，ｅ₂，ｅ₃，…ｅ_nをパ
ターンの周辺情報の各特徴量として、Ｐ（ｅ｜Ｈ）をＨ
だった場合のｅの生起確率とすると、ある特徴量ｅ_kに
対する尤度比Ｌ（ｅ_k｜Ｈ）は以下の式で求められる。The extraction dictionary 214 is referred to by using the character type as a key to extract the portrait, landscape,
The likelihood ratio of the aspect ratio, the pattern interval length, and the number of connected components to peripheral information is obtained. Here, the likelihood ratio for each value of the peripheral information is registered as a certainty factor in the dictionary 214, and the likelihood ratio for each peripheral information value is calculated as the certainty factor (1005, 1006, 1007, 1008). . The calculated certainty factors are integrated (1010). An event that was correct and H as character types cutout hypotheses character pattern is character _{_{identification, e 1, e 2, e}} 3, ... as the feature amount of the peripheral information of e _n pattern, P | a (e H) H
Assuming that the occurrence probability of e is the case, the likelihood ratio L (e _k | H) with respect to a certain feature value e _k is obtained by the following equation.

【００３３】[0033]

【００３４】[0034]

【数１】 (Equation 1)

【００３５】ここでＰ（Ｈ）をＨである生起確率とし、
式（１）で求められる複数の尤度比を用いて、ｅ₁，
ｅ₂，ｅ₃．．．，ｅ_nが得られた際のＨの生起確率Ｐ
（Ｈ｜ｅ₁，ｅ₂，ｅ₃，．．．，ｅ_n）を求める際にＢａ
ｙｅｓｒｕｌｅを用いると以下の式のようになる。Here, let P (H) be the occurrence probability of H,
Using a plurality of likelihood ratios determined by equation (1), e ₁ ,
e ₂ , e ₃ . . . , The probability of occurrence of H when e _n is obtained P
_{_{(H | e 1, e 2}} , e 3, ..., e n) Ba when obtaining the
If yes rule is used, the following equation is obtained.

【００３６】[0036]

【数２】 (Equation 2)

【００３７】式（２）を用いて、確信度統合処理１０１
０では、１００５、１００６、１００７、１００８のそ
れぞれで算出された尤度比をＢａｙｅｓｒｕｌｅを用
いて統合する。次に、統合された確信度を文字識別によ
り得られた類似度に掛けた値をアークに反映して重み付
けする（１０１１）。このようにアークに重み付けされ
た仮説文字パターンのデータを基に最初のノードから最
後のノードへたどる最適なルートを探索し、文字切出し
パスを決定する（１０１２）。その結果を街区照合（図
２の２１７）で使う。Using formula (2), certainty integration processing 101
In the case of 0, the likelihood ratios calculated in each of 1005, 1006, 1007, and 1008 are integrated using Bayes rule. Next, a value obtained by multiplying the integrated certainty by the similarity obtained by character identification is reflected on the arc and weighted (1011). The optimal route from the first node to the last node is searched based on the hypothetical character pattern data weighted to the arc as described above, and a character extraction path is determined (1012). The result is used in block matching (217 in FIG. 2).

【００３８】図１１は住所文字列の街区部分に対応した
仮説文字パターンを説明する図である。住所文字列の入
力画像の街区部分１１０２は図２の文字切出し仮説生成
２０６、境界線１１０４ように文字切出だされ、仮説文
字パターンとして、パターンテーブル２１９に記録され
ている。パタンテーブルでのデータの構成は図６、図７
で説明したものと同じである。点線１１０３は切出しの
境界線１１０４とノード番号１１０５（丸で囲んだ番号
５１、５２、…５５など）との対応を示している。例え
ば、仮説文字パターン「三」１１０６（アーク１１０７
に対応）を文字識別処理すると、「三」「３」「五」の
候補文字列１１０８が得られる。仮説文字パターン「三
｜」１１０９（アーク１１１１に対応）は文字識別処理
すると、「示」「未」「身」の候補文字列１１１０が得
られる。FIG. 11 is a diagram for explaining a hypothetical character pattern corresponding to a block in an address character string. The street section 1102 of the input image of the address character string is extracted as a character extraction hypothesis generation 206 and a boundary line 1104 in FIG. 2 and recorded in the pattern table 219 as a hypothesis character pattern. The data structure in the pattern table is shown in FIGS.
This is the same as that described above. A dotted line 1103 indicates the correspondence between the cutout boundary line 1104 and the node number 1105 (numbers 51, 52,. For example, the hypothetical character pattern “three” 1106 (arc 1107)
), A candidate character string 1108 of “3”, “3”, “5” is obtained. When the hypothetical character pattern “3 |” 1109 (corresponding to arc 1111) is subjected to character identification processing, candidate character strings 1110 of “show”, “not”, and “body” are obtained.

【００３９】また。各仮説文字パターンに対応するアー
クについて重み付けについては、図１０で述べた重み付
けがなされる。例えば上記文字種「示」「未」「身」１
１１０は、街区部分の文字として発生しない不適当な文
字であるため、仮説文字パターン１１０９に対応するア
ーク１１１１の重み付けを下げる。この重み付け処理に
より、街区部分として明らかに不適当であるアークを削
除する。他の仮説文字パターン（例えば「二」、
「一」、「一｜」「｜」等）についても対応するアーク
の重み付けを行い、不適当であるアークを削除する。Also, Regarding the weighting of the arc corresponding to each hypothesis character pattern, the weighting described in FIG. 10 is performed. For example, the above character type "show""not""body" 1
Since 110 is an inappropriate character that does not occur as a character in the block, the weight of the arc 1111 corresponding to the hypothetical character pattern 1109 is reduced. By this weighting process, arcs that are clearly inappropriate as a block portion are deleted. Other hypothetical character patterns (eg, "two",
For "1", "1 |", "|", etc., the corresponding arcs are weighted, and inappropriate arcs are deleted.

【００４０】図１２は図１０の街区部分の仮説文字パタ
ーンに対するアークの重み付け処理１０１１を説明する
図である。仮説文字パターン「三」１１０６を例に詳細
に説明する。先ず、仮説文字パターン「三」から周辺情
報を抽出する（２１２）。ここで、周辺情報とはパター
ンの縦長、横長、アスペクト比、連結成分数、文字間隔
長の値であり、また、文字識別結果の候補文字「三、
３、五」１１０８の１位の候補文字「三」もここで参照
する。これらの情報をまとめたものがブロック１２１３
内に１２１４、１２１５として示されている。FIG. 12 is a diagram for explaining arc weighting processing 1011 for the hypothetical character pattern in the block section in FIG. The hypothetical character pattern “3” 1106 will be described in detail as an example. First, peripheral information is extracted from the hypothetical character pattern "3" (212). Here, the peripheral information refers to values of the vertical length, horizontal length, aspect ratio, number of connected components, and character interval length of the pattern, and the candidate characters “3,
The first place candidate character “3” of “3, 5” 1108 is also referred to here. Block 1213 summarizes this information.
Are shown as 1214 and 1215 in FIG.

【００４１】文字種抽出１００２において、文字種
「三」の情報１２１４を、文字切出し用辞書２１４に送
る。切出し用辞書２１４には文字種「三」をキーにして
参照可能なアーク重みづけのためのデータ１２０５が格
納されており、文字種抽出１００２で得られた文字種を
キーにして切出し用辞書２１４中のインデックス「三」
を検索する。文字種「三」の検索する際、抽出した文字
種「三」と一致したインデックスの付いたデータ１２０
５から縦長１２０６、横長１２０７、アスペクト比１２
０８、連結成分数１２０９、文字間長１２１０に対応し
た尤度比を引き出し、各周辺情報の確信度を求め（１２
０４）、確信度統合１２１６で各尤度比（確信度１、
２、３、４、５）の統合を行い（１０１０）、重み付け
１０１１で仮説文字パターン「三」１１０６に対応した
アーク１１０７に重み付けを行う。上述のように、パタ
ーンの周辺情報を仮説文字パターン１１０６のアーク１
１０７に反映し、文字種に応じて最適な重み付けを行
う。In character type extraction 1002, information 1214 of character type “three” is sent to character extraction dictionary 214. The extraction dictionary 214 stores data 1205 for arc weighting that can be referred to using the character type “3” as a key, and the index in the extraction dictionary 214 using the character type obtained in the character type extraction 1002 as a key. "three"
Search for. When searching for the character type "3", the indexed data 120 that matches the extracted character type "3"
5 to vertical 1206, horizontal 1207, aspect ratio 12
08, the number of connected components 1209, and the likelihood ratio corresponding to the character length 1210, and the confidence of each peripheral information is obtained (12
04), the likelihood ratios (confidence 1,
2, 3, 4, and 5 are integrated (1010), and the weight 1101 is used to weight the arc 1107 corresponding to the hypothetical character pattern “three” 1106. As described above, the peripheral information of the pattern is converted to the arc 1 of the hypothetical character pattern 1106.
In step 107, optimal weighting is performed according to the character type.

【００４２】図１３は上記重み付けの結果をアークの線
の太さで表した図である。１３０１は仮説文字パターン
「三」１１０６に対応した周辺情報を加味して重み付け
をしたアークを示す。また、１３０３は仮説文字パター
ン「｜」に対応した周辺情報を加味して重み付けをした
アークで、これも周辺情報を加味すると非常に重みが大
きくなる。また、１３０２は、パターン「三」の下２つ
の連結成分を組み合わせて構成された「二」の文字パタ
ーンに非常によく似た仮説文字パターンのアークでノー
ド番号５２と５４を結ぶ。この仮説文字パターン「二」
を文字識別すると図１１に示すように文字種「二」
「三」「２」の候補文字が得られ、その中で文字種
「二」は類似度も大きな値が得られるため、文字切出し
としては、この仮説は妥当であるという判定される危険
性がある。しかし、周辺情報を加味すると、アーク１３
０２に対応したパターンは１つ上のパターンとの隙間が
狭いことにより文字切出し用辞書２１４を参照するとア
ーク１３０２上のパターンの確信度は低くなり、アーク
１３０２の重み付けも類似度だけの場合と比較して低く
なる。このため、アーク１３０１上の重み付けの方が大
きくなり、相対的にアーク１３０２を通るルートの方が
重みの合計の方が小さくなる。FIG. 13 is a diagram showing the result of the weighting in terms of the thickness of the arc line. Reference numeral 1301 denotes an arc that is weighted in consideration of the peripheral information corresponding to the hypothetical character pattern “three” 1106. Numeral 1303 denotes an arc weighted in consideration of the peripheral information corresponding to the hypothetical character pattern "|". The weight also becomes extremely large when the peripheral information is added. The numeral 1302 connects the node numbers 52 and 54 with a hypothetical character pattern arc very similar to the character pattern of “2” formed by combining the lower two connected components of the pattern “3”. This hypothetical character pattern "two"
When the character is identified, as shown in FIG.
Candidate characters “3” and “2” are obtained, and among them, the character type “2” also has a large similarity. Therefore, there is a risk that this hypothesis is determined to be valid as character extraction. . However, considering surrounding information, arc 13
Since the pattern corresponding to 02 has a narrow gap with the pattern one immediately above, when the character extraction dictionary 214 is referred to, the confidence of the pattern on the arc 1302 is low, and the weight of the arc 1302 is compared with the case where only the similarity is used. And then lower. For this reason, the weighting on the arc 1301 is larger, and the total of the weights of the route passing through the arc 1302 is relatively smaller.

【００４３】図１４は街区部分の文字切出し再決定を行
って選択された仮説文字パターン列を示している。すな
わち、アークへの重み付け（図１０の１０１１）により
仮説文字パターンに対応するアークに対し重み付けを行
い、重み付けの和が最大になるようにパスを決定する
と、仮説文字パターン「三」のアーク１３０１と仮説文
字パターン「｜」のアーク１３０３を選択してルートが
選択される。このようにして街区部分の文字切出し再決
定では、仮説文字パターン「三」、「｜」、「二」、
「｜」、「一」に対応したアークを選択してパスを構成
している。これらの選択されたアークのパターンに付属
した候補文字列を用いて、街区部分のラティスを生成す
る（２１６）。FIG. 14 shows a hypothetical character pattern string selected by re-determining the character segmentation of the block. That is, when the arc corresponding to the hypothetical character pattern is weighted by weighting the arc (1011 in FIG. 10) and the path is determined so that the sum of the weights is maximized, the arc 1301 of the hypothetical character pattern “three” is determined. The arc 1303 of the hypothetical character pattern "|" is selected, and the route is selected. In this manner, in the character segmentation redetermination of the block portion, the hypothetical character patterns “3”, “|”, “2”,
The arcs corresponding to "|" and "one" are selected to form a path. Using the candidate character string attached to the selected arc pattern, a lattice of the block is generated (216).

【００４４】図１５は上記実施の形態における住所文字
列全体の認識結果を示す。即ち街区専用の文字切出しを
行い、ラティス生成、街区照合処理を行った結果の文字
列に町域照合結果を結合することによる住所文字列全体
の認識結果を示す。入力した住所文字列画像６０１にお
いて、点線９０５は町域部分の範囲を示し、９１０は図
９で示した町域照合結果を示している。点線１５１０は
図１４で示した文字切出し再決定の切出し境界を示し、
文字列１５０６は文字切出し結果の１文字毎の文字識別
結果の候補文字列を示す。点線１５０７、１５０９は街
区部分の範囲を示し、文字列１５０８は文字識別結果の
候補文字列から街区照合２１７により得られた街区認識
すなわち読み取り結果を示す。文字列１５０４は町域照
合結果１５０２と街区照合結果１５０８を結合して生成
した宛名文字列全体の認識結果を示している。このよう
に、街区部分のみ文字切出しを再試行し、町域照合結果
と組み合わせることにより、住所文字列全体の認識精度
が向上する。FIG. 15 shows the result of recognition of the entire address character string in the above embodiment. That is, the recognition result of the entire address character string is shown by combining the town area collation result with the character string obtained by performing character extraction for the city block, lattice generation, and city area collation processing. In the input address character string image 601, a dotted line 905 indicates a range of a town area portion, and 910 indicates a town area collation result shown in FIG. Dotted line 1510 indicates the cutout boundary of character cutout redetermination shown in FIG.
A character string 1506 indicates a candidate character string of a character identification result for each character of the character cutout result. Dotted lines 1507 and 1509 indicate the range of the block, and a character string 1508 indicates the block recognition or reading result obtained by the block verification 217 from the candidate character string of the character identification result. A character string 1504 indicates a recognition result of the entire destination character string generated by combining the town area comparison result 1502 and the block comparison result 1508. As described above, the character segmentation is retried only for the block portion and is combined with the town area comparison result, thereby improving the recognition accuracy of the entire address character string.

【００４５】図１６は入力した宛名文字列と文字切出し
仮説のパターンテーブルと文字識別結果を画面上に表示
した例を示す。図３の表示装置３１０の画面上１６００
には、入力された住所文字列の画像１６０１、パターン
テーブルのノード１６０２−ｉ（ｉ＝１，２…８）、ノ
ード間を結んだアーク１６０３−１、１６０３−４、相
隣るノード間を結ぶアーク、仮説文字パターン１６０４
−ｊ（ｊ＝１，２…１０）、仮説文字パターン１６０４
−ｊに対する文字識別により得られた候補文字列１６０
５が表示されている。このように、表示装置３１０の画
面上１６００に画面表示を行うことにより、住所文字列
認識処理の際の文字切出し、文字識別処理の途中経過が
直感的に把握しやすくなり、装置の保守、機能拡張の際
に有用である。また、文字切出し辞書２１４を作成、更
新する際に仮説により切出されたパターンの周辺情報の
収集が必要である。FIG. 16 shows an example in which the input destination character string, the pattern table of the character extraction hypothesis, and the character identification result are displayed on the screen. 1600 on the screen of the display device 310 in FIG.
Includes an input address character string image 1601, pattern table nodes 1602-i (i = 1, 2,..., 8), arcs 1603-1 and 1603-4 connecting nodes, and adjacent nodes. Connecting arc, hypothetical character pattern 1604
−j (j = 1, 2,..., 10), hypothetical character pattern 1604
Candidate character string 160 obtained by character identification for −j
5 is displayed. In this manner, by displaying the screen on the screen 1600 of the display device 310, it becomes easy to intuitively grasp the progress of character segmentation and character identification processing during address character string recognition processing, and to maintain and function the apparatus. Useful for expansion. In addition, when creating and updating the character extraction dictionary 214, it is necessary to collect peripheral information of a pattern extracted based on a hypothesis.

【００４６】式（１）を参照すると、文字切出し用辞書
に格納されている尤度比とは、文字切出しが正解だった
仮説のパターンの周辺情報の分布を、文字切出しが不正
解だった仮説のパターンの周辺情報の分布で割った値で
ある。そこで、図１６のような画面表示に対しポインタ
により、あるアークを指定すると周辺情報の各パラメー
タ、文字識別結果等の情報を出力できるようにすると容
易にパターンの周辺情報を正解不正解に分類して収集で
きる。また、この画面表示をしたツールは文字識別処理
の文字識別用辞書の生成、更新の際に必要な文字パター
ンの収集にも有効であり、作業量を削減できる。Referring to the equation (1), the likelihood ratio stored in the character extraction dictionary indicates the distribution of the peripheral information of the pattern of the hypothesis where the character extraction was correct, and the hypothesis that the character extraction was incorrect. Is divided by the distribution of the peripheral information of the pattern. Therefore, when a certain arc is designated by the pointer on the screen display as shown in FIG. 16 and information such as each parameter of the peripheral information and the character identification result can be output, the peripheral information of the pattern is easily classified as the correct answer. Can be collected. The tool displaying the screen is also effective for collecting a character pattern required for generating and updating a character identification dictionary for character identification processing, and can reduce the amount of work.

【００４７】[0047]

【発明の効果】以上説明したように本発明によると、文
字切出し、即ち文字読み取りの精度が向上する。特に郵
便物の手書きされた宛名の住所の読み取りを行う場合、
町名記載部と街区記載部の処理を異ならしめ、街区記載
部の文字読み取りに本発明を適用した場合、街区に使用
される文字は限定されるので、文字切出し、辞書は小容
量で良く、そのため処理時間を少なくすることができ
る。町名記載部と街区記載部を合わせた住所の読み取り
を効率的かつ精度良く実現できる。As described above, according to the present invention, the accuracy of character extraction, that is, character reading, is improved. Especially when reading the address of a handwritten address on mail,
When the processing of the street name description section and the block description section are made different, and the present invention is applied to the character reading of the block description section, the characters used for the block are limited, so the character extraction and the dictionary may be small in size, and therefore, Processing time can be reduced. It is possible to efficiently and accurately read an address including the street name description section and the block description section.

[Brief description of the drawings]

【図１】従来技術における文字切出しの曖昧性が生じる
入力画像例を示す図である。FIG. 1 is a diagram illustrating an example of an input image in which ambiguity in character segmentation occurs in the related art.

【図２】本発明による文字読み取り方法の一実施形態を
示す処理フロー図である。FIG. 2 is a processing flowchart showing an embodiment of a character reading method according to the present invention.

【図３】本発明による文字識別方法を実施する文字識別
装置の一実施の構成図である。FIG. 3 is a configuration diagram of one embodiment of a character identification device that performs a character identification method according to the present invention.

【図４】図２の文字行抽出２０４の処理を説明する図で
ある。FIG. 4 is a diagram illustrating a process of character line extraction 204 of FIG. 2;

【図５】図２の縦横書き判定２０５の処理を説明する図
である。FIG. 5 is a diagram illustrating a process of vertical / horizontal writing determination 205 in FIG. 2;

【図６】図２の文字切出し仮説生成２０６を入力画像と
対応付けて説明する図である。FIG. 6 is a diagram illustrating the character segmentation hypothesis generation 206 of FIG. 2 in association with an input image.

【図７】図２のパターンテーブル２１９のデータ構造を
示す図である。FIG. 7 is a diagram showing a data structure of a pattern table 219 in FIG.

【図８】図２の文字切出し決定処理２０９により一意に
決定された仮説文字パターンの列を概念的に示す図であ
る。8 is a diagram conceptually showing a sequence of hypothetical character patterns uniquely determined by the character cutout determination process 209 in FIG.

【図９】図２のラティス生成２１０及び町域照合２１１
の処理を説明する図である。FIG. 9 shows the lattice generation 210 and the town area collation 211 of FIG.
It is a figure explaining the processing of.

【図１０】図２の街区部分のみ漢数字、算用数字用に文
字切出し再決定の処理２１５を説明する図である。FIG. 10 is a diagram illustrating a process 215 of character segmentation and redetermination for Chinese numerals and arithmetic numerals only in the block portion of FIG. 2;

【図１１】街区部分の入力画像に対する仮説文字パター
ンの対応を示した文字識別を説明する図である。FIG. 11 is a diagram illustrating character identification indicating correspondence of a hypothetical character pattern to an input image of a block section.

【図１２】図１０の仮説中文字パターンに対する確信度
を算出する処理を説明する図である。12 is a diagram illustrating a process of calculating the degree of certainty for the hypothetical character pattern in FIG. 10;

【図１３】街区部分の文字切出し仮説に対し、パターン
の確信度を算出しアークの重み付けに反映した結果を示
した図である。FIG. 13 is a diagram showing a result obtained by calculating the certainty factor of a pattern and reflecting it on the weighting of an arc with respect to a character segmentation hypothesis in a block area.

【図１４】街区部分の文字切出しの再決定を行った際に
選択された文字切出しを示す図である。FIG. 14 is a diagram illustrating a character cutout selected when the character cutout of the block is re-determined.

【図１５】住所文字列の町域部分と街区部分の認識結果
を組み合わせて住所文字列全体の認識結果を示した図で
ある。FIG. 15 is a diagram showing the recognition result of the entire address character string by combining the recognition results of the town area part and the block part of the address character string.

【図１６】本発明による宛名認識装置のメンテナンス、
機能拡張、辞書の生成更新のために用いるツールの画面
表示例を示した図である。FIG. 16 shows maintenance of an address recognition device according to the present invention;
FIG. 7 is a diagram showing an example of a screen display of a tool used for function extension and dictionary generation / update.

[Explanation of symbols]

２０６：文字切出し仮説生成、２０９：文字切出し決
定、２１０：ラティス生成、２１１：町域照合、２
１２：パターン周辺情報抽出２１３：パターン確信度算出、２１４：文字切出し用
辞書、２１５：文字切出し再決定、２１６：ラティス
再生成。206: character extraction hypothesis generation, 209: character extraction determination, 210: lattice generation, 211: town area collation, 2
12: Pattern peripheral information extraction 213: Pattern certainty factor calculation, 214: Character extraction dictionary, 215: Character extraction redetermination, 216: Lattice regeneration.

フロントページの続き (72)発明者藤澤浩道東京都国分寺市東恋ケ窪１丁目280番地株式会社日立製作所中央研究所内 (72)発明者緒方日佐男東京都国分寺市東恋ケ窪１丁目280番地株式会社日立製作所中央研究所内 (72)発明者嶋好博東京都国分寺市東恋ケ窪１丁目280番地株式会社日立製作所中央研究所内 (72)発明者渡辺成愛知県尾張旭市晴丘町池上１番地株式会社日立製作所オフィスシステム事業部内Continuing from the front page (72) Inventor Hiromichi Fujisawa 1-280 Higashi Koikekubo, Kokubunji-shi, Tokyo Inside the Hitachi, Ltd. Central Research Laboratory (72) Inventor Hisao Ogata 1-280 Higashi Koikekubo, Kokubunji-shi, Tokyo Hitachi, Ltd. Inside the research institute (72) Yoshihiro Shima 1-280 Higashi Koikekubo, Kokubunji-shi, Tokyo Inside the Central Research Laboratory, Hitachi, Ltd. (72) Inventor Shigeru Watanabe 1 Ikegami, Haruoka-cho, Owariasahi-shi, Aichi Pref. Inside

Claims

[Claims]

A first step of converting image information including character line information into an electric signal; extracting a character line region of the image information converted into the electric signal to form a character in the character line region; Second to cut out a combination of connected components as multiple hypothetical character patterns
A third step of identifying a character by referring to a character identification dictionary for the hypothetical character pattern to obtain a plurality of candidate characters for recognition, a fourth step of extracting peripheral information of the hypothetical character pattern and the third step, With reference to the extraction dictionary having the peripheral information of the character using the recognition candidate character obtained in the step as a key, the confidence of the peripheral information of the hypothetical character pattern extracted in the fourth step is obtained, and the hypothetical character pattern is weighted. A fifth step, a sixth step of determining character extraction using the weighting of the hypothetical character pattern, and a character identification dictionary based on a set of character identification results generated from the character extraction results determined in the sixth step. 7. A character reading method comprising the steps of: performing a word collation to recognize the character information to be identified; and reading the character line. Method.

2. The character reading method according to claim 1, wherein the peripheral information of the hypothetical character pattern is information of a vertical length, a horizontal length, an aspect ratio, a pattern interval length, and the number of connected components of the hypothetical character pattern. The cut-out dictionary stores likelihood distribution information for each of the characters in the portrait, landscape, aspect ratio, pattern interval length, and number of connected components. Reference to the character cut-out dictionary in the fifth step is a candidate for the character identification result. A method of reading an address, wherein the character extraction dictionary is referred to by using a character type of a character as a key.

3. The method according to claim 1, wherein the fifth step includes a generation / update step of generating / updating the character extraction dictionary, wherein the generation / update step includes a step of generating a character string to be read on a screen of a display device. An input image, the hypothetical character pattern, a display step of displaying candidate characters of a character identification result for the hypothetical character pattern, and an updating step of updating a character extraction dictionary by using a screen display of the display device. And a step of graphically displaying a character string recognition processing state on a screen.

4. A first step of converting image information including character line information having a street name description portion and a block description portion into an electric signal, extracting a character line region of the image information converted into the electric signal, Second to cut out a combination of connected components constituting the characters in the character line area as a plurality of hypothetical character patterns
And a third step of identifying a character by referring to a character identification dictionary for the hypothetical character pattern to obtain a plurality of recognition candidate characters and the recognition candidate character. A fourth step of recognizing the character line of the street name description portion from the plurality of hypothetical character patterns and detecting the top position information of the block description portion, and a block description portion based on the top position information obtained in the fourth step. A fifth step of extracting pattern peripheral information of a hypothetical character corresponding to a candidate character for recognition of the hypothetical character pattern of
Fifth, weighting a hypothetical character pattern for obtaining a certainty factor of the pattern peripheral information of the hypothetical character pattern with reference to the extraction dictionary in which the likelihood corresponding to the pattern peripheral information is described using the recognition candidate character as a key. A sixth step of extracting a character of a block described by the step and the weighting, and a second step of performing word matching with a block dictionary based on a set of character identification results generated from the sixth step and reading the block character line to be identified. An address reading method comprising seven steps.

5. The method according to claim 4, wherein the second step is a step of extracting characters from the beginning and end of the extracted character line and identifying vertical and horizontal writing by identifying the extracted character pattern. Address reading method characterized by having.

6. The method according to claim 4, wherein the pattern peripheral information includes a vertical length, a horizontal length, an aspect ratio, a pattern interval length,
The character extraction dictionary has information on the number of connected components, and stores the vertical length, the horizontal length, the aspect ratio, the pattern interval length, and the likelihood distribution information for the number of connected components for each character used in the block description. An address reading method using the characters of the character extraction dictionary using the recognition candidate characters of the step as keys.

7. The method according to claim 4, wherein the fifth step includes a generation / update step of generating / updating the character extraction dictionary, wherein the generation / update step includes an address character string to be read on a screen of a display device. A display step of displaying the input image, the hypothetical character pattern, and candidate characters of the character identification result for the hypothetical character pattern, and an updating step of updating a character extraction dictionary using the screen display of the display device. And a step of graphically displaying, on a screen, a recognition processing state of the address character string, the address reading method.