JPH1185909A

JPH1185909A - Address recognizing method

Info

Publication number: JPH1185909A
Application number: JP9238032A
Authority: JP
Inventors: Masashi Koga; 昌史古賀; Ryuji Mine; 竜治嶺; Hiroshi Ueda; 洋上田; Yutaka Sako; 裕酒匂; Hiromichi Fujisawa; 浩道藤澤
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1997-09-03
Filing date: 1997-09-03
Publication date: 1999-03-30
Anticipated expiration: 2017-09-03
Also published as: JP3468668B2

Abstract

PROBLEM TO BE SOLVED: To decrease the frequency of character recognition and to improve the character recognition precision by limiting the character category of standard patterns to be compared with respective input patterns to only character categories which can be connected by referring to an address tree when an address character string is recognized. SOLUTION: Character segmentation 401 cuts a character pattern candidate out of a character line image. Then character discrimination 402 discriminates the respective character pattern candidates and passes the results to a search processing 404. Then the search processing 404 dynamically generate a search tree for searching an address tree stored in an address dictionary 408 according to the character recognition result. After the generation of the search tree is completed, character string candidate selection 405 selects and outputs the path reaching a leaf from the starting node of the search tree, i.e., a recognized character string. Standard pattern selection 403 selects a standard pattern in a character category needed to expand a next node of the search tree is selected out of a standard pattern dictionary 406 and outputted to a character discrimination processing 402.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は文字読取りに関係
し、特に郵便物等の文書上に記載された住所を読取る住
所読取りにかかわる。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to character reading, and more particularly to address reading for reading an address written on a document such as a mail.

【０００２】[0002]

【従来の技術】郵便物に記載されている宛名の読取りと
しては、一般的には以下の様な手順が知られている。2. Description of the Related Art The following procedure is generally known for reading an address written on a mail.

【０００３】（１）宛名文字行の位置を決定する（文字
行抽出）。(1) The position of a destination character line is determined (character line extraction).

【０００４】（２）文字パタンを切り出す（文字切出
し）。(2) Extract a character pattern (character extraction).

【０００５】（３）各々の文字パタンの文字カテゴリ
（文字コード）を識別する（文字識別）。(3) A character category (character code) of each character pattern is identified (character identification).

【０００６】（４）文字の識別結果を住所文字列として
解釈する（文字列照合）。(4) Interpret the character identification result as an address character string (character string collation).

【０００７】ここで、（３）には、予め記憶している標
準パタン中で最も入力パタンに類似したものを選び出す
パタン整合法がよく用いられる。Here, for (3), a pattern matching method for selecting the most similar to the input pattern from the standard patterns stored in advance is often used.

【０００８】今、例として、図１（Ａ）の様な文字行か
ら（Ｂ）に示す様な５つの文字パタンが切出された場合
を考えてみる。また、認識対象の住所が図２の様なもの
（仮想的な住所）であったとする。Now, as an example, consider a case where five character patterns as shown in FIG. 1B are cut out from a character line as shown in FIG. It is also assumed that the address to be recognized is as shown in FIG. 2 (virtual address).

【０００９】従来の方式では、この場合、文字識別処理
において、５つの文字パタンをそれぞれ読み取り対象の
文字カテゴリである「旭安井口山市住所松上新中町田波
板北本里」の１９の文字カテゴリの標準パタンと比較す
る（現実には、全国の地名を読み取るためには約４００
０字、１つの市内の地名を読み取るためには数百文字が
読み取り対象となる）。そして、文字識別処理の結果得
られた候補文字を、住所文字列と比較し、最も類似した
ものを選択していた。In the conventional method, in this case, in the character identification processing, 19 characters of "Asahi Yasuiguchi City City Address Matsugami Shinnakamachi Tambaita Kitamotori" which is a character category to be read in each of the five character patterns. Compare with the standard pattern of the category (actually, about 400
In order to read zero characters and place names in one city, several hundred characters are read.) Then, the candidate character obtained as a result of the character identification processing is compared with the address character string, and the most similar character is selected.

【００１０】[0010]

【発明が解決しようとする課題】上記の様な処理手順に
おいては、切出された文字パタンを、住所中に有り得る
全ての文字カテゴリの標準パタンと比較する必要があ
り、処理量が大きくなってしまう、という問題点があっ
た。In the above-described processing procedure, it is necessary to compare the extracted character pattern with the standard patterns of all the character categories that can be present in the address. There was a problem that

【００１１】本発明で解決しようとする第１の課題は、
住所に現われる文字の並びに関する知識を用い、各パタ
ンと比較する標準パタンの文字カテゴリを絞り込み、処
理量の削減を図ることである。なお、この前述の例で
は、住所中の市、町の部分を対象としているが、本課題
は、市名、町名のみならず、丁目番地の表記、アパート
やビルの名称、氏名などを含めた住所を対象とする。A first problem to be solved by the present invention is as follows.
The purpose is to reduce the processing amount by narrowing down the character category of the standard pattern to be compared with each pattern by using knowledge about the arrangement of characters appearing in the address. In the above example, the city and town in the address are targeted.However, in this project, not only the city and town names, but also the names of street addresses, names of apartments and buildings, names, etc. were included. Target address.

【００１２】また、文字切出しに多重仮説検定法（昭和
５９年度電子通信学会総合全国大会１５５８「接触した
手書き文字の自動分離を行う文字切り出し方式」）を用
いる場合、文字識別を行う必要のある文字候補のパタン
の数が多くなり、処理量が増大するという問題がある。When a multiple hypothesis testing method (character extraction method for automatically separating contacted handwritten characters) is used for character extraction (1558, National Institute of Electronics and Communication Engineers, 1558, National Convention, 1984), characters that need to be subjected to character identification need to be identified. There is a problem that the number of candidate patterns increases and the processing amount increases.

【００１３】本発明で解決しようとする第２の課題は、
多重仮説検定法において必要な文字識別の回数を削減を
図り、更に識別精度の向上を図ることである。The second problem to be solved by the present invention is as follows.
An object of the present invention is to reduce the number of times of character identification required in the multiple hypothesis testing method and further improve the identification accuracy.

【００１４】[0014]

【課題を解決するための手段】本発明では、住所の文字
列の知識を図３に記すように住所文字列の各文字が節点
に対応するような木構造で表現する。以下、このような
木構造を「住所木」と呼ぶ。そして、文字識別を利用し
て住所木中の子節点（木構造の各節点の末端側に枝を介
して連接している節点）を選択しながら住所木を探索す
ることにより、住所文字列を認識する。この際、各入力
パタンと比較する標準パタンの文字カテゴリを、住所木
を参照して接続可能な文字カテゴリのみに限定すること
により、上記の課題が解決できる。According to the present invention, knowledge of a character string of an address is represented by a tree structure in which each character of the address character string corresponds to a node as shown in FIG. Hereinafter, such a tree structure is referred to as an “address tree”. Then, by using the character identification to search the address tree while selecting child nodes in the address tree (nodes connected to the terminal side of each tree structure via a branch), the address character string is obtained. recognize. At this time, the above problem can be solved by limiting the character category of the standard pattern to be compared with each input pattern to only the character categories that can be connected with reference to the address tree.

【００１５】例えば、出発節点から次の節点を選択する
場合には、図１の第一の文字パタンを「中」「所」の２
字種の標準パタンとマッチングする。もし、いずれかの
文字のみに類似している場合には、類似している方の文
字に対応する節点を選択し、以下同様に節点の探索を進
める。もし、入力文字パタンが次の節点の両方の文字に
十分類似している場合には、両方の文字に対応する両方
の節点を次の節点の候補（複数）として展開し、以下同
様に探索を進める。もし、入力文字パタンが次の節点の
いずれの文字とも類似していない場合には、その枝の探
索を停止する。こうして、木構造の葉、すなわち「町」
「里」「旭」「井」「松」「口」等の文字に対応する節
点に到達したならば、出発節点から当該の葉に至る文字
列を読み取り結果とする。もし、複数の葉に達した場合
（これは、複数の住所読み取り結果が候補として残った
ことを意味する）は、より詳細な判定（例えば、各節点
の展開の際の文字識別結果の確からしさの平均値）によ
り候補を選択する。For example, when selecting the next node from the starting node, the first character pattern in FIG.
Matches with the standard pattern of the character type. If the character is similar to only one of the characters, a node corresponding to the character that is more similar is selected, and the search for a node is performed in the same manner. If the input character pattern is sufficiently similar to both characters at the next node, both nodes corresponding to both characters are expanded as candidates for the next node (s), and so on. Proceed. If the input character pattern is not similar to any character at the next node, the search for that branch is stopped. Thus, the leaves of the tree structure, or "town"
When a node corresponding to a character such as “Sato”, “Asahi”, “I”, “Matsu” or “Mouth” is reached, a character string from the starting node to the leaf is read. If a plurality of leaves are reached (this means that a plurality of address reading results remain as candidates), a more detailed judgment (for example, the certainty of the character identification result at the time of developing each node). Are selected based on the average value of.

【００１６】仮に、各文字カテゴリに１つの標準パタン
を用いるとすると、従来の方式だと１９（カテゴリ）×
５（パタン）＝９５回の標準パタンとのマッチングが必
要であったのに対し、本方式だと約１０回程度のマッチ
ングで十分であり、大幅な処理量の削減が可能である。
また、全国の地名を読み取ることを想定し、文字種を４
０００字種、文字列長さの平均値を１０字とすると、１
つの文字列を読み取るのに従来方式では約４万回のマッ
チングが必要なのに対し、本方式では数百回程度のマッ
チングで十分である。Assuming that one standard pattern is used for each character category, the conventional method requires 19 (category) ×
Whereas 5 (patterns) = 95 matchings with the standard pattern are required, with this method, about 10 matchings are sufficient, and a significant reduction in processing amount is possible.
In addition, assuming that the place names in the whole country are read,
If the average value of 000 character types and character string length is 10 characters, 1
The conventional method requires about 40,000 matches to read one character string, whereas the present method requires several hundred matches.

【００１７】また、上で説明したような方式を多重仮説
検定法に応用した場合も、同様に文字識別の処理量を削
減できる。また、文字カテゴリを限定しているので、誤
った位置でパタンが切出された場合にそのパタンを文字
識別でリジェクトできる確率は高く、以降のパタンの検
定を中断でき、さらに処理量を削減することができる。Also, when the above-described method is applied to the multiple hypothesis testing method, the processing amount of character identification can be similarly reduced. In addition, since the character category is limited, when a pattern is cut out at an incorrect position, the probability that the pattern can be rejected by character identification is high, and subsequent pattern verification can be interrupted, further reducing the processing amount be able to.

【００１８】本方式は、読み取り精度の向上にも効果が
ある。各文字カテゴリで複数の標準パタンをマッチング
する Parzen Density Estimation 法やｋ−ＮＮ法が、
文字識別の精度向上に有効な事が知られている。しか
し、標準パタン数の増加は処理量の増加につながり、従
来の方式では標準パタン数を十分に増やすことは出来な
かった。しかし、本方式では識別対象の文字カテゴリが
大幅に削減されているため、標準パタン数を多くするこ
とが可能となり、読み取り精度向上が図れる。This method is also effective in improving the reading accuracy. Parzen Density Estimation method and k-NN method that match multiple standard patterns in each character category,
It is known that it is effective for improving the accuracy of character identification. However, an increase in the number of standard patterns leads to an increase in the amount of processing, and the conventional method cannot sufficiently increase the number of standard patterns. However, in this method, since the character categories to be identified are greatly reduced, the number of standard patterns can be increased, and reading accuracy can be improved.

【００１９】[0019]

【発明の実施の形態】本発明の実施形態の一例を、図４
のデータ・フロー・ダイアグラムを用いて説明する。本
明細書では、データ・フロー・ダイアグラムはゲーン・
サクソン記法に従う（Ｊ．マーチン「ソフトウエア構造
化技法」近代科学社、ISBN4 - 7649 -0124 - 2 C3050 P
5562E）。FIG. 4 shows an example of the embodiment of the present invention.
This will be described with reference to the data flow diagram of FIG. In this specification, the data flow diagram is
Follows Saxon notation (J. Martin, "Software Structuring Technique", Modern Science, ISBN4-7649-0124-2 C3050P
5562E).

【００２０】本実施例は、住所の記載された文字行の画
像を入力とし、文字列認識結果の住所文字列を出力とす
る住所認識処理である。まず、文字切出し４０１は、文
字行画像中より文字パタン候補を切出す。文字識別４０
２は、各文字パタン候補の文字識別を行い、探索処理４
０４に結果を渡す。探索処理４０４は、文字識別結果に
応じ、住所辞書４０８に格納されている住所木を探索す
る探索木を動的に生成する。探索木の生成が終了した
後、文字列候補選択４０５は、探索木の出発節点から葉
に至る経路、すなわち認識された文字列を選びだし、出
力する。標準パタン選択４０３は、探索木の次の節点を
展開するのに必要な文字カテゴリの標準パタンを標準パ
タン辞書４０６から選択し、文字識別処理４０２に出力
する。The present embodiment is an address recognition process in which an image of a character line on which an address is described is input and an address character string as a result of character string recognition is output. First, the character extraction 401 extracts character pattern candidates from a character line image. Character identification 40
2 is to perform character identification of each character pattern candidate,
Pass the result to 04. The search processing 404 dynamically generates a search tree for searching for an address tree stored in the address dictionary 408 according to the character identification result. After the generation of the search tree is completed, the character string candidate selection 405 selects and outputs a path from the starting node of the search tree to the leaf, that is, a recognized character string. The standard pattern selection 403 selects a standard pattern of a character category required for expanding the next node of the search tree from the standard pattern dictionary 406, and outputs it to the character identification processing 402.

【００２１】本発明で生成される探索木の生成過程の例
を図５に模式的に示す。ここでは、図１に示す様なパタ
ン列を、図２に示す様な住所のいずれであるかを認識す
る処理を例としている。５０６が探索木の出発節点であ
る。出発節点５０６からは、節点５０１すなわち図３の
Ａに対応する節点が展開される。節点５０１から次の節
点を展開するために、第一の文字パタンを「中」「所」
の２字種の標準パタンとマッチングする。もし、いずれ
かの文字に類似している場合には、類似している文字に
対応する節点を新たに生成する。本例の様に「中」に類
似していると判断された場合には、図３の住所木中の節
点Ｂに対応する節点５０２を生成する（図５では、参考
のため、文字識別の結果選択されなかった枝も破線で記
載してある）。次に、節点５０２の次の節点を生成する
ため、第２のパタンを「住」「山」の２文字カテゴリ、
すなわち図３のＢの次の節点として有り得る文字カテゴ
リ、の標準パタンとマッチングする。入力文字パタンが
両方の文字に十分類似している場合には、両方の文字に
対応する節点を生成する。以下、同様に節点の展開を繰
り返す。もし、５０３の様にリジェクトとなった場合、
すなわち入力文字パタンが次の節点のいずれの文字とも
類似していない場合には、その枝の探索を停止する。こ
うして節点５０６の様な住所木の葉に対応する節点に達
したら、その枝の探索を停止する。探索を継続すべき枝
が全て無くなったら全ての探索処理は終了である。FIG. 5 schematically shows an example of a process of generating a search tree generated by the present invention. Here, an example is given of a process for recognizing which of the addresses shown in FIG. 2 is the pattern sequence shown in FIG. 506 is a starting node of the search tree. From the starting node 506, a node 501, that is, a node corresponding to A in FIG. To develop the next node from node 501, change the first character pattern to “medium” “place”
Matches with the standard pattern of the two-character type. If the character is similar to any of the characters, a node corresponding to the similar character is newly generated. If it is determined that the character is similar to "medium" as in this example, a node 502 corresponding to the node B in the address tree in FIG. 3 is generated (in FIG. Branches that were not selected as a result are also indicated by broken lines). Next, in order to generate a node next to the node 502, the second pattern is a two-letter category of "living" and "mountain",
That is, matching is performed with a standard pattern of a character category that can be used as a node next to B in FIG. If the input character pattern is sufficiently similar to both characters, nodes corresponding to both characters are generated. Hereinafter, the expansion of the nodes is similarly repeated. If it is rejected like 503,
That is, if the input character pattern is not similar to any character at the next node, the search for that branch is stopped. When the node corresponding to the leaf of the address tree such as the node 506 is reached, the search for the branch is stopped. When all the branches for which the search should be continued are gone, all the search processes are completed.

【００２２】探索終了後、生成された探索木中の出発節
点から葉に至る経路上の文字の列を、文字列認識結果と
する。もし、複数の葉に到達した場合には、より詳細な
判定により候補を選択する。候補の選択には、例えば経
路上のパタンの文字識別結果の類似度の平均値が最大の
ものを選ぶ等の手法がある。After the search is completed, a character string on the path from the starting node to the leaf in the generated search tree is set as a character string recognition result. If a plurality of leaves are reached, a candidate is selected through more detailed judgment. For example, there is a method of selecting a candidate, such as selecting a candidate having a maximum average similarity of the character identification results of the patterns on the route.

【００２３】なお、上記実施例では、市名、町名から成
る住所を用いて処理の一連の流れを説明したが、更に、
市名や町名に続けて、丁目番地の表記、アパートやビル
の名称等を含めた形での住所の一連の処理にも本願発明
を適用できる。これは、丁目、番地名、アパート名、氏
名も有限の文字列であるので、前述の木で表現できるこ
とによる。In the above-described embodiment, a series of processing has been described using an address consisting of a city name and a town name.
The invention of the present application can be applied to a series of processing of addresses including notation of street addresses, names of apartments and buildings, etc., following city names and town names. This is because the street, street name, apartment name, and name are also finite character strings, and can be represented by the tree described above.

【００２４】本発明は、文字切出しに多重仮説検定法を
用いた場合にも応用できる。多重仮説検定法において
は、図６に示す様に文字の切出し方の仮説がネットワー
クで表現される。以下、このようなネットワークを切出
し仮説ネットワークと呼ぶ。切出し仮説ネットワーク上
では、切出されたパタンが辺、パタンの境界が節点で表
現されている。また、文字行中の文字の切出し方の仮説
は、切出し仮説ネットワークの左端の出発節点６０１か
ら右端への終点節点６０２への経路で表現される。The present invention can be applied to the case where multiple hypothesis testing is used for character segmentation. In the multiple hypothesis test method, as shown in FIG. 6, a hypothesis on how to extract characters is expressed in a network. Hereinafter, such a network is referred to as a cut-out hypothesis network. On the extraction hypothesis network, the extracted pattern is represented by an edge, and the boundary of the pattern is represented by a node. The hypothesis on how to extract characters in a character line is expressed by a path from the leftmost starting node 601 to the rightmost end node 602 of the extracted hypothesis network.

【００２５】図４の文字切出し処理４０１が、図６の様
な切出し仮説ネットワークを生成する場合の探索木が生
成される過程を図７に示す。多重仮説検定法では、ある
パタンの後続するパタンを予め一意に決めない。このた
め、図５で示した探索の過程に加え、各節点から次の節
点を展開する際に、次に文字識別すべきパタンを切出し
仮説ネットワーク中から選択する過程が必要となる。図
７中の７０１〜７０５は、この選択の過程を模式的に示
している。図７のような探索木でも、図５と同様に、各
節点は図３の住所木の節点へ対応している。例えば、７
１０と７１１はＡに、７１２と７１３はＢに対応してい
る。ただし、出発節点７１４は、図５の例と同様に住所
木への対応を持たない。FIG. 7 shows a process of generating a search tree when the character extraction process 401 of FIG. 4 generates an extraction hypothesis network as shown in FIG. In the multiple hypothesis testing method, a pattern following a certain pattern is not uniquely determined in advance. Therefore, in addition to the search process shown in FIG. 5, when developing the next node from each node, a process of extracting a pattern to be next character-identified and selecting it from the hypothesis network is required. Reference numerals 701 to 705 in FIG. 7 schematically show the selection process. In the search tree as shown in FIG. 7, each node corresponds to the node in the address tree in FIG. 3, as in FIG. For example, 7
10 and 711 correspond to A, and 712 and 713 correspond to B. However, the departure node 714 does not have a correspondence to the address tree as in the example of FIG.

【００２６】図８は、探索木の情報を格納する探索木テ
ーブルの形式を示す。探索木テーブルは各レコードが３
つのデータ項目を有するテーブルで表現される。各レコ
ードは、探索木の枝に対応する。図８は図７の様な探索
木を表わしており、レコード８０１は枝７１４、レコー
ド８０２は枝７０６、レコード８０３は枝７０７、レコ
ード８０４は枝７０８、レコード８０５は枝７０９に対
応する。第一のデータ項目ａｄｒＮｏｄｅ（次接点）
は、当該の枝の末端側の節点に対応する住所木の節点へ
のポインタを格納する。例えば８０２に対応する探索木
の枝は７０６であり、７０６の子節点７１２および７１
３に対応する住所木の節点はＢであるので、８０１のａ
ｄｒＮｏｄｅにはＢへのポインタを格納する。第二のデ
ータ項目ｐｔｎＣｒｄ（パタン確信度）は、子節点の展
開の確からしさの値を格納する。ｐｔｎＣｒｄの値とし
ては、例えば後述する文字識別処理の過程で得られる確
率密度等を用いる。第三のデータ項目ｓｔｒＣｒｄ（文
字列確信度）は、探索木の根から子節点に至る一連の展
開の確からしさを表わす値を格納する。ｓｔｒＣｒｄ
（文字列確信度）の値としては、例えば根から当該子節
点に至る経路上の枝のｐｔｎＣｒｄの平均値等を用い
る。第四のデータ項目ｐｔｎ（入力パターン）は、子節
点を展開する際に用いたパタンへのポインタを格納す
る。親節点が出発節点の場合には、ヌルポインタ（ポイ
ンタとしてありえない値）を格納し、これより上位の節
点が無いことを記憶する。第五のデータ項目ｐｒｖ（直
前の枝）は、当該節点の親節点へのポインタを格納す
る。出発節点の場合は、ｐｒｖにはヌルポインタを格納
する。この探索木テーブルは、住所中の文字の連接関係
を木構造で記憶したテーブルといえる。なお本例では、
後述する処理手順により、７１５の様に次節点を展開す
る事が出来なかった枝はテーブルに格納しない様になっ
ている。FIG. 8 shows the format of a search tree table for storing search tree information. In the search tree table, each record is 3
It is represented by a table with two data items. Each record corresponds to a branch of the search tree. FIG. 8 shows a search tree as shown in FIG. 7, in which record 801 corresponds to branch 714, record 802 corresponds to branch 706, record 803 corresponds to branch 707, record 804 corresponds to branch 708, and record 805 corresponds to branch 709. First data item adrNode (next contact)
Stores a pointer to a node of the address tree corresponding to the node on the terminal side of the branch. For example, the branch of the search tree corresponding to 802 is 706, and child nodes 712 and 71 of 706
Since the node of the address tree corresponding to No. 3 is B, a
A pointer to B is stored in drNode. The second data item ptnCrd (pattern certainty factor) stores a value of the likelihood of child node expansion. As the value of ptnCrd, for example, a probability density or the like obtained in the process of character identification processing described later is used. The third data item strCrd (character string certainty factor) stores a value indicating the likelihood of a series of expansion from the root of the search tree to the child node. strCrd
As the value of (character string certainty factor), for example, an average value of ptnCrd of a branch on a path from the root to the child node or the like is used. The fourth data item ptn (input pattern) stores a pointer to a pattern used when expanding child nodes. If the parent node is the starting node, a null pointer (a value that cannot be used as a pointer) is stored, and the fact that there is no higher node is stored. The fifth data item prv (previous branch) stores a pointer to the parent node of the node. In the case of the starting node, a null pointer is stored in prv. This search tree table can be said to be a table in which the connection relation of the characters in the address is stored in a tree structure. In this example,
According to the processing procedure described later, a branch in which the next node cannot be expanded like 715 is not stored in the table.

【００２７】図９に、住所木を格納する住所木テーブル
のデータ形式を示す。住所木テーブルは各レコードが３
つのデータ項目を有するテーブルで表現される。各レコ
ードが、住所木の枝に対応する。また、各節点には固有
な整数値（ｉｄ）を割り付ける。第一のデータ項目ｐｒ
ｔＮｏｄｅ（親接点）は、枝の上位側の節点のｉｄを格
納する。第二のデータ項目ｃｌｄＮｏｄｅ（子接点）に
は、枝の下位側の節点のｉｄを格納する。図中では、図
３に対応するよう、節点のｉｄを「Ａ」「Ｂ」と記号で
記しているが、実際には各節点に固有な整数値を格納す
る。第三のデータ項目ｃｏｄｅ（文字コード）には、枝
に対応する文字コードを格納する。FIG. 9 shows a data format of an address tree table for storing an address tree. In the address tree table, each record is 3
It is represented by a table with two data items. Each record corresponds to a branch of the address tree. Further, a unique integer value (id) is assigned to each node. First data item pr
tNode (parent contact) stores the id of the node on the upper side of the branch. In the second data item cldNode (child contact), the id of the lower node of the branch is stored. In the figure, the ids of the nodes are indicated by symbols "A" and "B" to correspond to FIG. 3, but an integer value unique to each node is actually stored. In the third data item code (character code), a character code corresponding to the branch is stored.

【００２８】本発明の処理手順を図１０に示す。第１の
ステップは、文字行画像から切出し仮説ネットワークを
生成する処理、すなわち文字切出し処理１００１であ
る。文字切出し処理には、例えば村瀬他「言語情報を利
用した手書き文字列からの文字切り出しと認識」（信学
論（D）vol. J69-D, No. 9）にあるような既知の方式を
用いる。第２のステップは、文字識別・探索処理１００
２である。文字識別と探索処理は、後述するように、互
いに連携しながら動作し、探索木テーブルにデータを書
き込んでいく。第３のステップは、探索木中より最も確
からしい文字列の候補を求める文字列候補選択処理１０
０３である。文字列候補選択処理では、ａｄｒＮｏｄｅ
が住所木の葉に相当するレコードを探索木テーブルから
もとめ、これらレコードのうちでｓｔｒＣｒｄの値が最
も大きいものを選択する。次に、このレコードから次々
とポインタｐｒｖをたどることにより、出発節点から葉
に至る経路を求め、経路上の文字の列を認識結果の文字
列候補とする。FIG. 10 shows the processing procedure of the present invention. The first step is a process of generating a cut-out hypothesis network from a character line image, that is, a character cut-out process 1001. For the character extraction process, for example, a known method such as that described in Murase et al., “Character Extraction and Recognition from Handwritten Character Strings Using Linguistic Information” (IEICE (D) vol. J69-D, No. 9) Used. The second step is a character identification / search processing 100
2. As will be described later, the character identification and the search process operate in cooperation with each other, and write data in the search tree table. The third step is a character string candidate selection process 10 for finding the most likely character string candidate from the search tree.
03. In the character string candidate selection process, adrNode
Finds a record corresponding to the leaf of the address tree from the search tree table, and selects a record having the largest strCrd value among these records. Next, by sequentially following the pointer prv from this record, a path from the starting node to the leaf is obtained, and a character string on the path is set as a character string candidate as a recognition result.

【００２９】文字識別・探索処理の処理手順を図１１に
示す。まずステップ１１０１において、第一のレコード
に、出発節点を登録する。次にステップ１１０２におい
て、注目している枝データへのポインタｐｃと新規枝デ
ータ書き込み先ｐｗを探索木テーブルの２レコード目に
セットする。次にループ１１０３において、文字行の左
端のパタンの候補（図６の６０３、６０４に示すように
複数個有り得る）各々を、住所木の出発節点から始まる
枝に対応する文字カテゴリ（住所木テーブルのデータ項
目「ｃｏｄｅ」に格納してある）に対して文字識別処理
を行う。文字識別がアクセプトとなった場合には、ポイ
ンタpwが示すアドレスに探索木の枝を登録する。次にル
ープ１１０４において、探索木テーブルに登録されてい
る枝の子節点の展開を繰り返し、文字識別結果がアクセ
プトの場合には探索木テーブルに新たな枝を追加する。
こうしてこれ以上探索する枝がなくなった場合、すなわ
ちポインタpc と pw の値が等しくなったらループを終
了する。次にステップ１１０５において、探索木中より
最も確からしい文字列を求め、認識結果として出力す
る。FIG. 11 shows the processing procedure of the character identification / search processing. First, in step 1101, the starting node is registered in the first record. Next, in step 1102, a pointer pc to the branch data of interest and a new branch data write destination pw are set in the second record of the search tree table. Next, in a loop 1103, each of pattern candidates at the left end of the character line (there can be a plurality of patterns as indicated by 603 and 604 in FIG. 6) is classified into the character category (address tree table) corresponding to the branch starting from the starting node of the address tree. Character identification processing is performed on the data item “code”). If the character identification is accepted, the branch of the search tree is registered at the address indicated by the pointer pw. Next, in a loop 1104, expansion of child nodes of the branches registered in the search tree table is repeated, and if the character identification result is accept, a new branch is added to the search tree table.
Thus, when there are no more branches to be searched, that is, when the values of the pointers pc and pw become equal, the loop is ended. Next, in step 1105, the most probable character string in the search tree is obtained and output as a recognition result.

【００３０】ループ１１０４では、未展開の節点から次
節点を展開する処理ループ１１０６、展開した節点を文
字列確信度でソートするステップ１１０７、予め定めら
れた定数ｓｒｃＷｄより多くの子節点が展開された場合
には上位ｓｒｃＷｄ個の節点のみを選択するステップを
実行する。本ステップにより、仮に多くの文字識別結果
がアクセプトとなった場合でも、節点の数を削減するこ
とにより、処理に要する時間の増大を抑えることが出来
る。In the loop 1104, a processing loop 1106 for expanding the next node from the unexpanded node, a step 1107 for sorting the expanded nodes by the character string certainty factor, and expanding more child nodes than a predetermined constant srcWd. In this case, the step of selecting only the upper srcWd nodes is executed. According to this step, even if many character identification results are accepted, an increase in the time required for processing can be suppressed by reducing the number of nodes.

【００３１】図１２は、本実施例におけるハードウエア
構成を示す。上に述べてきた本発明は、例えば同図のよ
うな郵便物宛名読取区分機の、住所読取りをつかさどる
認識装置１２０５に適用される。図中の太線は、郵便物
の流れを示す。１２０１は、画像入力をつかさどるスキ
ャナである。スキャナ１２０１は、住所文字行の画像を
抽出をつかさどる前処理装置１２０４に接続する。前処
理装置１２０４は、入出力用インタフェース１２０７を
介して認識装置のバス１２１０に接続する。１２０２
は、住所読取り時間を確保するために郵便物搬送路上に
設けるディレイライン（遅延用搬走路）である。１２０
３は、住所読取結果に基づき郵便物を区分するソータで
ある。ソータ１２０３は入出力用インタフェース１２０
８を介して認識装置のバス１２１０に接続する。１２０
６は、本発明に必要なプログラム、文字識別用標準パタ
ン、住所木等を格納しておくための外部記憶装置であ
る。外部記憶装置にはハードディスク等を用いる。外部
記憶装置は入出力用インタフェース１２０９を介して認
識装置のバス１２１０に接続する認識装置１２０５に
は、認識装置内部の各部分を接続するためのバス１２１
０、認識装置全体の制御および住所認識処理をつかさど
る演算処理装置１２１１、外部の装置との入出力をつか
さどる入出力用インタフェース１２０７、１２０８、１
２０９、住所認識に必要なプログラム、テーブル、辞書
などを格納するためのメモリ１２１２を備える。FIG. 12 shows a hardware configuration in this embodiment. The present invention described above is applied to, for example, a recognition device 1205 for reading an address of a mail address reading / sorting machine as shown in FIG. The thick line in the figure indicates the flow of mail. A scanner 1201 controls image input. The scanner 1201 is connected to a preprocessing device 1204 that controls an image of an address character line. The preprocessing device 1204 connects to the bus 1210 of the recognition device via the input / output interface 1207. 1202
Is a delay line (delay transport path) provided on the mail transport path to secure the address reading time. 120
Reference numeral 3 denotes a sorter for sorting mail items based on the address reading result. The sorter 1203 is an input / output interface 120
8 to the bus 1210 of the recognition device. 120
Reference numeral 6 denotes an external storage device for storing a program required for the present invention, a standard pattern for character identification, an address tree, and the like. A hard disk or the like is used as the external storage device. An external storage device is connected to a bus 1210 of the recognition device via an input / output interface 1209. A recognition device 1205 has a bus 121 for connecting each part inside the recognition device.
0, an arithmetic processing unit 1211 for controlling the entire recognition device and for address recognition processing, and input / output interfaces 1207, 1208, and 1 for inputting and outputting to and from external devices.
209, a memory 1212 for storing programs, tables, dictionaries, etc. necessary for address recognition.

【００３２】装置起動時には、まず外部記憶装置１２０
６からプログラム、標準パタン辞書、住所木がメモリ１
２１２へロードされる。ロード完了後、前処理装置１２
０４から文字行画像が認識装置１２０５へ文字行画像が
供給されると、文字行画像はメモリ１２１２に複写され
る。さらに演算処理装置１２１２は本発明で説明した方
式に従って切出し仮説ネットワーク、探索木をメモリ１
２１２上に生成しながら住所を認識し、認識結果をソー
タ１２０３へ発信する。ソータ１２０３は認識結果に応
じて郵便物を区分する。When the device is started, first, the external storage device 120
From 6, the program, standard pattern dictionary, address tree is memory 1
212. After loading is completed, the pre-processing device 12
When the character line image is supplied from 04 to the recognition device 1205, the character line image is copied to the memory 1212. Further, the arithmetic processing unit 1212 stores the cut-out hypothesis network and the search tree in the memory 1 according to the method described in the present invention.
The address is recognized while being generated on the 212, and the recognition result is transmitted to the sorter 1203. Sorter 1203 sorts mail items according to the recognition result.

【００３３】図１０および図１１で説明した処理手順
は、入力の文字行が正しく切り出されていることを前提
としている。このため、図１３に示すような例には対応
することができない。図１３（Ａ）は、住所先頭の１文
字が完全に欠落してしまった例、図１３（Ｂ）は、行頭
にノイズが混入した例、図１３（Ｃ）は行の頭の文字の
一部が欠けてしまった例である。以下に、上述の文字識
別・探索処理方式を拡張し、行頭の欠けやノイズの混入
に対応する方式を説明する。The processing procedure described with reference to FIGS. 10 and 11 is based on the premise that the input character line is correctly cut out. For this reason, it cannot respond to the example as shown in FIG. 13A shows an example in which one character at the beginning of the address is completely missing, FIG. 13B shows an example in which noise is mixed at the beginning of the line, and FIG. 13C shows one example of the character at the beginning of the line. This is an example in which a part is missing. In the following, a description will be given of a method for expanding the above-described character identification / search processing method and coping with the lack of a line head and the inclusion of noise.

【００３４】図１４は、行頭の欠けに対応する処理手順
である。ステップ１４０１は、図１０で説明したフロー
と同様の文字切出し処理である。ステップ１４０２は、
図１１で説明したような、文字行が正しく切り出されて
いることを前提とした読取り処理である。もし、ステッ
プ１４０２でアクセプトとなる認識結果が得られれば、
図１０で説明したフローと同じく、文字列候補処理１４
０５を実行する。もし、アクセプトとなる候補が得られ
なかった場合には、ステップ１４０３を実行する。FIG. 14 shows a processing procedure corresponding to the lack of a head of a line. Step 1401 is a character cutout process similar to the flow described in FIG. Step 1402 is:
This is a reading process assuming that a character line is correctly cut out as described with reference to FIG. If an acceptable recognition result is obtained in step 1402,
As in the flow described with reference to FIG.
Execute 05. If no candidate to be accepted is obtained, step 1403 is executed.

【００３５】ステップ１４０３は、行頭の何文字かがそ
っくり欠落した場合に対応する処理である。この場合、
住所木の根に対応するパターンは文字行中からは欠落し
ていることを前提とする必要があり、行頭が住所木のど
の節点に対応しているかを調べてから文字識別・探索を
開始する必要がある。まず、切り出し仮説ネットワーク
中から行頭となりうるパターンを求める。ここで、行頭
となりうるパターン」とは、ネットワークの出発節点に
接しているパターンのことである。たとえば、図１３
（Ｃ）の様な入力画像に対しては、図１５に示すような
文字切り出し仮説ネットワークが生成され、出発節点は
１５０１であり、「行頭となりうるパターン」とはパタ
ーン１５０３と１５０４である。次に、全ての行頭とな
りうるパターンに対して、住所中にありうる全ての文字
カテゴリで識別する。もし、アクセプトとなる文字カテ
ゴリがあれば、当該カテゴリに対応する節点を住所木中
から求める。そして、当該節点を根とする住所木の部分
木を住所木とみなして、図１１で説明したの同様の処理
手順で文字識別・探索を行う。もし、ステップ１４０３
でアクセプトとなる認識結果が得られれば、図１０で説
明したフローと同じく、文字列候補処理１４０５を実行
する。もし、アクセプトとなる候補が得られなかった場
合には、ステップ１４０４を実行する。Step 1403 is a process corresponding to a case where some characters at the beginning of a line are completely missing. in this case,
It is necessary to assume that the pattern corresponding to the root of the address tree is missing from the character line, and it is necessary to start character identification and search after checking which node of the address tree corresponds to the beginning of the line. is there. First, a pattern that can be the beginning of a line is obtained from the cut-out hypothesis network. Here, the “pattern that can be the beginning of a line” is a pattern that is in contact with the departure node of the network. For example, FIG.
For the input image as in (C), a character segmentation hypothesis network as shown in FIG. 15 is generated, the starting node is 1501, and the "patterns that can be the beginning of a line" are patterns 1503 and 1504. Next, all possible patterns are identified by all possible character categories in the address. If there is a character category to be accepted, a node corresponding to the category is obtained from the address tree. Then, a partial tree of the address tree having the node as a root is regarded as an address tree, and character identification and search are performed in the same processing procedure as described with reference to FIG. If step 1403
If a recognition result that is acceptable in is obtained, character string candidate processing 1405 is executed as in the flow described with reference to FIG. If no candidate to be accepted is obtained, step 1404 is executed.

【００３６】ステップ１４０４は、図１３（Ｂ）のよう
に行頭にノイズが混入した場合、および図１３（Ｃ）の
ように行頭の文字の１部が欠落した場合に対応する処理
である。この場合、１文字目のパタンを文字識別した結
果がリジェクトとなり、ステップ１４０３ではアクセプ
トとなる候補が挙がらないことが予想される。このた
め、２文字目となるパターンを手がかりに探索開始節点
を求める必要がある。まず、行の２番目の文字となりう
るパターン、すなわち切出し仮説ネットワークの出発節
点から１つの辺を介して到達しうるパターン（図１５の
例では、パターン１５０５、１５０６と１５０７）を、
住所中にありうる全ての文字カテゴリで識別する。も
し、アクセプトとなる文字カテゴリがあれば、当該カテ
ゴリに対応する節点を住所木中から求める。そして、当
該節点を根とする住所木の部分木を住所木とみなして、
図１１で説明したの同様の処理手順で文字識別・探索を
行う。Step 1404 is a process corresponding to a case where noise is mixed at the head of the line as shown in FIG. 13B and a case where a part of the character at the head of the line is missing as shown in FIG. 13C. In this case, the result of character identification of the pattern of the first character is rejected, and it is expected that no candidate to be accepted will be given in step 1403. For this reason, it is necessary to find the search start node based on the pattern serving as the second character. First, a pattern that can be the second character of the line, that is, a pattern that can be reached via one edge from the starting node of the cut-out hypothesis network (in the example of FIG. 15, patterns 1505, 1506, and 1507) is
Identify by all possible character categories in the address. If there is a character category to be accepted, a node corresponding to the category is obtained from the address tree. Then, a partial tree of the address tree having the node as a root is regarded as an address tree,
Character identification and search are performed in the same processing procedure as described with reference to FIG.

【００３７】本発明で用いる文字識別処理の原理の一例
を図１６に示す。図１６は文字識別に用いる特徴のベク
トル空間を模式的に示している。特徴量には、例えば森
「パターン認識」（ISBN4 - 88552 - 075 - 4 C3055、
電子情報通信学会）pp. 32 -109 に記載のあるような既
知のものを用いることができる。予め、各文字カテゴリ
毎に複数の文字パターンを収集しておき、これらの特徴
量を標準パタンとして標準パタン辞書４０６に格納して
おく。文字識別処理４０２において、入力パタンがある
文字に類似しているか否か判定する場合には、Parzen D
ensity Estimation （"Statistical Pattern Recogniti
on," ISBN 0-12-269851-7）により、当該文字カテゴリ
における当該文字パタンの条件つき確率密度分布が一定
値以上か否かで判定する。すなわち、入力パタンの特徴
ベクトルｘから予め設定してある半径ｒ以内に当該文字
カテゴリの標準パタンがいくつあるかを調べ、一定値数
以上であればアクセプト、下回ればリジェクトとする。FIG. 16 shows an example of the principle of the character identification processing used in the present invention. FIG. 16 schematically shows a vector space of a feature used for character identification. Features include, for example, forest “pattern recognition” (ISBN4-88552-075-4 C3055,
A known device as described in IEICE (pp. 32-109) can be used. A plurality of character patterns are collected in advance for each character category, and these feature amounts are stored in the standard pattern dictionary 406 as standard patterns. In the character identification process 402, if it is determined whether or not the input pattern is similar to a certain character, Parzen D
ensity Estimation ("Statistical Pattern Recogniti
on, "ISBN 0-12-269851-7), it is determined whether or not the conditional probability density distribution of the character pattern in the character category is equal to or greater than a certain value. It is checked how many standard patterns of the character category exist within a given radius r.

【００３８】[0038]

【発明の効果】住所の文字列の知識を用い、各パタンと
比較する標準パタンの文字カテゴリを絞り込み、住所認
識の際の処理量の削減を図ることが可能となる。また、
文字切出しに多重仮説検定法を用いる場合において、文
字識別を行う対象のパタンの数を削減することが可能と
なる。According to the present invention, the character category of the standard pattern to be compared with each pattern can be narrowed down using the knowledge of the character string of the address, and the processing amount at the time of address recognition can be reduced. Also,
When multiple hypothesis testing is used for character extraction, the number of patterns to be subjected to character identification can be reduced.

[Brief description of the drawings]

【図１】読取り対象の住所文字列の例を示す。FIG. 1 shows an example of an address character string to be read.

【図２】読み取り対象の住所文字列を示す。FIG. 2 shows an address character string to be read.

【図３】住所文字列の木構造表現の例を示す。FIG. 3 shows an example of a tree structure expression of an address character string.

【図４】本発明の一実施例のデータフローを示す。FIG. 4 shows a data flow of one embodiment of the present invention.

【図５】本発明において生成される探索木の生成過程の
例を示す。FIG. 5 shows an example of a search tree generation process generated in the present invention.

【図６】切出し仮説ネットワークの例を示す。FIG. 6 shows an example of a cut-out hypothesis network.

【図７】多重仮説検定方を用いた際に本発明において生
成される探索木の生成過程の例を示す。FIG. 7 shows an example of a search tree generation process generated in the present invention when the multiple hypothesis test method is used.

【図８】探索木テーブルの形式を示す。FIG. 8 shows a format of a search tree table.

【図９】住所木テーブルの形式を示す。FIG. 9 shows a format of an address tree table.

【図１０】本発明の一実施例の処理フローを示す。FIG. 10 shows a processing flow of an embodiment of the present invention.

【図１１】文字識別・探索の処理フローを示す。FIG. 11 shows a processing flow of character identification / search.

【図１２】本発明の構成を示す。FIG. 12 shows a configuration of the present invention.

【図１３】認識が困難な入力画像の例を示す。FIG. 13 shows an example of an input image that is difficult to recognize.

【図１４】行頭の欠けおよびノイズ混入に対応する処理
フローを示す。FIG. 14 shows a processing flow corresponding to a missing line at the beginning of a line and noise contamination.

【図１５】行頭が欠けた場合の切出し仮説ネットワーク
の例を示す。FIG. 15 shows an example of a cut-out hypothesis network when a head of a line is missing.

【図１６】文字識別の原理の一例を示す。FIG. 16 shows an example of the principle of character identification.

[Explanation of symbols]

４０１……文字切出し、４０２……文字識別処理、４０
３……標準パターン選択処理、４０４……探索処理、４
０５……文字列候補選択処理、４０７……探索木、４０
８……住所木。401 ... character extraction, 402 ... character identification processing, 40
3 ... standard pattern selection processing, 404 ... search processing, 4
05 ... character string candidate selection processing, 407 ... search tree, 40
8 ... Address tree.

───────────────────────────────────────────────────── フロントページの続き (72)発明者酒匂裕東京都国分寺市東恋ケ窪一丁目280番地株式会社日立製作所中央研究所内 (72)発明者藤澤浩道東京都国分寺市東恋ケ窪一丁目280番地株式会社日立製作所中央研究所内 ──────────────────────────────────────────────────続き Continuing from the front page (72) Inventor Hiroshi Sakami 1-280 Higashi Koikekubo, Kokubunji-shi, Tokyo Inside the Central Research Laboratory, Hitachi, Ltd. Central Research Laboratory

Claims

[Claims]

An address reading method for converting image information written on a document into an electric signal, inputting the converted signal, and reading an address described in the image, wherein a character having an address described on the image is provided. Character line extraction processing to extract lines, pattern extraction processing to extract character patterns from character lines, and reference to an address tree that stores the concatenation of characters in an address in a tree structure A character category limiting process for limiting a character category of a character to be identified in the pattern, a character identifying process for identifying a character of the extracted pattern, and an address for searching a tree structure stored in the address tree storage unit according to the character identification result. An address reading method, comprising a tree search process.

2. The method according to claim 1, wherein when the pattern extraction processing cannot uniquely determine the character extraction method, a plurality of extraction method hypotheses are output, and the address tree search processing controls the search in accordance with the plurality of character extraction hypotheses. 2. The address reading method according to claim 1, wherein: