JP5217127B2

JP5217127B2 - Collective place name recognition program, collective place name recognition apparatus, and collective place name recognition method

Info

Publication number: JP5217127B2
Application number: JP2006207066A
Authority: JP
Inventors: ▲升▼竹黄; 俊孫; 悦伸堀田; 聡直井
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2005-07-29
Filing date: 2006-07-28
Publication date: 2013-06-19
Anticipated expiration: 2026-07-28
Also published as: CN100555308C; CN1904906A; JP2007042097A

Description

本発明は、一括地名認識プログラム、一括地名認識装置および一括地名認識方法に関し、特に、枠のない領域内に手書きされた文字から住所を読み取って認識する一括地名認識プログラム、一括地名認識装置および一括地名認識方法に関する。 The present invention is Bulk place name recognition program relates collectively place name recognition apparatus and collectively place name recognition method, in particular, Bulk place name recognition program that recognizes reads the address from the handwritten characters without frame region, collectively place name recognition The present invention relates to an apparatus and a batch place name recognition method.

従来、二つの手書き住所認識方法が提案されている。 Conventionally, two handwritten address recognition methods have been proposed.

第１の従来方法においては、まず、入力された住所画像から複数の単一文字領域（すなわち、文字を一つだけ含む領域）を抽出する。次に、当該複数の単一文字領域からキー文字を抽出する（手書き住所認識においては、キー文字とは、省、区、州、市、町、郡、丁、村等の、行政区域を表すことができる一文字を指す。）。そして、連続したキー文字二つで区切られる地名領域を検出した後、地名領域内の文字を一つ一つ切り出し、切り出した文字を一つずつ認識する。これによって、地名領域内の地名が認識される。 In the first conventional method, first, a plurality of single character areas (that is, areas including only one character) are extracted from the input address image. Next, key characters are extracted from the plurality of single character areas (in handwritten address recognition, key characters represent administrative districts such as provinces, districts, states, cities, towns, counties, dings, villages, etc. Refers to a single character that can be.) And after detecting the place name area | region divided by two continuous key characters, the character in the place name area is cut out one by one, and the cut out character is recognized one by one. Thereby, the place name in the place name area is recognized.

しかし、上記第１の従来方法では、手書きされた住所の文字を一つ一つ分離する際、しばしばエラーが発生するという問題があった。特に、枠のない領域内に、文字が互いに接触した状態で住所が書かれている場合、この方法では認識ができない可能性があった。手書き文字は形が整えられておらず、また、ばらつきも大きいため、かかる不都合がしばしば発生していた。 However, the first conventional method has a problem that an error often occurs when handwritten address characters are separated one by one. In particular, when an address is written in a region without a frame with characters in contact with each other, there is a possibility that this method cannot be recognized. Such inconveniences often occur because handwritten characters are not shaped and have large variations.

第２の従来方法においても、まず切り出し文字を抽出し、連続したキー文字二つで区切られる地名領域を検出する。第２の従来方法が第１の従来方法と異なるのは、地名領域内の地名を一括認識する点である。 Also in the second conventional method, a cut-out character is first extracted, and a place name area delimited by two consecutive key characters is detected. The second conventional method is different from the first conventional method in that the place names in the place name area are collectively recognized.

図１は、第２の従来方法の処理を詳細に示すブロック図である。図１に示すように、文字切り出し手段６０１は、入力された住所画像からすべてのあり得る単一文字領域を切り出す。キー文字抽出手段６０２は、キー文字辞書６０３を参照して、あり得る単一文字領域の各々を認識し、当該単一文字領域内の文字がキー文字（例えば、「省」、「市」、
［文字１］

、「路」など）であるか否かを決定する。当該単一文字領域の文字に対する認識候補文字（すなわち、予備的にその領域の文字である可能性があると判断された文字）として、複数のキー文字があった場合、照合距離が最小のキー文字のみを選択して当該単一文字領域を認識する。なお、照合距離とは、入力文字の画像から文字の特徴を抽出した特徴ベクトル（数値列）とあらかじめ特徴辞書に持つ各文字の特徴ベクトルとを比較して得られる距離値を言う。完全一致すれば０になるので、小さいほど両者が近い。照合距離が小さいほど、キー文字と当該文字領域内の画像との類似性が高い。 FIG. 1 is a block diagram showing in detail the processing of the second conventional method. As shown in FIG. 1, the character cutout unit 601 cuts out all possible single character regions from the input address image. The key character extraction unit 602 refers to the key character dictionary 603 to recognize each possible single character area, and the characters in the single character area are key characters (for example, “Ministry”, “City”,
[Character 1]

, “Road”, etc.). If there are multiple key characters as recognition candidate characters for the characters in the single character region (that is, characters that are preliminarily determined to be characters in the region), the key character with the smallest matching distance Select only to recognize the single character area. The collation distance is a distance value obtained by comparing a feature vector (numerical string) obtained by extracting a character feature from an input character image with a feature vector of each character previously stored in the feature dictionary. Since it becomes 0 if it completely coincides, the smaller the closer, the closer. The smaller the collation distance, the higher the similarity between the key character and the image in the character area.

キー文字組決定手段６０４は、住所階層構造知識部６０５に基づいて、すべての可能性のあるキー文字候補をキー文字パスとして組み合わせる。各キー文字パスにおいては、前にあるキー文字は後ろにあるどのキー文字よりも法的に上位にある行政単位である、という要件が満たされねばならない。例えば、
［文字２］

はキー文字パスである。他方、
［文字１］

は「市」の上位行政単位ではなく、逆に、行政的な意味では「市」が
［文字１］

の上位にあるため、
［文字３］

はキー文字パスではない。地名語領域候補切り出し手段６０６は、キー文字パスにおける二つのキー文字によって区切られる地名語領域を抽出する。地名語一括認識手段６０７は、地名語認識辞書６０８を参照して、地名語を一括認識する。一文字認識手段６０９は、まず、一文字認識辞書６１０を参照して、地名でもキー文字でもない部分を入力画像から抽出する。例えば、
［文字４］

という住所からは、建物の名称と建物内での番号、つまり
［文字５］

を抽出する。そして、抽出した部分から複数の文字を切り出し、切り出した文字一つ一つを認識する。住所決定手段６１１は、住所知識部６１２内に入力された住所知識を使用し、認識結果と実際の住所とを比較して両者が一致するか否かを調べ、住所の全照合距離が所定の閾値よりも大きいか否かを調べる。所定の閾値よりも大きい場合、この住所の認識結果は信頼できないと判断され、システムはこの認識をリジェクトする。所定の閾値以下の場合、実際の住所と一致した認識結果が最終的に出力される。 Based on the address hierarchy structure knowledge unit 605, the key character set determination unit 604 combines all possible key character candidates as key character paths. In each key character path, the requirement that the preceding key character is an administrative unit that is legally higher than any following key character must be met. For example,
[Character 2]

Is the key character path. On the other hand
[Character 1]

Is not a higher-level administrative unit of "city", but conversely, in the administrative sense, "city" is [character 1]

Because it is at the top of
[Character 3]

Is not a key character path. The place name area candidate cutout unit 606 extracts a place name area divided by two key characters in the key character path. The place name word collective recognition unit 607 refers to the place name word recognition dictionary 608 and collectively recognizes place name words. First, the single character recognition unit 609 refers to the single character recognition dictionary 610 and extracts a portion that is neither a place name nor a key character from the input image. For example,
[Character 4]

From the address, the name of the building and the number in the building, that is, [Character 5]

To extract. Then, a plurality of characters are cut out from the extracted portion, and the cut out characters are recognized one by one. The address determination unit 611 uses the address knowledge input in the address knowledge unit 612 and compares the recognition result with the actual address to check whether or not they match, and the total collation distance of the address is a predetermined value. It is checked whether it is larger than the threshold value. If it is greater than the predetermined threshold, the address recognition result is determined to be unreliable and the system rejects this recognition. When the value is equal to or less than the predetermined threshold, a recognition result that matches the actual address is finally output.

図２は、地名語一括認識処理を示す。第１レベル地名照合処理１０５において、まず、第１のキー文字により区切られた地名領域を切り出し、地名語認識辞書６０８を使用して切り出された地名領域の地名を一括認識する。次に、第２レベル地名照合処理１０６で、第１のキー文字および（もし第２のキー文字があれば）第２のキー文字によって区切られた地名領域を切り出し、第１レベルの地名照合結果と図１に示す地名語認識辞書６０８とを参照して切り出された地名領域を認識する。例えば、実際に入力された住所が
［文字６］

であれば、キー文字パスは、「市−区−路」である。この場合、第１のキー文字「市」によって区切られる第１レベル地名領域は「北京」と認識される。次に、第１および第２のキー文字「市−区」によって区切られる地名領域が抽出される。そして、抽出された地名領域全体と、地名語認識辞書６０８中に記憶されている北京市内の全ての「区」とを照合することで地名領域が認識される。さらに、第３レベル地名照合処理１０７において、まず、（もし第２および第３のキー文字があれば）第２および第３のキー文字によって区切られる地名領域を切り出す。次に、第２レベルの地名照合結果と地名語認識辞書６０８とを参照して、切り出された地名領域を認識する。そして、最後のレベルになるまで同様の処理を繰り返す。第Ｎレベルの地名照合処理１０８においては、（もし第Ｎ−１のキー文字および第Ｎのキー文字があれば）第Ｎ−１のキー文字と第Ｎのキー文字とによって区切られる地名領域を抽出し、第Ｎ−１レベルの地名照合結果と地名語認識辞書６０８とを参照して、抽出した地名領域を認識する。 FIG. 2 shows a place word collective recognition process. In the first level place name matching process 105, first, place name areas delimited by the first key characters are cut out, and place names of the place name areas cut out using the place name word recognition dictionary 608 are collectively recognized. Next, in the second level place name matching process 106, the place name area delimited by the first key character and the second key character (if there is a second key character) is cut out, and the first level place name matching result is obtained. And the place name area dictionary 608 shown in FIG. For example, if the address actually entered is [Character 6]

If so, the key character path is “city-ku-road”. In this case, the first level place name area delimited by the first key character “city” is recognized as “Beijing”. Next, the place name area delimited by the first and second key characters “city-ku” is extracted. Then, the place name area is recognized by collating the entire extracted place name area with all “gu” in Beijing city stored in the place name word recognition dictionary 608. Further, in the third level place name matching process 107, first, the place name area delimited by the second and third key characters is cut out (if there are second and third key characters). Next, the extracted place name region is recognized by referring to the place name collation result of the second level and the place name word recognition dictionary 608. The same processing is repeated until the last level is reached. In the Nth level place name matching process 108, the place name area divided by the N-1th key character and the Nth key character (if there is an N-1th key character and an Nth key character) The extracted place name region is recognized by referring to the place name collation result of the (N-1) th level and the place name word recognition dictionary 608.

米国特許第６５３５６１９（Ｂ１）号明細書US Pat. No. 6,535,619 (B1) Bob Fisher, Simon Perkins, Ashley Walker and Erik Wolfart, "Hypermedia Image Processing Reference",[online], Department of Artificial Intelligence, University of Edinburgh, UK, インターネット＜URL: http://www.cee.hw.ac.uk/hipr/html/label.html＞Bob Fisher, Simon Perkins, Ashley Walker and Erik Wolfart, "Hypermedia Image Processing Reference", [online], Department of Artificial Intelligence, University of Edinburgh, UK, Internet <URL: http://www.cee.hw.ac. uk / hipr / html / label.html> Qiuqi RUAN, "Mathematical ImageProcessing", the Publishing House of the Electronics Industry, page 325Qiuqi RUAN, "Mathematical ImageProcessing", the Publishing House of the Electronics Industry, page 325 E. Ukkonen, "On approximate atring matching", Proc. Int. Conf. on Foundations ofComp. Theory, Springer-Verlag, LNCS 158 p 487-495,1983E. Ukkonen, "On approximate atring matching", Proc. Int. Conf. On Foundations of Comp. Theory, Springer-Verlag, LNCS 158 p 487-495,1983 S.B.Needleman and C.D.Wunsch,"A general method applicable to the search for similarities in the aminoacid sequence of two proteins", Journal Molec.Biol. 48 p 443-453, 1970S.B. Needleman and C.D.Wunsch, "A general method applicable to the search for similarities in the aminoacid sequence of two proteins", Journal Molec. Biol. 48 p 443-453, 1970 Christopher J.C. Burges, "A Tutorialon Support Vector Machines for Pattern Recognition, Data Mining and KnowledgeDiscovery", volume 2, No 2, pages 121-167, 1998.Christopher J.C. Burges, "A Tutorialon Support Vector Machines for Pattern Recognition, Data Mining and KnowledgeDiscovery", volume 2, No 2, pages 121-167, 1998.

上に説明した従来方法の問題点の一つは、正しいキー文字が全く抽出されなかった場合には認識も失敗するため、システムの精度が低下するという点である。具体的には、キー文字抽出においては、上記従来方法には二つの問題点が存在する。まず、（１）図１の一文字認識手段６０９は、全ての単一文字領域と一文字認識辞書中の全ての文字（中国語の単語に関する限り、使用される漢字は数千個存在する）とを照合するため、キー文字認識において不可避的にエラーが発生し、それによってシステムの認識精度が低下する。次に、（２）キー文字抽出手段６０２は、認識候補中に出現した照合距離が最小の切り出し文字をキー文字として認識するにすぎないため、一定の状況下ではキー文字認識において不可避的にエラーが発生する。例えば、ある単一文字領域内の画像が実際には「市」に対応するものの、照合距離に従って単語を並べると認識候補列が
［文字７］

となるような状況下では、上記従来方法では、「市」ではなく、
［文字８］

の方が当該単一文字領域のキー文字であると認識される可能性が高いため、システムの精度を低下させる。 One of the problems with the conventional method described above is that the accuracy of the system is lowered because recognition fails if no correct key characters are extracted. Specifically, the key character extraction has two problems with the conventional method. First, (1) the one-character recognition means 609 in FIG. 1 collates all single character areas with all characters in the one-character recognition dictionary (as far as Chinese words are concerned, there are thousands of kanji used). Therefore, an error inevitably occurs in key character recognition, thereby reducing the recognition accuracy of the system. Next, (2) the key character extracting means 602 only recognizes a cut-out character that appears in the recognition candidate with the smallest collation distance as a key character. Will occur. For example, an image in a single character area actually corresponds to “city”, but when words are arranged according to a collation distance, a recognition candidate string is [character 7].

Under such circumstances, the above conventional method is not “city”,
[Character 8]

Since there is a high possibility that this is recognized as a key character of the single character area, the accuracy of the system is lowered.

さらに、従来方法には次のような問題点もある。従来方法のような順次処理構造は、先行するいずれかの地名認識が失敗している場合には機能しない。より具体的には、後続のモジュールは先行するモジュールに大幅に依存する。先行するレベルでの地名認識が失敗すると、それに続く地名認識は常に誤りとなる。例えば、第１レベル地名照合１０５で正しい結果が出力されなかった場合、仮に、
［文字６］

という住所文字列が入力されたときに、「市」で区切られた第１レベルの地名が誤って
［文字９］

と照合されてしまうと、第１のキー文字「市」と第２のキー文字「区」とで区切られる実際の地名
［文字１０］

は、誤った第１レベルの地名
［文字９］

に含まれる行政単位ではないため、第２レベルの照合結果は常に誤りとなってしまう。さらに、住所が手書き入力された場合、地名の字体は様々に変形し、大きさもまちまちであるため、認識された単語が実際の住所と常に一致するとは限らない。このため、システムの精度が著しく低下してしまう。 Further, the conventional method has the following problems. The sequential processing structure as in the conventional method does not work when any preceding place name recognition fails. More specifically, subsequent modules are highly dependent on previous modules. If place name recognition at the preceding level fails, subsequent place name recognition will always be an error. For example, if the first level place name matching 105 does not output a correct result,
[Character 6]

When the address string is entered, the first level place name separated by "city" is mistakenly [character 9]

The actual place name delimited by the first key character “city” and the second key character “ku” [character 10].

Is the wrong first-level place name [character 9]

Because it is not an administrative unit included in, the second level collation result is always incorrect. Furthermore, when the address is input by handwriting, the place name fonts are variously deformed and the sizes thereof vary, so that the recognized word does not always match the actual address. For this reason, the accuracy of the system is significantly reduced.

さらに、上記従来方法には次のような問題点もある。手書き入力住所の質がまちまちであるという特異な性質のために、固定された閾値を用いて認識結果をリジェクトすると、住所の読み取りエラーが発生する。具体的には、閾値として小さな値を設定すると、上記従来方法では、質が低い住所に対する認識結果の大部分が拒絶されてしまう一方、閾値として大きな値を設定すると、質が高い住所に対する認識結果の大部分が承認されてしまう。このためシステムの信頼性が著しく損なわれる。 Further, the conventional method has the following problems. Due to the unique property that the quality of the handwritten input address varies, if the recognition result is rejected using a fixed threshold value, an address reading error occurs. Specifically, if a small value is set as a threshold value, the above-described conventional method rejects most of the recognition results for low-quality addresses, whereas if a large value is set as a threshold value, recognition results for high-quality addresses are rejected. Most of them are approved. For this reason, the reliability of the system is significantly impaired.

本発明の第１の目的は、効率よく住所認識を行うことができる住所認識装置を提供することである。 A first object of the present invention is to provide an address recognition apparatus that can perform address recognition efficiently.

本発明の第２の目的は、高い精度で住所認識を行うことができる住所認識装置を提供することである。 The second object of the present invention is to provide an address recognition apparatus capable of performing address recognition with high accuracy.

本発明の第３の目的は、信頼性のある住所認識を行うことができる住所認識装置を提供することである。 A third object of the present invention is to provide an address recognition device that can perform reliable address recognition.

上記の目的を達成するため、本発明は、複数の単一文字領域からのキー文字の抽出を複数の単一文字領域の各々について実行するキー文字抽出プログラムであって、前記単一文字領域に対応する画像から特徴ベクトルを抽出する特徴ベクトル抽出手順と、前記特徴ベクトル抽出手順により抽出された特徴ベクトルと、あり得るすべてのキー文字を記憶したキー文字辞書におけるキー文字の特徴ベクトルとを照合し、あり得るすべてのキー文字候補を検索する照合手順と、前記照合手順により検索された１つ以上のキー文字候補をそれぞれ有する複数の単一文字領域を認識する場合に、正当なキー文字パスを決定するキー文字パス決定手順と、をコンピュータに実行させることを特徴とする。 In order to achieve the above object, the present invention provides a key character extraction program that executes extraction of key characters from a plurality of single character regions for each of the plurality of single character regions, and an image corresponding to the single character region. A feature vector extraction procedure for extracting a feature vector from the feature vector, a feature vector extracted by the feature vector extraction procedure, and a feature vector of a key character in a key character dictionary storing all possible key characters; A collation procedure for retrieving all key character candidates and a key character for determining a valid key character path when recognizing a plurality of single character regions each having one or more key character candidates retrieved by the collation procedure And causing the computer to execute a path determination procedure.

また、本発明は、複数の単一文字領域からキー文字を抽出するキー文字抽出装置であって、前記単一文字領域の各々に対応する画像から特徴ベクトルを抽出する特徴ベクトル抽出手段と、あり得るすべてのキー文字のみを記憶するキー文字辞書部と、前記特徴ベクトル抽出手段により抽出された特徴ベクトルと、前記キー文字辞書部に記憶されたあり得るキー文字の特徴ベクトルとを照合し、あり得るすべてのキー文字候補を検索する照合手段と、前記照合手段により検索された１つ以上のキー文字候補をそれぞれ有する複数の単一文字領域を認識する場合に、正当なキー文字パスを決定するキー文字パス決定手段と、を備えたことを特徴とする。 The present invention is also a key character extraction device for extracting a key character from a plurality of single character areas, and feature vector extraction means for extracting a feature vector from an image corresponding to each of the single character areas, and all possible The key character dictionary unit that stores only the key characters of the key, the feature vector extracted by the feature vector extraction means, and the feature vector of the possible key characters stored in the key character dictionary unit, and all possible A key character path for determining a valid key character path when recognizing a plurality of single character areas each having one or more key character candidates searched by the matching means And a determining means.

また、本発明は、複数の単一文字領域からのキー文字の抽出を複数の単一文字領域の各々について実行するキー文字抽出方法であって、前記単一文字領域に対応する画像から特徴ベクトルを抽出する特徴ベクトル抽出工程と、前記特徴ベクトル抽出工程により抽出された特徴ベクトルと、あり得るすべてのキー文字を記憶したキー文字辞書におけるキー文字の特徴ベクトルとを照合し、あり得るすべてのキー文字候補を検索する照合工程と、前記照合工程により検索された１つ以上のキー文字候補をそれぞれ有する複数の単一文字領域を認識する場合に、正当なキー文字パスを決定するキー文字パス決定工程と、を含んだことを特徴とする。 The present invention is also a key character extraction method for extracting key characters from a plurality of single character areas for each of the plurality of single character areas, and extracting a feature vector from an image corresponding to the single character area. The feature vector extraction step, the feature vector extracted by the feature vector extraction step, and the feature vector of the key character in the key character dictionary storing all possible key characters are collated, and all possible key character candidates are obtained. A matching step for searching, and a key character path determining step for determining a valid key character path when recognizing a plurality of single character regions each having one or more key character candidates searched by the matching step. It is characterized by including.

また、本発明は、ある行政レベルの地名を含む地名画像の地名を認識する一括地名認識手順をコンピュータに実行させる一括地名認識プログラムであって、前記一括地名認識手順は、前記地名画像から特徴ベクトルを抽出する抽出手順と、前記抽出手順により抽出された特徴ベクトルと、地名辞書に記憶された地名の特徴ベクトルとを照合し、あり得るすべての地名候補を検索する照合手順と、をコンピュータに実行させ、前記地名辞書に記憶された地名のうち、前記抽出手順により抽出された特徴ベクトルとの間の照合距離が所定の閾値よりも小さい特徴ベクトルに対応する地名をあり得る地名候補として決定し、さらに前記地名画像の単一文字領域の各々に含まれる文字の文字候補を認識する個別文字認識手順と、前記個別文字認識手順により得られた認識結果に基づいて、前記一括地名認識手順により決定された地名候補のフィルタリングをおこなう地名検証手順と、をコンピュータに実行させることを特徴とする。 The present invention is also a collective place name recognition program for causing a computer to execute a collective place name recognition procedure for recognizing place names of place name images including place names at a certain administrative level, wherein the collective place name recognition procedure includes a feature vector from the place name image. The computer executes an extraction procedure for extracting a name, and a matching procedure for searching for all possible place name candidates by matching the feature vector extracted by the extraction procedure with the feature vector of the place name stored in the place name dictionary. A place name corresponding to a feature vector whose matching distance between the place name stored in the place name dictionary and the feature vector extracted by the extraction procedure is smaller than a predetermined threshold is determined as a possible place name candidate, Furthermore, an individual character recognition procedure for recognizing character candidates for characters included in each single character region of the place name image, and the individual character recognition procedure Based on more obtained recognition result, characterized in that to perform said a collective place name recognition procedure place name verification procedure for filtering the place name candidate determined by, to the computer.

前記一括地名認識手順は、個別文字認識手順と、地名検証手順とを含むことが望ましい。 The collective place name recognition procedure preferably includes an individual character recognition procedure and a place name verification procedure.

また、本発明は、ある行政レベルの地名を含む地名画像の地名を認識する一括地名認識工程を含む一括地名認識方法であって、前記一括地名認識工程は、前記地名画像から特徴ベクトルを抽出する抽出工程と、前記抽出工程により抽出された特徴ベクトルと、地名辞書に記憶された地名の特徴ベクトルとを照合し、あり得るすべての地名候補を検索する照合工程と、を含み、前記地名辞書に記憶された地名のうち、前記抽出工程により抽出された特徴ベクトルとの間の照合距離が所定の閾値よりも小さい特徴ベクトルに対応する地名をあり得る地名候補として決定し、さらに前記地名画像の単一文字領域の各々に含まれる文字の文字候補を認識する個別文字認識工程と、前記個別文字認識工程により得られた認識結果に基づいて、前記一括地名認識工程により決定された地名候補のフィルタリングをおこなう地名検証工程と、を含んだことを特徴とする。 The present invention is also a collective place name recognition method including a collective place name recognition process for recognizing place names of place name images including place names at a certain administrative level, wherein the collective place name recognition step extracts a feature vector from the place name image. An extraction step; a collation step of collating the feature vector extracted in the extraction step with the feature vector of the place name stored in the place name dictionary and searching for all possible place name candidates; Among the stored place names, a place name corresponding to a feature vector whose collation distance with the feature vector extracted by the extraction step is smaller than a predetermined threshold is determined as a possible place name candidate. An individual character recognition step for recognizing character candidates for characters included in each character region, and the batch location based on the recognition result obtained by the individual character recognition step. A place name verification step of performing filtering of place name candidates determined by the recognition process, characterized in that it contains.

前記一括地名認識方法は、個別文字認識工程と、地名検証工程とを含むことが望ましい。 The collective place name recognition method preferably includes an individual character recognition step and a place name verification step.

また、本発明は、一括地名認識装置であって、入力された地名画像から特徴ベクトルを抽出する抽出手段と、地名を記憶する地名辞書部と、前記地名辞書部に記憶された地名の特徴ベクトルと前記抽出手段により抽出された特徴ベクトルとを照合し、あり得るすべての地名候補および照合距離を求め、地名辞書に記憶された地名のうち、前記抽出手段により抽出された特徴ベクトルとの間の照合距離が所定の閾値よりも小さい特徴ベクトルに対応する地名をあり得る地名候補として決定する照合手段と、降順に並んだ行政レベルの地名を含む複数の地名画像が入力され、当該地名画像の認識により行政レベルの地名を降順に含む複数の住所パスが形成される場合に、複数の住所パス候補の評価および決定をおこなう照合評価手段と、をさらに備えたことを特徴とする。 Further, the present invention is a collective place name recognition device, wherein an extraction means for extracting a feature vector from an inputted place name image, a place name dictionary unit for storing a place name, and a place name feature vector stored in the place name dictionary unit Are compared with the feature vector extracted by the extraction means, and all possible place name candidates and collation distances are obtained, and among the place names stored in the place name dictionary, between the feature vectors extracted by the extraction means A collation means for determining a place name candidate corresponding to a feature vector whose collation distance is smaller than a predetermined threshold and a plurality of place name images including administrative place names arranged in descending order are input, and the place name image is recognized. And a collation evaluation means for evaluating and determining a plurality of address path candidates when a plurality of address paths including the administrative level place names in descending order are formed. And said that there were pictures.

また、本発明は、単一文字領域を分離する文字分離工程、前記文字分離工程により分離された単一文字領域の各々に対してキー文字を抽出するキー文字抽出工程、前記キー文字抽出工程によるキー文字の抽出結果に基づいて地名を一括して認識する一括地名認識工程、前記一括地名認識工程による地名認識結果に基づいて住所を決定する住所決定工程、を含んだ住所認識方法であって、前記キー文字抽出工程は、前記単一文字領域に対応する画像から特徴ベクトルを抽出する特徴ベクトル抽出工程と、前記特徴ベクトル抽出工程により抽出された特徴ベクトルと、すべてのあり得るキー文字を記憶したキー文字辞書におけるあり得るキー文字の特徴ベクトルとを照合し、あり得るすべてのキー文字候補を検索する照合工程と、前記照合工程により検索された１つ以上のキー文字候補をそれぞれ有する複数の単一文字領域を認識する場合に、正当なキー文字パスを決定するキー文字パス決定工程と、を含んだことを特徴とする。 Further, the present invention provides a character separation step for separating a single character region, a key character extraction step for extracting a key character for each single character region separated by the character separation step, and a key character by the key character extraction step. An address recognition method including a collective place name recognition step for collectively recognizing place names based on an extraction result of the address, an address determination step for determining an address based on a place name recognition result by the collective place name recognition step, and the key The character extraction step includes a feature vector extraction step of extracting a feature vector from an image corresponding to the single character region, a feature vector extracted by the feature vector extraction step, and a key character dictionary storing all possible key characters A collation step of collating with a feature vector of possible key characters and searching for all possible key character candidates; and Ri retrieved one or more key character candidates when recognizing a plurality of single-character area having respectively, characterized in that it includes a key character path determination step of determining a valid key characters path, the.

本発明の特徴の一つによれば、キー文字認識において、本発明は（Ｍ個のキー文字のみを含む）キー文字辞書とのみ比較を行う。これにより処理量が減り、認識精度が高まる。さらに、キー文字認識時に、結合パターンの評価結果の候補リストに一つだけキー文字がある場合には、あり得るキー文字候補の１つとして当該結合パターンを出力する。さらに、結合パターンがＬ（Ｌ<=Ｍ）個のあり得るキー文字候補を含んでいる場合には、Ｌ個のあり得るキー文字候補が出力される。したがって、例えば、結合パターンに二つのキー文字候補（「省」および「市」）が含まれている場合、当該結合パターンについては、「省」および「市」という二つの可能性のあるキー文字が出力される。これによって、正しいキー文字が失われる危険性が減り、住所認識の正確性が高まる。また、１つ以上のキー文字候補をそれぞれ有する複数の単一文字領域を認識する場合に、正当なキー文字パスを決定するので、従来の方法のように全てのキー文字パスを評価する必要はなく、平均照合距離が最小である高順位のＫ個のキー文字パスを選択するのみでよいため、住所認識の効率および正確性を高めることができる。 According to one aspect of the present invention, in key character recognition, the present invention compares only with a key character dictionary (including only M key characters). This reduces the amount of processing and increases the recognition accuracy. Further, when there is only one key character in the candidate list of the combination pattern evaluation result at the time of key character recognition, the combination pattern is output as one of the possible key character candidates. Further, when the combination pattern includes L (L <= M) possible key character candidates, L possible key character candidates are output. Thus, for example, if two key character candidates ("Province" and "City") are included in the combined pattern, the two possible key characters "Province" and "City" are used for the combined pattern. Is output. This reduces the risk of losing the correct key characters and increases the accuracy of address recognition. In addition, when recognizing a plurality of single character areas each having one or more key character candidates, since a valid key character path is determined, it is not necessary to evaluate all the key character paths as in the conventional method. Since it is only necessary to select high-rank K key character paths having the smallest average collation distance, the efficiency and accuracy of address recognition can be improved.

本発明の他の特徴によれば、地名検証方法が提供され、この地名検証方法では、一括地名認識による地名領域の認識結果と、当該地名から単一文字を切り出して一つ一つを認識した結果とが組み合わされ、一括地名認識の結果がフィルタリングされる。この方法では、個別文字認識の結果と実際の地名との間の編集距離を定義することにより、一括地名認識結果と個別文字認識結果とを効率的に組み合わせて検証をおこなう。一括地名認識が多数の同様の地名を区別できない場合に通常の認識では両者を別々に扱っており、住所認識の正確性が高まる。 According to another aspect of the present invention, a place name verification method is provided. In this place name verification method, a place name region recognition result by batch place name recognition, and a result of recognizing each one by cutting out a single character from the place name. Are combined, and the result of collective place name recognition is filtered. In this method, by defining the edit distance between the individual character recognition result and the actual place name, the verification is performed by efficiently combining the collective place name recognition result and the individual character recognition result. When collective place name recognition cannot distinguish a large number of similar place names, normal recognition treats both separately, which increases the accuracy of address recognition.

本発明のさらに他の特徴によれば、あるレベルの地名認識を行う場合、第１の候補から第Ｍの候補にいたる候補が生成され得る。当該レベルの認識結果が実際の地名と一致しなかった場合、後続のレベルの地名認識もエラーとなり、照合距離の大きいＭ個の候補を出力してしまう。したがって、地名認識によって照合距離の大きい候補が生成された場合、その前のレベルの地名認識が誤っている可能性が極めて高い。よって、最終レベルから第１レベルへのフィードバック処理がおこなわれる。すなわち、後のレベルの認識結果を前のレベルで発生したエラーを訂正するために使用する。これにより、住所認識の精度が高まる。本発明のさらに他の特徴によれば、全ての候補を評価しなくとも、フィードバック処理を伴う地名認識を行うことができるため、地名認識の効率および正確性が著しく高まる。また、降順に並んだ行政レベルの地名を含む複数の地名画像が入力され、当該地名画像の認識により行政レベルの地名を降順に含む複数の住所パスが形成される場合に、複数の住所パス候補の評価および決定をおこなうので、住所認識の信頼性を高めることができる。 According to still another feature of the present invention, when performing place name recognition at a certain level, candidates from the first candidate to the Mth candidate can be generated. If the recognition result of the level does not match the actual place name, the place name recognition of the subsequent level also becomes an error, and M candidates having a large collation distance are output. Therefore, when a candidate with a large collation distance is generated by place name recognition, there is a very high possibility that the place name recognition at the previous level is incorrect. Therefore, feedback processing from the final level to the first level is performed. That is, the recognition result of the subsequent level is used to correct an error that has occurred at the previous level. This increases the accuracy of address recognition. According to still another feature of the present invention, place name recognition with feedback processing can be performed without evaluating all candidates, and the efficiency and accuracy of place name recognition are significantly increased. In addition, when a plurality of place name images including administrative level place names arranged in descending order are input, and a plurality of address paths including the administrative level place names in descending order are formed by recognition of the place name images, a plurality of address path candidates are formed. This makes it possible to improve the reliability of address recognition.

以下に、図面を参照して、本発明の住所認識方法および住所認識装置を詳細に説明する。図面中、同一または類似の構成要素には同一の参照番号を付す。 Below, with reference to drawings, the address recognition method and address recognition device of the present invention are explained in detail. In the drawings, the same or similar components are denoted by the same reference numerals.

図３は、本発明の実施例に係る住所認識装置を示すブロック図である。図３において、単一文字領域検出手段１は、入力された住所画像から単一文字を１つずつ切り出す。単一文字の各画像領域を、単一文字領域と呼ぶ。キー文字抽出手段２は、単一文字領域検出手段１の結果に従い、キー文字を抽出する。次に、地名領域抽出手段３は、キー文字抽出手段２により抽出されたキー文字にもとづき、地名領域を抽出する。一括地名認識手段４は、地名領域抽出手段３が抽出した地名領域を再帰的に一括認識する。単一文字地名認識手段Ｅ１は、単一文字領域検出手段１から出力された文字を、地名領域抽出手段３が抽出した地名領域内の地名と対応付けて認識する。地名検証手段５は、一括地名認識手段４および単一文字地名認識手段Ｅ１が出力した地名認識結果を検証し決定する。リジェクト手段６は、地名認識結果をリジェクトするか承認するかを決定する。 FIG. 3 is a block diagram illustrating an address recognition apparatus according to an embodiment of the present invention. In FIG. 3, the single character area detecting means 1 cuts out single characters one by one from the input address image. Each image area of a single character is called a single character area. The key character extraction means 2 extracts key characters according to the result of the single character area detection means 1. Next, the place name area extracting unit 3 extracts a place name area based on the key characters extracted by the key character extracting unit 2. The collective place name recognition means 4 recursively recognizes the place name areas extracted by the place name area extraction means 3. The single character place name recognition means E1 recognizes the character output from the single character area detection means 1 in association with the place name in the place name area extracted by the place name area extraction means 3. The place name verification means 5 verifies and determines the place name recognition results output by the collective place name recognition means 4 and the single character place name recognition means E1. The reject means 6 determines whether to reject or approve the place name recognition result.

具体的には、認識対象である住所画像が入力されると、まず、単一文字領域検出手段１が、入力された住所画像から複数の単一文字領域を切り出す。単一文字領域を切り出した後、キー文字抽出手段２が単一文字領域の各々を認識し、キー文字を抽出する。キー文字を抽出した後、地名領域抽出手段３が、キー文字によって区切られた領域を入力画像から分離し、次に、一括地名認識手段４が、キー文字によって区切られた領域の特徴ベクトルと、地名辞書中の地名の特徴ベクトルとを照合し、照合距離が所定の閾値よりも小さい地名候補を地名候補列としてそれぞれ取得する。ここで、照合距離とは、キー文字により区切られた画像と、地名候補との非類似度を示すものである。照合距離の値が小さいほど、その時点で決定されている画像と地名候補との相違が少ないことになる。すなわち、両者はより類似していることになる。照合距離の計算については後述する。同時に、地名候補列中の各地名候補の照合距離と地名候補の最小の照合距離との間の差の、地名候補の最小の照合距離に対する比を順次見積もる。見積もった比が所定の閾値よりも大きい場合、その地名候補は当該画像と著しく相違すると判断され、その地名候補は除外される。したがって、本発明では、すべての地名候補を評価する必要がない。すなわち、決定されている画像と地名候補とが著しく異なっていれば、さらに比較を行う必要はないため（詳しくは図５および図６の説明を参照）、認識時間が短縮される。この時、単一文字地名認識手段Ｅ１および地名検証手段５を使用し、個別文字認識結果と一括地名認識結果とを組み合わせることにより、一括認識された地名をフィルタリングすることができる。同様に、異なったレベルの地名領域が一つずつ認識される。多くの場合、地名認識における第１地名候補は通常誤りであり、他の地名候補が正しい結果となる。したがって、対応する各パスの照合距離を順次計算し、平均照合距離が最も小さいパスを見つけ、当該パスを最終的な地名パスと決定する。このように、本発明において用いられる多候補再帰認識によって、認識の正確さが向上する。加えて、平均照合距離が最も小さいパスに近いパスが複数ある場合には、単一文字地名認識手段Ｅ１と地名検証手段５とを使用し、個別文字認識結果と一括地名認識結果とを組み合わせることにより、当該地名画像に対して一括認識された地名候補のうち当該複数のパスと食い違う地名候補をフィルタリングすることができる。 Specifically, when an address image to be recognized is input, first, the single character area detecting means 1 cuts out a plurality of single character areas from the input address image. After cutting out the single character area, the key character extracting means 2 recognizes each single character area and extracts the key character. After extracting the key character, the place name area extracting means 3 separates the area delimited by the key character from the input image, and then the collective place name recognizing means 4 includes the feature vector of the area delimited by the key character, The place name feature vectors in the place name dictionary are collated, and place name candidates whose collation distance is smaller than a predetermined threshold are respectively obtained as place name candidate strings. Here, the collation distance indicates the degree of dissimilarity between an image divided by key characters and a place name candidate. The smaller the collation distance value, the smaller the difference between the image determined at that time and the place name candidate. That is, both are more similar. The calculation of the verification distance will be described later. At the same time, the ratio of the difference between the collation distance of each place name candidate in the place name candidate string and the minimum collation distance of the place name candidate to the minimum collation distance of the place name candidates is sequentially estimated. If the estimated ratio is greater than a predetermined threshold, it is determined that the place name candidate is significantly different from the image, and the place name candidate is excluded. Therefore, in the present invention, it is not necessary to evaluate all the place name candidates. That is, if the determined image and the place name candidate are significantly different from each other, it is not necessary to perform further comparison (refer to the description of FIGS. 5 and 6 for details), so that the recognition time is shortened. At this time, by using the single character place name recognition means E1 and the place name verification means 5 and combining the individual character recognition result and the collective place name recognition result, the collectively recognized place names can be filtered. Similarly, place name areas of different levels are recognized one by one. In many cases, the first place name candidates in place name recognition are usually incorrect, and other place name candidates have the correct result. Therefore, the matching distance of each corresponding path is calculated sequentially, the path with the smallest average matching distance is found, and the path is determined as the final place name path. Thus, the multi-candidate recursive recognition used in the present invention improves the recognition accuracy. In addition, when there are a plurality of paths close to the path with the smallest average collation distance, the single character place name recognition means E1 and the place name verification means 5 are used to combine the individual character recognition result and the collective place name recognition result. The place name candidates that are inconsistent with the plurality of paths among the place name candidates collectively recognized for the place name image can be filtered.

単一文字地名認識手段Ｅ１は、キー文字により区切られた領域内に存在し、かつ、単一文字領域検出手段１により得られた単一文字領域の特徴を抽出し、抽出した特徴と単一文字辞書内の文字の特徴とを順次比較して認識する。地名が認識された後、地名検証手段５は、一括地名認識結果と個別文字認識結果とを組み合わせ、認識結果を最終的にチェックする。これによって、単一文字領域認識結果と再帰的一括認識結果とが組み合わされ、認識の正確さが向上する。 The single character place name recognition means E1 is present in the area delimited by the key characters, and extracts the characteristics of the single character area obtained by the single character area detection means 1, and extracts the extracted features and the single character dictionary. Recognize by sequentially comparing the character characteristics. After the place name is recognized, the place name verification means 5 combines the batch place name recognition result and the individual character recognition result, and finally checks the recognition result. As a result, the single character area recognition result and the recursive collective recognition result are combined to improve the recognition accuracy.

図３の各手段を以下順番に説明する。 Each means of FIG. 3 is demonstrated in order below.

図４は、図３の単一文字領域検出手段１を示すブロック図である。 FIG. 4 is a block diagram showing the single character area detecting means 1 of FIG.

図４において、結合要素ラベリング手段３４は、結合要素検出アルゴリズムによって、入力画像中の結合要素をすべてラベリングする（非特許文献１を参照）。結合要素とは、画像中の（通常、黒である）文字表示色の画素ポイントの集合である。当該集合中の画素によって、当該集合中の任意の２つの画素ポイントを結合することができる。 In FIG. 4, the coupling element labeling means 34 labels all the coupling elements in the input image by a coupling element detection algorithm (see Non-Patent Document 1). A combination element is a set of pixel points of a character display color (usually black) in an image. Any two pixel points in the set can be combined by pixels in the set.

単一文字領域候補決定手段３５は、ラベリングされた結合要素のサイズ、ラベリングされた結合要素の位置、および隣接する要素からの距離等に基づいて、対応する結合要素を統合して、新しい結合要素にする。新しい結合要素は、単一文字領域候補である。例えば、入力された住所画像が横向きに書かれていた場合、結合要素Ａが結合要素Ｂの上に位置していれば、ＢとＡとは同一の文字に属するはずであるので、ＢとＡとは新しい結合要素に統合されねばならない。結合要素の統合の詳細については、特許文献１の図１１Ａ、図１１Ｂ、および図１１Ｃならびに特許文献１の記述を参照されたい。 Based on the size of the labeled coupling element, the position of the labeled coupling element, the distance from the adjacent element, etc., the single character region candidate determination means 35 integrates the corresponding coupling elements into a new coupling element. To do. The new combined element is a single character region candidate. For example, if the input address image is written sideways, if the coupling element A is positioned above the coupling element B, B and A should belong to the same character, so B and A And must be integrated into a new coupling element. For details of the integration of the coupling elements, refer to FIGS. 11A, 11B, and 11C of Patent Document 1 and the description of Patent Document 1.

平均文字サイズ計算手段３６は、単一文字領域候補決定手段３５により得られた新しい結合要素中、明らかに異常なサイズの結合要素を除去し、残りの結合要素および文字の平均サイズを計算する。ここで、ＣＣ_i（ｉ＝１，２，．．．，ｍ_cc）が、単一文字領域候補決定手段３５により得られたｍ_cc個の結合要素のすべてを表し、Ｗｉｄｔｈ（ＣＣ_i）が、結合要素ＣＣ_iの幅を表し、Ｈｅｉｇｈｔ（ＣＣ_i）がその高さを表すとすれば、以下の条件（１）〜（４）のいずれか一つを満たすＣＣ_kはすべて除去される。 The average character size calculating means 36 removes the apparently abnormal size of the combined elements obtained by the single character area candidate determining means 35 and calculates the average size of the remaining combined elements and characters. Here, CC _i (i = 1, 2,..., M _cc ) represents all m _cc combined elements obtained by the single character region candidate determining means 35, and Width (CC _i ) is If the width of the coupling element CC _i is represented and Height (CC _i ) represents its height, all CC _k satisfying any one of the following conditions (1) to (4) are removed.

（１）Width(CC_k) < WCC_median− Th_{width_low}
（２）Width(CC_k) > WCC_median+ Th_{width_low}
（３）Height(CC_k) < HCC_median− Th_{height_low}
（４）Height(CC_k) > HCC_median+ Th_{height_low}
ここで、ＷＣＣ_medianは、全てのＷｉｄｔｈ（ＣＣ_i）（ｉ＝１，２，．．．，ｍ_cc）から得られる中央値であり、ＨＣＣ_medianは、全てのＨｅｉｇｈｔ（ＣＣ_i）（ｉ＝１，２，．．．，Ｍ_cc）から得られる中央値であり、Ｔｈ_{width_low}およびＴｈ_{height_low}は、予め定められた２つの正の閾値である。中央値の計算の詳細については、非特許文献２を参照されたい。 (1) Width (CC _k ) <WCC _median − Th _{width_low}
(2) Width (CC _k )> WCC _median + Th _{width_low}
(3) Height (CC _k ) <HCC _median − Th _{height_low}
(4) Height (CC _k )> HCC _median + Th _{height_low}
Here, WCC _median is a _median value obtained from all Width (CC _i ) (i = 1, 2,..., M _cc ), and HCC _median is all Height (CC _i ) (i = 1, 2,..., M _cc ), and Th _{width_low} and Th _{height_low} are two predetermined positive thresholds. Refer to Non-Patent Document 2 for details of the median calculation.

画素投影手段３７は、（画像が横方向に書かれている場合は）入力された住所画像を縦方向に投影し、（画像が縦方向に書かれている場合は）入力された住所画像を横方向に投影する。縦方向（または横方向）画像投影によって、横方向（または縦方向）の画像の各ポイントの黒い画素の数が得られる。 The pixel projecting unit 37 projects the input address image in the vertical direction (if the image is written in the horizontal direction), and converts the input address image (if the image is written in the vertical direction). Project in the horizontal direction. Vertical (or horizontal) image projection yields the number of black pixels at each point in the horizontal (or vertical) image.

一時的分離ポイント候補抽出手段３８および一時的分離ポイント候補決定手段３９は、単一文字領域候補決定手段３５から出力された新しい結合要素を実際の文字領域に分離するために使用される。すなわち、結合文字を個々の単一文字に分離するために使用される。 Temporary separation point candidate extraction means 38 and temporary separation point candidate determination means 39 are used to separate the new combined elements output from single character area candidate determination means 35 into actual character areas. That is, it is used to separate the combined characters into individual single characters.

画素投影手段３７により得られる黒い画素の数にしたがって、一時的分離ポイント候補抽出手段３８は、以下の二つの条件を満たす位置ポイントを一時的分離ポイント候補として抽出する。これらの位置ポイントは、結合文字の分離ポイントとなる可能性のあるポイントである。 According to the number of black pixels obtained by the pixel projection unit 37, the temporary separation point candidate extraction unit 38 extracts position points that satisfy the following two conditions as temporary separation point candidates. These position points are points that can become separation points for combined characters.

（条件１）位置ポイントは、投影によって得られた黒い画素の数のヒストグラムにおいて最小値をとるポイントである。
（条件２）当該位置ポイント上の黒い画素の数は、所定の閾値Ｔｈ_segよりも少ない。 (Condition 1) The position point is a point having a minimum value in the histogram of the number of black pixels obtained by projection.
(Condition 2) The number of black pixels on the position point is smaller than a predetermined threshold Th _seg .

分離ポイント候補の抽出は、例えば、特許文献１に記載の方法を用いて行うことができる。 The extraction of the separation point candidates can be performed using, for example, the method described in Patent Document 1.

一時的分離ポイント候補決定手段３９は、一時的分離ポイント候補抽出手段３８から得られる一時的分離ポイントが隣りあった文字領域の真の分離ポイントであるか否かを決定する。 Temporary separation point candidate determination means 39 determines whether or not the temporary separation point obtained from temporary separation point candidate extraction means 38 is a true separation point of the adjacent character area.

入力された文字パターンが横向きに書かれている場合は、分離される結合要素のいずれについても、当該結合要素の開始位置から平均文字幅の２分の１の倍数に相当する位置に隣接する領域内に位置する一時的分離ポイント候補が、分離ポイントとして決定される。具体的には、Ｐ_i（０＜ｉ＜ｓ：ｉは正の整数）を当該結合要素の開始位置から平均文字幅の２分の１の倍数に相当する地点にある位置ポイントとした場合に、
P_i - Th_delta < C <= Th_delta + P_i
を満足する分離ポイントＣおよびｉが存在すれば、Ｃが分離ポイントであると決定される。そして、当該結合要素は、ポイントＣにおいて、二つの新しい結合要素に縦方向に切断される。 When the input character pattern is written horizontally, for each of the separated coupling elements, an area adjacent to a position corresponding to a multiple of one half of the average character width from the start position of the coupling element Temporary separation point candidates located within are determined as separation points. Specifically, when _Pi (0 <i <s: i is a positive integer) is a position point at a point corresponding to a multiple of one half of the average character width from the start position of the coupling element. ,
P _i -Th _delta <C <= Th _delta + P _i
If there are separation points C and i satisfying the above, it is determined that C is a separation point. The connecting element is then cut longitudinally into two new connecting elements at point C.

入力された文字パターンが縦向きに書かれている場合は、分離される結合要素のいずれについても、当該結合要素の開始位置から平均文字幅の２分の１の倍数に相当する位置に隣接する領域内に位置する一時的分離ポイント候補が、分離ポイントとして決定される。具体的には、Ｐ_i（０＜ｉ＜ｓ：ｉは正の整数）を当該結合要素の開始位置から平均文字幅の２分の１の倍数に相当する地点にある位置ポイントとした場合に、
P_i - Th_delta < C <= Th_delta + P_i
を満足する分離ポイントＣおよびｉが存在すれば、Ｃが分離ポイントであると決定される。そして、当該結合要素は、ポイントＣにおいて、二つの新しい結合要素に横方向に切断される。 When the input character pattern is written vertically, any of the separated coupling elements is adjacent to a position corresponding to a multiple of one half of the average character width from the start position of the coupling element. Temporary separation point candidates located within the region are determined as separation points. Specifically, when _Pi (0 <i <s: i is a positive integer) is a position point at a point corresponding to a multiple of one half of the average character width from the start position of the coupling element. ,
P _i -Th _delta <C <= Th _delta + P _i
If there are separation points C and i satisfying the above, it is determined that C is a separation point. The connecting element is then cut transversely into two new connecting elements at point C.

図５は、本発明の実施の形態に係るキー文字抽出手段２を示すブロック図である。 FIG. 5 is a block diagram showing the key character extraction means 2 according to the embodiment of the present invention.

図５において、キー文字候補抽出手段２０は、単一文字領域検出手段１から入力された単一文字領域（または結合要素）からキー文字候補を抽出する。キー文字パス決定手段２１は、住所階層構造知識部２４を参照し、妥当なキー文字パスを決定する。ここで、住所階層構造知識部２４とは、地名の階層知識を提供することができるデータベースまたはプログラム等である。キー文字パスフィルタ手段２２は、複数のキー文字パスを記憶し、各パス中のキー文字の平均照合距離（すなわち、様々なレベルの地名の照合距離の合計と住所レベルの数との比）を計算し、全てのパスを昇順に（小さいものから大きいものの順に）並べ、最後に、より高順位のＫ個のキー文字パスを最終キー文字パスとして選択する。ここで、Ｋはあらかじめ定められた正の整数である。 In FIG. 5, the key character candidate extraction unit 20 extracts key character candidates from the single character region (or combination element) input from the single character region detection unit 1. The key character path determining means 21 refers to the address hierarchy structure knowledge unit 24 and determines an appropriate key character path. Here, the address hierarchical structure knowledge unit 24 is a database or a program that can provide hierarchical knowledge of place names. The key character path filter means 22 stores a plurality of key character paths, and calculates the average collation distance of key characters in each path (that is, the ratio of the sum of collation distances of various levels of place names to the number of address levels). All paths are calculated in ascending order (from smallest to largest), and finally K key character paths with higher ranks are selected as final key character paths. Here, K is a predetermined positive integer.

具体的には、認識のために住所画像が入力されたとき、まず入力された住所画像から結合要素が抽出され、次にキー文字候補抽出手段２０が、当該結合要素からキー文字を抽出する（詳細については、以下の図６の説明を参照）。キー文字を抽出した後、キー文字パス決定手段２１が、地名階層知識と適合するキー文字パスを決定する。決定されたキー文字パスは、キー文字の順序を記述するもので、先行するキー文字は後続のキー文字の上位の行政単位となっている。例えば、
［文字２］

はキー文字パスであるが、
［文字１］

は「市」の上位行政単位ではなく、「市」は行政上の意味で
［文字１］

を包含するものであるから、
［文字３］

はキー文字パスではない。次に、キー文字パスフィルタ手段２２が、認識結果中の各キー文字の平均照合距離を抽出する。各キー文字パスは、順次順位付けられる。地名領域抽出および認識においては、平均照合距離が最も小さいＫ個の文字パスのみについて評価を行う。ここで、Ｋは、あらかじめ定められた正の整数である。例えば、入力された住所が
［文字１１］

である場合、”京”および”市”に対応する画像領域について、「市」がキー文字と認識され、かつ、”区”に対応する画像領域について、「区」がキー文字と認識された場合、キー文字パスは、「市」（”京”）−「区」（”区”）、「市」（”市”）−「区」（”区”）となる。この場合、「市」（”京”）−「区」（”区”）において”京”および”区”が「市」および「区」として認識された場合の平均照合距離と、「市」（”市”）−「区」（”区”）において”市”および”区”が「市」および「区」として認識された場合の平均照合距離とが計算される。キー文字の平均照合距離が大きいほど、それが正しいキー文字パスである可能性は低いため、可能な限り除去せねばならない。このように、本発明の地名領域抽出・認識においては、従来の方法のように全てのキー文字パスを評価する必要はなく、平均照合距離が最小である高順位のＫ個のキー文字パスを選択するのみでよいため、住所認識の効率および正確性を高めることができる（ただし、Ｋはあらかじめ定められた正の整数である）。 Specifically, when an address image is input for recognition, first, a combination element is extracted from the input address image, and then a key character candidate extraction unit 20 extracts a key character from the combination element ( For details, see the description of FIG. 6 below). After extracting the key characters, the key character path determining means 21 determines a key character path that matches the place name hierarchy knowledge. The determined key character path describes the order of the key characters, and the preceding key character is an administrative unit higher than the succeeding key character. For example,
[Character 2]

Is the key character path,
[Character 1]

Is not a higher-level administrative unit of “city”, but “city” is an administrative meaning [letter 1]

Because it includes
[Character 3]

Is not a key character path. Next, the key character path filter means 22 extracts the average collation distance of each key character in the recognition result. Each key character path is ranked sequentially. In place name area extraction and recognition, only K character paths with the smallest average collation distance are evaluated. Here, K is a predetermined positive integer. For example, if the input address is [Character 11]

, “City” is recognized as a key character for the image areas corresponding to “Kyo” and “City”, and “Ku” is recognized as a key character for the image area corresponding to “Ku”. In this case, the key character path is “city” (“Kyo”) − “ku” (“ku”), “city” (“city”) − “ku” (“ku”). In this case, the average collation distance when “Kyo” and “Ku” are recognized as “City” and “Ku” in “City” (“Kyo”) − “Ku” (“Ku”), and “City” The average collation distance when “city” and “city” are recognized as “city” and “city” in (“city”) − “city” (“city”) is calculated. The greater the average collation distance of key characters, the less likely it is to be a correct key character path, so it must be removed as much as possible. As described above, in the place name area extraction / recognition of the present invention, it is not necessary to evaluate all the key character paths as in the conventional method, and K key character paths with high ranks having the smallest average collation distance are used. Since only selection is required, the efficiency and accuracy of address recognition can be improved (where K is a predetermined positive integer).

図６は、本発明の実施の形態に係るキー文字候補抽出手段２０のブロック図である。 FIG. 6 is a block diagram of key character candidate extraction means 20 according to the embodiment of the present invention.

特徴抽出手段４２は、単一文字領域検出手段１から入力された結合要素（すなわち、単一文字領域）に対応する画像から特徴ベクトルを抽出する。辞書照合手段４３は、キー文字辞書部４１を参照して、特徴抽出手段４２から出力された特徴ベクトルと、キー文字辞書部４１中のキー文字の特徴ベクトルとを照合し、可能性のあるキー文字を候補として出力する。キー文字候補決定手段４４は、これら選択されたキー文字候補が真のキー文字であるか否かを決定する。 The feature extraction unit 42 extracts a feature vector from the image corresponding to the coupling element (that is, the single character region) input from the single character region detection unit 1. The dictionary collating unit 43 refers to the key character dictionary unit 41, collates the feature vector output from the feature extracting unit 42 with the feature vector of the key character in the key character dictionary unit 41, and has a possible key Output characters as candidates. The key character candidate determining means 44 determines whether or not these selected key character candidates are true key characters.

具体的には、単一文字領域検出手段１から結合要素が入力されると、まず、特徴抽出手段４２が、結合要素の特徴を抽出する。次に、辞書照合手段４３が、これらの結合要素の画像から抽出された（ｆＣＣ_iとラベリングされた）特徴ベクトルと、キー文字辞書部４１中の全てのキー文字の特徴ベクトルとを照合する。キー文字辞書部４１中のキー文字は、ｆｋｅｙ_kとラベリングされており（ただし、ｋ＝１，２，３，．．．，２２）、中国の住所における全２２のキー文字に対応する。すなわち、
［文字１２］

である。結合要素との照合距離Ｄｉｓ（ＣＣ_i，Ｋｃ）が所定の閾値Ｔｈ_keyよりも小さいキー文字Ｋｃが存在する場合、当該結合要素は、キー文字の可能性がある候補Ｋｃであると決定される。照合距離Ｄｉｓ（ＣＣ_i，Ｋｃ）は、ｆＣＣ_iとｆｋｅｙ_Kcとの間のユークリッド距離と定義される。すなわち、

である。 Specifically, when a coupling element is input from the single character area detection unit 1, the feature extraction unit 42 first extracts the feature of the coupling element. Next, the dictionary collation means 43 collates the feature vectors extracted from the images of these coupling elements (labeled as fCC _i ) with the feature vectors of all the key characters in the key character dictionary unit 41. The key characters in the key character dictionary section 41 are labeled fkey _k (where k = 1, 2, 3,..., 22), and correspond to all 22 key characters in a Chinese address. That is,
[Character 12]

It is. When there is a key character Kc whose collation distance Dis (CC _i , Kc) with the combined element is smaller than a predetermined threshold Th _key , the combined element is determined to be a candidate Kc that may be a key character. . The collation distance Dis (CC _i , Kc) is defined as the Euclidean distance between fCC _i and fkey _Kc . That is,

It is.

上記のように、ｆＣＣ_iおよびｆｋｅｙ_Kcの次元はｍである。 As described above, the dimensions of fCC _i and fkey _Kc are m.

このように、キー文字候補抽出手段２０は、単一文字領域と限られた数（中国語の場合は２２個）のキー文字のみとを照合し、従来方法の場合のように数千の漢字すべてとの照合を行う必要がない。これによって、システムの効率性が高められ、同時に、キー文字の認識精度が上がる。 In this way, the key character candidate extracting means 20 collates a single character area with only a limited number (22 in the case of Chinese) of key characters, and all thousands of Chinese characters as in the conventional method. There is no need to check against. This increases the efficiency of the system and at the same time increases the accuracy of key character recognition.

加えて、結合要素の認識において、Ｌ個（Ｌ<=２２）の候補との照合距離が所定の閾値Ｔｈ_keyよりも小さい場合には、当該結合要素は、Ｌ個のキー文字の可能性があるものと認識される。したがって、例えば、結合要素の可能性があるキー文字が「省」および「市」であれば、当該結合要素は二つのキー文字「省」および「市」であると認識される。これによって、正しい文字が失われる危険性が減るとともに、住所認識の正確性が高まる。 In addition, in the recognition of the coupling element, when the collation distance with L (L <= 22) candidates is smaller than a predetermined threshold Th _key , the coupling element may be L key characters. It is recognized that there is. Therefore, for example, if the key characters that may be the combination elements are “Province” and “City”, the combination element is recognized as two key characters “Province” and “City”. This reduces the risk of losing correct characters and increases the accuracy of address recognition.

さらに、結合要素ＣＣ_iのキー文字候補がＬ個のキー文字（Ｋ₁，Ｋ₂，・・・，Ｋ_L）（ただし、Ｋ₁，Ｋ₂，・・・，Ｋ_Lは、照合距離にしたがって昇順に並べられている）を含む場合、キー文字候補決定手段４４は、当該結合要素を、次式を満たすキー文字のみを含むものとして認識する。
[Dis(CC_i, K_n) − Dis(CC_i, K₁)] / Dis(CC_i, K₁) < Th_{dis_k}
ここで、Ｄｉｓ（ＣＣ_i，Ｋ_n）は、キー文字Ｋ_nと結合要素ＣＣ_iとの間の照合距離を表し、Ｔｈ_{dis_k}は、正の定数であり、あらかじめ定められた閾値を表す。このようなキー文字除去方法を用いることにより、第１の文字候補に近いキー文字が保持され、その後のキー文字パス決定および地名照合において評価がなされる。上記の条件を満たさないキー文字がある場合、かかるキー文字の照合距離は第１候補の照合距離とはかなり遠いことを示しているので、当該キー文字が正しいキー文字である可能性は低く、そのようなキー文字は除去される。これにより、キー文字抽出の後は、可能性のある全てのキー文字について評価を行うために地名領域抽出および認識を行う必要はなく、住所認識の効率性および正確性が高められる。 Furthermore, the key character candidates coupling element CC _i are L key character _{_{(K 1, K 2, ···}} , K L) ( _{_{but, K 1, K 2, ···}} , K L is the matching distance Therefore, the key character candidate determination unit 44 recognizes the combination element as including only key characters satisfying the following expression.
[Dis (CC _i , K _n ) − Dis (CC _i , K ₁ )] / Dis (CC _i , K ₁ ) <Th _{dis_k}
Here, Dis (CC _i , K _n ) represents a collation distance between the key character K _n and the coupling element CC _i, and Th _{dis_k} is a positive constant and represents a predetermined threshold value. By using such a key character removal method, a key character close to the first character candidate is held, and evaluation is performed in subsequent key character path determination and place name verification. If there is a key character that does not satisfy the above conditions, the collation distance of the key character indicates that it is far from the collation distance of the first candidate, so it is unlikely that the key character is a correct key character, Such key characters are removed. Thereby, after the key character extraction, there is no need to extract and recognize the place name area in order to evaluate all the possible key characters, and the efficiency and accuracy of address recognition are improved.

図７は、本発明の実施の形態に係る一括地名認識手段４を示すブロック図である。図７に示すように、地名照合手段２７は、地名領域抽出手段３から入力された様々なレベルの地名に対応する画像から特徴を抽出し、（地名データベースおよび地名辞書等を記憶する）地名辞書部５４に記憶された地名特徴と照合し、それによって、１からＮｃ番目までの候補を出力する。地名候補評価手段２８は、１からＮｃ番目までの候補を評価し、それによって数個の候補を下位の住所に対する上位の行政単位として決定する。再帰的照合評価手段２９は、すべての住所候補の平均照合距離を計算し、そこから平均照合距離が最も小さい住所を選択する。地名照合手段２７は、地名画像中の特徴ベクトルを抽出するための特徴抽出手段（図示せず）と、抽出された特徴ベクトルと地名辞書中の地名の特徴ベクトルとを照合して可能性のある地名候補全てを見つけるための照合手段とを備える。 FIG. 7 is a block diagram showing the collective place name recognition means 4 according to the embodiment of the present invention. As shown in FIG. 7, the place name collating means 27 extracts features from images corresponding to place names at various levels inputted from the place name area extracting means 3, and stores place name dictionary (place name database, place name dictionary, etc.). The place name feature stored in the unit 54 is collated, thereby outputting the first to Ncth candidates. The place name candidate evaluation means 28 evaluates the first to Ncth candidates, and thereby determines several candidates as higher administrative units for lower addresses. The recursive matching evaluation means 29 calculates an average matching distance of all address candidates, and selects an address having the smallest average matching distance therefrom. The place name collating unit 27 may collate the extracted feature vector with the feature vector of the place name in the place name dictionary, and a feature extracting unit (not shown) for extracting a feature vector in the place name image. Collation means for finding all place name candidates.

地名認識においては第１候補は通常正しい地名ではないため、本発明の多候補再帰的認識方法によって、システムの認識精度が著しく向上する。さらに、地名の評価において、全ての候補を調べる必要がなく、システムの認識効率が向上する。 In place name recognition, since the first candidate is usually not a correct place name, the multi-candidate recursive recognition method of the present invention significantly improves the recognition accuracy of the system. Furthermore, in the evaluation of place names, it is not necessary to examine all candidates, and the recognition efficiency of the system is improved.

図８に、本発明の実施の形態に係る一括地名認識手段４がおこなう処理のフローチャートを例として示す。 FIG. 8 shows as an example a flowchart of processing performed by the collective place name recognition unit 4 according to the embodiment of the present invention.

入力された住所画像中にＮレベルのキー文字が存在すると仮定する。ステップＳＦ１において、住所認識はレベル１（ｉ＝１）から始まる。ステップＳＦ２において、ｉレベルの地名照合が行われる。地名照合において、上位の住所が存在する場合、当該ｉレベルの地名に対応する地名画像から抽出された特徴が、当該ｉレベルの地名のキー文字によって区切られ、かつ、地名辞書中にある上位の住所に包含される地名と照合される。１からＮｃ番目までの地名候補が生成され、照合距離に基づいて昇順に並べられる。ステップＳＦ３において、候補連続番号カウンタｋを２からカウントが始まるように設定する。ステップＳＦ４において、ｋ番目の地名候補の照合距離と第１の候補の照合距離との差の第１の候補の照合距離に対する比を計算する。ステップＳＦ５で、当該比が、所定の閾値Ｔｈ_disよりも大きいか否か、または、当該候補が最後の候補であるか否かを判定する。具体的には、
[Dis(cand1_k) − Dis(cand1₁)]/ Dis(cand1₁) < Th_dis
が成立するか、または、その時点での候補が最後の候補であると判定された場合には、ステップＳＦ７に進み、次のレベルの地名認識で使用するために１からｋ番目の候補を上位の行政単位として保持する。ここで、Ｄｉｓ（Ｃａｎｄ１_k）は、第１レベルの地名認識におけるｋ番目の候補の照合距離を表し、照合距離は、キー文字によって区切られた画像と候補との間の非類似度を表す。照合距離の値が小さいほど、その時点で確定されている画像と候補との間の相違は小さい。すなわち、両者はより近い。照合距離の計算については後述する。Ｔｈ_disは正の定数であり、あらかじめ定められた閾値である。照合距離の値がこの閾値を上回る場合は、すべてその時点で確定されている画像が候補とは著しく異なっていることを示しており、さらに比較する必要はない。さらに、地名候補は照合距離に基づいて並べられているため、その時点の候補について計算された比がＴｈ_disを上回れば、続く候補の比も必然的に当該閾値を上回ることになるため、それ以上評価する必要はない。他方、ステップＳＦ５において比が所定の閾値Ｔｈ_disを上回らないと決定され、かつ、当該候補が最後の候補ではないと判定されれば、ステップＳＦ６においてｋの値を増分し、ステップＳＦ４に戻る。 Assume that there are N-level key characters in the input address image. In step SF1, address recognition starts from level 1 (i = 1). In step SF2, i-level place name matching is performed. In the place name matching, if a higher-order address exists, the features extracted from the place-name image corresponding to the i-level place name are separated by the key characters of the i-level place name and Matched with the place name included in the address. First to Nc-th place name candidates are generated and arranged in ascending order based on the collation distance. In step SF3, the candidate serial number counter k is set so that counting starts from 2. In step SF4, the ratio of the difference between the collation distance of the kth place name candidate and the collation distance of the first candidate to the collation distance of the first candidate is calculated. In step SF5, it is determined whether or not the ratio is greater than a predetermined threshold value Th _dis or whether or not the candidate is the last candidate. In particular,
[Dis (cand1 _k ) − Dis (cand1 ₁ )] / Dis (cand1 ₁ ) <Th _dis
Or if it is determined that the current candidate is the last candidate, the process proceeds to step SF7, and the 1st to kth candidates are ranked higher for use in the next level place name recognition. As an administrative unit. Here, Dis (Cand1 _k ) represents the collation distance of the kth candidate in the first-level place name recognition, and the collation distance represents the dissimilarity between the image and the candidate divided by the key characters. The smaller the value of the collation distance, the smaller the difference between the image and the candidate determined at that time. That is, both are closer. The calculation of the verification distance will be described later. Th _dis is a positive constant and is a predetermined threshold value. If the value of the collation distance exceeds this threshold, it means that all the images determined at that time are significantly different from the candidates, and no further comparison is necessary. Furthermore, since the place name candidates are arranged based on the collation distance, if the ratio calculated for the current candidate exceeds Th _dis , the ratio of the subsequent candidates will necessarily exceed the threshold value. There is no need to evaluate above. On the other hand, if it is determined in step SF5 that the ratio does not exceed the predetermined threshold Th _dis and it is determined that the candidate is not the last candidate, the value of k is incremented in step SF6, and the process returns to step SF4.

全ての地名候補について評価を行った後、１からｋ番目までの地名候補を下位の地名とマッチする上位住所として保持する。各レベルの住所照合において保持される地名候補の数ｋは動的に変化し、各レベルにおいて異なる数となりうることがわかる。当該地名候補の照合距離が第１候補の照合距離と著しく異なっている場合、当該地名候補が正しい地名である可能性は低く、最終的なｋは比較的小さな数となる。それ以外の場合には、比較的大きい数となる。 After all the place name candidates have been evaluated, the 1st to kth place name candidates are held as upper addresses matching lower place names. It can be seen that the number k of place name candidates held in the address verification at each level changes dynamically and can be different at each level. If the collation distance of the place name candidate is significantly different from the collation distance of the first candidate, it is unlikely that the place name candidate is a correct place name, and the final k is a relatively small number. In other cases, the number is relatively large.

ステップＳＦ８においては、Ｎ個の地名レベル全てが認識されたか否かを決定する。全てのレベルが認識されていない場合には、ステップＳＦ９でｉの値を増分した後、次のレベルの地名照合を行う。例えば、第２レベルの地名照合においては、第１レベルの地名候補から出力された上位行政単位の範囲内で、第１および第２のキー文字間のパターンを、上位の地名により包含され、かつ、第２のキー文字によって区切られる行政単位の地名と照合する必要がある。例えば、第１レベルの地名候補が、「北京」と
［文字９］

であり、第１のキー文字が「市」、第２のキー文字が「区」である場合、第１のキー文字および第２のキー文字の間の画像と、「北京」市に包含される全ての名前とが可能性のある住所の一つとして照合される。さらに、他の可能性のある住所として、第１のキー文字および第２のキー文字の間の画像と、
［文字９］

市に包含される全ての名前とが照合される。そして、１からＮｃ番目までの可能性のある候補が出力される。 In step SF8, it is determined whether or not all N place name levels have been recognized. If all levels are not recognized, the value of i is incremented in step SF9, and then the next level place name verification is performed. For example, in the second level place name verification, the pattern between the first and second key characters is included by the upper place name within the range of the higher administrative unit output from the first level place name candidates, and It is necessary to collate with the place name of the administrative unit delimited by the second key character. For example, if the first level place name candidates are “Beijing” and [Character 9]

And the first key character is “city” and the second key character is “gu”, and the image between the first key character and the second key character is included in the “Beijing” city. All names are matched as one possible address. In addition, as other possible addresses, images between the first key character and the second key character,
[Character 9]

All names included in the city are matched. Then, the 1st to Ncth possible candidates are output.

さまざまなレベルでの地名の照合が終わると、つまり、ＳＦ８において「はい」であると判定されると（最後のレベルの地名については、それより下位の地名がないため、地名候補の評価を行う必要はなく、地名候補を他の地名を包含する上位地名として出力する必要はない）、ＳＦ１０において、各住所候補におけるすべての地名の照合距離を累積的に加算する。ＳＦ１１で、ＳＦ１０で得られた累積的に加算された照合距離にしたがって住所候補を順位付けし、累積的に加算された照合距離が最も小さい住所を認識結果として出力する。 When place name collation at various levels is completed, that is, when it is determined to be “Yes” in SF8 (the place name of the last level has no lower place names, and therefore place name candidates are evaluated. There is no need, and there is no need to output the place name candidates as upper place names including other place names.) In SF10, the collation distances of all place names in each address candidate are cumulatively added. In SF11, the address candidates are ranked according to the cumulatively added collation distance obtained in SF10, and the address having the smallest cumulatively added collation distance is output as a recognition result.

上述した処理は再帰的処理である。すなわち、ｉレベルの地名照合は、ｉ−１レベルから上位住所として得られたｋ個の候補すべてについてそれぞれ行わねばならない。同様に、ステップＳＦ７の後にｉレベルの地名照合から得られたｋ個の地名候補は、次のレベルの地名照合において上位住所としてすべて処理されねばならない。 The process described above is a recursive process. That is, i-level place name matching must be performed for all k candidates obtained as higher addresses from the i-1 level. Similarly, all k place name candidates obtained from the i-level place name matching after step SF7 must be processed as higher-order addresses in the next level place name matching.

照合距離Ｄｉｓ(Ｃａｎｄ_i)は、ｉ番目の地名候補の特徴ベクトルｆ_iと、地名画像から抽出されたそれに対応する特徴ベクトルｖ_iとの間のユークリッド距離と定義される。すなわち、次式が成立する。

上記のとおり、特徴ベクトルｆ_iおよび特徴ベクトルｖ_iの次元はｍである。 The collation distance Dis (Cand _i ) is defined as the Euclidean distance between the feature vector f _i of the i-th place name candidate and the corresponding feature vector v _i extracted from the place name image. That is, the following equation is established.

As described above, the dimension of the feature vector f _i and the feature vector v _i is m.

図８は、一括地名認識の動作フローの一例を示しているに過ぎない。一括地名認識処理においては、ステップＳＦ２における照合結果はさらに単一文字地名認識結果と組み合わされ、得られた住所候補を地名検証手段を用いてフィルタリングすることができる。その後、ステップＳＦ３以降の処理を行うことで、地名認識の精度が高められる。同様に、ステップＳＦ７の認識結果を単一文字地名認証結果と組み合わせ、得られた住所候補を地名検証手段を用いてフィルタリングすることができる。地名検証手段５および単一文字地名認証手段Ｅ１については後述する。 FIG. 8 only shows an example of an operation flow for collective place name recognition. In the collective place name recognition process, the matching result in step SF2 is further combined with the single character place name recognition result, and the obtained address candidates can be filtered using place name verification means. Then, the accuracy of place name recognition is improved by performing the processing after step SF3. Similarly, the recognition result of step SF7 can be combined with the single character place name authentication result, and the obtained address candidates can be filtered using place name verification means. The place name verification means 5 and the single character place name authentication means E1 will be described later.

図９に、本発明の実施の形態にかかる再帰的一括住所認識方法を例示的に示す。 FIG. 9 exemplarily shows a recursive collective address recognition method according to an embodiment of the present invention.

図９では、はじめに入力された手書き文字画像が
［文字１１］

であると仮定する。この場合、キー文字パスは、「市−区」となる。そして、キー文字抽出手段２を用いて、入力された画像から「市−区」パスに含まれるキー文字を抽出する。 In FIG. 9, the handwritten character image input first is [character 11].

Assume that In this case, the key character path is “city-ku”. Then, the key character extraction means 2 is used to extract key characters included in the “city-ward” path from the input image.

次に、第１のキー文字によって区切られた画像を抽出することにより、地名領域Ｒ１およびＲ２を切り出す。図８に示すように、ステップＳＦ２において地名領域Ｒ１が一括認識され、Ｎｃ個（この例では３個）の候補ＳＲ１が出力される。さらに、各候補について、照合距離が出力される。照合距離は、地名領域Ｒ１と地名辞書中の地名との非類似度を示す。地名領域Ｒ１は、
［文字９］

、「北京」、
［文字１３］

のそれぞれと比較され、それぞれ照合距離は、２３０、２４０、３１０となる。したがって、Ｒ１は、「北京」および
［文字１３］

よりも
［文字９］

である可能性が高く、かつ、
［文字１３］

よりも「北京」である可能性が高いことになる。 Next, the place name areas R1 and R2 are cut out by extracting an image delimited by the first key characters. As shown in FIG. 8, the place name region R1 is collectively recognized in step SF2, and Nc (three in this example) candidate SR1 are output. Further, the collation distance is output for each candidate. The collation distance indicates the dissimilarity between the place name region R1 and the place name in the place name dictionary. The place name area R1 is
[Character 9]

,"Beijing",
[Character 13]

The comparison distances are 230, 240, and 310, respectively. Therefore, R1 is "Beijing" and [letter 13]

Than [character 9]

Is likely, and
[Character 13]

More likely to be “Beijing”.

次に、Ｎｃ個の候補すべてについて、次のレベルの地名（たとえば、第２レベルの地名）に対して上位の行政単位とみなしうるか否かを評価する。
［文字１３］

と
［文字９］

との照合距離の間の差の
［文字９］

の照合距離に対する比は、（３１０−２３０）／２３０であり、比較的大きい。他方、「北京」と
［文字９］

との照合距離の間の差の
［文字９］

の照合距離に対する比は、（２４０−２３０）／２３０であり、比較的小さい。この場合、
［文字１３］

が、Ｒ１の実際の地名である可能性は低いため、ＳＲ２で除外される。具体的には、あらかじめ定められた閾値が０．２５であるとすれば、（３１０−２３０）／２３０＞０．２５、かつ、（２４０−２３０）／２３０＜０．２５であるため、
［文字１３］

は除外され、
［文字９］

および「北京」はそのまま上位行政単位として使用される。 Next, it is evaluated whether or not all Nc candidates can be regarded as higher administrative units with respect to the next level place name (for example, the second level place name).
[Character 13]

And [character 9]

[Character 9] of the difference between the reference distance and

Is a relatively large ratio of (310-230) / 230. On the other hand, "Beijing" and [letter 9]

[Character 9] of the difference between the reference distance and

The ratio to the collation distance is (240−230) / 230, which is relatively small. in this case,
[Character 13]

However, since it is unlikely that it is the actual place name of R1, it is excluded by SR2. Specifically, if the predetermined threshold is 0.25, (310-230) / 230> 0.25 and (240-230) / 230 <0.25,
[Character 13]

Are excluded,
[Character 9]

And “Beijing” is used as a higher administrative unit as it is.

次に、上記のステップを繰り返して、第２レベルの地名認識を行う。前のレベルが
［文字９］

であれば、地名辞書にもとづいて
［文字９］

に包含されるすべての地区の実際の地名パターンを地名領域Ｒ２中の地名と比較し、Ｎｃ個の候補を出力する。これらの候補およびその照合距離をＳＲ３に示す。前のレベルが「北京」であれば、地名辞書にもとづいて「北京」に包含されるすべての地区の実際の地名パターンを地名領域Ｒ２中の地名と比較し、Ｎｃ個の候補を出力する。これらの候補およびその照合距離をＳＲ４に示す。 Next, the above steps are repeated to perform second level place name recognition. The previous level is [Character 9]

Then, based on the place name dictionary [Character 9]

Are compared with the place names in the place name region R2, and Nc candidates are output. These candidates and their collation distances are shown in SR3. If the previous level is “Beijing”, the actual place name patterns of all the areas included in “Beijing” are compared with the place names in the place name region R2 based on the place name dictionary, and Nc candidates are output. These candidates and their collation distances are shown in SR4.

結果として、ＳＲ５に示すように、すべての候補パス（たとえば、
［文字９］

＋
［文字１４］

）をそれ以外の他の候補パスと比較した場合、平均照合距離の平均値が最短となるのは「北京」＋
［文字１０］

であり、その平均照合距離は（２４０＋１６０）／２＝２００となる。したがって、
［文字１１］

が認識結果として出力される。 As a result, all candidate paths (eg,
[Character 9]

+
[Character 14]

) Is compared to other candidate paths, the average average collation distance is “Beijing” +
[Character 10]

The average collation distance is (240 + 160) / 2 = 200. Therefore,
[Character 11]

Is output as a recognition result.

図１０は、本発明の実施の形態に係る単一文字地名認識手段を示すブロック図である。 FIG. 10 is a block diagram showing single character place name recognition means according to the embodiment of the present invention.

上述のとおり、単一文字地名認識手段および地名検証手段は、一括地名認識により取得された、可能性のある複数の地名候補をフィルタリングすることができる。加えて、単一文字地名認識手段および地名検証手段は、一括地名認識手段４の照合評価手段の評価結果（すなわち地名パス）における最短の平均照合距離により近い結果（もしあれば）をフィルタリングして決定することができる。 As described above, the single character place name recognition unit and the place name verification unit can filter a plurality of possible place name candidates acquired by batch place name recognition. In addition, the single character place name recognition means and place name verification means determine the result (if any) closer to the shortest average collation distance in the evaluation result (ie, place name path) of the collation evaluation means of the collective place name recognition means 4 can do.

図１０に示すように、単一文字地名認識手段Ｅ１は、特徴抽出手段６１、辞書照合手段６２、および単一文字辞書部５９を備える。 As shown in FIG. 10, the single character place name recognition unit E1 includes a feature extraction unit 61, a dictionary collation unit 62, and a single character dictionary unit 59.

特徴抽出手段６１は、地名領域抽出手段３により抽出された地名領域内に含まれ、かつ、単一文字領域検出手段１によって出力された単一の文字領域の特徴をひとつずつ抽出する。次に、辞書照合手段６２が、特徴抽出手段６１により得られた単一文字領域の各々の特徴を、単一文字辞書部５９内に記憶された単一文字の中国語単語の特徴と照合する。そして、各単一文字領域の１からＮ番目までの候補が出力される。 The feature extracting means 61 extracts the features of the single character area included in the place name area extracted by the place name area extracting means 3 and outputted by the single character area detecting means 1 one by one. Next, the dictionary collating unit 62 collates each feature of the single character area obtained by the feature extracting unit 61 with the feature of the single-character Chinese word stored in the single character dictionary unit 59. Then, the first to Nth candidates for each single character area are output.

図１１は、本発明の実施の形態に係る地名検証手段５を示すブロック図である。 FIG. 11 is a block diagram showing the place name verification means 5 according to the embodiment of the present invention.

単一文字地名認識手段Ｅ１の地名認識結果、および、一括地名認識手段４により得られた地名認識結果に基づいて、地名検証手段５は、これらの検証および組み合わせを行う。図１１は、本発明の実施の形態に係る地名検証手段５を示すブロック図である。図１１に示すように、地名検証手段５は、編集距離計算手段５５と、一括認識・個別文字認識組み合わせ手段５８とを備える。 Based on the place name recognition result of the single character place name recognition means E1 and the place name recognition result obtained by the collective place name recognition means 4, the place name verification means 5 performs these verifications and combinations. FIG. 11 is a block diagram showing the place name verification means 5 according to the embodiment of the present invention. As shown in FIG. 11, the place name verification unit 5 includes an edit distance calculation unit 55 and a batch recognition / individual character recognition combination unit 58.

編集距離計算手段５５は、単一文字地名認識手段Ｅ１により得られた地名結果と、キー文字の定義に合致し、かつ、前レベルの地名から認識された住所に包含されるすべての地名との編集距離を計算する。一括認識・個別文字認識組み合わせ手段５８は、一括地名認識手段４により得られた結果と、単一文字地名認識手段Ｅ１により得られた結果とを、編集距離計算手段５５を用いて組み合わせる。一括地名認識手段４により得られた候補シーケンスまたは候補列に従い、一括認識・個別文字認識組み合わせ手段５８は、各地名の全体的順位を計算し、最終的に第１位の地名を出力する。なお、編集距離とは、２つの文字列間がどの程度異なっているかを表わす数値であり、例えば「東京都」と「東束都」の間の編集距離は「京」を「束」の違いのみなので「１」となる。編集距離はレーベンシュタイン距離とも呼ばれる。この場合も小さいほど両者が近いことになる。 The edit distance calculation means 55 edits the place name result obtained by the single character place name recognition means E1 and all place names that match the definition of the key character and are included in the address recognized from the place name of the previous level. Calculate the distance. The batch recognition / individual character recognition combination means 58 uses the edit distance calculation means 55 to combine the result obtained by the batch place name recognition means 4 and the result obtained by the single character place name recognition means E1. In accordance with the candidate sequence or candidate string obtained by the collective place name recognizing means 4, the collective recognition / individual character recognition combination means 58 calculates the overall rank of each place name and finally outputs the first place name. Note that the edit distance is a numerical value indicating how different two character strings are different. For example, the edit distance between “Tokyo” and “Higashizukato” is the difference between “Kyo” and “Bunch”. Because it is only, it becomes “1”. The edit distance is also called Levenshtein distance. In this case as well, the smaller they are, the closer they are.

地名検証の具体的処理は以下のとおりである。 The specific processing of place name verification is as follows.

まず、編集距離計算手段５５は、ダイナミック・プログラミングを用いて（非特許文献３および４を参照）、キー文字により区切られた合理的な地名であり、かつ、上位レベルにおいて得られた地名領域に包含される地名と、単一文字地名認識手段Ｅ１により得られた結果Ｗとの間の（以下に定義する）式（１）により定義される編集距離を計算する。編集距離とは、挿入、置換、削除等の操作による単一文字部の認識結果から合理的な地名のオペランドへの変換を意味する（オペランドについては後述する）。したがって、合理的な地名の編集距離が小さい場合、地名と個別文字認識結果との差はきわめて小さく、他方、合理的な地名の編集距離が大きい場合、地名と個別文字認識結果との差は大きい。 First, the edit distance calculating means 55 uses the dynamic programming (see Non-Patent Documents 3 and 4) to obtain a place name area that is a reasonable place name separated by key characters and obtained at a higher level. The edit distance defined by the equation (1) (defined below) between the included place name and the result W obtained by the single character place name recognition means E1 is calculated. The edit distance means conversion from a recognition result of a single character part to an operand of a reasonable place name by an operation such as insertion, replacement, or deletion (the operand will be described later). Therefore, when the rational place name editing distance is small, the difference between the place name and the individual character recognition result is very small. On the other hand, when the rational place name editing distance is large, the difference between the place name and the individual character recognition result is large. .

本発明では、編集距離を以下のように定義する。 In the present invention, the edit distance is defined as follows.

Ｓはｎ次元ベクトルであり、ｎ個の文字を含む地名を指し、ＷはＮ×ｍの行列であり、Ｗ_ijは入力された地名画像中のｉ番目の単一文字領域または結合要素のｊ番目の認識候補を指し、Ｗ_iはｉ番目の単一文字領域または結合要素の認識結果、すなわち、ｍ次元の認識候補リストを指すものとする。ＳとＷとの間の編集距離は、挿入、削除、置換等によって候補を調整することを考慮すれば、ＷをＳに変換した場合に、もっとも小さいコストとなる。Ｓ_kとＷ_iとの間のコストは、以下のように計算される。

式（１）において、ｋは入力された地名画像中の単一文字領域の総数であり、ｗは当該地名画像中の各単一文字領域の認識候補の数である。 S is an n-dimensional vector, indicating a place name including n characters, W is an N × m matrix, and W _ij is the i-th single character region or j-th connected element in the inputted place name image. Let W _i denote the recognition result of the i-th single character region or coupling element, that is, the m-dimensional recognition candidate list. The edit distance between S and W is the smallest cost when W is converted to S, considering that candidates are adjusted by insertion, deletion, replacement, or the like. The cost between S _k and W _i is calculated as follows:

In Equation (1), k is the total number of single character regions in the input place name image, and w is the number of recognition candidates for each single character region in the place name image.

したがって、結合要素の認識候補Ｗ_i中に照合すべき地名Ｓの文字Ｓ_kが存在する場合、候補列Ｗ_i中においてその文字が位置づけられる順位が高ければ高いほど（すなわち、照合距離がＷ_ij=Ｓ_kを満たす小さなｊが存在すれば）、照合コストは小さくなり、Ｗ_iがＳ_kである可能性が高くなる。逆に、候補列Ｗ_i中においてその文字が位置づけられる順位が低ければ低いほど（すなわち、照合距離がＷ_ij=Ｓ_kを満たす大きなｊが存在すれば）、照合コストは大きくなり、Ｗ_iがＳ_kである可能性が低くなる。 Therefore, when the character S _{k of the} place name S to be collated exists in the recognition candidate W _i of the coupling element, the higher the rank in which the character is positioned in the candidate column W _i (that is, the collation distance is W _ij If there is a small j satisfying = S _k ), the matching cost is reduced and the possibility that W _i is S _k increases. Conversely, the lower the order in which the character is positioned in the candidate string W _i (that is, if there is a large j that satisfies the collation distance W _ij = S _k ), the collation cost increases and W _i becomes smaller. The possibility of being S _k is reduced.

一括認識・個別文字認識組み合わせ手段５８は、編集距離計算手段５５を利用して、一括認識結果と、単一文字地名認識結果とを以下のステップによって組み合わせる。 The collective recognition / individual character recognition combination means 58 uses the edit distance calculation means 55 to combine the collective recognition result and the single character place name recognition result in the following steps.

（１）上位レベルの地名に包含され、かつ、現時点でのキー文字によって区切られる、各地名Ａ_iの重み付け値を計算する。重み付け値は、以下の式（２）によって計算される。
Weight(A_i) = (1 − t1) × Rank_ED(A_i)＋ t1 × i （２）
式（２）において、Ａ_iは一括地名認識手段４により取得されたｉ番目の地名候補であり、ＥＤ(Ａ_i)はＡ_iの編集距離であり、Ｒａｎｋ＿ＥＤ(Ａ_i)はＡ_iと個別文字認識候補行列との間の編集距離の順位であり、ｔ１はｒｏｕｎｄ(ＥＤ(Ａ_i))／ｋに等しく、ｒｏｕｎｄ(ＥＤ(Ａ_i))はＥＤ(Ａ_i)を四捨五入することを示す。 (1) The weight value of each place name A _i included in the upper level place name and separated by the current key character is calculated. The weight value is calculated by the following equation (2).
Weight (A _i ) = (1 − t1) × Rank_ED (A _i ) + t1 × i (2)
In equation (2), A _i is the i-th place name candidate acquired by the collective place name recognition means 4, ED (A _i ) is the edit distance of A _i , and Rank_ED (A _i ) is individually independent of A _i It is the rank of the edit distance with the character recognition candidate matrix, t1 is equal to round (ED (A _i )) / k, and round (ED (A _i )) indicates that ED (A _i ) is rounded off. .

（２）上位レベルの地名に包含され、かつ、現時点でのキー文字によって区切られるすべての地名Ａ_iの重み付け値を昇順で順位付けし、重み付け値が最小のＡ_iを現時点で認識された地名とする。 (2) is included in the higher level of place names, and place names ranks weighting values of all place names A _i, delimited by the key character at the current time in ascending order, the weighting value is recognized at present the smallest A _i And

Ａ_iの編集距離が著しく小さい場合には、個別文字認識結果の信頼性が高いこととなり、より大きな重み付け値(１−ｔ１)がＲａｎｋ＿ＥＤ(Ａ_i)に割り当てられる。Ａ_iの編集距離が著しく大きい場合には、ｔ１が著しく大きくなり、一括認識部から得られた候補順位ｉが重み付け計算において重要な役割を果たすようになる。つまり、Ａ_iの編集距離が十分に小さければ個別文字認識の結果は信頼でき、他方、すべての合理的な住所の編集距離が十分大きければ個別文字認識結果はあまり信頼できない。この場合には、一括地名認識手段から得られた認識結果を組み合わされた地名として出力する必要がある。このように、一括地名認識手段から得られる認識結果と、個別文字認識の結果とが組み合わされるため、システムの正確性が著しく向上する。 When the edit distance of A _i is extremely small, the reliability of the individual character recognition result is high, and a larger weighting value (1-t1) is assigned to Rank_ED (A _i ). When the editing distance of A _i is remarkably large, t1 becomes remarkably large, and the candidate rank i obtained from the collective recognition unit plays an important role in the weighting calculation. That is, if the editing distance of A _i is sufficiently small, the result of individual character recognition is reliable, while if the editing distance of all reasonable addresses is sufficiently large, the result of individual character recognition is not very reliable. In this case, it is necessary to output the recognition result obtained from the collective place name recognition means as a combined place name. Thus, since the recognition result obtained from the collective place name recognition means and the result of the individual character recognition are combined, the accuracy of the system is remarkably improved.

図１２に、本発明に係る住所検証の一例を示す。 FIG. 12 shows an example of address verification according to the present invention.

図１２において、（Ａ）および（Ｂ）に示される入力された住所画像の実際の住所は、
［文字１５］

である。ここで、第１レベルの地名
［文字１６］

がすでに認識され、キー文字
［文字１７］

もすでに認識されていると仮定する。（Ａ）に、「二盛一」の画像領域から単一文字領域検出手段１により得られた単一文字領域の個別文字認識結果の例を示す。Ｗ₁、Ｗ₂、およびＷ₃は、それぞれ、単一文字領域「二」、「盛」、「一」に対応する認識候補列を示す。候補の数は３と仮定する。（Ｂ）に、「二盛一」の画像領域を一括認識することによって得られた地名候補の例を示す。ここで、
［文字１６］

市には、
［文字１７］

がつく地名は３つしかなく、それらはそれぞれ「二盛」、「二盛一」、「二旺一」であると仮定する。 In FIG. 12, the actual address of the input address image shown in (A) and (B) is
[Character 15]

It is. Here, the first level place name [character 16]

Is already recognized and the key character [character 17]

Is also already recognized. (A) shows an example of an individual character recognition result of a single character area obtained by the single character area detection means 1 from an image area of “Niimoriichi”. W ₁ , W ₂ , and W ₃ indicate recognition candidate strings corresponding to single character areas “two”, “high”, and “one”, respectively. Assume that the number of candidates is three. (B) shows an example of a place name candidate obtained by collectively recognizing the image area of “Niimoriichi”. here,
[Character 16]

The city has
[Character 17]

It is assumed that there are only three place names that are ticked, and that they are “Nisemori”, “Nishimori”, and “Ninoichi”, respectively.

地名検証によって、（Ａ）における行列Ｗ＝｛Ｗ₁,Ｗ₂,Ｗ₃｝と、
［文字１６］

市の領域に包含され、かつ、
［文字１７］

によって区切られるすべての地名（すなわち、「二盛」、「二盛一」、「二旺一」）との間の編集距離を計算する。たとえば、「二盛」とＷとの間の最小編集距離は、ダイナミック・プログラミングによって、「二」をＷ₁と照合し、「盛」をＷ₂と照合し、無効文字（ＮＵＬＬとラベリングする）を追加してこれをＷ₃と照合することで計算される。この場合の照合コストは以下のようになる。 By the place name verification, the matrix W = {W ₁ , W ₂ , W ₃ } in (A) and
[Character 16]

Included in the city territory, and
[Character 17]

The edit distance between all place names (ie, “Niimori”, “Niimoriichi”, “Ninoichi”) delimited by is calculated. For example, the minimum edit distance between the W and the "two-Sheng", where the dynamic programming, to match the "two" W ₁ and, the "Sheng" against the W _2, (to NULL and labeling) invalid character This is calculated by matching the W ₃ by adding. The verification cost in this case is as follows.

Cost("二", W₁) = (2 − 1) / (3 × 3) = 1 / 9;
Cost("盛", W₂) = (1 − 1) / (3 × 3) = 0;
Cost(NULL, W₃) = 1. Cost ("Two", W ₁ ) = (2 − 1) / (3 × 3) = 1/9;
Cost ("Sai", W ₂ ) = (1 − 1) / (3 × 3) = 0;
Cost (NULL, W ₃ ) = 1.

したがって、Ｗと「二盛」との間の編集距離は、１／９＋０＋１＝１０／９となる。 Therefore, the edit distance between W and “Niimori” is 1/9 + 0 + 1 = 10/9.

同様に、Ｗと「二盛一」との間の編集距離は以下のように計算される。
Cost("二", W₁) = (2 − 1) / (3 × 3) = 1 / 9;
Cost("盛", W₂) = (1 − 1) / (3 × 3) = 0;
Cost("一", W₃) = (2 − 1) / (3 × 3) = 1 / 9.
したがって、Ｗと「二盛一」との間の編集距離は、１／９＋０＋１／９＝２／９となる。 Similarly, the edit distance between W and “Nisseiichi” is calculated as follows.
Cost ("Two", W ₁ ) = (2 − 1) / (3 × 3) = 1/9;
Cost ("Sai", W ₂ ) = (1 − 1) / (3 × 3) = 0;
Cost ("One", W ₃ ) = (2 − 1) / (3 × 3) = 1/9.
Therefore, the edit distance between W and “Futakuichi” is 1/9 + 0 + 1/9 = 2/9.

同様に、Ｗと「二旺一」との間の編集距離は以下のように計算される。
Cost("二", W₁) = (2 − 1) / (3 × 3) = 1 / 9;
Cost("旺", W₂) = 1;
Cost("一", W₃) = (2 − 1) / (3 × 3) = 1 / 9.
したがって、Ｗと「二旺一」との間の編集距離は、１／９＋１＋１／９＝１１／９となる。 Similarly, the edit distance between W and “Two-one” is calculated as follows.
Cost ("Two", W ₁ ) = (2 − 1) / (3 × 3) = 1/9;
Cost ("旺", W ₂ ) = 1;
Cost ("One", W ₃ ) = (2 − 1) / (3 × 3) = 1/9.
Therefore, the edit distance between W and “Two-by-one” is 1/9 + 1 + 1/9 = 11/9.

したがって、
［文字１６］

市に包含されるすべての地名
［文字１７］

の編集距離の順位付けシーケンスは、「二盛一」、「二盛」、「二旺一」となる。 Therefore,
[Character 16]

All place names included in the city [Character 17]

The editing distance ranking sequence is “Niimoriichi”, “Niimori”, “Ninoichi”.

このとき「二盛一」の編集距離１／９が最も小さいため、検証ルール（１）により「二盛一」が正しい地名であると検証される。 At this time, since the edit distance 1/9 of “Nisseiichi” is the shortest, it is verified by the verification rule (1) that “Nisseiichi” is a correct place name.

図１３は、本発明の実施の形態に係るリジェクト手段を示すブロック図である。 FIG. 13 is a block diagram showing reject means according to the embodiment of the present invention.

図１３に示すように、リジェクト手段６は、特徴抽出手段６３と、リジェクト決定手段６４と、リジェクト学習手段６５とを備える。 As shown in FIG. 13, the rejection unit 6 includes a feature extraction unit 63, a rejection determination unit 64, and a rejection learning unit 65.

具体的には、特徴抽出手段６３は、地名画像領域のさまざまなレベルから特徴｛ｖ₁,ｖ₂,ｖ₃,…,ｖ_N｝を抽出する。ここで、ｖ_i（０＜ｉ＜Ｎ＋１）は、ｉレベルの地名から抽出された特徴であり、Ｎは地名レベルの総数である。リジェクト決定手段６４は、特徴抽出手段６３により得られた特徴を、リジェクト学習手段６５により得られたマッピング関数ｆに代入し、関数ｆ（ｖ₁,ｖ₂,ｖ₃,…,ｖ_N）の関数値を計算する。この値がゼロより大きければ、認識結果は正しい住所とされる。この値がゼロ以下の場合は、認識結果はリジェクトされる。 Specifically, the feature extraction unit 63 extracts features {v ₁ , v ₂ , v ₃ ,..., V _N } from various levels of the place name image region. Here, v _i (0 <i <N + 1) is a feature extracted from an i-level place name, and N is the total number of place name levels. The reject determining means 64 substitutes the feature obtained by the feature extracting means 63 into the mapping function f obtained by the reject learning means 65, and obtains the function f (v ₁ , v ₂ , v ₃ ,..., V _N ). Calculate the function value. If this value is greater than zero, the recognition result is a correct address. If this value is less than or equal to zero, the recognition result is rejected.

リジェクト学習手段６５は、実際の住所画像から集められた学習セットを用いて学習を行う。各住所サンプルに対する各レベルの地名認識の照合距離が特徴として集められる。住所サンプルが正しく認識される場合、住所画像には承認のラベリングがされるか、＋１のラベリングがされる。それ以外の場合には、住所画像にはリジェクトのラベリングがされるか、−１のラベリングがされる。 The reject learning means 65 performs learning using a learning set collected from actual address images. The collation distance for each level of place name recognition for each address sample is collected as a feature. If the address sample is recognized correctly, the address image is labeled for approval or +1. In other cases, the address image is rejected or -1 labeled.

上記の処理は、非特許文献５に基づく。マッピング関数（以下、分類子と呼ぶ）を構築するのは、学習データにおける特徴に基づいて自動的に分類を実行するためである。この分類子は、新しい住所を評価し、関数出力が正である（＞０）か負である（＜０）かによって認識結果をリジェクトするか承認するかを決定するためのリジェクトルールとして用いられる。この関数は、住所の質に適応的に適合するリジェクトルールを自動的に構築できるため、住所認識の信頼性が高まる。 The above processing is based on Non-Patent Document 5. The reason why the mapping function (hereinafter referred to as a classifier) is constructed is to automatically perform classification based on features in the learning data. This classifier is used as a reject rule to evaluate a new address and decide whether to reject or accept the recognition result depending on whether the function output is positive (> 0) or negative (<0). . This function can automatically build reject rules that adaptively adapt to the quality of the address, increasing the reliability of address recognition.

図１４は、本発明の実施の形態に係る住所認識装置を示す詳細なブロック図である。 FIG. 14 is a detailed block diagram showing the address recognition apparatus according to the embodiment of the present invention.

図１４に示すように、本発明に係る住所認識装置は、単一文字領域検出手段１と、キー文字候補抽出手段２０と、キー文字パス選択手段１５と、地名領域抽出手段３と、単一文字地名認識手段Ｅ１と、一括地名認識手段４と、地名検証手段５と、個別文字認識手段６９と、リジェクト手段６とを備える。単一文字領域検出手段１は、結合要素ラベリング手段３４と、単一文字領域候補決定手段３５と、平均文字サイズ計算手段３６と、画素投影手段３７と、一時的分離ポイント候補抽出手段３８と、一時的分離ポイント候補決定手段３９とを備えてなる。 As shown in FIG. 14, the address recognition apparatus according to the present invention includes a single character area detecting means 1, a key character candidate extracting means 20, a key character path selecting means 15, a place name area extracting means 3, a single character place name. Recognizing means E1, collective place name recognizing means 4, place name verifying means 5, individual character recognizing means 69, and rejecting means 6 are provided. The single character area detecting means 1 includes a combination element labeling means 34, a single character area candidate determining means 35, an average character size calculating means 36, a pixel projecting means 37, a temporary separation point candidate extracting means 38, Separation point candidate determination means 39 is provided.

結合要素ラベリング手段３４は、入力された二値画像から結合要素をラベリングする。単一文字領域候補決定手段３５は、ラベリングされた要素のサイズ、ラベリングされた要素の位置、隣りあう要素からの距離等に基づいて結合要素各々を統合し、最終的に単一文字領域候補を決定する。 The coupling element labeling means 34 labels the coupling element from the input binary image. The single character area candidate determining means 35 integrates each combination element based on the size of the labeled element, the position of the labeled element, the distance from the adjacent element, etc., and finally determines the single character area candidate. .

平均文字サイズ計算手段３６は、他の結合要素とサイズが合致しないパターンを除去し、文字の平均サイズを計算する。 The average character size calculation means 36 removes a pattern whose size does not match that of other coupling elements, and calculates the average character size.

画素投影手段３７は、入力された住所画像を（画像が横向きに書かれている場合は）縦向きに、または、（画像が縦向きに書かれている場合は）横向きに投影する。縦向き（または横向き）の画像投影によって、横（または縦）方向における画像の各ポイントにある黒画素数が得られる。 The pixel projecting unit 37 projects the input address image vertically (when the image is written horizontally) or horizontally (when the image is written vertically). With vertical (or horizontal) image projection, the number of black pixels at each point of the image in the horizontal (or vertical) direction is obtained.

一時的分離ポイント候補抽出手段３８および一時的分離ポイント候補決定手段３９は、単一文字領域決定手段３５によって出力された新しい結合要素を、実際の文字領域へと分離するために使用される。すなわち、両者は、結合された文字を個々の単一文字へと分離するために用いられる。 Temporary separation point candidate extraction means 38 and temporary separation point candidate determination means 39 are used to separate the new combined elements output by single character area determination means 35 into actual character areas. That is, both are used to separate the combined characters into individual single characters.

一時的分離ポイント候補決定手段３９は、一時的分離ポイント候補抽出手段３８によって得られた一時的分離ポイントが隣りあう文字領域の真の分離ポイントであるか否かを決定する。 Temporary separation point candidate determination means 39 determines whether or not the temporary separation point obtained by temporary separation point candidate extraction means 38 is a true separation point of adjacent character regions.

キー文字辞書４１は、すべてのキー文字の特徴を記録および記憶する。 The key character dictionary 41 records and stores the characteristics of all key characters.

キー文字候補抽出手段２０は、特徴抽出手段４２、辞書照合手段４３、およびキー文字候補決定手段４４を備える。 The key character candidate extraction unit 20 includes a feature extraction unit 42, a dictionary collation unit 43, and a key character candidate determination unit 44.

特徴抽出手段４２は、単一文字領域検出手段１により分離された領域の特徴を抽出する。辞書照合手段４３は、特徴抽出手段４２により抽出された特徴と、キー文字辞書４１に記憶された特徴とを照合し、１からＮ番目の文字候補を出力する。次に、キー文字候補決定手段４４は、キー文字候補が真のキー文字であるか否かを決定する。当該キー文字候補の照合距離の第１文字候補の照合距離に対する比が所定の閾値よりも小さい場合、当該キー文字候補は真のキー文字である。 The feature extraction means 42 extracts the features of the areas separated by the single character area detection means 1. The dictionary collating unit 43 collates the feature extracted by the feature extracting unit 42 with the feature stored in the key character dictionary 41 and outputs the 1st to Nth character candidates. Next, the key character candidate determination means 44 determines whether or not the key character candidate is a true key character. When the ratio of the collation distance of the key character candidate to the collation distance of the first character candidate is smaller than a predetermined threshold, the key character candidate is a true key character.

住所階層構造知識部２４は、住所階層構造の知識を記憶する。たとえば、キー文字「省」はキー文字「市」を包含し得るが、キー文字「市」はキー文字「省」を包含し得ない。 The address hierarchy structure knowledge unit 24 stores knowledge of the address hierarchy structure. For example, the key character “Province” may include the key character “City”, but the key character “City” cannot include the key character “Province”.

キー文字パス選択手段１５は、キー文字パス決定手段２１とキー文字パスフィルタ手段２２とを備える。 The key character path selection unit 15 includes a key character path determination unit 21 and a key character path filter unit 22.

キー文字パス決定手段２１は、住所階層構造知識部２４を参照して、合理的なキー文字パスを決定する。キー文字パスフィルタ手段２２は、複数のキー文字パスを記憶し、各パスにおけるキー文字間の平均照合距離を計算し、最終的にＫ個の主要パスを出力する。ここで、Ｋは所定の正の定数である。 The key character path determining means 21 refers to the address hierarchical structure knowledge unit 24 to determine a reasonable key character path. The key character path filter means 22 stores a plurality of key character paths, calculates an average collation distance between key characters in each path, and finally outputs K main paths. Here, K is a predetermined positive constant.

地名領域抽出手段３は、特定のキー文字パス内のキー文字によって区切られた地名領域を抽出する。 The place name area extraction means 3 extracts place name areas delimited by key characters in a specific key character path.

地名辞書部５４は、住所中の各地名の特徴ベクトルを記憶する。 The place name dictionary unit 54 stores a feature vector of each name in the address.

一括地名認識手段４は、地名抽出手段３により抽出された地名を順次一括認識する。一括地名認識手段４は、地名照合手段２７と、地名候補評価手段２８と、再帰的照合評価手段２９とを備える。 The collective place name recognition unit 4 sequentially recognizes the place names extracted by the place name extraction unit 3. The collective place name recognition means 4 includes place name collating means 27, place name candidate evaluating means 28, and recursive collating and evaluating means 29.

地名照合手段２７は、さまざまなレベルの地名に対応する画像から特徴を抽出し、地名辞書部５４に記憶された地名特徴と照合し、それによって１からＮｃ番目の候補を出力する。地名候補評価手段２８は、１からＮｃ番目の候補を評価し、下位の住所に対して上位の行政単位となるいくつかの候補を決定する。再帰的照合評価手段２９は、すべての住所候補の平均距離を計算し、そこから平均照合距離が最小である住所を最終的な認識結果として選択する。 The place name collation means 27 extracts features from images corresponding to place names at various levels, collates with the place name features stored in the place name dictionary unit 54, and thereby outputs the 1st to Ncth candidates. The place name candidate evaluation means 28 evaluates the 1st to Ncth candidates, and determines some candidates that are upper administrative units for lower addresses. The recursive matching evaluation means 29 calculates an average distance of all address candidates, and selects an address having the smallest average matching distance as a final recognition result.

単一文字辞書部５９は、住所内の各文字の特徴ベクトルを記憶する。 The single character dictionary unit 59 stores a feature vector of each character in the address.

単一文字地名認識手段Ｅ１は、単一文字領域検出手段１から出力された単一文字領域を順次認識する。これらの単一文字領域は、地名領域抽出手段３により抽出された地名領域内に含まれるものである。単一文字地名認識手段Ｅ１は、特徴抽出手段６１と辞書照合手段６２とを備える。 The single character place name recognition unit E1 sequentially recognizes the single character regions output from the single character region detection unit 1. These single character areas are included in the place name area extracted by the place name area extracting means 3. The single character place name recognition unit E1 includes a feature extraction unit 61 and a dictionary collation unit 62.

特徴抽出手段６１は、単一文字領域検出手段１により得られた単一文字領域の特徴ベクトルを抽出する。これらの単一文字領域は、地名領域抽出手段３により抽出された地名領域内に含まれるものである。辞書照合手段６２は、次に、特徴抽出手段６１により得られた特徴ベクトルと単一文字辞書部５９に記憶された特徴ベクトルとを照合し、各単一文字領域の１からＮ番目の候補を出力する。 The feature extraction means 61 extracts the feature vector of the single character area obtained by the single character area detection means 1. These single character areas are included in the place name area extracted by the place name area extracting means 3. Next, the dictionary collating unit 62 collates the feature vector obtained by the feature extracting unit 61 with the feature vector stored in the single character dictionary unit 59, and outputs the 1st to Nth candidates of each single character region. .

地名検証手段５は、単一文字地名認識手段Ｅ１により得られた地名認識結果と、一括地名認識手段４により取得された地名認識結果とを検証し、組み合わせる。地名検証手段６８は、編集距離計算手段５５と一括認識・個別文字認識組み合わせ手段５８とを備える。 The place name verification means 5 verifies the place name recognition result obtained by the single character place name recognition means E1 and the place name recognition result acquired by the collective place name recognition means 4 and combines them. The place name verification means 68 includes an edit distance calculation means 55 and a collective recognition / individual character recognition combination means 58.

編集距離計算手段５５は、単一文字地名認識手段Ｅ１により得られた地名結果と、キー文字の定義に合致し、かつ、前のレベルの地名から認識された住所に包含されるすべての地名との間の編集距離を計算する。一括認識・個別文字認識組み合わせ手段５８は、一括地名認識手段４により得られた結果と、単一文字地名認識手段Ｅ１により得られた結果とを組み合わせる。一括認識・個別文字認識組み合わせ手段５８は、計算した各地名を順位付けし、最終的に第１位の地名を出力する。 The edit distance calculation means 55 calculates the place name result obtained by the single character place name recognition means E1 and all place names that match the definition of the key character and are included in the address recognized from the place name of the previous level. Calculate the edit distance between. The batch recognition / individual character recognition combination means 58 combines the result obtained by the batch place name recognition means 4 and the result obtained by the single character place name recognition means E1. The collective recognition / individual character recognition combination means 58 ranks the calculated names of places and finally outputs the first place name.

個別文字認識手段６９は、地名でもキー文字でもないパターンを認識する。個別文字認識手段６９は、特徴抽出手段６１と辞書照合手段６２とを備える。 The individual character recognition means 69 recognizes a pattern that is neither a place name nor a key character. The individual character recognition unit 69 includes a feature extraction unit 61 and a dictionary collation unit 62.

リジェクト学習手段６５は、ひとまとまりの住所に基づいてサポートベクターマシーン分類子の学習を行う。なお、サポートベクターマシーンとは、２クラスの分類に優れた方式であり、住所が４階層（省−市−区−街）あるときは、全部で４単語あるので、正しい４単語の距離値の並びとそうでないものの距離値の並びをあらかじめ学習しておく。次に入力された４階層の文字列から４単語の距離の並びを算出し、それが学習したうちの正しい側かそうでない側のどちらに属するかを判定する、というものである。 The reject learning means 65 learns a support vector machine classifier based on a group of addresses. Note that the support vector machine is an excellent method for classification into two classes, and when there are four levels (province-city-district-town) of the address, there are four words in total. Learn in advance the sequence of distance values for sequences and those that are not. Next, a sequence of distances of four words is calculated from the input character string of the four layers, and it is determined whether it belongs to the correct side or not to be learned.

リジェクト手段６は、認識結果を承認するか否かを決定するものであり、特徴抽出手段６３と、リジェクト決定手段６４とを備える。特徴抽出手段６３は特徴ベクトルを抽出する。リジェクト決定手段６４は、リジェクト学習手段６５から出力されるサポートベクターマシーン分類子に基づき、認識結果を承認するか否かを決定し出力する。 The reject unit 6 determines whether or not to approve the recognition result, and includes a feature extraction unit 63 and a reject determination unit 64. The feature extraction unit 63 extracts feature vectors. Based on the support vector machine classifier output from the reject learning unit 65, the reject determining unit 64 determines whether to recognize the recognition result and outputs it.

最後に、結果がリジェクトされなかった場合、リジェクト手段６は認識結果を出力する。結果がリジェクトされた場合は、リジェクト手段６はエラーを出力する。 Finally, if the result is not rejected, the reject means 6 outputs the recognition result. If the result is rejected, the reject means 6 outputs an error.

これまでの本発明の実施の形態の説明は例示的なものであり、本質的に他の例を排除するものではなく本願を明確化するものであり、開示される特定の形態に本発明を限定するものではない。多くの変更および変形が可能であることはこの分野の通常の技術を有する者には明らかであろう。実施の形態の選択および説明は、本発明の実際の原理および応用例をよりよく説明することを意図するものであり、それによってこの分野の技術を有する他の者が、本発明のさまざまな実施例への応用および特殊な応用のための本発明の変更を理解できるようにすること意図するものである。たとえば、本発明は、日本語または韓国語で書かれた手書きの地名の認識にも適用できる。さらに、本発明は、類似度の決定に照合距離を利用しているが、類似度は照合距離以外の方法を用いて決定することもできる。たとえば、特徴ベクトル間のコサイン傾斜、特徴ベクトル間のブロック距離等を用いることができる。したがって、本発明における照合距離は、この分野の技術を有する者が想起できる類似度の明確な量の表現として広義に捉えられねばならない。 The foregoing descriptions of the embodiments of the present invention have been given by way of illustration, and are not intended to preclude other examples, but are to clarify the present application. It is not limited. It will be apparent to those skilled in the art that many modifications and variations are possible. The selection and description of the embodiments are intended to better illustrate the actual principles and applications of the present invention, so that others having skill in this field can implement various implementations of the present invention. It is intended to be able to understand the modifications of the present invention for example applications and special applications. For example, the present invention can be applied to recognition of handwritten place names written in Japanese or Korean. Furthermore, although the present invention uses the collation distance to determine the similarity, the similarity can be determined using a method other than the collation distance. For example, a cosine gradient between feature vectors, a block distance between feature vectors, or the like can be used. Therefore, the collation distance in the present invention must be broadly understood as an expression of a definite amount of similarity that can be recalled by those skilled in the art.

（付記１）複数の単一文字領域からのキー文字の抽出を複数の単一文字領域の各々について実行するキー文字抽出プログラムであって、
前記単一文字領域に対応する画像から特徴ベクトルを抽出する特徴ベクトル抽出手順と、
前記特徴ベクトル抽出手順により抽出された特徴ベクトルと、あり得るすべてのキー文字を記憶したキー文字辞書におけるキー文字の特徴ベクトルとを照合し、あり得るすべてのキー文字候補を検索する照合手順と、
前記照合手順により検索された１つ以上のキー文字候補をそれぞれ有する複数の単一文字領域を認識する場合に、正当なキー文字パスを決定するキー文字パス決定手順と、
をコンピュータに実行させることを特徴とするキー文字抽出プログラム。 (Supplementary note 1) A key character extraction program for executing extraction of a key character from a plurality of single character regions for each of the plurality of single character regions,
A feature vector extraction procedure for extracting a feature vector from an image corresponding to the single character region;
A collation procedure for collating the feature vector extracted by the feature vector extraction procedure with a key character feature vector in a key character dictionary storing all possible key characters, and searching for all possible key character candidates;
A key character path determination procedure for determining a valid key character path when recognizing a plurality of single character regions each having one or more key character candidates searched by the matching procedure;
Key character extraction program characterized by causing a computer to execute

（付記２）前記キー文字辞書は、行政区域レベルを表す文字の組を記憶したことを特徴とする付記１に記載のキー文字抽出プログラム。 (Supplementary note 2) The key character extraction program according to supplementary note 1, wherein the key character dictionary stores a set of characters representing an administrative district level.

（付記３）前記照合手順は、前記キー文字辞書に含まれるあり得るキー文字であって、前記特徴ベクトル抽出手順により抽出された特徴ベクトルとの間の照合距離が所定の閾値よりも小さい特徴ベクトルに対応するあり得るキー文字をキー文字候補として決定することを特徴とする付記１に記載のキー文字抽出プログラム。 (Additional remark 3) The said collation procedure is a key character which can be included in the said key character dictionary, Comprising: The feature vector with which the collation distance between the feature vectors extracted by the said feature vector extraction procedure is smaller than a predetermined threshold value The key character extraction program according to appendix 1, wherein a possible key character corresponding to is determined as a key character candidate.

（付記４）複数の単一文字領域からのキー文字の抽出を複数の単一文字領域の各々について実行するキー文字抽出方法であって、
前記単一文字領域に対応する画像から特徴ベクトルを抽出する特徴ベクトル抽出工程と、
前記特徴ベクトル抽出工程により抽出された特徴ベクトルと、あり得るすべてのキー文字を記憶したキー文字辞書におけるキー文字の特徴ベクトルとを照合し、あり得るすべてのキー文字候補を検索する照合工程と、
前記照合工程により検索された１つ以上のキー文字候補をそれぞれ有する複数の単一文字領域を認識する場合に、正当なキー文字パスを決定するキー文字パス決定工程と、
を含んだことを特徴とするキー文字抽出方法。 (Supplementary note 4) A key character extraction method for executing extraction of key characters from a plurality of single character areas for each of the plurality of single character areas,
A feature vector extraction step of extracting a feature vector from an image corresponding to the single character region;
Collating the feature vector extracted by the feature vector extracting step with the feature vector of the key character in the key character dictionary storing all possible key characters, and searching for all possible key character candidates;
A key character path determining step for determining a valid key character path when recognizing a plurality of single character regions each having one or more key character candidates searched by the matching step;
Key character extraction method characterized by including

（付記５）前記キー文字辞書は、行政区域レベルを表す文字の組を記憶したことを特徴とする付記４に記載のキー文字抽出方法。 (Additional remark 5) The said key character dictionary memorize | stored the group of the character showing the administrative district level, The key character extraction method of Additional remark 4 characterized by the above-mentioned.

（付記６）前記照合工程は、前記キー文字辞書に含まれるあり得るキー文字であって、前記特徴ベクトル抽出工程により抽出された特徴ベクトルとの間の照合距離が所定の閾値よりも小さい特徴ベクトルに対応するあり得るキー文字をキー文字候補として決定することを特徴とする付記４に記載のキー文字抽出方法。 (Additional remark 6) The said collation process is a key character which can be contained in the said key character dictionary, Comprising: The collation distance between the feature vectors extracted by the said feature vector extraction process is smaller than a predetermined threshold value The key character extraction method according to appendix 4, wherein a possible key character corresponding to is determined as a key character candidate.

（付記７）前記照合工程により単一文字領域に対するあり得るキー文字候補が決定され、照合距離に基づいてキー文字候補の順位付けをおこなうことにより第１位となったあり得るキー文字候補が第１キー文字候補とされた場合に、キー文字候補の照合距離と第１キー文字候補の照合距離との間の差の第１キー文字の照合距離に対する比を算出し、当該比が所定の閾値よりも小さくなる照合距離であるキー文字候補をキー文字候補結果として決定するキー文字候補決定工程をさらに含んだことを特徴とする付記６に記載のキー文字抽出方法。 (Additional remark 7) The key character candidate with respect to a single character area is determined by the said collation process, and the key character candidate which became the 1st place by ranking a key character candidate based on collation distance is 1st. When the key character candidate is selected, a ratio of the difference between the collation distance of the key character candidate and the collation distance of the first key character candidate to the collation distance of the first key character is calculated. The key character extraction method according to appendix 6, further comprising a key character candidate determination step of determining a key character candidate having a collation distance that becomes smaller as a key character candidate result.

（付記８）前記キー文字パス決定工程により正当なキー文字パスが複数決定された場合に、所定のルールに従って複数の正当なキー文字パスから一部の正当なキー文字パスをフィルタリングするキー文字パスフィルタリング工程をさらに含んだことを特徴とする付記４〜７のいずれか１つに記載のキー文字抽出方法。 (Supplementary Note 8) A key character path for filtering a part of valid key character paths from a plurality of valid key character paths according to a predetermined rule when a plurality of valid key character paths are determined in the key character path determining step. The key character extraction method according to any one of supplementary notes 4 to 7, further comprising a filtering step.

（付記９）前記所定のルールは、妥当なキー文字パスに含まれるキー文字候補の平均照合距離を算出し、すべてのキー文字パスを平均照合距離が昇順となるように順位付けし、第１の所定数のキー文字パスのみを選択するルールであることを特徴とする付記８に記載のキー文字抽出方法。 (Supplementary Note 9) The predetermined rule calculates an average collation distance of key character candidates included in a valid key character path, ranks all key character paths so that the average collation distance is in ascending order, The key character extraction method according to appendix 8, wherein the rule is to select only a predetermined number of key character paths.

（付記１０）複数の単一文字領域からキー文字を抽出するキー文字抽出装置であって、
前記単一文字領域の各々に対応する画像から特徴ベクトルを抽出する特徴ベクトル抽出手段と、
あり得るすべてのキー文字のみを記憶するキー文字辞書部と、
前記特徴ベクトル抽出手段により抽出された特徴ベクトルと、前記キー文字辞書部に記憶されたあり得るキー文字の特徴ベクトルとを照合し、あり得るすべてのキー文字候補を検索する照合手段と、
前記照合手段により検索された１つ以上のキー文字候補をそれぞれ有する複数の単一文字領域を認識する場合に、正当なキー文字パスを決定するキー文字パス決定手段と、
を備えたことを特徴とするキー文字抽出装置。 (Supplementary note 10) A key character extraction device for extracting a key character from a plurality of single character regions,
Feature vector extraction means for extracting a feature vector from an image corresponding to each of the single character regions;
A key character dictionary that stores only all possible key characters;
Collating means for collating the feature vector extracted by the feature vector extracting means with the feature vector of possible key characters stored in the key character dictionary unit, and searching for all possible key character candidates;
A key character path determining unit that determines a valid key character path when recognizing a plurality of single character regions each having one or more key character candidates searched by the matching unit;
A key character extraction device comprising:

（付記１１）前記キー文字辞書部に記憶されるあり得るキー文字は、行政区域レベルを表す文字であることを特徴とする付記１０に記載のキー文字抽出装置。 (Additional remark 11) The key character extraction device of Additional remark 10 characterized by the possible key character memorize | stored in the said key character dictionary part is a character showing an administrative district level.

（付記１２）前記照合手段は、前記キー文字辞書部に記憶されたあり得るキー文字であって、前記特徴ベクトル抽出手段により抽出された特徴ベクトルとの間の照合距離が所定の閾値よりも小さい特徴ベクトルに対応するあり得るキー文字をキー文字候補として決定することを特徴とする付記１０に記載のキー文字抽出装置。 (Additional remark 12) The said collation means is a possible key character memorize | stored in the said key character dictionary part, Comprising: The collation distance between the feature vectors extracted by the said feature vector extraction means is smaller than a predetermined threshold value The key character extraction device according to appendix 10, wherein a possible key character corresponding to the feature vector is determined as a key character candidate.

（付記１３）前記照合手段により単一文字領域に対するあり得るキー文字候補が決定され、照合距離に基づいてキー文字候補の順位付けをおこなうことにより第１位となったあり得るキー文字候補が第１キー文字候補とされた場合に、キー文字候補の照合距離と第１キー文字候補との間の差の第１キー文字の照合距離に対する比を算出し、当該比が所定の閾値よりも小さくなる照合距離であるキー文字候補をキー文字候補結果として決定するキー文字候補決定手段をさらに備えたことを特徴とする付記１２に記載のキー文字抽出装置。 (Additional remark 13) The key character candidate with respect to a single character area is determined by the said collation means, and the key character candidate which became the 1st place by ranking a key character candidate based on collation distance is 1st. When a key character candidate is selected, a ratio of a difference between the collation distance of the key character candidate and the first key character candidate to the collation distance of the first key character is calculated, and the ratio becomes smaller than a predetermined threshold value. 13. The key character extraction device according to appendix 12, further comprising key character candidate determination means for determining a key character candidate that is a collation distance as a key character candidate result.

（付記１４）前記キー文字パス決定手段により正当なキー文字パスが複数決定された場合に、所定のルールに従って複数の正当なキー文字パスから一部の正当なキー文字パスをフィルタリングするキー文字パスフィルタリング手段をさらに備えたことを特徴とする付記１０〜１３のいずれか１つに記載のキー文字抽出装置。 (Supplementary Note 14) A key character path for filtering a part of valid key character paths from a plurality of valid key character paths according to a predetermined rule when a plurality of valid key character paths are determined by the key character path determining means. 14. The key character extraction device according to any one of appendices 10 to 13, further comprising filtering means.

（付記１５）前記所定のルールは、妥当なキー文字パスに含まれるキー文字候補の平均照合距離を算出し、すべてのキー文字パスを平均照合距離が昇順となるように順位付けし、第１の所定数のキー文字パスのみを選択するルールであることを特徴とする付記１４に記載のキー文字抽出装置。 (Supplementary Note 15) The predetermined rule calculates an average collation distance of key character candidates included in a valid key character path, ranks all the key character paths so that the average collation distance is in ascending order, 15. The key character extraction device according to appendix 14, wherein the rule selects only a predetermined number of key character paths.

（付記１６）ある行政レベルの地名を含む地名画像の地名を認識する一括地名認識手順をコンピュータに実行させる一括地名認識プログラムであって、
前記一括地名認識手順は、
前記地名画像から特徴ベクトルを抽出する抽出手順と、
前記抽出手順により抽出された特徴ベクトルと、地名辞書に記憶された地名の特徴ベクトルとを照合し、あり得るすべての地名候補を検索する照合手順と、
をコンピュータに実行させ、
前記地名辞書に記憶された地名のうち、前記抽出手順により抽出された特徴ベクトルとの間の照合距離が所定の閾値よりも小さい特徴ベクトルに対応する地名をあり得る地名候補として決定し、
さらに前記地名画像の単一文字領域の各々に含まれる文字の文字候補を認識する個別文字認識手順と、
前記個別文字認識手順により得られた認識結果に基づいて、前記一括地名認識手順により決定された地名候補のフィルタリングをおこなう地名検証手順と、
をコンピュータに実行させることを特徴とする一括地名認識プログラム。 (Supplementary Note 16) A collective place name recognition program for causing a computer to execute a collective place name recognition procedure for recognizing place names of place name images including place names at a certain administrative level,
The collective place name recognition procedure includes:
An extraction procedure for extracting a feature vector from the place name image;
A collation procedure for collating the feature vectors extracted by the extraction procedure with the feature vectors of the place names stored in the place name dictionary and searching for all possible place name candidates;
To the computer,
Of the place names stored in the place name dictionary, a place name corresponding to a feature vector whose collation distance with the feature vector extracted by the extraction procedure is smaller than a predetermined threshold is determined as a possible place name candidate,
Further, an individual character recognition procedure for recognizing character candidates for characters included in each single character region of the place name image,
Based on the recognition result obtained by the individual character recognition procedure, a place name verification procedure for filtering place name candidates determined by the collective place name recognition procedure;
Collective place name recognition program characterized by causing computer to execute.

（付記１７）前記一括地名認識手順は、前記照合手順によりあり得る地名候補が複数得られた場合に、所定のルールに従って複数のあり得る地名候補をフィルタリングする地名候補評価手順をさらにコンピュータに実行させることを特徴とする付記１６に記載の一括地名認識プログラム。 (Supplementary Note 17) The batch place name recognition procedure further causes the computer to execute a place name candidate evaluation procedure for filtering a plurality of possible place name candidates according to a predetermined rule when a plurality of possible place name candidates are obtained by the matching procedure. The collective place name recognition program according to supplementary note 16, characterized by:

（付記１８）前記所定のルールは、複数のあり得る地名候補を照合距離に基づいて順位付けし、順位付けにより第１位となったあり得る地名候補を第１地名候補とした場合に、地名候補の照合距離と第１地名候補の照合距離との間の差の第１地名候補の照合距離に対する比を算出し、当該比が所定の閾値よりも小さくなる照合距離である地名候補のみを選択するルールであることを特徴とする付記１７に記載の一括地名認識プログラム。 (Supplementary Note 18) The predetermined rule is that a plurality of possible place name candidates are ranked based on the collation distance, and a place name candidate that is ranked first by the ranking is set as the first place name candidate. Calculate the ratio of the difference between the collation distance of the candidate and the collation distance of the first place name candidate to the collation distance of the first place name candidate, and select only the place name candidates whose collation distance is smaller than a predetermined threshold. 18. The collective place name recognition program according to appendix 17, wherein

（付記１９）ある行政レベルの地名を含む地名画像の地名を認識する一括地名認識工程を含む一括地名認識方法であって、
前記一括地名認識工程は、
前記地名画像から特徴ベクトルを抽出する抽出工程と、
前記抽出工程により抽出された特徴ベクトルと、地名辞書に記憶された地名の特徴ベクトルとを照合し、あり得るすべての地名候補を検索する照合工程と、
を含み、
前記地名辞書に記憶された地名のうち、前記抽出工程により抽出された特徴ベクトルとの間の照合距離が所定の閾値よりも小さい特徴ベクトルに対応する地名をあり得る地名候補として決定し、
さらに前記地名画像の単一文字領域の各々に含まれる文字の文字候補を認識する個別文字認識工程と、
前記個別文字認識工程により得られた認識結果に基づいて、前記一括地名認識工程により決定された地名候補のフィルタリングをおこなう地名検証工程と、
を含んだことを特徴とする一括地名認識方法。 (Supplementary note 19) A collective place name recognition method including a collective place name recognition process for recognizing place names of place name images including place names at a certain administrative level,
The collective place name recognition process includes:
An extraction step of extracting a feature vector from the place name image;
A collation step of collating the feature vector extracted by the extraction step with the feature vector of the place name stored in the place name dictionary and searching for all possible place name candidates;
Including
Of the place names stored in the place name dictionary, a place name corresponding to a feature vector whose collation distance with the feature vector extracted by the extraction step is smaller than a predetermined threshold is determined as a possible place name candidate,
Further, an individual character recognition step for recognizing character candidates for characters included in each single character region of the place name image,
Based on the recognition result obtained by the individual character recognition step, a place name verification step for filtering place name candidates determined by the collective place name recognition step;
Collective place name recognition method characterized by including.

（付記２０）前記一括地名認識工程は、前記照合工程によりあり得る地名候補が複数得られた場合に、所定のルールに従って複数のあり得る地名候補をフィルタリングする地名候補評価工程を含んだことを特徴とする付記１９に記載の一括地名認識方法。 (Supplementary note 20) The collective place name recognition step includes a place name candidate evaluation step of filtering a plurality of possible place name candidates according to a predetermined rule when a plurality of possible place name candidates are obtained by the matching step. The collective place name recognition method according to appendix 19.

（付記２１）前記所定のルールは、複数のあり得る地名候補を照合距離に基づいて順位付けし、順位付けにより第１位となったあり得る地名候補を第１地名候補とした場合に、地名候補の照合距離と第１地名候補の照合距離との間の差の第１地名候補の照合距離に対する比を算出し、当該比が所定の閾値よりも小さくなる照合距離である地名候補のみを選択するルールであることを特徴とする付記２０に記載の一括地名認識方法。 (Supplementary Note 21) The predetermined rule is that a plurality of possible place name candidates are ranked based on a collation distance, and a place name candidate that is ranked first by the ranking is set as a first place name candidate. Calculate the ratio of the difference between the collation distance of the candidate and the collation distance of the first place name candidate to the collation distance of the first place name candidate, and select only the place name candidates whose collation distance is smaller than a predetermined threshold. The collective place name recognition method according to appendix 20, wherein the rule is a rule for

（付記２２）前記地名検証工程は、前記一括地名認識工程により決定された地名候補の各々と、前記個別文字認識工程により得られた認識結果との間の編集距離を算出する工程と、各編集距離の順位を算出する工程と、前記編集距離の順位に基づいて、重みを算出する工程と、算出された重みに基づいて前記一括地名認識工程により得られた地名候補をフィルタリングする工程とを含むことを特徴とする付記１９、２０または２１に記載の一括地名認識方法。 (Supplementary note 22) The place name verification step includes a step of calculating an edit distance between each of the place name candidates determined by the collective place name recognition step and a recognition result obtained by the individual character recognition step, and each edit A step of calculating a rank of the distance; a step of calculating a weight based on the rank of the edit distance; and a step of filtering the place name candidates obtained by the collective place name recognition step based on the calculated weight. The collective place name recognition method according to appendix 19, 20 or 21, characterized in that:

（付記２３）前記一括地名認識工程は、地名の行政レベルが降順となっている複数の地名画像のそれぞれについて１つ以上の地名候補を認識した場合に、最上位レベルから最下位レベルまでの住所候補により形成されるパスの各々の平均照合距離を算出し、平均照合距離が最小のパスのみを住所パス候補として選択する照合評価工程をさらに含んだことを特徴とする付記１９、２０または２１に記載の一括地名認識方法。 (Supplementary Note 23) In the collective place name recognition step, when one or more place name candidates are recognized for each of a plurality of place name images in which the place name administrative level is in descending order, addresses from the highest level to the lowest level Additional description 19, 20 or 21 further comprising a collation evaluation step of calculating an average collation distance of each path formed by the candidates and selecting only a path having the smallest average collation distance as an address path candidate. Collective place name recognition method of description.

（付記２４）前記一括地名認識工程は、地名の行政レベルが降順となっている複数の地名画像のそれぞれについて１つ以上の地名候補を認識した場合に、最上位レベルから最下位レベルまでの住所候補により形成されるパスの各々の平均照合距離を算出し、平均照合距離が最小のパスのみを住所パス候補として選択する照合評価工程をさらに含んだことを特徴とする付記１９に記載の一括地名認識方法。 (Supplementary Note 24) In the collective place name recognition process, when one or more place name candidates are recognized for each of a plurality of place name images whose place names are in descending order, addresses from the highest level to the lowest level The collective place name according to appendix 19, further comprising a collation evaluation step of calculating an average collation distance of each of the paths formed by the candidates and selecting only a path having the smallest average collation distance as an address path candidate. Recognition method.

（付記２５）最小平均照合距離に近い平均照合距離を有するパスが複数ある場合に、複数のパスにおいて複数のあり得る地名候補を有する地名画像について個別文字認識を実行するパス個別文字認識工程と、前記パス個別文字認識工程において得られた認識結果に基づいて、地名画像のあり得る地名候補をフィルタリングするパス地名検証工程と、をさらに含み、前記地名検証工程は、地名画像のあり得る地名候補の各々と、前記個別文字認識工程により得られた認識結果との間の編集距離を算出する工程と、各編集距離の順位を算出する工程と、前記編集距離の順位に基づいて、重みを算出する工程と、算出された重みに基づいて地名を決定する工程とを含むことを特徴とする付記２４に記載の一括地名認識方法。 (Supplementary Note 25) When there are a plurality of paths having an average collation distance close to the minimum average collation distance, a path individual character recognition step of executing individual character recognition for a place name image having a plurality of possible place name candidates in the plurality of paths; A path place name verification step of filtering possible place name candidates of the place name image based on the recognition result obtained in the path individual character recognition step, and the place name verification step includes a place name candidate of the place name image. A step of calculating an edit distance between each and a recognition result obtained by the individual character recognition step, a step of calculating a rank of each edit distance, and a weight based on the rank of the edit distance 25. The collective place name recognition method according to appendix 24, comprising a step and a step of determining a place name based on the calculated weight.

（付記２６）前記照合評価工程により選択された住所パス候補を承認あるいはリジェクトする地名リジェクト工程をさらに含み、前記地名リジェクト工程は、各レベルの地名領域から特徴を抽出する工程と、各レベルの地名領域から抽出された特徴をリジェクト学習により得られたマッピング関数を用いて分析する工程と、分析の結果に基づいて住所パス候補を承認あるいはリジェクトする工程とを含んだことを特徴とする付記２５に記載の一括地名認識方法。 (Supplementary note 26) Further includes a place name rejecting step for approving or rejecting the address path candidate selected in the collation evaluation step, wherein the place name rejecting step includes a step of extracting features from the place name areas of each level, and a place name of each level Appendix 25 characterized by including a step of analyzing a feature extracted from a region using a mapping function obtained by reject learning, and a step of approving or rejecting an address path candidate based on a result of the analysis Collective place name recognition method of description.

（付記２７）一括地名認識装置であって、
入力された地名画像から特徴ベクトルを抽出する抽出手段と、
地名を記憶する地名辞書部と、
前記地名辞書部に記憶された地名の特徴ベクトルと前記抽出手段により抽出された特徴ベクトルとを照合し、あり得るすべての地名候補および照合距離を求め、地名辞書に記憶された地名のうち、前記抽出手段により抽出された特徴ベクトルとの間の照合距離が所定の閾値よりも小さい特徴ベクトルに対応する地名をあり得る地名候補として決定する照合手段と、
降順に並んだ行政レベルの地名を含む複数の地名画像が入力され、当該地名画像の認識により行政レベルの地名を降順に含む複数の住所パスが形成される場合に、複数の住所パス候補の評価および決定をおこなう照合評価手段と、
をさらに備えたことを特徴とする一括地名認識装置。 (Supplementary note 27) A collective place name recognition device,
Extracting means for extracting feature vectors from the input place name image;
A place name dictionary section for storing place names;
The place name feature vector stored in the place name dictionary unit is matched with the feature vector extracted by the extracting means to obtain all possible place name candidates and matching distances, and among the place names stored in the place name dictionary, Collating means for determining a place name corresponding to a feature vector whose matching distance between the feature vector extracted by the extracting means is smaller than a predetermined threshold as possible place name candidates;
Evaluation of multiple address path candidates when multiple place name images containing administrative level place names arranged in descending order are input and multiple place name paths containing administrative level place names are formed by recognition of the place name images. And a collation evaluation means for making a decision,
A collective place name recognition apparatus characterized by further comprising:

（付記２８）前記照合手段によりあり得る地名候補が複数得られた場合に、所定のルールに従って複数のあり得る地名候補をフィルタリングする地名候補評価手段をさらに備えたことを特徴とする付記２７に記載の一括地名認識装置。 (Supplementary note 28) The supplementary note 27, further comprising place name candidate evaluation means for filtering a plurality of possible place name candidates according to a predetermined rule when a plurality of possible place name candidates are obtained by the collating means. Collective place name recognition device.

（付記２９）前記所定のルールは、複数のあり得る地名候補を照合距離に基づいて順位付けし、順位付けにより第１位となったあり得る地名候補を第１地名候補とした場合に、地名候補の照合距離と第１地名候補の照合距離との間の差の第１地名候補の照合距離に対する比を算出し、当該比が所定の閾値よりも小さくなる照合距離である地名候補のみを選択するルールであることを特徴とする付記２８に記載の一括地名認識装置。 (Supplementary note 29) When the predetermined rule ranks a plurality of possible place name candidates based on the collation distance and sets the first place name candidate as the first place name candidate as the first place name candidate by the ranking, Calculate the ratio of the difference between the collation distance of the candidate and the collation distance of the first place name candidate to the collation distance of the first place name candidate, and select only the place name candidates whose collation distance is smaller than a predetermined threshold. 29. The collective place name recognition apparatus according to appendix 28, wherein

（付記３０）前記照合評価手段は、住所パスの各々について平均照合距離を計算し、平均照合距離が最小の住所パスを住所パス候補として決定することを特徴とする付記２７、２８または２９に記載の一括地名認識装置。 (Additional remark 30) The said collation evaluation means calculates an average collation distance about each address path, and determines an address path with the smallest average collation distance as an address path candidate, The additional remark 27, 28 or 29 characterized by the above-mentioned Collective place name recognition device.

（付記３１）前記照合評価手段により決定された住所パス候補を承認あるいはリジェクトするリジェクト手段をさらに備えたことを特徴とする付記３０に記載の一括地名認識装置。 (Supplementary note 31) The collective place name recognition apparatus according to supplementary note 30, further comprising reject means for approving or rejecting the address path candidate determined by the collation evaluation means.

（付記３２）前記リジェクト手段は、各地名領域から特徴を抽出する特徴抽出手段と、リジェクト学習を実行してマッピング関数を得るリジェクト学習手段と、前記マッピング関数を用い、各レベルの地名領域の抽出された特徴に基づいて前記住所パス候補を承認するかまたはリジェクトするかを決定するリジェクト決定手段とを備えたことを特徴とする付記３１に記載の一括地名認識装置。 (Additional remark 32) The rejection means uses the feature extraction means for extracting features from each name area, reject learning means for executing a rejection learning to obtain a mapping function, and extraction of place name areas at each level using the mapping function. 32. The collective place name recognition apparatus according to supplementary note 31, further comprising: a rejection determination unit that determines whether to approve or reject the address path candidate based on the made feature.

（付記３３）最小平均照合距離に近い平均照合距離を有する住所パスが複数存在する場合に、複数の住所パス中に複数のあり得る地名候補を有する地名画像に対して個別文字認識を実行し、または、前記照合手段が地名画像に対して複数のあり得る地名候補を決定した場合に、あり得る地名候補を複数有する地名画像に対して個別文字認識を実行する個別文字認識手段と、前記個別文字認識手段により得られた認識結果、および、前記照合評価手段による決定結果に基づいて、複数のあり得る地名候補を有する地名画像の地名候補をフィルタリングする地名検証手段と、をさらに備えたことを特徴とする付記２７〜３２のいずれか１つに記載の一括地名認識装置。 (Supplementary Note 33) When there are a plurality of address paths having an average collation distance close to the minimum average collation distance, individual character recognition is performed on a place name image having a plurality of possible place name candidates in the plurality of address paths, Alternatively, when the collating unit determines a plurality of possible place name candidates for the place name image, individual character recognition means for performing individual character recognition on the place name image having a plurality of possible place name candidates; and the individual character And a place name verification means for filtering place name candidates of place name images having a plurality of possible place name candidates based on the recognition result obtained by the recognition means and the determination result by the collation evaluation means. The collective place name recognition apparatus according to any one of Supplementary notes 27 to 32.

（付記３４）前記地名検証手段は、複数のあり得る地名候補を有する地名画像の地名候補の各々と、前記個別文字認識手段による認識結果との間の編集距離を計算する編集距離計算手段と、前記編集距離計算手段により計算された編集距離に基づいて複数のあり得る地名候補を有する地名画像の住所を決定する一括認識・個別文字認識組み合わせ手段と、を備えたことを特徴とする付記３３に記載の一括地名認識装置。 (Supplementary Note 34) The place name verification means includes an edit distance calculation means for calculating an edit distance between each place name candidate of a place name image having a plurality of possible place name candidates and a recognition result by the individual character recognition means, (Supplementary note 33) characterized by comprising batch recognition / individual character recognition combination means for determining addresses of place name images having a plurality of possible place name candidates based on the edit distance calculated by the edit distance calculation means. Collective place name recognition device of description.

（付記３５）単一文字領域を分離する文字分離工程、前記文字分離工程により分離された単一文字領域の各々に対してキー文字を抽出するキー文字抽出工程、前記キー文字抽出工程によるキー文字の抽出結果に基づいて地名を一括して認識する一括地名認識工程、前記一括地名認識工程による地名認識結果に基づいて住所を決定する住所決定工程、を含んだ住所認識方法であって、
前記キー文字抽出工程は、
前記単一文字領域に対応する画像から特徴ベクトルを抽出する特徴ベクトル抽出工程と、
前記特徴ベクトル抽出工程により抽出された特徴ベクトルと、すべてのあり得るキー文字を記憶したキー文字辞書におけるあり得るキー文字の特徴ベクトルとを照合し、あり得るすべてのキー文字候補を検索する照合工程と、
前記照合工程により検索された１つ以上のキー文字候補をそれぞれ有する複数の単一文字領域を認識する場合に、正当なキー文字パスを決定するキー文字パス決定工程と、
を含んだことを特徴とする住所認識方法。 (Supplementary Note 35) Character separation step for separating single character regions, key character extraction step for extracting key characters for each single character region separated by the character separation step, and extraction of key characters by the key character extraction step An address recognition method including a collective place name recognition step for collectively recognizing place names based on a result, an address determination step for determining an address based on a place name recognition result by the collective place name recognition step,
The key character extraction step includes:
A feature vector extraction step of extracting a feature vector from an image corresponding to the single character region;
A collation step of collating the feature vector extracted by the feature vector extraction step with a feature vector of possible key characters in a key character dictionary storing all possible key characters, and searching for all possible key character candidates. When,
A key character path determining step for determining a valid key character path when recognizing a plurality of single character regions each having one or more key character candidates searched by the matching step;
A method for recognizing an address, comprising:

（付記３６）前記キー文字抽出工程は、前記キー文字辞書に含まれるあり得るキー文字であって、前記特徴ベクトル抽出工程により抽出された特徴ベクトルとの間の照合距離が所定の閾値よりも小さい特徴ベクトルに対応するあり得るキー文字をキー文字候補として決定することを特徴とする付記３５に記載の住所認識方法。 (Additional remark 36) The said key character extraction process is a possible key character contained in the said key character dictionary, Comprising: The collation distance between the feature vectors extracted by the said feature vector extraction process is smaller than a predetermined threshold value 36. The address recognition method according to appendix 35, wherein a possible key character corresponding to the feature vector is determined as a key character candidate.

（付記３７）前記照合工程により単一文字領域に対するあり得るキー文字候補が決定され、照合距離に基づいてキー文字候補の順位付けをおこなうことにより第１位となったあり得るキー文字候補が第１キー文字候補とされた場合に、キー文字候補の照合距離と第１キー文字候補の照合距離との間の差の第１キー文字の照合距離に対する比を算出し、当該比が所定の閾値よりも小さくなる照合距離であるキー文字候補をキー文字候補結果として決定するキー文字候補決定工程をさらに含んだことを特徴とする付記３５に記載の住所認識方法。 (Additional remark 37) The key character candidate with respect to a single character area is determined by the collation process, and the key character candidate which became the first place by ranking the key character candidates based on the collation distance is the first. When the key character candidate is selected, a ratio of the difference between the collation distance of the key character candidate and the collation distance of the first key character candidate to the collation distance of the first key character is calculated. 36. The address recognition method according to supplementary note 35, further comprising a key character candidate determination step of determining a key character candidate having a matching distance that is smaller as a key character candidate result.

（付記３８）前記キー文字抽出工程は、前記キー文字パス決定工程により正当なキー文字パスが複数決定された場合に、所定のルールに従って複数の正当なキー文字パスから一部の正当なキー文字パスをフィルタリングするキー文字パスフィルタリング工程をさらに含んだことを特徴とする付記３５、３６または３７に記載の住所認識方法。 (Supplementary Note 38) In the key character extraction step, when a plurality of valid key character paths are determined in the key character path determination step, some valid key characters are extracted from a plurality of valid key character paths according to a predetermined rule. 38. The address recognition method according to appendix 35, 36 or 37, further comprising a key character path filtering step of filtering the path.

（付記３９）前記一括地名認識工程は、ある行政レベルの地名を含む地名画像の地名を認識する一括認識工程を含み、前記一括認識工程は、前記地名画像から特徴ベクトルを抽出する抽出工程と、前記抽出工程により抽出された特徴ベクトルと地名辞書に記憶された地名の特徴ベクトルとを照合し、あり得るすべての地名候補を検索する照合工程とを含み、前記地名辞書に記憶された地名のうち、前記抽出工程により抽出された特徴ベクトルとの間の照合距離が所定の閾値よりも小さい特徴ベクトルに対応する地名をあり得る地名候補として決定することを特徴とする付記３８に記載の住所認識方法。 (Supplementary Note 39) The collective place name recognition step includes a collective recognition step of recognizing place names of place name images including place names at a certain administrative level, and the collective recognition step includes an extraction step of extracting a feature vector from the place name images. A collation step of collating the feature vector extracted by the extraction step with the feature vector of the place name stored in the place name dictionary and searching for all possible place name candidates, and among the place names stored in the place name dictionary The address recognition method according to appendix 38, wherein a place name corresponding to a feature vector whose collation distance with the feature vector extracted by the extraction step is smaller than a predetermined threshold is determined as a possible place name candidate. .

（付記４０）前記一括認識工程は、前記照合工程によりあり得る地名候補が複数得られた場合に、所定のルールに従って複数のあり得る地名候補をフィルタリングする地名候補評価工程をさらに含み、前記所定のルールは、複数のあり得る地名候補を照合距離に基づいて順位付けし、順位付けにより第１位となったあり得る地名候補を第１地名候補とした場合に、地名候補の照合距離と第１地名候補の照合距離との間の差の第１地名候補の照合距離に対する比を算出し、当該比が所定の閾値よりも小さくなる照合距離である地名候補のみを選択するルールであることを特徴とする付記３９に記載の住所認識方法。 (Supplementary Note 40) The batch recognition step further includes a place name candidate evaluation step of filtering a plurality of possible place name candidates according to a predetermined rule when a plurality of possible place name candidates are obtained by the matching step, The rule ranks a plurality of possible place name candidates based on the collation distance, and sets the first place name candidate as the first place name candidate as the first place name candidate as a result of the ranking. The rule is to calculate a ratio of the difference between the collation distance of the place name candidates and the collation distance of the first place name candidates, and to select only the place name candidates whose collation distance is smaller than a predetermined threshold. The address recognition method according to appendix 39.

（付記４１）前記地名画像の単一文字領域の各々に含まれる文字の文字候補を認識する個別文字認識工程と、前記個別文字認識工程により得られた認識結果に基づいて、前記一括地名認識工程により決定された地名候補のフィルタリングをおこなう地名検証工程とをさらに含み、前記地名検証工程は、前記一括地名認識工程により決定された地名候補の各々と、前記個別文字認識工程により得られた認識結果との間の編集距離を算出する工程と、各編集距離の順位を算出する工程と、前記編集距離の順位に基づいて、式
Weight(A_i) = (1 − t1) × Rank_ED(A_i)＋ t1 × i
を用いることにより重みを算出する工程と、算出された重みに基づいて前記一括地名認識工程により得られた地名候補をフィルタリングする工程とを含み、Ａ_iは前記一括地名認識工程により得られたｉ番目の地名候補であり、ＥＤ(Ａ_i)はＡ_iの編集距離であり、Ｒａｎｋ＿ＥＤ(Ａ_i)は編集距離の順位であり、ｔ１はｒｏｕｎｄ(ＥＤ(Ａ_i))／ｋに等しく、ｒｏｕｎｄ(ＥＤ(Ａ_i))はＥＤ(Ａ_i)の四捨五入演算を意味し、ｋは前記地名画像における単一文字領域の総数であり、ｗｅｉｇｈｔ(Ａ_i)はｉ番目の地名候補の重みを意味することを特徴とする付記４０に記載の住所認識方法。 (Additional remark 41) Based on the recognition result obtained by the individual character recognition process which recognizes the character candidate of the character contained in each of the single character area | region of the said place name image, and the said individual character recognition process, by the said batch place name recognition process A place name verification step for filtering the determined place name candidates, wherein the place name verification step includes each of the place name candidates determined by the batch place name recognition step, and a recognition result obtained by the individual character recognition step; A step of calculating the edit distance between the steps, a step of calculating the rank of each edit distance, and the rank of the edit distance,
Weight (A _i ) = (1 − t1) × Rank_ED (A _i ) + t1 × i
And a step of filtering the place name candidates obtained by the collective place name recognition process based on the calculated weight, and A _i is the i obtained by the collective place name recognition process. a th place name candidate, ED (a _i) is the edit distance of a _i, a Rank_ED (a _i) is the edit distance of rank, t1 is round (ED (a _i)) equal to / k, round english (us) (ED (A _i )) means a rounding operation of ED (A _i ), k is the total number of single character areas in the place name image, and weight (A _i ) means the weight of the i-th place name candidate. 41. The address recognition method according to appendix 40, wherein

（付記４２）前記一括地名認識工程により地名の行政レベルが降順となっている複数の地名画像のそれぞれについて１つ以上の地名候補が認識された場合に、最上位レベルから最下位レベルまでの住所候補により形成されるパスの各々の平均照合距離を算出し、平均照合距離が最小のパスのみを住所パス候補として選択する照合評価工程を含んだことを特徴とする付記４１に記載の住所認識方法。 (Additional remark 42) When one or more place name candidates are recognized for each of a plurality of place name images in which the place name administrative level is descending in the collective place name recognition step, addresses from the highest level to the lowest level 42. The address recognition method according to appendix 41, further comprising a collation evaluation step of calculating an average collation distance of each of the paths formed by the candidates and selecting only a path having the smallest average collation distance as an address path candidate. .

（付記４３）前記照合評価工程により選択された住所パス候補を承認あるいはリジェクトする地名リジェクト工程をさらに含み、前記地名リジェクト工程は、各レベルの地名領域から特徴を抽出する工程と、各レベルの地名領域から抽出された特徴をリジェクト学習により得られたマッピング関数を用いて分析する工程と、分析の結果に基づいて住所パス候補を承認あるいはリジェクトする工程とを含んだことを特徴とする付記４２に記載の住所認識方法。 (Additional remark 43) It further includes the place name rejection process which approves or rejects the address path candidate selected by the said collation evaluation process, The said place name rejection process extracts the characteristic from the place name area | region of each level, The place name of each level Appendix 42 characterized by including a step of analyzing a feature extracted from a region using a mapping function obtained by reject learning, and a step of approving or rejecting an address path candidate based on a result of the analysis The address recognition method described.

以上のように、本発明は、キー文字抽出プログラム、キー文字抽出装置、キー文字抽出方法、一括地名認識プログラム、一括地名認識装置および一括地名認識方法に関し、特に、枠のない領域内に手書きされた文字から住所を読み取り認識するキー文字抽出プログラム、キー文字抽出装置、キー文字抽出方法、一括地名認識プログラム、一括地名認識装置および一括地名認識方法に有用である。 As described above, the present invention relates to a key character extraction program, a key character extraction device, a key character extraction method, a collective place name recognition program, a collective place name recognition apparatus, and a collective place name recognition method, and is particularly handwritten in an area without a frame. The present invention is useful for a key character extraction program, a key character extraction device, a key character extraction method, a collective place name recognition program, a collective place name recognition device, and a collective place name recognition method that recognize and read an address from a character.

図１は、従来の住所認識装置を示すブロック図である。FIG. 1 is a block diagram showing a conventional address recognition device. 図２は、図１の従来の住所認識装置に備えられ、一括地名認識を行う一括地名認識手段の処理を示す概略図である。FIG. 2 is a schematic diagram showing the processing of a collective place name recognition unit that is provided in the conventional address recognition apparatus of FIG. 1 and performs collective place name recognition. 図３は、本発明の実施例に係る住所認識装置を示すブロック図である。FIG. 3 is a block diagram illustrating an address recognition apparatus according to an embodiment of the present invention. 図４は、単一文字領域検出手段の一例を示すブロック図である。FIG. 4 is a block diagram showing an example of the single character area detecting means. 図５は、本発明の実施例に係るキー文字抽出手段を示すブロック図である。FIG. 5 is a block diagram showing key character extraction means according to an embodiment of the present invention. 図６は、本発明の実施例に係るキー文字候補抽出手段を示すブロック図である。FIG. 6 is a block diagram showing key character candidate extraction means according to the embodiment of the present invention. 図７は、本発明の実施例に係る一括住所認識手段を示すブロック図である。FIG. 7 is a block diagram showing collective address recognition means according to an embodiment of the present invention. 図８は、本発明の実施例に係る一括地名認識手段を示すフローチャートである。FIG. 8 is a flowchart showing the collective place name recognition means according to the embodiment of the present invention. 図９は、本発明の実施例に係る一括住所認識方法を例示的に示す図である。FIG. 9 is a diagram exemplarily illustrating a collective address recognition method according to an embodiment of the present invention. 図１０は、本発明の実施例に係る単一文字地名認識手段を示すブロック図である。FIG. 10 is a block diagram showing single character place name recognition means according to an embodiment of the present invention. 図１１は、本発明の実施例に係る地名検証手段を示すブロック図である。FIG. 11 is a block diagram showing place name verification means according to the embodiment of the present invention. 図１２は、本発明に係る地名検証方法を例示的に示す図である。FIG. 12 is a view exemplarily showing a place name verification method according to the present invention. 図１３は、本発明の実施例に係るリジェクト手段を示すブロック図である。FIG. 13 is a block diagram showing reject means according to the embodiment of the present invention. 図１４は、本発明の実施例に係る住所認識装置を示す詳細なブロック図である。FIG. 14 is a detailed block diagram illustrating an address recognition apparatus according to an embodiment of the present invention.

Explanation of symbols

１単一文字領域検出手段
２キー文字抽出手段
３地名領域抽出手段
４一括地名認識手段
５地名検証手段
６リジェクト手段
１６キー文字パス選択手段
２０キー文字候補抽出手段
２１キー文字パス決定手段
２２キー文字パスフィルタ手段
２４住所階層構造知識部
２７地名照合手段
２８地名候補評価手段
２９再帰的照合評価手段
３４結合要素ラベリング手段
３５単一文字領域候補決定手段
３６平均文字サイズ計算手段
３７画素投影手段
３８一時的分離ポイント候補抽出手段
３９一時的分離ポイント候補決定手段
４１キー文字辞書部
４２特徴抽出手段
４３辞書照合手段
４４キー文字候補決定手段
５４地名辞書部
５５編集距離計算手段
５８一括認識・個別文字認識組み合わせ手段
５９単一文字辞書部
６１特徴抽出手段
６２辞書照合手段
６３特徴抽出手段
６４リジェクト決定手段
６５リジェクト学習手段
６９個別文字認識手段
Ｅ１単一文字地名認識手段 DESCRIPTION OF SYMBOLS 1 Single character area detection means 2 Key character extraction means 3 Place name area extraction means 4 Collective place name recognition means 5 Place name verification means 6 Reject means 16 Key character path selection means 20 Key character candidate extraction means 21 Key character path determination means 22 Key character path Filter means 24 Address hierarchy structure knowledge part 27 Place name matching means 28 Place name candidate evaluation means 29 Recursive matching evaluation means 34 Joining element labeling means 35 Single character area candidate determination means 36 Average character size calculation means 37 Pixel projection means 38 Temporary separation point Candidate extraction means 39 Temporary separation point candidate determination means 41 Key character dictionary part 42 Feature extraction means 43 Dictionary collation means 44 Key character candidate determination means 54 Place name dictionary part 55 Edit distance calculation means 58 Collective recognition / individual character recognition combination means 59 One-character dictionary part 61 Feature extraction Stage 62 Dictionary collation means 63 feature extracting means 64 reject determining unit 65 reject learning means 69 individual character recognition means E1 single character place name recognition means

Claims

A collective place name recognition program that causes a computer to execute a collective place name recognition procedure for recognizing place names of place name images including place names at a certain administrative level,
The collective place name recognition procedure includes:
An extraction procedure for extracting a feature vector from the place name image;
A collation procedure for collating the feature vectors extracted by the extraction procedure with the feature vectors of the place names stored in the place name dictionary and searching for all possible place name candidates;
To the computer,
Of the place names stored in the place name dictionary, a place name corresponding to a feature vector whose collation distance with the feature vector extracted by the extraction procedure is smaller than a predetermined threshold is determined as a possible place name candidate,
Further, an individual character recognition procedure for recognizing character candidates for characters included in each single character region of the place name image,
Wherein the recognition result obtained by the individual character recognition procedure, the batch place name recognition procedure whether the different extent is between strings between the determined place name candidates calculated by, unlike the individual characters if small A place name verification procedure for performing filtering so that the recognition result obtained by the recognition procedure is superior, or if the difference is large, the place name candidates determined by the collective place name recognition procedure are superior ;
Collective place name recognition program characterized by causing computer to execute.

The collective place name recognition procedure further causes the computer to execute a place name candidate evaluation procedure for filtering a plurality of possible place name candidates according to a predetermined rule when a plurality of possible place name candidates are obtained by the collating procedure. The collective place name recognition program according to claim 1.

The predetermined rule is that when a plurality of possible place name candidates are ranked based on the collation distance, and the possible place name candidates ranked first by the ranking are set as the first place name candidates, the collation distance of the place name candidates This is a rule that calculates a ratio of the difference between the collation distance of the first place name candidate and the collation distance of the first place name candidate to the collation distance of the first place name candidate, and selects only the place name candidates whose collation distance is smaller than a predetermined threshold. The collective place name recognition program according to claim 2 characterized by things.

  A collective place name recognition device,
  Extracting means for extracting feature vectors from the input place name image;
  A place name dictionary section for storing place names;
  The place name feature vector stored in the place name dictionary unit is matched with the feature vector extracted by the extracting means to obtain all possible place name candidates and matching distances, and among the place names stored in the place name dictionary, Collating means for determining a place name corresponding to a feature vector whose matching distance between the feature vector extracted by the extracting means is smaller than a predetermined threshold as possible place name candidates;
  Evaluation of multiple address path candidates when multiple place name images containing administrative level place names arranged in descending order are input and multiple place name paths containing administrative level place names are formed by recognition of the place name images. And a collation evaluation means for making a decision,
  When there are a plurality of address paths having an average collation distance close to the minimum average collation distance, individual character recognition is performed on a place name image having a plurality of possible place name candidates in the plurality of address paths, or the collation Individual character recognition means for performing individual character recognition on a place name image having a plurality of possible place name candidates when the means determines a plurality of possible place name candidates for the place name image;
  Calculate how much the character string differs between the recognition result obtained by the individual character recognition means and the determination result by the collation evaluation means, and if the difference is small, the individual character recognition means A place name verification unit that performs filtering so that the obtained recognition result is superior, or when the difference is large, the determination result by the collation evaluation unit is superior;
  A collective place name recognition device characterized by comprising:

A collective place name recognition method including a collective place name recognition process for recognizing place names of place name images including place names at a certain administrative level,
The collective place name recognition process includes:
An extraction step of extracting a feature vector from the place name image;
A collation step of collating the feature vector extracted by the extraction step with the feature vector of the place name stored in the place name dictionary and searching for all possible place name candidates;
Including
Of the place names stored in the place name dictionary, a place name corresponding to a feature vector whose collation distance with the feature vector extracted by the extraction step is smaller than a predetermined threshold is determined as a possible place name candidate,
Further, an individual character recognition step for recognizing character candidates for characters included in each single character region of the place name image,
Wherein the recognition result obtained by the individual character recognition process, the batch place names recognized whether the different extent is between strings between the determined place name candidates calculated by step, unlike the individual characters if small A place name verification step for performing filtering so that the recognition result obtained by the recognition step is dominant, or if the difference is large, the place name candidates determined by the collective place name recognition step are dominant ;
Collective place name recognition method characterized by including.