JPH02308384A

JPH02308384A - Address recognizing device

Info

Publication number: JPH02308384A
Application number: JP1130709A
Authority: JP
Inventors: Masaaki Shizuno; 静野　正明
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 1989-05-24
Filing date: 1989-05-24
Publication date: 1990-12-21

Abstract

PURPOSE:To shorten the processing time and to recognize an address at high speed by defining a word, which starts the recognition of the address, as a keyword out of words to be used for recognition processing. CONSTITUTION:Out of the words to be prepared in a word dictionary 15 in advance, the word to be the starting point of the address recognition, namely, the word in the least-significant hierarchy of the address to be blocked is registered as the keyword. Out of word candidates recognized by a word recognizing means 14, the word registered as the keyword is retrieved and this retrieved word is defined as the starting point of collation. Then, the word candidate is successively collated toward the high-order hierarchy and a series of the words, which can be collated, are recognized as the address by an address recognizing means 16. Thus, the number of the words to be collated is decreased and the address can be recognized at high speed.

Description

【発明の詳細な説明】［発明の目的］（産業上の利用分野）この発明は、例えば配達物に示されている宛先である住
所を読取って認識を行う住所認識装置に関する。DETAILED DESCRIPTION OF THE INVENTION [Object of the Invention] (Industrial Application Field) The present invention relates to an address recognition device that reads and recognizes an address, which is a destination indicated on, for example, a delivery item.

（従来の技術）従来、住所認識装置にあっては、区分情報としての配達
物上の住所を構成する文字列を、例えば光学式文字読取
装置で読取って、この読取った文字列から単語辞書を参
照して単語を認識し、さらに、この認識された単語を組
み合わせて構成される住所を住所辞書を参照することに
より認識し、この認識された住所に基づいて配達物の区
分を行うようになっている。(Prior Art) Conventionally, in an address recognition device, a character string constituting an address on a delivery item as classification information is read by, for example, an optical character reader, and a word dictionary is created from the read character string. It then recognizes the word by referring to it, then recognizes the address formed by combining the recognized words by referring to the address dictionary, and classifies the delivery based on this recognized address. ing.

このような住所認識装置では、住所を認識するにあたり
、住所の上位階層の単語から下位階層の単語へと順番に
住所辞書を検索し、認識処理を行っていた。例えば、下
記のような「東京都足立区千住曙町」という住所を認識する場合、「東京都」＝「足立区」−
「千住」→「曙町」の順番で住所辞書の検索を行い、住
所辞書中に一致するものを見出だすことにより住所認識
を行っていた。しかしながら、この方法では、単語認識
によって得られた単語候補のすべてを住所辞書と照合す
る必要があり、最後まで読めないことが判っていても途
中まで照合を行うので処理時間か長くなるという欠点が
ある。In such an address recognition device, when recognizing an address, an address dictionary is searched in order from words in the upper layer of the address to words in the lower layer to perform recognition processing. For example, when recognizing the address "Senju Akebono-cho, Adachi-ku, Tokyo" as shown below, "Tokyo" = "Adachi-ku" -
Address recognition was performed by searching the address dictionary in the order of ``Senju'' → ``Akebono-cho'' and finding a match in the address dictionary. However, with this method, it is necessary to check all of the word candidates obtained through word recognition against the address dictionary, and even if it is known that it is impossible to read all the way to the end, the process is checked halfway, which increases the processing time. be.

（発明が解決しようとする課題）上記したように、住所の上位階層から順次照合して単語
認識を行うものは単語候補のすべてを住所辞書と照合す
る必要があり、最後まで読めないことが判っていても途
中まで照合を行うので処理時間が長くなるという欠点を
除去するもので、処理時間を短くし、高速に住所認識を
行うことのできる住所認識装置を提供することを目的と
する。(Problem to be Solved by the Invention) As mentioned above, it has been found that the method that performs word recognition by sequentially comparing addresses from the upper level requires checking all of the word candidates with the address dictionary, making it impossible to read all the way to the end. To provide an address recognition device capable of shortening the processing time and performing address recognition at high speed, by eliminating the drawback that processing time becomes long because verification is performed halfway even if the processing is completed.

［発明の構成］（課題を解決するための手段）この発明の住所認識装置は、文字情報を検出して辞書中
の情報と照合して住所を認識する住所認識装置において
、住所を構成する文字列を読取る文字読取手段と、この
文字読取手段により読取られた文字列に対して文字認識
を行って文字候補を出力する文字認識手段と、認識処理
に使用する単語のうち住所認識を開始すべき単語をキー
ワードとして記憶している単語辞書と、前記文字認識手
段から出力された文字候補と前記単語辞書との照合によ
り単語認識を行って単語候補を出力する単語認識手段と
、この単語認識手段から出力された単語候補のうち前記
キーワードとして記憶されている単語を検索し、この検
索された単語を開始点として前記単語候補を順次照合す
ることにより住所認識を行う住所認識手段とを具備する
ことを特徴とする。[Structure of the Invention] (Means for Solving the Problems) An address recognition device of the present invention detects character information and recognizes an address by comparing it with information in a dictionary. A character reading means for reading the string, a character recognition means for performing character recognition on the character string read by the character reading means and outputting character candidates, and a character recognition means for outputting character candidates among the words used in the recognition process, which should start address recognition. a word dictionary storing words as keywords; a word recognition means for performing word recognition and outputting word candidates by comparing character candidates output from the character recognition means with the word dictionary; Address recognition means for searching for a word stored as the keyword among the output word candidates, and performing address recognition by sequentially collating the word candidates using the searched word as a starting point. Features.

（作用）この発明は、予め単語辞書に用意する単語のうち住所認
識の開始点となるべき単語、つまり区分されるべき住所
の最下位階層の単語をキーワードとして登録しておき、
単語認識手段により認識された単語候補のうちキーワー
ドとして登録されている単語を検索し、この検索された
単語を照合の開始点として上記単語候補を上位階層の方
へ順次照合し、照合できた一連の単語を住所として認識
するようにしたものである。これにより照合すべき数が
減少し、高速に住所認識ができるものとなっている。(Operation) This invention registers in advance as a keyword a word that should be the starting point for address recognition among words prepared in a word dictionary, that is, a word at the lowest level of the address to be classified.
Search for words that are registered as keywords among the word candidates recognized by the word recognition means, use the searched words as a starting point for matching, and sequentially match the word candidates in the upper hierarchy. It is designed to recognize words as addresses. This reduces the number of checks that need to be made and allows for faster address recognition.

（実施例）以下、この発明の一実施例について図面を参照して説明
する。(Example) Hereinafter, an example of the present invention will be described with reference to the drawings.

第１図は、住所認識装置を概略的に示すものである。す
なわち、１１は走査部（光電変換部）で、図示しない搬
送路に沿って図示矢印ａ方向に搬送される配達物Ｐに対
向するように設けられている。FIG. 1 schematically shows an address recognition device. That is, reference numeral 11 denotes a scanning section (photoelectric conversion section), which is provided so as to face the delivery P that is conveyed in the direction of arrow a in the figure along a conveyance path (not shown).

この走査部１１は、配達物Ｐ上を光学的に走査して光電
変換することによりパターン信号を得るものであり、例
えば配達物Ｐを照明する光源、及びその反射光を受けて
電気信号に変換する自己走査形のＣＣＤイメージセンサ
等によって構成されている。この走査部１１が出力する
読取信号は住所領域検出部１２及び文字認識部１３内の
二値化回路２０に供給されるようになっている。This scanning unit 11 obtains a pattern signal by optically scanning the delivery item P and photoelectrically converting it. For example, it uses a light source that illuminates the delivery item P, and receives the reflected light and converts it into an electrical signal. It is composed of a self-scanning CCD image sensor and the like. The reading signal outputted from the scanning section 11 is supplied to the binarization circuit 20 in the address area detection section 12 and the character recognition section 13.

住所領域検出部１２は、配達物Ｐに記載されている各種
情報の中から、領域の形や位置等により、住所情報が記
載されている領域を検出する周知のものである。この住
所領域検出部１２で住所領域として検出される、例えば
領域の座標値が文字認識部１３内の選択回路２１に出力
されるようになっている。The address area detection unit 12 is a well-known device that detects an area where address information is written from among various information written on the delivery item P based on the shape, position, etc. of the area. For example, the coordinate values of an area detected as an address area by the address area detection unit 12 are output to a selection circuit 21 in the character recognition unit 13.

文字認識部１３は、走査部１１からの読取信号を二値化
する二値化回路２０、住所領域検出部１２から供給され
る座標値により特定される領域内の二値化信号、つまり
走査部１１からの読取信号のうち住所領域の読取信号の
みを選択して出力する選択回路２１、この選択回路２１
から供給される住所領域に対応する住所情報の文字を１
文字ずつ検出切出しする文字検出切出回路２２、この文
字検出切出回路２２から供給される検出切出された文字
情報を正規化し、サンプリングする正規化回路２３、及
び正規化回路２３から供給される文字情報を、例えば文
字辞書２５内の各文字に対する基準パターンとのマツチ
ング法などによって認識し、複数の文字候補を出力する
認識回路２４によって構成されている。この文字認識部
１３の認識回路２４からの出力は単語認識部１４に供給
される。The character recognition unit 13 includes a binarization circuit 20 that binarizes the read signal from the scanning unit 11, and a binarized signal within the area specified by the coordinate values supplied from the address area detection unit 12, that is, the scanning unit a selection circuit 21 that selects and outputs only the address area read signal from among the read signals from 11;
1 character of the address information corresponding to the address area supplied from
A character detection and extraction circuit 22 detects and extracts each character, a normalization circuit 23 normalizes and samples the detected and extracted character information supplied from the character detection and extraction circuit 22, and a normalization circuit 23 that is supplied from the normalization circuit 23. It is constituted by a recognition circuit 24 that recognizes character information by, for example, a method of matching each character in a character dictionary 25 with a reference pattern, and outputs a plurality of character candidates. The output from the recognition circuit 24 of the character recognition section 13 is supplied to the word recognition section 14.

単語認識部１４は、上記文字認識部１３から供給される
文字候補に対して単語辞書１５から読出される単語によ
り単語を認識し、認識できた複数の単語候補を出力する
ものである。また、単語辞書１５に記憶される単語のう
ち、住所認識処理上必ず認識できなければならない単語
、つまり認識結果により区分情報として使用される最下
位階層の単語には、予め当該単語をキーワードとするべ
き旨の識別情報が付されている。この単語辞書１５は例
えばＲＯＭ等で構成される周知のものである。この単語
認識部１４からの出力、つまり複数の単語候補は住所認
識部１６に供給される。The word recognition unit 14 recognizes the character candidates supplied from the character recognition unit 13 using the words read out from the word dictionary 15, and outputs a plurality of recognized word candidates. Furthermore, among the words stored in the word dictionary 15, words that must be recognized in the address recognition process, that is, words in the lowest hierarchy that are used as classification information based on the recognition results, are set in advance as keywords. Identification information indicating that it should be used is attached. This word dictionary 15 is a well-known one constructed of, for example, a ROM or the like. The output from the word recognition section 14, that is, the plurality of word candidates, is supplied to the address recognition section 16.

住所認識部１６は、上記単語認識部１４から供給される
単語候補に対し所定の処理を施して住所を認識し、その
認識された住所に対する区分指定データ、つまり住所認
識装置の後段に設けられている区分部の区分ポケット（
図示しない）を示すデータを出力するものである。The address recognition unit 16 performs predetermined processing on the word candidates supplied from the word recognition unit 14 to recognize an address, and generates classification designation data for the recognized address, that is, the address recognition unit is provided at a subsequent stage of the address recognition device. The compartment pocket (
(not shown).

次に、上記構成において第２図ないし第４図を参照しな
がら動作を説明する。Next, the operation of the above configuration will be explained with reference to FIGS. 2 to 4.

例えば今、第１図に示す配達物Ｐが図示矢印ａ方向に搬
送され、走査部１１に到達すると、この走査部１１は、
その配達物Ｐ上を光学的に走査して光電変換し、そのパ
ターン信号が住所領域検出部１２に供給される。すると
、住所領域検出部１２は、上記配達物Ｐに記載されてい
る各情報の中から住所情報が記載されている住所領域を
検出する。For example, when the delivery P shown in FIG.
The delivered item P is optically scanned and photoelectrically converted, and the pattern signal is supplied to the address area detection section 12. Then, the address area detection unit 12 detects an address area in which address information is written from among the pieces of information written in the delivery item P.

また、上記走査部１１の出力は文字認識部１３内の二値
化回路２０に供給され、二値化された後、選択回路２１
に出力される。Further, the output of the scanning section 11 is supplied to the binarization circuit 20 in the character recognition section 13, and after being binarized, the output is sent to the selection circuit 20.
is output to.

これにより、選択回路２１は、上記住所領域検出部１２
から供給される座標情報に対応する上記二値化回路２０
からの出力、つまり住所領域の読取信号のみを選択的に
文字検出切出回路２２に出力する。この文字検出切出回
路２２は供給された住所領域に対応する住所情報の文字
を１文字ずつ検出切出し、正規化回路２３に出力する。Thereby, the selection circuit 21 selects the address area detection section 12.
The binarization circuit 20 corresponding to the coordinate information supplied from
, that is, only the read signal of the address area is selectively output to the character detection and cutout circuit 22. The character detection and cutout circuit 22 detects and cuts out the characters of the address information corresponding to the supplied address area one by one, and outputs them to the normalization circuit 23.

正規化回路２３では検出切出された文字情報を正規化し
、サンプリングして二値化された文字列として認識回路
２４に出力する。この実施例では、第２図（ａ）に示す
ように、８文字の列が入力される場合の例について説明
する。The normalization circuit 23 normalizes the detected and extracted character information, and outputs it to the recognition circuit 24 as a sampled and binarized character string. In this embodiment, an example in which a string of eight characters is input as shown in FIG. 2(a) will be described.

認識回路２４は供給された文字情報を、例えば文字辞書
２５内の文字に対する基準パターンとのマツチング法な
どによって文字認識し、単語認識部１４に出力する。こ
の際、自由手書き漢字などでは認識結果を唯一に決定す
ることは困難であるので、１つの入力文字に対して複数
の答候補となる文字候補を出力する。図示例では４つの
文字候補が出力されるようになっている。The recognition circuit 24 performs character recognition on the supplied character information by, for example, a method of matching the characters in the character dictionary 25 with a reference pattern, and outputs the result to the word recognition section 14 . At this time, since it is difficult to determine a unique recognition result for freely handwritten kanji, etc., character candidates that are a plurality of answer candidates are output for one input character. In the illustrated example, four character candidates are output.

単語認識部１４は、上記文字認識部１３から供給される
文字候補に対して単語辞書１５から読出される単語によ
り単語を認識し、その単語列を住所認識部１６に出力す
る。すなわち、入力文字列の一部を取り出し、その中の
各文字候補を組み合わせて単語辞書と照合することによ
り単語候補を得る。例えば、２〜４文字目の部分の各文
字列についてそれぞれ、第４、第３、第４の答候補を組
み合わせると「関谷町」となり、単語辞書に含まれる単
語と一致し、第２図（ｂ）に示すように、「関谷町」と
いう単語候補が得られる。第２図（ｂ）中、「＊」が付
されている単語はキーワードである。このキーワードは
、住所認識処理を行うについて必ず認識できなければな
らない単語であり、例えば住所により区分する場合の最
下位階層の単語がキーワードとして選択されている。The word recognition section 14 recognizes the word from the word dictionary 15 read out from the character candidates supplied from the character recognition section 13, and outputs the word string to the address recognition section 16. That is, word candidates are obtained by extracting a part of the input character string, combining each character candidate therein, and comparing the combination with a word dictionary. For example, if you combine the 4th, 3rd, and 4th answer candidates for each character string in the 2nd to 4th character part, you will get "Sekiya-machi", which matches the word included in the word dictionary, and as shown in Figure 2 ( As shown in b), the word candidate "Sekiya-machi" is obtained. In FIG. 2(b), words marked with "*" are keywords. This keyword is a word that must be recognized in order to perform address recognition processing, and for example, a word at the lowest level when classifying by address is selected as a keyword.

第３図は、第２図（ｂ）に示す単語候補のメモリ上のイ
メージである。図において、「左単語（ＬＷ）Ｊとは、
住所候補となる「単語名（ＮＭ）Ｊの左にある単語を示
す。例えば、住所「東京都足立区足立」のときは、「定立」の左は「定立区」であることを表し
ている。なお、左単語がないとき、つまり、上記例でい
えば「東京都」のように最上位階層の単語である場合は
、左単語としてｒＥＮＤＪが記憶されるようになってい
る。FIG. 3 is an image of the word candidates shown in FIG. 2(b) in memory. In the figure, "Left word (LW) J is
Indicates the word to the left of "word name (NM) J" that is a candidate address. For example, in the case of the address "Adachi, Adachi-ku, Tokyo", the left side of "Sedate" indicates "Sedate-ku". . Note that when there is no left word, that is, in the case of the word in the top hierarchy such as "Tokyo" in the above example, rENDJ is stored as the left word.

また、「左端位置（Ｌ）」、「右端位置（Ｒ）」は、入
力された文字列の中での単語候補位置を示すものである
。例えば、入力された文字列が「定立区千住宮元町」であれば、「宮元町ｊの左端は「６」、右端は「８」で
表される。Furthermore, the "left end position (L)" and "right end position (R)" indicate the word candidate position within the input character string. For example, if the input character string is "Senju Miyamoto-cho, Seitatsu-ku," then "the left end of Miyamoto-cho j is represented by "6" and the right end is represented by "8."

次に、このような単語候補が単語認識部１４から得られ
ると、住所認識部１６は第４図のフローチャートに示す
ような住所認識処理を開始する。Next, when such word candidates are obtained from the word recognition unit 14, the address recognition unit 16 starts address recognition processing as shown in the flowchart of FIG.

先ず、単語候補の中からキーワードとなっている単語を
検索し、それを含む住所候補を選択する。First, a word that is a keyword is searched among word candidates, and an address candidate containing the word is selected.

つまり、図示しないカウンタにセットされる変数ｎを「
１」に初期化しくステップＳ　１　）　、ｎ番目の単語
候補のキーワードフラグ（Ｋ）欄を参照して、当該単語
がキーワードであるか否か判断する（ステップＳ２）。In other words, the variable n set in a counter (not shown) is
1" (step S1), the keyword flag (K) column of the nth word candidate is referred to to determine whether the word is a keyword (step S2).

そして、キーワードでないことが判断されるとステップ
Ｓｌｌへ分岐し、変数ｎをインクリメントして単語候補
として得られた単語の最後まで調べたか否か、つまり、
図示例でいえば変数ｎが「９」になったか否かを判断す
る（ステップ５１２）。そして、最後の単語まで調べて
いないことが判断されるとステップＳ２へ戻り、キーワ
ードとして定義された単語候補が出現するまでステップ
Ｓ２、Ｓ１１、Ｓ１２を繰り返し実行する。If it is determined that it is not a keyword, the process branches to step Sll, and the variable n is incremented to determine whether or not the word obtained as a word candidate has been examined to the end.
In the illustrated example, it is determined whether the variable n has become "9" (step 512). If it is determined that the last word has not been investigated, the process returns to step S2, and steps S2, S11, and S12 are repeatedly executed until a word candidate defined as a keyword appears.

この繰り返し実行の過程において、ステップＳ２でキー
ワードとなっている単語候補が見つかった旨を判断する
と、照合できた単語の長さ、つまり住所候補の長さを記
憶するレジスタＬＮＧ（図示しない）を「０」にクリア
し、その時の変数ｎを図示しないレジスタに変数ｍとし
てセットする（ステップＳ３）。図示例では、最初に見
つかるキーワードとなっている単語候補は４番目の単語
候補「関谷町」である。したがって、その時の変数ｎで
ある「４」が変数ｍとしてセットされる。In the process of this repeated execution, when it is determined in step S2 that a word candidate serving as a keyword has been found, a register LNG (not shown) that stores the length of the matched word, that is, the length of the address candidate is 0'', and the variable n at that time is set as the variable m in a register (not shown) (step S3). In the illustrated example, the word candidate that is the first keyword found is the fourth word candidate "Sekiya town." Therefore, "4", which is the variable n at that time, is set as the variable m.

次いで、ｍ番目の単語候補の左単語ＬＷをＬ　Ｗ　ｍ　
、左端位置りをＬｍ、右端位置ＲをＲｍとする（ステッ
プＳ４）。すなわち、４番目の単語候補である「関谷町
」の左単語ＬＷとして「定立区」、左端位置りとして「
２」、右端位置Ｒとして「４」を得る。Next, the left word LW of the m-th word candidate is L W m
, the left end position is set as Lm, and the right end position R is set as Rm (step S4). In other words, the left word LW of the fourth word candidate "Sekiya-machi" is "Seitateku", and the leftmost position is "
2", and "4" is obtained as the right end position R.

次いで、住所候補の単語の長さを求める。つまり、レジ
スタＬＮＧにｒＲｍ−Ｌｍ＋ＩＪを計算した値「４〜２
＋１−３Ｊを加算する（ステップＳ５）。そして、当該
単語候補の左単語がｒＥＮＤＪであるか否か、つまり、
その単語候補が最上位階層の単語であるか否かを調べる
（ステップＳ６）。そして、「ＥＮＤ」でないことが判
断されると、単語名Ｎ　、Ｍが左単語Ｌ　Ｗ　ｍ　。Next, the word length of the address candidate is determined. In other words, the value calculated by rRm-Lm+IJ is set in register LNG as "4 to 2.
+1-3J is added (step S5). Then, whether or not the left word of the word candidate is rENDJ, that is,
It is checked whether the word candidate is a word in the highest hierarchy (step S6). Then, if it is determined that it is not "END", the word names N and M are the left words L W m .

つまり「定立区」であり、かつ右端位置Ｒが「Ｌｍ−１
」、つまりｒ２−１−ＩＪとなる単語を単語候補中から
捜す（ステップＳ７）。そして、ステップＳ８にて該当
するものがあることが判断されると、その単語候補の番
号を新しい変数ｍとする（ステップＳ９）。一方、該当
するものがないことが判断されると、住所候補として変
数ｎ及び照合できた住所候補の長さＬＮＧを図示しない
メモリに残す（ステップ５１０）。図示例では、該当す
るものがないので変数ｎである「４」及びＬ−ＮＧを「
３」としてメモリに残す。すなわち、以上の処理にて「
関谷町」のみ住所候補として照合できたことになる。In other words, it is a "established ward" and the right end position R is "Lm-1
'', that is, the word r2-1-IJ is searched from the word candidates (step S7). If it is determined in step S8 that there is a corresponding word candidate, the number of the word candidate is set as a new variable m (step S9). On the other hand, if it is determined that there is no matching address, the variable n and the length LNG of the matched address candidate are left in a memory (not shown) as an address candidate (step 510). In the illustrated example, there is no corresponding item, so the variable n "4" and L-NG are changed to "
3" in memory. In other words, in the above process, '
This means that only "Sekiya Town" could be matched as a potential address.

次に、変数ｎをインクリメントしくステップ５１１）、
上記したと同様に、次のキーワードとして定義された単
語候補が出現するまでステップＳ２、Ｓｌｌ、５１２を
繰り返し実行する。そして、この繰り返し実行の過程に
おいてステップＳ２でキーワードとなっている単語候補
が見つかると、レジスタＬＮＧを「０」にクリアし、そ
の時の変数ｎを図示しないレジスタに変数ｍとして格納
する（ステップＳ３）。図示例では２番目に見つかるキ
ーワードとなっている単語候補は６番目の単語候補「前
片」である。したがって、その時の変数ｎである「６」
が変数ｍとしてセットされる。以降の処理は、上記キー
ワード「関谷町」の場合と同じであるので説明を省略す
るが、この場合は、「前片」のみが住所候補として照合
できる。Next, increment the variable n (step 511),
In the same manner as described above, steps S2, Sll, and 512 are repeatedly executed until a word candidate defined as the next keyword appears. When a word candidate serving as a keyword is found in step S2 in the process of this repeated execution, the register LNG is cleared to "0" and the variable n at that time is stored as a variable m in a register not shown (step S3). . In the illustrated example, the word candidate that is the second keyword to be found is the sixth word candidate, ``front piece''. Therefore, the variable n at that time is "6"
is set as the variable m. The subsequent processing is the same as in the case of the keyword "Sekiya-machi", so the explanation will be omitted, but in this case, only "front piece" can be collated as an address candidate.

次に、変数ｎをインクリメントしくステップ５１１）、
上記したと同様に、次のキーワードとして定義された単
語候補が出現するまでステップＳ２、Ｓｌｌ、Ｓ１２を
繰り返し実行する。Next, increment the variable n (step 511),
In the same manner as described above, steps S2, Sll, and S12 are repeatedly executed until a word candidate defined as the next keyword appears.

そして、この繰り返し実行の過程において、ステップＳ
２でキーワードとなっている単語候補が見つかると、レ
ジスタＬＮＧを「０」にクリアし、その時の変数ｎを図
示しないレジスタに変数ｍとしてセットする（ステップ
Ｓ３）。図示例では３番目に見つかるキーワードとなっ
ている単語候補は８番目の単語候補「宮元町」である。In the process of this repeated execution, step S
When a word candidate serving as a keyword is found in step 2, the register LNG is cleared to "0" and the variable n at that time is set as a variable m in a register (not shown) (step S3). In the illustrated example, the third word candidate found is the eighth word candidate "Miyamoto-cho."

したがって、その時の変数ｎである「８」が変数ｍとし
てセットされる。次いで、ｍ番目の単語候補の左単語Ｌ
ＷをＬ　Ｗ　ｍ　、左端位置りをＬｍ、右端位置ＲをＲ
ｍとする（ステップＳ４）。すなわち、８番目の単語候
補である「宮元町」の左単語ＬＷとして「千住」、左端
位置りとして「６」、右端位置Ｒとして「８」を得る。Therefore, "8", which is the variable n at that time, is set as the variable m. Next, the left word L of the m-th word candidate
W is L W m, left end position is Lm, right end position R is R
m (step S4). That is, "Senju" is obtained as the left word LW of "Miyamotocho", which is the eighth word candidate, "6" is obtained as the left end position, and "8" is obtained as the right end position R.

次いで、住所候補の長さを求める。つまり、レジスタＬ
ＮＧにｒＲｍ−Ｌｍ＋ＩＪを計算した値ｒ８−６＋１−
３Ｊを加算する（ステップＳ５）。Next, the length of the address candidate is determined. That is, register L
The value rRm-Lm+IJ is calculated for NG r8-6+1-
3J is added (step S5).

そして、当該単語候補の左単語がｒＥＮＤＪであるか否
か、つまり、その単語候補が最上位階層の単語であるか
否かを調べ（ステップＳ６）、ｒＥＮＤＪでないことが
判断されると、単語名ＮＭが左単語Ｌ　Ｗ　ｍ　、つま
り「千住Ｊであり、かつ右端位置ＲがｒＬｍ−ＩＪ、つ
まりｒ６−１−５」となる単語を単語候補中から捜す（
ステップＳ７）。そして、ステップＳ８にて該当するも
のがあることが判断されると、その単語候補の番号を新
しい変数ｍとする（ステップＳ９）。この場合、５番目
の単語候補に該当するものあるのでステップＳ９へ進み
、一致した単語候補の番号、つまり「５」を新しい変数
ｍとし、再度ステップＳ４へ戻る。そして、２巡目の処
理では、５番目の単語候補である「千住」の左単語ＬＷ
として「定立区」、左端位置りとして「４」、右端位置
Ｒとして「５」を得る。Then, it is checked whether the left word of the word candidate is rENDJ, that is, whether the word candidate is a word in the highest hierarchy (step S6), and if it is determined that it is not rENDJ, the word name Search the word candidates for a word in which NM is the left word L W m, that is, "Senju J, and the right end position R is rLm-IJ, that is, r6-1-5" (
Step S7). If it is determined in step S8 that there is a corresponding word candidate, the number of that word candidate is set as a new variable m (step S9). In this case, since there is a word that matches the fifth word candidate, the process advances to step S9, sets the number of the matched word candidate, that is, "5", as a new variable m, and returns to step S4 again. In the second round of processing, the left word LW of "Senju", which is the fifth word candidate, is
, "established ward" is obtained, "4" is obtained as the left end position, and "5" is obtained as the right end position R.

次いで、住所候補の長さを求める。つまり、レジスタＬ
ＮＧに「Ｒｍ−Ｌｍ＋ＩＪを計算した値ｒ５−４＋１−
２Ｊを加算する（ステップＳ５）。Next, the length of the address candidate is determined. That is, register L
NG: "Rm-Lm+IJ calculated value r5-4+1-
2J is added (step S5).

これによりレジスタＬＮＧの内容は「５」となる。As a result, the contents of register LNG become "5".

そして、当該単語候補の左単語がｒＥＮＤＪであるか否
か、つまり、その単語候補が最上位階層の単語であるか
否かを調べ（ステップＳ６）、ｒＥＮＤＪでないことが
判断されると、単語名ＮＭが左単語Ｌ　Ｗ　ｍ、つまり
「定立区」であり、かつ右端位置Ｒが「Ｌｍ−ＩＪ　、
つまりｒ４−１−３ｊとなる単語を単語候補中から捜す
（ステップＳ７）。この場合、２番目の単語候補に該当
するものがあるので、ステップＳ９へ進み、一致した単
語候補の番号、つまり「２」を新しい変数ｍとし、再度
ステップＳ４へ戻る。そして、３巡目の処理では、２番
目の単語候補である「定立区」の左単語ＬＷとして「東
京都」、左端位置りとして「１」、右端位置Ｒとして「
３」を得る。Then, it is checked whether the left word of the word candidate is rENDJ, that is, whether the word candidate is a word in the highest hierarchy (step S6), and if it is determined that it is not rENDJ, the word name NM is the left word L W m, that is, "established ward", and the right end position R is "Lm-IJ,"
In other words, the word r4-1-3j is searched from among the word candidates (step S7). In this case, since there is a matching word candidate, the process proceeds to step S9, sets the number of the matching word candidate, that is, "2", as a new variable m, and returns to step S4 again. Then, in the third round of processing, the left word LW of the second word candidate "Established Ward" is "Tokyo", the left end position is "1", and the right end position R is "Tokyo".
3”.

次いで、住所候補の長さを求める。つまり、レジスタＬ
ＮＧにｒＲｍ−Ｌｍ＋ｌＪを計算した値ｒ３−１＋１−
３４を加算する（ステップＳ５）。Next, the length of the address candidate is determined. That is, register L
The value rRm-Lm+lJ is calculated for NG r3-1+1-
34 is added (step S5).

これによりレジスタＬＮＧの内容は「８」となる。As a result, the contents of register LNG become "8".

そして、当該単語候補の左単語がｒＥＮＤＪであるか否
か、つまり、その単語候補が最上位階層の単語であるか
否かを調べ（ステップＳ６）、ｒＥＮＤＪでないことが
判断されると、単語名ＮＭが左単語Ｌ　Ｗ　ｍ　、つま
り「東京都」であり、かつ右端位置Ｒが「Ｌｍ−ＩＪ　
、つまりｒｌ−１−ＯＪとなる単語を単語候補中から捜
す（ステップＳ７）。この場合、該当するものがないの
で、ステップＳ１０へ分岐し、住所候補として変数ｎが
「８」及びＬＮＧが「８」という値を残す。すなわち、
「定立区千住宮本町」が照合できたことになる。Then, it is checked whether the left word of the word candidate is rENDJ, that is, whether the word candidate is a word in the highest hierarchy (step S6), and if it is determined that it is not rENDJ, the word name NM is the left word L W m , that is, "Tokyo", and the right end position R is "Lm-IJ
, that is, a word rl-1-OJ is searched from among the word candidates (step S7). In this case, since there is no matching item, the process branches to step S10, and the values of "8" for variable n and "8" for LNG are left as address candidates. That is,
This means that "Senju Miyamotocho, established ward" has been verified.

さらに、「元町」をキーワードとして処理を行った場合
は、上記と同様に、「元町」だけを照合できることにな
る。そして、ステップＳ１２で終了である旨が判断され
一連の住所認識処理を終了する。Furthermore, when processing is performed using "Motomachi" as a keyword, only "Motomachi" can be collated in the same way as above. Then, in step S12, it is determined that the process has ended, and the series of address recognition processes ends.

このようにして認識された住所候補としては、第２図（
ｃ）に示すように、複数のものが得られる。この場合は
、例えばレジスタＬＮＧの内容を参照することにより、
照合できた文字数（丸印で囲ったもの）の一番多いもの
を認識結果として出力する。すなわち、図示例では「定
立区千住宮元町」が認識結果となる。そして、認識され
た住所に対応する区分指定データ、つまり後段の区分部
における区分ポケットを示すデータを、後段の区分部に
出力する。これにより各住所に対応した区分ポケットに
配達物Ｐが区分されることになる。Address candidates recognized in this way are shown in Figure 2 (
As shown in c), several are obtained. In this case, for example, by referring to the contents of register LNG,
The one with the largest number of matched characters (circled) is output as the recognition result. That is, in the illustrated example, the recognition result is "Senju Miyamoto-cho, Seitate-ku." Then, the classification designation data corresponding to the recognized address, that is, the data indicating the classification pocket in the subsequent classification section, is output to the subsequent classification section. As a result, the delivery items P are sorted into sorting pockets corresponding to each address.

上記したように、予め単語辞１ｔ１５に用意する単語の
うち住所認忠の開始点となるべき単語、つまり区分され
るべき住所の最下位階層の単語をキーワードとして登録
しておき、単語認識部１４により認識された単語候補の
うちキーワードとじて登録されている単語を検索し、こ
の検索されたキーワードとしての単語を照合開始単語と
して上記単語候補を上位階層の方へ順次照合し、照合で
きた一連の単語を住所として認識するようにしたので、
従来方式では９つの単語候補すべてについて照合する必
要があるが、本実施例によれば半分以下の４つの照合で
済み、照合する単語候補の数が減少するので高速に住所
認識ができるものとなっている。また、最下位階層の単
語から順次照合処理を行うようにしたので、最後まで読
めないことが分かっている照合を実施することがなく、
効率良く照合処理ができるものとなっている。As described above, among the words prepared in the word dictionary 1t15, words that should be the starting point for address recognition, that is, words at the lowest level of the address to be classified, are registered as keywords, and the word recognition unit 14 Search for words that have been registered as keywords among the word candidates recognized by Since the word is recognized as an address,
In the conventional method, it is necessary to match all nine word candidates, but according to this embodiment, only four matches are required, which is less than half, and the number of word candidates to be matched is reduced, allowing for faster address recognition. ing. In addition, since the matching process is performed sequentially starting from the word in the lowest hierarchy, there is no need to perform matching that is known to be unreadable to the end.
This enables efficient verification processing.

［発明の効果］以上詳述したように、この発明によれば、処理時間を短
くし、高速に住所認識を行うことのできる住所認識装置
を提供できる。[Effects of the Invention] As detailed above, according to the present invention, it is possible to provide an address recognition device that can shorten processing time and perform address recognition at high speed.

[Brief explanation of the drawing]

図面はこの発明の一実施例を示すもので、第１図は全体
の構成を概略的に示す図、第２図は住所候補を得るまで
の処理を説明するための図、第３図は単語候補のメモリ
イメージを示す説明図、第４図は動作を説明するための
フローチャートである。Ｐ・・・配達物、１１・・・走査部（文字読取手段）、
１２・・・住所領域検出部、１３・・・文字認識部（文
字認識手段）、１４・・・単語認識部（単語認識手段）
、１５・・・単語辞書、１６・・・住所認識部（住所認
識手段）。The drawings show one embodiment of the present invention, and FIG. 1 is a diagram schematically showing the overall configuration, FIG. 2 is a diagram for explaining the processing up to obtaining address candidates, and FIG. 3 is a diagram showing the word FIG. 4 is an explanatory diagram showing a candidate memory image, and FIG. 4 is a flowchart for explaining the operation. P...Delivery item, 11...Scanning section (character reading means),
12... Address area detection section, 13... Character recognition section (character recognition means), 14... Word recognition section (word recognition means)
, 15... Word dictionary, 16... Address recognition unit (address recognition means).

Claims

[Claims] An address recognition device that recognizes an address by detecting character information and comparing it with information in a dictionary, comprising: a character reading means for reading a character string constituting an address; and a literature reading means read by the literature reading means. a character recognition means that performs character recognition on a character string and outputs character candidates; a word dictionary that stores words as keywords for starting address recognition among words used in recognition processing; and the character recognition means. word recognition means for performing word recognition and outputting word candidates by comparing character candidates output from the word dictionary with the word dictionary; 1. An address recognition device comprising: address recognition means for performing address recognition by searching and sequentially collating the word candidates using the searched word as a starting point.