JPH10328623A

JPH10328623A - Postal address recognition device

Info

Publication number: JPH10328623A
Application number: JP13958097A
Authority: JP
Inventors: Kiyouka Chiba; 京香千葉; Masato Teramoto; 正人寺本; Masashi Koga; 昌史古賀; Shoji Ikeda; 尚司池田; Tatsuhiko Kagehiro; 達彦影広
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1997-05-29
Filing date: 1997-05-29
Publication date: 1998-12-15

Abstract

PROBLEM TO BE SOLVED: To provide a postal address recognition device in which detection precision of an address region is raised and recognition fate of an addressee, an address and a character string is enhanced. SOLUTION: The postal address recognition device is equipped with an image input part 12, an image memory 13 storing image data fetched in the image input part 12 and an image processing part (a recognition processor 14, a character recognition dictionary 15, a work memory 16 and an address dictionary 17) performing address region cut-out, character cut-out, character recognition and adress collating for the image data stored in the image memory 13. Furthermore, when a tack seal, on which the address character string of an addressee is printed, is stuck on a postal item, the postal address recognition device is provided with both an edge detection means for detecting the edge of the tack seal and an address region judgement means for judging the address region by the position of the edge. Both a method for utilizing variable-density difference of ground color of the postal item and the tack seal and a method for utilizing thickness of the tack seal are used as the edge detection means.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、郵便物に記載され
ている宛名の住所文字列を認識するための郵便宛名認識
装置に関し、特に、短時間にかつ正確に宛名住所文字列
領域を検出することが可能な郵便宛名認識装置に関す
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a postal address recognition device for recognizing an address character string of an address described in a postal matter, and more particularly, to a method for detecting an address character string region accurately in a short time. Mail address recognition device capable of.

【０００２】[0002]

【従来の技術】郵便宛名認識装置において、郵便物に記
載されている宛名の住所文字列を文字認識するために
は、宛名が記載されている領域を検出する必要がある。
従来は、この宛名領域を検出する場合、横方向および縦
方向に対して、微分を利用して白黒の変化点を求め、そ
の累計を行う黒ドットの投影処理により検出する方法が
あるが、この方法では宛名の住所文字列であるのか広告
や差出人住所であるのか区別することは困難である。2. Description of the Related Art In a postal address recognizing apparatus, it is necessary to detect an area in which a postal address is described in order to recognize the address character string of the postal address described in a mail.
Conventionally, when this address area is detected, there is a method in which a black-and-white changing point is obtained using differentiation in the horizontal and vertical directions, and the black-dot changing point is calculated by projecting black dots. With the method, it is difficult to distinguish between an address character string of an address, an advertisement and a sender address.

【０００３】また、郵便宛名認識装置の宛名領域検出に
関する関連特許として、例えば、特開平７−９３４７４
号公報に記載されたようなラベリング処理を用いたもの
がある。ここでは、郵便物から読み取った２値のデジタ
ル画像で黒ドットのつながりに対してラベリングを施
し、得られたラベルを統合処理し、統合されたラベルの
うち、宛名が手書か印活かを判定し、手書または印活候
補サイズ大のラベルだけを抽出し、抽出したラベルの集
中するエリアを検出し、そのエリアが宛名記載フォーマ
ットに適合しているかを判定することにより、宛名領域
を検出するようにしている。しかしながら、この場合で
も宛名住所文字列と広告や差出人住所文字列等を正確に
区別するのは困難である。[0003] Also, as a related patent relating to address area detection of a postal address recognition apparatus, for example, Japanese Patent Application Laid-Open No. Hei 7-93474.
There is one using a labeling process as described in Japanese Unexamined Patent Publication (Kokai) No. H10-26095. Here, labeling is performed on the connection of the black dots in the binary digital image read from the postal matter, the obtained labels are integrated, and it is determined whether the address of the integrated labels is a handwritten stamp. Detects the address area by extracting only the label with the size of the handwriting or stamp candidate size, detecting the area where the extracted label concentrates, and determining whether the area conforms to the address description format. I have to. However, even in this case, it is difficult to accurately distinguish the destination address character string from the advertisement, the sender address character string, and the like.

【０００４】[0004]

【発明が解決しようとする課題】上記従来の技術は、前
者では黒ドットの投影処理を行う方法により、また後者
では黒ドットの集合に対してラベリング処理を行う方法
により、郵便物の宛名領域を判別しているようにしてい
るが、上述したように、これらの方法では宛名住所文字
列と広告や差出人住所文字列等とを区別するのが困難で
あるという課題があった。また、傾いた画像に対して、
黒ドットの投影処理を行うと、傾きにより縦横方向の累
計がずれるために正しい領域が得られないという課題が
あった。また、黒ドットの投影処理を行う前者の方法
も、ラベリング処理を行う後者の方法も、両方とも、郵
便画像全ての黒ドットに対して処理を行う必要があるた
め、処理時間がかかるという課題があった。本発明の目
的は、上記課題を解決し、宛名住所文字列と、広告や差
出人住所文字列等とを区別することが可能で、傾いた画
像に対しても有効な、また、短い時間で正確に郵便の宛
名を認識することが可能な郵便宛名認識装置を提供する
ことにある。In the prior art described above, the addressing area of a postal matter is determined by a method of projecting black dots in the former method and a method of labeling a set of black dots in the latter method. However, as described above, there is a problem that it is difficult to distinguish the address and address string from the advertisement and the sender address string and the like as described above. Also, for tilted images,
When black dot projection processing is performed, there is a problem that a correct area cannot be obtained because the total in the vertical and horizontal directions is shifted due to the inclination. In addition, both the former method of performing the black dot projection process and the latter method of performing the labeling process require processing for all the black dots of the postal image, so that the processing time is long. there were. SUMMARY OF THE INVENTION The object of the present invention is to solve the above-mentioned problems, and it is possible to distinguish an address character string from an advertisement, a sender address character string, etc., which is effective even for a tilted image, and is accurate in a short time. Another object of the present invention is to provide a postal address recognition device capable of recognizing postal addresses.

【０００５】[0005]

【課題を解決するための手段】本発明は、上記課題を解
決するために、タックシールに印刷されている文字列
は、通常宛名住所文字列であることを利用し、タックシ
ールを検出することによって郵便の宛名を認識するよう
にしたものである。SUMMARY OF THE INVENTION In order to solve the above-mentioned problems, the present invention utilizes a fact that a character string printed on a sticker is a normal address and address character string, and detects the sticker. This recognizes the mail address of the mail.

【０００６】さらに詳しくは、郵便物のイメージデータ
を取り込む画像入力部（１２）と、該画像入力部（１
２）で取り込んだイメージデータを格納する画像メモリ
（１３）と、該画像メモリ（１３）に格納されたイメー
ジデータに対して宛名領域切り出し、文字切り出し、文
字認識、住所照合を行う画像処理部（認識プロセッサ１
４，文字認識辞書１５，ワークメモリ１６，住所辞書１
７）を有する郵便宛名認識装置に、さらに宛名の住所文
字列が印刷されたタックシール（セロファンを含む）が
郵便物に貼り付けられている場合、そのタックシールの
エッジを検出するエッジ検出手段（図３〜図５）とエッ
ジの位置によって宛名領域を判定する宛名領域判定手段
（認識プロセッサ１４による）を設けたことを特徴とし
ている。エッジ検出手段としては、郵便物の地色とタッ
クシールの濃淡差を利用し、タックシールの貼られた郵
便物をスキャンして濃淡変化点の位置を検出し、該濃淡
変化点の位置に基づいてエッジを検出する方法と、タッ
クシールの厚みを利用し、タックシールの貼られた郵便
物に投射された光によって生じる影の位置を検出し、そ
の影の位置に基づいてエッジを検出する方法がある。More specifically, an image input unit (12) for receiving image data of a mail, and the image input unit (1)
An image memory (13) for storing the image data captured in 2), and an image processing unit (105) for extracting an address area, extracting characters, recognizing characters, and comparing addresses with respect to the image data stored in the image memory (13). Recognition processor 1
4, character recognition dictionary 15, work memory 16, address dictionary 1
In the case where the postal address recognition device having 7) is further attached with a tack seal (including cellophane) on which the address character string of the address is printed, the edge detecting means (for detecting the edge of the tack seal) 3 to 5) and a destination area determining means (by the recognition processor 14) for determining a destination area based on the position of an edge. As the edge detecting means, the background color of the postal matter and the density difference of the tack seal are used to scan the postal matter to which the tack seal is affixed to detect the position of the density change point, and based on the position of the density change point. And a method of detecting the position of a shadow caused by light projected on a postal matter to which a tack seal is attached, using the thickness of the tack seal, and detecting the edge based on the position of the shadow. There is.

【０００７】[0007]

【発明の実施の形態】本発明では、郵便宛名認識装置
に、宛名の住所文字列がタックシール（セロファンを含
む）に印刷されている場合、そのタックシールのエッジ
を検出するエッジ検出手段とエッジの位置によって宛名
領域を判定する宛名領域判定手段を設けている。エッジ
検出手段としては、郵便物の地色とタックシールの濃淡
差を利用し、タックシールの貼られた郵便物をスキャン
して濃淡変化点の位置を検出し、該濃淡変化点の位置に
基づいてエッジを検出する方法か、タックシールの厚み
を利用し、タックシールの貼られた郵便物に投射された
光によって生じる影の位置を検出し、その影の位置に基
づいてエッジを検出する方法を利用している。本構成
は、タックシールに印刷されている文字列は、通常宛名
住所文字列であることを利用したものである。郵便物、
特にダイレクトメールの宛名住所文字列は、タックシー
ルを利用している場合が多く非常に有効である。また、
タックシールの傾きは、エッジの傾きにより得られる
が、これは、文字列の傾きと一致しているため、文字列
の傾き調整を行うことで、高精度の認識処理が可能とな
る。DESCRIPTION OF THE PREFERRED EMBODIMENTS In the present invention, when an address character string of an address is printed on a tack seal (including cellophane), an edge detecting means for detecting the edge of the tack seal, and an edge Addressing area determining means for determining an addressing area according to the position of the addressing area is provided. As the edge detecting means, the background color of the postal matter and the density difference of the tack seal are used to scan the postal matter to which the tack seal is affixed to detect the position of the density change point, and based on the position of the density change point. Or a method of detecting the position of a shadow caused by light projected on a postal matter with a tack seal using the thickness of the tack seal, and detecting the edge based on the position of the shadow. I use. This configuration utilizes the fact that the character string printed on the tack seal is a normal address / address character string. Mail,
In particular, the address / address character string of direct mail is very effective in many cases using a tack seal. Also,
The inclination of the tack seal is obtained by the inclination of the edge. Since the inclination of the sticker matches the inclination of the character string, by performing the inclination adjustment of the character string, a highly accurate recognition process can be performed.

【０００８】以下、図面を用いて本発明の郵便宛名認識
装置の一実施例を詳細に説明する。図１は、本発明の郵
便宛名認識装置を説明するための全体のブロック図であ
る。図１に示したように、本発明の郵便宛名認識装置１
０は、ＣＣＤスキャナなどからイメージデータを読み取
って入力する画像入力部１２，読み取ったイメージデー
タを格納する画像メモリ１３，イメージデータから文字
を認識する認識プロセッサ１４，文字認識用の標準文字
パターンを格納している文字認識辞書１５，中間結果な
どを記憶するワークメモリ１６，および住所を記憶しし
ている住所辞書１７から構成されている。また、郵便物
１１上には宛名住所文字列が記載されている。Hereinafter, an embodiment of the mail address recognition apparatus of the present invention will be described in detail with reference to the drawings. FIG. 1 is an overall block diagram for explaining a postal address recognition apparatus of the present invention. As shown in FIG. 1, a postal address recognition device 1 of the present invention
Reference numeral 0 denotes an image input unit 12 for reading and inputting image data from a CCD scanner or the like, an image memory 13 for storing the read image data, a recognition processor 14 for recognizing characters from the image data, and storing a standard character pattern for character recognition. It comprises a character recognition dictionary 15, a work memory 16 for storing intermediate results and the like, and an address dictionary 17 for storing addresses. In addition, a character string of an address and address is described on the mail 11.

【０００９】次に、郵便宛名認識装置１０の処理を説明
する。まず、宛名住所文字列の記載されている郵便物１
１上のイメージデータを画像入力部１２から入力し、画
像メモリ１３に格納する。格納されたイメージデータに
対し、認識プロセッサ１４を用い、宛名領域の検出、認
識対象文字列の検出、文字方向の検出、１文字単位の切
り出し等を行った後、文字認識辞書１５を用い文字認識
を行う。文字認識結果を住所辞書１７を用い住所列との
照合処理等を行い認識結果をワークメモリ１６に格納す
る。このようにして郵便宛名は認識される。ワークメモ
リ１６に格納された住所列は郵便番号等に変換され、郵
便物の区分等に利用される。Next, the processing of the postal address recognition device 10 will be described. First, a mail item 1 with an address / address string
1 is input from the image input unit 12 and stored in the image memory 13. The stored image data is subjected to address recognition, a character string to be recognized, a character direction, a character-by-character cutout, etc., using a recognition processor 14, and then a character recognition dictionary 15 is used to perform character recognition. I do. The character recognition result is compared with an address string by using the address dictionary 17 and the recognition result is stored in the work memory 16. In this way, the mail address is recognized. The address string stored in the work memory 16 is converted into a zip code or the like, and is used for sorting mails.

【００１０】図２は、本発明に有効な、画像入力部１２
から入力された郵便物の画像データの１例である。同図
において、２０は郵便物の画像（以下単に郵便物とい
う），２１はタックシールの画像（以下単にタックシー
ルという），２２はタックシールに印刷されている宛名
住所文字列の画像（以下単に宛名住所文字列という），
２３および２４は郵便物２０に前もって印刷されている
差出人住所文字列および郵便番号の画像を示している。
従来は、宛名住所文字列２２のエリアと差出人住所文字
列２３のエリアは、同格であり、文字認識する場合、ど
ちらの文字列が認識対象となる住所文字列であるのか判
別が困難であった。本発明では、濃淡変化点等を利用し
て、タックシールの画像２１の枠を検出することによ
り、正しい宛名住所文字列の画像２２の認識処理が可能
となる。FIG. 2 shows an image input unit 12 useful for the present invention.
1 is an example of image data of a mail item input from the Internet. In the figure, reference numeral 20 denotes an image of a postal matter (hereinafter, simply referred to as a postal matter), reference numeral 21 denotes an image of a tack seal (hereinafter, simply referred to as a tack seal), and reference numeral 22 denotes an image of an address / address character string printed on the sticker (hereinafter, simply referred to as a sticker). Address mail address string),
Reference numerals 23 and 24 denote images of the sender address character string and the postal code which are printed on the mail 20 in advance.
Conventionally, the area of the destination address character string 22 and the area of the sender address character string 23 have the same rank, and when performing character recognition, it is difficult to determine which character string is the address character string to be recognized. . In the present invention, the recognition process of the image 22 of the correct address / address character string can be performed by detecting the frame of the image 21 of the tack seal using the shading change point or the like.

【００１１】図３〜図５は、本発明における宛名領域検
出方法の１例である。図３は、画像入力部１２で光学的
スキャンによって読み取られ画像メモリ１３に格納され
た郵便物２０からタックシール２１を検出する方法を説
明するための図，図４は画像メモリ１３の画像データを
読み出して得た濃淡変化点テーブルである。タックシー
ル２１を検出するためには、タックシール２１のエッジ
を検出する必要がある。このため、例えば、色の違いに
着目し濃淡差を検出する。すなわち図３において、画像
メモリ１３に格納されたイメージデータは単なる白か黒
で表されるデータ（２値データ）ではなく、濃淡をもっ
たデータ（多値データ）である。図３において、背景は
濃淡レベル０，郵便物２０は濃淡レベル７，タックシー
ル２１は濃淡レベル０と仮定する。画像メモリ１３から
の読み出しはスキャン水平方向３１（ｘ座標軸方向）お
よびスキャン垂直方向３２（ｙ座標軸方向）に行われ、
その読み出しデータから濃淡変化点３３を検出し濃淡変
化点テーブルに格納し、その後、濃淡変化点テーブルに
基づいてエッジを検出する。FIGS. 3 to 5 show an example of a destination area detecting method according to the present invention. FIG. 3 is a view for explaining a method of detecting the tack seal 21 from the mail 20 read by the optical scanning by the image input unit 12 and stored in the image memory 13, and FIG. It is a shading change point table obtained by reading. In order to detect the tack seal 21, it is necessary to detect the edge of the tack seal 21. For this reason, for example, a difference in density is detected by focusing on a difference in color. That is, in FIG. 3, the image data stored in the image memory 13 is not simply data expressed in white or black (binary data), but data having shades (multi-valued data). In FIG. 3, it is assumed that the background is at the gray level 0, the mail 20 is at the gray level 7, and the tack seal 21 is at the gray level 0. Reading from the image memory 13 is performed in the scan horizontal direction 31 (x coordinate axis direction) and the scan vertical direction 32 (y coordinate axis direction).
The gray level change point 33 is detected from the read data, stored in the gray level change point table, and then the edge is detected based on the gray level change point table.

【００１２】画像メモリ１３に格納された多値データか
ら濃淡変化点を検出する方法の一例を説明する。画像メ
モリの多値データパターン上において、水平方向にｘ座
標軸、垂直方向にｙ座標軸をとり、ｘ座標軸方向および
ｙ座標方向に沿って濃度変化点を検出し、濃淡変化点テ
ーブルに登録する。An example of a method for detecting a gray-scale change point from multi-value data stored in the image memory 13 will be described. On the multivalued data pattern in the image memory, the x-coordinate axis is set in the horizontal direction and the y-coordinate axis is set in the vertical direction. The density change points are detected along the x-coordinate axis direction and the y-coordinate direction, and registered in the density change point table.

【００１３】すなわち、まず、ｘ座標軸方向に多値デー
タを読み出し、多値データすなわち濃度が変化した点が
検出されたら、濃淡変化点テーブルに、そのときの座標
値，変化後の濃度，それがｘ座標軸方向の読み出し時に
検出されたことを示す水平フラグに１を登録する。この
処理をｙ座標値を０から郵便物の幅に相当する画像メモ
リの格納幅Ｙまで変えて繰り返す。That is, first, multi-value data is read in the x-coordinate direction, and when multi-value data, that is, a point at which the density has changed is detected, the coordinate value at that time, the density after the change, and 1 is registered in the horizontal flag indicating that the detection is performed at the time of reading in the x coordinate axis direction. This process is repeated by changing the y coordinate value from 0 to the storage width Y of the image memory corresponding to the width of the mail.

【００１４】例えば、ｙ座標値ｂでスキャン水平方向３
１（ｘ座標軸方向）の読み出しをすると、ｘ座標値０で
濃淡レベルが０から７に，ｘ座標値ａで濃淡レベルが７
から０に変化している。従って、濃淡変化点テーブルに
は、ｘ座標値に０，ｙ座標値にｂ，濃淡レベルに７，水
平フラグに１を格納するとともに、ｘ座標値にａ，ｙ座
標値にｂ，濃淡レベルに０，水平フラグに１を格納す
る。同様にｙ座標値（ｂ＋ｉ）でスキャン水平方向３１
（ｘ座標軸方向）の読み出しをした場合も、ｘ座標値０
で濃淡レベルが０から７に，ｘ座標値ａで濃淡レベルが
７から０に変化する。従って、濃淡変化点テーブルに
は、ｘ座標値に０，ｙ座標値に（ｂ＋ｉ），濃淡レベル
に７，水平フラグに１を格納するとともに、ｘ座標値に
ａ，ｙ座標値に（ｂ＋ｉ），濃淡レベルとして０，水平
フラグに１を格納する。なお、この際、ｙ座標値を変え
て繰り返した場合に、前回と同じｘ座標値で濃淡に変化
があった場合には濃淡変化点テーブルへの登録を省略す
ることにより、濃淡変化点テーブルに使用するメモリ領
域を節約できる。For example, the scan horizontal direction 3 with the y coordinate value b
When reading 1 (x-coordinate axis direction), the x-coordinate value 0 changes the gray level from 0 to 7, and the x-coordinate value a changes the gray level to 7
From 0 to 0. Therefore, in the gradation change point table, 0 is stored in the x coordinate value, b is stored in the y coordinate value, 7 is stored in the gray level, 1 is stored in the horizontal flag, and a is stored in the x coordinate value, b is stored in the y coordinate value, and the gray level is stored. 0 and 1 are stored in the horizontal flag. Similarly, the scan horizontal direction 31 is calculated using the y coordinate value (b + i).
(X-coordinate axis direction), the x-coordinate value 0
Changes the gray level from 0 to 7 at x-coordinate value a. Accordingly, 0 is stored as the x coordinate value, (b + i) is stored as the y coordinate value, 7 is stored as the gray level, and 1 is stored as the horizontal flag, and the x coordinate value is a and the y coordinate value is (b + i). , 0 as the gray level, and 1 as the horizontal flag. At this time, when the y-coordinate value is changed and the repetition is performed, if the shading changes at the same x-coordinate value as the previous time, the registration in the shading change point table is omitted, so that the shading change point table is saved. Uses less memory space.

【００１５】次に、ｙ座標軸方向に多値データを読み出
し、多値データすなわち濃度が変化した点が検出された
ら、濃淡変化点テーブルに、そのときの座標値，変化後
の濃度，それがｙ座標軸方向の読み出し時に検出された
ことを示す垂直フラグに１を登録する。次に、この処理
をｘ座標値を０から郵便物の長さに相当する画像メモリ
の格納幅Ｘまで変えて繰り返す。Next, the multivalued data is read in the y-coordinate axis direction, and when the multivalued data, that is, the point where the density is changed is detected, the coordinate value at that time, the density after the change, and y 1 is registered in the vertical flag indicating that the detection is performed at the time of reading in the coordinate axis direction. Next, this processing is repeated while changing the x coordinate value from 0 to the storage width X of the image memory corresponding to the length of the mail.

【００１６】例えば、ｘ座標値ｃでスキャン垂直方向３
２（ｙ座標軸方向）の読み出しをすると、ｙ座標値０で
濃淡レベルが０から７に，ｙ座標値ｄで濃淡レベルが７
から０に変化している。従って、濃淡変化点テーブルに
は、ｘ座標値にｃ，ｙ座標値に０，濃淡レベルに７，垂
直フラグに１を格納するとともに、ｙ座標値にｄ，ｘ座
標値にｃ，濃淡レベルに０，垂直フラグに１を格納す
る。同様に、ｘ座標値（ｃ＋ｊ）でスキャン垂直方向３
２（ｙ座標軸方向）にスキャンした場合も、ｙ座標値０
で濃淡レベルが０から７に，ｙ座標値ｄで濃淡レベルが
７から０に変化する。従って、濃淡変化点テーブルに
は、ｘ座標値に（ｃ＋ｊ），ｙ座標値に０，濃淡レベル
に７，垂直フラグに１を格納するとともに、ｘ座標値に
（ｃ＋ｊ），ｙ座標値にｄ，濃淡レベルに０，垂直フラ
グに１を格納する。なお、この際、ｘ座標値を変えて繰
り返した場合に、前回と同じｙ座標値で濃淡に変化があ
った場合には濃淡変化点テーブルへの登録を省略するこ
とにより、濃淡変化点テーブルに使用するメモリ領域を
節約できる。図４にこのようにして生成した濃淡変化点
テーブルの例を示す。For example, in the x-coordinate value c, the scanning vertical direction 3
2 (in the y coordinate axis direction), when the y coordinate value is 0, the gray level is changed from 0 to 7, and when the y coordinate value is d, the gray level is 7
From 0 to 0. Therefore, in the gray level change point table, c is stored in the x coordinate value, 0 is stored in the y coordinate value, 7 is stored in the gray level, and 1 is stored in the vertical flag. 0 and 1 are stored in the vertical flag. Similarly, the scan vertical direction 3 is determined by the x coordinate value (c + j).
2 (y coordinate axis direction), the y coordinate value is 0
Changes the gray level from 0 to 7 and the gray level from 7 to 0 at the y-coordinate value d. Therefore, in the gray point change point table, (c + j) is stored in the x coordinate value, 0 is stored in the y coordinate value, 7 is stored in the gray level, and 1 is stored in the vertical flag, and (c + j) is stored in the x coordinate value and d is stored in the y coordinate value. , 0 in the gray level, and 1 in the vertical flag. At this time, when the x coordinate value is changed and repeated, if there is a change in the shade with the same y coordinate value as the previous time, the registration in the shade change point table is omitted, so that the shade change point table is saved. Uses less memory space. FIG. 4 shows an example of the shading change point table generated in this way.

【００１７】以上の説明した処理、すなわち郵便物を光
学的にスキャンして求めた画像メモリの多値データ（濃
淡度）に対してスキャン水平方向（ｘ座標軸方向）およ
びスキャン垂直方向（ｙ座標軸方向）に上述した処理を
繰り返すことにより、図４に示したような濃淡変化点テ
ーブルが作成される。このようにして作成された濃淡変
化点テーブルを参照することによって濃淡変化点のエッ
ジの座標（変化点の水平方向，垂直方向の位置）が求め
られ、この座標を分析することにより、濃淡が郵便物の
地色と異なっている部分の形状と大きさを認識できる。
このようにして認識した形状が略四辺形で、大きさが通
常の郵便物の宛名欄（住所，名前）に相当する大きさ
（例えば、数ｃｍ×（５〜１５）ｃｍ程度）であれば、
この濃淡が郵便物の地色と異なっている部分は、タック
シールである可能性が高いと考えられ、その内側にあた
るエリアを宛名領域とする。また、上記形状が小さいも
のであったり、複雑・不規則な形状のものであれば、広
告やマークの可能性が高いと考えられるので、宛名領域
の候補から除外する。本実施例によれば、郵便物の宛名
領域の検出が容易となる。The above-described processing, that is, the scanning horizontal direction (x-coordinate axis direction) and the scanning vertical direction (y-coordinate axis direction) are performed on the multivalued data (shading) of the image memory obtained by optically scanning the mail. By repeating the above-mentioned processing, a gray level change point table as shown in FIG. 4 is created. The coordinates of the edge of the gradation change point (the horizontal and vertical positions of the change point) are obtained by referring to the gradation change point table created in this manner. The shape and size of the part different from the ground color of the object can be recognized.
If the shape recognized in this way is a substantially quadrilateral and the size is equivalent to the address column (address, name) of ordinary mail (for example, about several cm × (5 to 15) cm) ,
It is considered that there is a high possibility that a portion where the shading is different from the ground color of the mail is a tack seal, and an area inside the portion is set as a destination area. In addition, if the shape is small or has a complicated or irregular shape, it is considered that there is a high possibility of an advertisement or a mark, and is excluded from candidates for the address area. According to this embodiment, it is easy to detect a mailing address area.

【００１８】上述したタックシールのエッジ検出方法
は、地色とタックシールに濃淡差がある場合には有効で
あるが、例えば、地色が白色で、タックシールも白色の
場合など両者に濃淡差がない場合には濃度による検出は
できない。そのため、地色とタックシールに濃淡差がな
い場合にも有効なタックシールのエッジ検出方法が要望
される。このようなタックシールのエッジの検出方法の
一つとして、タックシールにある程度以上の厚みがある
場合には、次のようなタックシールのエッジの検出方法
が考えられる。The above-described method of detecting the edge of the tack seal is effective when the ground color and the tack seal have a difference in shading. For example, when the ground color is white and the tack seal is also white, the shading difference between the two is used. If there is no, detection by concentration cannot be performed. Therefore, there is a demand for a tack seal edge detection method that is effective even when there is no difference in shading between the ground color and the tack seal. As one of the methods of detecting the edge of the tack seal, when the tack seal has a certain thickness or more, the following method of detecting the edge of the tack seal can be considered.

【００１９】図５は、タックシールの厚みを利用してタ
ックシールのエッジを検出する実施例を説明するための
図である。同図（ａ）に示すように、タックシールの付
いていない領域は、表面が平らであるが、タックシール
２２の部分だけ厚み５１がある。そのため、イメージデ
ータを入力する際、領域検出用として郵便物２０の左右
上下の斜め上方に光源５２１〜５２４を設けておく。同
図（ｂ）に示すように、光源５２１〜５２４からの光を
郵便物に斜め上方からあてるとタックシールの影５３が
生成する。この影５３を検出し、領域検出用イメージデ
ータとして入力する。このようにして入力した領域検出
用イメージデータは、基本的に厚みがある部分しか線と
して現れないため、タックシールの検出が容易である。
検出したタックシールの形状、大きさによって上述した
場合と同様に宛名領域を検出できる。すなわち、検出し
た形状が略四辺形で、大きさが通常の郵便物の宛名欄
（住所，名前）に相当する大きさ（例えば、数ｃｍ×
（５〜１５）ｃｍ程度）であれば、その内側にあたるエ
リアを宛名領域とする。また、上記形状が小さいもので
あったり、複雑・不規則な形状のものであれば、広告や
マークの可能性が高いと考えられるので、宛名領域の候
補から除外する。本実施例によっても、郵便物の宛名領
域の検出が容易となる。FIG. 5 is a diagram for explaining an embodiment in which the edge of the tack seal is detected using the thickness of the tack seal. As shown in FIG. 3A, the area without the tack seal has a flat surface, but has a thickness 51 only at the tack seal 22. For this reason, when inputting image data, light sources 521 to 524 are provided obliquely above and to the left, right, up and down of the mail 20 for area detection. As shown in FIG. 3B, when light from the light sources 521 to 524 is directed obliquely to the mail, a shadow 53 of the tack seal is generated. This shadow 53 is detected and input as image data for area detection. Since the area detection image data thus input basically appears as a line only in a thick portion, the detection of the tack seal is easy.
The address area can be detected in the same manner as described above depending on the detected shape and size of the tack seal. That is, the detected shape is substantially quadrilateral and the size is equivalent to the address column (address, name) of ordinary mail (for example, several cm ×
(Approximately 5 to 15 cm), the area inside the area is the destination area. In addition, if the shape is small or has a complicated or irregular shape, it is considered that there is a high possibility of an advertisement or a mark, and is excluded from candidates for the address area. According to the present embodiment as well, it is easy to detect a mailing address area.

【００２０】図６は、本発明に関連する宛名認識処理の
流れを説明するためのフローチャートである。まず、画
像メモリ１３に格納されている郵便物のイメージデータ
に対し、タックシールのエッジ生成処理を行なう（ステ
ップ６１）。これは、図３および図４に示した濃淡変化
点を検出する方法、または図５に示したタックシールの
厚みによる影検出する方法を用いることによって行なう
ことが可能である。こうして生成されたタックシールの
エッジに対し、その形状および大きさを分析してタック
シールの有無を判別（ステップ６２）する。タックシー
ルと判断された場合に、そのタックシール内部を宛名領
域とする（ステップ６３）。その場合、図３〜５の説明
で述べたように、エッジで囲まれた領域の大きさが小さ
かったり形状が複雑であった場合には宛名領域とせずに
除外し、別の領域を宛名領域の候補とする。タックシー
ルが存在しない場合は、別の宛名領域検出処理（ステッ
プ６４）、例えば、黒ドットの投影処理（ステップ６
５）等を行い検出する。その検出した領域を宛名領域と
する（ステップ６６）。FIG. 6 is a flowchart for explaining the flow of an address recognition process related to the present invention. First, a tack seal edge generation process is performed on the mail image data stored in the image memory 13 (step 61). This can be performed by using the method of detecting the gray level change point shown in FIGS. 3 and 4, or the method of detecting the shadow based on the thickness of the tack seal shown in FIG. The shape and size of the edge of the tack seal generated in this way are analyzed to determine the presence or absence of the tack seal (step 62). If it is determined that the area is a tack seal, the inside of the tack seal is set as a destination area (step 63). In this case, as described in the description of FIGS. 3 to 5, when the size of the area surrounded by the edge is small or the shape is complicated, the area is excluded without being the destination area, and another area is excluded. As a candidate. If there is no tack seal, another address area detection process (step 64), for example, a black dot projection process (step 6)
5) Perform detection and the like. The detected area is set as a destination area (step 66).

【００２１】次に、検出した宛名領域に対して、例えば
黒ドットのつながり等を利用して文字切り出し処理を行
なう（ステップ６７）。次に、例えば文字候補の１文字
について、０°，９０°，１８０°，２７０°の４種類
の回転処理を行い、それぞれについて文字認識辞書１５
とマッチング処理を行うことにより、最も類似度の高い
方向を文字列の文字方向とすることにより文字方向判別
処理を行なう（ステップ６８）。この文字方向判別処理
を複数の切り出し文字に対して行い、多数決でタックシ
ールの方向を判別する。なお、上記では４種類の角度の
回転処理を行い、文字認識辞書１５とマッチング処理を
行っているが、回転角度を細かくしてさらに多種類の回
転処理を行うことにより、タックシールが斜めに貼られ
ていても宛名住所文字列を精度よく認識することが可能
になる。文字方向判別後、それらによる住所文字列の文
字認識処理を、文字認識辞書１５とのマッチング処理等
により行う（ステップ６９）。この１文字ごとの文字認
識結果に対して、住所辞書１７を用いた住所列の照合処
理等を行い（ステップ７０）、認識結果をワークメモリ
１６に出力する。Next, a character cutout process is performed on the detected address area using, for example, the connection of black dots (step 67). Next, for example, four types of rotation processing of 0 °, 90 °, 180 °, and 270 ° are performed on one character of a character candidate, and the character recognition dictionary 15
By performing the matching process, the character direction determining process is performed by setting the direction having the highest similarity as the character direction of the character string (step 68). This character direction determination processing is performed on a plurality of cut-out characters, and the direction of the tack seal is determined by majority decision. In the above description, rotation processing of four types of angles is performed and matching processing with the character recognition dictionary 15 is performed. However, by making the rotation angle finer and performing more types of rotation processing, the tack seal is attached diagonally. Even if it is, it becomes possible to recognize the address / address character string with high accuracy. After the determination of the character direction, the character recognition process of the address character string is performed by the matching process with the character recognition dictionary 15 (step 69). The character recognition result for each character is subjected to an address string collation process using the address dictionary 17 (step 70), and the recognition result is output to the work memory 16.

【００２２】なお、ステップ７０の住所列の照合処理
で、住所が照合できない場合は、当該領域は宛名を記載
したものではないと考えられるので、別の領域に対して
同様の処理を行なう。すなわち、ステップ６３でエッジ
の大きさが小さすぎたり、形状が複雑・不規則であるた
め不適当として除外したものを宛名領域の候補として採
用することも可能である。If the address cannot be compared in the address string matching process in step 70, it is considered that the area does not include an address, and the same processing is performed for another area. That is, it is also possible to adopt, as the candidates for the destination area, those which are excluded as inappropriate because the size of the edge is too small or the shape is complicated or irregular in step 63.

【００２３】[0023]

【発明の効果】本発明の郵便宛名認識装置によれば、宛
名住所文字列と、広告や差出人住所文字列等とを区別す
ることが可能で、傾いた画像に対しても有効な、また、
短い時間で正確に郵便の宛名を認識することが可能にな
る。According to the postal address recognition apparatus of the present invention, it is possible to distinguish between an address character string and an advertisement or a sender address character string, which is effective for a tilted image.
It is possible to accurately recognize the mail address in a short time.

[Brief description of the drawings]

【図１】本発明における郵便宛名認識装置全体のブロッ
ク図である。FIG. 1 is a block diagram of an entire postal address recognition device according to the present invention.

【図２】本発明に有効な郵便物の画像例である。FIG. 2 is an image example of a postal matter effective for the present invention.

【図３】本発明における宛名領域検出方法を説明するた
めの図である。FIG. 3 is a diagram for explaining a destination area detection method according to the present invention.

【図４】本発明における濃淡変化点テーブルの１例であ
る。FIG. 4 is an example of a shading change point table according to the present invention.

【図５】本発明におけるタックシールの影を用いた宛名
領域検出方法を説明するための図である。FIG. 5 is a diagram for explaining a destination area detection method using a tack seal shadow according to the present invention.

【図６】本発明における処理の流れを示すフローチャー
トである。FIG. 6 is a flowchart showing a flow of processing in the present invention.

[Explanation of symbols]

１０：郵便宛名認識装置、１１：郵便物、１２：画像入
力部、１３：画像メモリ、１４：認識プロセッサ、１
５：文字認識辞書、１６：ワークメモリ、１７：住所辞
書、２０：郵便物（の画像）、２１：タックシール（の
画像）、２２：宛名住所文字列（の画像）、２３：差出
人住所文字列（の画像）、２４：郵便番号（の画像）、
５２１〜５２４：光源、５３：影10: Postal address recognition device, 11: Mail, 12: Image input unit, 13: Image memory, 14: Recognition processor, 1
5: character recognition dictionary, 16: work memory, 17: address dictionary, 20: postal matter (image), 21: tack seal (image), 22: address / address character string (image), 23: sender address character Column (image of), 24: Zip code (image of),
521 to 524: light source, 53: shadow

───────────────────────────────────────────────────── フロントページの続き (72)発明者池田尚司東京都国分寺市東恋ケ窪一丁目280番地株式会社日立製作所中央研究所内 (72)発明者影広達彦東京都国分寺市東恋ケ窪一丁目280番地株式会社日立製作所中央研究所内 ──────────────────────────────────────────────────続き Continuing on the front page (72) Inventor Naoji Ikeda 1-280 Higashi Koikekubo, Kokubunji-shi, Tokyo Inside the Central Research Laboratory, Hitachi, Ltd. Inside the Central Research Laboratory

Claims

[Claims]

1. An image input unit for receiving image data of a postal matter, an image memory for storing the image data captured by the image input unit, and an address area cut out from the image data stored in the image memory. Cut out,
In a postal address recognition device having an image processing unit for character recognition and address collation, an edge detection means for detecting an edge of a tack seal on which an address is printed, and a destination area is determined based on the edge detected by the edge detection means. A postal address recognition device, further comprising an address area determining means for performing the operation.

2. The method according to claim 1, wherein the edge detecting means scans the postal matter to which a sticker is attached to detect a position of a shading change point, and detects an edge based on the position of the shading change point. Postal address recognition device.

3. The mail according to claim 1, wherein the edge detecting means detects a position of a shadow generated by light projected on the postal matter to which the tack seal is attached, and detects an edge based on the position of the shadow. Address recognition device.