JP2001109844A

JP2001109844A - Character string extracting method, handwritten character extracting method, character string extraction device, and image processor

Info

Publication number: JP2001109844A
Application number: JP28517799A
Authority: JP
Inventors: Hiroyasu Takahashi; 弘晏高橋
Original assignee: International Business Machines Corp
Current assignee: International Business Machines Corp
Priority date: 1999-10-06
Filing date: 1999-10-06
Publication date: 2001-04-20
Anticipated expiration: 2019-10-06
Also published as: US6798895B1; JP3204259B2

Abstract

PROBLEM TO BE SOLVED: To extract the character string of a handwritten address, etc., fast with high precision while evading a complicated integrating process in all pixels. SOLUTION: A connecting pixel group (CC) detection part 13 detects a connecting pixel group (CC) formed of black connecting pixels in a binarized image, a character size connecting pixel group (CharCC) extraction part 14 extracts a character size connecting pixel group (CharCC) which is neither too large nor too small among detected connecting pixel groups, and a lateral expansion part 21 and a longitudinal expansion part 22 which expand and contract the extracted character size connecting pixel group by expanding it in a virtual character string direction and contracting it at right angles to the virtual character string direction. Then thin and long connecting pixel group (LongCC) extraction parts 25 and 26 combine expanded and contracted connecting pixel groups in the virtual character string direction to extract thin and long connecting pixel groups and a character string selection part 30 selects an object character string of image recognition according to the extracted long and thin connecting pixel groups.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、例えば郵便物の宛
名自動読み取り機などに用いられる画像処理装置等に関
し、特に日本語手書き宛名文字等の不整合な文字列を抽
出するのに適当な画像処理装置等に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an image processing apparatus and the like used for, for example, an automatic address reader for mail, and more particularly, to an image processing apparatus suitable for extracting an inconsistent character string such as Japanese handwritten address characters. The present invention relates to a processing device and the like.

【０００２】[0002]

【従来の技術】例えば、郵便物の処理分野では、大量に
存在する郵便物を連日、短時間にて処理する必要があ
る。従前では、人間による目視と手作業による分類がな
されており、郵便局員の負担が非常に大きかった。その
ために、自動化読み取り装置による機械化が進められ、
例えば、我が国における郵便番号欄に記載された郵便番
号の読み取り等、送達区域毎の区分には大きな進展が見
られている。しかしながら、郵便番号欄が設けられてい
ない諸外国の郵便物では、郵便番号の位置を認識するこ
とが難しい。また、ますます増加が予想される今後の郵
便物処理において、郵便番号以外に郵便物上の宛名情報
を即座に読み取り、送達区域に区分する技術の発展は、
将来においても強く望まれている技術事項である。2. Description of the Related Art For example, in the field of mail processing, it is necessary to process a large number of mails every day in a short time. In the past, humans were classified visually and manually, and the burden on postal workers was very large. To this end, the mechanization with automated reading devices has been promoted,
For example, great progress has been made in the classification of each service area, such as reading postal codes written in the postal code column in Japan. However, it is difficult to recognize the position of the postal code for postal items from other countries without a postal code column. Also, in the future mail processing, which is expected to increase more and more, the development of technology to immediately read the address information on the mail other than the zip code and classify it into service areas,
It is a technical matter that is strongly desired in the future.

【０００３】このように、郵便番号欄が無い郵便物から
郵便番号を抽出することや、宛名情報の読み取りについ
て、その要望は強いものの、現状の技術では郵便物から
これらの情報を読み取ることは非常に難しい。これは、
郵便物には、宛名領域以外に差出人の領域や広告情報、
絵、写真等の様々な模様が含まれており、かつ、手書き
と印刷文字の混在や、縦書きや横書き等、バリエーショ
ンの数があまりにも多いことがその主たる理由である。
その為に、これらの混在する情報の中から郵便番号や宛
名情報を正しく抽出するためには、全ての領域をサーチ
する等、非常に複雑な処理が必要となる。特に日本語に
よる手書きの宛名は、縦書き、横書きの混在による問題
の他、宛名を形成する手書き文字が不規則に並んでいる
場合が多く、良好な宛名読み取りを実行することが特に
難しい。As described above, although there is a strong demand for extracting a postal code from a mail having no postal code column and reading address information, it is very difficult to read such information from a mail using current technology. Difficult. this is,
In addition to the address area, the mail area includes the sender's area, advertising information,
The main reason is that various patterns such as pictures and photographs are included, and there are too many variations such as a mixture of handwritten and printed characters, vertical writing and horizontal writing.
Therefore, in order to correctly extract postal code and address information from such mixed information, very complicated processing such as searching all areas is required. In particular, in handwritten addresses in Japanese, in addition to the problem of mixed vertical and horizontal writing, handwritten characters forming the addresses are often arranged irregularly, and it is particularly difficult to perform good address reading.

【０００４】ここで、宛名情報を抽出する既存の方法と
して、まず、宛名領域を抽出した後に実際の宛名を読み
取る技術が存在する。例えば、印字宛名はラベルとして
郵便物に貼り付けられていると仮定し、その反射属性や
ラベルの有するエッジの影を検出して宛名領域を抽出す
る方法や、横方向と縦方向に関して投影を取り、濃度の
高い場所を文字行として宛名領域を推定する方法があ
る。また、特開平７-２６５８０７号公報では、近接し
ている画素連結成分を統合化して文字行を推定し、また
更にそれを統合化して宛名領域の候補とする技術が開示
されている。Here, as an existing method for extracting address information, there is a technique of first extracting an address area and then reading an actual address. For example, assuming that the print address is attached to a postal matter as a label, the method of extracting the address area by detecting its reflection attribute and the shadow of the edge of the label, or projecting in the horizontal and vertical directions There is a method of estimating a destination area by using a high density place as a character line. Japanese Patent Application Laid-Open No. Hei 7-265807 discloses a technique of estimating a character line by integrating adjacent pixel connected components, and further integrating them to be a candidate for a destination area.

【０００５】[0005]

【発明が解決しようとする課題】しかしながら、印字宛
名がラベルとして貼り付けられていると仮定して領域を
抽出する方法では、ラベルが貼られておらず印字宛名が
直接、郵便物に記載されている場合には全く対応するこ
とができず、抽出できる郵便物が大きく限定される。特
にラベルが殆ど用いられない手書き日本語宛名に対して
この技術を適用することは困難である。また、濃度の高
い場所を文字行として推定する方法では、背景部に多く
の画像情報が存在し複雑な模様のある郵便物に対して宛
名領域以外との区別をつけることが難しく、正確な宛名
領域を検出することができない。更に、特開平７-２６
５８０７号公報に開示された技術では、文字が比較的き
れいに並んでおり、且つ文字列が比較的密に並んでいる
ことを想定しており、印刷文字の宛名にはある程度適用
できても、不規則な手書きの宛名に適用した場合、充分
な結果は期待できない。また、この技術では、全画素か
ら固まりを拾い、それを全部サーチする作業を固まり毎
に全て繰り返す、といった全画素内における複雑な統合
化処理を施す必要性がある。その為に、ロジックが非常
に難しく、処理速度が非常に遅くなると共に、構成があ
まりにも複雑となり、システム上からみて現実的なもの
とは言い得ない。However, in the method of extracting an area on the assumption that the print address is attached as a label, the label is not attached and the print address is directly written on the mail. Cannot be handled at all, and the mail that can be extracted is greatly limited. In particular, it is difficult to apply this technique to handwritten Japanese addresses where labels are rarely used. Also, in the method of estimating a place with high density as a character line, it is difficult to distinguish a mail having a complicated pattern with a lot of image information in a background portion from an area other than the address area, so that accurate addressing is not possible. The area cannot be detected. Further, JP-A-7-26
The technology disclosed in Japanese Patent No. 5807 assumes that characters are relatively neatly arranged and character strings are relatively densely arranged. When applied to regular handwritten addresses, satisfactory results cannot be expected. Further, in this technique, it is necessary to perform a complicated integration process in all pixels, such as picking up a cluster from all the pixels and repeating the operation of searching for all the clusters for each cluster. Therefore, the logic is very difficult, the processing speed is extremely slow, and the configuration is too complicated, so that it cannot be said to be realistic from the viewpoint of the system.

【０００６】本発明はこのような課題を解決するために
なされたものであって、全画素内での複雑な統合化処理
を避け、精度高く高速に手書き宛名等の文字列を抽出す
ることを主たる目的とする。また、他の目的は、文字列
の並びが不規則な宛名であっても実用に耐え得る文字列
抽出を提供することにある。更に他の目的は、縦書き、
横書きが混在した郵便物等の画像に対しても、柔軟性を
持たせ、且つ容易な処理により文字列を抽出することに
ある。また更に他の目的は、印刷文字列等の整合がとれ
た文字列と、日本語手書き宛名等のように不規則な文字
列に対して、アルゴリズムを変えて処理することで、柔
軟性が高く、かつ高精度な文字列抽出を提供することに
ある。SUMMARY OF THE INVENTION The present invention has been made to solve such a problem, and it is an object of the present invention to accurately and quickly extract a character string such as a handwritten address by avoiding complicated integration processing in all pixels. Main purpose. It is another object of the present invention to provide a character string extraction that can be put to practical use even if the arrangement of the character strings is irregular. Still other purposes are vertical writing,
It is an object of the present invention to extract character strings by providing flexibility and easy processing even for an image of a postal matter or the like in which horizontal writing is mixed. Still another object is to provide a high flexibility by processing an algorithm for a character string matched with a print character string or the like and an irregular character string such as a Japanese handwritten address. And highly accurate character string extraction.

【０００７】[0007]

【課題を解決するための手段】かかる目的のもと、本発
明の文字列抽出方法は、入力された画像を構成する画素
の連結画素群を抽出するステップと、抽出されたこの連
結画素群の大きさを予め定められた閾値と比較して閾値
内連結画素群を抽出するステップと、抽出されたこの閾
値内連結画素群の縦方向または横方向を伸ばして結合し
た細長連結画素群を抽出するステップと、抽出されたこ
の細長連結画素群の連結状態に基づいて文字列候補を抽
出するステップとを備えたことを特徴としている。With this object in mind, the character string extracting method according to the present invention comprises the steps of: extracting a connected pixel group of pixels constituting an input image; and extracting the connected pixel group of the extracted connected pixel group. Extracting a connected pixel group within the threshold by comparing the size with a predetermined threshold, and extracting a connected elongated pixel group extending in the vertical or horizontal direction of the extracted connected pixel group within the threshold. And a step of extracting a character string candidate based on the connected state of the extracted elongated connected pixel group.

【０００８】この連結画素群を抽出するステップとして
は、上下、左右、斜めの８連結画素群や、上下、左右の
４連結画素群を用いて抽出し、所定の閾値に入るものを
抽出する方法が挙げられる。また、この閾値内連結画素
群を抽出するステップは、連結画素群の縦方向および/
または横方向を、想定される文字サイズが有する所定の
縦方向および/または横方向と比較することにより閾値
内連結画素群を抽出することを特徴とすることもでき
る。これによれば、例えば、手書き文字と予想される文
字サイズを考慮して閾値を定めて比較することで、模様
等が混在した郵便物からノイズを取り除いた状態で連結
画素群を検出することが可能となる。より具体的には、
例えば、連結画素群の高さまたは幅が第１の定数値(例
えば４本/ｍｍに圧縮した解像度で１１０画素)以下であ
れば閾値内連結画素群とする構成が挙げられる。As a step of extracting the connected pixel group, a method of extracting the connected pixel group by using an eight-connected pixel group vertically, horizontally, and diagonally or a four-connected pixel group vertically, horizontally, and extracting a pixel which falls within a predetermined threshold value is used. Is mentioned. In addition, the step of extracting the connected pixel group within the threshold value is performed in a vertical direction of the connected pixel group and / or
Alternatively, the intra-threshold connected pixel group may be extracted by comparing the horizontal direction with predetermined vertical and / or horizontal directions of the assumed character size. According to this, for example, by determining a threshold value in consideration of a character size expected to be a handwritten character and a character size, it is possible to detect a connected pixel group in a state where noise is removed from a postal matter in which patterns and the like are mixed. It becomes possible. More specifically,
For example, if the height or width of the connected pixel group is equal to or smaller than a first constant value (for example, 110 pixels at a resolution of 4 lines / mm), the connected pixel group is set to a threshold connected pixel group.

【０００９】また、この細長連結画素群を抽出するステ
ップは、抽出された閾値内連結画素群の大きさに基づい
て伸ばす大きさを変更することにより細長連結画素群を
抽出することを特徴とすれば、バリエーションの多い手
書き文字のサイズに対応できる点で好ましい。より具体
的には、閾値内連結画素群の伸ばす方向の長さがＬであ
れば、その両側にＬずつ伸ばすことが挙げられる。更
に、この細長連結画素群を抽出するステップは、縦方向
を伸ばした場合には横方向を縮め、横方向を伸ばした場
合には縦方向を縮めて細長連結画素群を抽出するように
構成すれば、隣り合う文字同士の接触を防いで文字列を
抽出することが可能となる点で優れている。また更に、
この細長連結画素群を抽出するステップは、抽出された
閾値内連結画素群の大きさに基づいて縮める量を変更す
ることにより細長連結画素群を抽出することを特徴とす
れば、前述と同様に、バリエーションの多い手書き文字
のサイズに対応できる点で好ましい。Further, the step of extracting the elongated connected pixel group is characterized in that the elongated connected pixel group is extracted by changing the extension size based on the size of the extracted intra-threshold connected pixel group. This is preferable because it can correspond to the size of handwritten characters with many variations. More specifically, if the length of the in-threshold connected pixel group in the extending direction is L, it may be extended on both sides by L. Further, the step of extracting the elongated connected pixel group is configured to extract the elongated connected pixel group by contracting the horizontal direction when the vertical direction is extended and contracting the vertical direction when the horizontal direction is extended. For example, it is excellent in that a character string can be extracted while preventing contact between adjacent characters. Moreover,
The step of extracting the elongated connected pixel group is characterized in that the elongated connected pixel group is extracted by changing the amount of contraction based on the size of the extracted connected pixel group within the threshold, as described above. This is preferable in that it can correspond to the size of handwritten characters having many variations.

【００１０】また、画像を所定の大きさに区分したメッ
シュ画像を想定し、抽出された閾値内連結画素群に対応
するメッシュ画像をＯＮとするステップと、ＯＮとされ
たメッシュ画像の連結状態から特定領域を抽出するステ
ップとを更に具備することを特徴とすることができる。
更に、画像が印刷文字を宛名文字とする場合にはこのメ
ッシュ画像をＯＮとするステップに移行し、この画像が
手書き文字を宛名文字とする場合には細長連結画素群を
抽出するステップに移行することを特徴とすれば、印刷
文字についてはより簡単な領域抽出方法にて宛名領域を
抽出することができ、処理の高速化が図れる点で優れて
いる。尚、このメッシュ画像により特定領域を抽出する
方法は、１画素１ピクセルの世界から所定の矩形エリア
であるメッシュ画像の世界に移行し、このメッシュ画像
を１ポイントとして簡易化する方法であることから、よ
り簡単なアルゴリズムで特定領域を抽出することが可能
となる。Further, assuming a mesh image obtained by dividing the image into a predetermined size, turning on a mesh image corresponding to the extracted connected pixel group within the threshold value, and determining a connected state of the mesh images turned on. Extracting a specific region.
Further, if the image is a print character, the process proceeds to a step of turning on the mesh image. If the image is a handwritten character, the process proceeds to a step of extracting a group of elongated connected pixels. This feature is advantageous in that a destination area can be extracted by a simpler area extraction method for print characters, and the processing can be speeded up. Note that the method of extracting a specific region from the mesh image is a method of shifting from the world of one pixel per pixel to the world of the mesh image which is a predetermined rectangular area, and simplifying this mesh image as one point. It is possible to extract a specific area with a simpler algorithm.

【００１１】また、本発明の文字列抽出方法は、入力さ
れた手書き文字を含む画像から所定の大きさを有する画
素群を抽出し、この画素群の一方向を伸ばし他方向を縮
めることで伸ばした方向への文字列の紐を想定し、複数
からなる想定されたこの文字列の紐の状態を検出し、検
出されたこの文字列の紐の状態から宛名パターンを推定
することを特徴としている。ここで、検出されるこの文
字列の紐の状態は、紐の長さおよび紐の凹凸状態を含
み、この文字認識の認識処理を実行する順位は、紐の長
さの長いものまたは/および凹凸状態の少ないものに対
して順位を上げることを特徴とすれば、宛名等の文字列
候補として、より適当な文字列を抽出できる点で好まし
い。また、想定される文字列の紐は、画素群の一方であ
る縦または横に対してこの画素群の大きさに所定の割合
を付加して伸ばし、その他方である横または縦に対して
この画素群の大きさに所定の割合を減じて縮め、伸ばさ
れおよび縮められた複数の画素群の連結状態により文字
列の紐が想定されることを特徴とすることができる。In the character string extracting method according to the present invention, a pixel group having a predetermined size is extracted from an image including an input handwritten character, and the pixel group is expanded by extending one direction and contracting the other direction. Character string in the direction of the character string, detecting a string state of the assumed string of plural strings, and estimating an address pattern from the detected string state of the string. . Here, the detected string state of the character string includes the string length and the unevenness state of the string, and the order in which the character recognition processing is performed is that of the long string length and / or the unevenness of the string. It is preferable to increase the order of the items having a small number of states, since a more appropriate character string can be extracted as a character string candidate for an address or the like. In addition, the string of the assumed character string is extended by adding a predetermined ratio to the size of this pixel group with respect to the vertical or horizontal one of the pixel groups, and is expanded with respect to the other horizontal or vertical. A character string may be assumed based on a connection state of a plurality of pixel groups that are reduced, expanded, and reduced by reducing a predetermined ratio to the size of the pixel group.

【００１２】また、本発明の文字列抽出装置は、２値化
された画像中の黒の連結画素から形成される連結画素群
を検出する連結画素群検出部と、この連結画素群検出部
により検出された連結画素群のうち、大きすぎず小さす
ぎない文字サイズ連結画素群を抽出する文字サイズ連結
画素群抽出部と、この文字サイズ連結画素群抽出部によ
り抽出された文字サイズ連結画素群に対し、仮定文字列
方向には伸ばしこの仮定文字列方向と直交する方向には
縮めることで文字サイズ連結画素群を伸縮させる連結画
素群伸縮部と、この連結画素群伸縮部により伸縮された
複数の連結画素群を、仮定文字列方向に結合させて細長
連結画素群を抽出する細長連結画素群抽出部と、この細
長連結画素群抽出部により抽出された細長連結画素群に
基づいて画像認識すべき文字列を選定する文字列選定部
とを備えたことを特徴としている。更に、この文字サイ
ズ連結画素群抽出部は、郵便物上に記載された手書き文
字のサイズを考慮した閾値を基に文字サイズ連結画素群
を抽出することを特徴とすれば、手書き文字からなる文
字列を抽出する点から好ましい。ここで、この仮定文字
列方向は、縦書き宛名の文字列を考慮する場合には縦方
向であり、横書き宛名の文字列を考慮する場合には横方
向が該当する。Further, the character string extracting apparatus according to the present invention includes a connected pixel group detecting section for detecting a connected pixel group formed from black connected pixels in a binarized image, and a connected pixel group detecting section. Among the detected connected pixel groups, a character size connected pixel group extraction unit that extracts a character size connected pixel group that is neither too large nor too small, and a character size connected pixel group extracted by the character size connected pixel group extraction unit. On the other hand, a connected pixel group expansion / contraction unit that expands / contracts the character size connection pixel group by extending in the assumed character string direction and contracting in the direction orthogonal to the assumed character string direction, An elongated connected pixel group extraction unit for connecting the connected pixel groups in the assumed character string direction to extract an elongated connected pixel group, and image recognition based on the elongated connected pixel group extracted by the elongated connected pixel group extraction unit. It is characterized in that a character string selection unit for selecting a character string to. Further, the character size connected pixel group extracting unit extracts the character size connected pixel group based on a threshold value in consideration of the size of the handwritten character written on the mail, and the character size connected pixel group This is preferable in terms of extracting columns. Here, the assumed character string direction is a vertical direction when considering a character string of a vertically written address, and a horizontal direction when considering a character string of a horizontally written address.

【００１３】また、本発明の画像処理装置は、画像を入
力する画像入力手段と、この画像入力手段により入力さ
れた画像から所定の大きさを有する画素群を抽出する画
素群抽出手段と、この画素群抽出手段により抽出された
画素群に対して縦方向と横方向を定義し、この縦方向に
ついては画素群が伸ばされ横方向には縮められた縦方向
伸長画素群を生成する縦方向伸長画素群生成手段と、一
方、画素群抽出手段により抽出された画素群に対して縦
方向と横方向を定義し、この横方向については画素群が
伸ばされ縦方向には縮められた横方向伸長画素群を生成
する横方向伸長画素群生成手段と、縦方向伸長画素群生
成手段により生成された複数の縦方向伸長画素群および
横方向伸長画素群生成手段により生成された複数の横方
向伸長画素群を、それぞれ伸ばされた方向に結合して細
長連結画素群を抽出する細長連結画素群抽出手段と、こ
の細長連結画素群抽出手段により抽出された細長連結画
素群に基づいて文字列候補を抽出する文字列候補抽出手
段と、この文字列候補抽出手段により抽出された文字列
候補に対して画像の認識を行う画像認識手段とを備えた
ことを特徴としている。The image processing apparatus according to the present invention comprises: an image input means for inputting an image; a pixel group extracting means for extracting a pixel group having a predetermined size from the image input by the image input means; A vertical extension that defines a vertical direction and a horizontal direction with respect to the pixel group extracted by the pixel group extraction means, and generates a vertical extension pixel group in which the pixel group is extended and contracted in the horizontal direction. A pixel group generating means and a vertical direction and a horizontal direction are defined with respect to the pixel group extracted by the pixel group extracting means, and in this horizontal direction, the pixel group is expanded and the horizontal direction is reduced in the vertical direction. A horizontally elongated pixel group generating means for generating a pixel group; a plurality of vertically elongated pixel groups generated by the vertically expanded pixel group generating means and a plurality of horizontally expanded pixels generated by the horizontally expanded pixel group generating means Group Elongated connected pixel group extracting means for extracting elongated connected pixel groups by combining in the extended directions, and extracting a character string candidate based on the elongated connected pixel group extracted by the elongated connected pixel group extracting means. It is characterized by comprising a character string candidate extracting means and an image recognizing means for recognizing an image with respect to the character string candidate extracted by the character string candidate extracting means.

【００１４】この文字列候補抽出手段は、細長連結画素
群抽出手段により抽出された複数の細長連結画素群に対
し、その長さの長いものに対して優先度を上げて出力す
ることを特徴とすることができる。更に、この文字列候
補抽出手段は、縦方向伸長画素群と横方向伸長画素群と
の交差する画素群に対して評価点を下げて優先度を決定
することを特徴とすれば、偽りの文字列抽出を防ぎ、よ
り実用レベルとして優れた画像処理装置を提供できる点
で好ましい。The character string candidate extracting means outputs a plurality of elongated connected pixel groups extracted by the elongated connected pixel group extracting means with higher priority to those having a longer length. can do. Further, the character string candidate extracting means determines a priority by lowering an evaluation point with respect to a pixel group intersecting the vertical direction expanded pixel group and the horizontal direction expanded pixel group. This is preferable in that column extraction can be prevented and an image processing apparatus which is more practical can be provided.

【００１５】[0015]

【発明の実施の形態】以下、添付図面に示す実施の形態
に基づいて、この発明を詳細に説明する。図１は、本実
施の形態における画像処理装置の全体構成を示した説明
図である。同図に示すように、本実施の形態における画
像処理装置は、大きく、文字列抽出装置１と、画像入力
装置２と、画像認識装置３とにより構成される。この画
像入力装置２は、例えば主走査方向にラインセンサを備
え、副走査方向にスキャンして画像を読み取るスキャナ
ーや、フォトによる画像全面の読み取り等の光学的文字
読取装置から構成され、郵便物の全体イメージを取得で
きるように構成されている。また、この画像入力装置２
には、図示しない光電変換回路が備えられ、光学的に読
み取った画像を電気信号に変換するように構成されてい
る。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, the present invention will be described in detail based on an embodiment shown in the accompanying drawings. FIG. 1 is an explanatory diagram illustrating the overall configuration of the image processing apparatus according to the present embodiment. As shown in the figure, the image processing apparatus according to the present embodiment is largely composed of a character string extraction device 1, an image input device 2, and an image recognition device 3. The image input device 2 includes, for example, a scanner that has a line sensor in the main scanning direction and scans an image by scanning in the sub-scanning direction, and an optical character reading device that reads an entire image by a photo. It is configured so that an entire image can be obtained. Also, the image input device 2
Is provided with a photoelectric conversion circuit (not shown), and is configured to convert an optically read image into an electric signal.

【００１６】次に、本実施の形態における文字列抽出装
置１について説明する。画像入力装置２により出力され
る電気信号は、２値化部１１により白画素と黒画素の２
値信号に変換される。また、この２値化部１１では、小
さいノイズを除去するノイズ除去も実行される。尚、場
合によっては、画像入力装置２に２値化手段を備え、２
値化された状態で文字列抽出装置１に入力される場合も
あり得る。この場合には、文字列抽出装置１には２値化
部１１を有さない態様となるであろう。次に２値化され
た画像データは縮小部１２にて縮小される。例えばこの
縮小部１２では、８本/ｍｍの解像度を有する画像を４
本/ｍｍの解像度に変換するように構成されており、処
理スピードを上げるために縮小作業が実行される。縮小
方法としては、簡易な方法として、例えば、連続ライン
のＡＮＤをとって出力する方法等があるが、領域抽出精
度に影響が出ることから、処理スピードに問題がない場
合には縮小部１２を設ける必要はない。また、２値化部
１１と縮小部１２との順序を逆にすることも可能であ
る。Next, the character string extracting device 1 according to the present embodiment will be described. The electric signal output from the image input device 2 is converted by the binarization unit 11 into two signals of a white pixel and a black pixel.
It is converted to a value signal. The binarizing unit 11 also performs noise removal for removing small noise. In some cases, the image input device 2 includes a binarizing unit.
The value may be input to the character string extraction device 1 in some cases. In this case, the character string extracting device 1 will not have the binarizing unit 11. Next, the binarized image data is reduced by the reduction unit 12. For example, in the reduction unit 12, an image having a resolution of 8 lines / mm
It is configured to convert to a resolution of book / mm, and a reduction operation is performed to increase the processing speed. As a simple reduction method, for example, there is a method of performing an AND operation on continuous lines and outputting the result. However, since the accuracy of region extraction is affected, if there is no problem in the processing speed, the reduction unit 12 is used. No need to provide. It is also possible to reverse the order of the binarizing unit 11 and the reducing unit 12.

【００１７】この縮小部１２から縮小された画像データ
は、連結画素群(ＣＣ)検出部１３に入力される。本実施
の形態における連結画素群(ＣＣ)検出部１３では、２値
化された全体イメージから、８連結画素群、即ち黒画素
の８連結からなる連結画素群(Connected Component:Ｃ
Ｃ)を全て抽出している。この８連結画素群とは、中心
画素に対して左右、上下、斜めの８画素を捉え、これら
の黒画素の連結を取り出すことで連結画素群(ＣＣ)を検
出するものである。尚、連結画素群(ＣＣ)の検出方法と
しては、左右、上下である４連結画素群を取り出す方法
もあるが、斜め方向の画素が切れてしまうことから、斜
め方向の繋がりも考慮して、経験上、８連結画素群を採
用することが好ましい。The image data reduced from the reduction unit 12 is input to a connected pixel group (CC) detection unit 13. In the connected pixel group (CC) detection unit 13 in the present embodiment, an eight connected pixel group, that is, a connected pixel group (Connected Component: C
C) is all extracted. The eight connected pixel group is for detecting a connected pixel group (CC) by capturing eight pixels that are left, right, up, down, and oblique with respect to the center pixel, and extracting the connection of these black pixels. As a method of detecting the connected pixel group (CC), there is also a method of extracting four connected pixel groups, that is, left, right, and up and down. From experience, it is preferable to employ an 8-connected pixel group.

【００１８】閾値格納部１５には文字サイズを決定する
際に用いる連結画素群(ＣＣ)の閾値が格納されている。
文字サイズ連結画素群(CharＣＣ)抽出部１４では、連結
画素群(ＣＣ)検出部１３により検出された連結画素群
(ＣＣ)をこの閾値格納部１５に格納された閾値と比較
し、閾値内連結画素群である文字サイズ連結画素群(Cha
rＣＣ)を抽出している。この閾値格納部１５には、ま
ず、連結画素群(ＣＣ)の高さまたは幅が所定量以上の値
を取らないものとして最大定数値が格納されており、例
えば１１０といった閾値が格納されている。この値は、
印刷等がなされた郵便物における大きな模様や長い罫線
を取り除くために設けられるもので、４本/ｍｍの解像
度で１１０画素、即ち約２７ｍｍを超える連結画素群
(ＣＣ)は文字サイズ連結画素群(CharＣＣ)から除かれる
ように定められている。また、連結画素群(ＣＣ)の長方
形領域(矩形領域)が所定大きさ以下の値を取らないよう
な条件として、長方形領域の短いものが３以上、長いも
のが８以上という値も格納されている。この値は、細い
線はラベルにおけるエッジ等、ノイズの可能性があるこ
とや、破線等を取り除く意味から、３画素×８画素以下
の長方形領域を取り除くように定められている。The threshold value storage unit 15 stores a threshold value of a connected pixel group (CC) used for determining a character size.
In the character size connected pixel group (CharCC) extraction unit 14, the connected pixel group detected by the connected pixel group (CC) detection unit 13
(CC) is compared with the threshold value stored in the threshold value storage unit 15, and a character size connected pixel group (Cha
rCC). First, the threshold storage unit 15 stores a maximum constant value assuming that the height or width of the connected pixel group (CC) does not exceed a predetermined amount, and stores a threshold value such as 110, for example. . This value is
It is provided to remove large patterns and long ruled lines in printed mail, etc., and has 110 pixels at a resolution of 4 lines / mm, that is, a connected pixel group exceeding about 27 mm.
(CC) is defined so as to be excluded from the character size connected pixel group (CharCC). Also, as a condition that the rectangular area (rectangular area) of the connected pixel group (CC) does not take a value smaller than a predetermined size, a value that the rectangular area is 3 or more and the rectangular area is 8 or more is also stored. I have. This value is determined so as to remove a rectangular area of 3 pixels × 8 pixels or less in order to remove a thin line from a noise, such as an edge in a label, and to remove a broken line.

【００１９】一方、文字サイズ連結画素群(CharＣＣ)抽
出部１４により抽出された文字サイズ連結画素群(Char
ＣＣ)は、サイズ判定部１６に入力される。このサイズ
判定部１６では、例えば、大き目の文字サイズ連結画素
群(CharＣＣ)のピッチを測定し、そのピッチが大きいも
のは手書き文字の可能性が高く、小さいものは印刷文字
の可能性が高い、との判定がなされるように構成されて
いる。これは、一般に手書き文字だと文字間のピッチが
ばらつくことに鑑み、ばらつきが見られた場合には印刷
文字の可能性が低いと判定するものである。このサイズ
判定部１６の結果を受けて、手書き文字の可能性が高い
と判定される場合には手書き文字列抽出部２０にて文字
列の抽出を実行し、印刷文字の可能性が高いと判定され
る場合には印刷文字列抽出部４０にて文字列の抽出が実
行される。尚、サイズ判定部１６の判定は、上記の方法
に限られず、例えば、サイズ毎の文字サイズ連結画素群
(CharＣＣ)の頻度分布を測定して判定する等、任意の方
法により、またこれらの組み合わせにて判定できるよう
に構成することも可能である。On the other hand, the character size connected pixel group (CharCC) extracted by the character size connected pixel group (CharCC)
CC) is input to the size determination unit 16. In the size determination unit 16, for example, the pitch of a large character size connected pixel group (CharCC) is measured, and the larger the pitch, the higher the possibility of a handwritten character, and the smaller the pitch, the higher the possibility of a printed character. Is determined. This is because, in general, the pitch between characters varies in the case of handwritten characters, and if variations occur, it is determined that the possibility of a printed character is low. When it is determined that the possibility of a handwritten character is high based on the result of the size determination unit 16, a character string is extracted by the handwritten character string extraction unit 20, and it is determined that the possibility of a print character is high. In this case, a character string is extracted by the print character string extracting unit 40. Note that the determination by the size determination unit 16 is not limited to the above method.
It is also possible to configure so that the determination can be made by an arbitrary method, such as by measuring the frequency distribution of (CharCC), or by a combination thereof.

【００２０】次に、本実施の形態における手書き文字列
抽出部２０の構成について説明する。サイズ判定部１６
により手書き文字の可能性が高いと判定された場合に、
文字列方向が縦方向と横方向とを仮定し、それぞれの処
理を独立に実行する。まず、文字列方向が横方向である
と仮定する処理の場合、横方向伸長部２１では、抽出さ
れた文字サイズ連結画素群(CharＣＣ)に対し、その仮定
文字列方向である横方向に伸ばし、その仮定文字列方向
と直交する方向である縦方向に縮める処理を行う。この
伸縮された文字サイズ連結画素群(CharＣＣ)は、Ｈｏｒ
第１バッファ２３に格納される。また、細長連結画素群
(LongＣＣ)抽出部２５では、この伸縮された文字サイズ
連結画素群(CharＣＣ)を結合させて横方向の細長連結画
素群(LongＣＣ)を抽出する。横方向評価部２７は、抽出
された細長連結画素群(LongＣＣ)の連結状態を評価し、
その評価結果を文字列選定部３０に出力する。Next, the configuration of the handwritten character string extracting unit 20 in the present embodiment will be described. Size determination unit 16
If it is determined that the possibility of handwritten characters is high,
Assuming that the character string direction is the vertical direction and the horizontal direction, each processing is executed independently. First, in the case of a process assuming that the character string direction is the horizontal direction, the horizontal direction expansion unit 21 expands the extracted character size concatenated pixel group (CharCC) in the horizontal direction that is the assumed character string direction, Processing for contracting in the vertical direction which is a direction orthogonal to the assumed character string direction is performed. This expanded and contracted character size connected pixel group (CharCC) is Hor
The data is stored in the first buffer 23. Also, the elongated connected pixel group
The (LongCC) extraction unit 25 combines the expanded and contracted character size connected pixel groups (CharCC) to extract a horizontally elongated connected pixel group (LongCC). The horizontal evaluation unit 27 evaluates the connected state of the extracted elongated connected pixel group (LongCC),
The evaluation result is output to the character string selection unit 30.

【００２１】一方、文字列方向が縦方向であると仮定す
る処理の場合、縦方向伸長部２２では、抽出された文字
サイズ連結画素群(CharＣＣ)に対し、その仮定文字列方
向である縦方向に伸ばし、その仮定文字列方向と直交す
る方向である横方向に縮める処理を行う。この伸縮され
た文字サイズ連結画素群(CharＣＣ)は、Ｖｅｒ第１バッ
ファ２４に格納される。また、細長連結画素群(LongＣ
Ｃ)抽出部２６では、この伸縮された文字サイズ連結画
素群(CharＣＣ)を結合させて縦方向の細長連結画素群(L
ongＣＣ)を抽出する。縦方向評価部２８は、抽出された
細長連結画素群(LongＣＣ)の連結状態を評価し、その評
価結果を文字列選定部３０に出力する。On the other hand, in the case of processing assuming that the character string direction is the vertical direction, the vertical extension unit 22 applies the extracted character size concatenated pixel group (CharCC) to the vertical direction which is the assumed character string direction. , And shrink in the horizontal direction, which is a direction orthogonal to the assumed character string direction. The expanded and contracted character size connected pixel group (CharCC) is stored in the Ver first buffer 24. In addition, the elongated connected pixel group (Long C
C) The extraction unit 26 combines the expanded and contracted character size connected pixel groups (CharCC) to form a vertically elongated connected pixel group (L
ongCC). The vertical evaluation unit 28 evaluates the connected state of the extracted elongated connected pixel group (LongCC) and outputs the evaluation result to the character string selecting unit 30.

【００２２】文字列選定部３０では、Ｈｏｒ第１バッフ
ァ２３とＶｅｒ第１バッファ２４に格納されている細長
連結画素群(LongＣＣ)を混ぜた状態にて、横方向評価部
２７および縦方向評価部２８から出力された評価点の高
いものから順にソートする。また、文字列閾値格納部２
９に格納されている閾値情報に基づき、ソートされた順
番に閾値以上の評価点であれば文字列と決定している。
ここで、本実施の形態では、後述するように、ある横方
向の細長連結画素群(LongＣＣ)が文字列として選定され
た場合、この横方向の細長連結画素群(LongＣＣ)は、直
交する縦方向の文字列をその後に選定する際の減点材料
とするように構成されている。その為、細長連結画素群
(LongＣＣ)が横方向であれば、直交する減点用のＶｅｒ
第２バッファ３２に減点用矩形領域情報(LongＣＣの外
接長方形)を格納している。同様に、細長連結画素群(Lo
ngＣＣ)が縦方向であれば、直交する減点用のＨｏｒ第
２バッファ３１に減点用矩形領域情報を格納している。
文字列選定部３０では、順次、評価点の減点を実施しな
がら文字列候補に順序を付けてソートし直し、この評価
点の高いものから画像認識装置３に対して文字列情報を
送り込んでいる。画像認識装置３では、２値化部１１か
らの全体画像の２値化信号と、文字列選定部３０からの
文字列情報に基づいて、通常の場合、複数の宛名候補領
域が抽出されるが、評価点の高い、優先順の高い領域か
ら順に宛名読み取りが実行され、満足な結果が得られた
段階で読み取り作業を終了させている。The character string selection unit 30 mixes the elongated connected pixel group (LongCC) stored in the Hor first buffer 23 and the Ver first buffer 24 with the horizontal evaluation unit 27 and the vertical evaluation unit. 28 are sorted in descending order of evaluation score. In addition, the character string threshold storage unit 2
Based on the threshold information stored in No. 9, if the evaluation points are equal to or higher than the threshold in the sorted order, the character string is determined.
Here, in the present embodiment, as described later, when a certain horizontal elongated connected pixel group (LongCC) is selected as a character string, this horizontal elongated connected pixel group (LongCC) The character string of the direction is configured to be used as a deduction material when subsequently selected. Therefore, the elongated connected pixel group
If (LongCC) is the horizontal direction, the orthogonal deduction points for Ver
The second buffer 32 stores deduction point rectangular area information (circumscribed rectangle of LongCC). Similarly, the elongated connected pixel group (Lo
If (ngCC) is the vertical direction, the deduction point rectangular area information is stored in the orthogonal second point deduction Hor second buffer 31.
The character string selecting unit 30 sequentially sorts and sorts the character string candidates while sequentially reducing the evaluation points, and sends the character string information to the image recognition device 3 in descending order of the evaluation points. . In the image recognition device 3, a plurality of address candidate areas are normally extracted based on the binarized signal of the entire image from the binarizing unit 11 and the character string information from the character string selecting unit 30. The address reading is performed in order from the area having the highest evaluation score and the highest priority, and the reading operation is completed when a satisfactory result is obtained.

【００２３】次に、本実施の形態における印刷文字列抽
出部４０の構成について説明する。この印刷文字列抽出
部４０では、文字サイズ連結画素群(CharＣＣ)の大きさ
が比較的小さい、例えば閾値として４本/ｍｍの解像度
で３０画素、即ち約８ｍｍ以下の文字について処理され
る。まず、メッシュ画像形成部４１では、画像を小エリ
アに分割して１ポイントとして簡易化した矩形領域を想
定する所謂メッシュ画像を形成している。このメッシュ
の分割方法は、例えば、想定される宛名の中から大き目
の文字サイズに均等する範囲でメッシュの大きさを決定
することが有効であり、例えば、最大の文字サイズ連結
画素群(CharＣＣ)が３０画素であれば、予め定めた大き
さである２４画素×２４画素の小エリアからなる矩形領
域でメッシュに分割するように構成されている。Next, the configuration of the print character string extracting unit 40 in the present embodiment will be described. The print character string extraction unit 40 processes characters having a relatively small character size concatenated pixel group (CharCC), for example, 30 pixels at a resolution of 4 lines / mm as a threshold, that is, characters of about 8 mm or less. First, the mesh image forming unit 41 forms a so-called mesh image that divides an image into small areas and assumes a simplified rectangular area as one point. This mesh division method is effective, for example, to determine the size of the mesh in a range equivalent to the large character size from the assumed address, for example, the largest character size connected pixel group (CharCC) Is 30 pixels, the mesh is divided into meshes in a rectangular area consisting of a small area of 24 pixels × 24 pixels, which is a predetermined size.

【００２４】次に、対応メッシュ画像検出部４２では、
抽出された閾値内連結画素群である文字サイズ連結画素
群(CharＣＣ)が、メッシュ画像形成部４１により想定さ
れた矩形領域であるメッシュ画像のどれに属するか(ど
れに対応するか)を検出し、対応矩形領域を選定してメ
ッシュ画像をＯＮとしている。この検出の方法として
は、例えば、抽出された文字サイズ連結画素群(CharＣ
Ｃ)の中心座標が属するメッシュ画像をＯＮとする方法
が挙げられる。また、メッシュ画像形成部４１にて小さ
目のメッシュが想定されている場合には、文字サイズ連
結画素群(CharＣＣ)の長方形が少しでも重なっているメ
ッシュ画像をＯＮとする方法もある。この対応メッシュ
画像検出部４２からの出力を受け、メッシュ画像連結検
出部４３では、ＯＮとなったメッシュの連結状態を検出
する。例えば、一行分離れていても同一領域とするため
に縦と横に関してＯＮとなったメッシュに挟まれている
１単位のメッシュをＯＮとする。また、メッシュ画像の
中で、メッシュが縦、横に繋がっている４連結や、それ
に斜めの繋がりを考慮した８連結の連結メッシュ(Conne
cted Mesh:ＣＭ)を検出している。Next, in the corresponding mesh image detecting section 42,
The character size connected pixel group (CharCC), which is the extracted intra-threshold connected pixel group, detects which (corresponds to) which mesh image is a rectangular area assumed by the mesh image forming unit 41. , The corresponding rectangular area is selected and the mesh image is turned ON. As a method of this detection, for example, an extracted character size connected pixel group (CharC
A method of turning on the mesh image to which the center coordinate of C) belongs is exemplified. Further, when a small mesh is assumed by the mesh image forming unit 41, there is a method of turning on a mesh image in which the rectangles of the character size connected pixel group (CharCC) slightly overlap. Upon receiving the output from the corresponding mesh image detection unit 42, the mesh image connection detection unit 43 detects the connection state of the meshes that have been turned ON. For example, in order to make the same area even if separated by one line, one unit of mesh sandwiched between meshes that are turned on in the vertical and horizontal directions is turned on. Also, in the mesh image, a 4-connected mesh in which the mesh is connected vertically and horizontally or an 8-connected connected mesh (Conne
cted Mesh: CM).

【００２５】このメッシュ画像連結検出部４３からの結
果を受け、属性情報格納部４４からの属性情報を加味し
て特定領域抽出部４５により宛名領域候補が抽出され
る。この属性情報格納部４４には、宛名領域候補とし
て、そのサイズが小さすぎず大きすぎないものを選定す
るための閾値情報が格納されている。また、例えば、２
行以上の文字列があるか、十分な数の連結画素群(ＣＣ)
があるか、文字列の形がそれらしいか等の属性情報が格
納されている。また、郵便物の宛名情報は、通常、その
郵便物の中央に寄っていることが多いことから、全体イ
メージの端に寄っていないか、領域が中央に近いか等の
属性情報も格納されている。特定領域抽出部４５では、
この属性情報格納部４４からの属性情報に基づいて、例
えばメッシュ画像連結検出部４３による連結メッシュ
(ＣＭ)に対して評価点を付けることで宛名領域候補であ
る特定領域を抽出している。この結果、宛名領域を数個
に限定することができ、実際に宛名を認識する画像認識
装置３にこの結果を出力することで、宛名の読み取り精
度を格段に向上させることができると共に、宛名読み取
りの高速化を図ることができる。印刷文字列抽出部４０
の後処理装置である画像認識装置３では、２値化部１１
からの全体画像の２値化信号と、特定領域抽出部４５か
らの領域情報に基づいて、例えば評価点の高い、優先順
の高い領域から順に宛名読み取りが実行され、満足な結
果が得られた段階で読み取り作業を終了させている。Upon receiving the result from the mesh image connection detecting section 43, the specific area extracting section 45 extracts a destination area candidate in consideration of the attribute information from the attribute information storing section 44. The attribute information storage unit 44 stores threshold information for selecting a destination area candidate whose size is neither too small nor too large. Also, for example, 2
There is a character string of more than lines, or a sufficient number of connected pixel groups (CC)
Is stored, and attribute information such as whether or not the character string is in the form is stored. In addition, since the address information of a mail is usually shifted to the center of the mail, attribute information such as whether or not the area is closer to the end of the entire image or whether the area is closer to the center is also stored. I have. In the specific area extraction unit 45,
Based on the attribute information from the attribute information storage unit 44, for example, the connected mesh by the mesh image connection detection unit 43
By assigning an evaluation point to (CM), a specific area that is a destination area candidate is extracted. As a result, the address area can be limited to several, and by outputting this result to the image recognition device 3 that actually recognizes the address, the address reading accuracy can be remarkably improved and the address reading can be performed. Can be speeded up. Print character string extraction unit 40
In the image recognition device 3 which is a post-processing device, the binarizing unit 11
For example, based on the binarized signal of the whole image from and the area information from the specific area extraction unit 45, the address reading is performed in order from the area having the highest evaluation point and the highest priority, and a satisfactory result is obtained. The reading operation is completed at the stage.

【００２６】このように、本実施の形態における印刷文
字列抽出部４０では、１画素１ピクセルである画素単位
の世界から、所定の大きさを有するエリアであるメッシ
ュの世界へ移し、このメッシュを１ポイントとして簡易
化する方法を採用しているために、既存の方式に比べて
簡単なアルゴリズムで高速に宛名領域を抽出できる。更
に、宛名の文字サイズやメッシュの連結性、属性情報等
を加味して抽出することで、精度の高い領域抽出を実現
している。また、評価点を付与して優先順位の高い領域
から宛名読み取りを実行することで、無駄な宛名読み取
り作業を極力、少なくし、更に効率的かつ迅速な宛名読
み取りが可能となる。As described above, the print character string extraction unit 40 according to the present embodiment shifts the world from a pixel unit, which is one pixel per pixel, to a mesh world, which is an area having a predetermined size. Since the simplification method is adopted as one point, the address area can be extracted at a higher speed with a simpler algorithm than the existing method. Furthermore, highly accurate area extraction is realized by performing extraction in consideration of the character size of the address, the connectivity of the mesh, attribute information, and the like. In addition, by performing the address reading from an area having a high priority with an evaluation point assigned, useless address reading work can be minimized and the address reading can be performed more efficiently and quickly.

【００２７】次に、本実施の形態における郵便物の宛名
読み取り作業の流れを、図１〜図１４を用いて説明す
る。ここで、図２は、本実施の形態における手書き文字
列を主とする宛名読み取りの大まかなフローを示してい
る。また、図３〜図８は第１の実用例として、元画像
(図３)、文字サイズ連結画素群(CharＣＣ)の抽出状態
(図４)、横方向細長連結画素群(横方向LongＣＣ)の抽出
状態(図５)、縦方向細長連結画素群(縦方向LongＣＣ)の
抽出状態(図６)、文字列を選定して順位付けをした状態
(図７)、元画像と重ね合わせて抽出文字列を表現したも
の(図８)を示している。Next, the flow of a mail address reading operation according to the present embodiment will be described with reference to FIGS. Here, FIG. 2 shows a rough flow of address reading mainly with a handwritten character string in the present embodiment. FIGS. 3 to 8 show a first practical example as an original image.
(Figure 3), extraction state of character size connected pixel group (CharCC)
(FIG. 4), the extraction state of the horizontal elongated connected pixel group (horizontal LongCC) (FIG. 5), the extraction state of the vertical elongated connected pixel group (vertical LongCC) (FIG. 6), and the character strings are selected and ranked. Attached
FIG. 7 shows an extracted character string superimposed on the original image (FIG. 8).

【００２８】まず、画像入力装置２により郵便物の画像
が読み取られる(ステップ１０１)。図３に示すような郵
便物５０には、宛名情報５１の他、郵便番号５２や配信
情報５３、差出人情報５４や絵柄５５等、多種多様な画
像情報が含まれている。これらの画像情報は２値化部１
１により適切な方法にて２値化され(ステップ１０２)、
その２値化画像に対して連結画素群(ＣＣ)検出部１３に
より連結画素群(ＣＣ)が検出されてメモリ(図示せず)に
登録される(ステップ１０３)。この連結画素群(ＣＣ)
は、文字サイズ連結画素群(CharＣＣ)抽出部１４にて閾
値格納部１５に格納されている閾値と比較され、大きす
ぎず小さすぎない文字サイズ連結画素群(CharＣＣ)が抽
出される(ステップ１０４)。図４は、文字サイズ連結画
素群(CharＣＣ)５６を黒い長方形にし、理解し易いよう
に元画像に重ね合わせた状態を示している。図４に示す
ように、ほぼ全ての文字が多種サイズの文字サイズ連結
画素群(CharＣＣ)５６として抽出されている。但し、
「・」(点)や「二」の文字は、前述の閾値の条件を満た
していないことから抽出されていない。また、絵柄５５
を構成する多くの部分も同様に抽出されていないことが
理解できる。First, an image of a mail is read by the image input device 2 (step 101). The postal matter 50 as shown in FIG. 3 includes, in addition to the address information 51, various kinds of image information such as a postal code 52, distribution information 53, sender information 54, a picture 55, and the like. These image information are stored in a binarizing unit 1
1 is binarized by an appropriate method (step 102),
The connected pixel group (CC) is detected from the binarized image by the connected pixel group (CC) detection unit 13 and registered in a memory (not shown) (step 103). This connected pixel group (CC)
Is compared with the threshold value stored in the threshold value storage unit 15 by the character size connected pixel group (CharCC) extraction unit 14, and a character size connected pixel group (CharCC) that is neither too large nor too small is extracted (step 104). ). FIG. 4 shows a state in which the character size connected pixel group (CharCC) 56 is formed into a black rectangle and is superimposed on the original image for easy understanding. As shown in FIG. 4, almost all characters are extracted as a character size connected pixel group (CharCC) 56 of various sizes. However,
The characters “•” (dot) and “2” are not extracted because they do not satisfy the threshold condition described above. Also, picture 55
It can be understood that many parts constituting are not similarly extracted.

【００２９】次に、文字列方向が縦方向と横方向を仮定
し、以下の処理が独立して実行される(ステップ１０
５)。まず、連結画素群伸縮部としての横方向伸長部２
１および縦方向伸長部２２では、各文字サイズ連結画素
群(CharＣＣ)を囲む長方形を形成し、その縦と横の長い
方をＬとし、その長方形を仮定文字列方向(縦または横)
の両側にＬずつ伸ばし、仮定文字列方向と直交する方向
には、例えば(２/３)・Ｌだけ縮小され(ステップ１０
６)、横方向と縦方向に、それぞれ細長連結画素群(Long
ＣＣ)が形成される。Next, assuming that the character string direction is the vertical direction and the horizontal direction, the following processing is executed independently (step 10).
5). First, a horizontal extension unit 2 as a connected pixel group expansion / contraction unit
1 and the vertical extension unit 22 form a rectangle surrounding each character size concatenated pixel group (CharCC), the longer one of the length and width is L, and the rectangle is assumed to be the character string direction (vertical or horizontal).
Is extended by L on each side, and is reduced by, for example, (2/3) · L in the direction orthogonal to the assumed character string direction (step 10).
6) In the horizontal direction and the vertical direction, the elongated connected pixel group (Long
CC) is formed.

【００３０】図５は、各文字サイズ連結画素群(CharＣ
Ｃ)を囲む長方形を横方向に伸ばし、縦方向に縮めた状
態を、理解し易いように元画像に重ね合わせた図であ
る。但し、元画像の文字イメージは以下の文字列選定の
処理には用いられない。図５に示すように、文字サイズ
連結画素群(CharＣＣ)の外接長方形を横方向に伸長する
ことにより、横方向に連なった横方向の細長連結画素群
(LongＣＣ)５７が形成される。また、縦方向に縮小する
ことで隣り合う文字列同士の接触を防いでいる。また、
図６は、各文字サイズ連結画素群(CharＣＣ)を囲む長方
形を縦方向に伸ばし、横方向に縮めた状態を、理解し易
いように元画像に重ね合わせた図である。同図に示すよ
うに、文字サイズ連結画素群(CharＣＣ)の外接長方形の
長い方をＬとし、その長方形を縦方向の両側にＬずつ伸
ばし、横方向には長方形を縮小(例えば(２/３)・Ｌ)する
ことにより、縦方向に連なった縦方向の細長連結画素群
(LongＣＣ)５８が形成される。ここで、本実施の形態で
は、外接長方形の伸縮の値を固定値とはしておらず、文
字サイズ連結画素群(CharＣＣ)のサイズ(Ｌ)から決定し
ている。これは、文字(特に手書き)サイズのバリエーシ
ョンが大きく、固定値とするのが妥当でないと共に、文
字間隔は文字サイズにほぼ比例すると考えられるためで
ある。FIG. 5 is a diagram showing each character size connected pixel group (CharC
FIG. 11 is a diagram in which a rectangle surrounding C) is extended in the horizontal direction and contracted in the vertical direction, and is superimposed on the original image for easy understanding. However, the character image of the original image is not used for the following character string selection processing. As shown in FIG. 5, the circumscribed rectangle of the character size connected pixel group (CharCC) is expanded in the horizontal direction, so that the horizontal elongated connected pixel group connected in the horizontal direction is formed.
(LongCC) 57 is formed. Also, by reducing the size in the vertical direction, contact between adjacent character strings is prevented. Also,
FIG. 6 is a diagram in which a rectangle surrounding each character size concatenated pixel group (CharCC) is extended in the vertical direction and contracted in the horizontal direction, and is superimposed on the original image for easy understanding. As shown in the figure, the longer one of the circumscribed rectangles of the character size connected pixel group (CharCC) is L, the rectangle is extended by L on both sides in the vertical direction, and the rectangle is reduced in the horizontal direction (for example, (2/3) ) · L) to form a vertically elongated connected pixel group connected in the vertical direction.
(LongCC) 58 is formed. Here, in the present embodiment, the value of expansion and contraction of the circumscribed rectangle is not a fixed value, but is determined from the size (L) of the character size connected pixel group (CharCC). This is because the variation in character (especially handwriting) size is large, and it is not appropriate to use a fixed value, and the character spacing is considered to be almost proportional to the character size.

【００３１】次に、その仮定文字列方向への伸長によ
り、細長連結画素群(LongＣＣ)抽出部２５および２６で
は、文字サイズ連結画素群(CharＣＣ)を結合した細長連
結画素群(LongＣＣ)を抽出し、横方向評価部２７と縦方
向評価部２８にて評価点が付けられる(ステップ１０
７)。例えば、仮定文字列方向に長いほど評価点が高
く、例えば、連結画素群(ＣＣ)が、細長連結画素群(Lon
gＣＣ)の中に凹凸が少なく並んでいるほど評価点が高く
なる。但し、この段階では、図３に示される横方向の文
字群である「田」「営」「庁」「(株)」は、本来の文字
列とは異なり直交方向に並ぶ擬似文字列であるにもかか
わらず、図５の細長連結画素群(LongＣＣ)６１のように
比較的長くなることから、高い評価点が付与される。Next, the elongated connected pixel group (LongCC) extraction units 25 and 26 extract the elongated connected pixel group (LongCC) obtained by combining the character size connected pixel group (CharCC) by the extension in the assumed character string direction. Then, evaluation points are given by the horizontal evaluation unit 27 and the vertical evaluation unit 28 (step 10).
7). For example, the evaluation point is higher as the length is longer in the assumed character string direction.For example, the connected pixel group (CC) is
The less the unevenness in gCC), the higher the evaluation point. However, at this stage, unlike the original character strings, the character groups in the horizontal direction shown in FIG. 3, such as “ta”, “ei”, “government”, and “stock”, are pseudo character strings arranged in the orthogonal direction. Nevertheless, since it is relatively long like the elongated connected pixel group (LongCC) 61 in FIG. 5, a high evaluation point is given.

【００３２】次に、文字列選定部３０では、２つのバッ
ファ(Ｈｏｒ第１バッファ２３、Ｖｅｒ第１バッファ２
４)内の細長連結画素群(LongＣＣ)を混ぜて、評価点の
高い順にソートする(ステップ１０８)。そして、最も評
価点の高い細長連結画素群(LongＣＣ)から順にチェック
し、文字列閾値格納部２９に格納されている閾値以上の
評価点を有していれば、文字列と決定する。決定された
文字列と交差している直交細長連結画素群(直交LongＣ
Ｃ)は、その交差面積などに応じて評価点を減点される
(ステップ１０９)。前述のように、文字列と決定された
細長連結画素群(LongＣＣ)に対し、直交バッファ(Ｈｏ
ｒ第２バッファ３１、Ｖｅｒ第２バッファ３２)に減点
用の黒長方形(LongＣＣの外接長方形)が書き込まれてい
る。この構成により、例えば交差部分の長さだけ短くさ
れる等、決定された文字列と交差している直交方向の細
長連結画素群(LongＣＣ)は、その黒長方形との交差面積
などに応じて評価点が減点される。以降、文字列選定部
３０は、次の順位の細長連結画素群(LongＣＣ)を直交バ
ッファ(Ｈｏｒ第２バッファ３１、Ｖｅｒ第２バッファ
３２)内の黒長方形による減点を考慮しながら再評価
し、合格点(例えば長さが閾値以上で減点面積が２０％
以下)の細長連結画素群(LongＣＣ)を文字列として抽出
する。この抽出結果は、順次、減点用の直交バッファ
(Ｈｏｒ第２バッファ３１、Ｖｅｒ第２バッファ３２)に
書き込まれる。不合格の場合には、減点用の黒長方形は
書き込まれない。これにより、文字列の密な領域であっ
ても本来の文字方向に統一することが可能となる。Next, the character string selection unit 30 includes two buffers (the first Hor buffer 23 and the first Ver buffer 2).
(4) The elongated connected pixel group (LongCC) in (4) is mixed and sorted in descending order of evaluation points (step 108). Then, checking is performed in order from the longest connected pixel group (LongCC) having the highest evaluation point. If the evaluation point is equal to or more than the threshold value stored in the character string threshold value storage unit 29, the character string is determined. Orthogonal elongated connected pixel group that intersects with the determined character string (orthogonal LongC
In C), the evaluation points are deducted according to the intersection area etc.
(Step 109). As described above, for the elongated connected pixel group (LongCC) determined to be a character string, the orthogonal buffer (Ho) is used.
A black rectangle for deduction (circumscribed rectangle of LongCC) is written in the r second buffer 31 and the Ver second buffer 32). With this configuration, for example, the elongated connected pixel group (LongCC) in the orthogonal direction that intersects with the determined character string, such as being shortened by the length of the intersection, is evaluated according to the area of the intersection with the black rectangle. Is deducted. Thereafter, the character string selection unit 30 re-evaluates the elongated connected pixel group (LongCC) of the next rank in consideration of the deduction due to the black rectangle in the orthogonal buffer (Hor second buffer 31, Ver second buffer 32), Passing point (for example, if the length is above the threshold and the deduction area is 20%
The following is extracted as a character string. This extraction result is sequentially stored in a quadrature buffer for deduction points.
(Hor second buffer 31, Ver second buffer 32). In case of failure, no black rectangle for deduction is written. This makes it possible to unify the original character direction even in a dense area of the character string.

【００３３】ここで、減点用の黒長方形は、細長連結画
素群(LongＣＣ)そのものでも良い。また、文字列をチェ
ックする順序は、最初のソート順で固定せず、１つ文字
列が決定される毎にダイナミックに残りの細長連結画素
群(LongＣＣ)をソートし直すことが効果的である。但
し、かかる場合には、処理スピードが遅くなることが懸
念されるため、処理スピードとその効果との兼ね合いで
採用することが好ましい。図７は、文字列を選定して順
位付けをした状態を具体的に示している。図中の数字
(１〜１０)は、上位１位から１０位までの文字列候補に
順位付けをしたものである。また、英字(ａ〜ｆ)は、縦
の候補と横の候補の交差部分を、処理順に示したもので
ある。縦長の１位、３位、４位、９位、横長の２位、６
位、７位の文字列候補は、減点なしに文字列として選定
される。図３の郵便番号５２に対応する４位の横長文字
列は、「ａ」の交差部分だけ減点されるが、その比率が
小さいので選択される。８位の縦長文字列は「ｂ」の交
差部分だけ減点されて選択される。１０位(図５の細長
連結画素群(LongＣＣ)６１)である図３の文字列「田」
「営」「庁」「(株)」は、「ｃ」「ｄ」「ｅ」［ｆ］の
交差部分が減点されることで選択されない。その結果、
本来の文字列ではない、直交方向に並ぶ擬似文字列の選
定を防ぐことが可能となる。Here, the black rectangle for deduction may be the elongated connected pixel group (LongCC) itself. In addition, it is effective that the order of checking the character strings is not fixed in the initial sort order, and that the remaining elongated connected pixel group (LongCC) is dynamically rearranged each time one character string is determined. . However, in such a case, there is a concern that the processing speed will be slowed down. Therefore, it is preferable to adopt the method in consideration of the processing speed and its effect. FIG. 7 specifically shows a state in which character strings are selected and ranked. Number in figure
(1 to 10) ranks character string candidates from the first to tenth. The alphabetic characters (a to f) indicate the intersection of the vertical candidate and the horizontal candidate in the order of processing. 1st, 3rd, 4th, 9th in portrait, 2nd in landscape, 6
The character string candidates at the 7th and 7th places are selected as character strings without deduction. The horizontal character string at the fourth place corresponding to the postal code 52 in FIG. 3 is deducted only at the intersection of "a", but is selected because its ratio is small. The eighth position of the vertically long character string is selected by deducting only the intersection of "b". The character string “ta” in FIG. 3 which is the tenth place (the elongated connected pixel group (LongCC) 61 in FIG. 5)
“Y”, “Agency”, and “Co.” are not selected because the intersections of “c”, “d”, “e”, and “f” are deducted. as a result,
It is possible to prevent the selection of a pseudo character string that is not an original character string and is arranged in the orthogonal direction.

【００３４】尚、細長連結画素群(LongＣＣ)の評価点の
閾値によっては、細長連結画素群(LongＣＣ)の交差部分
が縦と横の両方向の文字列に属することがあるが、処理
画像がメモリ(図示せず)に前もって登録してある郵便全
体の宛名パターンのどれに属するかを推定するなどし
て、各文字サイズ連結画素群(CharＣＣ)が複数の文字列
に属さないようにする(ステップ１１０)。また別法とし
て、文字列選定部３０は、縦と横の重複に関し、それぞ
れの文字列の整合性等を再吟味して、各文字サイズ連結
画素群(CharＣＣ)が唯一の文字列に属するようにするこ
ともできる。図８は、元画像と重ね合わせて抽出文字列
を表現したものである。同図において、例えば、郵便番
号文字列６２は、宛名文字列５９や文字列６３と交差し
ているが、この画像は、文字列パターンから、郵便番号
枠付きと、縦書き手書き宛名との組み合わせであると推
定できる。例えば、細長連結画素群(LongＣＣ)の配置
が、図７に示すような上方に存在する５位の横長の文字
列と、その横長文字列の下方に交差し、または少し離れ
た位置に配置された複数の縦長の文字列群１位、８位、
３位、９位、４位が存在する場合を考察する。この場
合、横長の文字列が郵便番号文字列であり、縦長の文字
列群は、右側に存在するほど住所である確率が高く、左
側に存在するほど宛名である可能性が高いと判断し、認
識処理を行う。また、細長連結画素群(LongＣＣ)が横方
向に数段を構成して配置されている場合には、１番上が
郵便番号、そして、住所と宛名が順に出現する可能性が
高いと判断し、その後の認識処理を行う。このようにし
て、この実用例では、郵便番号文字列６２が優先され、
宛名文字列５９および文字列６３から「０」「３」
「５」等の文字サイズ連結画素群(CharＣＣ)が除かれ
る。文字列群中からの郵便番号文字列の推定は、文字列
中の文字サイズ連結画素群(CharＣＣ)の個数、その文字
列の位置、宛名パターンに関する知識などを用いて行う
ことができる。郵便番号文字列と認識された文字列に対
しては、他の文字列の認識処理に先立って、手書きの数
字の辞書とのマッチングにより認識処理が行われ、ま
た、宛名文字列に対しては、手書き名前辞書を用いた認
識処理が行われる。住所文字列に対しては、郵便番号か
ら検索された住所との比較、手書き住所辞書を用いた認
識処理等を複合した認識処理が実行される。ここで、図
８における「ＩＢＭ」の文字群６０は、横書きではある
が、縦書きの文字列６３が先に選定されたために、それ
によって減点されて文字列としては選択されていない。
これは、文字列の抽出という単独処理ではなく、文字列
６３における認識のフィードバック等によって処理され
るべきものである。Depending on the threshold value of the evaluation point of the elongated connected pixel group (LongCC), the intersection of the elongated connected pixel group (LongCC) may belong to both the vertical and horizontal character strings. (Not shown) so that each character size connected pixel group (CharCC) does not belong to a plurality of character strings by estimating to which of the address patterns of the entire mail registered in advance in advance (step 110). Alternatively, the character string selection unit 30 may reconsider the consistency of each character string with respect to vertical and horizontal overlap, and make sure that each character size connected pixel group (CharCC) belongs to only one character string. You can also FIG. 8 shows an extracted character string superimposed on the original image. In the figure, for example, the postal code character string 62 intersects with the address character string 59 and the character string 63, and this image is obtained by combining a postal code frame with a vertically written handwritten address from a character string pattern. It can be estimated that For example, the arrangement of the elongated connected pixel group (LongCC) is such that the fifth-place horizontally long character string as shown in FIG. First and eighth vertical character string groups,
Consider the case where the third, ninth and fourth positions exist. In this case, it is determined that the horizontal character string is a postal code character string, and the vertical character string group is more likely to be an address as it exists on the right side, and is more likely to be an address as it exists on the left side, Perform recognition processing. In addition, when the elongated connected pixel group (LongCC) is arranged in several rows in the horizontal direction, it is determined that there is a high possibility that the postal code appears at the top, and that the address and the address appear in order. , And perform subsequent recognition processing. Thus, in this practical example, the zip code string 62 has priority,
"0""3" from the address character string 59 and the character string 63
The character size connected pixel group (CharCC) such as “5” is excluded. The estimation of the postal code character string from the character string group can be performed using the number of character size concatenated pixel groups (CharCC) in the character string, the position of the character string, knowledge of the address pattern, and the like. For a character string recognized as a postal code character string, recognition processing is performed by matching a handwritten numeral with a dictionary prior to recognition processing of another character string. , A recognition process using a handwritten name dictionary is performed. For the address character string, a recognition process is performed in which a comparison with an address retrieved from a postal code, a recognition process using a handwritten address dictionary, and the like are performed. Here, the character group 60 of “IBM” in FIG. 8 is written horizontally, but since the character string 63 written vertically is selected first, it is deducted and is not selected as a character string.
This is not a single process of extracting a character string, but is to be processed by feedback of recognition in the character string 63 or the like.

【００３５】このように、本実施の形態における手書き
文字列からなる宛名文字列の抽出方法によれば、論理的
に文字サイズ連結画素群(CharＣＣ)を結合する他の手法
に比べ、柔軟性があり処理も容易である。また、優先順
位の高い細長連結画素群(LongＣＣ)から文字列を決定
し、且つそれに直交する細長連結画素群(LongＣＣ)の評
価点を減点するように構成しており、直交方向への偽り
の細長連結画素群(LongＣＣ)を文字列として抽出するの
を防いでいる。その結果、文字列の並びが不規則な宛名
でも実用に充分に耐え得るものである。As described above, according to the method of extracting a destination character string composed of a handwritten character string in the present embodiment, flexibility is higher than that of another method of logically combining character size connected pixel groups (CharCC). Yes, processing is easy. In addition, a character string is determined from the narrow connected pixel group (LongCC) having a high priority, and the evaluation point of the narrow connected pixel group (LongCC) orthogonal to the character string is reduced. This prevents extraction of the elongated connected pixel group (LongCC) as a character string. As a result, even an address having an irregular arrangement of character strings can sufficiently withstand practical use.

【００３６】次に、本実施の形態における、主として印
刷文字の宛名読み取り作業の流れを、図１、および図９
〜図１４を用いて説明する。ここで、図９は印刷文字の
宛名読み取りの大まかな流れを説明するためのフローチ
ャートである。また、図１０〜図１４は第２の実用例と
して、元画像(図１０)、文字サイズ連結画素群(CharＣ
Ｃ)の抽出状態(図１１)、メッシュ画像をＯＮした状態
(図１２)、統合化してメッシュ画像をＯＮした状態(図
１３)、抽出された宛名領域候補(図１４)を示した図で
ある。Next, FIG. 1 and FIG.
This will be described with reference to FIGS. FIG. 9 is a flowchart for explaining a general flow of reading the address of a print character. FIGS. 10 to 14 show a second practical example of an original image (FIG. 10) and a character size connected pixel group (CharC
C) extraction state (Fig. 11), mesh image ON
FIG. 12 is a diagram showing extracted address area candidates (FIG. 14) in a state where mesh images are turned on after integration (FIG. 13).

【００３７】まず、画像入力装置２により郵便物の画像
が読み取られる(ステップ１２１)。図１０に示すような
郵便物７０には、宛名情報７１の他、料金別納印刷７
２、配信情報７３、商標等の表示７４、連絡情報７５、
背景印刷情報７６等、多種多様な画像情報が含まれてい
る。これらの画像情報は２値化部１１により適切な方法
にて２値化され(ステップ１２２)、連結画素群(ＣＣ)検
出部１３により連結画素群(ＣＣ)が検出されてメモリ
(図示せず)に登録される(ステップ１２３)。この連結画
素群(ＣＣ)の中から、大きすぎず小さすぎない文字サイ
ズ連結画素群(CharＣＣ)が文字サイズ連結画素群(Char
ＣＣ)抽出部１４にて抽出される(ステップ１２４)。
尚、前述の手書き文字列抽出時と異なり、この印刷文字
の宛名領域の読み取りでは、閾値の最大値が小さい(例
えば３０画素)ことが挙げられる。図１１では、文字サ
イズ連結画素群(CharＣＣ)を黒い長方形にし、理解し易
いように元画像に重ね合わせたものである。この実用例
では、多くの文字情報が抽出されているが、商標等の表
示７４の文字や模様、背景印刷情報７６の「Ｍ」の文字
８２や「２０００」の文字８３は、３０画素を超える大
きな文字として除外されている。また、絵柄８４も省か
れている。尚、宛名情報７１の「川」の文字８０や
「二」の文字８１は抽出されていない。First, an image of a mail is read by the image input device 2 (step 121). The mail 70 as shown in FIG.
2, distribution information 73, display 74 of trademark, etc., contact information 75,
Various types of image information such as background print information 76 are included. The image information is binarized by an appropriate method by the binarization unit 11 (step 122), and the connected pixel group (CC) is detected by the connected pixel group (CC) detection unit 13 and stored in the memory.
(Not shown) (step 123). From the connected pixel group (CC), a character size connected pixel group (CharCC) that is not too large or not too small is a character size connected pixel group (Char
CC) is extracted by the extraction unit 14 (step 124).
Note that, unlike the above-described handwritten character string extraction, the maximum value of the threshold is small (for example, 30 pixels) in reading the address area of the printed character. In FIG. 11, the character size connected pixel group (CharCC) is a black rectangle, and is superimposed on the original image for easy understanding. In this practical example, a lot of character information is extracted, but the characters and patterns of the display 74 such as a trademark and the character 82 of “M” and the character 83 of “2000” in the background print information 76 exceed 30 pixels. Excluded as large letters. Also, the picture 84 is omitted. It should be noted that characters 80 of “river” and characters 81 of “two” in the address information 71 are not extracted.

【００３８】次に、文字サイズ連結画素群(CharＣＣ)の
中心画素を、メッシュ画像形成部４１にて想定された矩
形領域である、簡易化したメッシュ画像のどれに属する
かを対応メッシュ画像検出部４２にて計算し、そのメッ
シュを黒メッシュとしている(ステップ１２５)。図１２
に示されるように、ここでは、画像を２４画素×２４画
素の小エリアでメッシュ８８に分割している。ここで
は、メッシュ８８の境界線は表示されていない。また、
図１０にて説明した宛名情報７１の「―」文字部が白に
なっている。このような場合を補間するために、次のス
テップ処理が実行される。即ち、メッシュ画像連結検出
部４３では、１行分や１文字分の隙間があっても統合化
するために、上下か左右が黒メッシュである白メッシュ
を黒に変換する(ステップ１２６)。その結果が図１３に
示されている。図１２と比較して明らかなように、１行
分や１文字分の隙間があっても統合化がなされて黒に変
換されていることが理解できる。尚、この実用例では行
間を埋める必要性は低いが、通常は、ギャップを埋める
ことにより行間が埋められ、離れた２行を統合化できる
点でこの処理は有効である。これにより、宛名領域の可
能性のある部分が１つの連結メッシュ(ＣＭ)としてのブ
ロックとして把握することが可能となる。Next, a corresponding mesh image detecting unit determines which central pixel of the character size connected pixel group (CharCC) belongs to a simplified mesh image which is a rectangular area assumed by the mesh image forming unit 41. The calculation is made at 42, and the mesh is set as a black mesh (step 125). FIG.
Here, the image is divided into meshes 88 in a small area of 24 pixels × 24 pixels. Here, the boundary of the mesh 88 is not displayed. Also,
The “−” character part of the address information 71 described in FIG. 10 is white. In order to interpolate such a case, the following step processing is executed. That is, the mesh image connection detection unit 43 converts a white mesh, which is a black mesh on the top and bottom or left and right, into black in order to integrate even if there is a gap for one line or one character (step 126). The result is shown in FIG. As is apparent from comparison with FIG. 12, it can be understood that even if there is a gap corresponding to one line or one character, it is integrated and converted to black. In this practical example, although it is not necessary to fill the space between lines, this processing is effective in that the space between lines is usually filled by filling the gap, and two separated lines can be integrated. As a result, it is possible to grasp a portion that may be an address area as a block as one connected mesh (CM).

【００３９】次に、特定領域抽出部４５では、１メッシ
ュを１画素のごとく見做し、８連結を囲む長方形を宛名
領域の候補とする(ステップ１２７)。図１４の破線で囲
まれた部分がこのステップにより抽出された部分であ
り、領域９０〜９４が宛名領域の候補に該当する。ここ
で、図１４に示されている郵便物７０では、図１０で説
明した料金別納印刷７２、配信情報７３、商標等の表示
７４、および背景印刷情報７６の一部の連結メッシュ
(ＣＭ)が選定されていないことが解る。これらは、例え
ば文字列が１行であったり、充分な連結画素群(ＣＣ)が
存在しない等の理由により、宛名領域の候補から除外さ
れたものである。次に、領域内で文字抽出等が行われ、
属性情報格納部４４が有する種々の属性で各領域が評価
され、宛名候補として順位付けがなされる(ステップ１
２８)。この種々の属性は、前述したように、画像の端
に寄っていないか、文字列の形がそれらしいか等の内容
であり、これによって、図１４では、上から順に、領域
９０が第１候補、９１が第２、９２が第３、９３が第
４、９４が第５として順位付けがなされる。次に、優先
順位の高い領域順に、画像認識装置３にて宛名の認識が
行われ、満足な結果が得られた場合には、そこで宛名認
識を終了させる(ステップ１２９)。これらの処理の流れ
によって、本実施の形態における一連の郵便宛名の読み
取りが終了する。Next, the specific area extracting section 45 regards one mesh as one pixel, and sets a rectangle surrounding eight links as a candidate for a destination area (step 127). The part enclosed by the broken line in FIG. 14 is the part extracted in this step, and the areas 90 to 94 correspond to the candidates for the destination area. Here, in the postal matter 70 shown in FIG. 14, the fee-based payment printing 72, the distribution information 73, the display 74 of the trademark and the like described in FIG.
(CM) is not selected. These are excluded from the candidates for the destination area because, for example, the character string is one line, or a sufficient connected pixel group (CC) does not exist. Next, character extraction is performed in the area,
Each area is evaluated based on various attributes of the attribute information storage unit 44, and is ranked as address candidates (step 1).
28). As described above, these various attributes are contents such as whether the character is not close to the end of the image or whether the character string is appropriate. Thus, in FIG. Candidates, 91 are second, 92 are third, 93 are fourth, and 94 are fifth. Next, the address recognition is performed by the image recognition device 3 in the order of the regions having the highest priority, and if a satisfactory result is obtained, the address recognition is terminated there (step 129). With the flow of these processes, a series of reading of postal addresses in the present embodiment ends.

【００４０】本実施の形態による印字文字列抽出のアル
ゴリズムによれば、宛名領域以外に差出人の領域や様々
な模様を含んだ郵便物に対し、簡単なアルゴリズムで宛
名領域を含む矩形領域を抽出することが可能となる。即
ち、各領域候補が、全体イメージに比べて面積が非常に
狭く、模様やノイズから開放されていること、また、領
域内の文字列が垂直方向か水平方向かが予測できること
から、かかる領域候補に対して宛名認識を実行すること
で検出精度の向上と処理の簡潔化、およびそれに伴う処
理速度の向上を図ることができる。その為に、このアル
ゴリズムによれば、領域内の特にプリンタで印刷された
印字の宛名領域はかなり高い確率にて抽出することが可
能となる。また、手書きであっても、欧米の郵便のよう
に横書きで纏めて書かれた宛名に対して精度良く抽出す
ることが可能となる。According to the print character string extraction algorithm according to the present embodiment, a rectangular area including the address area is extracted by a simple algorithm for a mail including the sender area and various patterns in addition to the address area. It becomes possible. That is, since each area candidate has a very small area compared to the whole image and is free from patterns and noises, and since it is possible to predict whether the character string in the area is vertical or horizontal, such area candidates are By performing the address recognition for, the detection accuracy can be improved, the processing can be simplified, and the processing speed can be improved accordingly. For this reason, according to this algorithm, it is possible to extract, with a considerably high probability, a destination area of a print in the area, in particular, a print printed by a printer. Further, even with handwriting, it is possible to accurately extract an address written in a horizontal manner such as a postal mail in Europe and America.

【００４１】このように、本実施の形態によれば、印刷
文字のように纏まって宛名が記載されている場合や、手
書きであっても欧米の郵便のように横書きで纏めて書か
れた宛名に対しては、前述の印刷文字列抽出のアルゴリ
ズムによって簡単に処理し、日本語手書き宛名のよう
に、横書き、縦書きが入り混じって不規則な宛名に対し
ては、前述の手書き文字列抽出のアルゴリズムを適用し
て文字列を抽出することができる。その結果、宛名書き
の種類を選ぶことなく、精度高く高速に、宛名領域を含
む文字列を抽出することが可能となる。As described above, according to the present embodiment, a case where addresses are collectively written like printed characters, or a case where addresses are written in a horizontal manner like European and American mail even if they are handwritten Is easily processed by the above-described algorithm for extracting printed character strings. For irregular addresses that are mixed with horizontal and vertical writing, such as Japanese handwritten addresses, the aforementioned handwritten character string extraction is performed. A character string can be extracted by applying the algorithm of (1). As a result, it is possible to accurately and quickly extract a character string including an address area without selecting a type of address writing.

【００４２】[0042]

【発明の効果】以上説明したように、本発明によれば、
全画素内での複雑な統合化処理を避け、精度高く高速に
手書き宛名等の文字列を抽出することができる。As described above, according to the present invention,
A complicated integration process in all pixels can be avoided, and a character string such as a handwritten address can be extracted with high accuracy and high speed.

[Brief description of the drawings]

【図１】本実施の形態における画像処理装置の全体構
成を示した説明図である。FIG. 1 is an explanatory diagram illustrating an overall configuration of an image processing apparatus according to an embodiment.

【図２】本実施の形態における手書き文字列を主とす
る宛名読み取りの大まかな流れを説明するためのフロー
チャートである。FIG. 2 is a flowchart for explaining a general flow of address reading mainly with a handwritten character string in the present embodiment.

【図３】第１の実用例における元画像を示す図であ
る。FIG. 3 is a diagram showing an original image in a first practical example.

【図４】第１の実用例における文字サイズ連結画素群
(CharＣＣ)の抽出状態を示した図である。FIG. 4 shows a character size connected pixel group in the first practical example.
It is the figure which showed the extraction state of (CharCC).

【図５】第１の実用例における横方向細長連結画素群
(横方向LongＣＣ)の抽出状態を示した図である。FIG. 5 shows a horizontally elongated connected pixel group in the first practical example.
It is the figure which showed the extraction state of (lateral direction LongCC).

【図６】第１の実用例における縦方向細長連結画素群
(縦方向LongＣＣ)の抽出状態を示した図である。FIG. 6 shows a vertically elongated connected pixel group in the first practical example.
FIG. 14 is a diagram illustrating an extraction state of (vertical LongCC).

【図７】第１の実用例における文字列を選定して順位
付けをした状態を示した図である。FIG. 7 is a diagram showing a state in which character strings in the first practical example are selected and ranked.

【図８】第１の実用例における元画像と重ね合わせて
抽出文字列を表現した図である。FIG. 8 is a diagram showing an extracted character string superimposed on an original image in the first practical example.

【図９】印刷文字の宛名読み取りの大まかな流れを説
明するためのフローチャートである。FIG. 9 is a flowchart for explaining a general flow of reading an address of a print character.

【図１０】第２の実用例における元画像を示す図であ
る。FIG. 10 is a diagram showing an original image in a second practical example.

【図１１】第２の実用例における文字サイズ連結画素
群(CharＣＣ)の抽出状態を示した図である。FIG. 11 is a diagram showing an extraction state of a character size connected pixel group (CharCC) in a second practical example.

【図１２】第２の実用例におけるメッシュ画像をＯＮ
した状態を示した図である。FIG. 12 turns on the mesh image in the second practical example.
FIG.

【図１３】第２の実用例における統合化してメッシュ
画像をＯＮした状態を示した図である。FIG. 13 is a diagram showing a state in which mesh images are turned on after being integrated in the second practical example.

【図１４】第２の実用例における抽出された宛名領域
候補を示した図である。FIG. 14 is a diagram illustrating extracted destination area candidates in a second practical example.

[Explanation of symbols]

１…文字列抽出装置、２…画像入力装置、３…画像認識
装置、１１…２値化部、１２…縮小部、１３…連結画素
群(ＣＣ)検出部、１４…文字サイズ連結画素群(CharＣ
Ｃ)抽出部、１５…閾値格納部、１６…サイズ判定部、
２１…横方向伸長部、２２…縦方向伸長部、２３…Ｈｏ
ｒ第１バッファ、２４…Ｖｅｒ第１バッファ、２５,２
６…細長連結画素群(LongＣＣ)抽出部、２７…横方向評
価部、２８…縦方向評価部、２９…文字列閾値格納部、
３０…文字列選定部、３１…Ｈｏｒ第２バッファ、３２
…Ｖｅｒ第２バッファ、４０…印刷文字列抽出部、４１
…メッシュ画像形成部、４２…対応メッシュ画像検出
部、４３…メッシュ画像連結検出部、４４…属性情報格
納部、４５…特定領域抽出部DESCRIPTION OF SYMBOLS 1 ... Character string extraction device, 2 ... Image input device, 3 ... Image recognition device, 11 ... Binarization part, 12 ... Reduction part, 13 ... Connection pixel group (CC) detection part, 14 ... Character size connection pixel group ( CharC
C) Extraction unit, 15: threshold storage unit, 16: size determination unit,
21: horizontal extension, 22: vertical extension, 23: Ho
r first buffer, 24... Ver first buffer, 25, 2
6 ... elongated connected pixel group (LongCC) extraction unit, 27 ... horizontal evaluation unit, 28 ... vertical evaluation unit, 29 ... character string threshold storage unit,
30: character string selection unit, 31: Hor second buffer, 32
... Ver second buffer, 40... Print character string extraction unit, 41
... Mesh image forming section, 42... Corresponding mesh image detecting section, 43... Mesh image connection detecting section, 44... Attribute information storing section, 45.

───────────────────────────────────────────────────── フロントページの続きＦターム(参考） 3F079 AA02 AA03 CA02 CA03 CA11 CB08 CB29 CB31 CB35 5B029 AA01 CC28 EE14 5B064 AA04 CA08 5L096 AA07 BA08 BA17 EA02 EA03 FA64 FA67 GA15 GA51 ──────────────────────────────────────────────────続き Continued on the front page F term (reference) 3F079 AA02 AA03 CA02 CA03 CA11 CB08 CB29 CB31 CB35 5B029 AA01 CC28 EE14 5B064 AA04 CA08 5L096 AA07 BA08 BA17 EA02 EA03 FA64 FA67 GA15 GA51

Claims

[Claims]

1. A step of extracting a connected pixel group of pixels constituting an input image, and comparing a size of the extracted connected pixel group with a predetermined threshold to extract a connected pixel group within a threshold. And extracting a connected elongated pixel group in the vertical or horizontal direction of the extracted connected pixel group within the threshold, and a character string based on the connected state of the extracted connected elongated pixel group. Extracting a candidate.

2. The step of extracting a connected pixel group within a threshold value includes comparing a vertical direction and / or a horizontal direction of the connected pixel group with a predetermined vertical direction and / or a horizontal direction of an assumed character size. 2. The character string extracting method according to claim 1, wherein the intra-threshold connected pixel group is extracted by:

3. The method according to claim 1, wherein the step of extracting the elongated connected pixel group comprises extracting the elongated connected pixel group by changing a size of the extended connected pixel group based on the size of the extracted intra-threshold connected pixel group. 2. The character string extraction method according to claim 1, wherein:

4. The method according to claim 1, wherein the step of extracting the group of elongated connected pixels includes the step of contracting the horizontal direction when the group of connected pixels within the threshold value is elongated in the vertical direction, and shrinking the vertical direction when the group of elongated connected pixels is elongated in the horizontal direction. 2. A connected pixel group is extracted.
The described character string extraction method.

5. The step of extracting the elongated connected pixel group is characterized in that the elongated connected pixel group is extracted by changing an amount of contraction based on the size of the extracted connected pixel group within the threshold. The character string extraction method according to claim 4.

6. Assuming a mesh image obtained by dividing the image into a predetermined size, turning on the mesh image corresponding to the extracted connected pixel group within the threshold, and concatenating the mesh images turned on. 6. The method according to claim 5, further comprising: extracting a specific area from the state.

7. The mesh image is turned on when the image is a matched character string such as a print character as an address character string.
7. The method according to claim 6, further comprising the step of: extracting the elongated connected pixel group when the image is an inconsistent character string such as Japanese handwritten characters as an address character string. The described character string extraction method.

8. A pixel group having a predetermined size is extracted from an image including an input handwritten character, and a string of a character string in the expanded direction is obtained by extending one direction of the pixel group and contracting the other direction. A handwritten character string extraction method, comprising: detecting a string state of a plurality of assumed character strings; and estimating an address pattern from the detected string state of the character string.

9. The detected state of the string of the character string includes the length of the string and the unevenness state of the string, and the order in which the character recognition processing is performed is the one in which the length of the string is long or 9. The method for extracting a handwritten character string according to claim 8, wherein the order is raised for / and the one with the small unevenness state.

10. The string of the assumed character string is extended by adding a predetermined ratio to the size of the pixel group with respect to the vertical or horizontal one of the pixel groups, 9. The string of the character string is assumed based on a connection state of a plurality of pixel groups that are reduced, reduced, expanded, and reduced by reducing a predetermined ratio to the size of the pixel group with respect to the vertical. Extraction method of handwritten character string of description.

11. A connected pixel group detecting section for detecting a connected pixel group formed from black connected pixels in a binarized image, and among the connected pixel groups detected by the connected pixel group detecting section. A character size connected pixel group extraction unit that extracts a character size connected pixel group that is not too large and not too small, and the character size connected pixel group extracted by the character size connected pixel group extraction unit, in the assumed character string direction. Is expanded and contracted in the direction orthogonal to the assumed character string direction to expand and contract the character size connected pixel group, and a plurality of the connected pixel groups expanded and contracted by the connected pixel group expandable unit. An elongated connected pixel group extraction unit for extracting an elongated connected pixel group by combining in the assumed character string direction; and performing image recognition based on the elongated connected pixel group extracted by the elongated connected pixel group extraction unit. Character string extraction apparatus characterized by comprising a string selection unit for selecting a can string.

12. The character size connected pixel group extracting unit,
12. The character string extraction device according to claim 11, wherein the character size connection pixel group is extracted based on a threshold value in consideration of a size of a handwritten character written on a postal matter.

13. An image input unit for inputting an image, a pixel group extraction unit for extracting a pixel group having a predetermined size from the image input by the image input unit, and an image extracted by the pixel group extraction unit. A vertically-extended pixel group generating means for defining a vertical direction and a horizontal direction with respect to the pixel group, and generating a vertically-extended pixel group in which the pixel group is expanded and contracted in the horizontal direction in the vertical direction; A vertical direction and a horizontal direction are defined with respect to the pixel group extracted by the pixel group extracting unit, and a horizontal expanded pixel group is generated in which the pixel group is expanded in the horizontal direction and contracted in the vertical direction. A horizontal extension pixel group generation unit; a plurality of the vertical extension pixel groups generated by the vertical extension pixel group generation unit; and a plurality of the horizontal extension pixels generated by the horizontal extension pixel group generation unit. The,
Elongate connected pixel group extraction means for extracting an elongate connected pixel group connected in the extended direction, and a character string for extracting a character string candidate based on the elongate connected pixel group extracted by the elongate connected pixel group extraction means An image processing apparatus comprising: a candidate extracting unit; and an image recognizing unit that performs image recognition on a character string candidate extracted by the character string candidate extracting unit.

14. The character string candidate extraction unit outputs a plurality of the elongated connected pixel groups extracted by the elongated connected pixel group extraction unit with a higher priority to those having a longer length. The image processing apparatus according to claim 13, wherein:

15. The character string candidate extraction means determines a priority by lowering an evaluation point for a pixel group at which the vertical extension pixel group and the horizontal extension pixel group intersect. The image processing device according to claim 14.