JPH0765124A

JPH0765124A - Character recognition device

Info

Publication number: JPH0765124A
Application number: JP5213885A
Authority: JP
Inventors: Katsunosuke Kiko; 克之助喜古
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 1993-08-30
Filing date: 1993-08-30
Publication date: 1995-03-10

Abstract

PURPOSE:To provide the character recognition device which can evade a scan on the whole input image for character detection and segmentation before recognition and can recognize complicated character detection and segmentation candidates. CONSTITUTION:A memory 21 is stored with a binary image of the entire surface of postal matter which is divided, line by line. An MPU 22 traces an outline from the 1st line of the memory 21 and writes structure feature information on the coordinate values, the number of black dots, and respective outline points of a block connected in black each time the block is found. An MPU 24 performs a recognition processing by matching arithmetic between the structure feature information in the memory 23 and dictionary information of a dictionary 26 and writes the recognition result in the memory 23. An MPU 25 performs a recognition processing by composite similarity arithmetic, based upon the block coordinate values in the memory 23, between the character image in the memory 21 and the dictionary information in a dictionary 27 and writes the recognition result in the memory 23. An MPU 28 stops the MPUs 22, 24, and 25 from operating when a limit time is elapsed, and determines a final recognition result uniquely by referring to a section address data base 29.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、たとえば、郵便物上の
宛名情報を光学的に読取って郵便物を区分する郵便物宛
名自動読取区分機などにおいて、郵便物上に自由手書き
によって表記された宛名情報の文字を認識する文字認識
装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention is, for example, a mail address automatic reading / sorting device that optically reads address information on a mail item to classify the mail item. The present invention relates to a character recognition device that recognizes characters of address information.

【０００２】[0002]

【従来の技術】一般に、たとえば、郵便物上に自由手書
きによって表記された宛名情報の文字を認識する文字認
識装置においては、画像処理により、冗長な文字検出切
出し候補を生成する。そして、この文字検出切出し候補
に対して、複合類似度法や構造解析によるパターン認識
により認識処理を行なう。2. Description of the Related Art Generally, for example, in a character recognition device for recognizing a character of address information written by handwriting on a postal matter, redundant character detection cutout candidates are generated by image processing. Then, the character detection / cutout candidate is subjected to recognition processing by pattern recognition by the composite similarity method or structure analysis.

【０００３】ところで、文字検出切出し候補は、入力画
像の文字記載の自由度に比例して増加する。自由手書き
文字の認識の場合、文字検出切出し候補は非常に冗長に
なり、文字認識に時間的負担をかける。また、文字認識
の過程で算出した特徴を保持することは、大きな負担と
なる。By the way, the number of character detection / cutout candidates increases in proportion to the degree of freedom of character description of an input image. In the case of recognition of free handwritten characters, the character detection / cutout candidates become very redundant, which imposes a time burden on the character recognition. In addition, holding the features calculated in the process of character recognition is a heavy burden.

【０００４】また、文字検出切出し処理は、入力画像全
体を走査して行なうため、文字認識を開始する前に一度
は全画像を走査する必要がある。しかし、入力画像は冗
長であり、不必要な画像処理に多くの時間を割当てるこ
とになる。さらに、入力画像全体に対する画像処理は、
不要領域が多く存在する場合は、認識が困難なほど冗長
な文字検出切出し候補を生成する。Further, since the character detection / cutout processing is performed by scanning the entire input image, it is necessary to scan the entire image at least once before starting character recognition. However, the input image is redundant, and a lot of time is allotted to unnecessary image processing. Furthermore, image processing for the entire input image is
When there are many unnecessary areas, redundant character detection cutout candidates are generated that are difficult to recognize.

【０００５】[0005]

【発明が解決しようとする課題】従来は、文字検出切出
しの際に、入力画像全領域に対して走査するために、不
要領域が多い場合、非常に冗長な文字検出切出し候補を
生成する。また、文字検出切出し処理を行ないながら認
識処理を行なう場合でも、認識に必要なラべリング処理
のため、認識処理の前に全画像を走査する必要がある。
全画像を走査するということは、文字検出切出しの処理
を一部行なっていることになる。Conventionally, in the case of character detection / cutout, a very redundant character detection / cutout candidate is generated when there are many unnecessary areas because the entire area of the input image is scanned. Even when the recognition process is performed while the character detection / cutout process is performed, the entire image needs to be scanned before the recognition process because of the labeling process necessary for the recognition.
Scanning the entire image means that part of the character detection / cutout processing is performed.

【０００６】従来は、文字検出切出し処理と認識処理が
独立して行なわれていたため、時間的コストの増大や文
字検出切出し候補の頻発が発生していた。文字検出切出
しの時点では、認識対象物の概念は存在しないので、認
識による文字検出切出しも提案されているが、文字検出
切出しの前処理が存在する。Conventionally, since the character detection / cutout processing and the recognition processing are performed independently, the time cost increases and the character detection / cutout candidates frequently occur. At the time of character detection cutout, since the concept of the recognition target object does not exist, character detection cutout by recognition is also proposed, but there is preprocessing for character detection cutout.

【０００７】そこで、本発明は、認識前の文字検出切出
しのための入力画像全体に対する走査を回避することが
でき、しかも、単純な多角形で表現できない複雑な文字
検出切出し候補の認識も可能となる文字認識装置を提供
することを目的とする。Therefore, according to the present invention, it is possible to avoid scanning the entire input image for character detection cutout before recognition, and it is also possible to recognize a complicated character detection cutout candidate that cannot be expressed by a simple polygon. An object of the present invention is to provide a character recognition device.

【０００８】[0008]

【課題を解決するための手段】本発明の文字認識装置
は、被読取物上の文字が存在する画像を入力する画像入
力手段と、この画像入力手段で入力された画像に対して
逐次輪郭追跡を行なうことにより画像の構造特徴を抽出
する構造特徴抽出手段と、この構造特徴抽出手段と並列
的に動作し、前記構造特徴抽出手段で構造特徴が抽出さ
れると、その構造特徴を用いて構造特徴マッチング演算
を行なうことにより認識処理を行なう認識手段とを具備
している。A character recognition apparatus according to the present invention comprises an image inputting means for inputting an image in which characters are present on an object to be read, and successive contour tracing for the image input by the image inputting means. And a structure feature extracting unit that extracts the structure feature of the image by performing the above operation, and the structure feature extracting unit operates in parallel. When the structure feature extracting unit extracts the structure feature, the structure feature is used. A recognition means for performing recognition processing by performing feature matching calculation is provided.

【０００９】また、本発明の文字認識装置は、被読取物
上の文字が存在する画像を入力する画像入力手段と、こ
の画像入力手段で入力された画像に対して逐次輪郭追跡
を行なうことにより画像の構造特徴を抽出する構造特徴
抽出手段と、この構造特徴抽出手段と並列的に動作し、
前記構造特徴抽出手段で構造特徴が抽出されると、その
構造特徴に基づき、前記画像入力手段で入力された画像
を用いて複合類似度演算を行なうことにより認識処理を
行なう認識手段とを具備している。Further, the character recognition apparatus of the present invention comprises an image input means for inputting an image in which a character is present on an object to be read, and a contour tracing for the image input by the image input means. Structural feature extraction means for extracting structural features of an image, and the structural feature extraction means operate in parallel,
And a recognition unit that performs a recognition process by performing a composite similarity calculation using the image input by the image input unit based on the structure feature when the structure feature extraction unit extracts the structure feature. ing.

【００１０】さらに、本発明の文字認識装置は、被読取
物上の文字が存在する画像を入力する画像入力手段と、
この画像入力手段で入力された画像に対して逐次輪郭追
跡を行なうことにより画像の構造特徴を抽出する構造特
徴抽出手段と、この構造特徴抽出手段と並列的に動作
し、前記構造特徴抽出手段で構造特徴が抽出されると、
その構造特徴を用いて構造特徴マッチング演算を行なう
ことにより認識処理を行なう第１の認識手段と、前記構
造特徴抽出手段と並列的に動作し、前記構造特徴抽出手
段で構造特徴が抽出されると、その構造特徴に基づき、
前記画像入力手段で入力された画像を用いて複合類似度
演算を行なうことにより認識処理を行なう第２の認識手
段と、前記第１の認識手段の認識結果と前記第２の認識
手段の認識結果とにより最終的な１つの認識結果を出力
する手段とを具備している。Further, the character recognition device of the present invention comprises image input means for inputting an image in which characters on the object to be read exist.
Structural feature extracting means for extracting structural features of the image by sequentially performing contour tracing on the image input by the image inputting means, and operating in parallel with the structural feature extracting means. Once the structural features are extracted,
When the structure feature extraction unit operates in parallel with the first recognition unit that performs the recognition process by performing the structure feature matching operation using the structure feature, and the structure feature extraction unit extracts the structure feature. , Based on its structural features,
Second recognition means for performing recognition processing by performing composite similarity calculation using the image input by the image input means, recognition result of the first recognition means and recognition result of the second recognition means And means for outputting one final recognition result.

【００１１】[0011]

【作用】画像処理の段階で保持している画像の中に認識
対象物が含まれているので、認識対象概念を直接作用さ
せることにより、文字検出切出し処理と認識処理を同時
に行なうことができ、認識前の文字検出切出しのための
入力画像全体に対する走査を回避することができる。Since the recognition target object is included in the image held at the image processing stage, the character detection cut-out processing and the recognition processing can be simultaneously performed by directly applying the recognition target concept. It is possible to avoid scanning the entire input image for character detection clipping before recognition.

【００１２】また、認識処理の一部が文字検出切出し処
理を兼ねていて、文字検出切出しの段階で完全に対象画
像を捕えているので、単純な多角形で表現できない複雑
な文字検出切出し候補の認識も可能である。Further, a part of the recognition processing also serves as character detection / cutout processing, and the target image is completely captured at the stage of character detection / cutout, so that a complicated character detection / cutout candidate that cannot be expressed by a simple polygon is used. Recognition is also possible.

【００１３】[0013]

【実施例】以下、本発明の一実施例について図面を参照
して説明する。図７は、本実施例が適用される郵便物宛
名自動読取区分機の構成を概略的に示すものである。す
なわち、この郵便物宛名自動読取区分機は、多数の郵便
物Ｐが一括かつ立位状態でセットされる供給部１、この
供給部１にセットされた郵便物Ｐを最前部から順に１枚
ずつ取出す取出部２、この取出部２で取出された郵便物
Ｐを搬送する取込搬送路３、この取込搬送路３で搬送さ
れる郵便物Ｐ上の宛名情報を光学的に読取る読取部４、
この読取部４で宛名情報が読取られた郵便物Ｐをその読
取結果に基づいて区分する区分部５によって構成されて
いる。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of the present invention will be described below with reference to the drawings. FIG. 7 schematically shows the configuration of a mailing address automatic reading / sorting machine to which this embodiment is applied. In other words, this automatic mail address sorter and sorter has a supply unit 1 in which a large number of postal products P are set in a batch and standing state, and one postal product P set in the supply unit 1 in order from the front. A take-out section 2 for taking out, a take-in conveyance path 3 for conveying the postal matter P taken out by the take-out section 2, and a reading section 4 for optically reading address information on the postal matter P conveyed by the take-in conveyance path 3 ,
The reading unit 4 is configured by a sorting unit 5 that sorts the postal matter P whose address information has been read based on the reading result.

【００１４】区分部５は、読取部４を通過した郵便物Ｐ
を搬送する搬送路６、鉛直方向に設けられた複数段の区
分搬送路７ａ〜７ｇ、この各区分搬送路７ａ〜７ｇに沿
って設けられた多数のポケット（区分箱）８，…から構
成されている。The sorting section 5 is for the postal matter P that has passed through the reading section 4.
And a plurality of pockets (section boxes) 8 provided along each of the segmented transport paths 7a to 7g. ing.

【００１５】なお、供給部１の上部には、オペレータが
操作する操作パネル９が設けられている。しかして、操
作パネル９において、読取モードを指定した後、スター
トボタンを押下することにより、供給部１にセットされ
た郵便物Ｐは、取出部２によって１枚ずつ取出され、取
込搬送路３を搬送されて読取部４に送られる。読取部４
は、搬送される郵便物Ｐ上の宛名情報を光学的に読取
り、その読取結果に基づいて区分部５が動作することに
より、区分指定データに対応するポケット８に区分集積
されるようになっている。An operation panel 9 operated by an operator is provided above the supply unit 1. Then, by pressing the start button after designating the reading mode on the operation panel 9, the postal matter P set in the feeding section 1 is taken out one by one by the take-out section 2, and the take-in conveyance path 3 Is conveyed to the reading unit 4. Reading unit 4
Optically reads the address information on the conveyed postal matter P, and the sorting unit 5 operates based on the read result, whereby the sorting information is sorted and accumulated in the pocket 8 corresponding to the sorting designation data. There is.

【００１６】図６は、読取部４の構成を概略的に示すも
のである。すなわち、画像入力手段としての光電変換部
１１は、郵便物Ｐ上の全面画像を光電変換するもので、
たとえば、郵便物Ｐ上を照明する光源、および、郵便物
Ｐ上からの反射光を受光して電気信号に変換する自己走
査形のＣＣＤイメージセンサなどによって構成されてい
る。FIG. 6 schematically shows the structure of the reading unit 4. That is, the photoelectric conversion unit 11 as an image input unit photoelectrically converts the entire image on the mail P.
For example, it is configured by a light source that illuminates the mail P, a self-scanning CCD image sensor that receives reflected light from the mail P and converts it into an electric signal.

【００１７】光電変換部１１の出力は、住所画像処理部
１２に送られる。住所画像処理部１２では、まず、光電
変換部１１からの入力画像により住所領域を検出し、こ
の検出した住所領域から行領域を検出し、この検出した
行領域内から文字の検出切出しを行ない、文字検出切出
し候補を住所文字認識部１３に送る。The output of the photoelectric conversion unit 11 is sent to the address image processing unit 12. In the address image processing unit 12, first, an address area is detected from the input image from the photoelectric conversion unit 11, a line area is detected from the detected address area, and characters are detected and cut out from the detected line area. The character detection / cutout candidate is sent to the address character recognition unit 13.

【００１８】住所文字認識部１３では、住所画像処理部
１２からの文字検出切出し候補にしたがって住所認識処
理を行ない、その認識結果を数字認識部１４に送る。数
字認識部１４では、住所情報に続く画像領域を検出切出
ししながら数字認識処理を行ない、住所および数字の認
識結果を評価統合して、最終的に認識結果を一意に決定
し、その認識結果（答）を第１のＣＰＵ（セントラル・
プロセッシング・ユニット）１５に送る。The address character recognition unit 13 performs address recognition processing according to the character detection cutout candidates from the address image processing unit 12, and sends the recognition result to the number recognition unit 14. The number recognition unit 14 performs number recognition processing while detecting and cutting out the image area following the address information, evaluates and integrates the recognition results of the address and the number, and finally determines the recognition result uniquely, and the recognition result ( Answer) to the first CPU (Central
To the processing unit) 15.

【００１９】第１のＣＰＵ１５は、読取部４全体の制御
を司るもので、数字認識部１４からの認識結果を第２の
ＣＰＵ１６に転送する。第２のＣＰＵ１６は、区分機全
体の制御を司るもので、認識結果に基づき駆動部１７を
制御することにより、特定のポケット８に郵便物Ｐを区
分集積する。The first CPU 15 controls the entire reading unit 4, and transfers the recognition result from the numeral recognition unit 14 to the second CPU 16. The second CPU 16 controls the entire sorting machine, and controls the drive unit 17 based on the recognition result to sort and collect the postal matter P in a specific pocket 8.

【００２０】図１は、数字認識部１４の構成を詳細に示
すものである。すなわち、共有画像メモリ２１には、住
所画像処理部１２から行単位に分割された郵便物Ｐ全面
の２値画像が転送される。輪郭追跡ＭＰＵ（マイクロ・
プロセッサ・ユニット）２２は、共有画像メモリ２１内
の画像の１行目から輪郭追跡を行なうことにより、黒で
連結されたブロックを発見するごとに、共有画像メモリ
２１中のその座標値、黒点数、各輪郭点などの構造特徴
情報を共有メモリ２３に書込む。FIG. 1 shows the configuration of the numeral recognition unit 14 in detail. That is, the binary image of the entire surface of the postal matter P divided line by line is transferred from the address image processing unit 12 to the shared image memory 21. Contour tracking MPU (micro
The processor unit) 22 performs contour tracing from the first line of the image in the shared image memory 21 to find each block connected in black, the coordinate value and the number of black points in the shared image memory 21. , And writes structural feature information such as each contour point in the shared memory 23.

【００２１】構造マッチングＭＰＵ２４、および、複合
類似度ＭＰＵ２５は、それぞれ複数存在し、共有メモリ
２３に構造特徴情報が書込まれると、認識処理を開始す
る。構造マッチングＭＰＵ２４は、共有メモリ２３内の
構造特徴情報を用い、構造辞書２６内の構造辞書情報と
マッチング演算を行なうことにより、認識処理を行な
い、その認識結果を共有メモリ２３に書込む。There are a plurality of structure matching MPUs 24 and a plurality of composite similarity MPUs 25, respectively, and when structural feature information is written in the shared memory 23, recognition processing is started. The structure matching MPU 24 performs recognition processing by using the structure feature information in the shared memory 23 and matching operation with the structure dictionary information in the structure dictionary 26, and writes the recognition result in the shared memory 23.

【００２２】複合類似度ＭＰＵ２５は、共有画像メモリ
２１内の文字画像に対して、共有メモリ２３内の黒で連
結されたブロックの座標値にしたがって、複合類似度辞
書２７内の辞書情報との複合類似度演算を行なうことに
より、認識処理を行ない、その認識結果を共有メモリ２
３に書込む。The composite similarity MPU 25 combines the character image in the shared image memory 21 with the dictionary information in the composite similarity dictionary 27 according to the coordinate values of the blocks connected in black in the shared memory 23. The recognition process is performed by calculating the similarity, and the recognition result is stored in the shared memory 2.
Write to 3.

【００２３】制御ＭＰＵ２８は、共有メモリ２３を常に
監視しており、認識開始点につながらない認識を制限す
る。制限時間が過ぎると、輪郭追跡ＭＰＵ２２、構造マ
ッチングＭＰＵ２４、および、複合類似度ＭＰＵ２５の
各動作をそれぞれ停止させ、街区住所データベース２９
を参照することにより、最終的な認識結果（答）を一意
に決定する。決定された認識結果は、数字認識部全体の
制御を司るＣＰＵ３０に転送される。ＣＰＵ３０では、
受取った認識結果を数字認識部１４の認識結果として外
部に転送する。The control MPU 28 constantly monitors the shared memory 23 and limits the recognition that does not lead to the recognition start point. When the time limit has passed, the contour tracking MPU 22, the structure matching MPU 24, and the composite similarity MPU 25 are stopped, and the block address database 29 is stopped.
The final recognition result (answer) is uniquely determined by referring to. The determined recognition result is transferred to the CPU 30 that controls the entire number recognition unit. In CPU30,
The received recognition result is transferred to the outside as the recognition result of the numeral recognition unit 14.

【００２４】図２は、数字認識部１４の一部を示してお
り、以下、この図を用いて数字認識部１４をさらに詳細
に説明する。共有画像メモリ２１には、行検出切出しさ
れた画像が格納される。輪郭追跡ＭＰＵ２２は、住所文
字認識部１３の認識結果により、共有画像メモリ２１内
のｃ行（街区情報を示す数字部分）から処理を開始す
る。輪郭追跡ＭＰＵ２２による輪郭追跡は、画像のＹ軸
を主走査としてラスタスキャンすることにより行なう。
そして、黒ブロックを検出すると、その輪郭を追跡し、
構造特徴情報を共有メモリ２３に書込む。FIG. 2 shows a part of the numeral recognition unit 14, and the numeral recognition unit 14 will be described in more detail below with reference to this drawing. The shared image memory 21 stores the line-detected and cut-out image. The contour tracking MPU 22 starts the process from the line c (the number portion indicating the block information) in the shared image memory 21 according to the recognition result of the address character recognition unit 13. The contour tracking by the contour tracking MPU 22 is performed by raster scanning with the Y axis of the image as the main scan.
Then, when a black block is detected, its contour is tracked,
The structural characteristic information is written in the shared memory 23.

【００２５】構造マッチングＭＰＵ２４および複合類似
度ＭＰＵ２５は、常に共有メモリ２３を監視しており、
輪郭追跡ＭＰＵ２２によって新たに構造特徴情報が書込
まれると、処理を開始する。共有メモリ２３内にはメッ
セージ待ち行列があり、構造マッチングＭＰＵ２４およ
び複合類似度ＭＰＵ２５は、待ち行列の先頭をポーリン
グしている。The structure matching MPU 24 and the composite similarity MPU 25 constantly monitor the shared memory 23,
When the structure tracking information is newly written by the contour tracking MPU 22, the processing is started. There is a message queue in the shared memory 23, and the structure matching MPU 24 and the composite similarity MPU 25 poll the head of the queue.

【００２６】輪郭追跡ＭＰＵ２２は、黒ブロックの輪郭
特徴を演算終了時点で、メッセージ待ち行列にフラグを
立て、処理可能なパターンがあることを知らせる。構造
マッチングＭＰＵ２４および複合類似度ＭＰＵ２５は、
上記フラグにより共有メモリ２３から処理可能なパター
ンの情報を得て認識処理を開始する。The contour tracking MPU 22 sets a flag in the message queue at the time when the calculation of the contour feature of the black block is completed, and notifies that there is a pattern that can be processed. The structure matching MPU 24 and the composite similarity MPU 25 are
The flag is used to obtain information on a pattern that can be processed from the shared memory 23 and the recognition process is started.

【００２７】構造マッチングＭＰＵ２４および複合類似
度ＭＰＵ２５は、認識処理が終わると、再びメッセージ
待ち行列をポーリングする。構造マッチングＭＰＵ２４
は、共有メモリ２３から構造特徴情報を読出して、更に
各輪郭点を円弧近似した輪郭特徴情報を演算し、構造辞
書２６の辞書情報とマッチング演算を行ない、その結果
を共有メモリ２３に書込む。The structure matching MPU 24 and the composite similarity MPU 25 poll the message queue again after the recognition process is completed. Structure matching MPU24
Reads the structure characteristic information from the shared memory 23, further calculates the contour characteristic information in which each contour point is approximated by an arc, performs the matching calculation with the dictionary information of the structure dictionary 26, and writes the result to the shared memory 23.

【００２８】複合類似度ＭＰＵ２５は、共有メモリ２３
内の画像の黒で連結されたブロックの座標値を参照する
ことにより、共有画像メモリ２１から対応する画像のみ
を取得する。こうして取得した画像は、正規化および標
本化などの処理を行なった後、複合類似度辞書２７の辞
書情報との間で複合類似度演算を行ない、その結果を共
有メモリ２３に書込む。The composite similarity MPU 25 is shared by the shared memory 23.
Only the corresponding image is acquired from the shared image memory 21 by referring to the coordinate values of the black-connected blocks of the image inside. The image thus obtained is subjected to processing such as normalization and sampling, and is then subjected to composite similarity calculation with the dictionary information of the composite similarity dictionary 27, and the result is written in the shared memory 23.

【００２９】図３は、数字認識部１４における文字検出
切出し候補の例を示している。共有メモリ２３の内容
は、輪郭追跡ＭＰＵ２２によって黒ブロックが検出され
るごとに追加される。認識ユニット１、認識ユニット
２、認識ユニット３、認識ユニット４は、それぞれ構造
マッチングＭＰＵ２４と複合類似度ＭＰＵ２５とから構
成される。それぞれの認識ユニットは、共有メモリ２３
が追加されるごとに、１ブロックないし４ブロックの範
囲で文字検出切出しを行ない、認識する。FIG. 3 shows an example of character detection cutout candidates in the numeral recognition unit 14. The content of the shared memory 23 is added each time the contour tracking MPU 22 detects a black block. The recognition unit 1, the recognition unit 2, the recognition unit 3, and the recognition unit 4 are composed of a structure matching MPU 24 and a composite similarity MPU 25, respectively. Each recognition unit has a shared memory 23.
Each time a is added, character detection cutout is performed and recognized in the range of 1 block to 4 blocks.

【００３０】図中のＰ１では、共有メモリ２３にブロッ
クが１つしかないので、認識ユニット１のみ動作する。
図中のＰ２では、共有メモリ２３にブロックが１つ追加
されて２つになったので、新たに追加されたブロックと
合せた２ブロックのパターンを認識ユニット２で認識
し、新たに追加されたブロックのみを認識ユニット３で
認識する。このように、最大４ブロックを限度とし、認
識ユニット１，２，３，４がそれぞれ違うブロック数を
担当して認識処理を行なう。At P1 in the figure, since the shared memory 23 has only one block, only the recognition unit 1 operates.
In P2 in the figure, one block is added to the shared memory 23 to make two blocks. Therefore, the recognition unit 2 recognizes the pattern of two blocks including the newly added block, and the block is newly added. Only the block is recognized by the recognition unit 3. In this way, the recognition units 1, 2, 3, and 4 are responsible for different numbers of blocks, and the recognition processing is performed with a maximum of 4 blocks.

【００３１】制御ＭＰＵ２８は、認識処理が終了した時
点で認識の精度が高い認識結果を検索する。認識精度の
高い候補Ａを選んだ時点で、その候補Ａを含む他の候補
Ｂは除外される。残った候補のうち、認識できなかった
候補Ｃは除外し、認識できた候補のうち複数の文字検出
切出し候補があるものに関しては、認識結果を再評価し
て答を一意に決定する。The control MPU 28 searches for a recognition result having high recognition accuracy when the recognition process is completed. When the candidate A having high recognition accuracy is selected, other candidates B including the candidate A are excluded. Of the remaining candidates, the unrecognizable candidate C is excluded, and for the recognized candidates that have a plurality of character detection cutout candidates, the recognition result is re-evaluated to uniquely determine the answer.

【００３２】図４は、数字認識部１４における答編集の
例を示している。図中、枠内の太い文字は認識対象を示
し、その隣の（）内の文字は認識結果を示している。ま
た、斜線が引いてある枠の部分は、認識精度の高い候補
を採用することにより、廃棄されるグラフのアークであ
る。また、枠内の左上の数字はアーク番号である。FIG. 4 shows an example of answer editing in the numeral recognition unit 14. In the figure, the thick character in the frame indicates the recognition target, and the character in () next to it indicates the recognition result. Further, the shaded frame portion is an arc of a graph that is discarded by adopting a candidate with high recognition accuracy. The upper left number in the frame is the arc number.

【００３３】認識候補はグラフで表現され、認識ユニッ
ト１，２，３，４の認識結果はグラフのアークに対応す
る。グラフは通常、１つのノードから複数のアークを生
成して冗長な候補を含んでいる。高い認識精度で認識で
きたアーク（アーク番号は５）Ｄが発見された場合は、
アーク番号が４，６，７，９，１０，１１，１２，１５
のアークがグラフから削除される。The recognition candidates are represented by a graph, and the recognition results of the recognition units 1, 2, 3, 4 correspond to arcs on the graph. Graphs typically generate redundant arcs from one node to include redundant candidates. If an arc (arc number is 5) D that can be recognized with high recognition accuracy is found,
Arc number is 4,6,7,9,10,11,12,15
Arcs are removed from the graph.

【００３４】同様に、高い認識精度で認識できたアーク
（アーク番号は２２）Ｅが発見された場合は、アーク番
号が１９，２０，２１，２３，２４，２５，２７，２
８，３１のアークがグラフから削除される。そして、認
識結果の存在しない、アーク番号が１３，１６のアーク
Ｆがグラフから削除される。Similarly, when an arc (arc number 22) E that can be recognized with high recognition accuracy is found, the arc number is 19, 20, 21, 23, 24, 25, 27, 2
The 8,31 arcs are deleted from the graph. Then, the arc F having arc numbers 13 and 16 in which no recognition result exists is deleted from the graph.

【００３５】図５に示すように、えだかりされたグラフ
は最終的な認識候補となり、あらかじめ決められた手続
きでグラフを検索することにより、認識結果「２−１８
−３」を得ることができる。As shown in FIG. 5, the extruded graph becomes a final recognition candidate, and the recognition result "2-18" is obtained by searching the graph by a predetermined procedure.
-3 "can be obtained.

【００３６】なお、図２において、構造マッチングＭＰ
Ｕ２４および複合類似度ＭＰＵ２５の各認識結果が相違
した場合、または、構造マッチングＭＰＵ２４の認識処
理において複数肯定が発生した場合には、あらかじめ決
められた手続きによって答を一意に決定するが、組合わ
せによっては一意に決定できない場合が多い。In FIG. 2, the structure matching MP
If the recognition results of the U24 and the composite similarity MPU25 are different, or if multiple positives occur in the recognition processing of the structure matching MPU24, the answer is uniquely determined by a predetermined procedure. Is often not uniquely determined.

【００３７】共有メモリ２３には、認識対象領域の全構
造特徴情報が保持されているので、必要な候補の構造特
徴をさらに詳細にマッチング演算することにより、新た
な情報を得ることができる。このように、共有メモリ２
３に認識対象領域の全構造特徴情報を保持することによ
り、認識結果を一意に決定できない時点での高速なフィ
ードバックを可能にしている。Since the shared memory 23 holds all the structural feature information of the recognition target area, new information can be obtained by performing a more detailed matching calculation of the necessary candidate structural features. In this way, the shared memory 2
By holding all the structural feature information of the recognition target area in 3, it is possible to perform high-speed feedback when the recognition result cannot be uniquely determined.

【００３８】以上説明したように上記実施例によれば、
次のような効果が期待できる。すなわち、従来、文字認
識は入力画像の全体を走査して文字検出切出しを行な
い、その後で、認識処理を行なっていた。しかし、画像
処理の段階で保持している画像の中に認識対象物が含ま
れているので、認識対象概念を直接作用させることによ
り、文字検出切出し処理と認識処理を同時に行なうこと
ができ、認識前の文字検出切出しのための入力画像全体
に対する走査を回避することができる。As described above, according to the above embodiment,
The following effects can be expected. That is, conventionally, in character recognition, the entire input image is scanned to perform character detection cutout, and then the recognition process is performed. However, since the recognition target object is included in the image held at the image processing stage, the character detection cutout processing and the recognition processing can be performed simultaneously by directly acting the recognition target concept. It is possible to avoid scanning the entire input image for the previous character detection crop.

【００３９】また、従来は座標指定が矩形または多角形
であったので、複雑な文字検出切出しを指定できなかっ
た。しかし、本実施例では認識処理の一部が文字検出切
出しであって、文字検出切出しの段階で完全に対象画像
をとらえているので、単純な多角形で表現できない複雑
な文字検出切出し候補の認識処理も可能である。Further, in the past, since the coordinate designation was rectangular or polygonal, complicated character detection cutout could not be designated. However, in the present embodiment, part of the recognition processing is character detection cutout, and since the target image is completely captured at the stage of character detection cutout, recognition of a complicated character detection cutout candidate that cannot be expressed by a simple polygon. Processing is also possible.

【００４０】さらに、従来は全て原画像に対して処理を
行なっていたが、本実施例のように原画像に対する構造
特徴を保持することにより、答を一意に決定する際にフ
ィードバックでき、画像全体に対する構造特徴を参照で
きる。Further, conventionally, all the original images were processed, but by retaining the structural features of the original images as in the present embodiment, it is possible to feed back when the answer is uniquely determined, and the whole image is fed back. See the structural features for.

【００４１】[0041]

【発明の効果】以上詳述したように本発明によれば、認
識前の文字検出切出しのための入力画像全体に対する走
査を回避することができ、しかも、単純な多角形で表現
できない複雑な文字検出切出し候補の認識も可能となる
文字認識装置を提供できる。As described above in detail, according to the present invention, it is possible to avoid scanning the entire input image for character detection and clipping before recognition, and moreover, to complicate a complicated character that cannot be represented by a simple polygon. It is possible to provide a character recognition device that can also recognize a detection cutout candidate.

[Brief description of drawings]

【図１】数字認識部の構成を示すブロック図。FIG. 1 is a block diagram showing the configuration of a numeral recognition unit.

【図２】数字認識部の一部を示すブロック図。FIG. 2 is a block diagram showing a part of a numeral recognition unit.

【図３】数字認識部における文字検出切出し候補の例を
示す図。FIG. 3 is a diagram showing an example of character detection cutout candidates in a number recognition unit.

【図４】数字認識部における答編集の例を示す図。FIG. 4 is a diagram showing an example of answer editing in a number recognition unit.

【図５】最終的な認識候補のグラフを示す図。FIG. 5 is a diagram showing a graph of final recognition candidates.

【図６】読取部の構成を概略的に示すブロック図。FIG. 6 is a block diagram schematically showing the configuration of a reading unit.

【図７】郵便物宛名自動読取区分機の構成を概略的に示
す構成図。FIG. 7 is a configuration diagram schematically showing a configuration of a mail address automatic reading / sorting machine.

[Explanation of symbols]

Ｐ……郵便物（被読取物）、４……読取部、１１……光
電変換部（画像入力手段）、１２……住所画像処理部、
１３……住所文字認識部、１４……数字認識部、２１…
…共有画像メモリ、２２……輪郭追跡ＭＰＵ、２３……
共有メモリ、２４……構造マチッングＭＰＵ、２５……
複合類似度ＭＰＵ、２６……構造辞書、２７……複合類
似度辞書、２８……制御ＭＰＵ、２９……街区住所デー
タベース。P: postal matter (object to be read), 4 ... reading section, 11 ... photoelectric conversion section (image input means), 12 ... address image processing section,
13: Address character recognition unit, 14: Number recognition unit, 21 ...
… Shared image memory, 22 …… Contour tracking MPU, 23 ……
Shared memory, 24 …… Structure mating MPU, 25 ……
Compound similarity MPU, 26 ... Structural dictionary, 27 ... Compound similarity dictionary, 28 ... Control MPU, 29 ... Block address database.

Claims

[Claims]

1. An image input means for inputting an image in which characters are present on an object to be read, and a structure for extracting structural features of the image by successively performing contour tracing on the image input by the image input means. Feature recognition means operates in parallel with the structure feature extraction means, and when the structure feature extraction means extracts a structure feature, a recognition process is performed by performing a structure feature matching operation using the structure feature. A character recognition device comprising means and.

2. An image input means for inputting an image in which characters are present on an object to be read, and a structure for extracting structural features of the image by successively performing contour tracing on the image input by the image input means. The feature extracting means operates in parallel with the structure feature extracting means, and when the structure feature is extracted by the structure feature extracting means, based on the structure feature, a composite image is obtained using the image input by the image input means. A character recognition device comprising: a recognition unit that performs recognition processing by performing similarity calculation.

3. An image input means for inputting an image in which characters are present on an object to be read, and a structure for extracting structural features of the image by successively performing contour tracing on the image input by the image input means. The feature extracting means operates in parallel with the structure feature extracting means, and when the structure feature is extracted by the structure feature extracting means, recognition processing is performed by performing a structure feature matching operation using the structure feature. When the structural feature extracting unit operates in parallel with the recognizing unit No. 1 and the structural feature extracting unit and the structural feature extracting unit extracts the structural feature, the image input by the image input unit is used based on the structural feature. A second recognition unit that performs recognition processing by performing a composite similarity calculation, and a final recognition result by the recognition result of the first recognition unit and the recognition result of the second recognition unit. Character recognition apparatus characterized by comprising means for force, the.