JP2006309402A

JP2006309402A - Character string recognition method, number plate recognition method and device

Info

Publication number: JP2006309402A
Application number: JP2005129585A
Authority: JP
Inventors: Satoshi Chazono; 聡茶園; Kiichi Sugimoto; 喜一杉本; Masahiro Inoue; 正博井上
Original assignee: Mitsubishi Heavy Industries Ltd
Current assignee: Mitsubishi Heavy Industries Ltd
Priority date: 2005-04-27
Filing date: 2005-04-27
Publication date: 2006-11-09

Abstract

<P>PROBLEM TO BE SOLVED: To provide a character recognition technology adaptive to an imaged image of low contrast, and combining reduction in data processing quantity necessary for character recognition with adaptivity to the variety of a character string. <P>SOLUTION: For the whole part of a character string image constituted by including a plurality of character areas, character segmentation position candidates as the candidates of the boundary of the character areas are determined (A), and the character segmentation position candidates are made to correspond to the boundary of the plurality of character areas and the character candidate areas as the candidates of the plurality of character areas can be determined, and the area arrangement candidates expressed as the combination of the character candidate areas are determined (B), and image recognition is performed for each of the character candidate areas, and the optimal area arrangement candidate is selected from among the area arrangement candidates based on the result of the image recognition (C), and a character string constituted of characters recognized by the image recognition (C) for the character candidate areas corresponding to the optimal area arrangement candidate is output as a recognized result character string. <P>COPYRIGHT: (C)2007,JPO&INPIT

Description

本発明は、文字列認識方法、及びそれを行うための装置に関しており、特に、ナンバープレートの小文字列の認識に特に適した文字列認識方法、及び装置に関する。 The present invention relates to a character string recognition method and an apparatus for performing the method, and more particularly, to a character string recognition method and apparatus particularly suitable for recognition of a lowercase character string of a license plate.

撮像装置によって撮像された画像から文字を認識する文字認識は、様々な分野で応用されている。ナンバープレートに記載された文字を認識するナンバープレート認識は、文字認識の典型的な応用の一つである。 Character recognition for recognizing characters from an image captured by an imaging device is applied in various fields. License plate recognition for recognizing characters written on a license plate is one of typical applications of character recognition.

最も典型的な文字列認識方法は、撮像画像を２値化して２値化画像を生成し、その２値化画像とテンプレートとのイメージマッチングによって文字を認識する方法である。 The most typical character string recognition method is a method in which a captured image is binarized to generate a binarized image, and a character is recognized by image matching between the binarized image and a template.

この文字列認識方法の一つの問題点は、撮像画像のコントラストが低い場合に、文字を誤って認識しやすいことである。撮像画像を撮像する際に充分な光量が得られない場合、例えば、撮像装置に照明装置が設けられていない場合、撮像画像のコントラストが低下することがある。撮像画像のコントラストが低いと、２値化により文字線と背景を分離するための閾値を決定することが困難になるため、２値化画像から文字を構成する線が欠落したり、或いは、本来は分離されるべき２本の線が２値化画像において連結したりする。線の不所望な欠落や連結は、文字の誤認識の原因となる。 One problem with this character string recognition method is that when a captured image has a low contrast, characters are easily recognized by mistake. When a sufficient amount of light cannot be obtained when capturing a captured image, for example, when an illumination device is not provided in the imaging device, the contrast of the captured image may be reduced. If the contrast of the captured image is low, it is difficult to determine a threshold value for separating the character line and the background by binarization. Therefore, the line constituting the character is missing from the binarized image, or originally The two lines to be separated are connected in the binarized image. Undesirable missing or connected lines cause misrecognition of characters.

このような背景から、２値化処理を行わずに文字認識を行う技術の検討が行われている。例えば、特開平８−１４７４１４号公報（特許文献１）及び特開２００３−１４７４１４号公報（特許文献２）は、いずれも、画像データから正規化相互相関処理を用いて文字位置の検出及び文字認識を行う技術を開示している。詳細には、特許文献１に開示された文字認識技術は、文字列を構成する個々の文字位置を画像の全体について正規化相互相関法及びＤＰマッチングを行うことによって推定している。一方、特許文献２に開示された文字認識技術では、１文字ごとに正規化相互相関法を用いて位置検出を行い、その検出された位置を基準に次の文字位置を検出する方法を採用している。 From such a background, a technique for performing character recognition without performing binarization processing has been studied. For example, JP-A-8-147414 (Patent Document 1) and JP-A-2003-147414 (Patent Document 2) both detect character positions and recognize characters using normalized cross-correlation processing from image data. The technology to perform is disclosed. Specifically, the character recognition technique disclosed in Patent Document 1 estimates individual character positions constituting a character string by performing a normalized cross-correlation method and DP matching on the entire image. On the other hand, the character recognition technique disclosed in Patent Document 2 employs a method of detecting the position for each character using the normalized cross-correlation method and detecting the next character position based on the detected position. ing.

２値化処理を行わずに文字認識を行う技術の一つの課題は、文字認識に必要なデータ処理量の低減と、文字列の多様性への対応の向上とを両立させることである。例えば、特許文献１に開示された技術では、正規化相互相関法に加えてＤＰマッチングを採用することにより、文字数が不定もしくは不明な撮像画像の文字位置を検出している。しかし、画像の全体について正規化相互相関法及びＤＰマッチングを行うため、文字認識に必要なデータ処理量が大きい。一方、特許文献２に開示された技術は、正規化相互相関法が行われる範囲が小さく、データ処理量の低減が可能である。しかしながら、特許文献２に開示された技術は、文字数が定まっている撮像画像にしか適用できない（例えば、特許文献２の段落〔０００６〕参照）。 One problem of the technology for performing character recognition without performing binarization processing is to achieve both a reduction in the amount of data processing necessary for character recognition and an improvement in handling character string diversity. For example, in the technique disclosed in Patent Document 1, DP matching is employed in addition to the normalized cross-correlation method to detect the character position of a captured image with an indefinite or unknown number of characters. However, since the normalized cross correlation method and DP matching are performed on the entire image, the amount of data processing required for character recognition is large. On the other hand, the technique disclosed in Patent Document 2 has a small range in which the normalized cross correlation method is performed, and can reduce the data processing amount. However, the technique disclosed in Patent Document 2 can be applied only to a captured image in which the number of characters is fixed (see, for example, paragraph [0006] of Patent Document 2).

文字列の多様性に対応できないこと、特に、文字数が可変である文字列の文字認識ができないことは、ナンバープレート認識、特にナンバープレートの小文字列の文字認識においては特に問題である；小文字列とは、車両の本拠の運輸支局又は自動車検査登録事務所を表示する文字である陸支コード（例えば、「神戸」、「品川」）と、自動車の種別及び用途を示す車種コード（例えば、「３３」、「５５」「５０１」）とから構成される文字列である。車種コードは２桁又は３桁の分類番号から構成されており、従って、ナンバープレートの小文字列の文字数は不定である。従って、文字列の文字数が一定であることを必要とする文字認識技術は、ナンバープレートの小文字列の認識には適用できない。 The inability to deal with the diversity of character strings, especially the inability to recognize characters with variable number of characters, is particularly problematic in license plate recognition, especially in lower-case character plate recognition of license plates; Is a land branch code (for example, “Kobe” or “Shinagawa”) which is a character for displaying a vehicle-based transportation branch office or automobile inspection registration office, and a vehicle type code (for example, “33” ”,“ 55 ”,“ 501 ”). The vehicle type code is composed of a 2-digit or 3-digit classification number, and therefore, the number of characters in the lower-case letter plate of the license plate is indefinite. Therefore, the character recognition technique that requires that the number of characters in the character string is constant cannot be applied to the recognition of the lowercase character string in the license plate.

このような背景から、文字認識に必要なデータ処理量の低減と文字列の多様性への対処とを両立させることができる文字認識技術の提供が求められている。 Against this background, there is a need to provide a character recognition technology that can simultaneously reduce the amount of data processing required for character recognition and cope with the diversity of character strings.

特開平８−１４７４１４号公報JP-A-8-147414 特開２００３−１４７４１４号公報JP 2003-147414 A

本発明の目的は、コントラストが低い撮像画像に対応できる一方で、文字認識に必要なデータ処理量の低減と文字列の多様性への対処とを両立させることができる文字認識技術を提供することにある。 An object of the present invention is to provide a character recognition technology that can cope with a low-contrast captured image while simultaneously reducing the amount of data processing necessary for character recognition and dealing with the diversity of character strings. It is in.

上記の目的を達成するために、本発明は、以下に述べられる手段を採用する。その手段を構成する技術的事項の記述には、［特許請求の範囲］の記載と［発明を実施するための最良の形態］の記載との対応関係を明らかにするために、［発明を実施するための最良の形態］で使用される番号・符号が付加されている。但し、付加された番号・符号は、［特許請求の範囲］に記載されている発明の技術的範囲を限定的に解釈するために用いてはならない。 In order to achieve the above object, the present invention employs the means described below. In the description of technical matters constituting the means, in order to clarify the correspondence between the description of [Claims] and the description of [Best Mode for Carrying Out the Invention] Number / symbol used in the best mode for doing this is added. However, the added number / symbol should not be used to limit the technical scope of the invention described in [Claims].

本発明による文字列認識方法は、
（Ａ）それぞれに一の文字が写されている複数の文字領域（１４〜１６）を含んで構成される文字列画像（１１）の全体について、前記文字領域（１４〜１６）の境界の候補である文字切り出し位置候補（１８、２５）を定めるステップと、
（Ｂ）前記文字切り出し位置候補（１８、２５）を前記複数の文字領域（１４〜１６）の境界に対応付けて前記複数の文字領域（１４〜１６）の候補である文字候補領域（２１〜２４）を決定し、前記文字候補領域（２１〜２４）の組み合わせとして表現される領域配置候補を決定するステップと、
（Ｃ）前記文字候補領域（２１〜２４）のそれぞれについて画像認識を行い、前記画像認識の結果に基づいて、前記領域配置候補のうちから最適領域配置候補を選択するステップと、
（Ｄ）前記最適領域配置候補に対応する前記文字候補領域（２１〜２４）について前記（Ｃ）ステップの画像認識において認識された文字からなる文字列を、認識結果文字列として出力するステップ
とを具備する。 A character string recognition method according to the present invention includes:
(A) With respect to the whole character string image (11) including a plurality of character regions (14 to 16) in which one character is copied, candidates for boundaries of the character regions (14 to 16) Determining character extraction position candidates (18, 25) that are:
(B) Character candidate regions (21 to 21) that are candidates for the plurality of character regions (14 to 16) by associating the character cutout position candidates (18, 25) with boundaries of the plurality of character regions (14 to 16). 24) and determining a region arrangement candidate expressed as a combination of the character candidate regions (21 to 24);
(C) performing image recognition for each of the character candidate areas (21 to 24), and selecting an optimal area arrangement candidate from the area arrangement candidates based on the result of the image recognition;
(D) outputting a character string made up of characters recognized in the image recognition in step (C) as the recognition result character string for the character candidate regions (21 to 24) corresponding to the optimal region arrangement candidate. It has.

本発明による文字列認識方法では、文字列画像の全体ではなく、文字切り出し位置候補（１８、２５）から決定される文字候補領域（２１〜２４）について文字認識が行われるため、文字認識に必要なデータ処理量を低減させることができる。その一方で、本発明による文字列認識方法では、文字候補領域（２１〜２４）の組み合わせである領域配置候補から選択された最適領域配置候補に対応する文字列が認識結果として出力されるため、文字数が可変である文字列の文字認識が実現できる。このような文字列認識方法は、ナンバープレートの小文字列の認識に特に有効である。 In the character string recognition method according to the present invention, character recognition is performed not on the entire character string image but on the character candidate areas (21 to 24) determined from the character cutout position candidates (18, 25). The amount of data processing can be reduced. On the other hand, in the character string recognition method according to the present invention, since the character string corresponding to the optimal area arrangement candidate selected from the area arrangement candidates that are combinations of the character candidate areas (21 to 24) is output as the recognition result, Character recognition of a character string having a variable number of characters can be realized. Such a character string recognition method is particularly effective for the recognition of the lowercase character string in the license plate.

前記文字切り出し位置候補（１８、２５）は、最も典型的には、前記文字列画像の各画素の輝度から決定される。また、前記文字切り出し位置候補（１８、２５）は、前記文字列画像の微分画像から決定されることも可能である。 The character cutout position candidates (18, 25) are most typically determined from the luminance of each pixel of the character string image. The character cutout position candidates (18, 25) may be determined from a differential image of the character string image.

前記（Ｃ）ステップは、
（Ｃ１）前記文字切り出し位置候補（１８、２５）によって定められる前記文字候補領域（２１〜２４）のそれぞれの幅を算出するステップと、
（Ｃ２）前記文字切り出し位置候補（１８、２５）によって定められる前記文字候補領域（２１〜２４）のそれぞれについて、前記文字候補領域（２１〜２４）の画像と、予め用意されたテンプレート画像との相違度を算出するステップと、
（Ｃ３）前記文字候補領域（２１〜２４）の幅、及び前記相違度に基づいて文字らしさ評価値を算出するステップ
とを備え、前記最適領域配置候補は、前記文字らしさ評価値に基づいて決定されることが好ましい。文字候補領域（２１〜２４）の幅から文字らしさ評価値を算出し、その文字らしさ評価値から最適領域配置候補を選択することは、簡便な処理で、正しい最適領域配置候補を選択するために有効である。 The step (C) includes:
(C1) calculating a width of each of the character candidate regions (21 to 24) defined by the character cutout position candidates (18, 25);
(C2) For each of the character candidate areas (21-24) defined by the character cutout position candidates (18, 25), an image of the character candidate areas (21-24) and a template image prepared in advance Calculating the dissimilarity;
(C3) calculating a character likelihood evaluation value based on the width of the character candidate region (21 to 24) and the degree of difference, and determining the optimal region arrangement candidate based on the character likelihood evaluation value It is preferred that To calculate the character likelihood evaluation value from the width of the character candidate area (21 to 24) and select the optimum area arrangement candidate from the character likelihood evaluation value, in order to select the correct optimum area arrangement candidate with simple processing It is valid.

前記（Ｃ）ステップは、更に、
（Ｃ４）前記文字切り出し位置候補（１８、２５）によって定められる前記文字候補領域（２１〜２４）の、隣接する２つの文字候補領域（２１〜２４）の隙間の幅を算出するステップと、
（Ｃ５）前記２つの文字候補領域（２１〜２４）の隙間の幅に基づいて、隙間らしさ評価値を算出するステップ
を備え、前記最適領域配置候補は、前記隙間らしさ評価値に基づいて決定されることも好ましい。 The step (C) further includes:
(C4) calculating a width of a gap between two adjacent character candidate regions (21 to 24) of the character candidate regions (21 to 24) defined by the character cutout position candidates (18 and 25);
(C5) comprising a step of calculating a clearance likelihood evaluation value based on a width of a gap between the two character candidate regions (21 to 24), wherein the optimum region arrangement candidate is determined based on the clearance likelihood evaluation value. It is also preferable.

本発明の他の観点において、本発明によるナンバープレート認識方法は、
（Ｅ）ナンバープレートの小文字列が写されている小文字列画像（１１）の全体について、それぞれに前記小文字列の一の文字が写されている文字領域（１４〜１６）の境界の候補である文字切り出し位置候補（１８、２５）を定めるステップと、
（Ｆ）前記文字切り出し位置候補（１８、２５）を前記複数の文字領域（１４〜１６）の境界に対応付けて前記複数の文字領域（１４〜１６）の候補である文字候補領域（２１〜２４）を決定し、前記文字候補領域（２１〜２４）の組み合わせとして表現される領域配置候補を決定するステップと、
（Ｇ）前記文字候補領域（２１〜２４）のそれぞれについて画像認識を行い、前記画像認識の結果に基づいて、前記領域配置候補のうちから最適領域配置候補を選択するステップと、
（Ｈ）前記最適領域配置候補に対応する前記文字候補領域（２１〜２４）について前記（Ｇ）ステップの画像認識において認識された文字からなる文字列を、認識結果の少なくとも一部として出力するステップ
とを具備する。 In another aspect of the present invention, a license plate recognition method according to the present invention comprises:
(E) The entire lowercase character string image (11) in which the lowercase character string of the license plate is copied is a candidate for the boundary of the character region (14-16) in which one character of the lowercase character string is copied. Determining character cutout position candidates (18, 25);
(F) Associating the character cut-out position candidates (18, 25) with the boundaries of the plurality of character regions (14-16), character candidate regions (21-21) that are candidates for the plurality of character regions (14-16) 24) and determining a region arrangement candidate expressed as a combination of the character candidate regions (21 to 24);
(G) performing image recognition for each of the character candidate areas (21 to 24), and selecting an optimal area arrangement candidate from the area arrangement candidates based on the result of the image recognition;
(H) A step of outputting, as at least a part of a recognition result, a character string made up of characters recognized in the image recognition in the step (G) for the character candidate regions (21 to 24) corresponding to the optimum region arrangement candidates. It comprises.

一実施形態では、当該ナンバープレート認識方法は、
更に、
（Ｉ）前記文字切り出し位置候補（１８、２５）から前記小文字列の陸支コードが写されている陸支コード領域を決定するステップと、
（Ｊ）前記小文字列画像（１１）のうち前記陸支コード領域以外の領域に、車種コード領域を決定するステップ
とを具備し、前記領域配置候補は、前記車種コード領域について定められた車種コード領域配置候補である。 In one embodiment, the license plate recognition method comprises:
Furthermore,
(I) determining a land support code area in which the land support code in the lower case string is copied from the character cutout position candidates (18, 25);
(J) including a step of determining a vehicle type code region in a region other than the land support code region in the lower case sequence image (11), wherein the region arrangement candidate is a vehicle type code determined for the vehicle type code region. It is a region arrangement candidate.

この場合、前記（Ｉ）ステップは、
（Ｉ１）前記文字切り出し位置候補（１８、２５）を境界とし、前記小文字列画像（１１）の端に接し、且つ、所定範囲の幅を有する領域を陸支コード候補領域として定めるステップと、
（Ｉ２）前記陸支コード候補領域のそれぞれについて画像認識を行い、前記画像認識の結果に基づいて前記陸支コード候補領域のうちから陸支コードが写されている陸支コード領域を決定するステップ
とを備え、前記陸支コード領域として決定された前記陸支コード候補領域について認識された陸支コードが、前記認識結果の一部として出力され得る。 In this case, the step (I)
(I1) defining a region having a predetermined range of width as a land support code candidate region, with the character cutout position candidate (18, 25) as a boundary, touching an end of the lowercase character string image (11), and
(I2) performing image recognition for each of the land support code candidate areas, and determining a land support code area in which the land support code is copied from the land support code candidate areas based on the result of the image recognition The land support code recognized for the land support code candidate region determined as the land support code region can be output as a part of the recognition result.

他の実施形態では、前記（Ｇ）ステップの前記画像認識では、前記小文字列の最も左に位置する文字に対応する前記文字候補領域（２１〜２４）については陸支コードのテンプレート画像が使用され、それ以外の文字に対応する前記文字候補領域（２１〜２４）については数字のテンプレート画像が使用され、前記認識結果は、前記ナンバープレートの陸支コードの認識結果と前記ナンバープレートの車種コードの認識結果とを含む。 In another embodiment, in the image recognition in the step (G), a land code template image is used for the character candidate region (21 to 24) corresponding to the leftmost character in the lowercase character string. Numeral template images are used for the character candidate areas (21 to 24) corresponding to the other characters, and the recognition result includes the recognition result of the land plate code of the license plate and the vehicle type code of the license plate. Including recognition results.

また、前記（Ｇ）ステップにおいて前記文字候補領域（２１〜２４）のうちの或る文字候補領域について行われた前記画像認識によって「１」という認識結果が得られた場合には、前記或る文字候補領域の周囲を含む拡大領域について再度の画像認識が行われ、前記拡大領域について「１」以外の認識結果が得られた場合には、前記或る文字候補領域を含む領域配置候補は、前記最適領域配置候補として選択されないことが好ましい。 If the recognition result “1” is obtained by the image recognition performed for a certain character candidate region in the character candidate regions (21 to 24) in the step (G), the certain When the image recognition is performed again for the enlarged region including the periphery of the character candidate region and a recognition result other than “1” is obtained for the enlarged region, the region arrangement candidate including the certain character candidate region is It is preferable that the optimal region arrangement candidate is not selected.

前記文字候補領域（２１〜２４）が矩形である場合、前記文字切り出し位置候補（１８、２５）は、前記文字候補領域（２１〜２４）の左端及び右端を決定する横方向文字切り出し位置候補（１８）と、前記文字候補領域（２１〜２４）の上端及び下端を決定する縦方向文字切り出し位置候補（２５）とを備えることが好ましい。 When the character candidate area (21-24) is a rectangle, the character cutout position candidates (18, 25) are the horizontal character cutout position candidates (18, 25) that determine the left and right ends of the character candidate areas (21-24). 18) and vertical character cutout position candidates (25) for determining the upper and lower ends of the character candidate areas (21 to 24).

また、前記（Ｅ）ステップは、
（Ｅ１）前記小文字列画像（１１）を上側部分と下側部分とに区分するステップと、
（Ｅ２）前記下側部分の画像から前記文字切り出し位置候補（１８）を決定するステップ
とを備えることが好ましい。このような処理手順によれば、小文字列画像（１１）に写り得るビスの影響を排除することができる。 In addition, the step (E)
(E1) dividing the lowercase sequence image (11) into an upper part and a lower part;
(E2) It is preferable to include the step of determining the character cutout position candidate (18) from the image of the lower part. According to such a processing procedure, it is possible to eliminate the influence of screws that may appear in the lowercase string image (11).

本発明の更に他の観点において、本発明の文字列認識装置は、それぞれに一の文字が写されている複数の文字領域を含んで構成される文字列画像を取得する撮像装置（１）と、前記文字列画像の文字認識を行う画像処理装置（２）とを備えている。前記画像処理装置（２）は、下記ステップ群：
（Ａ）文字列画像の全体について、前記文字領域の境界の候補である文字切り出し位置候補（１８、２５）を定めるステップ、
（Ｂ）前記文字切り出し位置候補（１８、２５）を前記複数の文字領域の境界に対応付けて前記複数の文字領域の候補である文字候補領域（２１〜２４）を決定し、前記文字候補領域（２１〜２４）の組み合わせとして表現される領域配置候補を決定するステップ、
（Ｃ）前記文字候補領域（２１〜２４）のそれぞれについて画像認識を行い、前記画像認識の結果に基づいて、前記領域配置候補のうちから最適領域配置候補を選択するステップ、及び
（Ｄ）前記最適領域配置候補に対応する前記文字候補領域（２１〜２４）について前記（Ｃ）ステップの画像認識において認識された文字からなる文字列を、認識結果文字列として出力するステップ
とを実行するように構成されている。 In still another aspect of the present invention, a character string recognition device according to the present invention includes an imaging device (1) that acquires a character string image including a plurality of character regions in which one character is copied. And an image processing device (2) for performing character recognition of the character string image. The image processing apparatus (2) includes the following steps:
(A) A step of determining character cutout position candidates (18, 25) that are candidates for the boundary of the character region for the entire character string image;
(B) The character candidate regions (21 to 24) which are candidates for the plurality of character regions are determined by associating the character cutout position candidates (18, 25) with boundaries of the plurality of character regions, and the character candidate regions Determining a region arrangement candidate expressed as a combination of (21-24);
(C) performing image recognition for each of the character candidate regions (21 to 24), and selecting an optimum region arrangement candidate from the region arrangement candidates based on the result of the image recognition; and (D) Executing a step of outputting a character string made up of characters recognized in the image recognition in the step (C) as a recognition result character string for the character candidate regions (21 to 24) corresponding to the optimum region arrangement candidate. It is configured.

本発明の更に他の観点において、本発明のナンバープレート認識装置は、ナンバープレートの撮像画像を取得する撮像装置（１）と、前記撮像画像から前記ナンバープレートの文字認識を行うための画像処理装置（２）とを備えている。前記画像処理装置（２）は、下記ステップ群：
（Ｋ）前記撮像画像から、前記ナンバープレートの小文字列が写されている小文字列画像（１１）を切り出すステップ、
（Ｅ）’前記小文字列画像（１１）の全体について、それぞれに前記小文字列の一の文字が写されている文字領域の境界の候補である文字切り出し位置候補（１８、２５）を定めるステップ、
（Ｆ）前記文字切り出し位置候補（１８、２５）を前記複数の文字領域の境界に対応付けて前記複数の文字領域の候補である文字候補領域（２１〜２４）を決定し、前記文字候補領域（２１〜２４）の組み合わせとして表現される領域配置候補を決定するステップと、
（Ｇ）前記文字候補領域（２１〜２４）のそれぞれについて画像認識を行い、前記画像認識の結果に基づいて、前記領域配置候補のうちから最適領域配置候補を選択するステップ、及び
（Ｈ）前記最適領域配置候補に対応する前記文字候補領域（２１〜２４）について前記（Ｇ）ステップの画像認識において認識された文字からなる文字列を、認識結果の少なくとも一部として出力するステップ
を実行するように構成されている。 In still another aspect of the present invention, a license plate recognition device according to the present invention includes an imaging device (1) that acquires a captured image of a license plate, and an image processing device that performs character recognition of the license plate from the captured image. (2). The image processing apparatus (2) includes the following steps:
(K) a step of cutting out a lowercase string image (11) in which the lowercase string of the license plate is copied from the captured image;
(E) ′ determining a character cutout position candidate (18, 25) that is a candidate for a boundary of a character region in which one character of the lowercase character sequence is copied for each of the lowercase character sequence image (11);
(F) associating the character cutout position candidates (18, 25) with boundaries of the plurality of character regions to determine character candidate regions (21 to 24) that are candidates for the plurality of character regions; Determining a region arrangement candidate expressed as a combination of (21-24);
(G) performing image recognition for each of the character candidate areas (21 to 24), and selecting an optimum area arrangement candidate from the area arrangement candidates based on the result of the image recognition; and (H) the above Executing a step of outputting a character string made up of characters recognized in the image recognition in the step (G) as at least a part of the recognition result for the character candidate regions (21 to 24) corresponding to the optimum region arrangement candidate. It is configured.

本発明によれば、コントラストが低い撮像画像に対応できる一方で、文字認識に必要なデータ処理量の低減と文字列の多様性への対処とを両立させることができる文字認識技術を提供できる。 According to the present invention, it is possible to provide a character recognition technology that can cope with a captured image with low contrast, while at the same time reducing both the amount of data processing required for character recognition and dealing with the diversity of character strings.

第１第１の実施形態：
１．概略
図１は、本発明の第１の実施形態に係るナンバープレート認識装置１０の構成を示している。ナンバープレート認識装置１０は、撮像装置１と、画像処理装置２とを備えている。撮像装置１は、車両のナンバープレートを撮像し、ナンバープレート画像を得る。画像処理装置２は、そのナンバープレート画像に対してナンバープレート認識を行う。 First Embodiment:
1. Outline FIG. 1 shows a configuration of a license plate recognition apparatus 10 according to a first embodiment of the present invention. The license plate recognition device 10 includes an imaging device 1 and an image processing device 2. The imaging device 1 captures a license plate of a vehicle and obtains a license plate image. The image processing apparatus 2 performs license plate recognition on the license plate image.

画像処理装置２は、記憶装置３と、演算装置４と、撮像装置１に接続されたインターフェース５とを備えている。記憶装置３には、ナンバープレート認識プログラム６と、陸支コード用テンプレート７と、車種コード用テンプレート８と、用途コード用テンプレート９ａと、一連番号用テンプレート９ｂとが格納されている。 The image processing device 2 includes a storage device 3, a calculation device 4, and an interface 5 connected to the imaging device 1. The storage device 3 stores a license plate recognition program 6, a land code template 7, a vehicle type code template 8, a use code template 9a, and a serial number template 9b.

ナンバープレート認識プログラム６は、ナンバープレート認識を行うアルゴリズムを記述しており、ナンバープレート認識は、演算装置４がナンバープレート認識プログラム６を実行することによって行われる。 The license plate recognition program 6 describes an algorithm for license plate recognition. The license plate recognition is performed by the arithmetic device 4 executing the license plate recognition program 6.

陸支コード用テンプレート７には、陸支コードのテンプレート画像と、各テンプレート画像の陸支コードクラスを示す情報（即ち、各テンプレート画像が、どの陸支コードに対応しているかを示す情報）とが対応付けて格納されている。後述されるように、ナンバープレートに記述されている陸支コードの文字認識は、陸支コード用テンプレート７を用いたパターン認識によって行われる。 The land code template 7 includes a land image code image, information indicating the land code class of each template image (that is, information indicating which land code each template image corresponds to) Are stored in association with each other. As will be described later, the character recognition of the land code described in the license plate is performed by pattern recognition using the land code template 7.

車種コード用テンプレート８には、車種コードを構成する数字のテンプレート画像と、各テンプレートの文字クラスを示す情報（即ち、各テンプレート画像が、どの数字に対応しているかを示す情報）とが対応付けて格納されている。後述されるように、ナンバープレートに記述されている車種コードの文字認識は、車種コード用テンプレート８を用いたパターン認識によって行われる。 The vehicle type code template 8 is associated with a template image of a number constituting the vehicle type code and information indicating a character class of each template (that is, information indicating which number each template image corresponds to). Stored. As will be described later, the character recognition of the vehicle type code described on the license plate is performed by pattern recognition using the vehicle type code template 8.

用途コード用テンプレート９ａには、用途コード（即ち、事業用かどうかの別などを表示するかな文字及び英文字）のテンプレート画像と、各テンプレート画像の用途コードクラスを示す情報（即ち、各テンプレート画像が、どの用途コードに対応しているかを示す情報）とが対応付けて格納されている。ナンバープレートに記述されている用途コードの文字認識は用途コード用テンプレート９ｂを用いたパターン認識によって行われる。 The usage code template 9a includes a template image of a usage code (that is, kana characters and English characters that indicate whether it is for business use, etc.) and information indicating the usage code class of each template image (that is, each template image). Are stored in association with each other). Character recognition of the application code described on the license plate is performed by pattern recognition using the application code template 9b.

一連番号用テンプレート９ｂには、一連番号を構成する数字のテンプレート画像と、各テンプレート画像の文字クラスを示す情報（即ち、各テンプレート画像が、どの数字に対応しているかを示す情報）とが対応付けて格納されている。ナンバープレートに記述されている一連番号の文字認識は一連番号用テンプレート９ｂを用いたパターン認識によって行われる。 Corresponding to the template for serial number 9b is a template image of numbers constituting the serial number and information indicating the character class of each template image (that is, information indicating which number each template image corresponds to). It is stored with. Character recognition of serial numbers described on the license plate is performed by pattern recognition using the serial number template 9b.

図２は、画像処理装置２が行うナンバープレート認識の処理を示すフローチャートである。撮像装置１からナンバープレート画像を取得すると（ステップＳ０１）、画像処理装置２は、取得したナンバープレート画像から小文字列画像と大文字列画像を分離する（ステップＳ０２）；ここで小文字列画像とは、上述の小文字列が写されている画像であり、大文字列画像とは、用途コードと一連番号とが写されている画像である。図３は、ナンバープレート画像と、小文字列画像の例を示している。小文字列画像及び大文字列画像は、いずれも、濃淡画像（grayscale image）であり、２値化画像ではない。続いて、小文字列画像、及び大文字列画像のそれぞれについて文字認識が行われ、ナンバープレートに記載された文字が認識される（ステップＳ０３、Ｓ０４）。ナンバープレートに記載された文字の文字認識は、濃淡画像である小文字列画像、及び大文字列画像から正規化相互相関等によって行われる。本実施の形態の文字認識では、濃淡画像から２値化画像は生成されない。 FIG. 2 is a flowchart showing a license plate recognition process performed by the image processing apparatus 2. When the license plate image is acquired from the imaging device 1 (step S01), the image processing device 2 separates the lowercase letter image and the uppercase letter image from the acquired license plate image (step S02); The above-mentioned lowercase character string is an image, and the uppercase character string image is an image in which a usage code and a serial number are copied. FIG. 3 shows an example of a license plate image and a lower case string image. Both the lowercase sequence image and the uppercase sequence image are grayscale images and are not binarized images. Subsequently, character recognition is performed for each of the lowercase character string image and the uppercase character string image, and the characters written on the license plate are recognized (steps S03 and S04). Character recognition of the characters written on the license plate is performed by normalization cross-correlation or the like from the lowercase character sequence image that is a grayscale image and the uppercase character sequence image. In the character recognition of the present embodiment, a binarized image is not generated from a grayscale image.

本実施の形態の主題は、小文字列画像の文字認識の最適化である。以下では、小文字列画像について文字認識を行う処理について詳細に説明される。 The subject of this embodiment is optimization of character recognition of lowercase sequence images. Below, the process which performs character recognition about a small letter sequence image is demonstrated in detail.

本実施の形態における文字認識の最終的な目標は、図４に示されているように、
（１）小文字列画像１１に陸支コード領域１２と車種コード領域１３を最適に規定した上で、更に車種コード領域１３をそれぞれに１つの数字が存在する２つ又は３つの文字領域：第１文字領域１４、第２文字領域１５、第３文字領域１６に最適に区分すること、及び
（２）文字認識により、陸支コード領域１２に記載されている陸支コードと、車種コード領域１３に記載されている車種コード（即ち、第１文字領域１４、第２文字領域１５、第３文字領域１６に記載されている数字からなる数字列）を取得すること
の２つである。図４には、車種コード領域１３が３つの文字領域で構成されている場合が図示されているが、車種コード領域１３は、２つの文字領域で構成され得ることに留意されたい。 The final goal of character recognition in this embodiment is as shown in FIG.
(1) The land support code area 12 and the vehicle type code area 13 are optimally defined in the lower case sequence image 11, and the vehicle type code area 13 is further divided into two or three character areas each having one number: first By optimally dividing the character area 14, the second character area 15, and the third character area 16, and (2) by character recognition, the land branch code described in the land branch code area 12 and the vehicle type code area 13 This is two ways of acquiring the described vehicle type code (that is, a numeric string composed of the numbers described in the first character area 14, the second character area 15, and the third character area 16). Although FIG. 4 illustrates a case where the vehicle type code area 13 is configured by three character areas, it should be noted that the vehicle type code area 13 may be configured by two character areas.

小文字列画像１１、陸支コード領域１２と車種コード領域１３の境界、及び、文字領域１４の境界は、文字切り出し位置１７と呼ばれる。従って、上記の目標（１）は、小文字列画像１１に適正な文字切り出し位置１７を設定する、と言い換えることができる。以下では、上記の２つの目標を達成するための手順が詳細に説明される。 The lower case sequence image 11, the boundary between the land code area 12 and the vehicle type code area 13, and the boundary between the character area 14 are called a character cutout position 17. Therefore, the above target (1) can be paraphrased as setting an appropriate character cutout position 17 in the lowercase character string image 11. In the following, the procedure for achieving the above two goals will be described in detail.

図５は、第１の実施形態における小文字列画像１１の文字認識のアルゴリズムを示すフローチャートである。小文字列画像１１の文字認識の処理手順は、概略的には、文字切り出し位置候補の抽出（ステップＳ１１）と、陸支コードの認識（ステップＳ１２）と、車種コードの認識（ステップＳ１３）とで構成される。以下では、これらの処理が詳細に説明される。 FIG. 5 is a flowchart showing an algorithm for character recognition of the lowercase sequence image 11 in the first embodiment. The processing procedure for character recognition of the lowercase sequence image 11 is roughly as follows: extraction of a character extraction position candidate (step S11), recognition of a land code (step S12), and recognition of a vehicle type code (step S13). Composed. In the following, these processes will be described in detail.

２．文字切り出し位置候補の抽出
図６Ａを参照して、小文字列画像１１の文字認識では、まず、文字切り出し位置候補１８が抽出される（ステップＳ１１）。文字切り出し位置候補１８とは、小文字列画像１１に規定されるべき文字切り出し位置１７の候補である。小文字列画像１１が横方向（ｘ軸方向）に長い本実施の形態では、横方向の文字切り出し位置候補１８が規定される。各文字切り出し位置候補１８は、小文字列画像１１の縦方向（ｙ軸方向）に延伸するように規定されている。 2. Extraction of Character Cutout Position Candidate With reference to FIG. 6A, in character recognition of the lowercase character string image 11, first, a character cutout position candidate 18 is extracted (step S11). The character cutout position candidate 18 is a candidate for the character cutout position 17 to be defined in the lowercase character string image 11. In the present embodiment where the lowercase character string image 11 is long in the horizontal direction (x-axis direction), a character cutout position candidate 18 in the horizontal direction is defined. Each character cutout position candidate 18 is defined so as to extend in the vertical direction (y-axis direction) of the lowercase sequence image 11.

文字切り出し位置候補１８は、小文字列画像１１から直接に、又は、小文字列画像１１の微分画像から決定される。詳細には、小文字列画像１１の画素の輝度データの値（即ち、輝度値）から、小文字列画像１１の背景に該当すると推定される部分が決定され、小文字列画像１１の両端、及び、小文字列画像１１の背景と推定される部分に文字切り出し位置候補１８が定められる。また、輝度データの値の代わりに、小文字列画像１１の微分画像が文字切り出し位置候補１８の抽出に使用され得る。この場合、小文字列画像１１の各画素の輝度データから小文字列画像１１の微分画像が算出され、その微分画像から小文字列画像１１の背景と推定される部分が特定される。この場合にも、小文字列画像１１の両端、及びその背景に該当する部分に文字切り出し位置候補１８が定められる。 The character cutout position candidate 18 is determined directly from the lowercase sequence image 11 or from the differential image of the lowercase sequence image 11. Specifically, a portion estimated to correspond to the background of the lowercase sequence image 11 is determined from the luminance data value of the pixels of the lowercase sequence image 11 (that is, the brightness value), and both ends of the lowercase sequence image 11 and the lowercase letters are determined. A character cutout position candidate 18 is determined in a portion estimated as the background of the row image 11. Further, a differential image of the lowercase character string image 11 can be used for extraction of the character cutout position candidate 18 instead of the luminance data value. In this case, a differential image of the lower case sequence image 11 is calculated from the luminance data of each pixel of the lower case sequence image 11, and a portion estimated as the background of the lower case sequence image 11 is specified from the differential image. Also in this case, character cutout position candidates 18 are determined at both ends of the lowercase character string image 11 and portions corresponding to the background thereof.

より具体的には、ナンバープレートの塗色が何らかの手段によって既知である場合には、例えば、下記のような手順で文字切り出し位置候補１８の抽出が行われ得る。まず、小文字列画像１１の縦方向の輝度射影が算出される。縦方向の輝度射影とは、小文字列画像１１を構成する画素列のそれぞれについて算出された輝度値の和である；ここで画素列とは、縦方向に１列に並べられた画素の列のことである。座標ｘに位置する画素列の輝度射影Ｆ（ｘ）は、
Ｆ（ｘ）＝Σσ（ｘ，ｙ），
で算出される。ここでσ（ｘ，ｙ）は、小文字列画像１１の座標（ｘ，ｙ）に位置する画素の輝度（又は階調）であり、Σは、小文字列画像１１のｙ座標の全範囲についての和を表している。 More specifically, when the paint color of the license plate is known by some means, for example, the character cutout position candidate 18 can be extracted by the following procedure. First, the luminance projection in the vertical direction of the lowercase string image 11 is calculated. The luminance projection in the vertical direction is the sum of the luminance values calculated for each of the pixel columns constituting the lowercase column image 11; here, the pixel column is a column of pixels arranged in one column in the vertical direction. That is. The luminance projection F (x) of the pixel row located at the coordinate x is
F (x) = Σσ (x, y),
Is calculated by Here, σ (x, y) is the luminance (or gradation) of the pixel located at the coordinates (x, y) of the lowercase sequence image 11, and Σ is the entire range of the y coordinates of the lowercase sequence image 11. Represents the sum.

ナンバープレートの塗色が、背景より文字の輝度が高くなるような色である場合（例えば、ナンバープレートの塗色が白や黄である場合）には、輝度射影Ｆ（ｘ）が極小となる位置、及び、輝度射影Ｆ（ｘ）が所定の広さよりも広い範囲に渡って閾値より小さい場合には当該範囲において輝度射影Ｆ（ｘ）が立ち上がる位置（図６Ａでは、「川」と「１」の間の変化が大きい箇所）が、文字切り出し位置候補１８として抽出される。 When the license plate paint color is such that the brightness of the characters is higher than the background (for example, when the license plate paint color is white or yellow), the brightness projection F (x) is minimized. When the position and the brightness projection F (x) are smaller than the threshold over a range wider than a predetermined area, the position where the brightness projection F (x) rises in the range (“river” and “1” in FIG. 6A). ”Is extracted as a character cutout position candidate 18.

一方、図６Ｂに示されているように、ナンバープレートの塗色が、背景より文字の輝度が高くなるような色である場合（例えば、ナンバープレートの塗色が緑や黒である場合）には、輝度射影Ｆ（ｘ）が極大となる位置、及び、輝度射影Ｆ（ｘ）が所定の広さよりも広い範囲に渡って閾値より大きい場合には当該範囲において輝度射影Ｆ（ｘ）が立ち下がる位置が、文字切り出し位置候補１８として抽出される。 On the other hand, as shown in FIG. 6B, when the license plate paint color is such that the brightness of the characters is higher than the background (for example, the license plate paint color is green or black). If the luminance projection F (x) is larger than the threshold over a position where the luminance projection F (x) is maximum, and the luminance projection F (x) is larger than a predetermined area, the luminance projection F (x) rises in that range. The lowering position is extracted as the character cutout position candidate 18.

一方、ナンバープレートの塗色が未知である場合には、例えば、下記のような手順で文字切り出し位置候補１８の抽出が行われ得る。まず、小文字列画像１１の微分処理が行われ、各画素についてｘ方向、ｙ方向の微分値の和が求められる。更に、各画素列についてのｘ方向、ｙ方向の微分値の総和である微分値射影が求められる。その微分値射影が、上記の輝度射影の代わりに使用されて上記と同様の処理が行われ、これにより、文字切り出し位置候補１８が抽出される。 On the other hand, when the paint color of the license plate is unknown, the character cutout position candidate 18 can be extracted by the following procedure, for example. First, a differentiation process is performed on the lowercase sequence image 11 to obtain the sum of the differential values in the x and y directions for each pixel. Further, a differential value projection that is the sum of differential values in the x and y directions for each pixel column is obtained. The differential value projection is used in place of the luminance projection, and the same processing as described above is performed, whereby the character cutout position candidate 18 is extracted.

文字切り出し位置候補１８の抽出において考慮されるべきことは、小文字列画像１１は、ナンバープレートの上部から切り出されるため、小文字列画像１１にはナンバープレートを車体に固定するビスが写り得ることである。ビスは、文字切り出し位置候補１８の適切な抽出を妨げ得る。 What should be considered in the extraction of the character cutout position candidate 18 is that the lowercase string image 11 is cut out from the upper part of the license plate, and thus the lowercase string image 11 can include a screw for fixing the license plate to the vehicle body. . The screw may prevent proper extraction of the character cutout position candidate 18.

ビスの影響を排除するためには、図６Ｃに示されているように、小文字列画像１１のうち、ビスが写っていない下側の部分から文字切り出し位置候補１８が決定されることが好ましい。より具体的には、小文字列画像１１が、ビスが写っている上側部分１１ａと、写っていない下側部分１１ｂとに区分される。その下側部分について上述と同様の処理が行われ、文字切り出し位置候補１８が抽出される。これにより、ビスの影響を排除し、文字切り出し位置候補１８を適切に抽出することが可能になる。 In order to eliminate the influence of screws, as shown in FIG. 6C, it is preferable that the character cutout position candidate 18 is determined from the lower part of the lowercase character string image 11 where no screw is shown. More specifically, the lower-case string image 11 is divided into an upper portion 11a in which screws are shown and a lower portion 11b in which screws are not shown. The lower portion is subjected to the same processing as described above, and the character cutout position candidate 18 is extracted. As a result, the influence of the screw can be eliminated and the character cutout position candidate 18 can be appropriately extracted.

３．陸支コードの認識
図５に示されているように、文字切り出し位置候補１８の抽出に続いて、陸支コードの認識が行われる（ステップＳ１２）。 3. Recognition of Land Support Code As shown in FIG. 5, the land support code is recognized following the extraction of the character cutout position candidate 18 (step S12).

陸支コードの認識処理では、まず、陸支コード候補領域の設定が行われる（ステップＳ１２−１）。陸支コード候補領域とは、陸支コード領域１２の候補である。図７に示されているように、陸支コード候補領域１９の左の境界は、小文字列画像１１の左端で固定されており、右の境界は、ステップＳ１１において抽出された文字切り出し位置候補１８から選択される。ただし、陸支コード候補領域１９は、横方向の幅Ｗ_Ｒ（即ち、小文字列画像１１の左端から、対応する文字切り出し位置候補１８までの距離）が、下記の条件を満足するように決定される：
ｈ・ｋ_{ｍｉｎ＿Ｒ}＜Ｗ_Ｒ＜ｈ・ｋ_{ｍａｘ＿Ｒ}，・・・（１）
ここで、ｈは、小文字列画像１１の高さであり、ｋ_{ｍｉｎ＿Ｒ}，ｋ_{ｍａｘ＿Ｒ}は、所定のパラメータである。このような陸支コード候補領域１９の決定方法は、陸支コード領域１２の縦横比が、ある一定の範囲に収まることを基礎としている。陸支コード領域１２の縦横比として想定できないような縦横比を有する陸支コード候補領域１９は設定されない。図７の例では、４つの陸支コード候補領域１９_１乃至１９_４が設定されている。 In the land support code recognition process, first, a land support code candidate area is set (step S12-1). The land support code candidate area is a candidate for the land support code area 12. As shown in FIG. 7, the left boundary of the land code candidate area 19 is fixed at the left end of the lowercase character string image 11, and the right boundary is the character cutout position candidate 18 extracted in step S <b> 11. Selected from. However, the land code candidate area 19 is determined so that the horizontal width W _R (that is, the distance from the left end of the lowercase character string image 11 to the corresponding character cutout position candidate 18) satisfies the following condition. R:
h · k _{min —} _R <W _R <h · k _{max —} _R , (1)
Here, h is the height of the lower case sequence image 11, and k _{min — R} and k _{max — R} are predetermined parameters. Such a method for determining the land support code candidate area 19 is based on the aspect ratio of the land support code area 12 being within a certain range. The land code candidate area 19 having an aspect ratio that cannot be assumed as the aspect ratio of the land code area 12 is not set. In the example of FIG. 7, four land supporting code candidate regions 19 ₁ to 19 ₄ is set.

陸支コード候補領域１９の設定に続いて、図５に示されているように、陸支コード領域１２の決定及び陸支コードの認識が行われる（ステップＳ１２−２）。より具体的には、図８に示されているように、陸支コード候補領域１９の画像のそれぞれについて陸支コード用テンプレート７に格納されたテンプレート画像を用いたパターン認識が行われ、最近接クラス（即ち、最も相違度Ｓが小さくなる陸支コードクラス）、及び、その相違度Ｓが算出される。陸支コード候補領域１９のうち、最も相違度が小さくなるものが陸支コード領域１２として決定され、その陸支コード候補領域１９について決定された最近接クラスが最終的に求めるべき陸支コードとして決定される。図７の例では、最も相違度Ｓを小さくする陸支コード候補領域１９_４が、陸支コード領域１２として決定され、その陸支コード候補領域１９_４に対応する最近接クラス「品川」が最終的に求めるべき陸支コードと決定される。 Following the setting of the land support code candidate area 19, as shown in FIG. 5, the land support code area 12 is determined and the land support code is recognized (step S12-2). More specifically, as shown in FIG. 8, pattern recognition is performed using the template image stored in the land code template 7 for each of the images of the land code candidate area 19, and the closest The class (that is, the land support code class with the smallest difference S) and the difference S are calculated. Of the land support code candidate areas 19, the one with the smallest difference is determined as the land support code area 12, and the nearest branch class determined for the land support code candidate area 19 is the land support code to be finally obtained. It is determined. In the example of FIG. 7, the most dissimilarity Riku支code candidate regions 19 ₄ to reduce the S it is determined as Riku支coding region 12, closest class "Shinagawa" final corresponding to the Riku支code candidate region 19 ₄ It is determined that the land support code should be obtained.

陸支コードは、一般に複数の文字で構成されるが、本実施の形態の画像処理装置２は、陸支コードを全体として１つの文字として取り扱うことに留意されたい。本実施の形態でいう「１文字」とは、１つのテンプレート画像に写されている図形を意味している。 Although the land support code is generally composed of a plurality of characters, it should be noted that the image processing apparatus 2 of the present embodiment handles the land support code as one character as a whole. The term “one character” in the present embodiment means a figure that is copied in one template image.

４．車種コードの認識
図５に示されているように、陸支コードの認識の後、車種コードの認識が行われる（ステップＳ１３）。 4). Recognition of vehicle type code As shown in FIG. 5, after the land code is recognized, the vehicle type code is recognized (step S13).

車種コードの認識は、まず、車種コード領域１３を決定することで開始される（ステップＳ１３−１）。図９に示されているように、車種コード領域１３は、陸支コードの認識処理において決定された陸支コード領域１２から決定される。より具体的には、陸支コード領域１２からの距離が所定の範囲にある一の文字切り出し位置候補１８が、車種コード領域１３の左端として決定され、小文字列画像１１の右端が車種コード領域１３の右端として決定される。 The recognition of the vehicle type code is first started by determining the vehicle type code area 13 (step S13-1). As shown in FIG. 9, the vehicle type code area 13 is determined from the land code area 12 determined in the land code recognition process. More specifically, one character cutout position candidate 18 whose distance from the land code region 12 is within a predetermined range is determined as the left end of the vehicle type code region 13, and the right end of the lowercase character sequence image 11 is the vehicle type code region 13. Is determined as the right end of

続いて、決定された車種コード領域１３に対して車種コード領域配置候補が設定される（ステップＳ１３−２）。車種コード領域配置候補とは、車種コード領域１３における文字領域１４〜１６の配置の候補である。車種コードが２文字又は３文字から構成されることから、２文字の車種コードに対応する車種コード領域配置候補と、３文字の車種コードに対応する車種コード領域配置候補の２種類が決定される。図１０に示されているように、３文字の車種コードに対応する車種コード領域配置候補は、第１文字候補領域２１、第２文字候補領域２２及び第３文字候補領域２３の組み合わせで表現される。ここで、第１文字候補領域２１とは、車種コードの第１文字目が存在する第１文字領域１４の候補であり、第２文字候補領域２２とは、車種コードの第２文字目が存在する第２文字領域１５の候補であり、第３文字候補領域２３とは、車種コードの第３文字目が存在する第３文字領域１６の候補である。文字候補領域２１〜２３の両端は、２つの文字切り出し位置候補１８によって規定される。同様に、２文字の車種コードに対応する車種コード領域配置候補は、第１文字候補領域２１及び第２文字候補領域２２の組み合わせで表現される。 Subsequently, a vehicle type code area arrangement candidate is set for the determined vehicle type code area 13 (step S13-2). The vehicle type code area arrangement candidate is a candidate for arrangement of the character areas 14 to 16 in the vehicle type code area 13. Since the vehicle type code is composed of two or three characters, two types of vehicle type code region arrangement candidates corresponding to the two-character vehicle type code and vehicle type code region arrangement candidates corresponding to the three-character vehicle type code are determined. . As shown in FIG. 10, the vehicle type code area arrangement candidate corresponding to the 3-character vehicle type code is expressed by a combination of the first character candidate area 21, the second character candidate area 22, and the third character candidate area 23. The Here, the first character candidate area 21 is a candidate for the first character area 14 where the first character of the vehicle type code exists, and the second character candidate area 22 exists for the second character of the vehicle type code. The third character candidate area 23 is a candidate for the third character area 16 in which the third character of the vehicle type code exists. Both ends of the character candidate areas 21 to 23 are defined by two character cutout position candidates 18. Similarly, the vehicle type code area arrangement candidate corresponding to the 2-character vehicle type code is expressed by a combination of the first character candidate area 21 and the second character candidate area 22.

車種コード領域配置候補の決定は、具体的には、文字切り出し位置候補１８を、第１文字候補領域２１、第２文字候補領域２２、第３文字候補領域２３の始点（左端）及び終点（右端）に対応付けることによって行われる。一例として、図１０に示されているように、車種コード領域１３の位置”１”〜”６”に対して６つの文字切り出し位置候補１８_１〜１８_６が存在する場合を考える。例えば、ある車種コード領域配置候補では、文字切り出し位置候補１８_１、１８_２が、それぞれ、第１文字候補領域２１の始点（左端）及び終点（右端）にそれぞれ対応付けられ、文字切り出し位置候補１８_２、１８_３が第２文字候補領域２２の始点及び終点にそれぞれ対応付けられ、文字切り出し位置候補１８_３、１８_４が、第３文字候補領域２３の始点及び終点にそれぞれ対応付けられる。他の車種コード領域配置候補では、図１１に示されているように、異なる対応付けが行われる。図１１の例では、文字切り出し位置候補１８_１、１８_２が、それぞれ、第１文字候補領域２１の始点（左端）及び終点（右端）にそれぞれ対応付けられ、文字切り出し位置候補１８_３、１８_４が第２文字候補領域２２の始点及び終点にそれぞれ対応付けられ、文字切り出し位置候補１８_５、１８_６が、第３文字候補領域２３の始点及び終点にそれぞれ対応付けられる。 Specifically, the determination of the vehicle type code area arrangement candidate is performed by setting the character extraction position candidate 18 as the start point (left end) and end point (right end) of the first character candidate area 21, the second character candidate area 22, and the third character candidate area 23. ). As an example, let us consider a case where there are _six character cutout position candidates 18 ₁ to 18 ₆ for the positions “1” to “6” of the vehicle type code area 13 as shown in FIG. For example, in a certain car model code area arrangement candidate, the character cutout position candidates 18 ₁ and 18 ₂ are associated with the start point (left end) and the end point (right end) of the first character candidate area 21, respectively. ₂ and 18 ₃ are associated with the start point and end point of the second character candidate region 22, respectively, and the character cutout position candidates 18 ₃ and 18 ₄ are associated with the start point and end point of the third character candidate region 23, respectively. In other vehicle type code region arrangement candidates, different associations are performed as shown in FIG. In the example of FIG. 11, the character cutout position candidates 18 ₁ and 18 ₂ are respectively associated with the start point (left end) and the end point (right end) of the first character candidate area 21, and character cutout position candidates 18 ₃ and 18 _4. Are respectively associated with the start point and end point of the second character candidate region 22, and the character cutout position candidates 18 ₅ and 18 ₆ are associated with the start point and end point of the third character candidate region 23, respectively.

ただし、車種コード領域配置候補は、それに含まれる第１文字候補領域２１、第２文字候補領域２２及び第３文字候補領域２３それぞれの幅Ｗ_１、Ｗ_２，Ｗ_３が、いずれも最小文字幅Ｗ_{ｍｉｎ＿Ｎ}以上、最大文字幅Ｗ_{ｍａｘ＿Ｎ}以下であるように決定される。ここで、最小文字幅Ｗ_{ｍｉｎ＿Ｎ}、最大文字幅Ｗ_{ｍａｘ＿Ｎ}は、下記式で表される定数である：
Ｗ_{ｍｉｎ＿Ｎ}＝ｈ×ｋ_{ｍｉｎ＿Ｎ}，・・・（２ａ）
Ｗ_{ｍａｘ＿Ｎ}＝ｈ×ｋ_{ｍａｘ＿Ｎ}，・・・（２ｂ）
ここで、ｈは、小文字列画像１１の高さであり、ｋ_{ｍｉｎ＿Ｎ}、ｋ_{ｍａｘ＿Ｎ}は、所定のパラメータである。このような文字候補領域２１〜２３の組み合わせの決定方法は、車種コードとなり得る文字（即ち、数字）の縦横比が、ある一定の範囲に収まることを基礎としている。車種コードを構成する文字の縦横比として想定できないような縦横比を有する文字候補領域を含む車種コード領域配置候補は設定されない。これは、車種コード領域配置候補の数を減らし、これによって演算量を減らすために有効である。 However, the vehicle type code area arrangement candidates include the width W ₁ , W ₂ , and W _{3 of} the first character candidate area 21, the second character candidate area 22, and the third character candidate area 23, all of which are the minimum character width. It is determined to be not less than W _{min — N and not} more than the maximum character width W _{max — N.} Here, the minimum character width W _{min — N} and the maximum character width W _{max — N} are constants represented by the following expressions:
W _min — _N = h × k _{min — N} , (2a)
W _max — _N = h × k _{max — N} , (2b)
Here, h is the height of the lowercase sequence image 11, and _{kmin_N} and _{kmax_N} are predetermined parameters. Such a method for determining the combination of the character candidate areas 21 to 23 is based on the fact that the aspect ratio of characters (that is, numbers) that can be a vehicle type code falls within a certain range. A vehicle type code area arrangement candidate including a character candidate area having an aspect ratio that cannot be assumed as the aspect ratio of the characters constituting the vehicle type code is not set. This is effective in reducing the number of vehicle type code area arrangement candidates and thereby reducing the amount of calculation.

例えば、図１２に示されているように、幅を考慮しなければ、５つの文字候補領域が第１文字候補領域として設定され得る；しかし、上記の幅の制限により、始点が位置”１”であり終点が位置”２”である文字候補領域（１）、及び始点が位置”１”であり終点が位置”３”である文字候補領域（２）のみが、第１文字候補領域２１として許容される。 For example, as shown in FIG. 12, if the width is not considered, five character candidate regions can be set as the first character candidate regions; however, the start point is located at the position “1” due to the above width limitation. And the character candidate region (1) whose end point is the position “2” and the character candidate region (2) whose start point is the position “1” and whose end point is the position “3” are the first character candidate regions 21 Permissible.

加えて、車種コード領域配置候補は、それに含まれる文字候補領域２１〜２３の全てについて、第ｉ文字候補領域の左端が、車種コード領域１３の左端からの距離が（ｉ−１）×Ｗ_{ｍｉｎ＿Ｎ}以上の位置にあり、第ｉ文字候補領域の右端が、車種コード領域１３の左端からの距離がｉ×Ｗ_{ｍａｘ＿Ｎ}以下の位置にあるように決定される。例えば、図１３に示されているように、第２文字候補領域２２は、車種コード領域１３の左端からの距離がＷ_{ｍｉｎ＿Ｎ}以上である位置にその左端が位置しており、距離が２×Ｗ_{ｍａｘ＿Ｎ}以下である位置にその右端が位置するように決定される。これは、第１文字候補領域２１、第２文字候補領域２２及び第３文字候補領域２３のいずれか一つでもが妥当でない位置に存在するような車種コード領域配置候補を排除するためである。 In addition, the vehicle type code region arrangement candidate is such that, for all of the character candidate regions 21 to 23 included therein, the left end of the i-th character candidate region is a distance (i−1) × W _min — _N from the left end of the vehicle type code region 13. At the above position, the right end of the i-th character candidate area is determined such that the distance from the left end of the vehicle type code area 13 is at a position equal to or less than i × W _{max_N} . For example, as shown in FIG. 13, the second character candidate region 22 is located at a position where the distance from the left end of the vehicle type code region 13 is _{equal to} or greater than W _{min — N} , and the distance is 2 × W. It is determined so that the right end is located at a position that is less than or equal to _{max_N} . This is to eliminate vehicle type code area arrangement candidates in which any one of the first character candidate area 21, the second character candidate area 22, and the third character candidate area 23 is present at an invalid position.

例えば、図１２の例では、これらの条件を満足するように車種コード領域配置候補が決定されると、文字候補領域（１）、（２）が第１文字候補領域２１として、文字候補領域（３）〜（６）が第２文字候補領域２２として、文字候補領域（５）〜（９）が第３文字候補領域２３として決定されることになる。車種コード領域配置候補は、このようにして決められた第１文字候補領域２１、第２文字候補領域２２及び第３文字候補領域２３の組み合わせである。 For example, in the example of FIG. 12, when the vehicle type code area arrangement candidate is determined so as to satisfy these conditions, the character candidate areas (1) and (2) are set as the first character candidate area 21 as the character candidate area ( 3) to (6) are determined as the second character candidate region 22, and the character candidate regions (5) to (9) are determined as the third character candidate region 23. The vehicle type code region arrangement candidate is a combination of the first character candidate region 21, the second character candidate region 22, and the third character candidate region 23 determined in this way.

続いて、図５に示されているように、車種コード領域配置候補のそれぞれについて、評価値φ_{ｓｔｒｉｎｇ}が算出される（ステップＳ１３−３）。車種コード領域配置候補の評価値φ_{ｓｔｒｉｎｇ}は、各文字候補領域の「文字らしさ」を表す文字らしさ評価値φ_ｃｈａｒと、隣接する文字候補領域の間の「隙間らしさ」を表す隙間らしさ評価値φ_ｇａｐの総和で表される。文字らしさ評価値φ_ｃｈａｒは、対応する文字候補領域の画像が「文字らしい」ほど増大し、「文字らしくない」ほど減少する値である。一方、隙間らしさ評価値φ_ｇａｐは、隣接する２つ文字候補領域の間の領域が「隙間らしくない」（例えば、隙間としては狭すぎる、又は広すぎる）ほど減少される値である。より具体的には、３文字の車種コードに対応している車種コード領域配置候補の評価値φ_{ｓｔｒｉｎｇ}は、下記式：
φ_{ｓｔｒｉｎｇ}＝φ^１ _ｃｈａｒ＋φ^１２ _ｇａｐ＋φ^２ _ｃｈａｒ＋φ^２３ _ｇａｐ＋φ^３ _ｃｈａｒ，・・・（３ａ）
によって算出される。ここで、φ^ｉ _ｃｈａｒは、第ｉ文字候補領域の文字らしさ評価値であり、φ^ｉｊ _ｇａｐは、第ｉ文字候補領域と第ｊ文字候補領域とについて算出された隙間らしさ評価値である。一方、２文字の車種コードに対応している車種コード領域配置候補の評価値φ_{ｓｔｒｉｎｇ}は、下記式：
φ_{ｓｔｒｉｎｇ}＝φ^１ _ｃｈａｒ＋φ^１２ _ｇａｐ＋φ^２ _ｃｈａｒ，・・・（３ｂ）
によって算出される。 Subsequently, as illustrated in FIG. 5, the evaluation value φ _string is calculated for each of the vehicle type code area arrangement candidates (step S13-3). The evaluation value φ _string of the vehicle type code region arrangement candidate is a character likelihood evaluation value φ _char representing “character character” of each character candidate region and a character likelihood evaluation value φ representing “gap character” between adjacent character candidate regions. It is expressed as the sum of _gaps . The character-likeness evaluation value φ _char is a value that increases as the image of the corresponding character candidate region “looks like a character” and decreases as “not like a character”. On the other hand, the gap-likeness evaluation value _φgap is a value that decreases as the area between two adjacent character candidate areas is “not like a gap” (for example, too narrow or too wide as a gap). More specifically, evaluation value phi _string models coding region arrangement candidates that correspond to the three letter models code is represented by the following formula:
φ _string = φ ¹ _char + φ ¹² _gap + φ ² _char + φ ²³ _gap + φ ³ _char , (3a)
Is calculated by Here, φ ⁱ _char is a character likelihood evaluation value of the i-th character candidate area, and φ ^ij _gap is a gap likelihood evaluation value calculated for the i-th character candidate area and the j-th character candidate area. On the other hand, the evaluation value φ _string of the vehicle type code area arrangement candidate corresponding to the 2-character vehicle type code is expressed by the following formula:
φ _string = φ ¹ _char + φ ¹² _gap + φ ² _char , (3b)
Is calculated by

第ｉ文字候補領域の文字らしさ評価値φ^ｉ _ｃｈａｒの算出は、下記の３つの手順で行われる：まず、第ｉ文字候補領域の画像について文字認識が行われ、最近接文字クラス、及び、その相違度Ｓ_ｉが算出される。例えば、図１４に示されているように、第１文字候補領域２１の左端が文字切り出し位置候補１８_１、右端が文字切り出し位置候補１８_２である場合、第１文字候補領域２１には、「１」の文字が完全な状態で含まれる。このような場合、最近接文字クラスとして「１」が選択され、更に、小さな相違度が与えられる。一方、第１文字候補領域２１の左端が文字切り出し位置候補１８_１、右端が文字切り出し位置候補１８_３である場合には、第１文字候補領域２１には「１」の文字及び「８」の文字の左半分が含まれる。このような場合には、最近接文字クラスとして不適切なクラスが選択され、大きな相違度が与えられる。 The character likelihood evaluation value φ ⁱ _char of the i-th character candidate region is calculated in the following three steps: First, character recognition is performed on the image of the i-th character candidate region, the closest character class, and The dissimilarity S _i is calculated. For example, as illustrated in FIG. 14, when the left end of the first character candidate area 21 is the character cutout position candidate 18 ₁ and the right end is the character cutout position candidate 18 ₂ , the first character candidate area 21 includes “ The character “1” is included in the complete state. In such a case, “1” is selected as the closest character class, and a small difference is given. On the other hand, if the left edge of the first character candidate region 21 character segmentation position candidate 18 _1, right end is a character segmenting position candidate 18 _3, the first character candidate region 21 characters and "8" and "1" Contains the left half of the character. In such a case, an inappropriate class is selected as the closest character class, and a large difference is given.

更に、第ｉ文字候補領域の文字幅減点値Ｐ^ｉ _ｃｈａｒが算出される。文字幅減点値Ｐ^ｉ _ｃｈａｒとは、第ｉ文字候補領域の幅Ｗ_ｉが、文字が存在する文字領域の幅として妥当でない場合に増大される数値であり、具体的には、下記式：
Ｐ^ｉ _ｃｈａｒ＝ｃ_１（Ｗ_{ｍｉｎ＿ｃｈａｒ}−Ｗ_ｉ），（Ｗ_ｉ＜Ｗ_{ｍｉｎ＿ｃｈａｒ}の場合）・・・（４ａ）
Ｐ^ｉ _ｃｈａｒ＝ｃ_２（Ｗ_ｉ−Ｗ_{ｍａｘ＿ｃｈａｒ}），（Ｗ_{ｍａｘ＿ｃｈａｒ}＜Ｗ_ｉの場合）・・・（４ｂ）
Ｐ^ｉ _ｃｈａｒ＝０，（Ｗ_{ｍｉｎ＿ｃｈａｒ}≦Ｗ_ｉ≦Ｗ_{ｍａｘ＿ｃｈａｒ}の場合）・・・（４ｃ）
で定められる。ここで、ｃ_１，ｃ_２は、重みパラメータであり、Ｗ_{ｍｉｎ＿ｃｈａｒ}、Ｗ_{ｍａｘ＿ｃｈａｒ}は、それぞれ、「文字らしい」画像が含まれている文字候補領域の幅の下限値、及び上限値を表すパラメータであり、下記式で決定される：
Ｗ_{ｍｉｎ＿ｃｈａｒ}＝ｈ・ｋ_{ｍｉｎ＿ｃｈａｒ}，
Ｗ_{ｍａｘ＿ｃｈａｒ}＝ｈ・ｋ_{ｍａｘ＿ｃｈａｒ}，
ただし、ｈは、小文字列画像の高さであり、ｋ_{ｍｉｎ＿ｃｈａｒ}、ｋ_{ｍａｘ＿ｃｈａｒ}は、所定の定数である。このような文字幅減点値Ｐ^ｉ _ｃｈａｒの決定方法は、文字領域１４〜１６の縦横比が、ある一定の範囲に収まることを基礎としている。文字領域１４〜１６の縦横比として想定できないような縦横比を有する文字候補領域は設定されない。 Further, a character width deduction point P ⁱ _{char of} the i-th character candidate area is calculated. The character width deduction value P ⁱ _char is a numerical value that is increased when the width W _i of the i-th character candidate region is not valid as the width of the character region in which the character exists, and specifically, the following formula:
P ⁱ _char = c ₁ (W _min — _char −W _i ), (W _i <W _min — _char ) (4a)
P ⁱ _char = c ₂ (W _i −W _{max_char} ), (in the case of W _{max_char} <W _i ) (4b)
P ⁱ _char = _{0, (the} case of _{_{_{W min_char ≦ W i ≦ W max_char}}} ) ··· (4c)
Determined by Here, c ₁ and c ₂ are weight parameters, and W _{min_char} and W _{max_char} are parameters respectively _indicating a lower limit value and an upper limit value of the width of a character candidate area including a “character-like” image. Yes, determined by:
W _{min_char} = h · k _{min_char} ,
W _{max_char} = h · k _{max_char} ,
However, h is the height of the lowercase string _{_image, k} _{min_char,} k _{max_char} is a predetermined constant. Such a method for determining the character width deduction point P ⁱ _char is based on the fact that the aspect ratio of the character regions 14 to 16 falls within a certain range. A character candidate area having an aspect ratio that cannot be assumed as the aspect ratio of the character areas 14 to 16 is not set.

図１５を参照して、第ｉ文字候補領域の幅Ｗ_ｉが、Ｗ_{ｍｉｎ＿ｃｈａｒ}以上Ｗ_{ｍａｘ＿ｃｈａｒ}以下である場合には、文字幅減点値Ｐ^ｉ _ｃｈａｒは０である。一方、第ｉ文字候補領域の幅Ｗ_ｉがＷ_{ｍｉｎ＿ｃｈａｒ}よりも小さい場合には第ｉ文字候補領域の画像が文字らしくないと判断され、文字幅減点値Ｐ^ｉ _ｃｈａｒがＷ_{ｍｉｎ＿ｃｈａｒ}と幅Ｗ_ｉとの差に応じて増大される。同様に、第ｉ文字候補領域の幅Ｗ_ｉが、Ｗ_{ｍａｘ＿ｃｈａｒ}よりも大きい場合には第ｉ文字候補領域の画像が文字らしくないと判断され、文字幅減点値Ｐ^ｉ _ｃｈａｒが幅Ｗ_ｉとＷ_{ｍａｘ＿ｃｈａｒ}との差に応じて増大される。 Referring to FIG. 15, when the width W _i of the i-th character candidate region is not _less than W _{min_char and not} more than W _{max_char} , the character width deduction point P ⁱ _char is 0. On the other hand, if the width W _i of the i-th character candidate region is smaller than W _{min_char} , it is determined that the image of the i-th character candidate region does not _look like a character, and the character width deduction point P ⁱ _char is set to W _{min_char} and the width W _i . It is increased in accordance with the difference. Similarly, if the width W _i of the i-th character candidate area is larger than W _{max_char} , it is determined that the image of the i-th character candidate area does not look like a character, and the character width deduction point P ⁱ _char is set to the widths W _i and W Increased according to the difference from _{max_char} .

文字らしさ評価値φ^ｉ _ｃｈａｒは、最小相違度Ｓ_ｉが増大するほど、また、文字幅減点値Ｐ^ｉ _ｃｈａｒが増大するほど小さくなるように決定される。例えば、文字らしさ評価値φ^ｉ _ｃｈａｒは、下記式：
φ^ｉ _ｃｈａｒ＝Ｃ−Ｓ_ｉ−Ｐ^ｉ _ｃｈａｒ，・・・（５）
によって算出され得る。ここでＣは、定数である。式（５）から理解されるように、文字らしさ評価値φ^ｉ _ｃｈａｒは、第ｉ文字候補領域の幅Ｗ_ｉを加味した値になる。相違度のみならず第ｉ文字候補領域の幅Ｗ_ｉを用いて文字らしさ評価値φ^ｉ _ｃｈａｒを算出することは、第ｉ文字候補領域の「文字らしさ」を簡便に、且つ、適切に評価するために有効である。 The character likelihood evaluation value φ ⁱ _char is determined so as to decrease as the minimum dissimilarity S _i increases and as the character width deduction point P ⁱ _char increases. For example, the character-likeness evaluation value φ ⁱ _char is expressed by the following formula:
φ ⁱ _char = C−S _i −P ⁱ _char , (5)
Can be calculated by: Here, C is a constant. As understood from the equation (5), the character likelihood evaluation value φ ⁱ _char is a value in consideration of the width W _i of the i-th character candidate region. Calculating the character-likeness evaluation value φ ⁱ _char using not only the degree of difference but also the width W _i of the i-th character candidate region simply and appropriately evaluates the “character-likeness” of the i-th character candidate region. It is effective for.

文字らしさ評価値φ^ｉ _ｃｈａｒの算出の際に起こり得る一つの問題は、「０」、「８」等の数字の中間に文字切り出し位置候補１８が存在し、その結果、文字候補領域の画像として「０」、「８」等の数字が縦に半分に切断された画像が存在しうることである。このような画像は、図１６に示されているように、最近接文字クラスが「１」であると誤って認識され、且つ、その相違度が小さくなる可能性がある。このような事態を回避するためには、文字らしさ評価値φ^ｉ _ｃｈａｒの算出のときに行われる第ｉ文字候補領域の文字認識の手順は、図１７に示されているようなフローで行われることが好適である。 One problem that may occur when calculating the character-likeness evaluation value φ ⁱ _char is that the character extraction position candidate 18 exists in the middle of the numbers such as “0” and “8”. There is an image in which numbers such as “0” and “8” are vertically cut in half. As shown in FIG. 16, such an image may be erroneously recognized as having the closest character class “1”, and the difference may be reduced. In order to avoid such a situation, the procedure for character recognition of the i-th character candidate region performed when calculating the character likelihood evaluation value φ ⁱ _char is performed according to the flow shown in FIG. Is preferred.

まず、車種コード用テンプレート８を用いて第ｉ文字候補領域の文字認識が行われ、最近接クラスと最小相違度Ｓ_ｉとが取得される（ステップＳ２１）。文字認識によって得られた最近接クラスが「１」である場合には（ステップＳ２２）、当該第ｉ文字候補領域を左右に数画素広げた領域が設定され（ステップＳ２３）、設定されたその領域について、車種コード用テンプレート８を用いて再度に文字認識が行われる（ステップＳ２４）。これは、図１７から理解されるように、第ｉ文字候補領域を左右に数画素広げた領域を使用することにより、第ｉ文字候補領域が本当に「１」を含んでいるのか、或いは、「０」、「８」等の数字が縦に半分に切断された画像を含んでいるのかを判別できるからである。ステップＳ２４で得られた最近接文字クラスが再度に「１」であった場合には（ステップＳ２５）、ステップＳ２１において得られた最小相違度Ｓ_ｉが採用され、文字らしさ評価値φ^ｉ _ｃｈａｒの算出が行われる。一方、ステップＳ２４で得られた最近接クラスが「１」でない場合には当該第ｉ文字候補領域を含む車種コード領域配置候補が無効化される（ステップＳ２６）。より具体的には、ステップＳ２４で得られた最近接クラスが「１」でない場合には第ｉ文字候補領域の最小相違度Ｓ_ｉが、非常に大きな所定の数に書き換えられる。これにより、文字らしさ評価値φ^ｉ _ｃｈａｒが小さくなり、当該第ｉ文字候補領域を含む車種コード領域配置候補の評価値φ_{ｓｔｒｉｎｇ}が小さくなる。これにより、当該第ｉ文字候補領域を含む車種コード領域配置候補が無効化され（即ち、当該車種コード領域配置候補が最適な車種コード領域配置候補として選択されなくなり）、誤った文字認識が行われる確率を減らすことができる。 First, character recognition of the i-th character candidate area is performed using the vehicle type code template 8, and the closest class and the minimum difference degree _Si are acquired (step S21). When the closest class obtained by character recognition is “1” (step S22), an area obtained by expanding the i-th character candidate area by several pixels to the left and right is set (step S23), and the set area The character recognition is performed again using the vehicle type code template 8 (step S24). As can be understood from FIG. 17, by using an area where the i-th character candidate area is expanded by several pixels to the left and right, the i-th character candidate area really includes “1” or “ This is because it is possible to determine whether numbers such as “0” and “8” include an image that is vertically cut in half. When the closest character class obtained in step S24 is “1” again (step S25), the minimum difference S _i obtained in step S21 is adopted, and the character likelihood evaluation value φ ⁱ _char is determined. Calculation is performed. On the other hand, if the closest class obtained in step S24 is not “1”, the vehicle type code area arrangement candidate including the i-th character candidate area is invalidated (step S26). More specifically, when the closest class obtained in step S24 is not “1”, the minimum difference S _i of the i-th character candidate region is rewritten to a very large predetermined number. As a result, the character likelihood evaluation value φ ⁱ _char decreases, and the evaluation value φ _string of the vehicle type code area arrangement candidate including the i-th character candidate area decreases. As a result, the vehicle type code region arrangement candidate including the i-th character candidate region is invalidated (that is, the vehicle type code region arrangement candidate is not selected as the optimum vehicle type code region arrangement candidate), and erroneous character recognition is performed. Probability can be reduced.

一方、第ｉ文字候補領域と第ｊ文字候補領域との隙間の隙間らしさ評価値φ^ｉｊ _ｇａｐは、図１８に示されているように、第ｉ文字候補領域と第ｊ文字候補領域との隙間の幅Ｗ^ｉｊ _ｇａｐに依存する隙間減点値Ｐ^ｉｊ _ｇａｐから算出される。隙間減点値Ｐ^ｉｊ _ｇａｐとは、第ｉ文字候補領域と第ｊ文字候補領域との隙間の幅Ｗ^ｉｊ _ｇａｐが、隙間として妥当でない場合に増大される数値であり、具体的には、下記式：
Ｐ^ｉｊ _ｇａｐ＝ｃ_ｇａｐ（Ｗ_{ｍｉｎ＿ｇａｐ}−Ｗ^ｉｊ _ｇａｐ），
（Ｗ^ｉｊ _ｇａｐ＜Ｗ_{ｍｉｎ＿ｇａｐ}の場合）・・・（６ａ）
Ｐ^ｉｊ _ｇａｐ＝ｃ_ｇａｐ（Ｗ_ｉ−Ｗ_{ｍａｘ＿ｃｈａｒ}），
（Ｗ_{ｍａｘ＿ｇａｐ}＜Ｗ^ｉｊ _ｇａｐの場合）・・・（６ｂ）
Ｐ^ｉｊ _ｇａｐ＝０，（Ｗ_{ｍｉｎ＿ｇａｐ}≦Ｗ^ｉｊ _ｇａｐ≦Ｗ_{ｍａｘ＿ｇａｐ}の場合）・・・（６ｃ）
で定められる。ここで、ｃ_ｇａｐは、所定のパラメータであり、また、Ｗ_{ｍｉｎ＿ｇａｐ}、Ｗ_{ｍａｘ＿ｇａｐ}は、それぞれ、「文字らしい」画像が含まれている文字候補領域の幅の下限値、及び上限値を表すパラメータであり、下記式によって定められる：
Ｗ_{ｍｉｎ＿ｇａｐ}＝ｈ・ｋ_{ｍｉｎ＿ｇａｐ}，
Ｗ_{ｍａｘ＿ｇａｐ}＝ｈ・ｋ_{ｍａｘ＿ｇａｐ}，
ただし、ｈは、小文字列画像の高さであり、ｋ_{ｍｉｎ＿ｇａｐ}、ｋ_{ｍａｘ＿ｇａｐ}は、所定の定数である。このような隙間らしさ評価値φ^ｉｊ _ｇａｐの決定方法は、文字領域１４〜１６の間の隙間の縦横比が、ある一定の範囲に収まることを基礎としている。文字領域１４〜１６の隙間の縦横比として想定できないような縦横比を有する隙間がある場合には、隙間減点値Ｐ^ｉｊ _ｇａｐが増大される。 On the other hand, the gap-likeness evaluation value φ ^ij _gap between the i-th character candidate area and the j-th character candidate area is, as shown in FIG. 18, a gap between the i-th character candidate area and the j-th character candidate area. It is calculated from the gap deduction point value P ^ij _gap depending on the width W ^ij _gap of. The gap deduction value P ^ij _gap is a numerical value that is increased when the width W ^ij _gap of the gap between the i-th character candidate area and the j-th character candidate area is not valid as a gap. :
P ^ij _gap = c _gap (W _{min —gap} −W ^ij _gap ),
(W ^ij _gap <W _{min_gap} ) (6a)
P ^ij _gap = c _gap (W _i −W _{max_char} ),
(In the case of _{^{_{W max_gap <W ij gap) ···}}} (6b)
P ^ij _gap = 0, (when W _{min_gap} ≦ W ^ij _gap ≦ W _{max_gap} ) (6c)
Determined by Here, c _gap is a predetermined parameter, and W _{min_gap} and W _{max_gap} are parameters respectively _indicating a lower limit value and an upper limit value of the width of the character candidate area including the “character-like” image. Yes, determined by the following formula:
W _{min_gap} = h · k _{min_gap} ,
W _{max_gap} = h · k _{max_gap} ,
However, h is the height of the lowercase string _{_image, k} _{min_gap,} k _{max_gap} is a predetermined constant. Such a method of determining the evaluation value φ ^ij _gap of the gap is based on the fact that the aspect ratio of the gap between the character areas 14 to 16 falls within a certain range. When there is a gap having an aspect ratio that cannot be assumed as the aspect ratio of the gap between the character areas 14 to 16, the gap deduction point value P ^ij _gap is increased.

隙間らしさ評価値φ^ｉｊ _ｇａｐは、隙間減点値Ｐ^ｉｊ _ｇａｐが増大するほど小さくなるように決定される。例えば、隙間らしさ評価値φ^ｉｊ _ｇａｐは、下記式：
φ^ｉｊ _ｇａｐ＝−Ｐ^ｉｊ _ｇａｐ，・・・（７）
によって算出され得る。式（７）から理解されるように、隙間らしさ評価値φ^ｉｊ _ｇａｐは、第ｉ文字候補領域と第ｊ文字候補領域との隙間の幅Ｗ^ｉｊ _ｇａｐに依存する値である。隙間の幅Ｗ^ｉｊ _ｇａｐを用いて隙間らしさ評価値φ^ｉｊ _ｇａｐを算出することは、第ｉ文字候補領域と第ｊ文字候補領域との隙間の「隙間らしさ」を簡便に、且つ、適切に評価するために有効である。 The gap likelihood evaluation value φ ^ij _gap is determined so as to decrease as the gap deduction point value P ^ij _gap increases. For example, the clearance likelihood evaluation value φ ^ij _gap is given by the following formula:
φ ^ij _gap = −P ^ij _gap , (7)
Can be calculated by: As understood from the equation (7), the gap likelihood evaluation value φ ^ij _gap is a value depending on the width W ^ij _gap of the _gap between the i-th character candidate region and the j-th character candidate region. The calculation of the gap likelihood evaluation value φ ^ij _gap using the gap width W ^ij _gap easily and appropriately evaluates the “gap likelihood” of the gap between the i-th character candidate area and the j-th character candidate area. It is effective to do.

以上の手順により、文字らしさ評価値φ^ｉ _ｃｈａｒと隙間らしさ評価値φ^ｉｊ _ｇａｐとが算出され、更に式（３ａ）（３ｂ）を用いて評価値φ_{ｓｔｒｉｎｇ}が算出された後、算出された評価値φ_{ｓｔｒｉｎｇ}から、３文字の車種コードに対応する車種コード領域配置候補のうちで最適な車種コード領域配置候補と、２文字の車種コードに対応する車種コード領域配置候補のうちで最適な車種コード領域配置候補とが決定される（ステップＳ１３−４）。式（３ａ）から算出された評価値φ_{ｓｔｒｉｎｇ}を最も大きくする車種コード領域配置候補が、３文字の車種コードに対応する車種コード領域配置候補のうちで最適な車種コード領域配置候補であると決定される。同様に、式（３ｂ）から算出された評価値φ_{ｓｔｒｉｎｇ}を最も大きくする車種コード領域配置候補が、２文字の車種コードに対応する車種コード領域配置候補のうちで最適な車種コード領域配置候補であると決定される。 By the above procedure, the character likelihood evaluation value φ ⁱ _char and the gap likelihood evaluation value φ ^ij _gap are calculated, and further, the evaluation value φ _string is calculated using the equations (3a) and (3b), and then the calculated evaluation is calculated. From the value φ _string , the optimal vehicle type code area arrangement candidate among the vehicle type code area arrangement candidates corresponding to the 3-character vehicle type code and the optimal vehicle type code among the vehicle type code area arrangement candidates corresponding to the 2-character vehicle type code A region arrangement candidate is determined (step S13-4). The vehicle type code area arrangement candidate that maximizes the evaluation value φ _string calculated from the equation (3a) is determined to be the optimum vehicle type code area arrangement candidate among the vehicle type code area arrangement candidates corresponding to the three-character vehicle type code. Is done. Similarly, the vehicle type code region arrangement candidate that maximizes the evaluation value φ _string calculated from the expression (3b) is the optimum vehicle type code region arrangement candidate among the vehicle type code region arrangement candidates corresponding to the two-character vehicle type code. Determined to be.

全ての車種コード領域配置候補について個別に評価値φ_{ｓｔｒｉｎｇ}を算出し、算出された評価値φ_{ｓｔｒｉｎｇ}から（２文字の車種コード、及び３文字の車種コードのそれぞれについて）最適な車種コード領域配置候補を選択することは、可能ではあるが演算量を過剰に増大させ得る。車種コード領域配置候補の評価値を算出するために必要な演算量を小さくするためには、最適な車種コード領域配置候補の決定にＤＰマッチングが採用されることが好ましい。ＤＰマッチングを採用することにより、演算の重複を避け、少ない演算量で車種コード領域配置候補の評価値φ_{ｓｔｒｉｎｇ}を算出することができる。 The evaluation value φ _string is calculated individually for all the vehicle type code area arrangement candidates, and the optimal vehicle type code area arrangement candidate (for each of the two-character vehicle code and the three-character vehicle code) is calculated from the calculated evaluation value φ _string. However, it is possible to excessively increase the calculation amount. In order to reduce the amount of calculation required for calculating the evaluation value of the vehicle type code area arrangement candidate, it is preferable to employ DP matching for determining the optimal vehicle type code area arrangement candidate. By adopting DP matching, it is possible to avoid the duplication of calculation and calculate the evaluation value φ _string of the vehicle type code area arrangement candidate with a small calculation amount.

図１０（及び図１１）を参照して、本実施の形態のＤＰマッチングでは、車種コード領域配置候補（即ち、文字切り出し位置候補１８_１〜１８_６と第１〜第３文字候補領域２１〜２３との対応付け）が、文字切り出し位置候補１８に対応するＡ軸と、第１〜第３文字候補領域２１〜２３の始点及び終点に対応するＢ軸とからなる座標系上に展開された対応グラフとして表現される。Ａ軸上の要素ａ_１〜ａ_６は、車種コード領域１３に属する文字切り出し位置候補１８_１〜１８_６に対応しており、Ｂ軸上の要素ｂ_１〜ｂ_６は、第１文字候補領域２１、第２文字候補領域２２及び第３文字候補領域２３の始点及び終点に対応している；Ａ軸上の要素ａ_ｊの数は、車種コード領域１３に属する文字切り出し位置候補１８の数に合わせて調節されることに留意されたい。各対応グラフは、原点（ａ_１，ｂ_１）から座標（ａ_ｅｎｄ，ｂ_４）までの経路（２文字の車種コードに対応する場合）、又は、原点（ａ_１，ｂ_１）から座標（ａ_ｅｎｄ，ｂ_６）までの経路である；ここで、ａ_ｅｎｄは、対応グラフの終点のＡ軸上の要素であり、第２文字候補領域２２又は第３文字候補領域２３の終点として選ばれた文字切り出し位置候補１８に対応する要素である。 Referring to FIG. 10 (and FIG. 11), in the DP matching according to the present embodiment, vehicle type code region arrangement candidates (that is, character cut-out position candidates 18 ₁ to 18 ₆ and first to third character candidate regions 21 to 23). Is developed on a coordinate system consisting of the A axis corresponding to the character cutout position candidate 18 and the B axis corresponding to the start and end points of the first to third character candidate areas 21 to 23. Expressed as a graph. Elements a _{1 to} a ₆ on the A axis correspond to character cutout position candidates 18 ₁ to 18 ₆ belonging to the vehicle type code area 13, and elements b _{1 to} b ₆ on the B axis correspond to the first character candidate area. 21 corresponds to the start and end points of the second character candidate region 22 and the third character candidate region 23; the number of elements a _j on the A axis is the number of character cutout position candidates 18 belonging to the vehicle type code region 13 Note that they are adjusted accordingly. Each correspondence graph has a route from the origin (a ₁ , b ₁ ) to the coordinates (a _end , b ₄ ) (when it corresponds to a two-character vehicle model code), or coordinates from the origin (a ₁ , b ₁ ) ( a _end , b ₆ ); where a _end is an element on the A axis of the end point of the corresponding graph and is selected as the end point of the second character candidate region 22 or the third character candidate region 23 This is an element corresponding to the character cutout position candidate 18.

原点（ａ_１，ｂ_１）から座標（ａ_ｊ，ｂ_ｉ）までの対応グラフに対応する車種コード領域配置候補の評価値φ_{ｓｔｒｉｎｇ}の最大値をΦ（ａ_ｊ，ｂ_ｉ）と表すものとすれば、
Φ（ａ_ｊ，ｂ_ｉ）＝ＭＡＸ（ａ_ｋ，ｂ_ｐ）［Φ（ａ_ｋ，ｂ_ｐ）
＋φ_（ｉ）（ａ_ｋ，ｂ_ｐ，ａ_ｊ，ｂ_ｉ）］，・・・（８）
が成立する。ここで、ｂ_ｐは、ｂ_ｉの一つ前の要素であり、
ｂ_ｐ＝ｂ_ｉ−１，・・・（９）
である。また、ＭＡＸ（ａ_ｋ，ｂ_ｐ）［Ｘ（ａ_ｋ，ｂ_ｐ）］は、ａ_ｋ、ｂ_ｐを変化させたときのＸ（ａ_ｋ，ｂ_ｐ）の最大値を表す。更に、φ_（ｉ）（ａ_ｋ，ｂ_ｐ，ａ_ｊ，ｂ_ｉ）は、
（α）ｉが奇数のとき、要素ａ_ｋ，ａ_ｊに対応する文字切り出し位置候補の間に形成される隙間の隙間らしさ評価値φ_ｇａｐであり、
（β）ｉが偶数のとき、要素ａ_ｋ，ａ_ｊに対応する文字切り出し位置候補の間の文字候補領域の文字らしさ評価値φ_ｃｈａｒである。
また、ｂ_ｉ＝ｂ_１のときには、特別に、Φ＝０と決定される。 Origin _(a 1, _{b 1)} from the coordinates _(a j, _{b i)} up to the maximum value of the evaluation values phi _string models coding region arrangement candidate corresponding to the corresponding graph Φ _(a j, _{b i)} and represents the if,
Φ (a _j , b _i ) = MAX (a _k , b _p ) [Φ (a _k , b _p )
+ Φ _(i) (a _k , b _p , a _j , b _i )], (8)
Is established. Here, _{b p} is the previous element of the _{b i,}
b _p = b _i−1 , (9)
It is. MAX (a _k , b _p ) [X (a _k , b _p )] represents the maximum value of X (a _k , b _p ) when a _k and b _p are changed. Furthermore, φ _(i) (a _k , b _p , a _j , b _i ) is
(Α) When i is an odd number, a gap likelihood evaluation value φ _gap of a gap formed between character cutout position candidates corresponding to the elements a _k and a _j ,
(Β) When i is an even number, the character likelihood evaluation value φ _char of the character candidate area between the character cutout position candidates corresponding to the elements a _k and a _j .
Further, when b _i = b ₁ , it is determined that Φ = 0.

このΦ（ａ_ｊ，ｂ_ｉ）は、ｂ_ｉの漸化式となっているため、座標（ａ_１，ｂ_１）（即ち、原点）からスタートして座標（ａ_６，ｂ_６）まで、順次にΦ（ａ_ｊ，ｂ_ｉ）を決定することができる。加えて、Φ（ａ_ｊ，ｂ_ｉ）を決定するとき、Φ（ａ_ｋ，ｂ_ｐ）＋φ_（ｉ）（ａ_ｋ，ｂ_ｐ，ａ_ｊ，ｂ_ｉ）を最大にする座標(ａ_ｋ，ｂ_ｐ）が、座標（ａ_ｊ，ｂ_ｉ）と対応付けられて保存される。具体的には、座標（ａ_ｊ，ｂ_ｉ）から座標(ａ_ｋ，ｂ_ｐ）へのポインタが記憶装置に保存される。 Since this Φ (a _j , b _i ) is a recurrence formula of b _i , starting from the coordinates (a ₁ , b ₁ ) (that is, the origin) to the coordinates (a ₆ , b ₆ ), Φ (a _j , b _i ) can be determined sequentially. In addition, when Φ (a _j , b _i ) is determined, the coordinates (a _k , b _p ) + φ _(i) (a _k , b _p , a _j , b _i ) that maximize Φ (a _k , b _p ) + φ _(i) b _p ) is stored in association with the coordinates (a _j , b _i ). Specifically, a pointer from the coordinates (a _j , b _i ) to the coordinates (a _k , b _p ) is stored in the storage device.

このような方法によって、上述の処理によって決定された車種コード領域配置候補に対応するｊ，ｉの組み合わせのそれぞれについてΦ（ａ_ｊ，ｂ_ｉ）が算出される。算出されたΦ（ａ_ｊ，ｂ_ｉ）を用いて、下記の方法により、最適な対応グラフ、即ち、最適な車種コード領域配置候補が、２文字の車種コード、３文字の車種コードのそれぞれについて決定される。２文字の車種コードに対応する対応グラフの決定では、まず、Φ（ａ_ｋ，ｂ_４）の最大値が求められ、Φ（ａ_ｋ，ｂ_４）を最大にする対応グラフが、最適な対応グラフとして選択される。Φ（ａ_ｋ，ｂ_４）を最大にする対応グラフは、上述のポインタから決定可能である。同様に、３文字の車種コードに対応する最適な対応グラフの決定では、まず、Φ（ａ_ｋ，ｂ_６）の最大値が求められ、Φ（ａ_ｋ，ｂ_６）を最大にする対応グラフが、最適な対応グラフとして選択される。Φ（ａ_ｋ，ｂ_６）を最大にする対応グラフは、上述のポインタから決定可能である。 By such a method, Φ (a _j , b _i ) is calculated for each combination of j and i corresponding to the vehicle type code area arrangement candidate determined by the above-described processing. Using the calculated Φ (a _j , b _i ), an optimal correspondence graph, that is, an optimal vehicle type code area arrangement candidate is obtained for each of the 2-character vehicle type code and the 3-character vehicle type code by the following method. It is determined. In the determination of the corresponding graph corresponding to two-letter models code, _{first, Φ (a k, b 4} ) the maximum value of is determined, the corresponding graph [Phi the (a _{k, b} ₄₎ to the maximum, optimum response Selected as a graph. The correspondence graph that maximizes Φ (a _k , b ₄ ) can be determined from the pointers described above. Similarly, in the determination of the optimal response graph corresponding to the three letter models code, _{first, Φ (a k, b 6} ) the maximum value of is determined, the corresponding graph to maximize [Phi the (a _{k, b} ₆₎ Is selected as the optimal correspondence graph. The correspondence graph that maximizes Φ (a _k , b ₆ ) can be determined from the pointers described above.

図５に示されているように、３文字の車種コードについての最適な車種コード領域配置候補と、２文字の車種コードについての最適の車種コード領域配置候補とが決定された後、車種コード領域１３の文字数、即ち、いずれの車種コード領域配置候補が最適であるかが最終的に決定される（ステップＳ１３−５）。 As shown in FIG. 5, after the optimal vehicle type code area arrangement candidate for the 3-character vehicle type code and the optimal vehicle type code area arrangement candidate for the 2-character vehicle code are determined, the vehicle type code area The number of characters of 13, that is, which vehicle type code area arrangement candidate is optimal is finally determined (step S13-5).

ステップＳ１３−５における最適な車種コード領域配置候補の決定では、まず、文字の車種コードについての最適な車種コード領域配置候補と、２文字の車種コードについての最適の車種コード領域配置候補とのそれぞれが、有効であるかが判定される。具体的には、２つの車種コード領域配置候補について下記の４つの項目がチェックされ、いずれか一つでも該当する場合には、当該車種コード領域配置候補が無効であると判断される。 In the determination of the optimal vehicle type code area arrangement candidate in step S13-5, first, each of the optimal vehicle type code area arrangement candidate for the character vehicle type code and the optimal vehicle type code area arrangement candidate for the 2-character vehicle type code, respectively. Is determined to be valid. Specifically, the following four items are checked for two vehicle type code area arrangement candidates, and if any one of them corresponds, it is determined that the vehicle type code area arrangement candidate is invalid.

（１）極端に文字らしさ評価値が小さい文字候補領域が存在する
図１９Ａを参照して、第１文字候補領域２１、第２文字候補領域２２、及び存在する場合には第３文字候補領域２３に、極端に文字らしさ評価値φ_ｃｈａｒが小さいものが含まれている場合には、当該車種コード領域配置候補が無効であると判定される。具体的には、第ｉ文字候補領域の文字らしさ評価値φ^ｉ _ｃｈａｒの文字らしさ平均値φ_{ｍｅａｎ＿ｃｈａｒ}に対する比率が所定値よりも小さい場合、第ｉ文字候補領域の文字らしさ評価値φ^ｉ _ｃｈａｒが極端に小さいと判断され、対応する車種コード領域配置候補が無効であると判定される；平均値φ_{ｍｅａｎ＿ｃｈａｒ}とは、下記式で定義される値である：

ｎは、対象の車種コード領域配置候補の文字数である。 (1) Character candidate areas with extremely small character-likeness evaluation values exist With reference to FIG. 19A, a first character candidate area 21, a second character candidate area 22, and a third character candidate area 23 if present. In addition, when the character evaluation value φ _char is extremely small, it is determined that the vehicle type code area arrangement candidate is invalid. Specifically, when the ratio of the character likelihood evaluation value φ ⁱ _char of the i-th character candidate region to the character-like average value φ _{mean_char} is smaller than a predetermined value, the character likelihood evaluation value φ ⁱ _{char of} the i-th character candidate region is extremely It is determined that the corresponding vehicle type code area arrangement candidate is invalid; the average value φ _{mean_char} is a value defined by the following formula:

n is the number of characters of the target vehicle type code area arrangement candidate.

より具体的には、下記式が成立する場合に、第ｉ文字候補領域の文字らしさ評価値φ^ｉ _ｃｈａｒが極端に小さいと判断される：
φ^ｉ _ｃｈａｒ＜α・φ_{ｍｅａｎ＿ｃｈａｒ}，・・・（１１）
ここで、αは、１より小さい所定のパラメータである。図１９の例では、第２文字候補領域２２の文字らしさ評価値φ^ｉ _ｃｈａｒが極端に小さいため、当該車種コード領域配置候補が無効であると判定される。 More specifically, it is determined that the character-likeness evaluation value φ ⁱ _{char of} the i-th character candidate region is extremely small when the following formula is satisfied:
φ ⁱ _char <α · φ _{mean_char} , (11)
Here, α is a predetermined parameter smaller than 1. In the example of FIG. 19, since the character likelihood evaluation value φ ⁱ _{char of} the second character candidate area 22 is extremely small, it is determined that the vehicle type code area arrangement candidate is invalid.

（２）文字幅のバランスが悪い
図１９Ｂを参照して、第１文字候補領域２１、第２文字候補領域２２、及び存在する場合には第３文字候補領域２３の文字幅のバランスが悪い場合には、当該車種コード領域配置候補が無効であると判定される。文字幅のバランスは、文字幅（即ち、文字候補領域の幅）の最大値と、文字幅の最小値との比率に基づいて判断される。より具体的には、２文字の車種コードに対応している車種コード配置候補については、下記条件を満足する場合に文字幅のバランスが悪いと判断される：
Ｗ_ＭＡＸ／Ｗ_ＭＩＮ＞β，・・・（１２ａ）
Ｗ_ＭＡＸ＝ｍａｘ（Ｗ_１，Ｗ_２），・・・（１２ｂ）
Ｗ_ＭＩＮ＝ｍｉｎ（Ｗ_１，Ｗ_２），・・・（１２ｃ）
ここでＷ_ｉは、第ｉ文字候補領域の幅であり、βは、１より大きい所定のパラメータである。 (2) Poor character width balance Referring to FIG. 19B, character width of first character candidate region 21, second character candidate region 22, and, if present, third character candidate region 23 is poorly balanced. Is determined to be invalid. The balance of the character width is determined based on the ratio between the maximum value of the character width (that is, the width of the character candidate area) and the minimum value of the character width. More specifically, a vehicle type code arrangement candidate corresponding to a two-character vehicle type code is determined to have a poor character width balance when the following conditions are satisfied:
W _MAX / W _MIN > β, (12a)
W _MAX = max (W ₁ , W ₂ ), (12b)
W _MIN = min (W ₁ , W ₂ ), (12c)
Here, _Wi is the width of the i-th character candidate region, and β is a predetermined parameter larger than 1.

同様に、３文字の車種コードに対応している車種コード配置候補については、下記条件を満足する場合に文字幅のバランスが悪いと判断される：
Ｗ_ＭＡＸ／Ｗ_ＭＩＮ＞β，・・・（１２ｄ）
Ｗ_ＭＡＸ＝ｍａｘ（Ｗ_１，Ｗ_２，Ｗ_３），・・・（１２ｅ）
Ｗ_ＭＩＮ＝ｍｉｎ（Ｗ_１，Ｗ_２，Ｗ_３）．・・・（１２ｆ） Similarly, for the vehicle type code arrangement candidate corresponding to the 3-character vehicle type code, it is determined that the balance of the character width is bad when the following conditions are satisfied:
W _MAX / W _MIN > β, ... (12d)
W _MAX = max (W ₁ , W ₂ , W ₃ ), (12e)
W _MIN = min (W ₁ , W ₂ , W ₃ ). ... (12f)

例えば、図１９Ｂの例では、第１文字候補領域２１の幅が第２文字候補領域２２の幅と比べて極端に狭く、文字幅のバランスが悪いと判断される。 For example, in the example of FIG. 19B, it is determined that the width of the first character candidate region 21 is extremely narrow compared to the width of the second character candidate region 22, and the balance of character width is poor.

（３）文字幅に比べて極端に大きい隙間が存在する
図１９Ｃを参照して、ある一対の文字候補領域の間の隙間が、各文字候補領域の文字幅と比較して極端に大きい場合には、当該車種コード領域配置候補が無効であると判定される。文字幅と隙間の幅のバランスは、隙間の幅の最大値と、文字幅の最小値との比率に基づいて判断される。 (3) An extremely large gap exists compared to the character width Referring to FIG. 19C, when the gap between a pair of character candidate areas is extremely large compared to the character width of each character candidate area Is determined that the vehicle type code area arrangement candidate is invalid. The balance between the character width and the gap width is determined based on the ratio between the maximum gap width value and the minimum character width value.

より具体的には、２文字の車種コードに対応している車種コード配置候補については、下記条件を満足する場合に文字幅のバランスが悪いと判断される：
Ｗ^１２ _ｇａｐ／Ｗ_ＭＩＮ＞γ，・・・（１３ａ）
Ｗ_ＭＩＮ＝ｍｉｎ（Ｗ_１，Ｗ_２），・・・（１３ｂ）
ここで、Ｗ^１２ _ｇａｐは、第１文字候補領域２１と第２文字候補領域２２との間の隙間の幅であり、γは、所定のパラメータである。２文字の車種コードに対応している車種コード配置候補については、隙間は１つしかないことに留意されたい。 More specifically, a vehicle type code arrangement candidate corresponding to a two-character vehicle type code is determined to have a poor character width balance when the following conditions are satisfied:
W ¹² _gap / W _MIN > γ, (13a)
W _MIN = min (W ₁ , W ₂ ), (13b)
Here, W ¹² _gap is the width of the gap between the first character candidate region 21 and the second character candidate region 22, and γ is a predetermined parameter. It should be noted that there is only one gap for the vehicle type code arrangement candidate corresponding to the 2-character vehicle type code.

一方、３文字の車種コードに対応している車種コード配置候補については、下記条件を満足する場合に文字幅のバランスが悪いと判断される：
Ｗ_{Ｇ＿ＭＡＸ}／Ｗ_ＭＩＮ＞γ，・・・（１３ｃ）
Ｗ_{Ｇ＿ＭＡＸ}＝ｍａｘ（Ｗ^１２ _ｇａｐ，Ｗ^２３ _ｇａｐ），・・・（１３ｄ）
Ｗ_ＭＩＮ＝ｍｉｎ（Ｗ_１，Ｗ_２，Ｗ_３），・・・（１３ｅ）
ここで、Ｗ^ｉｊ _ｇａｐは、第ｉ文字候補領域と第ｊ文字候補領域の間の隙間の幅である。 On the other hand, for vehicle type code arrangement candidates corresponding to the 3-character vehicle type code, it is determined that the balance of the character width is poor when the following conditions are satisfied:
W _{G — MAX} / W _MIN > γ, (13c)
_{^{_{^{_{W G_MAX = max (W 12 gap}}}}} , W 23 gap), ··· (13d)
W _MIN = min (W ₁ , W ₂ , W ₃ ), (13e)
Here, W ^ij _gap is the width of the gap between the i-th character candidate region and the j-th character candidate region.

例えば図１９Ｃの例では、第１文字候補領域２１と第２文字候補領域２２の間の隙間が、第１文字候補領域２１の幅Ｗ_１（＝Ｗ_ＭＩＮ）と比して極端に大きいと判断される。 For example, in the example of FIG. 19C, it is determined that the gap between the first character candidate region 21 and the second character candidate region 22 is extremely larger than the width W ₁ (= W _MIN ) of the first character candidate region 21. Is done.

（４）隙間の領域に、文字らしさ評価値の大きい画像が存在する
図１９Ｄを参照して、ある一対の文字候補領域の間の隙間に、文字らしい画像が存在する場合には、当該車種コード領域配置候補が無効であると判定される。まず、文字候補領域の間の隙間のそれぞれについて、文字候補領域と同様の処理によって文字らしさ評価値が算出される。隙間の文字らしさ評価値が、上述の文字らしさ平均値φ_{ｍｅａｎ＿ｃｈａｒ}と対比できる程度に大きい場合には、当該車種コード領域配置候補が無効であると判定される。より具体的には、第ｉ文字候補領域と第ｊ文字候補領域との間の隙間が、下記式を成立させる場合には、当該隙間の領域に文字らしさの大きい画像が存在すると判断される：
φ^ｉｊ _{ｇａｐ＿ｃｈａｒ}＞δ・φ_{ｍｅａｎ＿ｃｈａｒ}，・・・（１４）
ここで、φ^ｉｊ _{ｇａｐ＿ｃｈａｒ}は、第ｉ文字候補領域と第ｊ文字候補領域との間の隙間について算出された文字らしさ評価値であり、δは、所定のパラメータである。 (4) An image with a large character-likeness evaluation value exists in the gap area Referring to FIG. 19D, if an image that seems to be a character exists in a gap between a pair of character candidate areas, the vehicle type code It is determined that the region arrangement candidate is invalid. First, for each of the gaps between character candidate areas, a character likelihood evaluation value is calculated by the same processing as that for the character candidate areas. When the character likelihood evaluation value of the gap is large enough to be compared with the above-described character likelihood average value φ _{mean_char} , it is determined that the vehicle type code area arrangement candidate is invalid. More specifically, when the gap between the i-th character candidate area and the j-th character candidate area satisfies the following expression, it is determined that an image having a high character character exists in the gap area:
φ ^ij _{gap_char} > δ · φ _{mean_char} , (14)
Here, φ ^ij _{gap_char} is a character likelihood evaluation value calculated for the gap between the i-th character candidate region and the j-th character candidate region, and δ is a predetermined parameter.

上記の４項目のチェックの結果、３文字の車種コードについての最適な車種コード領域配置候補と、２文字の車種コードについての最適の車種コード領域配置候補のうちの一方のみが有効であると判断された場合、有効である車種コード領域配置候補が最終的に最適な車種コード領域配置候補であると判断される。 As a result of checking the above four items, it is determined that only one of the optimal vehicle type code area arrangement candidate for the 3-character vehicle type code and the optimal vehicle type code area arrangement candidate for the 2-character vehicle type code is valid. If it is determined, the effective vehicle type code area arrangement candidate is finally determined to be the optimum vehicle type code area arrangement candidate.

両方の車種コード領域配置候補が無効であると判断された場合には、車種コード領域１３の文字認識が失敗したと判断される。 When it is determined that both the vehicle type code area arrangement candidates are invalid, it is determined that the character recognition of the vehicle type code area 13 has failed.

両方の車種コード領域配置候補が有効であると判断された場合、図２０に示されているように、上述の文字らしさ平均値φ_{ｍｅａｎ＿ｃｈａｒ}が大きい車種コード領域配置候補が、最終的に最適な車種コード領域配置候補であると判断される。例えば、図２３の例では、文字らしさ平均値φ_{ｍｅａｎ＿ｃｈａｒ}が大きい、２文字の車種コードの最適な車種コード領域配置候補が最終的に最適な車種コード領域配置候補であると判断される。 If it is determined that both the vehicle type code area arrangement candidates are valid, as shown in FIG. 20, the above-described vehicle type code area arrangement candidate having a large character-like average value φ _{mean_char} is finally the optimum vehicle type. It is determined that it is a code area arrangement candidate. For example, in the example of FIG. 23, it is _determined that the optimal vehicle type code area arrangement candidate of the two-character vehicle type code having a large character- _like average value φ _{mean_char} is finally the optimal vehicle type code area arrangement candidate.

最終的に決定された車種コード領域配置候補の各文字候補領域について認識された数字からなる数字列が、最終的に求めるべき車種コードとして決定され、最終的に最適と決定された車種コード領域配置候補に含まれる文字候補領域の数が、車種コード領域１３の文字数として決定される。 A numeric string consisting of the numbers recognized for each character candidate area of the finally determined vehicle type code area arrangement candidate is determined as the vehicle code to be finally obtained, and finally the vehicle type code area arrangement determined to be optimal The number of character candidate areas included in the candidates is determined as the number of characters in the vehicle type code area 13.

以上に説明された本実施の形態の車種コードの認識のアルゴリズムの一つの利点は、文字認識に必要なデータ処理量を低減できることである。これは、主として、車種コード領域１３の全体ではなく、文字切り出し位置候補１８から決定される第１文字領域候補２１〜第３文字領域候補２３についてのみ文字認識が行われることによって達成されている。このとき、ＤＰマッチングを採用すれば、更に少ない演算量で最適な車種コード領域配置を決定し、各文字の文字認識を行うことができる。 One advantage of the vehicle type code recognition algorithm of the present embodiment described above is that the amount of data processing required for character recognition can be reduced. This is mainly achieved by performing character recognition only on the first character region candidate 21 to the third character region candidate 23 determined from the character cutout position candidate 18, not the entire vehicle type code region 13. At this time, if DP matching is adopted, an optimal vehicle type code area arrangement can be determined with a smaller amount of calculation, and character recognition of each character can be performed.

もう一つの利点は、認識対象が、２文字の車種コードと、３文字の車種コードのいずれであっても、適切に車種コードの文字認識が行えることである。この利点は、車種コード領域１３の全体に渡って抽出された文字切り出し位置候補１８を境界とする第１文字領域候補２１〜第３文字領域候補２３が決定され、それらの組み合わせである車種コード領域配置候補のうちの最適な車種コード領域配置候補に対応する文字列が認識結果として出力されることによって得られている。 Another advantage is that the character recognition of the vehicle type code can be appropriately performed regardless of whether the recognition target is a two-character vehicle type code or a three-character vehicle type code. This advantage is that the first character region candidate 21 to the third character region candidate 23 with the character cutout position candidate 18 extracted over the entire vehicle type code region 13 as a boundary are determined, and the vehicle type code region that is a combination thereof. It is obtained by outputting a character string corresponding to an optimal vehicle type code region arrangement candidate among the arrangement candidates as a recognition result.

このように、本実施形態の車種コードの認識のアルゴリズムでは、車種コードの文字認識に必要なデータ処理量の低減と文字列の多様性への対処とを両立させることができる。 As described above, the vehicle type code recognition algorithm according to the present embodiment can achieve both reduction in the amount of data processing necessary for character recognition of the vehicle type code and coping with the diversity of character strings.

第２第２の実施形態
図２１は、第２の実施形態における小文字列画像１１の文字認識のアルゴリズムを示すフローチャートである。第２の実施形態のアルゴリズムと、第１の実施形態のアルゴリズムとの最も重要な相違は、第２の実施形態では陸支コードの認識と車種コードの認識とが同時的に行われる点である；第１の実施形態のように、陸支コード領域１２の決定及び陸支コードの認識の後に、車種コード領域１３の決定及び車種コードの認識が行われるのではない。 Second Second Embodiment FIG. 21 is a flowchart showing an algorithm for character recognition of the lowercase sequence image 11 in the second embodiment. The most important difference between the algorithm of the second embodiment and the algorithm of the first embodiment is that in the second embodiment, land code recognition and vehicle type code recognition are performed simultaneously. As in the first embodiment, the determination of the vehicle type code region 13 and the recognition of the vehicle type code are not performed after the determination of the land support code region 12 and the recognition of the land support code.

より具体的には、まず、文字切り出し位置候補１８が抽出される（ステップＳ３１）。文字切り出し位置候補１８の抽出の手順は、第１の実施形態で説明されたとおりである。図２２に示されているように、以下の説明では、１１本の文字切り出し位置候補１８_１〜１８_１１が決定されたとして説明が行われる。 More specifically, first, the character cutout position candidate 18 is extracted (step S31). The procedure for extracting the character cutout position candidate 18 is as described in the first embodiment. As illustrated in FIG. 22, the following description will be made assuming that ₁₁ character cutout position candidates 18 ₁ to 18 ₁₁ have been determined.

続いて、小文字列画像１１の全体に対して領域配置候補が設定される（ステップＳ３２）。領域配置候補とは、小文字列画像１１における陸支コード領域１２、第１文字領域１４、第２文字領域１５、及び第３文字領域１６の配置の候補である。ただし、本実施の形態では、陸支コード領域１２は、一つの文字領域として扱われる；即ち、本実施の形態では、小文字列画像１１が３つ又は４つの文字領域を含むとして小文字列画像１１の文字認識が行われる。既述のように、画像処理装置２の内部では、陸支コードが一つの文字として扱われていることに留意されたい。 Subsequently, region arrangement candidates are set for the entire lowercase character string image 11 (step S32). The region arrangement candidate is a candidate for arrangement of the land code area 12, the first character area 14, the second character area 15, and the third character area 16 in the lowercase character string image 11. However, in the present embodiment, the land code region 12 is treated as one character region; that is, in the present embodiment, the lowercase character sequence image 11 is assumed to include three or four character regions. Character recognition is performed. As described above, it should be noted that the land code is handled as one character in the image processing apparatus 2.

車種コードが２文字又は３文字から構成されることから、２文字の車種コードに対応する領域配置候補と、３文字の車種コードに対応する領域配置候補の２種類が決定される。図２２に示されているように、３文字の車種コードに対応する領域配置候補は、第１〜第４文字候補領域２１〜２４の組み合わせで表現される。本実施の形態では、第１文字候補領域２１とは、小文字列画像１１の第１文字目が存在する文字領域の候補、即ち、陸支コード領域１２の候補であり、第２文字候補領域２２は、車種コードの第１文字目が存在する第１文字領域１４の候補であり、第３文字候補領域２３とは、車種コードの第２文字目が存在する第２文字領域１５の候補であり、第４文字候補領域２４とは、車種コードの第３文字目が存在する第３文字領域１６の候補である。文字候補領域２１〜２４の両端は、２つの文字切り出し位置候補１８によって規定される。同様に、２文字の車種コードに対応する車種コード領域配置候補は、第１文字候補領域２１、第２文字候補領域２２及び第３文字候補領域２３の組み合わせで表現される。 Since the vehicle type code is composed of two or three characters, two types of region arrangement candidates corresponding to the two-character vehicle type code and region arrangement candidates corresponding to the three-character vehicle type code are determined. As shown in FIG. 22, the region arrangement candidate corresponding to the three-character vehicle type code is expressed by a combination of the first to fourth character candidate regions 21 to 24. In the present embodiment, the first character candidate area 21 is a character area candidate in which the first character of the lowercase character string image 11 exists, that is, a candidate for the land code area 12, and the second character candidate area 22 Is a candidate for the first character region 14 in which the first character of the vehicle type code exists, and the third character candidate region 23 is a candidate for the second character region 15 in which the second character of the vehicle type code exists. The fourth character candidate area 24 is a candidate for the third character area 16 in which the third character of the vehicle type code exists. Both ends of the character candidate areas 21 to 24 are defined by two character cutout position candidates 18. Similarly, the vehicle type code region arrangement candidate corresponding to the two-character vehicle type code is expressed by a combination of the first character candidate region 21, the second character candidate region 22, and the third character candidate region 23.

領域配置候補の設定は、具体的には、文字切り出し位置候補１８を、文字候補領域２１〜２４の始点（左端）及び終点（右端）に対応付けることによって行われる。一例として、図２２に示されているように、車種コード領域１３の位置”１”〜”１１”に対して１１本の文字切り出し位置候補１８_１〜１８_１１が存在する場合を考える。例えば、ある領域配置候補では、文字切り出し位置候補１８_１、１８_５が、それぞれ、第１文字候補領域２１の始点（左端）及び終点（右端）にそれぞれ対応付けられ、文字切り出し位置候補１８_６、１８_７が第２文字候補領域２２の始点及び終点にそれぞれ対応付けられる。更に、文字切り出し位置候補１８_７、１８_８が、第３文字候補領域２３の始点及び終点にそれぞれ対応付けられ、文字切り出し位置候補１８_８、１８_９が、第４文字候補領域２４の始点及び終点にそれぞれ対応付けられる。他の領域配置候補では、異なる対応付けが行われる。 Specifically, the region arrangement candidate is set by associating the character cutout position candidate 18 with the start point (left end) and end point (right end) of the character candidate regions 21 to 24. As an example, let us consider a case where there are ₁₁ character cutout position candidates 18 ₁ to 18 ₁₁ for the positions “1” to “11” of the vehicle type code area 13 as shown in FIG. For example, in a certain area arrangement candidate, the character cutout position candidates 18 ₁ and 18 ₅ are respectively associated with the start point (left end) and the end point (right end) of the first character candidate area 21, and the character cutout position candidates 18 ₆ , 18 ₇ are respectively associated with the start and end points of the second character candidate region 22. Furthermore, the character cutout position candidates 18 ₇ and 18 ₈ are associated with the start point and the end point of the third character candidate area 23, respectively, and the character cutout position candidates 18 ₈ and 18 ₉ are the start point and the end point of the fourth character candidate area 24. Respectively. Different associations are performed in other region arrangement candidates.

ただし、第１の実施形態と同様に、文字領域の幅として妥当でない幅を有するような文字候補領域を含むような領域配置候補は設定されない。詳細には、陸支コード領域１２に対応する第１文字候補領域２１は、式（１）を満足するように決定され、残りの文字候補領域２２〜２４は、式（２ａ）、（２ｂ）を満足するように決定される。これは、領域配置候補の数を減らし、もって演算量を減らすために有効である。 However, as in the first embodiment, an area arrangement candidate that includes a character candidate area that has an invalid width as the width of the character area is not set. Specifically, the first character candidate area 21 corresponding to the land code area 12 is determined so as to satisfy the expression (1), and the remaining character candidate areas 22 to 24 are determined according to the expressions (2a) and (2b). To be satisfied. This is effective for reducing the number of region arrangement candidates and thereby reducing the amount of calculation.

図２１に戻り、領域配置候補の設定に続いて、各領域配分候補の評価値φ_{ｓｔｒｉｎｇ}が算出される（Ｓ３４）。各領域配分候補の評価値φ_{ｓｔｒｉｎｇ}の算出の手順は、（１）文字認識が行われる文字領域候補の数が異なること、（２）第１文字候補領域２１の文字認識には、車種コード用テンプレート８ではなく陸支コード用テンプレート７が使用されること、（３）文字幅減点値Ｐ^ｉ _ｃｈａｒの算出に使用されるパラメータ：Ｗ_{ｍｉｎ＿ｃｈａｒ}及びＷ_{ｍａｘ＿ｃｈａｒ}が、第１文字候補領域２１と残りの文字候補領域２２〜２４とで異なることを除けば、第１の実施形態で行われる車種コード領域配分候補の評価値φ_{ｓｔｒｉｎｇ}の算出の手順と同様である。 Returning to FIG. 21, following the setting of the region arrangement candidates, the evaluation value φ _string of each region distribution candidate is calculated (S34). The procedure for calculating the evaluation value φ _string of each area allocation candidate is as follows: (1) the number of character area candidates to be recognized is different, and (2) the character recognition of the first character candidate area 21 is for vehicle type code. The land code template 7 is used instead of the template 8, and (3) the parameters used for calculating the character width deduction point P ⁱ _char : W _{min_char} and W _{max_char} are the first character candidate area 21 and the remaining Except for the difference between the character candidate areas 22 to 24, the procedure is the same as the procedure for calculating the evaluation value φ _string of the vehicle type code area distribution candidate performed in the first embodiment.

更に、２文字の車種コードと３文字の車種コードのそれぞれについて最適な領域配分候補が決定され（ステップＳ３４）、それら２つの領域配分候補から車種コードの文字数、即ち、最適な領域配分候補が最終的に決定される（ステップＳ３５）。最終的に最適な領域配分候補として決定された領域配分候補に対応する陸支コード、車種コード、及び文字数が、認識結果として出力される。 Further, the optimum region allocation candidate is determined for each of the 2-character vehicle type code and the 3-character vehicle type code (step S34), and the number of characters of the vehicle type code, that is, the optimal region allocation candidate is finally determined from these two region allocation candidates. (Step S35). The land code, the vehicle type code, and the number of characters corresponding to the area allocation candidate finally determined as the optimum area allocation candidate are output as the recognition result.

第２の実施形態で使用される文字認識のアルゴリズムは、第１の実施形態の文字認識のアルゴリズムと比べて演算量は多いものの、文字認識精度を高めることができる。第１の実施形態では、陸支コードの位置（即ち、陸支コード領域１２）を最適に決定できなかった場合に、車種コードの文字認識の精度が低下してしまう。一方、第２の実施形態では、小文字列画像１１の全体として最適に陸支コード領域１２及び文字領域１４〜１６を決定できるため、文字認識精度を高めることができる。 Although the character recognition algorithm used in the second embodiment has a larger amount of computation than the character recognition algorithm of the first embodiment, it can increase the character recognition accuracy. In the first embodiment, when the position of the land support code (that is, the land support code area 12) cannot be determined optimally, the accuracy of character recognition of the vehicle type code is lowered. On the other hand, in the second embodiment, since the land code area 12 and the character areas 14 to 16 can be determined optimally as the whole lowercase character string image 11, the character recognition accuracy can be improved.

第３第３の実施形態
第３の実施形態では、図２３に示されているように、横方向の文字切り出し位置候補１８に加えて、縦方向の文字切り出し位置候補２５が抽出され、これらの切り出し位置候補１８、２５から、陸支コード候補領域１９と、第１乃至第３文字候補領域２１〜２３が決定される。横方向及び縦方向の切り出し位置候補１８、２５を、明確に区別するために、以下では、横方向の文字切り出し位置候補を横方向文字切り出し位置候補１８と、縦方向の文字切り出し位置候補を縦方向文字切り出し位置候補２５と呼ぶことにする。第３の実施形態における文字認識のフローは、概略的には、図５に示されているフローと同様であるが、縦方向文字切り出し位置候補２５が抽出されることに伴い、各ステップで行われる処理が変更される。 Third Embodiment In the third embodiment, as shown in FIG. 23, in addition to the character cutout position candidate 18 in the horizontal direction, the character cutout position candidate 25 in the vertical direction is extracted. From the cutout position candidates 18 and 25, the land code candidate area 19 and the first to third character candidate areas 21 to 23 are determined. In order to clearly distinguish the horizontal and vertical cutout position candidates 18 and 25, in the following, the horizontal character cutout position candidates will be referred to as the horizontal character cutout position candidate 18 and the vertical character cutout position candidates as vertical. It will be called a directional character cutout position candidate 25. The flow of character recognition in the third embodiment is roughly the same as the flow shown in FIG. 5, but is performed at each step as the vertical character cutout position candidate 25 is extracted. The processing that is performed is changed.

具体的には、文字切り出し位置候補の抽出の処理（ステップＳ１１）では、横方向文字切り出し位置候補１８に加えて縦方向文字切り出し位置候補２５が抽出される。図２４は、本実施形態における文字切り出し位置候補の抽出の処理手順を示す概念図である。まず、第１の実施形態と同様の手順により、横方向文字切り出し位置候補１８が抽出される。続いて、隣接する横方向文字切り出し位置候補１８の組のそれぞれについて、隣接する横方向文字切り出し位置候補１８に挟まれた領域の横方向の輝度射影が算出される。ある領域の横方向の輝度射影とは、当該領域に位置する画素の行のそれぞれについて算出された輝度値の和である；ここで画素行とは、横方向に１列に並べられた画素の行のことである。座標ｘに位置する画素行の輝度射影Ｆ（ｙ）は、
Ｆ（ｙ）＝Σσ（ｘ，ｙ），
で算出される。ここでσ（ｘ，ｙ）は、当該領域の座標（ｘ，ｙ）に位置する画素の輝度（又は階調）であり、Σは、当該領域のｘ座標の全範囲についての和を表している。続いて、横方向の輝度射影Ｆ（ｙ）が極小になる位置が検出される。更に、横方向の輝度射影Ｆ（ｙ）が極小になる位置のうち、例えば、上下１ヶ所ずつ合計２ヶ所、並びに小文字列画像１１の上端と下端が、縦方向文字切り出し位置候補２５として抽出される。 Specifically, in the process of extracting character cutout position candidates (step S11), in addition to the horizontal character cutout position candidate 18, a vertical character cutout position candidate 25 is extracted. FIG. 24 is a conceptual diagram showing a processing procedure for extracting character cutout position candidates in the present embodiment. First, the horizontal character cutout position candidate 18 is extracted by the same procedure as in the first embodiment. Subsequently, for each set of adjacent horizontal character cutout position candidates 18, a horizontal luminance projection of an area sandwiched between adjacent horizontal character cutout position candidates 18 is calculated. The horizontal luminance projection of a region is the sum of the luminance values calculated for each row of pixels located in that region; where a pixel row is the horizontal alignment of pixels arranged in one column It is a line. The luminance projection F (y) of the pixel row located at the coordinate x is
F (y) = Σσ (x, y),
Is calculated by Here, σ (x, y) is the luminance (or gradation) of the pixel located at the coordinate (x, y) of the region, and Σ represents the sum of the entire range of the x coordinate of the region. Yes. Subsequently, a position where the horizontal luminance projection F (y) is minimized is detected. Further, out of the positions where the luminance projection F (y) in the horizontal direction is minimized, for example, a total of two positions one by one at the top and bottom, and the upper end and the lower end of the lowercase character string image 11 are extracted as the vertical character cutout position candidates 25. The

図５に示されているように、文字切り出し位置候補の抽出に続いて、陸支コード候補領域１９の設定が行われる（ステップＳ１２−１）。図２５Ａ乃至２５Ｅは、本実施形態における陸支コード候補領域１９の決定の手順を示す概念図である。図２５Ａを参照して、陸支コード候補領域１９の決定では、まず、２つの横方向文字切り出し位置候補１８が選択され、これにより陸支コード候補領域１９の始点（左端）及び終点（右端）の位置ｘ_Ｌ、ｘ_Ｒが決定される。 As shown in FIG. 5, following the extraction of the character cutout position candidates, the land code candidate area 19 is set (step S12-1). 25A to 25E are conceptual diagrams showing a procedure for determining the land code candidate area 19 in the present embodiment. Referring to FIG. 25A, in the determination of land support code candidate area 19, first, two horizontal character cutout position candidates 18 are selected, whereby the start point (left end) and end point (right end) of land support code candidate area 19 are selected. Positions x _L and x _R are determined.

続いて、左側の横方向文字切り出し位置候補１８（即ち、始点ｘ_Ｌに対応する横方向文字切り出し位置候補１８）に接しており、且つ、小文字列画像１１の下端よりも上端に近い縦方向文字切り出し位置候補２５が選択され、左側の横方向文字切り出し位置候補１８と選択された縦方向文字切り出し位置候補２５との接点が、陸支コード候補領域１９の左上の頂点として選択される；選択された左上の頂点は、図２５Ａでは丸で示されている。 Subsequently, the left lateral character segmentation position candidate 18 (i.e., lateral character segmentation position candidate 18 corresponding to the starting point x _L) is in contact with the, and the vertical direction character closer to the upper than the lower end of the lower case string image 11 A cutout position candidate 25 is selected, and a contact point between the left side character cutout position candidate 18 on the left side and the selected vertical character cutout position candidate 25 is selected as the top left vertex of the land code candidate area 19; The top left vertex is indicated by a circle in FIG. 25A.

更に、右側の横方向文字切り出し位置候補１８（即ち、終点ｘ_Ｒに対応する横方向文字切り出し位置候補１８）に接しており、且つ、小文字列画像１１の上端よりも下端に近い縦方向文字切り出し位置候補２５が選択され、右側の横方向文字切り出し位置候補１８と選択された縦方向文字切り出し位置候補２５との接点が、陸支コード候補領域１９の右下の頂点として選択される；選択された右下の頂点は、図２５Ａでは、逆三角で示されている。 Furthermore, the right lateral character segmentation position candidate 18 (i.e., lateral character segmentation position candidate 18 corresponding to the end point x _R) is in contact with the, and, cut longitudinally character closer to the lower than the upper end of the lower case string image 11 The position candidate 25 is selected, and the contact point between the right lateral character cutout position candidate 18 and the selected vertical character cutout position candidate 25 is selected as the lower right vertex of the land code candidate area 19; The lower right vertex is indicated by an inverted triangle in FIG. 25A.

選択された左上及び右下の頂点で規定される矩形領域が、陸支コード候補領域１９として設定される。許容される２つの横方向文字切り出し位置候補１８の組み合わせ、及び、横方向文字切り出し位置候補１８の各組み合わせについて許容される縦方向文字切り出し位置候補２５の全てについて、陸支コード候補領域１９が設定される。例えば、図２５Ｂ乃至２５Ｅに図示されている４種類の陸支コード候補領域１９も設定され得る。 A rectangular area defined by the selected upper left and lower right vertices is set as the land support code candidate area 19. Land support code candidate areas 19 are set for all of the combinations of two allowable horizontal character cutout position candidates 18 and all of the vertical character cutout position candidates 25 allowed for each combination of the horizontal character cutout position candidates 18. Is done. For example, four types of land code candidate areas 19 shown in FIGS. 25B to 25E can be set.

陸支コード候補領域１９の設定の後、陸支コード候補領域１９のそれぞれについて文字認識が行われる（ステップＳ１２−２）。最適な陸支コード候補領域１９が陸支コード領域１２として決定され、最適な陸支コード候補領域１９について認識された陸支コードが、認識結果として出力される。 After the land support code candidate area 19 is set, character recognition is performed for each land support code candidate area 19 (step S12-2). The optimal land support code candidate area 19 is determined as the land support code area 12, and the land support code recognized for the optimal land support code candidate area 19 is output as a recognition result.

続いて、陸支コード領域１２の位置から、車種コード領域１３が決定され（Ｓ１３−１）、その車種コード領域１３について車種コード配分候補が設定される（Ｓ１３−２）。図２６に示されているように、一の車種コード配分候補は、第１の実施形態と同様に、３つの文字候補領域２１〜２３によって表される。第１〜第３文字候補領域２１〜２３は、車種コード領域１３に属する横方向文字切り出し位置候補１８及び縦方向文字切り出し位置候補２５を、第１〜第３文字候補領域２１〜２３の左端、右端、上端、及び下端に対応付けることによって設定される。文字候補領域２１〜２３のそれぞれの左端及び右端は、一対の横方向文字切り出し位置候補１８によって決定され、上端及び下端は一対の縦方向文字切り出し位置候補２５によって決定される。 Subsequently, a vehicle type code region 13 is determined from the position of the land support code region 12 (S13-1), and a vehicle type code distribution candidate is set for the vehicle type code region 13 (S13-2). As shown in FIG. 26, one vehicle type code distribution candidate is represented by three character candidate areas 21 to 23 as in the first embodiment. The first to third character candidate areas 21 to 23 are the horizontal character cutout position candidate 18 and the vertical character cutout position candidate 25 belonging to the vehicle type code area 13, the left end of the first to third character candidate areas 21 to 23, It is set by associating with the right end, upper end, and lower end. The left and right ends of each of the character candidate areas 21 to 23 are determined by a pair of horizontal character cutout position candidates 18, and the upper and lower ends are determined by a pair of vertical character cutout position candidates 25.

更に、各車種コード配分候補について評価値φ_{ｓｔｒｉｎｇ}が算出され（ステップＳ１３−４）、２文字の車種コード及び３文字の車種コードのそれぞれについて最適な車種コード配分候補が決定される（ステップＳ１３−５）。評価値φ_{ｓｔｒｉｎｇ}の算出方法は、第１の実施形態と同様である。 Further, an evaluation value φ _string is calculated for each vehicle type code distribution candidate (step S13-4), and an optimal vehicle type code distribution candidate is determined for each of the two-character vehicle type code and the three-character vehicle type code (step S13-). 5). The calculation method of the evaluation value φ _string is the same as that of the first embodiment.

更に、第１の実施形態と同様の手順により、２文字の車種コード及び３文字の車種コードの最適な車種コード配分候補のうちの一方が、最適な車種コード配分候補と決定される（ステップＳ１３−３）。最終的に決定された車種コード領域配置候補の各文字候補領域について認識された数字からなる数字列が、最終的に求めるべき車種コードとして決定され、最終的に最適と決定された車種コード領域配置候補に含まれる文字候補領域の数が、車種コード領域１３の文字数として決定される。 Further, according to the same procedure as in the first embodiment, one of the optimal vehicle type code distribution candidates of the 2-character vehicle type code and the 3-character vehicle type code is determined as the optimal vehicle type code distribution candidate (step S13). -3). A numeric string consisting of the numbers recognized for each character candidate area of the finally determined vehicle type code area arrangement candidate is determined as the vehicle code to be finally obtained, and finally the vehicle type code area arrangement determined to be optimal The number of character candidate areas included in the candidates is determined as the number of characters in the vehicle type code area 13.

本実施形態では、横方向文字切り出し位置候補１８に加え、縦方向文字切り出し位置候補２５が第１〜第３文字候補領域２１〜２３の設定に使用されるため、車種コード配分候補の数が増加する。車種コード配分候補の数の増加は、データ処理量を増加させるために好ましくない。 In the present embodiment, in addition to the horizontal character cutout position candidate 18, the vertical character cutout position candidate 25 is used for setting the first to third character candidate areas 21 to 23, so the number of vehicle type code distribution candidates increases. To do. An increase in the number of vehicle type code allocation candidates is not preferable because the data processing amount is increased.

車種コード配分候補の数の増加によるデータ処理量の増加を抑制するためには、第１の実施形態と同様に、ＤＰマッチングが使用されることが好適である。図２６を参照して、本実施の形態のＤＰマッチングでは、車種コード領域配置候補（即ち、横方向文字切り出し位置候補１８及び縦方向文字切り出し位置候補２５と第１〜第３文字候補領域２１〜２３との対応付け）が、横方向文字切り出し位置候補１８に対応するＡ軸と、第１〜第３文字候補領域２１〜２３の始点及び終点に対応するＢ軸と、縦方向文字切り出し位置候補２５に対応するＣ軸からなる３次元座標系上に展開された対応グラフとして表現される。図２６の例では、Ａ軸上の要素ａ_１〜ａ_４は、横方向文字切り出し位置候補１８_１〜１８_４に対応しており、Ｂ軸上の要素ｂ_１〜ｂ_６は、第１文字候補領域２１、第２文字候補領域２２及び第３文字候補領域２３の始点及び終点に対応しており、Ｃ軸上の要素ｃ_１〜ｃ_６は、縦方向文字切り出し位置候補２５_１〜２５_６に対応している。Ａ軸上の要素ａ_ｊ、及びＣ軸上の要素ｃ_ｍの数は、車種コード領域１３に属する縦方向文字切り出し位置候補１８及び縦方向文字切り出し位置候補２５の数に合わせて調節されることに留意されたい。各対応グラフは、原点（ａ_１，ｂ_１，ｃ_{ｓｔａｒｔ}）から座標（ａ_ｅｎｄ，ｂ_４，ｃ_ｅｎｄ）までの経路（２文字の車種コードに対応する場合）、又は、原点（ａ_１，ｂ_１，ｃ_{ｓｔａｒｔ}）から座標（ａ_ｅｎｄ，ｂ_６，ｃ_ｅｎｄ）までの経路である；ここで、ｃ_{ｓｔａｒｔ}は、対応グラフの始点のＣ軸上の要素であり、ａ_ｅｎｄ，ｃ_ｅｎｄは、対応グラフの終点のＡ軸、Ｃ軸上の要素であり、第２文字候補領域２２又は第３文字候補領域２３の終点として選ばれた横方向文字切り出し位置候補１８及び縦方向文字切り出し位置候補２５に対応する要素である。 In order to suppress an increase in the amount of data processing due to an increase in the number of vehicle type code distribution candidates, it is preferable to use DP matching as in the first embodiment. Referring to FIG. 26, in the DP matching according to the present embodiment, vehicle type code area arrangement candidates (that is, horizontal character cutout position candidate 18 and vertical character cutout position candidate 25 and first to third character candidate areas 21 to 21 are included. 23) is the A axis corresponding to the horizontal character cutout position candidate 18, the B axis corresponding to the start and end points of the first to third character candidate areas 21 to 23, and the vertical character cutout position candidate. 25 is expressed as a correspondence graph developed on a three-dimensional coordinate system composed of the C-axis corresponding to 25. In the example of FIG. 26, the elements a _{1 to} a ₄ on the A axis correspond to the horizontal character cutout position candidates 18 ₁ to 18 ₄ , and the elements b _{1 to} b ₆ on the B axis are the first characters. Corresponding to the start and end points of the candidate area 21, the second character candidate area 22, and the third character candidate area 23, the elements c _{1 to} c ₆ on the C axis are the vertical character cut position candidates 25 _{1 to} 25 _6. It corresponds to. The numbers of the elements a _j on the A axis and the elements _cm on the C axis are adjusted according to the numbers of the vertical character cutout position candidates 18 and the vertical character cutout position candidates 25 belonging to the vehicle type code area 13. Please note that. Each correspondence graph includes a route from the origin (a ₁ , b ₁ , c _start ) to the coordinates (a _end , b ₄ , c _end ) (when corresponding to a two-character vehicle type code), or the origin (a ₁ , b ₁ , c _start ) to coordinates (a _end , b ₆ , c _end ); where c _start is an element on the C axis at the start point of the corresponding graph, and a _end , c _end are The horizontal character cutout position candidate 18 and the vertical character cutout position candidate which are elements on the A axis and C axis of the end point of the corresponding graph and selected as the end point of the second character candidate area 22 or the third character candidate area 23 This is an element corresponding to 25.

対応グラフが３次元座標上で表されることに対応して、式（８）を用いた評価値φｓ_{ｔｒｉｎｇ}の算出方法も、３次元に拡張される。具体的には、
対応グラフの始点（ａ_１，ｂ_１，ｃ_{ｓｔａｒｔ}）から座標（ａ_ｊ，ｂ_ｉ，ｃ_ｎ）までの対応グラフに対応する車種コード領域配置候補の評価値φ_{ｓｔｒｉｎｇ}の最大値をΦ（ａ_ｊ，ｂ_ｉ，ｃ_ｎ）と表すものとすれば、
Φ（ａ_ｊ，ｂ_ｉ，ｃ_ｎ）＝ＭＡＸ（ａ_ｋ，ｂ_ｐ，ｃ_ｍ）［Φ（ａ_ｋ，ｂ_ｐ，ｃ_ｍ）
＋φ_（ｉ）（ａ_ｋ，ｂ_ｐ，ｃ_ｍ，ａ_ｊ，ｂ_ｉ，ｃ_ｎ）］，・・・（８）’
が成立する。ここで、ｂ_ｐは、ｂ_ｉの一つ前の要素であり、
ｂ_ｐ＝ｂ_ｉ−１，・・・（９）
である。また、ＭＡＸ（ａ_ｋ，ｂ_ｐ，ｃ_ｍ）［Ｘ（ａ_ｋ，ｂ_ｐ，ｃ_ｍ）］は、ａ_ｋ、ｂ_ｐ、ｃ_ｍを変化させたときのＸ（ａ_ｋ，ｂ_ｐ，ｃ_ｍ）の最大値を表す。更に、φ_（ｉ）（ａ_ｋ，ｂ_ｐ，ｃ_ｍ，ａ_ｊ，ｂ_ｉ，ｃ_ｎ）は、
（α）ｉが奇数のとき、要素（ａ_ｋ，ｃ_ｍ）に対応する文字切り出し位置候補と、（ａ_ｊ，ｃ_ｎ）に対応する文字切り出し位置候補の間に形成される隙間の隙間らしさ評価値φ_ｇａｐであり、
（β）ｉが偶数のとき、要素（ａ_ｋ，ｃ_ｍ）に対応する文字切り出し位置候補と、（ａ_ｊ，ｃ_ｎ）に対応する文字切り出し位置候補の間に形成される文字候補領域の文字らしさ評価値φ_ｃｈａｒである。
また、ｂ_ｉ＝ｂ_１のときには、特別に、Φ＝０と決定される。 Corresponding to the correspondence graph being represented on three-dimensional coordinates, the method of calculating the evaluation value φs _string using the equation (8) is also expanded to three dimensions. In particular,
The maximum value of the evaluation value φ _string of the vehicle type code area arrangement candidate corresponding to the correspondence graph from the _start point (a ₁ , b ₁ , c _start ) of the correspondence graph to the coordinates (a _j , b _i , c _n ) is represented by Φ (a _j , b _i , c _n )
Φ (a _j , b _i , c _n ) = MAX (a _k , b _p , c _m ) [Φ (a _k , b _p , c _m )
+ Φ _(i) (a _k , b _p , c _m , a _j , b _i , c _n )]], (8) ′
Is established. Here, _{b p} is the previous element of the _{b i,}
b _p = b _i−1 , (9)
It is. _{_{_{Furthermore, MAX (a k, b p}}} , c m) [X (a k, b p, c m)] _{_{_{is, a k, b p, c}}} m X (a k when changing _{the, b} p, c _m ) represents the maximum value. _{_{_{Further, φ (i) (a k}}} , b p, c m, a j, b i, c n) is
(Α) When i is an odd number, the gap likelihood formed between the character cutout position candidate corresponding to the element (a _k , c _m ) and the character cutout position candidate corresponding to (a _j , c _n ) Evaluation value φ _gap
(Β) When i is an even number, a character candidate region formed between a character cutout position candidate corresponding to the element (a _k , c _m ) and a character cutout position candidate corresponding to (a _j , c _n ) Characteristic evaluation value φ _char .
Further, when b _i = b ₁ , it is determined that Φ = 0.

このΦ（ａ_ｊ，ｂ_ｉ，ｃ_ｎ）は、ｂ_ｉの漸化式となっているため、座標（ａ_１，ｂ_１，ｃ_{ｓｔａｒｔ}）（即ち、対応グラフの始点）からスタートして座標（ａ_６，ｂ_６，ｃ_ｅｎｄ）まで、順次にΦ（ａ_ｊ，ｂ_ｉ，ｃ_ｎ）を決定することができる。加えて、Φ（ａ_ｊ，ｂ_ｉ，ｃ_ｎ）を決定するとき、Φ（ａ_ｋ，ｂ_ｐ，ｃ_ｍ）＋φ_（ｉ）（ａ_ｋ，ｂ_ｐ，ｃ_ｍ，ａ_ｊ，ｂ_ｉ，ｃ_ｎ）を最大にする座標(ａ_ｋ，ｂ_ｐ，ｃ_ｍ）が、座標（ａ_ｊ，ｂ_ｉ）と対応付けられて保存される。具体的には、座標（ａ_ｊ，ｂ_ｉ，ｃ_ｎ）から座標(ａ_ｋ，ｂ_ｐ，ｃ_ｍ）へのポインタが記憶装置に保存される。 Since Φ (a _j , b _i , c _n ) is a recurrence formula of b _i , the coordinates _{start from} the coordinates (a ₁ , b ₁ , c _start ) (that is, the start point of the corresponding graph). Up to (a ₆ , b ₆ , c _end ), Φ (a _j , b _i , c _n ) can be sequentially determined. In addition, when determining Φ (a _j , b _i , c _n ), Φ (a _k , b _p , c _m ) + φ _(i) (a _k , b _p , _cm , a _j , b _i , Coordinates (a _k , b _p , c _m ) that maximize c _n ) are stored in association with the coordinates (a _j , b _i ). Specifically, pointers from the coordinates (a _j , b _i , c _n ) to the coordinates (a _k , b _p , c _m ) are stored in the storage device.

このような方法によって、車種コード領域配置候補に対応するｊ，ｉ，ｎの組み合わせのそれぞれについてΦ（ａ_ｊ，ｂ_ｉ，ｃ_ｎ）が算出される。算出されたΦ（ａ_ｊ，ｂ_ｉ，ｃ_ｎ）を用いて、下記の方法により、最適な対応グラフ、即ち、最適な車種コード領域配置候補が、２文字の車種コード、３文字の車種コードのそれぞれについて決定される。２文字の車種コードに対応する対応グラフの決定では、まず、Φ（ａ_ｋ，ｂ_４，ｃ_ｅｎｄ）の最大値が求められ、Φ（ａ_ｋ，ｂ_４，ｃ_ｅｎｄ）を最大にする対応グラフが、最適な対応グラフとして選択される。Φ（ａ_ｋ，ｂ_４，ｃ_ｅｎｄ）を最大にする対応グラフは、上述のポインタから決定可能である。同様に、３文字の車種コードに対応する最適な対応グラフの決定では、まず、Φ（ａ_ｋ，ｂ_６，ｃ_ｅｎｄ）の最大値が求められ、Φ（ａ_ｋ，ｂ_６，ｃ_ｅｎｄ）を最大にする対応グラフが、最適な対応グラフとして選択される。Φ（ａ_ｋ，ｂ_６，ｃ_ｅｎｄ）を最大にする対応グラフは、上述のポインタから決定可能である。 By such a method, Φ (a _j , b _i , c _n ) is calculated for each combination of j, i, n corresponding to the vehicle type code area arrangement candidate. Using the calculated Φ (a _j , b _i , c _n ), an optimal correspondence graph, that is, an optimal vehicle type code area arrangement candidate is a 2-character vehicle type code, a 3-character vehicle type code, by the following method. For each of them. In the determination of the corresponding graph corresponding to two-letter models code, first, [Phi maximum value of _{_{_{(a k, b 4, c}}} end) is obtained, corresponding to [Phi the _{_{_{(a k, b 4, c}}} end) to the maximum The graph is selected as the optimal correspondence graph. A correspondence graph that maximizes Φ (a _k , b ₄ , c _end ) can be determined from the pointers described above. Similarly, in the determination of the optimal response graph corresponding to the three letter models code, first, [Phi maximum value of _{_{_{(a k, b 6, c}}} end) is _{_{_{obtained, Φ (a k, b 6}}} , c end) The corresponding graph that maximizes is selected as the optimal corresponding graph. The correspondence graph that maximizes Φ (a _k , b ₆ , c _end ) can be determined from the pointers described above.

第３の実施形態による文字認識の処理方法では、横方向文字切り出し位置に加え、縦方向文字切り出し位置が設定されるため、文字が少し傾いている場合や、文字の上下に隙間がある場合にも、正しく文字候補領域を設定することが可能である。 In the character recognition processing method according to the third embodiment, since the vertical character cutout position is set in addition to the horizontal character cutout position, the character is slightly tilted or there is a gap above and below the character. Also, it is possible to set the character candidate area correctly.

以上には、本発明の実施形態が詳細に説明されているが、本発明は、実施形態に記載のものに限定して解釈されてはならない。本発明は、ナンバープレート認識に特に好適であるが、適切なテンプレート画像を使用することにより他の文字列の認識にも使用できることは自明的であろう。 Although the embodiments of the present invention have been described in detail above, the present invention should not be construed as being limited to those described in the embodiments. Although the present invention is particularly suitable for license plate recognition, it will be obvious that it can also be used for recognition of other character strings by using appropriate template images.

図１は、本発明の第１実施形態に係るナンバープレート認識装置の構成を示すブロック図である。FIG. 1 is a block diagram showing a configuration of a license plate recognition apparatus according to the first embodiment of the present invention. 図２は、本実施形態に係るナンバープレート認識装置の動作を示すフローチャートである。FIG. 2 is a flowchart showing the operation of the license plate recognition apparatus according to the present embodiment. 図３は、ナンバープレート画像と小文字列画像とを示す概念図である。FIG. 3 is a conceptual diagram showing a license plate image and a lowercase letter row image. 図４は、小文字列画像の構成を示す概念図である。FIG. 4 is a conceptual diagram showing the configuration of a lowercase string image. 図５は、小文字列画像について行われる文字認識の処理手順を示すフローチャートである。FIG. 5 is a flowchart showing a character recognition processing procedure performed for a lowercase string image. 図６Ａは、文字切り出し位置候補を決定する手順を示す概念図である。FIG. 6A is a conceptual diagram illustrating a procedure for determining a character cutout position candidate. 図６Ｂは、文字切り出し位置候補を決定する手順を示す概念図である。FIG. 6B is a conceptual diagram illustrating a procedure for determining a character extraction position candidate. 図６Ｃは、文字切り出し位置候補を決定する好適な手順を示す概念図である。FIG. 6C is a conceptual diagram illustrating a preferred procedure for determining character cutout position candidates. 図７は、陸支コード候補領域を決定する好適な手順を示す概念図である。FIG. 7 is a conceptual diagram showing a preferred procedure for determining a land support code candidate area. 図８は、陸支コード候補領域のうち最適なものを陸支コード領域として選択する手順を示す概念図である。FIG. 8 is a conceptual diagram showing a procedure for selecting an optimum land code code area as a land code area. 図９は、陸支コード領域から車種コード領域を決定する手順を示す概念図である。FIG. 9 is a conceptual diagram showing a procedure for determining the vehicle type code area from the land support code area. 図１０は、車種コード領域に決定される文字候補領域及び車種コード領域配置候補を説明する概念図である。FIG. 10 is a conceptual diagram illustrating character candidate areas and vehicle code area arrangement candidates determined as vehicle type code areas. 図１１は、車種コード領域に決定される文字候補領域及び車種コード領域配置候補を説明する他の概念図である。FIG. 11 is another conceptual diagram illustrating character candidate areas and vehicle code area arrangement candidates determined as vehicle type code areas. 図１２は、本実施形態において考えられる文字候補領域を示す概念図である。FIG. 12 is a conceptual diagram showing character candidate areas that can be considered in the present embodiment. 図１３は、本実施形態における文字候補領域の決定方法を説明する概念図である。FIG. 13 is a conceptual diagram illustrating a method for determining a character candidate area in the present embodiment. 図１４は、各文字候補領域について行われる文字認識の手順を説明する概念図である。FIG. 14 is a conceptual diagram illustrating the procedure of character recognition performed for each character candidate region. 図１５は、文字幅減点値Ｐ^ｉ _ｃｈａｒを算出する手順を説明する概念図である。FIG. 15 is a conceptual diagram illustrating a procedure for calculating the character width deduction point P ⁱ _char . 図１６は、本実施形態において生じ得る誤認識の例を説明する概念図である。FIG. 16 is a conceptual diagram illustrating an example of misrecognition that may occur in the present embodiment. 図１７は、文字候補領域の誤認識を防ぐための好適な処理手順を示すフローチャートである。FIG. 17 is a flowchart showing a preferred processing procedure for preventing erroneous recognition of a character candidate area. 図１８は、隙間らしさ評価値を算出する手順を説明する概念図である。FIG. 18 is a conceptual diagram illustrating a procedure for calculating a clearance likelihood evaluation value. 図１９Ａは、２文字、３文字の車種コードに対応する車種コード領域配置候補の有効性を判断するための手順を説明する概念図である。FIG. 19A is a conceptual diagram illustrating a procedure for determining the validity of a vehicle type code area arrangement candidate corresponding to a 2-character, 3-character vehicle type code. 図１９Ｂは、２文字、３文字の車種コードに対応する車種コード領域配置候補の有効性を判断するための手順を説明する概念図である。FIG. 19B is a conceptual diagram illustrating a procedure for determining the validity of a vehicle type code area arrangement candidate corresponding to a 2-character, 3-character vehicle type code. 図１９Ｃは、２文字、３文字の車種コードに対応する車種コード領域配置候補の有効性を判断するための手順を説明する概念図である。FIG. 19C is a conceptual diagram illustrating a procedure for determining the validity of a vehicle type code area arrangement candidate corresponding to a 2-character, 3-character vehicle type code. 図１９Ｄは、２文字、３文字の車種コードに対応する車種コード領域配置候補の有効性を判断するための手順を説明する概念図である。FIG. 19D is a conceptual diagram illustrating a procedure for determining the validity of a vehicle type code area arrangement candidate corresponding to a 2-character, 3-character vehicle type code. 図２０は、２文字、３文字の車種コードに対応する車種コード領域配置候補から最適な車種コード領域配置候補を選択する手順を説明する概念図である。FIG. 20 is a conceptual diagram illustrating a procedure for selecting an optimal vehicle type code area arrangement candidate from vehicle type code area arrangement candidates corresponding to 2-character and 3-character vehicle type codes. 図２１は、第２の実施形態に係る小文字列画像の文字認識処理を説明するフローチャートである。FIG. 21 is a flowchart for explaining character recognition processing of a lowercase string image according to the second embodiment. 図２２は、第２の実施形態における文字候補領域の決定方法を説明する概念図である。FIG. 22 is a conceptual diagram illustrating a method for determining a character candidate area in the second embodiment. 図２３は、第３の実施形態において規定される横方向文字切り出し位置候補及び縦方向文字切り出し位置を説明する概念図である。FIG. 23 is a conceptual diagram illustrating horizontal character cutout position candidates and vertical character cutout positions defined in the third embodiment. 図２４は、横方向文字切り出し位置候補及び縦方向文字切り出し位置を決定する処理手順を説明する概念図である。FIG. 24 is a conceptual diagram illustrating a processing procedure for determining a horizontal character cutout position candidate and a vertical character cutout position. 図２５Ａは、第３の実施形態における陸支コード候補領域の決定手順を説明する概念図である。FIG. 25A is a conceptual diagram illustrating a procedure for determining a land support code candidate area in the third embodiment. 図２５Ｂは、第３の実施形態における陸支コード候補領域の決定手順を説明する概念図である。FIG. 25B is a conceptual diagram illustrating a procedure for determining a land support code candidate region in the third embodiment. 図２５Ｃは、第３の実施形態における陸支コード候補領域の決定手順を説明する概念図である。FIG. 25C is a conceptual diagram illustrating a procedure for determining a land support code candidate region in the third embodiment. 図２５Ｄは、第３の実施形態における陸支コード候補領域の決定手順を説明する概念図である。FIG. 25D is a conceptual diagram illustrating a procedure for determining a land support code candidate region in the third embodiment. 図２５Ｅは、第３の実施形態における陸支コード候補領域の決定手順を説明する概念図である。FIG. 25E is a conceptual diagram illustrating a procedure for determining land code candidate regions in the third embodiment. 図２６は、第３の実施形態における文字候補領域の決定方法を説明する概念図である。FIG. 26 is a conceptual diagram illustrating a method for determining a character candidate area in the third embodiment.

Explanation of symbols

１：撮像装置
２：画像処理装置
３：記憶装置
４：演算装置
５：インターフェース
６：ナンバープレート認識プログラム
７：陸支コード用テンプレート
８：車種コード用テンプレート
１０：ナンバープレート認識装置
１１：小文字列画像
１２：陸支コード領域
１３：車種コード領域
１４：第１文字領域（文字領域）
１５：第２文字領域
１６：第３文字領域
１７：文字切り出し位置
１８、１８_１、１８_２、１８_３、１８_５：文字切り出し位置候補（横方向文字切り出し位置候補）
１９、１９_１、１９_４：陸支コード候補領域
２１：第１文字候補領域（文字候補領域）
２２：第２文字候補領域
２３：第３文字候補領域
２４：第４文字候補領域
２５：縦方向文字切り出し位置候補 1: Image pickup device 2: Image processing device 3: Storage device 4: Computing device 5: Interface 6: License plate recognition program 7: Land code template 8: Vehicle type code template 10: License plate recognition device 11: Lower case image 12: Land support code area 13: Vehicle type code area 14: First character area (character area)
15: 2nd character area 16: 3rd character area 17: Character cut-out positions 18, 18 ₁ , 18 ₂ , 18 ₃ , 18 ₅ : Character cut-out position candidates (horizontal character cut-out position candidates)
19, 19 ₁ , 19 ₄ : Land support code candidate area 21: First character candidate area (character candidate area)
22: Second character candidate area 23: Third character candidate area 24: Fourth character candidate area 25: Vertical character cutout position candidate

Claims

(A) determining a character cutout position candidate that is a candidate for the boundary of the character region for the entire character string image including a plurality of character regions in which one character is copied;
(B) A character candidate region that is a candidate for the plurality of character regions is determined by associating the character cut-out position candidates with boundaries of the plurality of character regions, and region arrangement candidates expressed as combinations of the character candidate regions are determined. The steps to decide;
(C) performing image recognition for each of the character candidate regions, and selecting an optimal region placement candidate from the region placement candidates based on the result of the image recognition;
(D) A step of outputting a character string made up of characters recognized in the image recognition in the step (C) for the character candidate region corresponding to the optimal region arrangement candidate as a recognition result character string. Method.

The character string recognition method according to claim 1,
The character cutout recognition method is a character string recognition method in which the character cutout position candidate is determined from the luminance of each pixel of the character string image.

The character string recognition method according to claim 1,
The character cutout position candidate is determined from a differential image of the character string image.

The character string recognition method according to claim 1,
The step (C) includes:
(C1) calculating a width of each of the character candidate areas defined by the character cutout position candidates;
(C2) calculating a difference between the image of the character candidate area and a template image prepared in advance for each of the character candidate areas defined by the character cutout position candidates;
(C3) calculating a character-likeness evaluation value based on the width of the character candidate region and the degree of difference,
The optimum region arrangement candidate is a character string recognition method that is determined based on the character likelihood evaluation value.

The character string recognition method according to claim 1,
The step (C) further includes:
(C4) calculating a width of a gap between two adjacent character candidate areas of the character candidate area defined by the character cutout position candidates;
(C5) comprising a step of calculating a clearance likelihood evaluation value based on the width of the gap between the two character candidate regions,
The optimum region arrangement candidate is a character string recognition method that is determined based on the gap likelihood evaluation value.

(E) determining a character extraction position candidate that is a candidate for a boundary of a character region in which one character of the lowercase character string is copied for each of the lowercase character string images in which the lowercase character string of the license plate is copied; ,
(F) associating the character cutout position candidates with the boundaries of the plurality of character regions, determining character candidate regions that are candidates for the plurality of character regions, and determining region arrangement candidates expressed as combinations of the character candidate regions The steps to decide;
(G) performing image recognition for each of the character candidate regions, and selecting an optimal region placement candidate from the region placement candidates based on the result of the image recognition;
(H) a step of outputting, as at least part of a recognition result, a character string made up of characters recognized in the image recognition in the step (G) for the character candidate region corresponding to the optimum region arrangement candidate. Plate recognition method.

The license plate recognition method according to claim 6,
Furthermore,
(I) determining a land code area where a land code of the lowercase character string is copied from the character cutout position candidates;
(J) determining a vehicle type code area in an area other than the land support code area in the lower case sequence image,
The region arrangement candidate is a vehicle type code region arrangement candidate defined for the vehicle type code region.

The character string recognition method according to claim 7,
The step (G) includes:
(G1) calculating a width of each of the character candidate areas defined by the character cutout position candidates;
(G2) calculating a difference between the image of the character candidate area and a template image prepared in advance for each of the character candidate areas defined by the character cutout position candidates;
(G3) calculating a character likelihood evaluation value based on the width of the character candidate region and the degree of difference,
The optimum region arrangement candidate is a character string recognition method that is determined based on the character likelihood evaluation value.

The character string recognition method according to claim 8,
The step (G) further includes:
(G4) calculating a width of a gap between two adjacent character candidate areas of the character candidate area defined by the character cutout position candidates;
(G5) comprising a step of calculating a clearance likelihood evaluation value based on the width of the gap between the two character candidate regions,
The optimum region arrangement candidate is a character string recognition method that is determined based on the gap likelihood evaluation value.

The license plate recognition method according to claim 7,
The step (I) includes:
(I1) a step of defining an area having a predetermined range of width as a land support code candidate area with the character cutout position candidate as a boundary, in contact with an end of the lowercase character string image, and
(I2) performing image recognition for each of the land support code candidate areas, and determining a land support code area in which the land support code is copied from the land support code candidate areas based on the result of the image recognition And
A license plate recognition method in which a land support code recognized for the land support code candidate region determined as the land support code region is output as a part of the recognition result.

The license plate recognition method according to claim 6,
In the image recognition in the step (G), a land code code template image is used for the character candidate region corresponding to the leftmost character in the lowercase character string, and the character candidates corresponding to other characters are used. A number template image is used for the area,
The recognition result includes a recognition result of a land code of the license plate and a recognition result of a vehicle type code of the license plate.

The license plate recognition method according to claim 6,
When the recognition result of “1” is obtained by the image recognition performed for a certain character candidate region in the character candidate region in the step (G), the surroundings of the certain character candidate region are included. When image recognition is performed again for the enlarged area and a recognition result other than “1” is obtained for the enlarged area, the area arrangement candidate including the certain character candidate area is selected as the optimum area arrangement candidate. No. License plate recognition method.

The license plate recognition method according to claim 6,
The character candidate area is a rectangle,
The character cutout position candidates are
A horizontal character cutout position candidate for determining the left end and the right end of the character candidate area;
A license plate recognition method comprising: a vertical character cutout position candidate for determining an upper end and a lower end of the character candidate region.

The license plate recognition method according to claim 6,
The step (E)
(E1) dividing the lower-case string image into an upper part and a lower part;
(E2) A license plate recognition method comprising: determining the character cutout position candidates from the image of the lower part.

An imaging device for acquiring a character string image including a plurality of character regions each of which one character is copied;
An image processing device that performs character recognition of the character string image,
The image processing apparatus includes the following steps:
(A) a step of determining a character cutout position candidate that is a candidate for the boundary of the character region for the entire character string image;
(B) A character candidate region that is a candidate for the plurality of character regions is determined by associating the character cut-out position candidates with boundaries of the plurality of character regions, and region arrangement candidates expressed as combinations of the character candidate regions are determined. Steps to determine,
(C) performing image recognition for each of the character candidate areas, and selecting an optimum area arrangement candidate from the area arrangement candidates based on the result of the image recognition; and (D) selecting the optimum area arrangement candidate A character string recognition device configured to execute a step of outputting, as a recognition result character string, a character string composed of characters recognized in the image recognition in step (C) for the corresponding character candidate region.

An imaging device for obtaining a captured image of a license plate;
An image processing device for performing character recognition of the license plate from the captured image,
The image processing apparatus includes the following steps:
(K) a step of cutting out from the captured image a lower case sequence image in which a lower case sequence of the license plate is copied;
(E) 'determining a character cutout position candidate that is a candidate for a boundary of a character region in which one character of the lowercase character string is copied for each of the lowercase character string images;
(F) associating the character cutout position candidates with the boundaries of the plurality of character regions, determining character candidate regions that are candidates for the plurality of character regions, and determining region arrangement candidates expressed as combinations of the character candidate regions Steps to determine,
(G) performing image recognition for each of the character candidate areas, and selecting an optimum area arrangement candidate from the area arrangement candidates based on the result of the image recognition; and (H) selecting the optimum area arrangement candidate. A license plate recognition device configured to execute a step of outputting, as at least a part of a recognition result, a character string made up of characters recognized in the image recognition of step (F) for the corresponding character candidate region.