JP5660294B2

JP5660294B2 - Character image identification device, character image identification method and program

Info

Publication number: JP5660294B2
Application number: JP2010182102A
Authority: JP
Inventors: 久保田　聡; 聡久保田; 木村　俊一; 俊一木村; 越　裕; 裕越
Original assignee: Fuji Xerox Co Ltd; Fujifilm Business Innovation Corp
Current assignee: Fujifilm Business Innovation Corp
Priority date: 2010-08-17
Filing date: 2010-08-17
Publication date: 2015-01-28
Anticipated expiration: 2030-08-17
Also published as: JP2012043077A

Description

本発明は、文字画像識別装置、文字画像識別方法およびプログラムに関する。 The present invention relates to a character image identification device, a character image identification method, and a program.

非特許文献１には、Haar-Like特徴と呼ばれる濃淡パターンを用いて、形状、サイズ、対象画像上でパターンを適用する位置が異なるとともに、各パターンの出力値を入力として所定の識別値を出力する複数の識別器（一般に弱識別器と呼ばれる）を構成し、これらの複数の弱識別器から顔の輝度の濃淡を表すのに適した弱識別器を選択して、それら複数の弱識別器をカスケード接続した識別器（一般に強識別器と呼ばれる）で顔識別を行う方法が開示されている。 Non-Patent Document 1 uses a shading pattern called a Haar-Like feature, and the shape, size, and position where the pattern is applied on the target image are different, and the output value of each pattern is input and a predetermined identification value is output. A plurality of weak classifiers (generally called weak classifiers) are selected, and a weak classifier suitable for representing the intensity of facial brightness is selected from the plurality of weak classifiers, and the plurality of weak classifiers are selected. Discloses a method of performing face recognition using a classifier (in general, referred to as a strong classifier) connected in cascade.

また顔識別以外のオブジェクト識別技術に関しては、特許文献１および特許文献２に開示されている方法がある。 Regarding object identification techniques other than face identification, there are methods disclosed in Patent Document 1 and Patent Document 2.

特許文献１には、パターンの認識判定において、判定対象から抽出したｎ次元特徴量に対してそれぞれ上限／下限閾値により規定される区間を設け、認識対象から抽出した特徴量が全ての次元の区間内に入るか否かに基づいて、認識対象のパターンが特定のカテゴリーに含まれるか否かを判定するパターン学習方法が開示されている。 In Patent Document 1, in pattern recognition determination, sections defined by upper / lower thresholds are provided for n-dimensional feature quantities extracted from a determination target, respectively, and feature quantities extracted from the recognition target are sections of all dimensions. A pattern learning method for determining whether or not a pattern to be recognized is included in a specific category based on whether or not it falls within the pattern is disclosed.

特許文献２には、パターン認識システムにおいて、入力パターンに対して一意な閾値を設定するのではなく、入力パターン毎に標準パターンとの類似度分布に基づいたしきい値を設定して、標準パターン群から入力パターンと整合するパターンを絞り込むようにしたパタン認識システムが開示されている。 In Patent Document 2, in the pattern recognition system, instead of setting a unique threshold value for an input pattern, a threshold value based on a similarity distribution with the standard pattern is set for each input pattern. A pattern recognition system is disclosed in which a pattern matching an input pattern is narrowed down from a group.

特開平０６−１２４３４９号公報Japanese Patent Laid-Open No. 06-124349 特開平１１−１２０２９１号公報Japanese Patent Laid-Open No. 11-120291

Paul Viola, Michael Jones, 「Rapid Object Detection using a Boosted Cascade of Simple Features.」IEEE CVPR, 2001.Paul Viola, Michael Jones, `` Rapid Object Detection using a Boosted Cascade of Simple Features. '' IEEE CVPR, 2001.

本発明の目的は、切り出された文字候補画像が文字画像か非文字画像であるかを短時間で識別することが可能な文字画像識別装置、文字画像識別方法およびプログラムを提供することである。 An object of the present invention is to provide a character image identification device, a character image identification method, and a program capable of identifying in a short time whether a cut character candidate image is a character image or a non-character image.

［文字画像識別装置］
請求項１に係る本発明は、処理対象の画像データから文字画像である可能性があると判定されて切出された文字候補画像から、それぞれ異なる複数の画像特性に基づく特徴量を順次抽出する複数段の抽出手段と、
前記複数段の抽出手段とそれぞれ対応して設けられ、対応する抽出手段により抽出された特徴量に基づいて当該文字候補画像が文字画像である可能性または非文字画像である可能性を示す識別値を算出する算出手段と、前記算出手段により算出された識別値をそれぞれ画像特性毎に予め設定されたしきい値と比較する比較手段とを備え、前記比較手段における比較結果に基づいて当該文字候補画像が文字画像であるか非文字画像であるかまたは、文字画像であるか非文字画像であるか不明な不明画像であるかを識別する複数段の識別手段と、
前記複数段の識別手段のうちの少なくとも１つの識別手段により、当該文字候補画像が文字画像または非文字画像であると識別された場合、当該文字候補画像を文字画像または非文字画像であると判定する判定手段とを有し、
前記複数段の抽出手段は、それぞれ、前段の識別手段において当該文字候補画像が不明画像であると識別された場合に、設定された画像特性に基づく特徴量を抽出する文字画像識別装置である。 [Character image identification device]
According to the first aspect of the present invention, feature amounts based on a plurality of different image characteristics are sequentially extracted from a character candidate image that is determined to be a character image from image data to be processed and cut out. A multi-stage extraction means;
An identification value provided corresponding to each of the plurality of stages of extraction means, and indicating the possibility that the character candidate image is a character image or a non-character image based on the feature amount extracted by the corresponding extraction means And a comparison unit that compares the identification value calculated by the calculation unit with a threshold value set in advance for each image characteristic, and based on the comparison result in the comparison unit, the character candidate A plurality of stages of identifying means for identifying whether the image is a character image or a non-character image, or a character image, a non-character image, or an unknown unknown image;
When the character candidate image is identified as a character image or a non-character image by at least one of the plurality of stages of identification means, the character candidate image is determined to be a character image or a non-character image. Determination means for
Each of the plurality of stages of extraction means is a character image identification apparatus that extracts a feature amount based on the set image characteristics when the previous-stage identification means identifies the character candidate image as an unknown image .

請求項２に係る本発明は、前記複数段の識別手段が、それぞれ、前段の識別手段における識別値と、現段の算出手段により算出された識別値を加算する加算手段をさらに備えた請求項１記載の文字画像識別装置である。 According to a second aspect of the present invention, the plurality of stages of identification means further comprise addition means for adding the identification value in the preceding stage identification means and the identification value calculated by the current stage calculation means, respectively. The character image identification device according to claim 1 .

請求項３に係る本発明は、前記算出手段が、当該特徴量に対する複数の文字画像および非文字画像の頻度分布に基づいて予め算出された識別関数を用いて、特徴量から識別値を算出する備えた請求項１または２記載の文字画像識別装置である。 According to a third aspect of the present invention, the calculation means calculates an identification value from the feature amount using an identification function calculated in advance based on the frequency distribution of a plurality of character images and non-character images for the feature amount. It is the character image identification device of Claim 1 or 2 provided.

請求項４に係る本発明は、前記複数段の識別手段が、文字候補画像が文字画像か非文字画像であるかを識別する識別能力が高い識別手段ほど前段に配置されている請求項１から３のいずれか１項記載の文字画像識別装置である。 According to a fourth aspect of the present invention, from the first aspect, the plurality of stages of identification means are arranged in a preceding stage as the identification means having a higher identification ability for identifying whether the character candidate image is a character image or a non-character image. 3 a character image identification apparatus according to any one of.

［文字画像識別方法］
請求項５に係る本発明は、処理対象の画像データから文字画像である可能性があると判定されて切出された文字候補画像から、それぞれ異なる複数の画像特性に基づく特徴量を抽出するステップと、
抽出された特徴量に基づいて当該文字候補画像が文字画像である可能性または非文字画像である可能性を示す識別値を算出するステップと、
算出された識別値をそれぞれ画像特性毎に予め設定されたしきい値と比較するステップと、
しきい値との比較結果に基づいて当該文字候補画像が文字画像であるか非文字画像であるかまたは、文字画像であるか非文字画像であるか不明な不明画像であるかを識別するステップと、
当該文字候補画像が文字画像または非文字画像であると識別された場合、当該文字候補画像を文字画像または非文字画像であると判定し、当該文字候補画像が不明画像であると識別された場合には、特徴量を抽出するステップ、識別値を算出するステップおよびしきい値と比較するステップ、文字候補画像を識別するステップを再度繰り返すステップとを有する文字画像識別方法である。 [Character image identification method]
The present invention according to claim 5 is a step of extracting feature amounts based on a plurality of different image characteristics from character candidate images that have been determined to be character images from image data to be processed and cut out. When,
Calculating an identification value indicating the possibility that the character candidate image is a character image or a non-character image based on the extracted feature amount; and
Comparing the calculated identification value with a preset threshold value for each image characteristic;
Identifying whether the character candidate image is a character image, a non-character image, or a character image, a non-character image, or an unknown image based on a comparison result with a threshold value When,
When the character candidate image is identified as a character image or a non-character image, it is determined that the character candidate image is a character image or a non-character image, and the character candidate image is identified as an unknown image Includes a step of extracting a feature amount, a step of calculating an identification value, a step of comparing with a threshold value, and a step of repeating the step of identifying a character candidate image again.

［プログラム］
請求項６に係る本発明は、処理対象の画像データから文字画像である可能性があると判定されて切出された文字候補画像から、それぞれ異なる複数の画像特性に基づく特徴量を抽出するステップと、
抽出された特徴量に基づいて当該文字候補画像が文字画像である可能性または非文字画像である可能性を示す識別値を算出するステップと、
算出された識別値をそれぞれ画像特性毎に予め設定されたしきい値と比較するステップと、
しきい値との比較結果に基づいて当該文字候補画像が文字画像であるか非文字画像であるかまたは、文字画像であるか非文字画像であるか不明な不明画像であるかを識別するステップと、
当該文字候補画像が文字画像または非文字画像であると識別された場合、当該文字候補画像を文字画像または非文字画像であると判定し、当該文字候補画像が不明画像であると識別された場合には、特徴量を抽出するステップ、識別値を算出するステップおよびしきい値と比較するステップ、文字候補画像を識別するステップを再度繰り返すステップとをコンピュータに実行させるためのプログラムである。 [program]
The present invention according to claim 6 is a step of extracting feature amounts based on a plurality of different image characteristics from character candidate images that are determined to be character images from image data to be processed and cut out. When,
Calculating an identification value indicating the possibility that the character candidate image is a character image or a non-character image based on the extracted feature amount; and
Comparing the calculated identification value with a preset threshold value for each image characteristic;
Identifying whether the character candidate image is a character image, a non-character image, or a character image, a non-character image, or an unknown image based on a comparison result with a threshold value When,
When the character candidate image is identified as a character image or a non-character image, it is determined that the character candidate image is a character image or a non-character image, and the character candidate image is identified as an unknown image Is a program for causing a computer to execute a step of extracting a feature amount, a step of calculating an identification value, a step of comparing with a threshold value, and a step of repeating a step of identifying a character candidate image again.

請求項１に係る本発明によれば、切り出された文字候補画像が文字画像か非文字画像であるかを、本構成を有しない場合と比較して、短時間で識別することが可能な文字画像識別装置を提供することができる。 According to the first aspect of the present invention, characters that can be identified in a short time compared with the case where the extracted character candidate image is a character image or a non-character image is not provided. An image identification device can be provided.

請求項２に係る本発明によれば、請求項１に係る本発明により得られる効果に加えて、前段までの識別結果を加味した識別処理を行うことが可能な文字画像識別装置を提供することができる。 According to the second aspect of the present invention, in addition to the effect obtained by the first aspect of the present invention, there is provided a character image identification device capable of performing identification processing taking into account the identification results up to the previous stage. Can do.

請求項３に係る本発明によれば、請求項１または２に係る本発明により得られる効果に加えて、識別関数による計算により容易に識別値を算出することが可能な文字画像識別装置を提供することができる。 According to the third aspect of the present invention, in addition to the effect obtained by the first or second aspect of the present invention, there is provided a character image identification device capable of easily calculating an identification value by calculation using an identification function. can do.

請求項４に係る本発明によれば、複数段の識別手段のうち早い段階で文字候補画像が文字画像か非文字画像かを判定することが可能な文字画像識別装置を提供することができる。 According to the fourth aspect of the present invention, it is possible to provide a character image identification device capable of determining whether a character candidate image is a character image or a non-character image at an early stage among a plurality of identification means.

請求項５に係る本発明によれば、切り出された文字候補画像が文字画像か非文字画像であるかを、本構成を有しない場合と比較して、短時間で識別することが可能な文字画像識別方法を提供することができる。 According to the fifth aspect of the present invention, it is possible to identify whether the extracted character candidate image is a character image or a non-character image in a short time compared to the case where this configuration is not provided. An image identification method can be provided.

請求項６に係る本発明によれば、切り出された文字候補画像が文字画像か非文字画像であるかを、本構成を有しない場合と比較して、短時間で識別することが可能なプログラムを提供することができる。

According to the sixth aspect of the present invention, a program capable of identifying in a short time whether the extracted character candidate image is a character image or a non-character image as compared with the case where the present configuration is not provided. Can be provided.

本発明の第１の実施形態における文字画像識別装置１０のハードウェア構成を示すブロック図である。It is a block diagram which shows the hardware constitutions of the character image identification device 10 in the 1st Embodiment of this invention. 本発明の第１の実施形態における文字画像識別装置１０の機能構成を示すブロック図である。It is a block diagram which shows the function structure of the character image identification device 10 in the 1st Embodiment of this invention. 画像サイズ規格化部２２における画像サイズ規格化の一例を説明した図である。It is a figure explaining an example of image size normalization in the image size normalization part. 特徴量抽出器３１₁〜３１_Nの具体的な一例を説明する図である。It is a diagram illustrating a specific example of the feature extractor 31 ₁ to 31 _N. 黒画素密度に対する文字画像ならびに非文字画像の頻度分布の具体例を表した図である。It is a figure showing the specific example of the frequency distribution of the character image with respect to black pixel density, and a non-character image. 黒画素周囲長に対する文字画像ならびに非文字画像の頻度分布の具体例を表した図である。It is a figure showing the specific example of the frequency distribution of the character image with respect to a black pixel perimeter, and a non-character image. 平均ラン長特徴量に対する文字画像ならびに非文字画像の頻度分布の具体例を表した図である。It is a figure showing the specific example of the frequency distribution of the character image with respect to an average run length feature-value, and a non-character image. 最大ラン長に対する文字画像ならびに非文字画像の頻度分布の具体例を表した図である。It is a figure showing the specific example of the frequency distribution of the character image with respect to the maximum run length, and a non-character image. 複雑度に対する文字画像ならびに非文字画像の頻度分布の具体例を表した図である。It is a figure showing the specific example of the frequency distribution of the character image with respect to complexity, and a non-character image. 連結成分数に対する文字画像ならびに非文字画像の頻度分布の具体例を表した図である。It is a figure showing the specific example of the frequency distribution of the character image with respect to the number of connected components, and a non-character image. 水平白黒変化点数に対する文字画像ならびに非文字画像の頻度分布の具体例を表した図である。It is a figure showing the specific example of the frequency distribution of the character image with respect to a horizontal black-and-white change score, and a non-character image. 垂直白黒変化点数に対する文字画像ならびに非文字画像の頻度分布の具体例を表した図である。It is a figure showing the specific example of the frequency distribution of the character image with respect to a vertical black-and-white change score, and a non-character image. 文字識別器３２₁〜３２_Nの構成を説明するための図である。It is a diagram for explaining the structure of a character discriminator 32 ₁ to 32 _N. 文字識別器３２₁〜３２_Nにおける識別値算出部４１₁〜４１_Nの動作を説明するための図である。It is a diagram for explaining the operation of the character discriminator 321 _to 323 identified value of _N calculating unit 41 ₁ to 41 _N. 識別関数の設定方法を説明するための図である。It is a figure for demonstrating the setting method of an identification function. 比較器４２₁〜４２_Nの動作を説明するための図である。It is a diagram for explaining the operation of the comparator 42 ₁ through 42 _N. 最終段において１つのしきい値ｔｈＮのみを用いる場合の比較器４２_Nの動作を説明するための図である。It is a figure for demonstrating operation | movement of the comparator 42 _N when only one threshold value thN is used in the last stage. しきい値の設定方法を説明するための図である。It is a figure for demonstrating the setting method of a threshold value. 別の頻度分布においてしきい値の設定方法を説明するための図である。It is a figure for demonstrating the setting method of a threshold value in another frequency distribution. さらに別の頻度分布においてしきい値の設定方法を説明するための図である。It is a figure for demonstrating the setting method of a threshold value in another frequency distribution. 本発明の第１の実施形態の文字画像識別装置１０における識別動作を説明するためのフローチャートである。It is a flowchart for demonstrating the identification operation | movement in the character image identification device 10 of the 1st Embodiment of this invention. 本発明の第２の実施形態における文字画像識別装置１０ａの機能構成を示すブロック図である。It is a block diagram which shows the function structure of the character image identification device 10a in the 2nd Embodiment of this invention. 本発明の第２の実施形態の文字画像識別装置１０ａにおける文字識別器１３２₁〜１３２_Nの構成を説明するための図である。It is a diagram for explaining the structure of a character classifier 132 ₁ to 132 _N in the second embodiment of the character image identification apparatus 10a of the present invention.

次に、本発明の実施の形態について図面を参照して詳細に説明する。
[第１の実施形態]
図１は本発明の第１の実施形態の文字画像識別装置１０のハードウェア構成を示す図である。 Next, embodiments of the present invention will be described in detail with reference to the drawings.
[First embodiment]
FIG. 1 is a diagram showing a hardware configuration of a character image identification device 10 according to the first embodiment of the present invention.

本実施形態の文字画像識別装置１０は、図１に示されるように、ＣＰＵ１１、メモリ１２、ハードディスクドライブ（ＨＤＤ）等の記憶装置１３、ネットワークを介して外部の装置等との間でデータの送信及び受信を行う通信インタフェース（ＩＦ）１４、タッチパネル又は液晶ディスプレイ並びにキーボードを含むユーザインタフェース（ＵＩ）装置１５を有する。これらの構成要素は、制御バス１６を介して互いに接続されている。 As shown in FIG. 1, the character image identification device 10 of the present embodiment transmits data to and from a CPU 11, a memory 12, a storage device 13 such as a hard disk drive (HDD), and an external device via a network. And a communication interface (IF) 14 that performs reception, and a user interface (UI) device 15 including a touch panel or a liquid crystal display and a keyboard. These components are connected to each other via a control bus 16.

ＣＰＵ１１は、メモリ１２または記憶装置１３に格納された文字画像識別プログラムに基づいて所定の処理を実行して、文字画像識別装置１０の動作を制御する。なお、本実施形態では、ＣＰＵ１１は、メモリ１２または記憶装置１３内に格納された文字画像識別装置プログラムを読み出して実行するものとして説明したが、当該プログラムをＣＤ−ＲＯＭ等の記憶媒体に格納してＣＰＵ１１に提供することも可能である。 The CPU 11 controls the operation of the character image identification device 10 by executing a predetermined process based on the character image identification program stored in the memory 12 or the storage device 13. In the present embodiment, the CPU 11 has been described as reading and executing the character image identification device program stored in the memory 12 or the storage device 13, but the program is stored in a storage medium such as a CD-ROM. It is also possible to provide it to the CPU 11.

図２は、上記の文字画像識別プログラムが実行されることにより実現される文字画像識別装置１０の機能構成を示すブロック図である。 FIG. 2 is a block diagram showing a functional configuration of the character image identification device 10 realized by executing the character image identification program.

本実施形態の文字画像識別装置１０は、図２に示されるように、文字候補画像切出部２１と、画像サイズ規格化部２２と、Ｎ個の特徴量抽出器３１₁〜３１_Nと、文字識別器３２₁〜３２_Nと、判定器２４とを備えている。 As shown in FIG. 2, the character image identification device 10 of the present embodiment includes a character candidate image cutout unit 21, an image size normalization unit 22, N feature quantity extractors 31 _{1 to} 31 _N , Character discriminators 32 _{1 to} 32 _N and a determiner 24 are provided.

以下、「特徴量抽出器３１₁〜３１_N」など、複数存在しうる構成部分のいずれかを特定せずに示す場合には、単に「特徴量抽出器３１」のように略記する場合がある。 Hereinafter, in the case where any one of a plurality of components that can exist such as “feature amount extractors 31 _{1 to} 31 _N ” is indicated without being specified, it may be simply abbreviated as “feature amount extractor 31”. .

本実施形態の文字画像識別装置１０は、２値化された画像データ、文字認識を目的とした文字切出し処理を前提として、文字画像らしき画像であるとして切出された画像（以下、文字候補画像と呼ぶ。）が文字画像か非文字画像かを識別する。 The character image identification device 10 of the present embodiment is based on binarized image data and an image cut out as an image that appears to be a character image on the premise of character extraction processing for character recognition (hereinafter referred to as a character candidate image). Is called a character image or a non-character image.

文字候補画像切出部２１は、処理対象の画像データから文字画像である可能性があると判定された領域の画像を文字候補画像として切出す。 The character candidate image cutout unit 21 cuts out an image of an area determined to be a character image from the image data to be processed as a character candidate image.

文字切出し技術に関しては、様々な方法が提案されており、本実施形態ではそのうちどれか１つの方法を用いればよい。例えば以下のような文献に開示されている技術が存在する。
特開平５−１１４０４７号公報
特開平４−１００１８９号公報
特開平４−９２９９２号公報
特開平４−６８４８１号公報
特開平９−５４８１４号公報
特開平９−１８５６８１号公報における段落[００２１]に記載の文字の境界候補抽出方式
特開平５−１２８３０８号公報における段落[０００５]に記載の文字切り出し位置決定方式
なお、当然ながら文字候補画像を切り出し可能な技術であれば、上記の文献に開示された方法以外の方法を用いてもよい。 Various methods have been proposed for the character cutting technique, and any one of them may be used in the present embodiment. For example, there are techniques disclosed in the following documents.
JP-A-5-114047 JP-A-4-100189 JP-A-4-92992 JP-A-4-68481 JP-A-9-54814 JP-A-9-185681 Paragraph [0021] Character boundary candidate extraction method Character cutting position determination method described in paragraph [0005] of Japanese Patent Laid-Open No. Hei 5-128308 It should be noted that, as a matter of course, any technique that can cut out a character candidate image can be used. Other methods may be used.

画像サイズ規格化部２２は、文字候補画像切出部２１により切出された文字候補画像の大きさを、特徴量抽出器３１₁〜３１_Nならびに文字識別期３２₁〜３２_Nの処理に必要な所定のサイズに規格化する。 The image size normalization unit 22 uses the size of the character candidate image cut out by the character candidate image cutout unit 21 for processing in the feature quantity extractors 31 _{1 to} 31 _N and the character identification periods 32 _{1 to} 32 _N. Standardize to a certain size.

図３は画像サイズ規格化部２２における画像サイズ規格化の一例を説明した図である。文書画像を２値化して文字切出し技術で切出される文字候補画像は、文書画像中に含まれる多様な文字サイズ、不正な文字切出し位置、さらには文書画像中に含まれる写真や線画領域の不正切出しなどの影響でさまざまな大きさとなる。例えば図３（Ａ）は異なる文字サイズ、図３（Ｂ）は不正な切出し位置、図３（Ｃ）は写真や線画領域の不正切出しの例である。画像サイズ規格化部２２はこのようなさまざまな大きさで入力される文字候補画像を、図３（Ａ）、図３（Ｂ）、図３（Ｃ）で示すように一様な大きさに規格化する。規格化するサイズは例えば６４×６４画素サイズなどである。もちろん正方形サイズに規格化するだけに限らず、同サイズに規格化するのであれば長方形になるように規格化してもよい。 FIG. 3 is a diagram for explaining an example of image size normalization in the image size normalization unit 22. Character candidate images that are extracted by binarizing the document image and extracted by the character extraction technology are various character sizes included in the document image, illegal character extraction positions, and illegal images and line drawing areas included in the document image. It becomes various sizes under the influence of cutting out. For example, FIG. 3A shows a different character size, FIG. 3B shows an example of an illegal cutout position, and FIG. 3C shows an example of an illegal cutout of a photograph or a line drawing area. The image size normalization unit 22 converts the character candidate images input in such various sizes into uniform sizes as shown in FIGS. 3 (A), 3 (B), and 3 (C). Standardize. The standardized size is, for example, a 64 × 64 pixel size. Of course, not only normalization to the square size, but also normalization to a rectangle may be used if normalization to the same size is performed.

特徴量抽出器３１₁〜３１_Nは、複数段に構成され、画像サイズ規格化部２２により規格化された文字候補画像から、それぞれ異なる複数の画像特性に基づく特徴量を順次抽出する。 The feature amount extractors 31 _{1 to} 31 _N are configured in a plurality of stages, and sequentially extract feature amounts based on a plurality of different image characteristics from the character candidate images normalized by the image size normalization unit 22.

なお、文字識別器３２₁〜３２_Nも複数段に構成されており、特徴量抽出器３１₁〜３１_Nと文字識別器３２₁〜３２_Nは、それぞれ１対１で対応して設けられている。そのため、１つの特徴量抽出器３１と１つの文字識別器３２の組により１つの段が構成されているものとして説明する。例えば、特徴量抽出器３１₁や文字識別器３２₁は、特徴量抽出器３１₂および文字識別器３２₂の前段に構成されており、特徴量抽出器３１₃や文字識別器３２₃は、特徴量抽出器３１₂および文字識別器３２₂の後段に構成されている。 The character identifiers 32 _{1 to} 32 _N are also configured in a plurality of stages, and the feature quantity extractors 31 _{1 to} 31 _N and the character identifiers 32 _{1 to} 32 _N are provided in a one-to-one correspondence. Yes. Therefore, description will be made assuming that one stage is constituted by a set of one feature quantity extractor 31 and one character classifier 32. For example, the feature quantity extractor 31 ₁ and the character classifier 32 ₁ are arranged in front of the feature quantity extractor 31 ₂ and the character classifier 32 ₂ , and the feature quantity extractor 31 ₃ and the character classifier 32 ₃ are The feature amount extractor 31 ₂ and the character identifier 32 ₂ are arranged at the subsequent stage.

図４は特徴量抽出器３１₁〜３１_Nの具体的な一例を説明する図である。図４に示すように、例えば文字識別器３２₁が識別判断に黒画素密度を用いる場合、その特徴量を抽出する特徴量抽出器３１₁は黒画素密度抽出器３１₁として機能し、文字候補画像から特徴量として下記の式（１）で示す黒画素密度Ｄｂを算出して文字識別器３２₁に出力する。 FIG. 4 is a diagram illustrating a specific example of the feature quantity extractors 31 _{1 to} 31 _N. As shown in FIG. 4, for example, when the character discriminator 32 ₁ uses the black pixel density for discrimination judgment, the feature amount extractor 31 ₁ that extracts the feature amount functions as the black pixel density extractor 31 ₁ , and character candidates A black pixel density Db represented by the following equation (1) is calculated from the image as a feature quantity and output to the character classifier 32 ₁ .

Ｄｂ＝文字候補画像中の黒画素数／文字候補画像面積・・・・・（１） Db = number of black pixels in the character candidate image / character candidate image area (1)

また、文字識別器３２₂が識別判断に黒画素周囲長を用いる場合、同様にその特徴量を抽出する特徴量抽出器３１₂は黒画素周囲長抽出器３１₂として機能し、文字候補画像から特徴量として黒画素周囲長Ｌａを算出して文字識別器３２₂に出力する。 Also, if the character discriminator 32 ₂ uses a black pixel perimeter identification decision, the feature quantity extractor 31 ₂ for extracting the characteristic amount similarly functions as a black pixel perimeter extractor 31 _2, from the character candidate image The black pixel perimeter La is calculated as the feature amount and output to the character discriminator 32 ₂ .

同様に、文字識別器３２₃〜３２₆が、それぞれ識別判断に０度方向黒画素平均ラン長、４５度方向黒画素平均ラン長、９０度方向黒画素平均ラン長、１３５度方向黒画素平均ラン長を用いる場合、特徴量抽出器３１₃〜３１₆は、それぞれ、０度方向黒画素平均ラン長抽出器３１₃、４５度方向黒画素平均ラン長抽出器３１₄、９０度方向黒画素平均ラン長抽出器３１₅、１３５度方向黒画素平均ラン長抽出器３１₆として機能する。 Similarly, the character classifiers 32 _{3 to} 32 _{6 use the} 0-degree direction black pixel average run length, the 45-degree direction black pixel average run length, the 90-degree direction black pixel average run length, and the 135-degree direction black pixel average, respectively, for identification determination. When the run length is used, the feature quantity extractors 31 _{3 to} 31 ₆ respectively include a 0 degree direction black pixel average run length extractor 31 ₃ , a 45 degree direction black pixel average run length extractor 31 ₄ , and a 90 degree direction black pixel. The average run length extractor 31 ₅ functions as a 135-degree direction black pixel average run length extractor 31 ₆ .

そして、この場合には、０度方向黒画素平均ラン長抽出器３１₃は特徴量として０度方向黒画素平均ラン長Ｌ₀を文字識別器３２₃に出力し、４５度方向黒画素平均ラン長抽出器３１₄は特徴量として４５度方向黒画素平均ラン長Ｌ₄₅を文字識別器３２₄に出力する。また、９０度方向黒画素平均ラン長抽出器３１₅は特徴量として９０度方向黒画素平均ラン長Ｌ₉₀を文字識別器３２₅に出力し、１３５度方向黒画素平均ラン長抽出器３１₆は特徴量として１３５度方向黒画素平均ラン長Ｌ₁₃₅を文字識別器３２₆に出力する。 In this case, the 0 degree direction black pixel average run length extractor 31 ₃ outputs the 0 degree direction black pixel average run length L ₀ to the character identifier 32 ₃ as the feature quantity, and the 45 degree direction black pixel average run length L ₃ is output. The length extractor 31 ₄ outputs the 45-degree direction black pixel average run length L ₄₅ as a feature quantity to the character identifier 32 ₄ . Further, 90-degree direction black pixel average run length extractor 31 ₅ outputs the 90-degree direction black pixel average run length L ₉₀ as the feature amount on the character discriminator 32 _5, 135-degree direction black pixel average run length extractor 31 ₆ outputs the 135-degree direction black pixel average run length L ₁₃₅ as features in the character discriminator 32 _6.

以上説明したように、特徴量抽出器３１は文字識別器３２が識別判断に用いる画像特性に基づく特徴量の数だけ構成されている。 As described above, the feature quantity extractors 31 are configured by the number of feature quantities based on the image characteristics used by the character classifier 32 for discrimination determination.

したがって、特徴量抽出器３１は図４に示したような特徴量を抽出するものだけに限定されるものではなく、入力される２値化された文字候補画像からさまざまな特徴量を抽出するよう構成することが可能である。例えば以下に示すような特徴量抽出器を適用することが可能である。 Therefore, the feature quantity extractor 31 is not limited to the one that extracts the feature quantity as shown in FIG. 4, but various feature quantities are extracted from the input binarized character candidate images. It is possible to configure. For example, a feature quantity extractor as shown below can be applied.

（１）最大ラン長抽出器（０度、４５度、９０度、１３５度の各方向）
（２）複雑度抽出器
（３）連結成分数抽出器５３
（４）０度方向（水平方向）黒白変化点数抽出器
（５）９０度方向（垂直方向）黒白変化点数抽出器 (1) Maximum run length extractor (0 degree, 45 degree, 90 degree, 135 degree directions)
(2) Complexity extractor (3) Connected component number extractor 53
(4) 0 degree direction (horizontal direction) black and white change point extractor (5) 90 degree direction (vertical direction) black and white change point extractor

ここで、（１）最大ラン長抽出器は、各方向独立の特徴量抽出器であり、各方向の最大ラン長を算出する。 Here, (1) the maximum run length extractor is a feature quantity extractor independent of each direction, and calculates the maximum run length in each direction.

（２）複雑度抽出器は、複雑度を算出する。複雑度Ｃは、例えば以下の式（２）で算出される。
Ｃ＝（周囲長）² ／黒画素数・・・・・（２） (2) The complexity extractor calculates the complexity. The complexity C is calculated by the following equation (2), for example.
C = (perimeter) ² / number of black pixels (2)

（３）連結成分抽出器は、文字候補画像中の独立した連結成分数を算出する。例えば、文字「あ」なら１、文字「い」なら２である。 (3) The connected component extractor calculates the number of independent connected components in the character candidate image. For example, 1 for the character “A” and 2 for the character “I”.

（４）０度方向黒白変化点抽出器は、例えば文字候補画像中における０度方向（水平方向あるいは主走査方向）の黒画素から白画素へ変わる回数の総和を算出する。 (4) The 0-degree direction black-and-white change point extractor calculates, for example, the total number of times of changing from black pixels to white pixels in the 0-degree direction (horizontal direction or main scanning direction) in the character candidate image.

（５）９０度方向黒白変化点抽出器は、例えば文字候補画像中における９０度方向（垂直方向あるいは副走査方向）の黒画素から白画素へ変わる回数の総和を算出する。 (5) The 90-degree direction black-and-white change point extractor calculates, for example, the total number of times of changing from black pixels to white pixels in the 90-degree direction (vertical direction or sub-scanning direction) in the character candidate image.

以上説明したように、特徴量抽出器３１は２値化された文字候補画像から複数の画像特性の異なる特徴量を抽出するが、抽出される各特徴量の値の範囲（値域）は異なるので、例えば抽出される実数値の特徴量ｘに対して、（０≦ｘ≦１２７）の範囲や（０≦ｘ≦１）の範囲で値の正規化を行い、値域を揃えて各特徴量抽出器３１により出力される特徴量を同様に扱えるようにする。 As described above, the feature quantity extractor 31 extracts a plurality of feature quantities having different image characteristics from the binarized character candidate images, but the range (value range) of the extracted feature quantities is different. For example, the extracted feature value x of the real value is normalized in the range of (0 ≦ x ≦ 127) or in the range of (0 ≦ x ≦ 1), and each feature amount is extracted by aligning the range. The feature amount output by the device 31 can be handled in the same manner.

ここまで特徴量抽出器３１で抽出される特徴量の具体例を説明してきたが、ここで特徴量抽出器３１においてどのような特徴量を文字識別のための特徴量として利用するかについて説明する。 The specific example of the feature amount extracted by the feature amount extractor 31 has been described so far. Here, what feature amount is used as the feature amount for character identification in the feature amount extractor 31 will be described. .

図５、図６、図７は、上記で説明した黒画素密度、黒画素周囲長、平均ラン長に対する文字画像ならびに非文字画像の頻度分布の具体例を表した図である。ここで図５、図６、図７において、横軸は各特徴量を（０≦ｘ≦１２７）の範囲で正規化した値を表し、縦軸は文字画像ならびに非文字画像の個数に相当する値であり、文字画像総数をＮ、非文字画像総数をＭとした場合に、文字画像から抽出した特徴量なら１／２Ｎ、非文字画像から抽出した特徴量なら１／２Ｍを個数の計数単位として表している。 5, 6, and 7 are diagrams illustrating specific examples of the frequency distribution of the character image and the non-character image with respect to the black pixel density, the black pixel perimeter, and the average run length described above. Here, in FIGS. 5, 6, and 7, the horizontal axis represents a value obtained by normalizing each feature amount in a range of (0 ≦ x ≦ 127), and the vertical axis corresponds to the number of character images and non-character images. If the total number of character images is N and the total number of non-character images is M, 1 / 2N if the feature amount is extracted from the character image, and 1 / 2M if the feature amount is extracted from the non-character image. It represents as.

図５、図６、図７を参照すると分かるように、黒画素密度、黒画素周囲長、平均ラン長のいずれの特徴量においても、文字画像と非文字画像とでは異なる頻度分布を表していることが分かる。つまり文字画像分布と非文字画像分布は異なる分布の山を有しており、これら特徴量に対して所定の閾値を用いることで文字、非文字を識別可能であることが分かる。 As can be seen from FIGS. 5, 6, and 7, the character image and the non-character image represent different frequency distributions in any of the feature amounts of black pixel density, black pixel perimeter, and average run length. I understand that. That is, it can be seen that the character image distribution and the non-character image distribution have different distribution peaks, and it is possible to identify characters and non-characters by using a predetermined threshold for these feature values.

さらに同様に、これまでに説明した別の特徴量である最大ラン長、複雑度、連結成分数、水平白黒変化点数、垂直白黒変化点数についても、図８〜図１２に文字ならびに非文字画像の頻度分布の具体例を示すように、文字画像と非文字画像とでは異なる頻度分布を表しているが分かる。 Similarly, the maximum run length, complexity, number of connected components, horizontal black-and-white change points, and vertical black-and-white change points, which are other feature quantities described so far, are also shown in FIGS. As shown in a specific example of the frequency distribution, it can be seen that the character image and the non-character image represent different frequency distributions.

つまり、文字候補画像が文字画像か非文字画像であるかを判定する際には、文字画像ならびに非文字画像の頻度分布が異なる分布を表すような特徴量を選択するようにして、それらの特徴量を抽出する複数の特徴量抽出器３１を構成するようにするとよい。 In other words, when determining whether a character candidate image is a character image or a non-character image, the feature amount is selected so that the frequency distribution of the character image and the non-character image represents different distributions. A plurality of feature quantity extractors 31 for extracting quantities may be configured.

また、各文字識別器３２₁〜３２_Nは、上述したように、それぞれ異なる画像特性に基づく特徴量に基づいて、文字候補画像が文字画像であるか非文字画像であるかの文字画像識別を行う。 Further, as described above, each of the character identifiers 32 _{1 to} 32 _N performs character image identification as to whether the character candidate image is a character image or a non-character image based on feature amounts based on different image characteristics. Do.

そして、判定器２４は、Ｎ個の文字識別器３２₁〜３２_Nのうちの少なくとも１つの文字識別器により、文字候補画像が文字画像または非文字画像であると識別された場合、その文字候補画像を文字画像または非文字画像であると最終的に判定する。 When the character candidate image is identified as a character image or a non-character image by at least one character identifier among the _N character identifiers 32 _{1 to} 32 _N , the determiner 24 selects the character candidate. The image is finally determined to be a character image or a non-character image.

次に、この文字識別器３２₁〜３２_Nの構成を図１３を参照して説明する。
図１３に示されるように、文字識別器３２₁は、識別値算出部４１₁と、比較器４２₁とから構成されている。また、文字識別器３２₂は、識別値算出部４１₂と、比較器４２₂と、加算器４３₂とから構成されている。同様に、文字識別器３２₃〜３２_Nは、それぞれ、識別値算出部４１₃〜４１_Nと、比較器４２₃〜４２_Nと、加算器４３₃と〜４３_Nとから構成されている。 Next, the configuration of the character discriminators 32 _{1 to} 32 _N will be described with reference to FIG.
As shown in FIG. 13, the character classifier 32 ₁ includes an identification value calculation unit 41 ₁ and a comparator 42 ₁ . The character identifier 32 ₂ includes an identification value calculator 41 ₂ , a comparator 42 _2, and an adder 43 ₂ . Similarly, the character discriminators 32 _{3 to} 32 _N are configured by discrimination value calculation units 41 _{3 to} 41 _N , comparators 42 _{3 to} 42 _N , and adders 43 ₃ to 43 _N , respectively.

識別値算出部４１₁〜４１_Nは、対応する特徴量抽出器３１₁〜３１_Nから抽出された特徴量に基づいて、識別値ｙ１〜ｙＮをそれぞれ算出して出力する。この識別値ｙ１〜ｙＮは、当該文字候補画像が文字画像である可能性または非文字画像である可能性を示す値であり、詳細については後述する。 The identification value calculation units 41 _{1 to} 41 _N calculate and output the identification values y1 to yN based on the feature amounts extracted from the corresponding feature amount extractors 31 _{1 to} 31 _N , respectively. The identification values y1 to yN are values indicating the possibility that the character candidate image is a character image or a non-character image, and details thereof will be described later.

加算器４３₃と〜４３_Nは、それぞれ、前段の文字識別器３２₁〜３２_N-1の比較器４２₁〜４２_N-1に入力される識別値Ｙ１〜Ｙ（Ｎ−１）と、同じ文字識別器３２₂〜３２_N内に設けられている現段の識別値算出部４１₂〜４１_Nにより算出された識別値ｙ２〜ｙＮとをそれぞれ加算して、識別値Ｙ２〜ＹＮとして比較器４２₂〜４２_Nに出力する。 Adder 43 ₃ and ~ 43 _N, respectively, and a previous character discriminator 32 _₁ ~32 _{_N-1} comparators 42 _₁ ~42 _{_N-1} identification value is input to Y1~Y (N-1), The identification values y2 to yN calculated by the identification value calculation units 41 _{2 to} 41 _N of the current stage provided in the same character identifiers 32 _{2 to} 32 _N are added and compared as identification values Y2 to YN. Outputs to devices 42 _{2 to} 42 _N.

なお、先頭の文字識別器３２₁においては、前段の文字識別器は存在しないため、識別値算出部４１₁において算出された識別値ｙ１がそのまま識別値Ｙ１として比較器４２₁に出力される。 In the _first character discriminator 32 ₁ , the preceding character discriminator does not exist, and therefore, the discrimination value y ₁ calculated by the discrimination value calculation unit 41 ₁ is directly output to the comparator 42 ₁ as the discrimination value Y ₁ .

つまり、ある比較器４２に入力される識別値は、それよりも前段における全ての識別値を累積した値となり、例えば、比較器４２₂に入力される識別値Ｙ２は、ｙ１とｙ２を加算した値であり、比較器４２₃に入力される識別値Ｙ３は、ｙ１、ｙ２、ｙ３を加算した値となっている。 That is, the identification value inputted to the comparator 42 with, it all the identification values becomes a value obtained by accumulating in front than, for example, the identification value Y2 which is input to the comparator 42 ₂ was added to y1 and y2 a value, identification value Y3 input to the comparator 42 ₃ is a value obtained by adding the y1, y2, y3.

そして、比較器４２₁〜４２_Nは、それぞれ、入力された識別値Ｙ１〜ＹＮを、それぞれ画像特性毎に予め設定されたしきい値と比較する。その結果、文字識別器３２₁〜３２_Nは、この比較器４２₁〜４２_Nにおける比較結果により、文字候補画像を３通りに識別する。つまり、文字識別器３２₁〜３２_Nは、文字候補画像が、文字画像か、非文字画像か、不明画像かを識別する。ここで文字画像もしくは非文字画像と識別された場合には、識別結果を出力して現文字候補画像に対する識別動作はここで終了する。なお、比較器４２₁〜４２_N-1において、文字候補画像が文字画像か非文字画像であるか不明であると判定された場合、その結果は次段の特徴量抽出器３２₂〜３２_Nに通知される。 Then, the comparators 42 _{1 to} 42 _N respectively compare the input identification values Y1 to YN with threshold values set in advance for each image characteristic. As a result, the character identifiers 32 _{1 to} 32 _N identify the character candidate images in three ways based on the comparison results in the comparators 42 _{1 to} 42 _N. That is, the character identifiers 32 _{1 to} 32 _N identify whether the character candidate image is a character image, a non-character image, or an unknown image. If the character image or non-character image is identified, the identification result is output and the identification operation for the current character candidate image ends here. When the comparators 42 _{1 to} 42 _{N-1 determine} that the character candidate image is a character image or a non-character image, it is determined that the result is a feature quantity extractor 32 _{2 to} 32 _{N in the} next stage. Will be notified.

そして、特徴量抽出器３２₂〜３２_Nは、前段の文字識別処理において文字候補画像が文字画像であるか否かが不明である場合に初めて、文字候補画像から設定された特徴量を抽出する。 The feature quantity extractors 32 _{2 to} 32 _N extract the feature quantity set from the character candidate image only when it is unclear whether the character candidate image is a character image in the preceding character identification process. .

つまり、文字識別器３２₁〜３２_Nは、順次文字候補画像の識別処理を行って文字候補画像が文字画像か否か、または不明であるかの識別を行い、現段での識別処理において不明である場合に次段での識別処理が行われる。 That is, the character discriminators 32 _{1 to} 32 _N sequentially identify the character candidate images to identify whether the character candidate images are character images or are unknown, and are unknown in the current identification process. If it is, the identification process in the next stage is performed.

ただし、最終段の文字識別器３２_Nでは、文字画像か非文字画像であるか不明と識別された画像に対して、文字画像識別装置１０の用途、要求精度などを考慮して文字画像と識別するように動作させてもよいし、非文字画像と識別するように動作させてもよい。 However, the character identifier 32 _N at the last stage identifies the character image or non-character image from the character image in consideration of the use of the character image identification device 10, the required accuracy, etc. You may operate | move so that it may perform, and you may operate | move so that it may identify with a non-character image.

このように各文字識別器３２₁〜３２_Nでは、前段の文字識別器３２において不明であると判定された場合にのみ、後段の文字識別器３２において識別動作が行われる。そして、文字識別器３２₁〜３２_Nでは、ある段階で文字画像か否かが確定すると、その時点で識別処理を終了する。つまり、本実施形態の文字画像識別装置１０では、入力された文字候補画像に対して、識別が容易な文字候補画像に関しては文字／非文字の判断をできるだけ前段の文字識別器３２で確定し、識別が難しい文字候補画像に関しては多くの文字識別器３２の判断に基づいて識別を行うように動作する。 As described above, in each of the character classifiers 32 _{1 to} 32 _N , the classification operation is performed in the subsequent-stage character classifier 32 only when it is determined that the previous-stage character classifier 32 is unknown. When the character classifiers 32 _{1 to} 32 _N determine whether or not the image is a character image at a certain stage, the identification process is terminated at that point. In other words, in the character image identification device 10 of the present embodiment, for the input character candidate image, for the character candidate image that is easy to identify, the character / non-character determination is determined by the character identifier 32 at the previous stage as much as possible. For character candidate images that are difficult to identify, the operation is performed so as to perform identification based on the judgment of many character classifiers 32.

次に、文字識別器３２₁〜３２_Nにおける識別値算出部４１₁〜４１_Nの動作を図１４を参照して説明する。
図１４は識別値算出部４１₁〜４１_Nの動作を模式的に表したものである。図１４に示すように、識別値算出部４１₁〜４１_Nは特徴量抽出器３１_i（ｉ＝１〜Ｎ）からの出力ｘｉを入力し、識別値算出部４１₁〜４１_N毎に設定された識別関数ｆ（ｘ）にしたがって識別値ｙｉを出力する。ここで識別関数ｆ（ｘ）は以下の式（３）で表される１入力１出力関数である。 Next, the operation of the identification value calculation units 41 _{1 to} 41 _{N in} the character identifiers 32 _{1 to} 32 _N will be described with reference to FIG.
FIG. 14 schematically shows the operation of the identification value calculation units 41 _{1 to} 41 _N. As illustrated in FIG. 14, the identification value calculation units 41 _{1 to} 41 _N receive the output xi from the feature quantity extractor 31 _i (i = 1 to N), and are set for each of the identification value calculation units 41 _{1 to} 41 _N. The discriminant value yi is output according to the discriminated discriminant function f (x). Here, the discriminant function f (x) is a one-input one-output function expressed by the following equation (3).

ｙ＝ｆ（ｘ）・・・・（３） y = f (x) (3)

つまり特徴量抽出器３１_iからの出力である特徴量ｘｉに対して、式（３）にしたがって一意に識別値ｙｉが出力される。 That is, the identification value yi is uniquely output according to the equation (3) for the feature quantity xi that is the output from the feature quantity extractor 31 _i .

ここである特定の特徴量抽出器３１から抽出される特徴量に対する文字画像ならびに非文字画像の分布が図１５（Ａ）で表されるような場合を考える。例えば、先に説明した図５〜図９および図１１に具体例を示した黒画素密度、黒画素周囲長、平均ラン長、最大ラン長、複雑度、および水平白黒変化点数の場合である。 Consider a case where the distribution of the character image and the non-character image corresponding to the feature amount extracted from the specific feature amount extractor 31 is represented in FIG. For example, this is the case of the black pixel density, the black pixel perimeter, the average run length, the maximum run length, the complexity, and the horizontal black-and-white change points shown in the specific examples in FIGS. 5 to 9 and 11 described above.

図１５（Ａ）に示すような分布はあらかじめ多数の文字画像および非文字画像に対して特定の特徴量を抽出して、抽出した特徴量に対する文字および非文字画像のヒストグラムを生成することで得られる。図１５（Ａ）で示す例では特徴量ｘは（０≦ｘ≦１２７）で値の正規化が行われている。このような分布が得られる前提において各識別値算出部４１_iに設定されている識別関数ｙ＝ｆ（ｘ）は以下の式（４）で表される関数で設定する。 A distribution as shown in FIG. 15A is obtained by extracting specific feature amounts from a large number of character images and non-character images in advance and generating histograms of characters and non-character images for the extracted feature amounts. It is done. In the example shown in FIG. 15A, the feature amount x is (0 ≦ x ≦ 127) and the value is normalized. On the premise that such a distribution is obtained, the discrimination function y = f (x) set in each discrimination value calculation unit 41 _i is set by a function represented by the following equation (4).

ｙ＝（文字画像分布（ｘ）−非文字画像分布（ｘ））÷（文字画像分布（ｘ）＋非文字画像分布（ｘ））・・・（４） y = (character image distribution (x) −non-character image distribution (x)) ÷ (character image distribution (x) + non-character image distribution (x)) (4)

なお、この式（４）において文字画像分布（ｘ）とは特徴量ｘに対応する文字画像の個数であり、非文字画像分布（ｘ）とは特徴量ｘに対応する非文字画像の個数である。 In this equation (4), the character image distribution (x) is the number of character images corresponding to the feature quantity x, and the non-character image distribution (x) is the number of non-character images corresponding to the feature quantity x. is there.

このようにして設定された識別関数ｆ（ｘ）を図示したものを図１５（Ｂ）に示す。図１５（Ｂ）で示すように式（４）で設定される識別関数ｙ＝ｆ（ｘ）は特徴量ｘ（０≦ｘ≦１２７）の範囲で、値域（−１≦ｙ≦１）を取る関数であり、図７において文字画像分布の下限値ｘ０、非文字画像分布の上限値ｘ１とすると、識別値ｙの値は下記の式のようになる。
ｙ＝１（ｘ＞ｘ１）
＝−１（ｘ＜ｘ０） FIG. 15B shows the discrimination function f (x) set in this way. As shown in FIG. 15B, the discriminant function y = f (x) set by the equation (4) is in the range of the feature quantity x (0 ≦ x ≦ 127), and the value range (−1 ≦ y ≦ 1). Assuming that the lower limit value x0 of the character image distribution and the upper limit value x1 of the non-character image distribution in FIG. 7, the value of the identification value y is as shown in the following equation.
y = 1 (x> x1)
= -1 (x <x0)

このように図１５（Ｂ）に示すような識別関数ｙ＝ｆ（ｘ）を設定することで特徴量抽出器３１_iからの出力される特徴量に対して一意の識別値ｙｉが得られる。 Thus, by setting the discrimination function y = f (x) as shown in FIG. 15B, a unique discrimination value yi is obtained for the feature quantity output from the feature quantity extractor 31 _i .

次に、識別値算出部４１_iから出力された識別値ｙｉに基づく識別処理の詳細について説明する。 Next, details of the identification process based on the identification value yi output from the identification value calculation unit 41 _i will be described.

比較器４２₁〜４２_Nでは、識別値算出部４１₁〜４１_Nから出力され加算器４３₂〜４３_Nにより足し合わされた識別値ｙ１〜ｙＮに対して複数のしきい値、具体的には２つのしきい値ｔｈ（Ｌ）、ｔｈ（Ｈ）を用いて文字候補画像が文字画像か、非文字画像か、あるいは文字画像が非文字画像であるかを識別できない不明画像なのかの３通りの判断を行う。なお２つのしきい値ｔｈ（Ｌ）、ｔｈ（Ｈ）は、文字候補画像の識別処理とは別にあらかじめ設定しておくことが望ましい。 In the comparators 42 _{1 to} 42 _N , a plurality of threshold values, specifically to the identification values y1 to yN output from the identification value calculation units 41 _{1 to} 41 _N and added by the adders 43 ₂ to 43 _N , specifically, There are three ways of determining whether a character candidate image is a character image, a non-character image, or a character image is a non-character image using two threshold values th (L) and th (H). Make a decision. The two threshold values th (L) and th (H) are preferably set in advance separately from the character candidate image identification process.

具体的には、図１６に示すように、比較器４２₁〜４２_Nには、それぞれ値の異なる２つのしきい値ｔｈ（Ｌ）、ｔｈ（Ｈ）が設定されており、ｔｈ（Ｌ）＜ｔｈ（Ｈ）となっている。そして、例えば比較器４２₁に設定されている２つのしきい値のうち値の大きな方をｔｈ１（Ｈ）として表記し、値の小さい方をｔｈ１（Ｌ）として表記する。 Specifically, as shown in FIG. 16, two threshold values th (L) and th (H) having different values are set in the comparators 42 _{1 to} 42 _N , respectively, and th (L) <Th (H). For example, the larger one of the two threshold values set in the comparator 42 ₁ is expressed as th1 (H), and the smaller value is expressed as th1 (L).

そして、比較器４２₁〜４２_Nは、入力された識別値Ｙｉと、２つのしきい値ｔｈｉ（Ｌ）、ｔｈｉ（Ｈ）とを比較することにより、文字候補画像が文字画像であるか非文字画像であるか、あるいは不明であるかを識別する。 Then, the comparators 42 _{1 to} 42 _N compare the input identification value Yi with the two threshold values thi (L) and thi (H) to determine whether the character candidate image is a character image. Identify whether it is a character image or unknown.

具体的には、比較器４２₁〜４２_Nは、入力された識別値Ｙｉと上述した２つのしきい値ｔｈｉ（Ｌ）、ｔｈｉ（Ｈ）を用いて以下のように文字候補画像を３タイプに識別する。（ｉ＝１〜Ｎ）
文字画像（ｔｈｉ（Ｈ）≦Ｙｉ）
非文字画像（Ｙｉ≦ｔｈｉ（Ｌ））
不明画像（ｔｈｉ（Ｌ）＜Ｙｉ＜ｔｈｉ（Ｈ）） Specifically, the comparators 42 _{1 to} 42 _N use the input identification value Yi and the two threshold values thi (L) and thi (H) described above to generate three types of character candidate images as follows. To identify. (I = 1 to N)
Character image (thi (H) ≦ Yi)
Non-character image (Yi ≦ thi (L))
Unknown image (thi (L) <Yi <thi (H))

なお、最終段の文字識別器３２_Nでの識別結果は「非文字」、「文字」の２通りになるように、図１７に示すように、比較器４２_Nにおいて用いられるしきい値を、しきい値ｔｈＮの１つだけ設定するように構成して「非文字」および「文字」のいずれかに必ず識別されるようにしてもよい。 As shown in FIG. 17, the threshold value used in the comparator 42 _N is set so that there are two classification results of “non-character” and “character” in the final-stage character classifier 32 _N. Only one threshold value thN may be set so as to be identified as either “non-character” or “character”.

次に、上述したしきい値ｔｈ（Ｌ）、ｔｈ（Ｈ）の設定方法について説明する。 Next, a method for setting the threshold values th (L) and th (H) will be described.

２つのしきい値ｔｈ（Ｌ）、ｔｈ（Ｈ）それぞれは各識別値算出部４１毎に設定された識別関数ｙ＝ｆ（ｘ）に基づいて決定される。図１８にしきい値の設定の具体的な一例を示す。図１８（Ｂ）に示す具体例では、特徴量ｘ２および特徴量ｘ３における識別関数ｙ＝ｆ（ｘ）の出力値を、それぞれしきい値ｔｈ（Ｌ）およびｔｈ（Ｈ）として設定している。ここで特徴量ｘ２および特徴量ｘ３の決定方法の具体例としては、例えば、文字画像分布における下位５％の個数を含む境界特徴量を特徴量ｘ２とし、非文字画像分布における上位５％の個数を含む境界特徴量を特徴量ｘ３とするような方法が挙げられる。ただし、このような方法に限定されるものではなく文字画像識別装置１０の用途、または要求精度などにより境界特徴量ｘ２およびｘ３の決定方法は変更されてもよい。 Each of the two threshold values th (L) and th (H) is determined based on an identification function y = f (x) set for each identification value calculation unit 41. FIG. 18 shows a specific example of threshold setting. In the specific example shown in FIG. 18B, the output values of the discrimination function y = f (x) in the feature quantity x2 and the feature quantity x3 are set as threshold values th (L) and th (H), respectively. . Here, as a specific example of the determination method of the feature amount x2 and the feature amount x3, for example, the boundary feature amount including the lower 5% number in the character image distribution is set as the feature amount x2, and the upper 5% number in the non-character image distribution. A method in which the boundary feature amount including the feature amount x3 is used. However, the method is not limited to such a method, and the determination method of the boundary feature amounts x2 and x3 may be changed depending on the use of the character image identification device 10 or the required accuracy.

また簡単な決定方法としては、下記のように特徴量ｘ２を文字画像分布の下限値、特徴量ｘ３を非文字画像分布の上限値としてもよい。
ｘ２＝ｘ０
ｘ３＝ｘ１ As a simple determination method, the feature amount x2 may be the lower limit value of the character image distribution and the feature amount x3 may be the upper limit value of the non-character image distribution as described below.
x2 = x0
x3 = x1

図１８に示したしきい値設定の具体的な一例は、ある特定の特徴量に対する文字画像および非文字画像の分布が図１８（Ａ）で表されるような場合を前提としての例であったが、特徴量に対する文字画像および非文字画像の分布は図１８（Ａ）で表されるような場合だけではない。 A specific example of the threshold setting shown in FIG. 18 is an example on the assumption that the distribution of character images and non-character images for a specific feature amount is represented in FIG. However, the distribution of the character image and the non-character image with respect to the feature amount is not limited to the case shown in FIG.

そのため、文字画像および非文字画像の分布が図１８（Ａ）とは異なる場合にしきい値を設定する具体的な方法を図１９を参照して説明する。
図１９（Ａ）は、例えば図１０に示した連結成分数を特徴量とした場合の、文字ならびに非文字画像の頻度分布の場合である。図１９（Ａ）では、文字画像分布が値の大きい特徴量の範囲に局在分布しており、さらに文字画像分布が非文字画像分布にほぼ包含されている。ここでｘ０は文字画像分布の下限値である。 Therefore, a specific method for setting the threshold when the distribution of the character image and the non-character image is different from that in FIG. 18A will be described with reference to FIG.
FIG. 19A shows the frequency distribution of characters and non-character images when the number of connected components shown in FIG. In FIG. 19A, the character image distribution is localized in the feature value range having a large value, and the character image distribution is substantially included in the non-character image distribution. Here, x0 is a lower limit value of the character image distribution.

図１９（Ｂ）は、図１９（Ａ）に示した文字および非文字画像分布に対して、上記の式（４）で表される関数により設定される識別関数と、設定されるしきい値ｔｈ（Ｌ）およびｔｈ（Ｈ）を図示したものである。図１９（Ｂ）に図示されたしきい値ｔｈ（Ｌ）およびｔｈ（Ｈ）の設定方法は、しきい値ｔｈ（Ｌ）においては、例えば文字画像分布における上位および下位５％の個数を含む境界特徴量ｘ１およびｘ２を決定し、その境界特徴量ｘ１、ｘ２に対する関数値をしきい値ｔｈ（Ｌ）およびしきい値ｔｈ（Ｈ）とする。また簡単な決定方法としては、下記のように特徴量ｘ２を文字画像分布の下限値、特徴量ｘ３を特徴量最大値としてもよい。
ｘ１＝ｘ０
ｘ２＝１２７ FIG. 19B shows an identification function set by the function represented by the above formula (4) and a threshold value set for the character and non-character image distribution shown in FIG. Th (L) and th (H) are illustrated. In the threshold value th (L) and th (H) setting method illustrated in FIG. 19B, the threshold value th (L) includes, for example, the number of upper and lower 5% in the character image distribution. The boundary feature amounts x1 and x2 are determined, and the function values for the boundary feature amounts x1 and x2 are set as a threshold value th (L) and a threshold value th (H). As a simple determination method, the feature amount x2 may be the lower limit value of the character image distribution, and the feature amount x3 may be the maximum feature amount value as described below.
x1 = x0
x2 = 127

さらに図２０（Ａ）に別の特徴量に対する具体例について説明する。図２０（Ａ）は、例えば図１２に具体例を示した垂直白黒変化点数を特徴量とした場合の、文字ならびに非文字画像の頻度分布の場合である。図２０（Ａ）では、文字画像分布と非文字画像分布は異なる分布であるが、非文字画像分布が値の小さい特徴量の範囲に局在分布している。ここでｘ０およびｘ１はそれぞれ非文字画像分布の上限値、文字画像分布の上限値である。 Further, a specific example for another feature amount will be described with reference to FIG. FIG. 20A shows a case of frequency distribution of characters and non-character images when, for example, the vertical black-and-white change score shown in the specific example in FIG. 12 is used as the feature amount. In FIG. 20A, the character image distribution and the non-character image distribution are different distributions, but the non-character image distribution is localized in the range of the feature amount having a small value. Here, x0 and x1 are the upper limit value of the non-character image distribution and the upper limit value of the character image distribution, respectively.

図２０（Ｂ）は、図２０（Ａ）に示した文字および非文字画像分布に対して、上記の式（４）で表される関数により設定される識別関数と、設定されるしきい値ｔｈ（Ｌ）およびｔｈ（Ｈ）を図示したものである。図２０（Ｂ）に図示されたしきい値ｔｈ（Ｌ）およびｔｈ（Ｈ）の設定方法は、しきい値ｔｈ（Ｌ）においては、例えば文字画像分布における下位５％の個数を含む境界特徴量ｘ２を、しきい値ｔｈ（Ｈ）においては、非文字画像分布における上位５％の個数を含む境界特徴量ｘ３を決定し、その境界特徴量ｘ２、ｘ３に対する関数値をしきい値ｔｈ（Ｌ）およびしきい値ｔｈ（Ｈ）とする。また簡単な決定方法としては、下記のように特徴量ｘ２を特徴量最小値、特徴量ｘ３を非文字分布上限値としてもよい。
ｘ２＝０
ｘ３＝ｘ０ FIG. 20B shows an identification function set by the function represented by the above formula (4) and a threshold value set for the character and non-character image distribution shown in FIG. Th (L) and th (H) are illustrated. In the threshold value th (L) and th (H) setting method illustrated in FIG. 20B, the threshold feature th (L) includes, for example, a boundary feature including the number of lower 5% in the character image distribution. When the amount x2 is the threshold th (H), the boundary feature amount x3 including the top 5% number in the non-character image distribution is determined, and the function value for the boundary feature amounts x2 and x3 is set to the threshold th ( L) and threshold value th (H). As a simple determination method, the feature quantity x2 may be the minimum feature quantity value and the feature quantity x3 may be the non-character distribution upper limit value as described below.
x2 = 0
x3 = x0

このように図１８（Ａ）で示した文字、非文字画像分布とは別の分布の場合である図１９（Ａ）および図２０（Ａ）の具体例においても図１８（Ｂ）と同様の設定方法でしきい値ｔｈ（Ｌ）およびｔｈ（Ｈ）の設定が可能である。つまりある特定の特徴量に対する文字、非文字分布がどのような分布であっても上記と同様な方法でしきい値ｔｈ（Ｌ）およびｔｈ（Ｈ）を設定できる。 Thus, the specific example of FIGS. 19A and 20A, which is a distribution different from the character and non-character image distribution shown in FIG. 18A, is the same as FIG. 18B. The thresholds th (L) and th (H) can be set by a setting method. That is, the threshold values th (L) and th (H) can be set in the same manner as described above, regardless of the distribution of characters and non-characters for a specific feature amount.

次に、本実施形態の文字画像識別装置１０における識別動作を図２１に示すフローチャートを参照して詳細に説明する。 Next, the identification operation in the character image identification apparatus 10 of this embodiment will be described in detail with reference to the flowchart shown in FIG.

先ず、特徴量抽出器３１₁では、文字候補画像から特徴量ｘ（１）を抽出する（ステップＳ１０１）。すると、文字識別器３２₁では、識別値算出部４１₁により、抽出された特徴量ｘ（１）に基づく識別値ｙ１が算出される（ステップＳ１０２）。 First, the feature quantity extractor 31 _1, extracts a feature x (1) from the character candidate image (step S101). Then, in the character classifier 32 ₁ , the identification value calculation unit 41 ₁ calculates an identification value y1 based on the extracted feature quantity x (1) (step S102).

すると、文字識別器３２₁では、比較器４２₁により、算出された識別値ｙ１と、予め設定されているしきい値ｔｈ１（Ｌ）、ｔｈ１（Ｈ）との比較が行われる（ステップＳ１０３、Ｓ１０４）。 Then, in the character discriminator 32 ₁ , the comparator 42 ₁ compares the calculated discriminant value y1 with preset threshold values th1 (L) and th1 (H) (step S103, S104).

この比較器４２₁における比較により、識別値ｙ１がｔｈ１（Ｌ）以下の場合には（ステップＳ１０３においてｙｅｓ）、判定結果は非文字であるとされ識別処理は終了する（ステップＳ２０３）。また、識別値ｙ１がｔｈ１（Ｈ）以上の場合には（ステップＳ１０４においてｙｅｓ）、判定結果は文字であるとされ識別処理は終了する（ステップＳ２０４）。 By comparison in the comparator 42 _1, the identification value y1 is th1 when (L) below (yes in step S103), the determination result is to be a non-character identification process ends (step S203). If the identification value y1 is greater than or equal to th1 (H) (yes in step S104), the determination result is assumed to be a character and the identification process ends (step S204).

識別値ｙ１がｔｈ１（Ｌ）よりも大きくｔｈ１（Ｈ）よりも小さいと判定された場合には、判定結果は不明であるとして、比較器４２1から次段の特徴量抽出器３２₂に対してその旨が通知される（ステップＳ１０３、Ｓ１０４においてもともにｎｏ）。 When it is determined that the identification value y1 is greater than th1 (L) and smaller than th1 (H), the determination result is unknown, and the comparator 421 applies to the next-stage feature quantity extractor 32 ₂ . This is notified (both in steps S103 and S104, no).

その結果、特徴量抽出器３１₂では、文字候補画像から特徴量ｘ（２）が抽出され（ステップＳ１０５）、文字識別器３２₂では、識別値算出部４１₂により、抽出された特徴量ｘ（２）に基づく識別値ｙ２が算出される。そして、加算器４３₂により識別値ｙ１（Ｙ１）と識別値ｙ２が加算されて識別値Ｙ２が算出される（ステップＳ１０６）。 As a result, the feature amount extractor 31 ₂ extracts the feature amount x (2) from the character candidate image (step S105), and the character identifier 32 ₂ extracts the feature amount x extracted by the identification value calculation unit 41 _2. An identification value y2 based on (2) is calculated. The adder 43 ₂ adds the identification value y1 (Y1) and the identification value y2 to calculate the identification value Y2 (step S106).

そして、文字識別器３２₂では、比較器４２₂により、算出された識別値Ｙ２と、予め設定されているしきい値ｔｈ２（Ｌ）、ｔｈ２（Ｈ）との比較が行われる（ステップＳ１０７、Ｓ１０８）。 In the character discriminator 32 ₂ , the comparator 42 ₂ compares the calculated discriminant value Y2 with preset threshold values th2 (L) and th2 (H) (step S107, S108).

この比較器４２₂における比較により、識別値Ｙ２がｔｈ２（Ｌ）以下の場合には（ステップＳ１０７においてｙｅｓ）、判定結果は非文字であるとされ識別処理は終了する（ステップＳ２０３）。また、識別値Ｙ２がｔｈ２（Ｈ）以上の場合には（ステップＳ１０８においてｙｅｓ）、判定結果は文字であるとされ識別処理は終了する（ステップＳ２０４）。 By comparison in the comparator 42 _2, the identification value Y2 is the case of th2 (L) or less (yes in step S107), the determination result is to be a non-character identification process ends (step S203). If the identification value Y2 is greater than or equal to th2 (H) (yes in step S108), the determination result is assumed to be a character and the identification process ends (step S204).

識別値Ｙ２がｔｈ２（Ｌ）よりも大きくｔｈ２（Ｈ）よりも小さいと判定された場合には、判定結果は不明であるとして、比較器４２₂から次段の特徴量抽出器３２₃に対してその旨が通知される（ステップＳ１０７、Ｓ１０８においてもともにno）。このようにしてステップＳ１０９以下においても同様の処理が繰り返される。 If the identification value Y2 is determined to th2 (L) less than the greater than th2 (H) is determined as the result is unknown, to the comparator 42 ₂ from the next stage feature quantity extractor 32 ₃ To that effect (both in steps S107 and S108, no). In this way, the same processing is repeated in step S109 and subsequent steps.

なおこの繰り返し処理において、文字識別器３２₂〜３２_Nには加算器４３₂〜４３_Nが設けられていることにより、現段での識別値と前段まで算出された識別値の和（例えば比較器４２₃ではＹ３＝ｙ１＋ｙ２＋ｙ３）としきい値との比較が行われることになる。 In this iterative process, the character identifiers 32 _{2 to} 32 _N are provided with adders 43 ₂ to 43 _N so that the sum of the identification value at the current stage and the identification value calculated up to the previous stage (for example, comparison) vessel 42 comparison ₃ in Y3 = y1 + y2 + y3) and the threshold value are to be made.

最後に、文字識別器３２_Nにおいて、算出された識別値ＹＮとしきい値ｔｈＮとが比較されて（ステップＳ２０２）、最終的に判定結果が「非文字」であるか（ステップＳ２０３）、「文字」であるか（ステップＳ２０４）が確定する。 Finally, the character discriminator 32 _N compares the calculated identification value YN with the threshold thN (step S202), and finally determines whether the determination result is “non-character” (step S203). "(Step S204).

次に、文字識別器３２₁〜３２_Nの構成順序について説明する。上記において説明してきたように、本実施形態における文字画像識別装置１０では、できる限り前段の文字識別器３２で「文字」、「非文字」と識別するように動作させるために、文字識別器３２₁〜３２_Nの構成順序は文字／非文字識別能力が高い順とする。つまり、識別能力が高い文字識別器３２ほど前段に配置されるような構成とする。ここで言う文字／非文字識別能力が高い文字識別器３２とは、設定された識別関数としきい値による誤識別率が最も低い文字識別器３２であり、誤識別率は例えば各文字識別器３２毎に文字／非文字を含んだ同じ文字候補画像群を識別させてその誤識別した画像数を計数し、この誤識別した画像数を総文字候補画像数で除することで得られる。 Next, the configuration order of the character identifiers 32 _{1 to} 32 _N will be described. As described above, in the character image identification device 10 in the present embodiment, the character identifier 32 is operated so that the character identifier 32 in the preceding stage can be identified as “character” and “non-character” as much as possible. The construction order of _{1 to} 32 _{N is} the order in which the character / non-character discrimination capability is high. That is, the character discriminator 32 having a higher discriminating ability is arranged in the preceding stage. The character discriminator 32 having a high character / non-character discriminating ability here is the character discriminator 32 having the lowest misidentification rate based on the set discriminating function and threshold value. It is obtained by identifying the same character candidate image group including characters / non-characters every time, counting the number of misidentified images, and dividing the number of misidentified images by the total number of character candidate images.

また文字識別器３２で用いるしきい値に関しても、より前段の文字識別器３２においては文字／非文字を高い精度で識別するために、たとえば図１８（Ｂ）でのしきい値設定の説明において文字画像分布の下限値、非文字画像の上限値を用いる場合（ｘ２＝ｘ０、ｘ３＝ｘ１）のように、より精度が高い識別が期待されるしきい値を設定するようにする。 Further, regarding the threshold value used in the character classifier 32, in order to identify the character / non-character with high accuracy in the preceding character classifier 32, for example, in the description of threshold setting in FIG. As in the case where the lower limit value of the character image distribution and the upper limit value of the non-character image are used (x2 = x0, x3 = x1), a threshold value that is expected to be identified with higher accuracy is set.

[第２の実施形態]
次に、本発明の第２の実施形態の文字画像識別装置について説明する。 [Second Embodiment]
Next, a character image identification apparatus according to a second embodiment of the present invention will be described.

上記の第１の実施形態では、文字識別器３２₂〜３２_Nにおける識別処理において、前段までの各文字識別器３２₁〜３２_N-1からの識別値と現段の文字識別器３２からの識別値との和を識別処理に用いていたが、本発明はこのような構成に限定されるものではない。 In the first embodiment, in the identification processing in the character classifiers 32 _{2 to} 32 _N , the identification values from the character classifiers 32 _{1 to} 32 _{N−1 up} to the previous stage and the character classifiers 32 from the current stage are used. Although the sum with the identification value is used for the identification processing, the present invention is not limited to such a configuration.

本実施形態の文字画像識別装置１０ａの構成を図２２、図２３を参照して説明する。
本実施形態の文字画像識別装置１０ａは、図２に示した第１の実施形態の文字画像識別装置１０に対して、文字識別器３２₁〜３２_Nを文字識別器１３２₁〜１３２_Nに置き換えた構成となっている。 The configuration of the character image identification device 10a of this embodiment will be described with reference to FIGS.
Character image identification apparatus 10a of the present embodiment is different from the character image identification apparatus 10 of the first embodiment shown in FIG. 2, replacing the character discriminator 32 ₁ to 32 _N in the character discriminator 132 ₁ to 132 _N It becomes the composition.

本実施形態における文字識別器１３２₁〜１３２_Nでは、図２３を参照すると分かるように、前段の識別値を順次加算していくための加算器が設けられていない。よって、本実施形態における文字識別器１３２₁〜１３２_Nでは、識別値算出部４１₁〜４１_Nにより算出された識別値ｙ１〜ｙＮと、予め設定されているしきい値ｔｈ（Ｌ）、ｔｈ（Ｈ）との比較が行われる。 In the character identifiers 132 _{1 to} 132 _N in the present embodiment, as can be seen with reference to FIG. 23, an adder for sequentially adding the identification values in the previous stage is not provided. Therefore, in the character classifiers 132 _{1 to} 132 _{N according to} the present embodiment, the identification values y1 to yN calculated by the identification value calculation units 41 _{1 to} 41 _N and the preset threshold values th (L) and th. Comparison with (H) is made.

本実施形態のように文字識別器１３２₁〜１３２_Nを構成した場合でも、文字識別処理の動作は第１の実施形態の動作と同様であり、図２１に示したフローチャートにおいて、ステップＳ１０６、Ｓ２０１等におけるＹ２、ＹＮ等をｙ２、ｙｎと置き換えるだけでよい。 Even when the character identifiers 132 _{1 to} 132 _N are configured as in the present embodiment, the operation of the character identification process is the same as the operation of the first embodiment, and steps S106 and S201 in the flowchart shown in FIG. It is only necessary to replace Y2, YN, etc. with y2, yn.

[変形例]
上記実施形態では、文字候補画像が文字画像であるか非文字画像であるかを識別する場合を用いて説明したが、本発明はこれに限定されるものではなく、顔識別処理のように識別対象画像がある特定のパターンと合致するか否かを識別するような場合でも同様に適用することができるものである。 [Modification]
In the above embodiment, the case where a character candidate image is identified as a character image or a non-character image has been described. However, the present invention is not limited to this, and identification is performed as in face identification processing. The present invention can be applied in the same manner even when identifying whether or not the target image matches a specific pattern.

１０文字画像識別装置
１１ＣＰＵ
１２メモリ
１３記憶装置
１４通信インタフェース（ＩＦ）
１５ユーザインタフェース（ＵＩ）装置
１６制御バス
２１文字候補画像切出部
２２画像サイズ規格化部
２４判定器
３１₁〜３１_N 特徴量抽出器
３２₁〜３２_N 文字識別器
４１₁〜４１_N 識別値算出部
４２₁〜４２_N 比較器
４３₂〜４３_N 加算器
１３２₁〜１３２_N 文字識別器 10 character image identification device 11 CPU
12 Memory 13 Storage Device 14 Communication Interface (IF)
15 User Interface (UI) device 16 controls the bus 21 character candidate image extraction unit 22 image size normalization section 24 judging unit 31 ₁ to 31 _N feature quantity extractor 32 ₁ to 32 _N character discriminators 41 ₁ to 41 _N identification value Calculation unit 42 _{1 to} 42 _N comparator 43 ₂ to 43 _N adder 132 _{1 to} 132 _N character identifier

Claims

A plurality of stages of extraction means for sequentially extracting feature amounts based on a plurality of different image characteristics from character candidate images that have been determined to be possibly character images from the image data to be processed;
An identification value provided corresponding to each of the plurality of stages of extraction means, and indicating the possibility that the character candidate image is a character image or a non-character image based on the feature amount extracted by the corresponding extraction means And a comparison unit that compares the identification value calculated by the calculation unit with a threshold value set in advance for each image characteristic, and based on the comparison result in the comparison unit, the character candidate A plurality of stages of identifying means for identifying whether the image is a character image or a non-character image, or a character image, a non-character image, or an unknown unknown image;
When the character candidate image is identified as a character image or a non-character image by at least one of the plurality of stages of identification means, the character candidate image is determined to be a character image or a non-character image. Determination means for
Each of the plurality of stages of extraction means is a character image identification device that extracts a feature amount based on the set image characteristics when the previous-stage identification means identifies the character candidate image as an unknown image .

It said plurality of stages of identification means, respectively, and the identification value in the preceding-stage identification means, character image identification apparatus according to claim 1, further comprising adding means for adding the calculated identification value by the calculation means in the current stage.

The calculating means, the plurality with respect to the feature amount using the discriminant function which is previously calculated based on the frequency distribution of the character image and the non-character image, according to claim 1 or 2, wherein comprising calculating the identification value from the feature Character image identification device.

The character according to any one of claims 1 to 3 , wherein the plurality of stages of identifying means are arranged in a preceding stage as an identifying means having a higher identification ability for identifying whether a character candidate image is a character image or a non-character image. Image identification device.

Extracting a feature amount based on a plurality of different image characteristics from a character candidate image that is determined to be a character image from image data to be processed and cut out;
Calculating an identification value indicating the possibility that the character candidate image is a character image or a non-character image based on the extracted feature amount; and
Comparing the calculated identification value with a preset threshold value for each image characteristic;
Identifying whether the character candidate image is a character image, a non-character image, or a character image, a non-character image, or an unknown image based on a comparison result with a threshold value When,
When the character candidate image is identified as a character image or a non-character image, it is determined that the character candidate image is a character image or a non-character image, and the character candidate image is identified as an unknown image The step of extracting the feature amount, the step of calculating the identification value, the step of comparing with the threshold value, the step of identifying the character candidate image again,
A character image identification method comprising:

Extracting a feature amount based on a plurality of different image characteristics from a character candidate image that is determined to be a character image from image data to be processed and cut out;
Calculating an identification value indicating the possibility that the character candidate image is a character image or a non-character image based on the extracted feature amount; and
Comparing the calculated identification value with a preset threshold value for each image characteristic;
Identifying whether the character candidate image is a character image, a non-character image, or a character image, a non-character image, or an unknown image based on a comparison result with a threshold value When,
When the character candidate image is identified as a character image or a non-character image, it is determined that the character candidate image is a character image or a non-character image, and the character candidate image is identified as an unknown image A program for causing a computer to execute a step of extracting a feature amount, a step of calculating an identification value, a step of comparing with a threshold value, and a step of repeating a step of identifying a character candidate image again.