JP5353482B2

JP5353482B2 - Pattern recognition dictionary generation device, pattern recognition device, and pattern recognition dictionary generation method

Info

Publication number: JP5353482B2
Application number: JP2009152887A
Authority: JP
Inventors: 利昇三好; 健永崎; 広新庄; 正一石井
Original assignee: Hitachi Information and Telecommunication Engineering Ltd
Current assignee: Hitachi Information and Telecommunication Engineering Ltd
Priority date: 2009-06-26
Filing date: 2009-06-26
Publication date: 2013-11-27
Anticipated expiration: 2029-06-26
Also published as: JP2011008634A

Description

本発明は、パターン認識装置に関し、特に認識の高速化のための大分類手法に関する。 The present invention relates to a pattern recognition apparatus, and more particularly to a large classification method for speeding up recognition.

パターン認識装置は、入力パターンを認識し、その所属カテゴリを判定する装置である。パターンとしては、画像データや音声データなどが挙げられる。例えば、文字認識の場合には、パターンは画像である。カテゴリは、識別対象となるパターンの類別である。例えば、文字認識において、数字認識の場合のカテゴリは「０」〜「９」の１０字種、漢字認識の場合のカテゴリは数千字種となる。 The pattern recognition device is a device that recognizes an input pattern and determines its category. Examples of the pattern include image data and audio data. For example, in the case of character recognition, the pattern is an image. The category is a classification of patterns to be identified. For example, in character recognition, the category for numeric recognition is 10 character types “0” to “9”, and the category for kanji recognition is several thousand character types.

パターン認識装置が文字認識装置の場合、文字認識装置は、入力画像中の文字が予め設定された字種（カテゴリ）のうち、どの字種（カテゴリ）に該当するかを判定し、判定結果を出力する。 When the pattern recognition device is a character recognition device, the character recognition device determines which character type (category) corresponds to a character type (category) among the character types (categories) set in advance in the input image, and determines the determination result. Output.

パターン認識装置が実行する処理は、学習フェーズと認識フェーズとから構成される。 The process executed by the pattern recognition apparatus is composed of a learning phase and a recognition phase.

学習フェーズでは、パターン認識装置が、学習用パターンＤＢ（データベース）を用いて特徴選択用辞書と識別用辞書とを作成する。 In the learning phase, the pattern recognition device creates a feature selection dictionary and an identification dictionary using a learning pattern DB (database).

具体的には、学習フェーズでは、大分類識別関数と詳細識別関数とが生成され、生成された詳細識別関数と大分類識別関数とが識別用辞書に格納される。大分類識別関数は、正解候補を大雑把に絞るために用いる関数である。詳細識別関数は、絞り込まれた正解候補の類似度を算出するための関数である。 Specifically, in the learning phase, a large classification function and a detailed classification function are generated, and the generated detailed classification function and large classification function are stored in the identification dictionary. The large classification identification function is a function used to roughly narrow down correct answer candidates. The detailed identification function is a function for calculating the similarity of the narrowed-down correct answer candidates.

詳細識別関数の生成アルゴリズムとしては、最近傍法、パーセプトロン、改良投影距離法、修正二次識別、一般学習ベクトル量子化、多項式ネットワーク、及びサポートベクトルマシンなどがある（例えば、非特許文献１、非特許文献２参照）。 As a detailed discriminant function generation algorithm, there are a nearest neighbor method, a perceptron, an improved projection distance method, a modified quadratic discrimination, a general learning vector quantization, a polynomial network, a support vector machine, etc. Patent Document 2).

大分類識別関数を生成するアルゴリズムとしては、詳細識別関数よりも精度は劣るが高速な識別関数を用いる方法と、識別するカテゴリの数を減らして大分類を行う方法とがある。 As an algorithm for generating a large classification discriminant function, there are a method using a high-speed discriminant function that is less accurate than a detailed discriminant function, and a method of performing large classification by reducing the number of categories to be identified.

高速な識別関数を用いる方法としては、例えば、線型識別関数などの計算量が少ない識別関数を大分類用に用いる方法である。前述した方法は、計算量が少ない識別関数を用いる点を除いて、詳細識別関数を生成するアルゴリズムと同様である。 As a method using a high-speed discriminant function, for example, a discriminant function having a small calculation amount such as a linear discriminant function is used for large classification. The method described above is the same as the algorithm for generating the detailed discriminant function, except that an discriminant function with a small amount of calculation is used.

識別するカテゴリの数を減らして大分類を行う方法は、認識対象のカテゴリをいくつかにまとめたり、分布をクラスタ化したりすることで行われる。例えば、アルファベットの認識で、特徴空間上で近い文字種（「Ｉ」と「ｌ」、又は「Ｋ」と「ｋ」など）を同一カテゴリとして扱うなどの方法がある。 A method of performing large classification by reducing the number of categories to be identified is performed by collecting some categories to be recognized or clustering distributions. For example, in alphabet recognition, there is a method of treating character types (“I” and “l” or “K” and “k”, etc.) that are close in the feature space as the same category.

より具体的には、大分類識別関数を生成するアルゴリズムとして、線型判別関数を用いた手法（例えば、非特許文献１参照）や、一般学習ベクトル量子化を用いたクラスタリング手法（例えば、特許文献１参照）などが使われる。 More specifically, as an algorithm for generating a large classification discriminant function, a method using a linear discriminant function (for example, see Non-Patent Document 1) or a clustering method using general learning vector quantization (for example, Patent Document 1). For example).

認識フェーズでは、パターン認識装置が、作成された特徴選択用辞書と識別用辞書とを用いて、入力パターンを認識する。認識フェーズでは、大分類識別処理と詳細識別処理とが実行される。 In the recognition phase, the pattern recognition device recognizes an input pattern using the created feature selection dictionary and identification dictionary. In the recognition phase, a large classification identification process and a detailed identification process are executed.

まず、大分類識別処理が実行される目的について説明する。 First, the purpose of executing the large classification identification process will be described.

大分類識別処理は、詳細識別関数を用いた詳細識別処理よりも粗い識別処理を高速に行い、全カテゴリの中から正解カテゴリとなる候補を絞るための処理である。 The large classification identification process is a process for performing a coarse identification process faster than the detailed identification process using the detailed identification function, and narrowing down candidates for the correct category from all categories.

詳細識別処理のみが実行される場合、パターン認識装置は、全カテゴリに対して処理を実行する必要がある。しかし、前述の方法では、例えば、数千カテゴリ以上を認識対象とする漢字認識などでは、処理時間が莫大となる。そのため、パターン認識装置は、大分類識別処理と、詳細識別処理との二段階の識別処理を実行することによって処理を高速化する。 When only the detailed identification process is executed, the pattern recognition apparatus needs to execute the process for all categories. However, in the above-described method, for example, the processing time becomes enormous in the recognition of kanji for which several thousand categories or more are to be recognized. Therefore, the pattern recognition apparatus speeds up the processing by executing a two-stage identification process including a large classification identification process and a detailed identification process.

大分類識別処理では、正解候補が大雑把に絞り込まれる。例えば、漢字認識では、大分類識別処理を実行することによって、正解候補が数十個から数百個程度に絞られる。詳細分類では、絞られた正解候補カテゴリの各々に対する類似度が、詳細識別関数を用いて算出される。 In the large classification identification process, correct answer candidates are roughly narrowed down. For example, in kanji recognition, the correct answer candidates are reduced to about several tens to several hundreds by executing the large classification identification process. In the detailed classification, the similarity to each of the narrowed correct answer candidate categories is calculated using a detailed identification function.

パターン認識では、認識処理の高速化のため、まず、高速に計算できる大分類識別関数を用いて、正解候補カテゴリを絞る粗い識別処理が実行される。その後、正解候補カテゴリに対して詳細識別関数を用いて、正解候補カテゴリの各々に対する入力パターンの類似度が算出され、最終的な認識結果が出力される。 In pattern recognition, in order to speed up the recognition process, first, a rough classification process for narrowing down the correct candidate categories is performed using a large classification function that can be calculated at high speed. Thereafter, using the detailed identification function for the correct candidate category, the similarity of the input pattern for each of the correct candidate categories is calculated, and the final recognition result is output.

従来の大分類識別処理及び詳細識別処理の方法として、テンプレート方式を用いた大分類方法がある。 As a method of conventional major classification identification processing and detailed identification processing, there is a major classification method using a template method.

前述の方法では大分類識別処理のために、予め学習用パターンＤＢを用いた学習によって、特徴空間上に１個又は複数個のカテゴリを代表する代表ベクトルを複数個用意しておく。 In the above-described method, a plurality of representative vectors representing one or a plurality of categories are prepared in advance in the feature space by learning using the learning pattern DB for the large classification identification process.

認識フェーズ時には、パターン認識装置が、代表ベクトルと入力パターンとの距離を算出し、入力パターンとの距離が近い代表ベクトルが代表するカテゴリ群に含まれるカテゴリを正解候補カテゴリとする。パターン認識装置は、前述のようにして算出された正解候補カテゴリの各々に対して、詳細識別処理を実行する。前述の場合、通常、詳細識別処理にもテンプレート方式の方法が用いられる。すなわち、予め、各カテゴリを代表する代表ベクトルを学習によって作成しておき、認識フェーズ時には、パターン認識装置が、入力パターンと代表ベクトルとの距離から入力パターンと各々のカテゴリとの類似度を算出する。 During the recognition phase, the pattern recognition apparatus calculates the distance between the representative vector and the input pattern, and sets the category included in the category group represented by the representative vector that is close to the input pattern as the correct candidate category. The pattern recognition apparatus performs detailed identification processing for each of the correct candidate categories calculated as described above. In the case described above, the template method is usually used for the detailed identification process. That is, representative vectors representing each category are created in advance by learning, and in the recognition phase, the pattern recognition device calculates the similarity between the input pattern and each category from the distance between the input pattern and the representative vector. .

特許第３４７５８８６号明細書Japanese Patent No. 3475886

Mohammed Cheriet, Nawwaf Kharma, Cheng lin Liu, and Ching Suen. Character Recognition Systems: A Guide for Students and Practitioners. Wiley-Interscience, 2007.Mohammed Cheriet, Nawwaf Kharma, Cheng lin Liu, and Ching Suen.Character Recognition Systems: A Guide for Students and Practitioners.Wiley-Interscience, 2007. 石井健一郎, 上田修功, 前田英作, 村瀬洋. パターン認識. オーム社出版局.Kenichiro Ishii, Nobuo Ueda, Eisaku Maeda, Hiroshi Murase. Pattern recognition. Ohm Publishing House. Liu, C. L., Sako, H., Fujisawa, H. Performance evaluation of pattern classifier for handwritten character recognition, International Journal of Document Analysis and Recognition, Vol.4, No.3, pp.191-204.Liu, C. L., Sako, H., Fujisawa, H. Performance evaluation of pattern classifier for handwritten character recognition, International Journal of Document Analysis and Recognition, Vol.4, No.3, pp.191-204.

しかし、テンプレート方式を用いた大分類識別処理は、精度上の問題がある。特に、手書き文字認識においてテンプレート方式を用いた大分類識別処理が実行された場合、テンプレート方式を用いた認識精度は、ニューラルネットワークやサポートベクトルマシンを用いた認識精度よりも低いことが実験によって確認されている（例えば、非特許文献３参照）。 However, the large classification identification process using the template method has a problem in accuracy. In particular, it is confirmed by experiments that the recognition accuracy using the template method is lower than the recognition accuracy using the neural network or the support vector machine when the large classification recognition process using the template method is executed in handwritten character recognition. (For example, refer nonpatent literature 3).

また、大分類識別処理のみにテンプレート方式を用い、詳細識別処理は他の学習アルゴリズムを用いる場合、学習アルゴリズム及び認識アルゴリズムが大分類識別処理と詳細識別処理とで異なることになるため、構成が煩雑になるという実装上の問題がある。 Further, when the template method is used only for the large classification identification process and the detailed identification process uses another learning algorithm, the learning algorithm and the recognition algorithm are different between the large classification identification process and the detailed identification process, so that the configuration is complicated. There is an implementation problem of becoming.

また、従来の大分類識別処理及び詳細識別処理の方法として、大分類識別処理では線型関数などの精度は劣るが計算量が少ない大分類識別関数を用い、詳細識別処理は精度が高い識別関数を用いる方法がある。前述の方法を用いた場合も、同様に、大分類識別関数の精度上の問題と、大分類識別処理及び詳細識別処理で学習アルゴリズムが二重になるという構成上の煩雑さの問題とがある。 In addition, as a method for conventional large classification identification processing and detailed identification processing, a large classification identification function is used which uses a large classification identification function which is inferior in accuracy such as a linear function but has a small amount of calculation. There is a method to use. Similarly, when the above method is used, there are a problem in accuracy of the large classification identification function and a problem in the construction that the learning algorithm is duplicated in the large classification identification processing and the detailed identification processing. .

本発明の代表的な一例を示せば、以下の通りである。プロセッサと、前記プロセッサに接続された記憶媒体とを備えるパターン認識用辞書生成装置であって、前記記憶媒体には、複数の学習用のパターンから構成される学習用パターンデータベースが格納され、前記パターン認識用辞書生成装置は、前記学習用パターンデータベースから、各々の前記学習用のパターンを一つのカテゴリとして取得するパターン入力部と、前記取得されたカテゴリ毎にｎ次元特徴を抽出する特徴抽出部と、前記抽出されたｎ次元特徴を用いて、前記ｎ次元以下の次元であるｍ次元特徴に前記ｎ次元特徴を変換する特徴選択関数を生成し、前記生成された特徴選択関数を特徴選択用辞書として前記記憶媒体に格納する特徴選択辞書生成部と、前記特徴選択関数を用いて、前記抽出されたｎ次元特徴を前記ｍ次元特徴に変換する特徴選択部と、前記変換されたｍ次元特徴を用いて、各カテゴリに対する認識対象のパターンの類似度を算出するためのｍ次元特徴空間上の詳細識別関数を生成し、前記生成された詳細識別関数を識別用辞書として前記記憶媒体に格納する識別関数生成部と、前記ｍ次元以下の次元であり、前記ｎ次元特徴空間の部分空間で、かつ、前記ｍ次元特徴空間の部分空間であるＬ次元特徴空間上のＬ次元特徴に前記ｍ次元特徴を変換する大分類特徴選択関数を生成し、前記詳細識別関数を前記Ｌ次元特徴空間上の関数として変換することによって前記Ｌ次元特徴空間上で各カテゴリに対する前記認証対象のパターンの類似度を算出するための大分類識別関数を生成し、前記生成された大分類特徴選択関数を前記特徴選択用辞書として前記記憶媒体に格納し、前記生成された大分類識別関数を前記識別用辞書として前記記憶媒体に格納する識別関数主要部分抽出部と、を備えることを特徴とする。 A typical example of the present invention is as follows. A pattern recognition dictionary generation device comprising a processor and a storage medium connected to the processor, wherein the storage medium stores a learning pattern database composed of a plurality of learning patterns, and the pattern The recognition dictionary generation device includes a pattern input unit that acquires each of the learning patterns as one category from the learning pattern database, and a feature extraction unit that extracts an n-dimensional feature for each of the acquired categories. , Using the extracted n-dimensional feature, generating a feature selection function for converting the n-dimensional feature into an m-dimensional feature that is a dimension less than or equal to the n-dimension, and using the generated feature selection function as a feature selection dictionary And using the feature selection function to store the extracted n-dimensional feature as the m-dimensional feature. Using the feature selection unit to convert and the converted m-dimensional feature, a detailed identification function on the m-dimensional feature space for calculating the similarity of the pattern to be recognized for each category is generated, and the generated A discriminant function generation unit that stores a detailed discriminant function as a discriminating dictionary in the storage medium, a dimension of m dimensions or less, a subspace of the n-dimensional feature space, and a subspace of the m-dimensional feature space A large classification feature selection function for converting the m-dimensional feature into an L-dimensional feature on a certain L-dimensional feature space is generated, and the detailed identification function is converted as a function on the L-dimensional feature space to thereby convert the L-dimensional feature space. A large classification identification function for calculating the similarity of the pattern to be authenticated with respect to each category is generated above, and the generated large classification feature selection function is used as the feature selection dictionary. Stored in 憶媒 body, characterized in that it comprises an identification function main portion extraction unit for storing in the storage medium a large classification identification function said generated as a dictionary for the identification.

Ｌ次元特徴空間上の関数として算出された大分類識別関数を詳細識別関数に用いることによって、パターン認識の処理の精度を保ち、かつ、高速化することができる。また、大分類識別関数は、詳細識別関数から生成されるため、大分類処理と詳細分類処理との学習アルゴリズムの構成を容易にすることができる。 By using the large classification discriminant function calculated as a function on the L-dimensional feature space as the detailed discriminant function, the accuracy of the pattern recognition process can be maintained and the speed can be increased. Further, since the major classification function is generated from the detailed classification function, the configuration of the learning algorithm for the major classification process and the detailed classification process can be facilitated.

本発明の第１の実施形態のパターン認識装置の構成の一例を示すブロック図である。It is a block diagram which shows an example of a structure of the pattern recognition apparatus of the 1st Embodiment of this invention. 本発明の第１の実施形態のパターン認識装置における学習フェーズの処理を実行するモジュールとＤＢ（データベース）の構成を説明するフローチャートである。It is a flowchart explaining the structure of the module and DB (database) which perform the process of the learning phase in the pattern recognition apparatus of the 1st Embodiment of this invention. 本発明の第１の実施形態のパターン認識装置における認識フェーズの処理を実行するモジュールとＤＢ（データベース）の構成を説明するフローチャートである。It is a flowchart explaining the structure of the module and DB (database) which perform the process of the recognition phase in the pattern recognition apparatus of the 1st Embodiment of this invention. 本発明の第１の実施形態の識別関数主要部分抽出部が実行する処理を説明するフローチャートである。It is a flowchart explaining the process which the discriminant function main part extraction part of the 1st Embodiment of this invention performs. 本発明の第１の実施形態の文字認識における特徴抽出部が実行する処理の一例を説明するフローチャートである。It is a flowchart explaining an example of the process which the feature extraction part in the character recognition of the 1st Embodiment of this invention performs. 従来のパターン認識装置における学習フェーズの処理を説明するフローチャートである。It is a flowchart explaining the process of the learning phase in the conventional pattern recognition apparatus. 従来のパターン認識装置における認識フェーズの処理を説明するフローチャートである。It is a flowchart explaining the process of the recognition phase in the conventional pattern recognition apparatus. 従来のパターン認識装置の一連の処理の流れを示すフローチャートである。It is a flowchart which shows the flow of a series of processes of the conventional pattern recognition apparatus.

まず従来技術について説明する。 First, the prior art will be described.

従来の学習フェーズにおける処理を実行するモジュールとＤＢ（データベース）の構成図を図６に示し、従来の認識フェーズにおける処理を実行するモジュールとＤＢの構成図を図７に示す。学習フェーズと認識フェーズとをまとめると、図８に示す構成となる。 FIG. 6 shows a configuration diagram of modules and DBs (databases) that execute processing in the conventional learning phase, and FIG. 7 shows a configuration diagram of modules and DBs that execute processing in the conventional recognition phase. The learning phase and the recognition phase are summarized as shown in FIG.

図６は、従来のパターン認識装置における学習フェーズの処理を説明するフローチャートである。 FIG. 6 is a flowchart for explaining the learning phase processing in the conventional pattern recognition apparatus.

パターン入力部２０１は、学習用パターンＤＢ２０７からパターンを取得し、取得されたパターンを特徴抽出部２０２に出力する。 The pattern input unit 201 acquires a pattern from the learning pattern DB 207 and outputs the acquired pattern to the feature extraction unit 202.

特徴抽出部２０２は、入力されたパターンからｎ次元ベクトルを抽出する。以下、特徴抽出部２０２で抽出されるｎ次元ベクトルをｎ次元特徴とよぶ。 The feature extraction unit 202 extracts an n-dimensional vector from the input pattern. Hereinafter, the n-dimensional vector extracted by the feature extraction unit 202 is referred to as an n-dimensional feature.

抽出されたｎ次元特徴は、特徴選択辞書生成部２０３と特徴選択部２０４とに出力される。 The extracted n-dimensional features are output to the feature selection dictionary generation unit 203 and the feature selection unit 204.

特徴抽出部２０２が実行する処理によって、パターン認識の対象のデータが音声又は画像であっても、入力されたパターンはｎ次元ベクトルとして表現される。したがってパターン認識装置は、パターンの種類に関わらず同一の処理を適用することができる。 By the process executed by the feature extraction unit 202, the input pattern is expressed as an n-dimensional vector even if the data for pattern recognition is audio or image. Therefore, the pattern recognition apparatus can apply the same processing regardless of the type of pattern.

以降の特徴選択辞書生成部２０３は、特徴選択用辞書２０８を生成するために複数のパターンのｎ次元特徴を必要とする場合がある。この場合、ｎ次元特徴が必要となるたびにパターン入力部２０１及び特徴抽出部２０２が処理を実行し、必要なｎ次元特徴を抽出してもよい。 The subsequent feature selection dictionary generation unit 203 may require n-dimensional features of a plurality of patterns in order to generate the feature selection dictionary 208. In this case, every time an n-dimensional feature is required, the pattern input unit 201 and the feature extraction unit 202 may execute processing to extract the necessary n-dimensional feature.

また、パターン入力部２０１及び特徴抽出部２０２が予め全ての学習用パターンについて処理を実行してｎ次元特徴に変換し、当該ｎ次元特徴を外部記憶装置１０７（図１参照）などに格納しておき、特徴選択辞書生成部２０３が必要となるたびに外部記憶装置１０７からｎ次元特徴を取得する方法でもよい。 In addition, the pattern input unit 201 and the feature extraction unit 202 execute processing for all learning patterns in advance to convert them into n-dimensional features, and store the n-dimensional features in the external storage device 107 (see FIG. 1) or the like. Alternatively, the n-dimensional feature may be acquired from the external storage device 107 whenever the feature selection dictionary generation unit 203 is required.

特徴選択辞書生成部２０３は、ｎ次元特徴をｍ次元特徴（ｍ≦ｎ）に変換するための変換関数を生成し、生成された変換関数を特徴選択用辞書２０８に格納する。 The feature selection dictionary generation unit 203 generates a conversion function for converting n-dimensional features into m-dimensional features (m ≦ n), and stores the generated conversion functions in the feature selection dictionary 208.

例えば、ｎ次元特徴をｘ、ｍ次元特徴をｙと表すと、変換関数ｆは、ｙ＝ｆ（ｘ）と表される。変換を線型変換に限定した場合、ｍ×ｎ行列Ｙを用いて、ｙ＝Ｙｘと表すことができる。また、カテゴリ毎に異なる変換関数を用いてｎ次元特徴をｍ次元特徴に変換する場合もある。この場合、カテゴリｋのｍ次元特徴ｙｋは、カテゴリｋの変換関数ｆｋを用いて、ｙｋ＝ｆｋ（ｘ）と表される。この場合、特徴選択辞書生成部２０３は、変換関数ｆ又はカテゴリ毎の変換関数ｆｋを生成し、生成された変換関数ｆ又はｆｋを特徴選択用辞書２０８に格納する。 For example, when an n-dimensional feature is represented by x and an m-dimensional feature is represented by y, the conversion function f is represented by y = f (x). When the conversion is limited to linear conversion, y = Yx can be expressed using an m × n matrix Y. In some cases, n-dimensional features are converted to m-dimensional features using different conversion functions for each category. In this case, the m-dimensional feature yk of the category k is expressed as yk = fk (x) using the conversion function fk of the category k. In this case, the feature selection dictionary generation unit 203 generates a conversion function f or a conversion function fk for each category, and stores the generated conversion function f or fk in the feature selection dictionary 208.

変換関数を生成する方法としては、主成分分析法や線型判別法などを用いる方法が考えられる。特徴選択の目的は、ｎ次元特徴から識別処理において有効な成分を抽出し、ｎ次元特徴の次元数を削減することによって、認識処理の高速化と高精度化とを実現するためである。 As a method for generating the conversion function, a method using a principal component analysis method, a linear discrimination method, or the like can be considered. The purpose of feature selection is to realize high-speed and high-precision recognition processing by extracting effective components in identification processing from n-dimensional features and reducing the number of dimensions of n-dimensional features.

特徴選択部２０４は、特徴選択用辞書２０８に格納されている変換関数を用いて、ｎ次元特徴をｍ次元特徴に変換する。変換関数をｆとすると、ｎ次元特徴ｘに対してｍ次元特徴ｙは、ｙ＝ｆ（ｘ）と表される。また、カテゴリ毎に異なる変換を行う場合、カテゴリ毎のｍ次元特徴ｙｋは、ｙｋ＝ｆｋ（ｘ）と表される。変換されたｍ次元特徴は、識別用辞書生成部２０５に出力される。 The feature selection unit 204 uses the conversion function stored in the feature selection dictionary 208 to convert the n-dimensional feature into an m-dimensional feature. When the conversion function is f, the m-dimensional feature y is expressed as y = f (x) with respect to the n-dimensional feature x. When different conversion is performed for each category, the m-dimensional feature yk for each category is expressed as yk = fk (x). The converted m-dimensional feature is output to the identification dictionary generation unit 205.

以降の識別用辞書生成部２０５は、識別用辞書２０９を生成するために複数のパターンのｍ次元特徴を必要とする場合がある。この場合、ｍ次元特徴が必要となるたびにパターン入力部２０１、特徴抽出部２０２及び特徴選択部２０４が処理を実行し、識別用辞書生成部２０５が必要なｍ次元特徴を取得してもよい。 The subsequent identification dictionary generation unit 205 may require m-dimensional features of a plurality of patterns in order to generate the identification dictionary 209. In this case, each time an m-dimensional feature is required, the pattern input unit 201, the feature extraction unit 202, and the feature selection unit 204 may perform processing, and the identification dictionary generation unit 205 may acquire the required m-dimensional feature. .

また、パターン入力部２０１、特徴抽出部２０２及び特徴選択部２０４が予め全ての学習用パターンについて処理を実行してｍ次元特徴に変換し、当該ｍ次元特徴を外部記憶装置１０７（図１参照）などに格納しておき、識別用辞書生成部２０５は必要となるたびに外部記憶装置１０７からｍ次元特徴を取得する方法であってもよい。 Further, the pattern input unit 201, the feature extraction unit 202, and the feature selection unit 204 execute processing for all the learning patterns in advance to convert them into m-dimensional features, and the m-dimensional features are converted into the external storage device 107 (see FIG. 1). The identification dictionary generating unit 205 may acquire the m-dimensional feature from the external storage device 107 whenever necessary.

識別用辞書生成部２０５は、詳細識別関数を生成する処理と大分類識別関数を生成する処理との２つの処理から構成される。 The identification dictionary generation unit 205 includes two processes: a process for generating a detailed identification function and a process for generating a large classification identification function.

詳細識別関数を生成する処理では、学習用パターンＤＢ２０７に格納されたパターンから生成されたｍ次元特徴と、当該ｍ次元特徴が所属カテゴリを示すラベルの集合とを用いて、詳細識別関数が生成される。 In the process of generating the detailed identification function, the detailed identification function is generated using the m-dimensional feature generated from the pattern stored in the learning pattern DB 207 and the set of labels indicating the category to which the m-dimensional feature belongs. The

また、大分類識別関数を生成する処理では、同様に、学習用パターンＤＢ２０７に格納されたパターンから生成されたｍ次元特徴と、当該ｍ次元特徴が所属カテゴリを示すラベルの集合とを用いて、詳細識別関数が生成される。生成された詳細識別関数と大分類識別関数とは識別用辞書２０９に記憶される。 Similarly, in the process of generating the large classification identification function, using the m-dimensional feature generated from the pattern stored in the learning pattern DB 207 and a set of labels indicating the category to which the m-dimensional feature belongs, A detailed discriminant function is generated. The generated detailed identification function and large classification identification function are stored in the identification dictionary 209.

まず、詳細識別関数を生成する処理について説明する。当該処理では、学習用パターンＤＢ２０７を用いて詳細識別関数が生成され、生成された詳細識別関数は識別用辞書２０９に格納される。 First, processing for generating a detailed identification function will be described. In this process, a detailed identification function is generated using the learning pattern DB 207, and the generated detailed identification function is stored in the identification dictionary 209.

詳細識別関数はカテゴリ毎に存在する。カテゴリｋの詳細識別関数ｕｋは、パターンのカテゴリｋに対する類似度を算出するための関数である。パターンのカテゴリｋに対する類似度は、ｕｋ（ｙ）で算出される。また、カテゴリ毎に異なるｍ次元特徴ｙｋが抽出された場合、パターンのカテゴリｋに対する類似度は、ｕｋ（ｙｋ）で算出される。 A detailed identification function exists for each category. The detailed identification function uk of the category k is a function for calculating the similarity of the pattern to the category k. The similarity of the pattern to category k is calculated by uk (y). When different m-dimensional features yk are extracted for each category, the similarity of the pattern to category k is calculated as uk (yk).

詳細識別関数の生成アルゴリズムは、最近傍法、パーセプトロン、改良投影距離法、修正二次識別、一般学習ベクトル量子化、多項式ネットワーク、及びサポートベクトルマシンなどがある（例えば、非特許文献１、非特許文献２参照）。 The generation algorithm of the detailed discrimination function includes a nearest neighbor method, a perceptron, an improved projection distance method, a modified secondary discrimination, a general learning vector quantization, a polynomial network, and a support vector machine (for example, Non-Patent Document 1, Non-Patent Document 1). Reference 2).

次に、大分類識別関数を生成する処理について説明する。当該処理では、学習用パターンＤＢ２０７を用いて大分類識別関数が生成され、生成された大分類識別関数は識別用辞書２０９に格納される。 Next, processing for generating a large classification identification function will be described. In this processing, a large classification identification function is generated using the learning pattern DB 207, and the generated large classification identification function is stored in the identification dictionary 209.

まず、大分類識別処理を実行する目的について説明する。 First, the purpose of executing the large classification identification process will be described.

大分類識別関数は、詳細識別関数を用いた詳細識別処理よりも粗い識別処理を高速に行い、全カテゴリの中から正解カテゴリとなる候補を絞るためのものである。 The large classification identification function is for performing coarse identification processing faster than the detailed identification processing using the detailed identification function, and narrowing down candidates for the correct category from all categories.

詳細識別関数のみが実行される場合、パターン認識装置は、全カテゴリに対してパターンの各々のカテゴリに対する類似度を、詳細識別関数ｕｋ（ｙ）又はｕｋ（ｙｋ）を用いて算出しなければならない。 When only the detailed identification function is executed, the pattern recognition apparatus must calculate the similarity for each category of the pattern with respect to all categories using the detailed identification function uk (y) or uk (yk). .

しかし、前述した方法では、例えば、数千カテゴリ以上を認識対象とする漢字認識などでは、処理時間が莫大となる。そのため、大分類識別処理及び詳細識別処理の二段階の処理によって識別処理を高速化する。大分類識別処理では、正解候補が大雑把に絞り込まれる。 However, in the above-described method, for example, in kanji recognition for which several thousand categories or more are to be recognized, the processing time is enormous. For this reason, the speed of the identification process is increased by two stages of the large classification identification process and the detailed identification process. In the large classification identification process, correct answer candidates are roughly narrowed down.

高速な識別関数を用いる方法としては、例えば、線型識別関数などの計算量が少ない識別関数を大分類用に用いる方法である。前述した方法は、計算量が少ない識別関数を用いる点を除いて、詳細識別関数の生成アルゴリズムと同様である。 As a method using a high-speed discriminant function, for example, a discriminant function having a small calculation amount such as a linear discriminant function is used for large classification. The method described above is the same as the generation algorithm of the detailed discriminant function except that an discriminant function with a small amount of calculation is used.

識別するカテゴリの数を減らして大分類を行う方法は、複数のカテゴリをまとめて一つのカテゴリとして扱い識別対象となるカテゴリ数を減らしたり、分布をクラスタ化したりすることで行われる。例えば、アルファベットの認識で、特徴空間上で近い文字種（「Ｉ」と「ｌ」、又は「Ｋ」と「ｋ」など）を同一カテゴリとして扱うなどの方法がある。 A method of performing large classification by reducing the number of categories to be identified is performed by collecting a plurality of categories as one category, reducing the number of categories to be identified, or clustering the distribution. For example, in alphabet recognition, there is a method of treating character types (“I” and “l” or “K” and “k”, etc.) that are close in the feature space as the same category.

大分類識別関数には、詳細識別関数とは異なる特徴空間が用いられる場合がある。前述の場合、大分類識別処理用の特徴選択関数が生成され、生成された大分類識別処理用の特徴選択関数が特徴選択用辞書２０８に格納される。大分類識別処理に用いられる特徴をｚ、変換関数をｇと表すと、大分類識別処理に用いられる特徴はｚ＝ｇ（ｘ）と算出される。大分類カテゴリｃの大分類識別関数をｖｃと表すと、カテゴリｃに対する類似度は、大分類識別関数ｖｃ（ｚ）によって算出される。 A feature space different from the detailed discriminant function may be used for the major discriminant function. In the above case, the feature selection function for the large classification identification process is generated, and the generated feature selection function for the large classification identification process is stored in the feature selection dictionary 208. When the feature used for the large classification identification process is expressed as z and the conversion function is expressed as g, the characteristic used for the large classification identification process is calculated as z = g (x). When the large classification identification function of the large classification category c is represented as vc, the similarity to the category c is calculated by the large classification identification function vc (z).

図７は、従来のパターン認識装置における認識フェーズの処理を説明するフローチャートである。 FIG. 7 is a flowchart for explaining the process of the recognition phase in the conventional pattern recognition apparatus.

パターン入力部２０１は、認識対象パターンＤＢ３０４からパターンを取得し、取得されたパターンを特徴抽出部２０２に出力する。 The pattern input unit 201 acquires a pattern from the recognition target pattern DB 304 and outputs the acquired pattern to the feature extraction unit 202.

特徴抽出部２０２は、入力されたパターンからｎ次元特徴を抽出し、抽出されたｎ次元特徴を特徴選択部２０４に出力する。 The feature extraction unit 202 extracts n-dimensional features from the input pattern, and outputs the extracted n-dimensional features to the feature selection unit 204.

特徴選択部２０４は、特徴選択用辞書２０８に格納されている変換関数を用いて、ｎ次元特徴をｍ次元特徴に変換する。ｍ次元特徴ｙは、変換関数ｆを用いてｙ＝ｆ（ｘ）と表される。カテゴリ毎に異なる変換が行われる場合、カテゴリｋのｍ次元特徴ｙｋは、変換関数ｆｋを用いてｙｋ＝ｆｋ（ｘ）と表される。変換されたｍ次元特徴は、大分類識別部３０１に出力される。 The feature selection unit 204 uses the conversion function stored in the feature selection dictionary 208 to convert the n-dimensional feature into an m-dimensional feature. The m-dimensional feature y is expressed as y = f (x) using the conversion function f. When different conversion is performed for each category, the m-dimensional feature yk of the category k is expressed as yk = fk (x) using the conversion function fk. The converted m-dimensional feature is output to the large classification identifying unit 301.

また、大分類識別処理において、詳細識別処理とは異なる特徴ｚが用いられる場合、大分類用の特徴選択関数ｇを用いてｚ＝ｇ（ｘ）が算出され、大分類識別部３０１に出力される。カテゴリ毎に異なる特徴が用られる場合、大分類カテゴリｃ毎に特徴ｚｃ＝ｇｃ（ｘ）が算出され、算出された特徴ｚｃが大分類識別部３０１に出力される。 In addition, when a feature z different from the detailed identification process is used in the large classification identification process, z = g (x) is calculated using the large classification feature selection function g and is output to the large classification identification unit 301. The When a different feature is used for each category, the feature zc = gc (x) is calculated for each major classification category c, and the calculated feature zc is output to the major classification identification unit 301.

大分類識別部３０１は、識別用辞書２０９に格納されている大分類識別関数を用いて、入力されたパターンが所属する正解カテゴリの候補を算出し、ｍ次元特徴と正解カテゴリ候補とを詳細識別部３０２に出力する。 The large classification identifying unit 301 uses the large classification identification function stored in the identification dictionary 209 to calculate a correct category candidate to which the input pattern belongs, and performs detailed identification of the m-dimensional feature and the correct category candidate. The data is output to the unit 302.

詳細識別部３０２は、識別用辞書２０９に格納されている詳細識別関数を用いて、正解候補カテゴリに対する、入力されたパターンの類似度を算出し、算出された類似度を認識結果出力部３０３に出力する。 The detailed identification unit 302 uses the detailed identification function stored in the identification dictionary 209 to calculate the similarity of the input pattern with respect to the correct answer candidate category, and the calculated similarity to the recognition result output unit 303. Output.

認識結果出力部３０３は、正解候補カテゴリに対する類似度を用いて、最終的な認識結果を出力する。 The recognition result output unit 303 outputs the final recognition result using the similarity to the correct candidate category.

通常、最も類似度が高いカテゴリを認識結果として出力する。なお、類似度が二番目以降のカテゴリも、二位候補、三位候補、などとして出力する場合もある。 Usually, the category with the highest similarity is output as the recognition result. It should be noted that categories whose similarity is second or later may also be output as second candidates, third candidates, and the like.

また、類似度が指定閾値よりも小さい場合、どのカテゴリにも該当しないとして、棄却する場合がある。例えば、数字認識の場合に、漢字が入力された場合などが該当する。また、類似度の第一位と第二位との差が指定閾値よりも小さい場合には、どちらのカテゴリであるかを判定することが難しいとして、棄却する場合がある。例えば、アルファベットの「Ｉ」や「ｌ」と数字の「１」など、フォントによっては識別し難いため、棄却する。 If the similarity is smaller than the specified threshold value, it may be rejected as not corresponding to any category. For example, in the case of digit recognition, this applies to the case where a kanji is input. In addition, when the difference between the first and second similarities is smaller than the specified threshold, it may be rejected because it is difficult to determine which category. For example, the letters “I” and “l” and the number “1” are rejected because they are difficult to identify depending on the font.

ここで、「棄却する」は、入力されたパターンに対して該当するカテゴリが無い旨を示す情報を出力する等が考えられる。 Here, “reject” may include outputting information indicating that there is no corresponding category for the input pattern.

大分類識別部３０１と詳細識別部３０２とにおいて、大分類識別処理と詳細識別処理との二段階の識別処理を実行する目的は、識別処理の高速化である。まず、粗い識別処理によって大雑把に正解候補を絞り、絞られた正解候補に対して詳細な識別処理を実行する。 The purpose of executing the two-stage identification process of the large classification identification process and the detailed identification process in the large classification identification unit 301 and the detailed identification unit 302 is to speed up the identification process. First, the correct answer candidates are roughly narrowed down by rough identification processing, and detailed identification processing is executed on the narrowed correct answer candidates.

大分類識別処理に用いられる大分類識別関数は、高速に計算できること、及び大分類識別関数によって算出される正解候補カテゴリの中に正解カテゴリが高精度で含まれることが要求される。 The large classification identification function used in the large classification identification process is required to be able to be calculated at high speed, and that the correct answer category is included in the correct candidate category calculated by the large classification identification function with high accuracy.

大分類識別処理に用いられる大分類識別関数の作成には、線型判別関数を用いたもの（例えば、非特許文献１参照）や、一般学習ベクトル量子化を用いたクラスタリング手法（例えば、特許文献１参照）などが使われる。 In order to create a major classification function used in the major classification processing, a method using a linear discriminant function (see, for example, Non-Patent Document 1) or a clustering method using general learning vector quantization (for example, Patent Document 1). For example).

パターン認識では、認識処理の高速化のため、まず、高速に計算できる大分類識別関数によって、正解候補カテゴリを絞る粗い識別処理が実行される。その後、正解候補カテゴリに対して詳細識別関数を用いて、正解候補カテゴリの各々に対する入力パターンの類似度が算出され、最終的な認識結果が出力される。 In pattern recognition, in order to speed up recognition processing, first, rough identification processing for narrowing down correct candidate categories is performed by a large classification identification function that can be calculated at high speed. Thereafter, using the detailed identification function for the correct candidate category, the similarity of the input pattern for each of the correct candidate categories is calculated, and the final recognition result is output.

認識フェーズ時には、パターン認識装置は、代表ベクトルと入力パターンとの距離を算出し、入力パターンとの距離が近い代表ベクトルが代表するカテゴリ群に含まれるカテゴリを正解候補カテゴリとする。前述のようにして算出された正解候補カテゴリの各々に対して詳細識別処理が実行される。この場合、通常、詳細識別処理にもテンプレート方式の方法が用いられる。すなわち、予め、各カテゴリを代表する代表ベクトルを学習によって作成しておき、認識フェーズ時には、パターン認識装置が、入力パターンと代表ベクトルとの距離から入力パターンと各々のカテゴリとの類似度を算出する。 During the recognition phase, the pattern recognition apparatus calculates the distance between the representative vector and the input pattern, and sets the category included in the category group represented by the representative vector that is close to the input pattern as the correct candidate category. Detailed identification processing is executed for each of the correct candidate categories calculated as described above. In this case, a template method is usually used for the detailed identification process. That is, representative vectors representing each category are created in advance by learning, and in the recognition phase, the pattern recognition device calculates the similarity between the input pattern and each category from the distance between the input pattern and the representative vector. .

図８は、従来のパターン認識装置の一連の処理の流れを示すフローチャートである。各モジュールが実行する処理は、図６及び図７と同一であるため説明を省略する。 FIG. 8 is a flowchart showing a flow of a series of processes of the conventional pattern recognition apparatus. The processing executed by each module is the same as that shown in FIGS.

（第１の実施形態）
本発明のパターン認識装置の実施例について、図表を参照しながら説明する。 (First embodiment)
Embodiments of the pattern recognition apparatus of the present invention will be described with reference to the drawings.

図１は、本発明の第１の実施形態のパターン認識装置の構成の一例を示すブロック図である。 FIG. 1 is a block diagram showing an example of the configuration of the pattern recognition apparatus according to the first embodiment of the present invention.

パターン認識装置１０１は、入力装置１０２、表示装置１０３、パターン取得装置１０４、通信装置１０５、演算装置（ＣＰＵ）１０６、及び外部記憶装置１０７を備える。 The pattern recognition device 101 includes an input device 102, a display device 103, a pattern acquisition device 104, a communication device 105, a computing device (CPU) 106, and an external storage device 107.

入力装置１０２は、演算装置（ＣＰＵ）１０６で実行されるプログラムの制御、その他接続機器の制御のために実行されるコマンド等を入力するための装置である。入力装置１０２は、例えば、コマンド等を入力するためのキーボードやマウス等である。 The input device 102 is a device for inputting a command executed for controlling a program executed by the arithmetic unit (CPU) 106 and other connected devices. The input device 102 is, for example, a keyboard or a mouse for inputting a command or the like.

表示装置１０３は、処理内容を表示するディスプレイ等の装置である。 The display device 103 is a device such as a display that displays processing contents.

パターン取得装置１０４は、スキャナやマイク等のパターンを取得するための装置である。取得されたパターンは、外部記憶装置１０７等に格納されてもよい。 The pattern acquisition device 104 is a device for acquiring a pattern such as a scanner or a microphone. The acquired pattern may be stored in the external storage device 107 or the like.

通信装置１０５は、ＰＣやサーバ等の外部機器との間でデータのやりとりを行うために用いられる装置である。通信装置１０５は、外部機器から送信される実行コマンドの取得、外部機器から画像や音声などのパターンを取得する。また、通信装置１０５は、パターン認識装置１０１において実行される処理の内容を外部機器に送信する。 The communication device 105 is a device used for exchanging data with an external device such as a PC or a server. The communication device 105 acquires an execution command transmitted from an external device, and acquires a pattern such as an image or sound from the external device. In addition, the communication device 105 transmits the content of processing executed in the pattern recognition device 101 to an external device.

演算装置（ＣＰＵ）１０６は、外部記憶装置１０７に格納されるプログラム等を実行し、認識処理を実行する装置である。例えば、演算装置（ＣＰＵ）１０６は、学習用パターンＤＢ２０７（図２参照）を用いて特徴選択用辞書５０３（図２参照）及び識別用辞書５０４（図２参照）を作成し、また、特徴選択用辞書５０３（図２参照）と識別用辞書５０４（図２参照）を用いての認識対象パターンの認識処理等を実行する。 The arithmetic device (CPU) 106 is a device that executes a program stored in the external storage device 107 and executes recognition processing. For example, the arithmetic unit (CPU) 106 creates a feature selection dictionary 503 (see FIG. 2) and an identification dictionary 504 (see FIG. 2) using the learning pattern DB 207 (see FIG. 2), and also selects features. A recognition target pattern recognition process or the like is executed using the dictionary 503 (see FIG. 2) and the identification dictionary 504 (see FIG. 2).

外部記憶装置１０７は、ＨＤＤ及びメモリ等の外部記憶装置である。外部記憶装置１０７には、学習用パターンＤＢ２０７（図２参照）、認識対象パターンＤＢ３０４（図３参照）、特徴選択用辞書５０３（図２参照）及び識別用辞書５０４（図２参照）が格納される。また、外部記憶装置１０７は、演算装置（ＣＰＵ）１０６が処理を実行するためのプログラム（モジュール）を格納し、また、演算装置（ＣＰＵ）１０６が実行する処理の処理結果等を一時的に格納する。 The external storage device 107 is an external storage device such as an HDD and a memory. The external storage device 107 stores a learning pattern DB 207 (see FIG. 2), a recognition target pattern DB 304 (see FIG. 3), a feature selection dictionary 503 (see FIG. 2), and an identification dictionary 504 (see FIG. 2). The The external storage device 107 stores a program (module) for the processing unit (CPU) 106 to execute processing, and temporarily stores processing results of processing executed by the processing unit (CPU) 106. To do.

なお、パターン認識装置１０１は、入力装置１０２、表示装置１０３、パターン取得装置１０４、又は通信装置１０５を備えていなくてもよい。 The pattern recognition apparatus 101 may not include the input device 102, the display device 103, the pattern acquisition device 104, or the communication device 105.

パターン認識装置１０１が入力装置１０２を備えていない場合、通信装置１０５を用いて外部機器から処理の開始を指示する方法、又は、時刻指定等によって処理を自動的に実行する方法を用いることが考えられる。 When the pattern recognition device 101 does not include the input device 102, it is possible to use a method of instructing the start of processing from an external device using the communication device 105, or a method of automatically executing processing by time designation or the like. It is done.

パターン認識装置１０１が表示装置１０３を備えていない場合、通信装置１０５を用いて外部機器に処理結果を送信する方法、又は、外部記憶装置１０７に処理結果を格納する方法が考えられる。 When the pattern recognition device 101 does not include the display device 103, a method of transmitting the processing result to the external device using the communication device 105 or a method of storing the processing result in the external storage device 107 can be considered.

処理を実行するモジュールへの出力及び入力は、外部記憶装置１０７を介して行ってもよい。例えば、処理を実行するモジュールを処理部１及び処理部２とする場合、処理部１で実行された処理結果を処理部２が入力として受け取る場合、処理部１が処理結果を外部記憶装置１０７に格納しておき、処理部２が外部記憶装置１０７に格納されている処理結果を入力として取得してもよい。 Output and input to a module that executes processing may be performed via the external storage device 107. For example, when the processing module is the processing unit 1 and the processing unit 2 and the processing unit 2 receives the processing result executed by the processing unit 1 as an input, the processing unit 1 stores the processing result in the external storage device 107. Alternatively, the processing unit 2 may acquire the processing result stored in the external storage device 107 as an input.

ユーザは、入力装置１０２を用いて処理を実行するモジュールを制御する。また、処理の実行結果は、表示装置１０３を介して表示される。 The user uses the input device 102 to control a module that executes processing. In addition, the execution result of the process is displayed via the display device 103.

次に、本発明の実施形態におけるパターン認識装置１０１が実行する処理について説明する。 Next, processing executed by the pattern recognition apparatus 101 according to the embodiment of the present invention will be described.

パターン認識装置１０１が実行する処理は、学習フェーズと認識フェーズから構成される。学習フェーズでは、学習用パターンＤＢ２０７（図２参照）を用いて特徴選択用辞書５０３（図２参照）と識別用辞書５０４（図２参照）とが作成される。認識フェーズでは、特徴選択用辞書５０３（図２参照）と識別用辞書５０４（図２参照）とを用いて、入力パターンが認識される。 The process executed by the pattern recognition apparatus 101 includes a learning phase and a recognition phase. In the learning phase, a feature selection dictionary 503 (see FIG. 2) and an identification dictionary 504 (see FIG. 2) are created using the learning pattern DB 207 (see FIG. 2). In the recognition phase, the input pattern is recognized using the feature selection dictionary 503 (see FIG. 2) and the identification dictionary 504 (see FIG. 2).

図２は、本発明の第１の実施形態のパターン認識装置１０１における学習フェーズの処理を実行するモジュールとＤＢ（データベース）の構成を説明するフローチャートである。図３は、本発明の第１の実施形態のパターン認識装置１０１における認識フェーズの処理を実行するモジュールとＤＢ（データベース）の構成を説明するフローチャートである。 FIG. 2 is a flowchart illustrating the configuration of a module and a DB (database) that execute the learning phase process in the pattern recognition apparatus 101 according to the first embodiment of the present invention. FIG. 3 is a flowchart illustrating the configuration of a module and a DB (database) that execute processing in the recognition phase in the pattern recognition apparatus 101 according to the first embodiment of the present invention.

なお、パターン認識装置１０１は、学習フェーズにおける処理を実行する装置（認識辞書生成装置）と認識フェーズにおける処理を実行する装置（認識装置）とから構成されていてもよい。その場合、認識辞書生成装置は、図２に示すモジュールを備え、学習用パターンＤＢ２０７を用いて特徴選択用辞書５０３と識別用辞書５０４とを生成する。また、認識装置は、図３に示すモジュールを備え、認識辞書生成装置によって生成された特徴選択辞書５０３と識別用辞書５０４とを用いて、入力パターンを認識する。 Note that the pattern recognition apparatus 101 may include an apparatus (recognition dictionary generation apparatus) that executes processing in the learning phase and an apparatus (recognition apparatus) that executes processing in the recognition phase. In that case, the recognition dictionary generation apparatus includes the module shown in FIG. 2 and generates a feature selection dictionary 503 and an identification dictionary 504 using the learning pattern DB 207. Further, the recognition apparatus includes the module shown in FIG. 3 and recognizes an input pattern using the feature selection dictionary 503 and the identification dictionary 504 generated by the recognition dictionary generation apparatus.

本発明では、従来の課題を解決するため、学習フェーズにおいて、大分類用特徴変換関数と大分類識別関数とをそれぞれ特徴選択用辞書５０３と識別用辞書５０４とに出力する識別関数主要部分抽出部５０２を、パターン認識装置１０１が備えることに特徴がある。 In the present invention, in order to solve the conventional problem, in the learning phase, an identification function main part extraction unit that outputs a large classification feature conversion function and a large classification identification function to the feature selection dictionary 503 and the identification dictionary 504, respectively. The pattern recognition apparatus 101 includes 502.

また、本発明は、認識フェーズにおいて、学習フェーズで生成された特徴選択用辞書５０３を用いて大分類用の特徴を生成する大分類特徴選択部６０１、識別用辞書５０４を用いて大分類識別処理を実行する大分類識別部６０２、大分類識別処理で取得された正解候補カテゴリに対して、詳細な識別処理を実行する詳細識別部６０３を、パターン認識装置１０１が備えることに特徴がある。 Further, according to the present invention, in the recognition phase, a large classification feature selection unit 601 that generates a feature for large classification using the feature selection dictionary 503 generated in the learning phase, and a large classification identification process using the identification dictionary 504 The pattern recognition apparatus 101 includes a large classification identifying unit 602 that executes a detailed classification and a detailed identifying unit 603 that performs a detailed identification process on a correct candidate category acquired in the large classification identifying process.

より具体的には、本発明では、詳細分類識別関数を低次元の部分特徴空間に制限して生成された関数が大分類識別関数として用いられる。当該低次元の部分特徴空間は、詳細分類用の識別関数の振る舞いをよく記述するように選択される。したがって、本発明における大分類識別関数は、詳細分類識別関数を近似したものとみなすことができる。 More specifically, in the present invention, a function generated by limiting the detailed classification identification function to a low-dimensional partial feature space is used as the large classification identification function. The low-dimensional partial feature space is selected so as to well describe the behavior of the discriminant function for detailed classification. Therefore, the large classification discriminant function in the present invention can be regarded as an approximation of the detailed classification discriminant function.

本発明では、任意の学習アルゴリズムによって作成された詳細分類用の識別関数から大分類識別関数が作成されるため、パターン認識装置１０１は、精度が高い識別関数を用いることで、認識精度を向上させることができる。また、本発明における大分類識別関数は、詳細分類用の識別関数を部分特徴空間に制限したものであるため、大分類識別処理と詳細識別処理とで学習アルゴリズムが異なるという構成の煩雑さも回避できる。 In the present invention, since a large classification discriminant function is created from a discriminant function for detailed classification created by an arbitrary learning algorithm, the pattern recognition apparatus 101 improves recognition accuracy by using a discriminating function with high accuracy. be able to. In addition, since the classification function in the present invention is a classification function for detailed classification limited to a partial feature space, it is possible to avoid the complexity of the configuration in which the learning algorithm is different between the large classification identification process and the detailed identification process. .

以下では、各フェーズの処理について図２を用いて説明する。まず、学習フェーズについて説明する。 Below, the process of each phase is demonstrated using FIG. First, the learning phase will be described.

パターン認識装置１０１は、学習フェーズのモジュール及びＤＢとして、パターン入力部２０１、特徴抽出部２０２、特徴選択辞書生成部２０３、特徴選択部２０４、識別関数生成部５０１、識別関数主要部分抽出部５０２、学習用パターンＤＢ２０７、特徴選択用辞書５０３、及び識別用辞書５０４を備える。 The pattern recognition apparatus 101 includes a pattern input unit 201, a feature extraction unit 202, a feature selection dictionary generation unit 203, a feature selection unit 204, an identification function generation unit 501, an identification function main part extraction unit 502, as a learning phase module and DB. A learning pattern DB 207, a feature selection dictionary 503, and an identification dictionary 504 are provided.

学習フェーズでは、パターン認識装置１０１は、学習用パターンＤＢ２０７を用いて、認識フェーズで用いる特徴選択用辞書５０３と識別用辞書５０４とを生成する。 In the learning phase, the pattern recognition apparatus 101 uses the learning pattern DB 207 to generate a feature selection dictionary 503 and an identification dictionary 504 that are used in the recognition phase.

学習用パターンＤＢ２０７は、学習用に予め作成され、所属カテゴリを示す正解ラベルが付与されたパターンをの集合である。学習用のパターンは、パターン取得装置１０４などを用いて作成される。 The learning pattern DB 207 is a set of patterns that are created in advance for learning and are given a correct answer label indicating an affiliation category. The learning pattern is created using the pattern acquisition device 104 or the like.

パターンは、例えば、画像データや音声データなどである。パターンの個数は、通常数十個以上であり、数千万個となる場合もある。例えば、漢字認識の場合、数千万個以上のパターンが格納される学習用パターンＤＢ２０７が学習用に用いられる。 The pattern is, for example, image data or audio data. The number of patterns is usually several tens or more, and may be tens of millions. For example, in the case of kanji recognition, a learning pattern DB 207 storing tens of millions or more patterns is used for learning.

正解ラベルは、例えば、認識対象のカテゴリの各々に番号を対応させて表現してもよいし、文字認識の場合には、ＥＵＣコード、ＪＩＳコード又はＳＪＩＳコードなどの文字コードを用いてもよい。後述する処理では、処理中のパターンの正解ラベルが分かるように、パターンと正解ラベルとの対応関係が失われないようにしておく。例えば、パターンのヘッダ部分に、当該パターンが所属カテゴリを示すラベルを記録しておくなどとすればよい。 For example, the correct label may be expressed by associating a number with each recognition target category, or in the case of character recognition, a character code such as an EUC code, a JIS code, or an SJIS code may be used. In the processing to be described later, the correspondence between the pattern and the correct label is not lost so that the correct label of the pattern being processed can be understood. For example, a label indicating the category to which the pattern belongs may be recorded in the header portion of the pattern.

なお、学習用パターンＤＢ２０７、特徴選択用辞書５０３及び識別用辞書５０４は、外部記憶装置１０７によって実現される。 The learning pattern DB 207, the feature selection dictionary 503, and the identification dictionary 504 are realized by the external storage device 107.

パターン入力部２０１は、学習用パターンＤＢ２０７から学習用に用いるパターンを取得し、取得されたパターンを特徴抽出部２０２に出力する。 The pattern input unit 201 acquires a pattern used for learning from the learning pattern DB 207, and outputs the acquired pattern to the feature extraction unit 202.

特徴抽出部２０２は、パターン入力部２０１から入力された各パターンから認識に有効な成分としてｎ次元ベクトルを抽出する。このとき生成されるｎ次元ベクトルをｎ次元特徴とよぶ。これによって、各パターンは、ｎ次元特徴として表現される。抽出されたｎ次元特徴は、特徴選択辞書生成部２０３及び特徴選択部２０４に出力される。 The feature extraction unit 202 extracts an n-dimensional vector as a component effective for recognition from each pattern input from the pattern input unit 201. The n-dimensional vector generated at this time is called an n-dimensional feature. Thus, each pattern is expressed as an n-dimensional feature. The extracted n-dimensional features are output to the feature selection dictionary generation unit 203 and the feature selection unit 204.

以降の特徴選択辞書生成部２０３は、特徴選択用辞書５０３を生成するために複数のパターンのｎ次元特徴を必要とする場合がある。この場合、ｎ次元特徴が必要となるたびにパターン入力部２０１及び特徴抽出部２０２が処理を実行し、特徴選択辞書生成部２０３が必要なｎ次元特徴を取得してもよい。 The subsequent feature selection dictionary generation unit 203 may require n-dimensional features of a plurality of patterns in order to generate the feature selection dictionary 503. In this case, every time an n-dimensional feature is required, the pattern input unit 201 and the feature extraction unit 202 may execute processing, and the feature selection dictionary generation unit 203 may acquire the required n-dimensional feature.

また、パターン入力部２０１及び特徴抽出部２０２が予め全ての学習用パターンについて処理してｎ次元特徴に変換し、当該ｎ次元特徴を外部記憶装置１０７などに格納しておき、特徴選択辞書生成部２０３が必要となるたびに外部記憶装置１０７からｎ次元特徴を取得する方法であってもよい。 Further, the pattern input unit 201 and the feature extraction unit 202 process all the learning patterns in advance to convert them into n-dimensional features, store the n-dimensional features in the external storage device 107 or the like, and the feature selection dictionary generation unit A method of acquiring n-dimensional features from the external storage device 107 every time 203 is required may be used.

特徴抽出部２０２が処理を実行することによってパターンは、ｎ次元特徴として表現される。この処理によって、入力されたデータが音声であっても画像であっても、パターンはｎ次元ベクトル値として表現されるため、パターン認識装置１０１は、パターンの種類に関わらず同一の処理を適用することができる。 The pattern is expressed as an n-dimensional feature as the feature extraction unit 202 executes the process. With this processing, the pattern recognition apparatus 101 applies the same processing regardless of the type of the pattern because the pattern is expressed as an n-dimensional vector value regardless of whether the input data is sound or image. be able to.

ここで、例として、文字認識における特徴抽出部２０２の処理について説明する。 Here, as an example, processing of the feature extraction unit 202 in character recognition will be described.

図５は、本発明の第１の実施形態の文字認識における特徴抽出部２０２が実行する処理の一例を説明するフローチャートである。 FIG. 5 is a flowchart illustrating an example of processing executed by the feature extraction unit 202 in character recognition according to the first embodiment of this invention.

入力ステップ８０１では、特徴抽出部２０２が、パターン入力部２０１から出力された画像を取り込む。 In input step 801, the feature extraction unit 202 captures the image output from the pattern input unit 201.

前処理ステップ８０２では、特徴抽出部２０２が、取り込まれた入力画像に対してノイズ除去及びぼかし処理等を実行し、文字認識に障害となる要因となるノイズ及びかすれ等を除去する。例えば、ノイズ除去処理では、ある閾値以下の大きさの孤立点が除去される。 In pre-processing step 802, the feature extraction unit 202 performs noise removal and blurring processing on the input image that has been taken in, and removes noise, blurring, and the like that cause obstacles to character recognition. For example, in the noise removal process, an isolated point having a size equal to or smaller than a certain threshold is removed.

正規化ステップ８０３では、特徴抽出部２０２が、前処理が実行された画像の各々を予め指定された固定サイズの画像に変換する。当該処理によって、様々なサイズの入力画像の大きさを揃えることができ、画像サイズに依存する処理を統一することができる。 In the normalization step 803, the feature extraction unit 202 converts each of the preprocessed images into a fixed size image designated in advance. By this processing, the sizes of input images of various sizes can be made uniform, and processing depending on the image size can be unified.

正規化法には、線型正規化法、非線型正規化法、又はモーメント正規化法などがある（非特許文献１参照）。例えば、入力画像が２値画像の場合、線型正規化法では、文字部分を示す黒画素を囲む最小の矩形によって囲まれる部分画像を固定サイズの画像に拡大又は縮小することで正規化画像が生成される。 Examples of the normalization method include a linear normalization method, a non-linear normalization method, and a moment normalization method (see Non-Patent Document 1). For example, when the input image is a binary image, in the linear normalization method, a normalized image is generated by enlarging or reducing a partial image surrounded by a minimum rectangle surrounding a black pixel indicating a character portion to a fixed size image. Is done.

文字特徴抽出ステップ８０４では、特徴抽出部２０２が、正規化によって生成された画像をｎ次元特徴に変換する。 In the character feature extraction step 804, the feature extraction unit 202 converts the image generated by normalization into an n-dimensional feature.

一例として、最も簡単な画素特徴抽出について説明する。画素特徴抽出では、各画素の画素値を特徴として用いる。例えば、入力画像が縦横２０×２０のグレー画像で、各画素の画素値が０〜２５５の整数値で表現されている場合、画素数は４００となり、抽出される特徴は４００次元特徴となる。４００次元特徴の各成分は、入力画像の０〜２５５の画素値である。 As an example, the simplest pixel feature extraction will be described. In pixel feature extraction, the pixel value of each pixel is used as a feature. For example, when the input image is a 20 × 20 gray image, and the pixel value of each pixel is expressed by an integer value of 0 to 255, the number of pixels is 400, and the extracted features are 400-dimensional features. Each component of the 400-dimensional feature is a pixel value of 0 to 255 of the input image.

出力ステップ８０５では、特徴抽出部２０２が、変換されたｎ次元特徴を特徴選択辞書生成部２０３及び特徴選択部２０４に出力する。 In the output step 805, the feature extraction unit 202 outputs the converted n-dimensional feature to the feature selection dictionary generation unit 203 and the feature selection unit 204.

以上が、文字認識における特徴抽出部２０２の処理の一例である。 The above is an example of the process of the feature extraction unit 202 in character recognition.

図２の説明に戻る。 Returning to the description of FIG.

特徴選択辞書生成部２０３は、後述する特徴選択部２０４がｎ次元特徴からｍ次元特徴を抽出するために用いられる特徴選択用辞書を生成する。 The feature selection dictionary generation unit 203 generates a feature selection dictionary used by the feature selection unit 204 described later to extract m-dimensional features from n-dimensional features.

ここで、まず、ｎ次元特徴からｍ次元特徴を抽出する理由について説明する。 Here, first, the reason for extracting the m-dimensional feature from the n-dimensional feature will be described.

特徴抽出部２０２が抽出したｎ次元特徴からｍ次元特徴を抽出する目的は、認識処理の高精度化と高速化とを実現するためである。 The purpose of extracting the m-dimensional feature from the n-dimensional feature extracted by the feature extraction unit 202 is to realize high accuracy and high speed of the recognition process.

まず、高速化について説明する。パターン認識装置１０１が学習又は認識するの時に要する計算量は、少なくとも次元数のべき乗のオーダーとなる。 First, speeding up will be described. The amount of calculation required when the pattern recognition apparatus 101 learns or recognizes is at least an order of the power of the number of dimensions.

例えば、ｎ×ｎの共分散行列が用いられる計算においては、ｎの二乗のオーダーの計算量を要する。また、識別関数の次数がｓの場合、識別関数の計算にｓ乗のオーダーの計算量を要する。 For example, in a calculation in which an n × n covariance matrix is used, a calculation amount in the order of the square of n is required. Further, when the order of the discriminant function is s, calculation of the discriminant function requires a calculation amount in the order of s-th power.

したがって、計算量を抑えるためには、識別処理への寄与が低い特徴成分を削減し、次元数を削減することが必要となる。また、次元数が大きくなると、互いに相関の高い特徴が混入する割合が大きくなり、計算量に見合った効果が得られない。そのため、相関が高い特徴を統合することによって、計算量を削減することが有効である。 Therefore, in order to reduce the amount of calculation, it is necessary to reduce feature components that contribute less to the identification process and to reduce the number of dimensions. Further, as the number of dimensions increases, the ratio of features with high correlation to each other increases, and an effect commensurate with the amount of calculation cannot be obtained. Therefore, it is effective to reduce the amount of calculation by integrating features having high correlation.

次に、高精度化について説明する。パターン認識装置１０１が高次元空間において有限個の学習パターンから識別関数を生成する場合、次元数を高くすると精度の低下を招く。これは、次元数の増加に応じて識別関数の推定パラメータ数が増大し、有限個の学習パターンを用いたパラメータの推定が、統計的に信頼性の低くなるためである。したがって、識別処理への寄与が小さい特徴を取り除き、特徴の次元数を適度に小さくする方が、認識処理の高精度化に有効である。 Next, high accuracy will be described. When the pattern recognition apparatus 101 generates a discriminant function from a finite number of learning patterns in a high-dimensional space, an increase in the number of dimensions causes a decrease in accuracy. This is because the number of estimated parameters of the discriminant function increases as the number of dimensions increases, and parameter estimation using a finite number of learning patterns is statistically unreliable. Therefore, removing features that contribute little to the identification processing and appropriately reducing the number of dimensions of the features is effective in increasing the accuracy of the recognition processing.

前述した理由によって、ｎ次元特徴がｍ次元特徴に変換される。 For the reasons described above, n-dimensional features are converted to m-dimensional features.

特徴選択辞書生成部２０３は、特徴抽出部２０２によって生成された学習パターンのｎ次元特徴を用いて、ｎ次元特徴をｍ次元特徴に変換する変換関数を生成する。生成された変換関数は、特徴選択用辞書５０３に格納される。線型変換である場合変換関数は行列で表される。 The feature selection dictionary generation unit 203 uses the n-dimensional feature of the learning pattern generated by the feature extraction unit 202 to generate a conversion function that converts an n-dimensional feature into an m-dimensional feature. The generated conversion function is stored in the feature selection dictionary 503. In the case of linear conversion, the conversion function is represented by a matrix.

変換関数の生成には、例えば、主成分分析や線型判別法などが用いられる。ｍ次元特徴への変換は、カテゴリ毎に異なる関数で行われる場合もある。その場合、カテゴリ毎に変換関数が作成される。 For example, principal component analysis or linear discriminant method is used to generate the conversion function. The conversion into m-dimensional features may be performed by a function that is different for each category. In that case, a conversion function is created for each category.

ここで、一例として、主成分分析による特徴選択関数の生成について説明する。主成分分析では、まず、特徴選択辞書生成部２０３が学習パターンの分布の共分散行列を算出する。次に、特徴選択辞書生成部２０３は、共分散行列の固有値の大きい順にｍ個の固有ベクトルを選択し、ｎ次元特徴を各々の固有ベクトルに射影したｍ個の特徴を成分とするｍ次元特徴を選択する。ｍ個の固有ベクトルをｐｉ(ｉ＝１、…ｍ)とおくと、ｍ次元特徴ｙの第ｉ成分ｙｉは、ｘとｐｉとの内積ｙｉ＝ｘ・ｐｉによって与えられる。したがって、主成分分析を用いて生成される変換関数は、ｍ個の固有ベクトルを行ベクトルとするｍ×ｎ行列Ｙで表される。具体的には、ｎ次元特徴をｘ、ｍ次元特徴をｙと表すと、変換行列Ｙはｙ＝Ｙｘと表される。 Here, generation of a feature selection function by principal component analysis will be described as an example. In the principal component analysis, first, the feature selection dictionary generation unit 203 calculates a covariance matrix of the learning pattern distribution. Next, the feature selection dictionary generation unit 203 selects m eigenvectors in descending order of the eigenvalues of the covariance matrix, and selects m-dimensional features whose components are m features obtained by projecting n-dimensional features onto the respective eigenvectors. To do. When m eigenvectors are denoted by pi (i = 1,..., m), the i-th component yi of the m-dimensional feature y is given by the inner product yi = x · pi of x and pi. Therefore, the conversion function generated using principal component analysis is represented by an m × n matrix Y having m eigenvectors as row vectors. Specifically, when an n-dimensional feature is represented as x and an m-dimensional feature is represented as y, the transformation matrix Y is represented as y = Yx.

特徴選択部２０４は、特徴選択用辞書５０３に格納される変換関数を用いて、ｎ次元特徴をｍ次元特徴に変換する。変換後のｍ次元特徴は、識別関数生成部５０１に出力される。ｎ次元特徴をｘ＝（ｘ１，ｘ２，…，ｘｎ）、変換後のｍ次元特徴をｙ＝（ｙ１，ｙ２，…，ｙｍ）とおくと、変換関数は、以下のように表される。
ｙ１＝ｆ１（ｘ１,ｘ２,…,ｘｎ）
ｙ２＝ｆ２（ｘ１,ｘ２,…,ｘｎ）
・・・・・・・・・・・・・・・・・
ｙｍ＝ｆｍ（ｘ１,ｘ２,…,ｘｎ）
また、線型変換である場合、ｍ×ｎの変換行列Ｙを用いてｙ＝Ｙｘと表される。 The feature selection unit 204 converts an n-dimensional feature into an m-dimensional feature using a conversion function stored in the feature selection dictionary 503. The converted m-dimensional feature is output to the discriminant function generator 501. If the n-dimensional feature is x = (x1, x2,..., xn) and the m-dimensional feature after conversion is y = (y1, y2,..., ym), the conversion function is expressed as follows.
y1 = f1 (x1, x2,..., xn)
y2 = f2 (x1, x2,..., xn)
・・・・・・・・・・・・・・・・・
ym = fm (x1, x2,..., xn)
Further, in the case of linear conversion, y = Yx is expressed using an m × n conversion matrix Y.

また、カテゴリ毎に異なる変換が実行される場合、特徴選択部２０４は、カテゴリ毎に作成された変換関数を用いて、カテゴリｋのｍ次元特徴ｙｋ＝（ｙｋ１,ｙｋ２,…,ｙｋｍ）を算出する。カテゴリｋの変換関数は、以下のように表される。
ｙｋ１＝ｆｋ１（ｘ１,ｘ２,…,ｘｎ）
ｙｋ２＝ｆｋ２（ｘ１,ｘ２,…,ｘｎ）
・・・・・・・・・・・・・・・・・
ｙｋｍ＝ｆｋｍ（ｘ１,ｘ２,…,ｘｎ）
線型変換である場合、ｍ×ｎの変換行列Ｙｋを用いてｙｋ＝Ｙｋｘと表される。 When different conversion is performed for each category, the feature selection unit 204 calculates the m-dimensional feature yk = (yk1, yk2,..., Ykm) of the category k using the conversion function created for each category. To do. The conversion function of category k is expressed as follows.
yk1 = fk1 (x1, x2,..., xn)
yk2 = fk2 (x1, x2,..., xn)
・・・・・・・・・・・・・・・・・
ykm = fkm (x1, x2,..., xn)
In the case of linear conversion, yk = Ykx is expressed using an m × n conversion matrix Yk.

以降の識別関数生成部５０１及び識別関数主要部分抽出部５０２は、詳細識別関数及び大分類識別関数を生成するために複数のパターンのｍ次元特徴を必要とする場合がある。 The subsequent discriminant function generating unit 501 and discriminant function main part extracting unit 502 may require m-dimensional features of a plurality of patterns in order to generate a detailed discriminant function and a large class discriminant function.

この場合、ｍ次元特徴が必要となるたびにパターン入力部２０１、特徴抽出部２０２及び特徴選択部２０４が処理を実行して、識別関数生成部５０１及び識別関数主要部分抽出部５０２が必要なｍ次元特徴を取得してもよい。 In this case, each time an m-dimensional feature is required, the pattern input unit 201, the feature extraction unit 202, and the feature selection unit 204 execute processing, and the discrimination function generation unit 501 and the discrimination function main part extraction unit 502 require m. A dimensional feature may be acquired.

また、パターン入力部２０１、特徴抽出部２０２及び特徴選択部２０４が予め全ての学習用パターンについて処理を実行してｍ次元特徴に変換し、当該ｍ次元特徴を外部記憶装置１０７などに格納しておき、識別関数生成部５０１及び識別関数主要部分抽出部５０２が必要となるたびに外部記憶装置１０７から取得する方法であってもよい。 In addition, the pattern input unit 201, the feature extraction unit 202, and the feature selection unit 204 execute processing for all learning patterns in advance to convert them into m-dimensional features, and store the m-dimensional features in the external storage device 107 or the like. Alternatively, a method may be used in which the discriminant function generation unit 501 and the discriminant function main part extraction unit 502 are acquired from the external storage device 107 each time they are required.

識別関数生成部５０１は、ｍ次元特徴を取得し、カテゴリｋに対する認識対象のパターンの類似度を計算するための詳細識別関数ｕｋ（ｘ）を生成する。生成された詳細識別関数は、識別関数主要部分抽出部５０２に出力され、また、識別用辞書５０４に格納される。 The discriminant function generation unit 501 acquires m-dimensional features and generates a detailed discriminant function uk (x) for calculating the similarity of the recognition target pattern with respect to the category k. The generated detailed discriminant function is output to the discriminant function main part extracting unit 502 and stored in the discriminating dictionary 504.

詳細識別関数ｕｋ（ｘ）は、表記を統一するためにｘの関数として表記したが、実際には、ｍ次元特徴ｙ又はｙｋに依存する関数である。すなわち、任意の関数ｈを用いて詳細識別関数は、ｕｋ（ｘ）＝ｈｋ（ｙ）＝ｈｋ（ｆ（ｘ））、又は、ｕｋ（ｘ）＝ｈｋ（ｙｋ）＝ｈｋ（ｆｋ（ｘ））と表すことができる。パターンのカテゴリｋに対する詳細識別関数は、ｍ次元特徴ｙを用いて、ｕｋ（ｘ）＝ｈｋ（ｙ）と表される。 The detailed identification function uk (x) is expressed as a function of x in order to unify the notation, but is actually a function depending on the m-dimensional feature y or yk. That is, the detailed discrimination function using an arbitrary function h is uk (x) = hk (y) = hk (f (x)) or uk (x) = hk (yk) = hk (fk (x) )It can be expressed as. The detailed discriminant function for the category k of the pattern is expressed as uk (x) = hk (y) using the m-dimensional feature y.

特徴選択部２０４がカテゴリ毎に異なるｍ次元特徴ｙｋを生成した場合、識別関数生成部５０１は、詳細識別関数は、ｕｋ（ｘ）＝ｈｋ（ｙｋ）と表される。 When the feature selection unit 204 generates different m-dimensional features yk for each category, the detailed identification function of the identification function generation unit 501 is expressed as uk (x) = hk (yk).

詳細識別関数の生成に用いられるアルゴリズムには、最近傍法、パーセプトロン、改良投影距離法、修正二次識別、一般学習ベクトル量子化、多項式ネットワーク、又はサポートベクトルマシンなどがある。 Algorithms used to generate the detailed discriminant function include nearest neighbor method, perceptron, improved projection distance method, modified quadratic discrimination, general learning vector quantization, polynomial network, or support vector machine.

識別関数主要部分抽出部５０２は、詳細識別関数をＬ次元部分特徴空間に制限した大分類識別関数ｖｋを生成し、生成された大分類識別関数ｖｋを識別用辞書５０４に格納する。また、識別関数主要部分抽出部５０２は、ｎ次元特徴をＬ次元部分特徴空間上のＬ次元特徴ｚに変換するための大分類用特徴変換関数ｇを生成し、生成された大分類用特徴変換関数ｇを特徴選択用辞書５０３に出力する。 The discrimination function main part extraction unit 502 generates a major classification function vk in which the detailed discrimination function is limited to the L-dimensional partial feature space, and stores the generated major classification function vk in the identification dictionary 504. Further, the discriminant function main part extraction unit 502 generates a large classification feature conversion function g for converting an n-dimensional feature into an L-dimensional feature z in the L-dimensional partial feature space, and generates the generated major classification feature conversion. The function g is output to the feature selection dictionary 503.

なお、Ｌ次元部分空間は、ｎ次元特徴空間の部分空間でもあり、かつ、ｍ次元特徴空間の部分空間でもある。 Note that the L-dimensional subspace is a subspace of the n-dimensional feature space and also a subspace of the m-dimensional feature space.

本発明は、識別関数主要部分抽出部５０２に特徴がある。識別関数主要部分抽出部５０２での処理の詳細は、後述する。 The present invention is characterized by the discrimination function main part extraction unit 502. Details of the processing in the discrimination function main part extraction unit 502 will be described later.

以上が、学習フェーズの処理の説明である。次に、認識フェーズについて図３を用いて説明する。 The above is the description of the learning phase process. Next, the recognition phase will be described with reference to FIG.

パターン認識装置１０１は、認識フェーズのモジュールとＤＢとして、パターン入力部２０１、特徴抽出部２０２、大分類特徴選択部６０１、大分類識別部６０２、特徴選択部２０４、詳細識別部６０３、認識結果出力部３０３、認識対象パターンＤＢ３０４、特徴選択用辞書５０３、及び識別用辞書５０４を備える。 The pattern recognition apparatus 101 includes, as a recognition phase module and DB, a pattern input unit 201, a feature extraction unit 202, a large classification feature selection unit 601, a large classification identification unit 602, a feature selection unit 204, a detailed identification unit 603, and a recognition result output. A unit 303, a recognition target pattern DB 304, a feature selection dictionary 503, and an identification dictionary 504.

パターン入力部２０１は、認識対象パターンを取得し、取得された認識対象パターンを特徴抽出部２０２に出力する。認識対象パターンは、予め認識対象パターンＤＢ３０４に格納しておき、認識対象パターンＤＢ３０４から取り込んでもよいし、パターン取得装置１０４又は通信装置１０５から直接取り込んでもよい。なお、認識対象パターンＤＢ３０４は、例えば、外部記憶装置１０７等が考えられる。 The pattern input unit 201 acquires a recognition target pattern and outputs the acquired recognition target pattern to the feature extraction unit 202. The recognition target pattern may be stored in advance in the recognition target pattern DB 304 and may be taken in from the recognition target pattern DB 304 or may be taken in directly from the pattern acquisition device 104 or the communication device 105. Note that the recognition target pattern DB 304 may be the external storage device 107, for example.

特徴抽出部２０２は、パターン入力部２０１によって入力されたパターンからｎ次元特徴を抽出する。特徴抽出部２０２がパターンからｎ次元特徴を抽出する処理は、学習フェーズと同一である。抽出されたｎ次元特徴は、大分類特徴選択部６０１と特徴選択部２０４とに出力される。 The feature extraction unit 202 extracts n-dimensional features from the pattern input by the pattern input unit 201. The process in which the feature extraction unit 202 extracts n-dimensional features from the pattern is the same as the learning phase. The extracted n-dimensional features are output to the large classification feature selection unit 601 and the feature selection unit 204.

大分類特徴選択部６０１は、特徴選択用辞書５０３に格納されている大分類用変換関数ｇを用いて、ｎ次元特徴をＬ次元特徴ｚに変換する。ここで、ｎ次元特徴ｘと大分類用変換関数ｇとを用いると、Ｌ次元特徴ｚはｚ＝ｇ（ｘ）と表される。変換されたＬ次元特徴は、大分類識別部６０２に出力される。 The major classification feature selection unit 601 converts an n-dimensional feature into an L-dimensional feature z using the major classification conversion function g stored in the feature selection dictionary 503. Here, when the n-dimensional feature x and the large classification conversion function g are used, the L-dimensional feature z is expressed as z = g (x). The converted L-dimensional feature is output to the large classification identifying unit 602.

大分類識別部６０２は、識別用辞書５０４に格納されている大分類識別関数ｖｋを用いてパターンの各カテゴリに対する粗い類似度を算出する。さらに、大分類識別部６０２は、算出された類似度を用いて、類似度が高いカテゴリを正解候補カテゴリとして算出する。なお、正解候補カテゴリは、ユーザによって指定された数だけ算出される。例えば、数千カテゴリの字種を認識対象とする漢字認識の場合、正解候補は数十カテゴリ程度に絞られる。 The large classification identifying unit 602 uses the large classification identification function vk stored in the identification dictionary 504 to calculate a rough similarity for each category of the pattern. Further, the large classification identifying unit 602 calculates a category having a high similarity as a correct candidate category using the calculated similarity. Note that the correct answer candidate categories are calculated by the number designated by the user. For example, in the case of Kanji recognition for character types of several thousand categories, the correct answer candidates are limited to about several tens of categories.

大分類識別部６０２は、正解候補カテゴリを特徴選択部２０４に出力する。なお、大分類識別部６０２が実行する処理の詳細は、後述する。 The large classification identifying unit 602 outputs the correct answer candidate category to the feature selecting unit 204. Details of the processing executed by the large classification identifying unit 602 will be described later.

特徴選択部２０４は、特徴選択用辞書５０３に格納されている変換関数ｆを用いて、ｎ次元特徴をｍ次元特徴に変換する。ｎ次元特徴をｍ次元特徴に変換する処理は、学習フェーズと同一である。カテゴリ毎に異なるｍ次元特徴が抽出される場合、大分類識別部６０２が実行した処理で得られた正解候補カテゴリ毎にｍ次元特徴に変換される。変換されたｍ次元特徴は、詳細識別部６０３に出力される。また、正解候補カテゴリも詳細識別部６０３に出力される。 The feature selection unit 204 uses the conversion function f stored in the feature selection dictionary 503 to convert the n-dimensional feature into an m-dimensional feature. The process of converting n-dimensional features into m-dimensional features is the same as in the learning phase. When different m-dimensional features are extracted for each category, each correct candidate category obtained by the process executed by the large classification identifying unit 602 is converted into an m-dimensional feature. The converted m-dimensional feature is output to the detailed identification unit 603. The correct answer candidate category is also output to the detail identifying unit 603.

詳細識別部６０３は、詳細識別関数を用いて、正解候補カテゴリに対する認証対象パターンの類似度を算出する。算出された正解候補カテゴリに対する類似度は、認識結果出力部３０３に出力される。詳細識別部６０３が実行する処理の詳細は、後述する。 The detailed identification unit 603 calculates the degree of similarity of the authentication target pattern with respect to the correct candidate category using the detailed identification function. The calculated similarity to the correct candidate category is output to the recognition result output unit 303. Details of the processing executed by the detail identifying unit 603 will be described later.

認識結果出力部３０３は、算出された正解候補カテゴリに対する認証対象パターンの類似度を用いて、最終的な認識結果を出力する。通常、認識結果出力部３０３は、最も類似度が高いカテゴリを認識結果として出力する。 The recognition result output unit 303 outputs the final recognition result by using the similarity of the authentication target pattern with the calculated correct candidate category. Usually, the recognition result output unit 303 outputs the category having the highest similarity as the recognition result.

また、認識結果出力部３０３は、類似度が二番目以降のカテゴリも、二位候補、三位候補、などとして出力する場合もある。 In addition, the recognition result output unit 303 may also output the second and subsequent categories of similarity as second candidates, third candidates, and the like.

また、類似度が指定閾値よりも小さい場合、認識結果出力部３０３は、どのカテゴリにも該当しないとして、棄却する場合がある。例えば、数字認識の場合に、漢字が入力された場合などが該当する。 When the similarity is smaller than the specified threshold value, the recognition result output unit 303 may reject it as not corresponding to any category. For example, in the case of digit recognition, this applies to the case where a kanji is input.

また、類似度の第一位と第二位との差が指定閾値よりも小さい場合、認識結果出力部３０３は、どちらのカテゴリであるか判定することが難しいとして、棄却する場合がある。例えば、アルファベットの「Ｉ」や「ｌ」と数字の「１」など、フォントによっては識別することが難しいため、どのカテゴリであるかが判定できない。 In addition, when the difference between the first and second similarities is smaller than the specified threshold, the recognition result output unit 303 may reject it because it is difficult to determine which category it is. For example, it is difficult to discriminate depending on the font such as alphabet “I” or “l” and number “1”.

認識結果は、表示装置１０３に表示され、通信装置１０５を用いて外部に送信され、又は、外部記憶装置１０７に出力される。 The recognition result is displayed on the display device 103 and transmitted to the outside using the communication device 105 or output to the external storage device 107.

以下では、本発明の特徴である、識別関数主要部分抽出部５０２と、大分類特徴選択部６０１、大分類識別部６０２、及び詳細識別部６０３が実行する処理の詳細について説明する。 Hereinafter, the details of processing executed by the discrimination function main part extraction unit 502, the major classification feature selection unit 601, the major classification identification unit 602, and the detailed identification unit 603, which are features of the present invention, will be described.

まず、識別関数主要部分抽出部５０２が大分類識別関数ｖｋと大分類用特徴変換関数とを生成する方法について説明する。 First, a method in which the discriminant function main part extraction unit 502 generates a major classification function vk and a major classification feature conversion function will be described.

以下の説明では、特徴選択部２０４がカテゴリ毎に異なるｍ次元特徴ｙｋを抽出した場合について説明する。なお、特徴選択部２０４がカテゴリに依存しないｍ次元特徴ｙを抽出した場合、識別関数主要部分抽出部５０２は、ｙｋをｙに置き換えて同一の処理を実行すればよい。 In the following description, a case will be described in which the feature selection unit 204 extracts different m-dimensional features yk for each category. When the feature selection unit 204 extracts the m-dimensional feature y that does not depend on the category, the discriminant function main part extraction unit 502 may perform the same processing by replacing yk with y.

また、本実施形態における条件として、特徴選択部２０４は、線型関数による変換ｙｋ＝Ｙｋｘによりｎ次元特徴をｍ次元特徴に変換するものとし、詳細識別関数は［数１］に示すような二次以下の関数とする。 Further, as a condition in the present embodiment, the feature selection unit 204 converts an n-dimensional feature into an m-dimensional feature by a linear function conversion yk = Ykx, and the detailed identification function is a quadratic as shown in [Equation 1]. Let us assume the following function.

まず、識別関数主要部分抽出部５０２が実行する処理の概要を説明した後に、図４に示す各々の処理について説明する。 First, after describing the outline of the process executed by the discriminant function main part extraction unit 502, each process shown in FIG. 4 will be described.

識別関数主要部分抽出部５０２は、詳細識別関数ｕｋ（ｘ）をＬ次元部分特徴空間に制限した大分類識別関数ｖｋ（ｘ）を生成する。本実施形態では、大分類識別関数ｖｋ（ｘ）として、詳細識別関数ｕｋ（ｘ）をよく近似し、高速に計算できるものが生成される。 The discriminant function main part extraction unit 502 generates a large class discriminant function vk (x) in which the detailed discriminant function uk (x) is limited to the L-dimensional partial feature space. In the present embodiment, as the large classification discriminant function vk (x), the detailed discriminant function uk (x) is well approximated and can be calculated at high speed.

識別関数主要部分抽出部５０２は、まず、各々のカテゴリの詳細識別関数ｕｋ（ｘ）を特徴づける特徴軸を選定し、次に、それぞれの選定された特徴軸の重要度を設定する。 The discriminant function main part extraction unit 502 first selects feature axes that characterize the detailed discriminant function uk (x) of each category, and then sets the importance of each selected feature axis.

次に、識別関数主要部分抽出部５０２は、重要度に基づいて特徴軸を統合する主要軸を算出する。最後に、識別関数主要部分抽出部５０２は、詳細識別関数ｕｋ（ｘ）を主要軸に制限した関数を生成し、当該関数を大分類識別関数ｖｋとして出力する。 Next, the discriminant function main part extraction unit 502 calculates a main axis for integrating feature axes based on the importance. Finally, the discriminant function main part extraction unit 502 generates a function in which the detailed discriminant function uk (x) is limited to the main axis, and outputs the function as the large class discriminant function vk.

［数１］に示した詳細識別関数から分かるように、二次の識別関数の計算量は、特徴空間の次元数ｍの二乗のオーダーである（［数１］の項数は次元数ｍの二乗のオーダーである）。したがって、特徴空間の次元数を制限することによって、詳細識別関数の計算量を抑えることができる。 As can be seen from the detailed discriminant function shown in [Equation 1], the calculation amount of the secondary discriminant function is the order of the square of the dimension number m of the feature space (the number of terms in [Equation 1] is the dimension number m Square order). Therefore, by limiting the number of dimensions of the feature space, it is possible to reduce the amount of calculation of the detailed identification function.

詳細分類識別関数の依存領域をＬ次元部分特徴空間に制限された大分類識別関数は、詳細分類識別関数よりも高速に計算することができる。本実施形態では、詳細分類識別関数をｍ次元特徴空間のＬ次元部分特徴空間に制限された関数を大分類識別関数とする方法について説明する。 The large classification discriminating function in which the dependent region of the detailed classification discriminating function is limited to the L-dimensional partial feature space can be calculated faster than the detailed classification discriminating function. In the present embodiment, a method will be described in which a detailed classification discriminant function is a function that is limited to an L-dimensional partial feature space of an m-dimensional feature space as a major classification discriminant function.

まず、［数１］に示す詳細識別関数を変形する。ここで、行列Ｗｋ及びベクトルｗｋを、それぞれ［数２］及び［数３］で定義する。 First, the detailed identification function shown in [Formula 1] is modified. Here, the matrix Wk and the vector wk are defined by [Equation 2] and [Equation 3], respectively.

［数２］、［数３］及び［数４］を用いて、［数１］は［数５］のように変形される。 [Equation 1] is transformed into [Equation 5] using [Equation 2], [Equation 3], and [Equation 4].

Ｗｋは対称行列であるため、任意のユニタリ行列Ｐｋが存在し、ＷｋはＰｋを用いて［数６］に示すように対角化することができる。ここで、対角成分は、［数７］に示すように、対角成分の絶対値が大きい順に並ぶようなＰｋが用いられる。 Since Wk is a symmetric matrix, there is an arbitrary unitary matrix Pk, and Wk can be diagonalized using Pk as shown in [Formula 6]. Here, as the diagonal component, as shown in [Equation 7], Pk that is arranged in descending order of the absolute value of the diagonal component is used.

Ｐｋはユニタリ行列であるため、［数５］は［数８］のように変形できる。 Since Pk is a unitary matrix, [Equation 5] can be transformed into [Equation 8].

さらに、変換関数ｙｋ＝Ｙｋｘを用いると、ｕｋ（ｙｋ）をｘの関数として表現しなおすことができ、［数８］は［数９］に示すように表現される。 Further, when the conversion function yk = Ykx is used, uk (yk) can be re-expressed as a function of x, and [Equation 8] is expressed as shown in [Equation 9].

行列ＰｋＹｋの行ベクトルを［数１０］に示すようにｑｋｉ（ｉ＝１、…、ｍ）の転置とおき、また、Ｐｋｗｋを［数１１］のようにおくと、識別関数主要部分抽出部５０２は、［数９］は［数１２］のように変形される。 When the row vector of the matrix PkYk is replaced by qki (i = 1,..., M) as shown in [Equation 10] and Pkwk is given as [Equation 11], the discriminant function main part extraction unit 502 [Equation 9] is transformed into [Equation 12].

すなわち、Ｗｋを対角化する行列Ｐｋを求めることができれば、［数１］は、［数６］、［数１０］及び［数１１］を用いて、［数１２］のように変形される。なお、Ｗｋから行列Ｐｋを求めるには、対称行列Ｗｋの固有値問題を解けばよい。 That is, if the matrix Pk that diagonalizes Wk can be obtained, [Equation 1] is transformed into [Equation 12] using [Equation 6], [Equation 10], and [Equation 11]. . In order to obtain the matrix Pk from Wk, the eigenvalue problem of the symmetric matrix Wk may be solved.

本実施形態では、詳細識別関数ｕｋを特徴づける特徴軸としてｑｋｉが用いられる。 In the present embodiment, qki is used as a feature axis that characterizes the detailed discrimination function uk.

次に、各特徴軸ｑｋｉの重要度ｈｋｉが設定される。重要度の決め方には様々な方法が考えられるが、最も簡単な方法は、［数１７］に示すように特徴軸ｑｋｉを含む項の係数の大きさを重要度として決定する方法が考えられる。 Next, the importance degree hki of each feature axis qki is set. There are various methods for determining the importance, but the simplest method is to determine the magnitude of the coefficient of the term including the feature axis qki as the importance as shown in [Equation 17].

次に、選定された特徴軸を統合する主要軸の算出方法について説明する。 Next, a calculation method of a main axis that integrates selected feature axes will be described.

ここでは、統合軸としてｎ次元ベクトルａが選択された場合を考える。このとき、特徴軸ｑｋｉのベクトルａへの射影長は、内積ａ・ｑｋｉで与えられる。 Here, consider a case where the n-dimensional vector a is selected as the integrated axis. At this time, the projection length of the feature axis qki onto the vector a is given by the inner product a · qki.

内積の値が大きいほど、ベクトルａへの統合によって失われる特徴軸ｑｋｉの情報量が少ないと考えられる。したがって、［数１８］に示すように、内積の値に特徴軸の重要度をかけたものの合計が大きいほど、ベクトルａへの統合によって失われる特徴軸方向の情報量が少ないといえる。 It can be considered that as the value of the inner product is larger, the information amount of the feature axis qki lost by the integration into the vector a is smaller. Therefore, as shown in [Equation 18], it can be said that the larger the sum of the inner product value multiplied by the importance of the feature axis, the smaller the amount of information in the feature axis direction lost by the integration into the vector a.

ここで、Ｋはカテゴリ数である。本実施形態では、［数１８］に示すような量が大きいベクトルａが主要軸として選択される。 Here, K is the number of categories. In the present embodiment, a vector a having a large amount as shown in [Equation 18] is selected as the main axis.

さらに、［数１８］は、［数１９］のように変形される。 Furthermore, [Equation 18] is transformed into [Equation 19].

ここで、［数１９］の式変形の結果の括弧内を［数２０］のような行列Ｑと定義する。 Here, the parenthesis in the result of the equation modification of [Equation 19] is defined as a matrix Q like [Equation 20].

行列Ｑは、ｎ×ｎの対称行列である。したがって、行列Ｑの固有値を大きい順にｄ１≧ｄ２≧…ｄｎとし、各々の固有値に対応する正規化された固有ベクトルをｕ１、ｕ２、…ｕｎとした場合、ユニタリ行列Ｕを［数２２］のようにおくと、行列Ｑは［数２１］のように対角化できる。 The matrix Q is an n × n symmetric matrix. Accordingly, when the eigenvalues of the matrix Q are d1 ≧ d2 ≧... Dn in descending order and the normalized eigenvectors corresponding to the eigenvalues are u1, u2,... Un, the unitary matrix U is expressed as [Equation 22]. In other words, the matrix Q can be diagonalized as [Equation 21].

［数１９］は、［数２１］を用いて［数２３］のように変形される。 [Equation 19] is transformed into [Equation 23] using [Equation 21].

ここで、ｄｉは、複数の特徴軸を一つの主要軸に統合した場合の評価値となる。ｄｉの値が大きいほど、複数の特徴軸をより統合していることを示す。 Here, di is an evaluation value when a plurality of feature axes are integrated into one main axis. A larger value of di indicates that a plurality of feature axes are more integrated.

ｕ１、ｕ２、…ｕｎは正規直交系であるから、［数２３］から、［数１８］の値が最も大きくなる主要軸は、最大固有値に対応する固有ベクトルａ＝ｕ１となる。以下、二番目の主要軸は二番目の固有値に対応する固有ベクトルａ＝ｕ２となり、三番目の主要軸は三番目の固有値に対応する固有ベクトルａ＝ｕ３となり、また、Ｌ番目の主要軸は行列ＱのＬ番目に大きい固有値に対応する固有ベクトルｕＬとなる。 Since u1, u2,..., un are orthogonal systems, the principal axis with the largest value of [Equation 18] from [Equation 23] is the eigenvector a = u1 corresponding to the largest eigenvalue. Hereinafter, the second major axis is the eigenvector a = u2 corresponding to the second eigenvalue, the third major axis is the eigenvector a = u3 corresponding to the third eigenvalue, and the Lth major axis is the matrix Q The eigenvector uL corresponding to the L-th largest eigenvalue.

最後に、詳細識別関数を主要軸に制限した大分類識別関数を求める。ｎ次元特徴ｘをｊ番目の主要軸ｕｊに制限したベクトルは、［数２４］に示すように変形される。 Finally, a large class discriminant function in which the detailed discriminant function is limited to the main axis is obtained. A vector obtained by limiting the n-dimensional feature x to the j-th major axis uj is transformed as shown in [Equation 24].

［数１２］に示す詳細識別関数のベクトルｘを［数２４］に示す制限ベクトルで置き換えると、［数２５］のように変形される。 When the vector x of the detailed discriminant function shown in [Equation 12] is replaced with the restriction vector shown in [Equation 24], it is transformed as [Equation 25].

ここで、［数２６］、［数２７］、［数２８］及び［数２９］のようにおくと、［数２５］は、［数３０］のように変形される。 Here, when [Equation 26], [Equation 27], [Equation 28], and [Equation 29] are set, [Equation 25] is transformed into [Equation 30].

したがって、大分類識別関数は［数３０］のようになり、［数３１］を用いるとｚ＝ＵＬ（ｘ）と表される。 Therefore, the large classification discriminant function is expressed as [Equation 30] and is expressed as z = UL (x) using [Equation 31].

したがって、大分類用の特徴選択に用いられる変換関数は、［数３１］となる。 Therefore, the conversion function used for feature selection for large classification is [Equation 31].

以上で説明した式を用いて、識別関数主要部分抽出部５０２は、［数３０］に示すような大分類識別関数ｖｋと、［数３１］に示すような大分類用特徴変換関数ＵＬとを生成することができる。 Using the equations described above, the discriminant function main part extraction unit 502 obtains a major classification discriminant function vk as shown in [Equation 30] and a major classification feature conversion function UL as shown in [Equation 31]. Can be generated.

以下、識別関数主要部分抽出部５０２が実行する処理の詳細を説明する。 The details of the processing executed by the discriminant function main part extraction unit 502 will be described below.

図４は、本発明の第１の実施形態の識別関数主要部分抽出部５０２が実行する処理を説明するフローチャートである。 FIG. 4 is a flowchart illustrating processing executed by the discriminant function main part extraction unit 502 according to the first embodiment of this invention.

識別関数入力ステップ７０１では、識別関数主要部分抽出部５０２が、識別関数生成部５０１から詳細識別関数を取得する。 In the discrimination function input step 701, the discrimination function main part extraction unit 502 acquires a detailed discrimination function from the discrimination function generation unit 501.

特徴軸選定ステップ７０２では、識別関数主要部分抽出部５０２が、主要軸算出の基準となる特徴軸を選択する。 In the feature axis selection step 702, the discriminant function main part extraction unit 502 selects a feature axis that is a reference for calculating the main axis.

具体的には、識別関数主要部分抽出部５０２は、［数１］に示す詳細識別関数を［数１２］のようにに変形する。当該変形は、対称行列Ｗｋを［数２］のように定義し、Ｗｋの固有ベクトルを行ベクトルとする行列Ｐｋを求め、［数６］、［数１０］及び［数１１］を用いて得られる。 Specifically, the discriminant function main part extraction unit 502 transforms the detailed discriminant function shown in [Equation 1] into [Equation 12]. The modification is obtained by defining a symmetric matrix Wk as [Equation 2], obtaining a matrix Pk having an eigenvector of Wk as a row vector, and using [Equation 6], [Equation 10], and [Equation 11]. .

識別関数主要部分抽出部５０２は、ｑｋｉを特徴軸として選択する。 The discriminant function main part extraction unit 502 selects qki as a feature axis.

軸重要度設定ステップ７０３では、識別関数主要部分抽出部５０２が、特徴軸と詳細識別関数とを用いて各特徴軸の重要度を算出する。 In the axis importance setting step 703, the discrimination function main part extraction unit 502 calculates the importance of each feature axis using the feature axis and the detailed discrimination function.

最も簡単な特徴軸の重要度を算出方法としては、特徴軸ｑｋｉの重要度ｈｋｉを［数１７］のように定義する方法である。重要度の定義方法は様々考えられ、他のいくつかの例について後述する。 The simplest method of calculating the importance of the feature axis is a method of defining the importance hki of the feature axis qki as shown in [Equation 17]. There are various ways of defining the importance, and some other examples will be described later.

主要軸算出ステップ７０４では、識別関数主要部分抽出部５０２が、算出された重要度及び特徴軸を用いて特徴軸を統合し、Ｌ個の主要軸を算出する。 In the main axis calculation step 704, the discriminant function main part extracting unit 502 integrates the feature axes using the calculated importance and feature axes, and calculates L main axes.

具体的には、Ｌ個の主要軸は、［数２０］に示すｎ×ｎ行列Ｑの固有値が大きい上位Ｌ個の正規化された固有ベクトルｕ１、ｕ２、…、ｕＬとして算出される。 Specifically, the L major axes are calculated as upper L normalized eigenvectors u1, u2,..., UL having a large eigenvalue of the n × n matrix Q shown in [Equation 20].

大分類識別関数生成ステップ７０５では、識別関数主要部分抽出部５０２が、主要軸と詳細識別関数とを用いて、Ｌ個の固有ベクトルｕ１、ｕ２、…、ｕＬによって生成されるＬ次元特徴空間に詳細識別関数を制限した大分類識別関数を生成する。 In the large classification discriminant function generation step 705, the discriminant function main part extraction unit 502 uses the main axis and the detailed discriminant function to detail the L-dimensional feature space generated by the L eigenvectors u1, u2,. A large class discriminant function with a limited discriminant function is generated.

具体的には、大分類識別関数は、［数２６］、［数２７］、［数２８］及び［数２９］を用いて［数３０］のように与えられる。また、大分類用の特徴選択関数は、［数３１］のように与えられる。 Specifically, the large classification discriminant function is given as [Equation 30] using [Equation 26], [Equation 27], [Equation 28], and [Equation 29]. A feature selection function for large classification is given by [Equation 31].

出力ステップ７０６では、識別関数主要部分抽出部５０２が、大分類識別関数用の特徴選択関数［数３１］を特徴選択用辞書５０３に格納し、また、大分類識別関数［数３０］を識別用辞書５０４に格納する。 In the output step 706, the discriminant function main part extraction unit 502 stores the feature selection function [Equation 31] for the major classification discriminant function in the feature selection dictionary 503, and also identifies the major classification discriminant function [Equation 30]. Store in dictionary 504.

以上が、識別関数主要部分抽出部５０２が実行する処理の詳細の説明である。 The above is the detailed description of the processing executed by the discriminant function main part extraction unit 502.

次に、認識フェーズの大分類特徴選択部６０１、大分類識別部６０２、及び詳細識別部６０３が実行する処理について説明する。 Next, processing executed by the major classification feature selection unit 601, the major classification identification unit 602, and the detailed identification unit 603 in the recognition phase will be described.

大分類特徴選択部６０１は、特徴選択用辞書５０３に格納されている大分類識別関数用の特徴選択関数［数３１］を用いてＬ次元特徴ｚ［数２９］を抽出し、大分類識別部６０２に出力する。 The major classification feature selection unit 601 extracts the L-dimensional feature z [Equation 29] using the feature selection function [Equation 31] for the major classification identification function stored in the feature selection dictionary 503, and the major classification identification unit To 602.

大分類識別部６０２は、識別用辞書５０４に格納されている大分類識別関数［数３０］を用いて類似度を算出する。大分類識別部６０２は、算出された類似度に基づいて、類似度が高い順にユーザによって指定された個数分の正解候補カテゴリを算出する。算出された正解候補カテゴリは、特徴選択部２０４に出力される。 The large classification identifying unit 602 calculates the similarity using the large classification identification function [Equation 30] stored in the identification dictionary 504. Based on the calculated similarity, the large classification identifying unit 602 calculates correct candidate categories for the number specified by the user in descending order of similarity. The calculated correct candidate category is output to the feature selection unit 204.

詳細識別部６０３は、特徴選択用辞書５０３に格納されている詳細識別関数ｕｋ（ｘ）を用いて、正解候補カテゴリに対する類似度を算出し、算出された正解候補カテゴリに対する類似度を認識結果出力部３０３に出力する。 The detailed identification unit 603 calculates the similarity to the correct candidate category using the detailed identification function uk (x) stored in the feature selection dictionary 503, and outputs the similarity to the calculated correct candidate category as a recognition result. The data is output to the unit 303.

以下、軸重要度設定ステップ７０３において、特徴軸の重要度を設定する方法の例について説明する。 Hereinafter, an example of a method for setting the importance of the feature axis in the axis importance setting step 703 will be described.

例１
特徴軸ｑｋｉの重要度を、［数１２］の係数λｋｉｉとζｋｉの絶対値の大きさであるとして、［数１７］のように定義する。 Example 1
The importance of the feature axis qki is defined as [Equation 17], assuming that it is the magnitude of the absolute values of the coefficients λkii and ζki of [Equation 12].

例２
［数１２］において、係数λｋｉｉと係数ζｋｉとの次数の違いを考慮して、［数３２］のように特徴軸ｑｋｉの重要度を定義する。 Example 2
In [Equation 12], considering the difference in the degree between the coefficient λkii and the coefficient ζki, the importance of the feature axis qki is defined as in [Equation 32].

例３
特徴軸ｑｋｉの重要度を決める場合に、係数λｋｉｉとζｋｉだけでなく、ｑｋｉ方向のｘの変動量ｑｋｉ・ｘの大きさのスケールも考慮に入れる。変動量のスケールは、ｑｋｉ方向のｘの分散値によって算出される。 Example 3
When determining the importance of the feature axis qki, not only the coefficients λkii and ζki but also the scale of the amount of variation qki · x of x in the qki direction is taken into consideration. The scale of the fluctuation amount is calculated by the variance value of x in the qki direction.

学習パターンをＮ個とし、平均ベクトルｍを［数３３］とすると、ｘｉ（ｉ＝１、…、Ｎ）の分布の共分散行列Σは、［数３４］で与えられる。 Assuming that there are N learning patterns and the average vector m is [Expression 33], the covariance matrix Σ of the distribution of xi (i = 1,..., N) is given by [Expression 34].

ｑｋｉ方向のｘの分散値ｖｋｉは、［数３５］によって算出できる。 The variance value vki of x in the qki direction can be calculated by [Equation 35].

これよって、特徴軸ｑｋｉの重要度は、例えば、［数３６］で定義する。また、［数３７］又は［数３８］等で、特徴軸ｑｋｉの重要度を定義する方法も考えられる。 Thus, the importance of the feature axis qki is defined by [Equation 36], for example. Also, a method of defining the importance of the feature axis qki using [Equation 37] or [Equation 38] may be considered.

例４
識別関数［数１２］を、［数３９］に示すように特徴軸ｑｋｉに制限した部分の値の分散値を重要度として定義する方法が考えられる。分散値σｋｉは、［数４０］を用いて［数４１］のように与えられる。重要度は、ｈｋｉ＝σｋｉと定義する。 Example 4
As shown in [Equation 39], the discriminant function [Equation 12] may be defined by defining the variance value of the portion limited to the feature axis qki as the importance. The variance value σki is given as [Equation 41] using [Equation 40]. The importance is defined as hki = σki.

（第２の実施形態）
第１の実施形態では、［数１２］に示すように識別関数を二次以下の関数と仮定し、また、特徴選択関数を線型関数と仮定した。第２の実施形態では、特徴選択関数ｆ（ｘ）又はｆｋ（ｘ）は、線型関数に限定されない。なお、第２の実施形態についても、識別関数は二次以下の関数と仮定する。 (Second Embodiment)
In the first embodiment, as shown in [Equation 12], the discriminant function is assumed to be a function of second order or lower, and the feature selection function is assumed to be a linear function. In the second embodiment, the feature selection function f (x) or fk (x) is not limited to a linear function. In the second embodiment, it is assumed that the discriminant function is a function of second order or lower.

第２の実施形態におけるパターン認識装置１０１の構成、及び、各モジュールが実行する処理は、第１の実施形態と同一であるため説明を省略する。以下、第１の実施形態との差異を中心に説明する。 Since the configuration of the pattern recognition apparatus 101 in the second embodiment and the processing executed by each module are the same as those in the first embodiment, the description thereof is omitted. Hereinafter, the difference from the first embodiment will be mainly described.

第２の実施形態では、学習フェーズにおける識別関数主要部分抽出部５０２が実行する処理、並びに、認識フェーズにおける大分類特徴選択部６０１、大分類識別部６０２及び詳細識別部６０３のそれぞれが実行する処理が異なる。 In the second embodiment, the processing executed by the discrimination function main part extraction unit 502 in the learning phase, and the processing executed by each of the major classification feature selection unit 601, the major classification identification unit 602, and the detailed identification unit 603 in the recognition phase. Is different.

以下、第２の実施形態における識別関数主要部分抽出部５０２、大分類特徴選択部６０１、大分類識別部６０２及び詳細識別部６０３について説明する。 Hereinafter, the identification function main part extraction unit 502, the major classification feature selection unit 601, the major classification identification unit 602, and the detailed identification unit 603 according to the second embodiment will be described.

まず、学習フェーズにおける識別関数主要部分抽出部５０２の処理について説明する。 First, the process of the discriminant function main part extraction unit 502 in the learning phase will be described.

識別関数主要部分抽出部５０２が実行する処理は、図４と同一であるが、具体的な処理が異なる。 The processing executed by the discriminant function main part extraction unit 502 is the same as that shown in FIG. 4, but the specific processing is different.

特徴軸選定ステップ７０２では、識別関数主要部分抽出部５０２が、主要軸選択の基準となる特徴軸を選択する。 In a feature axis selection step 702, the discriminant function main part extraction unit 502 selects a feature axis that is a reference for main axis selection.

具体的には、識別関数主要部分抽出部５０２は、［数１］に示す詳細識別関数を［数４５］のようにに変形する。 Specifically, the discriminant function main part extraction unit 502 transforms the detailed discriminant function shown in [Equation 1] into [Equation 45].

当該変形は、対称行列Ｗｋを［数２］のように定義し、Ｗｋの固有ベクトルを行ベクトルとする行列Ｐｋを求め、［数６］、［数４４］及び［数１１］を用いて得られる。 The modification is obtained by defining the symmetric matrix Wk as [Equation 2], obtaining a matrix Pk having the eigenvector of Wk as a row vector, and using [Equation 6], [Equation 44], and [Equation 11]. .

識別関数主要部分抽出部５０２は、ｑｋｉを特徴軸として選択する。第１の実施形態では、ｑｋｉはｎ次元ベクトルであったが、第２の実施形態ではｍ次元ベクトルである。 The discriminant function main part extraction unit 502 selects qki as a feature axis. In the first embodiment, qki is an n-dimensional vector, but in the second embodiment, it is an m-dimensional vector.

軸重要度設定ステップ７０３では、識別関数主要部分抽出部５０２が、特徴軸と詳細識別関数とを用いて、各特徴軸の重要度を算出する。 In the axis importance setting step 703, the discrimination function main part extraction unit 502 calculates the importance of each feature axis using the feature axis and the detailed discrimination function.

重要度の算出方法は様々考えられるが、例えば、第１の実施形態で示した方法を用いることができる。この場合、ｘはｙで置き換えられる。 There are various methods for calculating the importance. For example, the method shown in the first embodiment can be used. In this case, x is replaced with y.

具体的には、Ｌ個の主要軸は、［数２０］に示すｍ×ｍ行列の固有値が大きい上位Ｌ個の正規化された固有ベクトルｕ１、ｕ２、…、ｕＬとして得られる。なお、第１の実施形態では、［数２０］に示す行列はｎ×ｎ行列であり、ｕｉはｎ次元ベクトルであったのに対し、第２の実施形態では、［数２０］に示す行列はｍ×ｍ行列であり、ｕｉはｍ次元ベクトルである。 Specifically, L major axes are obtained as upper L normalized eigenvectors u1, u2,..., UL having large eigenvalues of the m × m matrix shown in [Equation 20]. In the first embodiment, the matrix shown in [Equation 20] is an n × n matrix and ui is an n-dimensional vector, whereas in the second embodiment, the matrix shown in [Equation 20]. Is an m × m matrix and ui is an m-dimensional vector.

具体的には、大分類識別関数は、［数２６］、［数２７］、［数２８］及び［数４６］を用いて［数３０］のように与えられる。また、大分類識別関数用の特徴選択関数は、［数３１］のように与えられる。 Specifically, the large classification identification function is given as [Equation 30] using [Equation 26], [Equation 27], [Equation 28], and [Equation 46]. Further, the feature selection function for the major classification function is given as [Equation 31].

出力ステップ７０６では、識別関数主要部分抽出部５０２が、大分類用の特徴選択関数［数３１］を特徴選択用辞書５０３に格納し、また、大分類識別関数［数３０］を識別用辞書５０４に格納する。 In the output step 706, the discrimination function main part extraction unit 502 stores the feature selection function for large classification [Equation 31] in the feature selection dictionary 503, and also stores the large classification discrimination function [Equation 30] in the discrimination dictionary 504. To store.

次に、認識フェーズの大分類特徴選択部６０１、大分類識別部６０２及び詳細識別部６０３が実行する処理について説明する。 Next, processing executed by the major classification feature selection unit 601, major classification identification unit 602, and detailed identification unit 603 in the recognition phase will be described.

大分類特徴選択部６０１は、特徴選択用辞書５０３に格納されている大分類用の特徴選択関数［数３１］を用いてＬ次元特徴ｚ［数４６］を抽出し、大分類識別部６０２に出力する。 The major classification feature selection unit 601 extracts the L-dimensional feature z [Equation 46] using the major classification feature selection function [Equation 31] stored in the feature selection dictionary 503, and sends it to the major classification identification unit 602. Output.

大分類識別部６０２は、識別用辞書５０４に格納されている大分類識別関数［数３０］を用いて類似度を算出する。大分類識別部６０２は、算出された類似度ｖｋ（ｘ）に基づいて、類似度が高い順にユーザによって指定された個数分の正解候補カテゴリを算出する。算出された正解候補カテゴリは、特徴選択部２０４に出力される。 The large classification identifying unit 602 calculates the similarity using the large classification identification function [Equation 30] stored in the identification dictionary 504. Based on the calculated similarity vk (x), the large classification identifying unit 602 calculates correct candidate categories for the number specified by the user in descending order of similarity. The calculated correct candidate category is output to the feature selection unit 204.

詳細識別部６０３は、特徴選択用辞書５０３に格納されている詳細識別関数ｕｋ（ｘ）を用いて正解候補カテゴリに対する類似度を算出し、算出された正解候補カテゴリに対する類似度を認識結果出力部３０３に出力する。 The detailed identification unit 603 calculates the similarity to the correct candidate category using the detailed identification function uk (x) stored in the feature selection dictionary 503, and recognizes the calculated similarity to the correct candidate category as a recognition result output unit. It outputs to 303.

（第３の実施形態）
第１の実施形態では、［数１２］に示すように識別関数を二次以下の関数と仮定し、また、特徴選択関数を線型関数と仮定した。第３の実施形態では、識別関数が二次以下の関数に限定されない場合について説明する。なお、第３の実施形態についても、特徴選択関数は線型関数と仮定する。 (Third embodiment)
In the first embodiment, as shown in [Equation 12], the discriminant function is assumed to be a function of second order or lower, and the feature selection function is assumed to be a linear function. In the third embodiment, a case will be described in which the discriminant function is not limited to a function of second order or lower. Also in the third embodiment, the feature selection function is assumed to be a linear function.

第３の実施形態におけるパターン認識装置１０１の構成、及び、各モジュールが実行する処理は、第１の実施形態と同一であるため説明を省略する。以下、第１の実施形態との差異を中心に説明する。 Since the configuration of the pattern recognition apparatus 101 in the third embodiment and the processing executed by each module are the same as those in the first embodiment, description thereof is omitted. Hereinafter, the difference from the first embodiment will be mainly described.

第３の実施形態では、学習フェーズにおける識別関数主要部分抽出部５０２が実行する処理、並びに、認識フェーズにおける大分類特徴選択部６０１、大分類識別部６０２及び詳細識別部６０３のそれぞれが実行する処理が異なる。 In the third embodiment, processing executed by the identification function main part extraction unit 502 in the learning phase, and processing executed by each of the large classification feature selection unit 601, the large classification identification unit 602, and the detailed identification unit 603 in the recognition phase. Is different.

以下、第３の実施形態における識別関数主要部分抽出部５０２、大分類特徴選択部６０１、大分類識別部６０２及び詳細識別部６０３について説明する。 Hereinafter, the identification function main part extraction unit 502, the major classification feature selection unit 601, the major classification identification unit 602, and the detailed identification unit 603 according to the third embodiment will be described.

具体的には、識別関数主要部分抽出部５０２は、ユーザによって指定される個数だけランダムなｍ次元ベクトルを特徴軸として選択する。ここでは、特徴軸は、Ｍ個選択されるベクトルとし、ａｋｉ（ｉ＝１、…、Ｍ）と表す。 Specifically, the discriminant function main part extraction unit 502 selects m-dimensional vectors random as the number specified by the user as feature axes. Here, the feature axes are M selected vectors and are represented as aki (i = 1,..., M).

重要度の算出方法は、次のようなものが考えられる。以下、ｍ次元特徴がカテゴリｋに依存しない場合、ｍ次元特徴ｙｋ、及び変換関数Ｙの添え字を省いたものを考えればよい。 The following calculation methods can be considered. Hereinafter, in the case where the m-dimensional feature does not depend on the category k, it may be considered that the m-dimensional feature yk and the subscript of the conversion function Y are omitted.

まず、ｍ次元ベクトルａｋｉをｎ次元特徴空間での表現に直す。Ａｋを［数４７］のように定義し、ＡｋＹｋを［数４８］のようにおく。 First, the m-dimensional vector aki is converted into an expression in the n-dimensional feature space. Ak is defined as [Equation 47], and AkYk is set as [Equation 48].

このとき、ベクトルｑｋｉがａｋｉのｎ次元特徴空間での表現である。 At this time, the vector qki is an expression in the n-dimensional feature space of aki.

特徴軸ｑｋｉ方向の識別関数の値の分散値を重要度とする。［数４７］に示す関数ｕｋｉは、詳細識別関数をｑｋｉ軸に制限したものである。具体的には、Ｎ個の学習パターンをｘｉとおくと、この関数の値の分散値σｋｉは、［数４０］を用いて［数４１］によって算出される。これを重要度としてｈｋｉ＝σｋｉとすればよい。 The variance of discriminant function values in the characteristic axis qki direction is defined as the importance. The function uki shown in [Equation 47] is obtained by limiting the detailed identification function to the qki axis. Specifically, when N learning patterns are set to xi, the variance value σki of the value of this function is calculated by [Equation 41] using [Equation 40]. This may be set as hki = σki.

具体的には、Ｌ個の主要軸は、［数２０］に示すｎ×ｎ行列の固有値が大きい上位Ｌ個の固有ベクトルｕ１、ｕ２、…、ｕＬとして得られる。 Specifically, the L major axes are obtained as upper L eigenvectors u1, u2,..., UL having a large eigenvalue of the n × n matrix shown in [Equation 20].

大分類識別関数生成ステップ７０５では、識別関数主要部分抽出部５０２が、主要軸と詳細識別関数とを用いて、Ｌ個の固有ベクトルｕ１、ｕ２、…、ｕＬによって生成されるＬ次元特徴空間に詳細識別関数を制限した大分類識別関数として生成する。 In the large classification discriminant function generation step 705, the discriminant function main part extraction unit 502 uses the main axis and the detailed discriminant function to detail the L-dimensional feature space generated by the L eigenvectors u1, u2,. Generated as a large class discriminant function with limited discriminant function.

具体的には、大分類識別関数は、［数５０］に示すように生成される。 Specifically, the large classification identification function is generated as shown in [Equation 50].

また、大分類識別関数用の特徴選択関数は、［数３１］の行列ＵＬを用いて、［数５３］に示すような行列によって与えられる。 Further, the feature selection function for the large classification identification function is given by a matrix as shown in [Equation 53] using the matrix UL of [Equation 31].

出力ステップ７０６では、識別関数主要部分抽出部５０２が、大分類用の特徴選択関数［数５３］を特徴選択用辞書５０３に格納し、また、大分類識別関数［数５０］を識別用辞書５０４に格納する。 In the output step 706, the discriminant function main part extraction unit 502 stores the feature selection function [Equation 53] for large classification in the feature selection dictionary 503, and also stores the large classification discrimination function [Equation 50] in the discrimination dictionary 504. To store.

大分類特徴選択部６０１は、特徴選択用辞書５０３に格納されている大分類用の特徴選択関数［数５３］を用いてＬ次元特徴ｚ［数４６］を抽出し、大分類識別部６０２に出力する。 The major classification feature selection unit 601 extracts the L-dimensional feature z [Equation 46] using the major classification feature selection function [Equation 53] stored in the feature selection dictionary 503, and sends it to the major classification identification unit 602. Output.

大分類識別部６０２は、識別用辞書５０４に格納されている大分類識別関数［数５０］を用いて類似度を算出する。大分類識別部６０２は、算出された類似度ｖｋ（ｘ）に基づいて、類似度が高い順にユーザによって指定された個数分の正解候補カテゴリを算出する。算出された正解候補カテゴリは、特徴選択部２０４に出力される。 The large classification identifying unit 602 calculates the similarity using the large classification identification function [Equation 50] stored in the identification dictionary 504. Based on the calculated similarity vk (x), the large classification identifying unit 602 calculates correct candidate categories for the number specified by the user in descending order of similarity. The calculated correct candidate category is output to the feature selection unit 204.

詳細識別部６０３は、特徴選択用辞書５０３に格納されている識別関数ｕｋ（ｘ）を用いて正解候補カテゴリに対する類似度を算出し、算出された正解候補カテゴリに対する類似度を認識結果出力部３０３に出力する。 The detailed identification unit 603 calculates the similarity to the correct candidate category using the identification function uk (x) stored in the feature selection dictionary 503, and uses the recognition result output unit 303 to calculate the similarity to the calculated correct candidate category. Output to.

（第４の実施形態）
第１の実施形態では、［数１２］に示すように識別関数を二次以下の関数と仮定し、また、特徴選択関数を線型関数と仮定した。第４の実施形態では、識別関数が二次以下の関数に限定されない場合について説明する。さらに、第４の実施形態では、特徴選択関数は線型関数に限定されない。 (Fourth embodiment)
In the first embodiment, as shown in [Equation 12], the discriminant function is assumed to be a function of second order or lower, and the feature selection function is assumed to be a linear function. In the fourth embodiment, a case where the discriminant function is not limited to a function of second order or lower will be described. Furthermore, in the fourth embodiment, the feature selection function is not limited to a linear function.

第４の実施形態におけるパターン認識装置１０１の構成、及び、各モジュールが実行する処理は、第１の実施形態と同一であるため説明を省略する。以下、第１の実施形態との差異を中心に説明する。 Since the configuration of the pattern recognition apparatus 101 in the fourth embodiment and the processing executed by each module are the same as those in the first embodiment, the description thereof is omitted. Hereinafter, the difference from the first embodiment will be mainly described.

第４の実施形態では、学習フェーズにおける識別関数主要部分抽出部５０２が実行する処理、並びに、認識フェーズにおける大分類特徴選択部６０１、大分類識別部６０２及び詳細識別部６０３のそれぞれが実行する処理が異なる。 In the fourth embodiment, processing executed by the identification function main part extraction unit 502 in the learning phase, and processing executed by each of the large classification feature selection unit 601, the large classification identification unit 602, and the detailed identification unit 603 in the recognition phase. Is different.

以下、第４の実施形態における識別関数主要部分抽出部５０２、大分類特徴選択部６０１、大分類識別部６０２及び詳細識別部６０３について説明する。 Hereinafter, the identification function main part extraction unit 502, the major classification feature selection unit 601, the major classification identification unit 602, and the detailed identification unit 603 according to the fourth embodiment will be described.

具体的には、識別関数主要部分抽出部５０２が、ユーザによって指定される個数だけランダムなｍ次元ベクトルを特徴軸としてに選択する。ここでは、盗聴軸は、Ｍ個選択されるベクトルとし、ａｋｉ（ｉ＝１、…、Ｍ）と表す。 Specifically, the discriminant function main part extraction unit 502 selects m-dimensional vectors random as the number specified by the user as feature axes. Here, the wiretapping axis is a vector selected from M pieces, and is represented as aki (i = 1,..., M).

重要度の算出方法は次のようなものが考えられる。以下、ｍ次元特徴がカテゴリｋに依存しない場合、ｍ次元特徴ｙｋ、及び変換関数Ｙの添え字を省いたものを考えればよい。 The importance calculation method is as follows. Hereinafter, in the case where the m-dimensional feature does not depend on the category k, it may be considered that the m-dimensional feature yk and the subscript of the conversion function Y are omitted.

特徴軸ｑｋｉ方向の識別関数の値の分散値を重要度にとる。［数５１］に示す関数ｕｋｉは、詳細識別関数をｑｋｉ軸に制限したものである。 The variance of the discriminant function value in the characteristic axis qki direction is taken as the importance. The function uki shown in [Formula 51] is obtained by limiting the detailed identification function to the qki axis.

具体的には、Ｎ個の学習パターンをｘｉとおくと、この関数の値の分散値σｋｉは、［数５１］を用いて［数５２］によって算出される。これを重要度としてｈｋｉ＝σｋｉとすればよい。 Specifically, when N learning patterns are set to xi, the variance value σki of the value of this function is calculated by [Equation 52] using [Equation 51]. This may be set as hki = σki.

主要軸算出ステップ７０４では、識別関数主要部分抽出部５０２が、算出された重要度及び特徴軸を用いて特徴軸を統合し、Ｌ個の主要軸を選定する。 In the main axis calculation step 704, the discriminant function main part extraction unit 502 integrates the feature axes using the calculated importance and feature axes, and selects L main axes.

具体的には、Ｌ個の主要軸は、［数２０］に示すｍ×ｍ行列の固有値が大きい上位Ｌ個の固有ベクトルｕ１、ｕ２、…、ｕＬとして得られる。 Specifically, the L major axes are obtained as upper L eigenvectors u1, u2,..., UL having a large eigenvalue of the m × m matrix shown in [Equation 20].

その後、識別関数主要部分抽出部５０２は大分類識別関数生成ステップ７０５に進む。 Thereafter, the discrimination function main part extraction unit 502 proceeds to the large classification discrimination function generation step 705.

具体的には、大分類識別関数は、［数５２］のように与えられる。また、大分類識別関数用の特徴選択関数は、［数３１］の行列によって与えられる。 Specifically, the large classification discriminant function is given as [Equation 52]. The feature selection function for the major classification function is given by the matrix of [Equation 31].

出力ステップ７０６では、識別関数主要部分抽出部５０２が、大分類用の特徴選択関数［数３１］を特徴選択用辞書５０３に格納し、また、大分類識別関数［数５２］を識別用辞書５０４に格納する。 In the output step 706, the discriminant function main part extraction unit 502 stores the large classification feature selection function [Equation 31] in the feature selection dictionary 503 and the large classification discriminant function [Equation 52]. To store.

大分類識別部６０２は、識別用辞書５０４に格納されている大分類識別関数［数５２］用いて類似度を算出する。大分類識別部６０２は、算出された類似度ｖｋ（ｘ）に基づいて、類似度が高い順にユーザによって指定された個数分の正解候補カテゴリを算出する。算出された正解候補カテゴリは、特徴選択部２０４に出力される。 The large classification identifying unit 602 calculates the similarity using the large classification identification function [Equation 52] stored in the identification dictionary 504. Based on the calculated similarity vk (x), the large classification identifying unit 602 calculates correct candidate categories for the number specified by the user in descending order of similarity. The calculated correct candidate category is output to the feature selection unit 204.

詳細識別部６０３は、特徴選択用辞書５０３に格納されている識別関数ｕｋ（ｘ）を用いて正解候補カテゴリに対する類似度を算出し、算出された正解候補カテゴリに対する類似度を認識結果出力部に出力する。 The detailed identification unit 603 calculates the similarity to the correct candidate category using the identification function uk (x) stored in the feature selection dictionary 503, and uses the calculated similarity to the correct candidate category to the recognition result output unit. Output.

本発明の一形態によれば、大分類識別関数が詳細識別関数の制限関数として生成されるため、認識処理の精度を保ち、かつ、高速化できる。 According to one aspect of the present invention, since the large classification discriminant function is generated as the limit function of the detailed discriminant function, the accuracy of the recognition process can be maintained and the speed can be increased.

また、本発明における大分類識別関数は、詳細識別関数の制限関数として生成されるため、大分類識別処理と詳細識別処理とで別々の学習アルゴリズムを用いる必要がない。したがって、構成の煩雑さを回避できる。 In addition, since the large classification function in the present invention is generated as a limit function of the detailed classification function, it is not necessary to use different learning algorithms for the large classification process and the detailed classification process. Therefore, complexity of the configuration can be avoided.

また、本発明の一形態によれば、任意の学習アルゴリズムによって作成された詳細識別関数から大分類識別関数が生成されるため、精度が高い詳細識別関数を用いることで、認識精度を向上させることができる。 In addition, according to one aspect of the present invention, since a large classification discriminant function is generated from a detailed discriminant function created by an arbitrary learning algorithm, it is possible to improve recognition accuracy by using a high-precision discriminant function. Can do.

１０１パターン認識装置
１０２入力装置
１０３表示装置
１０４パターン取得装置
１０５通信装置
１０６演算装置（ＣＰＵ）
１０７外部記憶装置（ＨＤＤ、メモリ）
２０１パターン入力部
２０２特徴抽出部
２０３特徴選択辞書生成部
２０４特徴選択部
２０５識別用辞書生成部
２０７学習用パターンＤＢ
２０８特徴選択用辞書
２０９識別用辞書
３０１大分類識別部
３０２詳細識別部
３０３認識結果出力部
３０４認識対象パターンＤＢ
５０１識別関数生成部
５０２識別関数主要部分抽出部
５０３特徴選択用辞書
５０４識別用辞書
６０１大分類特徴選択部
６０２大分類識別部
６０３詳細識別部
７０１識別関数入力ステップ
７０２特徴軸選定ステップ
７０３軸重要度設定ステップ
７０４主要軸算出ステップ
７０５大分類識別関数生成ステップ
７０６出力ステップ
８０１入力ステップ
８０２前処理ステップ
８０３正規化ステップ
８０４文字特徴抽出ステップ
８０５出力ステップ 101 Pattern Recognition Device 102 Input Device 103 Display Device 104 Pattern Acquisition Device 105 Communication Device 106 Arithmetic Device (CPU)
107 External storage device (HDD, memory)
201 pattern input unit 202 feature extraction unit 203 feature selection dictionary generation unit 204 feature selection unit 205 identification dictionary generation unit 207 learning pattern DB
208 Feature Selection Dictionary 209 Identification Dictionary 301 Major Classification Identification Unit 302 Detailed Identification Unit 303 Recognition Result Output Unit 304 Recognition Target Pattern DB
501 Discrimination function generation unit 502 Discrimination function main part extraction unit 503 Feature selection dictionary 504 Discrimination dictionary 601 Major classification feature selection unit 602 Major classification identification unit 603 Detailed classification unit 701 Discrimination function input step 702 Feature axis selection step 703 Axis importance Setting step 704 Main axis calculation step 705 Major classification function generation step 706 Output step 801 Input step 802 Preprocessing step 803 Normalization step 804 Character feature extraction step 805 Output step

Claims

A dictionary recognition apparatus for pattern recognition comprising a processor and a storage medium connected to the processor,
The storage medium stores a learning pattern database composed of a plurality of learning patterns,
The pattern recognition dictionary generation device includes:
A pattern input unit for acquiring each of the learning patterns as one category from the learning pattern database;
A feature extraction unit that extracts n-dimensional features for each of the acquired categories;
Using the extracted n-dimensional feature, a feature selection function for converting the n-dimensional feature into an m-dimensional feature that is a dimension equal to or less than the n-dimension is generated, and the generated feature selection function is used as a feature selection dictionary. A feature selection dictionary generator for storing in the storage medium;
A feature selection unit that converts the extracted n-dimensional feature into the m-dimensional feature using the feature selection function;
Using the converted m-dimensional feature, a detailed identification function on the m-dimensional feature space for calculating the similarity of the recognition target pattern for each category is generated, and the generated detailed identification function is used as an identification dictionary. A discriminant function generator for storing in the storage medium as
Large classification that transforms the m-dimensional feature into an L-dimensional feature on the L-dimensional feature space that is a dimension less than the m-dimension, is a subspace of the n-dimensional feature space, and is a subspace of the m-dimensional feature space A large classification for generating a feature selection function and calculating the similarity of the pattern to be authenticated for each category on the L-dimensional feature space by converting the detailed identification function as a function on the L-dimensional feature space An identification function is generated, the generated major classification feature selection function is stored in the storage medium as the feature selection dictionary, and the generated major classification identification function is stored in the storage medium as the identification dictionary A function main part extractor;
A pattern recognition dictionary generating apparatus comprising:

The discriminant function main part extraction unit includes:
Performing a detailed identification function acquisition step of acquiring the detailed function;
Performing a feature axis selection step of selecting M feature axes from the n-dimensional feature space using the acquired detailed identification function;
Performing an axis importance determining step for calculating the importance of the feature axis;
Performing a main axis calculating step of calculating the L main axes by integrating the feature axes;
The large classification discriminant function generation step of generating the large classification discriminant function by converting the detailed discriminant function as a function on the L-dimensional feature space generated by the main axis is performed. The pattern recognition dictionary generation device described.

The feature selection function is a linear function, and the detailed discriminant function uk (x) for the n-dimensional feature x is a second-order or lower-order polynomial function expressed by Equation 1,
The feature axis selection step includes:
The discriminant function main part extraction unit uses the n-dimensional feature or the m-dimensional feature y to characterize the vector qki obtained by transforming the detailed discriminant function uk (x) as shown in Equation 2 or Equation 3. Including selecting as an axis,
The axis importance determination step is
The pattern according to claim 2, wherein the discriminant function main part extraction unit includes a step of setting the importance of the feature axis qki using the function hki of the coefficient λkii and the coefficient ζki in Expression 2 or 3. A recognition dictionary generator.

The main axis calculation step includes:
The discriminant function main part extraction unit calculating eigenvalues of the matrix Q shown in Formula 4 generated from the feature axis qki and the axis importance hki;
The discriminant function main part extraction unit selects L eigenvectors in descending order of the calculated eigenvalues among eigenvectors of the matrix Q, and calculates the selected eigenvectors as main axes. The pattern recognition dictionary generation device according to claim 3, further comprising:

The pattern recognition dictionary generation device includes an input unit for a user to operate the pattern recognition dictionary generation device,
In the feature axis selection step, the discriminant function main part extraction unit randomly selects M arbitrary vectors as the feature axes, or selects M vectors based on an instruction from the user. The pattern recognition dictionary generation device according to claim 2, further comprising a step of selecting as a pattern recognition dictionary.

In the axis importance determination step, the discriminant function main part extraction unit has transformed the detailed discriminant function for the N learning patterns xi as a function on the M-dimensional space generated by the feature axis qki. 3. The pattern recognition dictionary generation device according to claim 2, wherein a function d (xi) is calculated, and the axis importance h is calculated as a function of a variance value σ of the calculated function d (xi). .

A pattern recognition apparatus comprising a processor and a storage medium connected to the processor,
The storage medium is
A recognition target pattern database composed of a plurality of recognition target patterns, and
A feature selection function that converts an n-dimensional feature extracted from the recognition target pattern into an m-dimensional feature that is a dimension of n or less, and a subspace of the n-dimensional feature space that is a dimension of m or less And a feature selection dictionary that stores a large-class feature selection function for converting the m-dimensional feature into an L-dimensional feature on an L-dimensional feature space that is a subspace of the m-dimensional feature space;
A detailed discriminant function on the m-dimensional feature space for calculating the similarity of the recognition target pattern to the correct candidate pattern, and the detailed discriminant function converted as a function on the L-dimensional feature space, and the L-dimensional feature An identification dictionary that stores a large classification identification function for calculating the similarity of the recognition target pattern to each correct answer candidate in space;
Store
The pattern recognition device includes:
A pattern input unit for obtaining the recognition target pattern from the recognition target pattern database;
A feature extraction unit for extracting an n-dimensional feature of the acquired recognition target pattern;
A major classification feature selection unit that converts the extracted n-dimensional feature into the L-dimensional feature using the major classification feature selection function;
A large classification identifying unit that calculates the similarity of the pattern to be recognized with respect to the correct candidate pattern using the converted L-dimensional feature and the large classification identifying function, and selects one or more correct candidate patterns; ,
A feature selection unit that converts the n-dimensional feature into the m-dimensional feature using the feature selection function;
A detailed identification unit that calculates the similarity of the recognition target pattern to the selected correct candidate pattern using the detailed identification function and the converted m-dimensional feature;
A recognition result output unit that outputs a recognition result for the recognition target pattern based on the calculated similarity;
A pattern recognition apparatus comprising:

The major classification function is
Performing a detailed identification function acquisition step of acquiring the detailed function;
Performing a feature axis selection step of selecting M feature axes from the n-dimensional feature space using the acquired detailed identification function;
Performing an axis importance determining step for calculating the importance of the feature axis;
Performing a main axis calculating step of calculating the L main axes by integrating the feature axes;
The major axis is generated by executing a large classification function generating step for generating the large classification function by converting the detailed classification function as a function on the L-dimensional feature space generated by the main axis. The pattern recognition apparatus according to claim 7.

The feature selection function is a linear function, and the detailed discriminant function uk (x) for the n-dimensional feature x is a second-order or lower polynomial function shown in Equation 5,
In the feature axis selection step, using the n-dimensional feature or the m-dimensional feature y, the vector qki obtained by transforming the detailed identification function uk (x) as shown in Equation 6 or Equation 7 is used as the feature axis. Selected
9. The pattern recognition apparatus according to claim 8, wherein, in the axis importance determination step, the importance of the feature axis qki is set using the function hki of the coefficient λkii and the coefficient ζki in Expression 6 or 7.

In the principal axis calculation step, the eigenvalue of the matrix Q shown in Equation 8 generated from the feature axis qki and the axis importance hki is calculated, and among the eigenvectors of the matrix Q, the calculated eigenvalue is large The pattern recognition apparatus according to claim 9, wherein L eigenvectors are selected in order, and the selected eigenvectors are calculated as main axes.

In the feature axis selection step, M arbitrary vectors are randomly selected as the feature axes, or M vectors are selected as the feature axes based on a preset instruction. Item 9. The pattern recognition device according to Item 8.

In the axis importance determination step, the detailed discriminant function for the N patterns xi is calculated as a function d (xi) transformed as a function in the M-dimensional space generated by the feature axis qki, and is calculated. The pattern recognition apparatus according to claim 8, wherein the axis importance h is calculated as a function of a variance value σ of the function d (xi).

A pattern recognition apparatus comprising a processor and a storage medium connected to the processor,
The storage medium stores a learning pattern database composed of a plurality of learning patterns, and a recognition target pattern database composed of a plurality of recognition target patterns,
The pattern recognition dictionary generation device includes:
A pattern input unit that acquires each of the learning patterns as one category from the learning pattern database, or a pattern input unit that acquires the recognition target pattern from the recognition target pattern database;
Extracting a n-dimensional feature for each of the acquired categories, and extracting an n-dimensional feature of the acquired recognition target pattern;
Using the extracted n-dimensional feature, a feature selection function that converts the n-dimensional feature into an m-dimensional feature that is a dimension equal to or less than the n-dimensional feature is generated, and the generated feature selection function is used as a feature selection dictionary. A feature selection dictionary generating unit for storing in the storage medium as
A feature selection unit that converts the n-dimensional feature into the m-dimensional feature using the feature selection function;
Using the converted m-dimensional feature, a detailed identification function on an m-dimensional feature space for calculating a pattern similarity to each category is generated, and the generated detailed identification function is stored as an identification dictionary. A discriminant function generator to be stored in the medium;
A large-scale transform that converts the m-dimensional feature into an L-dimensional feature on the L-dimensional feature space that is a dimension less than the m-dimension, a subspace of the n-dimensional feature space, and a subspace of the m-dimensional feature space. Generating a classification feature selection function, converting the detailed identification function as a function on the L-dimensional feature space, and calculating a large classification identification function for calculating the similarity of the pattern to each category on the L-dimensional feature space Main part of the discriminant function for generating and storing the generated major classification feature selection function in the storage medium as the feature selection dictionary and storing the generated major classification discrimination function in the storage medium as the discrimination dictionary An extractor;
A major classification feature selection unit that converts the extracted n-dimensional feature into the L-dimensional feature using the major classification feature selection function;
Using the converted L-dimensional features and the large classification identification function, a similarity of the pattern to be recognized for each category is calculated, and a large classification identification unit that selects one or more correct candidate patterns;
A feature selection unit that converts the n-dimensional feature into the m-dimensional feature using the feature selection function;
A detailed identification unit that calculates the similarity of the pattern to be recognized with respect to the selected correct candidate pattern using the detailed identification function and the converted m-dimensional feature;
A recognition result output unit that outputs a recognition result for the recognition target pattern based on the calculated similarity;
A pattern recognition apparatus comprising:

The discriminant function main part extraction unit includes:
Performing a detailed identification function acquisition step of acquiring the detailed function;
Performing a feature axis selection step of selecting M feature axes from the n-dimensional feature space using the acquired detailed identification function;
Performing an axis importance determining step for calculating the importance of the feature axis;
Performing a main axis calculating step of calculating the L main axes by integrating the feature axes;
The large classification discrimination function generating step of generating the large classification identification function by converting the detailed identification function as a function on the L-dimensional feature space generated by the main axis is performed. The pattern recognition dictionary generation device described.

The feature selection function is a linear function, and the detailed discriminant function uk (x) for the n-dimensional feature x is a second-order or lower polynomial function shown in Equation 9,
The feature axis selection step includes:
The discriminant function main part extraction unit uses the n-dimensional feature or the m-dimensional feature y to characterize the vector qki obtained by transforming the detailed discriminant function uk (x) as shown in Equation 10 or Equation 11. Including selecting as an axis,
The axis importance determination step is
15. The pattern according to claim 14, wherein the discriminant function main part extracting unit includes a step of setting the importance of the feature axis qki using the function hki of the coefficient λkii and the coefficient ζki in Expression 10 or 11. Recognition device.

The main axis calculation step includes:
The discriminant function main part extraction unit calculating an eigenvalue of the matrix Q shown in Formula 12 generated from the feature axis qki and the axis importance hki;
The discriminant function main part extraction unit selects L eigenvectors in descending order of the calculated eigenvalues among eigenvectors of the matrix Q, and calculates the selected eigenvectors as main axes; The pattern recognition apparatus according to claim 15, comprising:

The pattern recognition dictionary generation device includes an input unit for a user to operate the pattern recognition dictionary generation device,
In the feature axis selection step, the discriminant function main part extraction unit randomly selects M arbitrary vectors as the feature axes, or M vectors as the feature axes based on an instruction from the user. The pattern recognition apparatus according to claim 14, further comprising a selecting step.

A pattern recognition dictionary generation method in a pattern recognition dictionary generation device comprising a processor and a storage medium connected to the processor,
The storage medium stores a learning pattern database composed of a plurality of learning patterns,
The method
The pattern recognition dictionary generation device acquires each learning pattern as one category from the learning pattern database;
A second step in which the pattern recognition dictionary generating device extracts an n-dimensional feature for each of the acquired categories;
The pattern recognition dictionary generation device generates a feature selection function that converts the n-dimensional feature into an m-dimensional feature that is a dimension equal to or less than the n-dimension using the extracted n-dimensional feature, and the generated A third step of storing a feature selection function in the storage medium as a feature selection dictionary;
A fourth step in which the dictionary recognition device for pattern recognition converts the extracted n-dimensional feature into the m-dimensional feature using the feature selection function;
The dictionary recognition device for pattern recognition generates a detailed identification function on an m-dimensional feature space for calculating a similarity of a pattern to be recognized for each category using the converted m-dimensional feature, and the generation A fifth step of storing the detailed identification function performed as an identification dictionary in the storage medium;
The pattern recognition dictionary generating device converts the dimension less than or equal to the m dimension into an L dimension feature on the L dimension feature space that is a subspace of the n dimension feature space and a subspace of the m dimension feature space. By generating a large-scale feature selection function for converting the m-dimensional feature and converting the detailed discriminant function as a function on the L-dimensional feature space, the similarity of the pattern to each category on the L-dimensional feature space is obtained. Generating a large classification identification function for calculation, storing the generated large classification feature selection function in the storage medium as the feature selection dictionary, and using the generated large classification identification function as the identification dictionary A sixth step of storing in a storage medium;
A pattern recognition dictionary generating method characterized by comprising:

The sixth step includes
The pattern recognition dictionary generating device acquires a detailed identification function for acquiring the detailed function;
A feature axis selection step in which the dictionary recognition device for pattern recognition selects M feature axes from the n-dimensional feature space using the acquired detailed identification function;
The pattern recognition dictionary generating device calculates an importance of the feature axis, an axis importance determining step;
A principal axis calculating step in which the pattern recognition dictionary generating device calculates the L principal axes by integrating the feature axes;
The pattern recognition dictionary generation device generates a large classification function by generating the large classification function by converting the detailed classification function as a function on the L-dimensional feature space generated by the main axis;
The pattern recognition dictionary generation method according to claim 19, further comprising:

The feature selection function is a linear function, and the detailed discriminant function uk (x) for the n-dimensional feature x is a second-order or lower polynomial function shown in Equation 13,
The feature axis selection step includes:
The pattern recognition dictionary generation device uses the n-dimensional feature or the m-dimensional feature y to transform the detailed identification function uk (x) as shown in Equation 14 or Equation 15 into a vector qki Including the step of selecting as
The axis importance determination step is
The pattern recognition dictionary generating apparatus includes the step of setting the importance of the feature axis qki using the function hki of the coefficient λkii and the coefficient ζki in the equation (14) or (15). Dictionary generation method.

The main axis calculation step includes:
The discriminant function main part extraction unit calculating an eigenvalue of the matrix Q shown in Formula 16 generated from the feature axis qki and the axis importance hki;
The discriminant function main part extraction unit selects L eigenvectors in descending order of the calculated eigenvalues among eigenvectors of the matrix Q, and calculates the selected eigenvectors as main axes. The pattern recognition dictionary generation method according to claim 21, further comprising:

The pattern recognition dictionary generation device includes an input unit for a user to operate the pattern recognition dictionary generation device,
In the feature axis selection step, the pattern recognition dictionary generating device randomly selects M arbitrary vectors as the feature axes, or selects M vectors as the feature axes based on an instruction from the user. 21. The pattern recognition dictionary generation method according to claim 20, further comprising the step of:

In the axis importance determination step, the discriminant function main part extraction unit has transformed the detailed discriminant function for the N learning patterns xi as a function on the M-dimensional space generated by the feature axis qki. 21. The method of claim 20, further comprising: calculating a function d (xi); and calculating the axis importance h as a function of a variance value σ of the calculated function d (xi). Pattern recognition dictionary generation method.