JP2010086466A

JP2010086466A - Data classification device and program

Info

Publication number: JP2010086466A
Application number: JP2008257508A
Authority: JP
Inventors: Hikari So; 暉曹; Takashi Naito; 貴志内藤; Yoshiki Ninomiya; 芳樹二宮
Original assignee: Toyota Central R&D Labs Inc
Current assignee: Toyota Central R&D Labs Inc
Priority date: 2008-10-02
Filing date: 2008-10-02
Publication date: 2010-04-15

Abstract

<P>PROBLEM TO BE SOLVED: To classify a data by reduced memories in a short calculation time. <P>SOLUTION: An SVM classification device 34 classifies the propriety as a face image, as to a pick-up image of a feature vector generated as a test data by a feature vector extracting part 32, using an SVM classification expression f(x) shown in Fig.11 obtained by learning, based on a quadratic function k(x, z) shown in Fig.11 approximated with a kernel function expressed by an exponential function, and based on a plurality of feature vectors prepared preliminarily as a training data, where x and z represent the feature vectors, γ represents a parameter in the kernel function, a, c and q represents coefficients determined by the approximation of the kernel function, x<SB>j</SB>represents a j-dimensional feature amount of the feature vector x, x<SB>k</SB>represents a k-dimensional feature amount of the feature vector x, v<SB>i</SB>represents a support vector, v<SB>ij</SB>represents a j-dimensional feature amount of the support vector v<SB>i</SB>, v<SB>ik</SB>represents a k-dimensional feature amount of the support vector v<SB>i</SB>, y<SB>i</SB>represents a label of the support vector v<SB>i</SB>, n is the number of the support vectors, d represents the dimension number of the feature vectors, and α<SB>1</SB>an b represent coefficients determined by the learning. <P>COPYRIGHT: (C)2010,JPO&INPIT

Description

本発明は、データ分類装置及びプログラムに係り、特に、ＳＶＭ分類式を用いてテストデータを分類するデータ分類装置及びプログラムに関する。 The present invention relates to a data classification apparatus and program, and more particularly, to a data classification apparatus and program for classifying test data using an SVM classification formula.

従来より、ＳＶＭ分類器を使用して、テスト画像が顔画像であるか分類する方法が知られている（特許文献１）。
特開２００７−２３８０９０号公報 Conventionally, a method of classifying whether a test image is a face image using an SVM classifier is known (Patent Document 1).
JP 2007-238090 A

しかしながら、上記特許文献１の技術では、扱う問題が複雑な場合、ＳＶＭ分類器で使用されるサポートベクトルの数が非常に多くなるため、必要なメモリ量が膨大になると共に、計算時間が多くかかってしまう、という問題がある。 However, in the technique of the above-mentioned Patent Document 1, when the problem to be handled is complicated, the number of support vectors used in the SVM classifier becomes very large, so that a necessary amount of memory becomes enormous and a long calculation time is required. There is a problem that.

本発明は、上記の問題点を解決するためになされたもので、少ないメモリ量で、かつ、短い計算時間でデータを分類することができるデータ分類装置及びプログラムを提供することを目的とする。 The present invention has been made to solve the above problems, and an object of the present invention is to provide a data classification apparatus and program capable of classifying data with a small amount of memory and a short calculation time.

上記の目的を達成するために第１の発明に係るデータ分類装置は、指数関数で表わされるカーネル関数を近似した以下の式で表される二次関数ｋ（ｘ、ｚ）と、訓練データとして予め用意された複数の特徴ベクトルとに基づいて学習することにより得られた、以下の式で表されるＳＶＭ分類式ｆ（ｘ）を用いて、テストデータとして入力された特徴ベクトルを分類するものである。 In order to achieve the above object, a data classification apparatus according to the first invention includes a quadratic function k (x, z) represented by the following equation that approximates a kernel function represented by an exponential function, and training data: Classifying feature vectors input as test data using an SVM classification formula f (x) expressed by the following formula obtained by learning based on a plurality of feature vectors prepared in advance It is.

また、ｘ、ｚは特徴ベクトルであり、γはカーネル関数におけるパラメータであり、ａ，ｃ，ｑは、前記カーネル関数の近似により決定される係数である。ｘ_ｊは特徴ベクトルｘのｊ次元の特徴量であり、ｘ_ｋは特徴ベクトルｘのｋ次元の特徴量である。ｖ_ｉはサポートベクトルであり、ｖ_ｉｊはサポートベクトルｖ_ｉのｊ次元の特徴量であり、ｖ_ｉｋはサポートベクトルｖ_ｉのｋ次元の特徴量である。ｙ_ｉはサポートベクトルｖ_ｉのラベルであり、ｎはサポートベクトルの個数であり、ｄは前記特徴ベクトルの次元数である。α_ｉ，ｂは前記学習により決定される係数である。 Further, x and z are feature vectors, γ is a parameter in a kernel function, and a, c, and q are coefficients determined by approximation of the kernel function. x _j is a j-dimensional feature quantity of the feature vector x, and x _k is a k-dimensional feature quantity of the feature vector x. v _i is the support _{vector, v ij} is the feature quantity of j dimension of support vectors _{v _i,} _{v ik} is the characteristic of k-dimensional support vector _{v i.} y _i is the label of the support vector v _i , n is the number of support vectors, and d is the number of dimensions of the feature vector. α _i and b are coefficients determined by the learning.

第２の発明に係るプログラムは、コンピュータを、指数関数で表わされるカーネル関数を近似した以下の式で表される二次関数ｋ（ｘ、ｚ）と、訓練データとして予め用意された複数の特徴ベクトルとに基づいて学習することにより得られた、以下の式で表されるＳＶＭ分類式ｆ（ｘ）を用いて、テストデータとして入力された特徴ベクトルを分類する手段として機能させるためのプログラムである。 The program according to the second aspect of the present invention provides a computer having a quadratic function k (x, z) represented by the following formula approximating a kernel function represented by an exponential function and a plurality of features prepared in advance as training data: A program for functioning as a means for classifying feature vectors input as test data using the SVM classification formula f (x) represented by the following formula, obtained by learning based on vectors: is there.

第１の発明及び第２の発明によれば、指数関数で表わされるカーネル関数を近似した上記の式で表される二次関数ｋ（ｘ、ｚ）と、訓練データとして予め用意された複数の特徴ベクトルとに基づいて学習することにより得られた、上記の式で表されるＳＶＭ分類式ｆ（ｘ）を用いて、テストデータとして入力された特徴ベクトルを分類する。 According to the first invention and the second invention, a quadratic function k (x, z) represented by the above equation approximating a kernel function represented by an exponential function, and a plurality of prepared beforehand as training data The feature vectors input as test data are classified using the SVM classification formula f (x) expressed by the above formula obtained by learning based on the feature vectors.

このように、パラメータの少ないＳＶＭ分類式を用いてテストデータを分類することにより、少ないメモリ量で、かつ、短い計算時間でデータを分類することができる。 As described above, by classifying test data using the SVM classification formula having a small number of parameters, the data can be classified with a small amount of memory and a short calculation time.

第３の発明に係るデータ分類装置は、指数関数で表わされるカーネル関数を近似した以下の式で表される三次関数ｋ（ｘ、ｚ）と、訓練データとして予め用意された複数の特徴ベクトルとに基づいて学習することにより得られた、以下の式で表されるＳＶＭ分類式ｆ（ｘ）を用いて、テストデータとして入力された特徴ベクトルを分類するものである。 According to a third aspect of the present invention, there is provided a data classification device including a cubic function k (x, z) represented by the following expression approximating a kernel function represented by an exponential function, and a plurality of feature vectors prepared in advance as training data: The feature vectors input as test data are classified using the SVM classification formula f (x) expressed by the following formula obtained by learning based on the above.

また、ｘ、ｚは特徴ベクトルであり、γはカーネル関数におけるパラメータであり、ａ，ｃ，ｑ，ｈは、前記カーネル関数の近似により決定される係数である。ｘ_ｊは特徴ベクトルｘのｊ次元の特徴量であり、ｘ_ｋは特徴ベクトルｘのｋ次元の特徴量であり、ｘ_ｓは特徴ベクトルｘのｓ次元の特徴量である。ｖ_ｉはサポートベクトルであり、ｖ_ｉｊはサポートベクトルｖ_ｉのｊ次元の特徴量であり、ｖ_ｉｋはサポートベクトルｖ_ｉのｋ次元の特徴量であり、ｖ_ｉｓはサポートベクトルｖ_ｉのｓ次元の特徴量である。ｙ_ｉはサポートベクトルｖ_ｉのラベルであり、ｎはサポートベクトルの個数であり、ｄは前記特徴ベクトルの次元数である。α_ｉ，ｂは前記学習により決定される係数である。 Further, x and z are feature vectors, γ is a parameter in the kernel function, and a, c, q, and h are coefficients determined by approximation of the kernel function. x _j is a j-dimensional feature quantity of the feature vector x, x _k is a k-dimensional feature quantity of the feature vector x, and x _s is an s-dimensional feature quantity of the feature vector x. v _i is the support _{vector, v ij} is the feature quantity of j dimension of support vectors _{v _i,} _{v ik} is the characteristic of k-dimensional support vector _{v _i,} _{v IS} the s-dimensional support vector _{v i} It is a feature amount. y _i is the label of the support vector v _i , n is the number of support vectors, and d is the number of dimensions of the feature vector. α _i and b are coefficients determined by the learning.

第４の発明に係るプログラムは、コンピュータを、指数関数で表わされるカーネル関数を近似した以下の式で表される三次関数ｋ（ｘ、ｚ）と、訓練データとして予め用意された複数の特徴ベクトルとに基づいて学習することにより得られた、以下の式で表されるＳＶＭ分類式ｆ（ｘ）を用いて、テストデータとして入力された特徴ベクトルを分類する手段として機能させるためのプログラムである。 According to a fourth aspect of the present invention, there is provided a program comprising a computer, a cubic function k (x, z) represented by the following formula approximating a kernel function represented by an exponential function, and a plurality of feature vectors prepared in advance as training data: Is a program for functioning as a means for classifying feature vectors input as test data using the SVM classification formula f (x) represented by the following formula obtained by learning based on .

第３の発明及び第４の発明によれば、指数関数で表わされるカーネル関数を近似した上記の式で表される三次関数と、訓練データとして予め用意された複数の特徴ベクトルとに基づいて学習することにより得られた、上記の式で表されるＳＶＭ分類式を用いて、テストデータとして入力された特徴ベクトルを分類する。 According to the third and fourth aspects of the invention, learning is performed based on a cubic function represented by the above equation approximating a kernel function represented by an exponential function, and a plurality of feature vectors prepared in advance as training data. The feature vectors input as test data are classified using the SVM classification formula represented by the above formula obtained by the above.

上記の二次関数を、カーネル関数をティラー展開により近似した二次関数とすることができる。これによって、カーネル関数を二次関数に精度良く近似することができる。 The above quadratic function can be a quadratic function obtained by approximating a kernel function by Tiller expansion. As a result, the kernel function can be approximated to a quadratic function with high accuracy.

上記の三次関数を、カーネル関数をティラー展開により近似した三次関数とすることができる。これによって、カーネル関数を三次関数に精度良く近似することができる。 The above cubic function can be a cubic function obtained by approximating a kernel function by Tiller expansion. As a result, the kernel function can be approximated to the cubic function with high accuracy.

上記の二次関数を、指数関数から生成される複数のサンプリングデータに対して、近似することにより得られた二次関数とすることができる。これによって、カーネル関数を二次関数に精度良く近似することができる。 The above quadratic function can be a quadratic function obtained by approximating a plurality of sampling data generated from an exponential function. As a result, the kernel function can be approximated to a quadratic function with high accuracy.

また、上記の二次関数を、指数関数から生成される複数のサンプリングデータのうち、カーネル関数におけるパラメータγに複数の離散値の各々を適用した場合に取りうるサンプリングデータに対して、各々近似することにより得られた複数の二次関数とすることができる。 Further, the quadratic function is approximated to sampling data that can be obtained when each of a plurality of discrete values is applied to the parameter γ in the kernel function among the plurality of sampling data generated from the exponential function. Thus, a plurality of quadratic functions obtained can be obtained.

上記の三次関数を、指数関数から生成される複数のサンプリングデータに対して、近似することにより得られた三次関数とすることができる。これによって、カーネル関数を三次関数に精度良く近似することができる。 The above cubic function can be a cubic function obtained by approximating a plurality of sampling data generated from an exponential function. As a result, the kernel function can be approximated to the cubic function with high accuracy.

また、上記の三次関数を、指数関数から生成される複数のサンプリングデータのうち、カーネル関数におけるパラメータγに複数の離散値の各々を適用した場合に取りうるサンプリングデータに対して、各々近似することにより得られた複数の三次関数とすることができる。 Further, the above cubic function is approximated to sampling data that can be obtained when each of a plurality of discrete values is applied to a parameter γ in a kernel function among a plurality of sampling data generated from an exponential function. A plurality of cubic functions obtained by the above can be obtained.

上記の複数のサンプリングデータを、訓練データとして用意された複数の特徴ベクトルの内積値から得られる指数関数の指数部分の値の経験分布に応じて生成することができる。 The plurality of sampling data described above can be generated according to the empirical distribution of the value of the exponent part of the exponential function obtained from the inner product value of a plurality of feature vectors prepared as training data.

上記の特徴ベクトルとして、正規化された特徴ベクトルを用いることができる。 As the feature vector, a normalized feature vector can be used.

以上説明したように、本発明のデータ分類装置及びプログラムによれば、パラメータの少ないＳＶＭ分類式を用いてテストデータを分類することにより、少ないメモリ量で、かつ、短い計算時間でデータを分類することができる、という効果が得られる。 As described above, according to the data classification apparatus and program of the present invention, data is classified with a small memory amount and a short calculation time by classifying test data using an SVM classification formula with few parameters. The effect that it can be obtained.

以下、図面を参照して本発明の実施の形態を詳細に説明する。なお、本実施の形態では、撮像装置によって撮像された画像が、顔を表わす顔画像であるか分類する画像分類装置に本発明を適用した場合を例に説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings. In the present embodiment, a case where the present invention is applied to an image classification apparatus that classifies whether an image captured by an imaging apparatus is a face image representing a face will be described as an example.

図１に示すように、第１の実施の形態に係る画像分類装置１０は、所定領域を撮像するように設けられた撮像装置１２と、撮像装置１２により撮像した画像が、顔を表わす顔画像であるか分類して、分類結果を表示装置１６に表示させるコンピュータ１４とを備えている。 As shown in FIG. 1, an image classification device 10 according to the first embodiment includes an imaging device 12 provided to capture a predetermined area, and a face image in which an image captured by the imaging device 12 represents a face. And a computer 14 for causing the display device 16 to display the classification result.

コンピュータ１４は、ＣＰＵ、後述するＳＶＭモデル学習処理ルーチン及び顔画像分類処理ルーチンのプログラムを記憶したＲＯＭ、データ等を記憶するＲＡＭ、及びこれらを接続するバスを含んで構成されている。このコンピュータ１４をハードウエアとソフトウエアとに基づいて定まる機能実現手段毎に分割した機能ブロックで説明すると、図１に示すように、サポートベクターマシン（ＳｕｐｐｏｒｔＶｅｃｔｏｒＭａｃｈｉｎｅ、ＳＶＭ）のカーネル関数を二次関数に近似するカーネル関数近似部２０と、訓練データとして、顔画像の特徴ベクトル及び非顔画像の特徴ベクトルを多数記憶した訓練データ記憶部２２と、カーネル関数を近似した二次関数と訓練データとを用いて、ＳＶＭ分類式を学習するＳＶＭ学習部２４と、学習により得られたＳＶＭ分類式を式展開することにより式を変換するＳＶＭモデル変換部２６と、変換されたＳＶＭ分類式を記憶したＳＶＭモデル記憶部２８とを備えている。 The computer 14 includes a CPU, a ROM that stores programs for an SVM model learning processing routine and a face image classification processing routine, which will be described later, a RAM that stores data, and a bus that connects these. When the computer 14 is described in terms of functional blocks divided for each function realizing means determined based on hardware and software, as shown in FIG. 1, the kernel function of a support vector machine (Support Vector Machine, SVM) is quadratic. A kernel function approximation unit 20 that approximates a function, a training data storage unit 22 that stores a large number of feature vectors of face images and non-face images as training data, a quadratic function and training data that approximate a kernel function, The SVM learning unit 24 for learning the SVM classification formula, the SVM model conversion unit 26 for converting the formula by expanding the SVM classification formula obtained by learning, and the converted SVM classification formula are stored. And an SVM model storage unit 28.

ここで、ＳＶＭモデルを用いた分類手法について説明する。 Here, a classification method using the SVM model will be described.

ＳＶＭモデルを用いた分類手法では、訓練データの集合が与えられると、訓練データをより高次元の特徴空間に非線形写像し、この特徴空間において最大マージンを有する分離超平面を構築する。 In the classification method using the SVM model, when a set of training data is given, the training data is nonlinearly mapped to a higher-dimensional feature space, and a separation hyperplane having a maximum margin is constructed in the feature space.

ＳＶＭモデルでは、学習及び分類において必要な計算は、非線形写像空間での特徴φ（ｘ_１）やφ（ｘ_２）の内積算のみに関わるので、特徴φ（ｘ_１）を陽に計算する代わりに、カーネル関数Ｋ（ｘ_１，ｘ_２）＝φ（ｘ_１）^Ｔφ（ｘ_２）のように暗に計算する。 In the SVM model, the calculation required for learning and classification involves only the internal integration of the features φ (x ₁ ) and φ (x ₂ ) in the nonlinear mapping space, so that instead of calculating the features φ (x ₁ ) explicitly Then, the kernel function K (x ₁ , x ₂ ) = φ (x ₁ ) ^T φ (x ₂ ) is implicitly calculated.

カーネル関数としては多項式カーネル、ＲＢＦ（ＲａｄｉａｌＢａｓｉｓＦｕｎｃｔｉｏｎ）カーネル、シグモイドカーネルなどがよく使用される。ＳＶＭモデルにおいて、カーネル関数を計算するとき考慮しなければならないのは、分離超平面の近傍にあるベクトルのみであり、これらのベクトルをサポートベクトルと呼ぶ。一般のＳＶＭ分類式は以下の（１）式で表される。 As the kernel function, a polynomial kernel, an RBF (Radial Basis Function) kernel, a sigmoid kernel, or the like is often used. In the SVM model, only the vectors in the vicinity of the separation hyperplane need to be considered when calculating the kernel function, and these vectors are called support vectors. A general SVM classification formula is represented by the following formula (1).

ここで、ｖ_ｉはサポートベクトル、α_ｉはラグランジュ係数、ｙ_ｉは各サポートベクトルｖ_ｉのラベル（１，−１）を各々表す。 Here, v _i represents a support vector, α _i represents a Lagrange coefficient, and y _i represents a label (1, −1) of each support vector v _i .

本実施の形態では、カーネル関数としてＲＢＦカーネルを用いる。ＲＢＦカーネルを用いたＳＶＭによる分類性能は、一般に性能が高く安定的である。 In this embodiment, an RBF kernel is used as a kernel function. The classification performance by SVM using the RBF kernel is generally high performance and stable.

カーネル関数近似部２０は、ＲＢＦカーネルを、以下に説明するように二次多項式関数で近似する。以下では、普遍性を損失なく、特徴ベクトルがＬ２ノルムで正規化されている場合を仮定して説明する。 The kernel function approximation unit 20 approximates the RBF kernel with a second-order polynomial function as described below. In the following description, it is assumed that the feature vector is normalized by the L2 norm without loss of universality.

まず、ＲＢＦカーネルを、以下の（２）式のように展開できる。 First, the RBF kernel can be expanded as shown in the following equation (2).

ここで、γ（＞０）は、カーネルの幅と呼ばれるパラメータである。上記（２）式では、第一項の指数関数が常に固定値なので、無視してもカーネルに影響がない。つまり、ＲＢＦカーネルは、上記（２）式の第二項の指数関数で表される。 Here, γ (> 0) is a parameter called kernel width. In the above equation (2), since the exponential function of the first term is always a fixed value, even if ignored, the kernel is not affected. That is, the RBF kernel is represented by the exponential function of the second term of the above equation (2).

また、上記（２）式の第二項の指数関数で表されるＲＢＦカーネルを、以下の（３）式で表される二次多項式関数で近似する。 Further, the RBF kernel represented by the exponential function of the second term of the above equation (2) is approximated by a quadratic polynomial function represented by the following equation (3).

上記（３）式の係数ａ，ｃ，ｑについては、以下のように決定される。 The coefficients a, c, and q in the above equation (3) are determined as follows.

まず、特徴ベクトルｘとｚがＬ２ノルムで正規化されているため、内積値ｘ^Ｔｚは常に［０，１］の範囲である。従って、ＲＢＦカーネルの指数部分２γｘ^Ｔｚの値域が［０，２γ］となる。 First, since the feature vectors x and z are normalized by the L2 norm, the inner product value x ^T z is always in the range [0, 1]. Therefore, the range of the exponent part 2γx ^T z of the RBF kernel is [0, 2γ].

また、指数関数で表されるＲＢＦカーネルはティラー展開で近似することができる。パラメータγが小さい値をとる場合、ＲＢＦカーネルの指数部分２γｘ^Ｔｚの値域は小さい区間となるため、二次ティラー展開でＲＢＦカーネルを近似することができ、上記（３）式の係数について、ａ＝１，ｃ＝１，ｑ＝１／２と決定される。 In addition, the RBF kernel represented by an exponential function can be approximated by a tiller expansion. When the parameter γ takes a small value, since the range of the exponent part 2γx ^T z of the RBF kernel is a small interval, the RBF kernel can be approximated by the second-order Tiller expansion. = 1, c = 1, q = 1/2.

ＳＶＭ学習部２４は、以下に説明するように、γ、α_ｉ、ｂを学習して、ＳＶＭ分類式を決定する。 As will be described below, the SVM learning unit 24 learns γ, α _i , and b to determine the SVM classification formula.

まず、上記（１）式のＳＶＭ分類式ｆ（ｘ）のカーネル関数として、近似された上記（３）式の二次多項式関数を用いると共に、カーネル関数のパラメータγに複数の離散値の各々を適用して、γの各離散値に対して、訓練データの特徴ベクトルに基づいて、以下の（４）式を最大にするような、α_ｉを学習すると共に、学習したα_ｉを適用したときに任意のｉに対して以下の（５）式を満たすｂを選択することにより、ＳＶＭ分類式を各々決定する。 First, as the kernel function of the SVM classification formula f (x) of the above formula (1), an approximated second order polynomial function of the above formula (3) is used, and each of a plurality of discrete values is set as the kernel function parameter γ. applying, for each discrete value of gamma, based on the feature vector of the training data, the following equation (4) that maximizes and with learning the alpha _i, when applying learned alpha _i Each SVM classification formula is determined by selecting b satisfying the following formula (5) for any i.

ここで、Ｎは、訓練データの数であり、ｘ_ｉは、訓練データの特徴ベクトルであり、ｙ_ｉは特徴ベクトルｘ_ｉの教師ラベル（１、−１）である。また、Ｃは定数である。 Here, N is the number of training data, x _i is a feature vector of training data, and y _i is a teacher label (1, −1) of the feature vector x _i . C is a constant.

次に、各γの離散値に対して決定されたα_ｉ、ｂを用いたＳＶＭ分類式の各々について、訓練データの特徴ベクトルを適用した場合の計算結果と、訓練データの教師ラベルとを比較する。 Next, for each of the SVM classification formulas using α _i and b determined for the discrete values of γ, the calculation result when the feature vector of the training data is applied is compared with the teacher label of the training data. To do.

各γの離散値に対して得られた比較結果の各々から、最適なＳＶＭ分類式となるときのγを決定し、このγに対して決定されたα_ｉ、ｂと、決定されたγとを用いたＳＶＭ分類式を、学習結果として採用する。 From each of the comparison results obtained for the discrete values of each γ, γ to be an optimal SVM classification formula is determined, and α _i and b determined for this γ, The SVM classification formula using is used as a learning result.

ＳＶＭモデル変換部２６は、学習により得られたＳＶＭ分類式ｆ_２（ｘ）を、以下の（６）式に示すように数式展開することにより、パラメータの少ない式に変換して、ＳＶＭモデル記憶部２８に記憶しておく。 The SVM model conversion unit 26 converts the SVM classification expression f ₂ (x) obtained by learning into an expression with fewer parameters by expanding the expression as shown in the following expression (6), and stores the SVM model. Stored in the unit 28.

また、ｘ_ｊは特徴ベクトルｘのｊ次元の特徴量であり、ｘ_ｋは特徴ベクトルｘのｋ次元の特徴量である。ｖ_ｉはサポートベクトルであり、ｖ_ｉｊはサポートベクトルｖ_ｉのｊ次元の特徴量であり、ｖ_ｉｋはサポートベクトルｖ_ｉのｋ次元の特徴量である。ｙ_ｉはサポートベクトルｖ_ｉのラベルであり、ｎはサポートベクトルの個数であり、ｄは特徴ベクトルの次元数である。α_ｉ，ｂは学習により決定される係数である。 X _j is a j-dimensional feature quantity of the feature vector x, and x _k is a k-dimensional feature quantity of the feature vector x. v _i is the support _{vector, v ij} is the feature quantity of j dimension of support vectors _{v _i,} _{v ik} is the characteristic of k-dimensional support vector _{v i.} y _i is the label of the support vector v _i , n is the number of support vectors, and d is the number of dimensions of the feature vector. α _i and b are coefficients determined by learning.

上記（６）式のＳＶＭ分類式に使われるパラメータβ_ｊ、ω_ｊｋは、サポートベクトル間の重み付き加算結果から予め計算され、メモリ（図示省略）に予め記憶される。 The parameters β _j and ω _jk used in the above SVM classification formula (6) are calculated in advance from the weighted addition result between the support vectors and stored in advance in a memory (not shown).

また、コンピュータ１４は、撮像装置１２から出力される撮像画像を入力する画像入力部３０と、撮像画像から、テストデータとして、複数種類の特徴量から構成される特徴ベクトルを抽出する特徴ベクトル抽出部３２と、ＳＶＭモデル記憶部２８に記憶されたＳＶＭ分類式を用いて、テストデータに対して、顔画像であるか否かの分類を行って、分類結果を表示装置１６に表示させるＳＶＭ分類部３４とを備えている。 The computer 14 also includes an image input unit 30 that inputs a captured image output from the imaging device 12, and a feature vector extraction unit that extracts a feature vector composed of a plurality of types of feature amounts as test data from the captured image. 32 and the SVM classification unit stored in the SVM model storage unit 28, the test data is classified as to whether it is a face image, and the classification result is displayed on the display device 16. 34.

画像入力部３０は、例えば、Ａ／Ｄコンバータや１画面の画像データを記憶する画像メモリ等で構成される。 The image input unit 30 includes, for example, an A / D converter and an image memory that stores image data for one screen.

特徴ベクトル抽出部３２は、撮像画像から複数種類の特徴量を抽出して、テストデータとして、複数種類の特徴量からなる特徴ベクトルを生成する。なお、上記の訓練データの特徴ベクトルとテストデータの特徴ベクトルとは、同じ種類の特徴量から構成されるベクトルである。 The feature vector extraction unit 32 extracts a plurality of types of feature amounts from the captured image, and generates a feature vector including a plurality of types of feature amounts as test data. Note that the feature vector of the training data and the feature vector of the test data are vectors composed of the same type of feature quantity.

ＳＶＭ分類部３４は、テストデータの特徴ベクトルについて、上記（６）式で表されるＳＶＭ分類式を計算し、計算結果が正の値（０を含む）であれば、撮像画像が顔画像であると分類し、計算結果が負の値であれば、撮像画像が顔画像であると分類する。 The SVM classification unit 34 calculates the SVM classification formula expressed by the above formula (6) for the feature vector of the test data. If the calculation result is a positive value (including 0), the captured image is a face image. If the calculation result is a negative value, the captured image is classified as a face image.

次に、第１の実施の形態に係る画像分類装置１０の作用について説明する。 Next, the operation of the image classification device 10 according to the first embodiment will be described.

まず、訓練データとして、顔画像の特徴ベクトル及び非顔画像の特徴ベクトルが多数用意され、コンピュータ１４の訓練データ記憶部２２に教師ラベルと共に記憶される。また、コンピュータ１４において、図２に示すＳＶＭモデル学習処理ルーチンが実行される。 First, a large number of feature vectors of face images and feature vectors of non-face images are prepared as training data, and are stored in the training data storage unit 22 of the computer 14 together with a teacher label. Further, the SVM model learning processing routine shown in FIG.

まず、ステップ１００において、カーネル関数であるＲＢＦカーネルを二次ティラー展開により二次多項式関数で近似する。そして、ステップ１０２において、訓練データを読み込み、ステップ１０４において、訓練データを上記ステップ１００で近似した二次多項式関数に適用して学習することにより、ＳＶＭ分類式の係数を決定し、ＳＶＭ分類式を得る。 First, in step 100, the RBF kernel that is a kernel function is approximated by a quadratic polynomial function by quadratic tiller expansion. Then, in step 102, the training data is read. In step 104, the training data is applied to the quadratic polynomial function approximated in step 100 to learn, thereby determining the coefficient of the SVM classification formula. obtain.

次のステップ１０６では、上記ステップ１０４で得られたＳＶＭ分類式を数式展開することにより、上記（６）式で表される数式に変換して、ステップ１０８において、ＳＶＭモデル記憶部２８に、変換したＳＶＭ分類式を記憶して、ＳＶＭモデル学習処理ルーチンを終了する。 In the next step 106, the SVM classification formula obtained in the step 104 is converted into a mathematical formula represented by the above formula (6) by expanding the mathematical formula, and in the step 108, the SVM model storage unit 28 converts the formula into the mathematical formula. The SVM classification formula is stored, and the SVM model learning processing routine is terminated.

また、撮像装置１２によって所定領域の画像が撮像されると、コンピュータ１４において、図３に示す顔画像分類処理ルーチンが実行される。 Further, when an image of a predetermined area is picked up by the image pickup device 12, the face image classification processing routine shown in FIG.

まず、ステップ１２０において、撮像装置１２から撮像画像を取得し、ステップ１２２において、上記ステップ１２０で取得された撮像画像から複数種類の特徴量を抽出して、テストデータとしての特徴ベクトルを生成する。 First, in step 120, a captured image is acquired from the imaging device 12, and in step 122, a plurality of types of feature amounts are extracted from the captured image acquired in step 120, thereby generating a feature vector as test data.

そして、ステップ１２４において、ＳＶＭモデル記憶部２８に記憶されたＳＶＭ分類式に、上記ステップ１２２で生成された特徴ベクトルを適用して計算する。次のステップ１２６において、上記ステップ１２４で得られた計算値が正の値であるか否かを判定し、正の値であると、ステップ１２８において、撮像画像が顔画像であるとの分類結果を表示装置１６に表示させて、顔画像分類処理ルーチンを終了する。一方、上記ステップ１２６において、上記ステップ１２４で得られた計算値が負の値であると判定されると、ステップ１３０において、撮像画像が顔画像でないとの分類結果を表示装置１６に表示させて、顔画像分類処理ルーチンを終了する。
上記（６）式においてω_ｋｊ＝ω_ｊｋとなるので、上記顔画像分類処理ルーチンの計算において、ＳＶＭ分類式で使用する｛β_ｊ｝，｛ω_ｊｋ｝に関わる総メモリ量はｄ^２／２に比例する（ｄは特徴量の次元数を表す）。一方、近似しない元のＳＶＭモデルに関わる総メモリ量はｎｄ（サポートベクトルの数ｎ×特徴量の次元数ｄ）であり、本実施の形態の手法によるメモリ量の効率化率は２ｎ／ｄになる。ここで、一般的に、特徴量の次元数ｄはサポートベクトルの数ｎより少ないので、総メモリ量が削減される。例えば、特徴量の次元数ｄが１００であり、サポートベクトルの数ｎが４０００である場合には、上記（６）式を用いて計算することにより、変換する前のＳＶＭモデルを用いて計算に比べて、総メモリ量が約８０倍効率化される。また、計算時間についても、本実施の形態の手法によりメモリ量の効率化と同様に効率化することができる。 In step 124, the feature vector generated in step 122 is applied to the SVM classification formula stored in the SVM model storage unit 28 for calculation. In the next step 126, it is determined whether or not the calculated value obtained in step 124 is a positive value. If the calculated value is a positive value, in step 128, the classification result that the captured image is a face image is determined. Is displayed on the display device 16, and the face image classification processing routine is terminated. On the other hand, if it is determined in step 126 that the calculated value obtained in step 124 is a negative value, a classification result indicating that the captured image is not a face image is displayed on the display device 16 in step 130. Then, the face image classification processing routine ends.
Since the ω _kj = ω _jk in the equation (6), in the calculation of the face image classification processing routine, used in the SVM classifier formula {beta _j}, the total amount of memory related to {omega _jk} is ^d 2/2 (D represents the number of dimensions of the feature quantity). On the other hand, the total memory amount related to the original SVM model that is not approximated is nd (number of support vectors n × dimension number d of feature amount), and the efficiency of memory amount by the method of this embodiment is 2n / d. Become. Here, in general, the number of dimensions d of the feature amount is smaller than the number n of support vectors, so that the total memory amount is reduced. For example, when the feature quantity dimension d is 100 and the support vector number n is 4000, calculation is performed using the SVM model before conversion by calculating using the above equation (6). In comparison, the total memory capacity is about 80 times more efficient. Also, the calculation time can be improved by the method of the present embodiment in the same manner as the memory amount.

以上説明したように、第１の実施の形態に係る画像分類装置によれば、二次関数に近似したＲＢＦカーネルに基づくパラメータの少ないＳＶＭ分類式を用いて、テストデータを分類することにより、少ないメモリ量で、かつ、短い計算時間でデータを分類することができる。 As described above, according to the image classification device according to the first embodiment, the test data is classified by using the SVM classification formula having a small number of parameters based on the RBF kernel approximated to the quadratic function, thereby reducing the number of test data. Data can be classified by the amount of memory and in a short calculation time.

メモリ量や計算時間を削減するために、サポートベクトル数を減らす必要がないため、従来手法のＲＢＦカーネルを用いたＳＶＭモデルによる分類に比べ、分類性能を落とすことなく、高速で処理可能でありかつメモリ量を削減でき、低コスト化を実現できる。 Since it is not necessary to reduce the number of support vectors in order to reduce the amount of memory and calculation time, it can be processed at high speed without degrading the classification performance as compared with the classification based on the SVM model using the RBF kernel of the conventional method, and The amount of memory can be reduced and the cost can be reduced.

また、二次ティラー展開により、ＲＢＦカーネルを、二次関数で精度良く近似することができる。 Further, the RBF kernel can be approximated with a quadratic function with high accuracy by the quadratic tiller expansion.

次に、第２の実施の形態について説明する。なお、第２の実施の形態に係る画像分類装置の構成は、第１の実施の形態と同様の構成となるため、同一符号を付して説明を省略する。 Next, a second embodiment will be described. Note that the configuration of the image classification apparatus according to the second embodiment is the same as that of the first embodiment, and thus the same reference numerals are given and description thereof is omitted.

第２の実施の形態では、ＲＢＦカーネルを３次多項式関数に近似している点が第１の実施の形態と主に異なっている。 The second embodiment is mainly different from the first embodiment in that the RBF kernel is approximated to a cubic polynomial function.

第２の実施の形態に係る画像分類装置では、カーネル関数近似部２０によって、上記（２）式の第二項の指数関数で表されるＲＢＦカーネルを、以下の（７）式で表される三次次多項式関数で近似する。 In the image classification device according to the second embodiment, the kernel function approximating unit 20 represents the RBF kernel represented by the exponential function of the second term of the above equation (2) by the following equation (7). Approximate with cubic polynomial function.

上記（７）式の係数ａ，ｃ，ｑ，ｈについては、以下のように決定される。 The coefficients a, c, q, and h in the equation (7) are determined as follows.

パラメータγが小さい値をとる場合、ＲＢＦカーネルの指数部分２γｘ^Ｔｚの値域は小さい区間となるため、三次ティラー展開でＲＢＦカーネルを近似することができ、上記（７）式の係数について、ａ＝１，ｃ＝１，ｑ＝１／２、ｈ＝１／６と決定される。 When the parameter γ takes a small value, the range of the exponent part 2γx ^T z of the RBF kernel is a small interval, so that the RBF kernel can be approximated by the third-order Tiller expansion, and a = 1, c = 1, q = 1/2, and h = 1/6.

また、ＳＶＭ学習部２４は、上記（１）式のＳＶＭ分類式ｆ（ｘ）のカーネル関数として、近似された上記（７）式の三次多項式関数を用いると共に、カーネル関数のパラメータγに複数の離散値の各々を適用して、γの各離散値に対して、訓練データの特徴ベクトルに基づいて、上記の（４）式を最大化にするようなα_ｉ、を学習すると共に、上記の（５）式を満たすｂを選択して、ＳＶＭ分類式を各々決定する。 The SVM learning unit 24 uses the approximated third-order polynomial function of the equation (7) as the kernel function of the SVM classification equation f (x) of the equation (1), and uses a plurality of parameters γ for the kernel function. Applying each of the discrete values to learn α _i that maximizes the above equation (4) based on the feature vector of the training data for each discrete value of γ, (5) b satisfying the equation is selected to determine each of the SVM classification equations.

また、各γの離散値に対して決定されたα_ｉ、ｂを用いたＳＶＭ分類式の各々について、訓練データの特徴ベクトルを適用した場合の計算結果と、訓練データの教師ラベルとを比較し、各γの離散値に対して得られた比較結果の各々から、最適なＳＶＭ分類式となるときのγを決定し、このγに対して決定されたα_ｉ、ｂと、決定されたγとを用いたＳＶＭ分類式を、学習結果として採用する。 Further, for each SVM classification formula using α _i and b determined for each discrete value of γ, the calculation result when the feature vector of the training data is applied is compared with the teacher label of the training data. , Γ for determining the optimum SVM classification formula is determined from each of the comparison results obtained for the discrete values of γ, α _i and b determined for the γ, and determined γ The SVM classification formula using is used as a learning result.

ＳＶＭモデル変換部２６は、学習により得られたＳＶＭ分類式ｆ_３（ｘ）を、以下の（８）式に示すように数式展開することにより、パラメータの少ない式に変換して、ＳＶＭモデル記憶部２８に記憶しておく。 The SVM model conversion unit 26 converts the SVM classification expression f ₃ (x) obtained by learning into an expression with fewer parameters by expanding the expression as shown in the following expression (8), and stores the SVM model. Stored in the unit 28.

また、ｘ_ｊは特徴ベクトルｘのｊ次元の特徴量であり、ｘ_ｋは特徴ベクトルｘのｋ次元の特徴量であり、ｘ_ｓは特徴ベクトルｘのｓ次元の特徴量である。ｖ_ｉはサポートベクトルであり、ｖ_ｉｊはサポートベクトルｖ_ｉのｊ次元の特徴量であり、ｖ_ｉｋはサポートベクトルｖ_ｉのｋ次元の特徴量であり、ｖ_ｉｓはサポートベクトルｖ_ｉのｓ次元の特徴量である。ｙ_ｉはサポートベクトルｖ_ｉのラベルであり、ｎはサポートベクトルの個数であり、ｄは特徴ベクトルの次元数である。α_ｉ，ｂは前記学習により決定される係数である。 Further, x _j is a j-dimensional feature quantity of the feature vector x, x _k is a k-dimensional feature quantity of the feature vector x, and x _s is an s-dimensional feature quantity of the feature vector x. v _i is the support _{vector, v ij} is the feature quantity of j dimension of support vectors _{v _i,} _{v ik} is the characteristic of k-dimensional support vector _{v _i,} _{v IS} the s-dimensional support vector _{v i} It is a feature amount. y _i is the label of the support vector v _i , n is the number of support vectors, and d is the number of dimensions of the feature vector. α _i and b are coefficients determined by the learning.

上記（８）式のＳＶＭ分類式に使われるパラメータβ_ｊ、ω_ｊｋ、θ_ｊｋｓは、サポートベクトル間の重み付き加算結果から予め計算され、メモリ（図示省略）に予め記憶される。 The parameters β _j , ω _jk , θ _jks used in the SVM classification formula of the above formula (8) are calculated in advance from the weighted addition result between the support vectors and stored in advance in a memory (not shown).

なお、第２の実施の形態に係る画像分類装置の他の構成及び作用については、第１の実施の形態と同様であるため、説明を省略する。 Note that other configurations and operations of the image classification device according to the second embodiment are the same as those of the first embodiment, and thus the description thereof is omitted.

ここで、上記（８）式を用いた計算において、使用されるパラメータ｛β_ｊ｝，｛ω_ｊｋ｝，｛θ_ｊｋｓ｝に関わる総メモリ量はｄ（ｄ^２＋６ｄ）／２に比例するため、近似しない元のＳＶＭモデルと比較して、本実施の形態の手法によるメモリ量の効率化率は２ｎ／（ｄ^２＋６ｄ）になる。従って、特徴量の次元数が少ない場合（例えば、１００未満である場合）には、総メモリ量の効率化を実現することができる。また、計算時間についても、メモリ量の効率化と同様に効率化することができる。 Here, in the calculation using the above equation (8), the total memory amount related to the parameters {β _j }, {ω _jk }, {θ _jks } used is proportional to d (d ² + 6d) / 2. Compared with the original SVM model that is not approximated, the memory efficiency is 2n / (d ² + 6d) according to the method of the present embodiment. Therefore, when the number of dimensions of the feature quantity is small (for example, less than 100), the efficiency of the total memory quantity can be realized. Also, the calculation time can be improved in the same manner as the memory amount.

このように、三次関数に近似したＲＢＦカーネルに基づくパラメータの少ないＳＶＭ分類式を用いて、テストデータを分類することにより、少ないメモリ量で、かつ、短い計算時間でデータを分類することができる。また、三次ティラー展開により、ＲＢＦカーネルを、三次関数で精度良く近似することができる。 In this way, by classifying test data using the SVM classification formula having a small number of parameters based on the RBF kernel approximated to a cubic function, the data can be classified with a small memory amount and a short calculation time. In addition, by the cubic Tiller expansion, the RBF kernel can be approximated with a cubic function with high accuracy.

次に、第３の実施の形態について説明する。なお、第３の実施の形態に係る画像分類装置の構成は、第１の実施の形態と同様の構成となるため、同一符号を付して説明を省略する。 Next, a third embodiment will be described. Note that the configuration of the image classification apparatus according to the third embodiment is the same as that of the first embodiment, and thus the same reference numerals are given and description thereof is omitted.

第３の実施の形態では、ＲＢＦカーネルの指数関数から生成されるサンプリングデータに対して、フィッティングすることにより、ＲＢＦカーネルを近似した二次多項式関数を求めている点が、第１の実施の形態と主に異なっている。 In the third embodiment, a second-order polynomial function that approximates the RBF kernel is obtained by fitting the sampling data generated from the exponential function of the RBF kernel. And mainly different.

第３の実施の形態に係る画像分類装置では、カーネル関数近似部２０によって、ＲＢＦカーネルを、以下に説明するように、上記（３）式で表される二次多項式関数で近似する。ここでは、普遍性を損失なく、特徴ベクトルがＬ２ノルムで正規化されている場合を仮定して説明する。 In the image classification device according to the third embodiment, the kernel function approximating unit 20 approximates the RBF kernel with a quadratic polynomial function expressed by the above equation (3) as described below. Here, description will be made assuming that the feature vector is normalized by the L2 norm without loss of universality.

まず、ＲＢＦカーネルの指数部分２γｘ^Ｔｚの値域において、上記（２）式の第二項の指数関数から、図４（Ａ）に示すように、等間隔でサンプリングデータを生成し、均一的に分布したサンプリングデータの集合を生成する。 First, in the range of the exponent part 2γx ^T z of the RBF kernel, sampling data is generated at equal intervals from the exponential function of the second term of the above equation (2) as shown in FIG. Generate a set of distributed sampling data.

また、ＲＢＦカーネルのパラメータγにどの値が適用されるか未知なので、それぞれの離散値を適用した場合に取りうる範囲で、二次多項式関数に各々近似する。例えば、γに適用される離散値を０．５おきで、［０．５，１，１．５…］とすると、取りうる２γｘ^Ｔｚの値域は｛［０，１］，［０，２］，［０，３］…｝となる。各値域（区間）に含まれるサンプリングデータの集合に対して、図４（Ｂ）に示すように、最小二乗法によってフィッティングを行って、残差の二乗和を最小とするような二次多項式関数に近似するように上記（６）式の係数ａ，ｃ，ｑを決定して、パラメータγの各離散値に対する二次多項式関数を得る。なお、上記図４（Ｂ）では、γ＝１とした場合に取りうる区間でのフィッティングの様子を示している。 Further, since it is unknown which value is applied to the parameter γ of the RBF kernel, each value is approximated to a quadratic polynomial function within a range that can be taken when each discrete value is applied. For example, if the discrete values applied to γ are every 0.5 and [0.5, 1, 1.5...], The possible range of 2γx ^T z is {[0, 1], [0, 2 ], [0, 3] ...}. As shown in FIG. 4B, a quadratic polynomial function that performs fitting by a least square method on a set of sampling data included in each range (section) to minimize the sum of squares of the residuals. The coefficients a, c, and q of the above equation (6) are determined so as to approximate to ## EQU3 ## to obtain a second order polynomial function for each discrete value of the parameter γ. Note that FIG. 4B shows a fitting state in a section that can be taken when γ = 1.

ＳＶＭ学習部２４は、以下に説明するように、γ、αｉ、ｂを学習して、ＳＶＭ分類式を決定する。 As described below, the SVM learning unit 24 learns γ, αi, and b, and determines the SVM classification formula.

まず、上記（１）式のＳＶＭ分類式ｆ（ｘ）のカーネル関数として、γの離散値毎に近似された二次多項式関数を用いて、γの各離散値に対して、上記の（４）式を最大にするようなα_ｉを学習すると共に、上記の（５）式を満たすｂを選択して、ＳＶＭ分類式を各々決定する。 First, using the quadratic polynomial function approximated for each discrete value of γ as the kernel function of the SVM classification formula f (x) of the above equation (1), for each discrete value of γ, the above (4 ) Learning α _i that maximizes the expression, and selecting b satisfying the above expression (5) to determine the SVM classification expressions.

各γの離散値に対して得られた比較結果の各々から、最適なＳＶＭ分類式となるときのγを決定し、このγに対して決定されたα_ｉ、ｂと、このγに対して近似された二次多項式関数と、決定されたγとを用いたＳＶＭ分類式を、学習結果として採用する。 From each of the comparison results obtained for the discrete values of each γ, γ when an optimum SVM classification formula is obtained is determined, and α _i and b determined for this γ, and for this γ An SVM classification formula using the approximated second-order polynomial function and the determined γ is adopted as a learning result.

ＳＶＭモデル変換部２６は、上記第１の実施の形態と同様に、学習により得られたＳＶＭ分類式を数式展開して、上記（６）式に示すような数式に変換する。 Similar to the first embodiment, the SVM model conversion unit 26 develops mathematical expressions of the SVM classification expressions obtained by learning and converts them into mathematical expressions as shown in the above equation (6).

次に、第３の実施の形態に係るＳＶＭモデル学習処理ルーチンについて、図５を用いて説明する。なお、第１の実施の形態と同様の処理については、同一符号を付して詳細な説明を省略する。 Next, the SVM model learning process routine according to the third embodiment will be described with reference to FIG. In addition, about the process similar to 1st Embodiment, the same code | symbol is attached | subjected and detailed description is abbreviate | omitted.

まず、ステップ３００において、ＲＢＦカーネルの指数関数を用いて、等間隔にサンプリングデータを生成し、ステップ３０２において、パラメータγの各離散値に対して、対応する範囲のサンプリングデータに基づいて、ＲＢＦカーネルの指数関数を二次多項式関数で近似する。 First, in step 300, sampling data is generated at regular intervals using the exponential function of the RBF kernel. In step 302, the RBF kernel is calculated based on the sampling data in the corresponding range for each discrete value of the parameter γ. Is approximated by a quadratic polynomial function.

そして、ステップ１０２で、訓練データを読み込み、ステップ３０４において、訓練データを、上記ステップ３０２でγの離散値毎に近似した二次多項式関数に適用して学習することにより、ＲＢＦカーネルのパラメータγ及びＳＶＭ分類式の係数を決定し、ＳＶＭ分類式を得る。 Then, in step 102, the training data is read, and in step 304, the training data is applied to the quadratic polynomial function approximated for each discrete value of γ in step 302, thereby learning the parameters γ and RBF kernel. The coefficient of the SVM classification formula is determined, and the SVM classification formula is obtained.

次のステップ１０６では、上記ステップ３０４で得られたＳＶＭ分類式を数式展開することにより、上記（６）式で表される数式に変換して、ステップ１０８において、ＳＶＭモデル記憶部２８に、変換したＳＶＭ分類式を記憶させて、ＳＶＭモデル学習処理ルーチンを終了する。 In the next step 106, the SVM classification formula obtained in the above step 304 is converted into a mathematical formula represented by the above formula (6) by expanding the mathematical formula, and in the step 108, it is converted into the SVM model storage unit 28. The SVM classification formula is stored, and the SVM model learning processing routine is terminated.

また、撮像装置１２によって所定領域の画像が撮像されると、コンピュータ１４において、第１の実施の形態と同様に、顔画像分類処理ルーチンが実行される。 Further, when an image of a predetermined area is picked up by the image pickup device 12, a face image classification processing routine is executed in the computer 14 as in the first embodiment.

このように、二次関数に近似したＲＢＦカーネルに基づくパラメータの少ないＳＶＭ分類式を用いて、テストデータを分類することにより、少ないメモリ量で、かつ、短い計算時間でデータを分類することができる。 In this way, by classifying test data using the SVM classification formula having a small number of parameters based on the RBF kernel approximated to a quadratic function, the data can be classified with a small amount of memory and a short calculation time. .

また、サンプリングデータに対するフィッティングにより、ＲＢＦカーネルを、二次関数で精度良く近似することができる。 In addition, by fitting the sampling data, the RBF kernel can be approximated with a quadratic function with high accuracy.

次に、第４の実施の形態について説明する。なお、第４の実施の形態に係る画像分類装置の構成は、第１の実施の形態と同様の構成となるため、同一符号を付して説明を省略する。 Next, a fourth embodiment will be described. Note that the configuration of the image classification apparatus according to the fourth embodiment is the same as that of the first embodiment, and thus the same reference numerals are given and description thereof is omitted.

第４の実施の形態では、ＲＢＦカーネルを３次多項式関数に近似している点が第３の実施の形態と主に異なっている。 The fourth embodiment is mainly different from the third embodiment in that the RBF kernel is approximated to a cubic polynomial function.

第４の実施の形態に係る画像分類装置では、カーネル関数近似部２０によって、ＲＢＦカーネルを、以下に説明するように、上記（７）式で表される三次多項式関数で近似する。 In the image classification device according to the fourth embodiment, the kernel function approximating unit 20 approximates the RBF kernel with the cubic polynomial function expressed by the above equation (7) as described below.

まず、ＲＢＦカーネルの指数部分２γｘ^Ｔｚの値域において、上記（２）式の第二項の指数関数から、等間隔でサンプリングデータを生成し、均一的に分布したサンプリングデータの集合を生成する。 First, in the range of the exponent part 2γx ^T z of the RBF kernel, sampling data is generated at equal intervals from the exponent function of the second term of the above equation (2), and a set of uniformly distributed sampling data is generated.

また、ＲＢＦカーネルのパラメータγにそれぞれの離散値を適用した場合に取りうる範囲で、三次多項式関数に各々近似する。γの各離散値を適用した場合に取りうる２γｘ^Ｔｚの値域（区間）に含まれるサンプリングデータの集合に対して、最小二乗法によってフィッティングを行って、残差の二乗和を最小とするような三次多項式関数に近似するように上記（７）式の係数ａ，ｃ，ｑ，ｈを各々決定して、パラメータγの各離散値に対する三次多項式関数を各々得る。 In addition, each is approximated to a cubic polynomial function in a range that can be taken when each discrete value is applied to the parameter γ of the RBF kernel. By fitting the set of sampling data included in the range (section) of 2γx ^T z that can be obtained when each discrete value of γ is applied, fitting is performed by the method of least squares so as to minimize the sum of squares of the residuals. The coefficients a, c, q, and h in the above equation (7) are determined so as to approximate a cubic polynomial function to obtain a cubic polynomial function for each discrete value of the parameter γ.

ＳＶＭ学習部２４は、第３の実施の形態と同様に、γ、α_ｉ、ｂを学習して、ＳＶＭ分類式を決定する。 As in the third embodiment, the SVM learning unit 24 learns γ, α _i , and b and determines the SVM classification formula.

なお、第４の実施の形態に係る画像分類装置の他の構成及び作用については、第３の実施の形態と同様であるため、説明を省略する。 Note that the other configuration and operation of the image classification device according to the fourth embodiment are the same as those of the third embodiment, and thus the description thereof is omitted.

このように、三次関数に近似したＲＢＦカーネルに基づくパラメータの少ないＳＶＭ分類式を用いて、テストデータを分類することにより、少ないメモリ量で、かつ、短い計算時間でデータを分類することができる。 In this way, by classifying test data using the SVM classification formula having a small number of parameters based on the RBF kernel approximated to a cubic function, the data can be classified with a small memory amount and a short calculation time.

また、サンプリングデータに対するフィッティングにより、ＲＢＦカーネルを、三次関数で精度良く近似することができる。 In addition, by fitting the sampling data, the RBF kernel can be approximated with a cubic function with high accuracy.

次に、第５の実施の形態について説明する。なお、第５の実施の形態に係る画像分類装置の構成は、第１の実施の形態と同様の構成となるため、同一符号を付して説明を省略する。 Next, a fifth embodiment will be described. Note that the configuration of the image classification apparatus according to the fifth embodiment is the same as that of the first embodiment, and thus the same reference numerals are given and description thereof is omitted.

第５の実施の形態では、訓練データから算出される指数部分のヒストグラムに応じて、サンプリングデータを作成し、ＲＢＦカーネルの近似を行っている点が、第３の実施の形態と異なっている。 The fifth embodiment is different from the third embodiment in that sampling data is created in accordance with the histogram of the exponent part calculated from the training data and the RBF kernel is approximated.

第５の実施の形態に係る画像分類装置では、カーネル関数近似部２０によって、ＲＢＦカーネルを、以下に説明するように、上記（３）式で表される二次多項式関数で近似する。 In the image classification device according to the fifth embodiment, the kernel function approximating unit 20 approximates the RBF kernel with a quadratic polynomial function expressed by the above equation (3) as described below.

まず、図６に示すように、訓練データから、特徴ベクトル間の内積値ｘ^Ｔｚの経験分布（ヒストグラム）を算出し、算出されたヒストグラムから、指数部分２γｘ^Ｔｚの値の経験分布（ヒストグラム）を求める。そして、ＲＢＦカーネルの指数部分２γｘ^Ｔｚの値域において、上記（２）式の第二項の指数関数から、図７（Ａ）に示すように、指数部分２γｘ^Ｔｚの値のヒストグラムに応じて、サンプリングデータを生成し、サンプリングデータの集合を生成する。 First, as shown in FIG. 6, an empirical distribution (histogram) of inner product values x ^T z between feature vectors is calculated from the training data, and an empirical distribution (histogram) of the value of the exponent part 2γx ^T z is calculated from the calculated histogram. ) Then, in the range of the exponent part 2γx ^T z of the RBF kernel, from the exponent function of the second term of the above equation (2), as shown in FIG. 7A, according to the histogram of the value of the exponent part 2γx ^T z , Generate sampling data, and generate a set of sampling data.

また、ＲＢＦカーネルのパラメータγにそれぞれの離散値を適用した場合に取りうる範囲で、二次多項式関数に各々近似する。γの各離散値を適用した場合に取りうる２γｘ^Ｔｚの値域（区間）に含まれるサンプリングデータの集合に対して、図７（Ｂ）に示すように、最小二乗法によってフィッティングを行って、残差の二乗和を最小とするような二次多項式関数に近似するように上記（６）式の係数ａ，ｃ，ｑを決定して、パラメータγの各離散値に対する二次多項式関数を得る。なお、上記図７（Ｂ）では、γ＝１とした場合に取りうる指数部分の値の区間でのフィッティングの様子を示している。 In addition, each is approximated to a quadratic polynomial function within a range that can be taken when each discrete value is applied to the parameter γ of the RBF kernel. As shown in FIG. 7B, fitting is performed by a least square method to a set of sampling data included in the range (section) of 2γx ^T z that can be obtained when each discrete value of γ is applied, The coefficients a, c, q in the above equation (6) are determined so as to approximate a quadratic polynomial function that minimizes the sum of squares of the residuals, and a quadratic polynomial function for each discrete value of the parameter γ is obtained. . Note that FIG. 7B shows a state of fitting in the section of the exponent value that can be taken when γ = 1.

次に、第５の実施の形態に係るＳＶＭモデル学習処理ルーチンについて図８を用いて説明する。なお、第１の実施の形態及び第３の実施の形態と同様の処理については、同一符号を付して詳細な説明を省略する。 Next, an SVM model learning process routine according to the fifth embodiment will be described with reference to FIG. In addition, about the process similar to 1st Embodiment and 3rd Embodiment, the same code | symbol is attached | subjected and detailed description is abbreviate | omitted.

まず、ステップ１０２において、訓練データを読み込み、ステップ５００において、上記ステップ１０２で読み込んだ訓練データの特徴ベクトル間の内積値を各々演算して、内積値のヒストグラムを算出し、算出した内積値のヒストグラムから指数部分のヒストグラムを生成する。 First, in step 102, the training data is read. In step 500, the inner product values between the feature vectors of the training data read in step 102 are calculated to calculate the inner product value histogram, and the calculated inner product value histogram is calculated. To generate a histogram of the exponent part.

そして、ステップ５０２において、ＲＢＦカーネルの指数関数を用いて、上記ステップ５００で生成された指数部分のヒストグラムに応じて、複数のサンプリングデータを生成し、ステップ３０２において、パラメータγの各離散値に対して、対応する範囲のサンプリングデータに基づいて、指数関数で表わされるＲＢＦカーネルを二次多項式関数で各々近似する。 In step 502, a plurality of sampling data is generated according to the histogram of the exponent part generated in step 500 above using the exponential function of the RBF kernel. In step 302, for each discrete value of the parameter γ, Then, based on the sampling data in the corresponding range, the RBF kernel represented by the exponential function is approximated by a second-order polynomial function.

そして、ステップ３０４において、訓練データを、上記ステップ３０２でγの離散値毎にＲＢＦカーネルを近似した二次多項式関数に適用して学習することにより、ＲＢＦカーネルのパラメータγ及びＳＶＭ分類式の係数を決定し、ＳＶＭ分類式を得る。 In step 304, the training data is learned by applying it to the quadratic polynomial function approximating the RBF kernel for each discrete value of γ in step 302, so that the parameter γ of the RBF kernel and the coefficient of the SVM classification equation are obtained. To determine the SVM classification formula.

次のステップ１０６では、上記ステップ３０４で得られたＳＶＭ分類式を数式展開することにより、上記（６）式で表される数式に変換して、ステップ１０８において、ＳＶＭモデル記憶部２８に、変換したＳＶＭ分類式を記憶して、ＳＶＭモデル学習処理ルーチンを終了する。 In the next step 106, the SVM classification formula obtained in the above step 304 is converted into a mathematical formula represented by the above formula (6) by expanding the mathematical formula, and in the step 108, it is converted into the SVM model storage unit 28. The SVM classification formula is stored, and the SVM model learning processing routine is terminated.

また、経験分布に応じたサンプリングデータに対するフィッティングにより、ＲＢＦカーネルを、二次関数で精度良く近似することができる。 In addition, the RBF kernel can be approximated with a quadratic function with high accuracy by fitting the sampling data according to the experience distribution.

次に、第６の実施の形態について説明する。なお、第６の実施の形態に係る画像分類装置の構成は、第１の実施の形態と同様の構成となるため、同一符号を付して説明を省略する。 Next, a sixth embodiment will be described. Note that the configuration of the image classification apparatus according to the sixth embodiment is the same as that of the first embodiment, and thus the same reference numerals are given and description thereof is omitted.

第６の実施の形態では、ＲＢＦカーネルを３次多項式関数に近似している点が第５の実施の形態と主に異なっている。 The sixth embodiment is mainly different from the fifth embodiment in that the RBF kernel is approximated to a cubic polynomial function.

第６の実施の形態に係る画像分類装置では、カーネル関数近似部２０によって、ＲＢＦカーネルを、以下に説明するように、上記（７）式で表される三次多項式関数で近似する。 In the image classification device according to the sixth embodiment, the kernel function approximating unit 20 approximates the RBF kernel with the cubic polynomial function expressed by the above equation (7) as described below.

まず、訓練データから内積値ｘ^Ｔｚの経験分布（ヒストグラム）を算出し、算出されたヒストグラムから、指数部分２γｘ^Ｔｚの値の経験分布（ヒストグラム）を求める。そして、ＲＢＦカーネルの指数部分２γｘ^Ｔｚの値域において、上記（２）式の第二項の指数関数から、指数部分２γｘ^Ｔｚの値の経験分布に応じて、複数のサンプリングデータを生成し、サンプリングデータの集合を生成する。 First, the empirical distribution (histogram) of the inner product value x ^T z is calculated from the training data, and the empirical distribution (histogram) of the value of the exponent part 2γx ^T z is obtained from the calculated histogram. Then, in the range of the exponent part 2γx ^T z of the RBF kernel, a plurality of sampling data is generated according to the empirical distribution of the value of the exponent part 2γx ^T z from the exponent function of the second term of the above equation (2). Generate a collection of sampling data.

また、ＲＢＦカーネルのパラメータγにそれぞれの離散値を適用した場合に取りうる範囲で、三次多項式関数に各々近似する。γの各離散値を適用した場合に取りうる２γｘ^Ｔｚの値域（区間）に含まれるサンプリングデータの集合に対して、最小二乗法によってフィッティングを行って、残差の二乗和を最小とするような三次多項式関数に近似するように上記（７）式の係数ａ，ｃ，ｑ，ｈを決定して、パラメータγの各離散値に対する三次多項式関数を得る。 In addition, each is approximated to a cubic polynomial function in a range that can be taken when each discrete value is applied to the parameter γ of the RBF kernel. By fitting the set of sampling data included in the range (section) of 2γx ^T z that can be obtained when each discrete value of γ is applied, fitting is performed by the method of least squares so as to minimize the sum of squares of the residuals. The coefficients a, c, q, and h in the above equation (7) are determined so as to approximate a cubic polynomial function, and a cubic polynomial function for each discrete value of the parameter γ is obtained.

なお、第６の実施の形態に係る画像分類装置の他の構成及び作用については、第５の実施の形態と同様であるため、説明を省略する。 Note that other configurations and operations of the image classification device according to the sixth embodiment are the same as those of the fifth embodiment, and thus description thereof is omitted.

また、経験分布に応じたサンプリングデータに対するフィッティングにより、ＲＢＦカーネルを、三次関数で精度良く近似することができる。 In addition, the RBF kernel can be approximated with a cubic function with high accuracy by fitting the sampling data according to the experience distribution.

次に、特徴量の次元数とサポートベクトルの数とを変化させた場合の、上記の実施の形態に係る近似したＲＢＦカーネルを用いたＳＶＭによる分類と、従来の厳密なＲＢＦカーネルを用いたＳＶＭによる分類とにおけるメモリ量及び計算時間を比較した結果について説明する。 Next, when the dimension number of the feature quantity and the number of support vectors are changed, classification by SVM using the approximate RBF kernel according to the above embodiment, and SVM using the conventional strict RBF kernel The result of comparing the amount of memory and the calculation time in the classification according to will be described.

図９に示すように、厳密なＲＢＦカーネルを用いたＳＶＭによる分類におけるメモリ量及び計算時間に比べて、上記の実施の形態で説明した、二次多項式関数又は三次多項式関数で近似したＲＢＦカーネルを用いたＳＶＭによる分類では、計算時間とメモリ量とが低減されることがわかる。 As shown in FIG. 9, the RBF kernel approximated by the second-order polynomial function or the third-order polynomial function described in the above embodiment is compared with the memory amount and the calculation time in the classification by the SVM using the strict RBF kernel. It can be seen that the calculation time and the amount of memory are reduced by the classification by the used SVM.

また、歩行者識別実験の結果について説明する。撮像画像から抽出された３３６次元の画像特徴量からなる特徴ベクトルをテストデータとし、ＳＶＭ分類器を利用して画像が歩行者を表わしているかどうかを分類する。上記の実施の形態で説明したように、ＲＢＦカーネルを二次多項式関数で近似したＳＶＭによる分類を行った。また、比較例として、元の厳密なＲＢＦカーネル（サポートベクトルの数＝６６８５）を用いたＳＶＭによる分類を行った。 Moreover, the result of a pedestrian identification experiment is demonstrated. A feature vector made up of 336-dimensional image features extracted from the captured image is used as test data, and an SVM classifier is used to classify whether the image represents a pedestrian. As described in the above embodiment, the classification by SVM in which the RBF kernel is approximated by a quadratic polynomial function is performed. As a comparative example, classification by SVM using the original strict RBF kernel (number of support vectors = 6685) was performed.

図１０に示すように、ＲＢＦカーネルを二次多項式関数で近似した場合には、元の厳密なＲＢＦカーネルを用いた場合に比べて、計算時間を３０倍以上削減でき、メモリ量を約４０倍削減することができた。また、ＲＢＦカーネルを二次多項式関数で近似した場合には、厳密なＲＢＦカーネルを用いた場合に近い分類精度が得られた。 As shown in FIG. 10, when the RBF kernel is approximated by a quadratic polynomial function, the calculation time can be reduced by 30 times or more and the amount of memory is about 40 times compared to the case where the original exact RBF kernel is used. We were able to reduce it. Further, when the RBF kernel was approximated by a quadratic polynomial function, a classification accuracy close to that obtained when the strict RBF kernel was used was obtained.

上記の実施の形態では、撮像画像が顔画像であるか否かを分類する場合を例に説明したが、これに限定されるものではなく、他のテストデータを他の分類に分けるデータ分類装置に本発明を適用してもよい。 In the above embodiment, the case where the captured image is classified as a face image has been described as an example. However, the present invention is not limited to this, and the data classification apparatus divides other test data into other classifications. The present invention may be applied to.

また、本発明のプログラムを記憶媒体に格納して提供するようにしてもよい。 Further, the program of the present invention may be provided by being stored in a storage medium.

本発明の第１の実施の形態に係る画像分類装置を示すブロック図である。It is a block diagram which shows the image classification device which concerns on the 1st Embodiment of this invention. 本発明の第１の実施の形態に係る画像識別装置におけるＳＶＭモデル学習処理ルーチンの内容を示すフローチャートである。It is a flowchart which shows the content of the SVM model learning process routine in the image identification device which concerns on the 1st Embodiment of this invention. 本発明の第１の実施の形態に係る画像識別装置における顔画像分類処理ルーチンの内容を示すフローチャートである。It is a flowchart which shows the content of the face image classification process routine in the image identification device which concerns on the 1st Embodiment of this invention. （Ａ）均一的に生成したサンプリングデータの集合を示すグラフ、及び（Ｂ）サンプリングデータに対して近似した二次関数を示すグラフである。(A) A graph showing a set of uniformly generated sampling data, and (B) a graph showing a quadratic function approximated to sampling data. 本発明の第３の実施の形態に係る画像識別装置におけるＳＶＭモデル学習処理ルーチンの内容を示すフローチャートである。It is a flowchart which shows the content of the SVM model learning process routine in the image identification device which concerns on the 3rd Embodiment of this invention. 訓練データの特徴ベクトル間の内積値の経験分布を示すグラフである。It is a graph which shows the experience distribution of the inner product value between the feature vectors of training data. （Ａ）経験分布に応じて生成したサンプリングデータの集合を示すグラフ、及び（Ｂ）サンプリングデータに対して近似した二次関数を示すグラフである。(A) A graph showing a set of sampling data generated according to the experience distribution, and (B) a graph showing a quadratic function approximated to the sampling data. 本発明の第５の実施の形態に係る画像識別装置におけるＳＶＭモデル学習処理ルーチンの内容を示すフローチャートである。It is a flowchart which shows the content of the SVM model learning process routine in the image identification device which concerns on the 5th Embodiment of this invention. 特徴ベクトルの特徴量の次元数と、計算時間及びメモリ量との関係を示すグラフである。It is a graph which shows the relationship between the dimension number of the feature-value of a feature vector, calculation time, and memory amount. 厳密なＲＢＦカーネルを用いた場合、及び各近似手法で近似したＲＢＦカーネルを用いた場合における計算時間及びメモリ量を示す図である。It is a figure which shows the calculation time and memory amount at the time of using a strict RBF kernel and the case of using the RBF kernel approximated with each approximation method.

Explanation of symbols

１０画像分類装置
１４コンピュータ
２０カーネル関数近似部
２２訓練データ記憶部
２４ＳＶＭ学習部
２６ＳＶＭモデル変換部
２８ＳＶＭモデル記憶部
３２特徴ベクトル抽出部
３４ＳＶＭ分類部 DESCRIPTION OF SYMBOLS 10 Image classification apparatus 14 Computer 20 Kernel function approximation part 22 Training data storage part 24 SVM learning part 26 SVM model conversion part 28 SVM model storage part 32 Feature vector extraction part 34 SVM classification part

Claims

Obtained by learning based on a quadratic function k (x, z) represented by the following equation that approximates a kernel function represented by an exponential function and a plurality of feature vectors prepared in advance as training data A data classification device for classifying feature vectors input as test data using an SVM classification formula f (x) represented by the following formula.

Further, x and z are feature vectors, γ is a parameter in a kernel function, and a, c, and q are coefficients determined by approximation of the kernel function. x _j is a j-dimensional feature quantity of the feature vector x, and x _k is a k-dimensional feature quantity of the feature vector x. v _i is the support _{vector, v ij} is the feature quantity of j dimension of support vectors _{v _i,} _{v ik} is the characteristic of k-dimensional support vector _{v i.} y _i is the label of the support vector v _i , n is the number of support vectors, and d is the number of dimensions of the feature vector. α _i and b are coefficients determined by the learning.

Obtained by learning based on a cubic function k (x, z) represented by the following expression that approximates a kernel function represented by an exponential function, and a plurality of feature vectors prepared in advance as training data, A data classification device that classifies feature vectors input as test data using an SVM classification formula f (x) represented by the following formula.

Further, x and z are feature vectors, γ is a parameter in the kernel function, and a, c, q, and h are coefficients determined by approximation of the kernel function. x _j is a j-dimensional feature quantity of the feature vector x, x _k is a k-dimensional feature quantity of the feature vector x, and x _s is an s-dimensional feature quantity of the feature vector x. v _i is the support _{vector, v ij} is the feature quantity of j dimension of support vectors _{v _i,} _{v ik} is the characteristic of k-dimensional support vector _{v _i,} _{v IS} the s-dimensional support vector _{v i} It is a feature amount. y _i is the label of the support vector v _i , n is the number of support vectors, and d is the number of dimensions of the feature vector. α _i and b are coefficients determined by the learning.

The data classification apparatus according to claim 1, wherein the quadratic function is a quadratic function obtained by approximating the kernel function by Tiller expansion.

3. The data classification device according to claim 2, wherein the cubic function is a cubic function obtained by approximating the kernel function by Tiller expansion.

The data classification apparatus according to claim 1, wherein the quadratic function is a quadratic function obtained by approximating a plurality of sampling data generated from the exponential function.

The quadratic function is approximated to the sampling data that can be taken when each of a plurality of discrete values is applied to the parameter γ in the kernel function among the plurality of sampling data generated from the exponential function. The data classification device according to claim 5, wherein a plurality of quadratic functions obtained as described above are used.

The data classification device according to claim 2, wherein the cubic function is a cubic function obtained by approximating a plurality of sampling data generated from the exponential function.

Approximating the cubic function with respect to the sampling data that can be obtained when each of a plurality of discrete values is applied to the parameter γ in the kernel function among the plurality of sampling data generated from the exponential function. The data classification apparatus according to claim 7, wherein a plurality of cubic functions obtained by the step are used.

9. The method according to claim 5, wherein the plurality of sampling data is generated according to an empirical distribution of values of exponent parts of the exponential function obtained from inner product values of the plurality of feature vectors prepared as the training data. The data classification device according to claim 1.

The data classification device according to claim 1, wherein a normalized feature vector is used as the feature vector.

Computer
Obtained by learning based on a quadratic function k (x, z) represented by the following equation that approximates a kernel function represented by an exponential function and a plurality of feature vectors prepared in advance as training data A program for functioning as a means for classifying feature vectors input as test data using the SVM classification formula f (x) represented by the following formula.

Further, x and z are feature vectors, γ is a parameter in a kernel function, and a, c, and q are coefficients determined by approximation of the kernel function. x _k is a k-dimensional feature quantity of the feature vector x. v _i is the support _{vector, v ij} is the feature quantity of j dimension of support vectors _{v _i,} _{v ik} is the characteristic of k-dimensional support vector _{v i.} y _i is the label of the support vector v _i , n is the number of support vectors, and d is the number of dimensions of the feature vector. α _i and b are coefficients determined by the learning.

Computer
Obtained by learning based on a cubic function k (x, z) represented by the following expression that approximates a kernel function represented by an exponential function, and a plurality of feature vectors prepared in advance as training data, A program for functioning as a means for classifying feature vectors input as test data using the SVM classification formula f (x) represented by the following formula.

Further, x and z are feature vectors, γ is a parameter in the kernel function, and a, c, q, and h are coefficients determined by approximation of the kernel function. x _k is a k-dimensional feature quantity of the feature vector x, and x _s is an s-dimensional feature quantity of the feature vector x. v _i is the support _{vector, v ij} is the feature quantity of j dimension of support vectors _{v _i,} _{v ik} is the characteristic of k-dimensional support vector _{v _i,} _{v IS} the s-dimensional support vector _{v i} It is a feature amount. y _i is the label of the support vector v _i , n is the number of support vectors, and d is the number of dimensions of the feature vector. α _i and b are coefficients determined by the learning.