JP2009093335A

JP2009093335A - Identification method and program

Info

Publication number: JP2009093335A
Application number: JP2007262127A
Authority: JP
Inventors: Yasuo Kasai; 庸雄河西; Hirokazu Kasahara; 広和笠原
Original assignee: Seiko Epson Corp
Current assignee: Seiko Epson Corp
Priority date: 2007-10-05
Filing date: 2007-10-05
Publication date: 2009-04-30
Anticipated expiration: 2027-10-05
Also published as: JP4992646B2

Abstract

PROBLEM TO BE SOLVED: To provide an identification method and program for carrying out identification processing that matches a preference of a user. SOLUTION: This identification method identifies whether or not an identification object belongs to a certain class based on the comparison result of the value of an evaluation function to evaluate the identification object with a threshold. The identification method includes: an extraction step for extracting a sample belonging to the certain class and a sample not belonging to the certain class; a display step for displaying a mark between the sample belonging to the certain class and the sample not belonging to the certain class, and for displaying the mark between a different pair of the samples by moving a position of the mark in response to an instruction of a user; a setting change step for selecting the evaluation function corresponding to the position of the mark determined by the user from among a plurality of preliminarily prepared function evaluations; and an identification step for identifying whether or not the identification object belongs to the certain class based on the comparison result of the value of the evaluation function when the identification object is evaluated by using the selected evaluation function with the threshold. COPYRIGHT: (C)2009,JPO&INPIT

Description

本発明は、識別方法及びプログラムに関する。 The present invention relates to an identification method and a program.

デジタルスチルカメラには撮影モードを設定するモード設定ダイヤルを持つものがある。ユーザがダイヤルで撮影モードを設定すると、デジタルスチルカメラは撮影モードに応じた撮影条件（露光時間等）を決定し、撮影を行う。撮影が行われると、デジタルスチルカメラは、画像ファイルを生成する。この画像ファイルには、撮影した画像の画像データに、撮影時の撮影条件等の付加データが付加されている。 Some digital still cameras have a mode setting dial for setting a shooting mode. When the user sets the shooting mode with the dial, the digital still camera determines shooting conditions (such as exposure time) according to the shooting mode and performs shooting. When shooting is performed, the digital still camera generates an image file. In this image file, additional data such as shooting conditions at the time of shooting is added to the image data of the shot image.

付加データを用いて画像データの示す画像のカテゴリ（クラス）を識別することも可能である。但し、この場合、識別可能なカテゴリは、付加データに記録されるデータの種類に限定されてしまう。そこで、画像データを解析し、画像データの示す画像のカテゴリを識別することも行われている（特許文献１、２参照）。
特開平１０−３０２０６７号公報特表２００６−５１１０００号公報 It is also possible to identify the category (class) of the image indicated by the image data using the additional data. However, in this case, the identifiable category is limited to the type of data recorded in the additional data. Therefore, image data is analyzed to identify the category of the image indicated by the image data (see Patent Documents 1 and 2).
Japanese Patent Laid-Open No. 10-302067 JP 2006-511000 Gazette

識別処理の結果が、ユーザの好みと合わないことがある。このような場合、ユーザの好みに合わせて識別処理の設定を変更することが望ましい。
本発明は、ユーザの好みに合わせた識別処理を行うことを目的とする。 The result of the identification process may not match the user's preference. In such a case, it is desirable to change the setting of the identification process according to the user's preference.
An object of this invention is to perform the identification process according to a user's liking.

上記目的を達成するための主たる発明は、識別対象を評価する評価関数の値と閾値との比較結果に基づいて、あるクラスに前記識別対象が属するか否かを識別する識別方法であって、前記あるクラスに属するサンプルと、前記あるクラスに属しないサンプルとを抽出する抽出ステップと、抽出された複数の前記サンプルを表示部に並べて表示すると共に、前記あるクラスに属するサンプルと、前記あるクラスに属しないサンプルとの間にマークを表示し、ユーザの指示に応じて前記マークの位置を移動することによって、別の前記サンプルと前記サンプルとの間に前記マークを表示する表示ステップと、予め用意された複数の前記評価関数の中から、前記ユーザの決定した前記マークの位置に応じた前記評価関数を選択する設定変更ステップと、選択された前記評価関数を用いて識別対象を評価したときの前記評価関数の値と、前記閾値との比較結果に基づいて、前記あるクラスに識別対象が属するか否かを識別する識別ステップと、を備えることを特徴とする。
本発明の他の特徴については、本明細書及び添付図面の記載により明らかにする。 A main invention for achieving the above object is an identification method for identifying whether or not the identification object belongs to a certain class based on a comparison result between a value of an evaluation function for evaluating the identification object and a threshold value. An extraction step for extracting a sample belonging to the certain class and a sample not belonging to the certain class, a plurality of the extracted samples arranged side by side on a display unit, a sample belonging to the certain class, and the certain class A display step of displaying the mark between another sample and the sample by displaying a mark between the samples not belonging to the sample and moving the position of the mark according to a user instruction; Setting change step of selecting the evaluation function corresponding to the position of the mark determined by the user from among the plurality of prepared evaluation functions An identification step for identifying whether or not the identification object belongs to the certain class based on a comparison result between the value of the evaluation function when the identification object is evaluated using the selected evaluation function and the threshold value And.
Other features of the present invention will become apparent from the description of the present specification and the accompanying drawings.

本明細書及び添付図面の記載により、少なくとも、以下の事項が明らかとなる。 At least the following matters will become clear from the description of the present specification and the accompanying drawings.

識別対象を評価する評価関数の値と閾値との比較結果に基づいて、あるクラスに前記識別対象が属するか否かを識別する識別方法であって、
前記あるクラスに属するサンプルと、前記あるクラスに属しないサンプルとを抽出する抽出ステップと、
抽出された複数の前記サンプルを表示部に並べて表示すると共に、前記あるクラスに属するサンプルと、前記あるクラスに属しないサンプルとの間にマークを表示し、ユーザの指示に応じて前記マークの位置を移動することによって、別の前記サンプルと前記サンプルとの間に前記マークを表示する表示ステップと、
予め用意された複数の前記評価関数の中から、前記ユーザの決定した前記マークの位置に応じた前記評価関数を選択する設定変更ステップと、
選択された前記評価関数を用いて識別対象を評価したときの前記評価関数の値と、前記閾値との比較結果に基づいて、前記あるクラスに識別対象が属するか否かを識別する識別ステップと、
を備えることを特徴とする識別方法が明らかになる。
このような識別方法によれば、ユーザの好みに合わせた識別処理を行うことができる。 An identification method for identifying whether or not the identification object belongs to a certain class based on a comparison result between a value of an evaluation function for evaluating the identification object and a threshold value,
An extraction step of extracting samples belonging to the certain class and samples not belonging to the certain class;
A plurality of the extracted samples are displayed side by side on a display unit, and a mark is displayed between a sample belonging to the certain class and a sample not belonging to the certain class, and the position of the mark is determined according to a user instruction A display step of displaying the mark between another sample by moving
A setting change step of selecting the evaluation function according to the position of the mark determined by the user from the plurality of evaluation functions prepared in advance;
An identification step for identifying whether or not the identification object belongs to the certain class based on a comparison result between the value of the evaluation function when the identification object is evaluated using the selected evaluation function and the threshold value; ,
An identification method characterized by comprising:
According to such an identification method, it is possible to perform identification processing according to the user's preference.

前記抽出ステップでは、前記あるクラスに属するサンプルと、前記あるクラスとは別のクラスに属するサンプルとが抽出され、前記設定変更ステップでは、前記あるクラスに識別対象が属するか否かを識別するための評価関数が選択されると共に、前記別のクラスに識別対象が属するか否かを識別するための評価関数が選択されることが望ましい。これにより、ユーザの好みに合わせた識別処理を行うことができる。 In the extraction step, a sample belonging to the certain class and a sample belonging to a class different from the certain class are extracted, and in the setting change step, whether or not an identification target belongs to the certain class is identified. It is preferable that an evaluation function for identifying whether or not an identification target belongs to the another class is selected. Thereby, the identification process according to a user's liking can be performed.

前記あるクラスに属するサンプルと、前記別のクラスに属するサンプルとを分離する超平面の法線に前記サンプルを投影し、前記法線上に投影された前記サンプルの位置に基づいて、抽出すべきサンプルを決定することが望ましい。若しくは、前記識別処理は、空間を分離する超平面に基づいて前記識別対象が前記あるクラスに属するか否かを識別するものであり、前記抽出ステップにおいて、前記超平面の法線に前記サンプルを投影し、前記法線上に投影された前記サンプルの位置に基づいて、抽出すべきサンプルを決定することが望ましい。これにより、確信度の高い順に、サンプルを抽出することができる。 A sample to be extracted based on the position of the sample projected onto the normal by projecting the sample onto a hyperplane normal that separates the sample belonging to the class and the sample belonging to the other class It is desirable to determine. Alternatively, the identification process is for identifying whether or not the identification object belongs to the certain class based on a hyperplane separating a space, and in the extraction step, the sample is set to a normal of the hyperplane. It is desirable to project and determine a sample to be extracted based on the position of the sample projected on the normal. Thereby, samples can be extracted in descending order of certainty.

識別対象を評価する評価関数の値と閾値との比較結果に基づいて、あるクラスに前記識別対象が属するか否かを識別する識別装置に、
前記あるクラスに属するサンプルと、前記あるクラスに属しないサンプルとを抽出させ、
抽出された複数の前記サンプルを表示部に並べて表示させると共に、前記あるクラスに属するサンプルと、前記あるクラスに属しないサンプルとの間にマークを表示させ、ユーザの指示に応じて前記マークの位置を移動させことによって、別の前記サンプルと前記サンプルとの間に前記マークを表示させ、
予め用意された複数の前記評価関数の中から、前記ユーザの決定した前記マークの位置に応じた前記評価関数を選択させ、
選択された前記評価関数を用いて識別対象を評価したときの前記評価関数の値と、前記閾値との比較結果に基づいて、前記あるクラスに識別対象が属するか否かを識別させる
ことを特徴とするプログラムが明らかになる。
このようなプログラムによれば、ユーザの好みに合わせた識別処理を識別装置に実現させることができる。 Based on the comparison result between the value of the evaluation function for evaluating the identification object and the threshold value, the identification device for identifying whether the identification object belongs to a certain class,
Samples belonging to the certain class and samples not belonging to the certain class are extracted,
A plurality of the extracted samples are displayed side by side on a display unit, and a mark is displayed between a sample belonging to the certain class and a sample not belonging to the certain class, and the position of the mark is determined according to a user instruction. To display the mark between another sample and the sample,
The evaluation function corresponding to the position of the mark determined by the user is selected from the plurality of evaluation functions prepared in advance,
Whether or not the identification object belongs to the certain class is identified based on a comparison result between the value of the evaluation function when the identification object is evaluated using the selected evaluation function and the threshold value. The program to become clear.
According to such a program, it is possible to cause the identification device to perform identification processing according to the user's preference.

＝＝＝全体説明＝＝＝
まず、識別処理の基本的な構成及び処理について説明する。この説明の後、本実施形態について詳しく説明する。 === Overall Description ===
First, the basic configuration and processing of identification processing will be described. After this description, this embodiment will be described in detail.

＜全体構成＞
図１は、画像処理システムの説明図である。この画像処理システムは、デジタルスチルカメラ２と、プリンタ４とを備える。 <Overall configuration>
FIG. 1 is an explanatory diagram of an image processing system. This image processing system includes a digital still camera 2 and a printer 4.

デジタルスチルカメラ２は、被写体をデジタルデバイス（ＣＣＤなど）に結像させることによりデジタル画像を取得するカメラである。デジタルスチルカメラ２には、モード設定ダイヤル２Ａが設けられている。ユーザは、ダイヤル２Ａによって、撮影条件に応じた撮影モードを設定することができる。例えば、ダイヤル２Ａによって「夜景」モードが設定されると、デジタルスチルカメラ２は、シャッター速度を遅くしたり、ＩＳＯ感度を高くしたりして、夜景撮影に適した撮影条件にて撮影を行う。 The digital still camera 2 is a camera that acquires a digital image by forming an image of a subject on a digital device (CCD or the like). The digital still camera 2 is provided with a mode setting dial 2A. The user can set the shooting mode according to the shooting conditions by using the dial 2A. For example, when the “night scene” mode is set by the dial 2A, the digital still camera 2 performs shooting under shooting conditions suitable for night scene shooting by decreasing the shutter speed or increasing the ISO sensitivity.

デジタルスチルカメラ２は、ファイルフォーマット規格に準拠して、撮影により生成した画像ファイルをメモリカード６に保存する。画像ファイルには、撮影した画像のデジタルデータ（画像データ）だけでなく、撮影時の撮影条件（撮影データ）等の付加データも保存される。 The digital still camera 2 stores an image file generated by photographing in the memory card 6 in accordance with the file format standard. In the image file, not only digital data (image data) of a captured image but also additional data such as a shooting condition (shooting data) at the time of shooting is stored.

プリンタ４は、画像データの示す画像を紙に印刷する印刷装置である。プリンタ４には、メモリカード６を挿入するスロット２１が設けられている。ユーザは、デジタルスチルカメラ２で撮影した後、デジタルスチルカメラ２からメモリカード６を取り出し、スロット２１にメモリカード６を挿入することができる。 The printer 4 is a printing device that prints an image indicated by image data on paper. The printer 4 is provided with a slot 21 into which the memory card 6 is inserted. The user can take a picture with the digital still camera 2, remove the memory card 6 from the digital still camera 2, and insert the memory card 6 into the slot 21.

パネル部１５は、表示部１６と、各種のボタンを有する入力部１７とを備える。このパネル部１５はユーザインターフェースとして機能する。表示部１６は、液晶ディスプレイにより構成される。表示部１６がタッチパネルであれば、表示部１６は入力部１７としても機能する。表示部１６には、プリンタ４の設定を行うための設定画面や、メモリカードから読み取った画像データの画像や、ユーザへの確認や警告のための画面などが表示される。なお、表示部１６が表示する各種の画面については、後述する。 The panel unit 15 includes a display unit 16 and an input unit 17 having various buttons. The panel unit 15 functions as a user interface. The display unit 16 is configured by a liquid crystal display. If the display unit 16 is a touch panel, the display unit 16 also functions as the input unit 17. The display unit 16 displays a setting screen for setting the printer 4, an image of image data read from the memory card, a screen for confirmation and warning to the user, and the like. Various screens displayed by the display unit 16 will be described later.

図２は、プリンタ４の構成の説明図である。プリンタ４は、印刷機構１０と、この印刷機構１０を制御するプリンタ側コントローラ２０とを備える。印刷機構１０は、インクを吐出するヘッド１１と、ヘッド１１を制御するヘッド制御部１２と、紙を搬送するため等のモータ１３と、センサ１４とを有する。プリンタ側コントローラ２０は、メモリカード６からデータを送受信するためのメモリ用スロット２１と、ＣＰＵ２２と、メモリ２３と、モータ１３を制御する制御ユニット２４と、駆動信号（駆動波形）を生成する駆動信号生成部２５とを有する。また、プリンタ側コントローラ２０は、パネル部１５を制御するパネル制御部２６も備えている。 FIG. 2 is an explanatory diagram of the configuration of the printer 4. The printer 4 includes a printing mechanism 10 and a printer-side controller 20 that controls the printing mechanism 10. The printing mechanism 10 includes a head 11 that ejects ink, a head control unit 12 that controls the head 11, a motor 13 for conveying paper, and a sensor 14. The printer-side controller 20 includes a memory slot 21 for transmitting and receiving data from the memory card 6, a CPU 22, a memory 23, a control unit 24 for controlling the motor 13, and a drive signal for generating a drive signal (drive waveform). And a generation unit 25. The printer-side controller 20 also includes a panel control unit 26 that controls the panel unit 15.

メモリカード６がスロット２１に挿入されると、プリンタ側コントローラ２０は、メモリカード６に保存されている画像ファイルを読み出してメモリ２３に記憶する。そして、プリンタ側コントローラ２０は、画像ファイルの画像データを、印刷機構１０で印刷するための印刷データに変換し、印刷データに基づいて印刷機構１０を制御し、紙に画像を印刷する。この一連の動作は、「ダイレクトプリント」と呼ばれている。 When the memory card 6 is inserted into the slot 21, the printer-side controller 20 reads out the image file stored in the memory card 6 and stores it in the memory 23. Then, the printer-side controller 20 converts the image data of the image file into print data for printing by the printing mechanism 10, controls the printing mechanism 10 based on the printing data, and prints an image on paper. This series of operations is called “direct printing”.

なお、「ダイレクトプリント」は、メモリカード６をスロット２１に挿入することによって行われるだけでなく、デジタルスチルカメラ２とプリンタ４とをケーブル（不図示）で接続することによっても可能である。 “Direct printing” is not only performed by inserting the memory card 6 into the slot 21, but also by connecting the digital still camera 2 and the printer 4 with a cable (not shown).

メモリカード６に記憶される画像ファイルは、画像データと、付加データとから構成されている。画像データは、複数の画素データから構成されている。画素データは、画素の色情報（階調値）を示すデータである。画素がマトリクス状に配置されることによって、画像が構成される。このため、画像データは、画像を示すデータである。付加データには、画像データの特性を示すデータや、撮影データや、サムネイル画像データ等が含まれる。 The image file stored in the memory card 6 is composed of image data and additional data. The image data is composed of a plurality of pixel data. The pixel data is data indicating pixel color information (gradation value). An image is formed by arranging the pixels in a matrix. Therefore, the image data is data indicating an image. The additional data includes data indicating the characteristics of image data, shooting data, thumbnail image data, and the like.

＜自動補正機能の概要＞
「人物」の写真を印刷するときには、肌色をきれいにしたいという要求がある。また、「風景」の写真を印刷するときには、空の青色を強調し、木や草の緑色を強調したいという要求がある。そこで、プリンタ４は、画像ファイルを分析して自動的に適した補正処理を行う自動補正機能を備えている。 <Outline of automatic correction function>
When printing a “person” photo, there is a demand to clean the skin tone. In addition, when printing a “landscape” photograph, there is a demand for emphasizing the blue of the sky and the green of trees and grass. Therefore, the printer 4 has an automatic correction function that analyzes the image file and automatically performs a suitable correction process.

図３は、プリンタ４の自動補正機能の説明図である。図中のプリンタ側コントローラ２０の各要素は、ソフトウェアとハードウェアによって実現される。 FIG. 3 is an explanatory diagram of the automatic correction function of the printer 4. Each element of the printer-side controller 20 in the figure is realized by software and hardware.

記憶部３１は、メモリ２３の一部の領域及びＣＰＵ２２によって実現される。メモリカード６から読み出された画像ファイルの全部又は一部は、記憶部３１の画像記憶部３１Ａに展開される。また、プリンタ側コントローラ２０の各要素の演算結果は、記憶部３１の結果記憶部３１Ｂに格納される。 The storage unit 31 is realized by a partial area of the memory 23 and the CPU 22. All or part of the image file read from the memory card 6 is developed in the image storage unit 31 A of the storage unit 31. In addition, the calculation result of each element of the printer-side controller 20 is stored in the result storage unit 31B of the storage unit 31.

顔識別部３２は、ＣＰＵ２２と、メモリ２３に記憶された顔識別プログラムとによって実現される。顔識別部３２は、画像記憶部３１Ａに記憶された画像データを分析し、顔の有無を識別する。顔識別部３２によって顔が有ると識別された場合、識別対象となる画像が「人物」のシーンに属すると識別される。この場合、シーン識別部３３によるシーン識別処理は行われない。顔識別部３２による顔識別処理は、既に広く行われている処理と同様なので、詳細な説明は省略する。 The face identification unit 32 is realized by the CPU 22 and a face identification program stored in the memory 23. The face identification unit 32 analyzes the image data stored in the image storage unit 31A and identifies the presence or absence of a face. When the face identifying unit 32 identifies that there is a face, the image to be identified is identified as belonging to the “person” scene. In this case, the scene identification process by the scene identification unit 33 is not performed. Since the face identification process by the face identification unit 32 is the same as a process that has already been widely performed, detailed description thereof is omitted.

シーン識別部３３は、ＣＰＵ２２と、メモリ２３に記憶されたシーン識別プログラムとによって実現される。シーン識別部３３は、画像記憶部３１Ａに記憶された画像ファイルを分析し、画像データの示す画像のシーンを識別する。顔識別部３２によって顔がないと識別された場合に、シーン識別部３３によるシーン識別処理が行われる。後述するように、シーン識別部３３は、識別対象となる画像が「風景」、「夕景」、「夜景」、「花」、「紅葉」、「その他」のいずれの画像であるかを識別する。 The scene identification unit 33 is realized by the CPU 22 and a scene identification program stored in the memory 23. The scene identification unit 33 analyzes the image file stored in the image storage unit 31A and identifies the scene of the image indicated by the image data. When the face identifying unit 32 identifies that there is no face, a scene identifying process by the scene identifying unit 33 is performed. As will be described later, the scene identifying unit 33 identifies whether the image to be identified is a “landscape”, “evening scene”, “night scene”, “flower”, “autumn leaves”, or “other” image. .

図４は、画像のシーンと補正内容との関係の説明図である。
画像補正部３４は、ＣＰＵ２２と、メモリ２３に記憶された画像補正プログラムとによって実現される。画像補正部３４は、記憶部３１の結果記憶部３１Ｂ（後述）に記憶されている識別結果（顔識別部３２やシーン識別部３３の識別結果）に基づいて、画像記憶部３１Ａの画像データを補正する。例えば、シーン識別部３３の識別結果が「風景」である場合には、青色を強調し、緑色を強調するような補正が行われる。なお、画像補正部３４は、シーンの識別結果だけでなく、画像ファイルの撮影データの内容も反映して、画像データを補正しても良い。例えば、露出補正がマイナスの場合、暗い雰囲気の画像を明るくしないように画像データを補正しても良い。 FIG. 4 is an explanatory diagram of a relationship between an image scene and correction contents.
The image correction unit 34 is realized by the CPU 22 and an image correction program stored in the memory 23. The image correction unit 34 converts the image data of the image storage unit 31A based on the identification results (identification results of the face identification unit 32 and the scene identification unit 33) stored in the result storage unit 31B (described later) of the storage unit 31. to correct. For example, when the identification result of the scene identification unit 33 is “landscape”, correction is performed so that blue is emphasized and green is emphasized. The image correction unit 34 may correct the image data by reflecting not only the scene identification result but also the contents of the image data of the image file. For example, when the exposure correction is negative, the image data may be corrected so as not to brighten the dark atmosphere image.

プリンタ制御部３５は、ＣＰＵ２２、駆動信号生成部２５、制御ユニット２４及びメモリ２３に記憶されたプリンタ制御プログラムによって、実現される。プリンタ制御部３５は、補正後の画像データを印刷データに変換し、印刷機構１０に画像を印刷させる。 The printer control unit 35 is realized by a printer control program stored in the CPU 22, the drive signal generation unit 25, the control unit 24, and the memory 23. The printer control unit 35 converts the corrected image data into print data, and causes the printing mechanism 10 to print the image.

＜シーン識別処理＞
図５は、シーン識別部３３によるシーン識別処理のフロー図である。図６は、シーン識別部３３の機能の説明図である。図中のシーン識別部３３の各要素は、ソフトウェアとハードウェアによって実現される。シーン識別部３３は、図６に示す特徴量取得部４０と、全体識別器５０と、部分識別器６０と、統合識別器７０とを備えている。 <Scene identification processing>
FIG. 5 is a flowchart of the scene identification process performed by the scene identification unit 33. FIG. 6 is an explanatory diagram of the function of the scene identification unit 33. Each element of the scene identification unit 33 in the figure is realized by software and hardware. The scene identification unit 33 includes a feature amount acquisition unit 40, an overall classifier 50, a partial classifier 60, and an integrated classifier 70 shown in FIG.

最初に、特徴量取得部４０が、記憶部３１の画像記憶部３１Ａに展開された画像データを分析し、部分特徴量を取得する（Ｓ１０１）。具体的には、特徴量取得部４０は、画像データを８×８の６４ブロックに分割し、各ブロックの色平均と分散を算出し、この色平均と分散を部分特徴量として取得する。なお、ここでは各画素はＹＣＣ色空間における階調値のデータをもっており、各ブロックごとに、Ｙの平均値、Ｃｂの平均値及びＣｒの平均値がそれぞれ算出され、Ｙの分散、Ｃｂの分散及びＣｒの分散がそれぞれ算出される。つまり、各ブロックごとに３つの色平均と３つの分散が部分特徴量として算出される。これらの色平均や分散は、各ブロックにおける部分画像の特徴を示すものである。なお、ＲＧＢ色空間における平均値や分散を算出しても良い。
ブロックごとに色平均と分散が算出されるので、特徴量取得部４０は、画像記憶部３１Ａには画像データの全てを展開せずに、ブロック分の画像データをブロック順に展開する。このため、画像記憶部３１Ａは、必ずしも画像ファイルの全てを展開できるだけの容量を備えていなくても良い。 First, the feature amount acquisition unit 40 analyzes the image data developed in the image storage unit 31A of the storage unit 31 and acquires partial feature amounts (S101). Specifically, the feature amount acquisition unit 40 divides the image data into 8 × 8 64 blocks, calculates the color average and variance of each block, and acquires the color average and variance as partial feature amounts. Here, each pixel has gradation value data in the YCC color space, and the average value of Y, the average value of Cb, and the average value of Cr are calculated for each block, and the variance of Y and the variance of Cb are calculated. And the variance of Cr are calculated respectively. That is, three color averages and three variances are calculated as partial feature amounts for each block. These color averages and variances indicate the characteristics of the partial images in each block. Note that an average value or variance in the RGB color space may be calculated.
Since the color average and variance are calculated for each block, the feature amount acquisition unit 40 expands the image data for the blocks in the block order without expanding all the image data in the image storage unit 31A. For this reason, the image storage unit 31A does not necessarily have a capacity sufficient to expand all of the image files.

次に、特徴量取得部４０が、全体特徴量を取得する（Ｓ１０２）。具体的には、特徴量取得部４０は、画像データの全体の色平均、分散、重心及び撮影情報を、全体特徴量として取得する。なお、これらの色平均や分散は、画像の全体の特徴を示すものである。画像データ全体の色平均、分散及び重心は、先に算出した部分特徴量を用いて算出される。このため、全体特徴量を算出する際に、画像データを再度展開する必要がないので、全体特徴量の算出速度が速くなる。全体識別処理（後述）は部分識別処理（後述）よりも先に行われるにも関わらず、全体特徴量が部分特徴量よりも後に求められるのは、このように算出速度を速めるためである。なお、撮影情報は、画像ファイルの撮影データから抽出される。具体的には、絞り値、シャッター速度、フラッシュ発光の有無などの情報が全体特徴量として用いられる。但し、画像ファイルの撮影データの全てが全体特徴量として用いられるわけではない。 Next, the feature amount acquisition unit 40 acquires the entire feature amount (S102). Specifically, the feature quantity acquisition unit 40 acquires the overall color average, variance, center of gravity, and shooting information of the image data as the overall feature quantity. Note that these color averages and variances indicate the overall characteristics of the image. The color average, variance, and center of gravity of the entire image data are calculated using the partial feature values calculated previously. For this reason, it is not necessary to re-expand the image data when calculating the entire feature amount, and the calculation speed of the entire feature amount is increased. Although the overall identification process (described later) is performed prior to the partial identification process (described later), the overall feature value is obtained after the partial feature value in order to increase the calculation speed. The shooting information is extracted from the shooting data of the image file. Specifically, information such as the aperture value, shutter speed, and the presence or absence of flash emission is used as the overall feature amount. However, not all shooting data of the image file is used as the entire feature amount.

次に、全体識別器５０が、全体識別処理を行う（Ｓ１０３）。全体識別処理とは、全体特徴量に基づいて、画像データの示す画像のシーンを識別（推定）する処理である。全体識別処理の詳細については、後述する。 Next, the overall classifier 50 performs overall identification processing (S103). The overall identification process is a process for identifying (estimating) an image scene indicated by image data based on the overall feature amount. Details of the overall identification process will be described later.

全体識別処理によってシーンの識別ができる場合（Ｓ１０４でＹＥＳ）、シーン識別部３３は、記憶部３１の結果記憶部３１Ｂに識別結果を記憶することによってシーンを決定し（Ｓ１０９）、シーン識別処理を終了する。つまり、全体識別処理によってシーンの識別ができた場合（Ｓ１０４でＹＥＳ）、部分識別処理や統合識別処理が省略される。これにより、シーン識別処理の速度が速くなる。 If the scene can be identified by the overall identification process (YES in S104), the scene identification unit 33 determines the scene by storing the identification result in the result storage unit 31B of the storage unit 31 (S109), and performs the scene identification process. finish. That is, when the scene can be identified by the overall identification process (YES in S104), the partial identification process and the integrated identification process are omitted. This increases the speed of the scene identification process.

全体識別処理によってシーンの識別ができない場合（Ｓ１０４でＮＯ）、次に部分識別器６０が、部分識別処理を行う（Ｓ１０５）。部分識別処理とは、部分特徴量に基づいて、画像データの示す画像全体のシーンを識別する処理である。部分識別処理の詳細については、後述する。 If the scene cannot be identified by the overall identification process (NO in S104), the partial classifier 60 performs the partial identification process (S105). The partial identification process is a process for identifying the scene of the entire image indicated by the image data based on the partial feature amount. Details of the partial identification processing will be described later.

部分識別処理によってシーンの識別ができる場合（Ｓ１０６でＹＥＳ）、シーン識別部３３は、記憶部３１の結果記憶部３１Ｂに識別結果を記憶することによってシーンを決定し（Ｓ１０９）、シーン識別処理を終了する。つまり、部分識別処理によってシーンの識別ができた場合（Ｓ１０６でＹＥＳ）、統合識別処理が省略される。これにより、シーン識別処理の速度が速くなる。 When the scene can be identified by the partial identification process (YES in S106), the scene identification unit 33 determines the scene by storing the identification result in the result storage unit 31B of the storage unit 31 (S109), and performs the scene identification process. finish. That is, when the scene can be identified by the partial identification process (YES in S106), the integrated identification process is omitted. This increases the speed of the scene identification process.

部分識別処理によってシーンの識別ができない場合（Ｓ１０６でＮＯ）、次に統合識別器７０が、統合識別処理を行う（Ｓ１０７）。統合識別処理の詳細については、後述する。 If the scene cannot be identified by the partial identification process (NO in S106), the integrated discriminator 70 performs the integrated identification process (S107). Details of the integrated identification process will be described later.

統合識別処理によってシーンの識別ができる場合（Ｓ１０８でＹＥＳ）、シーン識別部３３は、記憶部３１の結果記憶部３１Ｂに識別結果を記憶することによってシーンを決定し（Ｓ１０９）、シーン識別処理を終了する。一方、統合識別処理によってシーンの識別ができない場合（Ｓ１０８でＮＯ）、画像データの示す画像が「その他」のシーン（「風景」、「夕景」、「夜景」、「花」又は「紅葉」以外のシーン）である旨の識別結果を結果記憶部３１Ｂに記憶する（Ｓ１１０）。 When the scene can be identified by the integrated identification process (YES in S108), the scene identification unit 33 determines the scene by storing the identification result in the result storage unit 31B of the storage unit 31 (S109), and performs the scene identification process. finish. On the other hand, if the scene cannot be identified by the integrated identification process (NO in S108), the image indicated by the image data is “other” (other than “landscape”, “evening scene”, “night scene”, “flower” or “autumn leaves”. Is stored in the result storage unit 31B (S110).

＜全体識別処理＞
図７は、全体識別処理のフロー図である。ここでは図６も参照しながら全体識別処理について説明する。 <Overall identification process>
FIG. 7 is a flowchart of the overall identification process. Here, the overall identification process will be described with reference to FIG.

まず、全体識別器５０は、複数のサブ識別器５１の中から１つのサブ識別器５１を選択する（Ｓ２０１）。全体識別器５０には、識別対象となる画像（識別対象画像）が特定のシーンに属するか否かを識別するサブ識別器５１が５つ設けられている。５つのサブ識別器５１は、それぞれ風景、夕景、夜景、花、紅葉のシーンを識別する。ここでは、全体識別器５０は、風景→夕景→夜景→花→紅葉の順に、サブ識別器５１を選択する（なお、サブ識別器５１の選択順序については、後述する）。このため、最初には、識別対象画像が風景のシーンに属するか否かを識別するサブ識別器５１（風景識別器５１Ｌ）が選択される。 First, the overall classifier 50 selects one sub-classifier 51 from the plurality of sub-classifiers 51 (S201). The overall classifier 50 is provided with five sub-classifiers 51 for identifying whether an image to be identified (identification target image) belongs to a specific scene. The five sub classifiers 51 identify scenes of scenery, evening scene, night scene, flowers, and autumn leaves, respectively. Here, the overall classifier 50 selects the sub classifier 51 in the order of landscape → evening scene → night scene → flower → autumn leaves (the selection order of the sub classifier 51 will be described later). For this reason, first, the sub classifier 51 (landscape classifier 51L) for identifying whether or not the classification target image belongs to a landscape scene is selected.

次に、全体識別器５０は、識別対象テーブルを参照し、選択したサブ識別器５１を用いてシーンを識別すべきか否かを判断する（Ｓ２０２）。
図８は、識別対象テーブルの説明図である。この識別対象テーブルは、記憶部３１の結果記憶部３１Ｂに記憶される。識別対象テーブルは、最初の段階では全ての欄がゼロに設定される。Ｓ２０２の処理では、「否定」欄が参照され、ゼロであればＹＥＳと判断され、１であればＮＯと判断される。ここでは、全体識別器５０は、識別対象テーブルにおける「風景」欄の「否定」欄を参照し、ゼロであるのでＹＥＳと判断する。 Next, the overall classifier 50 refers to the classification target table and determines whether or not a scene should be identified using the selected sub-classifier 51 (S202).
FIG. 8 is an explanatory diagram of the identification target table. This identification target table is stored in the result storage unit 31B of the storage unit 31. In the identification target table, all fields are set to zero in the first stage. In the process of S202, the “No” column is referred to, and if it is zero, it is determined as YES, and if it is 1, it is determined as NO. Here, the overall classifier 50 refers to the “No” column in the “Scenery” column in the identification target table, and determines “YES” because it is zero.

次に、サブ識別器５１は、全体特徴量に基づいて、識別対象画像が特定のシーンに属する確率（確信度）を算出する（Ｓ２０３）。サブ識別器５１には、サポートベクタマシン（ＳＶＭ）による識別手法が用いられている。なお、サポートベクタマシンについては、後述する。識別対象画像が特定のシーンに属する場合、サブ識別器５１の判別式は、プラスの値になりやすい。識別対象画像が特定のシーンに属しない場合、サブ識別器５１の判別式は、マイナスの値になりやすい。また、判別式は、識別対象画像が特定のシーンに属する確信度が高いほど、大きい値になる。このため、判別式の値が大きければ、識別対象画像が特定のシーンに属する確率（確信度）が高くなり、判別式の値が小さければ、識別対象画像が特定のシーンに属する確率が低くなる。 Next, the sub classifier 51 calculates the probability (confidence) that the classification target image belongs to a specific scene based on the entire feature amount (S203). For the sub classifier 51, a classification method using a support vector machine (SVM) is used. The support vector machine will be described later. When the classification target image belongs to a specific scene, the discriminant of the sub classifier 51 tends to be a positive value. When the classification target image does not belong to a specific scene, the discriminant of the sub classifier 51 tends to be a negative value. Further, the discriminant becomes a larger value as the certainty that the identification target image belongs to the specific scene is higher. For this reason, if the discriminant value is large, the probability (confidence) that the identification target image belongs to a specific scene is high, and if the discriminant value is small, the probability that the classification target image belongs to a specific scene is low. .

次に、サブ識別器５１は、判別式の値が肯定閾値より大きいか否かを判断する（Ｓ２０４）。判別式の値が肯定閾値より大きければ、サブ識別器５１は、識別対象画像が特定のシーンに属すると判断することになる。 Next, the sub discriminator 51 determines whether or not the value of the discriminant is larger than the positive threshold (S204). If the value of the discriminant is larger than the positive threshold, the sub discriminator 51 determines that the classification target image belongs to a specific scene.

図９は、全体識別処理の肯定閾値の説明図である。同図において、横軸は肯定閾値を示し、縦軸はRecall又はPrecisionの確率を示す。図１０は、RecallとPrecisionの説明図である。判別式の値が肯定閾値以上の場合には識別結果はPositiveであり、判別式の値が肯定閾値以上でない場合には識別結果はNegativeである。 FIG. 9 is an explanatory diagram of an affirmative threshold value of the overall identification process. In the figure, the horizontal axis indicates an affirmative threshold, and the vertical axis indicates the probability of recall or precision. FIG. 10 is an explanatory diagram of Recall and Precision. If the discriminant value is greater than or equal to the positive threshold, the identification result is Positive. If the discriminant value is not greater than or equal to the positive threshold, the identification result is Negative.

Recallは、再現率や検出率を示すものである。Recallは、特定のシーンの画像の総数に対する、特定のシーンに属すると識別された画像の数の割合である。言い換えると、Recallは、特定のシーンの画像をサブ識別器５１に識別させたときに、サブ識別器５１がPositiveと識別する確率（特定のシーンの画像が特定のシーンに属すると識別される確率）を示すものである。例えば、風景画像を風景識別器５１Ｌに識別させたときに、風景のシーンに属すると風景識別器５１Ｌが識別する確率を示すものである。 Recall indicates the recall rate and detection rate. Recall is the ratio of the number of images identified as belonging to a specific scene to the total number of images of the specific scene. In other words, Recall is the probability that the sub-identifier 51 identifies the image as a positive when the image of the specific scene is identified by the sub-identifier 51 (the probability that the image of the specific scene belongs to the specific scene. ). For example, when a landscape image is identified by the landscape classifier 51L, it indicates the probability that the landscape classifier 51L identifies it as belonging to a landscape scene.

Precisionは、正答率や正解率を示すものである。Precisionは、Positiveと識別された画像の総数に対する、特定のシーンの画像の数の割合である。言い換えると、Precisionは、特定のシーンを識別するサブ識別器５１がPositiveと識別したときに、識別対象の画像が特定のシーンである確率を示すものである。例えば、風景識別器５１Ｌが風景のシーンに属すると識別したときに、その識別した画像が本当に風景画像である確率を示すものである。 Precision indicates the correct answer rate and the correct answer rate. Precision is the ratio of the number of images in a particular scene to the total number of images identified as Positive. In other words, Precision indicates the probability that the image to be identified is a specific scene when the sub-classifier 51 that identifies the specific scene identifies it as Positive. For example, when the landscape classifier 51L identifies that it belongs to a landscape scene, it indicates the probability that the identified image is really a landscape image.

図９から分かる通り、肯定閾値を大きくするほど、Precisionが大きくなる。このため、肯定閾値を大きくするほど、例えば風景のシーンに属すると識別された画像が風景画像である確率が高くなる。つまり、肯定閾値を大きくするほど、誤識別の確率が低くなる。
一方、肯定閾値を大きくするほど、Recallは小さくなる。この結果、例えば、風景画像を風景識別器５１Ｌで識別した場合であっても、風景のシーンに属すると正しく識別しにくくなる。ところで、識別対象画像が風景のシーンに属すると識別できれば（Ｓ２０４でＹＥＳ）、残りの別のシーン（夕景など）の識別を行わないようにして全体識別処理の速度を速めている。このため、肯定閾値を大きくするほど、全体識別処理の速度は低下することになる。また、全体識別処理によってシーンが識別できれば部分識別処理を行わないようにしてシーン識別処理の速度を速めているため（Ｓ１０４）、肯定閾値を大きくするほど、シーン識別処理の速度は低下することになる。
つまり、肯定閾値が小さすぎると誤識別の確率が高くなり、大きすぎると処理速度が低下することになる。ここでは、正答率（Precision）を９７．５％に設定するため、風景の肯定閾値は１．２７に設定されている。 As can be seen from FIG. 9, the greater the positive threshold, the greater the Precision. For this reason, the larger the positive threshold value, the higher the probability that an image identified as belonging to a landscape scene, for example, is a landscape image. That is, the greater the positive threshold, the lower the probability of misidentification.
On the other hand, the larger the positive threshold, the smaller the Recall. As a result, for example, even when a landscape image is identified by the landscape classifier 51L, it is difficult to correctly identify it as belonging to a landscape scene. By the way, if the image to be identified can be identified as belonging to a landscape scene (YES in S204), the speed of the overall identification process is increased so as not to identify other remaining scenes (such as sunsets). For this reason, the larger the positive threshold, the lower the overall identification processing speed. Further, if the scene can be identified by the overall identification process, the partial identification process is not performed and the speed of the scene identification process is increased (S104). Therefore, as the positive threshold is increased, the scene identification process speed decreases. Become.
That is, if the positive threshold is too small, the probability of misidentification increases, and if it is too large, the processing speed decreases. Here, in order to set the correct answer rate (Precision) to 97.5%, the landscape affirmation threshold is set to 1.27.

判別式の値が肯定閾値より大きければ（Ｓ２０４でＹＥＳ）、サブ識別器５１は、識別対象画像が特定のシーンに属すると判断し、肯定フラグを立てる（Ｓ２０５）。「肯定フラグを立てる」とは、図８の「肯定」欄を１にすることである。この場合、全体識別器５０は、次のサブ識別器５１による識別を行わずに、全体識別処理を終了する。例えば、風景画像であると識別できれば、夕景などの識別を行わずに、全体識別処理を終了する。この場合、次のサブ識別器５１による識別を省略しているので、全体識別処理の速度を速めることができる。 If the discriminant value is greater than the affirmative threshold value (YES in S204), the sub-classifier 51 determines that the classification target image belongs to a specific scene and sets an affirmative flag (S205). “Set an affirmative flag” means that the “affirmation” column in FIG. In this case, the overall discriminator 50 ends the overall discrimination process without performing discrimination by the next sub discriminator 51. For example, if the image can be identified as a landscape image, the entire identification process is terminated without identifying the sunset scene or the like. In this case, since the identification by the next sub-identifier 51 is omitted, the speed of the overall identification process can be increased.

判別式の値が肯定閾値より大きくなければ（Ｓ２０４でＮＯ）、サブ識別器５１は、識別対象画像が特定のシーンに属すると判断できず、次のＳ２０６の処理を行う。 If the value of the discriminant is not greater than the positive threshold (NO in S204), the sub discriminator 51 cannot determine that the classification target image belongs to a specific scene, and performs the next process of S206.

次に、サブ識別器５１は、判別式の値と否定閾値とを比較する（Ｓ２０６）。これにより、サブ識別器５１は、識別対象画像が所定のシーンに属しないかを判断する。このような判断としては、２種類ある。第１に、ある特定のシーンのサブ識別器５１の判別式の値が第１否定閾値より小さければ、その特定のシーンに識別対象画像が属しないと判断されることになる。例えば、風景識別器５１Ｌの判別式の値が第１否定閾値より小さければ、識別対象画像が風景のシーンに属しないと判断されることになる。第２に、ある特定のシーンのサブ識別器５１の判別式の値が第２否定閾値より大きければ、その特定のシーンとは別のシーンに識別対象画像が属しないと判断されることになる。例えば、風景識別器５１Ｌの判別式の値が第２否定閾値より大きければ、識別対象画像が夜景のシーンに属しないと判断されることになる。 Next, the sub discriminator 51 compares the discriminant value with a negative threshold value (S206). Thereby, the sub classifier 51 determines whether the classification target image does not belong to a predetermined scene. There are two types of such determinations. First, if the value of the discriminant of the sub-identifier 51 of a specific scene is smaller than the first negative threshold, it is determined that the classification target image does not belong to the specific scene. For example, if the discriminant value of the landscape classifier 51L is smaller than the first negative threshold, it is determined that the classification target image does not belong to a landscape scene. Second, if the value of the discriminant of the sub-identifier 51 of a specific scene is larger than the second negative threshold, it is determined that the classification target image does not belong to a scene different from the specific scene. . For example, if the discriminant value of the landscape classifier 51L is larger than the second negative threshold, it is determined that the classification target image does not belong to the night scene.

図１１は、第１否定閾値の説明図である。同図において、横軸は第１否定閾値を示し、縦軸は確率を示す。グラフの太線は、True Negative Recallのグラフであり、風景画像以外の画像を風景画像ではないと正しく識別する確率を示している。グラフの細線は、False Negative Recallのグラフであり、風景画像なのに風景画像ではないと誤って識別する確率を示している。 FIG. 11 is an explanatory diagram of the first negative threshold. In the figure, the horizontal axis indicates the first negative threshold, and the vertical axis indicates the probability. The bold line in the graph is a True Negative Recall graph, and indicates the probability of correctly identifying an image other than a landscape image as not a landscape image. The thin line in the graph is a False Negative Recall graph, which indicates the probability of erroneously identifying a landscape image that is not a landscape image.

図１１から分かる通り、第１否定閾値を小さくするほど、False Negative Recallが小さくなる。このため、第１否定閾値を小さくするほど、例えば風景のシーンに属しないと識別された画像が風景画像である確率が低くなる。つまり、誤識別の確率が低くなる。
一方、第１否定閾値を小さくするほど、True Negative Recallも小さくなる。この結果、風景画像以外の画像を風景画像ではないと識別しにくくなる。その一方、識別対象画像が特定シーンでないことを識別できれば、部分識別処理の際に、その特定シーンのサブ部分識別器６１による処理を省略してシーン識別処理速度を速めている（後述、図１４のＳ３０２）。このため、第１否定閾値を小さくするほど、シーン識別処理速度は低下する。
つまり、第１否定閾値が大きすぎると誤識別の確率が高くなり、小さすぎると処理速度が低下することになる。ここでは、False Negative Recallを２．５％に設定するため、第１否定閾値は−１．１０に設定されている。 As can be seen from FIG. 11, the smaller the first negative threshold is, the smaller False Negative Recall is. For this reason, the smaller the first negative threshold, the lower the probability that an image identified as not belonging to a landscape scene is a landscape image, for example. That is, the probability of misidentification is reduced.
On the other hand, the True Negative Recall decreases as the first negative threshold decreases. As a result, it is difficult to identify an image other than a landscape image unless it is a landscape image. On the other hand, if the identification target image can be identified as not being a specific scene, the process by the sub-partial classifier 61 for the specific scene is omitted during the partial identification process to increase the scene identification processing speed (described later in FIG. 14). S302). For this reason, the scene identification processing speed decreases as the first negative threshold is decreased.
That is, if the first negative threshold is too large, the probability of misidentification increases, and if it is too small, the processing speed decreases. Here, in order to set False Negative Recall to 2.5%, the first negative threshold is set to −1.10.

ところで、ある画像が風景のシーンに属する確率が高ければ、必然的にその画像が夜景のシーンに属する確率は低くなる。このため、風景識別器５１Ｌの判別式の値が大きい場合には、夜景ではないと識別できる場合がある。このような識別を行うために、第２否定閾値が設けられる。 By the way, if the probability that an image belongs to a landscape scene is high, the probability that the image belongs to a night scene is inevitably low. For this reason, when the discriminant value of the landscape discriminator 51L is large, it may be identified that the scene is not a night scene. In order to perform such identification, a second negative threshold is provided.

図１２は、第２否定閾値の説明図である。同図において、横軸は風景の判別式の値を示し、縦軸は確率を示す。同図には、図９のRecallとPrecisionのグラフとともに、夜景のRecallのグラフが点線で描かれている。この点線のグラフに注目すると、風景の判別式の値が−０．４５よりも大きければ、その画像が夜景画像である確率は２．５％である。言い換えると、風景の判別式の値が−０．４５より大きい場合にその画像が夜景画像でないと識別しても、誤識別の確率は２．５％にすぎない。そこで、ここでは、第２否定閾値が−０．４５に設定されている。 FIG. 12 is an explanatory diagram of the second negative threshold. In the figure, the horizontal axis indicates the value of the landscape discriminant, and the vertical axis indicates the probability. In the same figure, the Recall graph of the night view is drawn with a dotted line together with the Recall and Precision graph of FIG. If attention is paid to this dotted line graph, if the value of the discriminant of the landscape is larger than −0.45, the probability that the image is a night scene image is 2.5%. In other words, if the landscape discriminant value is greater than −0.45, even if the image is identified as not a night scene image, the probability of misidentification is only 2.5%. Therefore, here, the second negative threshold is set to −0.45.

そして、判別式の値が第１否定閾値より小さい場合、又は、判別式の値が第２否定閾値より大きい場合（Ｓ２０６でＹＥＳ）、サブ識別器５１は、識別対象画像が所定のシーンに属しないと判断し、否定フラグを立てる（Ｓ２０７）。「否定フラグを立てる」とは、図８の「否定」欄を１にすることである。例えば、第１否定閾値に基づいて識別対象画像が風景のシーンに属しないと判断された場合、「風景」欄の「否定」欄が１になる。また、第２否定閾値に基づいて識別対象画像が夜景のシーンに属しないと判断された場合、「夜景」欄の「否定」欄が１になる。 When the discriminant value is smaller than the first negative threshold value, or when the discriminant value is larger than the second negative threshold value (YES in S206), the sub-classifier 51 determines that the classification target image belongs to a predetermined scene. It is determined not to do so, and a negative flag is set (S207). “Set a negative flag” means to set the “No” column in FIG. For example, when it is determined that the image to be identified does not belong to a landscape scene based on the first negative threshold, the “denial” column in the “landscape” column is 1. Further, when it is determined that the identification target image does not belong to the night scene based on the second negative threshold, the “Negation” field in the “Night scene” field is “1”.

図１３Ａは、閾値テーブルの説明図である。この閾値テーブルは、記憶部３１に記憶されていても良いし、全体識別処理を実行させるためのプログラムの一部に組み込まれていても良い。閾値テーブルには、前述の肯定閾値や否定閾値に関するデータが格納されている。 FIG. 13A is an explanatory diagram of a threshold table. The threshold value table may be stored in the storage unit 31 or may be incorporated in a part of a program for executing the overall identification process. The threshold table stores data related to the affirmative threshold and the negative threshold described above.

図１３Ｂは、上記で説明した風景識別器５１Ｌにおける閾値の説明図である。風景識別器５１Ｌには、肯定閾値及び否定閾値が予め設定されている。肯定閾値として１．２７が設定されている。否定閾値には第１否定閾値と第２否定閾値とがある。第１否定閾値として−１．１０が設定されている。また、第２否定閾値として、風景以外の各シーンにそれぞれ値が設定されている。 FIG. 13B is an explanatory diagram of threshold values in the landscape classifier 51L described above. An affirmative threshold value and a negative threshold value are preset in the landscape discriminator 51L. 1.27 is set as the positive threshold. The negative threshold includes a first negative threshold and a second negative threshold. -1.10 is set as the first negative threshold. In addition, a value is set for each scene other than the landscape as the second negative threshold.

図１３Ｃは、上記で説明した風景識別器５１Ｌの処理の概要の説明図である。ここでは、説明の簡略化のため、第２否定閾値については夜景についてのみ説明する。風景識別器５１Ｌは、判別式の値が１．２７より大きければ（Ｓ２０４でＹＥＳ）、識別対象画像が風景のシーンに属すると判断する。また、判別式の値が１．２７以下であり（Ｓ２０４でＮＯ）、−０．４５より大きければ（Ｓ２０６でＹＥＳ）、風景識別器５１Ｌは、識別対象画像が夜景のシーンに属しないと判断する。また、判別式の値が−１．１０より小さければ（Ｓ２０６でＹＥＳ）、風景識別器５１Ｌは、識別対象画像が風景のシーンに属しないと判断する。なお、風景識別器５１Ｌは、夕景や花や紅葉についても、第２否定閾値に基づいて、識別対象画像がそのシーンに属しないかを判断する。但し、これらの第２否定閾値は肯定閾値よりも大きいため、識別対象画像がこれらのシーンに属しないことを風景識別器５１Ｌが判断することはない。 FIG. 13C is an explanatory diagram outlining the processing of the landscape classifier 51L described above. Here, for simplification of description, only the night view will be described for the second negative threshold. If the discriminant value is greater than 1.27 (YES in S204), the landscape classifier 51L determines that the classification target image belongs to a landscape scene. If the discriminant value is 1.27 or less (NO in S204) and is greater than −0.45 (YES in S206), the landscape classifier 51L determines that the classification target image does not belong to the night scene. To do. If the value of the discriminant is smaller than −1.10 (YES in S206), the landscape classifier 51L determines that the classification target image does not belong to a landscape scene. Note that the landscape discriminator 51L also determines whether the image to be identified does not belong to the scene based on the second negative threshold for sunset scenes, flowers, and autumn leaves. However, since these second negative threshold values are larger than the positive threshold values, the landscape discriminator 51L does not determine that the classification target image does not belong to these scenes.

Ｓ２０２においてＮＯの場合、Ｓ２０６でＮＯの場合、又はＳ２０７の処理を終えた場合、全体識別器５０は、次のサブ識別器５１の有無を判断する（Ｓ２０８）。ここでは風景識別器５１Ｌによる処理を終えた後なので、全体識別器５０は、Ｓ２０８において、次のサブ識別器５１（夕景識別器５１Ｓ）があると判断する。 In the case of NO in S202, in the case of NO in S206, or when the processing in S207 is completed, the overall discriminator 50 determines the presence or absence of the next sub discriminator 51 (S208). Here, since the process by the landscape classifier 51L is finished, the overall classifier 50 determines in S208 that there is a next sub-classifier 51 (evening scene classifier 51S).

そして、Ｓ２０５の処理を終えた場合（識別対象画像が特定のシーンに属すると判断された場合）、又は、Ｓ２０８において次のサブ識別器５１がないと判断された場合（識別対象画像が特定のシーンに属すると判断できなかった場合）、全体識別器５０は、全体識別処理を終了する。 Then, when the process of S205 is finished (when it is determined that the identification target image belongs to a specific scene), or when it is determined in S208 that there is no next sub-classifier 51 (the identification target image is a specific image). When it cannot be determined that the scene belongs to the scene), the overall discriminator 50 ends the overall discrimination process.

なお、既に説明した通り、全体識別処理が終了すると、シーン識別部３３は、全体識別処理によってシーンの識別ができたか否かを判断する（図５のＳ１０４）。このとき、シーン識別部３３は、図８の識別対象テーブルを参照し、「肯定」欄に１があるか否かを判断することになる。
全体識別処理によってシーンの識別ができた場合（Ｓ１０４でＹＥＳ）、部分識別処理や統合識別処理が省略される。これにより、シーン識別処理の速度が速くなる。 As already described, when the overall identification process is completed, the scene identification unit 33 determines whether or not the scene has been identified by the overall identification process (S104 in FIG. 5). At this time, the scene identification unit 33 refers to the identification target table in FIG. 8 and determines whether or not there is 1 in the “affirmation” column.
If the scene can be identified by the overall identification process (YES in S104), the partial identification process and the integrated identification process are omitted. This increases the speed of the scene identification process.

ところで、上記の説明には無いが、全体識別器５０は、サブ識別器５１によって判別式の値を算出したときには、判別式の値に対応するPrecisionを、確信度に関する情報として結果記憶部３１Ｂに記憶する。もちろん、判別式の値そのものを確信度に関する情報として記憶しても良い。 By the way, although not described above, the overall discriminator 50, when the sub discriminator 51 calculates the discriminant value, the Precision corresponding to the discriminant value is stored in the result storage unit 31B as information on the certainty factor. Remember. Of course, the discriminant value itself may be stored as information on the certainty factor.

＜部分識別処理＞
図１４は、部分識別処理のフロー図である。部分識別処理は、全体識別処理によってシーンの識別ができなかった場合（図５のＳ１０４でＮＯ）に行われる。以下に説明するように、部分識別処理は、分割された部分画像のシーンをそれぞれ識別することによって、画像全体のシーンを識別する処理である。ここでは図６も参照しながら部分識別処理について説明する。 <Partial identification processing>
FIG. 14 is a flowchart of the partial identification process. The partial identification process is performed when the scene cannot be identified by the overall identification process (NO in S104 of FIG. 5). As will be described below, the partial identification process is a process for identifying the scene of the entire image by identifying each scene of the divided partial images. Here, the partial identification process will be described with reference to FIG.

まず、部分識別器６０は、複数のサブ部分識別器６１の中から１つのサブ部分識別器６１を選択する（Ｓ３０１）。部分識別器６０には、サブ部分識別器６１が３つ設けられている。各サブ部分識別器６１は、８×８の６４ブロックに分割された部分画像がそれぞれ特定のシーンに属するか否かを識別する。ここでの３つのサブ部分識別器６１は、それぞれ夕景、花、紅葉のシーンを識別する。ここでは、部分識別器６０は、夕景→花→紅葉の順に、サブ部分識別器６１を選択する（なお、サブ部分識別器６１の選択順序については、後述する）。このため、最初には、部分画像が夕景のシーンに属するか否かを識別するサブ部分識別器６１（夕景部分識別器６１Ｓ）が選択される。 First, the partial classifier 60 selects one sub partial classifier 61 from the plurality of sub partial classifiers 61 (S301). The partial discriminator 60 is provided with three sub partial discriminators 61. Each sub partial discriminator 61 discriminates whether or not each partial image divided into 8 × 8 64 blocks belongs to a specific scene. Here, the three sub partial classifiers 61 identify the scenes of sunset, flowers, and autumn leaves, respectively. Here, the partial discriminator 60 selects the sub partial discriminator 61 in the order of sunset scene → flower → autumn leaves (the selection order of the sub partial discriminator 61 will be described later). Therefore, first, the sub partial classifier 61 (evening scene partial classifier 61S) for identifying whether or not the partial image belongs to the sunset scene is selected.

次に、部分識別器６０は、識別対象テーブル（図８）を参照し、選択したサブ部分識別器６１を用いてシーンを識別すべきか否かを判断する（Ｓ３０２）。ここでは、部分識別器６０は、識別対象テーブルにおける「夕景」欄の「否定」欄を参照し、ゼロであればＹＥＳと判断し、１であればＮＯと判断する。なお、全体識別処理の際に、夕景識別器５１Ｓが第１否定閾値により否定フラグを立てたとき、又は、他のサブ識別器５１が第２否定閾値により否定フラグを立てたとき、このＳ３０２でＮＯと判断される。仮にＮＯと判断されると夕景の部分識別処理は省略されることになるので、部分識別処理の速度が速くなる。但し、ここでは説明の都合上、ＹＥＳと判断されるものとする。 Next, the partial discriminator 60 refers to the discrimination target table (FIG. 8) and determines whether or not the scene should be discriminated using the selected sub partial discriminator 61 (S302). Here, the partial discriminator 60 refers to the “No” column of the “Evening Scene” column in the classification target table, and determines YES if it is zero, and NO if it is 1. When the evening scene classifier 51S sets a negative flag with the first negative threshold during the overall identification process or when another sub-classifier 51 sets a negative flag with the second negative threshold, in S302 It is judged as NO. If it is determined NO, the sunset partial identification process is omitted, and the partial identification process speed increases. However, for the convenience of explanation, it is assumed that YES is determined here.

次に、サブ部分識別器６１は、８×８の６４ブロックに分割された部分画像の中から、１つの部分画像を選択する（Ｓ３０３）。
図１５は、夕景部分識別器６１Ｓが選択する部分画像の順番の説明図である。部分画像から画像全体のシーンを識別するような場合、識別に用いられる部分画像は、被写体が存在する部分であることが望ましい。そこで、数千枚のサンプルの夕景画像を用意し、各夕景画像を８×８の６４ブロックに分割し、夕景部分画像（夕景の太陽と空の部分画像）を含むブロックを抽出し、抽出されたブロックの位置に基づいて各ブロックにおける夕景部分画像の存在確率を算出した。そして、存在確率の高いブロックから順番に、部分画像が選択される。なお、図に示す選択順序の情報は、プログラムの一部としてメモリ２３に格納されている。 Next, the sub partial discriminator 61 selects one partial image from the partial images divided into 8 × 8 64 blocks (S303).
FIG. 15 is an explanatory diagram of the order of partial images selected by the evening scene partial classifier 61S. When a scene of the entire image is identified from the partial image, it is desirable that the partial image used for identification is a portion where the subject exists. Therefore, thousands of samples of sunset scene images are prepared, each sunset scene image is divided into 64 blocks of 8 × 8, and blocks including sunset scene partial images (sun sunset sky and sky partial images) are extracted and extracted. Based on the position of each block, the existence probability of the sunset partial image in each block was calculated. Then, partial images are selected in order from the block having the highest existence probability. Note that the selection order information shown in the figure is stored in the memory 23 as part of the program.

なお、夕景画像の場合、画像の中央付近から上半分に夕景の空が広がっていることが多いため、中央付近から上半分のブロックにおいて存在確率が高くなる。また、夕景画像の場合、画像の下１／３では逆光で陰になり、部分画像単体では夕景か夜景か区別がつかないことが多いため、下１／３のブロックにおいて存在確率が低くなる。花画像の場合、花を中央付近に配置させる構図にすることが多いため、中央付近における花部分画像の存在確率が高くなる。 In the case of an evening scene image, since the sky of the evening scene often spreads from the vicinity of the center to the upper half, the existence probability increases in the upper half block from the vicinity of the center. In the case of an evening scene image, the lower 1/3 of the image is shaded by backlight, and the partial image alone often cannot be distinguished from the evening scene or the night scene, so the existence probability is lower in the lower 1/3 block. In the case of a flower image, since the composition is often such that a flower is arranged near the center, the probability of existence of a flower partial image near the center increases.

次に、サブ部分識別器６１は、選択された部分画像の部分特徴量に基づいて、その部分画像が特定のシーンに属するか否かを判断する（Ｓ３０４）。サブ部分識別器６１には、全体識別器５０のサブ識別器５１と同様に、サポートベクタマシン（ＳＶＭ）による判別手法が用いられている。なお、サポートベクタマシンについては、後述する。判別式の値が正の値であれば、部分画像が特定のシーンに属すると判断し、サブ部分識別器６１は正カウント値をインクリメントする。また、判別式の値が負の値であれば、部分画像が特定のシーンに属しないと判断し、サブ部分識別器６１は負カウント値をインクリメントする。 Next, the sub partial classifier 61 determines whether or not the partial image belongs to a specific scene based on the partial feature amount of the selected partial image (S304). Similar to the sub classifier 51 of the overall classifier 50, the sub partial classifier 61 uses a discrimination method using a support vector machine (SVM). The support vector machine will be described later. If the discriminant value is a positive value, it is determined that the partial image belongs to a specific scene, and the sub partial classifier 61 increments the positive count value. If the discriminant value is a negative value, it is determined that the partial image does not belong to a specific scene, and the sub partial discriminator 61 increments the negative count value.

次に、サブ部分識別器６１は、正カウント値が肯定閾値よりも大きい否かを判断する（Ｓ３０５）。なお、正カウント値は、特定のシーンに属すると判断された部分画像の数を示すものである。正カウント値が肯定閾値より大きければ（Ｓ３０５でＹＥＳ）、サブ部分識別器６１は、識別対象画像が特定のシーンに属すると判断し、肯定フラグを立てる（Ｓ３０６）。この場合、部分識別器６０は、次のサブ部分識別器６１による識別を行わずに、部分識別処理を終了する。例えば、夕景画像であると識別できれば、花や紅葉の識別を行わずに、部分識別処理を終了する。この場合、次のサブ部分識別器６１による識別を省略しているので、部分識別処理の速度を速めることができる。 Next, the sub partial discriminator 61 determines whether or not the positive count value is larger than the positive threshold value (S305). The positive count value indicates the number of partial images determined to belong to a specific scene. If the positive count value is larger than the affirmative threshold (YES in S305), the sub partial classifier 61 determines that the classification target image belongs to a specific scene, and sets an affirmative flag (S306). In this case, the partial discriminator 60 ends the partial discriminating process without performing discrimination by the next sub partial discriminator 61. For example, if the image can be identified as an evening scene image, the partial identification process is terminated without identifying flowers and autumn leaves. In this case, since the identification by the next sub partial classifier 61 is omitted, the speed of the partial classification process can be increased.

正カウント値が肯定閾値より大きくなければ（Ｓ３０５でＮＯ）、サブ部分識別器６１は、識別対象画像が特定のシーンに属すると判断できず、次のＳ３０７の処理を行う。 If the positive count value is not greater than the positive threshold value (NO in S305), the sub partial classifier 61 cannot determine that the classification target image belongs to a specific scene, and performs the next process of S307.

サブ部分識別器６１は、正カウント値と残りの部分画像数との和が肯定閾値よりも小さければ（Ｓ３０７でＹＥＳ）、Ｓ３０９の処理へ進む。正カウント値と残りの部分画像数との和が肯定閾値よりも小さい場合、残り全ての部分画像によって正カウント値がインクリメントされても正カウント値が肯定閾値より大きくなることがないので、Ｓ３０９に処理を進めることによって、残りの部分画像についてサポートベクタマシンによる識別を省略する。これにより、部分識別処理の速度を速めることができる。 If the sum of the positive count value and the number of remaining partial images is smaller than the positive threshold (YES in S307), the sub partial discriminator 61 proceeds to the process of S309. If the sum of the positive count value and the number of remaining partial images is smaller than the positive threshold value, the positive count value does not become larger than the positive threshold value even if the positive count value is incremented by all the remaining partial images. By proceeding with the process, the remaining partial images are not identified by the support vector machine. Thereby, the speed of the partial identification process can be increased.

サブ部分識別器６１がＳ３０７でＮＯと判断した場合、サブ部分識別器６１は、次の部分画像の有無を判断する（Ｓ３０８）。なお、ここでは、６４個に分割された部分画像の全てを順に選択していない。図１５において太枠で示された上位１０番目までの１０個の部分画像だけを順に選択している。このため、１０番目の部分画像の識別を終えれば、サブ部分識別器６１は、Ｓ３０８において次の部分画像はないと判断する。（この点を考慮して、Ｓ３０７の「残りの部分画像数」も決定される。）
図１６は、上位１０番目までの１０個の部分画像だけで夕景画像の識別をしたときのRecall及びPrecisionのグラフである。図に示すような肯定閾値を設定すれば、正答率（Precision）を８０％程度に設定でき、再現率（Recall）を９０％程度に設定でき、精度の高い識別が可能である。 If the sub partial discriminator 61 determines NO in S307, the sub partial discriminator 61 determines whether there is a next partial image (S308). Here, all of the partial images divided into 64 pieces are not selected in order. In FIG. 15, only the top 10 partial images indicated by thick frames are selected in order. Therefore, when the identification of the tenth partial image is completed, the sub partial classifier 61 determines in S308 that there is no next partial image. (In consideration of this point, the “number of remaining partial images” in S307 is also determined.)
FIG. 16 is a Recall and Precision graph when an evening scene image is identified using only the top 10 partial images. If an affirmative threshold as shown in the figure is set, the accuracy rate (Precision) can be set to about 80%, the recall rate (Recall) can be set to about 90%, and identification with high accuracy is possible.

部分識別処理では、１０個の部分画像だけで夕景画像の識別を行っている。このため、６４個の全ての部分画像を用いて夕景画像の識別を行うよりも、部分識別処理の速度を速めることができる。
また、部分識別処理では、夕景部分画像の存在確率の高い上位１０番目の部分画像を用いて夕景画像の識別を行っている。このため、存在確率を無視して抽出された１０個の部分画像を用いて夕景画像の識別を行うよりも、Recall及びPrecisionをともに高く設定することが可能になる。
また、部分識別処理では、夕景部分画像の存在確率の高い順に部分画像を選択している。この結果、早い段階でＳ３０５の判断がＹＥＳになりやすくなる。このため、本実施形態では、存在確率の高低を無視した順で部分画像を選択したときよりも、部分識別処理の速度を速めることができる。 In the partial identification process, the evening scene image is identified using only 10 partial images. For this reason, it is possible to increase the speed of the partial identification processing compared to the case where the evening scene image is identified using all 64 partial images.
In the partial identification process, the sunset scene image is identified using the top tenth partial image having a high existence probability of the sunset scene partial image. For this reason, it is possible to set both Recall and Precision higher than the identification of an evening scene image using 10 partial images extracted by ignoring the existence probability.
In the partial identification process, partial images are selected in descending order of the existence probability of the sunset partial image. As a result, the determination in S305 is likely to be YES at an early stage. For this reason, in the present embodiment, the speed of the partial identification process can be increased as compared with the case where the partial images are selected in the order in which the presence probability level is ignored.

Ｓ３０７においてＹＥＳと判断された場合、又は、Ｓ３０８において次の部分画像がないと判断された場合、サブ部分識別器６１は、負カウント値が否定閾値よりも大きいか否かを判断する（Ｓ３０９）。この否定閾値は、前述の全体識別処理における否定閾値（図７のＳ２０６）とほぼ同様の機能を果たすものなので、詳しい説明は省略する。Ｓ３０９でＹＥＳと判断された場合、図７のＳ２０７と同様に、否定フラグを立てる。 When it is determined YES in S307, or when it is determined that there is no next partial image in S308, the sub partial discriminator 61 determines whether or not the negative count value is larger than the negative threshold (S309). . Since this negative threshold performs substantially the same function as the negative threshold (S206 in FIG. 7) in the above-described overall identification process, detailed description thereof is omitted. When YES is determined in S309, a negative flag is set as in S207 of FIG.

Ｓ３０２においてＮＯの場合、Ｓ３０９でＮＯの場合、又はＳ３１０の処理を終えた場合、部分識別器６０は、次のサブ部分識別器６１の有無を判断する（Ｓ３１１）。夕景部分識別器６１Ｓによる処理を終えた後の場合、サブ部分識別器６１として花部分識別器６１Ｆや紅葉部分識別器６１Ｒがまだあるので、部分識別器６０は、Ｓ３１１において、次のサブ部分識別器６１があると判断する。 In the case of NO in S302, in the case of NO in S309, or when the process of S310 is completed, the partial discriminator 60 determines whether or not there is a next sub partial discriminator 61 (S311). In the case after the processing by the evening scene partial classifier 61S is finished, since the flower partial classifier 61F and the autumnal leaves partial classifier 61R are still present as the sub partial classifier 61, the partial classifier 60 determines the next sub partial classifier in S311. It is determined that there is a container 61.

そして、Ｓ３０６の処理を終えた場合（識別対象画像が特定のシーンに属すると判断された場合）、又は、Ｓ３１１において次のサブ部分識別器６１がないと判断された場合（識別対象画像が特定のシーンに属すると判断できなかった場合）、部分識別器６０は、部分識別処理を終了する。 Then, when the process of S306 is completed (when it is determined that the identification target image belongs to a specific scene), or when it is determined in S311 that there is no next sub partial classifier 61 (the identification target image is specified). If it cannot be determined that the scene belongs to the scene), the partial discriminator 60 ends the partial discrimination processing.

なお、既に説明した通り、部分識別処理が終了すると、シーン識別部３３は、部分識別処理によってシーンの識別ができたか否かを判断する（図５のＳ１０６）。このとき、シーン識別部３３は、図８の識別対象テーブルを参照し、「肯定」欄に１があるか否かを判断することになる。
部分識別処理によってシーンの識別ができた場合（Ｓ１０６でＹＥＳ）、統合識別処理が省略される。これにより、シーン識別処理の速度が速くなる。 As already described, when the partial identification process is completed, the scene identification unit 33 determines whether or not the scene has been identified by the partial identification process (S106 in FIG. 5). At this time, the scene identification unit 33 refers to the identification target table in FIG. 8 and determines whether or not there is 1 in the “affirmation” column.
When the scene can be identified by the partial identification process (YES in S106), the integrated identification process is omitted. This increases the speed of the scene identification process.

ところで、上記の説明では、夕景部分識別器６１Ｓは、１０個の部分画像を用いて夕景画像の識別を行っているが、識別に用いられる部分画像の数は１０個に限られるものではない。また、他のサブ部分識別器６１が、夕景部分識別器６１Ｓとは異なる数の部分画像を用いて画像を識別しても良い。ここでは、花部分識別器６１Ｆは２０個の部分画像を用いて花画像を識別し、また、紅葉部分識別器６１Ｒは１５個の部分画像を用いて紅葉画像を識別するものとする。 In the above description, the evening scene partial classifier 61S identifies evening scene images using ten partial images. However, the number of partial images used for identification is not limited to ten. Further, the other sub partial classifier 61 may identify an image using a different number of partial images from the sunset scene partial classifier 61S. Here, it is assumed that the flower portion identifier 61F identifies a flower image using 20 partial images, and the autumnal leaf portion identifier 61R identifies a autumnal leaf image using 15 partial images.

＜サポートベクタマシン＞
統合識別処理について説明する前に、全体識別処理のサブ識別器５１や部分識別処理のサブ部分識別器６１において用いられているサポートベクタマシン（ＳＶＭ）について説明する。 <Support vector machine>
Before describing the integrated identification process, the support vector machine (SVM) used in the sub-identifier 51 for the overall identification process and the sub-partial identifier 61 for the partial identification process will be described.

図１７Ａは、線形サポートベクタマシンによる判別の説明図である。ここでは、２つの特徴量ｘ１、ｘ２によって、学習用サンプルを２次元空間に示している。学習用サンプルは２つのクラスＡ、Ｂに分けられている。図中では、クラスＡに属するサンプルは丸で示されており、クラスＢに属するサンプルは四角で示されている。
学習用サンプルを用いた学習によって、２次元空間を２つに分ける境界が定義される。境界は、＜ｗ・ｘ＞＋ｂ＝０で定義される（なお、ｘ＝（ｘ１，ｘ２）であり、ｗは重みベクトルであり、＜ｗ・ｘ＞はｗとｘの内積である）。但し、境界は、マージンが最大になるように、学習用サンプルを用いた学習によって定義される。つまり、図の場合、境界は、太点線ではなく、太実線のようになる。
判別は、判別式ｆ（ｘ）＝＜ｗ・ｘ＞＋ｂを用いて行われる。ある入力ｘ（この入力ｘは学習用サンプルとは別である）について、ｆ（ｘ）＞０であればクラスＡに属すると判別され、ｆ（ｘ）＜０であればクラスＢに属すると判別される。 FIG. 17A is an explanatory diagram of determination by the linear support vector machine. Here, the learning sample is shown in a two-dimensional space by two feature amounts x1 and x2. The learning sample is divided into two classes A and B. In the figure, samples belonging to class A are indicated by circles, and samples belonging to class B are indicated by squares.
A boundary that divides the two-dimensional space into two is defined by learning using the learning sample. The boundary is defined by <w · x> + b = 0 (where x = (x1, x2), w is a weight vector, and <w · x> is an inner product of w and x). However, the boundary is defined by learning using a learning sample so that the margin is maximized. That is, in the case of the figure, the boundary is not a thick dotted line but a thick solid line.
The discrimination is performed using the discriminant f (x) = <w · x> + b. It is determined that a certain input x (this input x is different from the learning sample) belongs to class A if f (x)> 0, and belongs to class B if f (x) <0. Determined.

ここでは２次元空間を用いて説明しているが、これに限られない（つまり、特徴量は２以上でも良い）。この場合、境界は超平面で定義される。 Here, the description is made using a two-dimensional space, but the present invention is not limited to this (that is, the feature amount may be two or more). In this case, the boundary is defined by a hyperplane.

ところで、２つのクラスに線形関数で分離できないことがある。このような場合に線形サポートベクタマシンによる判別を行うと、判別結果の精度が低下する。そこで、入力空間の特徴量を非線形変換すれば、すなわち入力空間からある特徴空間へ非線形写像すれば、特徴空間において線形関数で分離することができるようになる。非線形サポートベクタマシンでは、これを利用している。 By the way, there are cases where the two classes cannot be separated by a linear function. In such a case, if the determination is performed by the linear support vector machine, the accuracy of the determination result is lowered. Therefore, if the feature quantity of the input space is nonlinearly transformed, that is, if the input space is nonlinearly mapped to a certain feature space, it can be separated by a linear function in the feature space. This is used in the nonlinear support vector machine.

図１７Ｂは、カーネル関数を用いた判別の説明図である。ここでは、２つの特徴量ｘ１、ｘ２によって、学習用サンプルを２次元空間に示している。図１７Ｂの入力空間からの非線形写像が図１７Ａのような特徴空間になれば、線形関数で２つのクラスに分離することが可能になる。この特徴空間においてマージンが最大になるように境界が定義されれば、特徴空間における境界の逆写像が、図１７Ｂに示す境界になる。この結果、図１７Ｂに示すように、境界は非線形になる。 FIG. 17B is an explanatory diagram of discrimination using a kernel function. Here, the learning sample is shown in a two-dimensional space by two feature amounts x1 and x2. If the nonlinear mapping from the input space of FIG. 17B becomes a feature space as shown in FIG. 17A, it can be separated into two classes by a linear function. If the boundary is defined so that the margin is maximized in this feature space, the inverse mapping of the boundary in the feature space becomes the boundary shown in FIG. 17B. As a result, the boundary becomes nonlinear as shown in FIG. 17B.

ここではガウスカーネルを利用することにより、判別式ｆ（ｘ）は次式のようになる（なお、Ｍは特徴量の数であり、Ｎは学習用サンプルの数（若しくは境界に寄与する学習用サンプルの数）であり、ｗ_ｉは重み係数であり、ｙ_ｊは学習用サンプルの特徴量であり、ｘ_ｊは入力ｘの特徴量である）。

Here, by using a Gaussian kernel, the discriminant f (x) becomes as follows (where M is the number of features, and N is the number of learning samples (or learning for contributing to the boundary) The number of samples), w _i is a weighting factor, y _j is the feature quantity of the learning sample, and x _j is the feature quantity of the input x).

ある入力ｘ（この入力ｘは学習用サンプルとは別である）について、ｆ（ｘ）＞０であればクラスＡに属すると判別され、ｆ（ｘ）＜０であればクラスＢに属すると判別される。また、判別式ｆ（ｘ）の値が大きい値になるほど、入力ｘ（この入力ｘは学習用サンプルとは別である）がクラスＡに属する確率が高くなる。逆に、判別式ｆ（ｘ）の値が小さい値になるほど、入力ｘ（この入力ｘは学習用サンプルとは別である）がクラスＡに属する確率が低くなる。 It is determined that a certain input x (this input x is different from the learning sample) belongs to class A if f (x)> 0, and belongs to class B if f (x) <0. Determined. Further, the larger the value of the discriminant f (x), the higher the probability that the input x (this input x is different from the learning sample) belongs to the class A. On the contrary, the smaller the value of the discriminant f (x), the lower the probability that the input x (this input x is different from the learning sample) belongs to the class A.

前述の全体識別処理のサブ識別器５１や部分識別処理のサブ部分識別器６１では、上記のサポートベクタマシンの判別式ｆ（ｘ）の値を用いている。サポートベクタマシンによる判別式ｆ（ｘ）の値の算出には、学習用サンプルの数が多くなると時間がかかる。このため、判別式ｆ（ｘ）の値を複数回算出する必要があるサブ部分識別器６１は、判別式ｆ（ｘ）の値を１回算出すれば済むサブ識別器５１よりも、処理時間がかかる。 In the sub-identifier 51 for the overall identification process and the sub-partial identifier 61 for the partial identification process, the value of the discriminant f (x) of the support vector machine is used. The calculation of the value of the discriminant f (x) by the support vector machine takes time as the number of learning samples increases. For this reason, the sub partial classifier 61 that needs to calculate the value of the discriminant f (x) a plurality of times requires more processing time than the sub classifier 51 that only needs to calculate the value of the discriminant f (x) once. It takes.

なお、学習用サンプルとは別に評価用サンプルが用意されている。前述のRecallやPrecisionのグラフは、評価用サンプルに対する識別結果に基づくものである。 An evaluation sample is prepared separately from the learning sample. The above Recall and Precision graphs are based on the identification results for the evaluation samples.

＜統合識別処理＞
前述の全体識別処理や部分識別処理では、サブ識別器５１やサブ部分識別器６１における肯定閾値を比較的高めに設定し、Precision（正解率）を高めに設定している。なぜならば、例えば全体識別器の風景識別器５１Ｌの正解率が低く設定されると、風景識別器５１Ｌが紅葉画像を風景画像であると誤識別してしまい、紅葉識別器５１Ｒによる識別を行う前に全体識別処理を終えてしまう事態が発生してしまうからである。ここではPrecision（正解率）が高めに設定されることにより、特定のシーンに属する画像が特定のシーンのサブ識別器５１（又はサブ部分識別器６１）に識別されるようになる（例えば紅葉画像が紅葉識別器５１Ｒ（又は紅葉部分識別器６１Ｒ）によって識別されるようになる）。 <Integrated identification processing>
In the above-described overall identification process and partial identification process, the positive threshold value in the sub-classifier 51 and the sub-classifier 61 is set relatively high, and the Precision (correct answer rate) is set high. This is because, for example, if the accuracy rate of the landscape classifier 51L of the overall classifier is set low, the landscape classifier 51L erroneously identifies the autumnal image as a landscape image, and before the autumnal leaves classifier 51R performs the classification. This is because a situation occurs in which the entire identification process ends. Here, by setting the Precision (accuracy rate) to be high, an image belonging to a specific scene is identified by the sub-identifier 51 (or sub-partial identifier 61) of the specific scene (for example, autumnal image) Is identified by the autumnal leaf discriminator 51R (or the autumnal leaf partial discriminator 61R).

但し、全体識別処理や部分識別処理のPrecision（正解率）を高めに設定すると、全体識別処理や部分識別処理ではシーンの識別ができなくなる可能性が高くなる。そこで、全体識別処理及び部分識別処理によってシーンの識別ができなかった場合、以下に説明する統合識別処理が行われる。 However, if the Precision (accuracy rate) of the overall identification process or the partial identification process is set to be high, there is a high possibility that the scene cannot be identified by the overall identification process or the partial identification process. Therefore, when the scene cannot be identified by the overall identification process and the partial identification process, the integrated identification process described below is performed.

図１８は、統合識別処理のフロー図である。以下に説明するように、統合識別処理は、全体識別処理の各サブ識別器５１の判別式の値に基づいて、最も確信度の高いシーンを選択する処理である。 FIG. 18 is a flowchart of the integrated identification process. As will be described below, the integrated identification process is a process of selecting a scene with the highest certainty factor based on the discriminant value of each sub-classifier 51 in the overall identification process.

まず、統合識別器７０は、５つのサブ識別器５１の判別式の値に基づいて、正となるシーンを抽出する（Ｓ４０１）。このとき、全体識別処理の際に各サブ識別器５１が算出した判別式の値が用いられる。 First, the integrated discriminator 70 extracts a positive scene based on the discriminant values of the five sub discriminators 51 (S401). At this time, the value of the discriminant calculated by each sub classifier 51 during the overall identification process is used.

次に、統合識別器７０は、判別式の値が正のシーンが存在するか否かを判断する（Ｓ４０２）。
判別式の値が正のシーンが存在する場合（Ｓ４０２でＹＥＳ）、最大値のシーンの欄に肯定フラグを立てて（Ｓ４０３）、統合識別処理を終了する。これにより、最大値のシーンに識別対象画像が属すると判断される。
一方、判別式の値が正であるシーンが存在しない場合（Ｓ４０２でＮＯ）、肯定フラグを立てずに、統合識別処理を終了する。これにより、図８の識別対象テーブルの肯定欄において、１のシーンが無いままの状態になる。つまり、識別対象画像が、どのシーンに属するか識別できなかったことになる。
Next, the integrated discriminator 70 determines whether or not a scene having a positive discriminant value exists (S402).
If there is a scene with a positive discriminant value (YES in S402), an affirmative flag is set in the maximum value scene column (S403), and the integrated identification process is terminated. Accordingly, it is determined that the identification target image belongs to the maximum value scene.
On the other hand, if there is no scene having a positive discriminant value (NO in S402), the integrated identification process is terminated without setting an affirmative flag. As a result, there is no scene in the affirmative column of the identification target table in FIG. That is, it cannot be identified to which scene the identification target image belongs.

なお、既に説明した通り、統合識別処理が終了すると、シーン識別部３３は、統合識別処理によってシーンの識別ができたか否かを判断する（図５のＳ１０８）。このとき、シーン識別部３３は、図８の識別対象テーブルを参照し、「肯定」欄に１があるか否かを判断することになる。Ｓ４０２でＮＯとの判断の場合、Ｓ１０８の判断もＮＯになる。 As already described, when the integrated identification process is completed, the scene identification unit 33 determines whether or not the scene has been identified by the integrated identification process (S108 in FIG. 5). At this time, the scene identification unit 33 refers to the identification target table in FIG. 8 and determines whether or not there is 1 in the “affirmation” column. If it is determined NO in S402, the determination in S108 is also NO.

＝＝＝第１実施形態＝＝＝
＜概要説明＞
ユーザの好みには個人差があるため、ある画像が「風景」に識別されることを好む人もいれば、「風景」に識別されないことを好む人もいる。そこで、本実施形態では、ユーザの好みを識別処理に反映させることを可能にしている。 === First Embodiment ===
<Overview>
Because there are individual differences in user preferences, some people prefer that certain images be identified as “landscape”, while others prefer not to be identified as “landscape”. Therefore, in this embodiment, it is possible to reflect the user's preference in the identification process.

図１９は、第１実施形態の設定画面の説明図である。この設定画面１６１は、プリンタ４の表示部１６に表示される画面である。設定画面１６１には、各シーンに対応して、それぞれ５個の画像が表示される。これらの画像は、いずれもサポートベクタマシン（ＳＶＭ）の学習用サンプルの画像である。ここでは、「風景」に対応して最上段に表示される５個の画像Ｌ１〜Ｌ５について説明する。 FIG. 19 is an explanatory diagram of a setting screen according to the first embodiment. The setting screen 161 is a screen displayed on the display unit 16 of the printer 4. On the setting screen 161, five images are displayed corresponding to each scene. These images are all learning sample images of the support vector machine (SVM). Here, the five images L1 to L5 displayed at the top corresponding to “landscape” will be described.

５個の画像のうちの右の画像ほど風景とは無関係な画像になるように、５個の画像Ｌ１〜Ｌ５が表示されるようになっている（後述）。そして、当初の設定では、３個の画像Ｌ１〜画像Ｌ３に対応する学習用サンプルは風景に属するとされており、２個の画像Ｌ４及び画像Ｌ５に対応する学習用サンプルは風景に属さないとされている。これに応じて、設定画面１６１の表示の当初は、風景に属する画像と属さない画像との境界を示すように、境界設定バー１６１Ａが画像Ｌ３と画像Ｌ４との間に表示される。 Five images L1 to L5 are displayed so that the right image of the five images becomes an image irrelevant to the landscape (described later). In the initial setting, the learning samples corresponding to the three images L1 to L3 belong to the landscape, and the learning samples corresponding to the two images L4 and L5 must not belong to the landscape. Has been. Accordingly, at the beginning of the display of the setting screen 161, a boundary setting bar 161A is displayed between the image L3 and the image L4 so as to indicate the boundary between the image belonging to the landscape and the image not belonging to the landscape.

この境界設定バー１６１Ａは、ユーザによってその位置を変更することが可能である。例えば、表示部１６に表示された画像Ｌ３が風景画像ではないとユーザが判断した場合、ユーザは、パネル部１７を操作して、５個の境界設定バー１６１Ａのうち風景に対応する境界設定バー１６１Ａを選択し、その境界設定バー１６１Ａを一つ左に移動して画像Ｌ２と画像Ｌ３との間にする。 The position of the boundary setting bar 161A can be changed by the user. For example, when the user determines that the image L3 displayed on the display unit 16 is not a landscape image, the user operates the panel unit 17 to select a boundary setting bar corresponding to the landscape among the five boundary setting bars 161A. 161A is selected, and the boundary setting bar 161A is moved to the left by a distance between the images L2 and L3.

そして、設定された境界設定バー１６１Ａの位置に応じて、サブ識別器５１の処理が変更される（後述）。この結果、画像Ｌ３の類似画像を風景識別器５１Ｌが識別したとき、当初の設定のままでは風景識別器５１Ｌは風景のシーンに属すると識別していたが、風景のシーンに属しないと識別できるようになる。つまり、ユーザの好みが識別処理に反映されるようになる。 Then, the processing of the sub classifier 51 is changed according to the set position of the boundary setting bar 161A (described later). As a result, when the landscape discriminator 51L identifies a similar image of the image L3, the landscape discriminator 51L has been identified as belonging to a landscape scene, but can be discriminated as not belonging to a landscape scene. It becomes like this. That is, the user's preference is reflected in the identification process.

以下の説明では、まず、プリンタ４のメモリ２３に記憶されているデータについて説明する。次に、設定画面１６１をどのように表示するのかについて説明する。その次に、設定画面１６１にて境界が設定された後、サブ識別器５１の処理がどのように変更されるのかについて説明する。 In the following description, first, data stored in the memory 23 of the printer 4 will be described. Next, how to display the setting screen 161 will be described. Next, how the processing of the sub classifier 51 is changed after the boundary is set on the setting screen 161 will be described.

＜メモリに格納されている学習用サンプルのデータ＞
まず、プリンタ４のメモリ２３に記憶されているデータについて説明する。以下に説明するように、メモリ２３には、図２０Ａに示すデータ群と、図２０Ｂの白ドットで示す学習用サンプルの画像データが記憶されている。 <Learning sample data stored in memory>
First, data stored in the memory 23 of the printer 4 will be described. As will be described below, the memory 23 stores a data group shown in FIG. 20A and learning sample image data indicated by white dots in FIG. 20B.

図２０Ａは、メモリ２３に記憶されている学習用サンプルのデータ群である。ここでは、風景識別器５１Ｌのサポートベクタマシンに用いられるデータ群が示されている。 FIG. 20A shows a data group of learning samples stored in the memory 23. Here, a data group used for the support vector machine of the landscape classifier 51L is shown.

図に示すように、学習用サンプルの画像そのものの情報（画像データ）が記憶されるのではなく、学習用サンプルの全体特徴量がメモリ２３に記憶されている。また、各学習用サンプルに対応付けて、重み係数ｗもメモリ２３に記憶されている。この重み係数ｗは、学習用サンプルの全体特徴量のデータ群を用いて算出することが可能であるが、ここでは、重み係数ｗは予め算出されて、メモリ２３に記憶されているものとする。前述の判別式ｆ（ｘ）の値は、このデータ群の全体特徴量ｙと重み係数ｗを用いて、前述の数１の式に基づいて算出される。なお、境界の決定に寄与しない学習用サンプルの重み係数はゼロとなるので、本来ならばその学習用サンプルの全体特徴量はメモリ２３に記憶する必要はないが、本実施形態では、全ての学習用サンプルの全体特徴量をメモリ２３に記憶しているものとする。 As shown in the figure, information (image data) of the learning sample image itself is not stored, but the entire feature amount of the learning sample is stored in the memory 23. Further, the weighting coefficient w is also stored in the memory 23 in association with each learning sample. The weighting factor w can be calculated using the data group of the entire feature amount of the learning sample. Here, it is assumed that the weighting factor w is calculated in advance and stored in the memory 23. . The value of the above-described discriminant f (x) is calculated based on the above-described equation 1 using the entire feature amount y and the weight coefficient w of this data group. Note that since the weighting coefficient of the learning sample that does not contribute to the determination of the boundary is zero, it is not necessary to store the entire feature amount of the learning sample in the memory 23. It is assumed that the entire feature amount of the sample for use is stored in the memory 23.

更に、本実施形態では、各学習用サンプルに対応付けて、風景のシーンに属するか否かを示す情報（属性情報）も記憶されている。風景のシーンに属するものには属性情報としてＰが設定され、属さないものには属性情報としてＮが設定される。後述する通り、この属性情報は、図１９の設定画面１６１を表示する際に用いられると共に、図１９の境界設定バー１６１Ａの設定に応じて変更される。 Furthermore, in the present embodiment, information (attribute information) indicating whether or not the scene belongs to a landscape scene is also stored in association with each learning sample. P is set as attribute information for items belonging to landscape scenes, and N is set as attribute information for items that do not belong. As will be described later, this attribute information is used when displaying the setting screen 161 of FIG. 19 and is changed according to the setting of the boundary setting bar 161A of FIG.

図２０Ｂは、各学習用サンプルの分布の説明図である。ここでは説明の簡略化のため、２つの特徴量によって、２次元空間に学習用サンプルが分布している。各ドットは、学習用サンプルの２次元空間上での位置をそれぞれ示している。 FIG. 20B is an explanatory diagram of the distribution of each learning sample. Here, for simplification of explanation, learning samples are distributed in a two-dimensional space by two feature amounts. Each dot indicates the position of the learning sample on the two-dimensional space.

各学習用サンプルは予めクラスタリングされており、図中では１３個のクラスタ（クラスタＡ〜クラスタＭ）にクラスタリングされている。ここでは、公知のｋ−ｍｅａｎｓ法により、クラスタリングがされている。ｋ−ｍｅａｎｓ法によるクラスタリングの手法は、以下の通りである。（１）まず、コンピュータは、クラスタの中心の位置が仮決めする。ここでは、１３個の中心の位置をランダムに仮決めする。（２）次に、コンピュータは、各学習用サンプルを、最も近い中心のクラスタに分類する。これにより、新しいクラスタが決定される。（３）次に、コンピュータは、各クラスタの学習用サンプルの特徴量の平均値を算出し、その平均値を新しいクラスタの中心の位置とする。（４）新しいクラスタの中心の位置が、元のクラスタの中心の位置と変化しなければクラスタリングを終了し、変化していれば（２）に戻る。 Each learning sample is clustered in advance, and is clustered into 13 clusters (cluster A to cluster M) in the figure. Here, clustering is performed by a known k-means method. A clustering technique based on the k-means method is as follows. (1) First, the computer temporarily determines the position of the center of the cluster. Here, the positions of the 13 centers are provisionally determined at random. (2) Next, the computer classifies each learning sample into the nearest central cluster. Thereby, a new cluster is determined. (3) Next, the computer calculates the average value of the feature values of the learning samples for each cluster, and sets the average value as the position of the center of the new cluster. (4) If the position of the center of the new cluster does not change from the position of the center of the original cluster, the clustering is terminated, and if it has changed, the process returns to (2).

なお、同じクラスタ内には、似た性質の学習用サンプルが属することになる。例えば、青空の画像の学習用サンプルによってクラスタＡが構成されたり、新緑の画像の学習用サンプルによってクラスタＢが構成されたりする。 Note that learning samples having similar properties belong to the same cluster. For example, a cluster A is configured by a learning sample for a blue sky image, or a cluster B is configured by a learning sample for a fresh green image.

図２０Ｂの白ドットは、各クラスタの中心の位置に最も近い学習用サンプルの位置を示している。この白ドットの学習用サンプルは、各クラスタを代表するサンプル（代表サンプル）となる。メモリ２３には、白ドットで示された代表サンプルの画像データが記憶されている。言い換えると、メモリ２３には、各クラスタを代表する画像の画像データが記憶される。後述する通り、この代表画像データは、図１９の設定画面１６１の表示に用いられる。 The white dot in FIG. 20B indicates the position of the learning sample closest to the center position of each cluster. The white dot learning sample is a sample representing each cluster (representative sample). The memory 23 stores image data of representative samples indicated by white dots. In other words, the memory 23 stores image data representing an image representing each cluster. As will be described later, this representative image data is used to display the setting screen 161 in FIG.

以上説明したように、プリンタ４のメモリ２３には、図２０Ａに示すデータ群と、図２０Ｂの白ドットで示す代表サンプルの画像データが記憶されている。なお、各学習用サンプルの属するクラスタを示すデータは、メモリ２３に格納されていても良いし、格納されていなくても良い。各学習用サンプルの属するクラスタを示すデータは、図２０Ａのデータ群を用いて求めることが可能だからである。 As described above, the memory 23 of the printer 4 stores the data group shown in FIG. 20A and the representative sample image data shown by white dots in FIG. 20B. Note that the data indicating the cluster to which each learning sample belongs may or may not be stored in the memory 23. This is because the data indicating the cluster to which each learning sample belongs can be obtained using the data group of FIG. 20A.

＜設定画面１６１を表示するまでの処理＞
次に、プリンタ側コントローラ２０が、図１９のような設定画面１６１をどのように表示するのかについて説明する。 <Processing until the setting screen 161 is displayed>
Next, how the printer-side controller 20 displays the setting screen 161 as shown in FIG. 19 will be described.

図２１Ａは、境界（ｆ（ｘ）＝０）の法線に代表サンプルを投影する様子の説明図である。ここでも説明の簡略化のため、２次元空間に代表サンプルが分布しているものとする。また、説明の簡略化のため、この２次元空間は、図１７Ａのように線形関数で分離可能な空間であるものとする。このため、風景画像のサンプルと、非風景画像のサンプルを分離する境界（ｆ（ｘ）＝０）は、直線で定義される。（なお、デフォルトの設定では、クラスタＡ〜Ｇに属する学習用サンプルは風景画像であり、クラスタＨ〜Ｍに属する学習用サンプルは非風景画像である。）
図中において、代表サンプルの２次元空間上の位置が白ドットで示されており、境界（ｆ（ｘ）＝０）が太線で示されている。なお、この境界は、設定変更前のデフォルトの境界である。 FIG. 21A is an explanatory diagram showing a state in which the representative sample is projected onto the normal line of the boundary (f (x) = 0). Also here, for simplification of explanation, it is assumed that representative samples are distributed in a two-dimensional space. For the sake of simplicity of explanation, this two-dimensional space is assumed to be a space that can be separated by a linear function as shown in FIG. 17A. Therefore, the boundary (f (x) = 0) that separates the landscape image sample and the non-landscape image sample is defined by a straight line. (In the default setting, the learning samples belonging to the clusters A to G are landscape images, and the learning samples belonging to the clusters H to M are non-landscape images.)
In the figure, the position of the representative sample in the two-dimensional space is indicated by a white dot, and the boundary (f (x) = 0) is indicated by a bold line. This boundary is a default boundary before the setting is changed.

プリンタ側コントローラ２０は、境界に対する法線を一つ定義し、この法線上に、代表サンプルを投影する。投影される位置は、代表サンプルを通り境界と平行な直線（境界が超平面であれば超平面）と、法線との交点である。このようにして、１３個の代表サンプルが、法線上に投影される。言い換えると、１３個の代表サンプルが一直線上に並ぶことになる。 The printer-side controller 20 defines one normal to the boundary and projects a representative sample on this normal. The projected position is the intersection of a normal line and a straight line that passes through the representative sample and is parallel to the boundary (or a hyperplane if the boundary is a hyperplane). In this way, 13 representative samples are projected onto the normal. In other words, 13 representative samples are arranged in a straight line.

図２１Ｂは、法線上に投影された代表サンプルの説明図である。風景画像の代表サンプルが図中の左側に位置するように、言い換えると、非風景画像の代表サンプルが図中の右側に位置するように、法線を水平にして、法線上に投影された代表サンプルの位置関係を示している。 FIG. 21B is an explanatory diagram of a representative sample projected on the normal line. A representative that is projected on the normal with the normal level horizontal so that the representative sample of the landscape image is located on the left side of the figure, in other words, the representative sample of the non-landscape image is located on the right side of the figure The positional relationship of the sample is shown.

次に、プリンタ側コントローラ２０は、法線上に５個の区間を定義する。図中には、第１区間〜第５区間が定義されている。各区間は、所定の長さになるように定義されている。また、図２１Ａの法線と境界（ｆ（ｘ）＝０）との交点の位置が、２つの区間の境界になるように、５個の区間が定義される。ここでは、図２１Ａの法線と境界（ｆ（ｘ）＝０）との交点の位置は、第３区間と第４区間の境界に相当する。なお、各区間には、複数の代表サンプルが存在することになる。 Next, the printer-side controller 20 defines five sections on the normal line. In the figure, a first section to a fifth section are defined. Each section is defined to have a predetermined length. Further, five sections are defined so that the position of the intersection of the normal line and the boundary (f (x) = 0) in FIG. 21A becomes the boundary between the two sections. Here, the position of the intersection of the normal line and the boundary (f (x) = 0) in FIG. 21A corresponds to the boundary between the third section and the fourth section. Note that there are a plurality of representative samples in each section.

次に、プリンタ側コントローラ２０は、各区間の中心に位置する代表サンプルの画像データを抽出する。ここでは、第１区間から、クラスタＣの代表サンプルの画像データが抽出される。同様に、第２区間、第３区間、第４区間、第５区間から、クラスタＥ、Ｆ、Ｈ、Ｌの代表サンプルの画像データがそれぞれ抽出される。このとき、デフォルトの設定において風景のシーンに属しているとされる代表サンプルが、第１区間〜第３区間から抽出される。また、デフォルトの設定において風景のシーンに属しないとされる代表サンプルが、第４区間及び第５区間から抽出される。抽出された画像データは、各区間を代表するものと考えられる。 Next, the printer-side controller 20 extracts image data of a representative sample located at the center of each section. Here, the image data of the representative sample of cluster C is extracted from the first section. Similarly, image data of representative samples of clusters E, F, H, and L are extracted from the second section, the third section, the fourth section, and the fifth section, respectively. At this time, representative samples that belong to a landscape scene in the default settings are extracted from the first to third sections. In addition, representative samples that do not belong to the landscape scene in the default setting are extracted from the fourth section and the fifth section. The extracted image data is considered to represent each section.

プリンタ側コントローラ２０は、抽出した画像データを用いて、設定画面１６１をプリンタ４の表示部１６に表示する。第１区間から抽出されたクラスタＣの代表サンプルの画像データは、図１９の画像Ｌ１の表示に用いられる。同様に、クラスタＥ、Ｆ、Ｈ、Ｌの代表サンプルの画像データは、それぞれ図１９の画像Ｌ２、Ｌ３、Ｌ４、Ｌ５の表示に用いられる。 The printer-side controller 20 displays a setting screen 161 on the display unit 16 of the printer 4 using the extracted image data. The image data of the representative sample of the cluster C extracted from the first section is used for displaying the image L1 in FIG. Similarly, the image data of the representative samples of the clusters E, F, H, and L are used for displaying the images L2, L3, L4, and L5 in FIG. 19, respectively.

また、図２１Ａの法線と境界（ｆ（ｘ）＝０）との交点の位置が第３区間と第４区間の境界に相当しているので、プリンタ側コントローラ２０は、図１９の画像Ｌ３（第３区間から抽出された代表サンプルの画像）と画像Ｌ４（第４区間から抽出された代表サンプルの画像）との間に境界設定バー１６１Ａを表示する。ところで、画像Ｌ１〜画像Ｌ３は風景画像であり、画像Ｌ４及び画像Ｌ５は非風景画像であるので、境界設定バー１６１Ａは、風景画像と非風景画像との間に表示されることになる。 Further, since the position of the intersection of the normal line in FIG. 21A and the boundary (f (x) = 0) corresponds to the boundary between the third section and the fourth section, the printer-side controller 20 causes the image L3 in FIG. A boundary setting bar 161A is displayed between (the representative sample image extracted from the third section) and the image L4 (the representative sample image extracted from the fourth section). By the way, since the images L1 to L3 are landscape images and the images L4 and L5 are non-landscape images, the boundary setting bar 161A is displayed between the landscape image and the non-landscape image.

本実施形態では、上記のように、境界の法線に代表サンプルの位置が投影され、法線上に投影された代表サンプルの位置に基づいて、抽出すべき代表サンプルが決定される。これにより、本実施形態では、判別式の値が大きいものほど左側になるように、５個の代表サンプルの画像が表示される。言い換えると、風景のシーンに属する確信度の高い順に左から並ぶように、５個の代表サンプルの画像を表示できる。 In the present embodiment, as described above, the position of the representative sample is projected onto the normal line of the boundary, and the representative sample to be extracted is determined based on the position of the representative sample projected onto the normal line. Thereby, in this embodiment, the image of five representative samples is displayed so that the larger the discriminant value is, the left side is. In other words, images of five representative samples can be displayed so that they are arranged from the left in descending order of certainty belonging to a landscape scene.

そして、上記のように図１９の設定画面１６１が表示されるので、境界設定バー１６１Ａの左側には、デフォルトの設定では風景画像の代表サンプルが表示される。また、境界設定バー１６１Ａの右側には、デフォルトの設定では非風景画像の代表サンプルが表示される。そして、５個の画像のうちの右の画像ほど風景とは無関係な画像になるように、５個の画像Ｌ１〜Ｌ５が表示される。また、境界設定バー１６１Ａの近くに表示される画像は、ユーザの好みによって風景画像か否かの判断が分かれやすい画像になる。 Then, since the setting screen 161 in FIG. 19 is displayed as described above, a representative sample of a landscape image is displayed on the left side of the boundary setting bar 161A by default. On the right side of the boundary setting bar 161A, a representative sample of a non-landscape image is displayed by default. Then, the five images L1 to L5 are displayed so that the right image of the five images becomes an image irrelevant to the landscape. In addition, an image displayed near the boundary setting bar 161A is an image in which it is easy to determine whether or not the image is a landscape image depending on the user's preference.

以上の説明では風景のシーンについて説明したが、他のシーンについても、プリンタ側コントローラ２０は、同様の処理を行う。これにより、プリンタ側コントローラ２０は、図１９の設定画面１６１の風景以外の部分も表示できる。 In the above description, a landscape scene has been described, but the printer controller 20 performs the same processing for other scenes. As a result, the printer-side controller 20 can also display portions other than the scenery on the setting screen 161 in FIG.

＜第１参考例＞
次に、図１９に示すように、ユーザが境界設定バー１６１Ａを一つ左に移動して画像Ｌ２と画像Ｌ３との間に設定した後の処理について説明する。 <First Reference Example>
Next, as shown in FIG. 19, a process after the user moves the boundary setting bar 161A to the left and sets it between the images L2 and L3 will be described.

境界設定バー１６１Ａの移動後、境界設定バー１６１Ａの右側において、境界設定バー１６１Ａと、デフォルトの設定では非風景画像である画像Ｌ４との間に、デフォルトの設定では風景画像である画像Ｌ３（クラスタＦの代表サンプルの画像であり、第３区間を代表する画像である）が位置する状態になる。ユーザが境界設定バー１６１Ａを画像Ｌ３と画像Ｌ４との間から画像Ｌ２と画像Ｌ３との間に移動したということは、図２１Ｂに示す第３区間に属するクラスタＦ、Ｇに属する学習用サンプルが風景画像ではなく非風景画像であるとユーザが考えていると想定できる。 After the boundary setting bar 161A is moved, on the right side of the boundary setting bar 161A, between the boundary setting bar 161A and the image L4 that is a non-landscape image by default, the image L3 (cluster) that is a landscape image by default is set. F is a representative sample image, which is an image representative of the third section). The fact that the user has moved the boundary setting bar 161A between the images L3 and L4 between the images L2 and L3 means that the learning samples belonging to the clusters F and G belonging to the third section shown in FIG. It can be assumed that the user thinks that the image is not a landscape image but a non-landscape image.

図２２Ａは、変更後のデータ群の説明図である。図２２Ｂは、変更後の境界の説明図である。以下、これらの図を用いて、設定変更後のプリンタ側コントローラ２０の処理について説明する。 FIG. 22A is an explanatory diagram of the data group after the change. FIG. 22B is an explanatory diagram of the boundary after the change. Hereinafter, the processing of the printer-side controller 20 after the setting change will be described with reference to these drawings.

まず、プリンタ側コントローラ２０は、クラスタＦ、Ｇに属する学習用サンプルの属性情報をＰからＮに変更する。例えば、仮に図２０Ａのサンプル番号３がクラスタＦ又はクラスタＧに属していれば、図２２Ａに示すように属性情報をＰからＮに変更する。 First, the printer-side controller 20 changes the learning sample attribute information belonging to the clusters F and G from P to N. For example, if sample number 3 in FIG. 20A belongs to cluster F or cluster G, the attribute information is changed from P to N as shown in FIG. 22A.

本参考例では、クラスタＦの代表サンプルの属性情報を変更するだけでなく、クラスタＦに属する学習用サンプルの全ての属性情報を変更している。これにより、ユーザによる一度の操作によって、風景に属さないことをユーザが望んだ画像と似た性質の学習用サンプルの属性情報を、一括して変更することができる。 In this reference example, not only the attribute information of the representative sample of the cluster F is changed, but also all the attribute information of the learning samples belonging to the cluster F are changed. Thereby, the attribute information of the learning sample having properties similar to the image that the user desires not to belong to the landscape can be collectively changed by a single operation by the user.

また、本参考例では、クラスタＦの代表サンプルの属性情報を変更するだけでなく、第３区間に属する学習用サンプルの全ての属性情報を変更している。これにより、ユーザによる一度の操作によって、風景に属さないことをユーザが望んだ画像と同程度に境界から離れた学習用サンプルの属性情報を、一括して変更することができる。 In this reference example, not only the attribute information of the representative sample of the cluster F is changed, but also all the attribute information of the learning samples belonging to the third section are changed. Thereby, it is possible to collectively change the attribute information of the learning sample that is separated from the boundary as much as the image that the user desires not to belong to the landscape by a single operation by the user.

次に、プリンタ側コントローラ２０は、全体特徴量と変更後の属性情報とに基づいてサポートベクタマシンの再学習を行い、図２２Ｂに示すように境界を変更する。言い換えると、プリンタ側コントローラ２０は、全体特徴量と変更後の属性情報とに基づいて再学習を行い、図２０Ａの重み係数ｗを図２２Ｂに示すように変更する。ここでは、変更後の境界をｆ´（ｘ）＝０と表記し、変更後の重み係数をｗ´と表記している。なお、再学習の演算処理は通常のサポートベクタマシンの学習の演算処理と同じなので、再学習の説明は省略する。 Next, the printer-side controller 20 performs relearning of the support vector machine based on the entire feature amount and the changed attribute information, and changes the boundary as shown in FIG. 22B. In other words, the printer-side controller 20 performs relearning based on the overall feature amount and the changed attribute information, and changes the weighting coefficient w in FIG. 20A as shown in FIG. 22B. Here, the changed boundary is expressed as f ′ (x) = 0, and the changed weighting coefficient is expressed as w ′. Note that the relearning calculation process is the same as the normal support vector machine learning calculation process, and a description of the relearning will be omitted.

重み係数ｗ（又はｗ´）は、境界の決定に寄与しなければゼロになる。このため、図２０Ａではゼロであった重み係数ｗが、変更によってゼロ以外の値を持つこともある。逆に、図２０Ａではゼロ以外の値を持っていた重み係数ｗが、変更によってゼロになることもある。デフォルトの設定では境界の決定に寄与しない学習用サンプルのデータまでをも図２０Ａのデータ群で記憶しているのは、このためである。 The weighting factor w (or w ′) becomes zero if it does not contribute to the determination of the boundary. For this reason, the weighting factor w, which was zero in FIG. 20A, may have a value other than zero due to a change. Conversely, the weighting factor w, which had a value other than zero in FIG. 20A, may become zero due to a change. This is why even the learning sample data that does not contribute to the determination of the boundary in the default setting is stored in the data group of FIG. 20A.

識別対象画像が風景のシーンに属するか否かを風景識別器５１Ｌが判断するとき、風景識別器５１Ｌは、図２２Ｂのデータ群の学習用サンプルの全体特徴量と変更後の重み係数ｗ´とに基づいて、前述の数１の判別式の値（変更後の判別式ｆ´（ｘ）の値）を算出する。なお、重み係数ｗ´がゼロの学習用サンプルは除外して、風景識別器５１Ｌは、前述の数１の判別式の値を算出する。これにより、全ての学習用サンプルを用いて判別式の値を求める場合よりも、演算速度が速くなる。 When the landscape discriminator 51L determines whether or not the classification target image belongs to a landscape scene, the landscape discriminator 51L includes the entire feature amount of the learning sample of the data group in FIG. 22B and the changed weight coefficient w ′. Based on the above, the value of the discriminant of the above formula 1 (the value of the discriminant f ′ (x) after the change) is calculated. Note that the learning classifier 51L calculates the value of the discriminant of Equation 1 described above, excluding learning samples whose weighting coefficient w ′ is zero. As a result, the calculation speed is faster than the case of obtaining the discriminant value using all the learning samples.

変更後の判別式ｆ´（ｘ）を用いることにより、ユーザの好みを反映した識別処理を行うことができる。例えば、仮に画像Ｌ３（図１９参照）が建物の画像だとすると、識別対象画像が建物の画像である場合に、その識別対象画像が風景のシーンに属すると判断され難くなる。言い換えると、仮にクラスタＦ（図２２Ｂ参照）が建物の画像の学習用サンプルから構成されていたとすると、識別対象画像が建物の画像である場合に、その識別対象画像が風景のシーンに属すると判断され難くなる。 By using the discriminant f ′ (x) after the change, it is possible to perform an identification process reflecting the user's preference. For example, if the image L3 (see FIG. 19) is a building image, it is difficult to determine that the identification target image belongs to a landscape scene when the identification target image is a building image. In other words, if the cluster F (see FIG. 22B) is composed of building image learning samples, when the identification target image is a building image, it is determined that the identification target image belongs to a landscape scene. It becomes difficult to be done.

本参考例では、ユーザの好みに合わせた設定変更を、容易に行うことができる。もし仮に、１個ずつ学習用サンプルの画像を表示し、表示された学習用サンプルの画像が風景か否かをユーザが１個ずつ決定することにすると、ユーザは何度も決定作業を行う必要があるので不便である。 In this reference example, it is possible to easily change settings according to user preferences. If the learning sample images are displayed one by one and the user decides whether the displayed learning sample images are landscapes one by one, the user needs to repeat the determination work many times. It is inconvenient because there is.

なお、上記の説明では、ユーザが境界設定バー１６１Ａを一つ左に移動した場合について説明した。これに対し、仮にユーザが境界設定バー１６１Ａを一つ右に移動した場合には、境界設定バー１６１Ａの左側において、境界設定バー１６１Ａと、デフォルトの設定では風景画像である画像Ｌ３との間に、デフォルトの設定では非風景画像である画像Ｌ４（クラスタＨの代表サンプルの画像であり、第４区間を代表する画像である）が位置する状態になる。このような場合には、プリンタ側コントローラ２０は、第４区間に属するクラスタＩ、Ｈ、Ｊに属する学習用サンプルの属性情報をＮからＰに変更し、全体特徴量と変更後の属性情報とに基づいてサポートベクタマシンの再学習を行い、境界を変更する。この場合にも、ユーザの好みを反映した識別処理を行うことができる。 In the above description, the case where the user moves the boundary setting bar 161A to the left has been described. On the other hand, if the user moves the boundary setting bar 161A one place to the right, on the left side of the boundary setting bar 161A, between the boundary setting bar 161A and the image L3 that is a landscape image in the default setting. In the default setting, a non-landscape image L4 (a representative sample image of the cluster H and an image representative of the fourth section) is located. In such a case, the printer-side controller 20 changes the attribute information of the learning sample belonging to the clusters I, H, and J belonging to the fourth section from N to P, and the overall feature amount and the changed attribute information Re-learn the support vector machine based on, and change the boundary. Also in this case, identification processing reflecting user preferences can be performed.

＜本実施形態における設定画面１６１にて境界が設定された後の処理＞
前述の第１参考例では、ユーザが図１９の境界設定バー１６１Ａの位置を変更したときに、サポートベクタマシンの再学習を行っていた。このような形態では、プリンタ側コントローラ２０が、学習処理を行うためのプログラムを実行する必要があり、また、再学習の処理時間も必要となる。そこで、本実施形態では、予め複数の判別式が用意されており、境界設定バー１６１Ａの位置に応じて判別式が選択されることによって、プリンタ側コントローラ２０が再学習を行わなくても済むようにしている。なお、判別式は、識別対象画像を評価するための評価関数に相当する。 <Processing after border is set on setting screen 161 in this embodiment>
In the first reference example described above, the support vector machine is relearned when the user changes the position of the boundary setting bar 161A in FIG. In such a form, the printer-side controller 20 needs to execute a program for performing learning processing, and also requires relearning processing time. Therefore, in the present embodiment, a plurality of discriminants are prepared in advance, and the discriminant is selected according to the position of the boundary setting bar 161A, so that the printer-side controller 20 does not have to re-learn. Yes. The discriminant corresponds to an evaluation function for evaluating the identification target image.

図２３は、本実施形態の学習用サンプルのデータ群である。第１参考例のデータ群（図２０Ａ参照）と比較すると、本実施形態では各学習用サンプル毎に複数の重み係数ｗが記憶されている。ここでは、第１重み係数〜第４重み係数の４種類が記憶されている。言い換えると、４種類の判別式がメモリ２３に記憶されている。 FIG. 23 shows a data group of learning samples of the present embodiment. Compared to the data group of the first reference example (see FIG. 20A), in the present embodiment, a plurality of weighting factors w are stored for each learning sample. Here, four types of first weight coefficient to fourth weight coefficient are stored. In other words, four types of discriminants are stored in the memory 23.

４種類の重み係数は、それぞれ境界設定バー１６１Ａの位置に対応付けられている。第１重み係数は、図１９の画像Ｌ１と画像Ｌ２との間に対応付けられている。同様に、第２重み係数、第３重み係数、第４重み係数は、それぞれ、図１９の画像Ｌ２と画像Ｌ３との間、画像Ｌ３と画像Ｌ４との間、画像Ｌ４と画像Ｌ５との間に対応付けられている。 Each of the four types of weighting factors is associated with the position of the boundary setting bar 161A. The first weighting coefficient is associated between the image L1 and the image L2 in FIG. Similarly, the second weight coefficient, the third weight coefficient, and the fourth weight coefficient are respectively between the image L2 and the image L3 in FIG. 19, between the image L3 and the image L4, and between the image L4 and the image L5. Is associated with.

ユーザが設定を変更する前では、プリンタ側コントローラ２０（風景識別器５１Ｌ）は、第３重み係数と全体特徴量とに基づいて、前述の数１の判別式の値を算出する。言い換えると、デフォルトの設定では第３重み係数を用いた判別式が選択され、この判別式によって風景識別器５１Ｌは識別処理を行う。なお、演算速度向上のため、重み係数ｗがゼロの学習用サンプルは除外して、判別式の値が算出される。第３重み係数は、第１参考例のデフォルトの重み係数ｗ（図２０Ａ参照）と同じ値である。 Before the user changes the setting, the printer-side controller 20 (landscape discriminator 51L) calculates the value of the discriminant of the above formula 1 based on the third weighting coefficient and the overall feature amount. In other words, in the default setting, a discriminant using the third weighting coefficient is selected, and the landscape discriminator 51L performs the discrimination process based on this discriminant. Note that the discriminant value is calculated by excluding learning samples with a weighting factor w of zero in order to improve the calculation speed. The third weighting factor is the same value as the default weighting factor w (see FIG. 20A) of the first reference example.

ユーザが境界設定バー１６１Ａを一つ左に移動して画像Ｌ２と画像Ｌ３との間に設定した場合、風景識別器５１Ｌは、第２重み係数と全体特徴量とに基づいて、前述の数１の判別式の値を算出する。言い換えると、ユーザが境界設定バー１６１Ａを一つ左に移動して画像Ｌ２と画像Ｌ３との間に設定した場合、第２重み係数を用いた判別式が選択され、この判別式によって風景識別器５１Ｌが識別処理を行う。なお、演算速度向上のため、重み係数ｗがゼロの学習用サンプルは除外して、判別式の値が算出される。第２重み係数は、第１参考例において再学習によって求められた重み係数ｗ´（図２２Ｂ参照）と同じである。 When the user moves the boundary setting bar 161A one place to the left and sets it between the image L2 and the image L3, the landscape discriminator 51L is based on the second weighting factor and the overall feature amount, and the above-described equation 1 The value of the discriminant is calculated. In other words, when the user moves the boundary setting bar 161A one place to the left and sets it between the images L2 and L3, a discriminant using the second weighting coefficient is selected, and the discriminant uses this discriminant to determine the landscape classifier. 51L performs the identification process. Note that the discriminant value is calculated by excluding learning samples with a weighting factor w of zero in order to improve the calculation speed. The second weighting coefficient is the same as the weighting coefficient w ′ (see FIG. 22B) obtained by relearning in the first reference example.

以上の説明では風景の学習用サンプルのデータ群について説明したが、他のシーンについても、同様のデータ群をメモリに記憶している。 In the above description, the landscape learning sample data group has been described, but the same data group is stored in the memory for other scenes.

本実施形態においても、第１参考例と同様に、ユーザの好みを反映した識別処理を行うことができる。更に、本実施形態では再学習を行わなくても良いので、サポートベクタマシンによる学習処理を行うプログラムを実行しなくても良い。また、再学習を行わなくても良いので、設定後のプリンタ側コントローラ２０の処理の負荷も軽減される。 Also in the present embodiment, as in the first reference example, identification processing reflecting user preferences can be performed. Furthermore, in this embodiment, it is not necessary to perform relearning, and therefore it is not necessary to execute a program for performing learning processing by a support vector machine. In addition, since it is not necessary to perform re-learning, the processing load of the printer-side controller 20 after setting is also reduced.

＝＝＝第２実施形態＝＝＝
＜概要説明＞
ユーザの好みには個人差があるため、ある画像が「風景」に識別されることを好む人もいれば、「夕景」に識別されることを好む人もいる。そこで、第２実施形態では、ユーザの好みを識別処理に反映させることを可能にしている。 === Second Embodiment ===
<Overview>
Since there are individual differences in user preferences, some people prefer to identify an image as “landscape”, while others prefer to be identified as “evening scene”. Therefore, in the second embodiment, it is possible to reflect user preferences in the identification process.

図２４は、第２実施形態の設定画面の説明図である。この設定画面１６３は、プリンタ４の表示部１６に表示される画面である。設定画面１６３には、各シーンに対応して、それぞれ５個の画像が表示される。これらの画像は、いずれもサポートベクタマシン（ＳＶＭ）の学習用サンプルの画像である。ここでは、「風景」及び「夕景」に対応して最上段に表示される５個の画像ＬＳ１〜ＬＳ５について説明する。 FIG. 24 is an explanatory diagram of a setting screen according to the second embodiment. The setting screen 163 is a screen displayed on the display unit 16 of the printer 4. The setting screen 163 displays five images corresponding to each scene. These images are all learning sample images of the support vector machine (SVM). Here, five images LS 1 to LS 5 displayed at the top corresponding to “landscape” and “evening scene” will be described.

５個の画像のうちの左側の画像ほど風景の特徴が濃く現れた画像になっており、右側の画像ほど夕景の特徴が濃く現れた画像になっている。言い換えると、５個の画像のうちの左から右へ順に、風景画像から夕景画像に移り変わるように、５個の画像ＬＳ１〜ＬＳ５が表示されるようになっている（後述）。そして、当初の設定では、３個の画像ＬＳ１〜画像ＬＳ３に対応する学習用サンプルは風景に属するとされており、２個の画像ＬＳ４及び画像ＬＳ５に対応する学習用サンプルは夕景に属するとされている。これに応じて、設定画面１６３の表示の当初は、風景に属する画像と夕景に属する画像との境界を示すように、境界設定バー１６３Ａが画像ＬＳ３と画像ＬＳ４との間に表示される。 Of the five images, the image on the left side is an image in which the landscape feature appears darker, and the image on the right side is an image in which the feature of the sunset scene appears more intensely. In other words, five images LS1 to LS5 are displayed so as to change from a landscape image to an evening scene image in order from left to right among the five images (described later). In the initial setting, the learning samples corresponding to the three images LS1 to LS3 belong to the landscape, and the learning samples corresponding to the two images LS4 and LS5 belong to the evening scene. ing. Accordingly, at the beginning of display of setting screen 163, boundary setting bar 163A is displayed between images LS3 and LS4 so as to indicate the boundary between the image belonging to the landscape and the image belonging to the sunset scene.

この境界設定バー１６３Ａは、ユーザによってその位置を変更することが可能である。例えば、表示部１６に表示された画像ＬＳ３が風景画像ではなく夕景画像であるとユーザが判断した場合、ユーザは、パネル部１７を操作して、５個の境界設定バー１６３Ａのうち一番上の境界設定バー１６３Ａを選択し、その境界設定バー１６３Ａを一つ左に移動して画像ＬＳ２と画像ＬＳ３との間にする。 The position of the boundary setting bar 163A can be changed by the user. For example, when the user determines that the image LS3 displayed on the display unit 16 is not a landscape image but an evening scene image, the user operates the panel unit 17 to select the top of the five boundary setting bars 163A. The border setting bar 163A is selected, and the border setting bar 163A is moved to the left to make it between the images LS2 and LS3.

そして、設定された境界設定バー１６３Ａの位置に応じて、サブ識別器５１の処理が変更される（後述）。この結果、画像ＬＳ３の類似画像を風景識別器５１Ｌが識別したとき、当初の設定のままでは風景識別器５１Ｌは風景のシーンに属すると識別していたが、風景のシーンに属しないと識別できるようになる。また、画像ＬＳ３の類似画像を夕景識別器５１Ｓが識別したとき、当初の設定のままでは夕景識別器５１Ｓは夕景のシーンに属しないと識別していたが、夕景のシーンに属すると識別できるようになる。つまり、ユーザの好みが識別処理に反映されるようになる。 Then, the processing of the sub classifier 51 is changed according to the set position of the boundary setting bar 163A (described later). As a result, when the landscape discriminator 51L identifies a similar image of the image LS3, the landscape discriminator 51L has been identified as belonging to a landscape scene, but can be discriminated as not belonging to a landscape scene. It becomes like this. Further, when the evening scene classifier 51S identifies a similar image of the image LS3, the evening scene classifier 51S identified that it did not belong to the sunset scene under the initial setting, but can be identified as belonging to the evening scene. become. That is, the user's preference is reflected in the identification process.

以下の説明では、まず、プリンタ４のメモリ２３に記憶されているデータについて説明する。次に、設定画面１６３をどのように表示するのかについて説明する。その次に、設定画面１６３にて境界が設定された後、サブ識別器５１の処理がどのように変更されるのかについて説明する。 In the following description, first, data stored in the memory 23 of the printer 4 will be described. Next, how to display the setting screen 163 will be described. Next, how the processing of the sub classifier 51 is changed after the boundary is set on the setting screen 163 will be described.

＜メモリに格納されている学習用サンプルのデータ＞
まず、プリンタ４のメモリ２３に記憶されているデータについて説明する。以下に説明するように、メモリ２３には、図２５Ａに示すデータ群と、図２５Ｂの白ドットで示す学習用サンプルの画像データが記憶されている。 <Learning sample data stored in memory>
First, data stored in the memory 23 of the printer 4 will be described. As will be described below, the memory 23 stores a data group shown in FIG. 25A and image data of learning samples indicated by white dots in FIG. 25B.

図２５Ａは、メモリ２３に記憶されている学習用サンプルのデータ群である。
図に示すように、学習用サンプルの画像そのものの情報（画像データ）が記憶されるのではなく、学習用サンプルの全体特徴量がメモリ２３に記憶されている。また、各学習用サンプルに対応付けて、シーン毎の重み係数ｗもメモリ２３に記憶されている。この重み係数ｗは、学習用サンプルの全体特徴量のデータ群を用いて算出することが可能であるが、ここでは、重み係数ｗは予め算出されて、メモリ２３に記憶されているものとする。前述の判別式ｆ（ｘ）の値は、このデータ群の全体特徴量ｙと重み係数ｗ（例えば風景識別器５１Ｌの判別式ｆ（ｘ）の場合には添え字Ｌの重み係数）を用いて、前述の数１の式に基づいて算出される。なお、境界の決定に寄与しない学習用サンプルの重み係数はゼロとなるので、本来ならばその学習用サンプルの全体特徴量はメモリ２３に記憶する必要はないが、本実施形態では、全ての学習用サンプルの全体特徴量をメモリ２３に記憶しているものとする。 FIG. 25A shows a data group of learning samples stored in the memory 23.
As shown in the figure, information (image data) of the learning sample image itself is not stored, but the entire feature amount of the learning sample is stored in the memory 23. Further, the weight coefficient w for each scene is also stored in the memory 23 in association with each learning sample. The weighting factor w can be calculated using the data group of the entire feature amount of the learning sample. Here, it is assumed that the weighting factor w is calculated in advance and stored in the memory 23. . As the value of the above-described discriminant f (x), the overall feature amount y and the weighting factor w (for example, the weighting factor of the subscript L in the case of the discriminant f (x) of the landscape discriminator 51L) are used. Thus, it is calculated based on the above equation (1). Note that since the weighting coefficient of the learning sample that does not contribute to the determination of the boundary is zero, it is not necessary to store the entire feature amount of the learning sample in the memory 23. It is assumed that the entire feature amount of the sample for use is stored in the memory 23.

更に、本実施形態では、各学習用サンプルに対応付けて、どのシーンに属するかを示す情報（属性情報）も記憶されている。後述する通り、この属性情報は、図２４の設定画面１６３を表示する際に用いられると共に、図２４の境界設定バー１６３Ａの設定に応じて変更される。 Furthermore, in the present embodiment, information (attribute information) indicating which scene belongs is also stored in association with each learning sample. As will be described later, this attribute information is used when displaying the setting screen 163 of FIG. 24 and is changed according to the setting of the boundary setting bar 163A of FIG.

図２５Ｂは、各学習用サンプルの分布の説明図である。ここでは説明の簡略化のため、２つの特徴量によって、２次元空間に学習用サンプルが分布している。各ドットは、学習用サンプルの２次元空間上での位置をそれぞれ示している。 FIG. 25B is an explanatory diagram of the distribution of each learning sample. Here, for simplification of explanation, learning samples are distributed in a two-dimensional space by two feature amounts. Each dot indicates the position of the learning sample on the two-dimensional space.

なお、同じクラスタ内には、似た性質の学習用サンプルが属することになる。例えば、青空の画像の学習用サンプルによってクラスタＡが構成されたり、新緑の画像の学習用サンプルによってクラスタＢが構成されたりする。なお、デフォルトの設定では、クラスタＡ〜Ｆに属する学習用サンプルは風景画像であり、クラスタＧ〜Ｋに属する学習用サンプルは夕景画像であり、クラスタＬ〜Ｍに属する学習用サンプルは夜景画像であるものとする（花画像及び紅葉画像の学習用サンプルについては不図示とする）。 Note that learning samples having similar properties belong to the same cluster. For example, a cluster A is configured by a learning sample for a blue sky image, or a cluster B is configured by a learning sample for a fresh green image. In the default setting, the learning samples belonging to the clusters A to F are landscape images, the learning samples belonging to the clusters G to K are evening scene images, and the learning samples belonging to the clusters L to M are night scene images. Suppose that there is a sample for learning a flower image and an autumnal image (not shown).

図２５Ｂの白ドットは、各クラスタの中心の位置に最も近い学習用サンプルの位置を示している。この白ドットの学習用サンプルは、各クラスタを代表するサンプル（代表サンプル）となる。メモリ２３には、白ドットで示された代表サンプルの画像データが記憶されている。言い換えると、メモリ２３には、各クラスタを代表する画像の画像データが記憶される。後述する通り、この代表画像データは、図２４の設定画面１６３の表示に用いられる。 The white dot in FIG. 25B indicates the position of the learning sample closest to the center position of each cluster. The white dot learning sample is a sample representing each cluster (representative sample). The memory 23 stores image data of representative samples indicated by white dots. In other words, the memory 23 stores image data representing an image representing each cluster. As will be described later, this representative image data is used to display the setting screen 163 in FIG.

以上説明したように、プリンタ４のメモリ２３には、図２５Ａに示すデータ群と、図２５Ｂの白ドットで示す代表サンプルの画像データが記憶されている。なお、各学習用サンプルの属するクラスタを示すデータは、メモリ２３に格納されていても良いし、格納されていなくても良い。各学習用サンプルの属するクラスタを示すデータは、図２５Ａのデータ群を用いて求めることが可能だからである。 As described above, the memory 23 of the printer 4 stores the data group shown in FIG. 25A and the image data of the representative sample shown by white dots in FIG. 25B. Note that the data indicating the cluster to which each learning sample belongs may or may not be stored in the memory 23. This is because the data indicating the cluster to which each learning sample belongs can be obtained using the data group of FIG. 25A.

＜設定画面１６３を表示するまでの処理＞
次に、プリンタ側コントローラ２０が、図２４のような設定画面１６３をどのように表示するのかについて説明する。ここでは主に、設定画面１６３の一番上の５個の画像ＬＳ１〜ＬＳ５がどのように表示されるかについて説明する。 <Processing until the setting screen 163 is displayed>
Next, how the printer-side controller 20 displays the setting screen 163 as shown in FIG. 24 will be described. Here, how the top five images LS1 to LS5 on the setting screen 163 are displayed will be mainly described.

図２６Ａは、風景画像と夕景画像とを分離する境界Ｆ_ls（ｘ）＝０の説明図である。図中には、風景と夕景の学習用サンプルだけが示されており、他のシーン（例えば夜景）の学習用サンプルは示されていない。また、説明の簡略化のため、この２次元空間は、図１７Ａのように線形関数で分離可能な空間であるものとする。このため、風景画像のサンプルと、夕景画像のサンプルを分離する境界Ｆ_ls（ｘ）＝０は、直線で定義される。プリンタ側コントローラ２０は、この境界Ｆ_ls（ｘ）＝０を、風景と夕景の学習用サンプルを用いた学習によって求める。この学習の演算処理は、通常のサポートベクタマシンの学習の演算処理と同じなので、説明は省略する。 FIG. 26A is an explanatory diagram of a boundary F_ls (x) = 0 that separates a landscape image and an evening scene image. In the figure, only the learning samples for the landscape and the sunset are shown, and the learning samples for other scenes (for example, the night view) are not shown. For the sake of simplicity of explanation, this two-dimensional space is assumed to be a space that can be separated by a linear function as shown in FIG. 17A. Therefore, the boundary F_ls (x) = 0 that separates the landscape image sample and the sunset image sample is defined by a straight line. The printer-side controller 20 obtains the boundary F_ls (x) = 0 by learning using a learning sample for landscape and sunset scenes. Since the learning calculation process is the same as the learning process of a normal support vector machine, a description thereof will be omitted.

図２６Ｂは、境界（Ｆ_ls（ｘ）＝０）の法線に代表サンプルを投影する様子の説明図である。図中において、代表サンプルの２次元空間上の位置が白ドットで示されている。プリンタ側コントローラ２０は、境界に対する法線を一つ定義し、この法線上に、代表サンプルを投影する。投影される位置は、代表サンプルを通り境界と平行な直線（境界が超平面であれば超平面）と、法線との交点である。このようにして、１１個の代表サンプルが、法線上に投影される。言い換えると、１１個の代表サンプルが一直線上に並ぶことになる。なお、法線に投影される代表サンプルは、風景画像及び夕景画像の代表サンプルであり、それ以外のシーン（例えば夜景）の代表サンプルは含まれない。 FIG. 26B is an explanatory diagram showing a state in which the representative sample is projected onto the normal line of the boundary (F_ls (x) = 0). In the figure, the positions of the representative samples in the two-dimensional space are indicated by white dots. The printer-side controller 20 defines one normal to the boundary and projects a representative sample on this normal. The projected position is the intersection of a normal line and a straight line that passes through the representative sample and is parallel to the boundary (or a hyperplane if the boundary is a hyperplane). In this way, 11 representative samples are projected on the normal. In other words, 11 representative samples are arranged in a straight line. Note that the representative sample projected onto the normal is a representative sample of a landscape image and an evening scene image, and does not include representative samples of other scenes (for example, a night view).

図２６Ｃは、法線上に投影された代表サンプルの説明図である。風景画像の代表サンプルが図中の左側に位置するように、言い換えると、非風景画像の代表サンプルが図中の右側に位置するように、法線を水平にして、法線上に投影された代表サンプルの位置関係を示している。 FIG. 26C is an explanatory diagram of a representative sample projected on the normal line. A representative that is projected on the normal with the normal level horizontal so that the representative sample of the landscape image is located on the left side of the figure, in other words, the representative sample of the non-landscape image is located on the right side of the figure The positional relationship of the sample is shown.

次に、プリンタ側コントローラ２０は、法線上に５個の区間を定義する。図中には、第１区間〜第５区間が定義されている。各区間は、所定の長さになるように定義されている。また、図２６Ｂの法線と境界（Ｆ_ls（ｘ）＝０）との交点の位置が、２つの区間の境界になるように、５個の区間が定義される。ここでは、図２６Ｂの法線と境界（Ｆ_ls（ｘ）＝０）との交点の位置は、第３区間と第４区間の境界に相当する。なお、各区間には、複数の代表サンプルが存在することになる。 Next, the printer-side controller 20 defines five sections on the normal line. In the figure, a first section to a fifth section are defined. Each section is defined to have a predetermined length. In addition, five sections are defined so that the position of the intersection between the normal line and the boundary (F_ls (x) = 0) in FIG. 26B is the boundary between the two sections. Here, the position of the intersection between the normal line and the boundary (F_ls (x) = 0) in FIG. 26B corresponds to the boundary between the third section and the fourth section. Note that there are a plurality of representative samples in each section.

次に、プリンタ側コントローラ２０は、各区間の中心に位置する代表サンプルの画像データを抽出する。ここでは、第１区間から、クラスタＢの代表サンプルの画像データが抽出される。同様に、第２区間、第３区間、第４区間、第５区間から、クラスタＤ、Ｅ、Ｊ、Ｉの代表サンプルの画像データがそれぞれ抽出される。このとき、デフォルトの設定において風景のシーンに属しているとされる代表サンプルが、第１区間〜第３区間から抽出される。また、デフォルトの設定において夕景のシーンに属しているとされる代表サンプルが、第４区間及び第５区間から抽出される。抽出された画像データは、各区間を代表するものと考えられる。 Next, the printer-side controller 20 extracts image data of a representative sample located at the center of each section. Here, the image data of the representative sample of cluster B is extracted from the first section. Similarly, image data of representative samples of clusters D, E, J, and I are extracted from the second section, the third section, the fourth section, and the fifth section, respectively. At this time, representative samples that belong to a landscape scene in the default settings are extracted from the first to third sections. In addition, representative samples that belong to the evening scene in the default setting are extracted from the fourth section and the fifth section. The extracted image data is considered to represent each section.

プリンタ側コントローラ２０は、抽出した画像データを用いて、設定画面１６３をプリンタ４の表示部１６に表示する。第１区間から抽出されたクラスタＢの代表サンプルの画像データは、図２４の画像ＬＳ１の表示に用いられる。同様に、クラスタＤ、Ｅ、Ｊ、Ｉの代表サンプルの画像データは、それぞれ図２４の画像ＬＳ２、ＬＳ３、ＬＳ４、ＬＳ５の表示に用いられる。 The printer-side controller 20 displays the setting screen 163 on the display unit 16 of the printer 4 using the extracted image data. The image data of the representative sample of cluster B extracted from the first section is used for displaying the image LS1 in FIG. Similarly, the image data of the representative samples of the clusters D, E, J, and I are used to display the images LS2, LS3, LS4, and LS5 in FIG.

また、図２６Ｂの法線と境界（Ｆ_ls（ｘ）＝０）との交点の位置が第３区間と第４区間の境界に相当しているので、プリンタ側コントローラ２０は、図２４の画像ＬＳ３（第３区間から抽出された代表サンプルの画像）と画像ＬＳ４（第４区間から抽出された代表サンプルの画像）との間に境界設定バー１６３Ａを表示する。ところで、画像ＬＳ１〜画像ＬＳ３は風景画像であり、画像ＬＳ４及び画像ＬＳ５は夕景画像であるので、境界設定バー１６３Ａは、風景画像と夕景画像との間に表示されることになる。 In addition, since the position of the intersection between the normal line and the boundary (F_ls (x) = 0) in FIG. 26B corresponds to the boundary between the third section and the fourth section, the printer-side controller 20 causes the image LS3 in FIG. A boundary setting bar 163A is displayed between (the representative sample image extracted from the third section) and the image LS4 (the representative sample image extracted from the fourth section). By the way, since the images LS1 to LS3 are landscape images and the images LS4 and LS5 are sunset images, the boundary setting bar 163A is displayed between the landscape images and the sunset images.

本実施形態では、上記のように、境界の法線に代表サンプルの位置が投影され、法線上に投影された代表サンプルの位置に基づいて、抽出すべき代表サンプルが決定される。これにより、本実施形態では、判別式Ｆ_ls（ｘ）の値が大きいものほど左側になるように、５個の代表サンプルの画像が表示される。言い換えると、風景のシーンに属する確信度の高い順に左から並ぶように、５個の代表サンプルの画像を表示できる。 In the present embodiment, as described above, the position of the representative sample is projected onto the normal line of the boundary, and the representative sample to be extracted is determined based on the position of the representative sample projected onto the normal line. Thereby, in this embodiment, the image of five representative samples is displayed so that the larger the value of the discriminant F_ls (x) is on the left side. In other words, images of five representative samples can be displayed so that they are arranged from the left in descending order of certainty belonging to a landscape scene.

そして、上記のように図２４の設定画面１６３が表示されるので、境界設定バー１６３Ａの左側には、デフォルトの設定では風景画像の代表サンプルが表示される。また、境界設定バー１６３Ａの右側には、デフォルトの設定では夕景画像の代表サンプルが表示される。そして、５個の画像のうちの左側の画像ほど風景の特徴が濃く現れた画像になっており、右側の画像ほど夕景の特徴が濃く現れた画像になっている。言い換えると、５個の画像のうちの左から右へ順に、風景画像から夕景画像に移り変わるように、５個の画像ＬＳ１〜ＬＳ５が表示されるようになっている。このため、境界設定バー１６３Ａの近くに表示される画像は、ユーザの好みによって風景画像か夕景画像かの判断が分かれやすい画像になる。 Then, since the setting screen 163 of FIG. 24 is displayed as described above, a representative sample of a landscape image is displayed on the left side of the boundary setting bar 163A by default. On the right side of the boundary setting bar 163A, a representative sample of the evening scene image is displayed by default. Of the five images, the image on the left side has a darker landscape feature, and the image on the right side has a darker sunset feature. In other words, the five images LS1 to LS5 are displayed so as to change from the landscape image to the sunset image in order from the left to the right among the five images. For this reason, the image displayed near the boundary setting bar 163A is an image in which it is easy to determine whether the image is a landscape image or a sunset image depending on the user's preference.

以上の説明では、設定画面１６３の一番上の５個の画像（風景画像と夕景画像）について説明したが、他のシーンについても、プリンタ側コントローラ２０は、同様の処理を行う。これにより、プリンタ側コントローラ２０は、図２４の設定画面１６３の画像ＬＳ１〜ＬＳ５以外の画像も表示できる。 In the above description, the top five images (landscape image and sunset image) on the setting screen 163 have been described, but the printer-side controller 20 performs the same processing for other scenes. Accordingly, the printer-side controller 20 can also display images other than the images LS1 to LS5 on the setting screen 163 in FIG.

＜第２参考例（その１）＞
次に、図２４に示すように、ユーザが境界設定バー１６３Ａを一つ左に移動して画像ＬＳ２と画像ＬＳ３との間に設定した後の処理について説明する。 <Second Reference Example (1)>
Next, as shown in FIG. 24, a process after the user moves the boundary setting bar 163A to the left and sets it between the images LS2 and LS3 will be described.

境界設定バー１６３Ａの移動後、境界設定バー１６３Ａの右側において、境界設定バー１６３Ａと、デフォルトの設定では夕景画像である画像ＬＳ４との間に、デフォルトの設定では風景画像である画像ＬＳ３（クラスタＥの代表サンプルの画像であり、第３区間を代表する画像である）が位置する状態になる。ユーザが境界設定バー１６３Ａを画像ＬＳ３と画像ＬＳ４との間から画像ＬＳ２と画像ＬＳ３との間に移動したということは、図２６Ｃに示す第３区間に属するクラスタＥ、Ｆに属する学習用サンプルは風景画像ではないとユーザが考えていると想定できる（ここでは、クラスタＥ、Ｆに属する学習用サンプルは夕景画像であるとユーザが考えていると想定できる）。 After the boundary setting bar 163A is moved, on the right side of the boundary setting bar 163A, between the boundary setting bar 163A and the image LS4 which is a sunset scene image by default setting, the image LS3 (cluster E which is a landscape image by default setting). Is a representative sample image and is an image representative of the third section). The fact that the user has moved the boundary setting bar 163A between the images LS3 and LS4 between the images LS2 and LS3 means that the learning samples belonging to the clusters E and F belonging to the third section shown in FIG. It can be assumed that the user thinks that it is not a landscape image (here, it can be assumed that the user thinks that the learning samples belonging to the clusters E and F are sunset images).

図２７Ａは、変更後のデータ群の説明図である。図２７Ｂは、変更後の境界の説明図である。以下、これらの図を用いて、設定変更後のプリンタ側コントローラ２０の処理について説明する。 FIG. 27A is an explanatory diagram of the data group after the change. FIG. 27B is an explanatory diagram of the boundary after the change. Hereinafter, the processing of the printer-side controller 20 after the setting change will be described with reference to these drawings.

まず、プリンタ側コントローラ２０は、クラスタＥ、Ｆに属する学習用サンプルの属性情報を風景から夕景に変更する。例えば、仮に図２５Ａのサンプル番号３がクラスタＥ又はクラスタＦに属していれば、図２７Ａに示すように属性情報を風景から夕景に変更する。 First, the printer-side controller 20 changes the attribute information of the learning samples belonging to the clusters E and F from landscape to sunset. For example, if sample number 3 in FIG. 25A belongs to cluster E or cluster F, the attribute information is changed from landscape to sunset as shown in FIG. 27A.

本参考例では、クラスタＥの代表サンプルの属性情報を変更するだけでなく、クラスタＥに属する学習用サンプルの全ての属性情報を変更している。これにより、ユーザによる一度の操作によって、風景に属さないことをユーザが望んだ画像と似た性質の学習用サンプルの属性情報を、一括して変更することができる。 In this reference example, not only the attribute information of the representative sample of the cluster E is changed, but also all the attribute information of the learning samples belonging to the cluster E are changed. Thereby, the attribute information of the learning sample having properties similar to the image that the user desires not to belong to the landscape can be collectively changed by a single operation by the user.

また、本参考例では、クラスタＥの代表サンプルの属性情報を変更するだけでなく、第３区間に属する学習用サンプル（例えばクラスタＦ）の全ての属性情報を変更している。これにより、ユーザによる一度の操作によって、風景に属さないことをユーザが望んだ画像と同程度に境界から離れた学習用サンプルの属性情報を、一括して変更することができる。 In this reference example, not only the attribute information of the representative sample of the cluster E is changed, but also all the attribute information of the learning sample (for example, the cluster F) belonging to the third section is changed. Thereby, it is possible to collectively change the attribute information of the learning sample that is separated from the boundary as much as the image that the user desires not to belong to the landscape by a single operation by the user.

次に、プリンタ側コントローラ２０は、全体特徴量と変更後の属性情報とに基づいてサポートベクタマシンの再学習を行い、図２７Ｂに示すように境界を変更する。言い換えると、プリンタ側コントローラ２０は、全体特徴量と変更後の属性情報とに基づいて再学習を行い、図２５Ａの重み係数ｗを図２７Ｂに示すように変更する。ここでは、変更後の境界をｆ´（ｘ）＝０と表記し、変更後の重み係数をｗ´と表記している。なお、再学習の演算処理は通常のサポートベクタマシンの学習の演算処理と同じなので、再学習の説明は省略する。 Next, the printer-side controller 20 performs relearning of the support vector machine based on the entire feature amount and the changed attribute information, and changes the boundary as shown in FIG. 27B. In other words, the printer-side controller 20 performs relearning based on the overall feature amount and the changed attribute information, and changes the weighting coefficient w in FIG. 25A as shown in FIG. 27B. Here, the changed boundary is expressed as f ′ (x) = 0, and the changed weighting coefficient is expressed as w ′. Note that the relearning calculation process is the same as the normal support vector machine learning calculation process, and a description of the relearning will be omitted.

なお、図２４に示すように、一番上の境界設定バー１６３Ａの位置が変更されると、風景の重み係数と夕景の重み係数とが変更されることになり、風景識別器５１Ｌの境界ｆ（ｘ）と夕景識別器５１Ｓの境界ｆ（ｘ）とが変更されることになる。風景識別器５１Ｌの境界ｆ（ｘ）を再学習により変更するときには、変更後の属性情報における風景と非風景（夕景・夜景・花・紅葉）とを分離できるように、学習用サンプルを用いた再学習が行われる。夕景識別器５１Ｓの境界ｆ（ｘ）を再学習により変更するときには、変更後の属性情報における夕景と非夕景（風景・夜景・花・紅葉）とを分離できるように、学習用サンプルを用いた再学習が行われる。 As shown in FIG. 24, when the position of the top boundary setting bar 163A is changed, the weighting factor of the landscape and the weighting factor of the sunset scene are changed, and the boundary f of the landscape discriminator 51L is changed. (X) and the boundary f (x) of the evening scene classifier 51S are changed. When the boundary f (x) of the landscape classifier 51L is changed by re-learning, a learning sample is used so that a landscape and a non-landscape (evening scene / night scene / flower / foliage) in the changed attribute information can be separated. Re-learning is performed. When the boundary f (x) of the evening scene classifier 51S is changed by re-learning, a learning sample is used so that the evening scene and the non-evening scene (landscape / night scene / flower / foliage) in the changed attribute information can be separated. Re-learning is performed.

重み係数ｗ（又はｗ´）は、境界の決定に寄与しなければゼロになる。このため、図２５Ａではゼロであった重み係数ｗが、変更によってゼロ以外の値を持つこともある。逆に、図２５Ａではゼロ以外の値を持っていた重み係数ｗが、変更によってゼロになることもある。デフォルトの設定では境界の決定に寄与しない学習用サンプルのデータまでをも図２５Ａのデータ群で記憶しているのは、このためである。 The weighting factor w (or w ′) becomes zero if it does not contribute to the determination of the boundary. For this reason, the weighting coefficient w, which was zero in FIG. 25A, may have a value other than zero due to the change. Conversely, the weighting factor w, which had a value other than zero in FIG. 25A, may become zero due to a change. This is why even the data of the learning sample that does not contribute to the boundary determination in the default setting is stored in the data group of FIG. 25A.

識別対象画像が風景のシーンに属するか否かを風景識別器５１Ｌが判断するとき、風景識別器５１Ｌは、図２７Ｂのデータ群の学習用サンプルの全体特徴量と変更後の風景の重み係数ｗ´（添え字Ｌの重み係数）とに基づいて、前述の数１の判別式の値（変更後の判別式ｆ´（ｘ）の値）を算出する。なお、重み係数ｗ´がゼロの学習用サンプルは除外して、風景識別器５１Ｌは、前述の数１の判別式の値を算出する。これにより、全ての学習用サンプルを用いて判別式の値を求める場合よりも、演算速度が速くなる。 When the landscape discriminator 51L determines whether or not the classification target image belongs to a landscape scene, the landscape discriminator 51L determines the overall feature amount of the learning sample of the data group in FIG. 27B and the changed landscape weight coefficient w. Based on ′ (the weighting factor of the subscript L), the value of the discriminant of the above formula 1 (the value of the discriminant f ′ (x) after change) is calculated. Note that the learning classifier 51L calculates the value of the discriminant of Equation 1 described above, excluding learning samples whose weighting coefficient w ′ is zero. As a result, the calculation speed is faster than the case of obtaining the discriminant value using all the learning samples.

変更後の判別式ｆ´（ｘ）を用いることにより、ユーザの好みを反映した識別処理を行うことができる。例えば、仮に画像ＬＳ３（図２４参照）が赤みのある風景の画像だとすると、識別対象画像が赤みのある風景の画像である場合に、その識別対象画像が風景のシーンに属すると判断され難くなる。言い換えると、仮にクラスタＥ（図２７Ｂ参照）が赤みのある風景の画像の学習用サンプルから構成されていたとすると、識別対象画像が赤みのある風景の画像である場合に、その識別対象画像が風景のシーンに属すると判断され難くなる。 By using the discriminant f ′ (x) after the change, it is possible to perform an identification process reflecting the user's preference. For example, if the image LS3 (see FIG. 24) is a red landscape image, it is difficult to determine that the identification target image belongs to a landscape scene when the identification target image is a red landscape image. In other words, if the cluster E (see FIG. 27B) is composed of a learning sample of a reddish landscape image, when the identification target image is a reddish landscape image, the identification target image is a landscape. It becomes difficult to be judged to belong to the scene.

本参考例では、ユーザの好みに合わせた設定変更を、容易に行うことができる。もし仮に、多数の学習用サンプルの画像を１個ずつ表示し、表示された学習用サンプルの画像のシーンをユーザが１個ずつ決定することにすると、ユーザは何度も決定作業を行う必要があるので不便である。 In this reference example, it is possible to easily change settings according to user preferences. If a large number of learning sample images are displayed one by one and the user decides the scenes of the displayed learning sample images one by one, the user needs to perform a determination process many times. It is inconvenient because there are.

なお、上記の説明では、ユーザが境界設定バー１６３Ａを一つ左に移動した場合について説明した。これに対し、仮にユーザが境界設定バー１６３Ａを一つ右に移動した場合には、境界設定バー１６３Ａの左側において、境界設定バー１６３Ａと、デフォルトの設定では風景画像である画像ＬＳ３との間に、デフォルトの設定では夕景画像である画像ＬＳ４（クラスタＪの代表サンプルの画像であり、第４区間を代表する画像である）が位置する状態になる。このような場合には、プリンタ側コントローラ２０は、第４区間に属するクラスタＪ、Ｈ、Ｇに属する学習用サンプルの属性情報を夕景から風景に変更し、全体特徴量と変更後の属性情報とに基づいてサポートベクタマシンの再学習を行い、境界を変更する。この場合にも、ユーザの好みを反映した識別処理を行うことができる。 In the above description, the case where the user moves the boundary setting bar 163A one place to the left has been described. On the other hand, if the user moves the boundary setting bar 163A one place to the right, the boundary setting bar 163A on the left side of the boundary setting bar 163A is between the boundary setting bar 163A and the image LS3 that is a landscape image in the default setting. In the default setting, an image LS4 (an image of a representative sample of cluster J and an image representative of the fourth section) is located. In such a case, the printer-side controller 20 changes the attribute information of the learning sample belonging to the clusters J, H, and G belonging to the fourth section from the sunset scene to the landscape, and the entire feature amount and the changed attribute information Re-learn the support vector machine based on, and change the boundary. Also in this case, identification processing reflecting user preferences can be performed.

＜第２参考例（その２）＞
上記の説明では、１個の境界設定バーの位置だけが変更された場合について説明しているが、次に、２個の境界設定バーの位置が変更された場合について説明する。 <Second Reference Example (2)>
In the above description, the case where only the position of one boundary setting bar is changed is described. Next, the case where the positions of two boundary setting bars are changed will be described.

図２８は、２個の境界設定バーの位置が変更される様子の説明図である。既に説明した処理によって設定画面１６３が表示部１６に表示された結果、最上段に表示される５個の画像ＬＳ１〜ＬＳ５（図２４参照）のうちのＬＳ３の位置に、クラスタＥの代表サンプルの画像が表示されている。また、同様の処理の結果、２段目に表示される５個の画像ＬＮ１〜ＬＮ５（図２４参照）のうちのＬＮ３の位置に、クラスタＥの代表サンプルの画像が表示されている。 FIG. 28 is an explanatory diagram showing how the positions of two boundary setting bars are changed. As a result of the setting screen 163 being displayed on the display unit 16 by the processing described above, the representative sample of the cluster E is located at the position of LS3 among the five images LS1 to LS5 (see FIG. 24) displayed at the top. An image is displayed. As a result of the same processing, the representative sample image of cluster E is displayed at the position of LN3 among the five images LN1 to LN5 (see FIG. 24) displayed in the second row.

図２８に示すように、ユーザが最上段の境界設定バー１６３Ａを一つ左に移動するとともに、２段目の境界設定バー１６３Ｂも一つ左に移動するとする。このような場合、最上段の境界設定バー１６３Ａの設定変更によって「画像Ｅが夕景画像である」とし、２段目の境界設定バー１６３Ｂの設定変更によって「画像Ｅが夜景画像である」としてしまうと、矛盾が生じてしまう。このように、境界設定バーの位置が変更された場合、変更前後の境界設定バーに挟まれた画像が「○○画像である」と扱うことにすると、矛盾が生じることがある。 As shown in FIG. 28, it is assumed that the user moves the uppermost boundary setting bar 163A to the left and also moves the second boundary setting bar 163B to the left. In such a case, “image E is an evening scene image” due to the setting change of the uppermost boundary setting bar 163A, and “image E is a night scene image” due to the setting change of the second boundary setting bar 163B. And there will be a contradiction. As described above, when the position of the boundary setting bar is changed, if the image sandwiched between the boundary setting bars before and after the change is treated as “a XX image”, a contradiction may occur.

そこで、図２８のように２個の境界設定バーの位置が変更された場合、最上段の境界設定バー１６３Ａの設定変更によって「画像Ｅは風景画像ではない」と考え、２段目の境界設定バー１６３Ｂの設定変更によって「画像Ｅは風景画像ではない」と考えれば、矛盾は生じない。このように、境界設定バーの位置が変更された場合、変更前後の境界設定バーに挟まれた画像は「○○画像ではない」と扱うこととする（なお、位置の変更された境界設定バーが１個の場合も同様である。）。 Therefore, when the positions of the two boundary setting bars are changed as shown in FIG. 28, it is considered that “image E is not a landscape image” by changing the setting of the uppermost boundary setting bar 163A. If it is considered that “image E is not a landscape image” by changing the setting of bar 163B, no contradiction occurs. In this way, when the position of the boundary setting bar is changed, the image sandwiched between the boundary setting bars before and after the change is treated as “not an image” (the boundary setting bar whose position has been changed). The same applies when there is one).

図２９Ａは、最上段の境界設定バー１６３Ａの位置変更結果の説明図である。最上段の境界設定バー１６３Ａが一つ左に移動した結果、変更前後の境界設定バーに挟まれた画像は「風景画像ではない」と扱うため、クラスタＥ、Ｆに属する学習用サンプルの属性情報が風景ではなくなる。
図２９Ｂは、２段目の境界設定バー１６３Ｂの位置変更結果の説明図である。２段目の境界設定バー１６３Ｂが一つ左に移動した結果、変更前後の境界設定バーに挟まれた画像は「風景画像ではない」と扱うため、クラスタＥに属する学習用サンプルの属性情報が風景ではなくなる。 FIG. 29A is an explanatory diagram of the position change result of the uppermost boundary setting bar 163A. As a result of the uppermost boundary setting bar 163A moving to the left, the image sandwiched between the boundary setting bars before and after the change is treated as “not a landscape image”, so the attribute information of the learning samples belonging to clusters E and F Is no longer a landscape.
FIG. 29B is an explanatory diagram of the result of changing the position of the second-stage boundary setting bar 163B. As a result of the movement of the second boundary setting bar 163B to the left, the image sandwiched between the boundary setting bars before and after the change is treated as “not a landscape image”, so the attribute information of the learning sample belonging to the cluster E is It is no longer a landscape.

次に、クラスタＥ、Ｆの属性情報をどのシーンにすべきかについて説明する。図２９Ｃは、２個の境界設定バーの位置変更結果の概念図である。
クラスタＦは、図２９Ａ〜図２９Ｃに示すとおり、最上段の境界設定バー１６３Ａの設定変更だけの影響を受け、２段目の境界設定バー１６３Ｂの設定変更の影響は受けていない。このため、プリンタ側コントローラ２０は、クラスタＦに属する学習用サンプルの属性情報を、夕景に変更する。 Next, which scene should be the attribute information of the clusters E and F will be described. FIG. 29C is a conceptual diagram of the result of changing the positions of two boundary setting bars.
As shown in FIGS. 29A to 29C, the cluster F is affected only by the setting change of the uppermost boundary setting bar 163A and is not affected by the setting change of the second boundary setting bar 163B. Therefore, the printer-side controller 20 changes the attribute information of the learning sample belonging to the cluster F to the evening scene.

クラスタＥは、図２９Ａ〜図２９Ｃに示すとおり、最上段の境界設定バー１６３Ａの設定変更の影響だけでなく、２段目の境界設定バー１６３Ｂの設定変更の影響も受ける。このため、クラスタＥに属する学習用サンプルの属性情報を、夕景にすべきか、夜景にすべきか、問題になる。そこで、まずプリンタ側コントローラ２０は、図２９Ｃの空間上においてクラスタＥの代表サンプルに最も近い代表サンプルであって、風景以外のシーンの代表サンプル（クラスタＧ〜Ｍの代表サンプル）を抽出する。ここでは、クラスタＬの代表サンプルが抽出される。そして、プリンタ側コントローラ２０は、抽出された代表サンプルの属性情報と同じ属性情報になるように、クラスタＥに属する学習用サンプルの属性情報を変更する。つまり、クラスタＥに属する学習用サンプルの属性情報は、夜景に変更される。 As shown in FIGS. 29A to 29C, the cluster E is affected not only by the setting change of the uppermost boundary setting bar 163A but also by the setting change of the second boundary setting bar 163B. For this reason, it becomes a problem whether the attribute information of the learning sample belonging to the cluster E should be a sunset scene or a night scene. Therefore, first, the printer-side controller 20 extracts a representative sample that is closest to the representative sample of the cluster E in the space of FIG. 29C and is a representative sample of a scene other than the landscape (representative samples of the clusters G to M). Here, a representative sample of cluster L is extracted. Then, the printer-side controller 20 changes the attribute information of the learning sample belonging to the cluster E so that the attribute information is the same as the attribute information of the extracted representative sample. That is, the attribute information of the learning sample belonging to the cluster E is changed to a night view.

なお、属性情報の変更後の処理は、既に説明した通りである。すなわち、全体特徴量と変更後の属性情報とに基づいてサポートベクタマシンの再学習を行い、重み係数ｗを変更することによって、判別式を変更する（境界を変更する）。 Note that the processing after the change of the attribute information is as already described. That is, the discriminant is changed (the boundary is changed) by re-learning the support vector machine based on the entire feature amount and the changed attribute information and changing the weighting coefficient w.

上記の処理によれば、２個の境界設定バーの位置が変更された場合においても、ユーザの設定に矛盾なく、ユーザの好みを反映した識別処理を行うことができる。 According to the above process, even when the positions of the two boundary setting bars are changed, the identification process reflecting the user's preference can be performed without contradiction to the user's setting.

＜本実施形態における設定画面１６３にて境界が設定された後の処理＞
前述の第２参考例では、ユーザが図２４の境界設定バー１６３Ａの位置を変更したときに、サポートベクタマシンの再学習を行っていた。このような形態では、プリンタ側コントローラ２０が、学習処理を行うためのプログラムを実行する必要があり、また、再学習の処理時間も必要となる。そこで、本実施形態では、予め複数の判別式が用意されており、境界設定バーの位置に応じて判別式が選択されることによって、プリンタ側コントローラ２０が再学習を行わなくても済むようにしている。なお、判別式は、識別対象画像を評価するための評価関数に相当する。 <Processing after border is set on setting screen 163 in this embodiment>
In the above-described second reference example, the support vector machine is relearned when the user changes the position of the boundary setting bar 163A in FIG. In such a form, the printer-side controller 20 needs to execute a program for performing learning processing, and also requires relearning processing time. Therefore, in this embodiment, a plurality of discriminants are prepared in advance, and the discriminant is selected according to the position of the boundary setting bar, so that the printer-side controller 20 does not have to re-learn. . The discriminant corresponds to an evaluation function for evaluating the identification target image.

図３０は、第２実施形態の学習用サンプルのデータ群である。ここでは、風景識別器５１Ｌのサポートベクタマシンに用いられるデータ群だけが示されている。第２参考例のデータ群（図２５Ａ参照）と比較すると、本実施形態では各学習用サンプルに対して、各シーンごとに複数の重み係数ｗが対応付けられて記憶されている。ここでは、風景のシーンに対して、第１重み係数、第２重み係数、第３重み係数・・・が記憶されている。 FIG. 30 shows a data group of learning samples according to the second embodiment. Here, only the data group used for the support vector machine of the landscape classifier 51L is shown. Compared to the data group of the second reference example (see FIG. 25A), in this embodiment, a plurality of weighting factors w are stored in association with each learning sample for each scene. Here, a first weighting coefficient, a second weighting coefficient, a third weighting coefficient,... Are stored for a landscape scene.

各重み係数は、それぞれ境界設定バー１６３Ａの位置に対応付けられている。例えば、デフォルトの状態（図２４の設定変更前の状態）では第１重み係数が対応付けられており、図２４の設定変更後の状態では第２重み係数が対応付けられている。 Each weighting factor is associated with the position of the boundary setting bar 163A. For example, the first weighting coefficient is associated in the default state (the state before the setting change in FIG. 24), and the second weighting coefficient is associated in the state after the setting change in FIG.

ユーザが設定を変更する前では、プリンタ側コントローラ２０（風景識別器５１Ｌ）は、第１重み係数と全体特徴量とに基づいて、前述の数１の判別式の値を算出する。言い換えると、デフォルトの設定では第１重み係数を用いた判別式が選択され、この判別式によって風景識別器５１Ｌは識別処理を行う。なお、演算速度向上のため、重み係数ｗがゼロの学習用サンプルは除外して、判別式の値が算出される。第１重み係数は、第２参考例のデフォルトの重み係数ｗ_L（図２５Ａ参照）と同じ値である。 Before the user changes the setting, the printer-side controller 20 (scenery discriminator 51L) calculates the value of the discriminant of the above formula 1 based on the first weighting factor and the overall feature amount. In other words, in the default setting, a discriminant using the first weighting factor is selected, and the landscape discriminator 51L performs a discrimination process based on this discriminant. Note that the discriminant value is calculated by excluding learning samples with a weighting factor w of zero in order to improve the calculation speed. The first weighting factor is the same value as the default weighting factor w_L (see FIG. 25A) of the second reference example.

ユーザが境界設定バー１６３Ａを一つ左に移動して画像ＬＳ２と画像ＬＳ３との間に設定した場合、風景識別器５１Ｌは、第２重み係数と全体特徴量とに基づいて、前述の数１の判別式の値を算出する。言い換えると、ユーザが境界設定バー１６３Ａを一つ左に移動して画像ＬＳ２と画像ＬＳ３との間に設定した場合、第２重み係数を用いた判別式が選択され、この判別式によって風景識別器５１Ｌが識別処理を行う。なお、演算速度向上のため、重み係数ｗがゼロの学習用サンプルは除外して、判別式の値が算出される。第２重み係数は、第２参考例において再学習によって求められた重み係数ｗ_L´（図２７Ｂ参照）と同じである。 When the user moves the boundary setting bar 163A one place to the left and sets it between the image LS2 and the image LS3, the landscape discriminator 51L is based on the second weighting factor and the overall feature amount, and the above equation 1 The value of the discriminant is calculated. In other words, when the user moves the boundary setting bar 163A one place to the left and sets it between the images LS2 and LS3, a discriminant using the second weighting coefficient is selected, and the discriminant uses this discriminant to determine the landscape classifier 51L performs the identification process. Note that the discriminant value is calculated by excluding learning samples with a weighting factor w of zero in order to improve the calculation speed. The second weighting factor is the same as the weighting factor w_L ′ (see FIG. 27B) obtained by relearning in the second reference example.

以上の説明では風景の学習用サンプルのデータ群について説明したが、他のシーンについても、同様のデータ群をメモリに記憶している。なお、ユーザが境界設定バー１６３Ａを一つ左に移動して画像ＬＳ２と画像ＬＳ３との間に設定した場合、風景識別器５１Ｌの判別式が変更されるだけでなく、夕景識別器５１Ｓの判別式も変更される（夕景を識別するための判別式が予め複数用意されており、境界設定バー１６３Ａの位置に応じた判別式が選択される）。 In the above description, the landscape learning sample data group has been described, but the same data group is stored in the memory for other scenes. When the user moves the boundary setting bar 163A to the left and sets it between the images LS2 and LS3, not only the discriminant of the landscape discriminator 51L is changed but also the discriminant of the sunset scene discriminator 51S. The formula is also changed (a plurality of discriminants for identifying the sunset scene are prepared in advance, and a discriminant according to the position of the boundary setting bar 163A is selected).

本実施形態においても、第２参考例と同様に、ユーザの好みを反映した識別処理を行うことができる。更に、本実施形態では再学習を行わなくても良いので、サポートベクタマシンによる学習処理を行うプログラムを実行しなくても良い。また、再学習を行わなくても良いので、設定後のプリンタ側コントローラ２０の処理の負荷も軽減される。 Also in the present embodiment, identification processing that reflects user preferences can be performed as in the second reference example. Furthermore, in this embodiment, it is not necessary to perform relearning, and therefore it is not necessary to execute a program for performing learning processing by a support vector machine. In addition, since it is not necessary to perform re-learning, the processing load of the printer-side controller 20 after setting is also reduced.

＝＝＝その他の実施の形態＝＝＝
一実施形態としてのプリンタ等を説明したが、上記の実施形態は、本発明の理解を容易にするためのものであり、本発明を限定して解釈するためのものではない。本発明は、その趣旨を逸脱することなく、変更、改良され得ると共に、本発明にはその等価物が含まれることは言うまでもない。特に、以下に述べる実施形態であっても、本発明に含まれるものである。 === Other Embodiments ===
Although a printer or the like as one embodiment has been described, the above embodiment is for facilitating understanding of the present invention, and is not intended to limit the present invention. The present invention can be changed and improved without departing from the gist thereof, and it is needless to say that the present invention includes equivalents thereof. In particular, the embodiments described below are also included in the present invention.

＜プリンタについて＞
前述の実施形態ではプリンタ４がシーン識別処理をしていたが、デジタルスチルカメラ２がシーン識別処理をしても良い。また、上記のシーン識別処理を行う画像識別装置は、プリンタ４やデジタルスチルカメラ２に限られるものではない。例えば、大量の画像ファイルを保存するフォトストレージのような画像識別装置が、上記のシーン識別処理を行っても良い。もちろん、パーソナルコンピュータやインターネット上に設置されたサーバーが、上記のシーン識別処理を行っても良い。
なお、シーン識別装置に上記のシーン識別処理を実行させるプログラムも、本発明の範疇である。 <About the printer>
In the above-described embodiment, the printer 4 performs the scene identification process, but the digital still camera 2 may perform the scene identification process. Further, the image identification device that performs the above-described scene identification processing is not limited to the printer 4 or the digital still camera 2. For example, an image identification device such as a photo storage that stores a large amount of image files may perform the scene identification process described above. Of course, a personal computer or a server installed on the Internet may perform the scene identification process.
Note that a program that causes the scene identification device to execute the above-described scene identification processing is also within the scope of the present invention.

＜サポートベクタマシンについて＞
前述のサブ識別器５１やサブ部分識別器６１には、サポートベクタマシン（ＳＶＭ）による識別手法が用いられている。しかし、識別対象画像が特定シーンに属するか否かの識別手法は、サポートベクタマシンを用いるものに限られるものではない。例えば、ニューラルネットワーク等のパターン認識を採用しても良い。 <About Support Vector Machine>
For the above-described sub classifier 51 and sub partial classifier 61, a classification method using a support vector machine (SVM) is used. However, the method for identifying whether or not the identification target image belongs to a specific scene is not limited to using a support vector machine. For example, pattern recognition such as a neural network may be employed.

＜シーンの識別について＞
前述の実施形態では、サブ識別器５１やサブ部分識別器６１は、画像データの示す画像が特定のシーンに属するか否かを識別している。しかし、シーンの識別に限られず、何らかのクラスに属するか否かを識別するもので良い。例えば、画像データの示す画像が特定のパターン形状か否かを識別しても良い。 <About scene identification>
In the above-described embodiment, the sub classifier 51 and the sub partial classifier 61 identify whether or not the image indicated by the image data belongs to a specific scene. However, the present invention is not limited to scene identification, and may identify whether it belongs to some class. For example, you may identify whether the image which image data shows is a specific pattern shape.

画像処理システムの説明図である。It is explanatory drawing of an image processing system. プリンタの構成の説明図である。2 is an explanatory diagram of a configuration of a printer. FIG. プリンタの自動補正機能の説明図である。It is explanatory drawing of the automatic correction function of a printer. 画像のシーンと補正内容との関係の説明図である。It is explanatory drawing of the relationship between the scene of an image, and the correction content. シーン識別部によるシーン識別処理のフロー図である。It is a flowchart of the scene identification process by a scene identification part. シーン識別部の機能の説明図である。It is explanatory drawing of the function of a scene identification part. 全体識別処理のフロー図である。It is a flowchart of a whole identification process. 識別対象テーブルの説明図である。It is explanatory drawing of an identification object table. 全体識別処理の肯定閾値の説明図である。It is explanatory drawing of the affirmation threshold value of the whole identification process. RecallとPrecisionの説明図である。It is explanatory drawing of Recall and Precision. 第１否定閾値の説明図である。It is explanatory drawing of a 1st negative threshold value. 第２否定閾値の説明図である。It is explanatory drawing of a 2nd negative threshold value. 図１３Ａは、閾値テーブルの説明図である。図１３Ｂは、風景識別器における閾値の説明図である。図１３Ｃは、風景識別器の処理の概要の説明図である。FIG. 13A is an explanatory diagram of a threshold table. FIG. 13B is an explanatory diagram of threshold values in the landscape classifier. FIG. 13C is an explanatory diagram of an outline of the process of the landscape classifier. 部分識別処理のフロー図である。It is a flowchart of a partial identification process. 夕景部分識別器が選択する部分画像の順番の説明図である。It is explanatory drawing of the order of the partial image which an evening scene partial identifier selects. 上位１０番目までの１０個の部分画像だけで夕景画像の識別をしたときのRecall及びPrecisionのグラフである。It is a Recall and Precision graph when the evening scene image is identified only by the top 10 partial images. 図１７Ａは、線形サポートベクタマシンによる判別の説明図である。図１７Ｂは、カーネル関数を用いた判別の説明図である。FIG. 17A is an explanatory diagram of determination by the linear support vector machine. FIG. 17B is an explanatory diagram of discrimination using a kernel function. 統合識別処理のフロー図である。It is a flowchart of an integrated identification process. 第１実施形態の設定画面の説明図である。It is explanatory drawing of the setting screen of 1st Embodiment. 図２０Ａは、メモリ２３に記憶されている第１参考例の学習用サンプルのデータ群である。図２０Ｂは、各学習用サンプルの分布の説明図である。FIG. 20A is a data group of learning samples of the first reference example stored in the memory 23. FIG. 20B is an explanatory diagram of the distribution of each learning sample. 図２１Ａは、境界（ｆ（ｘ）＝０）の法線に代表サンプルを投影する様子の説明図である。図２１Ｂは、法線上に投影された代表サンプルの説明図である。図２１Ｂは、法線上に投影された代表サンプルの説明図である。FIG. 21A is an explanatory diagram illustrating a state in which the representative sample is projected onto the normal line of the boundary (f (x) = 0). FIG. 21B is an explanatory diagram of a representative sample projected on the normal line. FIG. 21B is an explanatory diagram of a representative sample projected on the normal line. 図２２Ａは、変更後のデータ群の説明図である。図２２Ｂは、変更後の境界の説明図である。FIG. 22A is an explanatory diagram of the data group after the change. FIG. 22B is an explanatory diagram of the boundary after the change. 第１実施形態の学習用サンプルのデータ群である。It is a data group of the sample for learning of 1st Embodiment. 第２実施形態の設定画面の説明図である。It is explanatory drawing of the setting screen of 2nd Embodiment. 図２５Ａは、メモリ２３に記憶されている第２参考例の学習用サンプルのデータ群である。図２５Ｂは、各学習用サンプルの分布の説明図である。FIG. 25A is a data group of learning samples of the second reference example stored in the memory 23. FIG. 25B is an explanatory diagram of the distribution of each learning sample. 図２６Ａは、風景画像と夕景画像とを分離する境界Ｆ_ls（ｘ）＝０の説明図である。図２６Ｂは、境界（Ｆ_ls（ｘ）＝０）の法線に代表サンプルを投影する様子の説明図である。図２６Ｃは、法線上に投影された代表サンプルの説明図である。FIG. 26A is an explanatory diagram of a boundary F_ls (x) = 0 that separates a landscape image and an evening scene image. FIG. 26B is an explanatory diagram showing a state in which the representative sample is projected onto the normal line of the boundary (F_ls (x) = 0). FIG. 26C is an explanatory diagram of a representative sample projected on the normal line. 図２７Ａは、変更後のデータ群の説明図である。図２７Ｂは、変更後の境界の説明図である。FIG. 27A is an explanatory diagram of the data group after the change. FIG. 27B is an explanatory diagram of the boundary after the change. ２個の境界設定バーの位置が変更される様子の説明図である。It is explanatory drawing of a mode that the position of two boundary setting bars is changed. 図２９Ａは、最上段の境界設定バー１６３Ａの位置変更結果の説明図である。図２９Ｂは、２段目の境界設定バー１６３Ｂの位置変更結果の説明図である。図２９Ｃは、２個の境界設定バーの位置変更結果の概念図である。FIG. 29A is an explanatory diagram of the position change result of the uppermost boundary setting bar 163A. FIG. 29B is an explanatory diagram of the result of changing the position of the second-stage boundary setting bar 163B. FIG. 29C is a conceptual diagram of the result of changing the positions of two boundary setting bars. 第２実施形態の学習用サンプルのデータ群である。It is a data group of the sample for learning of 2nd Embodiment.

Explanation of symbols

２デジタルスチルカメラ、２Ａモード設定ダイヤル、
４プリンタ、６メモリカード、１０印刷機構、
１１ヘッド、１２ヘッド制御部、１３モータ、１４センサ、
１５パネル部、１６表示部、１７入力部、
２０プリンタ側コントローラ、２１スロット、２２ＣＰＵ、
２３メモリ、２４制御ユニット、２５駆動信号生成部、
３１記憶部、３１Ａ画像記憶部、３１Ｂ結果記憶部、
３２顔識別部、３３シーン識別部、３４画像補正部、
３５プリンタ制御部、４０特徴量取得部、５０全体識別器、
５１サブ識別器、５１Ｌ風景識別器、５１Ｓ夕景識別器、
５１Ｎ夜景識別器、５１Ｆ花識別器、５１Ｒ紅葉識別器、
６０部分識別器、６１サブ部分識別器、６１Ｓ夕景部分識別器、
６１Ｆ花部分識別器、６１Ｒ紅葉部分識別器、
７０統合識別器、１６１設定画面、１６１Ａ境界設定バー、
１６３設定画面、１６３Ａ境界設定バー、１６３Ｂ境界設定バー 2 Digital still camera, 2A mode setting dial,
4 printer, 6 memory card, 10 printing mechanism,
11 head, 12 head control unit, 13 motor, 14 sensor,
15 Panel section, 16 Display section, 17 Input section,
20 printer-side controller, 21 slots, 22 CPU,
23 memory, 24 control unit, 25 drive signal generator,
31 storage unit, 31A image storage unit, 31B result storage unit,
32 face identification unit, 33 scene identification unit, 34 image correction unit,
35 Printer control unit, 40 feature quantity acquisition unit, 50 overall classifier,
51 sub classifier, 51L landscape classifier, 51S evening scene classifier,
51N night view classifier, 51F flower classifier, 51R autumn leaves classifier,
60 partial classifiers, 61 sub partial classifiers, 61S evening scene partial classifiers,
61F Flower partial classifier, 61R Autumn colored partial classifier,
70 Integrated identifier, 161 setting screen, 161A border setting bar,
163 setting screen, 163A border setting bar, 163B border setting bar

Claims

An identification method for identifying whether or not the identification object belongs to a certain class based on a comparison result between a value of an evaluation function for evaluating the identification object and a threshold value,
An extraction step of extracting samples belonging to the certain class and samples not belonging to the certain class;
A plurality of extracted samples are displayed side by side on a display unit, and a mark is displayed between a sample belonging to the certain class and a sample not belonging to the certain class, and the position of the mark is determined according to a user instruction. A display step of displaying the mark between another sample by moving
A setting change step of selecting the evaluation function according to the position of the mark determined by the user from among the plurality of evaluation functions prepared in advance;
An identification step for identifying whether or not the identification object belongs to the certain class based on a comparison result between the value of the evaluation function when the identification object is evaluated using the selected evaluation function and the threshold value; ,
An identification method comprising:

The identification method according to claim 1,
In the extraction step, samples belonging to the certain class and samples belonging to a class different from the certain class are extracted,
In the setting change step, an evaluation function for identifying whether or not an identification target belongs to the certain class is selected, and an evaluation function for identifying whether or not the identification target belongs to the another class An identification method characterized by being selected.

The identification method according to claim 2,
A sample to be extracted based on the position of the sample projected onto the normal by projecting the sample onto a hyperplane normal that separates the sample belonging to the class and the sample belonging to the other class The identification method characterized by determining.

The identification method according to claim 1,
The identification process is to identify whether the identification object belongs to the certain class based on a hyperplane separating a space,
In the extraction step, the sample is projected onto a normal of the hyperplane, and the sample to be extracted is determined based on the position of the sample projected on the normal.

Based on the comparison result between the value of the evaluation function for evaluating the identification object and the threshold value, the identification device for identifying whether the identification object belongs to a certain class,
Samples belonging to the certain class and samples not belonging to the certain class are extracted,
A plurality of the extracted samples are displayed side by side on a display unit, and a mark is displayed between a sample belonging to the certain class and a sample not belonging to the certain class, and the position of the mark is determined according to a user instruction. To display the mark between another sample and the sample,
The evaluation function corresponding to the position of the mark determined by the user is selected from the plurality of evaluation functions prepared in advance,
Whether or not the identification object belongs to the certain class is identified based on a comparison result between the value of the evaluation function when the identification object is evaluated using the selected evaluation function and the threshold value. Program.