JP2008242751A

JP2008242751A - Image discrimination method, image discrimination device, and program

Info

Publication number: JP2008242751A
Application number: JP2007081697A
Authority: JP
Inventors: Kenji Fukazawa; 賢二深沢; Hirokazu Kasahara; 広和笠原
Original assignee: Seiko Epson Corp
Current assignee: Seiko Epson Corp
Priority date: 2007-03-27
Filing date: 2007-03-27
Publication date: 2008-10-09
Anticipated expiration: 2027-03-27
Also published as: JP4910821B2

Abstract

<P>PROBLEM TO BE SOLVED: To improve the processing speed of the discrimination processing of image data. <P>SOLUTION: This image discrimination method is provided to perform discrimination processing for successively and selectively discriminating whether or not an image shown by image data is belonging to a specific category for every category (A), to omit the discrimination processing which has not be performed according to the result of certain discrimination processing (B), and to discriminate the category of the image based on the result of the discrimination processing (C). According to the result of the previously performed discrimination processing, the order of selection of the discrimination processing to be performed later is determined. <P>COPYRIGHT: (C)2009,JPO&INPIT

Description

本発明は、画像識別方法、画像識別装置及びプログラムに関する。 The present invention relates to an image identification method, an image identification apparatus, and a program.

デジタルスチルカメラには撮影モードを設定するモード設定ダイヤルを持つものがある。ユーザがダイヤルで撮影モードを設定すると、デジタルスチルカメラは撮影モードに応じた撮影条件（露光時間等）を決定し、撮影を行う。撮影が行われると、デジタルスチルカメラは、画像ファイルを生成する。この画像ファイルには、撮影した画像の画像データに、撮影時の撮影条件等の付加データが付加されている。 Some digital still cameras have a mode setting dial for setting a shooting mode. When the user sets the shooting mode with the dial, the digital still camera determines shooting conditions (such as exposure time) according to the shooting mode and performs shooting. When shooting is performed, the digital still camera generates an image file. In this image file, additional data such as shooting conditions at the time of shooting is added to the image data of the shot image.

付加データに応じて画像データに画像処理することが行われている。例えば、プリンタが画像ファイルに基づいて印刷を行うとき、付加データの示す撮影条件に応じて画像データを補正し、補正した画像データに従って印刷することが行われている（特許文献１参照）。 Image processing is performed on the image data according to the additional data. For example, when a printer performs printing based on an image file, the image data is corrected according to the shooting conditions indicated by the additional data, and printing is performed according to the corrected image data (see Patent Document 1).

また、画像データを解析し、画像データの示す画像のカテゴリを分類することも行われている（特許文献２、３参照）。
特開２００１−２３８１７７号公報特開平１０−３０２０６７号公報特表２００６−５１１０００号公報 In addition, image data is analyzed to classify image categories indicated by the image data (see Patent Documents 2 and 3).
JP 2001-238177 A Japanese Patent Laid-Open No. 10-302067 JP 2006-511000 Gazette

画像データの示す画像が特定のカテゴリに属するか否かを識別する識別器を複数組み合わせることによって、画像データを複数のカテゴリに分類することが行われている。このように複数の識別器を用いる場合、常に識別器の選択順序を同じにすると、識別処理の速度を速くすることができない。 The image data is classified into a plurality of categories by combining a plurality of discriminators for identifying whether or not the image indicated by the image data belongs to a specific category. When a plurality of discriminators are used in this way, the speed of the discrimination process cannot be increased if the selection order of the discriminators is always the same.

本発明は、画像データの識別処理の速度を向上させることを目的とする。 An object of the present invention is to improve the speed of image data identification processing.

上記目的を達成するための主たる発明は、画像データの示す画像が特定のカテゴリに属するか否かを識別する識別処理を、複数の前記カテゴリ毎に順に選択して行い、ある前記識別処理の結果に応じて、まだ行われていない前記識別処理が省略され、前記識別処理の結果に基づいて、前記画像のカテゴリを識別する画像識別方法において、先に行われる識別処理の結果に応じて、後に行われる識別処理の選択順序が決定されることを特徴とする。
本発明の他の特徴については、本明細書及び添付図面の記載により明らかにする。 The main invention for achieving the above object is to perform identification processing for identifying whether or not an image indicated by image data belongs to a specific category in order for each of the plurality of categories. The identification process that has not yet been performed is omitted, and in the image identification method for identifying the category of the image based on the result of the identification process, The selection order of identification processing to be performed is determined.
Other features of the present invention will become apparent from the description of the present specification and the accompanying drawings.

本明細書及び添付図面の記載により、少なくとも、以下の事項が明らかとなる。
画像データの示す画像が特定のカテゴリに属するか否かを識別する識別処理を、複数の前記カテゴリ毎に順に選択して行い、
ある前記識別処理の結果に応じて、まだ行われていない前記識別処理が省略され、
前記識別処理の結果に基づいて、前記画像のカテゴリを識別する
画像識別方法において、
先に行われる識別処理の結果に応じて、後に行われる識別処理の選択順序が決定される
ことを特徴とする画像識別方法が明らかになる。
このような画像識別方法によれば、処理時間を速くすることができる。 At least the following matters will become clear from the description of the present specification and the accompanying drawings.
Identification processing for identifying whether the image indicated by the image data belongs to a specific category is performed by selecting in order for each of the plurality of categories,
Depending on the result of the identification process, the identification process that has not yet been performed is omitted,
In the image identification method for identifying the category of the image based on the result of the identification process,
An image identification method is characterized in that the order of selection of identification processes to be performed later is determined according to the result of the identification process performed first.
According to such an image identification method, the processing time can be shortened.

また、ある前記識別処理においてその識別処理に対応する特定のカテゴリに前記画像が属することが識別されたとき、まだ行われていない前記識別処理が省略されることが望ましい。また、ある前記識別処理においてその識別処理とは別の識別処理に対応する特定のカテゴリに前記画像が属さないことが識別されたとき、前記別の識別処理が省略されることが望ましい。これにより、処理時間を速くすることができる。 In addition, when it is identified that the image belongs to a specific category corresponding to the identification process in the identification process, it is preferable that the identification process that has not been performed is omitted. In addition, when it is identified that the image does not belong to a specific category corresponding to an identification process different from the identification process in the identification process, it is preferable that the another identification process is omitted. As a result, the processing time can be increased.

また、前記識別処理では、前記画像が前記特定のカテゴリに属する確率に応じた値が算出され、その値に基づいて前記画像が前記特定のカテゴリに属するか否かが識別され、前記先に行われる識別処理で算出された前記値に応じて、前記後に行われる識別処理の選択順序が決定されることが望ましい。これにより、後に行われる識別処理の選択順序の決定が簡略になる。 In the identification process, a value corresponding to the probability that the image belongs to the specific category is calculated, and based on the value, it is identified whether the image belongs to the specific category. It is preferable that the selection order of the identification process performed later is determined according to the value calculated in the identification process. This simplifies determination of the selection order of identification processing performed later.

また、前記先に行われる識別処理における結果と、次に行われる識別処理とをそれぞれ関連付けたデータを用いて、前記選択順序が決定されることが望ましい。このようなデータを用いることによって、処理速度を速めることが実現できる。 In addition, it is preferable that the selection order is determined using data in which the result of the identification processing performed first and the identification processing performed next are associated with each other. By using such data, it is possible to increase the processing speed.

また、選択順序の異なる複数の候補について、前記後に行われる識別処理の処理時間と、前記前記先に行われる識別処理の前記結果の発生確率とに基づいて、前記画像のカテゴリを識別するまでの処理時間の期待値が算出され、前記複数の候補の中から前記期待値の短い候補が選択されることにより、前記選択順序が決定されることが望ましい。これにより、処理時間を速くすることができる。 In addition, for a plurality of candidates having different selection orders, until the category of the image is identified based on the processing time of the identification process performed later and the occurrence probability of the result of the identification process performed earlier. It is preferable that an expected value of processing time is calculated, and the selection order is determined by selecting a candidate with a short expected value from the plurality of candidates. As a result, the processing time can be increased.

また、ｎ−１個の識別処理を行う際の前記期待値の短い候補を決定する決定処理があり、ｎ個の識別処理を行う際の前記期待値の短い候補を決定するとき、最初に行う識別処理がそれぞれ異なるｎ個の候補についてそれぞれ前記決定処理を用いて前記期待値が算出され、ｎ個の候補の中から前記期待値の短い候補が選択されることにより、前記選択順序が決定されることが望ましい。これにより、簡易な手順によって、最適な選択順序を決定できる。 In addition, there is a determination process for determining a candidate with a short expected value when performing n-1 identification processes, and the determination process is performed first when determining a candidate with a short expected value when performing n identification processes. The expectation value is calculated using the determination process for each of n candidates having different identification processes, and the selection order is determined by selecting a candidate with a short expectation value from the n candidates. It is desirable. Thereby, the optimal selection order can be determined by a simple procedure.

コントローラを備える画像識別装置であって、
前記コントローラは、
画像データの示す画像が特定のカテゴリに属するか否かを識別する識別処理を、複数の前記カテゴリ毎に順に選択して行い、
ある前記識別処理の結果に応じて、まだ行われていない前記識別処理が省略され、
前記識別処理の結果に基づいて、前記画像のカテゴリを識別するとともに、
前記先に行われる識別処理で算出された前記値に応じて、前記後に行われる識別処理の選択順序を決定する
ことを特徴とする画像識別装置が明らかになる。 An image identification device comprising a controller,
The controller is
Identification processing for identifying whether the image indicated by the image data belongs to a specific category is performed by selecting in order for each of the plurality of categories,
Depending on the result of the identification process, the identification process that has not yet been performed is omitted,
Based on the result of the identification process, the category of the image is identified,
An image identification apparatus characterized by determining the selection order of the identification process performed later according to the value calculated in the identification process performed earlier.

画像識別装置に、
画像データの示す画像が特定のカテゴリに属するか否かを識別する識別処理を、複数の前記カテゴリ毎に順に選択して行わせ、
ある前記識別処理の結果に応じて、まだ行われていない前記識別処理が省略させ、
前記識別処理の結果に基づいて、前記画像のカテゴリを識別させる
プログラムにおいて、
前記画像識別装置に、先に行われる識別処理の結果に応じて、後に行われる識別処理の選択順序を決定させる
ことを特徴とするプログラムが明らかになる。 In the image identification device,
Identification processing for identifying whether an image indicated by image data belongs to a specific category is performed by selecting in order for each of the plurality of categories,
Depending on the result of the identification process, the identification process that has not yet been performed is omitted,
In a program for identifying the category of the image based on the result of the identification process,
A program that causes the image identification apparatus to determine the selection order of identification processing to be performed later according to the result of identification processing to be performed first becomes clear.

＝＝＝全体構成＝＝＝
図１は、画像処理システムの説明図である。この画像処理システムは、デジタルスチルカメラ２と、プリンタ４とを備える。 === Overall structure ===
FIG. 1 is an explanatory diagram of an image processing system. This image processing system includes a digital still camera 2 and a printer 4.

デジタルスチルカメラ２は、被写体をデジタルデバイス（ＣＣＤなど）に結像させることによりデジタル画像を取得するカメラである。デジタルスチルカメラ２には、モード設定ダイヤル２Ａが設けられている。ユーザは、ダイヤル２Ａによって、撮影条件に応じた撮影モードを設定することができる。例えば、ダイヤル２Ａによって「夜景」モードが設定されると、デジタルスチルカメラ２は、シャッター速度を遅くしたり、ＩＳＯ感度を高くしたりして、夜景撮影に適した撮影条件にて撮影を行う。
デジタルスチルカメラ２は、ファイルフォーマット規格に準拠して、撮影により生成した画像ファイルをメモリカード６に保存する。画像ファイルには、撮影した画像のデジタルデータ（画像データ）だけでなく、撮影時の撮影条件（撮影データ）等の付加データも保存される。 The digital still camera 2 is a camera that acquires a digital image by forming an image of a subject on a digital device (CCD or the like). The digital still camera 2 is provided with a mode setting dial 2A. The user can set the shooting mode according to the shooting conditions by using the dial 2A. For example, when the “night scene” mode is set by the dial 2A, the digital still camera 2 performs shooting under shooting conditions suitable for night scene shooting by decreasing the shutter speed or increasing the ISO sensitivity.
The digital still camera 2 stores an image file generated by photographing in the memory card 6 in accordance with the file format standard. In the image file, not only digital data (image data) of a captured image but also additional data such as a shooting condition (shooting data) at the time of shooting is stored.

プリンタ４は、画像データの示す画像を紙に印刷する印刷装置である。プリンタ４には、メモリカード６を挿入するスロット２１が設けられている。ユーザは、デジタルスチルカメラ２で撮影した後、デジタルスチルカメラ２からメモリカード６を取り出し、スロット２１にメモリカード６を挿入することができる。
図２は、プリンタ４の構成の説明図である。プリンタ４は、印刷機構１０と、この印刷機構１０を制御するプリンタ側コントローラ２０とを備える。印刷機構１０は、インクを吐出するヘッド１１と、ヘッド１１を制御するヘッド制御部１２と、紙を搬送するため等のモータ１３と、センサ１４とを有する。プリンタ側コントローラ２０は、メモリカード６からデータを送受信するためのメモリ用スロット２１と、ＣＰＵ２２と、メモリ２３と、モータ１３を制御する制御ユニット２４と、駆動信号（駆動波形）を生成する駆動信号生成部２５とを有する。 The printer 4 is a printing device that prints an image indicated by image data on paper. The printer 4 is provided with a slot 21 into which the memory card 6 is inserted. The user can take a picture with the digital still camera 2, remove the memory card 6 from the digital still camera 2, and insert the memory card 6 into the slot 21.
FIG. 2 is an explanatory diagram of the configuration of the printer 4. The printer 4 includes a printing mechanism 10 and a printer-side controller 20 that controls the printing mechanism 10. The printing mechanism 10 includes a head 11 that ejects ink, a head control unit 12 that controls the head 11, a motor 13 for conveying paper, and a sensor 14. The printer-side controller 20 includes a memory slot 21 for transmitting and receiving data from the memory card 6, a CPU 22, a memory 23, a control unit 24 for controlling the motor 13, and a drive signal for generating a drive signal (drive waveform). And a generation unit 25.

メモリカード６がスロット２１に挿入されると、プリンタ側コントローラ２０は、メモリカード６に保存されている画像ファイルを読み出してメモリ２３に記憶する。そして、プリンタ側コントローラ２０は、画像ファイルの画像データを、印刷機構１０で印刷するための印刷データに変換し、印刷データに基づいて印刷機構１０を制御し、紙に画像を印刷する。この一連の動作は、「ダイレクトプリント」と呼ばれている。
なお、「ダイレクトプリント」は、メモリカード６をスロット２１に挿入することによって行われるだけでなく、デジタルスチルカメラ２とプリンタ４とをケーブル（不図示）で接続することによっても可能である。 When the memory card 6 is inserted into the slot 21, the printer-side controller 20 reads out the image file stored in the memory card 6 and stores it in the memory 23. Then, the printer-side controller 20 converts the image data of the image file into print data for printing by the printing mechanism 10, controls the printing mechanism 10 based on the printing data, and prints an image on paper. This series of operations is called “direct printing”.
“Direct printing” is not only performed by inserting the memory card 6 into the slot 21, but also by connecting the digital still camera 2 and the printer 4 with a cable (not shown).

メモリカード６に記憶される画像ファイルは、画像データと、付加データとから構成されている。画像データは、複数の画素データから構成されている。画素データは、画素の色情報（階調値）を示すデータである。画素がマトリクス状に配置されることによって、画像が構成される。このため、画像データは、画像を示すデータである。付加データには、画像データの特性を示すデータや、撮影データや、サムネイル画像データ等が含まれる。 The image file stored in the memory card 6 is composed of image data and additional data. The image data is composed of a plurality of pixel data. The pixel data is data indicating pixel color information (gradation value). An image is formed by arranging the pixels in a matrix. Therefore, the image data is data indicating an image. The additional data includes data indicating the characteristics of image data, shooting data, thumbnail image data, and the like.

＝＝＝自動補正機能の概要＝＝＝
「人物」の写真を印刷するときには、肌色をきれいにしたいという要求がある。また、「風景」の写真を印刷するときには、空の青色を強調し、木や草の緑色を強調したいという要求がある。そこで、本実施形態のプリンタ４は、画像ファイルを分析して自動的に適した補正処理を行う自動補正機能を備えている。 === Outline of automatic correction function ===
When printing a “person” photo, there is a demand to clean the skin tone. In addition, when printing a “landscape” photograph, there is a demand for emphasizing the blue of the sky and the green of trees and grass. Therefore, the printer 4 of the present embodiment includes an automatic correction function that analyzes an image file and automatically performs a suitable correction process.

図３は、プリンタ４の自動補正機能の説明図である。図中のプリンタ側コントローラ２０の各要素は、ソフトウェアとハードウェアによって実現される。
記憶部３１は、メモリ２３の一部の領域及びＣＰＵ２２によって実現される。メモリカード６から読み出された画像ファイルの全部又は一部は、記憶部３１の画像記憶部３１Ａに展開される。また、プリンタ側コントローラ２０の各要素の演算結果は、記憶部３１の結果記憶部３１Ｂに格納される。 FIG. 3 is an explanatory diagram of the automatic correction function of the printer 4. Each element of the printer-side controller 20 in the figure is realized by software and hardware.
The storage unit 31 is realized by a partial area of the memory 23 and the CPU 22. All or part of the image file read from the memory card 6 is developed in the image storage unit 31 </ b> A of the storage unit 31. In addition, the calculation result of each element of the printer-side controller 20 is stored in the result storage unit 31B of the storage unit 31.

顔識別部３２は、ＣＰＵ２２と、メモリ２３に記憶された顔識別プログラムとによって実現される。顔識別部３２は、画像記憶部３１Ａに記憶された画像データを分析し、顔の有無を識別する。顔識別部３２によって顔が有ると識別された場合、識別対象となる画像が「人物」のシーンに属すると識別される。この場合、シーン識別部３３によるシーン識別処理は行われない。顔識別部３２による顔識別処理は、既に広く行われている処理と同様なので、詳細な説明は省略する。 The face identification unit 32 is realized by the CPU 22 and a face identification program stored in the memory 23. The face identification unit 32 analyzes the image data stored in the image storage unit 31A and identifies the presence or absence of a face. When the face identifying unit 32 identifies that there is a face, the image to be identified is identified as belonging to the “person” scene. In this case, the scene identification process by the scene identification unit 33 is not performed. Since the face identification process by the face identification unit 32 is the same as a process that has already been widely performed, detailed description thereof is omitted.

シーン識別部３３は、ＣＰＵ２２と、メモリ２３に記憶されたシーン識別プログラムとによって実現される。シーン識別部３３は、画像記憶部３１Ａに記憶された画像ファイルを分析し、画像データの示す画像のシーンを識別する。顔識別部３２によって顔がないと識別された場合に、シーン識別部３３によるシーン識別処理が行われる。後述するように、シーン識別部３３は、識別対象となる画像が「風景」、「夕景」、「夜景」、「花」、「紅葉」、「その他」のいずれの画像であるかを識別する。 The scene identification unit 33 is realized by the CPU 22 and a scene identification program stored in the memory 23. The scene identification unit 33 analyzes the image file stored in the image storage unit 31A and identifies the scene of the image indicated by the image data. When the face identifying unit 32 identifies that there is no face, a scene identifying process by the scene identifying unit 33 is performed. As will be described later, the scene identifying unit 33 identifies whether the image to be identified is a “landscape”, “evening scene”, “night scene”, “flower”, “autumn leaves”, or “other” image. .

図４は、画像のシーンと補正内容との関係の説明図である。
画像補正部３４は、ＣＰＵ２２と、メモリ２３に記憶された画像補正プログラムとによって実現される。画像補正部３４は、記憶部３１の結果記憶部３１Ｂ（後述）に記憶されている識別結果（顔識別部３２やシーン識別部３３の識別結果）に基づいて、画像記憶部３１Ａの画像データを補正する。例えば、シーン識別部３３の識別結果が「風景」である場合には、青色を強調し、緑色を強調するような補正が行われる。なお、画像補正部３４は、シーンの識別結果だけでなく、画像ファイルの撮影データの内容も反映して、画像データを補正しても良い。例えば、露出補正がマイナスの場合、暗い雰囲気の画像を明るくしないように画像データを補正しても良い。 FIG. 4 is an explanatory diagram of a relationship between an image scene and correction contents.
The image correction unit 34 is realized by the CPU 22 and an image correction program stored in the memory 23. The image correction unit 34 converts the image data of the image storage unit 31A based on the identification results (identification results of the face identification unit 32 and the scene identification unit 33) stored in the result storage unit 31B (described later) of the storage unit 31. to correct. For example, when the identification result of the scene identification unit 33 is “landscape”, correction is performed so that blue is emphasized and green is emphasized. The image correction unit 34 may correct the image data by reflecting not only the scene identification result but also the contents of the image data of the image file. For example, when the exposure correction is negative, the image data may be corrected so as not to brighten the dark atmosphere image.

プリンタ制御部３５は、ＣＰＵ２２、駆動信号生成部２５、制御ユニット２４及びメモリ２３に記憶されたプリンタ制御プログラムによって、実現される。プリンタ制御部３５は、補正後の画像データを印刷データに変換し、印刷機構１０に画像を印刷させる。 The printer control unit 35 is realized by a printer control program stored in the CPU 22, the drive signal generation unit 25, the control unit 24, and the memory 23. The printer control unit 35 converts the corrected image data into print data, and causes the printing mechanism 10 to print the image.

＝＝＝シーン識別処理＝＝＝
図５は、シーン識別部３３によるシーン識別処理のフロー図である。図６は、シーン識別部３３の機能の説明図である。図中のシーン識別部３３の各要素は、ソフトウェアとハードウェアによって実現される。シーン識別部３３は、図６に示す特徴量取得部４０と、全体識別器５０と、部分識別器６０と、統合識別器７０とを備えている。 === Scene Identification Processing ===
FIG. 5 is a flowchart of the scene identification process performed by the scene identification unit 33. FIG. 6 is an explanatory diagram of the function of the scene identification unit 33. Each element of the scene identification unit 33 in the figure is realized by software and hardware. The scene identification unit 33 includes a feature amount acquisition unit 40, an overall classifier 50, a partial classifier 60, and an integrated classifier 70 shown in FIG.

最初に、特徴量取得部４０が、記憶部３１の画像記憶部３１Ａに展開された画像データを分析し、部分特徴量を取得する（Ｓ１０１）。具体的には、特徴量取得部４０は、画像データを８×８の６４ブロックに分割し、各ブロックの色平均と分散を算出し、この色平均と分散を部分特徴量として取得する。なお、ここでは各画素はＹＣＣ色空間における階調値のデータをもっており、各ブロックごとに、Ｙの平均値、Ｃｂの平均値及びＣｒの平均値がそれぞれ算出され、Ｙの分散、Ｃｂの分散及びＣｒの分散がそれぞれ算出される。つまり、各ブロックごとに３つの色平均と３つの分散が部分特徴量として算出される。これらの色平均や分散は、各ブロックにおける部分画像の特徴を示すものである。なお、ＲＧＢ色空間における平均値や分散を算出しても良い。
ブロックごとに色平均と分散が算出されるので、特徴量取得部４０は、画像記憶部３１Ａには画像データの全てを展開せずに、ブロック分の画像データをブロック順に展開する。このため、画像記憶部３１Ａは、必ずしも画像ファイルの全てを展開できるだけの容量を備えていなくても良い。 First, the feature amount acquisition unit 40 analyzes the image data developed in the image storage unit 31A of the storage unit 31 and acquires partial feature amounts (S101). Specifically, the feature amount acquisition unit 40 divides the image data into 8 × 8 64 blocks, calculates the color average and variance of each block, and acquires the color average and variance as partial feature amounts. Here, each pixel has gradation value data in the YCC color space, and the average value of Y, the average value of Cb, and the average value of Cr are calculated for each block, and the variance of Y and the variance of Cb are calculated. And the variance of Cr are calculated respectively. That is, three color averages and three variances are calculated as partial feature amounts for each block. These color averages and variances indicate the characteristics of the partial images in each block. Note that an average value or variance in the RGB color space may be calculated.
Since the color average and variance are calculated for each block, the feature amount acquisition unit 40 expands the image data for the blocks in the block order without expanding all the image data in the image storage unit 31A. For this reason, the image storage unit 31A does not necessarily have a capacity sufficient to expand all of the image files.

次に、特徴量取得部４０が、全体特徴量を取得する（Ｓ１０２）。具体的には、特徴量取得部４０は、画像データの全体の色平均、分散、重心及び撮影情報を、全体特徴量として取得する。なお、これらの色平均や分散は、画像の全体の特徴を示すものである。画像データ全体の色平均、分散及び重心は、先に算出した部分特徴量を用いて算出される。このため、全体特徴量を算出する際に、画像データを再度展開する必要がないので、全体特徴量の算出速度が速くなる。全体識別処理（後述）は部分識別処理（後述）よりも先に行われるにも関わらず、全体特徴量が部分特徴量よりも後に求められるのは、このように算出速度を速めるためである。なお、撮影情報は、画像ファイルの撮影データから抽出される。具体的には、絞り値、シャッター速度、フラッシュ発光の有無などの情報が全体特徴量として用いられる。但し、画像ファイルの撮影データの全てが全体特徴量として用いられるわけではない。 Next, the feature amount acquisition unit 40 acquires the entire feature amount (S102). Specifically, the feature quantity acquisition unit 40 acquires the overall color average, variance, center of gravity, and shooting information of the image data as the overall feature quantity. Note that these color averages and variances indicate the overall characteristics of the image. The color average, variance, and center of gravity of the entire image data are calculated using the partial feature values calculated previously. For this reason, it is not necessary to re-expand the image data when calculating the entire feature amount, and the calculation speed of the entire feature amount is increased. Although the overall identification process (described later) is performed prior to the partial identification process (described later), the overall feature value is obtained after the partial feature value in order to increase the calculation speed. The shooting information is extracted from the shooting data of the image file. Specifically, information such as the aperture value, shutter speed, and the presence or absence of flash emission is used as the overall feature amount. However, not all shooting data of the image file is used as the entire feature amount.

次に、全体識別器５０が、全体識別処理を行う（Ｓ１０３）。全体識別処理とは、全体特徴量に基づいて、画像データの示す画像のシーンを識別（推定）する処理である。全体識別処理の詳細については、後述する。 Next, the overall classifier 50 performs overall identification processing (S103). The overall identification process is a process for identifying (estimating) an image scene indicated by image data based on the overall feature amount. Details of the overall identification process will be described later.

全体識別処理によってシーンの識別ができる場合（Ｓ１０４でＹＥＳ）、シーン識別部３３は、記憶部３１の結果記憶部３１Ｂに識別結果を記憶することによってシーンを決定し（Ｓ１０９）、シーン識別処理を終了する。つまり、全体識別処理によってシーンの識別ができた場合（Ｓ１０４でＹＥＳ）、部分識別処理や統合識別処理が省略される。これにより、シーン識別処理の速度が速くなる。
全体識別処理によってシーンの識別ができない場合（Ｓ１０４でＮＯ）、次に部分識別器６０が、部分識別処理を行う（Ｓ１０５）。部分識別処理とは、部分特徴量に基づいて、画像データの示す画像全体のシーンを識別する処理である。部分識別処理の詳細については、後述する。 If the scene can be identified by the overall identification process (YES in S104), the scene identification unit 33 determines the scene by storing the identification result in the result storage unit 31B of the storage unit 31 (S109), and performs the scene identification process. finish. That is, when the scene can be identified by the overall identification process (YES in S104), the partial identification process and the integrated identification process are omitted. This increases the speed of the scene identification process.
If the scene cannot be identified by the overall identification process (NO in S104), the partial classifier 60 performs the partial identification process (S105). The partial identification process is a process for identifying the scene of the entire image indicated by the image data based on the partial feature amount. Details of the partial identification processing will be described later.

部分識別処理によってシーンの識別ができる場合（Ｓ１０６でＹＥＳ）、シーン識別部３３は、記憶部３１の結果記憶部３１Ｂに識別結果を記憶することによってシーンを決定し（Ｓ１０９）、シーン識別処理を終了する。つまり、部分識別処理によってシーンの識別ができた場合（Ｓ１０６でＹＥＳ）、統合識別処理が省略される。これにより、シーン識別処理の速度が速くなる。
部分識別処理によってシーンの識別ができない場合（Ｓ１０６でＮＯ）、次に統合識別器７０が、統合識別処理を行う（Ｓ１０７）。統合識別処理の詳細については、後述する。 When the scene can be identified by the partial identification process (YES in S106), the scene identification unit 33 determines the scene by storing the identification result in the result storage unit 31B of the storage unit 31 (S109), and performs the scene identification process. finish. That is, when the scene can be identified by the partial identification process (YES in S106), the integrated identification process is omitted. This increases the speed of the scene identification process.
If the scene cannot be identified by the partial identification process (NO in S106), the integrated discriminator 70 performs the integrated identification process (S107). Details of the integrated identification process will be described later.

統合識別処理によってシーンの識別ができる場合（Ｓ１０８でＹＥＳ）、シーン識別部３３は、記憶部３１の結果記憶部３１Ｂに識別結果を記憶することによってシーンを決定し（Ｓ１０９）、シーン識別処理を終了する。一方、統合識別処理によってシーンの識別ができない場合（Ｓ１０８でＮＯ）、画像データの示す画像が「その他」のシーン（「風景」、「夕景」、「夜景」、「花」又は「紅葉」以外のシーン）である旨の識別結果を結果記憶部３１Ｂに記憶する（Ｓ１１０）。 When the scene can be identified by the integrated identification process (YES in S108), the scene identification unit 33 determines the scene by storing the identification result in the result storage unit 31B of the storage unit 31 (S109), and performs the scene identification process. finish. On the other hand, if the scene cannot be identified by the integrated identification process (NO in S108), the image indicated by the image data is “other” (other than “landscape”, “evening scene”, “night scene”, “flower” or “autumn leaves”. Is stored in the result storage unit 31B (S110).

＝＝＝全体識別処理＝＝＝
図７は、全体識別処理のフロー図である。ここでは図６も参照しながら全体識別処理について説明する。 === Overall identification processing ===
FIG. 7 is a flowchart of the overall identification process. Here, the overall identification process will be described with reference to FIG.

まず、全体識別器５０は、複数のサブ識別器５１の中から１つのサブ識別器５１を選択する（Ｓ２０１）。全体識別器５０には、識別対象となる画像（識別対象画像）が特定のシーンに属するか否かを識別するサブ識別器５１が５つ設けられている。５つのサブ識別器５１は、それぞれ風景、夕景、夜景、花、紅葉のシーンを識別する。ここでは、全体識別器５０は、風景→夕景→夜景→花→紅葉の順に、サブ識別器５１を選択する（なお、サブ識別器５１の選択順序については、後述する）。このため、最初には、識別対象画像が風景のシーンに属するか否かを識別するサブ識別器５１（風景識別器５１Ｌ）が選択される。 First, the overall classifier 50 selects one sub-classifier 51 from the plurality of sub-classifiers 51 (S201). The overall classifier 50 is provided with five sub-classifiers 51 for identifying whether an image to be identified (identification target image) belongs to a specific scene. The five sub classifiers 51 identify scenes of scenery, evening scene, night scene, flowers, and autumn leaves, respectively. Here, the overall classifier 50 selects the sub classifier 51 in the order of landscape → evening scene → night scene → flower → autumn leaves (the selection order of the sub classifier 51 will be described later). For this reason, first, the sub classifier 51 (landscape classifier 51L) for identifying whether or not the classification target image belongs to a landscape scene is selected.

次に、全体識別器５０は、識別対象テーブルを参照し、選択したサブ識別器５１を用いてシーンを識別すべきか否かを判断する（Ｓ２０２）。
図８は、識別対象テーブルの説明図である。この識別対象テーブルは、記憶部３１の結果記憶部３１Ｂに記憶される。識別対象テーブルは、最初の段階では全ての欄がゼロに設定される。Ｓ２０２の処理では、「否定」欄が参照され、ゼロであればＹＥＳと判断され、１であればＮＯと判断される。ここでは、全体識別器５０は、識別対象テーブルにおける「風景」欄の「否定」欄を参照し、ゼロであるのでＹＥＳと判断する。 Next, the overall classifier 50 refers to the classification target table and determines whether or not a scene should be identified using the selected sub-classifier 51 (S202).
FIG. 8 is an explanatory diagram of the identification target table. This identification target table is stored in the result storage unit 31B of the storage unit 31. In the identification target table, all fields are set to zero in the first stage. In the process of S202, the “No” column is referred to, and if it is zero, it is determined as YES, and if it is 1, it is determined as NO. Here, the overall classifier 50 refers to the “No” column in the “Scenery” column in the identification target table, and determines “YES” because it is zero.

次に、サブ識別器５１は、全体特徴量に基づいて、識別対象画像が特定のシーンに属する確率（確信度）を算出する（Ｓ２０３）。本実施形態のサブ識別器５１には、サポートベクタマシン（ＳＶＭ）による識別手法が用いられている。なお、サポートベクタマシンについては、後述する。識別対象画像が特定のシーンに属する場合、サブ識別器５１の判別式は、プラスの値になりやすい。識別対象画像が特定のシーンに属しない場合、サブ識別器５１の判別式は、マイナスの値になりやすい。また、判別式は、識別対象画像が特定のシーンに属する確信度が高いほど、大きい値になる。このため、判別式の値が大きければ、識別対象画像が特定のシーンに属する確率（確信度）が高くなり、判別式の値が小さければ、識別対象画像が特定のシーンに属する確率が低くなる。 Next, the sub classifier 51 calculates the probability (confidence) that the classification target image belongs to a specific scene based on the entire feature amount (S203). For the sub classifier 51 of this embodiment, a classification method using a support vector machine (SVM) is used. The support vector machine will be described later. When the classification target image belongs to a specific scene, the discriminant of the sub classifier 51 tends to be a positive value. When the classification target image does not belong to a specific scene, the discriminant of the sub classifier 51 tends to be a negative value. Further, the discriminant becomes a larger value as the certainty that the identification target image belongs to the specific scene is higher. For this reason, if the discriminant value is large, the probability (confidence) that the identification target image belongs to a specific scene is high, and if the discriminant value is small, the probability that the classification target image belongs to a specific scene is low. .

次に、サブ識別器５１は、判別式の値が肯定閾値より大きいか否かを判断する（Ｓ２０４）。判別式の値が肯定閾値より大きければ、サブ識別器５１は、識別対象画像が特定のシーンに属すると判断することになる。 Next, the sub discriminator 51 determines whether or not the value of the discriminant is larger than the positive threshold (S204). If the value of the discriminant is larger than the positive threshold, the sub discriminator 51 determines that the classification target image belongs to a specific scene.

図９は、全体識別処理の肯定閾値の説明図である。同図において、横軸は肯定閾値を示し、縦軸はRecall又はPrecisionの確率を示す。図１０は、RecallとPrecisionの説明図である。判別式の値が肯定閾値以上の場合には識別結果はPositiveであり、判別式の値が肯定閾値以上でない場合には識別結果はNegativeである。 FIG. 9 is an explanatory diagram of an affirmative threshold value of the overall identification process. In the figure, the horizontal axis indicates an affirmative threshold, and the vertical axis indicates the probability of recall or precision. FIG. 10 is an explanatory diagram of Recall and Precision. If the discriminant value is greater than or equal to the positive threshold, the identification result is Positive. If the discriminant value is not greater than or equal to the positive threshold, the identification result is Negative.

Recallは、再現率や検出率を示すものである。Recallは、特定のシーンの画像の総数に対する、特定のシーンに属すると識別された画像の数の割合である。言い換えると、Recallは、特定のシーンの画像をサブ識別器５１に識別させたときに、サブ識別器５１がPositiveと識別する確率（特定のシーンの画像が特定のシーンに属すると識別される確率）を示すものである。例えば、風景画像を風景識別器５１Ｌに識別させたときに、風景のシーンに属すると風景識別器５１Ｌが識別する確率を示すものである。 Recall indicates the recall rate and detection rate. Recall is the ratio of the number of images identified as belonging to a specific scene to the total number of images of the specific scene. In other words, Recall is the probability that the sub-identifier 51 identifies the image as a positive when the image of the specific scene is identified by the sub-identifier 51 (the probability that the image of the specific scene belongs to the specific scene. ). For example, when a landscape image is identified by the landscape classifier 51L, it indicates the probability that the landscape classifier 51L identifies it as belonging to a landscape scene.

Precisionは、正答率や正解率を示すものである。Precisionは、Positiveと識別された画像の総数に対する、特定のシーンの画像の数の割合である。言い換えると、Precisionは、特定のシーンを識別するサブ識別器５１がPositiveと識別したときに、識別対象の画像が特定のシーンである確率を示すものである。例えば、風景識別器５１Ｌが風景のシーンに属すると識別したときに、その識別した画像が本当に風景画像である確率を示すものである。 Precision indicates the correct answer rate and the correct answer rate. Precision is the ratio of the number of images in a particular scene to the total number of images identified as Positive. In other words, Precision indicates the probability that the image to be identified is a specific scene when the sub-classifier 51 that identifies the specific scene identifies it as Positive. For example, when the landscape classifier 51L identifies that it belongs to a landscape scene, it indicates the probability that the identified image is really a landscape image.

図９から分かる通り、肯定閾値を大きくするほど、Precisionが大きくなる。このため、肯定閾値を大きくするほど、例えば風景のシーンに属すると識別された画像が風景画像である確率が高くなる。つまり、肯定閾値を大きくするほど、誤識別の確率が低くなる。
一方、肯定閾値を大きくするほど、Recallは小さくなる。この結果、例えば、風景画像を風景識別器５１Ｌで識別した場合であっても、風景のシーンに属すると正しく識別しにくくなる。ところで、識別対象画像が風景のシーンに属すると識別できれば（Ｓ２０４でＹＥＳ）、残りの別のシーン（夕景など）の識別を行わないようにして全体識別処理の速度を速めている。このため、肯定閾値を大きくするほど、全体識別処理の速度は低下することになる。また、全体識別処理によってシーンが識別できれば部分識別処理を行わないようにしてシーン識別処理の速度を速めているため（Ｓ１０４）、肯定閾値を大きくするほど、シーン識別処理の速度は低下することになる。
つまり、肯定閾値が小さすぎると誤識別の確率が高くなり、大きすぎると処理速度が低下することになる。本実施形態では、正答率（Precision）を９７．５％に設定するため、風景の肯定閾値は１．２７に設定されている。 As can be seen from FIG. 9, the greater the positive threshold, the greater the Precision. For this reason, the larger the positive threshold value, the higher the probability that an image identified as belonging to a landscape scene, for example, is a landscape image. That is, the greater the positive threshold, the lower the probability of misidentification.
On the other hand, the larger the positive threshold, the smaller the Recall. As a result, for example, even when a landscape image is identified by the landscape classifier 51L, it is difficult to correctly identify it as belonging to a landscape scene. By the way, if the image to be identified can be identified as belonging to a landscape scene (YES in S204), the speed of the overall identification process is increased so as not to identify other remaining scenes (such as sunsets). For this reason, the larger the positive threshold, the lower the overall identification processing speed. Further, if the scene can be identified by the overall identification process, the partial identification process is not performed and the speed of the scene identification process is increased (S104). Therefore, as the positive threshold is increased, the scene identification process speed decreases. Become.
That is, if the positive threshold is too small, the probability of misidentification increases, and if it is too large, the processing speed decreases. In this embodiment, since the correct answer rate (Precision) is set to 97.5%, the landscape affirmation threshold is set to 1.27.

判別式の値が肯定閾値より大きければ（Ｓ２０４でＹＥＳ）、サブ識別器５１は、識別対象画像が特定のシーンに属すると判断し、肯定フラグを立てる（Ｓ２０５）。「肯定フラグを立てる」とは、図８の「肯定」欄を１にすることである。この場合、全体識別器５０は、次のサブ識別器５１による識別を行わずに、全体識別処理を終了する。例えば、風景画像であると識別できれば、夕景などの識別を行わずに、全体識別処理を終了する。この場合、次のサブ識別器５１による識別を省略しているので、全体識別処理の速度を速めることができる。 If the discriminant value is greater than the affirmative threshold value (YES in S204), the sub-classifier 51 determines that the classification target image belongs to a specific scene and sets an affirmative flag (S205). “Set an affirmative flag” means that the “affirmation” column in FIG. In this case, the overall discriminator 50 ends the overall discrimination process without performing discrimination by the next sub discriminator 51. For example, if the image can be identified as a landscape image, the entire identification process is terminated without identifying the sunset scene or the like. In this case, since the identification by the next sub-identifier 51 is omitted, the speed of the overall identification process can be increased.

判別式の値が肯定閾値より大きくなければ（Ｓ２０４でＮＯ）、サブ識別器５１は、識別対象画像が特定のシーンに属すると判断できず、次のＳ２０６の処理を行う。 If the value of the discriminant is not greater than the positive threshold (NO in S204), the sub discriminator 51 cannot determine that the classification target image belongs to a specific scene, and performs the next process of S206.

次に、サブ識別器５１は、判別式の値と否定閾値とを比較する（Ｓ２０６）。これにより、サブ識別器５１は、識別対象画像が所定のシーンに属しないかを判断する。このような判断としては、２種類ある。第１に、ある特定のシーンのサブ識別器５１の判別式の値が第１否定閾値より小さければ、その特定のシーンに識別対象画像が属しないと判断されることになる。例えば、風景識別器５１Ｌの判別式の値が第１否定閾値より小さければ、識別対象画像が風景のシーンに属しないと判断されることになる。第２に、ある特定のシーンのサブ識別器５１の判別式の値が第２否定閾値より大きければ、その特定のシーンとは別のシーンに識別対象画像が属しないと判断されることになる。例えば、風景識別器５１Ｌの判別式の値が第２否定閾値より大きければ、識別対象画像が夜景のシーンに属しないと判断されることになる。 Next, the sub discriminator 51 compares the discriminant value with a negative threshold value (S206). Thereby, the sub classifier 51 determines whether the classification target image does not belong to a predetermined scene. There are two types of such determinations. First, if the value of the discriminant of the sub-identifier 51 of a specific scene is smaller than the first negative threshold, it is determined that the classification target image does not belong to the specific scene. For example, if the discriminant value of the landscape classifier 51L is smaller than the first negative threshold, it is determined that the classification target image does not belong to a landscape scene. Second, if the value of the discriminant of the sub-identifier 51 of a specific scene is larger than the second negative threshold, it is determined that the classification target image does not belong to a scene different from the specific scene. . For example, if the discriminant value of the landscape classifier 51L is larger than the second negative threshold, it is determined that the classification target image does not belong to the night scene.

図１１は、第１否定閾値の説明図である。同図において、横軸は第１否定閾値を示し、縦軸は確率を示す。グラフの太線は、True Negative Recallのグラフであり、風景画像以外の画像を風景画像ではないと正しく識別する確率を示している。グラフの細線は、False Negative Recallのグラフであり、風景画像なのに風景画像ではないと誤って識別する確率を示している。 FIG. 11 is an explanatory diagram of the first negative threshold. In the figure, the horizontal axis indicates the first negative threshold, and the vertical axis indicates the probability. The bold line in the graph is a True Negative Recall graph, and indicates the probability of correctly identifying an image other than a landscape image as not a landscape image. The thin line in the graph is a False Negative Recall graph, which indicates the probability of erroneously identifying a landscape image that is not a landscape image.

図１１から分かる通り、第１否定閾値を小さくするほど、False Negative Recallが小さくなる。このため、第１否定閾値を小さくするほど、例えば風景のシーンに属しないと識別された画像が風景画像である確率が低くなる。つまり、誤識別の確率が低くなる。
一方、第１否定閾値を小さくするほど、True Negative Recallも小さくなる。この結果、風景画像以外の画像を風景画像ではないと識別しにくくなる。その一方、識別対象画像が特定シーンでないことを識別できれば、部分識別処理の際に、その特定シーンのサブ部分識別器６１による処理を省略してシーン識別処理速度を速めている（後述、図１４のＳ３０２）。このため、第１否定閾値を小さくするほど、シーン識別処理速度は低下する。
つまり、第１否定閾値が大きすぎると誤識別の確率が高くなり、小さすぎると処理速度が低下することになる。本実施形態では、False Negative Recallを２．５％に設定するため、第１否定閾値は−１．１０に設定されている。 As can be seen from FIG. 11, the smaller the first negative threshold is, the smaller False Negative Recall is. For this reason, the smaller the first negative threshold, the lower the probability that an image identified as not belonging to a landscape scene is a landscape image, for example. That is, the probability of misidentification is reduced.
On the other hand, the True Negative Recall decreases as the first negative threshold decreases. As a result, it is difficult to identify an image other than a landscape image unless it is a landscape image. On the other hand, if it is possible to identify that the identification target image is not a specific scene, the process by the sub partial classifier 61 for the specific scene is omitted during the partial identification process to increase the scene identification processing speed (described later in FIG. 14). S302). For this reason, the scene identification processing speed decreases as the first negative threshold is decreased.
That is, if the first negative threshold is too large, the probability of misidentification increases, and if it is too small, the processing speed decreases. In the present embodiment, in order to set False Negative Recall to 2.5%, the first negative threshold is set to -1.10.

ところで、ある画像が風景のシーンに属する確率が高ければ、必然的にその画像が夜景のシーンに属する確率は低くなる。このため、風景識別器５１Ｌの判別式の値が大きい場合には、夜景ではないと識別できる場合がある。このような識別を行うために、第２否定閾値が設けられる。 By the way, if the probability that an image belongs to a landscape scene is high, the probability that the image belongs to a night scene is inevitably low. For this reason, when the discriminant value of the landscape discriminator 51L is large, it may be identified that the scene is not a night scene. In order to perform such identification, a second negative threshold is provided.

図１２は、第２否定閾値の説明図である。同図において、横軸は風景の判別式の値を示し、縦軸は確率を示す。同図には、図９のRecallとPrecisionのグラフとともに、夜景のRecallのグラフが点線で描かれている。この点線のグラフに注目すると、風景の判別式の値が−０．４５よりも大きければ、その画像が夜景画像である確率は２．５％である。言い換えると、風景の判別式の値が−０．４５より大きい場合にその画像が夜景画像でないと識別しても、誤識別の確率は２．５％にすぎない。そこで、本実施形態では、第２否定閾値が−０．４５に設定されている。 FIG. 12 is an explanatory diagram of the second negative threshold. In the figure, the horizontal axis indicates the value of the landscape discriminant, and the vertical axis indicates the probability. In the same figure, the Recall graph of the night view is drawn with a dotted line together with the Recall and Precision graph of FIG. If attention is paid to this dotted line graph, if the value of the discriminant of the landscape is larger than −0.45, the probability that the image is a night scene image is 2.5%. In other words, if the landscape discriminant value is greater than −0.45, even if the image is identified as not a night scene image, the probability of misidentification is only 2.5%. Therefore, in the present embodiment, the second negative threshold is set to −0.45.

そして、判別式の値が第１否定閾値より小さい場合、又は、判別式の値が第２否定閾値より大きい場合（Ｓ２０６でＹＥＳ）、サブ識別器５１は、識別対象画像が所定のシーンに属しないと判断し、否定フラグを立てる（Ｓ２０７）。「否定フラグを立てる」とは、図８の「否定」欄を１にすることである。例えば、第１否定閾値に基づいて識別対象画像が風景のシーンに属しないと判断された場合、「風景」欄の「否定」欄が１になる。また、第２否定閾値に基づいて識別対象画像が夜景のシーンに属しないと判断された場合、「夜景」欄の「否定」欄が１になる。 When the discriminant value is smaller than the first negative threshold value, or when the discriminant value is larger than the second negative threshold value (YES in S206), the sub-classifier 51 determines that the classification target image belongs to a predetermined scene. It is determined not to do so, and a negative flag is set (S207). “Set a negative flag” means to set the “No” column in FIG. For example, when it is determined that the image to be identified does not belong to a landscape scene based on the first negative threshold, the “denial” column in the “landscape” column is 1. Further, when it is determined that the identification target image does not belong to the night scene based on the second negative threshold, the “Negation” field in the “Night scene” field is “1”.

図１３Ａは、閾値テーブルの説明図である。この閾値テーブルは、記憶部３１に記憶されていても良いし、全体識別処理を実行させるためのプログラムの一部に組み込まれていても良い。閾値テーブルには、前述の肯定閾値や否定閾値に関するデータが格納されている。 FIG. 13A is an explanatory diagram of a threshold table. The threshold value table may be stored in the storage unit 31 or may be incorporated in a part of a program for executing the overall identification process. The threshold table stores data related to the affirmative threshold and the negative threshold described above.

図１３Ｂは、上記で説明した風景識別器５１Ｌにおける閾値の説明図である。風景識別器５１Ｌには、肯定閾値及び否定閾値が予め設定されている。肯定閾値として１．２７が設定されている。否定閾値には第１否定閾値と第２否定閾値とがある。第１否定閾値として−１．１０が設定されている。また、第２否定閾値として、風景以外の各シーンにそれぞれ値が設定されている。 FIG. 13B is an explanatory diagram of threshold values in the landscape classifier 51L described above. An affirmative threshold value and a negative threshold value are preset in the landscape discriminator 51L. 1.27 is set as the positive threshold. The negative threshold includes a first negative threshold and a second negative threshold. -1.10 is set as the first negative threshold. In addition, a value is set for each scene other than the landscape as the second negative threshold.

図１３Ｃは、上記で説明した風景識別器５１Ｌの処理の概要の説明図である。ここでは、説明の簡略化のため、第２否定閾値については夜景についてのみ説明する。風景識別器５１Ｌは、判別式の値が１．２７より大きければ（Ｓ２０４でＹＥＳ）、識別対象画像が風景のシーンに属すると判断する。また、判別式の値が１．２７以下であり（Ｓ２０４でＮＯ）、−０．４５より大きければ（Ｓ２０６でＹＥＳ）、風景識別器５１Ｌは、識別対象画像が夜景のシーンに属しないと判断する。また、判別式の値が−１．１０より小さければ（Ｓ２０６でＹＥＳ）、風景識別器５１Ｌは、識別対象画像が風景のシーンに属しないと判断する。なお、風景識別器５１Ｌは、夕景や花や紅葉についても、第２否定閾値に基づいて、識別対象画像がそのシーンに属しないかを判断する。但し、これらの第２否定閾値は肯定閾値よりも大きいため、識別対象画像がこれらのシーンに属しないことを風景識別器５１Ｌが判断することはない。 FIG. 13C is an explanatory diagram outlining the processing of the landscape classifier 51L described above. Here, for simplification of description, only the night view will be described for the second negative threshold. If the discriminant value is greater than 1.27 (YES in S204), the landscape classifier 51L determines that the classification target image belongs to a landscape scene. If the discriminant value is 1.27 or less (NO in S204) and is greater than −0.45 (YES in S206), the landscape classifier 51L determines that the classification target image does not belong to the night scene. To do. If the value of the discriminant is smaller than −1.10 (YES in S206), the landscape classifier 51L determines that the classification target image does not belong to a landscape scene. Note that the landscape discriminator 51L also determines whether the image to be identified does not belong to the scene based on the second negative threshold for sunset scenes, flowers, and autumn leaves. However, since these second negative threshold values are larger than the positive threshold values, the landscape discriminator 51L does not determine that the classification target image does not belong to these scenes.

Ｓ２０２においてＮＯの場合、Ｓ２０６でＮＯの場合、又はＳ２０７の処理を終えた場合、全体識別器５０は、次のサブ識別器５１の有無を判断する（Ｓ２０８）。ここでは風景識別器５１Ｌによる処理を終えた後なので、全体識別器５０は、Ｓ２０８において、次のサブ識別器５１（夕景識別器５１Ｓ）があると判断する。 In the case of NO in S202, in the case of NO in S206, or when the processing in S207 is completed, the overall discriminator 50 determines the presence or absence of the next sub discriminator 51 (S208). Here, since the process by the landscape classifier 51L is finished, the overall classifier 50 determines in S208 that there is a next sub-classifier 51 (evening scene classifier 51S).

そして、Ｓ２０５の処理を終えた場合（識別対象画像が特定のシーンに属すると判断された場合）、又は、Ｓ２０８において次のサブ識別器５１がないと判断された場合（識別対象画像が特定のシーンに属すると判断できなかった場合）、全体識別器５０は、全体識別処理を終了する。 Then, when the process of S205 is finished (when it is determined that the identification target image belongs to a specific scene), or when it is determined in S208 that there is no next sub-classifier 51 (the identification target image is a specific image). When it cannot be determined that the scene belongs to the scene), the overall discriminator 50 ends the overall discrimination process.

なお、既に説明した通り、全体識別処理が終了すると、シーン識別部３３は、全体識別処理によってシーンの識別ができたか否かを判断する（図５のＳ１０４）。このとき、シーン識別部３３は、図８の識別対象テーブルを参照し、「肯定」欄に１があるか否かを判断することになる。 As already described, when the overall identification process is completed, the scene identification unit 33 determines whether or not the scene has been identified by the overall identification process (S104 in FIG. 5). At this time, the scene identification unit 33 refers to the identification target table in FIG. 8 and determines whether or not there is 1 in the “affirmation” column.

全体識別処理によってシーンの識別ができた場合（Ｓ１０４でＹＥＳ）、部分識別処理や統合識別処理が省略される。これにより、シーン識別処理の速度が速くなる。 If the scene can be identified by the overall identification process (YES in S104), the partial identification process and the integrated identification process are omitted. This increases the speed of the scene identification process.

ところで、上記の説明には無いが、全体識別器５０は、サブ識別器５１によって判別式の値を算出したときには、判別式の値に対応するPrecisionを、確信度に関する情報として結果記憶部３１Ｂに記憶する。もちろん、判別式の値そのものを確信度に関する情報として記憶しても良い。 By the way, although not described above, the overall discriminator 50, when the sub discriminator 51 calculates the discriminant value, the Precision corresponding to the discriminant value is stored in the result storage unit 31B as information on the certainty factor. Remember. Of course, the discriminant value itself may be stored as information on the certainty factor.

＝＝＝部分識別処理＝＝＝
図１４は、部分識別処理のフロー図である。部分識別処理は、全体識別処理によってシーンの識別ができなかった場合（図５のＳ１０４でＮＯ）に行われる。以下に説明するように、部分識別処理は、分割された部分画像のシーンをそれぞれ識別することによって、画像全体のシーンを識別する処理である。ここでは図６も参照しながら部分識別処理について説明する。 === Partial identification processing ===
FIG. 14 is a flowchart of the partial identification process. The partial identification process is performed when the scene cannot be identified by the overall identification process (NO in S104 of FIG. 5). As will be described below, the partial identification process is a process for identifying the scene of the entire image by identifying each scene of the divided partial images. Here, the partial identification process will be described with reference to FIG.

まず、部分識別器６０は、複数のサブ部分識別器６１の中から１つのサブ部分識別器６１を選択する（Ｓ３０１）。部分識別器６０には、サブ部分識別器６１が３つ設けられている。各サブ部分識別器６１は、８×８の６４ブロックに分割された部分画像がそれぞれ特定のシーンに属するか否かを識別する。ここでの３つのサブ部分識別器６１は、それぞれ夕景、花、紅葉のシーンを識別する。ここでは、部分識別器６０は、夕景→花→紅葉の順に、サブ部分識別器６１を選択する（なお、サブ部分識別器６１の選択順序については、後述する）。このため、最初には、部分画像が夕景のシーンに属するか否かを識別するサブ部分識別器６１（夕景部分識別器６１Ｓ）が選択される。 First, the partial classifier 60 selects one sub partial classifier 61 from the plurality of sub partial classifiers 61 (S301). The partial discriminator 60 is provided with three sub partial discriminators 61. Each sub partial discriminator 61 discriminates whether or not each partial image divided into 8 × 8 64 blocks belongs to a specific scene. Here, the three sub partial classifiers 61 identify the scenes of sunset, flowers, and autumn leaves, respectively. Here, the partial discriminator 60 selects the sub partial discriminator 61 in the order of sunset scene → flower → autumn leaves (the selection order of the sub partial discriminator 61 will be described later). Therefore, first, the sub partial classifier 61 (evening scene partial classifier 61S) for identifying whether or not the partial image belongs to the sunset scene is selected.

次に、部分識別器６０は、識別対象テーブル（図８）を参照し、選択したサブ部分識別器６１を用いてシーンを識別すべきか否かを判断する（Ｓ３０２）。ここでは、部分識別器６０は、識別対象テーブルにおける「夕景」欄の「否定」欄を参照し、ゼロであればＹＥＳと判断し、１であればＮＯと判断する。なお、全体識別処理の際に、夕景識別器５１Ｓが第１否定閾値により否定フラグを立てたとき、又は、他のサブ識別器５１が第２否定閾値により否定フラグを立てたとき、このＳ３０２でＮＯと判断される。仮にＮＯと判断されると夕景の部分識別処理は省略されることになるので、部分識別処理の速度が速くなる。但し、ここでは説明の都合上、ＹＥＳと判断されるものとする。 Next, the partial discriminator 60 refers to the discrimination target table (FIG. 8) and determines whether or not the scene should be discriminated using the selected sub partial discriminator 61 (S302). Here, the partial discriminator 60 refers to the “No” column of the “Evening Scene” column in the classification target table, and determines YES if it is zero, and NO if it is 1. When the evening scene classifier 51S sets a negative flag with the first negative threshold during the overall identification process or when another sub-classifier 51 sets a negative flag with the second negative threshold, in S302 It is judged as NO. If it is determined NO, the sunset partial identification process is omitted, and the partial identification process speed increases. However, for the convenience of explanation, it is assumed that YES is determined here.

次に、サブ部分識別器６１は、８×８の６４ブロックに分割された部分画像の中から、１つの部分画像を選択する（Ｓ３０３）。
図１５は、夕景部分識別器６１Ｓが選択する部分画像の順番の説明図である。部分画像から画像全体のシーンを識別するような場合、識別に用いられる部分画像は、被写体が存在する部分であることが望ましい。そこで、本実施形態では、数千枚のサンプルの夕景画像を用意し、各夕景画像を８×８の６４ブロックに分割し、夕景部分画像（夕景の太陽と空の部分画像）を含むブロックを抽出し、抽出されたブロックの位置に基づいて各ブロックにおける夕景部分画像の存在確率を算出した。そして、本実施形態では、存在確率の高いブロックから順番に、部分画像が選択される。なお、図に示す選択順序の情報は、プログラムの一部としてメモリ２３に格納されている。 Next, the sub partial discriminator 61 selects one partial image from the partial images divided into 8 × 8 64 blocks (S303).
FIG. 15 is an explanatory diagram of the order of partial images selected by the evening scene partial classifier 61S. When a scene of the entire image is identified from the partial image, it is desirable that the partial image used for identification is a portion where the subject exists. Therefore, in this embodiment, thousands of samples of sunset scene images are prepared, each sunset scene image is divided into 64 blocks of 8 × 8, and blocks including sunset scene partial images (sun and sky partial images of the sunset scene) are included. The presence probability of the sunset partial image in each block was calculated based on the extracted block position. And in this embodiment, a partial image is selected in an order from a block with a high existence probability. Note that the selection order information shown in the figure is stored in the memory 23 as part of the program.

なお、夕景画像の場合、画像の中央付近から上半分に夕景の空が広がっていることが多いため、中央付近から上半分のブロックにおいて存在確率が高くなる。また、夕景画像の場合、画像の下１／３では逆光で陰になり、部分画像単体では夕景か夜景か区別がつかないことが多いため、下１／３のブロックにおいて存在確率が低くなる。花画像の場合、花を中央付近に配置させる構図にすることが多いため、中央付近における花部分画像の存在確率が高くなる。 In the case of an evening scene image, since the sky of the evening scene often spreads from the vicinity of the center to the upper half, the existence probability increases in the upper half block from the vicinity of the center. In the case of an evening scene image, the lower 1/3 of the image is shaded by backlight, and the partial image alone often cannot be distinguished from the evening scene or the night scene, so the existence probability is lower in the lower 1/3 block. In the case of a flower image, since the composition is often such that a flower is arranged near the center, the probability of existence of a flower partial image near the center increases.

次に、サブ部分識別器６１は、選択された部分画像の部分特徴量に基づいて、その部分画像が特定のシーンに属するか否かを判断する（Ｓ３０４）。サブ部分識別器６１には、全体識別器５０のサブ識別器５１と同様に、サポートベクタマシン（ＳＶＭ）による判別手法が用いられている。なお、サポートベクタマシンについては、後述する。判別式の値が正の値であれば、部分画像が特定のシーンに属すると判断し、サブ部分識別器６１は正カウント値をインクリメントする。また、判別式の値が負の値であれば、部分画像が特定のシーンに属しないと判断し、サブ部分識別器６１は負カウント値をインクリメントする。 Next, the sub partial classifier 61 determines whether or not the partial image belongs to a specific scene based on the partial feature amount of the selected partial image (S304). Similar to the sub classifier 51 of the overall classifier 50, the sub partial classifier 61 uses a discrimination method using a support vector machine (SVM). The support vector machine will be described later. If the discriminant value is a positive value, it is determined that the partial image belongs to a specific scene, and the sub partial classifier 61 increments the positive count value. If the discriminant value is a negative value, it is determined that the partial image does not belong to a specific scene, and the sub partial discriminator 61 increments the negative count value.

次に、サブ部分識別器６１は、正カウント値が肯定閾値よりも大きい否かを判断する（Ｓ３０５）。なお、正カウント値は、特定のシーンに属すると判断された部分画像の数を示すものである。正カウント値が肯定閾値より大きければ（Ｓ３０５でＹＥＳ）、サブ部分識別器６１は、識別対象画像が特定のシーンに属すると判断し、肯定フラグを立てる（Ｓ３０６）。この場合、部分識別器６０は、次のサブ部分識別器６１による識別を行わずに、部分識別処理を終了する。例えば、夕景画像であると識別できれば、花や紅葉の識別を行わずに、部分識別処理を終了する。この場合、次のサブ部分識別器６１による識別を省略しているので、部分識別処理の速度を速めることができる。 Next, the sub partial discriminator 61 determines whether or not the positive count value is larger than the positive threshold value (S305). The positive count value indicates the number of partial images determined to belong to a specific scene. If the positive count value is larger than the affirmative threshold (YES in S305), the sub partial classifier 61 determines that the classification target image belongs to a specific scene, and sets an affirmative flag (S306). In this case, the partial discriminator 60 ends the partial discriminating process without performing discrimination by the next sub partial discriminator 61. For example, if the image can be identified as an evening scene image, the partial identification process is terminated without identifying flowers and autumn leaves. In this case, since the identification by the next sub partial classifier 61 is omitted, the speed of the partial classification process can be increased.

正カウント値が肯定閾値より大きくなければ（Ｓ３０５でＮＯ）、サブ部分識別器６１は、識別対象画像が特定のシーンに属すると判断できず、次のＳ３０７の処理を行う。 If the positive count value is not greater than the positive threshold value (NO in S305), the sub partial classifier 61 cannot determine that the classification target image belongs to a specific scene, and performs the next process of S307.

サブ部分識別器６１は、正カウント値と残りの部分画像数との和が肯定閾値よりも小さければ（Ｓ３０７でＹＥＳ）、Ｓ３０９の処理へ進む。正カウント値と残りの部分画像数との和が肯定閾値よりも小さい場合、残り全ての部分画像によって正カウント値がインクリメントされても正カウント値が肯定閾値より大きくなることがないので、Ｓ３０９に処理を進めることによって、残りの部分画像についてサポートベクタマシンによる識別を省略する。これにより、部分識別処理の速度を速めることができる。 If the sum of the positive count value and the number of remaining partial images is smaller than the positive threshold (YES in S307), the sub partial discriminator 61 proceeds to the process of S309. If the sum of the positive count value and the number of remaining partial images is smaller than the positive threshold value, the positive count value does not become larger than the positive threshold value even if the positive count value is incremented by all the remaining partial images. By proceeding with the process, the remaining partial images are not identified by the support vector machine. Thereby, the speed of the partial identification process can be increased.

サブ部分識別器６１がＳ３０７でＮＯと判断した場合、サブ部分識別器６１は、次の部分画像の有無を判断する（Ｓ３０８）。なお、本実施形態では、６４個に分割された部分画像の全てを順に選択していない。図１５において太枠で示された上位１０番目までの１０個の部分画像だけを順に選択している。このため、１０番目の部分画像の識別を終えれば、サブ部分識別器６１は、Ｓ３０８において次の部分画像はないと判断する。（この点を考慮して、Ｓ３０７の「残りの部分画像数」も決定される。）
図１６は、上位１０番目までの１０個の部分画像だけで夕景画像の識別をしたときのRecall及びPrecisionのグラフである。図に示すような肯定閾値を設定すれば、正答率（Precision）を８０％程度に設定でき、再現率（Recall）を９０％程度に設定でき、精度の高い識別が可能である。 If the sub partial discriminator 61 determines NO in S307, the sub partial discriminator 61 determines whether there is a next partial image (S308). In the present embodiment, not all of the partial images divided into 64 are selected in order. In FIG. 15, only the top 10 partial images indicated by thick frames are selected in order. Therefore, when the identification of the tenth partial image is completed, the sub partial classifier 61 determines in S308 that there is no next partial image. (In consideration of this point, the “number of remaining partial images” in S307 is also determined.)
FIG. 16 is a Recall and Precision graph when an evening scene image is identified using only the top 10 partial images. If an affirmative threshold as shown in the figure is set, the accuracy rate (Precision) can be set to about 80%, the recall rate (Recall) can be set to about 90%, and identification with high accuracy is possible.

本実施形態では、１０個の部分画像だけで夕景画像の識別を行っている。このため、本実施形態では、６４個の全ての部分画像を用いて夕景画像の識別を行うよりも、部分識別処理の速度を速めることができる。
また、本実施形態では、夕景部分画像の存在確率の高い上位１０番目の部分画像を用いて夕景画像の識別を行っている。このため、本実施形態では、存在確率を無視して抽出された１０個の部分画像を用いて夕景画像の識別を行うよりも、Recall及びPrecisionをともに高く設定することが可能になる。
また、本実施形態では、夕景部分画像の存在確率の高い順に部分画像を選択している。この結果、早い段階でＳ３０５の判断がＹＥＳになりやすくなる。このため、本実施形態では、存在確率の高低を無視した順で部分画像を選択したときよりも、部分識別処理の速度を速めることができる。 In this embodiment, the evening scene image is identified using only 10 partial images. For this reason, in the present embodiment, it is possible to increase the speed of the partial identification process compared to the case where the evening scene image is identified using all 64 partial images.
In this embodiment, the sunset scene image is identified using the top tenth partial image having a high existence probability of the sunset scene partial image. For this reason, in the present embodiment, it is possible to set both Recall and Precision higher than the identification of an evening scene image using 10 partial images extracted by ignoring the existence probability.
In this embodiment, the partial images are selected in descending order of the existence probability of the sunset partial image. As a result, the determination in S305 is likely to be YES at an early stage. For this reason, in the present embodiment, the speed of the partial identification process can be increased as compared with the case where the partial images are selected in the order in which the presence probability level is ignored.

Ｓ３０７においてＹＥＳと判断された場合、又は、Ｓ３０８において次の部分画像がないと判断された場合、サブ部分識別器６１は、負カウント値が否定閾値よりも大きいか否かを判断する（Ｓ３０９）。この否定閾値は、前述の全体識別処理における否定閾値（図７のＳ２０６）とほぼ同様の機能を果たすものなので、詳しい説明は省略する。Ｓ３０９でＹＥＳと判断された場合、図７のＳ２０７と同様に、否定フラグを立てる。 When it is determined YES in S307, or when it is determined that there is no next partial image in S308, the sub partial discriminator 61 determines whether or not the negative count value is larger than the negative threshold (S309). . Since this negative threshold performs substantially the same function as the negative threshold (S206 in FIG. 7) in the above-described overall identification process, detailed description thereof is omitted. When YES is determined in S309, a negative flag is set as in S207 of FIG.

Ｓ３０２においてＮＯの場合、Ｓ３０９でＮＯの場合、又はＳ３１０の処理を終えた場合、部分識別器６０は、次のサブ部分識別器６１の有無を判断する（Ｓ３１１）。夕景部分識別器６１Ｓによる処理を終えた後の場合、サブ部分識別器６１として花部分識別器６１Ｆや紅葉部分識別器６１Ｒがまだあるので、部分識別器６０は、Ｓ３１１において、次のサブ部分識別器６１があると判断する。 In the case of NO in S302, in the case of NO in S309, or when the process of S310 is completed, the partial discriminator 60 determines whether or not there is a next sub partial discriminator 61 (S311). In the case after the processing by the evening scene partial classifier 61S is finished, since the flower partial classifier 61F and the autumnal leaves partial classifier 61R are still present as the sub partial classifier 61, the partial classifier 60 determines the next sub partial classifier in S311. It is determined that there is a container 61.

そして、Ｓ３０６の処理を終えた場合（識別対象画像が特定のシーンに属すると判断された場合）、又は、Ｓ３１１において次のサブ部分識別器６１がないと判断された場合（識別対象画像が特定のシーンに属すると判断できなかった場合）、部分識別器６０は、部分識別処理を終了する。 Then, when the process of S306 is completed (when it is determined that the identification target image belongs to a specific scene), or when it is determined in S311 that there is no next sub partial classifier 61 (the identification target image is specified). If it cannot be determined that the scene belongs to the scene), the partial discriminator 60 ends the partial discrimination processing.

なお、既に説明した通り、部分識別処理が終了すると、シーン識別部３３は、部分識別処理によってシーンの識別ができたか否かを判断する（図５のＳ１０６）。このとき、シーン識別部３３は、図８の識別対象テーブルを参照し、「肯定」欄に１があるか否かを判断することになる。
部分識別処理によってシーンの識別ができた場合（Ｓ１０６でＹＥＳ）、統合識別処理が省略される。これにより、シーン識別処理の速度が速くなる。 As already described, when the partial identification process is completed, the scene identification unit 33 determines whether or not the scene has been identified by the partial identification process (S106 in FIG. 5). At this time, the scene identification unit 33 refers to the identification target table in FIG. 8 and determines whether or not there is 1 in the “affirmation” column.
When the scene can be identified by the partial identification process (YES in S106), the integrated identification process is omitted. This increases the speed of the scene identification process.

ところで、上記の説明では、夕景部分識別器６１Ｓは、１０個の部分画像を用いて夕景画像の識別を行っているが、識別に用いられる部分画像の数は１０個に限られるものではない。また、他のサブ部分識別器６１が、夕景部分識別器６１Ｓとは異なる数の部分画像を用いて画像を識別しても良い。本実施形態では、花部分識別器６１Ｆは２０個の部分画像を用いて花画像を識別し、また、紅葉部分識別器６１Ｒは１５個の部分画像を用いて紅葉画像を識別するものとする。 In the above description, the evening scene partial classifier 61S identifies evening scene images using ten partial images. However, the number of partial images used for identification is not limited to ten. Further, the other sub partial classifier 61 may identify an image using a different number of partial images from the sunset scene partial classifier 61S. In the present embodiment, it is assumed that the flower partial discriminator 61F identifies a flower image using 20 partial images, and the autumnal foliage partial discriminator 61R identifies a autumnal foliage image using 15 partial images.

＝＝＝サポートベクタマシン＝＝＝
統合識別処理について説明する前に、全体識別処理のサブ識別器５１や部分識別処理のサブ部分識別器６１において用いられているサポートベクタマシン（ＳＶＭ）について説明する。 === Support vector machine ===
Before describing the integrated identification process, the support vector machine (SVM) used in the sub-identifier 51 for the overall identification process and the sub-partial identifier 61 for the partial identification process will be described.

図１７Ａは、線形サポートベクタマシンによる判別の説明図である。ここでは、２つの特徴量ｘ１、ｘ２によって、学習用サンプルを２次元空間に示している。学習用サンプルは２つのクラスＡ、Ｂに分けられている。図中では、クラスＡに属するサンプルは丸で示されており、クラスＢに属するサンプルは四角で示されている。
学習用サンプルを用いた学習によって、２次元空間を２つに分ける境界が定義される。境界は、＜ｗ・ｘ＞＋ｂ＝０で定義される（なお、ｘ＝（ｘ１，ｘ２）であり、ｗは重みベクトルであり、＜ｗ・ｘ＞はｗとｘの内積である）。但し、境界は、マージンが最大になるように、学習用サンプルを用いた学習によって定義される。つまり、図の場合、境界は、太点線ではなく、太実線のようになる。
判別は、判別式ｆ（ｘ）＝＜ｗ・ｘ＞＋ｂを用いて行われる。ある入力ｘ（この入力ｘは学習用サンプルとは別である）について、ｆ（ｘ）＞０であればクラスＡに属すると判別され、ｆ（ｘ）＜０であればクラスＢに属すると判別される。 FIG. 17A is an explanatory diagram of determination by the linear support vector machine. Here, the learning sample is shown in a two-dimensional space by two feature amounts x1 and x2. The learning sample is divided into two classes A and B. In the figure, samples belonging to class A are indicated by circles, and samples belonging to class B are indicated by squares.
A boundary that divides the two-dimensional space into two is defined by learning using the learning sample. The boundary is defined by <w · x> + b = 0 (where x = (x1, x2), w is a weight vector, and <w · x> is an inner product of w and x). However, the boundary is defined by learning using a learning sample so that the margin is maximized. That is, in the case of the figure, the boundary is not a thick dotted line but a thick solid line.
The discrimination is performed using the discriminant f (x) = <w · x> + b. It is determined that a certain input x (this input x is different from the learning sample) belongs to class A if f (x)> 0, and belongs to class B if f (x) <0. Determined.

ここでは２次元空間を用いて説明しているが、これに限られない（つまり、特徴量は２以上でも良い）。この場合、境界は超平面で定義される。 Here, the description is made using a two-dimensional space, but the present invention is not limited to this (that is, the feature amount may be two or more). In this case, the boundary is defined by a hyperplane.

ところで、２つのクラスに線形関数で分離できないことがある。このような場合に線形サポートベクタマシンによる判別を行うと、判別結果の精度が低下する。そこで、入力空間の特徴量を非線形変換すれば、すなわち入力空間からある特徴空間へ非線形写像すれば、特徴空間において線形関数で分離することができるようになる。非線形サポートベクタマシンでは、これを利用している。 By the way, there are cases where the two classes cannot be separated by a linear function. In such a case, if the determination is performed by the linear support vector machine, the accuracy of the determination result is lowered. Therefore, if the feature quantity of the input space is nonlinearly transformed, that is, if the input space is nonlinearly mapped to a certain feature space, it can be separated by a linear function in the feature space. This is used in the nonlinear support vector machine.

図１７Ｂは、カーネル関数を用いた判別の説明図である。ここでは、２つの特徴量ｘ１、ｘ２によって、学習用サンプルを２次元空間に示している。図１７Ｂの入力空間からの非線形写像が図１７Ａのような特徴空間になれば、線形関数で２つのクラスに分離することが可能になる。この特徴空間においてマージンが最大になるように境界が定義されれば、特徴空間における境界の逆写像が、図１７Ｂに示す境界になる。この結果、図１７Ｂに示すように、境界は非線形になる。 FIG. 17B is an explanatory diagram of discrimination using a kernel function. Here, the learning sample is shown in a two-dimensional space by two feature amounts x1 and x2. If the nonlinear mapping from the input space of FIG. 17B becomes a feature space as shown in FIG. 17A, it can be separated into two classes by a linear function. If the boundary is defined so that the margin is maximized in this feature space, the inverse mapping of the boundary in the feature space becomes the boundary shown in FIG. 17B. As a result, the boundary becomes nonlinear as shown in FIG. 17B.

本実施形態ではガウスカーネルを利用することにより、判別式ｆ（ｘ）は次式のようになる（なお、Ｍは特徴量の数であり、Ｎは学習用サンプルの数（若しくは境界に寄与する学習用サンプルの数）であり、ｗ_ｉは重み係数であり、ｙ_ｊは学習用サンプルの特徴量であり、ｘ_ｊは入力ｘの特徴量である）。

In this embodiment, by using a Gaussian kernel, the discriminant f (x) becomes as follows (where M is the number of features and N is the number of learning samples (or contributes to the boundary): The number of learning samples), w _i is a weighting factor, y _j is the feature quantity of the learning sample, and x _j is the feature quantity of the input x).

ある入力ｘ（この入力ｘは学習用サンプルとは別である）について、ｆ（ｘ）＞０であればクラスＡに属すると判別され、ｆ（ｘ）＜０であればクラスＢに属すると判別される。また、判別式ｆ（ｘ）の値が大きい値になるほど、入力ｘ（この入力ｘは学習用サンプルとは別である）がクラスＡに属する確率が高くなる。逆に、判別式ｆ（ｘ）の値が小さい値になるほど、入力ｘ（この入力ｘは学習用サンプルとは別である）がクラスＡに属する確率が低くなる。 It is determined that a certain input x (this input x is different from the learning sample) belongs to class A if f (x)> 0, and belongs to class B if f (x) <0. Determined. Further, the larger the value of the discriminant f (x), the higher the probability that the input x (this input x is different from the learning sample) belongs to the class A. On the contrary, the smaller the value of the discriminant f (x), the lower the probability that the input x (this input x is different from the learning sample) belongs to the class A.

前述の全体識別処理のサブ識別器５１や部分識別処理のサブ部分識別器６１では、上記のサポートベクタマシンの判別式ｆ（ｘ）の値を用いている。サポートベクタマシンによる判別式ｆ（ｘ）の値の算出には、学習用サンプルの数（本実施形態では数万個）が多くなると時間がかかる。このため、判別式ｆ（ｘ）の値を複数回算出する必要があるサブ部分識別器６１は、判別式ｆ（ｘ）の値を１回算出すれば済むサブ識別器５１よりも、処理時間がかかる。 In the sub-identifier 51 for the overall identification process and the sub-partial identifier 61 for the partial identification process, the value of the discriminant f (x) of the support vector machine is used. The calculation of the value of the discriminant f (x) by the support vector machine takes time when the number of learning samples (in this embodiment, tens of thousands) increases. For this reason, the sub partial classifier 61 that needs to calculate the value of the discriminant f (x) a plurality of times requires more processing time than the sub classifier 51 that only needs to calculate the value of the discriminant f (x) once. It takes.

なお、学習用サンプルとは別に評価用サンプルが用意されている。前述のRecallやPrecisionのグラフは、評価用サンプルに対する識別結果に基づくものである。 An evaluation sample is prepared separately from the learning sample. The above Recall and Precision graphs are based on the identification results for the evaluation samples.

＝＝＝統合識別処理＝＝＝
前述の全体識別処理や部分識別処理では、サブ識別器５１やサブ部分識別器６１における肯定閾値を比較的高めに設定し、Precision（正解率）を高めに設定している。なぜならば、例えば全体識別器の風景識別器５１Ｌの正解率が低く設定されると、風景識別器５１Ｌが紅葉画像を風景画像であると誤識別してしまい、紅葉識別器５１Ｒによる識別を行う前に全体識別処理を終えてしまう事態が発生してしまうからである。本実施形態では、Precision（正解率）が高めに設定されることにより、特定のシーンに属する画像が特定のシーンのサブ識別器５１（又はサブ部分識別器６１）に識別されるようになる（例えば紅葉画像が紅葉識別器５１Ｒ（又は紅葉部分識別器６１Ｒ）によって識別されるようになる）。 === Integrated identification processing ===
In the above-described overall identification process and partial identification process, the positive threshold value in the sub-classifier 51 and the sub-classifier 61 is set relatively high, and the Precision (correct answer rate) is set high. This is because, for example, if the accuracy rate of the landscape classifier 51L of the overall classifier is set low, the scene classifier 51L erroneously identifies the autumnal image as a landscape image, and before the autumnal classifier 51R performs the classification. This is because a situation occurs in which the entire identification process ends. In the present embodiment, by setting the Precision (accuracy rate) high, an image belonging to a specific scene is identified by the sub-classifier 51 (or sub-partial classifier 61) of the specific scene ( For example, the autumnal leaves image is identified by the autumnal leaves discriminator 51R (or the autumnal leaf partial discriminator 61R).

但し、全体識別処理や部分識別処理のPrecision（正解率）を高めに設定すると、全体識別処理や部分識別処理ではシーンの識別ができなくなる可能性が高くなる。そこで、本実施形態では、全体識別処理及び部分識別処理によってシーンの識別ができなかった場合、以下に説明する統合識別処理が行われる。 However, if the Precision (accuracy rate) of the overall identification process or the partial identification process is set to be high, there is a high possibility that the scene cannot be identified by the overall identification process or the partial identification process. Therefore, in this embodiment, when the scene cannot be identified by the overall identification process and the partial identification process, the integrated identification process described below is performed.

図１８は、統合識別処理のフロー図である。以下に説明するように、統合識別処理は、全体識別処理の各サブ識別器５１の判別式の値に基づいて、最も確信度の高いシーンを選択する処理である。 FIG. 18 is a flowchart of the integrated identification process. As will be described below, the integrated identification process is a process of selecting a scene with the highest certainty factor based on the discriminant value of each sub-classifier 51 in the overall identification process.

まず、統合識別器７０は、５つのサブ識別器５１の判別式の値に基づいて、正となるシーンを抽出する（Ｓ４０１）。このとき、全体識別処理の際に各サブ識別器５１が算出した判別式の値が用いられる。 First, the integrated discriminator 70 extracts a positive scene based on the discriminant values of the five sub discriminators 51 (S401). At this time, the value of the discriminant calculated by each sub classifier 51 during the overall identification process is used.

次に、統合識別器７０は、判別式の値が正のシーンが存在するか否かを判断する（Ｓ４０２）。
判別式の値が正のシーンが存在する場合（Ｓ４０２でＹＥＳ）、最大値のシーンの欄に肯定フラグを立てて（Ｓ４０３）、統合識別処理を終了する。これにより、最大値のシーンに識別対象画像が属すると判断される。
一方、判別式の値が正であるシーンが存在しない場合（Ｓ４０２でＮＯ）、肯定フラグを立てずに、統合識別処理を終了する。これにより、図８の識別対象テーブルの肯定欄において、１のシーンが無いままの状態になる。つまり、識別対象画像が、どのシーンに属するか識別できなかったことになる。 Next, the integrated discriminator 70 determines whether or not a scene having a positive discriminant value exists (S402).
If there is a scene with a positive discriminant value (YES in S402), an affirmative flag is set in the maximum value scene column (S403), and the integrated identification process is terminated. Accordingly, it is determined that the identification target image belongs to the maximum value scene.
On the other hand, if there is no scene having a positive discriminant value (NO in S402), the integrated identification process is terminated without setting an affirmative flag. As a result, there is no scene in the affirmative column of the identification target table in FIG. That is, it cannot be identified to which scene the identification target image belongs.

なお、既に説明した通り、統合識別処理が終了すると、シーン識別部３３は、統合識別処理によってシーンの識別ができたか否かを判断する（図５のＳ１０８）。このとき、シーン識別部３３は、図８の識別対象テーブルを参照し、「肯定」欄に１があるか否かを判断することになる。Ｓ４０２でＮＯとの判断の場合、Ｓ１０８の判断もＮＯになる。 As already described, when the integrated identification process is completed, the scene identification unit 33 determines whether or not the scene has been identified by the integrated identification process (S108 in FIG. 5). At this time, the scene identification unit 33 refers to the identification target table in FIG. 8 and determines whether or not there is 1 in the “affirmation” column. If it is determined NO in S402, the determination in S108 is also NO.

＝＝＝参考例＝＝＝
＜概要＞
既に説明した通り、全体識別器５０は、５つのサブ識別器５１を有しており、全体識別処理の際に各サブ識別器５１を順に選択していくことになる。ここで、サブ識別器５１の選択順序は、全体識別処理の処理時間の期待値が最も短くなるようにすることが望ましい。 === Reference Example ===
<Overview>
As already described, the overall discriminator 50 has five sub discriminators 51, and each sub discriminator 51 is sequentially selected during the overall discriminating process. Here, it is desirable that the selection order of the sub-identifiers 51 is such that the expected value of the processing time of the overall identification process is the shortest.

そこで、本参考例では、全体識別処理の処理時間の期待値が短くなるように、５つのサブ識別器５１の選択順序を決定している。 Therefore, in this reference example, the selection order of the five sub classifiers 51 is determined so that the expected value of the processing time of the overall identification process is shortened.

図１９は、選択順序の決定フローの説明図である。この決定フローは、シーン識別プログラムを設計する際に、行われる。 FIG. 19 is an explanatory diagram of a selection order determination flow. This decision flow is performed when designing the scene identification program.

まず、最初に、サブ識別器５１の選択順序の全てが列挙される。ここでは、サブ識別器５１が５つであるので、１２０（＝５の階乗）通りの順序が列挙される（Ｓ５０１）。 First, all the selection orders of the sub discriminators 51 are listed. Here, since there are five sub discriminators 51, 120 (= 5 factorial) orders are listed (S501).

次に、各選択順序における処理時間の期待値を算出する（Ｓ５０２）。ここでは、１２０通りの選択順序のそれぞれについて、処理時間の期待値が算出される。処理時間の期待値の算出方法については、後で詳述する。 Next, an expected value of processing time in each selection order is calculated (S502). Here, the expected value of the processing time is calculated for each of the 120 selection orders. A method for calculating the expected value of the processing time will be described in detail later.

そして、１２０通りの処理時間の期待値の中から最短のものを抽出する（Ｓ５０３）。そして、処理時間の期待値が最短のときの選択順序を記憶する。 Then, the shortest of 120 expected processing time values is extracted (S503). Then, the selection order when the expected processing time is the shortest is stored.

＜処理時間の期待値の算出方法＞
図２０は、ある選択順序における処理時間の期待値を算出するフロー図である。
ここでは、風景→夕景→夜景→花→紅葉の順に、サブ識別器５１が選択される場合の処理時間の期待値について説明する。ｎ番目に選択されるサブ識別器５１のことを「第ｎサブ識別器」と呼ぶ。例えば、風景識別器５１Ｌは、第１サブ識別器である。また、第ｎサブ識別器５１が実行される確率（実行確率）をＰｎとする。例えば、夕景識別器５１Ｓの実行確率はＰ２である。また、第ｎサブ識別器５１の処理速度（識別対象画像が特定のシーンであるか否かを識別する処理速度）をＴｎとする。例えば、夜景識別器５１Ｎの処理速度はＴ３である。 <Calculation method of expected value of processing time>
FIG. 20 is a flowchart for calculating an expected value of processing time in a certain selection order.
Here, the expected value of the processing time when the sub discriminator 51 is selected in the order of landscape → evening scene → night scene → flower → autumn leaves will be described. The n-th selected sub classifier 51 is referred to as an “n-th sub classifier”. For example, the landscape classifier 51L is a first sub classifier. Further, the probability (execution probability) that the n-th sub classifier 51 is executed is assumed to be Pn. For example, the execution probability of the evening scene classifier 51S is P2. The processing speed of the n-th sub classifier 51 (processing speed for identifying whether or not the classification target image is a specific scene) is Tn. For example, the processing speed of the night scene classifier 51N is T3.

まず、各サブ識別器５１の実行確率Ｐｎ（ｎは１〜５）が算出される（Ｓ６０１）。最初に実行される第１サブ識別器５１（ここでは風景識別器５１Ｌ）の実行確率Ｐ１は、１になる。但し、第１サブ識別器５１の判別式の値が肯定閾値より大きい場合や、第１サブ識別器の判別式の値が第２否定閾値より大きい場合、第２サブ識別器以降の処理が省略されるので、第２〜第５サブ識別器５１の実行確率Ｐ２〜Ｐ５は、１よりも小さい値になる。各サブ識別器５１の実行確率の算出方法については、後で詳述する。 First, the execution probability Pn (n is 1 to 5) of each sub classifier 51 is calculated (S601). The execution probability P1 of the first sub classifier 51 (here, the scene classifier 51L) to be executed first is 1. However, when the value of the discriminant of the first sub classifier 51 is larger than the positive threshold or when the value of the discriminant of the first sub classifier is larger than the second negative threshold, the processing after the second sub classifier is omitted. Therefore, the execution probabilities P2 to P5 of the second to fifth sub discriminators 51 are smaller than 1. A method of calculating the execution probability of each sub classifier 51 will be described in detail later.

次に、各サブ識別器５１の処理時間の期待値が算出される（Ｓ６０２）。各サブ識別器５１の処理時間の期待値は、そのサブ識別器５１の実行確率Ｐｎと処理速度Ｔｎとを乗算した値である。例えば、花識別器５１Ｆの処理速度の期待値は、Ｔ４×Ｐ４と算出される。 Next, an expected value of processing time of each sub classifier 51 is calculated (S602). The expected value of the processing time of each sub classifier 51 is a value obtained by multiplying the execution probability Pn of the sub classifier 51 by the processing speed Tn. For example, the expected value of the processing speed of the flower discriminator 51F is calculated as T4 × P4.

図２１は、サブ識別器５１の処理時間の表である。図に示すように、各サブ識別器の処理時間はそれぞれ異なっている。これは、サポートベクタマシンによる判別式ｆ（ｘ）の値の算出では、サンプル数に応じて処理時間が異なるためである。なお、各サブ識別器の処理時間は、Ｓ６０２の処理の前に、予め求められている。 FIG. 21 is a table of processing times of the sub classifier 51. As shown in the figure, the processing time of each sub classifier is different. This is because the processing time varies depending on the number of samples in the calculation of the value of the discriminant f (x) by the support vector machine. Note that the processing time of each sub classifier is obtained in advance before the processing of S602.

次に、全体識別処理の処理時間の期待値Ｔｔが算出される（Ｓ６０３）。全体識別処理の処理時間の期待値Ｔｔは、各サブ識別器の処理時間の期待値の総和となる。つまり、全体識別処理の処理速度の期待値Ｔｔは、Ｔｔ＝（Ｔ１×Ｐ１）＋（Ｔ２×Ｐ２）＋（Ｔ３×Ｐ３）＋（Ｔ４×Ｐ４）＋（Ｔ５×Ｐ５）として算出される。 Next, an expected value Tt of the processing time of the overall identification process is calculated (S603). The expected value Tt of the processing time of the overall identification process is the sum of the expected values of the processing time of each sub classifier. That is, the expected value Tt of the overall identification processing speed is calculated as Tt = (T1 × P1) + (T2 × P2) + (T3 × P3) + (T4 × P4) + (T5 × P5).

＜Ｐ（ｉ，ｊ）の算出方法＞
各サブ識別器５１の実行確率Ｐｎの算出方法を説明する前に、まず、第ｉサブ識別器が実行された後に、第ｊサブ識別器が省略されずに実行される確率Ｐ（ｉ，ｊ）について説明する。 <Calculation method of P (i, j)>
Before describing the method of calculating the execution probability Pn of each sub-classifier 51, first, after the i-th sub-classifier is executed, the probability P (i, j executed without the j-th sub-classifier being omitted). ).

第ｊサブ識別器が省略される場合としては、第ｉサブ識別器によって識別対象画像が特定のシーンに属することが識別できた場合（第ｉサブ識別器の判別式の値が肯定閾値よりも大きい場合）、若しくは、第ｉサブ識別器によって第ｊサブ識別器のシーンの否定フラグが立つ場合（第ｉサブ識別器の判別式の値が第２否定閾値よりも大きい場合）がある。このため、Ｐ（ｉ，ｊ）は、判別式の値と、肯定閾値と第２否定閾値との関係から求めることができる。 As the case where the j-th sub classifier is omitted, it can be identified by the i-th sub classifier that the classification target image belongs to a specific scene (the discriminant value of the i-th sub classifier is greater than the positive threshold). In some cases, or when the negative flag of the scene of the j-th sub-classifier is set by the i-th sub-classifier (when the discriminant value of the i-th sub-classifier is larger than the second negative threshold). For this reason, P (i, j) can be obtained from the value of the discriminant and the relationship between the positive threshold and the second negative threshold.

具体的には、第ｉサブ識別器によって識別対象画像が特定のシーンに属することが識別できる確率をＰposｉ、第ｉサブ識別器によって第ｊサブ識別器のシーンの否定フラグが立つ確率をＰneg（ｉ，ｊ）とすると、Ｐ（ｉ，ｊ）は次式となる。 Specifically, the probability that the identification target image can be identified by the i-th sub-classifier belongs to a specific scene is Pposi, and the probability that the negative flag of the scene of the j-th sub-classifier is set by the i-th sub-classifier is Pneg ( i, j), P (i, j) is given by

Ｐ（ｉ，ｊ）＝１−max（Ｐposｉ、Ｐneg（ｉ，ｊ））
より具体的に説明するため、風景識別器５１Ｌが実行された後に夕景識別器５１Ｓが実行される確率Ｐ（１，２）について説明する。 P (i, j) = 1-max (Pposi, Pneg (i, j))
In order to explain more specifically, the probability P (1,2) that the evening scene classifier 51S is executed after the scenery classifier 51L is executed will be described.

図２２Ａは、風景識別器５１Ｌによって評価用サンプル画像を識別したときの判別式の値の確率分布である。図中には、点線等によって、肯定閾値や各シーンの第２否定閾値が示されている。 FIG. 22A shows a probability distribution of discriminant values when a sample image for evaluation is identified by the landscape classifier 51L. In the figure, a positive threshold and a second negative threshold for each scene are shown by dotted lines or the like.

図２２Ｂは、風景識別器５１Ｌが風景画像を識別できる確率Ｐpos１の説明図である。風景識別器５１が風景画像を識別できる確率Ｐpos１は、判別式の値が肯定閾値よりも大きい確率であるため、図中の斜線部分の領域の積分値として求められる。 FIG. 22B is an explanatory diagram of the probability Ppos1 with which the landscape classifier 51L can identify a landscape image. The probability Ppos1 by which the landscape discriminator 51 can identify a landscape image is a probability that the discriminant value is larger than the affirmative threshold, and is thus obtained as an integral value of the shaded area in the figure.

図２２Ｃは、風景識別器５１Ｌが夕景の否定フラグを立てる確率Ｐneg（１，２）の説明図である。風景識別器５１Ｌが夕景の否定フラグを立てる確率Ｐneg（１，２）は、風景識別器５１Ｌの判別式の値が夕景の第２否定閾値よりも大きい確率であるため、図中の斜線部分の領域の積分値として求められる。この場合、夕景の第２否定閾値は肯定閾値よりも大きい値であるため、確率Ｐneg（１，２）は確率Ｐpos１よりも小さい値になる。 FIG. 22C is an explanatory diagram of the probability Pneg (1, 2) that the landscape discriminator 51L sets the evening scene negative flag. The probability Pneg (1,2) that the landscape discriminator 51L sets the evening scene negation flag is a probability that the discriminant value of the landscape discriminator 51L is larger than the second negation threshold value of the evening scene. It is obtained as the integral value of the region. In this case, since the second negative threshold value of the evening scene is larger than the positive threshold value, the probability Pneg (1, 2) is smaller than the probability Ppos1.

よって、風景識別器５１Ｌが実行された後に夕景識別器５１Ｓが実行される確率Ｐ（１，２）は、Ｐ（１，２）＝１−Ｐpos１と求められる。 Therefore, the probability P (1,2) that the sunset scene classifier 51S is executed after the scenery classifier 51L is executed is obtained as P (1,2) = 1-Ppos1.

参考までに、風景識別器５１Ｌが実行された後に夜景識別器５１Ｎが実行される確率Ｐ（１，３）について説明する。
図２２Ｄは、風景識別器５１Ｌが夜景の否定フラグを立てる確率Ｐneg（１，３）の説明図である。風景識別器５１Ｌが夜景の否定フラグを立てる確率Ｐneg（１，３）は、風景識別器５１Ｌの判別式の値が夜景の第２否定閾値よりも大きい確率であるため、図中の斜線部分の領域の積分値として求められる。この場合、夜景の第２否定閾値は肯定閾値よりも小さい値であるため、確率Ｐneg（１，３）は確率Ｐpos１よりも大きい値になる。
よって、風景識別器５１Ｌが実行された後に夜景識別器５１Ｎが実行される確率Ｐ（１，３）は、Ｐ（１，３）＝１−Ｐneg（１，３）と求められる。 For reference, the probability P (1, 3) that the night scene classifier 51N is executed after the scene classifier 51L is executed will be described.
FIG. 22D is an explanatory diagram of the probability Pneg (1, 3) that the landscape classifier 51L sets the night scene negative flag. The probability Pneg (1, 3) that the landscape discriminator 51L sets the night scene negative flag is a probability that the discriminant value of the landscape discriminator 51L is larger than the second negative threshold of the night scene. It is obtained as the integral value of the region. In this case, since the second negative threshold of the night view is a value smaller than the positive threshold, the probability Pneg (1, 3) is a value larger than the probability Ppos1.
Therefore, the probability P (1,3) that the night scene classifier 51N is executed after the scene classifier 51L is executed is obtained as P (1,3) = 1-Pneg (1,3).

図２３は、第ｉサブ識別器が実行されたときに第ｊサブ識別器が実行される確率Ｐ（ｉ，ｊ）の表である。このように算出された確率Ｐ（ｉ，ｊ）を用いて、次に、各サブ識別器５１の実行確率Ｐｎが算出される。 FIG. 23 is a table of probabilities P (i, j) that the j-th sub classifier is executed when the i-th sub classifier is executed. Next, the execution probability Pn of each sub classifier 51 is calculated using the probability P (i, j) calculated in this way.

＜実行確率の算出方法＞
図２４は、ある選択順序における各サブ識別器５１の実行確率Ｐｎの算出するフロー図である。 <Execution probability calculation method>
FIG. 24 is a flowchart for calculating the execution probability Pn of each sub classifier 51 in a certain selection order.

まず、全てのサブ識別器５１の実行確率Ｐｎを１に初期化する（Ｓ７０１）。次に、ｉを１に設定し（Ｓ７０２）、第１サブ識別器の実行確率Ｐ１を算出する（Ｓ７０３）。第１サブ識別器は、最初に実行されるので、省略されることが無いため、実行確率Ｐ１は１になる。 First, the execution probability Pn of all the sub classifiers 51 is initialized to 1 (S701). Next, i is set to 1 (S702), and the execution probability P1 of the first sub classifier is calculated (S703). Since the first sub classifier is executed first, it is not omitted, so the execution probability P1 is 1.

次に、全てのサブ識別器５１の実行確率を算出したか否かを判断する（Ｓ７０４）。未だ第１サブ識別器５１の実行確率Ｐ１しか算出していないので、Ｓ７０４ではＮＯになる。 Next, it is determined whether or not the execution probabilities of all the sub classifiers 51 have been calculated (S704). Since only the execution probability P1 of the first sub classifier 51 has been calculated, NO is obtained in S704.

次に、未実行の第２〜第５サブ識別器の実行確率Ｐ２〜Ｐ５の値を更新する（Ｓ７０５）。ここでは、実行確率Ｐｎ（ｎは２〜５）は、Ｐ（１，ｎ）×Ｐ１に更新される。 Next, the values of the execution probabilities P2 to P5 of the second to fifth sub classifiers that have not been executed are updated (S705). Here, the execution probability Pn (n is 2 to 5) is updated to P (1, n) × P1.

その後、ｉを２に設定し（Ｓ７０６）、第２サブ識別器の実行確率Ｐ２を算出する（Ｓ７０３）。第２サブ識別器の実行確率Ｐ２は、Ｓ７０５において更新された値Ｐ（１，２）×Ｐ１になる。 Thereafter, i is set to 2 (S706), and the execution probability P2 of the second sub classifier is calculated (S703). The execution probability P2 of the second sub classifier becomes the value P (1,2) × P1 updated in S705.

次に、全てのサブ識別器５１の実行確率を算出したか否かを判断する（Ｓ７０４）。未だ第２サブ識別器５１の実行確率Ｐ２までしか算出していないので、Ｓ７０４ではＮＯになる。 Next, it is determined whether or not the execution probabilities of all the sub classifiers 51 have been calculated (S704). Since only the execution probability P2 of the second sub discriminator 51 has been calculated yet, NO is obtained in S704.

次に、未実行の第３〜第５サブ識別器の実行確率Ｐ３〜Ｐ５の値を更新する（Ｓ７０５）。ここでは、第２サブ識別器が実行された場合と省略された場合に分けて考える必要がある。まず、第２サブ識別器が実行された場合には、第ｎサブ識別器が実行される確率Ｐｎは、Ｐ２×Ｐ（２，ｎ）×Ｐｎになる。一方、第２サブ識別器が省略された場合には、第ｎサブ識別器が実行される確率Ｐｎは、（１−Ｐ２）×Ｐｎになる。従って、第ｎ識別器が実行される確率Ｐｎは、（Ｐ２×Ｐ（２，ｎ）＋１−Ｐ２）×Ｐｎに更新される。 Next, the values of the execution probabilities P3 to P5 of the unexecuted third to fifth sub classifiers are updated (S705). Here, it is necessary to consider separately when the second sub classifier is executed and when it is omitted. First, when the second sub-classifier is executed, the probability Pn that the n-th sub-classifier is executed is P2 × P (2, n) × Pn. On the other hand, when the second sub classifier is omitted, the probability Pn that the nth sub classifier is executed is (1−P2) × Pn. Accordingly, the probability Pn that the nth discriminator is executed is updated to (P2 × P (2, n) + 1−P2) × Pn.

このようにして、第３サブ識別器以降のサブ識別器についても、同様に実行確率Ｐｎを算出していく（Ｓ７０３〜Ｓ７０６）。 In this way, the execution probabilities Pn are similarly calculated for the sub-classifiers after the third sub-classifier (S703 to S706).

そして、全てのサブ識別器の実行確率Ｐｎが算出したら（Ｓ７０４でＹＥＳ）、処理を終了する。これにより、ある選択順序における各サブ識別器５１の実行確率Ｐｎが算出される。 When the execution probabilities Pn of all the sub classifiers are calculated (YES in S704), the process ends. Thereby, the execution probability Pn of each sub discriminator 51 in a certain selection order is calculated.

＜選択順序の決定後について＞
上記のようにして、１２０通りの選択順序の中から、処理時間の期待値が最短の選択順序が決定される。この結果、本参考例では、風景→夕景→夜景→花→紅葉の選択順序が決定される。そして、この決定に基づいて、全体識別器５０を実現するためのシーン識別プログラムが構成され、このプログラムがプリンタ４のプリンタ側コントローラ２０に搭載される。この結果、処理速度の速い全体識別処理を実現することができる。 <After determining the selection order>
As described above, the selection order with the shortest expected processing time value is determined from the 120 selection orders. As a result, in this reference example, the selection order of landscape → evening scene → night scene → flower → autumn leaves is determined. Based on this determination, a scene identification program for realizing the overall classifier 50 is configured, and this program is installed in the printer-side controller 20 of the printer 4. As a result, it is possible to realize an overall identification process with a high processing speed.

＝＝＝第１実施形態＝＝＝
＜概要＞
前述の参考例では、サブ識別器５１の選択順序が固定されている。このため、前述の参考例では、サブ識別器５１の選択順序が固定されていることを前提にして、処理時間の期待値が最短の選択順序が決定されている。 === First Embodiment ===
<Overview>
In the reference example described above, the selection order of the sub-identifiers 51 is fixed. For this reason, in the above-described reference example, the selection order with the shortest expected processing time is determined on the assumption that the selection order of the sub-identifiers 51 is fixed.

一方、第１実施形態では、先に実行されるサブ識別器の判別式の値に応じて、それ以降に実行されるサブ識別器の順序を変更している。このため、第１実施形態では、サブ識別器の順序が変更されることも考慮して、処理時間の期待値が最短の選択順序が決定される。 On the other hand, in the first embodiment, the order of the sub classifiers to be executed thereafter is changed according to the value of the discriminant of the sub classifier to be executed first. For this reason, in the first embodiment, considering the change of the order of the sub classifiers, the selection order having the shortest expected processing time is determined.

＜サブ識別器の選択順序について＞
最初に風景識別器５１Ｌが選択された場合、判別式の値に応じて、夜景が除外されない場合と、除外される場合とがある。前者の場合、夜景識別器５１Ｎを含む残り４個のサブ識別器５１の処理時間の期待値が最短になるような選択順序であることが望ましい。一方、後者の場合、夜景識別器５１Ｎを除く残り３個のサブ識別器５１の処理時間の期待値が最短になるような選択順序であることが望ましい。このように、先に実行されるサブ識別器の判別式の値に応じて、それ以降に実行されるサブ識別器の最適な順序が異なることがある。 <About the selection order of sub classifiers>
When the landscape discriminator 51L is selected first, there are cases where the night view is not excluded and cases where it is excluded depending on the value of the discriminant. In the former case, it is desirable that the selection order be such that the expected value of the processing time of the remaining four sub classifiers 51 including the night scene classifier 51N is the shortest. On the other hand, in the latter case, it is desirable that the selection order be such that the expected value of the processing time of the remaining three sub discriminators 51 excluding the night scene discriminator 51N is the shortest. As described above, the optimal order of the sub-classifiers executed thereafter may differ depending on the value of the discriminant of the sub-classifier executed first.

図２５は、順序決定の際に参照されるツリー構造のデータ（ツリーデータ）の概念図である。このツリーデータは、サブ識別器５１の選択順序を示すものであり、図７のＳ２０１の際に全体識別器５０に参照されるデータである。また、このツリーデータは、サブ識別器５１の判別式の値と、次に選択すべきサブ識別器５１とを関連付けたデータになっている。言い換えると、このツリーデータは、サブ識別器５１の判別式の値に応じて、次に選択すべきサブ識別器５１が分岐している。このツリーデータは、記憶部３１に記憶されていても良いし、全体識別処理を実行させるためのプログラムの一部に組み込まれていても良い。 FIG. 25 is a conceptual diagram of tree-structured data (tree data) referred to in order determination. This tree data indicates the selection order of the sub classifier 51, and is data referred to by the overall classifier 50 in S201 of FIG. The tree data is data in which the discriminant value of the sub classifier 51 is associated with the sub classifier 51 to be selected next. In other words, in the tree data, the sub classifier 51 to be selected next branches according to the discriminant value of the sub classifier 51. This tree data may be stored in the storage unit 31 or may be incorporated in a part of a program for executing the overall identification process.

以下、図７のＳ２０１の際に、全体識別器５０がツリーデータを用いてどのようにサブ識別器５１を選択するかについて、説明する。
まず、最初のＳ２０１の際に、全体識別器５０は、ツリーデータの１番目を参照し、風景識別器５１Ｌを選択する。なお、既に説明した通り、風景識別器５１Ｌの判別式の値が１．２７（肯定閾値）よりも大きければ（Ｓ２０４でＹＥＳ）、風景識別器５１Ｌは識別対象画像が風景のシーンに属することが判断でき、他のサブ識別器５１による処理が省略される。また、風景識別器５１Ｌの判別式の値が−０．４５より大きく、１．２７（肯定閾値）よりも小さければ（Ｓ２０６でＹＥＳ）、夜景の否定フラグが立ち、夜景識別器５１Ｎによる処理が省略されるようになる（夜景が除外される）。 Hereinafter, how the overall discriminator 50 selects the sub discriminator 51 using the tree data in S201 of FIG. 7 will be described.
First, in the first S201, the overall classifier 50 refers to the first tree data and selects the landscape classifier 51L. As already described, if the value of the discriminant of the landscape discriminator 51L is greater than 1.27 (positive threshold) (YES in S204), the landscape discriminator 51L may indicate that the classification target image belongs to a landscape scene. This can be determined, and the processing by the other sub classifier 51 is omitted. If the value of the discriminant of the landscape discriminator 51L is larger than −0.45 and smaller than 1.27 (positive threshold) (YES in S206), a night scene negative flag is set and processing by the night scene discriminator 51N is performed. Omitted (night view is excluded).

ところで、風景識別器５１Ｌによって夜景が除外されると、識別処理を実行することになる残りのサブ識別器５１は、夕景識別器５１Ｓと、花識別器５１Ｆと、紅葉識別器５１Ｒの３つになる。この３つのサブ識別器５１の選択順序について、夕景→花→紅葉の順よりも、紅葉→夕景→花の順の方が処理時間の期待値が短いならば、後者の選択順序にする方が望ましい。但し、前述の参考例によれば、紅葉識別器５１Ｒが、夕景識別器５１Ｓや花識別器５１Ｆよりも先に選択されることは起こらない。これに対し、第１実施形態では、判別式の値に応じて次に選択されるサブ識別器の種類が決まるため、風景識別器５１Ｌによって夜景が除外されると、次に紅葉識別器５１Ｒが選択されるようになる。 By the way, when the night scene is excluded by the landscape classifier 51L, the remaining sub-classifiers 51 that will perform the classification process are the evening scene classifier 51S, the flower classifier 51F, and the autumnal leaves classifier 51R. Become. Regarding the selection order of the three sub classifiers 51, if the expected value of the processing time is shorter in the order of autumnal leaves → sunset → flowers than in the order of evening scenes → flowers → autumn leaves, the latter selection order is preferred. desirable. However, according to the above-described reference example, the autumnal leaves discriminator 51R is not selected before the evening scene discriminator 51S or the flower discriminator 51F. On the other hand, in the first embodiment, since the type of the sub-classifier to be selected next is determined according to the value of the discriminant, when the night scene is excluded by the landscape classifier 51L, the autumn color classifier 51R Will be selected.

すなわち、風景識別器５１Ｌによる処理が終わった後の次のＳ２０１の際に、全体識別器５０は、ツリーデータの２番目を参照し、風景識別器５１Ｌの判別式の値が−０．４５（夜景に対応する第２否定閾値）よりも小さければ、全体識別器５０は、夕景識別器５１Ｓを選択する。一方、風景識別器５１Ｌの判別式の値が−０．４５より大きく、１．２７（肯定閾値）よりも小さければ、全体識別器５０は、紅葉識別器５１Ｒを選択する。このように、第１実施形態では、風景識別器５１Ｌの判別式の値に応じて、次に選択されるサブ識別器の種類が変わる。 That is, in the next S201 after the processing by the landscape classifier 51L is completed, the overall classifier 50 refers to the second of the tree data, and the discriminant value of the landscape classifier 51L is −0.45 ( If it is smaller than the second negative threshold corresponding to the night view), the overall discriminator 50 selects the evening scene discriminator 51S. On the other hand, if the discriminant value of the landscape discriminator 51L is larger than −0.45 and smaller than 1.27 (positive threshold), the overall discriminator 50 selects the autumn color discriminator 51R. Thus, in the first embodiment, the type of the sub-classifier to be selected next changes according to the discriminant value of the landscape classifier 51L.

これにより、第１実施形態では、風景識別器５１によって夜景が除外された後の処理時間の期待値が短くなる。この結果、全体識別処理の処理時間の期待値も短くなる。 Thereby, in 1st Embodiment, the expected value of the processing time after a night view is excluded by the landscape identification device 51 becomes short. As a result, the expected value of the processing time of the overall identification process is also shortened.

同様に、２番目の夕景識別器５１Ｓによって花のみが除外されると、識別処理を実行することになる残りのサブ識別器５１は、夜景識別器５１Ｎと紅葉識別器５１Ｒの２つになる。この２つのサブ識別器５１の順序について、夜景→紅葉の順よりも、紅葉→夜景の順の方が処理時間の期待値が短いならば、後者の選択順序にする方が望ましい。但し、前述の参考例によれば、紅葉識別器５１Ｒが夜景識別器５１Ｎよりも先に選択されることは起こらない。これに対し、第１実施形態では、判別式の値に応じて次に選択されるサブ識別器の種類が決まるため、２番目の夕景識別器５１Ｒによって花のみが除外されると、次に紅葉識別器５１Ｒが選択されるようになる。 Similarly, when only the flowers are excluded by the second evening scene discriminator 51S, the remaining sub discriminators 51 that will execute the discrimination process are the night scene discriminator 51N and the autumnal leaf discriminator 51R. Regarding the order of the two sub classifiers 51, if the expected value of the processing time is shorter in the order of autumn leaves → night view than in the order of night view → autumn leaves, the latter selection order is desirable. However, according to the above-described reference example, the autumnal leaves discriminator 51R is not selected before the night view discriminator 51N. On the other hand, in the first embodiment, since the type of the next sub-classifier to be selected is determined according to the value of the discriminant, if only the flower is excluded by the second evening scene classifier 51R, then the autumn leaves The discriminator 51R is selected.

すなわち、２番目に夕景識別器５１Ｓが選択された場合において、さらに次のＳ２０１の際に（夕景識別器５１Ｓによる処理が終わった後の次のＳ２０１の際に）、全体識別器５０は、ツリーデータの３番目を参照する。このとき、夕景識別器５１Ｓの判別式の値が−０．６６（花に対応する第２否定閾値）よりも小さければ、全体識別器５０は、夜景識別器５１Ｎを選択する。また、夕景識別器５１Ｓの判別式の値が−０．６６より大きく、−０．６２よりも小さければ、全体識別器５０は、紅葉識別器５１Ｒを選択する。このように、第１実施形態では、夕景判別器５１Ｌの判別式の値に応じて、次に選択されるサブ識別器５１の種類が変わる。 In other words, when the sunset scene classifier 51S is selected second, in the next S201 (in the next S201 after the processing by the sunset scene classifier 51S is finished), the overall classifier 50 Refers to the third of the data. At this time, if the value of the discriminant of the evening scene classifier 51S is smaller than −0.66 (second negative threshold corresponding to the flower), the overall classifier 50 selects the night scene classifier 51N. If the discriminant value of the evening scene classifier 51S is larger than −0.66 and smaller than −0.62, the overall classifier 50 selects the autumnal leaves classifier 51R. As described above, in the first embodiment, the type of the sub-classifier 51 to be selected next changes according to the value of the discriminant expression of the evening scene classifier 51L.

これにより、第１実施形態では、２番目の夕景識別器５１Ｓによって花が除外された後の処理時間の期待値が短くなる。この結果、全体識別処理の処理時間の期待値も短くなる。 Thereby, in 1st Embodiment, the expected value of the processing time after a flower is excluded by the 2nd evening scene identifier 51S becomes short. As a result, the expected value of the processing time of the overall identification process is also shortened.

以上説明したように、第１実施形態では、判別式の値に応じて次に選択されるサブ識別器の種類が決まる。そして、第１実施形態では、後述するように最適な選択順序になるようにツリーデータが構成されているため、全体識別処理の処理時間の期待値を短くすることができる。 As described above, in the first embodiment, the type of the sub-classifier to be selected next is determined according to the discriminant value. In the first embodiment, the tree data is configured to have an optimal selection order as will be described later, so that the expected value of the processing time of the overall identification process can be shortened.

＜ツリー構造のデータの作成方法について＞
図２５のツリーデータは、処理時間の期待値が最短になるような最適な選択順序で構成されている。最適な選択順序になるように構成されるのであればツリーデータはどのような手順で作成しても良いが、本実施形態では、以下に説明するように再帰的な手順によって、ツリーデータを構成するための最適順序を決定している。 <How to create tree structure data>
The tree data in FIG. 25 is configured in an optimal selection order that minimizes the expected processing time. The tree data can be created by any procedure as long as it is configured to have an optimal selection order, but in this embodiment, the tree data is constructed by a recursive procedure as described below. To determine the optimal order.

図２６は、再帰的最適順序決定のフロー図である。このフローは、シーン識別プログラムを設計する際に、行われる。 FIG. 26 is a flowchart of recursive optimal order determination. This flow is performed when designing the scene identification program.

仮に全体識別器５０のサブ識別器５１の数が０個の場合、全体識別処理の処理時間の期待値はゼロになる。また、仮に全体識別器５０のサブ識別器５１の数が１個の場合、全体識別処理の処理時間の期待値は、そのサブ識別器５１の処理時間そのものになる。そこで、まず、未実行のサブ識別器５１の数が１個かゼロかを判断し（Ｓ８０１）、ゼロの場合には期待値にゼロを設定し、１個の場合にはそのサブ識別器５１の処理時間そのものを期待値に設定し（Ｓ８１１）、処理を終了する。 If the number of sub discriminators 51 of the overall discriminator 50 is 0, the expected value of the processing time of the overall discriminating process is zero. Further, if the number of the sub classifiers 51 of the overall classifier 50 is one, the expected value of the processing time of the overall classification process is the processing time of the sub classifier 51 itself. Therefore, first, it is determined whether the number of unexecuted sub classifiers 51 is one or zero (S801). If zero, the expected value is set to zero, and if it is one, the sub classifier 51 is set. Is set to the expected value (S811), and the process is terminated.

図２７は、２個のサブ識別器のツリーの候補（ツリー候補）の説明図である。ここでは、ＡとＢの２個のサブ識別器があるとする。以下、図２６と図２７とを参照しながら、２個の識別器の最適順序の決定について説明する。 FIG. 27 is an explanatory diagram of tree candidates (tree candidates) for two sub classifiers. Here, it is assumed that there are two sub-classifiers A and B. Hereinafter, determination of the optimal order of the two discriminators will be described with reference to FIGS. 26 and 27. FIG.

サブ識別器５１の数が２個の場合（Ｓ８０１でＮＯ）、まず、いずれかのサブ識別器を選択する（Ｓ８０２）。ここでは、まず、サブ識別器Ａが選択されるものとする。 If the number of sub classifiers 51 is two (NO in S801), first, any one of the sub classifiers is selected (S802). Here, first, it is assumed that the sub classifier A is selected.

次に、発生する可能性のあるサブ識別器の組み合わせを全て作成する（Ｓ８０３）。ここでは、条件Ｘa1（例えば、サブ識別器Ａの判別式の値が第２否定閾値よりも小さいという条件）のとき、「未実行のサブ識別器がＢ」という組み合わせと、条件Ｙa1のとき、「未実行のサブ識別器がゼロ」という組み合わせの２つがあるとする。 Next, all combinations of sub-classifiers that may occur are created (S803). Here, when the condition Xa1 (for example, the condition that the discriminant value of the sub classifier A is smaller than the second negative threshold), the combination of “unexecuted sub classifier is B” and the condition Ya1, Assume that there are two combinations of “unexecuted sub classifiers are zero”.

次に、Ｓ８０４において、前者の組み合わせ（「未実行のサブ識別器がＢ」という組み合わせ）が選択されるとする。そして、Ｓ８０５において、このフローが再帰的に実行され、未実行のサブ識別器が１個なので、サブ識別器Ｂの処理時間が期待値として出力されて（Ｓ８１１）、処理が戻ってくる。そして、Ｓ８０６において、後者の組み合わせが残っているので、ＮＯの判断となる。 Next, in S804, it is assumed that the former combination (the combination “unexecuted sub-identifier is B”) is selected. In S805, this flow is recursively executed, and since there is one unexecuted sub classifier, the processing time of the sub classifier B is output as an expected value (S811), and the process returns. In S806, since the latter combination remains, the determination is NO.

次のＳ８０４において、後者の組み合わせ（「未実行のサブ識別器がゼロ」という組み合わせ）が選択される。そして、このフローが再帰的に実行され、未実行のサブ識別器がゼロなので、ゼロの期待値が出力されて（Ｓ８１１）、処理が戻ってくる。そして、Ｓ８０６において、全ての組み合わせが終了したので、ＹＥＳの判断となる。 In the next S804, the latter combination (the combination “unexecuted sub-identifier is zero”) is selected. This flow is executed recursively, and since the unexecuted sub classifier is zero, an expected value of zero is output (S811), and the process returns. In step S806, since all combinations have been completed, the determination is YES.

次に、ツリー候補の期待処理時間を算出する（Ｓ８０７）。ここで、期待処理時間は、ある条件の発生確率とその条件に対応する組み合わせの期待値とを乗算した値と、Ｓ８０２で選択されたサブ識別器の処理時間との和である。つまり、図２７の第１ツリー候補（図２７の左側のツリー候補）に従って全体識別処理が実行されたときの処理時間の期待値が算出される。 Next, the expected processing time of the tree candidate is calculated (S807). Here, the expected processing time is the sum of the value obtained by multiplying the occurrence probability of a certain condition by the expected value of the combination corresponding to the condition, and the processing time of the sub-identifier selected in S802. That is, the expected value of the processing time when the overall identification process is executed according to the first tree candidate in FIG. 27 (the left tree candidate in FIG. 27) is calculated.

次に、別の未実行のサブ識別器の有無を判断する（Ｓ８０８）。２つのサブ識別器のうち、未だサブ識別器Ｂが選択されていないので、ここでの判断は「ある」になる。そして、同様にＳ８０２〜Ｓ８０７の処理が行われることにより、図２７の第２ツリー候補（図２７の右側のツリー候補）に従って全体識別処理が実行されたときの処理時間の期待値が算出される。 Next, it is determined whether there is another unexecuted sub classifier (S808). Of the two sub-classifiers, sub-classifier B has not yet been selected, so the determination here is “Yes”. Similarly, the processing of S802 to S807 is performed, and the expected value of the processing time when the overall identification processing is executed according to the second tree candidate in FIG. 27 (the tree candidate on the right side in FIG. 27) is calculated. .

そして、期待処理時間が最短のツリー候補を選択する（Ｓ８０９）。ここでは、第１ツリー候補と第２ツリー候補の期待処理時間が比較された結果、第１ツリー候補が選択されるものとする。 Then, the tree candidate with the shortest expected processing time is selected (S809). Here, it is assumed that the first tree candidate is selected as a result of comparing the expected processing times of the first tree candidate and the second tree candidate.

以上の処理により、サブ識別器５１の数が２個の場合の最適順序が求められることになる。 With the above processing, the optimum order when the number of sub-classifiers 51 is two is obtained.

図２８は、３個のサブ識別器のツリーの候補（ツリー候補）の説明図である。ここでは、Ａ〜Ｃの３個のサブ識別器があるとする。以下、図２６と図２８とを参照しながら、３個の識別器の最適順序の決定について説明する。 FIG. 28 is an explanatory diagram of tree candidates (tree candidates) for three sub classifiers. Here, it is assumed that there are three sub-classifiers A to C. Hereinafter, determination of the optimal order of the three classifiers will be described with reference to FIGS.

サブ識別器５１の数が２個の場合（Ｓ８０１でＮＯ）、まず、いずれかのサブ識別器を選択する（Ｓ８０２）。ここでは、まず、サブ識別器Ａが選択されるものとする。そして、サブ識別器Ａにおいて、３つの条件（条件Ｘa1、条件Ｙa1、条件Ｚa1）の下で３つの組み合わせがあるとする。 If the number of sub classifiers 51 is two (NO in S801), first, any one of the sub classifiers is selected (S802). Here, first, it is assumed that the sub classifier A is selected. In the sub-identifier A, it is assumed that there are three combinations under three conditions (condition Xa1, condition Ya1, and condition Za1).

まず、Ｓ８０４において、条件Ｘa1における組み合わせ（「未実行のサブ識別器がＢとＣの２個」）が選択されるものとする。この場合、次のＳ８０５において、既に説明した通り、サブ識別器の数が２個の場合の最適順序決定が行われ、その最適順序における期待値が出力され（Ｓ８１０）、処理が戻ってくる。そして、同様に、残りの組み合わせ（条件Ｙa1における組み合わせと、条件Ｚa1における組み合わせ）の最適順序と期待値も算出される。 First, in S804, it is assumed that the combination in the condition Xa1 (“2 unexecuted sub classifiers B and C”) is selected. In this case, in the next step S805, as already described, the optimum order is determined when the number of sub classifiers is two, the expected value in the optimum order is output (S810), and the process returns. Similarly, the optimum order and expected value of the remaining combinations (the combination in the condition Ya1 and the combination in the condition Za1) are also calculated.

そして、Ｓ８０７において、図２８の第１ツリー候補に従って全体識別処理が実行されたときの処理時間の期待値が算出される。そして、同様に、残りのツリー候補（不図示）の処理時間の期待値が算出される。そして、期待処理時間が最短のツリー候補を選択する（Ｓ８０９）。 In S807, an expected value of the processing time when the overall identification process is executed according to the first tree candidate in FIG. 28 is calculated. Similarly, the expected value of the processing time of the remaining tree candidates (not shown) is calculated. Then, the tree candidate with the shortest expected processing time is selected (S809).

以上の処理により、サブ識別器５１の数が３個の場合の最適順序が求められることになる。なお、サブ識別器の数が４個以上の場合も、上記のような再帰的な手順により最適順序が求められる。 With the above processing, the optimum order when the number of sub-identifiers 51 is three is obtained. Even when the number of sub classifiers is four or more, the optimal order is obtained by the recursive procedure as described above.

このようにして、本実施形態では、最適な選択順序になるように、図２５のツリーデータが構成される。そして、全体識別器５０が、図７のＳ２０１の際にこのツリーデータを参照してサブ識別器５１を選択することにより、全体識別処理の処理時間を速めることができる。 In this way, in the present embodiment, the tree data of FIG. 25 is configured so as to have an optimal selection order. Then, the overall classifier 50 selects the sub-classifier 51 with reference to this tree data at S201 in FIG. 7, thereby speeding up the processing time of the overall identification process.

＝＝＝第２実施形態＝＝＝
前述の第１実施形態では、全体識別器５０による全体識別処理の処理時間が速くなるように、５個のサブ識別器５１の選択順序が決定されている。
但し、部分識別器６０による部分識別処理の処理時間が速くなるように、３個のサブ部分識別器の選択順序が決定されても良い。 === Second Embodiment ===
In the first embodiment described above, the selection order of the five sub classifiers 51 is determined so that the processing time of the total classifying process by the total classifier 50 is increased.
However, the selection order of the three sub partial classifiers may be determined so that the processing time of the partial classification processing by the partial classifier 60 is accelerated.

＝＝＝第３実施形態＝＝＝
前述の第１実施形態によれば、全体識別処理におけるサブ識別器５１の選択順序が図２５に示す順になるようにシーン識別プログラムが構成されており、このプログラムがプリンタ４のプリンタ側コントローラ２０に搭載されている。 === Third Embodiment ===
According to the first embodiment described above, the scene identification program is configured so that the selection order of the sub-identifiers 51 in the overall identification process is the order shown in FIG. 25, and this program is stored in the printer-side controller 20 of the printer 4. It is installed.

これに対し、図７のＳ２０１においてサブ識別器５１を選択する際に、例えば図２６の処理をプリンタ側コントローラ２０が実行するように、シーン識別プログラムを構成しても良い。 On the other hand, when selecting the sub discriminator 51 in S201 of FIG. 7, for example, the scene discrimination program may be configured such that the printer-side controller 20 executes the processing of FIG.

＝＝＝その他の実施の形態＝＝＝
一実施形態としてのプリンタ等を説明したが、上記の実施形態は、本発明の理解を容易にするためのものであり、本発明を限定して解釈するためのものではない。本発明は、その趣旨を逸脱することなく、変更、改良され得ると共に、本発明にはその等価物が含まれることは言うまでもない。特に、以下に述べる実施形態であっても、本発明に含まれるものである。 === Other Embodiments ===
Although a printer or the like as one embodiment has been described, the above embodiment is for facilitating understanding of the present invention, and is not intended to limit the present invention. The present invention can be changed and improved without departing from the gist thereof, and it is needless to say that the present invention includes equivalents thereof. In particular, the embodiments described below are also included in the present invention.

＜プリンタについて＞
前述の実施形態ではプリンタ４がシーン識別処理をしていたが、デジタルスチルカメラ２がシーン識別処理をしても良い。また、上記のシーン識別処理を行う画像識別装置は、プリンタ４やデジタルスチルカメラ２に限られるものではない。例えば、大量の画像ファイルを保存するフォトストレージのような画像識別装置が、上記のシーン識別処理を行っても良い。もちろん、パーソナルコンピュータやインターネット上に設置されたサーバーが、上記のシーン識別処理を行っても良い。 <About the printer>
In the above-described embodiment, the printer 4 performs the scene identification process, but the digital still camera 2 may perform the scene identification process. Further, the image identification device that performs the above-described scene identification processing is not limited to the printer 4 or the digital still camera 2. For example, an image identification device such as a photo storage that stores a large amount of image files may perform the scene identification process described above. Of course, a personal computer or a server installed on the Internet may perform the scene identification process.

＜画像ファイルについて＞
前述の画像ファイルはＥｘｉｆ形式であったが、画像ファイルフォーマットはこれに限られるものではない。また、前述の画像ファイルは静止画であるが、動画であっても良い。要するに、画像ファイルが画像データと付加データとを備えていれば、前述のようなシーン識別処理を行うことが可能である。 <About image files>
The image file described above is in the Exif format, but the image file format is not limited to this. Further, the above-described image file is a still image, but may be a moving image. In short, if the image file includes image data and additional data, the scene identification process as described above can be performed.

＜サポートベクタマシンについて＞
前述のサブ識別器５１やサブ部分識別器６１には、サポートベクタマシン（ＳＶＭ）による識別手法が用いられている。しかし、識別対象画像が特定シーンに属するか否かの識別手法は、サポートベクタマシンを用いるものに限られるものではない。例えば、ニューラルネットワーク等のパターン認識を採用しても良い。 <About Support Vector Machine>
For the above-described sub classifier 51 and sub partial classifier 61, a classification method using a support vector machine (SVM) is used. However, the method for identifying whether or not the identification target image belongs to a specific scene is not limited to using a support vector machine. For example, pattern recognition such as a neural network may be employed.

＜シーンの識別について＞
前述の実施形態では、サブ識別器５１やサブ部分識別器６１は、画像データの示す画像が特定のシーンに属するか否かを識別している。しかし、サブ識別器５１やサブ部分識別器６１は、特定のシーンに属するか否かを識別するものに限られず、画像データの示す画像が特定のカテゴリに属するか否かを分類できれば良い。このため、サブ識別器５１やサブ部分識別器６１は、例えば画像データの示す画像が特定のパターン形状か否かを識別しても良い。 <About scene identification>
In the above-described embodiment, the sub classifier 51 and the sub partial classifier 61 identify whether or not the image indicated by the image data belongs to a specific scene. However, the sub discriminator 51 and the sub partial discriminator 61 are not limited to identifying whether or not they belong to a specific scene, as long as they can classify whether or not an image indicated by image data belongs to a specific category. For this reason, the sub classifier 51 and the sub partial classifier 61 may identify whether the image indicated by the image data has a specific pattern shape, for example.

＜サブ識別器・サブ部分識別器の選択順序について＞
前述の実施形態では、全体識別器５０による全体識別処理の後、部分識別器６０による部分識別処理が行われていた（図６、図７参照）。つまり、前述の実施形態では、５個のサブ識別器５１が選択された後、３個のサブ部分識別器６１が選択されていた。但し、これに限られるものではない。 <Selection order of sub classifier / sub partial classifier>
In the above-described embodiment, after the overall identification process by the overall classifier 50, the partial identification process by the partial classifier 60 is performed (see FIGS. 6 and 7). That is, in the above-described embodiment, after the five sub classifiers 51 are selected, three sub partial classifiers 61 are selected. However, the present invention is not limited to this.

例えば、サブ識別器５１とサブ部分識別器６１の選択順序を混ぜても良い。この場合、シーン識別処理の処理時間の期待値が最短になるように、サブ識別器５１とサブ部分識別器６１の選択順序が決定される。但し、サブ部分識別器６１の処理時間はサブ識別器５１の処理時間よりも遅いので（サブ部分識別器６１は判別式の値を複数回算出するため）、最適な選択順序を算出しても、結局、５個のサブ識別器５１が選択された後に３個のサブ部分識別器６１が選択されることになるかもしれない。 For example, the selection order of the sub classifier 51 and the sub partial classifier 61 may be mixed. In this case, the selection order of the sub classifier 51 and the sub partial classifier 61 is determined so that the expected value of the processing time of the scene classification process is the shortest. However, since the processing time of the sub partial discriminator 61 is slower than the processing time of the sub discriminator 51 (because the sub partial discriminator 61 calculates the discriminant value a plurality of times), the optimum selection order can be calculated. Eventually, after the five sub classifiers 51 are selected, the three sub partial classifiers 61 may be selected.

＝＝＝まとめ＝＝＝
（１）前述の実施形態では、全体識別器５０は複数のサブ識別器５１を有している。サブ識別器５１は、画像データの示す画像が特定のシーンに属するか否かを識別する識別処理を行っている。そして、前述の実施形態では、このようなサブ識別器を複数組み合わせることによって、画像データの示す画像のシーン（カテゴリの一例）を分類している。
ここで、仮にシーン識別処理を行うたびに全てのサブ識別器５１による識別処理を行っていたのでは、シーン識別処理の処理速度が遅くなってしまう。そこで、前述の参考例や実施形態では、画像データの示す画像が特定のシーンに属することをサブ識別器５１が識別できたとき等の場合、別のサブ識別器５１による処理を省略し、シーン識別処理の処理速度を速くしている。例えば、全体識別器５０の風景識別器５１Ｌが風景画像を識別できれば（図７のＳ２０４でＹＥＳ）、全体識別器５０の夕景識別器５１Ｓによる処理が省略され、シーン識別処理の処理速度が速くなる。
ところで、このようにサブ識別器５１の処理が省略されることがある場合、５個のサブ識別器５１の順序に応じて、全体識別処理の処理時間の期待値が異なることになる。そこで、前述の参考例では、全体識別処理の処理時間の期待値が最短になる固定された順序にて、５個のサブ識別器５１を順に実行している。これにより、全体識別処理の処理時間が速くなる。 === Summary ===
(1) In the above-described embodiment, the overall classifier 50 has a plurality of sub-classifiers 51. The sub-identifier 51 performs identification processing for identifying whether or not the image indicated by the image data belongs to a specific scene. And in the above-mentioned embodiment, the scene (an example of a category) of the image which image data shows is classified by combining several such sub discriminators.
Here, if the identification processing by all the sub-classifiers 51 is performed every time the scene identification processing is performed, the processing speed of the scene identification processing becomes slow. Therefore, in the above-described reference examples and embodiments, when the sub-classifier 51 can identify that the image indicated by the image data belongs to a specific scene, the processing by another sub-classifier 51 is omitted, and the scene The processing speed of the identification process is increased. For example, if the landscape discriminator 51L of the overall discriminator 50 can discriminate a landscape image (YES in S204 in FIG. 7), the processing by the evening scene discriminator 51S of the overall discriminator 50 is omitted, and the processing speed of the scene discrimination processing is increased. .
By the way, when the process of the sub classifier 51 may be omitted in this way, the expected value of the processing time of the overall classification process varies depending on the order of the five sub classifiers 51. Therefore, in the above-described reference example, the five sub-identifiers 51 are sequentially executed in a fixed order in which the expected value of the processing time of the overall identification process is the shortest. As a result, the processing time of the entire identification process is increased.

但し、風景→夕景→夜景→花→紅葉という固定された順序にて５個のサブ識別器５１を順に実行した場合、紅葉識別器５１Ｒが、夕景識別器５１Ｓや花識別器５１Ｆよりも先に選択されることは起こらない。一方、風景識別器５１Ｌによって夜景が除外されたとき、夕景→花→紅葉の順よりも、紅葉→夕景→花の順の方が処理時間の期待値が短いならば、後者の選択順序にする方が望ましい。 However, when the five sub classifiers 51 are sequentially executed in a fixed order of landscape → evening scene → night scene → flower → autumn leaves, the autumnal scene classifier 51R precedes the evening scene classifier 51S and the flower classifier 51F. It does not happen to be selected. On the other hand, when the night scene is excluded by the landscape discriminator 51L, if the expected value of the processing time is shorter in the order of autumnal leaves → evening scene → flower than in the order of evening scene → flower → autumn leaves, the latter order is selected. Is preferable.

そこで、前述の実施形態では、先に実行されるサブ識別器の判別式の値に応じて、次に実行されるサブ識別器５１が決定される。例えば風景識別器５１Ｌの判別式の値が−０．４５（夜景に対応する第２否定閾値）よりも小さければ、全体識別器５０は、夕景識別器５１Ｓを選択する。一方、風景識別器５１Ｌの判別式の値が−０．４５より大きく、１．２７（肯定閾値）よりも小さければ、全体識別器５０は、紅葉識別器５１Ｒを選択する。これにより、第１実施形態では、選択順序が固定された参考例よりも、全体識別処理の処理速度を速めることができる。 Therefore, in the above-described embodiment, the sub classifier 51 to be executed next is determined according to the value of the discriminant of the sub classifier to be executed first. For example, if the discriminant value of the landscape discriminator 51L is smaller than −0.45 (second negative threshold corresponding to the night scene), the overall discriminator 50 selects the evening scene discriminator 51S. On the other hand, if the discriminant value of the landscape discriminator 51L is larger than −0.45 and smaller than 1.27 (positive threshold), the overall discriminator 50 selects the autumn color discriminator 51R. Thereby, in the first embodiment, the processing speed of the overall identification process can be increased as compared with the reference example in which the selection order is fixed.

（２）前述の実施形態では、例えば風景識別器５１Ｌにより識別対象画像が風景であることを識別できたとき（図１０のＳ２０４でＹＥＳ）、もはや他のサブ識別器５１による処理は不要になるので、例えば夕景識別器５１による処理を省略している。これにより、シーン識別処理の処理速度を速くしている。 (2) In the above-described embodiment, for example, when the landscape classifier 51L can identify that the classification target image is a landscape (YES in S204 in FIG. 10), the processing by the other sub-classifier 51 is no longer necessary. Therefore, for example, the processing by the evening scene classifier 51 is omitted. Thereby, the processing speed of the scene identification process is increased.

（３）前述の実施形態では、例えば風景識別器５１Ｌの判別式の値が第２否定閾値より大きければ、風景とは別のシーン（例えば夜景）のシーンに識別対象画像が属しないと判断され、その識別器（例えば夜景識別器５１Ｎ）による識別は省略される。これにより、識別処理の速度を速めることができる。 (3) In the above-described embodiment, for example, if the value of the discriminant of the landscape discriminator 51L is larger than the second negative threshold, it is determined that the classification target image does not belong to a scene different from the landscape (for example, a night view). The identification by the classifier (for example, night scene classifier 51N) is omitted. As a result, the speed of the identification process can be increased.

（４）前述のサブ識別器５１は、判別式の値（画像が特定のカテゴリに属する確率に応じた値）を算出し、その値に基づいて画像が特定のシーンに属するか否かを識別している。そして、前述の実施形態では、この判別式の値を利用して、次に実行するサブ識別器５１を決定している。これにより、次に実行するサブ識別器５１を決定するための指標を別途算出する必要はないので、次に実行するサブ識別器５１の決定が簡略になる。 (4) The sub-identifier 51 described above calculates a discriminant value (a value corresponding to the probability that the image belongs to a specific category), and identifies whether the image belongs to a specific scene based on the value. is doing. In the above-described embodiment, the discriminator 51 to be executed next is determined using the value of this discriminant. Thereby, there is no need to separately calculate an index for determining the sub classifier 51 to be executed next, so that the determination of the sub classifier 51 to be executed next is simplified.

（５）前述の実施形態では、全体識別器５０は、ツリーデータを用いて、次に実行するサブ識別器５１を決定する。ここで、ツリーデータは、図２５に示すように、サブ識別器５１の判別式の値と、次に選択すべきサブ識別器５１とをそれぞれ関連付けたデータである。このようなツリーデータを用いることによって、全体識別器５０は、先に実行されるサブ識別器の判別式の値に応じて、次に実行されるサブ識別器５１を決定できる。 (5) In the above-described embodiment, the overall classifier 50 determines the next sub-classifier 51 to be executed using the tree data. Here, the tree data is data in which the discriminant value of the sub classifier 51 is associated with the sub classifier 51 to be selected next, as shown in FIG. By using such tree data, the overall classifier 50 can determine the sub classifier 51 to be executed next in accordance with the value of the discriminant of the sub classifier to be executed first.

（６）前述の実施形態では、各ツリー候補の期待処理時間が算出され（図２６のＳ８０７参照）、全てのツリー候補の中から期待処理時間が最短のツリー候補が選択され（Ｓ８０９）、ツリーデータが作成される。これにより、全体識別処理の処理速度を速めることができる。 (6) In the above-described embodiment, the expected processing time of each tree candidate is calculated (see S807 in FIG. 26), and the tree candidate with the shortest expected processing time is selected from all tree candidates (S809). Data is created. Thereby, the processing speed of the whole identification process can be increased.

（７）前述の実施形態では、ｎ−１個のサブ識別器５１によって処理時間が最短になる選択順序を決定し、その処理時間を出力する決定フローが用意されている（図２６参照）。そして、ｎ個のサブ識別器５１によって処理時間が最短になる選択順序を決定する場合、まず１つのサブ識別器５１を選択し、残りのｎ−１個のサブ識別器５１によって処理時間が最短になる選択順序を再帰的処理により決定し、選択したサブ識別器５１が最初に実行されるようなツリー候補の期待処理時間を算出する（Ｓ８０７）。このようにして、ｎ個のツリー候補の期待処理時間が算出され、期待処理時間が最短になる選択順序が決定される（Ｓ８０９）。
これにより、簡易な手順によって、最適な選択順序を決定できる。 (7) In the above-described embodiment, a determination flow is prepared in which the n-1 sub discriminators 51 determine the selection order that minimizes the processing time and output the processing time (see FIG. 26). When determining the selection order in which the processing time is the shortest by the n sub discriminators 51, first, one sub discriminator 51 is selected, and the remaining n-1 sub discriminators 51 are the shortest in processing time. The selection order to become is determined by recursive processing, and the expected processing time of the tree candidate that the selected sub-identifier 51 is executed first is calculated (S807). In this way, the expected processing time of n tree candidates is calculated, and the selection order that minimizes the expected processing time is determined (S809).
Thereby, the optimal selection order can be determined by a simple procedure.

（８）前述のプリンタ（画像識別装置に相当）は、プリンタ側コントローラ２０を備えている（図２参照）。そして、このプリンタ側コントローラ２０は、画像データの示す画像が特定のカテゴリに属するか否かを識別するサブ識別器５１を複数組み合わせることによって、画像データの示す画像のシーンを分類している。また、プリンタ側コントローラ２０は、例えば風景識別器５１Ｌの識別結果に応じて、例えば夕景識別器５１Ｓによる処理を省略する。 (8) The above-described printer (corresponding to an image identification device) includes a printer-side controller 20 (see FIG. 2). The printer-side controller 20 classifies the scene of the image indicated by the image data by combining a plurality of sub-identifiers 51 that identify whether the image indicated by the image data belongs to a specific category. Further, the printer-side controller 20 omits, for example, the processing by the evening scene classifier 51S according to the classification result of the landscape classifier 51L.

そして、前述の実施形態では、プリンタ側コントローラ２０は、先に実行されるサブ識別器の判別式の値に応じて、次に実行されるサブ識別器５１を決定する。これにより、第１実施形態では、選択順序が固定された参考例よりも、全体識別処理の処理速度を速めることができる。 In the above-described embodiment, the printer-side controller 20 determines the sub classifier 51 to be executed next in accordance with the value of the discriminant of the sub classifier to be executed first. Thereby, in the first embodiment, the processing speed of the overall identification process can be increased as compared with the reference example in which the selection order is fixed.

（９）前述のメモリ２３には、図５、図７及び図１４の処理をプリンタ４に実行させるためのプログラムが記憶されている。すなわち、このプログラムは、画像データの示す画像が特定のカテゴリに属するか否かを識別する識別処理を、複数のカテゴリ毎に順に選択して行うコードと、ある識別処理の結果に応じて、まだ行われていない識別処理を省略するコードと、識別処理の結果に基づいて、画像のカテゴリを識別するコードと、を備えている。また、このプログラムには、画像識別装置に、先に行われる識別処理の結果に応じて、後に行われる識別処理の選択順序を決定させるコードも含まれている。 (9) The memory 23 stores a program for causing the printer 4 to execute the processes of FIGS. That is, according to the code for selecting the identification process for identifying whether or not the image indicated by the image data belongs to a specific category in order for each of a plurality of categories and the result of the identification process, A code that omits the identification process that is not performed, and a code that identifies the category of the image based on the result of the identification process. The program also includes a code for causing the image identification apparatus to determine the selection order of the identification process to be performed later according to the result of the identification process to be performed first.

画像処理システムの説明図である。It is explanatory drawing of an image processing system. プリンタの構成の説明図である。2 is an explanatory diagram of a configuration of a printer. FIG. プリンタの自動補正機能の説明図である。It is explanatory drawing of the automatic correction function of a printer. 画像のシーンと補正内容との関係の説明図である。It is explanatory drawing of the relationship between the scene of an image, and the correction content. シーン識別部によるシーン識別処理のフロー図である。It is a flowchart of the scene identification process by a scene identification part. シーン識別部の機能の説明図である。It is explanatory drawing of the function of a scene identification part. 全体識別処理のフロー図である。It is a flowchart of a whole identification process. 識別対象テーブルの説明図である。It is explanatory drawing of an identification object table. 全体識別処理の肯定閾値の説明図である。It is explanatory drawing of the affirmation threshold value of the whole identification process. RecallとPrecisionの説明図である。It is explanatory drawing of Recall and Precision. 第１否定閾値の説明図である。It is explanatory drawing of a 1st negative threshold value. 第２否定閾値の説明図である。It is explanatory drawing of a 2nd negative threshold value. 図１３Ａは、閾値テーブルの説明図である。図１３Ｂは、風景識別器における閾値の説明図である。図１３Ｃは、風景識別器の処理の概要の説明図である。FIG. 13A is an explanatory diagram of a threshold table. FIG. 13B is an explanatory diagram of threshold values in the landscape classifier. FIG. 13C is an explanatory diagram of an outline of the process of the landscape classifier. 部分識別処理のフロー図である。It is a flowchart of a partial identification process. 夕景部分識別器が選択する部分画像の順番の説明図である。It is explanatory drawing of the order of the partial image which an evening scene partial identifier selects. 上位１０番目までの１０個の部分画像だけで夕景画像の識別をしたときのRecall及びPrecisionのグラフである。It is a Recall and Precision graph when the evening scene image is identified only by the top 10 partial images. 図１７Ａは、線形サポートベクタマシンによる判別の説明図である。図１７Ｂは、カーネル関数を用いた判別の説明図である。FIG. 17A is an explanatory diagram of determination by the linear support vector machine. FIG. 17B is an explanatory diagram of discrimination using a kernel function. 統合識別処理のフロー図である。It is a flowchart of an integrated identification process. 参考例における選択順序の決定フローの説明図である。It is explanatory drawing of the determination flow of the selection order in a reference example. ある選択順序における処理時間の期待値を算出するフロー図である。It is a flowchart which calculates the expected value of the processing time in a certain selection order. サブ識別器５１の処理時間の表である。It is a table | surface of the processing time of the sub discrimination device 51. FIG. 風景識別器５１Ｌによって評価用サンプル画像を識別したときの判別式の値の確率分布である。This is a probability distribution of discriminant values when an evaluation sample image is identified by the landscape classifier 51L. 風景識別器５１Ｌが風景画像を識別できる確率Ｐpos１の説明図である。It is explanatory drawing of the probability Ppos1 in which the landscape identification device 51L can identify a landscape image. 風景識別器５１Ｌが夕景の否定フラグを立てる確率Ｐneg（１，２）の説明図である。It is explanatory drawing of the probability Pneg (1, 2) that the landscape discriminator 51L sets the evening scene negative flag. 風景識別器５１Ｌが夜景の否定フラグを立てる確率Ｐneg（１，３）の説明図である。It is explanatory drawing of probability Pneg (1, 3) that the landscape identification device 51L sets the negative flag of a night view. 第ｉサブ識別器が実行されたときに第ｊサブ識別器が実行される確率Ｐ（ｉ，ｊ）の表である。It is a table | surface of the probability P (i, j) by which a j-th sub discriminator is performed when an i-th sub discriminator is performed. ある選択順序における各サブ識別器５１の実行確率Ｐｎの算出するフロー図である。It is a flowchart which calculates the execution probability Pn of each sub discriminator 51 in a certain selection order. 第１実施形態における順序決定の際に参照されるツリー構造のデータ（ツリーデータ）の概念図である。It is a conceptual diagram of the data (tree data) of the tree structure referred in the case of order determination in 1st Embodiment. 第１実施形態における再帰的最適順序決定のフロー図である。It is a flowchart of recursive optimal order determination in 1st Embodiment. ２個のサブ識別器のツリーの候補（ツリー候補）の説明図である。It is explanatory drawing of the tree candidate (tree candidate) of two sub discriminators. ３個のサブ識別器のツリーの候補（ツリー候補）の説明図である。It is explanatory drawing of the tree candidate (tree candidate) of three sub classifiers.

Explanation of symbols

２デジタルスチルカメラ、２Ａモード設定ダイヤル、
４プリンタ、６メモリカード、
１０印刷機構、１１ヘッド、１２ヘッド制御部、１３モータ、１４センサ、
２０プリンタ側コントローラ、２１スロット、２２ＣＰＵ、２３メモリ、
２４制御ユニット、２５駆動信号生成部、
３１記憶部、３１Ａ画像記憶部、３１Ｂ結果記憶部、
３２顔識別部、３３シーン識別部、３４画像補正部、３５プリンタ制御部、
４０特徴量取得部、５０全体識別器、５１サブ識別器、５１Ｌ風景識別器、
５１Ｓ夕景識別器、５１Ｎ夜景識別器、５１Ｆ花識別器、５１Ｒ紅葉識別器、
６０部分識別器、６１サブ部分識別器、６１Ｓ夕景部分識別器、
６１Ｆ花部分識別器、６１Ｒ紅葉部分識別器、
７０統合識別器、 2 Digital still camera, 2A mode setting dial,
4 Printer, 6 Memory card,
10 printing mechanism, 11 head, 12 head control unit, 13 motor, 14 sensor,
20 printer-side controller, 21 slots, 22 CPU, 23 memory,
24 control unit, 25 drive signal generator,
31 storage unit, 31A image storage unit, 31B result storage unit,
32 face identification unit, 33 scene identification unit, 34 image correction unit, 35 printer control unit,
40 feature quantity acquisition unit, 50 global classifier, 51 sub classifier, 51L landscape classifier,
51S evening scene classifier, 51N night scene classifier, 51F flower classifier, 51R autumn leaves classifier,
60 partial classifiers, 61 sub partial classifiers, 61S evening scene partial classifiers,
61F Flower partial classifier, 61R Autumn colored partial classifier,
70 Integrated identifier,

Claims

Identification processing for identifying whether the image indicated by the image data belongs to a specific category is performed by selecting in order for each of the plurality of categories,
Depending on the result of the identification process, the identification process that has not yet been performed is omitted,
In the image identification method for identifying the category of the image based on the result of the identification process,
An image identification method, wherein a selection order of identification processes to be performed later is determined in accordance with a result of identification processes to be performed first.

The image identification method according to claim 1,
An image identification method characterized in that, when it is identified in a certain identification process that the image belongs to a specific category corresponding to the identification process, the identification process that has not yet been performed is omitted.

The image identification method according to claim 1 or 2,
An image identification method in which, when it is identified that the image does not belong to a specific category corresponding to an identification process different from the identification process in the identification process, the another identification process is omitted. .

The image identification method according to any one of claims 1 to 3,
In the identification process, a value corresponding to the probability that the image belongs to the specific category is calculated, and based on the value, it is identified whether the image belongs to the specific category,
An image identification method, wherein a selection order of identification processing to be performed later is determined according to the value calculated in the identification processing performed first.

The image identification method according to any one of claims 1 to 4,
The image identification method, wherein the selection order is determined by using data in which a result of the identification processing performed first and an identification processing performed next are associated with each other.

The image identification method according to any one of claims 1 to 5,
The processing time until the category of the image is identified based on the processing time of the identification processing performed later and the occurrence probability of the result of the identification processing performed earlier for a plurality of candidates having different selection orders Expected value of
The image identification method, wherein the selection order is determined by selecting a candidate having a short expected value from the plurality of candidates.

The image identification method according to claim 6, comprising:
There is a determination process for determining a candidate with a short expected value when performing n-1 identification processes,
When determining a candidate with a short expected value when performing n identification processes, the expected value is calculated using the determination process for each of n candidates that have different identification processes to be performed first. An image identification method, wherein the selection order is determined by selecting a candidate with a short expected value from candidates.

An image identification device comprising a controller,
The controller is
Identification processing for identifying whether the image indicated by the image data belongs to a specific category is performed by selecting in order for each of the plurality of categories,
Depending on the result of the identification process, the identification process that has not yet been performed is omitted,
Based on the result of the identification process, the category of the image is identified,
An image identification apparatus, wherein a selection order of identification processing performed later is determined according to the value calculated in the identification processing performed earlier.

In the image identification device,
Identification processing for identifying whether an image indicated by image data belongs to a specific category is performed by selecting in order for each of the plurality of categories,
Depending on the result of the identification process, the identification process that has not yet been performed is omitted,
In a program for identifying the category of the image based on the result of the identification process,
A program for causing the image identification apparatus to determine a selection order of identification processing to be performed later according to a result of identification processing to be performed first.