JP2022175451A

JP2022175451A - Imaging apparatus, control method for the same, and program

Info

Publication number: JP2022175451A
Application number: JP2021081834A
Authority: JP
Inventors: 俊司藤田; Shunji Fujita
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2021-05-13
Filing date: 2021-05-13
Publication date: 2022-11-25
Also published as: US20220368827A1; US11689800B2

Abstract

To provide an imaging apparatus capable of automatically capturing an image at a suitable timing predicted from the current image.SOLUTION: An imaging apparatus includes prediction means for generating a prediction image, in which an image at a second point in time future than a first point in time is predicted using an image at the first point in time output from imaging means, determination means for determining, with respect to the prediction image, a degree of suitability indicating the degree to which the prediction image is suitable as an image to be recorded, and control means for controlling the imaging means so as to capture an image when the time reaches the second point in time when the determined suitability is higher than a predetermined threshold.SELECTED DRAWING: Figure 4

Description

本発明は、撮像装置およびその制御方法ならびにプログラムに関する。 The present invention relates to an imaging apparatus, its control method, and program.

近年、機械学習を用いた画像予測技術の開発が盛んに行われている。被写体の行動シーンが撮影された大量の映像データを教師データとして学習させた推論モデルに対してカメラの撮影映像を入力することにより、被写体の未来の画像を予測することができる。特許文献１は、車載カメラの撮影画像を学習済みの畳み込みニューラルネットワークに入力し、カメラを備える車両がそのまま走行した場合に、運動物体が出現して衝突する可能性のある危険領域とその特徴を推定する技術を開示している。 In recent years, the development of image prediction techniques using machine learning has been actively carried out. A future image of the subject can be predicted by inputting the image captured by the camera to an inference model that has learned a large amount of video data in which the action scene of the subject is captured as training data. In Patent Document 1, an image captured by an in-vehicle camera is input to a trained convolutional neural network, and when a vehicle equipped with a camera continues to run, a moving object may appear and collide with a dangerous area and its characteristics. A technique for estimating is disclosed.

特開２０１７－１６２４３８号公報JP 2017-162438 A

ところで、カメラやスマートフォンなどの撮影アシスト機能に、被写体の未来の画像を予測する技術を適用することにより、より使い勝手を向上させた撮影アシスト機能が提供されることが期待されている。例えば、カメラやスマートフォンの撮影アシスト機能により、被写体の変化に応じた好適な撮影タイミングで自動的に撮影が実行されればユーザの使い勝手が向上することが考えられる。特許文献１では、将来の画像を予測する技術を利用しているものの、当該技術を撮影アシスト機能に用いることは考慮していなかった。 By the way, by applying technology for predicting the future image of a subject to the shooting assist function of cameras, smartphones, etc., it is expected that the shooting assist function with improved usability will be provided. For example, it is conceivable that user-friendliness will be improved if a photographing assist function of a camera or a smartphone automatically executes photographing at suitable photographing timing according to changes in the subject. Japanese Patent Application Laid-Open No. 2002-200000 uses a technique for predicting future images, but does not consider using the technique for the shooting assist function.

本発明は、上記課題に鑑みてなされ、その目的は、現在の画像から予測される好適なタイミングにおいて自動的に撮影することが可能な技術を実現することである。 SUMMARY OF THE INVENTION The present invention has been made in view of the above problems, and an object thereof is to realize a technique capable of automatically capturing an image at a suitable timing predicted from the current image.

この課題を解決するため、例えば本発明の撮像装置は以下の構成を備える。すなわち、
撮像手段から出力される第１時点の画像を用いて、前記第１時点より未来の第２時点の画像を予測した予測画像を生成する予測手段と、
前記予測画像について、前記予測画像が記録する画像として適している度合いを示す好適度を決定する決定手段と、
前記決定された好適度が所定の閾値より高い場合に、時刻が前記第２時点になったときに画像を撮像するように前記撮像手段を制御する制御手段と、を有することを特徴とする。 In order to solve this problem, for example, the imaging device of the present invention has the following configuration. i.e.
Prediction means for generating a predicted image by predicting an image at a second point in the future from the first point in time using an image at a first point in time output from an imaging means;
determination means for determining a degree of suitability of the predicted image as an image to be recorded;
and control means for controlling the imaging means to capture an image when the time reaches the second point in time when the determined suitability is higher than a predetermined threshold.

本発明によれば、現在の画像から予測される好適なタイミングにおいて自動的に撮影することが可能になる。 According to the present invention, it is possible to automatically shoot an image at a suitable timing predicted from the current image.

本発明に係る撮像装置の一例のとしてのカメラの構成例を示すブロック図1 is a block diagram showing a configuration example of a camera as an example of an imaging device according to the present invention; FIG. 本実施形態における画像予測部の構成例を示す図FIG. 4 is a diagram showing a configuration example of an image prediction unit according to the present embodiment; 本実施形態における好適決定部の構成例を示す図A diagram showing a configuration example of a suitable determination unit according to the present embodiment. 本実施形態におけるＬＶ画像生成から自動撮影までのデータの流れを示す図A diagram showing a data flow from LV image generation to automatic photographing in the present embodiment. 本実施形態における生成される予測画像と時刻の関係を示す図A diagram showing the relationship between predicted images generated in the present embodiment and time. 本実施形態における撮影予約管理テーブルの一例を示す図A diagram showing an example of a shooting reservation management table according to the present embodiment. 本実施形態における自動撮影処理の一連の動作を示すフローチャートFlowchart showing a series of operations of automatic photographing processing in the present embodiment 本実施形態における通知情報の表示画面例を示す図A diagram showing an example of a display screen of notification information according to the present embodiment.

以下、添付図面を参照して実施形態を詳しく説明する。なお、以下の実施形態は特許請求の範囲に係る発明を限定するものではない。実施形態には複数の特徴が記載されているが、これらの複数の特徴の全てが発明に必須のものとは限らず、また、複数の特徴は任意に組み合わせられてもよい。さらに、添付図面においては、同一若しくは同様の構成に同一の参照番号を付し、重複した説明は省略する。 Hereinafter, embodiments will be described in detail with reference to the accompanying drawings. In addition, the following embodiments do not limit the invention according to the scope of claims. Although multiple features are described in the embodiments, not all of these multiple features are essential to the invention, and multiple features may be combined arbitrarily. Furthermore, in the accompanying drawings, the same or similar configurations are denoted by the same reference numerals, and redundant description is omitted.

以下では撮像装置の一例として、機械学習モデルを用いて好適なタイミングで自動的に撮影することができるカメラを用いる例を説明する。しかし、本実施形態は、カメラに限らず、機械学習モデルを用いて好適なタイミングで自動的に撮影することが可能な他の任意の機器にも適用可能である。 In the following, as an example of an imaging device, an example using a camera capable of automatically capturing an image at a suitable timing using a machine learning model will be described. However, the present embodiment is applicable not only to cameras but also to any other device capable of automatically capturing images at suitable timings using machine learning models.

＜カメラの構成＞
本実施形態に係る撮像装置の一例としてのカメラ１００の機能構成例について、図１を参照して説明する。なお、図１に示す機能ブロックの１つ以上は、ＡＳＩＣやプログラマブルロジックアレイ（ＰＬＡ）などのハードウェアによって実現されてもよいし、ＣＰＵやＧＰＵ等のプログラマブルプロセッサがソフトウェアを実行することによって実現されてもよい。また、ソフトウェアとハードウェアの組み合わせによって実現されてもよい。従って、以下の説明において、異なる機能ブロックが動作主体として記載されている場合であっても、同じハードウェアが主体として実現されうる。 <Camera configuration>
An example of the functional configuration of a camera 100 as an example of an imaging device according to this embodiment will be described with reference to FIG. One or more of the functional blocks shown in FIG. 1 may be implemented by hardware such as an ASIC or programmable logic array (PLA), or implemented by a programmable processor such as a CPU or GPU executing software. may It may also be implemented by a combination of software and hardware. Therefore, in the following description, even when different functional blocks are described as main entities, the same hardware can be implemented as the main entities.

１０１は、制御部である。制御部１０１は、例えば１つ以上のＣＰＵなどのプロセッサを含み、ＲＯＭ１０２或いは記憶媒体１０８に格納されるプログラムを読み出して実行することにより、カメラ１００全体の動作を制御する。制御部１０１は更にＧＰＵを含んでもよい。また、制御部１０１は、撮影予約を行うためのハードウェアタイマ（タイマ機能ともいう）を備える。後述するように、制御部１０１は、好適決定部１１０からの出力に基づいて、画像予測部１０９から出力される予測画像が所定の条件を満たすかを判定したり、予測画像の未来時刻に撮影予約を設定したりする。 101 is a control unit. The control unit 101 includes processors such as one or more CPUs, for example, and controls the overall operation of the camera 100 by reading and executing programs stored in the ROM 102 or storage medium 108 . The control unit 101 may further include a GPU. The control unit 101 also includes a hardware timer (also referred to as a timer function) for making a shooting reservation. As will be described later, the control unit 101 determines whether the predicted image output from the image prediction unit 109 satisfies a predetermined condition based on the output from the suitable determination unit 110, and determines whether the predicted image will be captured at a future time. set up a reservation.

１０２は、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）であり、例えば不揮発性の半導体メモリで構成される。ＲＯＭ１０２は、制御部１０１によって実行される制御プログラムを格納する。 Reference numeral 102 denotes a ROM (Read Only Memory), which is composed of, for example, a non-volatile semiconductor memory. ROM 102 stores a control program executed by control unit 101 .

１０３は、ＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）であり、ＲＡＭ１０３は、例えば揮発性の半導体メモリで構成される。ＲＡＭ１０３は、制御部１０１がプログラムを実行するためのワークメモリとして、或いは、各種データの一時格納領域などとして使用される。 103 is a RAM (Random Access Memory), and the RAM 103 is composed of, for example, a volatile semiconductor memory. The RAM 103 is used as a work memory for the control unit 101 to execute programs, or as a temporary storage area for various data.

１０４は、撮像部であり、撮像部１０４は、例えば、撮像レンズ、絞り、シャッター、ＣＭＯＳセンサ等の撮像素子、およびＡ／Ｄ変換器等で構成される。撮像部１０４は、撮像レンズを介して入力されるアナログ信号をデジタルデータに変換して画像データを取得する。画像データは、ＲＡＭ１０３に格納される。 Reference numeral 104 denotes an imaging unit, and the imaging unit 104 includes, for example, an imaging lens, an aperture, a shutter, an imaging element such as a CMOS sensor, an A/D converter, and the like. The imaging unit 104 acquires image data by converting an analog signal input via the imaging lens into digital data. Image data is stored in the RAM 103 .

撮像部１０４は、カメラが撮影しているライブビュー画像（ＬＶ画像ともいう）と記録用の静止画データを生成する。ＬＶ画像は、撮影操作中に撮像部１０４から所定のフレームレートで順次出力され、リアルタイムに表示部１０６に表示されるＹＵＶ形式の画像データである。またＬＶ画像は、画像予測部１０９にも入力される。記録用の静止画データは、ユーザの撮影指示によって生成される高解像度の画像データであり、記録部１０７を介して記憶媒体１０８に記録される。記録用の静止画データは高解像度であるため、例えばＪＰＥＧ形式に圧縮される。 The imaging unit 104 generates a live view image (also referred to as an LV image) captured by the camera and still image data for recording. The LV image is YUV format image data that is sequentially output from the imaging unit 104 at a predetermined frame rate during a shooting operation and displayed on the display unit 106 in real time. The LV image is also input to the image prediction unit 109 . The still image data for recording is high-resolution image data generated by the user's shooting instruction, and is recorded in the storage medium 108 via the recording unit 107 . Since still image data for recording has high resolution, it is compressed into JPEG format, for example.

１０５は、操作部である。操作部１０５は、ユーザの操作を受け付け、入力された情報を制御部１０１に通知する。操作部１０５は、例えばタッチパネル、ボタン、スイッチ、及び十字キーの少なくともいずれかによって構成されてよい。 105 is an operation unit. The operation unit 105 receives user operations and notifies the control unit 101 of the input information. The operation unit 105 may be configured by at least one of a touch panel, buttons, switches, and a cross key, for example.

１０６は、表示部である。表示部１０６は、カメラ１００の状態に関する情報や、撮像部１０４によって生成されるライブビュー画像、記憶媒体１０８に記録されている画像データなどをユーザに表示する処理部である。表示部１０６は、例えば液晶パネル或いはＬＥＤ等によって構成される。 106 is a display unit. The display unit 106 is a processing unit that displays information regarding the state of the camera 100, live view images generated by the imaging unit 104, image data recorded in the storage medium 108, and the like to the user. The display unit 106 is configured by, for example, a liquid crystal panel or an LED.

１０７は、記録部である。記録部１０７は、記憶媒体１０８に対するデータの読み書きの制御を行う。記録部１０７は、記憶媒体１０８の初期化や、記憶媒体１０８とＲＡＭ１０３の間のデータ転送を制御する。 107 is a recording unit. The recording unit 107 controls reading and writing of data with respect to the storage medium 108 . The recording unit 107 controls initialization of the storage medium 108 and data transfer between the storage medium 108 and the RAM 103 .

１０８は、記憶媒体である。記憶媒体１０８は、大容量の記憶領域を備えるデバイスであり、例えばメモリーカード或いはフラッシュメモリ等で構成される。記憶媒体１０８は、制御部１０１によって実行される制御プログラムを格納してもよい。 108 is a storage medium. The storage medium 108 is a device having a large-capacity storage area, and is composed of, for example, a memory card or flash memory. Storage medium 108 may store a control program executed by control unit 101 .

１０９は、画像予測部である。画像予測部１０９は、画像予測モデルを構成する。画像予測部１０９は、当該画像予測モデルの処理を実行するための回路又はプロセッサによって実現されてもよいし、制御部１０１によって実行される、画像予測モデルの処理のためのソフトウェアによって実現されてもよい。画像予測部１０９は、画像を入力し、その画像の未来時刻における予測画像を生成する。 109 is an image prediction unit. The image prediction unit 109 configures an image prediction model. The image prediction unit 109 may be realized by a circuit or processor for executing the processing of the image prediction model, or may be realized by software for processing the image prediction model executed by the control unit 101. good. The image prediction unit 109 receives an image and generates a predicted image of the image at future time.

画像予測部１０９の具体的な構成例について、図２（Ａ）を参照して説明する。図２（Ａ）に示すように、画像予測部１０９は、複数の予測ユニット２０２が連結された構成を含む。画像予測部１０９は、画像２０１が入力されると画像予測モデルの処理を行い、複数の予測画像２０３を出力する。予測ユニット２０２は、ある時刻で撮影された入力画像の次の時刻に対応する予測画像を生成して出力する。予測画像は、ある時刻（第１時点）で撮影された画像を用いて、当該時刻の次の時刻のタイミング（第２時点）において被写体や背景がどのように撮影されるかを予測して生成される画像である。また、予測画像が入力された予測ユニット２０２は、さらに次の時刻の予測画像を生成する。このように画像予測部１０９は、連結された予測ユニット２０２の数と同じ数の複数の予測画像を生成する。 A specific configuration example of the image prediction unit 109 will be described with reference to FIG. As shown in FIG. 2A, the image prediction unit 109 includes a configuration in which a plurality of prediction units 202 are connected. When the image 201 is input, the image prediction unit 109 performs image prediction model processing and outputs a plurality of predicted images 203 . The prediction unit 202 generates and outputs a prediction image corresponding to the time next to the input image captured at a certain time. A predicted image is generated by using an image captured at a certain time (first time point) to predict how the subject and background will be captured at the next time (second time point). This is an image that will be Also, the prediction unit 202 to which the predicted image is input further generates a predicted image for the next time. In this way, the image prediction unit 109 generates a plurality of predicted images as many as the number of connected prediction units 202 .

図２（Ｂ）は、それぞれの予測ユニット２０２の具体的な構成例を示している。予測ユニット２０２は、例えばニューラルネットワークによって構成され、入力画像の特徴情報を抽出する畳み込みネットワーク２０４と、過去の状態を記憶して次の状態を推論する再帰型ニューラルネットワーク２０５とを含む。畳み込みネットワーク２０４は、画像認識用のディープラーニング技術として知られるＣＮＮ（ＣｏｎｖｏｌｕｔｉｏｎａｌＮｅｕｒａｌＮｅｔｗｏｒｋ）を構成するネットワークと同様であってよい。畳み込みネットワーク２０４は、フィルタを用いて画像の特徴点を凝縮する畳み込み処理と、重要な特徴点の情報を維持しながら画像データの情報量を縮小するプーリング処理を行うニューラルネットワークである。再帰型ニューラルネットワーク２０５は、図２（Ｂ）に示すように、再帰型ニューラルネットワーク２０５内の隠れ層（中間層ともいう）において状態記憶のための再帰結合をもつネットワークである。再帰型ニューラルネットワーク２０５は、再帰結合により時系列変化を伴う特徴を加味することができ、過去の画像に基づいて未来の予測画像を生成する。なお、再帰型ニューラルネットワーク２０５は、例えば長期記憶に適した再帰型ニューラルネットワークとして知られているＬＳＴＭ（ＬｏｎｇＳｈｏｒｔ－ＴｅｒｍＭｅｍｏｒｙ）を用いることができる。なお、ある時刻で撮影された画像に基づいて、当該時刻の次の時刻のタイミングにおける予測画像を生成可能な構成であれば、他の構成であってもよい。また、予測ユニット２０２のニューラルネットワークは、例えば、動画の第１時点の画像を入力画像とし、動画の次の時刻のタイミングである第２時点の画像を正解データとするような学習データを用いて学習させることができる。 FIG. 2B shows a specific configuration example of each prediction unit 202 . The prediction unit 202 is configured by, for example, a neural network, and includes a convolutional network 204 that extracts feature information of an input image, and a recurrent neural network 205 that stores past states and infers the next state. The convolutional network 204 may be similar to a network forming a CNN (Convolutional Neural Network) known as deep learning technology for image recognition. The convolutional network 204 is a neural network that performs convolutional processing for condensing image feature points using a filter and pooling processing for reducing the information amount of image data while maintaining important feature point information. The recursive neural network 205 is, as shown in FIG. 2B, a network having recursive connections for state storage in hidden layers (also called intermediate layers) within the recursive neural network 205 . The recursive neural network 205 can add features that accompany time-series changes by recursive connection, and generates future predicted images based on past images. For the recurrent neural network 205, for example, an LSTM (Long Short-Term Memory) known as a recurrent neural network suitable for long-term memory can be used. It should be noted that any other configuration may be used as long as it is capable of generating a predicted image at the next time based on an image captured at a certain time. In addition, the neural network of the prediction unit 202 uses learning data such that, for example, the image at the first time point of the moving image is the input image, and the image at the second time point, which is the timing of the next time point in the moving image, is the correct data. can be learned.

１１０は、好適決定部である。好適決定部１１０は、画像が入力されると、その画像に対する撮影記録の好適度を決定する。ここで好適度情報とは、入力画像が記録する画像（記録画像ともいう）として適しているかどうかの度合い、すなわちその記録画像がユーザに好まれるかどうかの度合いを示す情報である。好適決定部１１０は、撮影記録の好適度を決定する処理を実行するための回路又はプロセッサによって実現されてもよいし、制御部１０１によって実行される、当該好適度の決定処理のためのソフトウェアによって実現されてもよい。図３を参照して、好適決定部１１０の構成例について説明する。好適決定部１１０は、上述したＣＮＮを用いて構成されてよい。好適決定部１１０は、入力される画像３０１の特徴情報を抽出する畳み込みネットワーク３０２と、事前に好適度の高い大量の画像を学習データとして使って学習処理が実施された好適度決定ネットワーク３０３とを含む。畳み込みネットワーク３０２の構成は、上述した畳み込みネットワーク２０５と同様である。好適度決定ネットワーク３０３は、隠れ層に順伝搬型の全結合層を構成するネットワークであり、特徴情報に基づく好適度の分類を行った結果、出力層において好適度情報３０４を出力する。学習データは、例えば、フィギュアスケートの撮影では、ジャンプの頂点に達して選手の顔が正面を向いている画像に高い好適度を付与したデータ、ジャンプを始めている状態や顔が正面を向いていない画像に、より低い好適度を付与したデータが含まれてよい。学習データは、フィギュアスケートの例に限らず、様々なスポーツや様々な撮影シーンについて高い或いは低い好適度を付与した画像を含んでよい。 110 is a suitable determination unit. When an image is input, the suitability determination unit 110 determines the suitability of photographing and recording for the image. Here, the suitability information is information indicating the degree to which an input image is suitable as an image to be recorded (also referred to as a recorded image), that is, the degree to which the user likes the recorded image. The suitability determination unit 110 may be realized by a circuit or processor for executing processing for determining the suitability of shooting recording, or may be implemented by software for the suitability determination processing executed by the control unit 101. may be implemented. A configuration example of the suitable determination unit 110 will be described with reference to FIG. The suitable determining unit 110 may be configured using the CNN described above. The suitability determination unit 110 includes a convolution network 302 that extracts feature information of an input image 301, and a suitability determination network 303 that has undergone learning processing in advance using a large number of images with high suitability as learning data. include. The configuration of convolutional network 302 is similar to convolutional network 205 described above. The suitability determination network 303 is a network that configures a forward propagation type fully connected layer in the hidden layer, and outputs suitability information 304 in the output layer as a result of suitability classification based on feature information. For example, when photographing figure skating, data in which a high degree of suitability is assigned to an image in which the athlete's face is facing the front after reaching the top of the jump, data in which the athlete's face is facing the front at the beginning of the jump, and data in which the athlete's face is not facing the front. The image may contain data that gave it a lower preference rating. The learning data is not limited to the example of figure skating, and may include images given high or low suitability for various sports and various shooting scenes.

例えば、制御部１０１は、好適決定部１１０から出力された好適度情報に基づいて、記録画像として好適な予測画像であるかどうかを判定する。また、制御部１０１は、記録画像として好適であると判定された予測画像の未来時刻に撮影予約を設定する。 For example, based on the suitability information output from the suitability determining unit 110, the control unit 101 determines whether or not the predicted image is suitable as the recording image. Further, the control unit 101 sets a shooting reservation at the future time of the predicted image determined to be suitable as the recording image.

１１１は、上述した各処理部の間でやりとりされる制御信号を接続する内部バスである。 Reference numeral 111 denotes an internal bus that connects control signals exchanged between the processing units described above.

＜自動撮影処理の概要＞
次に、本実施形態に係る、ＬＶ画像が生成されてから自動撮影が行われるまでの処理（単に自動撮影処理ともいう）の概要について説明する。本実施形態に係る自動撮影処理は、一例として、撮影アシスト機能として用いられることが想定される。例えば、フィギュアスケートの撮影に自動撮影処理を適用する場合を考える。今からジャンプを飛ぼうとしている選手をＬＶ映像に含めると、ジャンプの頂点に到達して選手の顔が正面を向いているタイミング、すなわち撮影に好適なタイミングで自動的に撮影を行うことが可能になる。 <Overview of automatic shooting processing>
Next, an outline of the processing (simply referred to as automatic imaging processing) from the generation of the LV image to the execution of automatic imaging according to the present embodiment will be described. As an example, it is assumed that the automatic photographing process according to the present embodiment is used as a photographing assist function. For example, consider a case where automatic shooting processing is applied to figure skating shooting. If a player about to jump is included in the LV video, it is possible to automatically shoot at the timing when the player's face is facing the front after reaching the top of the jump, that is, at a suitable timing for shooting. become.

以下、図４を参照して、自動撮影処理の概要について説明する。撮像部１０４は、撮像部１０４が稼働している間に一定の時間間隔でＬＶ画像を生成して、生成したＬＶ画像を表示部１０６と画像予測部１０９に送信する。本実施形態では一例として、ＬＶ画像は、毎秒２０フレーム（５０ミリ秒間隔）のスピードで、解像度が８００×６００であるＹＵＶデータとして生成されるものとする。表示部１０６は、送信されるＬＶ画像をリアルタイムに表示する。 An outline of the automatic photographing process will be described below with reference to FIG. The imaging unit 104 generates LV images at regular time intervals while the imaging unit 104 is operating, and transmits the generated LV images to the display unit 106 and the image prediction unit 109 . As an example in this embodiment, the LV image is generated as YUV data with a resolution of 800×600 at a speed of 20 frames per second (50 millisecond intervals). The display unit 106 displays the transmitted LV image in real time.

画像予測部１０９は、入力されるＬＶ画像に基づいて、複数の未来時刻における予測画像を生成し、生成した予測画像を予測画像バッファ４０１に格納する。予測画像バッファ４０１は、ＲＡＭ１０３の一部に構成される予測画像を一時的に記憶するメモリ領域である。 The image prediction unit 109 generates predicted images at a plurality of future times based on the input LV images, and stores the generated predicted images in the predicted image buffer 401 . A predicted image buffer 401 is a memory area that temporarily stores a predicted image configured as part of the RAM 103 .

なお、本実施形態における画像予測部１０９は、一例として２０個の予測ユニット２０２で構成されるものとする。画像予測部１０９は、５０ミリ秒間隔で入力されるＬＶ画像を使って、最大１秒後（５０ミリ秒×２０）までの予測画像を生成する。例えば、画像予測部１０９が生成する予測画像と時刻の関係は、図５に示すようになる。 Note that the image prediction unit 109 in this embodiment is configured with 20 prediction units 202 as an example. The image prediction unit 109 uses LV images input at intervals of 50 milliseconds to generate predicted images up to 1 second later (50 milliseconds×20). For example, the relationship between the predicted image generated by the image prediction unit 109 and time is as shown in FIG.

画像予測部１０９は、時刻ｔ０において、ＬＶ画像ｆ_ｔ０を使った画像予測を実行し、未来時刻ｔ１～ｔ２０に対応する予測画像ｆ_ｔ１～ｆ_ｔ２０を生成する。続いて５０ミリ秒後の次の時刻ｔ１では、画像予測部１０９は、ＬＶ画像ｆ_ｔ１を使った画像予測を実行し、未来時刻ｔ２～ｔ２１に対応する予測画像ｆ_ｔ２～ｆ_ｔ２１を生成する。以降の時刻ｔ２、ｔ３も同様に、２０個の予測画像が生成される。 The image prediction unit 109 performs image prediction using the LV image f _t0 at time t0, and generates predicted images f _t1 to f _t20 corresponding to future times t1 to t20. Subsequently, at the next time t1 after 50 milliseconds, the image prediction unit 109 executes image prediction using the LV image f _t1 to generate predicted images f _t2 to f _t21 corresponding to future times t2 to t21. . Similarly, 20 predicted images are generated at subsequent times t2 and t3.

次に、制御部１０１は、予測画像バッファ４０１に格納された予測画像を好適決定部１１０に入力し、好適決定部１１０から好適度情報を取得する。制御部１０１は、好適決定部１１０から取得した好適度情報に基づいて、対応する予測画像に対して撮影記録を実行すべきかどうかを判定し、実行すべきであると判定した場合には、対応する未来時刻にあわせて撮影予約を設定する。具体的には、制御部１０１は、対応する未来時刻にあわせて撮像部１０４による撮影記録を実行するようにタイマ機能を実行し、撮影予約の対象の予測画像と好適度情報を撮影予約管理テーブルに記録する。撮影予約管理テーブルは、例えば、記憶媒体１０８に記憶され、図６に示すテーブルのように構成される。図６に示すＩＤは、それぞれの撮影予約を識別するための番号であり、撮影の実行順にも相当してもよい。図６に示す予測画像ＩＤは、撮影予約の対象となる予測画像を識別する情報であり、例えば、対応する時刻情報が分かるように表現される。時刻情報は、ＬＶ画像からの相対的な時刻でも良いし、絶対的な時刻であってもよい。図６に示す好適度は、撮影予約の対象である予測画像の好適度を示す情報であり、例えば１００段階のパーセント（％）で示される。好適度は、記録画像として適しているかどうかの度合いを示すものであれば、例えば０から１までの小数値など他の数値であってもよい。図６に示す例では、３つの撮影予約が設定されている状態を示す。 Next, the control unit 101 inputs the predicted image stored in the predicted image buffer 401 to the suitability determination unit 110 and acquires suitability information from the suitability determination unit 110 . Based on the suitability information acquired from the suitability determination unit 110, the control unit 101 determines whether or not the corresponding predicted image should be photographed and recorded. Set the shooting reservation according to the future time. Specifically, the control unit 101 executes a timer function so that the image capturing unit 104 performs shooting recording in accordance with the corresponding future time, and stores the predicted image of the shooting reservation target and the suitability information in the shooting reservation management table. to record. The photography reservation management table is stored in the storage medium 108, for example, and configured like the table shown in FIG. The ID shown in FIG. 6 is a number for identifying each photography reservation, and may correspond to the execution order of photography. The predicted image ID shown in FIG. 6 is information for identifying a predicted image to be scheduled for shooting, and is expressed, for example, so that the corresponding time information can be known. The time information may be relative time from the LV image or absolute time. The degree of suitability shown in FIG. 6 is information indicating the degree of suitability of the predicted image that is the target of the shooting reservation, and is indicated by percentage (%) in 100 steps, for example. The degree of suitability may be another numerical value such as a decimal value from 0 to 1 as long as it indicates the degree of suitability as a recording image. The example shown in FIG. 6 shows a state in which three shooting reservations are set.

なお、上述した画像予測から撮影予約までの一連の処理は、ＬＶ画像の生成周期（例えば５０ミリ秒）の時間内に終了するようにリアルタイム性を保証しながら実行される。 Note that the series of processes from image prediction to shooting reservation described above are executed while ensuring real-time performance so as to be completed within the LV image generation cycle (for example, 50 milliseconds).

＜自動撮影処理に係る一連の動作＞
次に、図７を参照して、自動撮影処理に係る一連の動作について説明する。なお、本処理は、特に明示する場合を除き、制御部１０１がＲＯＭ１０２或いは記憶媒体１０８に記憶されたプログラムをＲＡＭ１０３の作業用領域に展開、実行することにより実現される。また、本処理は、画像予測部１０９及び好適決定部１１０における機械学習モデル（ニューラルネットワーク）が学習済みの状態で実行される。更に、本処理は、例えば操作部１０５に対するユーザ操作により、カメラ１００の動作モードが撮影モードに遷移した時点で開始される。ここで撮影モードは、カメラ１００の動作モードの１つであり、ユーザが撮影記録ボタンを押すことによっていつでも静止画を撮影できる動作モードである。例えば、撮像機能を稼働せず、メモリーカードに記録された静止画を液晶モニタで再生表示させるような動作モードは、これに該当しない。 <A series of operations related to automatic shooting processing>
Next, with reference to FIG. 7, a series of operations related to automatic photographing processing will be described. Note that this process is realized by the control unit 101 expanding a program stored in the ROM 102 or the storage medium 108 into a working area of the RAM 103 and executing the program, unless otherwise specified. Further, this processing is executed in a state where the machine learning model (neural network) in the image prediction unit 109 and the suitable determination unit 110 has been trained. Further, this process is started when the operation mode of the camera 100 is changed to the shooting mode by the user's operation on the operation unit 105, for example. Here, the shooting mode is one of operation modes of the camera 100, and is an operation mode in which the user can shoot a still image at any time by pressing a shooting record button. For example, this does not apply to an operation mode in which a still image recorded in a memory card is reproduced and displayed on a liquid crystal monitor without operating the imaging function.

Ｓ７０１において、撮像部１０４は、ＬＶ画像を１フレーム生成して、画像予測部１０９に出力する。Ｓ７０２において、画像予測部１０９は、撮像部１０４から出力されたＬＶ画像に基づいて複数の未来時刻に対応する予測画像を生成し、生成した予測画像を予測画像バッファ４０１に格納する。画像予測部１０９は生成した画像を画像バッファ４０１に格納すると、制御部１０１に通知する。Ｓ７０３において、制御部１０１は、予測画像バッファ４０１から予測画像を１つ取り出して好適決定部１１０に入力し、好適決定部１１０から対応する好適度情報を取得する。 In S701 , the imaging unit 104 generates one frame of LV image and outputs it to the image prediction unit 109 . In S702 , the image prediction unit 109 generates predicted images corresponding to a plurality of future times based on the LV image output from the imaging unit 104 and stores the generated predicted images in the predicted image buffer 401 . When the image prediction unit 109 stores the generated image in the image buffer 401 , it notifies the control unit 101 . In S703 , the control unit 101 extracts one predicted image from the predicted image buffer 401 , inputs it to the suitable determination unit 110 , and acquires the corresponding suitability information from the suitable determination unit 110 .

Ｓ７０４において、制御部１０１は、好適決定部１１０から取得した好適度情報を用いて、撮影予約を実行すべきかを判定する。例えば、制御部１０１は、好適度が所定の閾値より大きい場合、対応する予測画像の撮影予約を実行すべきであると判定する。所定の閾値は、例えば７０％などでよい。制御部１０１は、好適度が所定の閾値より大きい場合、撮影予約を実行すべきと判定してＳ７０５に進み、好適度が所定の閾値以下である場合、撮影予約を実行すべきでないと判定してＳ７１１に進む。 In step S704 , the control unit 101 uses the suitability information acquired from the suitability determination unit 110 to determine whether to execute a shooting reservation. For example, when the degree of suitability is greater than a predetermined threshold, the control unit 101 determines that the shooting reservation for the corresponding predicted image should be executed. The predetermined threshold may be, for example, 70%. If the degree of suitability is greater than the predetermined threshold, the control unit 101 determines that the photographing reservation should be executed, and advances to step S705. to proceed to S711.

Ｓ７０５において、制御部１０１は、予約すべきと判定した未来時刻の近い時刻に、他の撮影予約が存在するかを判定する。例えば、制御部１０１は、記憶媒体１０８の撮影予約管理テーブルを参照して、Ｓ７０４にて予約すべきと判定した未来時刻に近い時刻に、既に他の撮影予約が設定されているかどうかを判定する。制御部１０１は、撮影予約が設定されていると判定した場合はＳ７０９に進み、撮影予約が設定されていないと判定した場合はＳ７０６に進む。ここで、近い時刻に既に他の撮影予約が設定されている状態とは、互いの時間間隔が所定の時間範囲より狭く、例えば画像一枚あたりの撮影記録処理に要する時間よりも短いために、両方の撮影予約を有効にすることができない状態を意味する。本実施形態では、一例として、画像一枚あたりの撮影記録処理に要する時間を２００ミリ秒とする。この場合、近い時刻に他の撮影予約が設定されているかどうかの判定は、前後２００ミリ秒に他の撮影予約が含まれるかどうかの判定となる。このように、撮影予約の重複を判定することにより、現に実行することが難しい撮影予約を設定してしまうことを回避することができる。 In step S705 , the control unit 101 determines whether there is another imaging reservation at a time close to the future time determined to be reserved. For example, the control unit 101 refers to the shooting reservation management table in the storage medium 108 and determines whether another shooting reservation has already been set at a time close to the future time determined to be reserved in S704. . If the control unit 101 determines that a shooting reservation has been set, the process advances to step S709, and if it determines that a shooting reservation has not been set, the process advances to step S706. Here, the state in which another shooting reservation is already set at a close time means that the time interval between them is narrower than a predetermined time range, for example, shorter than the time required for shooting and recording processing per image. This means that both shooting reservations cannot be validated. In the present embodiment, as an example, it is assumed that the time required for shooting and recording processing for one image is 200 milliseconds. In this case, the judgment as to whether or not another photographing reservation is set at a near time is a judgment as to whether another photographing reservation is included within 200 milliseconds before or after. In this way, by judging duplication of photographing reservations, it is possible to avoid setting a photographing reservation that is actually difficult to execute.

Ｓ７０６において、制御部１０１は、Ｓ７０４において予約すべきと判定された撮影予約を設定する。具体的には、対応する未来時刻にあわせて撮像部１０４による撮影記録を実行するようにタイマ機能を実行するとともに、撮影予約を行う予測画像と好適度の情報を、記憶媒体１０８の撮影予約管理テーブルに追加する。また、本ステップにおいて、制御部１０１は、撮像部１０４による被写体の捕捉が継続するようにユーザに注意を促すため、カメラ１００に撮影予約が設定されたこと（自動撮影を行っていること）を通知する情報を表示部１０６に表示する。例えば、自動撮影中であることを示す画面は、図８に示す画面８０１であってよい。８０１が画面全体であり、８０２は本ステップによって表示される通知情報である。通知情報８０２は、例えば、撮影予約が設定されたこと（自動撮影を行っていること）を示す情報と、自動撮影の期間及び自動的に撮影される画像の数を示す情報とを含む。通知情報８０２は、撮影予約管理テーブルの情報を用いて構成されてもよい。図８に示す通知情報８０２は、テキストで情報が表示される例を示しているが、テキストに限らずアイコンで示されてもよいし、通知情報が音声によって通知されてもよい。 In S706, the control unit 101 sets the imaging reservation determined to be reserved in S704. Specifically, the timer function is executed so that the imaging unit 104 performs shooting recording in accordance with the corresponding future time, and the predicted image for which the shooting reservation is made and information on the degree of suitability are stored in the storage medium 108 for shooting reservation management. Add to table. In addition, in this step, the control unit 101 notifies the user that the imaging unit 104 continues to capture the subject, so that the camera 100 is notified that a shooting reservation has been set (that automatic shooting is being performed). The information to be notified is displayed on the display unit 106 . For example, the screen indicating that automatic shooting is in progress may be the screen 801 shown in FIG. 801 is the entire screen, and 802 is notification information displayed by this step. The notification information 802 includes, for example, information indicating that a shooting reservation has been set (that automatic shooting is being performed), and information indicating the period of automatic shooting and the number of images that are automatically shot. The notification information 802 may be configured using information in the shooting reservation management table. The notification information 802 shown in FIG. 8 shows an example in which information is displayed in text, but not limited to text, it may be indicated by an icon, or the notification information may be notified by voice.

Ｓ７０７において、制御部１０１は、Ｓ７０２において予測画像バッファ４０１に格納された予測画像の全てが、Ｓ７０３によって取り出されたかどうかを判定する。制御部１０１は、予測画像バッファ４０１に格納された予測画像がすべて取り出されたと判定した場合、Ｓ７０８に進み、全て取り出されていないと判定した場合は再びＳ７０３に戻る。 In S707, the control unit 101 determines whether all the predicted images stored in the predicted image buffer 401 in S702 have been extracted in S703. If the control unit 101 determines that all the predicted images stored in the predicted image buffer 401 have been taken out, the process proceeds to S708, and if it is determined that all the predicted images have not been taken out, the process returns to S703 again.

Ｓ７０８において、制御部１０１は、ユーザによってカメラ１００の動作モードが撮影モードから他のモードに変更されたかどうかを判定する。他のモードに変更された場合は、本フローチャートを終了する。撮影モードのまま継続されている場合は、再びＳ７０１に戻る。 In S708, the control unit 101 determines whether the user has changed the operation mode of the camera 100 from the shooting mode to another mode. If the mode is changed to another mode, this flowchart ends. If the shooting mode continues, the process returns to S701.

Ｓ７０９において、制御部１０１は、近い時刻に撮影予約が存在した場合の処理を行う。具体的には、制御部１０１は、撮影予約管理テーブルを参照し、Ｓ７０４にて予約すべきと判定された予測画像の好適度が、Ｓ７０５にて近い時刻に撮影予約済みであることが検出された予測画像の好適度よりも高いかどうかを判定する。制御部１０１は、予約すべきと判定された新規の予測画像の好適度が、設定済みの予測画像の好適度より高いと判定した場合はＳ７１０に進み、そうでない場合は、Ｓ７０４にて予約すべきと判定された撮影予約の処理を中断し、Ｓ７０７に進む。 In step S709 , the control unit 101 performs processing when there is a shooting reservation at a near time. Specifically, the control unit 101 refers to the photographing reservation management table, and detects in S705 that the degree of suitability of the predicted image determined to be reserved in S704 indicates that the photographing has already been reserved in the near future. It is determined whether the degree of suitability is higher than the degree of suitability of the predicted image. If the control unit 101 determines that the new predicted image to be reserved has a higher degree of suitability than the set predicted image, the control unit 101 proceeds to S710. The processing of the photographing reservation determined to be necessary is interrupted, and the process advances to step S707.

Ｓ７１０において、制御部１０１は、Ｓ７０５にて検出された、設定済みの撮影予約を解除し、Ｓ７０４にて予約すべきと判定された撮影予約を新規に設定する。ここで、制御部１０１は、撮影予約を解除するために、対応するタイマ機能を停止すると共に、撮影予約管理テーブルからの対応する予約情報を削除する。撮影予約の設定処理の内容は、Ｓ７０６において説明した撮影予約の設定処理の内容と同様である。このように、近接する予測画像が存在する場合、より好適度の高い予測画像の撮影予約を優先することで、よりユーザにとって有益な画像を自動撮影によって取得することができる。 In S710, the control unit 101 cancels the already set imaging reservation detected in S705, and newly sets the imaging reservation determined to be reserved in S704. Here, in order to cancel the photography reservation, the control unit 101 stops the corresponding timer function and deletes the corresponding reservation information from the photography reservation management table. The content of the shooting reservation setting processing is the same as the content of the shooting reservation setting processing described in step S706. In this way, when there are predictive images close to each other, by prioritizing the shooting reservation of the predictive image with a higher degree of preference, it is possible to obtain an image that is more beneficial to the user by automatic shooting.

Ｓ７１１において、制御部１０１は、Ｓ７０４において撮影予約を実行すべきでないと判定された未来時刻と同じ時刻に、既に撮影予約が設定されているかどうかを判定する。制御部１０１は、記憶媒体１０８の撮影予約管理テーブルを参照して、Ｓ７０４において判定された未来時刻と同じ時刻に、既に撮影予約が設定されていると判定した場合はＳ７１２に進み、そうでない場合にはＳ７０７に進む。 In S711, the control unit 101 determines whether or not a shooting reservation has already been set at the same time as the future time when it was determined in S704 that the shooting reservation should not be executed. The control unit 101 refers to the shooting reservation management table of the storage medium 108, and if it determines that a shooting reservation has already been set at the same time as the future time determined in S704, proceeds to S712; to S707.

Ｓ７１２において、制御部１０１は、Ｓ７１１において検出した、既に設定されている撮影予約を解除する。撮影予約の解除処理の内容は、Ｓ７１０において説明した撮影予約の解除処理の内容と同様である。このとき、本ステップにおいて制御部１０１は、撮影予約を解除したことに伴い、Ｓ７０６において開始した通知情報の表示を更新する。或いは、撮影予約が解除されたことを通知する情報を表示部１０６に表示してもよい。撮影予約が無くなった場合（全ての撮影予約を解除した場合）、Ｓ７０６において開始した通知情報の表示を消去してもよい。Ｓ７１１及びＳ７１２により、以前のＬＶ画像の画像予測では撮影記録に好適であると判定されていた未来時刻であったが、その後のＬＶ画像の画像予測において好適ではないと判定された場合に、既存の撮影予約を解除することができる。その後、制御部１０１は、上述のようにＳ７０７及びＳ７０８を実行して、自動撮影処理の一連の動作を終了する。 In S712, the control unit 101 cancels the already set shooting reservation detected in S711. The content of the photography reservation cancellation processing is the same as the content of the photography reservation cancellation processing described in S710. At this time, in this step, the control unit 101 updates the display of the notification information started in S706 as the shooting reservation is canceled. Alternatively, the display unit 106 may display information notifying that the shooting reservation has been cancelled. When there are no more shooting reservations (when all shooting reservations are canceled), the display of the notification information started in S706 may be erased. Through S711 and S712, the future time was determined to be suitable for shooting recording in the image prediction of the previous LV image, but if it is determined not to be suitable in the image prediction of the subsequent LV image, the existing You can cancel the shooting reservation of After that, the control unit 101 executes S707 and S708 as described above, and ends the series of operations of the automatic photographing process.

なお、上述の実施形態では、例えばカメラ１００内に構成される好適決定部１１０の数が１つである場合を例に説明した。しかし、好適決定部１１０の数はこの限りではなく、カメラ１００は、複数の好適決定部１１０を制御可能に構成されてもよい。複数の好適決定部１１０を構成することによってシステム構成の規模は大きくなるものの、複数の予測画像に対する好適判定処理を同時並行で実行することが可能になり、全体の処理時間を短縮することが可能となる。なお、この方法を用いることで、画像予測部１０９から出力される予測画像の数を増やすことが可能となる。これにより、より遠い未来時刻の画像予測を実行することができるようになる。 In the above-described embodiment, for example, the case where the camera 100 includes one suitable determining unit 110 has been described as an example. However, the number of suitable determining units 110 is not limited to this, and the camera 100 may be configured to be able to control a plurality of suitable determining units 110 . Although the scale of the system configuration is increased by constructing a plurality of suitability determination units 110, it is possible to execute suitability determination processing for a plurality of predicted images in parallel, thereby shortening the overall processing time. becomes. By using this method, the number of predicted images output from the image prediction unit 109 can be increased. This makes it possible to perform image prediction for a farther future time.

また、上述の実施形態では、画像予測部１０９及び好適決定部１１０がニューラルネットワークによって構成される場合を例に説明した。計算量が多いニューラルネットワークを備える画像予測部１０９や好適決定部１１０を用いる場合に、処理負荷を下げるための仕組みを更に実施するようにしてもよい。例えば、制御部１０１によって１つの撮影予約が設定された後は、制御部１０１は、その撮影予約による撮影記録が実行されるまで、あるいは、その撮影予約が解除されるまで、新規の撮影予約を設定しないようにしてもよい。そして、制御部１０１は、既に設定された撮影予約の時刻の調整だけを実施するようにしてもよい。例えば、制御部１０１は、好適決定部１１０による好適度の判定処理を、画像予測部１０９が生成する全ての予測画像に対して実行するのではなく、撮影予約の時刻に近い時刻の予測画像に対してのみ実行すればよい。ここで撮影予約の時刻に近い時刻とは、例えば撮影予約の時刻の前後３００ミリ秒に含まれる時刻であってよい。このように、撮影予約の対象となる時刻を制限することにより、好適決定部１１０の処理負荷の軽減や、それに伴うカメラ１００全体の消費電力の低減が可能となる。 Further, in the above-described embodiment, the case where the image prediction unit 109 and the suitable determination unit 110 are configured by a neural network has been described as an example. When using the image prediction unit 109 or the suitable determination unit 110 having a neural network with a large amount of calculation, a mechanism for reducing the processing load may be further implemented. For example, after one shooting reservation is set by the control unit 101, the control unit 101 continues to set a new shooting reservation until shooting recording by the shooting reservation is executed or until the shooting reservation is cancelled. You may choose not to set it. Then, the control unit 101 may adjust only the time of the photography reservation that has already been set. For example, the control unit 101 does not execute the suitability determination processing by the suitability determination unit 110 on all the predicted images generated by the image prediction unit 109, but on the predicted image at a time close to the shooting reservation time. You only have to do it for Here, the time close to the shooting reservation time may be, for example, a time within 300 milliseconds before and after the shooting reservation time. In this way, by limiting the times for which photography reservations are made, it is possible to reduce the processing load of the suitable determination unit 110 and the power consumption of the camera 100 as a whole.

以上説明したように、本実施形態では、撮像部１０４から出力される第１時点の画像を用いて、第１時点より未来の第２時点の画像を予測した予測画像を生成する。そして、生成した予測画像について好適度（予測画像が記録する画像として適している度合い）を決定し、決定した好適度が所定の閾値より高い場合に、時刻が第２時点になったときに画像を撮像するようにした。これにより、現在の画像から予測される好適なタイミングにおいて自動的に撮影することが可能になる。換言すれば、画像予測モデルを用いて、現在のＬＶ画像から撮影に好適な未来の画像を予測して自動的に撮影するシステムを実現することが可能となる。また、本実施形態では、未来時刻に撮影予約を設定した後、その後生成されるＬＶ画像によってその時刻の撮影が好適でないと判定された場合は、撮影予約を解除するようにした。このようにすることで、一度撮影予約を設定した未来時刻を、ＬＶ画像生成の度に、つまり設定した未来時刻が現在時刻に近づくにつれて、正しい撮影タイミングとなるように調整することが可能となる。 As described above, in the present embodiment, an image at a first time point output from the imaging unit 104 is used to generate a predicted image by predicting an image at a second time point in the future from the first time point. Then, the degree of suitability (the degree of suitability of the predicted image as an image to be recorded) is determined for the generated predicted image. was taken. This makes it possible to automatically shoot at a suitable timing predicted from the current image. In other words, it is possible to realize a system that predicts a future image suitable for photographing from a current LV image using an image prediction model and automatically photographs the image. Further, in the present embodiment, after setting a photographing reservation at a future time, if it is determined that photographing at that time is not suitable based on an LV image generated thereafter, the photographing reservation is canceled. By doing so, it is possible to adjust the future time once the shooting reservation is set to the correct shooting timing each time the LV image is generated, that is, as the set future time approaches the current time. .

（その他の実施形態）
本発明は、上述の実施形態の１以上の機能を実現するプログラムを、ネットワーク又は記憶媒体を介してシステム又は装置に供給し、そのシステム又は装置のコンピュータにおける１つ以上のプロセッサがプログラムを読出し実行する処理でも実現可能である。また、１以上の機能を実現する回路（例えば、ＡＳＩＣ）によっても実現可能である。 (Other embodiments)
The present invention supplies a program that implements one or more functions of the above-described embodiments to a system or apparatus via a network or a storage medium, and one or more processors in the computer of the system or apparatus reads and executes the program. It can also be realized by processing to It can also be implemented by a circuit (for example, ASIC) that implements one or more functions.

発明は上記実施形態に制限されるものではなく、発明の精神及び範囲から離脱することなく、様々な変更及び変形が可能である。従って、発明の範囲を公にするために請求項を添付する。 The invention is not limited to the embodiments described above, and various modifications and variations are possible without departing from the spirit and scope of the invention. Accordingly, the claims are appended to make public the scope of the invention.

１０１…制御部、１０４…撮像部、１０５…操作部、１０６…表示部、１０８…記憶媒体、１０９…画像予測部、１１０…好適決定部 DESCRIPTION OF SYMBOLS 101... Control part 104... Imaging part 105... Operation part 106... Display part 108... Storage medium 109... Image prediction part 110... Suitable determination part

Claims

Prediction means for generating a predicted image by predicting an image at a second point in the future from the first point in time using an image at a first point in time output from an imaging means;
determination means for determining a degree of suitability of the predicted image as an image to be recorded;
and control means for controlling the imaging means so as to capture an image when the time reaches the second point in time when the determined suitability is higher than a predetermined threshold. Device.

2. The method according to claim 1, wherein said prediction means is configured by a machine learning model that generates said predicted image corresponding to a future point in time from said first point in time based on said image at said first point in time. Imaging device.

3. The imaging apparatus according to claim 1, wherein said determining means is configured by a machine learning model for determining said suitability of an input image.

4. The imaging apparatus according to any one of claims 1 to 3, wherein the image at the first time point is a live view image sequentially output from the imaging means.

The control means is characterized in that, when the determined degree of suitability is higher than the predetermined threshold, the control means sets a photography reservation for capturing an image when the time reaches the second time point. Item 5. The imaging device according to any one of Items 1 to 4.

6. The imaging apparatus according to claim 5, wherein, in response to the setting of the photographing reservation, the control unit causes the display unit to display information notifying that the photographing reservation has been set.

After setting a first photography reservation for capturing the image at the second time based on the image at the first time, the control means uses the image at the third time, which is later than the first time. 7. The method according to claim 5, wherein it is determined whether the degree of suitability of the generated predicted image is higher than the degree of suitability of the predicted image generated using the image at the first time point. The imaging device described.

When the suitability of the predicted image generated using the image at the third time point is higher than the suitability of the predicted image generated using the image at the first time point, 8. The imaging apparatus according to claim 7, wherein the first reservation for photographing is cancelled, and a second reservation for photographing based on the image at the third time is set.

After setting a first photography reservation for capturing the image at the second time based on the image at the first time, the control means uses the image at the third time, which is later than the first time. If the suitability of the generated predicted image at the second time point or within a predetermined time range from the second time point is equal to or less than the predetermined threshold value, the first photography reservation is canceled. The imaging device according to claim 5 or 6.

In response to the cancellation of the first photography reservation, the control means causes the display means to display information notifying that the photography reservation has been cancelled, or notifies that the photography reservation has been set. 10. The imaging device according to claim 8, wherein the information is changed.

10. The imaging apparatus according to claim 9, wherein said predetermined time range is the time required for an image to be output from said imaging means and recorded on a storage medium.

The prediction means outputs predicted images at a plurality of points in the future after the first point in time,
The control means captures an image at the time of the predicted image in which the degree of suitability among the degrees of suitability determined for the predicted images at the plurality of times is higher than the predetermined threshold. 12. The image pickup apparatus according to any one of claims 5 to 11, wherein the image pickup apparatus controls the

In response to the setting of the shooting reservation, the determining means determines the suitability only for the predicted images at points within a predetermined time range from the point at which the shooting reservation is set, among the predicted images at the plurality of points in time. 13. The imaging device according to claim 12, wherein:

14. The imaging apparatus according to any one of claims 1 to 13, further comprising recording means for recording an image captured at the second time point in a storage medium.

a prediction step of generating a predicted image obtained by predicting an image at a second point in the future from the first point in time using an image at a first point in time output from an imaging means;
a determining step of determining a degree of suitability of the predicted image as an image to be recorded;
and a control step of controlling the image capturing means to capture an image when the time reaches the second point in time when the determined suitability is higher than a predetermined threshold. How to control the device.

A program for causing a computer to function as each means of the imaging apparatus according to any one of claims 1 to 14.