JP2021114071A

JP2021114071A - Apparatus and method for specifying object to be edited, program, and recording medium

Info

Publication number: JP2021114071A
Application number: JP2020005823A
Authority: JP
Inventors: 寿之岩西; Toshiyuki Iwanishi; 和伸富田; Kazunobu Tomita; 恭平松浦; Kyohei Matsuura
Original assignee: NEC Solution Innovators Ltd
Current assignee: NEC Solution Innovators Ltd
Priority date: 2020-01-17
Filing date: 2020-01-17
Publication date: 2021-08-05

Abstract

To provide an editing object specifying apparatus which can easily edit an image automatically captured by a camera, etc., in accordance with the purpose.SOLUTION: An editing object specifying apparatus 1 includes image acquisition means 11, storage means 12, editing object specifying means 13, and scoring means 14. The image acquisition means 11 acquires images. The storage means 12 stores the images and attribute information of an editing object. The editing object specifying means 13 specifies an image including the editing object from among the images by collating the images with the attribute information of the editing object. The scoring means 14 combines a plurality of images including the editing object to generate a score, and selects an image of which the score exceeds a threshold.SELECTED DRAWING: Figure 1

Description

本発明は、編集対象特定装置、編集対象特定方法、プログラム及び記録媒体に関する。 The present invention relates to an edit target identification device, an edit target identification method, a program, and a recording medium.

カメラで撮影された動画を元にダイジェスト動画を作成する動画編集装置が各種提案されている（例えば、特許文献１参照）。 Various video editing devices have been proposed that create a digest video based on a video shot by a camera (see, for example, Patent Document 1).

特開２０１９−１１０４２２号公報JP-A-2019-110422

しかしながら、例えば、カメラ等で自動撮影した画像を、目的に応じて容易に編集可能な新たな技術が求められている。 However, for example, there is a demand for a new technique that can easily edit an image automatically taken by a camera or the like according to a purpose.

そこで、本発明は、例えば、カメラ等で自動撮影した画像を、目的に応じて容易に編集可能な編集対象特定装置及び編集対象特定方法を提供することを目的とする。 Therefore, an object of the present invention is to provide, for example, an edit target identification device and an edit target identification method that can easily edit an image automatically taken by a camera or the like according to a purpose.

前記目的を達成するために、本発明の編集対象特定装置は、
画像取得手段と、記憶手段と、編集対象特定手段と、スコアリング手段と、を含み、
前記画像取得手段は、画像を取得し、
前記記憶手段は、前記画像と、前記編集対象の属性情報と、を記憶し、
前記編集対象特定手段は、前記画像と、前記編集対象の属性情報とを照合して、前記画像の中から前記編集対象を含む画像を特定し、
前記スコアリング手段は、前記編集対象を含む画像を複数組み合わせてスコア化し、前記スコアが閾値を超える画像を選択する装置である。 In order to achieve the above object, the editing target specifying device of the present invention is used.
Includes image acquisition means, storage means, edit target identification means, and scoring means.
The image acquisition means acquires an image and
The storage means stores the image and the attribute information to be edited.
The editing target specifying means collates the image with the attribute information of the editing target, identifies an image including the editing target from the images, and identifies the image.
The scoring means is a device that combines a plurality of images including the editing target and scores them, and selects an image whose score exceeds a threshold value.

本発明の編集対象特定方法は、
画像取得工程と、記憶工程と、編集対象特定工程と、スコアリング工程と、を含み、
前記画像取得工程は、画像を取得し、
前記記憶工程は、前記画像と、前記編集対象の属性情報と、を記憶し、
前記編集対象特定工程は、前記画像と、前記編集対象の属性情報とを照合して、前記画像の中から前記編集対象を含む画像を特定し、
前記スコアリング工程は、前記編集対象を含む画像を複数組み合わせてスコア化し、前記スコアが閾値を超える画像を選択する方法である。 The method for specifying an edit target of the present invention is
Includes an image acquisition process, a storage process, an edit target identification process, and a scoring process.
The image acquisition step acquires an image and
The storage step stores the image and the attribute information to be edited.
In the editing target specifying step, the image is collated with the attribute information of the editing target, and an image including the editing target is specified from the images.
The scoring step is a method in which a plurality of images including the editing target are combined and scored, and an image whose score exceeds a threshold value is selected.

本発明によれば、例えば、カメラ等で自動撮影した画像を、目的に応じて容易に編集可能である。 According to the present invention, for example, an image automatically taken by a camera or the like can be easily edited according to a purpose.

図１は、実施形態１の編集対象特定装置の構成の一例を示すブロック図である。FIG. 1 is a block diagram showing an example of the configuration of the edit target specifying device of the first embodiment. 図２は、実施形態１の編集対象特定装置のハードウエア構成の一例を示すブロック図である。FIG. 2 is a block diagram showing an example of the hardware configuration of the edit target specifying device of the first embodiment. 図３は、実施形態１の編集対象特定装置における処理の一例を示すフローチャートである。FIG. 3 is a flowchart showing an example of processing in the editing target specifying device of the first embodiment. 図４は、実施形態２の編集対象特定装置の構成の一例を示すブロック図である。FIG. 4 is a block diagram showing an example of the configuration of the edit target specifying device according to the second embodiment. 図５は、実施形態２の編集対象特定装置のハードウエア構成の一例を示すブロック図である。FIG. 5 is a block diagram showing an example of the hardware configuration of the edit target specifying device according to the second embodiment. 図６は、実施形態２の編集対象特定装置における処理の一例を示すフローチャートである。FIG. 6 is a flowchart showing an example of processing in the editing target specifying device of the second embodiment. 図７は、実施形態１において、編集対象特定手段が、前記画像の中から前記編集対象を含む画像を特定する態様の一例を示す模式図である。FIG. 7 is a schematic view showing an example of a mode in which the editing target specifying means identifies an image including the editing target from the images in the first embodiment. 図８は、実施形態１において、編集対象特定手段が、動画を所定時間毎に区切って、前記動画の中から前記編集対象を含む画像を特定する態様の一例を示す模式図である。FIG. 8 is a schematic diagram showing an example of an embodiment in which the editing target identifying means divides a moving image at predetermined time intervals and identifies an image including the editing target from the moving image in the first embodiment. 図９は、実施形態１において、スコアリング手段により、前記編集対象を含む画像を複数組み合わせてスコア化し、前記スコアが閾値を超える画像を選択する態様の一例を示す模式図である。FIG. 9 is a schematic diagram showing an example of a mode in which, in the first embodiment, a plurality of images including the editing target are combined and scored by a scoring means, and an image whose score exceeds a threshold value is selected. 図１０は、実施形態１において、スコアリング手段により、前記編集対象を含む画像を複数組み合わせてスコア化し、前記スコアが閾値を超える画像を選択する態様の一例を示す模式図である。FIG. 10 is a schematic diagram showing an example of a mode in which, in the first embodiment, a plurality of images including the editing target are combined and scored by a scoring means, and an image whose score exceeds a threshold value is selected. 図１１は、編集対象を含む画像と、ラベルの一例を示す模式図である。FIG. 11 is a schematic diagram showing an image including an edit target and an example of a label.

本発明の編集対象特定装置において、例えば、前記画像が、動画であり、前記編集対象特定手段が、所定時間毎に区切って、前記編集対象を含む画像を特定するという態様であってもよい。 In the edit target identification device of the present invention, for example, the image may be a moving image, and the edit target identification means may specify an image including the edit target by dividing the image at predetermined time intervals.

本発明の編集対象特定装置は、例えば、さらに、ラベリング手段を含み、前記ラベリング手段は、前記編集対象を含む画像に、前記編集対象の属性情報及び前記画像に関する情報を含むラベルを付し、前記記憶手段は、前記編集対象を含む画像と前記ラベルとを紐づけて記憶するという態様であってもよい。 The editing target specifying device of the present invention further includes, for example, labeling means, and the labeling means attaches a label containing the attribute information of the editing target and information about the image to the image including the editing target, and the above-mentioned. The storage means may be in the form of associating the image including the editing target with the label and storing the image.

本発明の編集対象特定装置において、例えば、前記画像に関する情報が、撮影日時、撮影場所、天候、及び、編集対象の撮影状態からなる群から選択される少なくとも一つであるという態様であってもよい。 In the editing target specifying device of the present invention, for example, even if the information regarding the image is at least one selected from the group consisting of the shooting date and time, the shooting location, the weather, and the shooting state of the editing target. good.

本発明の編集対象特定装置において、例えば、前記編集対象が、人物であり、前記スコアリング手段が、前記編集対象周辺の人物、前記画像の撮影場所、及び、前記画像の撮影時間からなる群から選択される少なくとも一つを用いて前記スコア化を実施するという態様であってもよい。 In the editing target specifying device of the present invention, for example, the editing target is a person, and the scoring means is composed of a group consisting of a person around the editing target, a shooting location of the image, and a shooting time of the image. The scoring may be performed using at least one selected.

本発明の編集対象特定装置において、例えば、前記編集対象が、人物であり、前記編集対象の属性情報が、顔画像、ＩＤ（ｉｄｅｎｔｉｆｉｃａｔｉｏｎ、身分証明）、氏名、年齢、及び、性別からなる群から選択される少なくとも一つであるという態様であってもよい。 In the editing target specifying device of the present invention, for example, the editing target is a person, and the attribute information of the editing target consists of a group consisting of a face image, an ID (identification, identification), a name, an age, and a gender. It may be at least one selected.

本発明の編集対象特定方法において、例えば、前記画像が、動画であり、前記編集対象特定工程が、所定時間毎に区切って、前記編集対象を含む画像を特定するという態様であってもよい。 In the method for specifying an editing target of the present invention, for example, the image may be a moving image, and the editing target specifying step may be divided into predetermined time intervals to specify an image including the editing target.

本発明の編集対象特定方法において、例えば、さらに、ラベリング工程を含み、前記ラベリング工程は、前記編集対象を含む画像に、前記編集対象の属性情報及び前記画像に関する情報を含むラベルを付し、前記記憶工程は、前記編集対象を含む画像と前記ラベルとを紐づけて記憶するという態様であってもよい。 In the method for specifying an editing target of the present invention, for example, a labeling step is further included, and the labeling step attaches a label containing the attribute information of the editing target and information about the image to the image including the editing target. The storage step may be an embodiment in which the image including the editing target and the label are associated and stored.

本発明の編集対象特定方法において、例えば、前記画像に関する情報が、撮影日時、撮影場所、天候、及び、編集対象の撮影状態からなる群から選択される少なくとも一つであるという態様であってもよい。 In the method for specifying an editing target of the present invention, for example, even if the information regarding the image is at least one selected from the group consisting of the shooting date and time, the shooting location, the weather, and the shooting state of the editing target. good.

本発明の編集対象特定方法において、例えば、前記編集対象が、人物であり、前記スコアリング工程が、前記編集対象周辺の人物、前記画像の撮影場所、及び、前記画像の撮影時間からなる群から選択される少なくとも一つを用いて前記スコア化を実施するという態様であってもよい。 In the method for specifying an editing target of the present invention, for example, the editing target is a person, and the scoring step consists of a group consisting of a person around the editing target, a shooting location of the image, and a shooting time of the image. The scoring may be performed using at least one selected.

本発明の編集対象特定方法において、例えば、前記編集対象が、人物であり、前記編集対象の属性情報が、顔画像、ＩＤ、氏名、年齢、及び、性別からなる群から選択される少なくとも一つであるという態様であってもよい。 In the method for specifying an editing target of the present invention, for example, the editing target is a person, and the attribute information of the editing target is at least one selected from a group consisting of a face image, an ID, a name, an age, and a gender. It may be in the form of.

本発明のプログラムは、本発明の編集対象特定方法をコンピュータ上で実行可能なプログラムである。 The program of the present invention is a program capable of executing the method of specifying the editing target of the present invention on a computer.

本発明の記録媒体は、本発明のプログラムを記録しているコンピュータ読み取り可能な記録媒体である。 The recording medium of the present invention is a computer-readable recording medium on which the program of the present invention is recorded.

次に、本発明の実施形態について、図１から図１１を用いて説明する。本発明は、下記の実施形態によって何ら限定及び制限されない。なお、図１から図１１において、同一部分には、同一符号を付している。各実施形態における説明は、それぞれ、互いを援用できる。 Next, an embodiment of the present invention will be described with reference to FIGS. 1 to 11. The present invention is not limited or limited by the following embodiments. In addition, in FIGS. 1 to 11, the same parts are designated by the same reference numerals. The descriptions in each embodiment can be incorporated into each other.

［実施形態１］
図１は、本実施形態の編集対象特定装置１の構成の一例を示すブロック図である。図１に示すように、編集対象特定装置１は、画像取得手段１１と、記憶手段１２と、編集対象特定手段１３と、スコアリング手段１４と、を含む。本発明の編集対象特定装置は、例えば、サーバーに組み込まれていてもよい。また、本発明の編集対象特定装置は、例えば、本発明のプログラムがインストールされたパーソナルコンピュータ（ＰＣ）であってもよい。 [Embodiment 1]
FIG. 1 is a block diagram showing an example of the configuration of the edit target specifying device 1 of the present embodiment. As shown in FIG. 1, the editing target specifying device 1 includes an image acquisition means 11, a storage means 12, an editing target specifying means 13, and a scoring means 14. The editing target identification device of the present invention may be incorporated in, for example, a server. Further, the editing target specifying device of the present invention may be, for example, a personal computer (PC) in which the program of the present invention is installed.

図２に、編集対象特定装置１のハードウエア構成のブロック図を例示する。図２に示すように、編集対象特定装置１は、例えば、ＣＰＵ（中央処理装置）１０１、メモリ１０２、バス１０３、記憶手段１２、入力装置１０５、表示装置１０６、通信デバイス１０７等を有する。編集対象特定装置１の各部は、例えば、それぞれのインターフェース（Ｉ／Ｆ）により、バス１０３を介して接続されている。 FIG. 2 illustrates a block diagram of the hardware configuration of the editing target specifying device 1. As shown in FIG. 2, the editing target specifying device 1 includes, for example, a CPU (central processing unit) 101, a memory 102, a bus 103, a storage means 12, an input device 105, a display device 106, a communication device 107, and the like. Each part of the edit target specifying device 1 is connected via the bus 103 by, for example, each interface (I / F).

ＣＰＵ１０１は、編集対象特定装置１の全体の制御を担う。編集対象検出装置１において、ＣＰＵ１０１により、例えば、本発明のプログラムやその他のプログラムが実行され、また、各種情報の読み込みや書き込みが行われる。具体的には、例えば、ＣＰＵ１０１が、画像取得手段１１、編集対象特定手段１３、及び、スコアリング手段１４として機能する。なお、本発明において、ＣＰＵに代えてＧＰＵ等の他の演算装置を用いてもよい。 The CPU 101 is responsible for controlling the entire editing target specifying device 1. In the editing target detection device 1, for example, the program of the present invention and other programs are executed by the CPU 101, and various information is read and written. Specifically, for example, the CPU 101 functions as an image acquisition means 11, an editing target specifying means 13, and a scoring means 14. In the present invention, another arithmetic unit such as a GPU may be used instead of the CPU.

バス１０３は、例えば、外部機器とも接続できる。前記外部機器は、例えば、外部記憶装置（外部データベース等）、プリンター等が挙げられる。編集対象特定装置１は、例えば、バス１０３に接続された通信デバイス１０７により、通信回線網に接続でき、前記通信回線網を介して、前記外部機器と接続することもできる。前記通信回線網は、特に制限されず、公知のネットワークを使用でき、例えば、有線でも無線でもよい。前記通信回線網は、例えば、インターネット回線、ＷＷＷ（ＷｏｒｌｄＷｉｄｅＷｅｂ）、電話回線、ＬＡＮ（ＬｏｃａｌＡｒｅａＮｅｔｗｏｒｋ）、ＷｉＦｉ（ＷｉｒｅｌｅｓｓＦｉｄｅｌｉｔｙ）等が挙げられる。 The bus 103 can also be connected to, for example, an external device. Examples of the external device include an external storage device (external database, etc.), a printer, and the like. The editing target specifying device 1 can be connected to the communication network by, for example, the communication device 107 connected to the bus 103, and can also be connected to the external device via the communication network. The communication network is not particularly limited, and a known network can be used. For example, it may be wired or wireless. Examples of the communication line network include an Internet line, WWW (World Wide Web), telephone line, LAN (Local Area Network), WiFi (Wireless Fidelity) and the like.

メモリ１０２は、例えば、メインメモリを含み、前記メインメモリは、主記憶装置ともいう。ＣＰＵ１０１が処理を行う際には、例えば、後述する記憶手段１２に記憶されている本発明のプログラム等の種々の動作プログラム１０４を、メモリ１０２が読み込み、ＣＰＵ１０１は、メモリ１０２からデータを受け取って、プログラム１０４を実行する。前記メインメモリは、例えば、ＲＡＭ（ランダムアクセスメモリ）である。メモリ１０２は、例えば、さらに、ＲＯＭ（読み出し専用メモリ）を含む。 The memory 102 includes, for example, a main memory, and the main memory is also referred to as a main storage device. When the CPU 101 performs processing, for example, the memory 102 reads various operation programs 104 such as the program of the present invention stored in the storage means 12 described later, and the CPU 101 receives data from the memory 102. Program 104 is executed. The main memory is, for example, a RAM (random access memory). The memory 102 further includes, for example, a ROM (read-only memory).

記憶手段１２は、例えば、前記メインメモリ（主記憶装置）に対して、いわゆる補助記憶装置ともいう。前述のように、記憶手段１２には、本発明のプログラムを含む動作プログラム１０４が格納されている。また、記憶手段１２は、前記画像と、前記編集対象の属性情報と、を記憶している。記憶手段１２は、例えば、記憶媒体と、前記記憶媒体に読み書きするドライブとを含む。前記記憶媒体は、特に制限されず、例えば、内蔵型でも外付け型でもよく、ＨＤ（ハードディスク）、ＦＤ（フロッピー（登録商標）ディスク）、ＣＤ−ＲＯＭ、ＣＤ−Ｒ、ＣＤ−ＲＷ、ＭＯ、ＤＶＤ、フラッシュメモリー、メモリーカード等が挙げられ、前記ドライブは、特に制限されない。記憶手段１２は、例えば、記憶媒体とドライブとが一体化されたハードディスクドライブ（ＨＤＤ）であってもよい。 The storage means 12 is also referred to as a so-called auxiliary storage device with respect to the main memory (main storage device), for example. As described above, the storage means 12 stores an operation program 104 including the program of the present invention. Further, the storage means 12 stores the image and the attribute information to be edited. The storage means 12 includes, for example, a storage medium and a drive for reading and writing to the storage medium. The storage medium is not particularly limited, and may be an internal type or an external type, for example, HD (hard disk), FD (floppy (registered trademark) disk), CD-ROM, CD-R, CD-RW, MO, etc. Examples thereof include a DVD, a flash memory, a memory card, and the like, and the drive is not particularly limited. The storage means 12 may be, for example, a hard disk drive (HDD) in which a storage medium and a drive are integrated.

編集対象特定装置１は、例えば、さらに、入力装置１０５、表示装置１０６を有する。入力装置１０５は、例えば、タッチパネル、キーボード、マウス等である。表示装置１０６は、例えば、ＬＥＤディスプレイ、液晶ディスプレイ等が挙げられる。 The editing target specifying device 1 further includes, for example, an input device 105 and a display device 106. The input device 105 is, for example, a touch panel, a keyboard, a mouse, or the like. Examples of the display device 106 include an LED display and a liquid crystal display.

編集対象特定装置１において、メモリ１０２及び記憶手段１２は、ユーザーからのアクセス情報及びログ情報、並びに、外部データベース（図示せず）から取得した情報を記憶することも可能である。 In the editing target specifying device 1, the memory 102 and the storage means 12 can also store access information and log information from the user, and information acquired from an external database (not shown).

次に、本実施形態の編集対象特定方法の一例を、図３のフローチャートに基づき説明する。本実施形態の編集対象特定方法は、例えば、図１の編集対象特定装置１を用いて、次のように実施する。なお、本実施形態の編集対象特定方法は、図１の編集対象特定装置１の使用には限定されない。 Next, an example of the method for specifying the editing target of the present embodiment will be described with reference to the flowchart of FIG. The editing target specifying method of the present embodiment is carried out as follows, for example, using the editing target specifying device 1 of FIG. The editing target specifying method of the present embodiment is not limited to the use of the editing target specifying device 1 of FIG.

まず、画像取得手段１１により、画像を取得する（Ｓ１１）。画像取得手段１１は、例えば、編集対象特定装置１に搭載されたカメラ等の撮像手段で撮影することで、前記画像を取得してもよい。また、前記画像は、例えば、編集対象特定装置１とは別個に設けられたカメラ（例えば、撮影方向を制御可能な固定カメラ、撮影方向を制御不可能な固定カメラ、ドローン等の移動体に搭載された移動カメラ等）等の撮像手段で撮影され、前記通信回線網を介して、編集対象特定装置１の画像取得手段１１によって取得されたものであってもよい。前記画像は、例えば、静止画であってもよいし、動画であってもよい。 First, the image is acquired by the image acquisition means 11 (S11). The image acquisition means 11 may acquire the image by, for example, taking a picture with an image pickup means such as a camera mounted on the editing target specifying device 1. Further, the image is mounted on a moving body such as a camera (for example, a fixed camera capable of controlling the shooting direction, a fixed camera whose shooting direction cannot be controlled, a drone, etc.) provided separately from the editing target specifying device 1, for example. It may be an image taken by an image pickup means such as a mobile camera or the like, and acquired by an image acquisition means 11 of the edit target identification device 1 via the communication network. The image may be, for example, a still image or a moving image.

次に、記憶手段１２により、前記画像と、前記編集対象の属性情報と、を記憶する（Ｓ１２）。前記編集対象は、特に制限されず、例えば、運動会、文化祭、結婚式等のパーティーの参加者（例えば、子供、孫、児童、生徒、学生、新郎、新婦、参列者等）、コンサートの演者、観客等の人物、競馬における競走馬等の動物、カーレースにおけるレースカー等の自動車、お祭りにおける御神輿等が挙げられる。前記編集対象は、一つ（例えば、一人の人物、一匹の動物、一台の自動車等）であってもよいし、複数（例えば、複数人の人物、複数匹の動物、複数台の自動車等）であってもよい。前記編集対象の属性情報としては、例えば、前記編集対象が人物であれば、顔画像、ＩＤ、氏名、年齢、性別等が挙げられる。なお、図３には、画像取得工程（Ｓ１１）の後、記憶工程（Ｓ１２）において前記編集対象の属性情報を記憶する例を示したが、本発明は、これに限定されない。本発明においては、画像取得工程（Ｓ１１）の前に、記憶工程（Ｓ１２）における前記編集対象の属性情報の記憶を実施してもよいし、画像取得工程（Ｓ１１）及び記憶工程（Ｓ１２）における前記編集対象の属性情報の記憶を同時に実施してもよい。 Next, the storage means 12 stores the image and the attribute information to be edited (S12). The editing target is not particularly limited, and for example, participants of parties such as athletic meet, school festival, wedding, etc. (for example, children, grandchildren, children, students, students, grooms, brides, attendees, etc.), performers of concerts, etc. , People such as spectators, animals such as race horses in horse racing, automobiles such as race cars in car races, and gods in festivals. The editing target may be one (for example, one person, one animal, one car, etc.) or a plurality (for example, a plurality of people, a plurality of animals, a plurality of automobiles, etc.). Etc.). Examples of the attribute information of the editing target include a face image, ID, name, age, gender, and the like if the editing target is a person. Note that FIG. 3 shows an example in which the attribute information to be edited is stored in the storage step (S12) after the image acquisition step (S11), but the present invention is not limited to this. In the present invention, the attribute information to be edited in the storage step (S12) may be stored before the image acquisition step (S11), or in the image acquisition step (S11) and the storage step (S12). The attribute information to be edited may be stored at the same time.

次に、編集対象特定手段１３により、前記画像と、前記編集対象の属性情報とを照合して、前記画像の中から前記編集対象を含む画像を特定する（Ｓ１３）。 Next, the editing target specifying means 13 collates the image with the attribute information of the editing target, and identifies an image including the editing target from the images (S13).

図７に、編集対象特定手段１３が、前記画像の中から前記編集対象を含む画像を特定する態様の一例を示す。図７に示すように、本例では、前記編集対象は、運動会に参加する児童である。本例において、編集対象特定手段１３は、記憶手段１２が記憶している前記画像と、前記編集対象の属性情報（本例では、複数の児童の顔画像）とを照合し、前記編集対象を含む画像（本例では、徒競走でゴールする児童の画像）を特定する。前記特定には、例えば、公知の顔認証技術等を用い得る。編集対象特定手段１３は、図７に例示するように、さらに、複数の顔画像（本例では、＃１〜４の四つ）を検出し、それと一致度の高い児童のＩＤをラベルとして設定してもよい。 FIG. 7 shows an example of a mode in which the editing target specifying means 13 identifies an image including the editing target from the images. As shown in FIG. 7, in this example, the editing target is a child who participates in an athletic meet. In this example, the editing target specifying means 13 collates the image stored in the storage means 12 with the attribute information of the editing target (in this example, facial images of a plurality of children), and sets the editing target. The image to be included (in this example, the image of the child who finishes in the race) is specified. For the identification, for example, a known face recognition technique or the like can be used. As illustrated in FIG. 7, the editing target identifying means 13 further detects a plurality of face images (four of # 1 to # 4 in this example), and sets the ID of a child having a high degree of matching as a label. You may.

前記画像が動画の場合、編集対象特定手段１３は、例えば、図８に示すように、所定時間毎に区切って、前記編集対象を含む画像を特定してもよい。前記所定時間は、特に制限されず、例えば、１秒、１分等である。 When the image is a moving image, the editing target specifying means 13 may specify the image including the editing target by dividing it at predetermined time intervals, for example, as shown in FIG. The predetermined time is not particularly limited, and is, for example, 1 second, 1 minute, or the like.

次に、スコアリング手段１４により、前記編集対象を含む画像を複数組み合わせてスコア化し、前記スコアが閾値を超える画像を選択する（Ｓ１４）。 Next, the scoring means 14 combines a plurality of images including the editing target into a score, and selects an image whose score exceeds the threshold value (S14).

図９及び図１０に、スコアリング手段１４が、前記編集対象を含む画像を複数組み合わせてスコア化し、前記スコアが閾値を超える画像を選択する態様の一例を示す。図９に示すように、本例では、徒競走でゴールする児童の画像（画像１）、組体操をする児童の画像（画像２）、綱引きをする児童の画像（画像３）の三つの前記編集対象を含む画像を組み合わせている。図９に例示するように、画像１〜３では、編集対象特定手段１３により、複数人の児童（本例では、児童ａ〜ｆの六人）の顔画像が抽出され、それと一致度の高い児童のＩＤ（ＩＤ−ａ〜ｆ）がラベルとして設定されている。スコアリング手段１４は、例えば、次のようにして前記スコア化を実施する。すなわち、まず、スコアリング手段１４は、図９に例示するように、画像１〜３に含まれる児童ａ〜ｆの組み合わせを表にまとめ、児童ａが含まれる画像には、児童ｃ及びｆが含まれる確率が高いことを突き止め、画像２において、ＩＤが不明の顔画像がある場合、それが児童ｃ及びｆの顔画像である可能性が高いと推定する。 9 and 10 show an example of a mode in which the scoring means 14 combines a plurality of images including the editing target and scores them, and selects an image whose score exceeds the threshold value. As shown in FIG. 9, in this example, three edits are made: an image of a child scoring a goal in a race (image 1), an image of a child doing gymnastics (image 2), and an image of a child doing tug of war (image 3). The images including the target are combined. As illustrated in FIG. 9, in images 1 to 3, facial images of a plurality of children (in this example, six children a to f) are extracted by the editing target identifying means 13, and the degree of coincidence with the images is high. Children's IDs (IDs-a to f) are set as labels. The scoring means 14 carries out the scoring as follows, for example. That is, first, as illustrated in FIG. 9, the scoring means 14 summarizes the combinations of the children a to f included in the images 1 to 3 in a table, and the images including the child a include the children c and f. It is found that the probability of being included is high, and if there is a face image whose ID is unknown in image 2, it is presumed that it is highly likely that it is a face image of children c and f.

次に、図１０に例示するように、画像１〜３において、縦軸の児童が含まれているならば、横軸の児童が含まれている確率をスコア化し、前記スコアが閾値を超える画像を選択する。このように、スコアの高い画像を選択することで、顔認証のみで編集対象を識別するよりも、目的とする編集対象を含む画像を多く選択することが可能である。 Next, as illustrated in FIG. 10, in the images 1 to 3, if the children on the vertical axis are included, the probability that the children on the horizontal axis are included is scored, and the score exceeds the threshold value. Select. In this way, by selecting an image having a high score, it is possible to select more images including the target editing target than to identify the editing target only by face authentication.

編集対象特定手段１３は、前記顔画像以外にも、例えば、前記画像の撮影場所、前記画像の撮影時間（例えば、プログラム進行等の時間に基づく撮影時間等）等により、編集対象を推定可能である。 In addition to the face image, the editing target specifying means 13 can estimate the editing target based on, for example, the shooting location of the image, the shooting time of the image (for example, the shooting time based on the time of program progress, etc.), and the like. be.

本実施形態の編集対象特定装置１は、例えば、特定された前記編集対象を含む画像を画像編集装置に出力してもよい。前記画像編集装置は、特に制限されない。前記画像編集装置によれば、例えば、前記編集対象を含む画像をアルバム、ドキュメント、物語、日記等の各種モードに編集可能である。前記画像が動画の場合は、例えば、発話している人物毎に動画を切替えることも可能である。また、運動会の場合は、競技に出場している人物と応援者を組み合わせた１つの動画にすることも可能である。 The editing target specifying device 1 of the present embodiment may output, for example, an image including the specified editing target to the image editing device. The image editing device is not particularly limited. According to the image editing device, for example, an image including the editing target can be edited into various modes such as an album, a document, a story, and a diary. When the image is a moving image, for example, it is possible to switch the moving image for each person who is speaking. Also, in the case of an athletic meet, it is possible to make one video that combines the people participating in the competition and the supporters.

本実施形態によれば、カメラ等で自動撮影した画像から編集対象を特定できるため、例えば、目的に応じて容易に編集可能である。 According to the present embodiment, since the editing target can be specified from the image automatically taken by the camera or the like, for example, the editing can be easily performed according to the purpose.

［実施形態２］
次に、編集対象特定装置１が、さらに、ラベリング手段１５を含む実施形態を説明する。図４及び図５に示すように、本実施形態の編集対象特定装置１は、さらに、ラベリング手段１５を含む。それ以外の構成は、実施形態１と同様である。 [Embodiment 2]
Next, an embodiment in which the editing target specifying device 1 further includes the labeling means 15 will be described. As shown in FIGS. 4 and 5, the editing target specifying device 1 of the present embodiment further includes a labeling means 15. Other configurations are the same as those in the first embodiment.

次に、本実施形態の編集対象特定装置における処理の一例を、図４のブロック図、及び、図６のフローチャートに基づき説明する。 Next, an example of the processing in the editing target specifying device of the present embodiment will be described with reference to the block diagram of FIG. 4 and the flowchart of FIG.

まず、実施形態１と同様にして、画像取得手段１１による画像の取得（Ｓ１１）、記憶手段１２による前記画像及び前記編集対象の属性情報の記憶（Ｓ１２ａ）、編集対象特定手段１３による前記編集対象を含む画像の特定（Ｓ１３）、及び、スコアリング手段１４によるスコア化（Ｓ１４）を実施する。 First, in the same manner as in the first embodiment, the image acquisition means 11 acquires the image (S11), the storage means 12 stores the image and the attribute information of the edit target (S12a), and the edit target specifying means 13 acquires the edit target. Identification of the image including (S13) and scoring by the scoring means 14 (S14) are carried out.

次に、ラベリング手段１５により、前記編集対象を含む画像に、前記編集対象の属性情報及び前記画像に関する情報を含むラベルを付す（Ｓ１５）。すなわち、前記ラベルは、前記編集対象を含む画像と対になっている。前記画像に関する情報は、特に制限されず、例えば、撮影日時、撮影場所、天候、及び、編集対象の撮影状態（例えば、ピンぼけの有無、編集対象の全体を含むか一部しか含んでいないか、構図の良し悪し、編集対象が人物であれば、その表情等）等である。前記編集対象を含む画像と、前記ラベルの一例を図１１に示す。そして、記憶手段１２は、さらに、前記編集対象を含む画像と前記ラベルとを紐づけて記憶する（Ｓ１２ｂ）。記憶手段１２は、例えば、前記画像と前記ラベルとを別個に記憶してもよいし、前記画像のデータ中に拡張情報として前記ラベルを含ませて記憶してもよい。また、前記画像が動画の場合は、前記区切った画像毎に前記ラベルを付してもよい。 Next, the labeling means 15 attaches a label including the attribute information of the editing target and the information related to the image to the image including the editing target (S15). That is, the label is paired with the image including the editing target. The information regarding the image is not particularly limited, and for example, the shooting date and time, the shooting location, the weather, and the shooting state of the editing target (for example, the presence or absence of out-of-focus, whether the editing target is included in whole or only partially included, etc. The composition is good or bad, and if the editing target is a person, the facial expression, etc.). An image including the editing target and an example of the label are shown in FIG. Then, the storage means 12 further stores the image including the editing target and the label in association with each other (S12b). For example, the storage means 12 may store the image and the label separately, or may include the label as extended information in the data of the image and store the label. When the image is a moving image, the label may be attached to each of the separated images.

本実施形態の編集対象特定装置１は、例えば、前記ラベルが付された前記編集対象を含む画像（以下、「ラベル化画像」とも言う。）を画像編集装置に出力してもよい。前記画像編集装置は、例えば、記憶手段と、ラベル化画像抽出手段と、編集手段とを含む。 The editing target specifying device 1 of the present embodiment may output, for example, an image including the editing target with the label (hereinafter, also referred to as “labeled image”) to the image editing device. The image editing device includes, for example, a storage means, a labeled image extraction means, and an editing means.

前記画像編集装置を用いた画像の編集について、例をあげて説明する。 Editing an image using the image editing device will be described with an example.

まず、前記記憶手段により、ラベル化画像を記憶する。 First, the labeled image is stored by the storage means.

次に、前記ラベル化画像抽出手段により、前記ラベル化画像の中から、所定の条件に当てはまる画像を抽出する。前記画像の抽出における前記所定の条件としては、例えば、前記編集対象が人物であれば、特定人物の存在、所定のストーリー等が挙げられる。前記特定人物は、例えば、一人であってもよいし、複数人のグループであってもよい。前記所定のストーリーとしては、例えば、アルバム風、ドキュメント風、物語風、日記風等の各種モード等が挙げられる。前記各種モードにおけるストーリーは、例えば、ＡＩ（ＡｒｔｉｃｉｃｉａｌＩｎｔｅｌｌｉｇｅｎｃｅ、人工知能）等を用いて自動で生成可能である。 Next, the labeled image extracting means extracts an image that meets a predetermined condition from the labeled image. Examples of the predetermined condition in extracting the image include the existence of a specific person, a predetermined story, and the like if the editing target is a person. The specific person may be, for example, one person or a group of a plurality of people. Examples of the predetermined story include various modes such as an album style, a document style, a story style, and a diary style. The stories in the various modes can be automatically generated by using, for example, AI (Artificial Intelligence).

例えば、前記編集対象が、運動会に参加する児童であれば、前記ラベル化画像抽出手段は、徒競走でゴールする児童ａを含む画像、組体操をする児童ａを含む画像、綱引きをする児童ａの画像等を抽出する。前記ラベル化画像抽出手段は、前記児童ａを含む画像に代えて、複数人（例えば、児童ａ、児童ｂ、児童ｃの三人）を含む画像を抽出してもよい。 For example, if the editing target is a child participating in an athletic meet, the labeled image extraction means includes an image including a child a who finishes in a foot race, an image including a child a who performs gymnastics, and a child a who tug of war. Extract images, etc. The labeled image extraction means may extract an image including a plurality of people (for example, three children a, a child b, and a child c) instead of the image including the child a.

また、前記ラベル化画像抽出手段は、例えば、前記所定のストーリーに応じて、前記特定人物以外の人物（前述の運動会の例では、児童ａを応援する家族等）、風景等の運動会等のイベント全体の画像を抽出してもよい。 Further, the labeled image extraction means is, for example, an event such as a person other than the specific person (in the example of the above-mentioned athletic meet, a family supporting the child a), an athletic meet such as a landscape, etc., according to the predetermined story. The entire image may be extracted.

次に、前記編集手段により、所定の条件に基づいて、前記抽出された画像を編集する。前記画像の編集における前記所定の条件としては、例えば、抽出結果の一覧形式とする、時系列に沿って画像を配置し、静止画像中心のブック形式、動画を含むムービー形式とする等が挙げられ、風景の比率を増やす等してもよい。また、前記画像が動画の場合は、例えば、発話している人物毎に動画を切替えることも可能である。 Next, the extracted image is edited by the editing means based on predetermined conditions. Examples of the predetermined condition in editing the image include a list format of extraction results, a book format centered on still images by arranging images in chronological order, and a movie format including moving images. , The ratio of the landscape may be increased. Further, when the image is a moving image, for example, it is possible to switch the moving image for each person who is speaking.

［実施形態３］
本実施形態のプログラムは、前記各実施形態の方法を、コンピュータ上で実行可能なプログラムである。また、本実施形態のプログラムは、例えば、コンピュータ読み取り可能な記録媒体に記録されてもよい。前記記録媒体としては、特に限定されず、例えば、読み出し専用メモリ（ＲＯＭ）、ハードディスク（ＨＤ）、光ディスク、フロッピー（登録商標）ディスク（ＦＤ）等が挙げられる。 [Embodiment 3]
The program of this embodiment is a program that can execute the method of each of the above-described embodiments on a computer. Further, the program of the present embodiment may be recorded on a computer-readable recording medium, for example. The recording medium is not particularly limited, and examples thereof include a read-only memory (ROM), a hard disk (HD), an optical disk, a floppy (registered trademark) disk (FD), and the like.

＜付記＞
上記の実施形態の一部又は全部は、以下の付記のように記載されうるが、以下には限られない。
（付記１）
画像取得手段と、記憶手段と、編集対象特定手段と、スコアリング手段と、を含み、
前記画像取得手段は、画像を取得し、
前記記憶手段は、前記画像と、前記編集対象の属性情報と、を記憶し、
前記編集対象特定手段は、前記画像と、前記編集対象の属性情報とを照合して、前記画像の中から前記編集対象を含む画像を特定し、
前記スコアリング手段は、前記編集対象を含む画像を複数組み合わせてスコア化し、前記スコアが閾値を超える画像を選択する、
編集対象特定装置。
（付記２）
前記画像が、動画であり、
前記編集対象特定手段が、所定時間毎に区切って、前記編集対象を含む画像を特定する、
付記１記載の編集対象特定装置。
（付記３）
さらに、ラベリング手段を含み、
前記ラベリング手段は、前記編集対象を含む画像に、前記編集対象の属性情報及び前記画像に関する情報を含むラベルを付し、
前記記憶手段は、前記編集対象を含む画像と前記ラベルとを紐づけて記憶する、
付記１又は２記載の編集対象特定装置。
（付記４）
前記画像に関する情報が、撮影日時、撮影場所、天候、及び、編集対象の撮影状態からなる群から選択される少なくとも一つである、
付記１から３のいずれかに記載の編集対象特定装置。
（付記５）
前記編集対象が、人物であり、
前記スコアリング手段が、前記編集対象周辺の人物、前記画像の撮影場所、及び、前記画像の撮影時間からなる群から選択される少なくとも一つを用いて前記スコア化を実施する、
付記１から４のいずれかに記載の編集対象特定装置。
（付記６）
前記編集対象が、人物であり、
前記編集対象の属性情報が、顔画像、ＩＤ、氏名、年齢、及び、性別からなる群から選択される少なくとも一つである、
付記１から５のいずれかに記載の編集対象特定装置。
（付記７）
画像取得工程と、記憶工程と、編集対象特定工程と、スコアリング工程と、を含み、
前記画像取得工程は、画像を取得し、
前記記憶工程は、前記画像と、前記編集対象の属性情報と、を記憶し、
前記編集対象特定工程は、前記画像と、前記編集対象の属性情報とを照合して、前記画像の中から前記編集対象を含む画像を特定し、
前記スコアリング工程は、前記編集対象を含む画像を複数組み合わせてスコア化し、前記スコアが閾値を超える画像を選択する、
編集対象特定方法。
（付記８）
前記画像が、動画であり、
前記編集対象特定工程が、所定時間毎に区切って、前記編集対象を含む画像を特定する、
付記７記載の編集対象特定方法。
（付記９）
さらに、ラベリング工程を含み、
前記ラベリング工程は、前記編集対象を含む画像に、前記編集対象の属性情報及び前記画像に関する情報を含むラベルを付し、
前記記憶工程は、前記編集対象を含む画像と前記ラベルとを紐づけて記憶する、
付記７又は８記載の編集対象特定方法。
（付記１０）
前記画像に関する情報が、撮影日時、撮影場所、天候、及び、編集対象の撮影状態からなる群から選択される少なくとも一つである、
付記７から９のいずれかに記載の編集対象特定方法。
（付記１１）
前記編集対象が、人物であり、
前記スコアリング工程が、前記編集対象周辺の人物、前記画像の撮影場所、及び、前記画像の撮影時間からなる群から選択される少なくとも一つを用いて前記スコア化を実施する、
付記７から１０のいずれかに記載の編集対象特定方法。
（付記１２）
前記編集対象が、人物であり、
前記編集対象の属性情報が、顔画像、ＩＤ、氏名、年齢、及び、性別からなる群から選択される少なくとも一つである、
付記７から１１のいずれかに記載の編集対象特定方法。
（付記１３）
付記７から１２のいずれかに記載の編集対象特定方法をコンピュータ上で実行可能なプログラム。
（付記１４）
付記１３記載のプログラムを記録しているコンピュータ読み取り可能な記録媒体。 <Additional notes>
Some or all of the above embodiments may be described as, but not limited to, the following appendices.
(Appendix 1)
Includes image acquisition means, storage means, edit target identification means, and scoring means.
The image acquisition means acquires an image and
The storage means stores the image and the attribute information to be edited.
The editing target specifying means collates the image with the attribute information of the editing target, identifies an image including the editing target from the images, and identifies the image.
The scoring means combines a plurality of images including the editing target into a score, and selects an image whose score exceeds a threshold value.
Editing target identification device.
(Appendix 2)
The image is a moving image.
The editing target specifying means identifies an image including the editing target by dividing it at predetermined time intervals.
The edit target identification device described in Appendix 1.
(Appendix 3)
In addition, including labeling means,
The labeling means attaches a label containing the attribute information of the editing target and information about the image to the image including the editing target.
The storage means stores the image including the editing target in association with the label.
The editing target specifying device according to Appendix 1 or 2.
(Appendix 4)
The information about the image is at least one selected from the group consisting of the shooting date and time, the shooting location, the weather, and the shooting state to be edited.
The editing target specifying device according to any one of Supplementary notes 1 to 3.
(Appendix 5)
The editing target is a person
The scoring means performs the scoring using at least one selected from a group consisting of a person around the editing target, a shooting location of the image, and a shooting time of the image.
The editing target specifying device according to any one of Supplementary notes 1 to 4.
(Appendix 6)
The editing target is a person
The attribute information to be edited is at least one selected from the group consisting of a face image, an ID, a name, an age, and a gender.
The editing target specifying device according to any one of Supplementary notes 1 to 5.
(Appendix 7)
Includes an image acquisition process, a storage process, an edit target identification process, and a scoring process.
The image acquisition step acquires an image and
The storage step stores the image and the attribute information to be edited.
In the editing target specifying step, the image is collated with the attribute information of the editing target, and an image including the editing target is specified from the images.
In the scoring step, a plurality of images including the editing target are combined and scored, and an image whose score exceeds the threshold value is selected.
How to specify the edit target.
(Appendix 8)
The image is a moving image.
The editing target specifying step identifies an image including the editing target by dividing it at predetermined time intervals.
The method for identifying the editing target described in Appendix 7.
(Appendix 9)
In addition, it includes a labeling process
In the labeling step, the image including the editing target is labeled with the attribute information of the editing target and the information related to the image.
In the storage step, the image including the editing target and the label are stored in association with each other.
The method for identifying an edit target according to Appendix 7 or 8.
(Appendix 10)
The information about the image is at least one selected from the group consisting of the shooting date and time, the shooting location, the weather, and the shooting state to be edited.
The method for identifying an edit target according to any one of Supplementary notes 7 to 9.
(Appendix 11)
The editing target is a person
The scoring step performs the scoring using at least one selected from the group consisting of a person around the editing target, a shooting location of the image, and a shooting time of the image.
The method for identifying an editing target according to any one of Supplementary Provisions 7 to 10.
(Appendix 12)
The editing target is a person
The attribute information to be edited is at least one selected from the group consisting of a face image, an ID, a name, an age, and a gender.
The method for identifying an edit target according to any one of Supplementary notes 7 to 11.
(Appendix 13)
A program capable of executing the editing target identification method described in any one of Appendix 7 to 12 on a computer.
(Appendix 14)
A computer-readable recording medium on which the program described in Appendix 13 is recorded.

以上、実施形態を参照して本発明を説明したが、本発明は、上記実施形態に限定されるものではない。本発明の構成や詳細には、本発明のスコープ内で当業者が理解しうる様々な変更をできる。 Although the present invention has been described above with reference to the embodiments, the present invention is not limited to the above embodiments. Various changes that can be understood by those skilled in the art can be made to the structure and details of the present invention within the scope of the present invention.

本発明によれば、カメラ等で自動撮影した画像から編集対象を特定可能となり、運動会等のイベントにおいて、特に有用である。 According to the present invention, it is possible to specify an editing target from an image automatically taken by a camera or the like, which is particularly useful in an event such as an athletic meet.

１編集対象特定装置
１１画像取得手段
１２記憶手段
１３編集対象特定手段
１４スコアリング手段
１５ラベリング手段 1 Editing target specifying device 11 Image acquisition means 12 Storage means 13 Editing target specifying means 14 Scoring means 15 Labeling means

Claims

Includes image acquisition means, storage means, edit target identification means, and scoring means.
The image acquisition means acquires an image and
The storage means stores the image and the attribute information to be edited.
The editing target specifying means collates the image with the attribute information of the editing target, identifies an image including the editing target from the images, and identifies the image.
The scoring means combines a plurality of images including the editing target into a score, and selects an image whose score exceeds a threshold value.
Editing target identification device.

The image is a moving image.
The editing target specifying means identifies an image including the editing target by dividing it at predetermined time intervals.
The editing target specifying device according to claim 1.

In addition, including labeling means,
The labeling means attaches a label containing the attribute information of the editing target and information about the image to the image including the editing target.
The storage means stores the image including the editing target in association with the label.
The editing target specifying device according to claim 1 or 2.

The information about the image is at least one selected from the group consisting of the shooting date and time, the shooting location, the weather, and the shooting state to be edited.
The editing target specifying device according to any one of claims 1 to 3.

The editing target is a person
The scoring means performs the scoring using at least one selected from a group consisting of a person around the editing target, a shooting location of the image, and a shooting time of the image.
The editing target specifying device according to any one of claims 1 to 4.

The editing target is a person
The attribute information to be edited is at least one selected from the group consisting of a face image, an ID, a name, an age, and a gender.
The editing target specifying device according to any one of claims 1 to 5.

Includes an image acquisition process, a storage process, an edit target identification process, and a scoring process.
The image acquisition step acquires an image and
The storage step stores the image and the attribute information to be edited.
In the editing target specifying step, the image is collated with the attribute information of the editing target, and an image including the editing target is specified from the images.
In the scoring step, a plurality of images including the editing target are combined and scored, and an image whose score exceeds the threshold value is selected.
How to specify the edit target.

A program capable of executing the editing target identification method according to claim 7 on a computer.

A computer-readable recording medium on which the program according to claim 8 is recorded.