JP7158564B2

JP7158564B2 - Information processing device, server device, user device and information processing system

Info

Publication number: JP7158564B2
Application number: JP2021508748A
Authority: JP
Inventors: 彰田中; 誠村▲崎▼; 充弘小形; 広樹石塚; 昇悟池田; 翔七尾
Original assignee: NTT Docomo Inc
Current assignee: NTT Docomo Inc
Priority date: 2019-03-27
Filing date: 2019-12-18
Publication date: 2022-10-21
Anticipated expiration: 2039-12-18
Also published as: US20220182733A1; WO2020194925A1; US11800198B2; JPWO2020194925A1

Description

本発明は、情報処理装置、サーバ装置、ユーザ装置及び情報処理システムに関する。 The present invention relates to an information processing device, a server device, a user device and an information processing system.

特許文献１には、表示部に表示されるコンテンツをキーワードに変換し、ユーザの行動履歴に基づいてキーワードを絞り込み、絞り込んだキーワードに関する情報を表示させるためのアイコンを生成する技術が開示されている。また、特許文献２には、ユーザの行動履歴に基づいて、ユーザの関心のある物体の特徴量を特定し、特定した特徴量を用いて、表示される物体の画像を絞り込み、絞り込んだ物体の画像に対してコメントを行う技術が開示されている。 Patent Literature 1 discloses a technique of converting content displayed on a display unit into keywords, narrowing down the keywords based on a user's action history, and generating an icon for displaying information about the narrowed down keywords. . In addition, in Patent Document 2, based on the user's action history, a feature amount of an object of interest to the user is specified, and using the specified feature amount, the image of the object to be displayed is narrowed down, and the narrowed down object image is displayed. A technique for commenting on an image has been disclosed.

特開２０１５―１５４１９５号公報JP 2015-154195 A 特開２０１４－１６８８２号公報JP 2014-16882 A

従来の技術では、キーワードの絞り込み処理と物体の画像の絞り込み処理の各々にかかる処理負荷をどのように分散するかについて開示されていなかった。 The prior art does not disclose how to distribute the processing load applied to each of the keyword narrowing process and the object image narrowing process.

以上の課題を解決するために、本発明の好適な態様に係る情報処理装置は、ユーザの行動履歴を示す行動情報と第１の絞り込みの程度とに基づいて、複数の物体の画像を動画から特定し、特定された複数の物体の画像の各々についてコメントの対象の候補となる候補キーワードを生成するキーワード生成部と、前記行動情報と第２の絞り込みの程度に基づいて、コメントの対象となる一又は複数の対象キーワードを前記キーワード生成部によって生成された複数の候補キーワードから特定する特定部と、前記一又は複数の対象キーワードの各々について、当該対象キーワードに関連するコメントを生成するコメント生成部と、前記第１の絞り込みの程度と前記第２の絞り込みの程度とを、前記キーワード生成部及び前記特定部の処理に関する処理情報に応じて調整する調整部とを備える。 In order to solve the above problems, an information processing apparatus according to a preferred aspect of the present invention extracts images of a plurality of objects from a moving image based on action information indicating a user's action history and the degree of first narrowing down. A keyword generation unit that identifies and generates candidate keywords that are candidates for comments for each of the identified images of a plurality of objects; A specifying unit that specifies one or more target keywords from a plurality of candidate keywords generated by the keyword generating unit, and a comment generating unit that generates comments related to the target keywords for each of the one or more target keywords. and an adjustment unit that adjusts the degree of first narrowing down and the degree of second narrowing down according to processing information related to the processing of the keyword generating unit and the specifying unit.

また、以上の課題を解決するために、本発明の好適な態様に係る情報処理システムは、ユーザが管理するユーザ装置と、サーバ装置とを備える情報処理システムであって、前記ユーザ装置は、前記ユーザの行動履歴を示す行動情報と第１の絞り込みの程度とに基づいて、複数の物体の画像を動画から特定し、特定された複数の画像の各々についてコメントの対象の候補となる候補キーワードを生成するキーワード生成部と、前記行動情報、前記キーワード生成部によって生成された複数の候補キーワード、及び前記キーワード生成部の処理に関する処理情報を前記サーバ装置へ送信し、前記サーバ装置から送信されるコメントを受信する第１通信装置と、前記コメントを表示装置に表示させる表示制御部と、を備え、前記サーバ装置は、前記ユーザ装置から送信される前記行動情報、前記複数の候補キーワード及び前記キーワード生成部の処理に関する処理情報を受信し、前記コメントを前記ユーザ装置へ送信する第２通信装置と、前記行動情報と第２の絞り込みの程度とに基づいて、コメントの対象となる一又は複数の対象キーワードを前記複数の候補キーワードから特定する特定部と、前記一又は複数の対象キーワードの各々について、当該対象キーワードに関連するコメントを前記コメントとして生成するコメント生成部と、前記第１の絞り込みの程度と前記第２の絞り込みの程度とを、前記キーワード生成部の処理に関する処理情報及び前記特定部の処理に関する処理情報に応じて調整する調整部とを備える。 Further, in order to solve the above problems, an information processing system according to a preferred aspect of the present invention is an information processing system comprising a user device managed by a user and a server device, wherein the user device comprises the A plurality of images of objects are identified from the moving image based on the action information indicating the user's action history and the degree of the first narrowing down, and candidate keywords that are candidates for comments are determined for each of the plurality of identified images. A comment transmitted from the server device by transmitting a keyword generation unit to be generated, the action information, a plurality of candidate keywords generated by the keyword generation unit, and processing information related to the processing of the keyword generation unit to the server device. and a display control unit for displaying the comment on a display device, wherein the server device receives the action information, the plurality of candidate keywords and the keyword generation transmitted from the user device a second communication device that receives processing information about processing of a part and transmits the comment to the user device; a specifying unit that specifies a keyword from the plurality of candidate keywords; a comment generating unit that generates, as the comment, a comment related to the target keyword for each of the one or more target keywords; and the degree of the first narrowing down. and the degree of the second narrowing down according to processing information about the processing of the keyword generating unit and processing information about the processing of the specifying unit.

本発明に係る情報処理装置又は情報処理システムによれば、複数の物体の画像を特定する絞り込みの処理負荷と複数の候補キーワードから対象キーワードを特定する絞り込みの処理負荷とを処理能力に応じて調整することができる。 According to the information processing apparatus or information processing system according to the present invention, the processing load of narrowing down to identify a plurality of object images and the processing load of narrowing down to identify a target keyword from a plurality of candidate keywords are adjusted according to the processing power. can do.

本発明の第１実施形態に係るサービスシステムの全体構成を示すブロック図である。It is a block diagram showing the whole service system composition concerning a 1st embodiment of the present invention. 同実施形態に用いるサーバ装置のハードウェア構成を例示するブロック図である。It is a block diagram which illustrates the hardware constitutions of the server apparatus used for the same embodiment. 同実施形態に用いるサーバ装置の機能を示す機能ブロック図である。3 is a functional block diagram showing functions of a server device used in the same embodiment; FIG. 同実施形態における物体の画像の一例を示す説明図である。It is an explanatory view showing an example of an image of an object in the same embodiment. 同実施形態に用いるサーバ装置の動作を示すフローチャートである。It is a flowchart which shows operation|movement of the server apparatus used for the same embodiment. 第２実施形態に用いるユーザ装置のハードウェア構成を例示するブロック図である。FIG. 11 is a block diagram illustrating the hardware configuration of a user device used in the second embodiment; FIG. 第２実施形態に用いるユーザ装置の機能を示す機能ブロック図である。FIG. 8 is a functional block diagram showing functions of a user device used in the second embodiment; 同実施形態に用いるユーザ装置の動作を示すフローチャートである。It is a flowchart which shows operation|movement of the user apparatus used for the same embodiment. 第３実施形態に用いるサーバ装置のハードウェア構成を例示するブロック図である。FIG. 12 is a block diagram illustrating the hardware configuration of a server device used in the third embodiment; FIG. 同実施形態に用いるサーバ装置の機能を示す機能ブロック図である。3 is a functional block diagram showing functions of a server device used in the same embodiment; FIG. 同実施形態に用いるユーザ装置のハードウェア構成を例示するブロック図である。It is a block diagram which illustrates the hardware constitutions of the user's apparatus used for the same embodiment. 同実施形態に用いるユーザ装置の機能を示す機能ブロック図である。3 is a functional block diagram showing functions of a user device used in the same embodiment; FIG. 同実施形態に用いるサーバ装置及びユーザ装置の各々の動作を示すフローチャートである。It is a flowchart which shows each operation|movement of a server apparatus and a user apparatus which are used for the same embodiment.

[１．第１実施形態]
［１．１．サービスシステムの構成］
図１は、本発明の第１実施形態に係るサービスシステムの全体構成を示すブロック図である。図１に示されるサービスシステム１は、動画の配信サービスを提供する。例えば、動画の配信サービスは、映画又は地上波デジタル放送のコンテンツ等を提供する。[1. First Embodiment]
[1.1. Service system configuration]
FIG. 1 is a block diagram showing the overall configuration of a service system according to the first embodiment of the invention. A service system 1 shown in FIG. 1 provides a video distribution service. For example, a moving image distribution service provides content of movies or terrestrial digital broadcasting.

図１に例示するように、サービスシステム１は、ユーザＵ_1～ユーザＵ_mが管理するユーザ装置２０_1～２０_m（ｍは１以上の整数）と、ネットワークＮＷと、サーバ装置１０とを備える。ネットワークＮＷは移動体通信網又はインターネット等を含む電気通信回線である。以下の説明では、同種の要素を区別しない場合には、ユーザ装置２０、ユーザＵのように、参照符号のうちの共通番号だけを使用する。 As illustrated in FIG. 1, the service system 1 includes user devices 20_1 to 20_m (m is an integer equal to or greater than 1) managed by users U_1 to U_m, a network NW, and a server device . The network NW is a telecommunications line including a mobile communication network or the Internet. In the following description, only common numbers among reference numerals are used, such as user device 20 and user U, when similar elements are not distinguished.

ユーザ装置２０は、各種の情報を処理する情報処理装置である。ユーザ装置２０は、例えば、スマートフォン又はタブレット端末等の可搬型の情報処理装置である。但し、ユーザ装置２０としては、任意の情報処理装置を採用することができる。ユーザ装置２０は、例えば、パーソナルコンピュータ等の端末型の情報処理装置であってもよい。 The user device 20 is an information processing device that processes various types of information. The user device 20 is, for example, a portable information processing device such as a smart phone or a tablet terminal. However, any information processing device can be adopted as the user device 20 . The user device 20 may be, for example, a terminal-type information processing device such as a personal computer.

ユーザ装置２０は、ネットワークＮＷを介して、サーバ装置１０と通信可能である。ユーザ装置２０は、サーバ装置１０から送信される画像信号Ｓａを受信して当該画像信号Ｓａに応じた画像を表示したり、或いは画像信号Ｓａをテレビジョン受像機３０に送信してテレビジョン受像機３０に画像を表示させることができる。また、ユーザ装置２０は、ネットワークＮＷを介して、行動情報をサーバ装置１０に送信する。行動情報は、ユーザの行動履歴を示す。行動情報は、ユーザの位置を示す位置情報、ユーザの物品又はサービスの購買に関する購買情報、ｗｅｂの閲覧に関する閲覧情報、及び動画又は音楽の再生に関する再生情報の各々に、時間情報を対応付けた情報である。 The user device 20 can communicate with the server device 10 via the network NW. The user device 20 receives the image signal Sa transmitted from the server device 10 and displays an image corresponding to the image signal Sa, or transmits the image signal Sa to the television receiver 30 and 30 can be caused to display an image. Also, the user device 20 transmits behavior information to the server device 10 via the network NW. The action information indicates the user's action history. The behavior information is information in which time information is associated with each of position information indicating the user's position, purchase information regarding the user's purchase of goods or services, browsing information regarding web browsing, and playback information regarding video or music playback. is.

サーバ装置１０は、動画を示す画像信号Ｓａをユーザ装置２０へ送信する動画配信機能と、コメント生成機能とを有する情報処理装置である。コメント生成機能とは、画像信号Ｓａの示す動画の一画面の画像に含まれる複数の物体の画像からユーザＵの関心のある物体の画像を当該ユーザの行動履歴に基づいて絞り込み、絞り込んだ物体の画像についてのコメントを生成する機能である。 The server device 10 is an information processing device having a moving image distribution function of transmitting an image signal Sa representing a moving image to the user device 20 and a comment generating function. The comment generating function narrows down images of objects of interest to the user U based on the action history of the user from among a plurality of images of objects included in one screen image of the moving image indicated by the image signal Sa, and selects the images of the narrowed down objects. It is a function to generate comments about images.

［１．２．サーバ装置１０の構成］
図２は、サーバ装置１０のハードウェア構成を例示するブロック図である。サーバ装置１０は、処理装置１１Ａ、記憶装置１２Ａ、通信装置１４Ａ、及びバス１９を具備するコンピュータシステムにより実現される。処理装置１１Ａ、記憶装置１２Ａ、及び通信装置１４Ａは、情報を通信するためのバス１９で接続される。バス１９は、単一のバスで構成されてもよいし、装置間で異なるバスで構成されてもよい。なお、サーバ装置１０の各要素は、単数又は複数の機器で構成され、サーバ装置１０の一部の要素は省略されてもよい。[1.2. Configuration of server device 10]
FIG. 2 is a block diagram illustrating the hardware configuration of the server device 10. As shown in FIG. The server device 10 is implemented by a computer system comprising a processing device 11A, a storage device 12A, a communication device 14A, and a bus 19. FIG. The processing device 11A, storage device 12A, and communication device 14A are connected by a bus 19 for communicating information. The bus 19 may be composed of a single bus, or may be composed of different buses between devices. Note that each element of the server device 10 may be composed of one or more devices, and some elements of the server device 10 may be omitted.

処理装置１１Ａは、サーバ装置１０の全体を制御するプロセッサであり、例えば単数又は複数のチップで構成される。処理装置１１Ａは、例えば、周辺装置とのインタフェース、演算装置及びレジスタ等を含む中央処理装置（ＣＰＵ：Central Processing Unit）で構成される。なお、処理装置１１Ａの機能の一部又は全部は、ＤＳＰ（Digital Signal Processor）、ＡＳＩＣ（Application Specific Integrated Circuit）、ＰＬＤ（Programmable Logic Device）、ＦＰＧＡ（Field Programmable Gate Array）等のハードウェアで実現されてもよい。処理装置１１Ａは、各種の処理を並列的又は逐次的に実行する。 The processing device 11A is a processor that controls the entire server device 10, and is composed of, for example, one or more chips. The processing device 11A is composed of, for example, a central processing unit (CPU) including an interface with peripheral devices, an arithmetic device, registers, and the like. Some or all of the functions of the processing device 11A are realized by hardware such as a DSP (Digital Signal Processor), an ASIC (Application Specific Integrated Circuit), a PLD (Programmable Logic Device), and an FPGA (Field Programmable Gate Array). may The processing device 11A executes various processes in parallel or sequentially.

記憶装置１２Ａは、処理装置１１Ａが読取可能な記録媒体である。記憶装置１２Ａは、処理装置１１Ａが実行する制御プログラムＰＲａを含む複数のプログラム、特徴量テーブルＴＢＬａ、コメントテーブルＴＢＬｂ、サービスデータＤＳ、及び処理装置１１Ａが使用する各種のデータを記憶する。記憶装置１２Ａは、例えば、ＲＯＭ（Read Only Memory）、ＥＰＲＯＭ（Erasable Programmable ROM）、ＥＥＰＲＯＭ（Electrically Erasable Programmable ROM）、ＲＡＭ（Random Access Memory）等の記憶回路の１種類以上で構成される。 The storage device 12A is a recording medium readable by the processing device 11A. The storage device 12A stores a plurality of programs including a control program PRa executed by the processing device 11A, a feature amount table TBLa, a comment table TBLb, service data DS, and various data used by the processing device 11A. The storage device 12A is composed of, for example, one or more types of storage circuits such as ROM (Read Only Memory), EPROM (Erasable Programmable ROM), EEPROM (Electrically Erasable Programmable ROM), and RAM (Random Access Memory).

特徴量テーブルＴＢＬａは、物体の種類を示す単語と特徴量とを対応付けて記憶する。物体の種類を示す単語は、例えば、マルチーズ(Maltese dogs)、ワイン等の名詞である。特徴量は物体の形状に関する第１の特徴量及び物体の色に関する第２の特徴量を含む。例えば、マルチーズという単語に対応付けて特徴量テーブルＴＢＬａに記憶されている特徴量は、マルチーズの形状に関する第１の特徴量とマルチーズの色に関する第２の特徴量を含む。また、液体のように特定の形状を有さない物体については、特徴量テーブルＴＢＬａには当該物体の種類を示す単語に対応付けて、当該物体を収納する容器の形状に関する第１の特徴量と当該物体の色に関する第２の特徴量が記憶されている。例えば、ワインという単語に対応付けて特徴量テーブルＴＢＬａに記憶されている特徴量は、ワインボトルの形状に関する第１の特徴量とワインの色に関する第２の特徴量を含む。 The feature amount table TBLa associates and stores words indicating types of objects and feature amounts. Words indicating types of objects are, for example, nouns such as Maltese dogs and wine. The feature amount includes a first feature amount regarding the shape of the object and a second feature amount regarding the color of the object. For example, the feature quantity stored in the feature quantity table TBLa in association with the word maltese includes a first feature quantity relating to the shape of the maltese and a second feature quantity relating to the color of the maltese. For an object that does not have a specific shape, such as a liquid, the feature quantity table TBLa associates a word indicating the type of the object with a first feature value related to the shape of the container that stores the object. A second feature quantity relating to the color of the object is stored. For example, the feature quantity stored in the feature quantity table TBLa in association with the word wine includes a first feature quantity relating to the shape of the wine bottle and a second feature quantity relating to the color of the wine.

サービスデータＤＳは、ネットワークＮＷを介してサーバ装置１０からユーザ装置２０にストリーミングにより配信される動画を表す動画データである。サービスデータＤＳは、圧縮されており、Ｉフレーム(I-frame: Intra-coded frame)、Ｐフレーム(P-frame: Predicted Frame)、及びＢフレーム(B-frame: Bi-directional Predicted Frame)を有する。Ｉフレームは非圧縮のフレームである。Ｐフレーム及びＢフレームの各々は差分のフレームである。処理装置１１Ａは、サービスデータＤＳを伸長して画像信号Ｓａを生成する。 The service data DS is moving image data representing a moving image distributed by streaming from the server device 10 to the user device 20 via the network NW. The service data DS is compressed and has an I frame (I-frame: Intra-coded frame), a P frame (P-frame: Predicted Frame), and a B frame (B-frame: Bi-directional Predicted Frame). . An I-frame is an uncompressed frame. Each of the P-frames and B-frames is a differential frame. The processing device 11A expands the service data DS to generate the image signal Sa.

通信装置１４Ａの一例としては、例えばネットワークデバイス、ネットワークコントローラ、ネットワークカード又は通信モジュールが挙げられる。通信装置１４Ａは、処理装置１１Ａによる制御の下、ネットワークＮＷを介してユーザ装置２０と通信する。通信装置１４Ａは、画像信号Ｓａをユーザ装置２０へ送信する。また、通信装置１４Ａは、ユーザ装置２０から送信される行動情報を受信し、コメント生成機能により生成したコメントをユーザ装置２０へ送信する。 Examples of the communication device 14A include, for example, network devices, network controllers, network cards, or communication modules. The communication device 14A communicates with the user device 20 via the network NW under the control of the processing device 11A. The communication device 14A transmits the image signal Sa to the user device 20. FIG. The communication device 14A also receives action information transmitted from the user device 20 and transmits comments generated by the comment generation function to the user device 20 .

［１．３．サーバ装置１０の機能］
図３は、サーバ装置１０の機能を示す機能ブロック図である。処理装置１１Ａは記憶装置１２Ａから制御プログラムＰＲａを読み取り実行することによって、キーワード生成部１１０Ａ、特定部１２０、コメント生成部１３０、及び調整部１４０として機能する。[1.3. Function of server device 10]
FIG. 3 is a functional block diagram showing functions of the server device 10. As shown in FIG. The processing device 11A functions as a keyword generation unit 110A, a specification unit 120, a comment generation unit 130, and an adjustment unit 140 by reading and executing the control program PRa from the storage device 12A.

キーワード生成部１１０Ａは、ネットワークＮＷを介してユーザ装置２０から受信した行動情報と第１の絞り込みの程度とに基づいて、ユーザの関心のある複数の物体の画像を動画から特定する。 Keyword generation unit 110A identifies images of a plurality of objects of interest to the user from the moving image based on the behavior information received from user device 20 via network NW and the degree of first narrowing down.

キーワード生成部１１０Ａには、調整部１４０から第１制御情報ＳＤａが与えられる。第１制御情報ＳＤａは、第１の絞り込みの程度を示す情報の一例である。第１制御情報ＳＤａは、キーワード生成部１１０Ａにおける絞り込みの程度を指定する。第１制御情報ＳＤａは、例えばレベル１～３を指定する。レベル１は絞り込みの程度「低」を意味する。レベル２は絞り込みの程度「中」を意味する。レベル３は絞り込みの程度「高」を意味する。絞り込みの程度が「低」とは、行動情報の示す行動履歴を広く浅く解析して絞り込みを行うことを意味する。絞り込みの程度が「高」とは、行動情報の示す行動履歴を深く解析して絞り込みを行うことを意味する。第１制御情報ＳＤａによりレベル１が指定された場合、絞り込みの程度が低いので、キーワード生成部１１０Ａにより生成される候補キーワードＫＷの数は多くなる。一方、第１制御情報ＳＤａによりレベル３が指定された場合、絞り込みの程度が高いので、キーワード生成部１１０Ａにより絞り込まれる候補キーワードＫＷの数は少なくなる。絞り込みの程度が高いほど行動履歴を深く解析するので、キーワード生成部１１０Ａの処理負荷は高くなる。 The first control information SDa is provided from the adjuster 140 to the keyword generator 110A. The first control information SDa is an example of information indicating the degree of first narrowing down. The first control information SDa designates the degree of narrowing down in the keyword generator 110A. The first control information SDa designates levels 1 to 3, for example. Level 1 means "low" degree of refinement. Level 2 means "medium" degree of refinement. Level 3 means "high" degree of narrowing down. A "low" degree of narrowing down means that narrowing down is performed by broadly and shallowly analyzing the action history indicated by the action information. A "high" degree of narrowing down means that the action history indicated by the action information is deeply analyzed and narrowed down. When level 1 is specified by the first control information SDa, the degree of narrowing down is low, so the number of candidate keywords KW generated by the keyword generation unit 110A increases. On the other hand, when level 3 is specified by the first control information SDa, the degree of narrowing down is high, so the number of candidate keywords KW narrowed down by the keyword generation unit 110A is small. The higher the degree of narrowing down, the deeper the analysis of the action history, so the processing load on the keyword generation unit 110A increases.

図３に示すように、キーワード生成部１１０Ａは、決定部１１１、特徴量生成部１１２、抽出部１１３Ａ、及び変換部１１４を備える。 As shown in FIG. 3, the keyword generation unit 110A includes a determination unit 111, a feature amount generation unit 112, an extraction unit 113A, and a conversion unit 114.

決定部１１１は、物体に対するユーザの関心の程度を評価した結果と第１制御情報ＳＤａとに基づいて、動画から抽出する物体の種類を決定する。物体に対するユーザの関心の程度の評価には評価関数が用いられる。評価関数は、所定期間（例えば、過去１ヶ月）の行動情報の示す時間情報、位置情報、購買情報、閲覧情報、及び再生情報を変数とし、ユーザの関心がある物体についての関心の程度を評価値として出力する。第１制御情報ＳＤａの示す絞り込みの程度が低いほど多くの物体の種類が決定される。
具体的には、決定部１１１は、絞り込みの程度に応じた閾値と評価値とを比較することによって、動画から抽出する物体の種類を決定する。例えば、再生情報がサッカーの試合の動画が再生されたことを示すものとする。また、再生されたサッカーの試合は、サッカーチームＡとサッカーチームＢとの対戦であったものとする。また、購買情報がサッカーチームＡに属する特定のサッカー選手のユニフォームの購入を示すものとする。さらに、「サッカー」に関する物体についての評価値をＸ１、「サッカーチームＡ」に関する物体についての評価値をＸ２、特定のサッカー選手に関する物体についての評価値をＸ３とする。この場合、「サッカー」の概念が最も広く、「特定のサッカー選手」の概念が最も狭い。そして、「サッカーチームＡ」の概念は、「サッカー」の概念と「特定のサッカー選手」の概念の中間にある。すなわち、ユーザの関心は、「特定のサッカー選手」、「サッカーチームＡ」、「サッカー」の順に高いと言える。評価値は、Ｘ３＞Ｘ２＞Ｘ１となる。
ここで、絞り込みの程度がレベル１に対応する閾値がＲ１であるとする。絞り込みの程度がレベル２に対応する閾値がＲ２であるとする。絞り込みの程度がレベル３に対応する閾値がＲ３であるとする。さらに、Ｘ３＞Ｒ３＞Ｘ２＞Ｒ２＞Ｘ１＞Ｒ１であるとする。
絞り込みの程度がレベル１の場合、決定部１１１は動画から抽出する物体の種類として「サッカーに関する物体」を決定する。絞り込みの程度がレベル２の場合、決定部１１１は動画から抽出する物体の種類として「サッカーチームＡに関する物体」を決定する。絞り込みの程度がレベル３の場合、決定部１１１は動画から抽出する物体の種類として「特定のサッカー選手に関する物体」を決定する。決定部１１１は、絞り込みの程度に応じた閾値と評価値とを比較することによって、動画から抽出する物体の種類を決定する。The determination unit 111 determines the type of object to be extracted from the moving image based on the result of evaluating the user's degree of interest in the object and the first control information SDa. An evaluation function is used to evaluate the user's degree of interest in the object. The evaluation function uses time information, position information, purchase information, browsing information, and playback information indicating behavior information for a predetermined period (for example, the past month) as variables, and evaluates the degree of interest of the user in the object of interest. Output as a value. More types of objects are determined as the degree of narrowing indicated by the first control information SDa is lower.
Specifically, the determination unit 111 determines the type of object to be extracted from the moving image by comparing the threshold corresponding to the degree of narrowing down and the evaluation value. For example, the playback information may indicate that a video of a soccer game was played. It is also assumed that the reproduced soccer match was a match between soccer team A and soccer team B. Also assume that the purchase information indicates the purchase of a uniform for a particular soccer player belonging to soccer team A. Further, let the evaluation value for the object related to "soccer" be X1, the evaluation value for the object related to "soccer team A" be X2, and the evaluation value for the object related to a specific soccer player be X3. In this case, the concept of "soccer" is the broadest and the concept of "a particular soccer player" is the narrowest. And the concept of "soccer team A" is intermediate between the concept of "soccer" and the concept of "a particular soccer player". That is, it can be said that the user's interest is higher in the order of "specific soccer player", "soccer team A", and "soccer". The evaluation values are X3>X2>X1.
Here, it is assumed that the threshold corresponding to level 1 of the degree of narrowing down is R1. Assume that the threshold corresponding to level 2 of the degree of narrowing down is R2. Assume that the threshold corresponding to level 3 of the degree of narrowing down is R3. Furthermore, assume that X3>R3>X2>R2>X1>R1.
When the degree of narrowing down is level 1, the determining unit 111 determines “objects related to soccer” as the type of object to be extracted from the moving image. When the degree of narrowing down is level 2, the determining unit 111 determines “objects related to soccer team A” as the type of object to be extracted from the moving image. When the degree of narrowing down is level 3, the determining unit 111 determines “objects related to a specific soccer player” as the type of object to be extracted from the moving image. The determining unit 111 determines the type of object to be extracted from the moving image by comparing the threshold corresponding to the degree of narrowing down and the evaluation value.

特徴量生成部１１２は、決定部１１１により決定された物体の種類について、当該種類の物体の特徴量を生成する。特徴量生成部１１２は、決定部１１１により決定された物体の種類に対応する特徴量を特徴量テーブルＴＢＬａから読み出すことで、物体の特徴量を生成する。 For the type of object determined by the determination unit 111, the feature amount generation unit 112 generates a feature amount of the object of the type. The feature quantity generation unit 112 reads the feature quantity corresponding to the type of the object determined by the determination unit 111 from the feature quantity table TBLa to generate the feature quantity of the object.

抽出部１１３Ａは、特徴量生成部１１２により生成された特徴量を有する物体の画像をサービスデータＤＳから抽出する。抽出部１１３Ａは、サービスデータＤＳに含まれるＩフレーム、Ｐフレーム及びＢフレームのうち、Ｉフレームの画像から物体の画像を抽出する。１つのＩフレームの画像には、多数の物体の画像が存在する。Ｉフレームの画像が、図４に示されるものである場合、抽出部１１３Ａは、例えば画像ＯＢ１～ＯＢ５を抽出する。 113 A of extraction parts extract the image of the object which has the feature-value produced|generated by the feature-value production|generation part 112 from the service data DS. The extraction unit 113A extracts the image of the object from the I-frame image among the I-frame, P-frame, and B-frame included in the service data DS. There are many object images in one I-frame image. When the I-frame images are those shown in FIG. 4, the extraction unit 113A extracts images OB1 to OB5, for example.

変換部１１４は、抽出部１１３Ａによって抽出された複数の物体の画像ＯＢの各々についてコメントの対象の候補となる候補キーワードＫＷを生成する。具体的には、変換部１１４は、複数の物体の画像ＯＢの各々を候補キーワードＫＷに変換する。変換部１１４は、例えば、機械学習により学習された画像認識モデルを用いて、物体の画像ＯＢを候補キーワードＫＷに変換する。例えば、図４に示される画像ＯＢ１は「ワイン」に、画像ＯＢ２は「ワイングラス」に、画像ＯＢ３は「時計」に、画像ＯＢ４は「キャンドル」に、画像ＯＢ５は「洋食」に夫々変換される。 The conversion unit 114 generates a candidate keyword KW as a comment target candidate for each of the plurality of object images OB extracted by the extraction unit 113A. Specifically, the conversion unit 114 converts each of the plurality of object images OB into candidate keywords KW. The conversion unit 114 converts the image OB of the object into the candidate keyword KW using, for example, an image recognition model learned by machine learning. For example, image OB1 shown in FIG. 4 is converted to "wine", image OB2 to "wine glass", image OB3 to "clock", image OB4 to "candle", and image OB5 to "Western food". be.

特定部１２０は、行動情報を用いた絞り込みにより、キーワード生成部１１０Ａよって生成された複数の候補キーワードＫＷからコメントの対象となる一又は複数の対象キーワードＫＸを特定する。前述したように、行動情報には、時間情報、位置情報、購買情報、閲覧情報、及び再生情報が含まれる。特定部１２０は、複数の候補キーワードＫＷの各々について、時間情報、位置情報、購買情報、閲覧情報、及び再生情報を変数として評価関数により評価値を算出する。特定部１２０は、複数の候補キーワードＫＷを評価値の高い順にランク付けする。次いで、特定部１２０は、コメントの対象となる一又は複数の対象キーワードＫＸを、複数の候補キーワードＫＷの各々のランクと第２制御情報ＳＤｂとに基づいて特定する。第２制御情報ＳＤｂは、第２の絞り込みの程度を示す情報の一例である。 The identifying unit 120 identifies one or more target keywords KX to be commented from among the plurality of candidate keywords KW generated by the keyword generating unit 110A by narrowing down using the behavior information. As described above, the action information includes time information, location information, purchase information, browsing information, and playback information. The specifying unit 120 calculates an evaluation value for each of the plurality of candidate keywords KW using an evaluation function using time information, position information, purchase information, browsing information, and reproduction information as variables. The identifying unit 120 ranks the plurality of candidate keywords KW in descending order of evaluation value. Next, the specifying unit 120 specifies one or a plurality of target keywords KX to be commented on based on the rank of each of the plurality of candidate keywords KW and the second control information SDb. The second control information SDb is an example of information indicating the degree of second narrowing down.

第２制御情報ＳＤｂは調整部１４０により生成され、調整部１４０から特定部１２０に与えられる。第２制御情報ＳＤｂは、第１制御情報ＳＤａと同様に、例えばレベル１～３を指定する。レベル１、レベル２及びレベル３の意味は、前述した第１制御情報ＳＤａにおけるレベル１、レベル２及びレベル３の各々の意味と同じである。 The second control information SDb is generated by the adjusting section 140 and provided from the adjusting section 140 to the specifying section 120 . The second control information SDb designates levels 1 to 3, for example, like the first control information SDa. The meanings of level 1, level 2 and level 3 are the same as those of level 1, level 2 and level 3 in the first control information SDa described above.

特定部１２０は、第２制御情報ＳＤｂの示す絞り込みの程度が低いほど、多くの対象キーワードＫＸを特定する。第２制御情報ＳＤｂによりレベル１が指定された場合、絞り込みの程度が低いので、特定部１２０における絞り込みにより得られる対象キーワードＫＸの数は多くなる。一方、第２制御情報ＳＤｂによりレベル３が指定された場合、絞り込みの程度が高いので、特定部１２０における絞り込みにより得られる対象キーワードＫＸの数は少なくなる。絞り込みの程度が高いほど、特定部１２０の処理負荷は高くなる。 The identification unit 120 identifies more target keywords KX as the degree of narrowing indicated by the second control information SDb is lower. When level 1 is specified by the second control information SDb, the degree of narrowing down is low, so the number of target keywords KX obtained by narrowing down in the specifying unit 120 increases. On the other hand, when level 3 is specified by the second control information SDb, the degree of narrowing down is high, so the number of target keywords KX obtained by narrowing down in the specifying unit 120 is small. As the degree of narrowing down increases, the processing load on the identification unit 120 increases.

コメント生成部１３０は、一又は複数の対象キーワードＫＸの各々について、関連するコメントを生成する。コメントとは、対象キーワードＫＸについての説明又は解説の意味である。また、コメントはレコメンドを含む概念である。このため、対象キーワードＫＸに関連してユーザＵに購入を勧める商品及び商品を取り扱う店舗に関する情報がコメントに含まれる。コメント生成部１３０は、キーワードに対応付けてコメントを記憶したコメントテーブルＴＢＬｂから対象キーワードＫＸに対応するコメントを読み出すことによって、コメントを生成する。あるいは、コメント生成部１３０は、ネットワークＮＷに接続される検索サイトにアクセスして対象キーワードＫＸに関連する情報を取得し、取得した情報をコメントとして用いてもよい。コメント生成部１３０により生成されたコメントは、例えば電子メール等で通信装置１４Ａによってユーザ装置２０へ送信される。 The comment generating unit 130 generates related comments for each of one or more target keywords KX. A comment means an explanation or commentary on the target keyword KX. A comment is a concept including recommendations. For this reason, the comment includes information about the product that the user U is recommended to purchase and the store that sells the product in relation to the target keyword KX. The comment generation unit 130 generates a comment by reading a comment corresponding to the target keyword KX from the comment table TBLb storing comments in association with keywords. Alternatively, the comment generation unit 130 may access a search site connected to the network NW, acquire information related to the target keyword KX, and use the acquired information as a comment. The comment generated by the comment generator 130 is transmitted to the user device 20 by the communication device 14A, for example, by e-mail.

調整部１４０は、キーワード生成部１１０Ａにおいて物体の種類を絞り込む程度（第１の絞り込みの程度）と、特定部１２０において複数の候補キーワードＫＷから一又は複数の対象キーワードＫＸを絞り込む程度（第２の絞り込みの程度）とを、キーワード生成部１１０Ａの処理及び特定部１２０の処理に関する処理情報に応じて調整する。処理情報は、キーワード生成部１１０Ａの処理能力及び特定部１２０の処理能力を示す情報である。キーワード生成部１１０Ａの処理能力及び特定部１２０の処理能力は、ＣＰＵの処理能力、或いはキーワード生成部１１０Ａの機能と特定部１２０の機能とに夫々割り当てられるＣＰＵのリソースに応じて変化する。 The adjustment unit 140 adjusts the degree to which the keyword generation unit 110A narrows down the types of objects (first degree of narrowing down) and the degree to which the specifying unit 120 narrows down one or more target keywords KX from a plurality of candidate keywords KW (second degree of narrowing down). degree of narrowing down) is adjusted according to processing information relating to the processing of the keyword generation unit 110A and the processing of the identification unit 120. FIG. The processing information is information indicating the processing capability of the keyword generation unit 110A and the processing capability of the identification unit 120. FIG. The processing capacity of the keyword generating section 110A and the processing capacity of the specifying section 120 change according to the processing capacity of the CPU or the resources of the CPU allocated to the functions of the keyword generating section 110A and the specifying section 120, respectively.

調整部１４０は、ＣＰＵの処理能力が予め定められた閾値を上回っている場合、又は特定部１２０の機能に割り当てられるリソースがキーワード生成部１１０Ａの機能に割り当てられるリソースよりも多い場合には、特定部１２０の処理能力が高いと判定する。リソースの大小は、例えば、コア数及びスレッド数の少なくとも一方の大小によって定まる。例えば、キーワード生成部１１０Ａに割り当てられるコア数が「１」且つスレッド数が「１」であり、特定部１２０に割り当てられるコア数が「１」且つスレッド数が「２」であることを想定する。この場合、特定部１２０の処理能力は、キーワード生成部１１０Ａの処理能力よりも高いと判定する。
特定部１２０の処理能力が高いと判定した場合、調整部１４０は、キーワード生成部１１０Ａにおける絞り込みの程度を低くし、特定部１２０における絞り込みの程度を高くする。具体的には、調整部１４０は、レベル１を示す第１制御情報ＳＤａと、レベル３を示す第２制御情報ＳＤｂとを生成する。レベル１を示す第１制御情報ＳＤａが与えられることで、キーワード生成部１１０Ａの処理負荷は低くなる。一方、レベル３を示す第２制御情報ＳＤｂが与えられることで、特定部１２０の処理負荷は高くなる。If the processing power of the CPU exceeds a predetermined threshold value, or if the resources allocated to the function of the identification unit 120 are greater than the resources allocated to the function of the keyword generation unit 110A, the adjustment unit 140 performs the identification It is determined that the processing capability of the unit 120 is high. The size of resources is determined, for example, by at least one of the number of cores and the number of threads. For example, assume that the number of cores and the number of threads allocated to the keyword generation unit 110A is "1" and the number of threads is "1", and the number of cores and the number of threads allocated to the identification unit 120 is "1" and "2". . In this case, it is determined that the processing capability of the identification unit 120 is higher than the processing capability of the keyword generation unit 110A.
When determining that the processing capability of the identifying unit 120 is high, the adjusting unit 140 decreases the degree of narrowing down in the keyword generating unit 110A and increases the degree of narrowing down in the identifying unit 120 . Specifically, adjustment section 140 generates first control information SDa indicating level 1 and second control information SDb indicating level 3. FIG. By providing the first control information SDa indicating level 1, the processing load of the keyword generation unit 110A is reduced. On the other hand, when the second control information SDb indicating level 3 is given, the processing load on the identifying unit 120 increases.

特定部１２０の処理能力が低いと判定した場合には、調整部１４０は、キーワード生成部１１０Ａにおける絞り込みの程度を高くし、特定部１２０における絞り込みの程度を低くする。具体的には、調整部１４０は、レベル３を示す第１制御情報ＳＤａと、レベル１を示す第２制御情報ＳＤｂとを生成する。レベル３を示す第１制御情報ＳＤａを与えられることで、キーワード生成部１１０Ａの処理負荷は高くなる。一方、レベル３を示す第２制御情報ＳＤｂを与えられることで、特定部１２０の処理負荷は低くなる。 When determining that the processing capability of the identifying unit 120 is low, the adjusting unit 140 increases the degree of narrowing down in the keyword generating unit 110A and decreases the degree of narrowing down in the identifying unit 120 . Specifically, adjustment section 140 generates first control information SDa indicating level 3 and second control information SDb indicating level 1 . Given the first control information SDa indicating level 3, the processing load on the keyword generation unit 110A increases. On the other hand, given the second control information SDb indicating level 3, the processing load on the identification unit 120 is reduced.

特定部１２０の処理能力が高い場合にキーワード生成部１１０Ａにおける絞り込みの程度を低くし、特定部１２０における絞り込みの程度を高くする理由は次の通りである。キーワード生成部１１０Ａは、ユーザＵの関心のある物体を、抽出部１１３Ａにおいて画像を解析することによって抽出する。しかし、画像処理を用いた物体の抽出の精度は低い。従って、特定部１２０の処理能力が高い場合には、キーワード生成部１１０Ａにおける絞り込みの程度を低くする。絞り込みの程度を低くすることによって、絞り込みの程度が高い場合と比較して、キーワード生成部１１０Ａで生成される候補キーワードＫＷの数は増加する。しかし、特定部１２０の処理能力が高いので、特定部１２０は、増加した候補キーワードＫＷの中から、対象キーワードＫＸを絞り込むことができる。
一方、特定部１２０の処理能力が低い場合、処理能力を上回る数の候補キーワードＫＷが生成されると、特定部１２０の処理に遅延が発生する。この結果、適切なタイミングでコメントを生成することが困難になる。このような場合には、候補キーワードＫＷの精度を犠牲にしてキーワード生成部１１０Ａが生成する候補キーワードＫＷの数を減少させることが望ましい。
調整部１４０は、キーワード生成部１１０Ａの処理能力及び特定部１２０の処理能力に応じて、キーワード生成部１１０Ａにおいて物体の種類を絞り込む程度と、特定部１２０において複数の候補キーワードＫＷから一又は複数の対象キーワードＫＸを絞り込む程度とを、調整する。この結果、特定部１２０コメントの的確性が向上する。また、処理の遅延が抑制されることによって、適切なタイミングでコメントが生成される。
さらに、キーワード生成部１１０Ａの処理能力と、特定部１２０の処理能力とは、動的に変更されることがある。処理能力が動的に変更される例としては、処理装置１１Ａの処理負荷の増加に伴い、キーワード生成部１１０Ａに割り当てられるリソースと特定部１２０に割り当てられるリソースが減少する場合である。このような場合、調整部１４０は、キーワード生成部１１０Ａの処理能力及び特定部１２０の処理能力に応じて、キーワード生成部１１０Ａの絞り込みの程度と、特定部１２０の絞り込みの程度とを、調整する。そのため、コメントの的確性が向上する。コメントが適切なタイミングで生成される。The reason for lowering the degree of narrowing down in the keyword generation unit 110A and increasing the degree of narrowing down in the specifying unit 120 when the processing capability of the specifying unit 120 is high is as follows. 110 A of keyword production|generation parts extract the object which the user U is interested in by analyzing an image in 113 A of extraction parts. However, the accuracy of object extraction using image processing is low. Therefore, when the processing capability of the identification unit 120 is high, the degree of narrowing down in the keyword generation unit 110A is lowered. By lowering the degree of narrowing down, the number of candidate keywords KW generated by the keyword generation unit 110A increases compared to when the degree of narrowing down is high. However, since the identifying unit 120 has high processing capability, the identifying unit 120 can narrow down the target keyword KX from the increased candidate keywords KW.
On the other hand, if the processing capability of the identifying unit 120 is low, the processing of the identifying unit 120 will be delayed if the number of candidate keywords KW that exceeds the processing capability is generated. As a result, it becomes difficult to generate comments at appropriate times. In such a case, it is desirable to reduce the number of candidate keywords KW generated by the keyword generator 110A at the expense of the accuracy of the candidate keywords KW.
The adjustment unit 140 adjusts the extent to which the keyword generation unit 110A narrows down the types of objects in the keyword generation unit 110A and the identification unit 120 selects one or more from a plurality of candidate keywords KW according to the processing capabilities of the keyword generation unit 110A and the identification unit 120. The extent to which the target keyword KX is narrowed down is adjusted. As a result, the accuracy of the specific part 120 comment improves. In addition, comments are generated at appropriate timing by suppressing processing delays.
Furthermore, the processing capacity of the keyword generating section 110A and the processing capacity of the identifying section 120 may be dynamically changed. An example in which the processing capacity is dynamically changed is when the resources allocated to the keyword generation unit 110A and the resources allocated to the identification unit 120 decrease as the processing load of the processing device 11A increases. In such a case, the adjusting unit 140 adjusts the degree of narrowing down of the keyword generating unit 110A and the degree of narrowing down of the identifying unit 120 according to the processing ability of the keyword generating unit 110A and the processing ability of the identifying unit 120. . Therefore, the accuracy of comments is improved. Comments are generated in a timely manner.

［１．４．サーバ装置１０の動作］
次に、サーバ装置１０の動作について説明する。図５は、サーバ装置１０の動作を示すフローチャートである。[1.4. Operation of server device 10]
Next, operations of the server device 10 will be described. FIG. 5 is a flow chart showing the operation of the server device 10. As shown in FIG.

まず、処理装置１１Ａは、処理情報に応じて、第１制御情報ＳＤａと第２制御情報ＳＤｂとを生成する（ステップＳ１）。サーバ装置１０は、ユーザ装置２０に比較して処理能力の高いＣＰＵを有し、当該ＣＰＵの処理能力は所定の閾値を上回っている。ステップＳ１において処理装置１１Ａは、特定部１２０の能力が高いと判定し、レベル１を示す第１制御情報ＳＤａと、レベル３を示す第２制御情報ＳＤｂと、を生成する。 First, the processing device 11A generates first control information SDa and second control information SDb according to processing information (step S1). The server device 10 has a CPU with higher processing power than the user device 20, and the processing power of the CPU exceeds a predetermined threshold. In step S1, the processing device 11A determines that the capability of the identifying unit 120 is high, and generates first control information SDa indicating level 1 and second control information SDb indicating level 3. FIG.

次に、処理装置１１Ａは、行動情報とステップＳ１にて生成した第１制御情報ＳＤａに応じた絞り込みの程度とに基づいて、物体の種類を決定する。本動作例のステップＳ１において生成された第１制御情報ＳＤａはレベル１を示すので、ステップＳ２における物体の絞り込みの程度は低くなる。 Next, the processing device 11A determines the type of object based on the action information and the degree of narrowing down according to the first control information SDa generated in step S1. Since the first control information SDa generated in step S1 of this operation example indicates level 1, the degree of object narrowing down in step S2 is low.

次に、処理装置１１Ａは、ステップＳ２にて決定した物体の種類に対応する特徴量を特徴量テーブルＴＢＬａから読み出す（ステップＳ３）。 Next, the processing device 11A reads the feature quantity corresponding to the type of object determined in step S2 from the feature quantity table TBLa (step S3).

次に、処理装置１１Ａは、ステップＳ３にて読み出した特徴量に基づいて、サービスデータＤＳの示す動画のＩフレームから物体の画像を抽出する（ステップＳ４Ａ）。１フレームの画像には、複数のオブジェクト画像が存在するのが通常である。このため、処理装置１１Ａは、ステップＳ４Ａの処理において複数の物体の画像を抽出する。 Next, the processing device 11A extracts the image of the object from the I frame of the moving image indicated by the service data DS based on the feature amount read out in step S3 (step S4A). A single frame image usually includes a plurality of object images. Therefore, the processing device 11A extracts images of a plurality of objects in the process of step S4A.

次に、処理装置１１Ａは、ステップＳ４Ａにて抽出した複数の物体の画像の各々を候補キーワードＫＷに変換する（ステップＳ５）。 Next, the processing device 11A converts each of the plurality of object images extracted in step S4A into candidate keywords KW (step S5).

次に、処理装置１１Ａは、行動情報と第２制御情報ＳＤｂとに基づいて、ステップＳ５の処理により得られた複数の候補キーワードＫＷのうちから、対象キーワードＫＸを特定する（ステップＳ６）。本動作例のステップＳ１において生成された第２制御情報ＳＤｂはレベル３を示すので、ステップＳ６では行動情報を深く解析する絞り込みが行われる。 Next, the processing device 11A identifies the target keyword KX from among the plurality of candidate keywords KW obtained by the process of step S5 based on the action information and the second control information SDb (step S6). Since the second control information SDb generated in step S1 of this operation example indicates level 3, the action information is narrowed down to be deeply analyzed in step S6.

次に、処理装置１１Ａは、対象キーワードＫＸに関連するコメントを生成する（ステップＳ７）。ステップＳ７の処理において、処理装置１１Ａは対象キーワードＫＸに対応するコメントをコメントテーブルＴＢＬｂから読み出すことによってコメントを生成する。処理装置１１Ａは、生成したコメントを電子メール等でユーザ装置２０へ送信する。 Next, the processing device 11A generates a comment related to the target keyword KX (step S7). In the processing of step S7, the processing device 11A generates a comment by reading the comment corresponding to the target keyword KX from the comment table TBLb. The processing device 11A transmits the generated comment to the user device 20 by e-mail or the like.

また、処理装置１１Ａは、ステップＳ１の処理において調整部１４０として機能し、ステップＳ２からステップＳ５の処理においてキーワード生成部１１０Ａとして機能する。より詳細には、処理装置１１Ａは、ステップＳ２の処理において決定部１１１として機能し、ステップＳ３の処理において特徴量生成部１１２として機能し、ステップＳ４Ａの処理において抽出部１１３Ａとして機能し、ステップＳ５の処理において変換部１１４として機能する。さらに、処理装置１１Ａは、ステップＳ６の処理において特定部１２０として機能し、ステップＳ７の処理においてコメント生成部１３０として機能する。 Further, the processing device 11A functions as the adjustment unit 140 in the processing of step S1, and functions as the keyword generation unit 110A in the processing of steps S2 to S5. More specifically, the processing device 11A functions as the determination unit 111 in the process of step S2, functions as the feature amount generation unit 112 in the process of step S3, functions as the extraction unit 113A in the process of step S4A, and functions as the extraction unit 113A in the process of step S5. It functions as the conversion unit 114 in the processing of . Furthermore, the processing device 11A functions as the specifying unit 120 in the processing of step S6, and functions as the comment generating unit 130 in the processing of step S7.

本実施形態のサーバ装置１０は、コメント生成機能を有する情報処理装置、すなわち本発明の情報処理装置、の一例である。サーバ装置１０は、ユーザの行動履歴を示す行動情報に基づいて、複数の物体に各々対応する複数の画像を動画から絞り込み、絞りこまれた複数の画像の各々についてコメントの対象の候補となる候補キーワードを生成するキーワード生成部１１０Ａを備える。また、サーバ装置１０は、行動情報に基づいて、キーワード生成部１１０Ａによって生成された複数の候補キーワードを絞り込むことにより、コメントの対象となる一又は複数の対象キーワードを特定する特定部１２０を備える。さらに、サーバ装置１０は、一又は複数の対象キーワードの各々について、当該対象キーワードに関連するコメントを生成するコメント生成部１３０と、キーワード生成部１１０Ａにおいて複数の画像を絞り込む程度と、特定部１２０において複数の候補キーワードを絞り込む程度とを、キーワード生成部１１０Ａ及び特定部１２０の処理に関する処理情報に応じて調整する調整部１４０と、を備える。 The server device 10 of this embodiment is an example of an information processing device having a comment generation function, that is, an information processing device of the present invention. The server device 10 narrows down a plurality of images each corresponding to a plurality of objects from the moving image based on action information indicating the user's action history, and candidates for commenting on each of the plurality of narrowed down images. A keyword generation unit 110A for generating keywords is provided. The server device 10 also includes a specifying unit 120 that specifies one or more target keywords to be commented on by narrowing down the plurality of candidate keywords generated by the keyword generating unit 110A based on the behavior information. Furthermore, for each of one or a plurality of target keywords, the server device 10 includes a comment generation unit 130 that generates a comment related to the target keyword, an extent to which a plurality of images are narrowed down in the keyword generation unit 110A, and and an adjustment unit 140 that adjusts the degree to which a plurality of candidate keywords are narrowed down according to processing information related to the processing of the keyword generation unit 110A and the identification unit 120 .

この態様によれば、特定部１２０の処理能力に応じてキーワード生成部１１０Ａにおける絞り込みの程度と特定部１２０における絞り込みの程度とを調整することができる。上記実施形態のサーバ装置１０では、特定部１２０の処理能力が高いため、キーワード生成部１１０Ａにおける絞り込みの程度を低く、特定部１２０における絞り込みの程度を高く調整することで、コメントの的確性を向上させることができる。 According to this aspect, the degree of narrowing down in the keyword generation unit 110</b>A and the degree of narrowing down in the specifying unit 120 can be adjusted according to the processing capability of the specifying unit 120 . In the server device 10 of the above-described embodiment, since the processing capability of the identification unit 120 is high, the accuracy of comments is improved by adjusting the degree of narrowing down in the keyword generation unit 110A to be low and the degree of narrowing down in the identification unit 120 to be high. can be made

［２．第２実施形態］
第２実施形態のサービスシステム１では、サーバ装置１０は動画配信機能のみを有し、ユーザ装置２０がコメント生成機能を有する。[2. Second Embodiment]
In the service system 1 of the second embodiment, the server device 10 has only the moving image distribution function, and the user device 20 has the comment generation function.

図６は、第２実施形態のユーザ装置２０のハードウェア構成例を示す図である。ユーザ装置２０は、処理装置１１Ｂ、記憶装置１２Ｂ、通信装置１４Ｂ、表示制御部１５、及びバス１９を具備する。ユーザ装置２０は、処理装置１１Ｂ、記憶装置１２Ｂ、通信装置１４Ｂ、表示制御部１５、及びバス１９の他に、表示装置とスピーカとを含む出力装置、タッチパネル等の入力装置、近距離無線通信装置及びＧＰＳ装置を含んでもよい。近距離無線通信装置とは、近距離無線通信によって他の装置と通信する機器である。近距離無線通信には、例えばＢｌｕｅｔｏｏｔｈ（登録商標）、ＺｉｇＢｅｅ（登録商標）、又は、ＷｉＦｉ（登録商標）等が挙げられる。ＧＰＳ装置とは、複数の衛星からの電波を受信し、受信した電波から位置情報を生成する機器である。 FIG. 6 is a diagram showing a hardware configuration example of the user device 20 of the second embodiment. The user device 20 includes a processing device 11 B, a storage device 12 B, a communication device 14 B, a display control section 15 and a bus 19 . In addition to the processing device 11B, the storage device 12B, the communication device 14B, the display control unit 15, and the bus 19, the user device 20 includes an output device including a display device and a speaker, an input device such as a touch panel, and a short-range wireless communication device. and GPS devices. A short-range wireless communication device is a device that communicates with another device by short-range wireless communication. Short-range wireless communication includes, for example, Bluetooth (registered trademark), ZigBee (registered trademark), or WiFi (registered trademark). A GPS device is a device that receives radio waves from a plurality of satellites and generates position information from the received radio waves.

処理装置１１Ｂ、記憶装置１２Ｂ、及び通信装置１４Ｂの各々は、第１実施形態における処理装置１１Ａ、記憶装置１２Ａ、及び通信装置１４Ａの各々に対応する。記憶装置１２Ｂは、制御プログラムＰＲａに代えて制御プログラムＰＲｂを記憶している点と、行動情報を記憶している点と、サービスデータＤＳを記憶していない点と、で記憶装置１２Ａと相違する。通信装置１４Ｂは、処理装置１１Ｂによる制御の下、サーバ装置１０と通信する点で通信装置１４Ａと相違する。通信装置１４Ｂは、サーバ装置１０からネットワークＮＷを介して送信されてくる画像信号Ｓａを受信する。 Each of the processing device 11B, the storage device 12B, and the communication device 14B corresponds to each of the processing device 11A, the storage device 12A, and the communication device 14A in the first embodiment. The storage device 12B differs from the storage device 12A in that it stores the control program PRb instead of the control program PRa, that it stores action information, and that it does not store the service data DS. . The communication device 14B differs from the communication device 14A in that it communicates with the server device 10 under the control of the processing device 11B. The communication device 14B receives the image signal Sa transmitted from the server device 10 via the network NW.

表示制御部１５は、表示装置又はテレビジョン受像機３０の作動制御を行う。表示制御部１５は、画像信号Ｓａの示す動画を表示装置又はテレビジョン受像機３０に表示させる。また、表示制御部１５は、コメント生成機能により生成したコメントの画像を動画に対するオーバレイ用の画像として生成し、動画のフレームに当該オーバレイ用の画像を重ねて表示装置又はテレビジョン受像機３０に表示させる。なお、コメントの表示は、動画の視聴中にリアルタイムで行われる態様には限定されず、動画の視聴終了後に行われてもよい。 The display control unit 15 controls the operation of the display device or the television receiver 30 . The display control unit 15 causes the display device or the television receiver 30 to display the moving image indicated by the image signal Sa. In addition, the display control unit 15 generates an image of the comment generated by the comment generation function as an overlay image for the moving image, and displays the overlay image on the frame of the moving image on the display device or the television receiver 30. Let It should be noted that the comment display is not limited to being performed in real time while viewing the moving image, and may be performed after viewing the moving image.

図７は第２実施形態の処理装置１１Ｂの機能を示す機能ブロック図である。処理装置１１Ｂは記憶装置１２Ｂから制御プログラムＰＲｂを読み取り実行することによって、キーワード生成部１１０Ｂ、特定部１２０、コメント生成部１３０、及び調整部１４０として機能する。また、第２実施形態の処理装置１１Ｂにおいては、記憶装置１２Ｂに記憶されている行動情報がキーワード生成部１１０Ｂ及び特定部１２０に与えられる。 FIG. 7 is a functional block diagram showing functions of the processing device 11B of the second embodiment. The processing device 11B functions as a keyword generation unit 110B, a specification unit 120, a comment generation unit 130, and an adjustment unit 140 by reading and executing the control program PRb from the storage device 12B. Further, in the processing device 11B of the second embodiment, the action information stored in the storage device 12B is provided to the keyword generating section 110B and the specifying section 120. FIG.

図７に示されるようにキーワード生成部１１０Ｂは、決定部１１１、特徴量生成部１１２、抽出部１１３Ｂ、及び変換部１１４を備える。抽出部１１３Ｂには、ネットワークＮＷを介して受信した画像信号Ｓａが供給される。画像信号Ｓａは複数のフレームから構成される。画像信号ＳａはサービスデータＤＳを伸長して得られた信号であるから、画像信号Ｓａの表す各フレームには、Ｉフレーム、Ｐフレーム及びＢフレームの区別はない。抽出部１１３Ｂは、画像信号Ｓａの表す各フレームの画像から決定部１１１により決定された種類の物体の画像を抽出する点で第１実施形態の抽出部１１３Ａと相違する。 As shown in FIG. 7, the keyword generation unit 110B includes a determination unit 111, a feature amount generation unit 112, an extraction unit 113B, and a conversion unit 114. The image signal Sa received via the network NW is supplied to the extraction unit 113B. The image signal Sa is composed of a plurality of frames. Since the image signal Sa is a signal obtained by decompressing the service data DS, each frame represented by the image signal Sa does not distinguish between I-frames, P-frames and B-frames. The extraction unit 113B is different from the extraction unit 113A of the first embodiment in that it extracts an image of the type of object determined by the determination unit 111 from the image of each frame represented by the image signal Sa.

次に、第２実施形態のユーザ装置２０の動作について説明する。図８は、第２実施形態のユーザ装置２０の動作を示すフローチャートである。第２実施形態のユーザ装置２０において処理装置１１Ｂは、図７に示すステップＳ１、Ｓ２、Ｓ３、Ｓ４Ｂ、Ｓ５、Ｓ６及びＳ７の各処理をこの順に実行する。処理装置１１Ｂは、ステップＳ１の処理において調整部１４０として機能する。ただし、ユーザ装置２０は、サーバ装置１０に比較して処理能力の低いＣＰＵを有するため、ステップＳ１において処理装置１１Ｂは、特定部１２０の処理能力は低いと判定し、レベル３を示す第１制御情報ＳＤａと、レベル１を示す第２制御情報ＳＤｂと、を生成する。 Next, operation of the user device 20 of the second embodiment will be described. FIG. 8 is a flow chart showing the operation of the user device 20 of the second embodiment. In the user device 20 of the second embodiment, the processing device 11B executes steps S1, S2, S3, S4B, S5, S6 and S7 shown in FIG. 7 in this order. The processing device 11B functions as the adjusting section 140 in the process of step S1. However, since the user device 20 has a CPU with lower processing power than the server device 10, the processing device 11B determines that the processing power of the identification unit 120 is low in step S1, and the first control indicating level 3 Information SDa and second control information SDb indicating level 1 are generated.

ステップＳ２からステップＳ５の処理においてキーワード生成部１１０Ｂとして機能する。より詳細には、処理装置１１Ｂは、ステップＳ２の処理において決定部１１１として機能し、ステップＳ３の処理において特徴量生成部１１２として機能し、ステップＳ４Ｂの処理において抽出部１１３Ｂとして機能し、ステップＳ５の処理において変換部１１４として機能する。本動作例では、ステップＳ１にて生成される第１制御情報ＳＤａはレベル３を示すため、ステップＳ２における絞り込みの程度は高くなる。また、ステップＳ４Ｂの処理では、処理装置１１Ｂは、ステップＳ３にて読み出した特徴量に基づいて、画像信号Ｓａの示す動画の複数のフレームの各々から物体の画像を抽出する。１フレームの画像には複数の物体の画像が存在するのが通常であるため、処理装置１１Ｂは、ステップＳ４Ｂの処理において複数の物体の画像を抽出する。 It functions as the keyword generator 110B in the processing from step S2 to step S5. More specifically, the processing device 11B functions as the determination unit 111 in the process of step S2, functions as the feature amount generation unit 112 in the process of step S3, functions as the extraction unit 113B in the process of step S4B, and functions as the extraction unit 113B in the process of step S5. It functions as the conversion unit 114 in the processing of . In this operation example, the first control information SDa generated in step S1 indicates level 3, so the degree of narrowing down in step S2 is high. Further, in the process of step S4B, the processing device 11B extracts an image of the object from each of the plurality of frames of the moving image indicated by the image signal Sa, based on the feature amount read out in step S3. Since images of a plurality of objects usually exist in one frame image, the processing device 11B extracts images of a plurality of objects in the process of step S4B.

さらに、処理装置１１Ｂは、ステップＳ６の処理において特定部１２０として機能し、ステップＳ７の処理おいてコメント生成部１３０として機能する。処理装置１１Ｂは、ステップＳ７の処理により生成したコメントを、表示制御部１５を用いて、表示装置又はテレビジョン受像機３０に表示させる。本動作例では、ステップＳ１にて生成される第２制御情報ＳＤｂはレベル１を示すため、ステップＳ６における絞り込みの程度は低くなり、特定部１２０の処理負荷は低くなる。 Furthermore, the processing device 11B functions as the specifying unit 120 in the process of step S6, and functions as the comment generation unit 130 in the process of step S7. The processing device 11B causes the display device or the television receiver 30 to display the comment generated by the processing of step S7 using the display control unit 15 . In this operation example, the second control information SDb generated in step S1 indicates level 1, so the degree of narrowing down in step S6 is low, and the processing load on the identification unit 120 is low.

本実施形態のユーザ装置２０は、コメント生成機能を有する情報処理装置、すなわち本発明の情報処理装置、の一例である。ユーザ装置２０は、ユーザの行動履歴を示す行動情報と第１の絞り込みの程度とに基づいて、複数の物体の画像を動画から特定し、特定された複数の画像の各々についてコメントの対象の候補となる候補キーワードを生成するキーワード生成部１１０Ｂを備える。また、ユーザ装置２０は、行動情報と第２の絞り込みの程度とに基づいて、コメントの対象となる一又は複数の対象キーワードをキーワード生成部１１０Ｂによって生成された複数の候補キーワードから特定する特定部１２０を備える。さらに、ユーザ装置２０は、一又は複数の対象キーワードの各々について、当該対象キーワードに関連するコメントを生成するコメント生成部１３０と、キーワード生成部１１０Ｂにおいて複数の画像を絞り込む程度(第１の絞り込みの程度)と、特定部１２０において複数の候補キーワードを絞り込む程度(第２の絞り込みの程度)とを、キーワード生成部１１０Ｂ及び特定部１２０の処理に関する処理情報に応じて調整する調整部１４０と、を備える。 The user device 20 of this embodiment is an example of an information processing device having a comment generation function, that is, an information processing device of the present invention. The user device 20 identifies images of a plurality of objects from the moving image based on the behavior information indicating the behavior history of the user and the degree of the first narrowing down, and candidates for comments on each of the identified images. A keyword generation unit 110B that generates a candidate keyword that becomes Also, the user device 20 is a specifying unit that specifies one or a plurality of target keywords to be commented from a plurality of candidate keywords generated by the keyword generating unit 110B, based on the behavior information and the degree of the second narrowing down. 120. Furthermore, for each of one or a plurality of target keywords, the user device 20 includes the comment generation unit 130 that generates comments related to the target keyword, and the keyword generation unit 110B that narrows down the plurality of images (first narrowing down). degree) and the degree of narrowing down a plurality of candidate keywords in the identifying unit 120 (second degree of narrowing down) according to processing information about the processing of the keyword generating unit 110B and the identifying unit 120; Prepare.

この態様によれば、特定部１２０の処理能力に応じてキーワード生成部１１０Ｂにおける絞り込みの程度と特定部１２０における絞り込みの程度とを調整することができる。第２実施形態のユーザ装置２０では、特定部１２０の処理能力が低いため、キーワード生成部１１０Ｂにおける絞り込みの程度は高く、特定部１２０における絞り込みの程度は低く調整される。 According to this aspect, the degree of narrowing down in the keyword generation unit 110</b>B and the degree of narrowing down in the specifying unit 120 can be adjusted according to the processing capability of the specifying unit 120 . In the user device 20 of the second embodiment, since the processing capability of the specifying unit 120 is low, the degree of narrowing down in the keyword generating unit 110B is adjusted to be high, and the degree of narrowing down in the specifying unit 120 is adjusted to be low.

［３．第３実施形態］
図９は、第３実施形態のサービスシステム１に含まれるサーバ装置１０のハードウェア構成例を示す図である。サーバ装置１０は、処理装置１１Ｃ、記憶装置１２Ｃ、通信装置１４Ｃ（第２通信装置）、及びバス１９を具備する。処理装置１１Ｃ、記憶装置１２Ｃ、及び通信装置１４Ｃの各々は、第１実施形態における処理装置１１Ａ、記憶装置１２Ａ、及び通信装置１４Ａの各々に対応する。記憶装置１２Ｃは、制御プログラムＰＲａに代えて制御プログラムＰＲｃを記憶している点と、特徴量テーブルＴＢＬａを記憶していない点と、で記憶装置１２Ａと相違する。通信装置１４Ｃは、処理装置１１Ｃによる制御の下、画像信号Ｓａとコメントとをユーザ装置２０に送信する点では通信装置１４Ａと共通である。しかし、通信装置１４Ｃは、キーワード生成部１１０Ｂにおける処理に関する処理情報及び行動情報をユーザ装置２０から受信する点と、第１制御情報ＳＤａをユーザ装置２０へ送信する点と、で通信装置１４Ａと相違する。[3. Third Embodiment]
FIG. 9 is a diagram showing a hardware configuration example of the server device 10 included in the service system 1 of the third embodiment. The server device 10 comprises a processing device 11C, a storage device 12C, a communication device 14C (second communication device), and a bus 19 . Each of the processing device 11C, the storage device 12C, and the communication device 14C corresponds to each of the processing device 11A, the storage device 12A, and the communication device 14A in the first embodiment. The storage device 12C differs from the storage device 12A in that the control program PRc is stored in place of the control program PRa and the feature amount table TBLa is not stored. The communication device 14C is common to the communication device 14A in that it transmits the image signal Sa and the comment to the user device 20 under the control of the processing device 11C. However, the communication device 14C differs from the communication device 14A in that it receives from the user device 20 processing information and action information relating to the processing in the keyword generation unit 110B, and in that it transmits the first control information SDa to the user device 20. do.

図１０は、第３実施形態のサーバ装置１０の機能を示す機能ブロック図である。処理装置１１Ｃは記憶装置１２Ｃから制御プログラムＰＲｃを読み取り実行することによって、特定部１２０、コメント生成部１３０、及び調整部１４０として機能する。また、第３実施形態のサーバ装置１０は、ネットワークＮＷを介してユーザ装置２０から送信されてくる処理情報が調整部１４０に与えられる点と、調整部１４０により生成された第１制御情報ＳＤａがネットワークＮＷを介してユーザ装置２０へ送信される点と、で第１実施形態のサーバ装置１０と相違する。 FIG. 10 is a functional block diagram showing functions of the server device 10 of the third embodiment. The processing device 11C functions as a specifying unit 120, a comment generating unit 130, and an adjusting unit 140 by reading and executing the control program PRc from the storage device 12C. Further, in the server device 10 of the third embodiment, the processing information transmitted from the user device 20 via the network NW is given to the adjustment unit 140, and the first control information SDa generated by the adjustment unit 140 is It differs from the server device 10 of the first embodiment in that it is transmitted to the user device 20 via the network NW.

図１１は、第３実施形態のサービスシステム１に含まれるユーザ装置２０のハードウェア構成例を示す図である。ユーザ装置２０は、処理装置１１Ｄ、記憶装置１２Ｄ、通信装置（第１通信装置）１４Ｄ、表示制御部１５、及びバス１９を具備する。処理装置１１Ｄ、記憶装置１２Ｄ、及び通信装置１４Ｄの各々は、第２実施形態における処理装置１１Ｂ、記憶装置１２Ｂ、及び通信装置１４Ｂの各々に対応する。記憶装置１２Ｄは、制御プログラムＰＲｂに代えて制御プログラムＰＲｄを記憶している点と、コメントテーブルＴＢＬｂを記憶していない点と、で記憶装置１２Ｂと相違する。通信装置１４Ｄは、処理装置１１Ｄによる制御の下、画像信号Ｓａをサーバ装置１０から受信する点では通信装置１４Ｂと共通である。しかし、通信装置１４Ｄは、キーワード生成部１１０Ｂにおける処理に関する処理情報、行動情報及び複数の候補キーワードＫＷをサーバ装置１０へ送信する点と、第１制御情報ＳＤａとコメントとをサーバ装置１０から受信する点と、で通信装置１４Ｂと相違する。 FIG. 11 is a diagram showing a hardware configuration example of the user device 20 included in the service system 1 of the third embodiment. The user device 20 includes a processing device 11D, a storage device 12D, a communication device (first communication device) 14D, a display control section 15, and a bus 19. FIG. Each of the processing device 11D, the storage device 12D, and the communication device 14D corresponds to each of the processing device 11B, the storage device 12B, and the communication device 14B in the second embodiment. The storage device 12D differs from the storage device 12B in that it stores the control program PRd instead of the control program PRb and that it does not store the comment table TBLb. The communication device 14D is common to the communication device 14B in that it receives the image signal Sa from the server device 10 under the control of the processing device 11D. However, the communication device 14D transmits to the server device 10 processing information, action information, and a plurality of candidate keywords KW relating to the processing in the keyword generation unit 110B, and receives first control information SDa and comments from the server device 10. It differs from the communication device 14B in one point.

図１２は、第３実施形態のユーザ装置２０の機能を示す機能ブロック図である。処理装置１１Ｄは記憶装置１２Ｄから制御プログラムＰＲｄを読み取り実行することによって、キーワード生成部１１０Ｂとして機能する。また、第３実施形態のユーザ装置２０は、ネットワークＮＷを介してサーバ装置１０に処理情報を送信する点と、ネットワークを介してサーバ装置１０から送信されてくる第１制御情報ＳＤａがキーワード生成部１１０Ｂに与えられる点と、で第２実施形態のユーザ装置２０と相違する。 FIG. 12 is a functional block diagram showing functions of the user device 20 of the third embodiment. The processing device 11D functions as a keyword generator 110B by reading and executing the control program PRd from the storage device 12D. Further, the user device 20 of the third embodiment transmits processing information to the server device 10 via the network NW, and the first control information SDa transmitted from the server device 10 via the network is a keyword generator. 110B is different from the user device 20 of the second embodiment.

次に、第３実施形態のサーバ装置１０及びユーザ装置２０の動作について説明する。図１３は、第３実施形態のサーバ装置１０及びユーザ装置２０の各々の動作を示すフローチャートである。第３実施形態のユーザ装置２０において処理装置１１Ｄは、図７に示すステップＳ１＿１、Ｓ２、Ｓ３、Ｓ４Ｂ、及びＳ５の各処理をこの順に実行する。一方、第３実施形態のサーバ装置１０において処理装置１１Ｃは、図１２に示すステップＳ１、Ｓ６、及びＳ７の各処理をこの順に実行する。 Next, operations of the server device 10 and the user device 20 of the third embodiment will be described. FIG. 13 is a flow chart showing operations of the server device 10 and the user device 20 of the third embodiment. In the user device 20 of the third embodiment, the processing device 11D executes steps S1_1, S2, S3, S4B, and S5 shown in FIG. 7 in this order. On the other hand, in the server device 10 of the third embodiment, the processing device 11C executes steps S1, S6, and S7 shown in FIG. 12 in this order.

ステップＳ１＿１の処理では、処理装置１１Ｄは、キーワード生成部１１０Ｂの処理に関する処理情報と記憶装置１２Ｄに記憶されている行動情報とを、ネットワークＮＷを介してサーバ装置１０へ送信する。 In the process of step S1_1, the processing device 11D transmits the processing information regarding the processing of the keyword generating section 110B and the action information stored in the storage device 12D to the server device 10 via the network NW.

サーバ装置１０では、行動情報及び処理情報の受信を契機としてステップＳ１の処理が実行される。ステップＳ１の処理において、処理装置１１Ｃは、調整部１４０として機能する。ユーザ装置２０から受信した処理情報はキーワード生成部１１０Ｂの処理能力を示す。前述したように、ユーザ装置２０の処理能力はサーバ装置１０の処理能力に比較して低く、ユーザ装置２０においてキーワード生成部１１０Ｂに割り当て可能なリソースもサーバ装置１０において特定部１２０の機能に割り当て可能なリソースよりも少ない。ステップＳ１において処理装置１１Ｄは、特定部１２０の処理能力が高いと判定し、レベル１を示す第１制御情報ＳＤａと、レベル３を示す第２制御情報ＳＤｂと、を生成する。ステップＳ１の処理により生成された第１制御情報ＳＤａはネットワークＮＷを介してユーザ装置２０へ送信される。 In the server device 10, the processing of step S1 is executed with the reception of the action information and the processing information as a trigger. In the process of step S1, the processing device 11C functions as the adjusting section 140. FIG. The processing information received from the user device 20 indicates the processing capability of the keyword generator 110B. As described above, the processing capability of the user device 20 is lower than the processing capability of the server device 10, and resources that can be allocated to the keyword generation unit 110B in the user device 20 can also be allocated to the function of the identification unit 120 in the server device 10. resources. In step S1, the processing device 11D determines that the processing capability of the identifying unit 120 is high, and generates first control information SDa indicating level 1 and second control information SDb indicating level 3. FIG. The first control information SDa generated by the process of step S1 is transmitted to the user device 20 via the network NW.

ユーザ装置２０では、サーバ装置１０から送信されてくる第１制御情報ＳＤａの受信を契機としてステップＳ２以降の処理が実行される。ステップＳ２からステップＳ５の処理において、処理装置１１Ｄは、キーワード生成部１１０Ｂとして機能する。より詳細には、処理装置１１Ｄは、ステップＳ２の処理において決定部１１１として機能し、ステップＳ３の処理において特徴量生成部１１２として機能し、ステップＳ４Ｂの処理において抽出部１１３Ｂとして機能し、ステップＳ５の処理において変換部１１４として機能する。ステップＳ５の処理により生成された候補キーワードＫＷは、ネットワークＮＷを介してサーバ装置１０へ送信される。本動作例においてサーバ装置１０からユーザ装置２０へ送信される第１制御情報ＳＤａはレベル１を示すため、ステップＳ２における絞り込みの程度は低く、キーワード生成部１１０Ｂの処理負荷は低くなる。 In the user device 20, the processing from step S2 onward is executed with the reception of the first control information SDa transmitted from the server device 10 as a trigger. In the processing from step S2 to step S5, the processing device 11D functions as the keyword generation section 110B. More specifically, the processing device 11D functions as the determination unit 111 in the processing of step S2, functions as the feature amount generation unit 112 in the processing of step S3, functions as the extraction unit 113B in the processing of step S4B, and functions as the extraction unit 113B in the processing of step S4B. It functions as the conversion unit 114 in the processing of . The candidate keyword KW generated by the process of step S5 is transmitted to the server device 10 via the network NW. In this operation example, since the first control information SDa transmitted from the server device 10 to the user device 20 indicates level 1, the degree of narrowing down in step S2 is low, and the processing load on the keyword generation unit 110B is low.

サーバ装置１０では、ユーザ装置２０から送信されてくる候補キーワードＫＷの受信を契機としてステップＳ６以降の処理が実行される。処理装置１１Ｃは、ステップＳ６の処理において特定部１２０として機能し、ステップＳ７の処理おいてコメント生成部１３０として機能する。ステップＳ７の処理により生成されたコメントは電子メール等でサーバ装置１０からユーザ装置２０へ送信される。ユーザ装置２０は、電子メール等によりコメントを受信すると、当該コメントの画像を表示装置又はテレビジョン受像機に表示させる。本動作例では、ステップＳ１においてレベル３を示す第２制御情報ＳＤｂが生成されるため、ステップＳ６における絞り込みの程度は高く、特定部１２０の処理負荷は高くなる。 In the server device 10, when the candidate keyword KW transmitted from the user device 20 is received, the process from step S6 onwards is executed. 11 C of processing apparatuses function as the specific part 120 in the process of step S6, and function as the comment production|generation part 130 in the process of step S7. The comment generated by the process of step S7 is transmitted from the server device 10 to the user device 20 by e-mail or the like. When receiving a comment by e-mail or the like, the user device 20 displays an image of the comment on a display device or a television receiver. In this operation example, since the second control information SDb indicating level 3 is generated in step S1, the degree of narrowing down in step S6 is high, and the processing load on the identification unit 120 is high.

本実施形態のサービスシステム１は、コメント生成機能を有する情報処理システム、すなわち本発明の情報処理システム、の一例であり、ユーザが管理するユーザ装置２０と、サーバ装置１０とを備える。ユーザ装置２０は、ユーザの行動履歴を示す行動情報と第１の絞り込みの程度とに基づいて、複数の物体の画像を動画から特定し、特定された複数の画像の各々についてコメントの対象の候補となる候補キーワードを生成するキーワード生成部１１０Ｂと、行動情報、キーワード生成部１１０Ｂによって生成された複数の候補キーワード、及びキーワード生成部１１０Ｂの処理に関する処理情報をサーバ装置１０へ送信し、サーバ装置１０から送信されるコメントを受信する通信装置１４Ｄ（第１通信装置）と、コメントを表示装置に表示させる表示制御部１５と、を備える。サーバ装置１０は、ユーザ装置２０から送信される行動情報、キーワード生成部１１０Ｂによって生成された複数の候補キーワード及びキーワード生成部１１０Ｂの処理に関する処理情報を受信し、コメントをユーザ装置２０へ送信する通信装置１４Ｃ（第２通信装置）と、行動情報と第２の絞り込みの程度とに基づいて、コメントの対象となる一又は複数の対象キーワードを複数の候補キーワードから特定する特定部１２０と、一又は複数の対象キーワードの各々について、関連するコメントを生成するコメント生成部１３０と、キーワード生成部１１０Ｂにおいて複数の画像を絞り込む程度(第１の絞り込みの程度)と、特定部１２０において複数の候補キーワードを絞り込む程度(第２の絞り込みの程度)と、をキーワード生成部１１０Ｂの処理に関する処理情報及び特定部１２０の処理に関する処理情報に応じて調整する調整部１４０とを備える。 The service system 1 of this embodiment is an example of an information processing system having a comment generation function, that is, an information processing system of the present invention, and includes a user device 20 managed by a user and a server device 10 . The user device 20 identifies images of a plurality of objects from the moving image based on the behavior information indicating the behavior history of the user and the degree of the first narrowing down, and candidates for comments on each of the identified images. A keyword generation unit 110B that generates a candidate keyword that becomes, action information, a plurality of candidate keywords generated by the keyword generation unit 110B, and processing information related to the processing of the keyword generation unit 110B are transmitted to the server device 10, and the server device 10 and a communication device 14D (first communication device) that receives comments transmitted from and a display control unit 15 that displays the comments on the display device. The server device 10 receives action information transmitted from the user device 20, a plurality of candidate keywords generated by the keyword generation unit 110B, and processing information related to the processing of the keyword generation unit 110B, and transmits comments to the user device 20. a device 14C (second communication device); a specifying unit 120 that specifies one or more target keywords to be commented from a plurality of candidate keywords based on the behavior information and the degree of the second narrowing down; For each of a plurality of target keywords, a comment generating unit 130 that generates related comments, a degree of narrowing down a plurality of images in the keyword generating unit 110B (a first degree of narrowing down), and a plurality of candidate keywords in the specifying unit 120 are selected. The adjusting unit 140 adjusts the degree of narrowing down (second degree of narrowing down) according to the processing information regarding the processing of the keyword generating unit 110B and the processing information regarding the processing of the identifying unit 120. FIG.

この態様によれば、ユーザ装置２０が有するキーワード生成部１１０Ｂの処理能力とサーバ装置１０が有する特定部１２０の処理能力とに応じてキーワード生成部１１０Ｂにおける絞り込みの程度と特定部１２０における絞り込みの程度とを調整することができる。第３実施形態のサービスシステム１では、キーワード生成部１１０Ｂの処理能力は特定部１２０の処理能力よりも低いため、キーワード生成部１１０Ｂにおける絞り込みの程度は低く、特定部１２０における絞り込みの程度は高く調整され、コメントの的確性を向上させることができる。 According to this aspect, the degree of narrowing down in the keyword generating unit 110B and the degree of narrowing down in the identifying unit 120 are according to the processing ability of the keyword generating unit 110B of the user device 20 and the processing ability of the identifying unit 120 of the server device 10. and can be adjusted. In the service system 1 of the third embodiment, the keyword generation unit 110B has a lower processing capacity than the identification unit 120, so the degree of narrowing down in the keyword generation unit 110B is low, and the degree of narrowing down in the identification unit 120 is adjusted to be high. and can improve the relevance of comments.

[４．変形例]
本発明は、以上に例示した各実施形態に限定されない。具体的な変形の態様を以下に例示する。以下の例示から任意に選択された２以上の態様を併合してもよい。[4. Modification]
The present invention is not limited to the embodiments exemplified above. Specific modification modes are exemplified below. Two or more aspects arbitrarily selected from the following examples may be combined.

（１）上述した第１実施形態において抽出部１１３ＡがサービスデータＤＳから物体の画像を抽出するフレームは以下のフレームであってもよい。
第１に、抽出部１１３Ａは、画像内の複数のフレームのうち、視聴率の高いフレームで物体の画像を抽出してもよい。この場合、抽出部１１３Ａは、視聴率を外部装置からリアルタイムで取得すればよい。具体的には、抽出部１１３Ａは、取得した視聴率が所定の視聴率を超えたフレームで物体の画像の抽出を実行する。視聴率が高いフレームは、他のフレームと比較してユーザＵの関心が他の高いと推定される。従って、ユーザＵの関心が高いフレームの画像から物体の画像が抽出されるので、ユーザＵに有益なコメントを生成できる。
第２に、抽出部１１３Ａは、ユーザＵの音声信号をユーザ装置２０から受信し、音声信号に基づいて、ユーザＵの反応が良いフレームからオブジェクト画像を抽出してもよい。例えば、ユーザＵが歓声をあげたフレームからオブジェクト画像を抽出してもよい。
第３に、抽出部１１３Ａは、番組情報に基づいて番組の主題となるフレームでオブジェクト画像を抽出してもよい。例えば、抽出部１１３Ａは、サービスデータＤＳを解析し、番組の主題となるフレームを特定してもよい。この場合、抽出部１１３Ａは、ネットワークＮＷを介して外部装置から番組情報を取得すればよい。
同様に第２実施形態及び第３実施形態において抽出部１１３Ｂが画像信号Ｓａから物体の画像を抽出する対象とするフレームも、視聴率の高いフレーム、ユーザＵの反応が良いフレーム、又は番組の主題となるフレームであってもよい。(1) In the above-described first embodiment, the frames from which the extraction unit 113A extracts the image of the object from the service data DS may be the following frames.
First, the extraction unit 113A may extract the image of the object in a frame with a high audience rating among the plurality of frames in the image. In this case, the extraction unit 113A may acquire the audience rating from the external device in real time. Specifically, the extraction unit 113A extracts an image of an object from a frame in which the obtained audience rating exceeds a predetermined audience rating. A frame with a high audience rating is presumed to be of high interest to the user U compared to other frames. Therefore, since the image of the object is extracted from the image of the frame in which the user U is highly interested, a comment beneficial to the user U can be generated.
Secondly, the extraction unit 113A may receive the audio signal of the user U from the user device 20, and extract the object image from the frame to which the user U responds well based on the audio signal. For example, an object image may be extracted from a frame in which the user U cheers.
Thirdly, the extraction unit 113A may extract an object image in a frame that is the theme of the program based on the program information. For example, the extraction unit 113A may analyze the service data DS and identify frames that are the subject of the program. In this case, the extraction unit 113A may acquire program information from the external device via the network NW.
Similarly, in the second and third embodiments, the extraction unit 113B extracts the image of the object from the image signal Sa. It may be a frame that becomes

（２）調整部１４０は、処理能力の替わりに又は処理能力に加えて動画の品質に応じて、決定部１１１における絞り込みの程度（第１の絞り込みの程度）と、特定部１２０における絞り込みの程度（第２の絞り込みの程度）とを調整してもよい。動画の品質は処理情報に含まれる。動画の品質の具体例としては、動画のフレームレート、又は動画の各フレームにおける解像度が挙げられる。処理情報が動画の品質を含む場合、調整部１４０は、動画の品質が高い場合、動画の品質が低い場合と比較して、キーワード生成部において複数の画像を絞り込む程度を小さくさせ、特定部において複数の候補キーワードを絞り込む程度を大きくさせる。動画の品質が高い場合には、キーワード生成部の処理負荷が、動画の品質が低い場合と比較して、高くなるからである。 (2) The adjustment unit 140 adjusts the degree of narrowing down (first degree of narrowing down) in the determining unit 111 and the degree of narrowing down in the specifying unit 120 according to the quality of the moving image instead of or in addition to the processing power. (degree of second narrowing down) may be adjusted. Video quality is included in the processing information. A specific example of the quality of the moving image is the frame rate of the moving image or the resolution of each frame of the moving image. When the processing information includes the quality of the moving image, the adjustment unit 140 reduces the extent to which the plurality of images are narrowed down in the keyword generation unit compared to when the quality of the moving image is high and when the quality of the moving image is low. Increase the degree of narrowing down multiple candidate keywords. This is because the processing load on the keyword generation unit is higher when the quality of the moving image is high than when the quality of the moving image is low.

（３）上記１実施形態では、行動情報に基づいて特定された物体の画像から変換されるキーワードをそのまま候補キーワードとした。しかし、キーワード生成部１１０Ａにおける絞り込みでは、行動情報に基づいて特定された物体の画像から変換されるキーワードの上位概念を当該キーワードに替えて又は当該キーワードに加えて候補キーワードとし、特定部１２０における絞り込みで下位概念に絞り込んでもよい。例えば、行動情報に基づいて特定された物体の画像から変換されるキーワードが「ＳＵＶ」（Sport Utility Vehicle）であった場合、キーワード生成部１１０Ａにおける絞り込みでは「ＳＵＶ」の上位概念である「車」を、「ＳＵＶ」と共に又は「ＳＵＶ」に代えて候補キーワードとし、特定部１２０における絞り込みで対象キーワードを「ＳＵＶ」に絞り込めばよい。また、行動情報に基づいて特定された物体の画像から変換されるキーワードが「薔薇」であった場合、キーワード生成部１１０Ａにおける絞り込みでは「薔薇」の上位概念である「花」を、「薔薇」と共に又は「薔薇」に代えて候補キーワードとし、特定部１２０における絞り込みで対象キーワードを「薔薇」に絞り込めばよい。なお、上位概念化の程度については、キーワード生成部１１０Ａの処理能力に応じて調整すればよい。また、キーワード生成部１１０Ｂにおける絞り込みについても同様に上位概念化が採用されてもよい。 (3) In the above embodiment, the keyword converted from the image of the object specified based on the action information is directly used as the candidate keyword. However, in the narrowing down by the keyword generating unit 110A, the broader concept of the keyword converted from the image of the object identified based on the action information is used as a candidate keyword instead of or in addition to the keyword, and the narrowing down in the identifying unit 120 is performed. can be narrowed down to subordinate concepts. For example, if the keyword converted from the image of the object specified based on the action information is "SUV" (Sport Utility Vehicle), the narrowing down by the keyword generation unit 110A will be "car" which is a superordinate concept of "SUV". is used as a candidate keyword together with or instead of "SUV", and the target keyword is narrowed down to "SUV" by narrowing down in the specifying unit 120. FIG. Further, when the keyword converted from the image of the object specified based on the action information is "rose", the keyword generation unit 110A narrows down "flower", which is a superordinate concept of "rose", to "rose". The target keyword may be narrowed down to "rose" by using a candidate keyword together with or instead of "rose" and narrowing down in the specifying unit 120. FIG. It should be noted that the degree of hyper-conceptualization may be adjusted according to the processing capability of the keyword generating section 110A. In addition, for the narrowing down in the keyword generation unit 110B, the hypernymization may be similarly adopted.

（４）上記第１実施形態及び第３実施形態では、コメントは電子メール等によりサーバ装置１０からユーザ装置２０へ送信された。しかし、コメントがオーバレイされた画像信号Ｓａがサーバ装置１０からユーザ装置２０へ送信されてもよい。また、コメントの表示はリアルタイムでの表示には限定されず、動画の視聴が終了した後に行われてもよい。 (4) In the first and third embodiments, comments are sent from the server device 10 to the user device 20 by e-mail or the like. However, the image signal Sa overlaid with the comment may be transmitted from the server device 10 to the user device 20 . Moreover, the display of the comment is not limited to real-time display, and may be performed after the viewing of the moving image is finished.

（５）本発明の情報処理装置の一例として、上記第１実施形態ではサーバ装置１０が挙げられ、上記第２実施形態ではユーザ装置２０が挙げられていた。しかし、本発明の情報処理装置は、ユーザの行動履歴を示す行動情報と第１の絞り込みの程度とに基づいて、複数の物体の画像を動画から特定し、特定された複数の画像の各々についてコメントの対象の候補となる候補キーワードを生成するキーワード生成部と、行動情報と第２の絞り込みの程度とに基づいて、コメントの対象となる一又は複数の対象キーワードをキーワード生成部によって生成された複数の候補キーワードから特定する特定部と、一又は複数の対象キーワードの各々について、当該対象キーワードに関連するコメントを生成するコメント生成部と、キーワード生成部において複数の画像を絞り込む程度(第１の絞り込みの程度)と、特定部において複数の候補キーワードを絞り込む程度(第２の絞り込みの程度)とを、前記キーワード生成部及び前記特定部の処理に関する処理情報に応じて調整する調整部とを備えていればよく、サーバ装置又はユーザ装置には限定されない。 (5) As an example of the information processing apparatus of the present invention, the server device 10 was mentioned in the first embodiment, and the user device 20 was mentioned in the second embodiment. However, the information processing apparatus of the present invention identifies images of a plurality of objects from a moving image based on behavior information indicating a user's behavior history and the degree of first narrowing down, and for each of the identified plurality of images, A keyword generation unit that generates candidate keywords that are candidates for comments, and one or more target keywords that are targets for comments are generated by the keyword generation unit based on the behavior information and the degree of second narrowing down. A specifying unit that specifies from a plurality of candidate keywords, a comment generating unit that generates comments related to each of one or more target keywords, and a degree of narrowing down a plurality of images in the keyword generating unit (first degree of narrowing down) and degree of narrowing down the plurality of candidate keywords in the specifying unit (second degree of narrowing down) according to processing information relating to the processing of the keyword generating unit and the specifying unit. It is not limited to the server device or the user device.

例えば、動画配信サーバから送信される画像信号Ｓａをユーザ装置へ中継する中継装置（スイッチングハブ、ルータ又はゲートウェイ等）が、キーワード生成部、特定部、コメント生成部及び調整部を備えていてもよい。この中継装置によれば、動画配信サーバから送信された画像信号Ｓａに、コメント生成部により生成したコメントの画像をオーバレイしてユーザ装置へ転送することができる。また、この中継装置によれば、特定部の処理能力に応じてキーワード生成部における絞り込みの程度と特定部における絞り込みの程度とを調整することができる。 For example, a relay device (switching hub, router, gateway, etc.) that relays the image signal Sa transmitted from the video distribution server to the user device may include a keyword generation unit, a specification unit, a comment generation unit, and an adjustment unit. . According to this relay device, the image signal Sa transmitted from the moving image distribution server can be overlaid with the image of the comment generated by the comment generation unit and transferred to the user device. Further, according to this relay device, it is possible to adjust the degree of narrowing down in the keyword generating unit and the degree of narrowing down in the identifying unit according to the processing capability of the identifying unit.

また、コメント生成機能を有するサーバ装置、すなわち本発明のサーバ装置は、上記キーワード生成部、特定部、コメント生成部及び調整部を有する情報処理装置と、ユーザが管理するユーザ装置から送信される行動情報を受信し、コメントをユーザ装置へ送信する通信装置と、を備えていればよい。同様に、コメント生成機能を有するユーザ装置、すなわち本発明のユーザ装置は、上記キーワード生成部、特定部、コメント生成部及び調整部を有する情報処理装置と、コメントを表示装置に表示させる表示制御部と、を備えていればよい。 A server device having a comment generation function, that is, the server device of the present invention includes an information processing device having the keyword generation unit, the specification unit, the comment generation unit, and the adjustment unit, and an action transmitted from the user device managed by the user. a communication device for receiving information and transmitting comments to the user device. Similarly, a user device having a comment generation function, that is, the user device of the present invention includes an information processing device having the keyword generation unit, the specification unit, the comment generation unit, and the adjustment unit, and a display control unit for displaying the comment on the display device. and .

（６）上述した各実施形態の説明に用いたブロック図は、機能単位のブロックを示している。これらの機能ブロック（構成部）は、ハードウェア及び／又はソフトウェアの任意の組み合わせによって実現される。また、各機能ブロックの実現手段は特に限定されない。すなわち、各機能ブロックは、物理的及び／又は論理的に結合した１つの装置により実現されてもよいし、物理的及び／又は論理的に分離した２つ以上の装置を直接的及び／又は間接的に(例えば、有線及び／又は無線)で接続し、これら複数の装置により実現されてもよい。例えば、決定部１１１の機能はネットワークＮＷを介して接続される他のサーバ装置から提供されてもよい。同様に、特徴量生成部１１２の機能もネットワークＮＷを介して接続される他のサーバ装置から提供されてもよく、特徴量テーブルＴＢＬａも他のサーバ装置に設けられてもよい。
また、上述した各実施形態の説明に用いた「装置」という文言は、回路、デバイス又はユニット等の他の用語に読替えてもよい。(6) The block diagrams used to describe each of the above-described embodiments show blocks for each function. These functional blocks (components) are implemented by any combination of hardware and/or software. Further, means for realizing each functional block is not particularly limited. That is, each functional block may be implemented by one device physically and/or logically coupled, or may be implemented by two or more physically and/or logically separated devices directly and/or indirectly. These multiple devices may be connected together (eg, wired and/or wirelessly). For example, the function of the determination unit 111 may be provided by another server device connected via the network NW. Similarly, the function of the feature quantity generation unit 112 may also be provided by another server device connected via the network NW, and the feature quantity table TBLa may also be provided in another server device.
Also, the term "apparatus" used in the description of each of the above-described embodiments may be replaced with other terms such as circuit, device, or unit.

（７）上述した各実施形態における処理手順、シーケンス、フローチャート等は、矛盾の無い限り、順序を入れ替えてもよい。例えば、本明細書で説明した方法については、例示的な順序で様々なステップの要素を提示しており、提示した特定の順序に限定されない。 (7) As long as there is no contradiction, the order of the processing procedures, sequences, flowcharts, etc. in each of the above-described embodiments may be changed. For example, the methods described herein present elements of the various steps in a sample order, and are not limited to the specific order presented.

（８）上述した各実施形態において、入出力された情報等は特定の場所(例えば、メモリ)に保存されてもよいし、管理テーブルで管理してもよい。入出力される情報等は、上書き、更新、又は追記され得る。出力された情報等は削除されてもよい。入力された情報等は他の装置へ送信されてもよい。 (8) In each of the above-described embodiments, input/output information and the like may be stored in a specific location (for example, memory) or managed in a management table. Input/output information and the like can be overwritten, updated, or appended. The output information and the like may be deleted. The entered information and the like may be transmitted to another device.

（９）上述した各実施形態において、判定は、１ビットで表される値（０か１か）によって行われてもよいし、真偽値（Boolean：true又はfalse）によって行われてもよいし、数値の比較（例えば、所定の値との比較）によって行われてもよい。 (9) In each of the above-described embodiments, the determination may be made by a value represented by 1 bit (0 or 1), or by a true/false value (Boolean: true or false). and may be performed by numerical comparison (eg, comparison with a predetermined value).

（１０）上述した第１実施形態における記憶装置１２Ａは、処理装置１１Ａが読取可能な記録媒体であり、ＲＯＭ及びＲＡＭ等を例示したが、フレキシブルディスク、光磁気ディスク(例えば、コンパクトディスク、デジタル多用途ディスク、Ｂｌｕ－ｒａｙ（登録商標）ディスク)、スマートカード、フラッシュメモリデバイス(例えば、カード、スティック、キードライブ)、ＣＤ－ＲＯＭ（Compact Disc－ＲＯＭ）、レジスタ、リムーバブルディスク、ハードディスク、フロッピー（登録商標）ディスク、磁気ストリップ、データベース、サーバその他の適切な記憶媒体である。第２実施形態における記憶装置１２Ｂ，第３実施形態における記憶装置１２Ｃ及び記憶装置１２Ｄも、記憶装置１２Ａと同様である。また、プログラムは、ネットワークＮＷから送信されても良い。また、プログラムは、電気通信回線を介して通信網から送信されても良い。 (10) The storage device 12A in the first embodiment described above is a recording medium readable by the processing device 11A. Applications discs, Blu-ray (registered trademark) discs), smart cards, flash memory devices (e.g. cards, sticks, key drives), CD-ROMs (Compact Disc-ROMs), registers, removable discs, hard disks, floppies (registered (trademark) disks, magnetic strips, databases, servers, or other suitable storage media. The storage device 12B in the second embodiment, and the storage devices 12C and 12D in the third embodiment are similar to the storage device 12A. Also, the program may be transmitted from the network NW. Also, the program may be transmitted from a communication network via an electric communication line.

（１１）上述した各実施形態は、ＬＴＥ（Long Term Evolution）、ＬＴＥ－Ａ（LTE-Advanced）、ＳＵＰＥＲ３Ｇ、ＩＭＴ－Ａｄｖａｎｃｅｄ、４Ｇ、５Ｇ、ＦＲＡ（Future Radio Access）、Ｗ－ＣＤＭＡ（登録商標）、ＧＳＭ（登録商標）、ＣＤＭＡ２０００、ＵＭＢ（Ultra Mobile Broadband）、ＩＥＥＥ８０２．１１（Ｗｉ－Ｆｉ）、ＩＥＥＥ８０２．１６（ＷｉＭＡＸ）、ＩＥＥＥ８０２．２０、ＵＷＢ（Ultra-WideBand）、Ｂｌｕｅｔｏｏｔｈ（登録商標）、その他の適切なシステムを利用するシステム及び／又はこれらに基づいて拡張された次世代システムに適用されてもよい。 (11) Each of the above-described embodiments is LTE (Long Term Evolution), LTE-A (LTE-Advanced), SUPER 3G, IMT-Advanced, 4G, 5G, FRA (Future Radio Access), W-CDMA (registered trademark) ), GSM (registered trademark), CDMA2000, UMB (Ultra Mobile Broadband), IEEE 802.11 (Wi-Fi), IEEE 802.16 (WiMAX), IEEE 802.20, UWB (Ultra-WideBand), Bluetooth (registered Trademarks), other suitable systems, and/or future generation systems enhanced based on these.

（１２）上述した各実施形態において、説明した情報及び信号等は、様々な異なる技術のいずれかを使用して表されてもよい。例えば、上述の説明全体に渡って言及され得るデータ、命令、コマンド、情報、信号、ビット、シンボル、チップ等は、電圧、電流、電磁波、磁界若しくは磁性粒子、光場若しくは光子、又はこれらの任意の組み合わせによって表されてもよい。
なお、本明細書で説明した用語及び／又は本明細書の理解に必要な用語については、同一の又は類似する意味を有する用語と置き換えてもよい。(12) In each of the embodiments described above, the information, signals, etc. described may be represented using any of a variety of different technologies. For example, data, instructions, commands, information, signals, bits, symbols, chips, etc. that may be referred to throughout the above description may refer to voltages, currents, electromagnetic waves, magnetic fields or magnetic particles, light fields or photons, or any of these. may be represented by a combination of
The terms explained in this specification and/or terms necessary for understanding this specification may be replaced with terms having the same or similar meanings.

（１３）図３、図７、図１０、及び図１２に例示された各機能は、ハードウェア及びソフトウェアの任意の組合せによって実現される。また、各機能は、単体の装置によって実現されてもよいし、相互に別体で構成された２個以上の装置によって実現されてもよい。 (13) Each function illustrated in FIGS. 3, 7, 10, and 12 is implemented by any combination of hardware and software. Also, each function may be implemented by a single device, or may be implemented by two or more devices configured separately from each other.

（１４）上述した各実施形態で例示したプログラムは、ソフトウェア、ファームウェア、ミドルウェア、マイクロコード又はハードウェア記述言語と呼ばれるか、他の名称によって呼ばれるかを問わず、命令、命令セット、コード、コードセグメント、プログラムコード、サブプログラム、ソフトウェアモジュール、アプリケーション、ソフトウェアアプリケーション、ソフトウェアパッケージ、ルーチン、サブルーチン、オブジェクト、実行可能ファイル、実行スレッド、手順又は機能等を意味するよう広く解釈されるべきである。
また、ソフトウェア、命令等は、伝送媒体を介して送受信されてもよい。例えば、ソフトウェアが、同軸ケーブル、光ファイバケーブル、ツイストペア及びデジタル加入者回線（ＤＳＬ）等の有線技術及び／又は赤外線、無線及びマイクロ波等の無線技術を使用してウェブサイト、サーバ、又は他のリモートソースから送信される場合、これらの有線技術及び／又は無線技術は、伝送媒体の定義内に含まれる。(14) The programs exemplified in each of the above embodiments, whether referred to as software, firmware, middleware, microcode, hardware description language, or by any other name, may include instructions, instruction sets, code, code segments. , program code, subprograms, software modules, applications, software applications, software packages, routines, subroutines, objects, executable files, threads of execution, procedures or functions, or the like.
Software, instructions, etc. may also be sent and received over a transmission medium. For example, the software may use wired technologies such as coaxial cable, fiber optic cable, twisted pair and Digital Subscriber Line (DSL) and/or wireless technologies such as infrared, radio and microwave to create websites, servers, or other When transmitted from a remote source, these wired and/or wireless technologies are included within the definition of transmission media.

（１５）上述した各実施形態において、「システム」及び「ネットワーク」という用語は、互換的に使用される。 (15) In each of the embodiments described above, the terms "system" and "network" are used interchangeably.

（１６）上述した各実施形態において、情報、パラメータ等は、絶対値で表されてもよいし、所定の値からの相対値で表されてもよいし、対応する別の情報で表されてもよい。 (16) In each of the above-described embodiments, information, parameters, etc. may be represented by absolute values, may be represented by relative values from a predetermined value, or may be represented by corresponding separate information. good too.

（１７）上述した各実施形態において、ユーザ装置２０は、移動局である場合が含まれる。移動局は、当業者によって、加入者局、モバイルユニット、加入者ユニット、ワイヤレスユニット、リモートユニット、モバイルデバイス、ワイヤレスデバイス、ワイヤレス通信デバイス、リモートデバイス、モバイル加入者局、アクセス端末、モバイル端末、ワイヤレス端末、リモート端末、ハンドセット、ユーザエージェント、モバイルクライアント、クライアント、又はいくつかの他の適切な用語で呼ばれる場合もある。 (17) In each of the embodiments described above, the user equipment 20 may be a mobile station. A mobile station is defined by those skilled in the art as a subscriber station, mobile unit, subscriber unit, wireless unit, remote unit, mobile device, wireless device, wireless communication device, remote device, mobile subscriber station, access terminal, mobile terminal, wireless It may also be called a terminal, remote terminal, handset, user agent, mobile client, client, or some other suitable term.

（１８）上述した各実施形態において、「接続された(connected)」という用語、又はこれらのあらゆる変形は、２又はそれ以上の要素間の直接的又は間接的なあらゆる接続又は結合を意味し、互いに「接続」された２つの要素間に１又はそれ以上の中間要素が存在することを含むことができる。要素間の接続は、物理的なものであっても、論理的なものであっても、或いはこれらの組み合わせであってもよい。本明細書で使用する場合、２つの要素は、１又はそれ以上の電線、ケーブル及び／又はプリント電気接続を使用することにより、並びにいくつかの非限定的かつ非包括的な例として、無線周波数領域、マイクロ波領域及び光（可視及び不可視の両方）領域の波長を有する電磁エネルギー等の電磁エネルギーを使用することにより、互いに「接続」されると考えることができる。 (18) in each of the above embodiments, the term "connected" or any variation thereof means any direct or indirect connection or coupling between two or more elements; It can include the presence of one or more intermediate elements between two elements that are "connected" to each other. Connections between elements may be physical, logical, or a combination thereof. As used herein, two elements are referred to by the use of one or more wires, cables and/or printed electrical connections and, as some non-limiting and non-exhaustive examples, radio frequency They can be considered to be "connected" to each other through the use of electromagnetic energy, such as electromagnetic energy having wavelengths in the microwave, light (both visible and invisible) regions.

（１９）上述した各実施形態において、「に基づいて」という記載は、別段に明記されていない限り、「のみに基づいて」を意味しない。言い換えれば、「に基づいて」という記載は、「のみに基づいて」と「に少なくとも基づいて」の両方を意味する。 (19) In each of the embodiments described above, the phrase "based on" does not mean "based only on," unless expressly specified otherwise. In other words, the phrase "based on" means both "based only on" and "based at least on."

（２０）本明細書で使用する「第１」、「第２」等の呼称を使用した要素へのいかなる参照も、それらの要素の量又は順序を全般的に限定するものではない。これらの呼称は、２つ以上の要素間を区別する便利な方法として本明細書で使用され得る。従って、第１及び第２の要素への参照は、２つの要素のみがそこで採用され得ること、又は何らかの形で第１の要素が第２の要素に先行しなければならないことを意味しない。 (20) Any reference to elements using the "first," "second," etc. designations used herein does not generally limit the quantity or order of those elements. These designations may be used herein as a convenient method of distinguishing between two or more elements. Thus, references to first and second elements do not imply that only two elements may be employed therein, or that the first element must precede the second element in any way.

（２１）上述した各実施形態において「含む(ｉｎｃｌｕｄｉｎｇ)」、「含んでいる（ｃｏｍｐｒｉｓｉｎｇ）」、及びそれらの変形が、本明細書あるいは特許請求の範囲で使用されている限り、これら用語は、用語「備える」と同様に、包括的であることが意図される。さらに、本明細書あるいは特許請求の範囲において使用されている用語「又は（or）」は、排他的論理和ではないことが意図される。 (21) To the extent that "including," "comprising," and variations thereof are used in each of the above-described embodiments in the specification or claims, these terms include: Like the term "comprising," it is intended to be inclusive. Furthermore, the term "or" as used in this specification or the claims is not intended to be an exclusive OR.

（２２）本願の全体において、例えば、英語におけるa、an及びtheのように、翻訳によって冠詞が追加された場合、これらの冠詞は、文脈から明らかにそうではないことが示されていなければ、複数を含む。 (22) Throughout this application, where articles have been added by translation, e.g. a, an and the in English, these articles shall be used unless the context clearly indicates otherwise. Including multiple.

（２３）本発明が本明細書中に説明した実施形態に限定されないことは当業者にとって明白である。本発明は、特許請求の範囲の記載に基づいて定まる本発明の趣旨及び範囲を逸脱することなく修正及び変更態様として実施できる。従って、本明細書の記載は、例示的な説明を目的とし、本発明に対して何ら制限的な意味を有さない。また、本明細書に例示した態様から選択された複数の態様を組み合わせてもよい。 (23) It will be clear to those skilled in the art that the present invention is not limited to the embodiments described herein. The present invention can be implemented as modifications and changes without departing from the spirit and scope of the present invention determined based on the description of the claims. Accordingly, the description herein is for illustrative purposes only and is not meant to be limiting in any way. Also, a plurality of aspects selected from the aspects exemplified in this specification may be combined.

１…サービスシステム、１０…サーバ装置、１１Ａ、１１Ｂ、１１Ｃ，１１Ｄ…処理装置、２０…ユーザ装置、１２Ａ、１２Ｂ…記憶装置、１４Ａ、１４Ｂ、１４Ｃ，１４Ｄ…通信装置、１５…表示部、１９…バス、１１０Ａ、１１０Ｂ…キーワード生成部、１１１…決定部、１１２…特徴量生成部、１１３Ａ、１１３Ｂ…抽出部、１１４…変換部、１２０…特定部、１３０…コメント生成部、１４０…調整部、ＰＲａ、ＰＲｂ、ＰＲｃ、ＰＲｄ…制御プログラム、ＴＢＬａ…特徴量テーブル、ＴＢＬｂ…コメントテーブル、ＤＳ…サービスデータ、ＫＷ…候補キーワード、ＫＸ…対象キーワード。

Reference Signs List 1 service system 10 server device 11A, 11B, 11C, 11D processing device 20 user device 12A, 12B storage device 14A, 14B, 14C, 14D communication device 15 display unit 19 Bus 110A, 110B Keyword generation unit 111 Determination unit 112 Feature amount generation unit 113A, 113B Extraction unit 114 Conversion unit 120 Specification unit 130 Comment generation unit 140 Adjustment unit , PRa, PRb, PRc, PRd... control program, TBLa... feature amount table, TBLb... comment table, DS... service data, KW... candidate keyword, KX... target keyword.

Claims

A plurality of images of objects are specified from a moving image based on action information indicating a user's action history and a degree of first narrowing down, and candidates for commenting on each of the specified images of a plurality of objects. a keyword generation unit that generates keywords;
a specifying unit that specifies one or more target keywords to be commented from a plurality of candidate keywords generated by the keyword generating unit, based on the behavior information and the degree of second narrowing down;
a comment generation unit that generates a comment related to the target keyword for each of the one or more target keywords;
an adjusting unit that adjusts the degree of first narrowing down and the degree of second narrowing down according to processing information related to processing of the keyword generating unit and the specifying unit;
Information processing device.

2. The information processing apparatus according to claim 1, wherein said processing information includes processing capability of said specifying unit.

The adjustment unit
When the processing power of the specific unit is high, compared to when the processing power is low, the degree of the first narrowing down is lowered and the degree of the second narrowing down is raised.
The information processing apparatus according to claim 2.

2. The information processing apparatus according to claim 1, wherein the processing information includes information regarding quality of the moving image.

The adjustment unit
When the quality of the moving image is high, compared to when the quality of the moving image is low, the degree of the first narrowing down is lowered and the degree of the second narrowing down is raised.
The information processing apparatus according to claim 4.

The adjustment unit outputs control information indicating the degree of the first narrowing down to the keyword generation unit,
The keyword generation unit is
a determination unit that determines the type of object to be extracted from the moving image based on the result of evaluating the user's degree of interest in the object based on the behavior information and the control information;
a feature quantity generating unit for generating a feature quantity of an image of the object of the type determined by the determining unit;
an extraction unit that extracts an image of an object having the feature amount from the moving image;
with
The keyword generation unit is
a conversion unit that converts the image of the object extracted by the extraction unit into the candidate keyword;
The information processing apparatus according to any one of claims 1 to 5.

an information processing apparatus according to any one of claims 1 to 6;
a communication device that receives the action information transmitted from a user device managed by the user and transmits the comment to the user device;
A server device comprising

an information processing apparatus according to any one of claims 1 to 6;
a display control unit for displaying the comment on a display device;
A user equipment comprising:

An information processing system comprising a user device managed by a user and a server device,
The user device
Based on the action information indicating the action history of the user and the degree of first narrowing down, a plurality of images respectively corresponding to the plurality of objects are specified from the moving image, and a comment is made for each of the specified images of the plurality of objects. a keyword generating unit that generates candidate keywords that are candidates for the target;
A first communication device that transmits the action information, a plurality of candidate keywords generated by the keyword generation unit, and processing information related to processing of the keyword generation unit to the server device, and receives comments transmitted from the server device. When,
A display control unit for displaying the comment on a display device,
The server device
a second communication device that receives the action information, the plurality of candidate keywords, and processing information related to processing of the keyword generation unit transmitted from the user device, and transmits the comment to the user device;
a specifying unit that specifies one or more target keywords to be commented on from the plurality of candidate keywords based on the behavior information and the degree of second narrowing down;
a comment generating unit that generates, for each of the one or more target keywords, a comment related to the target keyword as a comment to be transmitted to the user device;
a degree of narrowing down the plurality of images in the keyword generating unit; an adjustment unit that adjusts according to information and processing information related to processing of the identification unit;
Information processing system.