JP7255585B2

JP7255585B2 - Information processing device, information processing method, and program

Info

Publication number: JP7255585B2
Application number: JP2020505682A
Authority: JP
Inventors: 光平西村
Original assignee: Sony Corp; Sony Group Corp
Current assignee: Sony Corp; Sony Group Corp
Priority date: 2018-03-16
Filing date: 2019-02-08
Publication date: 2023-04-11
Anticipated expiration: 2039-02-08
Also published as: JPWO2019176398A1; WO2019176398A1

Description

本開示は、情報処理装置、情報処理方法、および、プログラムに関する。 The present disclosure relates to an information processing device, an information processing method, and a program.

従来、検索技術や機械学習技術の領域では、テキストや画像、表形式データ、時系列データ等の様々なデータを、そのデータの特徴をよく表すＮ次元ベクトルで代替すること（特徴抽出）が広く行われている。 Conventionally, in the fields of search technology and machine learning technology, replacing various data such as text, images, tabular data, and time-series data with N-dimensional vectors that well represent the characteristics of the data (feature extraction) is widely used. It is done.

ベクトル化の例として、自然言語処理の界隈では、語彙数Ｎを次元数とし、出現した単語のみ値を持つベクトルを用いて文章を代表するＢｏＷ（ＢａｇｏｆＷｏｒｄｓ）と呼ばれる手法が一般的に使われている。また、画像処理の界隈では、局所バイナリパターン（ｌｏｃａｌｂｉｎａｒｙｐａｔｔｅｒｎ：ＬＢＰ）などの局所特徴をコードワードと見なしたＢｏＶＷ（ＢａｇｏｆＶｉｓｕａｌＷｏｒｄｓ）といった技法の他、データを入力とし、特徴ベクトルを出力する深層学習モデルも多数考案されている。表形式データは、カテゴリを１－ｈｏｔな数次元ベクトルに変換する処理や、各整数値や実数値を正規化する処理によって１つのベクトルに変換される。 As an example of vectorization, in the realm of natural language processing, a method called BoW (Bag of Words) is commonly used in which the number of words N is the number of dimensions, and a vector having a value only for words that appear is used to represent sentences. It is In addition, in the area of image processing, in addition to techniques such as BoVW (Bag of Visual Words), in which local features such as local binary patterns (LBP) are regarded as codewords, data is input and feature vectors are output. Many deep learning models have also been devised. Tabular data is converted into one vector by a process of converting a category into a 1-hot number-dimensional vector and a process of normalizing each integer value and real value.

このようなベクトルで張られた空間は特徴空間と呼ばれ、検索技術や機械学習技術の素性として用いられる。今後特徴空間の需要はさらに増大し、特徴空間を複数扱い、横断的に利用したり、切り替えたりといった需要が想定される。 A space spanned by such vectors is called a feature space, and is used as features in search technology and machine learning technology. In the future, the demand for feature spaces will increase further, and it is expected that there will be demand for handling multiple feature spaces, cross-sectionally using them, and switching between them.

また、異なるモダリティ（入出力形式）を統一的に扱う横断的な特徴空間も存在する。例えば下記非特許文献１では、テキストと画像を意味的な空間にマッピングする技術（マルチモーダルをシングルスペースにマッピングする技術）が開示されている。 There is also a transversal feature space that handles different modalities (input/output formats) in a unified manner. For example, Non-Patent Document 1 below discloses a technique of mapping text and images to a semantic space (a technique of mapping multimodal to a single space).

"A New Approach to Cross-Modal Multimedia Retrieval" N. Rasiwasia, J. Costa Pereira, E. Coviello, G. Doyle, G. Lanckriet, R.Levy and N. Vasconcelos. Proceedings of the 18th ACM international conference on Multimedia, Pages 251-260, Florence, Italy - Oct. 2010."A New Approach to Cross-Modal Multimedia Retrieval" N. Rasiwasia, J. Costa Pereira, E. Coviello, G. Doyle, G. Lanckriet, R.Levy and N. Vasconcelos. Proceedings of the 18th ACM international conference on Multimedia, Pages 251-260, Florence, Italy - Oct. 2010.

ここで、特徴抽出を行う際には、特徴抽出器に入れるまでのデータの加工（前処理）が重要であり、決まったプロセスを踏む必要があるが、これらは特徴抽出器毎に異なるため、特徴空間を複数扱う場合は同じデータを特徴抽出器毎に異なるプロセスでそれぞれ前処理を行わなければならず、冗長であった。 Here, when performing feature extraction, it is important to process (preprocess) the data before entering it into the feature extractor, and it is necessary to go through a fixed process. When dealing with a plurality of feature spaces, the same data must be preprocessed by different processes for each feature extractor, which is redundant.

そこで、本開示では、複数の特徴空間を扱うシステムの利便性をより向上させることが可能な情報処理装置、情報処理方法、および、プログラムを提案する。 Therefore, the present disclosure proposes an information processing device, an information processing method, and a program that can further improve the convenience of a system that handles a plurality of feature spaces.

本開示によれば、登録要求情報に含まれる登録オブジェクトを複数の特徴抽出部に共通する一意の第１の識別情報と関連付けて記憶部に記憶する制御と、前記登録オブジェクトを前記登録オブジェクトのモダリティの定義に従って変換し、登録用の変換データを生成する制御と、前記第１の識別情報と前記登録用の変換データを、前記モダリティに対応する複数の特徴抽出器に出力する制御と、を行う制御部を備える、情報処理装置を提案する。 According to the present disclosure, control for storing a registered object included in registration request information in a storage unit in association with unique first identification information common to a plurality of feature extraction units, and modality of the registered object for the registered object and control to generate conversion data for registration, and control to output the first identification information and the conversion data for registration to a plurality of feature extractors corresponding to the modality. An information processing apparatus including a control unit is proposed.

本開示によれば、プロセッサが、登録要求情報に含まれる登録オブジェクトを複数の特徴抽出部に共通する一意の第１の識別情報と関連付けて記憶部に記憶する制御と、前記登録オブジェクトを前記登録オブジェクトのモダリティの定義に従って変換し、登録用の変換データを生成する制御と、前記第１の識別情報と前記登録用の変換データを、前記モダリティに対応する複数の特徴抽出器に出力する制御と、を行うことを含む、情報処理方法を提案する。 According to the present disclosure, a processor controls to store a registered object included in registration request information in a storage unit in association with unique first identification information common to a plurality of feature extraction units; Control for converting according to the definition of the modality of the object to generate conversion data for registration, and control for outputting the first identification information and the conversion data for registration to a plurality of feature extractors corresponding to the modalities. We propose a method for processing information, including:

本開示によれば、コンピュータを、登録要求情報に含まれる登録オブジェクトを複数の特徴抽出部に共通する一意の第１の識別情報と関連付けて記憶部に記憶する制御と、前記登録オブジェクトを前記登録オブジェクトのモダリティの定義に従って変換し、登録用の変換データを生成する制御と、前記第１の識別情報と前記登録用の変換データを、前記モダリティに対応する複数の特徴抽出器に出力する制御と、を行う制御部として機能させるための、プログラムを提案する。 According to the present disclosure, a computer controls to store a registered object included in registration request information in a storage unit in association with unique first identification information common to a plurality of feature extraction units; Control for converting according to the definition of the modality of the object to generate conversion data for registration, and control for outputting the first identification information and the conversion data for registration to a plurality of feature extractors corresponding to the modalities. We propose a program for functioning as a control unit that performs

以上説明したように本開示によれば、複数の特徴空間を扱うシステムの利便性をより向上させることが可能となる。 As described above, according to the present disclosure, it is possible to further improve the convenience of a system that handles multiple feature spaces.

なお、上記の効果は必ずしも限定的なものではなく、上記の効果とともに、または上記の効果に代えて、本明細書に示されたいずれかの効果、または本明細書から把握され得る他の効果が奏されてもよい。 In addition, the above effects are not necessarily limited, and in addition to the above effects or instead of the above effects, any of the effects shown in this specification, or other effects that can be grasped from this specification may be played.

本開示の一実施形態による情報処理システムの概要について説明する図である。1 is a diagram describing an overview of an information processing system according to an embodiment of the present disclosure; FIG. 本実施形態による情報処理システムの主な機能ブロックにおける処理内容の一例を示す図である。It is a figure which shows an example of the processing content in the main functional blocks of the information processing system by this embodiment. 本実施形態による情報処理システムの他のシステム構成例の一例を示す図である。It is a figure which shows an example of the other system configuration example of the information processing system by this embodiment. 本実施形態による情報処理システムにおけるオブジェクトの登録処理の流れの一例を示すシーケンス図である。FIG. 7 is a sequence diagram showing an example of the flow of object registration processing in the information processing system according to the present embodiment; 本実施形態による情報処理システムにおける検索処理の流れの一例を示すシーケンス図である。FIG. 4 is a sequence diagram showing an example of the flow of search processing in the information processing system according to the embodiment; 本実施形態における検索画面の一例を示す図である。It is a figure which shows an example of the search screen in this embodiment. 本実施形態による第１の応用例における特徴抽出器のモジュール化について説明する図である。It is a figure explaining modularization of the feature extractor in the 1st application example by this embodiment. 本実施形態による第１の応用例の登録処理の一例を示すシーケンス図である。FIG. 12 is a sequence diagram showing an example of registration processing of the first application example according to the present embodiment; 本実施形態による第１の応用例の検索処理の一例を示すシーケンス図である。FIG. 10 is a sequence diagram showing an example of search processing of the first application example according to the present embodiment; 本実施形態による第２の応用例における検索画面の一例を示す図である。FIG. 11 is a diagram showing an example of a search screen in a second application example according to the embodiment; 本実施形態による第２の応用例における検索画面の他の例を示す図である。FIG. 10 is a diagram showing another example of a search screen in the second application example according to the embodiment; 本実施形態による第２の応用例の検索処理の一例を示すシーケンス図である。FIG. 12 is a sequence diagram showing an example of search processing in a second application example according to the present embodiment; 本実施形態による第３の応用例のサジェストシステムの構成の一例を示す機能ブロック図である。It is a functional block diagram which shows an example of a structure of the suggestion system of the 3rd application example by this embodiment. 本実施形態による第３の応用例のサジェストシステムにおける検索処理の流れの一例を示すシーケンス図である。FIG. 12 is a sequence diagram showing an example of the flow of search processing in the suggestion system of the third application example according to the present embodiment; 本実施形態による第３の応用例におけるアプリケーションから取得する操作情報と要求情報の一例を示す図である。FIG. 14 is a diagram showing an example of operation information and request information acquired from an application in a third application example according to the embodiment;

以下に添付図面を参照しながら、本開示の好適な実施の形態について詳細に説明する。なお、本明細書及び図面において、実質的に同一の機能構成を有する構成要素については、同一の符号を付することにより重複説明を省略する。 Preferred embodiments of the present disclosure will be described in detail below with reference to the accompanying drawings. In the present specification and drawings, constituent elements having substantially the same functional configuration are denoted by the same reference numerals, thereby omitting redundant description.

また、説明は以下の順序で行うものとする。
１．本開示の一実施形態による情報処理システムの概要
２．構成
２－１．情報処理装置１０の構成
２－２．特徴管理サーバ２０の構成
３．動作処理
３－１．登録処理
３－２．検索処理
４．応用例
４－１．第１の応用例：モダリティの包含関係の定義
４－２．第２の応用例：検索結果のマージ
４－３．第３の応用例：サジェストシステム
５．まとめAlso, the description shall be given in the following order.
1. Overview of an information processing system according to an embodiment of the present disclosure2. Configuration 2-1. Configuration of information processing apparatus 2-2. Configuration of feature management server 20 3 . Operation processing 3-1. Registration process 3-2. Search processing 4 . Application example 4-1. First Application Example: Definition of Inclusive Relationship of Modalities 4-2. Second Application Example: Merging Search Results 4-3. Third Application Example: Suggestion System5. summary

＜＜１．本開示の一実施形態による情報処理システムの概要＞＞
図１は、本開示の一実施形態による情報処理システムの概要について説明する図である。図１に示すように、本実施形態による情報処理システムは、情報処理装置１０と、特徴管理サーバ２０とを有する構成となっている。<<1. Outline of information processing system according to an embodiment of the present disclosure>>
FIG. 1 is a diagram illustrating an overview of an information processing system according to an embodiment of the present disclosure. As shown in FIG. 1, the information processing system according to this embodiment has a configuration including an information processing device 10 and a feature management server 20 .

特徴管理サーバ２０は、テキストや画像、表形式データ、時系列データ等の様々なデータ（以下、「オブジェクト」と称す）を、そのデータの特徴をよく表すＮ次元ベクトルで代替する処理（特徴抽出処理）を行う特徴抽出部２０２（特徴抽出器の一例）を有し、かかるベクトルで張られた空間（特徴空間）を管理している。本明細書では、特徴抽出部２０２と特徴空間は１対１の関係にある。特徴管理サーバ２０は、図１に示すように複数存在していてもよく、それぞれ１の特徴抽出部２０２を有する構成となっている（すなわち、各特徴管理サーバ２０が、それぞれ１の特徴空間を管理していると言える）。 The feature management server 20 replaces various data such as texts, images, tabular data, and time-series data (hereinafter referred to as "objects") with N-dimensional vectors that well represent the features of the data (feature extraction). processing), and manages a space (feature space) spanned by such vectors. In this specification, there is a one-to-one relationship between the feature extraction unit 202 and the feature space. A plurality of feature management servers 20 may exist as shown in FIG. 1, each having one feature extraction unit 202. said to be in control).

ここで、一般的な特徴空間の扱いについて以下説明する。特徴空間では、オブジェクト間の類似度や関係の表現ができ、例えば概念での検索および推薦が可能となる。また、特徴空間では、他の機械学習技術の精度を向上させることができ、例えば、認識系、データ解析等で素性として利用することが可能となる。また、特徴空間では、異なるモダリティを統一的に扱うことができ、例えば、テキストも手書きもノートも１つの特徴空間で扱うことが可能となる。本明細書において「モダリティ」とは、データの入出力形式であって、例えば下記に挙げるように多岐に渡るモダリティが想定される。
・テキスト－単語、文章、ＨＴＭＬ（HyperText Markup Language）など
・メディア－ＲＧＢ画像、深度画像、ベクタ画像、動画、音声など
・複合文書－オフィス文書、ＰＤＦ、Ｗｅｂページ、電子メールなど
・メタデータ－ユーザ、日付など
・センサデータ－現在位置、加速度、心拍数など
・アプリケーションデータ－起動ログ、処理中のファイル情報など
このようなモダリティに対して特徴空間を定義でき、自由に拡張可能となる。Here, the handling of a general feature space will be described below. In the feature space, it is possible to express similarities and relationships between objects, and for example, to search and recommend concepts. In addition, in the feature space, the accuracy of other machine learning techniques can be improved, and for example, it can be used as features in recognition systems, data analysis, and the like. Also, in the feature space, different modalities can be handled in a unified manner, and for example, text, handwriting, and notes can be handled in one feature space. In this specification, the term "modality" refers to a data input/output format, and includes a wide variety of modalities, such as those listed below.
・Text - words, sentences, HTML (HyperText Markup Language), etc. ・Media - RGB images, depth images, vector images, videos, audio, etc. ・Compound documents - office documents, PDFs, web pages, emails, etc. ・Metadata - users , date, etc. Sensor data - current position, acceleration, heart rate, etc. Application data - startup log, file information being processed, etc. A feature space can be defined for these modalities, and can be expanded freely.

また、複数の特徴空間を横断的に扱うことも可能である。例えば、テキスト、ノート、およびＷｅｂページを意味の視点で処理することが可能な第１の特徴空間と、手書きと画像を形の視点で処理することが可能な第２の特徴空間とがある場合に、これらを所定の関数を通してマッピングしてもよい。これにより、例えば、テキストで画像や手書きを検索したり、今見ているＷｅｂページに関連したノートを探したり、今書いているノートに近いタグを自動的に付与するシステムをすぐに生成することが可能となる。 Also, it is possible to cross-handle multiple feature spaces. For example, if there is a first feature space that can process text, notes, and web pages from a semantic perspective, and a second feature space that can process handwriting and images from a shape perspective. , these may be mapped through a predetermined function. As a result, for example, it is possible to quickly create a system that searches for images and handwriting in text, searches for notes related to the web page you are currently viewing, and automatically adds tags similar to the notes you are currently writing. It becomes possible.

また、複数の特徴空間を横断的に扱う方法として、サードパーティによる拡張（プラグインによる拡張）も想定される。より具体的には、他者でも同じ特徴空間に別のモダリティを関連付けたり、同じモダリティから別の特徴空間を構成することが可能である。また、これらを両方行って拡張することも可能である（例えば、「論文」や「判例」のモダリティを関連付けると共に、症例毎のセンサデータを入れることで、論文や判例に近い症例を探すことができる）。また、他者が扱う特徴空間と混ぜて利用することもできる（例えば、テキストや画像から商品を探す）。 Further, extension by a third party (extension by plug-in) is also envisioned as a method of cross-cutting multiple feature spaces. More specifically, others can associate different modalities with the same feature space, or construct different feature spaces from the same modality. It is also possible to expand by doing both (for example, by associating the modalities of "papers" and "judicial precedents" and inserting sensor data for each case, it is possible to search for cases that are similar to the papers and judicial precedents. can). It can also be used in combination with feature spaces handled by others (for example, searching for products from text or images).

このようにして、複数のマルチモーダル空間中のあらゆるデータを横断検索することが可能となる。 In this way, it is possible to traverse any data in multiple multimodal spaces.

（背景）
ここで、このような複数の特徴空間を横断的に検索するシステムの構築に関し、以下のような問題が考えられる。(background)
Here, the following problems are conceivable with regard to the construction of such a system for cross-searching a plurality of feature spaces.

まず、特徴抽出を行う際には、特徴抽出器に入れるまでのデータの加工（前処理）が重要であり、決まったプロセスを踏む必要がある。前処理の決まったプロセスとしては、例えば画像を例に取ると、
・ＪＰＥＧもしくはＰＮＧデータのみ受け付ける
・ＲＧＢ３チャンネル、２５６ｘ２５６ｐｘに統一する
・アスペクト比が１：１でない場合に短辺を引き伸ばす
・平滑化等のフィルタ処理を掛ける
といった処理が行われる。First, when performing feature extraction, it is important to process (pre-process) data before entering it into the feature extractor, and it is necessary to go through a fixed process. As a fixed process of preprocessing, for example, taking an image,
・Only JPEG or PNG data is accepted. ・Uniform to RGB 3 channels, 256x256px. ・If the aspect ratio is not 1:1, the short side is stretched.

また、テキストの前処理の例としては、
・クリーニング処理（テキスト中のノイズを除去。例えばＷｅｂテキストの場合、ＨＴＭＬタグやＪａｖａＳｃｒｉｐｔ（登録商標）のソースコードなど）
・文章の単語分割
・単語の正規化
・ストップワード除去
などが挙げられる。Also, as an example of text preprocessing,
・Cleaning processing (remove noise in the text. For example, in the case of web text, HTML tags, JavaScript (registered trademark) source code, etc.)
・Sentence word segmentation, word normalization, stop word removal, etc.

しかしながら、これらは特徴抽出器毎に異なるため、特徴空間を複数扱う場合、同じデータを特徴抽出器毎に異なるプロセスでそれぞれ前処理を行わなければならず、冗長であった。 However, since these are different for each feature extractor, when handling a plurality of feature spaces, the same data must be preprocessed by different processes for each feature extractor, which is redundant.

そこで、本実施形態では、複数の特徴抽出器における前処理を共通化することで、複数の特徴空間を扱うシステムの利便性をより向上させることを可能とする。 Therefore, in the present embodiment, by sharing preprocessing in a plurality of feature extractors, it is possible to further improve the convenience of a system that handles a plurality of feature spaces.

具体的には、モダリティ毎に所定のデータ形式へ変換するルールを定義付け、情報処理装置１０においてオブジェクトのモダリティの定義に従って変換した変換データ（以下、「entity」とも称す）を、当該モダリティに対応する１以上の特徴管理サーバ２０（特徴抽出器の一例である特徴抽出部２０２を有するサーバ装置）に出力する。 Specifically, rules for conversion into a predetermined data format are defined for each modality, and conversion data (hereinafter also referred to as "entity") converted according to the definition of the modality of the object in the information processing apparatus 10 is converted to correspond to the modality. output to one or more feature management servers 20 (a server device having a feature extraction unit 202, which is an example of a feature extractor).

また、一般的なプログラミングの型として扱えない一部のデータに関し、モダリティの定義を規定しておくことで、処理の難しいデータも読み込める上、データの前処理を統一することが可能となる。 In addition, by prescribing modality definitions for some data that cannot be handled as general programming types, it is possible to read data that is difficult to process, and to unify data preprocessing.

例えば以下のようなデータでの需要が想定される。
（パターン１：形式間の変換／抽出）
・ベクタデータ（通常テキストファイル）を他の画像と同様に扱うためにレンダリング（描画）する
・ＰＤＦ、オフィス文書等をテキストとして扱うためにテキストを抜き出す
・ｉＣａｌｅｎｄａｒ（スケジュールの標準フォーマット）から予定の名前と日程、参加者のみを抽出する
（パターン２：形式の統一）
手書きデータなどは、各社が異なるフォーマットを提唱しており、標準のフォーマットが存在しない。そういったデータの共通項を取り出し、処理可能な形式に変換する。
（パターン３：特殊なデータの読み込み）
血圧や心拍数等、まだあまりデジタルで扱われていないデータは、テキスト等の一般的なデータで記述される場合が多い。それらを読み込み、処理しやすい（例えば整数列などの）データ形式に変換する。
（パターン４：データの取得）
・ＵＲＬから画像を取得する
・ＩＤを入力とし、外部の特定のデータベースからデータを引き出すFor example, the demand for the following data is assumed.
(Pattern 1: conversion/extraction between formats)
・Rendering (drawing) vector data (usually text files) to handle it like other images ・Extracting text from PDFs, office documents, etc. to handle them as text ・Name of appointment from iCalendar (standard format for schedules) , dates, and participants only (Pattern 2: Format unification)
Each company has proposed different formats for handwritten data, and there is no standard format. It extracts the common denominator of such data and transforms it into a processable format.
(Pattern 3: Reading special data)
Data such as blood pressure and heart rate, which are not handled digitally yet, are often described as general data such as text. Read them and convert them into a data format that is easier to process (eg, an integer string).
(Pattern 4: Acquisition of data)
・Acquire an image from a URL ・With an ID as an input, extract data from a specific external database

また、複数の特徴空間を扱って検索データベース（以下、検索ＤＢ）への登録を行う場合、特徴空間毎にデータを保存すると、同じデータが複数のデータベースに登録されてしまい、冗長である。そこで検索ＤＢとは別に、データを格納し、ＩＤからデータを取り出せるデータベースを用意し、検索ＤＢにはＩＤのみを格納することが考えられる。この際、ユーザがデータと共に任意にＩＤを入力するようにすると、以下のような問題が生じる可能性があり、ユーザ側でケアしなければならない。
・同じデータに対して複数のＩＤを関連付けることが可能であること
・複数のデータに対して同じＩＤを関連付けることが可能であること
・データを一部の検索ＤＢのみに保存する場合、異なる検索ＤＢに保存されているＩＤが同一のデータを指す保証がない（検索結果の比較やまとめができない）
・結果として、ＩＤから元のデータを取り出せる保証がされないAlso, when handling a plurality of feature spaces and registering them in a search database (hereinafter referred to as a search DB), if data is saved for each feature space, the same data will be registered in a plurality of databases, which is redundant. Therefore, it is conceivable to store data separately from the search DB and prepare a database from which data can be retrieved from IDs, and to store only IDs in the search DB. At this time, if the user arbitrarily inputs the ID together with the data, the following problems may arise, and the user must take care of them.
・It is possible to associate multiple IDs with the same data. ・It is possible to associate the same ID with multiple data. There is no guarantee that the IDs stored in the DB point to the same data (search results cannot be compared or summarized)
・As a result, there is no guarantee that the original data can be extracted from the ID.

そこで、本実施形態では、情報処理装置１０において、上記前処理の共通化と共に、登録オブジェクトの変換データに複数の特徴抽出器に共通する一意の（ユニークな）ＩＤ（識別情報）を付与すると共に、登録オブジェクトを保存することで上記問題を解決し、複数の特徴空間を扱うシステムの利便性をさらに向上させることを可能とする。 Therefore, in the present embodiment, in the information processing apparatus 10, in addition to standardization of the preprocessing, a unique ID (identification information) common to a plurality of feature extractors is assigned to the transformation data of the registered object. , the above problem can be solved by saving registered objects, and the convenience of a system that handles a plurality of feature spaces can be further improved.

以上説明したように、本実施形態による情報処理システムでは、オブジェクトのモダリティごとに統一的に変換すると共に、複数の特徴空間に共通する一意のＩＤでオブジェクトを管理することで、複数の特徴空間を扱うシステムの利便性をより向上させることを可能とする。 As described above, in the information processing system according to the present embodiment, a plurality of feature spaces are transformed by uniformly transforming objects for each modality and managing objects with unique IDs common to a plurality of feature spaces. It is possible to further improve the convenience of the system to be handled.

このような本実施形態による情報処理システムに含まれる情報処理装置１０および特徴管理サーバ２０の構成について、以下説明する。 The configurations of the information processing apparatus 10 and the feature management server 20 included in the information processing system according to this embodiment will be described below.

＜＜２．構成＞＞
＜２－１．情報処理装置１０の構成＞
図１に示すように、本実施形態による情報処理装置１０は、制御部１００、通信部１２０、出力部１３０、および記憶部１４０を有する。情報処理装置１０は、例えばユーザに利用されるスマートフォン、タブレット端末、またはＰＣ等のローカル端末である。<<2. Configuration>>
<2-1. Configuration of Information Processing Device 10>
As shown in FIG. 1, the information processing apparatus 10 according to this embodiment has a control unit 100, a communication unit 120, an output unit 130, and a storage unit 140. FIG. The information processing device 10 is, for example, a local terminal such as a smart phone, a tablet terminal, or a PC used by a user.

（制御部１００）
制御部１００は、演算処理装置および制御装置として機能し、各種プログラムに従って情報処理装置１０内の動作全般を制御する。制御部１００は、例えばＣＰＵ（Central Processing Unit）、マイクロプロセッサ等の電子回路によって実現される。また、制御部１００は、使用するプログラムや演算パラメータ等を記憶するＲＯＭ（Read Only Memory）、及び適宜変化するパラメータ等を一時記憶するＲＡＭ（Random Access Memory）を含んでいてもよい。(control unit 100)
The control unit 100 functions as an arithmetic processing device and a control device, and controls general operations within the information processing device 10 according to various programs. The control unit 100 is realized by an electronic circuit such as a CPU (Central Processing Unit), a microprocessor, or the like. The control unit 100 may also include a ROM (Read Only Memory) that stores programs to be used, calculation parameters, and the like, and a RAM (Random Access Memory) that temporarily stores parameters that change as appropriate.

また、本実施形態による制御部１００は、特徴空間管理部１０１（Feature Space Manager）およびモダリティ管理部１０２（Modality Manager）としても機能する。 The control unit 100 according to this embodiment also functions as a feature space manager 101 and a modality manager 102 .

特徴空間管理部１０１は、扱う特徴空間の管理（１以上の特徴管理サーバ２０のＩＤの取得など）や、オブジェクトの登録処理、検索処理等を行う。ここで、図２に、本実施形態による情報処理システムの主な機能ブロック（特徴空間管理部１０１、モダリティ管理部１０２、および特徴管理部２０１）における処理内容の一例を示す。図２に示すように、例えば本実施形態による特徴空間管理部１０１は、下記のような処理を行い得る。
・Get Manager(space ID: string): Feature Manager
・Register Manager(manager: Feature Manager)
・Get Vector(space ID: string, obj: object, modality: string): vector
・Register Object(obj: object, modality: string, space ID: string=ANY)
・Search(query: object, query Modality: string, target Modality: string=ANY): Search ResultThe feature space management unit 101 manages the feature space to be handled (acquisition of IDs of one or more feature management servers 20, etc.), object registration processing, search processing, and the like. Here, FIG. 2 shows an example of processing contents in the main functional blocks (feature space management unit 101, modality management unit 102, and feature management unit 201) of the information processing system according to this embodiment. As shown in FIG. 2, for example, the feature space management unit 101 according to this embodiment can perform the following processing.
・Get Manager (space ID: string): Feature Manager
・Register Manager (manager: Feature Manager)
・GetVector(space ID: string, obj: object, modality: string): vector
・Register Object (obj: object, modality: string, space ID: string=ANY)
・Search(query: object, query Modality: string, target Modality: string=ANY): Search Result

より具体的には、例えば特徴空間管理部１０１は、登録要求情報が入力された際、登録要求情報に含まれるオブジェクトおよびモダリティをモダリティ管理部１０２に出力し、モダリティ管理部１０２においてモダリティの定義に従って変換された変換データ（entity）およびＩＤを取得し、当該entityおよびＩＤを特徴管理サーバ２０に出力する。この際、特徴空間管理部１０１は、オブジェクトのモダリティに対応する１以上の特徴管理サーバ２０に出力し得る。「モダリティに対応する特徴管理サーバ２０」とは、当該モダリティを扱うことが可能な特徴管理サーバ２０である。特徴空間管理部１０１は、各特徴空間がどのような情報を扱っているかを特徴管理サーバ２０のＩＤ（space ID: stringなど）から把握することが可能である。従って、例えば特徴空間管理部１０１は、modality: "Text"の場合に、各space IDを参照し、"Text"を扱う特徴空間を管理している特徴管理サーバ２０を特定することが可能となる。若しくは、特徴空間管理部１０１は、全ての特徴管理サーバ２０に出力するようにしてもよい（この場合、特徴管理サーバ２０側で適宜処理可否が判断され得る）。 More specifically, for example, when the registration request information is input, the feature space management unit 101 outputs the object and modality included in the registration request information to the modality management unit 102, and the modality management unit 102 modality modality definition. The converted data (entity) and the ID are acquired, and the entity and the ID are output to the feature management server 20 . At this time, the feature space management unit 101 can output to one or more feature management servers 20 corresponding to the modality of the object. The “feature management server 20 corresponding to the modality” is the feature management server 20 capable of handling the modality. The feature space management unit 101 can grasp what kind of information each feature space handles from the ID (space ID: string, etc.) of the feature management server 20 . Therefore, for example, in the case of modality: "Text", the feature space management unit 101 can refer to each space ID and specify the feature management server 20 that manages the feature space that handles "Text". . Alternatively, the feature space management unit 101 may output to all the feature management servers 20 (in this case, the feature management server 20 side can appropriately determine whether or not processing is possible).

また、特徴空間管理部１０１は、検索要求情報が入力された際、検索要求に含まれるオブジェクトおよびモダリティをモダリティ管理部１０２に出力し、モダリティ管理部１０２においてモダリティの定義に従って変換された変換データ（entity）を取得し、当該entityを特徴管理サーバ２０に出力する。この際、特徴空間管理部１０１は、オブジェクトのモダリティに対応する１以上の特徴管理サーバ２０に出力し得る。また、特徴空間管理部１０１は、検索要求に検索条件としてspace IDが含まれていた場合、指定されたspace IDに対応する特徴管理サーバ２０に出力するようにしてもよい。若しくは、特徴空間管理部１０１は、全ての特徴管理サーバ２０に出力するようにしてもよい（この場合、特徴管理サーバ２０側で適宜処理可否が判断され得る）。 Further, when the search request information is input, the feature space management unit 101 outputs the object and modality included in the search request to the modality management unit 102, and the converted data ( entity) and outputs the entity to the feature management server 20 . At this time, the feature space management unit 101 can output to one or more feature management servers 20 corresponding to the modality of the object. Also, when a space ID is included as a search condition in a search request, the feature space management unit 101 may output to the feature management server 20 corresponding to the designated space ID. Alternatively, the feature space management unit 101 may output to all the feature management servers 20 (in this case, the feature management server 20 side can appropriately determine whether or not processing is possible).

そして、特徴空間管理部１０１は、特徴管理サーバ２０において検索された１以上のＩＤに基づいて、記憶部１４０から対応するオブジェクト（すなわち元のデータ）を取り出し、検索結果として検索要求元に出力する。なお、検索条件には、追加条件としてフィルター情報がさらに含まれていてもよい。フィルター情報としては、例えば、検索ＤＢ（検索対象の特徴空間に相当）の指定や、検索数の指定等が挙げられる。特徴空間管理部１０１は、例えば検索数が指定されている場合、各検索結果の類似度（各検索結果の、検索オブジェクトの変換データ（entity）の特徴量との類似度合いを示す類似度）に基づいて、上位所定数の検索結果を検索要求元に出力するようにしてもよい。 Based on one or more IDs retrieved by the feature management server 20, the feature space management unit 101 retrieves the corresponding object (that is, the original data) from the storage unit 140, and outputs it as a search result to the search requester. . Note that the search condition may further include filter information as an additional condition. Examples of filter information include specification of a search DB (corresponding to a feature space to be searched) and specification of the number of searches. For example, when the number of searches is specified, the feature space management unit 101 calculates the similarity of each search result (similarity indicating the similarity of each search result to the feature amount of the transformation data (entity) of the search object). Based on this, a predetermined number of top search results may be output to the search request source.

モダリティ管理部１０２は、モダリティの管理を行う。例えば図２にも示すように、モダリティ管理部１０２は下記のような処理を行い得る。
・Get Modalities(): string[]
・Register Modality(modality: string, definer: Modality Definer)
・Create(obj: object, modality: string): entityThe modality management unit 102 manages modalities. For example, as also shown in FIG. 2, the modality management unit 102 can perform the following processing.
・Get Modalities(): string[]
・Register Modality (modality: string, definer: Modality Definer)
・Create(obj: object, modality: string): entity

より具体的には、例えばモダリティ管理部１０２は、モダリティの定義（データ）を登録したり、モダリティ定義部１０３により、特徴空間管理部１０１から入力されたオブジェクト（登録オブジェクト）を当該オブジェクトのモダリティの定義に従って所定のデータ形式に変換し、変換データ（entity）を生成したりする。entityは、特徴抽出部２０２に直接渡すことのできる、整形されたデータである。また、モダリティ定義部１０３は、生成したentityに、複数の特徴空間に共通する一意のＩＤを付与し、entityおよびＩＤを、特徴空間管理部１０１に出力する。さらに、モダリティ定義部１０３は、オブジェクト（登録オブジェクト）と、当該登録オブジェクトのentityに付与したＩＤを関連付けて記憶部１４０に記憶する。かかるＩＤは、同じモダリティを扱う複数の特徴空間にまたがって一意な文字列である。また、同じentityに対しては同じ値を返すよう、ハッシュ値等を用いてもよい。 More specifically, for example, the modality management unit 102 registers the modality definition (data), and the modality definition unit 103 registers the object (registered object) input from the feature space management unit 101 as the modality of the object. It converts to a predetermined data format according to the definition and generates converted data (entity). An entity is shaped data that can be passed directly to the feature extractor 202 . Further, modality definition section 103 assigns a unique ID common to a plurality of feature spaces to the generated entity, and outputs the entity and the ID to feature space management section 101 . Further, the modality definition unit 103 associates the object (registered object) with the ID assigned to the entity of the registered object and stores them in the storage unit 140 . Such an ID is a unique character string across multiple feature spaces that deal with the same modality. Also, a hash value or the like may be used so that the same value is returned for the same entity.

また、モダリティの定義のデータは、入力データ(obj)として受け取れる形式の定義データ（例えば、ファイル名(string)、OpenCVのMat形式）（複数可）と、出力データ(entity)の形式の定義データ（１つのみ）（例えば、char[3][256][256]）とを有する。モダリティの定義データは、例えば、データ形式毎や、上述したパターン毎（形式間の変換、形式の統一、特殊なデータの読み込み等）に存在し、記憶部１４０に記憶されている。モダリティ管理部１０２は、かかるモダリティの定義データを用いて、登録オブジェクト（入力データ）を変換し、変換データ（entity）を出力する。また、モダリティ定義部１０３は、ＩＤに対応するデータを必要に応じて保存するか、保存されていることを確認する機能と、ＩＤに対応するデータを取り出す機能と、ＩＤに対応するデータを削除する機能を有する。モダリティ定義部１０３は、モダリティ毎に存在する。 In addition, the modality definition data consists of definition data in a format that can be received as input data (obj) (e.g., file name (string), OpenCV Mat format) (multiple possible) and definition data in the format of output data (entity) (only one) (eg char[3][256][256]). The modality definition data exists, for example, for each data format or for each pattern described above (conversion between formats, unification of formats, reading of special data, etc.), and is stored in the storage unit 140 . The modality management unit 102 converts the registered object (input data) using the modality definition data, and outputs the converted data (entity). In addition, the modality definition unit 103 has a function of saving data corresponding to the ID as required, a function of confirming that the data is saved, a function of retrieving data corresponding to the ID, and a function of deleting the data corresponding to the ID. It has the function to A modality definition unit 103 exists for each modality.

（入力部１１０）
入力部１１０は、ユーザによる操作指示を受け付ける操作入力部や、ユーザのよる音声指示を受け付ける音声入力部など、ユーザによる指示内容を受け付ける機能を有し、その指示内容を制御部１００に出力する。操作入力部は、タッチセンサ、圧力センサ、若しくは近接センサであってもよい。あるいは、入力部１１０は、ボタン、スイッチ、およびレバーなど、物理的構成であってもよい。(Input unit 110)
The input unit 110 has a function of receiving instructions from the user, such as an operation input unit that receives operation instructions from the user and a voice input unit that receives voice instructions from the user, and outputs the instructions to the control unit 100. The operation input unit may be a touch sensor, pressure sensor, or proximity sensor. Alternatively, input 110 may be a physical structure such as buttons, switches, and levers.

（通信部１２０）
通信部１２０は、有線または無線により外部装置と接続し、外部装置とデータの送受信を行う。例えば通信部１２０は、有線／無線ＬＡＮ（Local Area Network）、またはＷｉ－Ｆｉ（登録商標）、Ｂｌｕｅｔｏｏｔｈ（登録商標）、携帯通信網（ＬＴＥ：Long Term Evolution、３Ｇ（第３世代の移動体通信方式））等により、ネットワーク（不図示）に接続し、ネットワークを介して特徴管理サーバ２０とデータの送受信を行い得る。(Communication unit 120)
The communication unit 120 connects to an external device by wire or wirelessly, and transmits and receives data to and from the external device. For example, the communication unit 120 can be a wired/wireless LAN (Local Area Network), Wi-Fi (registered trademark), Bluetooth (registered trademark), mobile communication network (LTE: Long Term Evolution, 3G (third generation mobile communication method)), etc., to transmit and receive data to and from the feature management server 20 via the network (not shown).

（出力部１３０）
出力部１３０は、表示部および音声出力部等、ユーザへの情報提示（出力）を行う機能を有する。例えば出力部１３０は、制御部１００の制御に従って、検索画面を出力したり、検索結果を出力したりする。(Output unit 130)
The output unit 130 has a function of presenting (outputting) information to the user, such as a display unit and an audio output unit. For example, the output unit 130 outputs a search screen or a search result under the control of the control unit 100 .

（記憶部１４０）
記憶部１４０は、制御部１００の処理に用いられるプログラムや演算パラメータ等を記憶するＲＯＭ（Read Only Memory）、および適宜変化するパラメータ等を一時記憶するＲＡＭ（Random Access Memory）により実現される。(storage unit 140)
The storage unit 140 is implemented by a ROM (Read Only Memory) that stores programs, calculation parameters, and the like used in the processing of the control unit 100, and a RAM (Random Access Memory) that temporarily stores parameters that change as appropriate.

例えば、記憶部１４０には、特徴空間の管理情報、モダリティの定義情報、および、一意のＩＤが付与されたオブジェクト（実体データ）が格納される。 For example, the storage unit 140 stores feature space management information, modality definition information, and objects (entity data) assigned unique IDs.

以上、本実施形態による情報処理装置１０の構成について具体的に説明した。なお情報処理装置１０の構成は、図１に示す例に限定されない。例えば、情報処理装置１０の制御部１００による各処理を複数の装置で実行するようにしてもよいし、ネットワーク上のサーバで実行するようにしてもよい。 The configuration of the information processing apparatus 10 according to the present embodiment has been specifically described above. Note that the configuration of the information processing apparatus 10 is not limited to the example shown in FIG. For example, each process by the control unit 100 of the information processing apparatus 10 may be executed by a plurality of apparatuses, or may be executed by a server on a network.

＜２－２．特徴管理サーバ２０の構成＞
図２に示すように、特徴管理サーバ２０は、制御部２００、通信部２１０、および特徴量データベース２２０を有する。なお、本実施形態において、特徴管理サーバ２０は１の特徴抽出部２０２を有するため、各特徴管理サーバ２０はそれぞれ１の特徴空間を管理していると言えるが、本開示は、これに限定されず、例えば特徴管理サーバ２０が複数の特徴抽出部２０２を有していれば、複数の特徴空間を管理することも可能である。<2-2. Configuration of Feature Management Server 20>
As shown in FIG. 2 , the feature management server 20 has a control section 200 , a communication section 210 and a feature amount database 220 . In the present embodiment, since each feature management server 20 has one feature extraction unit 202, it can be said that each feature management server 20 manages one feature space, but the present disclosure is limited to this. Instead, for example, if the feature management server 20 has a plurality of feature extraction units 202, it is possible to manage a plurality of feature spaces.

（制御部２００）
制御部２００は、演算処理装置および制御装置として機能し、各種プログラムに従って特徴管理サーバ２０内の動作全般を制御する。制御部２００は、例えばＣＰＵ（Central Processing Unit）、マイクロプロセッサ等の電子回路によって実現される。また、制御部２００は、使用するプログラムや演算パラメータ等を記憶するＲＯＭ（Read Only Memory）、及び適宜変化するパラメータ等を一時記憶するＲＡＭ（Random Access Memory）を含んでいてもよい。(control unit 200)
The control unit 200 functions as an arithmetic processing device and a control device, and controls overall operations within the feature management server 20 according to various programs. The control unit 200 is realized by an electronic circuit such as a CPU (Central Processing Unit), a microprocessor, or the like. The control unit 200 may also include a ROM (Read Only Memory) for storing programs to be used, calculation parameters, and the like, and a RAM (Random Access Memory) for temporarily storing parameters and the like that change as appropriate.

また、本実施形態による制御部２００は、特徴管理部２０１としても機能する。特徴管理部２０１は、情報処理装置１０から送信されたentityに対し、特徴抽出部２０２により特徴抽出（例えば、Ｎ次元ベクトルへの代替処理）を行い、抽出した特徴量を、当該entityのＩＤに関連付けて、特徴量データベース２２０へ登録する処理を行う。特徴量抽出のアルゴリズムは既存の技術を用いることが可能であり、ここでは特に限定しない。また、特徴抽出部２０２は、異なるモダリティを統一的に扱うことが可能である。特徴抽出部２０２で抽出した特徴量は、特徴量データベース２２０に登録されるが、ここで、特徴量データベース２２０は、モダリティ毎に存在する。特徴管理部２０１は、特徴抽出部２０２で抽出した特徴量を、元データ（情報処理装置１０から送信されたentity）のモダリティに対応する特徴量データベース２２０に、上記ＩＤと関連付けて登録する。例えば、色特徴という視点で異なるモダリティ（例えば手書き（Strokes）と画像（Image）等）から特徴量を抽出することができる特徴抽出部２０２ａ（色特徴の特徴空間ａ）が存在したとする。この場合、特徴抽出部２０２ａにより抽出された特徴量は、手書きデータから抽出した場合は手書きデータに対応する特徴量データベース２２０－１に格納され、画像データから抽出した場合は画像データに対応する特徴量データベース２２０－２に格納される。 The control unit 200 according to this embodiment also functions as a feature management unit 201 . The feature management unit 201 performs feature extraction (for example, substitution processing to an N-dimensional vector) by the feature extraction unit 202 for the entity transmitted from the information processing apparatus 10, and assigns the extracted feature quantity to the ID of the entity. A process of associating and registering in the feature amount database 220 is performed. Algorithms for feature quantity extraction can use existing techniques, and are not particularly limited here. Also, the feature extraction unit 202 can handle different modalities in a unified manner. The feature amount extracted by the feature extraction unit 202 is registered in the feature amount database 220. Here, the feature amount database 220 exists for each modality. The feature management unit 201 registers the feature amount extracted by the feature extraction unit 202 in the feature amount database 220 corresponding to the modality of the original data (the entity transmitted from the information processing apparatus 10) in association with the ID. For example, assume that there is a feature extraction unit 202a (feature space a of color features) capable of extracting feature amounts from different modalities (for example, handwriting (Strokes) and image (Image)) from the viewpoint of color features. In this case, the feature amount extracted by the feature extraction unit 202a is stored in the feature amount database 220-1 corresponding to the handwritten data when extracted from the handwritten data, and is stored in the feature amount database 220-1 corresponding to the handwritten data, and when extracted from the image data, the feature amount corresponding to the image data. Stored in quantity database 220-2.

また、特徴管理部２０１は、特徴空間管理部１０１からの要求に応じて、特徴空間を用いた検索処理（類似検索）を行うことも可能である。特徴量データベース２２０はモダリティ毎に存在するため、特徴管理部２０１は、ターゲットモダリティ（検索対象のモダリティ）に対応する特徴量データベース２２０を用いて類似検索を行えばよい。例えば特徴管理部２０１は、図２に示すように、下記のような処理を行い得る。
・Get Space ID(): string
・Register Database(modality: string, database: Feature Database)
・Get Vector(entity: object, modality: string): vector
・Add(id: string, entity: object, modality: string)
・Most Similar(query: object, modality: string, target Modality: string): Search Result[]Further, the feature management unit 201 can also perform search processing (similarity search) using the feature space in response to a request from the feature space management unit 101 . Since the feature amount database 220 exists for each modality, the feature management unit 201 may perform similarity search using the feature amount database 220 corresponding to the target modality (modality to be searched). For example, the feature management unit 201 can perform the following processing as shown in FIG.
・GetSpaceID(): string
・Register Database (modality: string, database: Feature Database)
・GetVector(entity: object, modality: string): vector
・Add(id: string, entity: object, modality: string)
・Most Similar(query: object, modality: string, target Modality: string): Search Result[]

（通信部２１０）
通信部２１０は、有線または無線により外部装置と接続し、外部装置とデータの送受信を行う。例えば通信部２１０は、有線／無線ＬＡＮ（Local Area Network）、またはＷｉ－Ｆｉ（登録商標）、Ｂｌｕｅｔｏｏｔｈ（登録商標）、携帯通信網（ＬＴＥ：Long Term Evolution、３Ｇ（第３世代の移動体通信方式））等により、ネットワーク（不図示）に接続し、ネットワークを介して情報処理装置１０とデータの送受信を行い得る。(Communication unit 210)
The communication unit 210 connects to an external device by wire or wirelessly, and transmits and receives data to and from the external device. For example, the communication unit 210 is a wired/wireless LAN (Local Area Network), Wi-Fi (registered trademark), Bluetooth (registered trademark), mobile communication network (LTE: Long Term Evolution, 3G (third generation mobile communication method)) or the like to connect to a network (not shown) and transmit/receive data to/from the information processing apparatus 10 via the network.

（特徴量データベース２２０）
特徴量データベース２２０は、特徴抽出部２０２により抽出された特徴量を蓄積する。各特徴量には、モダリティ管理部１０２により付与された一意のＩＤが関連付けられる。特徴量データベース２２０は、上述したように、モダリティ毎に存在する。(Feature database 220)
The feature amount database 220 accumulates the feature amounts extracted by the feature extraction unit 202 . A unique ID assigned by the modality management unit 102 is associated with each feature amount. The feature quantity database 220 exists for each modality as described above.

また、特徴量データベース２２０は、特徴管理サーバ２０が有する記憶部（不図示）に記憶される。特徴管理サーバ２０の記憶部は、制御部２００の処理に用いられるプログラムや演算パラメータ等を記憶するＲＯＭ、および適宜変化するパラメータ等を一時記憶するＲＡＭにより実現される。 Also, the feature amount database 220 is stored in a storage unit (not shown) of the feature management server 20 . The storage unit of the feature management server 20 is implemented by a ROM that stores programs, calculation parameters, and the like used in the processing of the control unit 200, and a RAM that temporarily stores parameters that change as appropriate.

以上、本実施形態による特徴管理サーバ２０の構成について具体的に説明した。なお図１に示す特徴管理サーバ２０の構成は一例であって、本実施形態はこれに限定されない。例えば特徴管理サーバ２０の少なくとも一部の構成が外部装置にあってもよい。 The configuration of the feature management server 20 according to this embodiment has been specifically described above. Note that the configuration of the feature management server 20 shown in FIG. 1 is an example, and the present embodiment is not limited to this. For example, at least part of the configuration of the feature management server 20 may be in an external device.

ここで、図３に、本実施例による情報処理システムの他の構成例の一例を示す。図３に示すように、例えば特徴抽出部２４０と特徴量データベース２５０を別のサーバ（特徴管理サーバ２４およびデータベースサーバ２５）でそれぞれ管理するようにしてもよい。 Here, FIG. 3 shows an example of another configuration example of the information processing system according to this embodiment. As shown in FIG. 3, for example, the feature extraction unit 240 and the feature amount database 250 may be managed by separate servers (the feature management server 24 and the database server 25).

＜＜３．動作処理＞＞
続いて、本実施形態による情報処理システムの動作処理について図面を用いて具体的に説明する。<<3. Operation processing >>
Next, operation processing of the information processing system according to this embodiment will be specifically described with reference to the drawings.

＜３－１．登録処理＞
図４は、本実施形態による情報処理システムにおけるオブジェクトの登録処理の流れの一例を示すシーケンス図である。<3-1. Registration process>
FIG. 4 is a sequence diagram showing an example of the flow of object registration processing in the information processing system according to this embodiment.

図４に示すように、まず、情報処理装置１０の特徴空間管理部１０１は、ユーザの操作入力等に基づいて登録要求を取得すると（ステップＳ１０３）、登録要求に含まれるオブジェクト（obj）と当該オブジェクトのモダリティ（mdl）の情報と共に、モダリティ管理部１０２に対して、（変換データ（entity）の）生成依頼を行う（ステップＳ１０６）。 As shown in FIG. 4, first, when the feature space management unit 101 of the information processing apparatus 10 acquires a registration request based on a user's operation input or the like (step S103), the object (obj) included in the registration request and the relevant Along with the information of the modality (mdl) of the object, the modality management unit 102 is requested to generate (converted data (entity)) (step S106).

次に、モダリティ管理部１０２は、モダリティ定義部１０３により、変換データ（entity）の生成、一意のＩＤの付与、およびobjと一意のＩＤの保存処理を行う（ステップＳ１０９）。具体的には、モダリティ定義部１０３は、モダリティの定義に従って、オブジェクトを所定の形式のデータに変換する処理（共通化した前処理）を行う。処理の具体例として、例えば以下のような例が挙げられる。
・静止画の場合：ＪＰＥＧデータをchar[3][256][256]（多次元配列）に変換し、平滑化処理を実施する。
・音声の場合：mp3データをshort型の任意長配列として読み取る。
・テキストの場合：ＨＴＭＬタグを除去し、全ての大文字を小文字に変換する（形式は変換しない）。
・手書きの場合：点列データを読み取り、char[3][256][256]の黒画像に太さ３の白線で描画する。Next, the modality management unit 102 causes the modality definition unit 103 to generate conversion data (entity), assign a unique ID, and store obj and the unique ID (step S109). Specifically, the modality definition unit 103 performs processing (common preprocessing) for converting an object into data in a predetermined format according to the modality definition. Specific examples of processing include the following examples.
・For still images: JPEG data is converted to char[3][256][256] (multidimensional array) and smoothed.
・For audio: Read mp3 data as a short type arbitrary length array.
- For text: Remove HTML tags and convert all uppercase letters to lowercase (do not convert format).
・In the case of handwriting: read the point sequence data and draw a white line with a thickness of 3 on a black image of char[3][256][256].

次いで、特徴空間管理部１０１は、モダリティ管理部１０２から、少なくともＩＤおよびentityを取得する（ステップＳ１１２）。また、モダリティ管理部１０２からは、IDとobjを保存した旨が通知されてもよい。 Next, the feature space manager 101 acquires at least the ID and entity from the modality manager 102 (step S112). Also, the modality management unit 102 may notify that the ID and obj have been saved.

次に、特徴空間管理部１０１は、取得したＩＤおよびentityに基づいて、対応する全ての特徴空間（Feature Space）に対してデータの追加（登録）要求を出力する（ステップＳ１１５）。追加要求には、ＩＤ、entity、モダリティ（mdl）が含まれる。対応する特徴空間とは、当該entityのモダリティを扱い得る特徴空間（特徴管理サーバ２０）である。なお、当該entityのモダリティを扱い得る特徴空間が複数ある場合は、特徴空間毎に、ステップＳ１１５～Ｓ１２１に示す処理を繰り返す。 Next, the feature space management unit 101 outputs a data addition (registration) request to all corresponding feature spaces based on the acquired ID and entity (step S115). The add request includes ID, entity and modality (mdl). The corresponding feature space is the feature space (feature management server 20) that can handle the modality of the entity. Note that if there are a plurality of feature spaces that can handle the modality of the entity, the processing shown in steps S115 to S121 is repeated for each feature space.

次いで、特徴管理部２０１は、特徴抽出部２０２により、特徴量の抽出を行う（ステップＳ１１８）。 Next, the feature management unit 201 causes the feature extraction unit 202 to extract feature amounts (step S118).

そして、特徴管理部２０１は、抽出された特徴量を、上記取得した一意のＩＤと共に、特徴量データベース２２０に追加（登録）する（ステップＳ１２１）。この際、特徴管理部２０１は、抽出元のentityのモダリティに対応する特徴量データベース２２０に登録する。 Then, the feature management unit 201 adds (registers) the extracted feature amount to the feature amount database 220 together with the acquired unique ID (step S121). At this time, the feature management unit 201 registers in the feature amount database 220 corresponding to the modality of the extraction source entity.

以上、本実施形態による登録処理について具体的に説明した。このように各特徴空間にデータを入力する前に、モダリティ管理部１０２において、モダリティ毎に所定の変換処理を行うことで、同じデータを特徴抽出器毎に異なるプロセスでそれぞれ前処理を行うといった手間が省け、複数の特徴空間を扱うシステムの利便性を向上させることができる。また、モダリティ管理部１０２と特徴抽出部２０２を個別に管理することで、各機能の責任が軽くなる（例えばエラー時の原因を特定し易くなる）。 The registration processing according to the present embodiment has been specifically described above. In this way, before inputting data into each feature space, the modality management unit 102 performs a predetermined conversion process for each modality. can be omitted, and the convenience of a system that handles multiple feature spaces can be improved. Also, by managing the modality management unit 102 and the feature extraction unit 202 separately, the responsibility of each function is reduced (for example, it becomes easier to identify the cause of an error).

また、例えば、ＩＤと特徴ベクトルを保管できる一般的なデータベースを提供すれば、特徴抽出器のみの開発で検索システムが完成し、システムの可用性も上がる。また、検索ＤＢ（すなわち、特徴量データベース２２０）では、ＩＤと特徴量のみ登録し、元データ（データの実体）はモダリティ定義部１０３により別で管理するため、同じデータを複数のデータベースに登録して冗長となることを回避することができる。また、複数の特徴空間にまたがって一意なＩＤを同じデータに付与することで、同じデータに対して複数のＩＤを関連付けてしまうことや、複数のデータに対して同じＩＤを関連付けてしまうこと等を回避することができる。 Also, for example, if a general database capable of storing IDs and feature vectors is provided, a search system can be completed by developing only a feature extractor, and the availability of the system can be improved. Also, in the search DB (that is, the feature amount database 220), only IDs and feature amounts are registered, and the original data (the substance of the data) is managed separately by the modality definition unit 103. Therefore, the same data can be registered in a plurality of databases. verbosity can be avoided. In addition, by assigning a unique ID to the same data across multiple feature spaces, it is possible to associate multiple IDs with the same data, or associate the same ID with multiple data. can be avoided.

また、特徴管理部２０１は、モダリティ管理部１０２で元データが保存されている場合のみ特徴量データベース２２０に登録するようにしてもよい。これにより、後述する検索結果取得の際に特徴空間管理部１０１がＩＤから元のデータを取り出せる保証がなされる。 Also, the feature manager 201 may register in the feature quantity database 220 only when the original data is stored in the modality manager 102 . This guarantees that the feature space management unit 101 can retrieve the original data from the ID when obtaining a search result, which will be described later.

また、所定のデータ形式への前処理が別で管理されるため、特徴空間（検索ＤＢシステム）の開発側としては、入力形式を気にすることなく特徴抽出部２０２の開発を行うことができる。 In addition, since the preprocessing to a predetermined data format is managed separately, the feature space (search DB system) development side can develop the feature extraction unit 202 without worrying about the input format. .

また、本実施形態による登録処理は、図４に示す例に限定されない。例えば、上記ステップＳ１１５では、モダリティに対応する特徴空間（特徴管理サーバ２０）に追加指示を行う旨を説明したが、本実施形態はこれに限定されず、特徴空間管理部１０１は、全ての特徴空間（特徴管理サーバ２０）に追加指示を行ってもよい。この場合、特徴空間（特徴管理サーバ２０）側で、モダリティに基づき、処理可能なentityであるか否かを判断し得る。 Further, the registration process according to this embodiment is not limited to the example shown in FIG. For example, in the above step S115, it has been explained that an additional instruction is given to the feature space (feature management server 20) corresponding to the modality, but the present embodiment is not limited to this, and the feature space management unit 101 allows all feature An additional instruction may be given to the space (feature management server 20). In this case, the feature space (feature management server 20) side can determine whether or not the entity can be processed based on the modality.

＜３－２．検索処理＞
続いて、上述したように特徴空間を構築する本実施形態による情報処理システムにおける検索処理について、図５を参照して説明する。図５は、本実施形態による情報処理システムにおける検索処理の流れの一例を示すシーケンス図である。<3-2. Search processing>
Next, search processing in the information processing system according to this embodiment, which constructs the feature space as described above, will be described with reference to FIG. FIG. 5 is a sequence diagram showing an example of the flow of search processing in the information processing system according to this embodiment.

図５に示すように、まず、情報処理装置１０の特徴空間管理部１０１は、ユーザの操作入力等に基づいて検索要求を取得する（ステップＳ１３３）。検索要求には、オブジェクト（obj）と、当該オブジェクトのモダリティ（mdl1）と、検索対象のモダリティを示すターゲットモダリティ（mdl2）とが含まれる。 As shown in FIG. 5, first, the feature space management unit 101 of the information processing apparatus 10 acquires a search request based on user's operation input or the like (step S133). The search request includes the object (obj), the modality (mdl1) of the object, and the target modality (mdl2) indicating the modality to be searched.

次いで、特徴空間管理部１０１は、検索要求に含まれるオブジェクト（obj）と当該オブジェクトのモダリティ（mdl1）の情報と共に、モダリティ管理部１０２に対して、（変換データ（entity）の）生成依頼を行う（ステップＳ１３６）。 Next, the feature space management unit 101 requests the modality management unit 102 to generate (converted data (entity)) together with the information of the object (obj) and the modality (mdl1) of the object included in the search request. (Step S136).

次に、モダリティ管理部１０２は、モダリティ定義部１０３により、変換データ（entity）の生成、および一意のＩＤの付与を行う（ステップＳ１３９）。具体的には、モダリティ定義部１０３は、モダリティの定義に従って、オブジェクトを所定の形式のデータに変換する処理（共通化した前処理）を行う。 Next, the modality management unit 102 causes the modality definition unit 103 to generate conversion data (entity) and assign a unique ID (step S139). Specifically, the modality definition unit 103 performs processing (common preprocessing) for converting an object into data in a predetermined format according to the modality definition.

次いで、特徴空間管理部１０１は、モダリティ管理部１０２から、ＩＤおよびentityを取得する（ステップＳ１４２）。 Next, the feature space manager 101 acquires the ID and entity from the modality manager 102 (step S142).

次に、特徴空間管理部１０１は、取得したentityに基づいて、対応する全ての特徴空間（Feature Space）に対して検索要求を出力する（ステップＳ１４５）。検索要求には、entity、mdl1（元データのモダリティ）、mdl2（ターゲットモダリティ）が含まれる。対応する特徴空間とは、mdl1およびmdl2を扱い得る特徴空間（特徴管理サーバ２０）である。なお、mdl1およびmdl2を扱い得る特徴空間が複数ある場合は、特徴空間毎に、ステップＳ１４５～Ｓ１５７に示す処理を繰り返す。 Next, the feature space management unit 101 outputs a search request to all corresponding feature spaces based on the acquired entity (step S145). The search request includes entity, mdl1 (original data modality), and mdl2 (target modality). The corresponding feature space is the feature space (feature management server 20) that can handle mdl1 and mdl2. Note that if there are a plurality of feature spaces that can handle mdl1 and mdl2, the processing shown in steps S145 to S157 is repeated for each feature space.

次いで、特徴管理部２０１は、特徴抽出部２０２により、特徴量の抽出を行う（ステップＳ１４８）。 Next, the feature management unit 201 causes the feature extraction unit 202 to extract feature amounts (step S148).

続いて、特徴管理部２０１は、抽出された特徴量に基づいて、特徴量データベース２２０から、類似している特徴量の検索を行う（ステップＳ１５１）。この際、特徴管理部２０１は、要求されたターゲットモダリティ（mdl2）に対応する特徴量データベース２２０から検索する。特徴量データベース２２０では、特徴量に、上記一意のＩＤが関連付けられており、特徴管理部２０１は、検索要求されたentityの特徴量と類似する特徴量を特徴量データベース２２０から検索し、類似する特徴量に関連付けられたＩＤと、当該特徴量の類似度：sim（検索要求されたentityの特徴量との類似度であって、例えばＮ次元ベクトルの距離）を取得する。なお、ターゲットモダリティ（mdl2）が複数ある場合は、特徴量データベース２２０毎に、ステップＳ１５１～Ｓ１５４に示す処理を繰り返す。 Subsequently, the feature management unit 201 searches for similar feature amounts from the feature amount database 220 based on the extracted feature amounts (step S151). At this time, the feature manager 201 searches the feature amount database 220 corresponding to the requested target modality (mdl2). In the feature amount database 220, the feature amount is associated with the above-mentioned unique ID. An ID associated with a feature amount and the similarity of the feature amount: sim (similarity between the feature amount of the requested entity and, for example, the distance of an N-dimensional vector) are obtained. Note that if there are a plurality of target modalities (mdl2), the processing shown in steps S151 to S154 is repeated for each feature amount database 220. FIG.

次に、特徴空間管理部１０１は、特徴管理部２０１から、検索結果（検索した特徴量のＩＤ、検索したモダリティ：mdl、および検索した特徴量の類似度：sim）を取得する（ステップＳ１５７）。検索結果には、複数のID、mdl、およびsimが含まれていてもよい。検索結果として単数が求められている場合、特徴空間管理部１０１は、例えば類似度が最も高い特徴量のＩＤを特定する。検索結果として所定数が求められている場合、特徴空間管理部１０１は、例えば類似度に基づいて上位所定数の特徴量のＩＤを特定する。 Next, the feature space management unit 101 acquires the search result (the ID of the retrieved feature amount, the retrieved modality: mdl, and the similarity of the retrieved feature amount: sim) from the feature management unit 201 (step S157). . Search results may include multiple ids, mdl, and sim. When a singular number is obtained as a search result, the feature space management unit 101 identifies, for example, the ID of the feature quantity with the highest similarity. When a predetermined number are obtained as search results, the feature space management unit 101 identifies IDs of a predetermined number of top feature amounts based on similarity, for example.

次いで、特徴空間管理部１０１は、特定したＩＤおよび対応するモダリティの情報と共に、モダリティ管理部１０２に対して元データの要求を行う（ステップＳ１６０）。 Next, the feature space management unit 101 requests the modality management unit 102 for the original data together with the identified ID and the corresponding modality information (step S160).

次に、モダリティ管理部１０２は、モダリティ定義部１０３により、ＩＤに関連付けられた元データ（すなわち、オブジェクト）を取得し（ステップＳ１６３）、特徴空間管理部１０１に出力する（ステップＳ１６６）。かかるステップＳ１６０～Ｓ１６６に示す処理は、出力する検索結果数分行い得る。元データの取得ができなかった場合、特徴空間管理部１０１は、レコード（ID,mdl,sim）の削除を行う。 Next, the modality management unit 102 acquires the original data (that is, the object) associated with the ID by the modality definition unit 103 (step S163), and outputs it to the feature space management unit 101 (step S166). The processing shown in steps S160 to S166 can be performed for the number of search results to be output. If the original data cannot be acquired, the feature space management unit 101 deletes the record (ID, mdl, sim).

そして、特徴空間管理部１０１は、検索結果（オブジェクト、モダリティ、および類似度）を、検索要求元に出力する（ステップＳ１６９）。例えば特徴空間管理部１０１は、検索結果を示す画面を、出力部１３０で表示し、ユーザに提示してもよい。 Then, the feature space management unit 101 outputs the search result (object, modality, and similarity) to the search request source (step S169). For example, the feature space management unit 101 may display a screen showing search results on the output unit 130 and present it to the user.

以上、本実施形態による検索処理について具体的に説明した。このように、本実施形態により構築した特徴空間を、異なるモダリティを扱う複数の特徴空間における横断的な検索に利用することができる。 The search processing according to the present embodiment has been specifically described above. In this way, the feature space constructed according to this embodiment can be used for cross-sectional searches in a plurality of feature spaces dealing with different modalities.

上記最後の例に示したように、検索に用いる特徴空間（検索ＤＢ）を特定してもよい。特定した特徴空間は、space IDとして、上記ステップＳ１１３の検索要求に含まれる。例えば、「○○社が作成した検索ＤＢを利用したい」、「○○検索サイトを利用したい」等が想定される。 As shown in the last example above, the feature space (search DB) used for searching may be specified. The specified feature space is included in the search request in step S113 as a space ID. For example, "I want to use the search DB created by XX company", "I want to use the XX search site", etc. are assumed.

ここで、図６に、本実施形態における検索画面の一例を示す。図６に示す検索画面３０は、例えば出力部１３０で提示される。ユーザは、検索オブジェクト３０１を入力し、検索対象を選択し（モダリティに相当。例えば、「写真」、「イラスト」、「書類」等）、何が似ているものを検索したいのかその特徴量を選択し（例えば、「形」、「色」、「意味」等であって、各特徴空間に相当する。例えば、形の特徴に基づいて構築された特徴空間、色の特徴に基づいて構築された特徴空間等である）、検索ボタン３０２を選択すると、検索結果として取得された検索オブジェクト３０１に似ているオブジェクトが提示される。例えば検索対象として「イラスト」を選択し、特徴量として「形」（space ID1）、「色」（space ID2）、および「意味」（space ID3）を選択した場合、検索結果として、検索オブジェクト３０１と、形、色、および／または意味が似ているイラストが取得され（例えば、形の特徴に基づいて構築された特徴空間を扱う特徴管理サーバ２０が保有する「モダリティ：イラスト」の特徴量データベース２２０から検索され）、提示される。特徴量の検索条件のand/orはユーザが任意に選択できるようにしてもよいし、orをデフォルトにしてもよい。 Here, FIG. 6 shows an example of the search screen in this embodiment. The search screen 30 shown in FIG. 6 is presented by the output unit 130, for example. A user inputs a search object 301, selects a search target (corresponding to a modality; for example, "photograph", "illustration", "document", etc.), and specifies the feature amount of what is similar to what he/she wants to search. Select (e.g., 'shape', 'color', 'meaning', etc., corresponding to each feature space. For example, feature space constructed based on shape features, color feature ), and when a search button 302 is selected, an object similar to the search object 301 obtained as a search result is presented. For example, if "illustration" is selected as a search target, and "shape" (space ID1), "color" (space ID2), and "meaning" (space ID3) are selected as feature quantities, a search object 301 , illustrations similar in shape, color, and/or meaning are obtained (for example, the feature amount database of "modality: illustration" held by the feature management server 20 that handles feature spaces constructed based on the features of the shape 220) and presented. The user may arbitrarily select and/or of the search condition of the feature amount, or the default may be set to or.

＜＜４．応用例＞＞
続いて、本実施形態による情報処理システムの応用例について説明する。<<4. Application example >>
Next, an application example of the information processing system according to this embodiment will be described.

＜４－１．第１の応用例：モダリティの包含関係の定義＞
まず、第１の応用例として、モダリティの包含関係の定義について説明する。本実施形態によるモダリティ定義部１０３は、モダリティ同士に親子関係（包含関係）を定義してもよい。具体例として下記が挙げられる。<4-1. First Application Example: Definition of Inclusive Relationship of Modalities>
First, as a first application example, the definition of the inclusion relationship of modalities will be described. The modality definition unit 103 according to this embodiment may define a parent-child relationship (inclusive relationship) between modalities. Specific examples include the following.

・モダリティ「ＲＧＢ画像」は、モダリティ「グレースケール画像」を子として持つ
・モダリティ「メール」は、モダリティ「テキスト」、「ユーザ」、「日付」を子として持つ・The modality “RGB image” has the modality “grayscale image” as a child. ・The modality “mail” has the modalities “text”, “user”, and “date” as children.

本応用例によれば、新規にモダリティを定義する際、子モダリティを定義すれば、容易に既存の特徴空間に組込むことができる。また、新しいモダリティに対しても、複数の特徴抽出器（特徴空間、特徴抽出部２０２）を組み合わせることによって特徴抽出を行うことが可能となる。 According to this application example, when defining a new modality, if a child modality is defined, it can be easily incorporated into an existing feature space. Also, for new modalities, feature extraction can be performed by combining a plurality of feature extractors (feature space, feature extraction unit 202).

例えば、既存モダリティを含むモダリティを定義するユースケースが想定される。より具体的には、“テキスト”というモダリティが既に存在し、“テキスト”を扱う特徴空間Ａ（特徴抽出部２０２Ａ）があるとする。ここに、テキスト（本文）とユーザ（送信者）を含む“メール”というモダリティと、“メール”を扱える特徴空間Ｂ（特徴抽出部２０２Ｂ）を追加した場合を想定する。この場合、第１の効果として、「同時に複数の特徴空間に登録できる」ということが挙げられる。すなわち、メールからはテキストが取得できるため、同じＩＤとオブジェクトのペアで、特徴空間Ｂだけでなく、特徴空間Ａにも登録できる。すなわち、特徴空間Ａには、テキストだけに着目した場合の特徴量が、特徴空間Ｂには、テキストとユーザに着目した場合の特徴量が格納される。また、他のテキストと横断的に検索が可能となる（この場合、ＩＤは、同じモダリティだけではなく、全てのモダリティにまたがって一意なＩＤを付与する必要がある）。 For example, consider a use case that defines modalities that include existing modalities. More specifically, it is assumed that the modality "text" already exists and there is a feature space A (feature extraction unit 202A) that handles "text". It is assumed here that a modality "email" including text (text) and a user (sender) and a feature space B (feature extraction unit 202B) capable of handling "email" are added. In this case, the first effect is that "it can be registered in a plurality of feature spaces at the same time". That is, since the text can be obtained from the mail, the same ID/object pair can be registered not only in the feature space B but also in the feature space A. That is, the feature space A stores the feature amount when focusing only on the text, and the feature space B stores the feature amount when focusing on the text and the user. In addition, it is possible to search across other texts (in this case, it is necessary to assign a unique ID across all modalities, not just the same modality).

また、第２の効果として、「既存の特徴抽出器（特徴抽出部２０２）を再利用することで、容易に新規の特徴抽出器（特徴抽出部２０２）を構築できる」ということが挙げられる。すなわち、特徴空間Ｂは、特徴空間Ａを利用してテキストの特徴抽出を行うことができるため、特徴空間Ｂではユーザの特徴抽出のみを行えばよく、実装が容易となる。また、既存の特徴抽出器をモジュール化して利用することも可能である。図７は、特徴抽出器のモジュール化について説明する図である。 A second effect is that "a new feature extractor (feature extraction unit 202) can be easily constructed by reusing an existing feature extractor (feature extraction unit 202)." That is, since the feature space B can use the feature space A to extract the feature of the text, only the feature of the user needs to be extracted in the feature space B, which facilitates implementation. It is also possible to use an existing feature extractor as a module. FIG. 7 is a diagram explaining modularization of the feature extractor.

図７左に示すように、例えばメールの特徴抽出を行う際には、メールと包含関係が定義されている文章、およびユーザといったモダリティをそれぞれ扱うことが可能な各特徴抽出器を用いて各モダリティの特徴量を抽出することで、図７右に示すように、文章特徴量（内容）、およびユーザ特徴量（送信者）を含むメール特徴量を取得することが可能となる。 As shown on the left side of FIG. By extracting the feature amount of , it is possible to obtain the mail feature amount including the text feature amount (content) and the user feature amount (sender), as shown in the right side of FIG.

以下、本応用例における登録処理と検索処理について、図８および図９を参照してそれぞれ順次説明する。 Registration processing and search processing in this application example will be sequentially described below with reference to FIGS. 8 and 9, respectively.

（モダリティの包含関係の定義を考慮した登録処理）
図８は、本実施形態による第１の応用例の登録処理の一例を示すシーケンス図である。(Registration processing considering definition of modality inclusion relationship)
FIG. 8 is a sequence diagram showing an example of registration processing of the first application example according to this embodiment.

図８に示すように、まず、情報処理装置１０の特徴空間管理部１０１は、ユーザの操作入力等に基づいて登録要求を取得すると（ステップＳ２０３）、登録要求に含まれるオブジェクト（obj）と当該オブジェクトのモダリティ（mdl）の情報（例えば、「Mail」）と共に、モダリティ管理部１０２に対して、（entityの）生成依頼を行う（ステップＳ２０６）。 As shown in FIG. 8, first, when the feature space management unit 101 of the information processing apparatus 10 acquires a registration request based on a user's operation input or the like (step S203), the object (obj) included in the registration request and the relevant Along with the information of the modality (mdl) of the object (for example, "Mail"), the modality management unit 102 is requested to generate (entity) (step S206).

次に、モダリティ管理部１０２は、モダリティ定義部１０３により、変換データ（entity）の生成、一意のＩＤの付与、およびobjと一意のＩＤの保存処理を行うと共に、objのモダリティと包含関係を有するモダリティ（sub mdl）の定義に基づいて、sub entityの生成を行う（ステップＳ２０９）。例えばオブジェクトのモダリティが「Mail」であって、これと包含関係を有するモダリティ（sub mdl）が「Text」の場合、モダリティ定義部１０３は、「Text」の定義に従って、メールデータ（obj）のうちテキストのデータを所定のデータ形式に変換し、sub entityとして生成する処理を行う。 Next, the modality management unit 102 uses the modality definition unit 103 to generate conversion data (entity), assign a unique ID, and store obj and the unique ID. A sub entity is generated based on the definition of the modality (sub mdl) (step S209). For example, if the modality of the object is "Mail" and the modality (sub mdl) having an inclusion relationship with it is "Text", the modality definition unit 103, according to the definition of "Text", It converts text data into a predetermined data format and performs processing to generate it as a sub entity.

次いで、特徴空間管理部１０１は、モダリティ管理部１０２から、少なくともＩＤ、entity、およびsub entityを取得する（ステップＳ２１２）。また、モダリティ管理部１０２からは、IDとobjを保存した旨が通知されてもよい。 Next, the feature space manager 101 acquires at least the ID, entity, and sub entity from the modality manager 102 (step S212). Also, the modality management unit 102 may notify that the ID and obj have been saved.

次に、特徴空間管理部１０１は、取得したＩＤおよびentity（例えばMail Entity）に基づいて、対応する全ての特徴空間（例えば特徴空間Ｂ）に対してデータの追加（登録）要求を出力する（ステップＳ２１５）。続くステップＳ２１８～Ｓ２２１に示す特徴量の抽出に関する処理については、図４に示すステップＳ１１８～Ｓ１２１と同様であるため、詳細な説明は省略するが、例えばメールを扱う特徴空間Ｂには、メールの特徴量（Mail Vector）を登録するが、この際、メールの特徴量のうち、テキスト（Text Vector）については、次に説明するテキストを扱う特徴空間ＡのGet Vector（ステップＳ２２７）を利用するようにしてもよい。 Next, the feature space management unit 101 outputs a data addition (registration) request to all corresponding feature spaces (eg, feature space B) based on the acquired ID and entity (eg, Mail Entity) ( step S215). The processing related to extraction of the feature amount shown in subsequent steps S218 to S221 is the same as steps S118 to S121 shown in FIG. 4, so a detailed description will be omitted. The feature quantity (Mail Vector) is registered. At this time, for the text (Text Vector) among the features of the mail, the Get Vector (step S227) of the feature space A that handles the text, which will be described below, is used. can be

特徴空間管理部１０１は、同ＩＤおよびsub entity（例えばText Entity）について、対応する全ての特徴空間（例えば特徴空間Ａ）に対してデータの追加（登録）要求を出力する（ステップＳ２２４～２３０）。特徴空間Ａは、テキストのみに対応した特徴空間であり、Text Entityから抽出したテキストの特徴量（Text Vector）が登録される。 The feature space management unit 101 outputs a data addition (registration) request to all feature spaces (for example, feature space A) corresponding to the same ID and sub entity (for example, Text Entity) (steps S224 to 230). . Feature space A is a feature space that corresponds only to text, and a text feature quantity (Text Vector) extracted from Text Entity is registered.

このように、本変形例では、特徴量の抽出において、包含関係を有する特徴空間を利用することができると共に、当該特徴空間にも特徴量を登録することが可能となる。 As described above, in this modified example, it is possible to use a feature space having an inclusive relationship in extracting feature amounts, and to register feature amounts in the feature space as well.

（モダリティの包含関係の定義を考慮した検索処理）
続いて、本変形例による検索処理について図９を参照して説明する。図９は、本実施形態による第１の応用例の検索処理の一例を示すシーケンス図である。(Search processing considering definition of modality inclusion relationship)
Next, search processing according to this modification will be described with reference to FIG. FIG. 9 is a sequence diagram showing an example of search processing of the first application example according to this embodiment.

図９に示すように、まず、情報処理装置１０の特徴空間管理部１０１は、ユーザの操作入力等に基づいて検索要求を取得する（ステップＳ２４３）。検索要求には、オブジェクト（obj）と、当該オブジェクトのモダリティ（mdl1）と、検索対象のモダリティを示すターゲットモダリティ（mdl2）とが含まれる。ここでは、例えばmdl1＝Mail、mdl2=Textとする。 As shown in FIG. 9, first, the feature space management unit 101 of the information processing apparatus 10 acquires a search request based on the user's operation input (step S243). The search request includes the object (obj), the modality (mdl1) of the object, and the target modality (mdl2) indicating the modality to be searched. Here, for example, mdl1=Mail and mdl2=Text.

次いで、特徴空間管理部１０１は、検索要求に含まれるオブジェクト（obj）と当該オブジェクトのモダリティ（mdl1）の情報と共に、モダリティ管理部１０２に対して、（entityの）生成依頼を行う（ステップＳ２４６）。 Next, the feature space management unit 101 requests the modality management unit 102 to generate (entity) together with the information of the object (obj) and the modality (mdl1) of the object included in the search request (step S246). .

次に、モダリティ管理部１０２は、モダリティ定義部１０３により、変換データ（entity）の生成、および一意のＩＤの付与を行うと共に、objのモダリティと包含関係を有するモダリティ（sub mdl）の定義に基づいて、sub entityの生成を行う（ステップＳ２４９）。例えばオブジェクトのモダリティが「Mail」であって、これと包含関係を有するモダリティ（sub mdl）が「Text」の場合、モダリティ定義部１０３は、「Text」の定義に従って、メールデータ（obj）のうちテキストのデータを所定のデータ形式に変換し、sub entityとして生成する処理を行う。 Next, the modality management unit 102 causes the modality definition unit 103 to generate conversion data (entity), assign a unique ID, and based on the definition of a modality (sub mdl) having an inclusion relationship with the modality of obj. Then, a sub entity is generated (step S249). For example, if the modality of the object is "Mail" and the modality (sub mdl) having an inclusion relationship with it is "Text", the modality definition unit 103, according to the definition of "Text", It converts text data into a predetermined data format and performs processing to generate it as a sub entity.

次いで、特徴空間管理部１０１は、モダリティ管理部１０２から、ＩＤ、entity、およびsub entityを取得する（ステップＳ２５２）。 Next, the feature space manager 101 acquires the ID, entity, and sub entity from the modality manager 102 (step S252).

次に、特徴空間管理部１０１は、取得したentity（例えばMail Entity）に基づいて、対応する全ての特徴空間（Feature Space）に対して検索要求を出力する（ステップＳ２５５）。検索要求には、entity、mdl1（元データのモダリティ、例えばMail）、mdl2（ターゲットモダリティ、例えばText）が含まれる。対応する特徴空間とは、mdl1およびmdl2を扱い得る特徴空間（例えば、メールとテキスト双方に対応した特徴空間）である。 Next, the feature space management unit 101 outputs a search request to all corresponding feature spaces based on the acquired entity (for example, Mail Entity) (step S255). The search request includes entity, mdl1 (original data modality, such as Mail), and mdl2 (target modality, such as Text). A corresponding feature space is a feature space that can handle mdl1 and mdl2 (for example, a feature space that supports both mail and text).

続くステップＳ２５８～Ｓ２６７に示す特徴量の抽出に関する処理については、図５に示すステップＳ１４８～Ｓ１５７と同様であるため、詳細な説明は省略する。ここで、対応する特徴空間が存在しない場合も想定される。例えば、メールを扱う特徴空間Ｂと、テキストを扱う特徴空間Ａが存在する場合、いずれも上記メールとテキストの双方を扱う特徴空間ではないため、検索結果は返されないが、次に説明するsub mdlを用いた場合には、特徴空間Ａから検索結果が返され得る。 Processing related to the extraction of the feature quantity shown in subsequent steps S258 to S267 is the same as steps S148 to S157 shown in FIG. 5, so detailed description thereof will be omitted. Here, it is assumed that there is no corresponding feature space. For example, if there are a feature space B that handles email and a feature space A that handles text, neither feature space handles both email and text, so no search results are returned. , the search results may be returned from feature space A.

特徴空間管理部１０１は、同ＩＤおよびsub entity（例えばText Entity）に基づいて、対応する全ての特徴空間（Feature Space）に対して検索要求を出力する（ステップＳ２７０）。検索要求には、sub entity（例えばText Entity）、mdl1（sub mdl、例えばText）、mdl2（ターゲットモダリティ、例えばText）が含まれる。対応する特徴空間とは、mdl1およびmdl2を扱い得る特徴空間、ここではmdl1およびmdl2が同じ「Text」であるため、テキストに対応した特徴空間Ａが相当する。特徴空間Ａにおいて検索が行われ（ステップＳ２７３～Ｓ２７９）、特徴空間管理部１０１は、特徴管理部２０１から、検索結果を取得する（ステップＳ２８２）。 The feature space management unit 101 outputs a search request to all corresponding feature spaces based on the same ID and sub entity (for example, Text Entity) (step S270). The search request includes sub entity (eg Text Entity), mdl1 (sub mdl, eg Text), mdl2 (target modality, eg Text). The corresponding feature space corresponds to a feature space that can handle mdl1 and mdl2, here, since mdl1 and mdl2 are the same "Text", feature space A corresponding to text corresponds. A search is performed in the feature space A (steps S273 to S279), and the feature space management unit 101 acquires search results from the feature management unit 201 (step S282).

続いて、特徴空間管理部１０１は、上述した図５に示すステップＳ１６０～Ｓ１６９と同様に、取得したＩＤおよび対応するモダリティ（例えば、Text）と共に、モダリティ管理部１０２に対して元データの要求を行い（ステップＳ２８５）、モダリティ定義部１０３によりＩＤに基づいて取得されたオブジェクトが（ステップＳ２８８）、モダリティ管理部１０２から特徴空間管理部１０１に出力される（ステップＳ２９１）。 Subsequently, the feature space management unit 101 requests the modality management unit 102 for the original data together with the acquired ID and the corresponding modality (for example, Text) in the same manner as in steps S160 to S169 shown in FIG. (step S285), the object acquired by the modality definition unit 103 based on the ID (step S288) is output from the modality management unit 102 to the feature space management unit 101 (step S291).

そして、特徴空間管理部１０１は、検索結果（オブジェクト、モダリティ、および類似度）を、検索要求元に出力する（ステップＳ２９４）。 Then, the feature space management unit 101 outputs the search result (object, modality, and similarity) to the search request source (step S294).

以上、本応用例によるモダリティの包含関係を考慮した検索処理について具体的に説明した。 The search processing in consideration of the inclusion relationship of modalities according to this application example has been specifically described above.

＜４－２．第２の応用例：検索結果のマージ＞
次に、第２の応用例として、検索結果のマージについて説明する。特徴空間管理部１０１は、各特徴抽出器からの検索結果の類似度と重み付けに基づいて、検索結果を再評価した上で、検索要求元に最終的な検索結果を出力することが可能である。重み付けとは、例えば特徴空間の重み付けである。かかる重み付けは、検索要求元（例えばユーザ）が任意に設定することも可能である。<4-2. Second Application Example: Merging of Search Results>
Next, merging of search results will be described as a second application example. The feature space management unit 101 can re-evaluate the search results based on the similarity and weighting of the search results from each feature extractor, and then output the final search results to the search requester. . Weighting is, for example, weighting of feature space. Such weighting can be arbitrarily set by the search requester (for example, the user).

図１０は、本応用例における検索画面の一例を示す図である。図１０に示すように、検索画面３２には、検索オブジェクト３２１と、検索対象の選択領域３２２と、何が似ているものを検索したいのかその特徴量を選択する領域３２３と、検索ボタン３２６が表示されている。特徴量を選択する領域３２３では、スライドバー３２４を操作して、選択した特徴量の重み付けを設定することが可能である。例えば、「形特徴」と「色特徴」のうち「色特徴」を優先したい場合は、スライドバー３２４の操作部３２５を「色特徴」の方に動かす。これにより、例えばシステム側で、以下のように重み付け（w）を設定する。ここで、色の特徴空間：space1、形の特徴空間：space2とする。
w( weights)＝{ space1: 0.8, space2: 0.2}FIG. 10 is a diagram showing an example of a search screen in this application example. As shown in FIG. 10, the search screen 32 includes a search object 321, a search target selection area 322, an area 323 for selecting a similar feature amount to search for, and a search button 326. is displayed. In the area 323 for selecting the feature amount, it is possible to operate the slide bar 324 to set the weighting of the selected feature amount. For example, if it is desired to give priority to the "color feature" out of the "shape feature" and the "color feature", the operation section 325 of the slide bar 324 is moved to the "color feature". Accordingly, for example, the system side sets the weighting (w) as follows. Here, the color feature space: space1 and the shape feature space: space2.
w(weights)＝{space1: 0.8, space2: 0.2}

この場合、図１０に示すように、「色特徴」を優先した検索結果（色が似ているイラストが優先された検索結果）が表示される。 In this case, as shown in FIG. 10, a search result giving priority to "color characteristics" (search results giving priority to illustrations with similar colors) is displayed.

なお、特徴空間の重み付けの設定は、図１０に示す例に限定されず、例えば検索結果からユーザが選択したものに基づいて重み付けを設定し、再度検索結果を提示するようにしてもよい。図１１に一例を示す。例えば、図１１の検索画面３４に提示された検索結果のうち、イラスト３４１が選択されると、システムは、イラスト３４１が、ユーザの意図に近い結果であったとして、イラスト３４１を検索結果として出力した特徴空間（特徴抽出器、すなわち特徴抽出部２０２）を優先するよう重み付けを設定し、再度検索結果を提示するようにしてもよい。 Note that the setting of the weighting of the feature space is not limited to the example shown in FIG. 10. For example, the weighting may be set based on what the user selects from the search results, and the search results may be presented again. An example is shown in FIG. For example, when the illustration 341 is selected from among the search results presented on the search screen 34 of FIG. 11, the system outputs the illustration 341 as the search result, assuming that the illustration 341 is a result close to the user's intention. A weighting may be set so as to give priority to the feature space (feature extractor, that is, the feature extraction unit 202), and the search result may be presented again.

（動作処理）
次に、本応用例の動作処理について図１２を参照して説明する。図１２は、第２の応用例の検索処理の一例を示すシーケンス図である。(operation processing)
Next, operation processing of this application example will be described with reference to FIG. FIG. 12 is a sequence diagram showing an example of search processing of the second application example.

図１２に示すように、まず、情報処理装置１０の特徴空間管理部１０１は、ユーザの操作入力等に基づいて検索要求を取得する（ステップＳ３０３）。検索要求には、オブジェクト（obj）と、当該オブジェクトのモダリティ（mdl1）と、検索対象のモダリティを示すターゲットモダリティ（mdl2）と、特徴空間の重み付け（ｗ）が含まれる。 As shown in FIG. 12, first, the feature space management unit 101 of the information processing apparatus 10 acquires a search request based on the user's operation input (step S303). The search request includes the object (obj), the modality (mdl1) of the object, the target modality (mdl2) indicating the modality to be searched, and the weighting (w) of the feature space.

続くステップＳ３１５～Ｓ３２１では、上述した図５のステップＳ１４５～１５７に示す処理と同様の検索処理が行われるため、ここでの詳細な説明は省略する。なお、ステップＳ３１８では、図５のステップＳ１４８～Ｓ１５４に示す処理と同様の処理が行われるが、詳細な図示は省略している。 In subsequent steps S315 to S321, search processing similar to the processing shown in steps S145 to S157 in FIG. 5 described above is performed, so detailed description thereof will be omitted here. In step S318, the same processing as that shown in steps S148 to S154 in FIG. 5 is performed, but detailed illustration is omitted.

次に、特徴空間管理部１０１は、検索結果の類似度と重み付けに応じて、検索結果の順位付け（再評価）を行う（ステップＳ３２４）。具体的には、例えば特徴空間管理部１０１は、検索結果の類似度と、当該検索結果を出力した特徴空間（特徴抽出器、すなわち特徴抽出部２０２）の重みとを乗算し、新たな類似度を算出した上で、再評価を行い得る。下記表１に、再評価の一例を示す。ここで、w (weights)= {space1: 0.8, space2: 0.2}とする。 Next, the feature space management unit 101 ranks (re-evaluates) the search results according to the degree of similarity and weighting of the search results (step S324). Specifically, for example, the feature space management unit 101 multiplies the similarity of the search result by the weight of the feature space (feature extractor, that is, the feature extraction unit 202) that outputs the search result, and obtains a new similarity can be calculated and re-evaluated. Table 1 below shows an example of re-evaluation. Here, w (weights) = {space1: 0.8, space2: 0.2}.

上記表１に示すように、例えば検索結果であるオブジェクトＡが、第１の特徴空間(space1)から検索された際の類似度（sim(space1)：0.9）に第１の特徴空間の重み（space1:0.8）を乗算した値と、第２の特徴空間(space2)から検索された際の類似度（sim(space2)：0.3）に第２の特徴空間の重み（space2:0.2）を乗算した値とを加算した値（sim(new)：0.78）が、新たな類似度として算出される。同じデータに関連付くＩＤが複数の特徴空間に登録されている場合も考えられるためである。また、検索結果が１つの特徴空間からのみ検索された場合も想定される。この場合、上記表１のオブジェクトＣの例のように、例えばオブジェクトＣが、第２の特徴空間(space2)から検索された際の類似度（sim(space2)：0.9）に第２の特徴空間の重み（space2:0.2）を乗算した値（sim(new)：0.18）が、新たな類似度として算出される。 As shown in Table 1 above, for example, the similarity (sim(space1): 0.9) when object A, which is the retrieval result, is retrieved from the first feature space (space1) and the weight of the first feature space ( space1: 0.8) and the similarity when searched from the second feature space (space2) (sim(space2): 0.3) multiplied by the weight of the second feature space (space2: 0.2) value (sim(new): 0.78) is calculated as a new degree of similarity. This is because IDs associated with the same data may be registered in multiple feature spaces. It is also assumed that search results are obtained from only one feature space. In this case, as in the example of object C in Table 1 above, for example, when object C is retrieved from the second feature space (space2), the similarity (sim(space2): 0.9) to the second feature space A value (sim(new): 0.18) obtained by multiplying the weight (space2: 0.2) is calculated as a new degree of similarity.

特徴空間管理部１０１は、新たな類似度に基づいて、例えば上位所定数の検索結果（ＩＤ）を特定する。 Based on the new degree of similarity, the feature space management unit 101 identifies, for example, a predetermined number of top search results (IDs).

次いで、特徴空間管理部１０１は、上述した図５に示すステップＳ１６０～Ｓ１６９と同様に、特定したＩＤおよび対応するモダリティと共に、モダリティ管理部１０２に対して元データの要求を行い（ステップＳ３２７）、モダリティ定義部１０３によりＩＤに基づいて取得されたオブジェクトが（ステップＳ３３０）、モダリティ管理部１０２から特徴空間管理部１０１に出力される（ステップＳ３３３）。 Next, the feature space management unit 101 requests original data from the modality management unit 102 together with the specified ID and the corresponding modality, similarly to steps S160 to S169 shown in FIG. 5 (step S327). The object acquired based on the ID by the modality definition unit 103 (step S330) is output from the modality management unit 102 to the feature space management unit 101 (step S333).

そして、特徴空間管理部１０１は、検索結果（オブジェクト、モダリティ、および類似度）を、検索要求元に出力する（ステップＳ３３６）。 Then, the feature space management unit 101 outputs the search result (object, modality, and similarity) to the search request source (step S336).

以上、本応用例による検索結果のマージについて具体的に説明した。 The merging of search results according to this application example has been specifically described above.

＜４－３．第３の応用例：サジェストシステム＞
次に、第３の応用例として、サジェストシステムについて図１３～図１５を参照して説明する。サジェストシステムは、複数のアプリケーションが動作するシステム上で用いることで、各アプリケーションにおけるユーザの操作情報（閲覧しているコンテンツや、操作しているコンテンツ等）に基づいて、状況に合ったコンテンツを検索し、ユーザに提案することを可能とする。<4-3. Third Application Example: Suggestion System>
Next, as a third application example, a suggestion system will be described with reference to FIGS. 13 to 15. FIG. The suggestion system is used on a system running multiple applications to search for content that matches the situation based on the user's operation information (content being viewed, content being operated, etc.) in each application. and make it possible to propose to the user.

例えば、ユーザが色々なアプリケーションを用いて旅行計画を立てている場合を想定する。ユーザが、Ｗｅｂブラウザで観光地を探し、地図アプリで現地の地図を検索し、さらにノートアプリに計画をまとめている場合、サジェストシステムは、これらの複数アプリケーションの利用状況に応じて、需要に合ったコンテンツ（Ｗｅｂページやテキスト、画像など）を提案することが可能となる。 For example, assume that the user is planning a trip using various applications. If the user searches for tourist spots on a web browser, searches for local maps on a map app, and organizes plans on a note app, the suggestion system will be able to match demand according to the usage of these multiple applications. It is possible to propose content (web pages, text, images, etc.) that is unique to the user.

（構成例）
図１３は、本システムの構成の一例を示す機能ブロック図である。図１３に示すように、例えばサジェストシステムは、情報処理装置１０ｘにより実現され得る。情報処理装置１０ｘは、１以上のアプリ１０５と、情報収集部１０６と、サジェスト部１０７と、特徴空間管理部１０１ｘと、モダリティ管理部１０２ｘと、として機能する。これらは、情報処理装置１０の制御部１００により実施され得る。(Configuration example)
FIG. 13 is a functional block diagram showing an example of the configuration of this system. As shown in FIG. 13, for example, the suggestion system can be realized by the information processing device 10x. The information processing device 10x functions as one or more applications 105, an information collection unit 106, a suggestion unit 107, a feature space management unit 101x, and a modality management unit 102x. These can be performed by the control unit 100 of the information processing device 10 .

アプリ１０５は、Ｗｅｂブラウザ、地図アプリケーション、ノートアプリケーション等の、各種アプリケーションプログラムである。 The application 105 is various application programs such as a web browser, a map application, and a notebook application.

情報収集部１０６は、各アプリ１０５の動作を監視し、各アプリ１０５におけるユーザ操作情報（すなわちアプリケーションの利用状況）を収集、蓄積する機能を有する。また、情報収集部１０６には、ＯＳ（Operating System）を利用してもよい。 The information collection unit 106 has a function of monitoring the operation of each application 105 and collecting and accumulating user operation information (i.e. application usage status) in each application 105 . Also, an OS (Operating System) may be used for the information collecting unit 106 .

サジェスト部１０７は、情報収集部１０６により収集された操作情報に基づいて検索要求を生成し、特徴空間管理部１０１ｘに対して検索要求を行う。例えばサジェスト部１０７は、情報収集部１０６から各アプリ１０５から取得した閲覧中／編集中のコンテンツのモダリティ(mdl1)と内容(obj)、および必要なコンテンツのモダリティ(mdl2)の要求に基づいて、検索要求を生成し得る。各アプリ１０５から取得されるコンテンツのモダリティと必要なコンテンツのモダリティの要求は、例えば以下のような例が想定される。
・Ｗｅｂブラウザ…閲覧：Ｗｅｂページ、要求：Ｗｅｂページ
・地図アプリ…閲覧：住所、要求：なし
・ノートアプリ…編集：テキスト／画像、要求：テキスト／画像The suggestion unit 107 generates a search request based on the operation information collected by the information collection unit 106, and issues the search request to the feature space management unit 101x. For example, the suggestion unit 107, based on the modality (mdl1) and content (obj) of the content being viewed/edited acquired from each application 105 from the information collection unit 106, and the modality (mdl2) of the required content, A search request can be generated. The modalities of content acquired from each application 105 and the modalities of required content are assumed to be, for example, as follows.
・Web browser…Browsing: Web page, Request: Web page ・Map application…Browsing: Address, Request: None ・Note application…Editing: Text/image, Request: Text/image

特徴空間管理部１０１ｘは、サジェスト部１０７からの要求に応じて、１以上の特徴空間を用いた検索処理を行う。検索処理は、上述した実施形態と同様であり、まず特徴空間管理部１０１ｘがモダリティ管理部１０２ｘによりobjを変換処理したentityを取得し、entity、mdl1（例えばＷｅｂページ、住所、テキスト）、およびmdl2（例えばＷｅｂページ、画像）に基づいて、特徴管理サーバ２０に対して検索要求を行う。そして、特徴空間管理部１０１ｘは、検索結果をサジェスト部１０７に出力する。 In response to a request from the suggestion unit 107, the feature space management unit 101x performs search processing using one or more feature spaces. The search processing is the same as in the above-described embodiment. First, the feature space management unit 101x obtains the entity obtained by converting obj by the modality management unit 102x, and obtains the entity, mdl1 (for example, Web page, address, text), and mdl2. (for example, web page, image), a search request is made to the feature management server 20 . The feature space management unit 101 x then outputs the search result to the suggestion unit 107 .

モダリティ管理部１０２ｘは、図１を参照して説明したモダリティ管理部１０２と同様の機能を有し、モダリティ定義部１０３により、objを、mdl1のモダリティの定義に従って所定のデータ形式に変換する処理を行い、生成したentityを特徴空間管理部１０１ｘに出力する。 The modality management unit 102x has the same function as the modality management unit 102 described with reference to FIG. and outputs the generated entity to the feature space management unit 101x.

以上、本応用例によるサジェストシステムを実行する情報処理装置１０ｘの構成の一例について具体的に説明した。 An example of the configuration of the information processing device 10x that executes the suggestion system according to the application example has been specifically described above.

（動作処理）
続いて、本応用例によるサジェストシステムの動作処理について図１４を参照して説明する。図１４は、本応用例のサジェストシステムにおける検索処理の流れの一例を示すシーケンス図である。(operation processing)
Next, operation processing of the suggestion system according to this application example will be described with reference to FIG. 14 . FIG. 14 is a sequence diagram showing an example of the flow of search processing in the suggestion system of this application example.

図１４に示すように、まず、１以上のアプリ１０５は、ユーザにより操作が行われると（ステップＳ４０３）、扱っているコンテンツの送信(post；obj,mdl1)と、必要なコンテンツの要求(request；mdl2)を、情報収集部１０６に対して行う（ステップＳ４０６）。postの一例としては、例えば、「金閣寺」のＷｅｂページ、「京都市北区・・・１－２－３」という住所、および旅行関連のテキスト等が挙げられる。また、requestとしては、例えば、Ｗｅｂページ、画像が挙げられる。 As shown in FIG. 14, first, when one or more applications 105 are operated by a user (step S403), the content being handled is sent (post; obj, mdl1) and the required content is requested (request ;mdl2) to the information collection unit 106 (step S406). Examples of posts include a web page of “Kinkakuji”, an address of “1-2-3 Kita-ku, Kyoto”, and travel-related texts. Also, the request includes, for example, a web page and an image.

次いで、情報収集部１０６は、収集した情報（post、request）を、サジェスト部１０７に出力する（ステップＳ４１２）。 Next, the information collection unit 106 outputs the collected information (post, request) to the suggestion unit 107 (step S412).

次に、サジェスト部１０７は、特徴空間管理部１０１ｘに対し、検索要求を行う（ステップＳ４１５）。検索要求には、postに含まれるコンテンツがobj、そのモダリティがmdl1、また、requestに含まれるコンテンツのモダリティがmdl2として含まれる。 Next, the suggestion unit 107 makes a search request to the feature space management unit 101x (step S415). The search request includes the content included in the post as obj, its modality as mdl1, and the modality of the content included in the request as mdl2.

次いで、特徴空間管理部１０１ｘにおいて検索処理が実行される（ステップＳ４１８）。ステップＳ４１８では、図５のステップＳ１３６～Ｓ１６６と同様の処理（objとmdl1からentityの生成、entityとmdl1とmdl2に基づく検索、検索結果のIDからオブジェクトの取得）が行われるが、ここでの詳細な説明は省略する。 Next, search processing is executed in the feature space management unit 101x (step S418). In step S418, the same processing as steps S136 to S166 in FIG. 5 (generating an entity from obj and mdl1, searching based on entity, mdl1 and mdl2, and obtaining an object from the ID of the search result) is performed. Detailed description is omitted.

次に、サジェスト部１０７は、特徴空間管理部１０１ｘから検索結果を取得する（ステップＳ４２１）。 Next, the suggestion unit 107 acquires search results from the feature space management unit 101x (step S421).

次いで、サジェスト部１０７は、検索結果の類似度と重み付け（Ｗ）に応じて、検索結果の順位付け（再評価）を行ってもよい（ステップＳ４２４）。例えば、サジェスト部１０７は、入出力毎に下記表２のような重みを設定しておき、類似度に掛け合わせてランキングしてもよい。なお、本応用例において、かかる再評価はスキップされてもよい。 Next, the suggestion unit 107 may rank (re-evaluate) the search results according to the degree of similarity and weighting (W) of the search results (step S424). For example, the suggestion unit 107 may set weights as shown in Table 2 below for each input and output, and rank them by multiplying the degrees of similarity. Note that in this application example, such re-evaluation may be skipped.

そして、サジェスト部１０７は、検索結果を示す表示画面の作成を行い（ステップＳ４２７）、ユーザに提示する（ステップＳ４３０）。 Then, the suggestion unit 107 creates a display screen showing the search results (step S427) and presents it to the user (step S430).

また、サジェスト部１０７は、ユーザから利用状況のフィードバックを得た場合は、上記ステップ４２４で用いた重み付け（Ｗ）を更新等してもよい（ステップＳ４３３）。 In addition, the suggestion unit 107 may update the weighting (W) used in step 424 (step S433) when feedback on the usage status is obtained from the user.

なお、サジェスト部１０７によるユーザへのサジェストやユーザからのフィードバックの取得は、アプリ１０５を介して行うようにしてもよい。 Note that the suggestion to the user by the suggestion unit 107 and the acquisition of feedback from the user may be performed via the application 105 .

ここで、図１５に、本応用例によるアプリケーションから取得する操作情報と要求情報の一例を示す。本システムでは、各アプリケーションから、図１５の左に示すような操作情報と、図１５の右に示すような要求情報を取得し、操作情報に基づいて、要求された情報をサジェストする。 Here, FIG. 15 shows an example of operation information and request information acquired from the application according to this application example. This system acquires operation information shown on the left side of FIG. 15 and request information shown on the right side of FIG. 15 from each application, and suggests requested information based on the operation information.

＜＜５．まとめ＞＞
上述したように、本開示の実施形態による情報処理システムでは、複数の特徴空間を扱うシステムの利便性をより向上させることが可能となる。<<5. Summary>>
As described above, in the information processing system according to the embodiment of the present disclosure, it is possible to further improve the convenience of a system that handles multiple feature spaces.

以上、添付図面を参照しながら本開示の好適な実施形態について詳細に説明したが、本技術はかかる例に限定されない。本開示の技術分野における通常の知識を有する者であれば、特許請求の範囲に記載された技術的思想の範疇内において、各種の変更例または修正例に想到し得ることは明らかであり、これらについても、当然に本開示の技術的範囲に属するものと了解される。 Although the preferred embodiments of the present disclosure have been described in detail above with reference to the accompanying drawings, the present technology is not limited to such examples. It is obvious that those who have ordinary knowledge in the technical field of the present disclosure can conceive of various modifications or modifications within the scope of the technical idea described in the claims. is naturally within the technical scope of the present disclosure.

例えば、上述した情報処理装置１０、または特徴管理サーバ２０に内蔵されるＣＰＵ、ＲＯＭ、およびＲＡＭ等のハードウェアに、情報処理装置１０、または特徴管理サーバ２０の機能を発揮させるためのコンピュータプログラムも作成可能である。また、当該コンピュータプログラムを記憶させたコンピュータ読み取り可能な記憶媒体も提供される。 For example, a computer program for causing hardware such as a CPU, ROM, and RAM incorporated in the information processing apparatus 10 or the feature management server 20 described above to exhibit the functions of the information processing apparatus 10 or the feature management server 20. can be created. A computer-readable storage medium storing the computer program is also provided.

また、本明細書に記載された効果は、あくまで説明的または例示的なものであって限定的ではない。つまり、本開示に係る技術は、上記の効果とともに、または上記の効果に代えて、本明細書の記載から当業者には明らかな他の効果を奏しうる。 Also, the effects described herein are merely illustrative or exemplary, and are not limiting. In other words, the technology according to the present disclosure can produce other effects that are obvious to those skilled in the art from the description of this specification, in addition to or instead of the above effects.

なお、本技術は以下のような構成も取ることができる。
（１）
登録要求情報に含まれる登録オブジェクトを複数の特徴抽出部に共通する一意の第１の識別情報と関連付けて記憶部に記憶する制御と、
前記登録オブジェクトを前記登録オブジェクトのモダリティの定義に従って変換し、登録用の変換データを生成する制御と、
前記第１の識別情報と前記登録用の変換データを、前記モダリティに対応する複数の特徴抽出器に出力する制御と、
を行う制御部を備える、情報処理装置。
（２）
前記モダリティの定義は、モダリティに対応する所定のデータ形式への変換ルールである、前記（１）に記載の情報処理装置。
（３）
前記制御部は、
検索要求に含まれる検索オブジェクトを前記検索オブジェクトのモダリティの定義に従って変換し、検索用の変換データを生成する制御と、
前記検索用の変換データを、前記検索オブジェクトのモダリティと前記検索要求に含まれるターゲットモダリティとに対応する前記特徴抽出器に出力する制御と、
を行う、前記（１）または（２）に記載の情報処理装置。
（４）
前記制御部は、
１以上の前記特徴抽出器において前記検索用の変換データに基づいて検索された第２の識別情報を取得し、
前記第２の識別情報に基づいて、前記記憶部から対応するオブジェクトを取得し、検索結果として出力する、前記（３）に記載の情報処理装置。
（５）
前記制御部は、
前記特徴抽出器から、前記検索用の変換データから抽出された特徴と類似する特徴に関連付けられた前記第２の識別情報と共に、前記特徴の類似度合いを示す類似度を取得する、前記（４）に記載の情報処理装置。
（６）
前記検索要求には、検索条件としてフィルター情報がさらに含まれる、前記（４）または（５）に記載の情報処理装置。
（７）
前記制御部は、
前記登録要求情報が入力された際、前記登録オブジェクトのモダリティと親子関係を有するサブモダリティの定義に従って、前記登録オブジェクトを変換して登録用のサブ変換データを生成する制御と、
前記第１の識別情報と前記サブ変換データを、前記サブモダリティに対応する１以上の特徴抽出器に出力する制御と、
をさらに行う、前記（１）～（６）のいずれか１項に記載の情報処理装置。
（８）
前記制御部は、
前記検索要求が入力された際、前記検索オブジェクトのモダリティと親子関係を有するサブモダリティの定義に従って、前記検索オブジェクトのうち前記サブモダリティに対応するデータを変換して検索用のサブ変換データを生成する制御と、
前記検索用のサブ変換データを、前記サブモダリティおよび前記ターゲットモダリティに対応する１以上の前記特徴抽出器に出力する制御と、
をさらに行う、前記（３）～（６）のいずれか１項に記載の情報処理装置。
（９）
前記制御部は、
前記検索要求に基づいて前記特徴抽出器から取得した前記第２の識別情報および類似度と、前記特徴抽出器の重み付けに基づいて、複数の前記第２の識別情報を順位付けする制御と、
上位所定数の前記第２の識別情報を前記検索結果として出力する制御と、
をさらに行う、前記（４）～（６）のいずれか１項に記載の情報処理装置。
（１０）
前記制御部は、
１以上のアプリケーションから出力されたユーザの操作情報を含む情報に基づいて、前記ユーザに提案するコンテンツを検索する前記検索要求を生成し、
前記特徴抽出器から取得した１以上の前記第２の識別情報を、前記検索結果として出力する、前記（４）～（６）のいずれか１項に記載の情報処理装置。
（１１）
前記制御部は、
前記情報に含まれる、前記アプリケーションで扱われているコンテンツを前記検索オブジェクトとし、
前記コンテンツのモダリティを、前記検索オブジェクトのモダリティとし、
前記アプリケーションで要求されているコンテンツのモダリティを、前記ターゲットモダリティとして、前記検索要求を生成する、前記（１０）に記載の情報処理装置。
（１２）
前記情報処理装置は、
前記第１の識別情報と前記登録用の変換データを、前記特徴抽出器を有する特徴管理サーバに送信する通信部をさらに備える、前記（１）～（１１）のいずれか１項に記載の情報処理装置。
（１３）
プロセッサが、
登録要求情報に含まれる登録オブジェクトを複数の特徴抽出部に共通する一意の第１の識別情報と関連付けて記憶部に記憶する制御と、
前記登録オブジェクトを前記登録オブジェクトのモダリティの定義に従って変換し、登録用の変換データを生成する制御と、
前記第１の識別情報と前記登録用の変換データを、前記モダリティに対応する複数の特徴抽出器に出力する制御と、
を行うことを含む、情報処理方法。
（１４）
コンピュータを、
登録要求情報に含まれる登録オブジェクトを複数の特徴抽出部に共通する一意の第１の識別情報と関連付けて記憶部に記憶する制御と、
前記登録オブジェクトを前記登録オブジェクトのモダリティの定義に従って変換し、登録用の変換データを生成する制御と、
前記第１の識別情報と前記登録用の変換データを、前記モダリティに対応する複数の特徴抽出器に出力する制御と、
を行う制御部として機能させるための、プログラム。Note that the present technology can also take the following configuration.
(1)
control to associate a registration object included in the registration request information with unique first identification information common to a plurality of feature extraction units and store the registration object in a storage unit;
a control for converting the registration object according to the modality definition of the registration object to generate conversion data for registration;
Control for outputting the first identification information and the conversion data for registration to a plurality of feature extractors corresponding to the modalities;
An information processing device comprising a control unit that performs
(2)
The information processing apparatus according to (1), wherein the definition of the modality is a conversion rule into a predetermined data format corresponding to the modality.
(3)
The control unit
control for converting a search object included in a search request in accordance with the modality definition of the search object to generate conversion data for the search;
control for outputting the transformation data for retrieval to the feature extractor corresponding to the modality of the retrieval object and the target modality included in the retrieval request;
The information processing apparatus according to (1) or (2) above.
(4)
The control unit
Acquiring second identification information retrieved based on the transformation data for retrieval in one or more of the feature extractors;
The information processing apparatus according to (3), wherein the corresponding object is acquired from the storage unit based on the second identification information and output as a search result.
(5)
The control unit
Acquiring from the feature extractor the second identification information associated with the feature similar to the feature extracted from the conversion data for search and the degree of similarity indicating the degree of similarity of the feature; (4) The information processing device according to .
(6)
The information processing apparatus according to (4) or (5), wherein the search request further includes filter information as a search condition.
(7)
The control unit
control for generating sub-converted data for registration by converting the registered object according to the definition of a submodality having a parent-child relationship with the modality of the registered object when the registration request information is input;
Control for outputting the first identification information and the sub-transformed data to one or more feature extractors corresponding to the sub-modalities;
The information processing apparatus according to any one of (1) to (6), further performing
(8)
The control unit
When the search request is input, data corresponding to the submodality of the search object is converted to generate sub-converted data for search according to the definition of the submodality having a parent-child relationship with the modality of the search object. control and
Control for outputting the sub-transformed data for searching to one or more of the feature extractors corresponding to the submodality and the target modality;
The information processing apparatus according to any one of (3) to (6), further performing
(9)
The control unit
Control for ranking the plurality of pieces of second identification information based on the second identification information and the degree of similarity obtained from the feature extractor based on the search request and the weighting of the feature extractor;
Control for outputting the second identification information of a predetermined number of higher ranks as the search result;
The information processing apparatus according to any one of (4) to (6), further performing
(10)
The control unit
generating the search request for searching for content to be proposed to the user based on information including user operation information output from one or more applications;
The information processing apparatus according to any one of (4) to (6), wherein the one or more pieces of second identification information acquired from the feature extractor are output as the search result.
(11)
The control unit
The content handled by the application, which is included in the information, is defined as the search object;
Let the modality of the content be the modality of the search object,
The information processing apparatus according to (10), wherein the search request is generated with the modality of the content requested by the application as the target modality.
(12)
The information processing device is
The information according to any one of (1) to (11) above, further comprising a communication unit that transmits the first identification information and the conversion data for registration to a feature management server having the feature extractor. processing equipment.
(13)
the processor
control to associate a registration object included in the registration request information with unique first identification information common to a plurality of feature extraction units and store the registration object in a storage unit;
a control for converting the registration object according to the modality definition of the registration object to generate conversion data for registration;
Control for outputting the first identification information and the conversion data for registration to a plurality of feature extractors corresponding to the modalities;
A method of processing information, including performing
(14)
the computer,
control to associate a registration object included in the registration request information with unique first identification information common to a plurality of feature extraction units and store the registration object in a storage unit;
a control for converting the registration object according to the modality definition of the registration object to generate conversion data for registration;
Control for outputting the first identification information and the conversion data for registration to a plurality of feature extractors corresponding to the modalities;
A program that functions as a control unit that performs

１０、１０ｘ情報処理装置
２０、２４特徴管理サーバ
２５データベースサーバ
１００制御部
１０１、１０１ｘ特徴空間管理部
１０２、１０２ｘモダリティ管理部
１０３モダリティ定義部
１０５アプリ
１０６情報収集部
１０７サジェスト部
１１０入力部
１２０通信部
１３０出力部
１４０記憶部
２００制御部
２０１特徴管理部
２０２特徴抽出部
２１０通信部
２２０特徴量データベース
２４０特徴抽出部
２５０特徴量データベース10, 10x information processing device 20, 24 feature management server 25 database server 100 control unit 101, 101x feature space management unit 102, 102x modality management unit 103 modality definition unit 105 application 106 information collection unit 107 suggestion unit 110 input unit 120 communication unit 130 output unit 140 storage unit 200 control unit 201 feature management unit 202 feature extraction unit 210 communication unit 220 feature amount database 240 feature extraction unit 250 feature amount database

Claims

In an information processing device connected to a plurality of feature extractors,
Control for storing the registration object included in the registration request information in a storage unit in association with unique first identification information common to a plurality of feature extractors corresponding to modalities among the plurality of feature extractors ;
a control for converting the registration object according to the modality definition of the registration object to generate conversion data for registration;
Control for outputting the first identification information and the conversion data for registration to a plurality of feature extractors corresponding to the modality among the plurality of feature extractors;
An information processing device comprising a control unit that performs

2. The information processing apparatus according to claim 1, wherein said modality definition is a conversion rule into a predetermined data format corresponding to the modality.

The control unit
control for converting a search object included in a search request in accordance with the modality definition of the search object to generate conversion data for the search;
control for outputting the transformation data for retrieval to the feature extractor corresponding to the modality of the retrieval object and the target modality included in the retrieval request;
The information processing apparatus according to claim 1, wherein

The control unit
Acquiring second identification information retrieved based on the transformation data for retrieval in one or more of the feature extractors;
4. The information processing apparatus according to claim 3, wherein a corresponding object is acquired from said storage unit based on said second identification information and output as a search result.

The control unit
5. The method according to claim 4, wherein from the feature extractor, the second identification information associated with the feature similar to the feature extracted from the conversion data for search and the similarity indicating the degree of similarity of the feature are obtained. The information processing device described.

5. The information processing apparatus according to claim 4, wherein said search request further includes filter information as a search condition.

The control unit
control for generating sub-converted data for registration by converting the registered object according to the definition of a submodality having a parent-child relationship with the modality of the registered object when the registration request information is input;
Control for outputting the first identification information and the sub-transformed data to one or more feature extractors corresponding to the sub-modalities;
The information processing apparatus according to claim 1, further comprising:

The control unit
When the search request is input, data corresponding to the submodality of the search object is converted to generate sub-converted data for search according to the definition of the submodality having a parent-child relationship with the modality of the search object. control and
Control for outputting the sub-transformed data for searching to one or more of the feature extractors corresponding to the submodality and the target modality;
4. The information processing apparatus according to claim 3, further comprising:

The control unit
Control for ranking the plurality of pieces of second identification information based on the second identification information and the degree of similarity obtained from the feature extractor based on the search request and the weighting of the feature extractor;
Control for outputting the second identification information of a predetermined number of higher ranks as the search result;
5. The information processing apparatus according to claim 4, further comprising:

The control unit
generating the search request for searching for content to be proposed to the user based on information including user operation information output from one or more applications;
5. The information processing apparatus according to claim 4, wherein said one or more pieces of said second identification information acquired from said feature extractor are output as said search result.

The control unit
The content handled by the application, which is included in the information, is defined as the search object;
Let the modality of the content be the modality of the search object,
11. The information processing apparatus according to claim 10, wherein the search request is generated with the modality of the content requested by the application as the target modality.

The information processing device is
2. The information processing apparatus according to claim 1, further comprising a communication unit that transmits said first identification information and said conversion data for registration to a feature management server having said feature extractor.

In an information processing method for an information processing device connected to a plurality of feature extractors,
the processor
Control for storing the registration object included in the registration request information in a storage unit in association with unique first identification information common to a plurality of feature extractors corresponding to modalities among the plurality of feature extractors ;
a control for converting the registration object according to the modality definition of the registration object to generate conversion data for registration;
Control for outputting the first identification information and the conversion data for registration to a plurality of feature extractors corresponding to the modality among the plurality of feature extractors;
A method of processing information, including performing

In a computer program connected to a plurality of feature extractors,
said computer,
Control for storing the registration object included in the registration request information in a storage unit in association with unique first identification information common to a plurality of feature extractors corresponding to modalities among the plurality of feature extractors ;
a control for converting the registration object according to the modality definition of the registration object to generate conversion data for registration;
Control for outputting the first identification information and the conversion data for registration to a plurality of feature extractors corresponding to the modality among the plurality of feature extractors;
A program that functions as a control unit that performs