JP2024027034A

JP2024027034A - Information processing apparatus, information processing method, and information processing program

Info

Publication number: JP2024027034A
Application number: JP2022129735A
Authority: JP
Inventors: 雅二郎岩崎; Masajiro Iwasaki
Original assignee: Ly Corp
Current assignee: Ly Corp
Priority date: 2022-08-16
Filing date: 2022-08-16
Publication date: 2024-02-29

Abstract

To create groups into which objects are classified.SOLUTION: An information processing apparatus according to the present application has an acquisition unit and a generation unit. The acquisition unit acquires object information indicating a plurality of objects to be subjected to data search. The generation unit generates, based on the object information acquired by the acquisition unit, first group information indicating a plurality of first groups that are a first number of groups into which the plurality of objects are classified, and second group information indicating a plurality of second groups that are groups different in classification from the plurality of first groups and are a second number of groups larger than the first number into which the plurality of objects are classified.SELECTED DRAWING: Figure 44

Description

本発明は、情報処理装置、情報処理方法、及び情報処理プログラムに関する。 The present invention relates to an information processing device, an information processing method, and an information processing program.

従来、種々の情報を探索（検索）する技術が提供されている。例えば、検索対象に対応するノードがエッジにより連結されたグラフを生成し、生成したグラフを用いて検索を行う技術が提供されている。また、このような技術は、例えば画像検索等に用いられる。 Conventionally, techniques for searching (searching) various types of information have been provided. For example, there is a technology that generates a graph in which nodes corresponding to a search target are connected by edges, and performs a search using the generated graph. Further, such a technique is used for, for example, image search.

特許第６２９３３３５号公報Patent No. 6293335 特許第６３００９８２号公報Patent No. 6300982 特許第６３１１０００号公報Patent No. 6311000

岩崎雅二郎 "木構造型インデックスを利用した近似k最近傍グラフによる近傍検索", 情報処理学会論文誌, 2011/2, Vol. 52, No. 2. pp.817-828.Masajiro Iwasaki "Neighborhood search using approximate k-nearest neighbor graph using tree-structured index", Information Processing Society of Japan Transactions, 2011/2, Vol. 52, No. 2. pp.817-828.

しかしながら、上記の従来技術では、検索処理の効率化の点では改善の余地がある。例えば、上記の従来技術では、グラフのエッジ数等を調整したりして、効率的な検索ができるグラフに変更することにより、検索処理の時間削減等の効率化を行っているが、グラフの変更により検索処理の効率化を図るには限界がある。そのため、例えば検索処理の高速化等、効率的な検索処理に利用可能な情報を生成するために、検索対象となるオブジェクトを分類するグループの概念を導入し、オブジェクトを分類するグループを生成することが望まれている。 However, with the above-mentioned conventional technology, there is room for improvement in terms of improving the efficiency of search processing. For example, in the above-mentioned conventional technology, by adjusting the number of edges of the graph and changing the graph to one that allows efficient searching, efficiency is achieved by reducing search processing time. There is a limit to how much efficiency can be achieved in search processing through changes. Therefore, in order to generate information that can be used for efficient search processing, such as speeding up search processing, the concept of groups for classifying objects to be searched is introduced, and groups for classifying objects are generated. is desired.

本願は、上記に鑑みてなされたものであって、オブジェクトを分類するグループを生成する情報処理装置、情報処理方法、及び情報処理プログラムを提供することを目的とする。 The present application has been made in view of the above, and aims to provide an information processing device, an information processing method, and an information processing program that generate groups for classifying objects.

本願に係る情報処理装置は、データ検索の対象となる複数のオブジェクトを示すオブジェクト情報を取得する取得部と、前記取得部により取得された前記オブジェクト情報に基づいて、前記複数のオブジェクトを分類する第１数のグループである複数の第１グループを示す第１グループ情報と、前記複数の第１グループとは異なる分類となるグループであって、前記複数のオブジェクトを分類する前記第１数よりも多い第２数のグループである複数の第２グループを示す第２グループ情報とを生成する生成部と、を備えたことを特徴とする。 The information processing device according to the present application includes an acquisition unit that acquires object information indicating a plurality of objects that are targets of data search, and a second object that classifies the plurality of objects based on the object information acquired by the acquisition unit. first group information indicating a plurality of first groups that are one number group; and groups that have a different classification from the plurality of first groups and are larger than the first number for classifying the plurality of objects. A generation unit that generates second group information indicating a plurality of second groups that are a second number of groups.

実施形態の一態様によれば、オブジェクトを分類するグループを生成することができるという効果を奏する。 According to one aspect of the embodiment, it is possible to generate groups for classifying objects.

図１は、第１の実施形態に係る情報処理の一例を示す図である。FIG. 1 is a diagram illustrating an example of information processing according to the first embodiment. 図２は、第１の実施形態に係るデータの一例を示す図である。FIG. 2 is a diagram showing an example of data according to the first embodiment. 図３は、第１の実施形態に係るデータの一例を示す図である。FIG. 3 is a diagram showing an example of data according to the first embodiment. 図４は、第１の実施形態に係る情報処理システムの構成例を示す図である。FIG. 4 is a diagram illustrating a configuration example of an information processing system according to the first embodiment. 図５は、第１の実施形態に係る情報処理装置の構成例を示す図である。FIG. 5 is a diagram illustrating a configuration example of an information processing apparatus according to the first embodiment. 図６は、第１の実施形態に係るオブジェクト情報記憶部の一例を示す図である。FIG. 6 is a diagram illustrating an example of an object information storage unit according to the first embodiment. 図７は、第１の実施形態に係るグラフ情報記憶部の一例を示す図である。FIG. 7 is a diagram illustrating an example of a graph information storage unit according to the first embodiment. 図８は、第１の実施形態に係る量子化情報記憶部の一例を示す図である。FIG. 8 is a diagram illustrating an example of a quantization information storage unit according to the first embodiment. 図９は、第１の実施形態に係るコードブック情報記憶部の一例を示す図である。FIG. 9 is a diagram illustrating an example of a codebook information storage unit according to the first embodiment. 図１０は、第１の実施形態に係る情報処理の一例を示すフローチャートである。FIG. 10 is a flowchart illustrating an example of information processing according to the first embodiment. 図１１は、第１の実施形態に係る検索処理の一例を示すフローチャートである。FIG. 11 is a flowchart illustrating an example of search processing according to the first embodiment. 図１２は、第２の実施形態に係る情報処理の一例を示す図である。FIG. 12 is a diagram illustrating an example of information processing according to the second embodiment. 図１３は、第２の実施形態に係る情報処理装置の構成例を示す図である。FIG. 13 is a diagram illustrating a configuration example of an information processing apparatus according to the second embodiment. 図１４は、第２の実施形態に係る量子化情報記憶部の一例を示す図である。FIG. 14 is a diagram illustrating an example of a quantization information storage unit according to the second embodiment. 図１５は、第２の実施形態に係るブロブ情報記憶部の一例を示す図である。FIG. 15 is a diagram illustrating an example of a blob information storage unit according to the second embodiment. 図１６は、第２の実施形態に係る検索処理の一例を示すフローチャートである。FIG. 16 is a flowchart illustrating an example of search processing according to the second embodiment. 図１７は、変形例に係る情報処理の一例を示す図である。FIG. 17 is a diagram illustrating an example of information processing according to a modification. 図１８は、変形例に係るグラフ情報記憶部の一例を示す図である。FIG. 18 is a diagram illustrating an example of a graph information storage unit according to a modification. 図１９は、変形例に係る検索処理の一例を示すフローチャートである。FIG. 19 is a flowchart illustrating an example of a search process according to a modification. 図２０は、変形例に係る情報処理の他の一例を示す図である。FIG. 20 is a diagram illustrating another example of information processing according to the modification. 図２１は、ブロブ情報の生成処理の一例を示す図である。FIG. 21 is a diagram illustrating an example of blob information generation processing. 図２２は、変形例に係る情報処理の一例を示すフローチャートである。FIG. 22 is a flowchart illustrating an example of information processing according to a modification. 図２３は、第３の実施形態に係る情報処理の一例を示す図である。FIG. 23 is a diagram illustrating an example of information processing according to the third embodiment. 図２４は、第３の実施形態に係る情報処理装置の構成例を示す図である。FIG. 24 is a diagram illustrating a configuration example of an information processing apparatus according to the third embodiment. 図２５は、第３の実施形態に係るブロブ連結グラフ情報記憶部の一例を示す図である。FIG. 25 is a diagram illustrating an example of a blob connection graph information storage unit according to the third embodiment. 図２６は、第３の実施形態に係る情報処理の一例を示すフローチャートである。FIG. 26 is a flowchart illustrating an example of information processing according to the third embodiment. 図２７は、第３の実施形態に係る検索処理の一例を示すフローチャートである。FIG. 27 is a flowchart illustrating an example of search processing according to the third embodiment. 図２８は、第３の実施形態に係る検索処理の一例を示すフローチャートである。FIG. 28 is a flowchart illustrating an example of search processing according to the third embodiment. 図２９は、変形例に係るブロブ連結グラフ情報の一例を示す図である。FIG. 29 is a diagram illustrating an example of blob connection graph information according to a modification. 図３０は、変形例に係るブロブ情報記憶部の一例を示す図である。FIG. 30 is a diagram illustrating an example of a blob information storage unit according to a modification. 図３１は、変形例に係るブロブ連結グラフ情報記憶部の一例を示す図である。FIG. 31 is a diagram illustrating an example of a blob connection graph information storage unit according to a modification. 図３２は、第４の実施形態に係る情報処理の一例を示す図である。FIG. 32 is a diagram illustrating an example of information processing according to the fourth embodiment. 図３３は、第４の実施形態に係る情報処理装置の構成例を示す図である。FIG. 33 is a diagram illustrating a configuration example of an information processing apparatus according to the fourth embodiment. 図３４は、第４の実施形態に係る量子化に関する概念図である。FIG. 34 is a conceptual diagram regarding quantization according to the fourth embodiment. 図３５は、第４の実施形態に係る転置インデックスの一例を示す図である。FIG. 35 is a diagram illustrating an example of a transposed index according to the fourth embodiment. 図３６は、第４の実施形態に係る転置インデックスの具体例を示す図である。FIG. 36 is a diagram showing a specific example of a transposed index according to the fourth embodiment. 図３７は、第４の実施形態に係る情報処理の一例を示すフローチャートである。FIG. 37 is a flowchart illustrating an example of information processing according to the fourth embodiment. 図３８は、第４の実施形態に係る情報処理の一例を示すフローチャートである。FIG. 38 is a flowchart illustrating an example of information processing according to the fourth embodiment. 図３９は、第５の実施形態に係る情報処理の一例を示す図である。FIG. 39 is a diagram illustrating an example of information processing according to the fifth embodiment. 図４０は、第５の実施形態に係る情報処理の一例を示す図である。FIG. 40 is a diagram illustrating an example of information processing according to the fifth embodiment. 図４１は、第５の実施形態に係る情報処理装置の構成例を示す図である。FIG. 41 is a diagram illustrating a configuration example of an information processing apparatus according to the fifth embodiment. 図４２は、第５の実施形態に係る情報処理の一例を示すフローチャートである。FIG. 42 is a flowchart illustrating an example of information processing according to the fifth embodiment. 図４３は、第５の実施形態に係る検索処理の一例を示すフローチャートである。FIG. 43 is a flowchart illustrating an example of search processing according to the fifth embodiment. 図４４は、第６の実施形態に係る情報処理の一例を示す図である。FIG. 44 is a diagram illustrating an example of information processing according to the sixth embodiment. 図４５は、第６の実施形態に係る情報処理装置の構成例を示す図である。FIG. 45 is a diagram illustrating a configuration example of an information processing apparatus according to the sixth embodiment. 図４６は、第６の実施形態に係る転置インデックスの一例を示す図である。FIG. 46 is a diagram illustrating an example of a transposed index according to the sixth embodiment. 図４７は、第６の実施形態に係る情報処理の一例を示すフローチャートである。FIG. 47 is a flowchart illustrating an example of information processing according to the sixth embodiment. 図４８は、第６の実施形態に係る検索処理の一例を示すフローチャートである。FIG. 48 is a flowchart illustrating an example of search processing according to the sixth embodiment. 図４９は、第１グループ及び第２グループの生成の一例を示す図である。FIG. 49 is a diagram illustrating an example of generation of the first group and the second group. 図５０は、第１グループ及び第２グループの生成の一例を示す図である。FIG. 50 is a diagram illustrating an example of generation of the first group and the second group. 図５１は、第１グループ及び第２グループの生成の一例を示す図である。FIG. 51 is a diagram illustrating an example of generation of the first group and the second group. 図５２は、第６の実施形態に係る情報処理の一例を示すフローチャートである。FIG. 52 is a flowchart illustrating an example of information processing according to the sixth embodiment. 図５３は、情報処理装置の機能を実現するコンピュータの一例を示すハードウェア構成図である。FIG. 53 is a hardware configuration diagram showing an example of a computer that implements the functions of the information processing device.

以下に、本願に係る情報処理装置、情報処理方法、及び情報処理プログラムを実施するための形態（以下、「実施形態」と呼ぶ）について図面を参照しつつ詳細に説明する。なお、この実施形態により本願に係る情報処理装置、情報処理方法、及び情報処理プログラムが限定されるものではない。また、以下の各実施形態において同一の部位には同一の符号を付し、重複する説明は省略される。 DESCRIPTION OF THE PREFERRED EMBODIMENTS An information processing apparatus, an information processing method, and an embodiment of an information processing program (hereinafter referred to as "embodiments") according to the present application will be described in detail below with reference to the drawings. Note that the information processing apparatus, information processing method, and information processing program according to the present application are not limited to this embodiment. Further, in each of the embodiments below, the same parts are given the same reference numerals, and redundant explanations will be omitted.

（実施形態）
〔１．第１の実施形態〕
〔１－１．情報処理〕
図１を用いて、第１の実施形態に係る情報処理の一例について説明する。図１は、第１の実施形態に係る情報処理の一例を示す図である。情報処理装置１００は、データ検索の対象となる複数のオブジェクトをグラフ構造化したグラフインデックス（単に「グラフ」ともいう）を用いた検索処理を実行する。図１では、情報処理装置１００がデータ検索の対象であるオブジェクトがベクトル化された各ベクトルに対応するノードがエッジで連結されたグラフを用いて近傍検索を行う場合の検索処理の一部を示す。情報処理装置１００は、グラフの各ノードを検索対象のオブジェクトとして、グラフを辿って与えられた検索クエリ（ベクトル）の近傍のノードを探索する。情報処理装置１００は、検索処理により、抽出する近傍のノードの数として指定された所定数（以下「検索数」ともいう）のノードを、検索クエリの近傍のノードとして抽出する。以下では、画像情報をデータ検索の対象とした場合を一例として説明するが、データ検索の対象は、動画情報や音声情報等の種々の対象であってもよい。 (Embodiment)
[1. First embodiment]
[1-1. Information processing〕
An example of information processing according to the first embodiment will be described using FIG. 1. FIG. 1 is a diagram illustrating an example of information processing according to the first embodiment. The information processing apparatus 100 executes a search process using a graph index (also simply referred to as a "graph") that is a graph structure of a plurality of objects that are data search targets. FIG. 1 shows part of a search process when the information processing device 100 performs a neighborhood search using a graph in which nodes corresponding to vectors of the object to be searched are connected by edges. . The information processing apparatus 100 uses each node of the graph as an object to be searched, and searches for nodes near a given search query (vector) by tracing the graph. Through the search process, the information processing apparatus 100 extracts a predetermined number of nodes (hereinafter also referred to as "search number") designated as the number of nearby nodes to be extracted as nodes in the vicinity of the search query. In the following, a case where image information is the object of data search will be described as an example, but the object of data search may be various objects such as video information and audio information.

情報処理装置１００は、数百万～数億等の単位の膨大な画像情報に対応するノードを対象に検索処理を行うが、図面においてはその一部（図１ではノードＮ１等の数十個）のみを図示する。例えば、情報処理装置１００は、図１中の空間情報ＳＰ１に示すように、ノードＮ１、Ｎ７、Ｎ９等に示すような複数のノード（ベクトル）に関する情報を取得する。このように「ノードＮ＊（＊は任意の数値）」と記載した場合、そのノードはノードＩＤ「Ｎ＊」により識別されるノードであることを示す。例えば、「ノードＮ１」と記載した場合、そのノードはノードＩＤ「Ｎ１」により識別されるノードである。空間情報ＳＰ１中の白丸（〇）の各々が各ノードを示す。 The information processing device 100 performs search processing for nodes corresponding to a huge amount of image information in units of millions to hundreds of millions, but in the drawing, only a portion of them (several tens of nodes such as node N1 in FIG. 1) are shown. ) are shown. For example, the information processing apparatus 100 acquires information regarding a plurality of nodes (vectors) such as nodes N1, N7, N9, etc., as shown in spatial information SP1 in FIG. When "node N* (* is an arbitrary numerical value)" is written in this way, it indicates that the node is identified by the node ID "N*". For example, when "node N1" is written, the node is identified by the node ID "N1". Each white circle (◯) in the spatial information SP1 indicates each node.

図１中の空間情報ＳＰ１では、主に説明に関係するノードに符号を付すが、符号が付されていない白丸（〇）の各々もノードであり、図示するノード以外にも多数のノードが含まれる。また、各ノードは、各オブジェクト（検索対象）に対応する。例えば、画像から抽出された複数の局所特徴量のそれぞれがオブジェクトであってもよい。また、例えば、オブジェクト間の距離が定義された種々のデータがオブジェクトであってもよい。 In the spatial information SP1 in FIG. 1, codes are mainly attached to nodes related to the explanation, but each white circle (○) without a code is also a node, and many nodes are included in addition to the nodes shown. It will be done. Further, each node corresponds to each object (search target). For example, each of the plurality of local features extracted from the image may be an object. Further, for example, various data in which distances between objects are defined may be objects.

例えば図１中の空間情報ＳＰ１は、ユークリッド空間であってもよい。例えば、空間情報ＳＰ１は、オブジェクトのベクトルの次元数に対応し、１００次元や１０００次元等の多次元空間であるものとする。なお、図１では、空間情報ＳＰ１に示すように直積量子化によりベクトル（空間）が複数の部分領域（部分空間）に４分割された状態を概念的に図示するが、直積量子化の点については後述する。 For example, the spatial information SP1 in FIG. 1 may be a Euclidean space. For example, it is assumed that the spatial information SP1 corresponds to the number of dimensions of the vector of the object, and is a multidimensional space such as 100 dimensions or 1000 dimensions. Note that FIG. 1 conceptually illustrates a state in which a vector (space) is divided into four subregions (subspaces) by direct product quantization as shown in spatial information SP1, but regarding the point of direct product quantization, will be described later.

空間情報ＳＰ１中のノードである白丸（〇）間を接続する点線がノード間を連結するエッジを示す。図１の例では、説明を簡単にするために、ノード間が無向（双方向）エッジ（以下単に「エッジ」ともいう）により連結される場合を示す。なお、ここでいう無向エッジとは、連結されたノード間を双方向にデータを辿ることができるエッジを意味する。例えば、ノードＮ１を示す白丸（〇）とノードＮ４を示す白丸（〇）との間は点線で接続されており、ノードＮ１とノードＮ４との間はエッジで連結されていることを示す。すなわち、空間情報ＳＰ１に示すグラフではノードＮ１とノードＮ４との間を双方向に辿ることが可能となる。具体的には、空間情報ＳＰ１ではノードＮ１からノードＮ４へ辿ることができ、かつノードＮ４からノードＮ１へ辿ることができる。 A dotted line connecting white circles (◯) that are nodes in the spatial information SP1 indicates an edge connecting the nodes. In the example of FIG. 1, to simplify the explanation, a case is shown in which nodes are connected by undirected (bidirectional) edges (hereinafter also simply referred to as "edges"). Note that the undirected edge here means an edge that allows data to be traced in both directions between connected nodes. For example, the white circle (◯) indicating the node N1 and the white circle (◯) indicating the node N4 are connected by a dotted line, indicating that the nodes N1 and N4 are connected by an edge. That is, in the graph shown in the spatial information SP1, it is possible to trace the path between the node N1 and the node N4 in both directions. Specifically, the spatial information SP1 can be traced from the node N1 to the node N4, and from the node N4 to the node N1.

なお、図１の例では、図示の関係上、図示したノード間を連結するエッジのみを図示するが、図示したエッジ以外にも多数のエッジが含まれる。このように、図１中の空間情報ＳＰ１では、エッジの一部のみを図示するが、例えばｋ近傍グラフ（k-nearest neighbor graph）であるものとする。なお、空間情報ＳＰ１は、種々のグラフであってもよい。 Note that in the example of FIG. 1, for convenience of illustration, only edges connecting the illustrated nodes are illustrated, but many edges other than the illustrated edges are included. In this way, the spatial information SP1 in FIG. 1 shows only a portion of the edges, but it is assumed that the spatial information SP1 in FIG. 1 is, for example, a k-nearest neighbor graph. Note that the spatial information SP1 may be various types of graphs.

また、グラフのエッジは、無向エッジに限らず、有向エッジであってもよい。有向エッジの場合、有向エッジの参照元となっているノードから参照先のノードへのみ辿ることができる。例えば、２つのノード間が、一方を参照元とし他方を参照先とする第１エッジ、及び一方を参照先とし他方を参照元とする第２エッジの２つの有向エッジで連結されている場合、その２つのノード間が無向（双方向）エッジで連結されて状態と同じ状態である。 Furthermore, the edges of the graph are not limited to undirected edges, but may be directed edges. In the case of a directed edge, it is possible to trace only from the node that is the reference source of the directed edge to the reference destination node. For example, if two nodes are connected by two directed edges: a first edge where one is the reference source and the other is the reference destination, and a second edge where one is the reference destination and the other is the reference source. , the two nodes are connected by an undirected (bidirectional) edge and have the same state.

ここから、図１を用いて検索処理について説明する。図１の例では、情報処理装置１００は、空間情報ＳＰ１に示すようなグラフＧＲ１を取得済みであるものとする。なお、情報処理装置１００は、種々の従来技術を適宜用いてグラフＧＲ１を生成してもよい。まず、検索処理の説明に先立ってベクトル（空間）の直積量子化について説明する。 The search process will now be described using FIG. 1. In the example of FIG. 1, it is assumed that the information processing apparatus 100 has already acquired a graph GR1 as shown in the spatial information SP1. Note that the information processing apparatus 100 may generate the graph GR1 using various conventional techniques as appropriate. First, prior to explaining the search process, vector (space) product quantization will be explained.

図１では、空間情報ＳＰ１に示すように直積量子化によりベクトル（空間）が４つの部分空間ＡＲ１１～ＡＲ１４に分割された状態を概念的に図示する。図１に示す部分空間ＡＲ１１～ＡＲ１４は、各ノード（オブジェクト）のベクトル間の距離等の説明のための概念的な図で示すが、各部分空間ＡＲ１１～ＡＲ１４は多次元空間となる。例えば、図１に示す部分空間ＡＲ１１は、平面上に図示するため２次元の態様にて図示されるが、例えば１００次元や１０００次元等の多次元空間であるものとする。 FIG. 1 conceptually illustrates a state in which a vector (space) is divided into four subspaces AR11 to AR14 by direct product quantization as shown in spatial information SP1. The subspaces AR11 to AR14 shown in FIG. 1 are shown as conceptual diagrams for explaining distances between vectors of nodes (objects), etc., but each of the subspaces AR11 to AR14 is a multidimensional space. For example, the subspace AR11 shown in FIG. 1 is illustrated in a two-dimensional manner because it is illustrated on a plane, but it is assumed that it is a multidimensional space, such as a 100-dimensional or 1000-dimensional space.

各部分空間ＡＲ１１～ＡＲ１４の各々は、４分割されるベクトルの各分割（部分ベクトル）に対応する空間を示す。なお、例えば１００次元のベクトルを１００分割しても良い。例えば、部分空間ＡＲ１１は、ベクトルが４分割された４つの部分ベクトル（「サブベクトル」ともいう）のうち、先頭の部分ベクトル（「第１サブベクトル」ともいう）に対応する次元の空間（「第１サブ空間」ともいう）を示す。部分空間ＡＲ１１は、ベクトルの先頭の分割位置に対応する部分空間（第１サブ空間）を示す。検索クエリＱＥ１の場合、部分空間ＡＲ１１は、先頭の部分ベクトル（第１サブベクトル）である第１部分クエリＱＥ１－１に対応する空間である。 Each of the partial spaces AR11 to AR14 indicates a space corresponding to each division (partial vector) of the vector divided into four. Note that, for example, a 100-dimensional vector may be divided into 100 parts. For example, the subspace AR11 is a dimension space (" (also referred to as "first subspace"). Subspace AR11 indicates a subspace (first subspace) corresponding to the first division position of the vector. In the case of search query QE1, subspace AR11 is a space corresponding to first subquery QE1-1, which is the first subvector (first subvector).

また、部分空間ＡＲ１２は、ベクトルが４分割された４つの部分ベクトルのうち、先頭から２番目の部分ベクトル（「第２サブベクトル」ともいう）に対応する次元の空間（「第２サブ空間」ともいう）を示す。部分空間ＡＲ１２は、ベクトルの先頭から２番目の分割位置に対応する部分空間（第２サブ空間）を示す。検索クエリＱＥ１の場合、部分空間ＡＲ１２は、先頭から２番目の部分ベクトル（第２サブベクトル）である第２部分クエリＱＥ１－２に対応する空間である。 Furthermore, the subspace AR12 is a dimensional space ("second subspace") corresponding to the second subvector from the beginning (also referred to as "second subvector") among the four subvectors obtained by dividing the vector into four. ). Subspace AR12 indicates a subspace (second subspace) corresponding to the second division position from the beginning of the vector. In the case of search query QE1, subspace AR12 is a space corresponding to second subquery QE1-2, which is the second subvector (second subvector) from the beginning.

また、部分空間ＡＲ１３は、ベクトルが４分割された４つの部分ベクトルのうち、先頭から３番目の部分ベクトル（「第３サブベクトル」ともいう）に対応する次元の空間（「第３サブ空間」ともいう）を示す。部分空間ＡＲ１３は、ベクトルの先頭から３番目の分割位置に対応する部分空間（第３サブ空間）を示す。検索クエリＱＥ１の場合、部分空間ＡＲ１３は、先頭から３番目の部分ベクトル（第３サブベクトル）である第３部分クエリＱＥ１－３に対応する空間である。 Further, the subspace AR13 is a dimensional space ("third subspace") corresponding to the third subvector from the beginning (also referred to as "third subvector") among the four subvectors obtained by dividing the vector into four. ). Subspace AR13 indicates a subspace (third subspace) corresponding to the third division position from the beginning of the vector. In the case of search query QE1, subspace AR13 is a space corresponding to third partial query QE1-3, which is the third partial vector (third subvector) from the beginning.

また、部分空間ＡＲ１４は、ベクトルが４分割された４つの部分ベクトルのうち、先頭から４番目（すなわち最後尾）の部分ベクトル（「第４サブベクトル」ともいう）に対応する次元の空間（「第４サブ空間」ともいう）を示す。部分空間ＡＲ１４は、ベクトルの先頭から４番目の分割位置に対応する部分空間（第４サブ空間）を示す。検索クエリＱＥ１の場合、部分空間ＡＲ１４は、先頭から４番目の部分ベクトル（第４サブベクトル）である第４部分クエリＱＥ１－４に対応する空間である。 Furthermore, the subspace AR14 is a dimension space (" 4th subspace). Subspace AR14 indicates a subspace (fourth subspace) corresponding to the fourth division position from the beginning of the vector. In the case of search query QE1, subspace AR14 is a space corresponding to fourth subquery QE1-4, which is the fourth subvector (fourth subvector) from the beginning.

なお、図１の例では、部分空間ＡＲ１１～ＡＲ１４を類似の形状で示すが、各部分空間ＡＲ１１～ＡＲ１４の形状は異なってもよいし、また各部分空間ＡＲ１１～ＡＲ１４における領域の分割態様も異なってもよい。また、ノード間のエッジによる接続関係は部分空間ＡＲ１１～ＡＲ１４で共通であるものとする。すなわち、グラフＧＲ１は、空間情報ＳＰ１に対応するグラフであり、部分空間ＡＲ１１～ＡＲ１４で共通である。 In the example of FIG. 1, the subspaces AR11 to AR14 are shown with similar shapes, but the shapes of the subspaces AR11 to AR14 may be different, and the manner in which the regions in each of the subspaces AR11 to AR14 are divided may also be different. It's okay. Further, it is assumed that the connection relationship between nodes by edges is common to subspaces AR11 to AR14. That is, the graph GR1 is a graph corresponding to the spatial information SP1, and is common to the subspaces AR11 to AR14.

また、図１の例では、分割された部分ベクトル（サブベクトル）ごとにルックアップテーブルが生成される。例えば、第１サブベクトル、第２サブベクトル、第３サブベクトル、及び第４サブベクトルの４つのサブベクトルごとにクラスタリングされ、個別にルックアップテーブル（コードブック情報）が生成される。 Furthermore, in the example of FIG. 1, a lookup table is generated for each divided partial vector (subvector). For example, clustering is performed for each of four subvectors: a first subvector, a second subvector, a third subvector, and a fourth subvector, and a lookup table (codebook information) is generated individually.

図１では、第１サブベクトルについては、部分空間ＡＲ１１に示すように、コードブックＣＤ１１～ＣＤ１９に対応する９個のグループにクラスタリングされ、コードブックＣＤ１１～ＣＤ１９の各々に対応するベクトルが代表ベクトル（セントロイド）として算出される。例えば、ノードＮ７、Ｎ９等の第１サブベクトルはコードブックＣＤ１１に対応するグループにクラスタリングされることを示す。この場合、ノードＮ７、Ｎ９等の第１サブベクトルは距離の計算（算出）において、第１サブベクトルに対応するコードブック情報（「第１コードブック情報」ともいう）を用いて、コードブックＣＤ１１のベクトルにベクトル量子化される。 In FIG. 1, the first subvectors are clustered into nine groups corresponding to codebooks CD11 to CD19, as shown in subspace AR11, and the vectors corresponding to each of codebooks CD11 to CD19 are representative vectors ( centroid). For example, the first subvectors such as nodes N7 and N9 indicate that they are clustered into a group corresponding to codebook CD11. In this case, the first subvectors such as nodes N7 and N9 use the codebook information (also referred to as "first codebook information") corresponding to the first subvector in the distance calculation (calculation). is vector quantized into a vector.

また、第２サブベクトルについては、コードブックＣＤ２１～ＣＤ２４等（図２、図９参照）に対応する複数のグループにクラスタリングされ、コードブックＣＤ２１～ＣＤ２４等の各々に対応するベクトルが代表ベクトル（セントロイド）として算出される。この場合、各ノードの第２サブベクトルは距離の計算（算出）において、第２サブベクトルに対応するコードブック情報（「第２コードブック情報」ともいう）を用いて、対応するコードブックのベクトルにベクトル量子化される。 Furthermore, the second sub-vectors are clustered into a plurality of groups corresponding to codebooks CD21 to CD24, etc. (see FIGS. 2 and 9), and the vectors corresponding to each of codebooks CD21 to CD24, etc. are representative vectors (centre Lloyd's). In this case, the second subvector of each node is calculated using the codebook information (also referred to as "second codebook information") corresponding to the second subvector in distance calculation. vector quantized.

第３サブベクトルについては、コードブックＣＤ３１～ＣＤ３４等（図２、図９参照）に対応する複数のグループにクラスタリングされ、コードブックＣＤ３１～ＣＤ３４等の各々に対応するベクトルが代表ベクトル（セントロイド）として算出される。この場合、各ノードの第３サブベクトルは距離の計算（算出）において、第３サブベクトルに対応するコードブック情報（「第３コードブック情報」ともいう）を用いて、対応するコードブックのベクトルにベクトル量子化される。 The third sub-vectors are clustered into multiple groups corresponding to codebooks CD31 to CD34, etc. (see FIGS. 2 and 9), and the vectors corresponding to each of codebooks CD31 to CD34, etc. are representative vectors (centroids). It is calculated as In this case, the third subvector of each node is calculated using the codebook information (also referred to as "third codebook information") corresponding to the third subvector in distance calculation. vector quantized.

第４サブベクトルについては、コードブックＣＤ４１～ＣＤ４４等（図２、図９参照）に対応する複数のグループにクラスタリングされ、コードブックＣＤ２４～ＣＤ４４等の各々に対応するベクトルが代表ベクトル（セントロイド）として算出される。この場合、各ノードの第４サブベクトルは距離の計算（算出）において、第４サブベクトルに対応するコードブック情報（「第４コードブック情報」ともいう）を用いて、対応するコードブックのベクトルにベクトル量子化される。 The fourth sub-vectors are clustered into multiple groups corresponding to codebooks CD41 to CD44, etc. (see FIGS. 2 and 9), and the vectors corresponding to each of codebooks CD24 to CD44, etc. are representative vectors (centroids). It is calculated as In this case, the fourth subvector of each node is calculated using the codebook information (also referred to as "fourth codebook information") corresponding to the fourth subvector in distance calculation. vector quantized.

なお、代表ベクトルを求める処理等、コードブック情報の生成は、種々の技術を適宜用いて生成される。コードブック情報の生成については従来技術であるため詳細な説明を省略する。また、ルックアップテーブルについては上記に限らず、例えば、分割された部分ベクトル（サブベクトル）全体で１つのルックアップテーブル（コードブック情報）を用いてもよいが、この点については後述する。 Note that the codebook information, such as the process of determining the representative vector, is generated using various techniques as appropriate. Since generation of codebook information is a conventional technique, detailed explanation will be omitted. Further, the lookup table is not limited to the above, and for example, one lookup table (codebook information) may be used for all divided partial vectors (subvectors), but this point will be described later.

ここから、検索クエリＱＥ１を対象とする検索処理を説明する。まず、情報処理装置１００は、検索クエリＱＥ１を取得する（ステップＳ１１）。例えば、情報処理装置１００は、ユーザが利用する端末装置１０（図４参照）から検索クエリＱＥ１を取得する。 From here, a search process targeting the search query QE1 will be explained. First, the information processing device 100 obtains the search query QE1 (step S11). For example, the information processing device 100 acquires the search query QE1 from the terminal device 10 (see FIG. 4) used by the user.

そして、情報処理装置１００は、検索クエリＱＥ１であるベクトルを４分割する。すなわち、情報処理装置１００は、検索クエリＱＥ１を４つの部分クエリに分割する。図１では、情報処理装置１００は、検索クエリＱＥ１を、第１サブベクトルである第１部分クエリＱＥ１－１、第２サブベクトルである第２部分クエリＱＥ１－２、第３サブベクトルである第３部分クエリＱＥ１－３、及び第４サブベクトルである第４部分クエリＱＥ１－４との４つのサブベクトルに分割する。具体的には、情報処理装置１００は、「４５，２３，２…」を第１部分クエリＱＥ１－１とし、「１２７，３４，５…」を第２部分クエリＱＥ１－２とし、「２０，９８，１１０…」を第３部分クエリＱＥ１－３とし、「１２，４５，４…」を第４部分クエリＱＥ１－４とする。 Then, the information processing device 100 divides the vector that is the search query QE1 into four. That is, the information processing device 100 divides the search query QE1 into four partial queries. In FIG. 1, the information processing device 100 divides the search query QE1 into a first partial query QE1-1 which is a first subvector, a second partial query QE1-2 which is a second subvector, and a second partial query QE1-2 which is a third subvector. It is divided into four subvectors: 3 partial queries QE1-3 and a fourth partial query QE1-4, which is a fourth subvector. Specifically, the information processing device 100 sets "45, 23, 2..." as a first partial query QE1-1, "127, 34, 5..." as a second partial query QE1-2, and sets "20, 98, 110...'' is the third partial query QE1-3, and ``12, 45, 4...'' is the fourth partial query QE1-4.

そして、情報処理装置１００は、検索クエリＱＥ１の各サブベクトルと、そのサブベクトルに対応するコードブックのベクトルとの間の距離を算出する。情報処理装置１００は、図２のコードブック情報ＴＢ１～ＴＢ４に示すように、検索クエリＱＥ１の各サブベクトルと各コードブックのベクトルとの間の距離（差分）を算出する。図２は、第１の実施形態に係るデータの一例を示す図である。 Then, the information processing device 100 calculates the distance between each subvector of the search query QE1 and the vector of the codebook corresponding to that subvector. The information processing device 100 calculates the distance (difference) between each subvector of the search query QE1 and the vector of each codebook, as shown in codebook information TB1 to TB4 in FIG. 2. FIG. 2 is a diagram showing an example of data according to the first embodiment.

コードブック情報ＴＢ１は、第１サブベクトルに対応する第１コードブック情報を示し、情報処理装置１００は、第１サブベクトルに対応するコードブックＣＤ１１～ＣＤ１９の各々のベクトルと、第１部分クエリＱＥ１－１との間の距離を算出する。図２では、情報処理装置１００は、コードブック情報ＴＢ１に示すように、コードブックＣＤ１１と第１部分クエリＱＥ１－１との間の距離を距離ＤＳ１１と算出する。同様に、情報処理装置１００は、コードブックＣＤ１２～ＣＤ１４等の各々と第１部分クエリＱＥ１－１との間の距離を距離ＤＳ１２～ＤＳ１４等と算出する。 The codebook information TB1 indicates first codebook information corresponding to the first subvector, and the information processing apparatus 100 stores each vector of the codebooks CD11 to CD19 corresponding to the first subvector and the first partial query QE1. -1. In FIG. 2, the information processing device 100 calculates the distance between the codebook CD11 and the first partial query QE1-1 as a distance DS11, as shown in codebook information TB1. Similarly, the information processing device 100 calculates the distances between each of the codebooks CD12 to CD14, etc. and the first partial query QE1-1 as distances DS12 to DS14, etc.

同様に、コードブック情報ＴＢ２～ＴＢ４の各々は、第２～第４サブベクトルの各々に対応する第２～第４コードブック情報を示す。情報処理装置１００は、コードブック情報ＴＢ２に示すように、コードブックＣＤ２１～ＣＤ２４等の各々と第２部分クエリＱＥ１－２との間の距離を距離ＤＳ２１～ＤＳ２４等と算出する。情報処理装置１００は、コードブック情報ＴＢ３に示すように、コードブックＣＤ３１～ＣＤ３４等の各々と第３部分クエリＱＥ１－３との間の距離を距離ＤＳ３１～ＤＳ３４等と算出する。情報処理装置１００は、コードブック情報ＴＢ４に示すように、コードブックＣＤ４１～ＣＤ４４等の各々と第４部分クエリＱＥ１－４との間の距離を距離ＤＳ４１～ＤＳ４４等と算出する。 Similarly, each of codebook information TB2 to TB4 indicates second to fourth codebook information corresponding to each of the second to fourth subvectors. The information processing device 100 calculates the distances between each of the codebooks CD21 to CD24, etc. and the second partial query QE1-2 as distances DS21 to DS24, etc., as shown in the codebook information TB2. The information processing device 100 calculates the distances between each of the codebooks CD31 to CD34, etc. and the third partial query QE1-3 as distances DS31 to DS34, etc., as shown in the codebook information TB3. The information processing device 100 calculates the distances between each of the codebooks CD41 to CD44, etc. and the fourth partial query QE1-4 as distances DS41 to DS44, etc., as shown in codebook information TB4.

なお、図２では説明のため抽象的に示すが、距離ＤＳ１１～ＤＳ４４等は具体的な値であるものとする。距離は浮動小数点で表される値であるが、後述のＳＩＭＤ（Single Instruction, Multiple Data）による高速化のためにscale-offset-compression（「https://www.unidata.ucar.edu/blogs/developer/entry/compression_by_scaling_and_offfset」等参照）により１バイト整数型に圧縮してもよい。情報処理装置１００は、各コードブックとクエリとの間の距離算出を、検索クエリＱＥ１を取得したタイミングで行ってもよいし、その情報が必要になったタイミングで行ってもよい。情報処理装置１００は、検索処理において、コードブック情報ＴＢ１～ＴＢ４をルックアップテーブルとして用いて、各ノードと検索クエリＱＥ１との間の距離、すなわち近似距離（「第１距離」ともいう）を算出する。このように、情報処理装置１００は、検索処理において、グラフを探索する際には、各ノードと検索クエリとの間の真の距離（「第２距離」ともいう）ではなく、近似距離（第１距離）を計算し処理を行うことにより、効率的な検索処理を可能にすることができる。 Although the distances DS11 to DS44 and the like are shown abstractly in FIG. 2 for the sake of explanation, it is assumed that the distances DS11 to DS44, etc. are concrete values. The distance is a value expressed as a floating point number, but scale-offset-compression (https://www.unidata.ucar.edu/blogs/ developer/entry/compression_by_scaling_and_offfset" etc.) may be used to compress it into a 1-byte integer type. The information processing apparatus 100 may calculate the distance between each codebook and the query at the timing when the search query QE1 is acquired, or at the timing when the information becomes necessary. In the search process, the information processing device 100 uses the codebook information TB1 to TB4 as a lookup table to calculate the distance between each node and the search query QE1, that is, the approximate distance (also referred to as "first distance"). do. In this way, when searching a graph in the search process, the information processing device 100 uses approximate distances (also referred to as "second distances") rather than true distances (also referred to as "second distances") between each node and the search query. 1 distance) and perform the processing, it is possible to perform efficient search processing.

情報処理装置１００は、検索クエリＱＥ１を対象とする検索処理を実行する（ステップＳ１２）。情報処理装置１００は、検索クエリＱＥ１を対象として、グラフＧＲ１を用いた図１１に示すような検索処理を行うことにより、検索クエリＱＥ１の検索結果を得る。図１１に示す検索処理についての詳細は後述する。情報処理装置１００は、検索クエリＱＥ１を対象として検索処理を行うことにより、検索数のノードを検索クエリＱＥ１の近傍のノードとして抽出する。 The information processing device 100 executes a search process targeting the search query QE1 (step S12). The information processing device 100 obtains a search result for the search query QE1 by performing a search process as shown in FIG. 11 using the graph GR1 for the search query QE1. Details of the search process shown in FIG. 11 will be described later. The information processing device 100 performs a search process on the search query QE1, thereby extracting the searched nodes as nodes in the vicinity of the search query QE1.

情報処理装置１００は、検索クエリＱＥ１を対象とする検索処理において、各ノードのうち、所定のノードをグラフＧＲ１の検索の開始点（起点）となるノード（以下「起点ノード」ともいう）として選択する。例えば、情報処理装置１００は、起点ノードを、木構造のインデックス等の所定のインデックスを用いて選択する。図１の例では、情報処理装置１００は、起点ノードとして、ノードＮ７を選択するものとする。なお、図１では説明を簡単にするために、所定のインデックスを用いてノードＮ７のみを起点ノードとして選択する場合を示すが、情報処理装置１００は、複数のノードを起点ノードとして選択してもよいし、ランダム等の様々な方法により起点ノードを選択してもよい。 In the search process targeting the search query QE1, the information processing device 100 selects a predetermined node from among the nodes as a node (hereinafter also referred to as "starting point node") that becomes the starting point (starting point) of the search for the graph GR1. do. For example, the information processing apparatus 100 selects the starting node using a predetermined index such as a tree-structured index. In the example of FIG. 1, it is assumed that the information processing apparatus 100 selects node N7 as the starting point node. Note that although FIG. 1 shows a case where only node N7 is selected as the starting point node using a predetermined index to simplify the explanation, the information processing apparatus 100 can also select multiple nodes as the starting point node. Alternatively, the starting node may be selected by various methods such as random.

ここで、情報処理装置１００は、検索クエリＱＥ１を対象とする検索処理において、一のノードからのエッジが連結されたノード（以下「接続ノード」ともいう）と検索クエリとの距離を並列化して算出する（ステップＳ１３）。図１では、情報処理装置１００は、一括処理情報ＬＴ１に示すように、ノードＮ７からのエッジが連結されたノード（接続ノード）であるノードＮ９、Ｎ１２、Ｎ５４、Ｎ８５等については、検索クエリＱＥ１との距離を並列化して算出する。 Here, in the search process targeting the search query QE1, the information processing device 100 parallelizes the distance between the search query and a node to which edges from one node are connected (hereinafter also referred to as "connection node"). Calculate (step S13). In FIG. 1, the information processing device 100 uses the search query QE1 for nodes N9, N12, N54, N85, etc., which are nodes (connection nodes) to which edges from the node N7 are connected, as shown in the batch processing information LT1. Calculate the distance between the two in parallel.

例えば、ノードＮ９は、ノード情報ＩＮＦ１に示すように、第１サブベクトルがコードブックＣＤ１２、第２サブベクトルがコードブックＣＤ２３、第３サブベクトルがコードブックＣＤ３５、及び第４サブベクトルがコードブックＣＤ４７に対応付けられていることを示す。なお、変数CDのサイズはコードブックのサイズにより決定されるので、例えばコードブックサイズが16であれば、変数CDは4ビットで良いことになり、大幅なデータの圧縮が可能である。そのため、情報処理装置１００は、コードブックＣＤ１２の距離ＤＳ１２、コードブックＣＤ２３の距離ＤＳ２３、コードブックＣＤ３５の距離ＤＳ３５、及びコードブックＣＤ４７の距離ＤＳ４７を用いて、ノードＮ９と、検索クエリＱＥ１との距離を算出する。例えば、情報処理装置１００は、コードブックＣＤ１２の距離ＤＳ１２、コードブックＣＤ２３の距離ＤＳ２３、コードブックＣＤ３５の距離ＤＳ３５、及びコードブックＣＤ４７の距離ＤＳ４７の合計を、ノードＮ９と検索クエリＱＥ１との距離として算出する。 For example, in node N9, as shown in node information INF1, the first subvector is codebook CD12, the second subvector is codebook CD23, the third subvector is codebook CD35, and the fourth subvector is codebook CD47. Indicates that it is associated with. Note that the size of the variable CD is determined by the size of the codebook, so if the codebook size is 16, for example, the variable CD only needs to be 4 bits, making it possible to significantly compress data. Therefore, the information processing device 100 uses the distance DS12 of the codebook CD12, the distance DS23 of the codebook CD23, the distance DS35 of the codebook CD35, and the distance DS47 of the codebook CD47 to determine the distance between the node N9 and the search query QE1. Calculate. For example, the information processing device 100 calculates the sum of the distance DS12 of the codebook CD12, the distance DS23 of the codebook CD23, the distance DS35 of the codebook CD35, and the distance DS47 of the codebook CD47 as the distance between the node N9 and the search query QE1. calculate.

例えば、ノードＮ１２は、ノード情報ＩＮＦ２に示すように、第１サブベクトルがコードブックＣＤ１４、第２サブベクトルがコードブックＣＤ２９、第３サブベクトルがコードブックＣＤ３１、及び第４サブベクトルがコードブックＣＤ４５に対応付けられていることを示す。例えば、情報処理装置１００は、コードブックＣＤ１４の距離ＤＳ１４、コードブックＣＤ２９の距離ＤＳ２９、コードブックＣＤ３１の距離ＤＳ３１、及びコードブックＣＤ４５の距離ＤＳ４５の合計を、ノードＮ１２と検索クエリＱＥ１との距離として算出する。同様に、情報処理装置１００は、ノードＮ５４のノード情報ＩＮＦ３を用いて、ノードＮ５４と検索クエリＱＥ１との距離を算出し、ノードＮ８５のノード情報ＩＮＦ４を用いて、ノードＮ８５と検索クエリＱＥ１との距離を算出する。 For example, in the node N12, as shown in the node information INF2, the first subvector is codebook CD14, the second subvector is codebook CD29, the third subvector is codebook CD31, and the fourth subvector is codebook CD45. Indicates that it is associated with. For example, the information processing device 100 calculates the sum of the distance DS14 of the codebook CD14, the distance DS29 of the codebook CD29, the distance DS31 of the codebook CD31, and the distance DS45 of the codebook CD45 as the distance between the node N12 and the search query QE1. calculate. Similarly, the information processing device 100 uses the node information INF3 of the node N54 to calculate the distance between the node N54 and the search query QE1, and uses the node information INF4 of the node N85 to calculate the distance between the node N85 and the search query QE1. Calculate distance.

例えば、情報処理装置１００は、ＳＩＭＤの演算に関する並列化により、ノードＮ９、Ｎ１２、Ｎ５４、Ｎ８５の各々と検索クエリＱＥ１との距離を一括して算出する。これにより、情報処理装置１００は、距離計算を並列化することにより、距離計算を高速化することでき、効率的な検索処理を可能にすることができる。 For example, the information processing device 100 calculates the distance between each of the nodes N9, N12, N54, and N85 and the search query QE1 at once by parallelizing SIMD calculations. Thereby, the information processing device 100 can speed up the distance calculation by parallelizing the distance calculation, and can perform efficient search processing.

なお、図１では説明を簡単にするために、４つのノードを対象として距離を並列化して算出する場合を示すが、並列化される数は、情報処理装置１００の仕様に基づいて決定される。例えば、情報処理装置１００がＳＩＭＤにより一括して処理できる数（「一括処理可能単位」ともいう）が「４」である場合、図１と同様の処理となるが、ＳＩＭＤにより一括して処理できる数（一括処理可能単位）が「１６」である場合、情報処理装置１００は、１６個のノードを対象として距離を並列化して算出する。また、一括処理可能単位が「３２」である場合、情報処理装置１００は、３２個のノードを対象として距離を並列化して算出する。このように、情報処理装置１００は、ＳＩＭＤにより一括して処理できる数（一括処理可能単位）に対応する数のノードの距離計算を一括して行うことにより、効率的な検索処理を可能にすることができる。 Note that, in order to simplify the explanation, FIG. 1 shows a case where the distance is computed in parallel for four nodes, but the number to be computed in parallel is determined based on the specifications of the information processing device 100. . For example, if the number that the information processing device 100 can process at once using SIMD (also referred to as “unit that can be processed at once”) is “4”, the processing will be similar to that shown in FIG. 1, but it will be possible to process at once using SIMD. When the number (unit that can be collectively processed) is "16", the information processing apparatus 100 calculates distances in parallel for 16 nodes. Further, when the batch processable unit is "32", the information processing apparatus 100 calculates distances in parallel for 32 nodes. In this way, the information processing device 100 enables efficient search processing by collectively calculating the distances of the number of nodes corresponding to the number that can be processed at once by SIMD (batch processable unit). be able to.

情報処理装置１００は、上記のように複数のノードの距離計算を並列化して行いながら、検索クエリＱＥ１を対象として、グラフＧＲ１を用いた図１１に示すような検索処理を行うことにより、検索数のノードを検索クエリＱＥ１の近傍のノードとして抽出する。 The information processing device 100 calculates the number of searches by performing a search process as shown in FIG. The node is extracted as a node in the vicinity of the search query QE1.

上述のように、情報処理装置１００は、ベクトル量子化された各ノードのベクトルと検索クエリとの間の近似距離（第１距離）を算出して、第１距離を用いて検索処理を行う事により、効率的な検索処理を可能にすることができる。また、情報処理装置１００は、並列化可能な数のノードの距離計算を一括して行うことにより、効率的な検索処理を可能にすることができる。上記のように、情報処理装置１００は、グラフのノードの複数の接続ノードとの近似距離を一括して計算することにより、限定されたメモリ領域（ルックアップテーブル）を繰り返し利用する（メモリキャッシュにのる）ので高速化が可能である。 As described above, the information processing device 100 calculates the approximate distance (first distance) between the vector quantized vector of each node and the search query, and performs the search process using the first distance. This enables efficient search processing. Furthermore, the information processing apparatus 100 can perform efficient search processing by performing distance calculations for a number of nodes that can be parallelized at once. As described above, the information processing device 100 repeatedly uses a limited memory area (lookup table) (memory cache) by collectively calculating approximate distances between nodes of the graph and multiple connected nodes. speed).

また、図１の例では、情報処理装置１００は、直積量子化により分割したベクトルを用いて検索処理を行うことにより、ベクトルを分割せずに検索処理を行う場合に比べて、より効率的な検索処理を可能にすることができる。例えば、情報処理装置１００は、直積量子化により短いベクトル間の距離を算出することとなり、距離計算の対象となるベクトルのサイズを小さくできる。また、情報処理装置１００は、距離計算時に利用するルックアップテーブルによりアクセスするメモリ空間を、分割されたサブベクトルのルックアップテーブルに限定することができる。情報処理装置１００は、直積量子化により検索精度の低下を抑制しつつ、ベクトルサイズを削減することができる。なお、図１では、直積量子化が行われた場合の処理を説明したが、直積量子化が行われた場合は一例に過ぎず、情報処理装置１００は、直積量子化を行われてないベクトルを用いて検索処理を行ってもよい。 In addition, in the example of FIG. 1, the information processing device 100 performs search processing using vectors divided by direct product quantization, thereby achieving more efficient search processing than when performing search processing without dividing vectors. Search processing can be enabled. For example, the information processing device 100 calculates distances between short vectors by direct product quantization, and can reduce the size of vectors that are targets of distance calculation. Further, the information processing apparatus 100 can limit the memory space accessed by the lookup table used during distance calculation to the lookup table of divided subvectors. The information processing apparatus 100 can reduce the vector size while suppressing a decrease in search accuracy by using Cartesian product quantization. In addition, in FIG. 1, the processing in the case where Cartesian product quantization is performed has been described, but the case where Cartesian product quantization is performed is only an example, and the information processing device 100 can process vectors that have not been subjected to Cartesian product quantization. The search process may be performed using .

〔１－１－１．その他〕
上述した処理は一例に過ぎず、情報処理装置１００は、効率的な検索処理の為に様々な情報や手法を用いて、検索処理を行ってもよい。この点について、各事項について詳述する。 [1-1-1. others〕
The above-described processing is merely an example, and the information processing apparatus 100 may perform search processing using various information and methods for efficient search processing. Regarding this point, each item will be explained in detail.

（エッジ（接続ノード）の格納態様）
各ノードに連結されるエッジ（接続ノード）の情報について、ノードでのエッジ（接続ノード）の格納態様（格納方法）には、例えば以下の第１格納態様及び第２格納態様の２種類が考えられるため、格納態様については、利用形態によって選択されてもよい。 (Storage mode of edges (connection nodes))
Regarding the information of edges (connection nodes) connected to each node, there are two types of storage modes (storage methods) for edges (connection nodes) in nodes: the following first storage mode and second storage mode. Therefore, the storage mode may be selected depending on the usage mode.

例えば、第１格納態様としては、各ノードには接続ノード（オブジェクト）のＩＤを対応付けて格納し、別のテーブルに直積量子化されたオブジェクトの情報を格納してもよい。例えば、図７及び図８に示すデータの格納態様が第１格納態様に対応する。第１格納態様の場合、メモリ使用量を削減できる。 For example, as a first storage mode, each node may be stored in association with the ID of the connected node (object), and information on objects subjected to product quantization may be stored in another table. For example, the data storage format shown in FIGS. 7 and 8 corresponds to the first storage format. In the case of the first storage mode, memory usage can be reduced.

また、例えば、第２格納態様としては、各ノードに直に量子化されたオブジェクトを持つ。第２格納態様の場合、あるノードの接続ノード（オブジェクト）が量子化された情報を各ノードに対応付けて記憶する。例えば、ノードＮ７の接続ノードであるノードＮ９、Ｎ１２、Ｎ５４、Ｎ８５の各々の量子化された情報（図２中のノード情報ＩＮＦ１～ＩＮＦ４）がノードＮ７に対応付けて記憶されるデータの格納態様が第２格納態様に対応する。第２格納態様の場合、検索時にシーケンシャルにオブジェクトをアクセスするため速度低下が発生を抑制することができる。 Furthermore, for example, as a second storage mode, each node has a quantized object directly. In the case of the second storage mode, information obtained by quantizing connection nodes (objects) of a certain node is stored in association with each node. For example, a data storage mode in which quantized information (node information INF1 to INF4 in FIG. 2) of nodes N9, N12, N54, and N85, which are connected nodes of node N7, is stored in association with node N7. corresponds to the second storage mode. In the case of the second storage mode, since objects are accessed sequentially during a search, a decrease in speed can be suppressed.

（ルックアップテーブル）
上述した例では、分割したサブベクトルごとに個別にクラスタリングし、個々にルックアップテーブル（コードブック情報）を生成する場合を示したが、この場合に限らず、ルックアップテーブルについては任意の態様であってもよい。 (lookup table)
In the above example, each divided subvector is individually clustered and a lookup table (codebook information) is generated individually, but the lookup table is not limited to this case. There may be.

例えば、全てのサブベクトルに対してクラスタリングし、一つのルックアップテーブルが生成されてもよい。上述した例では、例えば、第１サブベクトル、第２サブベクトル、第３サブベクトル、及び第４サブベクトルの４つのサブベクトル全体を対象にクラスタリングし、一つのルックアップテーブルが生成されてもよい。 For example, all subvectors may be clustered and one lookup table may be generated. In the above example, for example, one lookup table may be generated by clustering all four subvectors: the first subvector, the second subvector, the third subvector, and the fourth subvector. .

また、個々のサブベクトルを部分的にマージして複数のクラスごとに、ルックアップテーブルが生成されてもよい。例えば、分散が類似しているサブベクトルをマージして複数のクラスを形成し、複数のクラスごとに、ルックアップテーブルが生成されてもよい。上述した例では、例えば、第１サブベクトル及び第３サブベクトルの２つのサブベクトの分散が類似し、第２サブベクトル及び第４サブベクトルの２つのサブベクトの分散が類似している場合、第１サブベクトル及び第３サブベクトルをマージして第１クラスとし、第２サブベクトル及び第４サブベクトルをマージして第２クラスとしてもよい。この場合、第１クラスのルックアップテーブル（コードブック情報）と、第２クラスのルックアップテーブル（コードブック情報）との２つのルックアップテーブルが生成されてもよい。 Further, lookup tables may be generated for each of a plurality of classes by partially merging individual subvectors. For example, subvectors with similar variances may be merged to form multiple classes, and a lookup table may be generated for each of the multiple classes. In the above example, for example, if the variances of two subvectors, the first subvector and the third subvector, are similar, and the variances of the two subvectors, the second subvector and the fourth subvector, are similar, then the first The sub-vector and the third sub-vector may be merged to form the first class, and the second sub-vector and the fourth sub-vector may be merged to form the second class. In this case, two lookup tables, a first class lookup table (codebook information) and a second class lookup table (codebook information), may be generated.

（転置）
上述したデータの保持は一例に過ぎず、情報処理装置１００は、効率的なデータ参照等が可能となるように、種々の態様によりデータを保持してもよい。例えば、情報処理装置１００は、転置したデータを保持してもよい。この点について、図３を用いて説明する。図３は、第１の実施形態に係るデータの一例を示す図である。 (transposed)
The data retention described above is merely an example, and the information processing apparatus 100 may retain data in various ways so as to enable efficient data reference. For example, the information processing device 100 may hold transposed data. This point will be explained using FIG. 3. FIG. 3 is a diagram showing an example of data according to the first embodiment.

例えば、ノード情報ＩＮＦ１～ＩＮＦ４に示すようにデータを保持し、ノード情報ＩＮＦ１～ＩＮＦ４を順次処理した場合、図２のステップＳ１４に示すように、ルックアップテーブルであるコードブック情報ＴＢ１～ＴＢ４を繰り返し参照することとなる。 For example, if data is held as shown in node information INF1 to INF4 and node information INF1 to INF4 are sequentially processed, codebook information TB1 to TB4, which is a lookup table, is repeatedly processed as shown in step S14 in FIG. Please refer to it.

そこで、ステップＳ１５に示すように、ノード情報ＩＮＦ１～ＩＮＦ４を転置した転置データＴＲ１～ＴＲ４が生成される。例えば、情報処理装置１００は、ノード情報ＩＮＦ１～ＩＮＦ４を転置した転置データＴＲ１～ＴＲ４を生成する。 Therefore, as shown in step S15, transposed data TR1 to TR4 are generated by transposing the node information INF1 to INF4. For example, the information processing device 100 generates transposed data TR1 to TR4 by transposing node information INF1 to INF4.

図３では、ノード情報ＩＮＦ１～ＩＮＦ４のうち、ノードＮ９、Ｎ１２、Ｎ５４、Ｎ８５の第１サブベクトルに対応するコードブックの一覧である第１データＴＲ１が生成される。具体的には、ノードＮ９の第１サブベクトルに対応するコードブックＣＤ１２、ノードＮ１２の第１サブベクトルに対応するコードブックＣＤ１４、ノードＮ５４の第１サブベクトルに対応するコードブックＣＤ１３、及びノードＮ８５の第１サブベクトルに対応するコードブックＣＤ１８の一覧である第１データＴＲ１が生成される。 In FIG. 3, first data TR1 is generated which is a list of codebooks corresponding to the first subvectors of nodes N9, N12, N54, and N85 among the node information INF1 to INF4. Specifically, codebook CD12 corresponding to the first subvector of node N9, codebook CD14 corresponding to the first subvector of node N12, codebook CD13 corresponding to the first subvector of node N54, and node N85. First data TR1, which is a list of codebook CD18 corresponding to the first subvector of , is generated.

同様に、ノード情報ＩＮＦ１～ＩＮＦ４のうち、ノードＮ９、Ｎ１２、Ｎ５４、Ｎ８５の第２サブベクトルに対応するコードブックの一覧である第２データＴＲ２が生成される。また、ノード情報ＩＮＦ１～ＩＮＦ４のうち、ノードＮ９、Ｎ１２、Ｎ５４、Ｎ８５の第３サブベクトルに対応するコードブックの一覧である第３データＴＲ３が生成される。ノード情報ＩＮＦ１～ＩＮＦ４のうち、ノードＮ９、Ｎ１２、Ｎ５４、Ｎ８５の第４サブベクトルに対応するコードブックの一覧である第４データＴＲ４が生成される。 Similarly, second data TR2 is generated, which is a list of codebooks corresponding to the second subvectors of nodes N9, N12, N54, and N85 among the node information INF1 to INF4. Also, third data TR3 is generated, which is a list of codebooks corresponding to the third subvectors of nodes N9, N12, N54, and N85 among the node information INF1 to INF4. Fourth data TR4 is generated, which is a list of codebooks corresponding to the fourth subvectors of nodes N9, N12, N54, and N85 among the node information INF1 to INF4.

なお、各転置データＴＲ１～ＴＲ４の一覧のうち、何番目のデータがどのノードに対応するかの対応付けを示す対応付情報が生成される。図３の例では、各転置データＴＲ１～ＴＲ４の一覧のうち、１番目（最初）のデータがノードＮ９に対応し、２番目のデータがノードＮ１２に対応し、３番目のデータがノードＮ５４に対応し、４番目（最後）のデータがノードＮ８５に対応することを示す対応付情報が生成される。情報処理装置１００は、対応付情報を参照することにより、各転置データＴＲ１～ＴＲ４の一覧の各データが、どのノードに対応するかを特定することができる。例えば、情報処理装置１００は、対応付情報を生成する。 Note that correspondence information is generated that indicates the correspondence between which data in the list of each transposed data TR1 to TR4 corresponds to which node. In the example of FIG. 3, among the list of transposed data TR1 to TR4, the first data corresponds to node N9, the second data corresponds to node N12, and the third data corresponds to node N54. Correspondingly, association information indicating that the fourth (last) data corresponds to node N85 is generated. By referring to the correspondence information, the information processing device 100 can specify which node each data in the list of transposed data TR1 to TR4 corresponds to. For example, the information processing device 100 generates association information.

情報処理装置１００は、ノード情報ＩＮＦ１～ＩＮＦ４を転置した転置データＴＲ１～ＴＲ４を用いて処理を行う。例えば、情報処理装置１００は、転置データＴＲ１を用いてルックアップテーブルを参照する場合、コードブック情報ＴＢ１のみを参照することとなり、１つのコードブック情報ＴＢ１のみを参照することとなる。同様に、情報処理装置１００は、転置データＴＲ２を用いてルックアップテーブルを参照する場合、コードブック情報ＴＢ２のみを参照することとなり、１つのコードブック情報ＴＢ２のみを参照することとなる。転置データＴＲ３、ＴＲ４についても同様に１つのコードブック情報のみを参照することとなる。これにより、情報処理装置１００は、効率的にデータを参照することができるため、効率的な検索処理を可能にすることができる。 The information processing device 100 performs processing using transposed data TR1 to TR4 obtained by transposing node information INF1 to INF4. For example, when the information processing device 100 refers to the lookup table using the transposed data TR1, it refers only to the codebook information TB1, and refers to only one codebook information TB1. Similarly, when the information processing device 100 refers to the lookup table using the transposed data TR2, it refers only to the codebook information TB2, and refers to only one codebook information TB2. Similarly, only one codebook information is referred to for the transposed data TR3 and TR4. As a result, the information processing apparatus 100 can efficiently refer to data, thereby enabling efficient search processing.

このように、一括距離計算時にサブクラス（サブベクトル等）ごとのルックアップテーブルの参照するように、ノードの近傍オブジェクト（接続ノード）のデータを転置し、サブクラスごとにまとめ上げた順番でデータをノードにもつことで、参照の局所性がさらに高まり、高速化が可能となる。 In this way, in order to refer to the lookup table for each subclass (subvector, etc.) during batch distance calculation, the data of the neighboring objects (connected nodes) of the node is transposed, and the data is sent to the node in the order in which it is compiled for each subclass. By having both, the locality of reference is further increased and speeding up is possible.

なお、図３では説明を簡単にするために、４つのノードを対象として転置データを生成する場合を示すが、転置データを生成する単位（ノードの数）は、情報処理装置１００の仕様に基づいて決定される。例えば、情報処理装置１００の一括処理可能単位が「４」である場合、図３と同様の処理となるが、一括処理可能単位が「１６」である場合、１６個のノードを一つの単位として転置データが生成される。また、一括処理可能単位が「３２」である場合、３２個のノードを一つの単位として転置データが生成される。このように、情報処理装置１００がＳＩＭＤにより一括して処理できる数（一括処理可能単位）に応じて生成される転置データを用いることで、情報処理装置１００は、効率的にデータを参照することができるため、効率的な検索処理を可能にすることができる。 Note that, in order to simplify the explanation, FIG. 3 shows a case where transposed data is generated for four nodes, but the unit (number of nodes) in which transposed data is generated is based on the specifications of the information processing device 100. Determined by For example, if the batch processable unit of the information processing device 100 is "4", the process is similar to that shown in FIG. 3, but if the batch processable unit is "16", 16 nodes are treated as one unit. Transposed data is generated. Further, when the batch processable unit is "32", transposed data is generated with 32 nodes as one unit. In this way, by using the transposed data that is generated according to the number that the information processing apparatus 100 can process at once using SIMD (units that can be processed at once), the information processing apparatus 100 can refer to data efficiently. This makes it possible to perform efficient search processing.

例えば、上述のような検索処理時の時間の多くは距離計算であるが、距離計算は上記のようなＳＩＭＤによる並列計算により高速化を図ることができる。このように距離計算を並列化した場合、オブジェクトデータをフェッチする時間が検索処理を占めることになる。このフェッチ時間を削減することができれば、さらなる高速化が実現される。なお、データをフェッチする時間を削減するにはプリフェッチを行う方法があるが、限界がある。 For example, much of the time during the above-mentioned search process is spent on distance calculation, but distance calculation can be sped up by parallel calculation using SIMD as described above. When distance calculation is parallelized in this way, the time required to fetch object data occupies the search process. If this fetch time can be reduced, further speeding up can be achieved. Note that there is a method of prefetching to reduce the time required to fetch data, but there is a limit.

一方、情報処理装置１００は、上述のように参照の局所性を高め、効率的にデータを参照することを可能にすることにより、データのフェッチを抑制し、さらなる高速化を実現することができる。 On the other hand, the information processing device 100 can suppress data fetching and achieve further speedup by increasing the locality of reference and making it possible to efficiently refer to data as described above. .

（検索結果）
なお、上述した第２距離（真の距離）による検索結果を返す場合には、以下の第１の方法及び第２の方法の２つの方法が考えられる。 (search results)
Note that when returning search results based on the second distance (true distance) described above, the following two methods, the first method and the second method, can be considered.

例えば、第１の方法としては、検索数を指定された検索数より多く探索し、探索終了時に、真の距離計算を行って距離でソートし、指定された検索数のオブジェクトを検索結果としてもよい。この場合、情報処理装置１００は、指定された検索数（「第１数」ともいう）よりも多い数（「拡張検索数」ともいう）のノードを抽出するように、拡張検索数（「第２数」ともいう）を検索数として設定し、図１１に示すような検索処理を行うことにより、拡張検索数のノード、すなわち指定された検索数よりも多い数のノードを近傍候補ノードとして抽出する。そして、情報処理装置１００は、近傍候補ノードを対象として、第２距離（真の距離）を算出し、近傍候補ノードのうち、第２距離が短い方から第１数のノードを検索クエリの近傍のノードとして抽出する。 For example, the first method is to search for more than the specified number of searches, and at the end of the search, calculate the true distance, sort by distance, and select the specified number of objects as search results. good. In this case, the information processing device 100 extracts nodes whose number (also referred to as "extended search number") is greater than the specified number of searches (also referred to as "first number"). By setting the number of searches (also referred to as ``number 2'') as the search number and performing the search process as shown in Figure 11, nodes with the extended search number, that is, nodes whose number is greater than the specified number of searches, are extracted as nearby candidate nodes. do. Then, the information processing device 100 calculates a second distance (true distance) for the neighboring candidate nodes, and selects the first number of nodes from among the neighboring candidate nodes, starting from the one with the shortest second distance, in the vicinity of the search query. Extract as a node.

また、例えば、第２の方法としては、探索中に、第１距離（近似距離）に基づいて、ノード（オブジェクト）が探索範囲内または検索範囲内に入った場合のみに、第２距離（真の距離）を計算し、近似距離を真の距離で置き換えることで、真の距離による検索結果を返してもよい。 For example, as a second method, only when a node (object) is within the search range or within the search range based on the first distance (approximate distance), the second distance (true The search result based on the true distance may be returned by calculating the approximate distance (distance) and replacing the approximate distance with the true distance.

ここでいう、検索範囲は、図１１中の「ｒ」により規定される範囲であり、探索範囲は、図１１中の検索範囲係数「ε」を用いた「ｒ（１＋ε）」により規定される範囲である。例えば、検索範囲に入った場合に第２距離（真の距離）を計算する場合、情報処理装置１００は、ノードの第１距離（近似距離）が「ｒ」以下となった場合に、そのノードの第２距離（真の距離）を算出する。また、探索範囲に入った場合に第２距離（真の距離）を計算する場合、情報処理装置１００は、ノードの第１距離（近似距離）が「ｒ（１＋ε）」以下となった場合に、そのノードの第２距離（真の距離）を算出する。 The search range here is the range defined by "r" in FIG. 11, and the search range is defined by "r(1+ε)" using the search range coefficient "ε" in FIG. range. For example, when calculating the second distance (true distance) when the node enters the search range, the information processing device 100 calculates whether the node A second distance (true distance) is calculated. Further, when calculating the second distance (true distance) when the node enters the search range, the information processing device 100 calculates the second distance (true distance) when the first distance (approximate distance) of the node is equal to or less than “r(1+ε)”. , calculate the second distance (true distance) of that node.

例えば、精度を高める場合、探索範囲に入った場合を条件としてもよい。また、処理の高速化を求める場合、検索範囲に入った場合を条件としてもよい。なお、上記は一例に過ぎず、いずれを条件とするかは、検索範囲係数「ε」の値や処理の目的などに応じて適宜設定されてもよい。 For example, to improve accuracy, the condition may be set to be within the search range. Furthermore, when speeding up the processing is desired, the condition may be set to be within the search range. Note that the above is just an example, and which condition is to be used may be set as appropriate depending on the value of the search range coefficient "ε", the purpose of processing, etc.

（ベクトル量子化）
なお、情報処理装置１００は、特許文献３に示すように、２段階のベクトル量子化を行ってもよい。そして、情報処理装置１００は、下記の式（１）により、検索クエリと各ノードとの距離を算出してもよい。 (vector quantization)
Note that the information processing device 100 may perform two-stage vector quantization as shown in Patent Document 3. Then, the information processing device 100 may calculate the distance between the search query and each node using the following equation (1).

ここで、上記式（１）中の左辺の値は、例えば、検索クエリとノードとの間の二乗距離を示す。また、例えば、上記式（１）中の「ｘ」は、クエリに対応する。また、例えば、上記式（１）中の「ｙ」は、ノードに対応する。また、例えば、上記式（１）の右辺中の「ｑ_ｃ（ｙ）」は、「ｙ」の代表ベクトル（セントロイド）を示す。例えば、情報処理装置１００は、上記式（１）中の「ｙ」について、ノードのベクトルデータを有しない場合は、各ノードが属する部分領域のセントロイドの数値を用いてもよい。また、例えば、「ｙ－ｑ_ｃ（ｙ）」は、残差ベクトルを示す。また、例えば、上記式（１）の右辺中の「ｑ_ｐ」は、所定の量子化器（関数）を示す。 Here, the value on the left side of the above equation (1) indicates, for example, the square distance between the search query and the node. Further, for example, "x" in the above formula (1) corresponds to a query. Further, for example, "y" in the above formula (1) corresponds to a node. Further, for example, "q _c (y)" on the right side of the above equation (1) indicates a representative vector (centroid) of "y". For example, when the information processing apparatus 100 does not have node vector data for "y" in the above equation (1), the information processing apparatus 100 may use the numerical value of the centroid of the partial area to which each node belongs. Further, for example, "yq _c (y)" indicates a residual vector. Further, for example, “q _p ” on the right side of the above equation (1) indicates a predetermined quantizer (function).

また、例えば、上記式（１）の右辺中の「ｊ」は、分割された空間の数であってもよい。例えば、図１の例では、上記式（１）の右辺中の「ｊ」は、分割された空間の数「４」であってもよい。また、例えば、上記式（１）の右辺中の「ｕ_ｊ（）」は、括弧中のベクトル間の部分残差ベクトルを示す。例えば、情報処理装置１００は、上記式（１）を用いて、各部分空間におけるクエリとノードとの間の二乗距離を算出し、合算することにより、クエリとノードとの距離を算出してもよい。 Further, for example, "j" on the right side of the above equation (1) may be the number of divided spaces. For example, in the example of FIG. 1, "j" on the right side of the above equation (1) may be the number of divided spaces "4". Further, for example, "u _j ()" on the right side of the above equation (1) indicates a partial residual vector between the vectors in parentheses. For example, the information processing device 100 may calculate the distance between the query and the node by calculating the squared distance between the query and the node in each subspace using the above equation (1), and summing the squared distance between the query and the node. good.

〔１－２．情報処理システムの構成〕
図４に示すように、情報処理システム１は、端末装置１０と、情報提供装置５０と、情報処理装置１００とが含まれる。端末装置１０と、情報提供装置５０と、情報処理装置１００とは所定のネットワークＮを介して、有線または無線により通信可能に接続される。図４は、第１の実施形態に係る情報処理システムの構成例を示す図である。なお、図４に示した情報処理システム１には、複数台の端末装置１０や、複数台の情報提供装置５０や、複数台の情報処理装置１００が含まれてもよい。 [1-2. Information processing system configuration]
As shown in FIG. 4, the information processing system 1 includes a terminal device 10, an information providing device 50, and an information processing device 100. The terminal device 10, the information providing device 50, and the information processing device 100 are connected via a predetermined network N so that they can communicate by wire or wirelessly. FIG. 4 is a diagram illustrating a configuration example of an information processing system according to the first embodiment. Note that the information processing system 1 shown in FIG. 4 may include multiple terminal devices 10, multiple information providing devices 50, and multiple information processing devices 100.

端末装置１０は、ユーザによって利用される情報処理装置である。端末装置１０は、ユーザによる種々の操作を受け付ける。なお、以下では、端末装置１０をユーザと表記する場合がある。すなわち、以下では、ユーザを端末装置１０と読み替えることもできる。なお、上述した端末装置１０は、例えば、スマートフォンや、タブレット型端末や、ノート型ＰＣ（Personal Computer）や、デスクトップＰＣや、携帯電話機や、ＰＤＡ（Personal Digital Assistant）等により実現される。 The terminal device 10 is an information processing device used by a user. The terminal device 10 accepts various operations by the user. Note that below, the terminal device 10 may be referred to as a user. That is, in the following, the user can also be read as the terminal device 10. Note that the above-described terminal device 10 is realized by, for example, a smartphone, a tablet terminal, a notebook PC (Personal Computer), a desktop PC, a mobile phone, a PDA (Personal Digital Assistant), or the like.

情報提供装置５０は、ユーザ等に種々の情報提供を行うための情報が格納された情報処理装置である。例えば、情報提供装置５０は、ウェブサーバ等の種々の外部装置から収集した文字情報等に基づくオブジェクトＩＤが格納される。例えば、情報提供装置５０は、ユーザ等に画像検索サービスを提供する情報処理装置である。例えば、情報提供装置５０は、画像検索サービスを提供するための各情報が格納される。例えば、情報提供装置５０は、画像検索サービスの対象となる画像に対応するベクトル情報を情報処理装置１００に提供する。また、情報提供装置５０は、クエリを情報処理装置１００に送信することにより、情報処理装置１００からクエリに対応する画像を示すオブジェクトＩＤ等を受信する。 The information providing device 50 is an information processing device that stores information for providing various information to users and the like. For example, the information providing device 50 stores object IDs based on character information etc. collected from various external devices such as web servers. For example, the information providing device 50 is an information processing device that provides image search services to users and the like. For example, the information providing device 50 stores various pieces of information for providing an image search service. For example, the information providing device 50 provides the information processing device 100 with vector information corresponding to an image that is a target of an image search service. Furthermore, by transmitting a query to the information processing device 100, the information providing device 50 receives an object ID and the like indicating an image corresponding to the query from the information processing device 100.

情報処理装置１００は、検索サービスを提供するコンピュータである。情報処理装置１００は、検索クエリに対応する指定された検索数のオブジェクトを検索結果として提供する。情報処理装置１００は、検索対象となるノード（オブジェクト）がエッジで連結されたグラフを用いて、検索クエリの近傍のノード（オブジェクト）を検索結果として提供する。情報処理装置１００は、複数のオブジェクトの各々に対応するノード群がエッジにより連結されたグラフを用いて、検索クエリの近傍のノードを検索する検索処理において、所定の基準により選択された複数のノードと検索クエリとの距離を、ベクトル量子化がされた複数のノードのベクトル情報を用いて算出する。 The information processing device 100 is a computer that provides a search service. The information processing apparatus 100 provides a specified number of search objects corresponding to the search query as search results. The information processing apparatus 100 uses a graph in which search target nodes (objects) are connected by edges to provide nodes (objects) near a search query as search results. The information processing device 100 uses a graph in which a group of nodes corresponding to each of a plurality of objects are connected by edges to search for nodes in the vicinity of a search query. The distance between the search query and the search query is calculated using vector information of a plurality of nodes subjected to vector quantization.

例えば、情報処理装置１００は、端末装置１０からクエリ（検索クエリ）を受信すると、検索クエリに類似する対象（オブジェクト）を検索し、検索結果を端末装置に提供する。また、例えば、情報処理装置１００が端末装置に提供するデータは、画像情報等のデータ自体であってもよいし、ＵＲＬ（Uniform Resource Locator）等の対応するデータを参照するための情報であってもよい。また、検索クエリや検索対象（オブジェクト）は、画像、音声、テキストデータなど、如何なる種類のデータであってもよい。 For example, upon receiving a query (search query) from the terminal device 10, the information processing device 100 searches for a target (object) similar to the search query, and provides the search result to the terminal device. Further, for example, the data that the information processing device 100 provides to the terminal device may be data itself such as image information, or information for referring to corresponding data such as a URL (Uniform Resource Locator). Good too. Further, the search query or search target (object) may be any type of data such as image, audio, text data, etc.

〔１－３．情報処理装置の構成〕
次に、図５を用いて、第１の実施形態に係る情報処理装置１００の構成について説明する。図５は、第１の実施形態に係る情報処理装置１００の構成例を示す図である。図５に示すように、情報処理装置１００は、通信部１１０と、記憶部１２０と、制御部１３０とを有する。なお、情報処理装置１００は、情報処理装置１００の管理者等から各種操作を受け付ける入力部（例えば、キーボードやマウス等）や、各種情報を表示するための表示部（例えば、液晶ディスプレイ等）を有してもよい。 [1-3. Configuration of information processing device]
Next, the configuration of the information processing apparatus 100 according to the first embodiment will be described using FIG. 5. FIG. 5 is a diagram illustrating a configuration example of the information processing device 100 according to the first embodiment. As shown in FIG. 5, the information processing device 100 includes a communication section 110, a storage section 120, and a control section 130. Note that the information processing device 100 includes an input unit (for example, a keyboard, a mouse, etc.) that accepts various operations from an administrator of the information processing device 100, and a display unit (for example, a liquid crystal display, etc.) for displaying various information. May have.

（通信部１１０）
通信部１１０は、例えば、ＮＩＣ（Network Interface Card）等によって実現される。そして、通信部１１０は、ネットワーク（例えば図４中のネットワークＮ）と有線または無線で接続され、端末装置１０や情報提供装置５０との間で情報の送受信を行う。 (Communication Department 110)
The communication unit 110 is realized by, for example, a NIC (Network Interface Card). The communication unit 110 is connected to a network (for example, network N in FIG. 4) by wire or wirelessly, and transmits and receives information to and from the terminal device 10 and the information providing device 50.

（記憶部１２０）
記憶部１２０は、例えば、ＲＡＭ（Random Access Memory）、フラッシュメモリ（Flash Memory）等の半導体メモリ素子、または、ハードディスク、光ディスク等の記憶装置によって実現される。第１の実施形態に係る記憶部１２０は、図５に示すように、オブジェクト情報記憶部１２１と、グラフ情報記憶部１２２と、量子化情報記憶部１２３と、コードブック情報記憶部１２４とを有する。 (Storage unit 120)
The storage unit 120 is realized by, for example, a semiconductor memory element such as a RAM (Random Access Memory) or a flash memory, or a storage device such as a hard disk or an optical disk. As shown in FIG. 5, the storage unit 120 according to the first embodiment includes an object information storage unit 121, a graph information storage unit 122, a quantization information storage unit 123, and a codebook information storage unit 124. .

（オブジェクト情報記憶部１２１）
第１の実施形態に係るオブジェクト情報記憶部１２１は、オブジェクトに関する各種情報を記憶する。例えば、オブジェクト情報記憶部１２１は、オブジェクトＩＤやベクトルデータを記憶する。図６は、第１の実施形態に係るオブジェクト情報記憶部の一例を示す図である。図６に示すオブジェクト情報記憶部１２１は、「オブジェクトＩＤ」、「ベクトル情報」といった項目が含まれる。 (Object information storage unit 121)
The object information storage unit 121 according to the first embodiment stores various information regarding objects. For example, the object information storage unit 121 stores object IDs and vector data. FIG. 6 is a diagram illustrating an example of an object information storage unit according to the first embodiment. The object information storage unit 121 shown in FIG. 6 includes items such as "object ID" and "vector information."

「オブジェクトＩＤ」は、オブジェクトを識別するための識別情報を示す。また、「ベクトル情報」は、オブジェクトＩＤにより識別されるオブジェクトに対応するベクトル情報を示す。すなわち、図６の例では、オブジェクトを識別するオブジェクトＩＤに対して、オブジェクトに対応するベクトルデータ（ベクトル情報）が対応付けられて登録されている。 "Object ID" indicates identification information for identifying an object. Moreover, "vector information" indicates vector information corresponding to the object identified by the object ID. That is, in the example of FIG. 6, vector data (vector information) corresponding to the object is registered in association with an object ID that identifies the object.

例えば、図６の例では、オブジェクトＩＤ「ＯＢ１」により識別されるオブジェクト（対象）は、「１０，２４，５１，２・・・」の多次元のベクトル情報が対応付けられることを示す。 For example, the example in FIG. 6 indicates that the object (target) identified by the object ID "OB1" is associated with multidimensional vector information of "10, 24, 51, 2...".

なお、オブジェクト情報記憶部１２１は、上記に限らず、目的に応じて種々の情報を記憶してもよい。 Note that the object information storage unit 121 is not limited to the above, and may store various types of information depending on the purpose.

（グラフ情報記憶部１２２）
第１の実施形態に係るグラフ情報記憶部１２２は、グラフに関する各種情報を記憶する。例えば、グラフ情報記憶部１２２は、生成したグラフを記憶する。図７は、第１の実施形態に係るグラフ情報記憶部の一例を示す図である。図７に示すグラフ情報記憶部１２２は、「ノードＩＤ」、「オブジェクトＩＤ」、および「接続ノード情報」といった項目を有する。 (Graph information storage unit 122)
The graph information storage unit 122 according to the first embodiment stores various information regarding graphs. For example, the graph information storage unit 122 stores the generated graph. FIG. 7 is a diagram illustrating an example of a graph information storage unit according to the first embodiment. The graph information storage unit 122 shown in FIG. 7 has items such as "node ID", "object ID", and "connected node information".

「ノードＩＤ」は、グラフにおける各ノード（対象）を識別するための識別情報を示す。また、「オブジェクトＩＤ」は、オブジェクトを識別するための識別情報を示す。なお、ノードＩＤとオブジェクトＩＤが共通である場合、「ノードＩＤ」にオブジェクトＩＤが記憶され、グラフ情報記憶部１２２に「オブジェクトＩＤ」の項目は含まれてなくてもよい。例えば、オブジェクトＩＤとノードＩＤとして用いる場合、「ノードＩＤ」にオブジェクトＩＤが記憶され、グラフ情報記憶部１２２に「オブジェクトＩＤ」の項目は含まれてなくてもよい。 "Node ID" indicates identification information for identifying each node (object) in the graph. Moreover, "object ID" indicates identification information for identifying an object. Note that if the node ID and object ID are the same, the object ID is stored in the "node ID" and the graph information storage unit 122 does not need to include the item "object ID." For example, when used as an object ID and a node ID, the object ID may be stored in the "node ID" and the graph information storage unit 122 may not include the "object ID" item.

また、「接続ノード情報」は、対応するノードから辿ることができるノード（参照先のノード）に関する情報を示す。例えば、「接続ノード情報」には、「参照先」といった情報が含まれる。「参照先」は、エッジにより連結され、そのノードから辿ることができる参照先（ノード）を識別するための情報を示す。すなわち、図７の例では、ノードを識別するノードＩＤ（オブジェクトＩＤ）に対して、そのノードからエッジにより辿ることができる参照先（ノード）が対応付けられて登録されている。なお、「接続ノード情報」には、参照先に接続されるエッジを識別するための情報（エッジＩＤ）等が含まれてもよい。 Further, "connected node information" indicates information regarding a node (reference destination node) that can be traced from the corresponding node. For example, "connected node information" includes information such as "reference destination." “Reference destination” indicates information for identifying a reference destination (node) that is connected by an edge and can be traced from that node. That is, in the example of FIG. 7, a reference destination (node) that can be traced from the node by an edge is registered in association with a node ID (object ID) that identifies the node. Note that the "connected node information" may include information (edge ID) for identifying an edge connected to a reference destination.

図７の例では、ノードＩＤ「Ｎ１」により識別されるノード（ノードＮ１）は、オブジェクトＩＤ「ＯＢ１」により識別されるオブジェクト（対象）に対応することを示す。また、ノードＮ１からは、ノードＩＤ「Ｎ４」により識別されるノード（ノードＮ４）にエッジが連結されており、ノードＮ１からノードＮ４へ辿ることができることを示す。 The example of FIG. 7 indicates that the node (node N1) identified by the node ID "N1" corresponds to the object (target) identified by the object ID "OB1". Further, an edge is connected from the node N1 to the node (node N4) identified by the node ID "N4", indicating that it is possible to trace from the node N1 to the node N4.

また、ノードＩＤ「Ｎ２」により識別されるノード（ノードＮ２）は、オブジェクトＩＤ「ＯＢ２」により識別されるオブジェクト（対象）に対応することを示す。また、ノードＮ２からは、ノードＩＤ「Ｎ６」により識別されるノード（ノードＮ６）にエッジが連結されており、ノードＮ２からノードＮ６へ辿ることができることを示す。 Further, the node (node N2) identified by the node ID "N2" corresponds to the object (target) identified by the object ID "OB2". Further, an edge is connected from the node N2 to the node (node N6) identified by the node ID "N6", indicating that it is possible to trace from the node N2 to the node N6.

また、ノードＩＤ「Ｎ７」により識別されるノード（ノードＮ７）は、オブジェクトＩＤ「ＯＢ７」により識別されるオブジェクト（対象）に対応することを示す。また、ノードＮ７からは、ノードＮ９、Ｎ１２、Ｎ５４、Ｎ８５にエッジが連結されており、ノードＮ２からノードＮ９、Ｎ１２、Ｎ５４、Ｎ８５の各々へ辿ることができることを示す。 Further, the node (node N7) identified by the node ID "N7" corresponds to the object (target) identified by the object ID "OB7". Furthermore, edges are connected from node N7 to nodes N9, N12, N54, and N85, indicating that each of nodes N9, N12, N54, and N85 can be traced from node N2.

なお、グラフ情報記憶部１２２は、上記に限らず、目的に応じて種々の情報を記憶してもよい。例えば、グラフ情報記憶部１２２は、各ノード（ベクトル）間を連結するエッジの長さが記憶されてもよい。すなわち、グラフ情報記憶部１２２は、各ノード（ベクトル）間の距離を示す情報が記憶されてもよい。なお、グラフ情報記憶部１２２は、上記に限らず、種々のデータ構造によりグラフ情報を記憶してもよい。 Note that the graph information storage unit 122 is not limited to the above, and may store various information depending on the purpose. For example, the graph information storage unit 122 may store the length of an edge connecting each node (vector). That is, the graph information storage unit 122 may store information indicating the distance between each node (vector). Note that the graph information storage unit 122 is not limited to the above, and may store graph information using various data structures.

また、グラフは、クエリを入力とし、グラフ中のエッジを辿ることによりノードを探索し、クエリに類似するノードを抽出し出力するプログラムモジュールを含んでもよい。すなわち、グラフは、グラフを用いて検索処理を行うプログラムモジュールとしての利用が想定されるものであってもよい。例えば、グラフＧＲ１は、クエリとしてベクトルデータが入力された場合に、そのベクトルデータに類似するベクトルデータに対応するノードをグラフ中から抽出し、出力するプログラムであってもよい。例えば、グラフＧＲ１は、クエリ画像に対応する類似画像を検索するプログラムモジュールとして利用されるデータであってもよい。例えば、グラフＧＲ１は、入力されたクエリに基づいて、グラフにおいてそのクエリに類似するノードを抽出し、出力するよう、コンピュータを機能させる。 The graph may also include a program module that receives a query as input, searches for nodes by tracing edges in the graph, and extracts and outputs nodes similar to the query. That is, the graph may be assumed to be used as a program module that performs search processing using the graph. For example, the graph GR1 may be a program that, when vector data is input as a query, extracts nodes corresponding to vector data similar to the vector data from the graph and outputs them. For example, the graph GR1 may be data used as a program module that searches for similar images corresponding to a query image. For example, the graph GR1 causes a computer to function based on an input query to extract and output nodes similar to the query in the graph.

（量子化情報記憶部１２３）
第１の実施形態に係る量子化情報記憶部１２３は、割当処理に関する各種情報を記憶する。図８は、第１の実施形態に係る量子化情報記憶部の一例を示す図である。図８の例では、量子化情報記憶部１２３には、「ノードＩＤ」、「オブジェクトＩＤ」、および「量子化情報」といった項目を有する。 (Quantization information storage unit 123)
The quantization information storage unit 123 according to the first embodiment stores various information regarding allocation processing. FIG. 8 is a diagram illustrating an example of a quantization information storage unit according to the first embodiment. In the example of FIG. 8, the quantization information storage unit 123 has items such as "node ID", "object ID", and "quantization information".

「ノードＩＤ」は、グラフにおける各ノード（対象）を識別するための識別情報を示す。また、「オブジェクトＩＤ」は、オブジェクトを識別するための識別情報を示す。なお、ノードＩＤとオブジェクトＩＤが共通である場合、「ノードＩＤ」にオブジェクトＩＤが記憶され、量子化情報記憶部１２３に「オブジェクトＩＤ」の項目は含まれてなくてもよい。 "Node ID" indicates identification information for identifying each node (object) in the graph. Moreover, "object ID" indicates identification information for identifying an object. Note that when the node ID and the object ID are common, the object ID is stored in the "node ID", and the quantization information storage unit 123 does not need to include the item "object ID".

また、「量子化情報」は、各ノード（オブジェクト）の量子化されたベクトルの情報を示す。例えば、「量子化情報」には、「要素」、「コードブックＩＤ」といった情報が含まれる。「要素」は、対応するオブジェクトのベクトルにおける配置を示す。図８の例では、「要素」には、「＃１」、「＃２」、「＃３」、「＃４」が含まれる場合を示す。この場合、各ノード（オブジェクト）のベクトルは４分割され、各分割された部分ベクトルがコードブックにより量子化されることを示す。なお、分割数は４に限らず、例えば分割数が６の場合、「要素」には、「＃１」、「＃２」、「＃３」、「＃４」、「＃５」、「＃６」が含まれる。「コードブックＩＤ」は、各要素（部分ベクトル）に対応するコードブックを識別するための情報を示す。 Moreover, "quantization information" indicates information on quantized vectors of each node (object). For example, "quantization information" includes information such as "element" and "codebook ID." "Element" indicates the arrangement of the corresponding object in the vector. In the example of FIG. 8, the "element" includes "#1", "#2", "#3", and "#4". In this case, the vector of each node (object) is divided into four parts, and each divided partial vector is quantized by the codebook. Note that the number of divisions is not limited to 4. For example, if the number of divisions is 6, "element" may include "#1", "#2", "#3", "#4", "#5", " #6” is included. "Codebook ID" indicates information for identifying the codebook corresponding to each element (partial vector).

図８の例では、ノードＮ９（オブジェクトＯＢ９）のベクトルは、４分割された部分ベクトルのうち、先頭の部分ベクトルがコードブックＩＤ「ＣＤ１２」により識別されるコードブック（コードブックＣＤ１２）により量子化されることを示す。また、ノードＮ９（オブジェクトＯＢ９）のベクトルは、４分割された部分ベクトルのうち、先頭から２番目の部分ベクトルがコードブックＩＤ「ＣＤ２３」により識別されるコードブック（コードブックＣＤ２３）により量子化されることを示す。 In the example of FIG. 8, the vector of node N9 (object OB9) is quantized by the codebook (codebook CD12) whose first partial vector among the four divided partial vectors is identified by the codebook ID "CD12". Indicates that the In addition, in the vector of node N9 (object OB9), among the four divided partial vectors, the second partial vector from the beginning is quantized by the codebook (codebook CD23) identified by the codebook ID "CD23". to show that

また、ノードＮ９（オブジェクトＯＢ９）のベクトルは、４分割された部分ベクトルのうち、先頭から３番目の部分ベクトルがコードブックＩＤ「ＣＤ３５」により識別されるコードブック（コードブックＣＤ３５）により量子化されることを示す。また、ノードＮ９（オブジェクトＯＢ９）のベクトルは、４分割された部分ベクトルのうち、先頭から４番目（すなわち最後尾）の部分ベクトルがコードブックＩＤ「ＣＤ４７」により識別されるコードブック（コードブックＣＤ４７）により量子化されることを示す。 In addition, in the vector of node N9 (object OB9), among the four divided partial vectors, the third partial vector from the beginning is quantized by the codebook (codebook CD35) identified by the codebook ID "CD35". to show that In addition, the vector of node N9 (object OB9) has a codebook (codebook CD47) in which the fourth (i.e., the last) partial vector from the beginning among the four divided partial vectors is identified by the codebook ID "CD47". ) indicates that it is quantized.

なお、量子化情報記憶部１２３は、上記に限らず、目的に応じて種々の情報を記憶してもよい。 Note that the quantization information storage unit 123 is not limited to the above, and may store various information depending on the purpose.

（コードブック情報記憶部１２４）
第１の実施形態に係るコードブック情報記憶部１２４は、コードブックに関する各種情報を記憶する。例えば、コードブック情報記憶部１２４は、コードブックＩＤや各コードブックのベクトル情報を記憶する。図９は、第１の実施形態に係るコードブック情報記憶部の一例を示す図である。図９の例では、コードブック情報記憶部１２４は、各コードブックとベクトルとの対応付けを示すルックアップテーブルを記憶する場合を一例として示す。 (Codebook information storage unit 124)
The codebook information storage unit 124 according to the first embodiment stores various information regarding the codebook. For example, the codebook information storage unit 124 stores codebook IDs and vector information of each codebook. FIG. 9 is a diagram illustrating an example of a codebook information storage unit according to the first embodiment. In the example of FIG. 9, the codebook information storage unit 124 stores a lookup table that indicates the association between each codebook and a vector.

コードブック情報記憶部１２４は、４分割された部分ベクトルのうち、先頭の部分ベクトルを量子化するために用いるコードブック情報ＴＢ１、４分割された部分ベクトルのうち、先頭から２番目の部分ベクトルを量子化するために用いるコードブック情報ＴＢ２を記憶する。また、コードブック情報記憶部１２４は、４分割された部分ベクトルのうち、先頭から３番目の部分ベクトルを量子化するために用いるコードブック情報ＴＢ３、４分割された部分ベクトルのうち、先頭から４番目（すなわち最後尾）の部分ベクトルを量子化するために用いるコードブック情報ＴＢ４等を記憶する。 The codebook information storage unit 124 stores codebook information TB1 used for quantizing the first partial vector among the four divided partial vectors, and the second partial vector from the beginning among the four divided partial vectors. Codebook information TB2 used for quantization is stored. The codebook information storage unit 124 also stores codebook information TB3 used for quantizing the third partial vector from the beginning among the partial vectors divided into four; Codebook information TB4 and the like used for quantizing the th (ie, last) partial vector is stored.

図９の例では、コードブック情報ＴＢ１には、コードブックＩＤ「ＣＤ１１」により識別されるコードブック（コードブックＣＤ１１）やコードブックＩＤ「ＣＤ１２」により識別されるコードブック（コードブックＣＤ１２）等のコードブック情報を記憶する。例えば、コードブックＣＤ１１は、「５，１３・・・」の多次元のベクトル情報が対応付けられることを示す。また、コードブックＣＤ１２は、「２７，５１・・・」の多次元のベクトル情報が対応付けられることを示す。 In the example of FIG. 9, the codebook information TB1 includes the codebook identified by the codebook ID "CD11" (codebook CD11), the codebook identified by the codebook ID "CD12" (codebook CD12), etc. Store codebook information. For example, codebook CD11 indicates that multidimensional vector information of "5, 13..." is associated. Further, codebook CD12 indicates that multidimensional vector information of "27, 51..." is associated.

なお、コードブック情報記憶部１２４は、上記に限らず、目的に応じて種々の情報を記憶してもよい。例えば、コードブック情報記憶部１２４は、各コードブックと検索クエリとの差分（距離）を示す情報を記憶してもよい。 Note that the codebook information storage unit 124 is not limited to the above, and may store various information depending on the purpose. For example, the codebook information storage unit 124 may store information indicating the difference (distance) between each codebook and the search query.

（制御部１３０）
図５の説明に戻って、制御部１３０は、コントローラ（controller）であり、例えば、ＣＰＵ（Central Processing Unit）やＭＰＵ（Micro Processing Unit）やＧＰＵ（Graphics Processing Unit）等によって、情報処理装置１００内部の記憶装置に記憶されている各種プログラム（情報処理プログラムの一例に相当）がＲＡＭを作業領域として実行されることにより実現される。また、制御部１３０は、コントローラであり、例えば、ＡＳＩＣ（Application Specific Integrated Circuit）やＦＰＧＡ（Field Programmable Gate Array）等の集積回路により実現される。 (Control unit 130)
Returning to the explanation of FIG. 5, the control unit 130 is a controller, and for example, controls the internal processing of the information processing apparatus 100 by a CPU (Central Processing Unit), an MPU (Micro Processing Unit), a GPU (Graphics Processing Unit), etc. This is realized by executing various programs (corresponding to an example of an information processing program) stored in the storage device of , using the RAM as a work area. Further, the control unit 130 is a controller, and is realized by, for example, an integrated circuit such as an ASIC (Application Specific Integrated Circuit) or an FPGA (Field Programmable Gate Array).

図５に示すように、制御部１３０は、取得部１３１と、生成部１３２と、検索処理部１３３と、提供部１３４とを有し、以下に説明する情報処理の機能や作用を実現または実行する。なお、制御部１３０の内部構成は、図５に示した構成に限られず、後述する情報処理を行う構成であれば他の構成であってもよい。 As shown in FIG. 5, the control unit 130 includes an acquisition unit 131, a generation unit 132, a search processing unit 133, and a provision unit 134, and realizes or executes information processing functions and operations described below. do. Note that the internal configuration of the control unit 130 is not limited to the configuration shown in FIG. 5, and may be any other configuration as long as it performs information processing to be described later.

（取得部１３１）
取得部１３１は、各種情報を取得する。例えば、取得部１３１は、記憶部１２０から各種情報を取得する。例えば、取得部１３１は、オブジェクト情報記憶部１２１や、グラフ情報記憶部１２２や、量子化情報記憶部１２３や、コードブック情報記憶部１２４等から各種情報を取得する。また、取得部１３１は、各種情報を外部の情報処理装置から取得する。取得部１３１は、端末装置１０や情報提供装置５０から各種情報を取得する。 (Acquisition unit 131)
The acquisition unit 131 acquires various information. For example, the acquisition unit 131 acquires various information from the storage unit 120. For example, the acquisition unit 131 acquires various information from the object information storage unit 121, the graph information storage unit 122, the quantization information storage unit 123, the codebook information storage unit 124, and the like. Further, the acquisition unit 131 acquires various information from an external information processing device. The acquisition unit 131 acquires various information from the terminal device 10 and the information providing device 50.

取得部１３１は、グラフを取得する。例えば、情報処理装置１００は、図１中の空間情報ＳＰ１を取得してもよい。例えば、情報処理装置１００は、情報提供装置５０等の外部装置からグラフを取得してもよい。 The acquisition unit 131 acquires a graph. For example, the information processing device 100 may acquire spatial information SP1 in FIG. For example, the information processing device 100 may acquire a graph from an external device such as the information providing device 50.

取得部１３１は、データ検索の対象となる複数のオブジェクトに対する検索クエリを取得する。例えば、取得部１３１は、検索クエリＱＥ１に関する情報を取得する。例えば、取得部１３１は、画像検索に関する検索クエリを取得する。例えば、取得部１３１は、利用する端末装置１０からクエリを取得する。例えば、取得部１３１は、利用する端末装置１０からクエリを受け付けた情報提供装置５０からクエリを取得する。 The acquisition unit 131 acquires search queries for a plurality of objects that are data search targets. For example, the acquisition unit 131 acquires information regarding the search query QE1. For example, the acquisition unit 131 acquires a search query related to image search. For example, the acquisition unit 131 acquires a query from the terminal device 10 to be used. For example, the acquisition unit 131 acquires a query from the information providing device 50 that has received the query from the terminal device 10 to be used.

（生成部１３２）
生成部１３２は、各種情報を生成する。例えば、生成部１３２は、記憶部１２０に記憶された情報（データ）から各種情報（データ）を生成する。例えば、生成部１３２は、オブジェクト情報記憶部１２１や、グラフ情報記憶部１２２や、量子化情報記憶部１２３や、コードブック情報記憶部１２４等に記憶された情報（データ）から各種情報を生成する。 (Generation unit 132)
The generation unit 132 generates various information. For example, the generation unit 132 generates various information (data) from the information (data) stored in the storage unit 120. For example, the generation unit 132 generates various information from information (data) stored in the object information storage unit 121, the graph information storage unit 122, the quantization information storage unit 123, the codebook information storage unit 124, etc. .

生成部１３２は、グラフ情報記憶部１２２に示すようなグラフを生成してもよい。例えば、生成部１３２は、空間情報ＳＰ１を生成する。また、生成部１３２は、量子化情報記憶部１２３に示すようなベクトル量子化に関する情報を生成してもよい。例えば、生成部１３２は、ノードＮ１（オブジェクトＯＢ１）等の各オブジェクトがベクトル量子化された情報を生成する。また、生成部１３２は、コードブック情報記憶部１２４に示すようなコードブックに関する情報を生成してもよい。生成部１３２は、コードブックのルックアップテーブルを生成してもよい。例えば、生成部１３２は、コードブック情報ＴＢ１～ＴＢ４のような、コードブックに関する情報を生成する。なお、情報処理装置１００がグラフ情報記憶部１２２、量子化情報記憶部１２３、コードブック情報記憶部１２４に示す情報を、情報提供装置５０等の外部装置から取得する場合、情報処理装置１００は、生成部１３２を有しなくてもよい。 The generation unit 132 may generate a graph as shown in the graph information storage unit 122. For example, the generation unit 132 generates spatial information SP1. Further, the generation unit 132 may generate information related to vector quantization as shown in the quantization information storage unit 123. For example, the generation unit 132 generates information in which each object such as the node N1 (object OB1) is vector quantized. Further, the generation unit 132 may generate information regarding the codebook as shown in the codebook information storage unit 124. The generation unit 132 may generate a codebook lookup table. For example, the generation unit 132 generates information regarding codebooks, such as codebook information TB1 to TB4. Note that when the information processing device 100 acquires the information shown in the graph information storage unit 122, quantization information storage unit 123, and codebook information storage unit 124 from an external device such as the information providing device 50, the information processing device 100 The generation unit 132 may not be included.

（検索処理部１３３）
検索処理部１３３は、オブジェクトに関する検索サービスを提供する。検索処理部１３３は、各種情報を探索する。検索処理部１３３は、各種情報を検索する。例えば、検索処理部１３３は、グラフを探索することにより、オブジェクトを検索する。例えば、検索処理部１３３は、取得部１３１により取得されたクエリが取得された場合、グラフを探索することにより、クエリに類似するオブジェクトを検索する。例えば、検索処理部１３３は、グラフを探索することにより、クエリに類似するオブジェクトを抽出する。例えば、検索処理部１３３は、図１１に示すような処理手順に基づいて、グラフを探索することにより、クエリに類似するオブジェクトを抽出する。なお、情報処理装置１００は、検索サービスを提供しない場合、検索処理部１３３を有しなくてもよい。 (Search processing unit 133)
The search processing unit 133 provides search services regarding objects. The search processing unit 133 searches for various information. The search processing unit 133 searches various information. For example, the search processing unit 133 searches for objects by searching a graph. For example, when the query acquired by the acquisition unit 131 is acquired, the search processing unit 133 searches for an object similar to the query by searching a graph. For example, the search processing unit 133 extracts objects similar to the query by searching the graph. For example, the search processing unit 133 extracts objects similar to the query by searching the graph based on the processing procedure shown in FIG. Note that the information processing device 100 does not need to include the search processing unit 133 when not providing a search service.

検索処理部１３３は、検索処理において各種情報を選択する。検索処理部１３３は、検索処理において各種情報を抽出する。検索処理部１３３は、検索処理において各種情報を判定する。検索処理部１３３は、検索処理において各種情報を決定する。検索処理部１３３は、検索処理において各種情報を変更する。検索処理部１３３は、検索処理において各種情報を更新する。 The search processing unit 133 selects various types of information in the search process. The search processing unit 133 extracts various information in the search process. The search processing unit 133 determines various information in the search process. The search processing unit 133 determines various information in the search process. The search processing unit 133 changes various information in the search process. The search processing unit 133 updates various information during search processing.

検索処理部１３３は、複数のオブジェクトの各々に対応するノード群がエッジにより連結されたグラフを用いて、検索クエリの近傍のノードを検索する検索処理において、所定の基準により選択された複数のノードと検索クエリとの距離を、ベクトル量子化がされた複数のノードのベクトル情報を用いて算出する。検索処理部１３３は、複数のノードと検索クエリとの距離の算出を並列化して一括で行う。検索処理部１３３は、情報処理装置１００の仕様に基づいて決定される一括処理数の複数のノードと検索クエリとの距離の算出を並列処理する。 The search processing unit 133 uses a graph in which a group of nodes corresponding to each of a plurality of objects are connected by edges to search for nodes in the vicinity of a search query. The distance between the search query and the search query is calculated using vector information of a plurality of nodes subjected to vector quantization. The search processing unit 133 parallelizes and collectively calculates distances between a plurality of nodes and a search query. The search processing unit 133 performs parallel processing to calculate distances between a search query and a plurality of nodes whose number is determined based on the specifications of the information processing device 100.

検索処理部１３３は、複数のノードの各々が対応する代表ベクトルに対応付けられたコードブックを用いて、複数のノードと検索クエリとの距離を算出する。検索処理部１３３は、一のノードからのエッジが連結された複数のノードと検索クエリとの距離を算出する。検索処理部１３３は、複数のノードと複数のノードの各々が対応付けられた代表ベクトルを示す参照用情報が一のノードに対応付けて記憶されたノード情報を用いて、複数のノードと検索クエリとの距離を算出する。 The search processing unit 133 calculates distances between the plurality of nodes and the search query using a codebook in which each of the plurality of nodes is associated with a corresponding representative vector. The search processing unit 133 calculates the distance between the search query and a plurality of nodes in which edges from one node are connected. The search processing unit 133 searches the plurality of nodes and the search query using node information stored in association with one node and reference information indicating a representative vector with which each of the plurality of nodes is associated. Calculate the distance to.

検索処理部１３３は、直積量子化により、各々が複数の部分ベクトルに分割された複数のノードのベクトル情報を用いて、複数のノードと検索クエリとの距離を算出する。検索処理部１３３は、複数のノードの各々が分割された複数の部分ベクトルの分割位置ごとのベクトルに対応する代表ベクトルに対応付けられた複数のコードブックを用いて、複数のノードと検索クエリとの距離を算出する。検索処理部１３３は、複数のノードと複数のノードの各々の複数の部分ベクトルの各々が対応付けられた代表ベクトルを示す参照用情報が一のノードに対応付けて記憶されたノード情報を用いて、複数のノードと検索クエリとの距離を算出する。 The search processing unit 133 calculates distances between a plurality of nodes and a search query using vector information of a plurality of nodes, each of which is divided into a plurality of partial vectors by Cartesian product quantization. The search processing unit 133 uses a plurality of codebooks that are associated with representative vectors corresponding to vectors for each division position of a plurality of partial vectors into which each of the plurality of nodes is divided, to combine the plurality of nodes with the search query. Calculate the distance. The search processing unit 133 uses the node information stored in such a manner that the plurality of nodes and reference information indicating representative vectors associated with each of the plurality of partial vectors of each of the plurality of nodes are associated with one node. , calculate the distance between multiple nodes and the search query.

検索処理部１３３は、複数のノードの各々に、各ノードの複数の部分ベクトルの各々が対応付けられた代表ベクトルの一覧が対応付けられた参照用情報を用いて、複数のノードと検索クエリとの距離を算出する。検索処理部１３３は、複数のノードの各々の複数の部分ベクトルの各々が対応する代表ベクトルが、分割位置ごとに一覧で並ぶ転置情報と、複数のノードの各々が一覧で対応する位置を示す対応付情報とを含む参照用情報を用いて、複数のノードと検索クエリとの距離を算出する。検索処理部１３３は、複数のノードの各々の複数の部分ベクトルの各々が対応する代表ベクトルに対応付けられた一のコードブックを用いて、複数のノードと検索クエリとの距離を算出する。 The search processing unit 133 uses reference information in which each of the plurality of nodes is associated with a list of representative vectors to which each of the plurality of partial vectors of each node is associated, and the search processing unit 133 associates the plurality of nodes with the search query. Calculate the distance. The search processing unit 133 includes transposed information in which representative vectors to which each of a plurality of partial vectors of each of a plurality of nodes corresponds are arranged in a list for each division position, and a correspondence indicating a position to which each of a plurality of nodes corresponds in a list. The distance between the plurality of nodes and the search query is calculated using the reference information including the attached information. The search processing unit 133 calculates distances between the plurality of nodes and the search query using one codebook in which each of the plurality of partial vectors of each of the plurality of nodes is associated with a corresponding representative vector.

検索処理部１３３は、検索処理において処理対象となったノードのうち、所定のノードを対象として、ベクトル量子化がされた距離である第１距離とは異なり、ベクトル量子化がされていない第２距離を算出する。検索処理部１３３は、検索処理において検索クエリの近傍のノードとして抽出するノードの第１数よりも多い数である第２数のノードを近傍候補ノードとして抽出し、近傍候補ノードを対象として第２距離を算出する。検索処理部１３３は、近傍候補ノードのうち、第２距離が短い方から第１数のノードを検索クエリの近傍のノードとして抽出する。 The search processing unit 133 targets a predetermined node among the nodes to be processed in the search process. Calculate distance. The search processing unit 133 extracts a second number of nodes, which is a larger number than the first number of nodes to be extracted as neighboring nodes of the search query, as neighboring candidate nodes in the search process, and extracts a second number of nodes as neighboring candidate nodes. Calculate distance. The search processing unit 133 extracts a first number of nodes from among the nearby candidate nodes having the shorter second distance as nodes in the vicinity of the search query.

検索処理部１３３は、第１距離が所定の閾値以内であるノードを対象として第２距離を算出する。検索処理部１３３は、近傍のノードとして抽出する対象範囲を示す検索範囲内のノードを対象として第２距離を算出する。検索処理部１３３は、検索処理の対象範囲を示す探索範囲内のノードを対象として第２距離を算出する。 The search processing unit 133 calculates a second distance for nodes whose first distance is within a predetermined threshold. The search processing unit 133 calculates a second distance for nodes within the search range indicating the target range to be extracted as nearby nodes. The search processing unit 133 calculates a second distance for nodes within a search range indicating a target range of the search process.

（提供部１３４）
提供部１３４は、各種情報を提供する。例えば、提供部１３４は、端末装置１０や情報提供装置５０に各種情報を送信する。例えば、提供部１３４は、検索クエリに対応するオブジェクトＩＤを検索結果として提供する。提供部１３４は、検索結果を端末装置１０へ送信する。提供部１３４は、検索処理部１３３により検索されたオブジェクトＩＤを、検索クエリに対応する検索結果として端末装置１０へ提供する。 (Providing unit 134)
The providing unit 134 provides various information. For example, the providing unit 134 transmits various information to the terminal device 10 and the information providing device 50. For example, the providing unit 134 provides an object ID corresponding to a search query as a search result. The providing unit 134 transmits the search results to the terminal device 10. The providing unit 134 provides the object ID searched by the search processing unit 133 to the terminal device 10 as a search result corresponding to the search query.

また、提供部１３４は、検索処理部１３３により検索されたオブジェクトＩＤを情報提供装置５０へ提供してもよい。例えば、提供部１３４は、検索処理部１３３が検索により抽出したオブジェクトＩＤを情報提供装置５０へ提供する。提供部１３４は、検索処理部１３３により抽出されたオブジェクトＩＤをクエリに対応するベクトルを示す情報として情報提供装置５０に提供する。 Further, the providing unit 134 may provide the object ID searched by the search processing unit 133 to the information providing device 50. For example, the providing unit 134 provides the information providing device 50 with the object ID extracted by the search processing unit 133 through the search. The providing unit 134 provides the object ID extracted by the search processing unit 133 to the information providing device 50 as information indicating a vector corresponding to the query.

〔１－４．情報処理のフロー〕
次に、図１０を用いて、第１の実施形態に係る情報処理システム１による情報処理の手順について説明する。図１０は、第１の実施形態に係る情報処理の一例を示すフローチャートである。 [1-4. Information processing flow]
Next, the procedure of information processing by the information processing system 1 according to the first embodiment will be described using FIG. 10. FIG. 10 is a flowchart illustrating an example of information processing according to the first embodiment.

図１０に示すように、情報処理装置１００は、データ検索の対象となる複数のオブジェクトに対する検索クエリを取得する（ステップＳ１０１）。図１の例では、情報処理装置１００は、検索クエリＱＥ１を取得する。 As shown in FIG. 10, the information processing apparatus 100 obtains search queries for a plurality of objects that are data search targets (step S101). In the example of FIG. 1, the information processing device 100 obtains the search query QE1.

そして、情報処理装置１００は、複数のオブジェクトの各々に対応するノード群がエッジにより連結されたグラフを用いて、検索クエリの近傍のノードを検索する検索処理において、所定の基準により選択された複数のノードと検索クエリとの距離を、ベクトル量子化がされた複数のノードのベクトル情報を用いて算出する（ステップＳ１０２）。図１の例では、情報処理装置１００は、複数のノードＮ９、Ｎ１２、Ｎ５４、Ｎ８５と前記検索クエリＱＥ１との距離を、ベクトル量子化がされた複数のノードＮ９、Ｎ１２、Ｎ５４、Ｎ８５のベクトル情報を用いて算出する。 The information processing apparatus 100 uses a graph in which a group of nodes corresponding to each of a plurality of objects are connected by edges to search for nodes near a search query. The distance between the node and the search query is calculated using vector information of a plurality of nodes subjected to vector quantization (step S102). In the example of FIG. 1, the information processing device 100 calculates the distances between the plurality of nodes N9, N12, N54, N85 and the search query QE1 by vector quantizing the distances between the plurality of nodes N9, N12, N54, N85. Calculate using information.

〔１－５．検索処理例〕
ここで、第１の実施形態に係る検索処理の一例について、図１１を一例として説明する。図１１は、第１の実施形態に係る検索処理の一例を示すフローチャートである。以下に説明する検索処理は、情報処理装置１００の検索処理部１３３によって行われる。また、以下でいうオブジェクトは、ノードと読み替えてもよい。なお、以下では、情報処理装置１００（検索処理部１３３）が検索処理を行う。なお、検索サービスを提供しない場合、情報処理装置１００は検索処理部１３３を有しなくてもよい。以下で説明する処理の検索クエリは、追加ノードや対象ノードやユーザが指定したオブジェクト等であってもよい。 [1-5. Search processing example]
Here, an example of the search process according to the first embodiment will be described using FIG. 11 as an example. FIG. 11 is a flowchart illustrating an example of search processing according to the first embodiment. The search processing described below is performed by the search processing unit 133 of the information processing device 100. Further, the object referred to below may be read as a node. Note that in the following, the information processing device 100 (search processing unit 133) performs search processing. Note that if the search service is not provided, the information processing device 100 does not need to include the search processing unit 133. The search query for the process described below may be an additional node, a target node, an object specified by the user, or the like.

ここでは、近傍オブジェクト集合Ｎ（Ｇ，ｙ）は、ノードｙに付与されているエッジにより関連付けられている近傍のオブジェクトの集合である。「Ｇ」は、所定のグラフデータ（例えば、空間情報ＳＰ１に示すグラフＧＲ１等）であってもよい。例えば、情報処理装置１００は、ｋ近傍検索処理を実行する。 Here, the neighboring object set N(G, y) is a set of neighboring objects that are related by edges attached to node y. "G" may be predetermined graph data (for example, graph GR1 shown in spatial information SP1). For example, the information processing device 100 executes k-nearest neighbor search processing.

例えば、情報処理装置１００は、超球の半径ｒを∞（無限大）に設定し（ステップＳ３００）、既存のオブジェクト集合から部分集合Ｓを抽出する（ステップＳ３０１）。例えば、情報処理装置１００は、ルートノードとして選択されたオブジェクト（ノード）を部分集合Ｓとして抽出してもよい。また、例えば、超球とは、検索範囲を示す仮想的な球である。なお、ステップＳ３０１において抽出されたオブジェクト集合Ｓに含まれるオブジェクトは、同時に検索結果のオブジェクト集合Ｒの初期集合にも含められる。 For example, the information processing device 100 sets the radius r of the hypersphere to ∞ (infinity) (step S300), and extracts a subset S from an existing object set (step S301). For example, the information processing apparatus 100 may extract the object (node) selected as the root node as the subset S. Further, for example, a hypersphere is a virtual sphere that indicates a search range. Note that the objects included in the object set S extracted in step S301 are also included in the initial set of the object set R as a search result.

次に、情報処理装置１００は、オブジェクト集合Ｓに含まれるオブジェクトの中で、検索クエリオブジェクトをｙとするとオブジェクトｙとの距離が最も短いオブジェクトを抽出し、オブジェクトｓとする（ステップＳ３０２）。例えば、情報処理装置１００は、ルートノードとして選択されたオブジェクト（ノード）のみがＳの要素の場合には、結果的にルートノードがオブジェクトｓとして抽出される。次に、情報処理装置１００は、オブジェクトｓをオブジェクト集合Ｓから除外する（ステップＳ３０３）。 Next, among the objects included in the object set S, the information processing apparatus 100 extracts the object having the shortest distance from the object y, assuming that the search query object is y, and sets it as the object s (step S302). For example, when the only object (node) selected as the root node is an element of S, the information processing apparatus 100 extracts the root node as the object s. Next, the information processing device 100 excludes the object s from the object set S (step S303).

次に、情報処理装置１００は、オブジェクトｓとオブジェクトｙとの距離ｄ（ｓ，ｙ）がｒ（１＋ε）を超えるか否かを判定する（ステップＳ３０４）。ここで、εは拡張要素であり、ｒ（１＋ε）は、探索範囲（この範囲内のノードのみを探索する。検索範囲よりも大きくすることで精度を高めることができる）の半径を示す値である。オブジェクトｓとオブジェクトｙとの距離ｄ（ｓ，ｙ）がｒ（１＋ε）を超える場合（ステップＳ３０４：Ｙｅｓ）、情報処理装置１００は、オブジェクト集合Ｒをオブジェクトｙの近傍オブジェクト集合として出力し（ステップＳ３０５）、処理を終了する。 Next, the information processing apparatus 100 determines whether the distance d(s, y) between the object s and the object y exceeds r(1+ε) (step S304). Here, ε is an expansion element, and r(1+ε) is a value indicating the radius of the search range (only nodes within this range are searched. Accuracy can be improved by making it larger than the search range). be. If the distance d(s, y) between object s and object y exceeds r(1+ε) (step S304: Yes), the information processing device 100 outputs the object set R as a neighboring object set of object y (step S305), the process ends.

オブジェクトｓと検索クエリオブジェクトｙとの距離ｄ（ｓ，ｙ）がｒ（１＋ε）を超えない場合（ステップＳ３０４：Ｎｏ）、情報処理装置１００は、オブジェクトｓの近傍オブジェクト集合Ｎ（Ｇ，ｓ）の要素であるオブジェクトの中からオブジェクト集合Ｃに含まれないオブジェクトを一つ選択し、選択したオブジェクトｕを、オブジェクト集合Ｃに格納する（ステップＳ３０６）。オブジェクト集合Ｃは、重複検索を回避するために便宜上設けられるものであり、処理開始時には空集合に設定される。 If the distance d(s, y) between the object s and the search query object y does not exceed r(1+ε) (step S304: No), the information processing device 100 determines the neighboring object set N(G, s) of the object s. One object that is not included in object set C is selected from among the objects that are elements of , and the selected object u is stored in object set C (step S306). The object set C is provided for convenience in order to avoid duplicate searches, and is set to be an empty set at the start of processing.

次に、情報処理装置１００は、オブジェクトｕとオブジェクトｙとの距離ｄ（ｕ，ｙ）がｒ（１＋ε）以下であるか否かを判定する（ステップＳ３０７）。オブジェクトｕとオブジェクトｙとの距離ｄ（ｕ，ｙ）がｒ（１＋ε）以下である場合（ステップＳ３０７：Ｙｅｓ）、情報処理装置１００は、オブジェクトｕをオブジェクト集合Ｓに追加する（ステップＳ３０８）。また、オブジェクトｕとオブジェクトｙとの距離ｄ（ｕ，ｙ）がｒ（１＋ε）以下ではない場合（ステップＳ３０７：Ｎｏ）、情報処理装置１００は、ステップＳ３０９の判定（処理）を行う。 Next, the information processing device 100 determines whether the distance d(u, y) between the object u and the object y is less than or equal to r(1+ε) (step S307). If the distance d(u,y) between the object u and the object y is less than or equal to r(1+ε) (step S307: Yes), the information processing device 100 adds the object u to the object set S (step S308). Further, if the distance d(u,y) between the object u and the object y is not less than or equal to r(1+ε) (step S307: No), the information processing apparatus 100 performs the determination (process) of step S309.

次に、情報処理装置１００は、オブジェクトｕとオブジェクトｙとの距離ｄ（ｕ，ｙ）がｒ以下であるか否かを判定する（ステップＳ３０９）。オブジェクトｕとオブジェクトｙとの距離ｄ（ｕ，ｙ）がｒを超える場合（ステップＳ３０９：Ｎｏ）、情報処理装置１００は、ステップＳ３１５の判定（処理）を行う。すなわち、オブジェクトｕとオブジェクトｙとの距離ｄ（ｕ，ｙ）がｒ以下ではない場合、情報処理装置１００は、ステップＳ３１５の判定（処理）を行う。 Next, the information processing device 100 determines whether the distance d(u, y) between the object u and the object y is less than or equal to r (step S309). If the distance d(u, y) between object u and object y exceeds r (step S309: No), the information processing apparatus 100 performs the determination (process) of step S315. That is, if the distance d(u, y) between object u and object y is not less than or equal to r, the information processing apparatus 100 performs the determination (process) of step S315.

オブジェクトｕとオブジェクトｙとの距離ｄ（ｕ，ｙ）がｒ以下である場合（ステップＳ３０９：Ｙｅｓ）、情報処理装置１００は、オブジェクトｕをオブジェクト集合Ｒに追加する（ステップＳ３１０）。そして、情報処理装置１００は、オブジェクト集合Ｒに含まれるオブジェクト数がｋｓを超えるか否かを判定する（ステップＳ３１１）。所定数ｋｓは、任意に定められる自然数である。例えば、ｋｓは、検索数や抽出対象数であってもよい。また、例えば、範囲検索等において抽出するオブジェクト数の上限を設けない場合、ｋｓは、無限大に設定されてもよい。例えば、ｋｓ＝４等であってもよい。オブジェクト集合Ｒに含まれるオブジェクト数がｋｓを超えない場合（ステップＳ３１１：Ｎｏ）、情報処理装置１００は、ステップＳ３１３の判定（処理）を行う。 If the distance d(u,y) between the object u and the object y is less than or equal to r (step S309: Yes), the information processing device 100 adds the object u to the object set R (step S310). Then, the information processing device 100 determines whether the number of objects included in the object set R exceeds ks (step S311). The predetermined number ks is an arbitrarily determined natural number. For example, ks may be the number of searches or the number of extraction targets. Further, for example, when there is no upper limit on the number of objects to be extracted in a range search or the like, ks may be set to infinity. For example, ks may be 4, etc. If the number of objects included in the object set R does not exceed ks (step S311: No), the information processing device 100 performs the determination (process) of step S313.

オブジェクト集合Ｒに含まれるオブジェクト数がｋｓを超える場合（ステップＳ３１１：Ｙｅｓ）、情報処理装置１００は、オブジェクト集合Ｒに含まれるオブジェクトの中でオブジェクトｙとの距離が最も長い（遠い）オブジェクトを、オブジェクト集合Ｒから除外する（ステップＳ３１２）。 If the number of objects included in the object set R exceeds ks (step S311: Yes), the information processing device 100 selects the object that is the longest (farthest) from the object y among the objects included in the object set R. Exclude from object set R (step S312).

次に、情報処理装置１００は、オブジェクト集合Ｒに含まれるオブジェクト数がｋｓと一致するか否かを判定する（ステップＳ３１３）。オブジェクト集合Ｒに含まれるオブジェクト数がｋｓと一致しない場合（ステップＳ３１３：Ｎｏ）、情報処理装置１００は、ステップＳ３１５の判定（処理）を行う。また、オブジェクト集合Ｒに含まれるオブジェクト数がｋｓと一致する場合（ステップＳ３１３：Ｙｅｓ）、情報処理装置１００は、オブジェクト集合Ｒに含まれるオブジェクトの中でオブジェクトｙとの距離が最も長い（遠い）オブジェクトと、オブジェクトｙとの距離を、新たなｒに設定する（ステップＳ３１４）。 Next, the information processing device 100 determines whether the number of objects included in the object set R matches ks (step S313). If the number of objects included in the object set R does not match ks (step S313: No), the information processing device 100 performs the determination (process) of step S315. Further, when the number of objects included in the object set R matches ks (step S313: Yes), the information processing device 100 determines that the distance to the object y is the longest (farthest) among the objects included in the object set R. The distance between the object and object y is set to a new r (step S314).

そして、情報処理装置１００は、オブジェクトｓの近傍オブジェクト集合Ｎ（Ｇ，ｓ）の要素であるオブジェクトから全てのオブジェクトを選択してオブジェクト集合Ｃに格納し終えたか否かを判定する（ステップＳ３１５）。オブジェクトｓの近傍オブジェクト集合Ｎ（Ｇ，ｓ）の要素であるオブジェクトから全てのオブジェクトを選択してオブジェクト集合Ｃに格納し終えていない場合（ステップＳ３１５：Ｎｏ）、情報処理装置１００は、ステップＳ３０６に戻って処理を繰り返す。 Then, the information processing device 100 determines whether all objects that are elements of the neighboring object set N(G, s) of the object s have been selected and stored in the object set C (step S315). . If all objects that are elements of the neighboring object set N(G, s) of the object s have not been selected and stored in the object set C (step S315: No), the information processing device 100 selects the objects that are elements of the neighboring object set N(G, s) of the object s and stores them in the object set C (step S315: No). Return to and repeat the process.

オブジェクトｓの近傍オブジェクト集合Ｎ（Ｇ，ｓ）の要素であるオブジェクトから全てのオブジェクトを選択してオブジェクト集合Ｃに格納し終えた場合（ステップＳ３１５：Ｙｅｓ）、情報処理装置１００は、オブジェクト集合Ｓが空集合であるか否かを判定する（ステップＳ３１６）。オブジェクト集合Ｓが空集合でない場合（ステップＳ３１６：Ｎｏ）、情報処理装置１００は、ステップＳ３０２に戻って処理を繰り返す。また、オブジェクト集合Ｓが空集合である場合（ステップＳ３１６：Ｙｅｓ）、情報処理装置１００は、オブジェクト集合Ｒを出力し、処理を終了する（ステップＳ３１７）。例えば、情報処理装置１００は、オブジェクト集合Ｒに含まれるオブジェクト（ノード）を追加ノード（入力オブジェクトｙ）に対応する近傍ノードとして選択してもよい。例えば、情報処理装置１００は、オブジェクト集合Ｒに含まれるオブジェクト（ノード）を対象ノード（入力オブジェクトｙ）に対応する近傍ノードとして抽出（選択）してもよい。また、例えば、情報処理装置１００は、オブジェクト集合Ｒに含まれるオブジェクト（ノード）を検索クエリ（入力オブジェクトｙ）に対応する検索結果として、検索を行った端末装置等へ提供してもよい。 When all objects that are elements of the object set N(G, s) in the vicinity of the object s have been selected and stored in the object set C (step S315: Yes), the information processing device 100 It is determined whether or not is an empty set (step S316). If the object set S is not an empty set (step S316: No), the information processing device 100 returns to step S302 and repeats the process. Furthermore, if the object set S is an empty set (step S316: Yes), the information processing device 100 outputs the object set R and ends the process (step S317). For example, the information processing apparatus 100 may select an object (node) included in the object set R as a neighboring node corresponding to the additional node (input object y). For example, the information processing apparatus 100 may extract (select) objects (nodes) included in the object set R as neighboring nodes corresponding to the target node (input object y). Further, for example, the information processing device 100 may provide the objects (nodes) included in the object set R as the search results corresponding to the search query (input object y) to the terminal device or the like that performed the search.

〔２．第２の実施形態〕
ここから、第２の実施形態について説明する。第２の実施形態は、クラスタリングなどにより近傍オブジェクトをグルーピングし、各グループ（以下「ブロブ」ともいう）を一括して距離計算を行う対象とする。すなわち、第２の実施形態では、ブロブ単位で一括距離計算を行う。なお、第１の実施形態と同様の点については、適宜説明を省略する。第２の実施形態においては、情報処理システム１は、情報処理装置１００に代えて、情報処理装置１００Ａを有する。 [2. Second embodiment]
The second embodiment will now be described. In the second embodiment, neighboring objects are grouped by clustering or the like, and each group (hereinafter also referred to as a "blob") is subjected to distance calculation at once. That is, in the second embodiment, batch distance calculation is performed in units of blobs. Note that descriptions of points similar to those in the first embodiment will be omitted as appropriate. In the second embodiment, the information processing system 1 includes an information processing device 100A instead of the information processing device 100.

〔２－１．情報処理〕
まず、図１２を用いて、第２の実施形態に係る情報処理の概要を説明する。図１２は、第２の実施形態に係る情報処理の一例を示す図である。 [2-1. Information processing〕
First, an overview of information processing according to the second embodiment will be explained using FIG. 12. FIG. 12 is a diagram illustrating an example of information processing according to the second embodiment.

図１２の例では、情報処理装置１００Ａは、空間情報ＳＰ２１に示すようなグラフＧＲ２１を取得済みであるものとする。例えば図１中の空間情報ＳＰ２１は、ユークリッド空間であってもよい。例えば、空間情報ＳＰ２１は、オブジェクトのベクトルの次元数に対応し、１００次元や１０００次元等の多次元空間であるものとする。なお、図１２の例では、説明を簡単にするために直積量子化が行われていない図を基に説明するが、図１と同様にベクトル（空間）の直積量子化が行われてもよい。 In the example of FIG. 12, it is assumed that the information processing device 100A has already acquired a graph GR21 as shown in the spatial information SP21. For example, the spatial information SP21 in FIG. 1 may be a Euclidean space. For example, it is assumed that the spatial information SP21 corresponds to the number of dimensions of the vector of the object, and is a multidimensional space such as 100 dimensions or 1000 dimensions. Note that the example of FIG. 12 will be explained based on a diagram in which direct product quantization is not performed to simplify the explanation, but vector (space) may be subjected to direct product quantization as in FIG. 1. .

まず、図１２で示す各情報について説明する。図１２の空間情報ＳＰ２１中のノードである白丸（〇）間を接続する点線がノード間を連結するエッジを示す。図１２では、図１と同様に無向エッジを例として示すが、グラフＧＲ２１のエッジは、無向エッジに限らず、有向エッジであってもよい。なお、ノートやエッジについては図１と同様であるため詳細な説明は省略する。 First, each piece of information shown in FIG. 12 will be explained. Dotted lines connecting white circles (◯) that are nodes in the spatial information SP21 in FIG. 12 indicate edges connecting the nodes. In FIG. 12, an undirected edge is shown as an example as in FIG. 1, but the edges of the graph GR21 are not limited to undirected edges and may be directed edges. Note that notes and edges are the same as those in FIG. 1, so detailed explanations will be omitted.

図１２の空間情報ＳＰ２１において、直線で囲まれた領域はクラスタリングなどで近傍のオブジェクトがまとめ上げられた（グループ化された）ものであり、領域をブロブと称する。以下で示す例ではブロブが一括距離計算の処理単位となる。図１２は、ブロブＢＬ１～ＢＬ１０の１０個の領域（ブロブ）にクラスタリングされた場合を示す。例えば、ブロブＢＬ１は、ノードＮ７、Ｎ９、Ｎ８５、Ｎ１２６等の複数のノードが属するブロブであることを示す。なお、ブロブＢＬ１～ＢＬ１０中の黒い点は、各ブロブのセントロイド（代表ベクトル）を示す。 In the spatial information SP21 of FIG. 12, an area surrounded by straight lines is a region in which nearby objects are grouped together (grouped) by clustering or the like, and the area is called a blob. In the example shown below, blobs are the processing unit for batch distance calculation. FIG. 12 shows a case where clustering is performed into ten regions (blobs) of blobs BL1 to BL10. For example, blob BL1 is a blob to which multiple nodes such as nodes N7, N9, N85, and N126 belong. Note that the black dots in the blobs BL1 to BL10 indicate the centroid (representative vector) of each blob.

なお、ブロブＢＬ１～ＢＬ１０は、クラスタリング等に関する種々の手法を適宜用いて生成される。例えば、ブロブに分類するためのクラスタリングは、ｋ－ｍｅａｎｓクラスタリングでもよい。また、例えば、ブロブに分類するためのクラスタリングは、ｋ－ｍｅａｎｓでの一回のイテレーション（アサイン）で各クラスタの中心座標（平均）で得られたセントロイドを次のイテレーションに使うのではなく、その代わりに、各セントロイドに最も近いオブジェクトにセントロイドを置き換えてから、次のイテレーションを行ってもよい。この場合、ｋ－ｍｅａｎｓで得られる最終的なクラスタのセントロイドは既存のオブジェクト、つまり、ノードとなる。これにより、クラスタ（ブロブ）と各ノードのエッジ（近傍ノード）が一致する傾向が高まり、検索性能をさらに向上させることができる。 Note that the blobs BL1 to BL10 are generated using various methods related to clustering and the like as appropriate. For example, clustering for classifying into blobs may be k-means clustering. Also, for example, clustering for classification into blobs does not use the centroid obtained at the center coordinates (average) of each cluster in one iteration (assignment) with k-means for the next iteration. Alternatively, the centroids may be replaced with the object closest to each centroid before the next iteration. In this case, the centroid of the final cluster obtained by k-means is an existing object, that is, a node. This increases the tendency for clusters (blobs) and edges (neighboring nodes) of each node to match, making it possible to further improve search performance.

図１２の例では、情報処理装置１００Ａは、空間情報ＳＰ２１に示すように、複数のノード（オブジェクト）をクラスタリングすることにより、分類した複数のブロブＢＬ１～ＢＬ１０の情報を用いて、検索処理を行う。 In the example of FIG. 12, the information processing device 100A performs a search process using information on a plurality of classified blobs BL1 to BL10 by clustering a plurality of nodes (objects), as shown in the spatial information SP21. .

ここから、検索クエリＱＥ２を対象とする検索処理を説明する。まず、情報処理装置１００Ａは、検索クエリＱＥ２を取得する（ステップＳ２１）。例えば、情報処理装置１００Ａは、ユーザが利用する端末装置１０（図４参照）から検索クエリＱＥ２を取得する。 From here, a search process targeting the search query QE2 will be explained. First, the information processing device 100A obtains the search query QE2 (step S21). For example, the information processing device 100A obtains the search query QE2 from the terminal device 10 (see FIG. 4) used by the user.

そして、情報処理装置１００Ａは、検索クエリＱＥ２のベクトルと、コードブックのベクトルとの間の距離を算出する。図示は省略するが直積量子化を行っていない場合、情報処理装置１００Ａは、検索クエリＱＥ２のベクトルと、ベクトル全体を量子化するためのコードブックのベクトルとの間の距離（差分）を算出する。そして、情報処理装置１００Ａは、検索クエリＱＥ２のベクトルと各コードブックのベクトルとの間の距離（差分）を、各コードブックを識別するための情報に対応付けてコードブック情報ＴＢ２１として保持する。なお、直積量子化を行っている場合、情報処理装置１００Ａは、図１と同様に、図２のコードブック情報ＴＢ１～ＴＢ４に示すように、検索クエリＱＥ２の各サブベクトルと各コードブックのベクトルとの間の距離（差分）を算出する。なお、検索クエリのベクトルとコードブックのベクトルとの距離算出は、図１、図２等と同様であるため詳細な説明は省略する。 Then, the information processing device 100A calculates the distance between the vector of the search query QE2 and the vector of the codebook. Although illustration is omitted, when cross product quantization is not performed, the information processing device 100A calculates the distance (difference) between the vector of the search query QE2 and the vector of the codebook for quantizing the entire vector. . The information processing device 100A then stores the distance (difference) between the vector of the search query QE2 and the vector of each codebook as codebook information TB21 in association with information for identifying each codebook. Note that when performing Cartesian product quantization, the information processing device 100A stores each subvector of the search query QE2 and the vector of each codebook, as shown in codebook information TB1 to TB4 of FIG. 2, as in FIG. Calculate the distance (difference) between Note that the calculation of the distance between the vector of the search query and the vector of the codebook is the same as in FIGS. 1, 2, etc., and therefore detailed explanation will be omitted.

情報処理装置１００Ａは、検索クエリＱＥ２を対象とする検索処理を実行する（ステップＳ２２）。情報処理装置１００Ａは、検索クエリＱＥ２を対象として、グラフＧＲ２１を用いた図１６に示すような検索処理を行うことにより、検索クエリＱＥ２の検索結果を得る。図１６に示す検索処理についての詳細は後述する。情報処理装置１００Ａは、検索クエリＱＥ２を対象として検索処理を行うことにより、検索数のノードを検索クエリＱＥ２の近傍のノードとして抽出する。 The information processing device 100A executes a search process targeting the search query QE2 (step S22). The information processing device 100A obtains a search result for the search query QE2 by performing a search process as shown in FIG. 16 using the graph GR21 for the search query QE2. Details of the search process shown in FIG. 16 will be described later. The information processing device 100A performs a search process on the search query QE2, thereby extracting the searched nodes as nodes in the vicinity of the search query QE2.

情報処理装置１００Ａは、検索クエリＱＥ２を対象とする検索処理において、各ノードのうち、所定のノードをグラフＧＲ２１の検索の起点となるノード（起点ノード）として選択する。図１２の例では、情報処理装置１００Ａは、起点ノードとして、ノードＮ９を選択するものとする。図１２の例では、情報処理装置１００Ａは、例えば、ノードＮ９を処理対象として検索処理を開始する。 In the search process targeting the search query QE2, the information processing device 100A selects a predetermined node from among the nodes as the node (starting point node) that becomes the starting point for the search of the graph GR21. In the example of FIG. 12, it is assumed that the information processing apparatus 100A selects node N9 as the starting point node. In the example of FIG. 12, the information processing device 100A starts the search process with node N9 as the processing target, for example.

ここで、情報処理装置１００Ａは、検索クエリＱＥ２を対象とする検索処理において、処理対象となるノードが属するブロブに属する複数のノードと検索クエリとの距離を並列化して算出する（ステップＳ２３）。図１２では、情報処理装置１００Ａは、一括処理情報ＬＴ２に示すように、処理対象となるノードＮ９が属するブロブＢＬ１に属するノードであるノードＮ７、Ｎ９、Ｎ８５、Ｎ１２６等については、検索クエリＱＥ２との距離を並列化して算出する。 Here, in the search processing targeting the search query QE2, the information processing device 100A parallelizes and calculates distances between the search query and a plurality of nodes belonging to the blob to which the processing target node belongs (step S23). In FIG. 12, as shown in the batch processing information LT2, the information processing device 100A uses the search query QE2 for nodes N7, N9, N85, N126, etc. that belong to the blob BL1 to which the processing target node N9 belongs. Calculate the distance in parallel.

情報処理装置１００Ａは、コードブック情報ＴＢ２１が示す各コードブックと検索クエリＱＥ２との間の距離を用いて、ノードＮ７、Ｎ９、Ｎ８５、Ｎ１２６の各々と検索クエリＱＥ２との距離を算出する。例えば、情報処理装置１００Ａは、コードブック情報ＴＢ２１を参照して、ノードＮ７のベクトルに対応するコードブックの距離を、ノードＮ７と検索クエリＱＥ２との距離として算出する。なお、直積量子化が行われている場合の距離算出は、図１、図２等と同様であるため詳細な説明は省略する。 The information processing device 100A uses the distance between each codebook indicated by the codebook information TB21 and the search query QE2 to calculate the distance between each of the nodes N7, N9, N85, and N126 and the search query QE2. For example, the information processing device 100A refers to the codebook information TB21 and calculates the distance of the codebook corresponding to the vector of the node N7 as the distance between the node N7 and the search query QE2. Note that distance calculation when cross-product quantization is performed is the same as in FIGS. 1, 2, etc., and detailed description thereof will be omitted.

なお、一括処理可能単位については、情報処理装置１００の場合と同様に、情報処理装置１００Ａの仕様に基づいて決定される。例えば、情報処理装置１００ＡがＳＩＭＤにより一括して処理できる数（一括処理可能単位）が「４」である場合、図１と同様に、４個のノードを対象として距離を並列化して算出する。例えば、情報処理装置１００Ａは、ＳＩＭＤの演算に関する並列化により、ノードＮ７、Ｎ９、Ｎ８５、Ｎ１２６の各々と検索クエリＱＥ２との距離を一括して算出する。これにより、情報処理装置１００Ａは、距離計算を並列化することにより、距離計算を高速化することでき、効率的な検索処理を可能にすることができる。また、情報処理装置１００Ａは、グラフＧＲ２１を用いて、ノードＮ７からのエッジが接続されたノード（例えばノードＮ１２、Ｎ６４）を辿り、それらのノードが属するブロブ（例えばブロブＢＬ２、ＢＬ３）を対象とした一括距離計算を行って、検索処理を行う。なお、情報処理装置１００Ａは、検索処理において、同じブロブが繰り返し一括距離計算の対象となることを抑制するが詳細は後述する。 Note that, as in the case of the information processing apparatus 100, the units that can be collectively processed are determined based on the specifications of the information processing apparatus 100A. For example, if the number that the information processing device 100A can process at once using SIMD (unit that can be processed at once) is "4", the distances are computed in parallel for four nodes, as in FIG. 1. For example, the information processing device 100A collectively calculates the distance between each of the nodes N7, N9, N85, and N126 and the search query QE2 by parallelizing SIMD calculations. Thereby, the information processing device 100A can speed up the distance calculation by parallelizing the distance calculation, and can perform efficient search processing. In addition, the information processing device 100A uses the graph GR21 to trace nodes (for example, nodes N12, N64) to which edges from the node N7 are connected, and target blobs (for example, blobs BL2, BL3) to which these nodes belong. The search process is performed by performing batch distance calculation. Note that in the search process, the information processing device 100A suppresses the same blob from being repeatedly subjected to batch distance calculation, but details will be described later.

なお、図１２では説明を簡単にするために、４つのノードを対象として距離を並列化して算出する場合を示すが、並列化される数は、情報処理装置１００Ａの仕様に基づいて決定される。この点についても図１、図２等と同様であるため、詳細な説明は省略する。また、情報処理装置１００Ａは、図３に示す例と同様に、転置データを用いてもよい。 Note that, in order to simplify the explanation, FIG. 12 shows a case where distances are computed in parallel for four nodes, but the number of parallelized nodes is determined based on the specifications of the information processing device 100A. . This point is also similar to FIGS. 1, 2, etc., so detailed explanation will be omitted. Further, the information processing device 100A may use transposed data similarly to the example shown in FIG.

上述のように、情報処理装置１００Ａは、ベクトル量子化された各ノードのベクトルと検索クエリとの間の近似距離（第２距離）を算出して、第２距離を用いて検索処理を行う事により、効率的な検索処理を可能にすることができる。また、情報処理装置１００Ａは、並列化可能な数のノードの距離計算を一括して行うことにより、効率的な検索処理を可能にすることができる。上記のように、情報処理装置１００Ａは、ブロブを一括処理の単位として一括して計算することにより、効率的な検索処理を可能にすることができる。 As described above, the information processing device 100A calculates the approximate distance (second distance) between the vector quantized vector of each node and the search query, and performs the search process using the second distance. This enables efficient search processing. Further, the information processing device 100A can perform efficient search processing by collectively performing distance calculations for a number of nodes that can be parallelized. As described above, the information processing apparatus 100A can perform efficient search processing by collectively calculating blobs as a unit of batch processing.

情報処理装置１００Ａは、検索処理において、グラフＧＲ２１を辿り処理対象となるノードを対象に上述した処理を行うことにより、検索数のノードを検索クエリＱＥ２の近傍のノードとして抽出する。例えば、情報処理装置１００Ａは、情報処理装置１００と同様に、上述した第１の方法及び第２の方法のいずれかにより検索結果を得る。 In the search process, the information processing device 100A traces the graph GR21 and performs the above-described process on the nodes to be processed, thereby extracting the searched nodes as nodes in the vicinity of the search query QE2. For example, like the information processing device 100, the information processing device 100A obtains search results using either the first method or the second method described above.

上述した処理は一例に過ぎず、情報処理装置１００Ａは、効率的な検索処理の為に様々な情報や手法を用いて、検索処理を行ってもよい。この点について、各事項について詳述する。例えば、情報処理装置１００Ａは、各ブロブに属するノードの数が、情報処理装置１００がＳＩＭＤにより一括して処理できる数（一括処理可能単位）になるように、クラスタリングを行い、ブロブを生成してもよい。 The above-described processing is merely an example, and the information processing device 100A may perform search processing using various information and methods for efficient search processing. Regarding this point, each item will be explained in detail. For example, the information processing device 100A performs clustering and generates blobs so that the number of nodes belonging to each blob becomes the number that the information processing device 100 can process at once using SIMD (batch processable unit). Good too.

また、情報処理装置１００Ａは、ブロブに関する情報を用いて、重複した処理が行われることを抑制してもよい。情報処理装置１００Ａは、ブロブ単位にそのブロブの距離計算を行ったかを示すブロブ距離計算フラグ（以下単に「フラグ」ともいう）、および各オブジェクトがどのブロブに属するかを示すテーブルを有してもよい。この場合、情報処理装置１００Ａは、フラグにより各ブロブが処理済であるか否かを管理してもよい。例えば、情報処理装置１００Ａは、近傍ノードを逐一処理する直前にノードが属するブロブをテーブルにより特定し、そのブロブが一括距離計算済みかをフラグで判断し、未計算の場合には、一括計算を行い、個々のノードの処理を行ってもよい。この点については、図１５や図１６においても説明する。これにより、情報処理装置１００Ａは、重複した処理が行われることを抑制することができる。 Further, the information processing device 100A may use information regarding blobs to suppress redundant processing. The information processing device 100A may have a blob distance calculation flag (hereinafter also simply referred to as a "flag") that indicates whether distance calculation has been performed for each blob, and a table that indicates which blob each object belongs to. good. In this case, the information processing device 100A may manage whether each blob has been processed using a flag. For example, immediately before processing neighboring nodes one by one, the information processing device 100A identifies the blob to which the node belongs based on a table, determines whether the blob has already been subjected to batch distance calculation using a flag, and if the distance has not yet been calculated, performs batch calculation. You may also perform processing on individual nodes. This point will also be explained with reference to FIGS. 15 and 16. Thereby, the information processing device 100A can prevent duplicate processing from being performed.

〔２－２．情報処理装置の構成〕
次に、図１３を用いて、第２の実施形態に係る情報処理装置１００Ａの構成について説明する。図１３は、第２の実施形態に係る情報処理装置の構成例を示す図である。図１３に示すように、情報処理装置１００Ａは、通信部１１０と、記憶部１２０Ａと、制御部１３０Ａとを有する。なお、情報処理装置１００Ａにおいて、情報処理装置１００と同様の点は適宜説明を省略する。 [2-2. Configuration of information processing device]
Next, the configuration of the information processing apparatus 100A according to the second embodiment will be described using FIG. 13. FIG. 13 is a diagram illustrating a configuration example of an information processing apparatus according to the second embodiment. As shown in FIG. 13, the information processing device 100A includes a communication section 110, a storage section 120A, and a control section 130A. Note that in the information processing device 100A, descriptions of the same points as the information processing device 100 will be omitted as appropriate.

（記憶部１２０Ａ）
記憶部１２０Ａは、例えば、ＲＡＭ、フラッシュメモリ等の半導体メモリ素子、または、ハードディスク、光ディスク等の記憶装置によって実現される。第２の実施形態に係る記憶部１２０Ａは、図１３に示すように、オブジェクト情報記憶部１２１と、グラフ情報記憶部１２２と、量子化情報記憶部１２３Ａと、コードブック情報記憶部１２４と、ブロブ情報記憶部１２５とを有する。 (Storage unit 120A)
The storage unit 120A is realized by, for example, a semiconductor memory device such as a RAM or a flash memory, or a storage device such as a hard disk or an optical disk. As shown in FIG. 13, the storage unit 120A according to the second embodiment includes an object information storage unit 121, a graph information storage unit 122, a quantization information storage unit 123A, a codebook information storage unit 124, and a blob information storage unit 121. It has an information storage section 125.

（量子化情報記憶部１２３Ａ）
第２の実施形態に係る量子化情報記憶部１２３Ａは、割当処理に関する各種情報を記憶する。図１４は、第２の実施形態に係る量子化情報記憶部の一例を示す図である。図１４の例では、量子化情報記憶部１２３Ａには、「ノードＩＤ」、「オブジェクトＩＤ」、「ブロブＩＤ」、および「量子化情報」といった項目を有する。 (Quantization information storage unit 123A)
The quantization information storage unit 123A according to the second embodiment stores various information regarding allocation processing. FIG. 14 is a diagram illustrating an example of a quantization information storage unit according to the second embodiment. In the example of FIG. 14, the quantization information storage unit 123A has items such as "node ID", "object ID", "blob ID", and "quantization information".

「ノードＩＤ」は、グラフにおける各ノード（対象）を識別するための識別情報を示す。また、「オブジェクトＩＤ」は、オブジェクトを識別するための識別情報を示す。なお、ノードＩＤとオブジェクトＩＤが共通である場合、「ノードＩＤ」にオブジェクトＩＤが記憶され、量子化情報記憶部１２３Ａに「オブジェクトＩＤ」の項目は含まれてなくてもよい。 "Node ID" indicates identification information for identifying each node (object) in the graph. Moreover, "object ID" indicates identification information for identifying an object. Note that when the node ID and object ID are common, the object ID is stored in the "node ID", and the quantization information storage unit 123A does not need to include the item "object ID".

「ブロブＩＤ」は、ノード（オブジェクト）が属するブロブを識別するための情報を示す。 “Blob ID” indicates information for identifying a blob to which a node (object) belongs.

また、「量子化情報」は、各ノード（オブジェクト）の量子化されたベクトルの情報を示す。例えば、「量子化情報」には、「要素」、「コードブックＩＤ」といった情報が含まれる。「要素」は、対応するオブジェクトのベクトルにおける配置を示す。図１４の例では、「要素」には、「＃１」、「＃２」、「＃３」、「＃４」が含まれる場合を示す。この場合、各ノード（オブジェクト）のベクトルは４分割され、各分割された部分ベクトルがコードブックにより量子化されることを示す。 Moreover, "quantization information" indicates information on quantized vectors of each node (object). For example, "quantization information" includes information such as "element" and "codebook ID." "Element" indicates the arrangement of the corresponding object in the vector. In the example of FIG. 14, the "element" includes "#1", "#2", "#3", and "#4". In this case, the vector of each node (object) is divided into four parts, and each divided partial vector is quantized by the codebook.

図１４の例では、ノードＮ１（オブジェクトＯＢ１）は、ブロブＩＤ「ＢＬ９」により識別されるブロブ（ブロブＢＬ９）に属することを示す。例えば、直積量子化によりベクトルが４分割される場合、ノードＮ１の量子化情報には、図８に示すノードＮ１の量子化情報示す４つのコードブックと同様の情報が格納される。また、例えば、直積量子化が行われない場合、ノードＮ１の量子化情報には、ノードＮ１のベクトルを量子化するための、１つのコードブックを示す情報が記憶される。 The example in FIG. 14 indicates that the node N1 (object OB1) belongs to the blob (blob BL9) identified by the blob ID "BL9". For example, when a vector is divided into four by Cartesian product quantization, the quantization information of node N1 stores the same information as the four codebooks indicating the quantization information of node N1 shown in FIG. Further, for example, when cross-product quantization is not performed, the quantization information of the node N1 stores information indicating one codebook for quantizing the vector of the node N1.

なお、量子化情報記憶部１２３Ａは、上記に限らず、目的に応じて種々の情報を記憶してもよい。 Note that the quantization information storage section 123A is not limited to the above, and may store various information depending on the purpose.

（ブロブ情報記憶部１２５）
第２の実施形態に係るブロブ情報記憶部１２５は、ブロブに関する情報を記憶する。例えば、ブロブ情報記憶部１２５は、各ブロブに対応付けられたオブジェクトを識別する各種情報を記憶する。図１５は、第２の実施形態に係るブロブ情報記憶部の一例を示す図である。図１５の例では、ブロブ情報記憶部１２５は、「ブロブＩＤ」、「ノードＩＤ」、「ベクトル情報」といった項目が含まれる。 (Blob information storage unit 125)
The blob information storage unit 125 according to the second embodiment stores information regarding blobs. For example, the blob information storage unit 125 stores various information that identifies objects associated with each blob. FIG. 15 is a diagram illustrating an example of a blob information storage unit according to the second embodiment. In the example of FIG. 15, the blob information storage unit 125 includes items such as "blob ID", "node ID", and "vector information".

「ブロブＩＤ」は、ブロブを識別するための識別情報を示す。また、「ノードＩＤ」は、ブロブＩＤにより識別されるブロブに対応付けられたノード（オブジェクト）を示す。「ベクトル情報」は、ブロブのベクトル情報を示す。例えば、「ベクトル情報」は、ブロブのセントロイドに対応するベクトルを示す。 “Blob ID” indicates identification information for identifying a blob. Further, "node ID" indicates a node (object) associated with the blob identified by the blob ID. "Vector information" indicates vector information of a blob. For example, "vector information" indicates the vector corresponding to the centroid of the blob.

図１５に示す例では、ブロブＩＤ「ＢＬ１」により識別されるブロブ（ブロブＢＬ１）に対応付けられたノード（オブジェクト）は、ノードＮ７、Ｎ９、Ｎ８５、Ｎ１２６等であることを示す。ブロブＢＬ１は、「５１，４，１０２，３３・・・」の多次元のベクトル情報が対応付けられることを示す。 The example shown in FIG. 15 indicates that the nodes (objects) associated with the blob (blob BL1) identified by the blob ID "BL1" are nodes N7, N9, N85, N126, etc. Blob BL1 indicates that multidimensional vector information of "51, 4, 102, 33..." is associated with it.

また、ブロブＩＤ「ＢＬ９」により識別されるブロブ（ブロブＢＬ９）に対応付けられたノード（オブジェクト）は、ノードＮ１、Ｎ４、Ｎ５等であることを示す。ブロブＢＬ９は、「１２，５５，１２，６・・・」の多次元のベクトル情報が対応付けられることを示す。 Further, the nodes (objects) associated with the blob (blob BL9) identified by the blob ID "BL9" are nodes N1, N4, N5, etc. Blob BL9 indicates that multidimensional vector information of "12, 55, 12, 6..." is associated with it.

なお、ブロブ情報記憶部１２５は、上記に限らず、目的に応じて種々の情報を記憶してもよい。ブロブ情報記憶部１２５は、各ブロブが距離計算の処理対象となったか否かを示すフラグを記憶してもよい。例えば、ブロブ情報記憶部１２５は、距離計算の対象として処理されていないことを示す値（「未処理フラグ値」ともいう）と、距離計算の対象として処理されたことを示す値（「処理済フラグ値」ともいう）とのいずれかを各ブロブのフラグの値に設定する。例えば、ブロブ情報記憶部１２５は、検索処理開始時に各ブロブのフラグの値を、そのブロブを対象としての距離計算が未処理であることを示す値（例えば０）に設定し、距離計算の対象となったブロブのフラグの値を、距離計算が処理済みであることを示す値（例えば１）に変更する。 Note that the blob information storage unit 125 is not limited to the above, and may store various types of information depending on the purpose. The blob information storage unit 125 may store a flag indicating whether each blob is a target of distance calculation processing. For example, the blob information storage unit 125 stores a value indicating that the blob has not been processed as a target for distance calculation (also referred to as "unprocessed flag value") and a value indicating that it has been processed as a target for distance calculation ("processed flag value"). (also referred to as "flag value") as the value of each blob's flag. For example, at the start of the search process, the blob information storage unit 125 sets the value of the flag of each blob to a value (for example, 0) indicating that the distance calculation for that blob has not been processed, and The value of the flag of the blob that has become , is changed to a value (for example, 1) that indicates that distance calculation has been completed.

例えば、情報処理装置１００Ａは、処理対象となったブロブのフラグの値を参照して、そのブロブについて一括距離計算の処理を実行するか否かを判定する。情報処理装置１００Ａは、各ブロブのフラグの値を参照し、そのブロブのフラグの値が未処理フラグ値である場合は、そのブロブが一括距離計算の処理前であると判定して、そのブロブに属するノードの一括距離計算を実行する。一方、情報処理装置１００Ａは、そのブロブのフラグの値が処理済フラグ値である場合は、そのブロブが一括距離計算の処理済みであると判定して、そのブロブに属するノードの一括距離計算を行わない。 For example, the information processing device 100A refers to the value of the flag of the blob that is the processing target and determines whether to perform the batch distance calculation process for that blob. The information processing device 100A refers to the flag value of each blob, and if the blob's flag value is an unprocessed flag value, the information processing device 100A determines that the blob has not yet been processed in the batch distance calculation, and processes the blob. Perform bulk distance calculation of nodes belonging to . On the other hand, if the value of the flag of the blob is the processed flag value, the information processing device 100A determines that the blob has been subjected to the batch distance calculation, and performs the batch distance calculation of the nodes belonging to the blob. Not performed.

（制御部１３０Ａ）
図１３の説明に戻って、制御部１３０Ａは、コントローラ（controller）であり、例えば、ＣＰＵやＭＰＵやＧＰＵ等によって、情報処理装置１００Ａ内部の記憶装置に記憶されている各種プログラム（情報処理プログラムの一例に相当）がＲＡＭを作業領域として実行されることにより実現される。また、制御部１３０Ａは、コントローラであり、例えば、ＡＳＩＣやＦＰＧＡ等の集積回路により実現される。 (Control unit 130A)
Returning to the explanation of FIG. 13, the control unit 130A is a controller, and includes various programs (information processing programs) stored in a storage device inside the information processing device 100A, for example, by a CPU, MPU, GPU, etc. (corresponding to one example) is realized by being executed using RAM as a work area. Further, the control unit 130A is a controller, and is realized by, for example, an integrated circuit such as an ASIC or an FPGA.

図１３に示すように、制御部１３０Ａは、取得部１３１と、生成部１３２Ａと、検索処理部１３３Ａと、提供部１３４とを有し、以下に説明する情報処理の機能や作用を実現または実行する。なお、制御部１３０Ａの内部構成は、図１３に示した構成に限られず、後述する情報処理を行う構成であれば他の構成であってもよい。 As shown in FIG. 13, the control unit 130A includes an acquisition unit 131, a generation unit 132A, a search processing unit 133A, and a provision unit 134, and realizes or executes information processing functions and operations described below. do. Note that the internal configuration of the control unit 130A is not limited to the configuration shown in FIG. 13, and may be any other configuration as long as it performs information processing to be described later.

（生成部１３２Ａ）
生成部１３２Ａは、生成部１３２と同様に各種情報を生成する。 (Generation unit 132A)
The generation unit 132A generates various information similarly to the generation unit 132.

生成部１３２Ａは、量子化情報記憶部１２３に示すようなベクトル量子化に関する情報を生成してもよい。例えば、生成部１３２は、ノードＮ１（オブジェクトＯＢ１）がブロブＢＬ９に属することを示す情報を生成する。また、生成部１３２は、ブロブ情報記憶部１２５に示すようなブロブに関する情報を生成してもよい。例えば、生成部１３２は、ブロブＢＬ１には、ノードＮ７、Ｎ９、Ｎ８５、Ｎ１２６等が属することを示す情報を生成する。なお、情報処理装置１００がグラフ情報記憶部１２２、量子化情報記憶部１２３Ａ、コードブック情報記憶部１２４、ブロブ情報記憶部１２５に示す情報を、情報提供装置５０等の外部装置から取得する場合、情報処理装置１００は、生成部１３２Ａを有しなくてもよい。 The generation unit 132A may generate information related to vector quantization as shown in the quantization information storage unit 123. For example, the generation unit 132 generates information indicating that the node N1 (object OB1) belongs to the blob BL9. Further, the generation unit 132 may generate information regarding blobs as shown in the blob information storage unit 125. For example, the generation unit 132 generates information indicating that nodes N7, N9, N85, N126, etc. belong to blob BL1. Note that when the information processing device 100 acquires the information shown in the graph information storage unit 122, quantization information storage unit 123A, codebook information storage unit 124, and blob information storage unit 125 from an external device such as the information providing device 50, The information processing device 100 does not need to include the generation unit 132A.

（検索処理部１３３Ａ）
検索処理部１３３Ａは、検索処理部１３３と同様に検索処理に関する各種処理を行う。 (Search processing unit 133A)
Similar to the search processing unit 133, the search processing unit 133A performs various processes related to search processing.

検索処理部１３３Ａは、複数のオブジェクトの各々に対応するノード群を対象として、検索クエリの近傍のノードを検索する検索処理において、複数のオブジェクトを分類した複数のブロブの情報を用いて、一のブロブに属する複数のノードと検索クエリとの距離を、ベクトル量子化された複数のノードのベクトル情報を用いて算出する。検索処理部１３３Ａは、複数のノードと検索クエリとの距離の算出を並列化して一括で行う。検索処理部１３３Ａは、情報処理装置１００Ａの仕様に基づいて決定される一括処理数の複数のノードと検索クエリとの距離の算出を並列処理する。 The search processing unit 133A uses information on a plurality of blobs in which a plurality of objects are classified in a search process for searching for nodes near a search query in a group of nodes corresponding to each of a plurality of objects. Distances between a plurality of nodes belonging to a blob and a search query are calculated using vector quantized vector information of a plurality of nodes. The search processing unit 133A parallelizes and collectively calculates distances between a plurality of nodes and a search query. The search processing unit 133A performs parallel processing to calculate the distance between the search query and a plurality of nodes whose number is determined based on the specifications of the information processing device 100A.

検索処理部１３３Ａは、検索クエリが該当するブロブと隣接する一のブロブに属する複数のノードと検索クエリとの距離を算出する。検索処理部１３３Ａは、複数のノードがエッジにより連結されたグラフを用いて、検索クエリの近傍のノードを検索する検索処理において、複数のノードと検索クエリとの距離を算出する。検索処理部１３３Ａは、検索処理において処理対象となる対象ノードからのエッジが連結された接続ノードが属する一のブロブに属する複数のノードと検索クエリとの距離を算出する。 The search processing unit 133A calculates the distance between the search query and a plurality of nodes belonging to one blob adjacent to the blob to which the search query applies. The search processing unit 133A calculates distances between a plurality of nodes and a search query in a search process that searches for nodes in the vicinity of a search query using a graph in which a plurality of nodes are connected by edges. The search processing unit 133A calculates the distance between the search query and a plurality of nodes belonging to one blob to which a connection node to which edges from the target node to be processed belong in the search process.

検索処理部１３３Ａは、対象ノードが属するブロブ以外の一のブロブに属する複数のノードと検索クエリとの距離を算出する。検索処理部１３３Ａは、複数のノードの各々から、複数のノードの各々が属するブロブ以外の他のブロブへ連結されたブロブ用グラフを用いて、検索クエリの近傍のノードを検索する検索処理において、複数のノードと検索クエリとの距離を算出する。検索処理部１３３Ａは、複数のノードがエッジにより連結された変換前のグラフにおいて、複数のノードのうち一のノードからエッジが連結されたノードが属するブロブであって、一のノードが属するブロブ以外のブロブへ、一のノードからエッジを連結することにより生成された変換後のグラフであるブロブ用グラフを用いて、検索クエリの近傍のノードを検索する検索処理において、複数のノードと検索クエリとの距離を算出する。 The search processing unit 133A calculates the distance between the search query and a plurality of nodes belonging to a blob other than the blob to which the target node belongs. In a search process in which the search processing unit 133A searches for nodes near the search query using a blob graph connected from each of the plurality of nodes to other blobs other than the blob to which each of the plurality of nodes belongs, Calculate the distance between multiple nodes and a search query. In the graph before conversion in which a plurality of nodes are connected by edges, the search processing unit 133A searches for a blob to which a node to which an edge is connected from one of the plurality of nodes belongs, other than the blob to which the one node belongs. In a search process that searches for nodes near a search query using a blob graph, which is a converted graph generated by connecting edges from one node to a blob, multiple nodes and a search query are Calculate the distance.

検索処理部１３３Ａは、検索処理において処理対象となる対象ノードからのエッジが連結されたブロブである一のブロブに属する複数のノードと検索クエリとの距離を算出する。検索処理部１３３Ａは、既に処理対象となったブロブである処理済みブロブを示す情報を用いて、一のブロブを処理対象とするかを判定する。検索処理部１３３Ａは、一のブロブが処理済みブロブである場合、一のブロブを処理対象としないと判定する。 The search processing unit 133A calculates the distance between the search query and a plurality of nodes belonging to one blob, which is a blob in which edges from the target node to be processed are connected in the search process. The search processing unit 133A uses information indicating a processed blob, which is a blob that has already been a processing target, to determine whether one blob is to be processed. If the one blob is a processed blob, the search processing unit 133A determines that the one blob is not to be processed.

検索処理部１３３Ａは、検索処理において処理対象となったノードのうち、所定のノードを対象として、ベクトル量子化がされた距離である第１距離とは異なり、ベクトル量子化がされていない第２距離を算出する。検索処理部１３３Ａは、検索処理において検索クエリの近傍のノードとして抽出するノードの第１数よりも多い数である第２数のノードを近傍候補ノードとして抽出し、近傍候補ノードを対象として第２距離を算出する。検索処理部１３３Ａは、近傍候補ノードのうち、第２距離が短い方から第１数のノードを検索クエリの近傍のノードとして抽出する。 The search processing unit 133A targets a predetermined node among the nodes to be processed in the search process. Calculate distance. In the search process, the search processing unit 133A extracts a second number of nodes, which is a larger number than the first number of nodes to be extracted as nodes in the vicinity of the search query, as neighboring candidate nodes, and selects a second number of nodes as neighboring candidate nodes. Calculate distance. The search processing unit 133A extracts a first number of nodes from among the nearby candidate nodes having the shorter second distance as nodes in the vicinity of the search query.

検索処理部１３３Ａは、第１距離が所定の閾値以内であるノードを対象として第２距離を算出する。検索処理部１３３Ａは、近傍のノードとして抽出する対象範囲を示す検索範囲内のノードを対象として第２距離を算出する。検索処理部１３３Ａは、検索処理の対象範囲を示す探索範囲内のノードを対象として第２距離を算出する。 The search processing unit 133A calculates a second distance for nodes whose first distance is within a predetermined threshold. The search processing unit 133A calculates a second distance for nodes within the search range indicating the target range to be extracted as nearby nodes. The search processing unit 133A calculates a second distance for nodes within a search range indicating a target range of the search process.

〔２－３．検索処理例〕
ここで、第２の実施形態に係る検索処理の一例について、図１６を一例として説明する。図１６は、第２の実施形態に係る検索処理の一例を示すフローチャートである。以下に説明する検索処理は、情報処理装置１００Ａの検索処理部１３３Ａによって行われる。なお、図１１等、第１の実施形態と同様の点については適宜説明を省略する。例えば、図１６において、図１１と同様の点は同じステップ番号を付すことにより、適宜説明を省略する。 [2-3. Search processing example]
Here, an example of the search process according to the second embodiment will be described using FIG. 16 as an example. FIG. 16 is a flowchart illustrating an example of search processing according to the second embodiment. The search processing described below is performed by the search processing unit 133A of the information processing device 100A. Note that descriptions of points similar to those of the first embodiment, such as FIG. 11, will be omitted as appropriate. For example, in FIG. 16, the same steps as those in FIG. 11 are given the same step numbers, and the description thereof will be omitted as appropriate.

ここでは、近傍オブジェクト集合Ｎ（Ｇ，ｙ）は、ノードｙに付与されているエッジにより関連付けられている近傍のオブジェクトの集合である。「Ｇ」は、所定のグラフデータ（例えば、空間情報ＳＰ２１に示すグラフＧＲ２１等）であってもよい。例えば、情報処理装置１００Ａは、ｋ近傍検索処理を実行する。 Here, the neighboring object set N(G, y) is a set of neighboring objects that are related by edges attached to node y. "G" may be predetermined graph data (for example, graph GR21 shown in spatial information SP21, etc.). For example, the information processing device 100A executes k-nearest neighbor search processing.

例えば、図１６に示す検索処理は、ステップＳ３０４の後にステップＳ３０４ａに示す処理を行う点で、図１１に示す検索処理と相違する。このように、図１６に示す検索処理は、通常のグラフの探索で、まだ、アクセスしていないブロブに到達した場合にそのブロブ内のオブジェクト（ノード）とクエリとの距離を一括して計算し、各ノードの処理を行う。 For example, the search process shown in FIG. 16 differs from the search process shown in FIG. 11 in that the process shown in step S304a is performed after step S304. In this way, the search process shown in FIG. 16 is a normal graph search, and when a blob that has not been accessed yet is reached, the distance between the object (node) in that blob and the query is calculated all at once. , performs processing on each node.

具体的には、図１６の検索処理においては、情報処理装置１００Ａは、オブジェクトｓが属するブロブがまだ距離計算していない場合には、以下のすべて（図１６中のステップＳ３０４ａ中の「－」に続けて示す４つの処理。以下「第１の処理」～「第４の処理」とする）を実行する（ステップＳ３０４ａ）。ステップＳ３０４ａにおいて、情報処理装置１００Ａは、そのブロブ内のオブジェクトを一括距離計算する処理（第１の処理）を実行する。また、ステップＳ３０４ａにおいて、情報処理装置１００Ａは、ブロブの距離計算フラグをセットする処理（第２の処理）を実行する。また、ステップＳ３０４ａにおいて、情報処理装置１００Ａは、一括距離計算を行ったブロブ内のすべてのオブジェクトをＣに格納する処理（第３の処理）を実行する。また、ステップＳ３０４ａにおいて、情報処理装置１００Ａは、ブロブのすべてのオブジェクトをｕとして逐一ステップＳ３０７からステップＳ３１４までの処理を行う処理（第４の処理）を実行する。その後、情報処理装置１００Ａは、ステップＳ３０６以降の処理を行う。 Specifically, in the search process of FIG. 16, if the distance of the blob to which the object s belongs has not yet been calculated, the information processing device 100A performs all of the following (“-” in step S304a in FIG. 16). The following four processes (hereinafter referred to as "first process" to "fourth process") are executed (step S304a). In step S304a, the information processing device 100A executes a process (first process) of calculating the distances of objects within the blob at once. Furthermore, in step S304a, the information processing device 100A executes a process (second process) of setting a blob distance calculation flag. Furthermore, in step S304a, the information processing apparatus 100A executes a process (third process) of storing all objects in the blob for which the batch distance calculation has been performed in C. Furthermore, in step S304a, the information processing apparatus 100A executes a process (fourth process) in which all objects of the blob are set as u and processes from step S307 to step S314 are performed one by one. After that, the information processing device 100A performs the processing from step S306 onwards.

〔２－４．変形例〕
ここから、変形例について説明する。第２の実施形態に係る変形例においては、ブロブの概念を含むグラフを用いてもよい。例えば、第２の実施形態に係る変形例においては、ノードからのエッジによる参照先がブロブであるグラフを用いてもよい。なお、第１の実施形態や第２の実施形態と同様の点については、適宜説明を省略する。変形例に係る情報処理装置１００Ａは、グラフ情報記憶部１２２に代えて、グラフ情報記憶部１２２Ａを有する。 [2-4. Modified example]
From here, modified examples will be explained. In a modification of the second embodiment, a graph including the concept of blob may be used. For example, in a modification of the second embodiment, a graph may be used in which a blob is referenced by an edge from a node. Note that descriptions of points similar to those in the first embodiment and the second embodiment will be omitted as appropriate. The information processing device 100A according to the modification includes a graph information storage section 122A instead of the graph information storage section 122.

〔２－４－１．情報処理〕
まず、図１７を用いて、変形例に係る情報処理の概要を説明する。図１７は、変形例に係る情報処理の一例を示す図である。 [2-4-1. Information processing〕
First, an overview of information processing according to the modified example will be explained using FIG. 17. FIG. 17 is a diagram illustrating an example of information processing according to a modification.

情報処理装置１００Ａは、ノード間をエッジで連結したグラフＧＲ２１を、ノードからのエッジによる参照先がブロブであるグラフＧＲ３１に変換する（ステップＳ３１）。すなわち、情報処理装置１００Ａは、図１７に示すように、通常のグラフであるグラフＧＲ２１から、ブロブの概念が導入されたグラフ（ブロブ用グラフ）であるグラフＧＲ３１を生成する。 The information processing device 100A converts the graph GR21 in which nodes are connected by edges into a graph GR31 in which a blob is referenced by an edge from a node (step S31). That is, as shown in FIG. 17, the information processing device 100A generates a graph GR31, which is a graph in which the concept of a blob is introduced (blob graph), from a graph GR21, which is a normal graph.

図１７の例では、情報処理装置１００Ａは、空間情報ＳＰ３１に示すように、ノードからブロブへの有向エッジを含むグラフＧＲ３１を生成する。なお、グラフＧＲ３１においては、各ノード間のエッジについては削除され、グラフＧＲ３１には、ノードからブロブへの有向エッジが含まれ、ノード間のエッジは含まれない。図１７に示す矢印線は、矢元のノードから矢先のブロブへの有向エッジを示す。すなわち、図１７に示す矢印線は、矢元のノードを参照元とし、矢先のブロブを参照先とする有向エッジを示す。例えば、図１７では、ノードＮ１からは、ブロブＢＬ８及びブロブＢＬ１０の２つのブロブへのエッジが連結されることを示す。 In the example of FIG. 17, the information processing device 100A generates a graph GR31 including directed edges from nodes to blobs, as shown in spatial information SP31. Note that in the graph GR31, edges between nodes are deleted, and the graph GR31 includes directed edges from nodes to blobs, but does not include edges between nodes. The arrow line shown in FIG. 17 indicates a directed edge from the node at the base of the arrow to the blob at the base of the arrow. That is, the arrow line shown in FIG. 17 indicates a directed edge with the node at the base of the arrow as the reference source and the blob at the tip of the arrow as the reference destination. For example, FIG. 17 shows that edges from node N1 to two blobs, blob BL8 and blob BL10, are connected.

例えば、情報処理装置１００Ａは、各ノードのエッジを接続ノード（近傍ノード）へのエッジから接続ノードが属するブロブへのエッジに変換する。なお、情報処理装置１００Ａは、各ノード自身が属するブロブへのエッジは生成しない。図１７の例では、情報処理装置１００Ａは、ノードＮ９とノードＮ１２６は同じブロブＢＬ１に属するため、ノードＮ９からブロブＢＬ１へのエッジ、及びノードＮ１２６からブロブＢＬ１へのエッジは生成しない。一方、図１７の例では、情報処理装置１００Ａは、ノードＮ９とノードＮ１８は異なるブロブＢＬ１、ＢＬ４に属するため、ノードＮ９からブロブＢＬ４へのエッジ、及びノードＮ１８からブロブＢＬ１へのエッジを生成する。 For example, the information processing device 100A converts the edge of each node from an edge to a connection node (nearby node) to an edge to a blob to which the connection node belongs. Note that the information processing device 100A does not generate edges to the blob to which each node itself belongs. In the example of FIG. 17, since the node N9 and the node N126 belong to the same blob BL1, the information processing device 100A does not generate an edge from the node N9 to the blob BL1 and an edge from the node N126 to the blob BL1. On the other hand, in the example of FIG. 17, since the node N9 and the node N18 belong to different blobs BL1 and BL4, the information processing device 100A generates an edge from the node N9 to the blob BL4 and an edge from the node N18 to the blob BL1. .

図１７の例では、情報処理装置１００Ａは、空間情報ＳＰ３１に示すように、ノード（オブジェクト）からブロブへの有向エッジを含むグラフＧＲ３１を用いて、検索処理を行う。例えば、情報処理装置１００Ａは、検索クエリＱＥ２を対象として、グラフＧＲ２１を用いた図１９に示すような検索処理を行うことにより、検索クエリＱＥ２の検索結果を得る。図１９に示す検索処理についての詳細は後述する。なお、図１７での検索処理は、ブロブの概念を用いて検索処理を行う点で図１２と共通し、エッジに関連する処理の点以外は、図１２に示す検索処理と同様である。 In the example of FIG. 17, the information processing device 100A performs a search process using a graph GR31 including directed edges from nodes (objects) to blobs, as shown in spatial information SP31. For example, the information processing device 100A obtains the search result of the search query QE2 by performing a search process as shown in FIG. 19 using the graph GR21 for the search query QE2. Details of the search process shown in FIG. 19 will be described later. Note that the search process in FIG. 17 is similar to FIG. 12 in that the search process is performed using the concept of a blob, and is the same as the search process shown in FIG. 12 except for the process related to edges.

〔２－４－２．グラフ〕
次に、図１８を用いて、変形例に係るグラフの概要を説明する。図１８は、変形例に係るグラフ情報記憶部の一例を示す図である。例えば、図１８に示すグラフ情報記憶部１２２Ａは、ブロブの概念が導入されたグラフ（ブロブ用グラフ）を記憶する。図１８に示すグラフ情報記憶部１２２Ａは、「ノードＩＤ」、「オブジェクトＩＤ」、および「ブロブ情報」といった項目を有する。このように、変形例に係るグラフ情報記憶部１２２Ａは、「接続ノード情報」に代えて、「ブロブ情報」を有する点で図７のグラフ情報記憶部１２２と相違する。なお、図７のグラフ情報記憶部１２２と同様の点については適宜説明を省略する。 [2-4-2. graph〕
Next, an outline of a graph according to a modified example will be explained using FIG. 18. FIG. 18 is a diagram illustrating an example of a graph information storage unit according to a modification. For example, the graph information storage unit 122A shown in FIG. 18 stores a graph (blob graph) in which the concept of blob is introduced. The graph information storage unit 122A shown in FIG. 18 has items such as "node ID", "object ID", and "blob information". In this way, the graph information storage unit 122A according to the modified example differs from the graph information storage unit 122 of FIG. 7 in that it includes “blob information” instead of “connection node information”. Note that descriptions of the same points as those of the graph information storage unit 122 in FIG. 7 will be omitted as appropriate.

また、「ブロブ情報」は、対応するノードから辿ることができるブロブ（参照先のブロブ）に関する情報を示す。例えば、「ブロブ情報」には、「参照先」といった情報が含まれる。「参照先」は、エッジにより連結され、そのノードから辿ることができる参照先（ブロブ）を識別するための情報を示す。すなわち、図１８の例では、ノードを識別するノードＩＤ（オブジェクトＩＤ）に対して、そのノードからエッジにより辿ることができる参照先（ブロブ）が対応付けられて登録されている。なお、「ブロブ情報」には、参照先に接続されるエッジを識別するための情報（エッジＩＤ）等が含まれてもよい。 Moreover, "blob information" indicates information regarding a blob (a referenced blob) that can be traced from a corresponding node. For example, "blob information" includes information such as "reference destination." “Reference destination” indicates information for identifying a reference destination (blob) that is connected by an edge and can be traced from that node. That is, in the example of FIG. 18, a reference destination (blob) that can be traced from the node by an edge is registered in association with a node ID (object ID) that identifies the node. Note that the "blob information" may include information (edge ID) for identifying an edge connected to a reference destination.

図１８の例では、ノードＩＤ「Ｎ１」により識別されるノード（ノードＮ１）は、オブジェクトＩＤ「ＯＢ１」により識別されるオブジェクト（対象）に対応することを示す。また、ノードＮ１からは、ブロブＩＤ「ＢＬ８」により識別されるブロブ（ブロブＢＬ８）にエッジが連結されており、ノードＮ１からブロブＢＬ８へ辿ることができることを示す。また、ノードＮ１からは、ブロブＩＤ「ＢＬ１０」により識別されるブロブ（ブロブＢＬ１０）にエッジが連結されており、ノードＮ１からブロブＢＬ１０へ辿ることができることを示す。 The example in FIG. 18 indicates that the node (node N1) identified by the node ID "N1" corresponds to the object (target) identified by the object ID "OB1". Furthermore, an edge is connected from the node N1 to the blob (blob BL8) identified by the blob ID "BL8", indicating that it is possible to trace from the node N1 to the blob BL8. Further, an edge is connected from the node N1 to the blob (blob BL10) identified by the blob ID "BL10", indicating that the blob BL10 can be traced from the node N1.

また、ノードＩＤ「Ｎ２」により識別されるノード（ノードＮ２）は、オブジェクトＩＤ「ＯＢ２」により識別されるオブジェクト（対象）に対応することを示す。また、ノードＮ２からは、ブロブＩＤ「ＢＬ５」により識別されるブロブ（ブロブＢＬ５）にエッジが連結されており、ノードＮ１からブロブＢＬ５へ辿ることができることを示す。 Further, the node (node N2) identified by the node ID "N2" corresponds to the object (target) identified by the object ID "OB2". Furthermore, an edge is connected from the node N2 to the blob (blob BL5) identified by the blob ID "BL5", indicating that the blob BL5 can be traced from the node N1.

なお、グラフ情報記憶部１２２Ａは、上記に限らず、目的に応じて種々の情報を記憶してもよい。 Note that the graph information storage unit 122A is not limited to the above, and may store various information depending on the purpose.

また、グラフは、クエリを入力とし、グラフ中のエッジを辿ることによりノードを探索し、クエリに類似するノードを抽出し出力するプログラムモジュールを含んでもよい。すなわち、グラフは、グラフを用いて検索処理を行うプログラムモジュールとしての利用が想定されるものであってもよい。例えば、グラフＧＲ３１は、クエリとしてベクトルデータが入力された場合に、そのベクトルデータに類似するベクトルデータに対応するノードをグラフ中から抽出し、出力するプログラムであってもよい。例えば、グラフＧＲ３１は、クエリ画像に対応する類似画像を検索するプログラムモジュールとして利用されるデータであってもよい。例えば、グラフＧＲ３１は、入力されたクエリに基づいて、グラフにおいてそのクエリに類似するノードを抽出し、出力するよう、コンピュータを機能させる。 The graph may also include a program module that receives a query as input, searches for nodes by tracing edges in the graph, and extracts and outputs nodes similar to the query. That is, the graph may be assumed to be used as a program module that performs search processing using the graph. For example, the graph GR31 may be a program that, when vector data is input as a query, extracts nodes corresponding to vector data similar to the vector data from the graph and outputs them. For example, the graph GR31 may be data used as a program module that searches for similar images corresponding to a query image. For example, based on an input query, the graph GR31 causes the computer to extract and output nodes similar to the query in the graph.

〔２－４－３．検索処理例〕
ここから、変形例に係る検索処理の一例について、図１９を一例として説明する。図１９は、変形例に係る検索処理の一例を示すフローチャートである。以下に説明する検索処理は、情報処理装置１００Ａの検索処理部１３３Ａによって行われる。なお、図１１、図１６等、第１の実施形態または第２の実施形態と同様の点については適宜説明を省略する。例えば、図１９において、図１１と同様の点は同じステップ番号を付すことにより、適宜説明を省略する。 [2-4-3. Search processing example]
An example of the search process according to the modified example will now be described using FIG. 19 as an example. FIG. 19 is a flowchart illustrating an example of a search process according to a modification. The search processing described below is performed by the search processing unit 133A of the information processing device 100A. Note that descriptions of points similar to those of the first embodiment or the second embodiment, such as FIGS. 11 and 16, will be omitted as appropriate. For example, in FIG. 19, the same steps as those in FIG. 11 are given the same step numbers, and the description thereof will be omitted as appropriate.

図１９では、集合Ｎ（Ｇ，ｙ）は、ノードｙに付与されているエッジにより関連付けられているブロブの集合である点で、図１１及び図１６の検索処理と相違する。図１９では、Ｎ（Ｇ、ｓ）およびＣはブロブ集合となる。すなわち、図１１での「近傍オブジェクト集合Ｎ」は、図１９では、「ブロブ集合Ｎ」と読み替えられ、図１１での「オブジェクト集合Ｃ」は、図１９では、「ブロブ集合Ｃ」と読み替えられる。また、上記のブロブへの変更に関連する処理における「オブジェクト」の文言は、適宜「ブロブ」と読み替えられる。「Ｇ」は、ブロブの概念が導入されたグラフ（ブロブ用グラフ）データ（例えば、空間情報ＳＰ３１に示すグラフＧＲ３１等）であってもよい。例えば、情報処理装置１００Ａは、ｋ近傍検索処理を実行する。 In FIG. 19, the set N(G, y) is different from the search processing in FIGS. 11 and 16 in that it is a set of blobs related by edges attached to node y. In FIG. 19, N(G, s) and C are blob sets. That is, "nearby object set N" in FIG. 11 is replaced with "blob set N" in FIG. 19, and "object set C" in FIG. 11 is replaced with "blob set C" in FIG. . Furthermore, the word "object" in the process related to the change to a blob can be read as "blob" as appropriate. "G" may be graph (blob graph) data (for example, graph GR31 shown in spatial information SP31) into which the concept of blob has been introduced. For example, the information processing device 100A executes k-nearest neighbor search processing.

例えば、図１９に示す検索処理は、図１１からステップＳ３０６の処理が以下のように変更される。図１９の検索処理においては、情報処理装置１００Ａは、Ｎ（Ｇ、ｓ）の中からブロブ集合Ｃに含まれない、ブロブを一つ選択し、選択したブロブをブロブ集合Ｃに格納し、ブロブ内のオブジェクトを一括計算し、その集合をＢ（「対象ブロブ内オブジェクト集合Ｂ」ともいう）とする（ステップＳ３０６）。 For example, in the search process shown in FIG. 19, the process in step S306 from FIG. 11 is changed as follows. In the search process in FIG. 19, the information processing device 100A selects one blob from N(G, s) that is not included in the blob set C, stores the selected blob in the blob set C, and stores the selected blob in the blob set C. The objects within the blob are collectively calculated, and the set thereof is set as B (also referred to as "object set B within the target blob") (step S306).

また、図１９に示す検索処理は、ステップＳ３０６の後にステップＳ３０６ａに示す処理を行う点で、図１１に示す検索処理と相違する。具体的には、図１９の検索処理においては、情報処理装置１００Ａは、対象ブロブ内オブジェクト集合Ｂからオブジェクトｕを一つ選択する（ステップＳ３０６ａ）。その後、情報処理装置１００Ａは、ステップＳ３０７以降の処理を行う。 Furthermore, the search process shown in FIG. 19 differs from the search process shown in FIG. 11 in that the process shown in step S306a is performed after step S306. Specifically, in the search process of FIG. 19, the information processing device 100A selects one object u from the target blob object set B (step S306a). After that, the information processing device 100A performs the processing from step S307 onwards.

また、図１９に示す検索処理は、ステップＳ３１５の前にステップＳ３１４ａに示す処理を行う点で、図１１に示す検索処理と相違する。具体的には、図１９の検索処理においては、情報処理装置１００Ａは、対象ブロブ内オブジェクト集合Ｂから全て選択したか否かを判定する（ステップＳ３１４ａ）。 The search process shown in FIG. 19 is different from the search process shown in FIG. 11 in that the process shown in step S314a is performed before step S315. Specifically, in the search process of FIG. 19, the information processing device 100A determines whether all objects have been selected from object set B in the target blob (step S314a).

対象ブロブ内オブジェクト集合Ｂから全て選択し終えた場合（ステップＳ３１４ａ：Ｙｅｓ）、情報処理装置１００Ａは、ステップＳ３１５の処理を行う。一方、対象ブロブ内オブジェクト集合Ｂから全て選択し終えていない場合（ステップＳ３１４ａ：Ｎｏ）、情報処理装置１００Ａは、ステップＳ３０６ａに戻って処理を繰り返す。 If all objects have been selected from object set B within the target blob (step S314a: Yes), the information processing device 100A performs the process of step S315. On the other hand, if all objects have not been selected from object set B within the target blob (step S314a: No), the information processing device 100A returns to step S306a and repeats the process.

〔２－４－４．生成処理の他の例〕
上述した例では、各ノードのエッジを接続ノードへのエッジから接続ノードが属するブロブへのエッジに変換してブロブ用グラフを生成する場合を示したが、情報処理装置１００Ａは、様々な情報を適宜用いて、ブロブ用グラフを生成してもよい。例えば、情報処理装置１００Ａは、複数のオブジェクトを検索対象とする近傍検索インデックス（単に「インデックス」ともいう）を用いて、ブロブ用グラフを生成してもよい。なお、上述した内容と同様の点については適宜説明を省略する。 [2-4-4. Other examples of generation processing]
In the above example, a case was shown in which a blob graph is generated by converting the edge of each node from an edge to a connected node to an edge to a blob to which the connected node belongs, but the information processing device 100A converts various information. It may be used as appropriate to generate a blob graph. For example, the information processing device 100A may generate a blob graph using a neighborhood search index (also simply referred to as an "index") that searches for multiple objects. Note that descriptions of points similar to those described above will be omitted as appropriate.

ブロブ用グラフの生成の一例について図２０を用いて説明する。図２０は、変形例に係る情報処理の他の一例を示す図である。なお、以下では、インデックスの一例としてグラフを用いる場合を一例として説明するが、インデックスは、複数のオブジェクトを検索対象とするものであればどのようなものであってもよい。例えば、インデックスは、ハッシュテーブル等のハッシュに関する記述を利用したインデックス（ハッシュインデックス）、木構造を有するインデックス（ツリーインデックス）等であってもよい。 An example of generating a blob graph will be described using FIG. 20. FIG. 20 is a diagram illustrating another example of information processing according to the modification. In addition, although the case where a graph is used as an example of an index is demonstrated below as an example, the index may be any type of index as long as it searches for a plurality of objects. For example, the index may be an index using a hash-related description such as a hash table (hash index), an index having a tree structure (tree index), or the like.

また、オブジェクトまたはそのオブジェクトに対応するノード（「オブジェクトノード」ともいう）を示す情報を第１情報と記載し、複数のオブジェクトを分類するために用いる情報を第２情報と記載する場合がある。また、ブロブを示す情報を第３情報と記載し、オブジェクトノードをエッジで連結したグラフを第４情報と記載する場合がある。例えば、図２０中のグラフＧＲ２１は、オブジェクトノードであるノードＮ１等がエッジで連結されており、第４情報に対応する。また、図１７中のグラフＧＲ３１及び図２０中のグラフＧＲ３２は、第４情報とは異なり、オブジェクトノードとブロブとがエッジが連結されたブロブ用グラフ（「グループグラフ」ともいう）である。すなわち、グループグラフは、オブジェクトノードからブロブへエッジが連結され、オブジェクトノードを参照元とし、ブロブ（グループ）を参照先とするグラフである。 Further, information indicating an object or a node corresponding to the object (also referred to as an "object node") may be described as first information, and information used to classify a plurality of objects may be described as second information. Further, information indicating a blob may be described as third information, and a graph in which object nodes are connected by edges may be described as fourth information. For example, the graph GR21 in FIG. 20 has node N1, which is an object node, connected by edges, and corresponds to the fourth information. Also, unlike the fourth information, graph GR31 in FIG. 17 and graph GR32 in FIG. 20 are blob graphs (also referred to as "group graphs") in which object nodes and blobs are connected at edges. In other words, the group graph is a graph in which edges are connected from object nodes to blobs, the object nodes are reference sources, and the blobs (groups) are reference destinations.

情報処理装置１００Ａは、オブジェクトノード間がエッジで連結したグラフＧＲ２１をインデックス（第４情報）として用いて、オブジェクトノードからのエッジによる参照先がブロブとなるグループグラフであるグラフＧＲ３２を生成する（ステップＳ４１）。図２０では、情報処理装置１００Ａは、空間情報ＳＰ３２に示すように、ノードからブロブへの有向エッジを含むグラフＧＲ３２を生成する。 The information processing device 100A uses the graph GR21 in which object nodes are connected by edges as an index (fourth information) to generate a graph GR32 that is a group graph in which blobs are referenced by edges from object nodes (step S41). In FIG. 20, the information processing device 100A generates a graph GR32 including directed edges from nodes to blobs, as shown in spatial information SP32.

例えば、情報処理装置１００Ａは、複数のオブジェクトから一のオブジェクトを選択し、グラフＧＲ２１を用いて一のオブジェクトの近傍オブジェクトを検索し、一のオブジェクトに対応する一のオブジェクトノードから、近傍オブジェクトが属する他のグループへエッジを連結する連結処理により、グラフＧＲ３２を生成する。例えば、情報処理装置１００Ａは、複数のオブジェクトからランダムに一のオブジェクトを選択し、選択した一のオブジェクトを対象に連結処理を行い、グラフＧＲ３２を生成する。 For example, the information processing device 100A selects one object from a plurality of objects, searches for neighboring objects of the one object using the graph GR21, and determines which neighboring objects belong from one object node corresponding to the one object. A graph GR32 is generated by a connection process that connects edges to other groups. For example, the information processing device 100A randomly selects one object from a plurality of objects, performs a concatenation process on the selected object, and generates the graph GR32.

例えば、情報処理装置１００Ａは、以下のような処理（１－１）～（１－４）を行い、グラフＧＲ３２を生成する。なお、オブジェクトは、オブジェクトに対応するノードと読み替えてもよい。 For example, the information processing device 100A performs the following processes (1-1) to (1-4) to generate the graph GR32. Note that the object may be read as a node corresponding to the object.

（１－１）：ランダムでオブジェクトＮＸを取得（選択）
（１－２）：オブジェクトＮＸの近傍オブジェクトを定数ｋ件検索
（１－３）：各近傍オブジェクトの属するブロブを取得
（１－４）：オブジェクトＮＸから取得したブロブへのエッジを生成 (1-1): Obtain object NX randomly (selection)
(1-2): Search for a constant number of k neighboring objects of object NX (1-3): Obtain the blob to which each neighboring object belongs (1-4): Generate an edge to the blob obtained from object NX

例えば処理（１－１）では、情報処理装置１００Ａは、オブジェクト情報記憶部１２１に記憶された複数のオブジェクトからランダムに一のオブジェクトＮＸを取得する。 For example, in process (1-1), the information processing device 100A randomly acquires one object NX from a plurality of objects stored in the object information storage unit 121.

例えば処理（１－２）では、情報処理装置１００Ａは、図１１に示すような検索処理を、オブジェクトＮＸを対象（クエリ）とし、グラフＧＲ２１を用いて行うことにより、ｋ件（個）のオブジェクトをオブジェクトＮＸの近傍オブジェクトとして抽出する。 For example, in process (1-2), the information processing device 100A performs a search process as shown in FIG. is extracted as a neighboring object of object NX.

例えば処理（１－３）では、情報処理装置１００Ａは、ブロブ情報記憶部１２５に記憶されたブロブの情報を参照し、オブジェクトＮＸの近傍オブジェクト（ノード）が対応付けられたブロブを取得する。 For example, in process (1-3), the information processing device 100A refers to blob information stored in the blob information storage unit 125 and obtains blobs associated with neighboring objects (nodes) of object NX.

例えば処理（１－４）では、情報処理装置１００Ａは、オブジェクトＮＸから取得したブロブへのエッジを生成する。例えば、情報処理装置１００Ａは、量子化情報記憶部１２３Ａ中のオブジェクトＮＸに取得したブロブを示す情報を対応付けることにより、グラフＧＲ３２を生成する。 For example, in process (1-4), the information processing device 100A generates an edge to the blob acquired from the object NX. For example, the information processing device 100A generates the graph GR32 by associating information indicating the acquired blob with the object NX in the quantization information storage unit 123A.

例えば、情報処理装置１００Ａは、上記の処理（１－１）～（１－４）を、処理対象となるオブジェクトが無くなるまで繰り返し行い、グラフＧＲ３２を生成する。なお、空間情報ＳＰ２１に示すブロブを検索により生成している場合、ブロブの生成の際の検索結果を流用して、グラフＧＲ３２を生成してもよい。なお、ブロブを検索により生成する例については後述する。 For example, the information processing device 100A repeatedly performs the above processes (1-1) to (1-4) until there are no more objects to be processed, and generates the graph GR32. Note that when the blob shown in the spatial information SP21 is generated by a search, the graph GR32 may be generated by using the search results when the blob is generated. Note that an example of generating blobs by search will be described later.

例えば、情報処理装置１００Ａは、空間情報ＳＰ３２に示すように、ノード（オブジェクト）からブロブへの有向エッジを含むグラフＧＲ３２を用いて、検索処理を行う。例えば、情報処理装置１００Ａは、検索クエリＱＥ２を対象として、グラフＧＲ３２を用いた図１９に示すような検索処理を行うことにより、検索クエリＱＥ２の検索結果を得る。 For example, the information processing device 100A performs a search process using a graph GR32 including directed edges from nodes (objects) to blobs, as shown in spatial information SP32. For example, the information processing device 100A obtains the search result of the search query QE2 by performing a search process as shown in FIG. 19 using the graph GR32 for the search query QE2.

また、上述した例では、クラスタリングによりブロブを生成する場合を示したが、ブロブは、クラスタリングに限らず、様々な方法により生成されてもよい。例えば、情報処理装置１００Ａは、インデックス等の第２情報を用いて、ブロブを示す情報（第３情報）を生成してもよい。 Further, in the above example, a case was shown in which blobs were generated by clustering, but blobs are not limited to clustering, and may be generated by various methods. For example, the information processing device 100A may generate information indicating a blob (third information) using second information such as an index.

情報処理装置１００Ａは、図２１に示すような、グラフＧＲ２１をインデックスとして用いて、ブロブを示す第３情報を生成する。図２１は、ブロブ情報の生成処理の一例を示す図である。図２１の空間情報ＳＰ２０は、空間情報ＳＰ２１中のブロブの情報が生成される前の状態を示し、図２１のグラフＧＲ２１は、図２０のグラフＧＲ２１と同様のグラフである。なお、図２１に示す処理は、後述する第１方法の一部を示すものであるが、詳細は後述する。 The information processing device 100A uses the graph GR21 as an index as shown in FIG. 21 to generate third information indicating a blob. FIG. 21 is a diagram illustrating an example of blob information generation processing. Spatial information SP20 in FIG. 21 shows a state before the blob information in spatial information SP21 is generated, and graph GR21 in FIG. 21 is a graph similar to graph GR21 in FIG. 20. Note that the process shown in FIG. 21 shows a part of the first method described later, and the details will be described later.

例えば、情報処理装置１００Ａは、グラフＧＲ２１を用いた検索により、複数のオブジェクトを複数のブロブに分類する第３情報を生成する。例えば、情報処理装置１００Ａは、グラフＧＲ２１中の複数のノードから一のノード（「処理対象ノード」ともいう）を選択し、処理対象ノードにエッジで連結されたノードである近傍ノードのうち少なくとも一部のノードである分類対象ノード、及び処理対象ノードの各々に対応するオブジェクト群を一のグループに分類する分類処理により、第３情報を生成する。例えば、情報処理装置１００Ａは、複数のノードから所定の処理により処理対象ノードを選択し、選択した処理対象ノードを対象に分類処理を行い、第３情報を生成する。 For example, the information processing device 100A generates third information that classifies a plurality of objects into a plurality of blobs by searching using the graph GR21. For example, the information processing device 100A selects one node (also referred to as a "processing target node") from a plurality of nodes in the graph GR21, and selects at least one of the neighboring nodes that are nodes connected to the processing target node by an edge. The third information is generated by a classification process of classifying a group of objects corresponding to each of the classification target node, which is a node of the section, and the processing target node into one group. For example, the information processing device 100A selects a processing target node from a plurality of nodes by a predetermined process, performs a classification process on the selected processing target node, and generates third information.

例えば、情報処理装置１００Ａは、ノードにエッジで連結された近傍ノード（接続ノード）の数に基づいて処理対象ノードを選択してもよい。例えば、情報処理装置１００Ａは、近傍ノードの数が大きい方から順に処理対象ノードを選択してもよい。近傍ノードの数が大きい方から順に処理対象ノードを選択し、ブロブを示す第３情報を生成する方法を「第１方法」ともいう。また、例えば、情報処理装置１００Ａは、近傍ノード（接続ノード）の数が小さい方から順に処理対象ノードを選択してもよい。近傍ノードの数が小さい方から順に処理対象ノードを選択し、ブロブを生成する方法を「第２方法」ともいう。また、例えば、情報処理装置１００Ａは、ランダムに処理対象ノードを選択してもよい。ランダムに処理対象ノードを選択し、ブロブを示す第３情報を生成する方法を「第３方法」ともいう。以下では、第１方法、第２方法、及び第３方法の各々の処理手順の概要を説明する。 For example, the information processing device 100A may select a processing target node based on the number of neighboring nodes (connection nodes) connected to the node by edges. For example, the information processing apparatus 100A may select processing target nodes in descending order of the number of neighboring nodes. A method of selecting processing target nodes in order of increasing number of neighboring nodes and generating third information indicating a blob is also referred to as a "first method." Further, for example, the information processing apparatus 100A may select processing target nodes in descending order of the number of neighboring nodes (connected nodes). A method of selecting processing target nodes in order of decreasing number of neighboring nodes and generating blobs is also referred to as a "second method." Further, for example, the information processing apparatus 100A may randomly select a processing target node. A method of randomly selecting processing target nodes and generating third information indicating a blob is also referred to as a "third method." Below, the outline of each processing procedure of the first method, second method, and third method will be explained.

〔２－４－４－１．第１方法〕
まず、第１方法について説明する。例えば、第１方法の場合、情報処理装置１００Ａは、以下のような処理（２－１）～（２－３）を行い、ブロブを示す第３情報を生成する。 [2-4-4-1. 1st method]
First, the first method will be explained. For example, in the case of the first method, the information processing device 100A performs the following processes (2-1) to (2-3) to generate third information indicating a blob.

（２－１）：各ノードの近傍ノード数（エッジ数）が大きいものからノードＮＤを取得
（２－２）：ノードＮＤの近傍ノード及びノードＮＤを一のブロブとする
（２－３）：ブロブとしたノード及びそのノードのエッジを削除 (2-1): Obtain the node ND from the one with the largest number of neighboring nodes (number of edges) for each node (2-2): Set the neighboring nodes of the node ND and the node ND as one blob (2-3): Delete blob nodes and edges of those nodes

例えば、情報処理装置１００Ａは、処理（２－１）～（２－３）を、すべてのノードにおいて実施する。 For example, the information processing device 100A performs processes (2-1) to (2-3) on all nodes.

例えば、処理（２－１）では、情報処理装置１００Ａは、グラフＧＲ２１を参照し、一のノードＮＤを順次取得する。情報処理装置１００Ａは、近傍ノードの数が大きい方から順に並べられたノードの一覧情報を用いて、一のノードＮＤを取得してもよい。 For example, in process (2-1), the information processing device 100A refers to the graph GR21 and sequentially acquires one node ND. The information processing device 100A may obtain one node ND using list information of nodes arranged in descending order of the number of neighboring nodes.

例えば処理（２－２）では、情報処理装置１００Ａは、処理対象となっているグラフ（グラフＧＲ２１等）を参照し、ノードＮＤの近傍ノード及びノードＮＤを一のブロブとする情報を生成する。なお、情報処理装置１００Ａは、すべての近傍ノードをブロブとするのではなく定数Ｋ以下の近傍ノードをブロブとしても良い。例えば、情報処理装置１００Ａは、ノードＮＤの全近傍ノードをノードＮＤとともにブロブとするのではなく、近傍ノードのうちＫ個の近傍ノード及びノードＮＤを一のブロブとしてもよい。 For example, in process (2-2), the information processing device 100A refers to the graph to be processed (graph GR21, etc.) and generates information regarding the neighboring nodes of the node ND and the node ND as one blob. Note that the information processing device 100A may set neighboring nodes that are less than or equal to a constant K as blobs instead of setting all neighboring nodes as blobs. For example, instead of making all the neighboring nodes of the node ND into a blob together with the node ND, the information processing device 100A may make K neighboring nodes and the node ND among the neighboring nodes into one blob.

例えば処理（２－３）では、情報処理装置１００Ａは、ノードが重複して選択されないように、処理（２－２）においてブロブとしたノード及びそのノードのエッジを削除する。例えば、情報処理装置１００Ａは、ブロブとしたノードが他のノードの近傍ノードであれば、その（エッジおよび）ノードを削除する。なお、上記の方法は一例に過ぎず、処理（２－３）は、エッジ等を削除する方法に限らず、ブロブを生成可能であればどのような方法であってもよい。例えば、情報処理装置１００Ａは、既にブロブに属しているノードのリストを保持して、除外するノードを管理する方法により、ブロブの生成を行ってもよい。例えば、情報処理装置１００Ａは、ブロブとしたノードに所定のフラグ（削除フラグ等）を立てることにより、既にブロブに属しているノードを管理してもよい。このように、フラグを用いる場合、情報処理装置１００Ａは、処理の際に各ノードのフラグを参照して、そのノードを処理対象とするかを判定する。 For example, in process (2-3), the information processing device 100A deletes the node that was made into a blob in process (2-2) and the edges of that node so that nodes are not selected redundantly. For example, the information processing device 100A deletes the blob (edge and) node if the node is a neighboring node of another node. Note that the above method is only an example, and the process (2-3) is not limited to the method of deleting edges and the like, but may be any method as long as it is possible to generate blobs. For example, the information processing device 100A may generate a blob by maintaining a list of nodes that already belong to the blob and managing nodes to be excluded. For example, the information processing device 100A may manage nodes that already belong to a blob by setting a predetermined flag (such as a deletion flag) on the node that is a blob. In this way, when using flags, the information processing apparatus 100A refers to the flags of each node during processing and determines whether that node is to be processed.

上述した第１方法の具体例について図２１を用いて説明する。図２１では、情報処理装置１００Ａは、グラフＧＲ２１のうち、近傍ノードの数が最大の４個であるノードＮ７を取得する（ステップＳ５１）。図２１では、グラフＧＲ２１のうちノードＮ１も近傍ノードの数が４個であるが、情報処理装置１００Ａは、ノードＮ７を取得するものとする。なお、近傍ノードの数が同数のノードが複数ある場合、情報処理装置１００Ａは、その複数のノードからランダムにノードを取得してもよい。 A specific example of the first method described above will be explained using FIG. 21. In FIG. 21, the information processing device 100A acquires the node N7, which has the maximum number of neighboring nodes of four, from the graph GR21 (step S51). In FIG. 21, the node N1 in the graph GR21 also has four neighboring nodes, but the information processing device 100A acquires the node N7. Note that when there are multiple nodes with the same number of neighboring nodes, the information processing device 100A may randomly acquire nodes from the multiple nodes.

そして、情報処理装置１００Ａは、ノードＮ７の近傍ノードであるノードＮ９、Ｎ１２、Ｎ５４、Ｎ１２６の４個のノード及びノードＮ７の５個のノードを一のブロブＢＬ１１とする（ステップＳ５２）。情報処理装置１００Ａは、ノードＮ７、Ｎ９、Ｎ１２、Ｎ５４、Ｎ１２６の５個のノードが一のブロブＢＬ１１に属することを示す第３情報ＢＬＴ１を生成する。そして、情報処理装置１００Ａは、ノードＮ７、Ｎ９、Ｎ１２、Ｎ５４、Ｎ１２６の５個のノード及び各ノードのエッジをグラフＧＲ２１から削除する。これにより、空間情報ＳＰ２０－１に示すように、グラフＧＲ２１はグラフＧＲ２１－１に更新される。 Then, the information processing device 100A sets four nodes N9, N12, N54, and N126, which are neighboring nodes of the node N7, and five nodes including the node N7, as one blob BL11 (step S52). The information processing device 100A generates third information BLT1 indicating that five nodes N7, N9, N12, N54, and N126 belong to one blob BL11. The information processing device 100A then deletes the five nodes N7, N9, N12, N54, and N126 and the edges of each node from the graph GR21. As a result, the graph GR21 is updated to the graph GR21-1, as shown in the spatial information SP20-1.

そして、情報処理装置１００Ａは、グラフＧＲ２１のうち、近傍ノードの数が４個であるノードＮ１を取得する（ステップＳ５３）。そして、情報処理装置１００Ａは、ノードＮ１の近傍ノードであるノードＮ４、Ｎ５、Ｎ８８、Ｎ９９の４個のノード及びノードＮ１の５個のノードを一のブロブＢＬ１２とする（ステップＳ５４）。情報処理装置１００Ａは、ノードＮ１、Ｎ４、Ｎ５、Ｎ８８、Ｎ９９の５個のノードが一のブロブＢＬ１２に属することを示す第３情報ＢＬＴ２を生成する。そして、情報処理装置１００Ａは、ノードＮ１、Ｎ４、Ｎ５、Ｎ８８、Ｎ９９の５個のノード及び各ノードのエッジをグラフＧＲ２１－１から削除する。 Then, the information processing device 100A obtains the node N1 having four neighboring nodes from the graph GR21 (step S53). Then, the information processing device 100A sets four nodes N4, N5, N88, and N99, which are neighboring nodes of the node N1, and five nodes including the node N1, as one blob BL12 (step S54). The information processing device 100A generates third information BLT2 indicating that five nodes N1, N4, N5, N88, and N99 belong to one blob BL12. Then, the information processing device 100A deletes the five nodes N1, N4, N5, N88, and N99 and the edges of each node from the graph GR21-1.

情報処理装置１００Ａは、上述した処理を処理対象となるノードが無くなるまで繰り返し、複数のブロブを示す第３情報を生成する。 The information processing device 100A repeats the above-described processing until there are no more nodes to be processed, and generates third information indicating a plurality of blobs.

〔２－４－４－２．第２方法〕
次に、第２方法について説明する。例えば、第２方法の場合、情報処理装置１００Ａは、以下のような処理（３－１）～（３－３）を行い、ブロブを示す第３情報を生成する。 [2-4-4-2. Second method]
Next, the second method will be explained. For example, in the case of the second method, the information processing device 100A performs the following processes (3-1) to (3-3) to generate third information indicating a blob.

（３－１）：各ノードの近傍ノード数（エッジ数）が小さいものからノードＮＤを取得
（３－２）：ノードＮＤの近傍ノード及びノードＮＤを一のブロブとする
（３－３）：ブロブとしたノード及びそのノードのエッジを削除 (3-1): Obtain nodes ND from the smallest number of neighboring nodes (number of edges) for each node (3-2): Set neighboring nodes of node ND and node ND as one blob (3-3): Delete blob nodes and edges of those nodes

例えば、情報処理装置１００Ａは、処理（３－１）～（３－３）を、すべてのノードにおいて実施する。 For example, the information processing device 100A performs processes (3-1) to (3-3) on all nodes.

例えば、処理（３－１）では、情報処理装置１００Ａは、グラフＧＲ２１を参照し、一のノードＮＤを順次取得する。情報処理装置１００Ａは、近傍ノードの数が小さい方から順に並べられたノードの一覧情報を用いて、一のノードＮＤを取得してもよい。 For example, in process (3-1), the information processing device 100A refers to the graph GR21 and sequentially acquires one node ND. The information processing device 100A may obtain one node ND using list information of nodes arranged in descending order of the number of neighboring nodes.

例えば処理（３－２）では、情報処理装置１００Ａは、処理対象となっているグラフ（グラフＧＲ２１等）を参照し、ノードＮＤの近傍ノード及びノードＮＤを一のブロブとする情報を生成する。なお、情報処理装置１００Ａは、すべての近傍ノードをブロブとするのではなく定数Ｋ以下の近傍ノードをブロブとしても良い。例えば、情報処理装置１００Ａは、ノードＮＤの全近傍ノードをノードＮＤとともにブロブとするのではなく、近傍ノードのうちＫ個の近傍ノード及びノードＮＤを一のブロブとしてもよい。 For example, in the process (3-2), the information processing device 100A refers to the graph to be processed (graph GR21, etc.) and generates information regarding the neighboring nodes of the node ND and the node ND as one blob. Note that the information processing device 100A may set neighboring nodes that are less than or equal to a constant K as blobs instead of setting all neighboring nodes as blobs. For example, instead of making all the neighboring nodes of the node ND into a blob together with the node ND, the information processing device 100A may make K neighboring nodes and the node ND among the neighboring nodes into one blob.

例えば処理（３－３）では、情報処理装置１００Ａは、ノードが重複して選択されないように、処理（３－２）においてブロブとしたノード及びそのノードのエッジを削除する。例えば、情報処理装置１００Ａは、ブロブとしたノードが他のノードの近傍ノードであれば、その（エッジおよび）ノードを削除する。なお、第２方法は、ノードの取得順が近傍ノード数（エッジ数）が小さい方から順である点以外は、第１方法と同様であるため、具体例等についての説明は省略する。例えば、処理（３－３）は、第１方法と同様に、エッジ等を削除する方法に限らず、ブロブを生成可能であればどのような方法であってもよい。例えば、情報処理装置１００Ａは、第１方法と同様に、既にブロブに属しているノードのリストを保持して、除外するノードを管理する方法により、ブロブの生成を行ってもよい。 For example, in process (3-3), the information processing apparatus 100A deletes the node that was made into a blob in process (3-2) and the edges of that node so that nodes are not selected twice. For example, the information processing device 100A deletes the blob (edge and) node if the node is a neighboring node of another node. Note that the second method is the same as the first method except that the order of node acquisition is from the smallest number of neighboring nodes (number of edges), so a description of specific examples and the like will be omitted. For example, like the first method, the process (3-3) is not limited to the method of deleting edges and the like, but may be any method as long as it is possible to generate blobs. For example, similar to the first method, the information processing device 100A may generate a blob by maintaining a list of nodes that already belong to the blob and managing nodes to be excluded.

〔２－４－４－３．第３方法〕
次に、第３方法について説明する。例えば、第３方法の場合、情報処理装置１００Ａは、以下のような処理（４－１）～（４－３）を行い、ブロブを示す第３情報を生成する。 [2-4-4-3. Third method]
Next, a third method will be explained. For example, in the case of the third method, the information processing device 100A performs the following processes (4-1) to (4-3) to generate third information indicating a blob.

（４－１）：ランダムにノードＮＤを順次取得
（４－２）：ノードＮＤの近傍ノード及びノードＮＤを一のブロブとする
（４－３）：ブロブとしたノード及びそのノードのエッジを削除 (4-1): Randomly acquire nodes ND in sequence (4-2): Make neighboring nodes of node ND and node ND one blob (4-3): Delete the blob node and the edges of that node

例えば、情報処理装置１００Ａは、処理（４－１）～（４－３）を、すべてのノードにおいて実施する。なお、第３方法は、ノードの取得順がランダムである点以外は、第１方法及び第２方法と同様であり詳細な説明は省略する。例えば、情報処理装置１００Ａは、すべて同じエッジ数であればランダム順（第３方法）で第３情報の生成を実行してもよい。 For example, the information processing device 100A performs processes (4-1) to (4-3) on all nodes. Note that the third method is similar to the first method and the second method, except that the order of node acquisition is random, and detailed description thereof will be omitted. For example, the information processing device 100A may generate the third information in random order (third method) if the number of edges is the same.

〔２－４－４－４．第４方法〕
なお、上述した第１方法～第３方法は一例に過ぎず、情報処理装置１００Ａは、様々な処理によりブロブを生成してもよい。例えば、情報処理装置１００Ａは、複数のオブジェクトを検索対象とする近傍検索インデックス（インデックス）を用いた検索によりブロブを生成してもよい。インデックスを用いた検索によりブロブを生成する方法を「第４方法」ともいう。 [2-4-4-4. Fourth method]
Note that the first to third methods described above are merely examples, and the information processing apparatus 100A may generate blobs by various processes. For example, the information processing device 100A may generate a blob through a search using a neighborhood search index that searches for multiple objects. The method of generating blobs by searching using an index is also referred to as the "fourth method."

以下、第４方法について説明する。なお、上述した内容と同様の点については適宜説明を省略する。以下では、インデックスの一例としてグラフを用いる場合を一例として説明するが、上述したようにインデックスは、複数のオブジェクトを検索対象とするものであればどのようなものであってもよい。例えば、インデックスは、ハッシュインデックス、ツリーインデックス等、どのようなインデックスであってもよい。 The fourth method will be explained below. Note that descriptions of points similar to those described above will be omitted as appropriate. In the following, a case will be described in which a graph is used as an example of the index, but as described above, the index may be of any type as long as it searches for a plurality of objects. For example, the index may be any index such as a hash index or a tree index.

ここから、図２１に示すようなグラフＧＲ２１をインデックスとして用いた検索により、情報処理装置１００Ａがブロブを示す第３情報を生成する場合を説明する。例えば、情報処理装置１００Ａは、以下のような処理（５－１）～（５－４）を行い、グラフＧＲ３２を生成する。 From here, a case will be described in which the information processing device 100A generates third information indicating a blob by searching using the graph GR21 as an index as shown in FIG. 21. For example, the information processing device 100A performs the following processes (5-1) to (5-4) to generate the graph GR32.

（５－１）：ランダムでオブジェクトＮＸを取得（選択）
（５－２）：オブジェクトＮＸの近傍オブジェクトを定数ｋ件検索
（５－３）：オブジェクトＮＸの近傍オブジェクト及びオブジェクトＮＸを一のブロブとする
（５－４）：重複処理回避用の処理を実行 (5-1): Obtain object NX randomly (selection)
(5-2): Search for a constant number of k neighboring objects of object NX (5-3): Make neighboring objects of object NX and object NX one blob (5-4): Execute processing to avoid duplicate processing

例えば処理（５－１）では、情報処理装置１００Ａは、グラフＧＲ２１からランダムに一のオブジェクトＮＸを取得する。 For example, in process (5-1), the information processing device 100A randomly acquires one object NX from the graph GR21.

例えば処理（５－２）では、情報処理装置１００Ａは、図１１に示すような検索処理を、オブジェクトＮＸを対象（クエリ）とし、グラフＧＲ２１を用いて行うことにより、ｋ件（個）のオブジェクトをオブジェクトＮＸの近傍オブジェクトとして抽出する。なお、情報処理装置１００Ａは、すべてのオブジェクトをスキャン（検索）して、オブジェクトＮＸの近傍オブジェクトを抽出してもよい。 For example, in process (5-2), the information processing device 100A performs a search process as shown in FIG. 11 using the graph GR21 with the object NX as the target (query). is extracted as a neighboring object of object NX. Note that the information processing device 100A may scan (search) all the objects and extract neighboring objects of the object NX.

例えば処理（５－３）では、情報処理装置１００Ａは、オブジェクトＮＸの近傍オブジェクトの近傍ノード及びオブジェクトＮＸを一のブロブとする情報を生成する。 For example, in process (5-3), the information processing device 100A generates information that makes the neighboring nodes of the neighboring objects of the object NX and the object NX one blob.

例えば処理（５－４）では、情報処理装置１００Ａは、一のオブジェクトが重複して検索されたりする等により複数のブロブに属することとならないように、重複処理回避用の処理を実行する。例えば、情報処理装置１００Ａは、ブロブとしたオブジェクトをインデックスから削除する。例えば、情報処理装置１００Ａは、ブロブとしたオブジェクトをグラフＧＲ２１から削除する。 For example, in process (5-4), the information processing device 100A executes a process for avoiding duplicate processing so that one object does not belong to multiple blobs due to duplicate searches or the like. For example, the information processing device 100A deletes the blob object from the index. For example, the information processing device 100A deletes the blob object from the graph GR21.

なお、情報処理装置１００Ａは、上記に限らず、一のオブジェクトが複数のブロブに属することとならなければ、どのような方法により重複処理回避用の処理を行ってもよい。例えば、情報処理装置１００Ａは、ブロブとしたオブジェクトに所定のフラグ（削除フラグ等）を立てることにより、既にブロブに属しているオブジェクトが処理対象となることを抑制してもよい。このように、フラグを用いる場合、情報処理装置１００Ａは、処理の際に各オブジェクトのフラグを参照して、そのオブジェクトを処理対象とするかを判定する。例えば、情報処理装置１００Ａは、上述したグループグラフの生成に検索結果を利用する場合には、フラグを用いた方法により重複処理回避用の処理を行ってもよい。 Note that the information processing apparatus 100A is not limited to the above method, and may perform processing for avoiding duplicate processing using any method as long as one object does not belong to multiple blobs. For example, the information processing device 100A may set a predetermined flag (such as a deletion flag) on an object that has been made into a blob, thereby preventing objects that already belong to the blob from becoming processing targets. In this way, when using flags, the information processing apparatus 100A refers to the flag of each object during processing and determines whether the object is to be processed. For example, when the information processing device 100A uses the search results to generate the group graph described above, the information processing device 100A may perform processing to avoid duplicate processing using a method using a flag.

〔２－４－５．変形例に係る情報処理装置〕
第２の実施形態の変形例に係る情報処理装置１００Ａにおいて取得部１３１は、以下の処理も行う。取得部１３１は、データ検索の対象となる複数のオブジェクトを示す第１情報と、複数のオブジェクトを分類するために用いる第２情報とを取得する。取得部１３１は、複数のオブジェクトを検索対象とするインデックスである第２情報を取得する。取得部１３１は、複数のオブジェクトの各々に対応する複数のノードがエッジにより連結されたグラフである第２情報を取得する。取得部１３１は、複数のオブジェクトを検索対象とするインデックスである第４情報を取得する。取得部１３１は、複数のオブジェクトの各々に対応する複数のオブジェクトノードがエッジにより連結されたグラフである第４情報を取得する。 [2-4-5. Information processing device according to modification]
In the information processing device 100A according to the modification of the second embodiment, the acquisition unit 131 also performs the following processing. The acquisition unit 131 acquires first information indicating a plurality of objects to be searched for data, and second information used to classify the plurality of objects. The acquisition unit 131 acquires second information that is an index that searches for a plurality of objects. The acquisition unit 131 acquires second information that is a graph in which a plurality of nodes corresponding to each of a plurality of objects are connected by edges. The acquisition unit 131 acquires fourth information that is an index that searches for multiple objects. The acquisition unit 131 acquires fourth information that is a graph in which a plurality of object nodes corresponding to each of a plurality of objects are connected by edges.

また、第２の実施形態の変形例に係る情報処理装置１００Ａにおいて生成部１３２Ａは、以下の処理も行う。生成部１３２Ａは、取得部１３１により取得された第２情報を用いて、第１情報が示す複数のオブジェクトを分類し、複数のオブジェクトを対象とする検索処理において一括処理を行うために用いられる複数のグループを示す第３情報を生成する。生成部１３２Ａは、第２情報を用いて、複数のオブジェクトを複数のグループに分類する第３情報を生成する。生成部１３２Ａは、第２情報を用いた検索により、複数のオブジェクトを複数のグループに分類する第３情報を生成する。 Furthermore, in the information processing apparatus 100A according to the modification of the second embodiment, the generation unit 132A also performs the following processing. The generation unit 132A uses the second information acquired by the acquisition unit 131 to classify the plurality of objects indicated by the first information, and classifies the plurality of objects used for batch processing in the search process targeting the plurality of objects. Third information indicating the group is generated. The generation unit 132A uses the second information to generate third information that classifies the plurality of objects into a plurality of groups. The generation unit 132A generates third information for classifying a plurality of objects into a plurality of groups by searching using the second information.

生成部１３２Ａは、グラフ中の複数のノードから一のノードを選択し、一のノードにエッジで連結されたノードである近傍ノードのうち少なくとも一部のノードである分類対象ノード、及び一のノードの各々に対応するオブジェクト群を一のグループに分類する分類処理により、第３情報を生成する。生成部１３２Ａは、一のノードの近傍ノードのうち所定の数のノードである分類対象ノード、及び一のノードの各々に対応するオブジェクト群を一のグループに分類する分類処理により、第３情報を生成する。生成部１３２Ａは、一のノードの近傍ノードの全てである分類対象ノード、及び一のノードの各々に対応するオブジェクト群を一のグループに分類する分類処理により、第３情報を生成する。 The generation unit 132A selects one node from a plurality of nodes in the graph, and generates classification target nodes that are at least some nodes among neighboring nodes that are nodes connected to the one node by edges, and the one node. The third information is generated by a classification process that classifies the object group corresponding to each of the objects into one group. The generation unit 132A generates the third information through a classification process of classifying into one group a group of objects corresponding to each of the classification target nodes, which are a predetermined number of nodes among the nodes in the vicinity of one node, and one node. generate. The generation unit 132A generates the third information through a classification process of classifying the classification target node, which is all of the neighboring nodes of one node, and the object group corresponding to each of the one node into one group.

生成部１３２Ａは、複数のノードの各々にエッジで連結されたノードである近傍ノードの数に基づいて、複数のノードから一のノードを選択し、一のノードの分類対象ノード及び一のノードの各々に対応するオブジェクト群を一のグループに分類する分類処理により、第３情報を生成する。生成部１３２Ａは、近傍ノードの数が大きい方から順に一のノードを選択し、一のノードの分類対象ノード及び一のノードの各々に対応するオブジェクト群を一のグループに分類する分類処理により、第３情報を生成する。生成部１３２Ａは、近傍ノードの数が小さい方から順に一のノードを選択し、一のノードの分類対象ノード及び一のノードの各々に対応するオブジェクト群を一のグループに分類する分類処理により、第３情報を生成する。 The generation unit 132A selects one node from the plurality of nodes based on the number of neighboring nodes that are nodes connected to each of the plurality of nodes by edges, and selects one node from the plurality of nodes, and selects a classification target node of the one node and a node of the one node. The third information is generated by a classification process of classifying the corresponding object groups into one group. The generation unit 132A selects one node in descending order of the number of neighboring nodes, and performs a classification process of classifying the classification target node of one node and the object group corresponding to each of the one node into one group. Generate third information. The generation unit 132A selects one node in descending order of the number of neighboring nodes, and performs a classification process of classifying the classification target node of one node and the object group corresponding to each of the one node into one group. Generate third information.

生成部１３２Ａは、複数のノードからランダムに一のノードを選択し、一のノードの分類対象ノード及び一のノードの各々に対応するオブジェクト群を一のグループに分類する分類処理により、第３情報を生成する。生成部１３２Ａは、一のグループに分類された一のノードの分類対象ノード及び一のノードである処理済みノード群をグラフから除外し、処理済みノード群を除外した後のグラフを用いて、分類処理を繰り返すことにより、第３情報を生成する。生成部１３２Ａは、処理済みノード群を除外した後のグラフを用いて、分類処理を複数のオブジェクトがいずれかのグループに分類されるまで繰り返すことにより、第３情報を生成する。生成部１３２Ａは、各々に属するオブジェクトが相互に排他的な複数のグループを示す第３情報を生成する。 The generation unit 132A randomly selects one node from the plurality of nodes, and generates third information by performing a classification process of classifying the classification target node of the one node and the object group corresponding to each of the one node into one group. generate. The generation unit 132A excludes the classification target node of the one node classified into the one group and the processed node group that is the one node from the graph, and performs classification using the graph after excluding the processed node group. Third information is generated by repeating the process. The generation unit 132A generates third information by repeating the classification process using the graph after excluding the processed node group until a plurality of objects are classified into any group. The generation unit 132A generates third information indicating a plurality of groups to which objects each belong to are mutually exclusive.

生成部１３２Ａは、第４情報を用いて、複数のオブジェクトのうち少なくとも一部のオブジェクトに対応するノードであるオブジェクトノードから、複数のグループのうち、オブジェクトノードが属するグループ以外の他のグループへエッジが連結されたグループグラフを生成する。生成部１３２Ａは、複数のオブジェクトから一のオブジェクトを選択し、第４情報を用いて複数のオブジェクトを対象に、一のオブジェクトの近傍オブジェクトを検索し、一のオブジェクトに対応する一のオブジェクトノードから、近傍オブジェクトが属する他のグループへエッジを連結する連結処理により、グループグラフを生成する。生成部１３２Ａは、複数のオブジェクトからランダムに一のオブジェクトを選択し、一のオブジェクトの近傍オブジェクトを検索し、一のオブジェクトに対応する一のオブジェクトノードから、近傍オブジェクトが属する他のグループへエッジを連結する連結処理により、グループグラフを生成する。生成部１３２Ａは、第４情報を用いて、グループグラフを生成する。 Using the fourth information, the generation unit 132A generates an edge from an object node, which is a node corresponding to at least some of the plurality of objects, to a group other than the group to which the object node belongs among the plurality of groups. Generates a connected group graph. The generation unit 132A selects one object from the plurality of objects, searches for neighboring objects of the one object among the plurality of objects using the fourth information, and selects one object from the one object node corresponding to the one object. , a group graph is generated by a connection process that connects edges to other groups to which neighboring objects belong. The generation unit 132A randomly selects one object from a plurality of objects, searches for neighboring objects of the one object, and creates an edge from one object node corresponding to the one object to another group to which the neighboring object belongs. A group graph is generated by the concatenation process. The generation unit 132A generates a group graph using the fourth information.

〔２－４－６．情報処理のフロー〕
次に、図２２を用いて、変形例に係る情報処理の手順について説明する。図２２は、変形例に係る情報処理の一例を示すフローチャートである。 [2-4-6. Information processing flow]
Next, an information processing procedure according to a modified example will be described using FIG. 22. FIG. 22 is a flowchart illustrating an example of information processing according to a modification.

図２２に示すように、情報処理装置１００Ａは、データ検索の対象となる複数のオブジェクトを示す第１情報を取得する（ステップＳ４０１）。例えば、情報処理装置１００Ａは、オブジェクト情報記憶部１２１から複数のオブジェクトを示す第１情報を取得する。 As shown in FIG. 22, the information processing device 100A acquires first information indicating a plurality of objects that are data search targets (step S401). For example, the information processing device 100A acquires first information indicating a plurality of objects from the object information storage unit 121.

また、情報処理装置１００Ａは、複数のオブジェクトを分類するために用いる第２情報を取得する（ステップＳ４０２）。例えば、情報処理装置１００Ａは、グラフ情報記憶部１２２Ａからグラフを第２情報として取得する。 The information processing device 100A also acquires second information used to classify a plurality of objects (step S402). For example, the information processing device 100A obtains a graph as the second information from the graph information storage unit 122A.

そして、情報処理装置１００Ａは、第２情報を用いて、第１情報が示す複数のオブジェクトを分類し、複数のオブジェクトを対象とする検索処理において一括処理を行うために用いられる複数のグループを示す第３情報を生成する（ステップＳ４０３）。例えば、情報処理装置１００Ａは、各々に属するオブジェクトが相互に排他的な複数のブロブを示す第３情報を生成する。 The information processing device 100A uses the second information to classify the plurality of objects indicated by the first information, and indicates a plurality of groups used for performing batch processing in a search process targeting a plurality of objects. Third information is generated (step S403). For example, the information processing device 100A generates third information indicating a plurality of blobs in which objects belonging to each blob are mutually exclusive.

〔３．第３の実施形態〕
上述した例では、オブジェクトに対応するノード（オブジェクトノード）間をエッジで連結したグラフ（「オブジェクトグラフ」ともいう）、オブジェクトノードとグループ（ブロブ）間をエッジで連結したグラフ（ブロブ用グラフ）の２種類のグラフを用いる説明したが、情報処理システム１は様々な種類のグラフを用いてもよい。例えば、情報処理システム１は、オブジェクトグラフ及びブロブ用グラフとは異なり、グループ（ブロブ）間を連結したグラフ（以下「グループ連結グラフ」または「ブロブ連結グラフ」ともいう）を生成し、ブロブ連結グラフを検索に用いてもよい。 [3. Third embodiment]
In the above example, there are two graphs: a graph (also called an "object graph") in which nodes corresponding to objects (object nodes) are connected by edges, and a graph (blob graph) in which object nodes and groups (blobs) are connected by edges. Although the description uses two types of graphs, the information processing system 1 may use various types of graphs. For example, unlike object graphs and blob graphs, the information processing system 1 generates a graph that connects groups (blobs) (hereinafter also referred to as a "group connected graph" or "blob connected graph"), and generates a blob connected graph. may be used for the search.

この点について、以下、第３の実施形態として説明する。第３の実施形態においては、情報処理システム１は、情報処理装置１００または情報処理装置１００Ａに代えて、情報処理装置１００Ｂを有する。なお、第１の実施形態及び第２の実施形態等において上述した内容と同様の点については適宜説明を省略する。 This point will be described below as a third embodiment. In the third embodiment, the information processing system 1 includes an information processing device 100B instead of the information processing device 100 or the information processing device 100A. Note that descriptions of points similar to those described above in the first embodiment, second embodiment, etc. will be omitted as appropriate.

情報処理装置１００Ｂは、インデックス（インデックス情報）を用いて、一のオブジェクトが属する第１グループ（第１ブロブ）ら、一のオブジェクトの近傍オブジェクトが属する第２グループ（第２ブロブ）へエッジが連結されたグループ連結グラフを生成する。以下では、インデックスの一例としてグラフを用いる場合を一例として説明するが、上述したようにインデックスは、複数のオブジェクトを検索対象とするものであればどのようなものであってもよい。例えば、インデックスは、ハッシュインデックス、ツリーインデックス等、どのようなインデックスであってもよい。 The information processing device 100B uses the index (index information) to connect edges from a first group (first blob) to which one object belongs to a second group (second blob) to which neighboring objects of one object belong. generate a group connected graph. In the following, a case will be described in which a graph is used as an example of the index, but as described above, the index may be of any type as long as it searches for a plurality of objects. For example, the index may be any index such as a hash index or a tree index.

〔３－１．情報処理〕
まず、図２３を用いて、第３の実施形態に係る情報処理の概要を説明する。図２３は、第３の実施形態に係る情報処理の一例を示す図である。 [3-1. Information processing〕
First, an overview of information processing according to the third embodiment will be explained using FIG. 23. FIG. 23 is a diagram illustrating an example of information processing according to the third embodiment.

図２３の例では、情報処理装置１００Ｂは、空間情報ＳＰ２１に示すようなグラフＧＲ２１を取得済みであるものとする。なお、情報処理装置１００Ｂは、グラフ生成に関する様々な技術を適宜用いて、グラフＧＲ２１を生成してもよい。グラフＧＲ２１は図１２等で示すグラフＧＲ２１と同様であるため説明を省略する。 In the example of FIG. 23, it is assumed that the information processing apparatus 100B has already acquired a graph GR21 as shown in the spatial information SP21. Note that the information processing device 100B may generate the graph GR21 using various techniques related to graph generation as appropriate. Since the graph GR21 is similar to the graph GR21 shown in FIG. 12 etc., the explanation will be omitted.

情報処理装置１００Ｂは、任意の方法により、ブロブＢＬ１～ＢＬ１０等を示す情報を生成する。情報処理装置１００Ｂは、ｋ－ｍｅａｎｓ等の任意のクラスタリングにより各ノードをブロブＢＬ１～ＢＬ１０のいずれかに分類してもよい。なお、情報処理装置１００Ｂは、ブロブを生成可能であれば、どのような方法によりブロブを生成してもよい。例えば、情報処理装置１００Ｂは、上述した第１方法～第４方法のいずれかの方法によりブロブＢＬ１～ＢＬ１０等を示す情報を生成してもよい。 The information processing device 100B generates information indicating the blobs BL1 to BL10, etc. using an arbitrary method. The information processing device 100B may classify each node into one of the blobs BL1 to BL10 using arbitrary clustering such as k-means. Note that the information processing device 100B may use any method to generate blobs as long as it is capable of generating blobs. For example, the information processing device 100B may generate information indicating the blobs BL1 to BL10, etc. using any of the first to fourth methods described above.

情報処理装置１００Ｂは、空間情報ＳＰ２１に示すようなグラフＧＲ２１及びブロブの情報を用いて、ブロブ連結グラフを生成する（ステップＳ６１）。図２３の例では、情報処理装置１００Ｂは、空間情報ＳＰ４１に示すように、ブロブ（第１ブロブ）と他のブロブ（第２ブロブ）との間をエッジで連結したブロブ連結グラフであるグラフＧＲ４１を生成する。図２３に示す矢印線は、矢元のブロブ（第１ブロブ）から矢先のブロブ（第２ブロブ）への有向エッジを示す。すなわち、図２３に示す矢印線は、矢元のブロブを参照元とし、矢先のブロブを参照先とする有向エッジを示す。例えば、図２３では、ブロブＢＬ１からは、ブロブＢＬ２、ブロブＢＬ３、ブロブＢＬ４、ブロブＢＬ５及びブロブＢＬ８の５つのブロブへのエッジが連結されることを示す。 The information processing device 100B generates a blob connection graph using the graph GR21 as shown in the spatial information SP21 and the blob information (step S61). In the example of FIG. 23, the information processing device 100B uses a graph GR41, which is a blob connection graph in which a blob (first blob) and another blob (second blob) are connected by edges, as shown in spatial information SP41. generate. The arrow line shown in FIG. 23 indicates a directed edge from the blob at the base of the arrow (first blob) to the blob at the tip (second blob). That is, the arrow line shown in FIG. 23 indicates a directed edge that uses the blob at the beginning of the arrow as a reference source and the blob at the tip of the arrow as a reference destination. For example, FIG. 23 shows that edges from blob BL1 to five blobs, blob BL2, blob BL3, blob BL4, blob BL5, and blob BL8, are connected.

例えば、情報処理装置１００Ｂは、グラフＧＲ２１を検索し、検索結果を基にグラフＧＲ４１を生成する。例えば、情報処理装置１００Ｂは、以下のような処理（６－１）～（６－４）を行い、ブロブ連結グラフであるグラフＧＲ４１を生成する。なお、以下のオブジェクトは、オブジェクトに対応するノードと読み替えてもよい。 For example, the information processing device 100B searches for graph GR21 and generates graph GR41 based on the search results. For example, the information processing device 100B performs the following processes (6-1) to (6-4) to generate a graph GR41 that is a blob connection graph. Note that the following objects may be replaced with nodes corresponding to the objects.

（６－１）：ランダムでオブジェクトＮＸを取得（選択）
（６－２）：オブジェクトＮＸの近傍オブジェクトを定数ｋ件検索
（６－３）：各近傍オブジェクトの属するブロブを取得
（６－４）：オブジェクトＮＸが属するブロブから取得したブロブへのエッジを生成 (6-1): Obtain object NX randomly (selection)
(6-2): Search constant k neighboring objects of object NX (6-3): Obtain the blob to which each neighboring object belongs (6-4): Generate an edge from the blob to which object NX belongs to the obtained blob

例えば処理（６－１）では、情報処理装置１００Ｂは、オブジェクト情報記憶部１２１に記憶された複数のオブジェクトからランダムに一のオブジェクトＮＸを取得する。 For example, in process (6-1), the information processing device 100B randomly acquires one object NX from a plurality of objects stored in the object information storage unit 121.

例えば処理（６－２）では、情報処理装置１００Ｂは、図１１に示すような検索処理を、オブジェクトＮＸを対象（クエリ）とし、グラフＧＲ２１を用いて行うことにより、ｋ件（個）のオブジェクトをオブジェクトＮＸの近傍オブジェクトとして抽出する。 For example, in process (6-2), the information processing device 100B performs a search process as shown in FIG. is extracted as a neighboring object of object NX.

例えば処理（６－３）では、情報処理装置１００Ｂは、ブロブ情報記憶部１２５に記憶されたブロブの情報を参照し、オブジェクトＮＸの近傍オブジェクト（ノード）が対応付けられたブロブを取得する。 For example, in process (6-3), the information processing device 100B refers to the blob information stored in the blob information storage unit 125 and obtains a blob to which a neighboring object (node) of the object NX is associated.

例えば処理（６－４）では、情報処理装置１００Ｂは、オブジェクトＮＸが属するブロブから取得したブロブへのエッジを生成する。情報処理装置１００Ｂは、オブジェクトＮＸが属するブロブからすべての近傍オブジェクトの属するブロブへのエッジを生成する。例えば、情報処理装置１００Ｂは、オブジェクトＮＸが属するブロブの参照先として取得したブロブを示す情報（ブロブＩＤ）を、オブジェクトＮＸが属するブロブを示す情報（ブロブＩＤ）に対応付けて、ブロブ連結グラフ情報記憶部１２６に登録することにより、グラフＧＲ４１を生成する。 For example, in process (6-4), the information processing device 100B generates an edge from the blob to which the object NX belongs to the acquired blob. The information processing device 100B generates edges from the blob to which the object NX belongs to the blobs to which all neighboring objects belong. For example, the information processing device 100B associates the information (blob ID) indicating the blob acquired as a reference destination of the blob to which the object NX belongs with the information (blob ID) indicating the blob to which the object NX belongs, and generates blob connection graph information. By registering in the storage unit 126, a graph GR41 is generated.

例えば、情報処理装置１００Ｂは、上記の処理（６－１）～（６－４）を、処理対象となるオブジェクトが無くなるまで繰り返し行い、グラフＧＲ４１を生成する。なお、情報処理装置１００Ｂは、ブロブＸの参照先としてブロブＹが登録済みの場合、ブロブＸに属するオブジェクトの近傍オブジェクトとして、ブロブＹに属するオブジェクトが再度検索（抽出）されても、登録をスキップして各ブロブの参照先に同じブロブが重複して登録されることを抑制してもよい。 For example, the information processing device 100B repeatedly performs the above processes (6-1) to (6-4) until there are no more objects to be processed, and generates the graph GR41. Note that if blob Y has already been registered as a reference destination of blob The same blob may be prevented from being registered redundantly in the reference destination of each blob.

例えば、情報処理装置１００Ｂは、空間情報ＳＰ４１に示すように、ブロブ間をエッジで連結するグラフＧＲ４１を用いて検索処理を行う。例えば、情報処理装置１００Ｂは、検索クエリＱＥ２を対象として、ブロブ連結グラフであるグラフＧＲ４１を用いた図２７及び図２８に示すような検索処理を行うことにより、検索クエリＱＥ２の検索結果を得る。 For example, the information processing device 100B performs a search process using a graph GR41 that connects blobs with edges, as shown in spatial information SP41. For example, the information processing device 100B obtains the search results for the search query QE2 by performing search processing as shown in FIGS. 27 and 28 using the graph GR41, which is a blob connection graph, for the search query QE2.

上述したブロブ連結グラフの生成方法は一例に過ぎず、情報処理装置１００Ｂは、様々な方法によりブロブ連結グラフを生成してもよい。例えば、情報処理装置１００Ｂは、グラフを検索することなく、ブロブ連結グラフを生成してもよい。空間情報ＳＰ２１に示すブロブを検索により生成している場合、ブロブの生成の際の検索結果を流用して、グラフＧＲ４１等のブロブ連結グラフを生成してもよい。 The above-described method for generating a blob connected graph is merely an example, and the information processing apparatus 100B may generate a blob connected graph using various methods. For example, the information processing device 100B may generate a blob connection graph without searching for a graph. When the blob shown in the spatial information SP21 is generated by a search, a blob connection graph such as the graph GR41 may be generated by using the search results at the time of blob generation.

例えば、情報処理装置１００Ｂは、グラフＧＲ２１でのオブジェクトノードの接続関係を、オブジェクトノードが属するブロブの接続関係に変換して、ブロブ連結グラフを生成してもよい。例えば、ブロブＢＬ９に属するノードＮ１からはブロブＢＬ８に属するノードＮ８８及びブロブＢＬ１０に属するノードにエッジが連結されており、ブロブＢＬ９に属するノードＮ４からはブロブＢＬ７に属するノードにエッジが連結されている。そのため、情報処理装置１００Ｂは、ブロブＢＬ９から、ブロブＢＬ７、ブロブＢＬ８及びブロブＢＬ１０の３つのブロブへのエッジが連結されたブロブ連結グラフを生成する。このように、情報処理装置１００Ｂは、オブジェクトノード間を連結するエッジを基に、ブロブ間をエッジで連結するブロブ連結グラフを生成してもよい。 For example, the information processing device 100B may generate a blob connection graph by converting the connection relationship of object nodes in the graph GR21 into the connection relationship of blobs to which the object nodes belong. For example, an edge is connected from a node N1 belonging to blob BL9 to a node N88 belonging to blob BL8 and a node belonging to blob BL10, and an edge is connected from node N4 belonging to blob BL9 to a node belonging to blob BL7. . Therefore, the information processing device 100B generates a blob connection graph in which edges from blob BL9 to three blobs, blob BL7, blob BL8, and blob BL10, are connected. In this way, the information processing apparatus 100B may generate a blob connection graph that connects blobs with edges, based on edges that connect object nodes.

〔３－２．情報処理装置の構成〕
次に、図２４を用いて、第３の実施形態に係る情報処理装置１００Ｂの構成について説明する。図２４は、第３の実施形態に係る情報処理装置の構成例を示す図である。図２４に示すように、情報処理装置１００Ｂは、通信部１１０と、記憶部１２０Ｂと、制御部１３０Ｂとを有する。なお、情報処理装置１００Ｂにおいて、情報処理装置１００または情報処理装置１００Ａと同様の点は適宜説明を省略する。 [3-2. Configuration of information processing device]
Next, the configuration of the information processing apparatus 100B according to the third embodiment will be described using FIG. 24. FIG. 24 is a diagram illustrating a configuration example of an information processing apparatus according to the third embodiment. As shown in FIG. 24, the information processing device 100B includes a communication section 110, a storage section 120B, and a control section 130B. Note that in the information processing device 100B, descriptions of the same points as the information processing device 100 or the information processing device 100A will be omitted as appropriate.

（記憶部１２０Ｂ）
記憶部１２０Ｂは、例えば、ＲＡＭ、フラッシュメモリ等の半導体メモリ素子、または、ハードディスク、光ディスク等の記憶装置によって実現される。第３の実施形態に係る記憶部１２０Ｂは、図２４に示すように、オブジェクト情報記憶部１２１と、グラフ情報記憶部１２２と、量子化情報記憶部１２３と、コードブック情報記憶部１２４と、ブロブ情報記憶部１２５と、ブロブ連結グラフ情報記憶部１２６とを有する。 (Storage unit 120B)
The storage unit 120B is realized by, for example, a semiconductor memory device such as a RAM or a flash memory, or a storage device such as a hard disk or an optical disk. As shown in FIG. 24, the storage unit 120B according to the third embodiment includes an object information storage unit 121, a graph information storage unit 122, a quantization information storage unit 123, a codebook information storage unit 124, and a blob information storage unit 121. It has an information storage section 125 and a blob connection graph information storage section 126.

（ブロブ連結グラフ情報記憶部１２６）
第３の実施形態に係るブロブ連結グラフ情報記憶部１２６は、ブロブ連結グラフに関する各種情報を記憶する。例えば、ブロブ連結グラフ情報記憶部１２６は、生成したブロブ連結グラフを記憶する。図２５は、第３の実施形態に係るブロブ連結グラフ情報記憶部の一例を示す図である。図２５に示すブロブ連結グラフ情報記憶部１２６は、「ブロブＩＤ」および「接続ブロブ情報」といった項目を有する。 (Blob connection graph information storage unit 126)
The blob connection graph information storage unit 126 according to the third embodiment stores various information regarding the blob connection graph. For example, the blob connection graph information storage unit 126 stores the generated blob connection graph. FIG. 25 is a diagram illustrating an example of a blob connection graph information storage unit according to the third embodiment. The blob connection graph information storage unit 126 shown in FIG. 25 has items such as "blob ID" and "connection blob information."

「ブロブＩＤ」は、グラフにおける各ブロブ（グループ）を識別するための識別情報を示す。「接続ブロブ情報」は、対応するブロブから辿ることができるブロブ（参照先のブロブ）に関する情報を示す。例えば、「接続ブロブ情報」には、「参照先」といった情報が含まれる。「参照先」は、エッジにより連結され、そのブロブから辿ることができる参照先（ブロブ）を識別するための情報を示す。すなわち、図２５の例では、ブロブを識別するブロブＩＤに対して、そのブロブからエッジにより辿ることができる参照先（ブロブ）が対応付けられて登録されている。なお、「接続ブロブ情報」には、参照先に接続されるエッジを識別するための情報（エッジＩＤ）等が含まれてもよい。 “Blob ID” indicates identification information for identifying each blob (group) in the graph. “Connection blob information” indicates information regarding a blob (reference destination blob) that can be traced from a corresponding blob. For example, "connection blob information" includes information such as "reference destination." “Reference destination” indicates information for identifying a reference destination (blob) that is connected by an edge and can be traced from the blob. That is, in the example of FIG. 25, a reference destination (blob) that can be traced from the blob by an edge is registered in association with a blob ID that identifies the blob. Note that the "connection blob information" may include information (edge ID) for identifying an edge connected to a reference destination.

図２５の例では、ブロブＩＤ「ＢＬ１」により識別されるブロブ（ブロブＢＬ１）からは、ブロブＩＤ「ＢＬ２」、「ＢＬ３」、「ＢＬ４」、「ＢＬ５」、「ＢＬ８」の各々により識別される５つのブロブにエッジが連結されていることを示す。すなわち、ブロブＢＬ１からは、ブロブＢＬ２、ＢＬ３、ＢＬ４、ＢＬ５、ＢＬ８の５つのブロブの各々へ辿ることができることを示す。 In the example of FIG. 25, the blob (blob BL1) identified by the blob ID "BL1" is identified by each of the blob IDs "BL2", "BL3", "BL4", "BL5", and "BL8". Shows that edges are connected to five blobs. That is, it shows that each of the five blobs BL2, BL3, BL4, BL5, and BL8 can be traced from the blob BL1.

なお、ブロブ連結グラフ情報記憶部１２６は、上記に限らず、目的に応じて種々の情報を記憶してもよい。また、ブロブ連結グラフは、クエリを入力とし、ブロブ連結グラフ中のエッジを辿ることによりオブジェクト（オブジェクトノード）を探索し、クエリに類似するオブジェクトを抽出し出力するプログラムモジュールを含んでもよい。すなわち、ブロブ連結グラフは、ブロブ連結グラフを用いて検索処理を行うプログラムモジュールとしての利用が想定されるものであってもよい。例えば、ブロブ連結グラフであるグラフＧＲ４１は、クエリとしてベクトルデータが入力された場合に、そのベクトルデータに類似するベクトルデータに対応するオブジェクトをブロブ連結グラフにより抽出し、出力するプログラムであってもよい。例えば、グラフＧＲ４１は、クエリ画像に対応する類似画像を検索するプログラムモジュールとして利用されるデータであってもよい。例えば、グラフＧＲ４１は、入力されたクエリに基づいて、ブロブ連結グラフにおいてそのクエリに類似するオブジェクトを抽出し、出力するよう、コンピュータを機能させる。 Note that the blob connection graph information storage unit 126 is not limited to the above, and may store various information depending on the purpose. The blob connection graph may also include a program module that receives a query as input, searches for objects (object nodes) by tracing edges in the blob connection graph, and extracts and outputs objects similar to the query. That is, the blob connection graph may be assumed to be used as a program module that performs search processing using the blob connection graph. For example, the graph GR41, which is a blob connection graph, may be a program that, when vector data is input as a query, extracts and outputs objects corresponding to vector data similar to the vector data using a blob connection graph. . For example, the graph GR41 may be data used as a program module that searches for similar images corresponding to a query image. For example, based on an input query, the graph GR41 causes the computer to extract and output objects similar to the query in the blob connection graph.

（制御部１３０Ｂ）
図２４の説明に戻って、制御部１３０Ｂは、コントローラ（controller）であり、例えば、ＣＰＵやＭＰＵやＧＰＵ等によって、情報処理装置１００Ｂ内部の記憶装置に記憶されている各種プログラム（情報処理プログラムの一例に相当）がＲＡＭを作業領域として実行されることにより実現される。また、制御部１３０Ｂは、コントローラであり、例えば、ＡＳＩＣやＦＰＧＡ等の集積回路により実現される。 (Control unit 130B)
Returning to the explanation of FIG. 24, the control unit 130B is a controller, and includes various programs (information processing programs) stored in a storage device inside the information processing device 100B by, for example, a CPU, MPU, GPU, etc. (corresponding to one example) is realized by being executed using RAM as a work area. Further, the control unit 130B is a controller, and is realized by, for example, an integrated circuit such as an ASIC or an FPGA.

図２４に示すように、制御部１３０Ｂは、取得部１３１と、生成部１３２Ｂと、検索処理部１３３Ｂと、提供部１３４とを有し、以下に説明する情報処理の機能や作用を実現または実行する。なお、制御部１３０Ｂの内部構成は、図２４に示した構成に限られず、後述する情報処理を行う構成であれば他の構成であってもよい。 As shown in FIG. 24, the control unit 130B includes an acquisition unit 131, a generation unit 132B, a search processing unit 133B, and a provision unit 134, and realizes or executes information processing functions and operations described below. do. Note that the internal configuration of the control unit 130B is not limited to the configuration shown in FIG. 24, and may be any other configuration as long as it performs information processing to be described later.

第３の実施形態に係る取得部１３１は、データ検索の対象となる複数のオブジェクトが分類された複数のブロブを示すブロブ情報と、複数のオブジェクトを検索対象とするインデックスを示すインデックス情報とを取得する。取得部１３１は、複数のオブジェクトの各々に対応する複数のノードがエッジにより連結されたグラフを示すインデックス情報を取得する。取得部１３１は、クラスタリング処理により複数のオブジェクトが分類された複数のブロブを示すブロブ情報を取得する。取得部１３１は、各々に属するオブジェクトが相互に排他的な複数のブロブを示すブロブ情報を取得する。取得部１３１は、クエリを取得する。 The acquisition unit 131 according to the third embodiment acquires blob information indicating a plurality of blobs in which a plurality of objects to be searched are classified, and index information indicating an index for searching a plurality of objects. do. The acquisition unit 131 acquires index information indicating a graph in which a plurality of nodes corresponding to each of a plurality of objects are connected by edges. The acquisition unit 131 acquires blob information indicating a plurality of blobs in which a plurality of objects are classified by clustering processing. The acquisition unit 131 acquires blob information indicating a plurality of blobs, each of which has mutually exclusive objects. The acquisition unit 131 acquires a query.

（生成部１３２Ｂ）
生成部１３２Ｂは、生成部１３２または生成部１３２Ａと同様に各種情報を生成する。 (Generation unit 132B)
The generation unit 132B generates various types of information in the same way as the generation unit 132 or the generation unit 132A.

生成部１３２Ｂは、取得部１３１により取得されたインデックス情報を用いて、複数のオブジェクトのうち一のオブジェクトが属する第１ブロブから、一のオブジェクトの近傍オブジェクトが属するブロブである第２ブロブへエッジが連結されたブロブ連結グラフを生成する。生成部１３２Ｂは、複数のオブジェクトから一のオブジェクトを選択し、インデックス情報を用いて複数のオブジェクトを対象に、一のオブジェクトの前近傍オブジェクトを検索し、一のオブジェクトが属する第１ブロブから、近傍オブジェクトが属する第２ブロブへエッジを連結することにより、ブロブ連結グラフを生成する。 The generation unit 132B uses the index information acquired by the acquisition unit 131 to generate an edge from a first blob to which one of the plurality of objects belongs to a second blob to which a neighboring object of the first object belongs. Generate a connected blob connectivity graph. The generation unit 132B selects one object from the plurality of objects, searches for the previous neighboring objects of the one object among the plurality of objects using the index information, and selects the neighboring objects from the first blob to which the one object belongs. A blob connectivity graph is generated by connecting edges to the second blob to which the object belongs.

生成部１３２Ｂは、複数のオブジェクトからランダムに一のオブジェクトを選択し、一のオブジェクトの近傍オブジェクトを検索し、一のオブジェクトが属する第１ブロブから、近傍オブジェクトが属する第２ブロブへエッジを連結することにより、ブロブ連結グラフを生成する。生成部１３２Ｂは、グラフを用いて第１ブロブから第２ブロブへエッジが連結されたブロブ連結グラフを生成する。 The generation unit 132B randomly selects one object from a plurality of objects, searches for neighboring objects of the one object, and connects edges from the first blob to which the one object belongs to the second blob to which the neighboring object belongs. By doing this, a blob connected graph is generated. The generation unit 132B uses the graph to generate a blob connection graph in which edges are connected from the first blob to the second blob.

生成部１３２Ｂは、グラフを用いて、一のオブジェクトの近傍オブジェクトを検索し、一のオブジェクトが属する第１ブロブから、近傍オブジェクトが属する第２ブロブへエッジを連結することにより、ブロブ連結グラフを生成する。生成部１３２Ｂは、第１ブロブから、グラフにおいて一のオブジェクトに対応する一のノードにエッジで連結されたノードである近傍ノードに対応する近傍オブジェクトが属する第２ブロブへエッジを連結することにより、ブロブ連結グラフを生成する。 The generation unit 132B uses the graph to search for neighboring objects of one object, and generates a blob connection graph by connecting edges from the first blob to which the one object belongs to the second blob to which the neighboring object belongs. do. The generation unit 132B connects an edge from the first blob to a second blob to which a neighboring object corresponding to a neighboring node, which is a node connected by an edge to one node corresponding to one object in the graph, belongs. Generate a blob connected graph.

生成部１３２Ｂは、第１ブロブから、第１ブロブの代表点である第１代表点と、第２ブロブの代表点である第２代表点との間をエッジで連結することにより、ブロブ連結グラフを生成する。生成部１３２Ｂは、第１ブロブのセントロイドである第１代表点と、第２ブロブのセントロイドである第２代表点との間をエッジで連結することにより、ブロブ連結グラフを生成する。生成部１３２Ｂは、第１ブロブに属するオブジェクトを基に算出される中心点である第１代表点と、第２ブロブに属するオブジェクトを基に算出される中心点である第２代表点との間をエッジで連結することにより、ブロブ連結グラフを生成する。 The generation unit 132B generates a blob connection graph by connecting, from the first blob, a first representative point that is a representative point of the first blob and a second representative point that is a representative point of a second blob. generate. The generation unit 132B generates a blob connection graph by connecting the first representative point, which is the centroid of the first blob, and the second representative point, which is the centroid of the second blob, with an edge. The generation unit 132B generates an image between a first representative point, which is a center point calculated based on an object belonging to the first blob, and a second representative point, which is a center point calculated based on an object belonging to a second blob. A blob-connected graph is generated by connecting the blobs with edges.

（検索処理部１３３Ｂ）
検索処理部１３３Ｂは、検索処理部１３３または検索処理部１３３Ａと同様に検索処理に関する各種処理を行う。 (Search processing unit 133B)
The search processing unit 133B performs various processes related to search processing in the same way as the search processing unit 133 or the search processing unit 133A.

検索処理部１３３Ｂは、生成部１３２Ｂにより生成されたブロブ連結グラフを用いた検索処理を行う。検索処理部１３３Ｂは、ブロブ連結グラフを用いて、検索クエリの近傍オブジェクトを検索する検索処理を行う。検索処理部１３３Ｂは、ブロブ連結グラフにおいてブロブ間を連結するエッジを辿ることにより、検索クエリの近傍オブジェクトを検索する検索処理を行う。 The search processing unit 133B performs a search process using the blob connection graph generated by the generation unit 132B. The search processing unit 133B performs a search process to search for nearby objects of the search query using the blob connection graph. The search processing unit 133B performs a search process to search for nearby objects of the search query by tracing edges connecting blobs in the blob connection graph.

検索処理部１３３Ｂは、検索処理において、複数のブロブを示すブロブ情報を用いて、一のブロブに属するオブジェクトである対象オブジェクトと検索クエリとの距離を、ベクトル量子化された対象オブジェクトのベクトル情報を用いて算出する。検索処理部１３３Ｂは、対象オブジェクトと検索クエリとの距離の算出を並列化して一括で行う。検索処理部１３３Ｂは、情報処理装置の仕様に基づいて決定される一括処理数の対象オブジェクトと検索クエリとの距離の算出を並列処理する。 In the search process, the search processing unit 133B uses blob information indicating a plurality of blobs to calculate the distance between the target object, which is an object belonging to one blob, and the search query, and the vector information of the target object that has been vector quantized. Calculate using The search processing unit 133B parallelizes and collectively calculates the distance between the target object and the search query. The search processing unit 133B performs parallel processing to calculate the distance between the search query and the target object of the batch processing number determined based on the specifications of the information processing device.

〔３－３．情報処理のフロー〕
次に、図２６を用いて、第３の実施形態に係る情報処理の手順について説明する。図２６は、第３の実施形態に係る情報処理の一例を示すフローチャートである。 [3-3. Information processing flow]
Next, the information processing procedure according to the third embodiment will be described using FIG. 26. FIG. 26 is a flowchart illustrating an example of information processing according to the third embodiment.

図２６に示すように、情報処理装置１００Ｂは、データ検索の対象となる複数のオブジェクトが分類された複数のグループを示すグループ情報を取得する（ステップＳ５０１）。例えば、情報処理装置１００Ｂは、データ検索の対象となる複数のオブジェクトに対応するノード（オブジェクトノード）が分類された複数のブロブを示すブロブ情報を取得する。 As shown in FIG. 26, the information processing apparatus 100B acquires group information indicating a plurality of groups into which a plurality of objects to be data searched are classified (step S501). For example, the information processing device 100B acquires blob information indicating a plurality of blobs in which nodes (object nodes) corresponding to a plurality of objects to be data searched are classified.

情報処理装置１００Ｂは、複数のオブジェクトを検索対象とするインデックスを示すインデックス情報を取得する（ステップＳ５０２）。例えば、情報処理装置１００Ｂは、複数のオブジェクトに対応するノード（オブジェクトノード）がエッジで連結されたグラフをインデックスとして取得する。 The information processing apparatus 100B acquires index information indicating an index that searches for a plurality of objects (step S502). For example, the information processing device 100B obtains, as an index, a graph in which nodes (object nodes) corresponding to a plurality of objects are connected by edges.

情報処理装置１００Ｂは、インデックス情報を用いて、複数のオブジェクトのうち一のオブジェクトが属する第１グループから、一のオブジェクトの近傍オブジェクトが属するグループである第２グループへエッジが連結されたグループ連結グラフを生成する（ステップＳ５０３）。例えば、情報処理装置１００Ｂは、インデックス情報を用いて、複数のオブジェクトのうち一のオブジェクトが属する第１ブロブから、一のオブジェクトの近傍オブジェクトが属するブロブである第２ブロブへエッジが連結されたブロブ連結グラフを生成する。 The information processing device 100B uses index information to create a group connection graph in which edges are connected from a first group to which one of the plurality of objects belongs to a second group to which a neighboring object of the one object belongs. is generated (step S503). For example, the information processing device 100B uses index information to create a blob whose edges are connected from a first blob to which one of the plurality of objects belongs to a second blob which is a blob to which a neighboring object of the one object belongs. Generate a connected graph.

〔３－４．検索処理例〕
ここで、第３の実施形態に係る検索処理の一例について、図２７及び図２８を一例として説明する。図２７及び図２８は、第３の実施形態に係る検索処理の一例を示すフローチャートである。以下に説明する検索処理は、情報処理装置１００Ｂの検索処理部１３３Ｂによって行われる。なお、第１の実施形態や第２の実施形態で説明した検索処理と同様の点については適宜説明を省略する。 [3-4. Search processing example]
Here, an example of the search process according to the third embodiment will be described using FIGS. 27 and 28 as examples. 27 and 28 are flowcharts illustrating an example of search processing according to the third embodiment. The search processing described below is performed by the search processing unit 133B of the information processing device 100B. Note that descriptions of points similar to the search processing described in the first embodiment and the second embodiment will be omitted as appropriate.

図２７及び図２８では、Ｎ（Ｇ、ｓ）およびＣはブロブの集合（ブロブ集合）となる。また、「Ｇ」は、ブロブ間がエッジで連結されたグラフ（ブロブ連結グラフ）データ（例えば、空間情報ＳＰ４１に示すグラフＧＲ４１等）であってもよい。例えば、情報処理装置１００Ｂは、ｋ近傍検索処理を実行する。 In FIGS. 27 and 28, N(G, s) and C are a set of blobs (blob set). Further, “G” may be graph data (blob connected graph) in which blobs are connected by edges (for example, graph GR41 shown in spatial information SP41). For example, the information processing device 100B executes k-nearest neighbor search processing.

まず、図２７に示す処理（メイン処理）について説明する。例えば、情報処理装置１００Ｂは、超球の半径ｒを∞（無限大）に設定し、ブロブ集合Ｃを空集合（Φ）に設定する（ステップＳ６０１）。そして、情報処理装置１００Ｂは、既存のブロブ集合（全てのブロブ）から部分ブロブ集合Ｂを抽出する（ステップＳ６０２）。例えば、情報処理装置１００Ｂは、ルートノードとして選択されたオブジェクト（ノード）が属するブロブを部分ブロブ集合Ｂとして抽出してもよい。また、例えば、情報処理装置１００Ｂは、ランダムにブロブを部分ブロブ集合Ｂとして抽出してもよい。 First, the process (main process) shown in FIG. 27 will be described. For example, the information processing device 100B sets the radius r of the hypersphere to ∞ (infinity) and sets the blob set C to the empty set (Φ) (step S601). Then, the information processing device 100B extracts a partial blob set B from the existing blob set (all blobs) (step S602). For example, the information processing device 100B may extract, as the partial blob set B, the blob to which the object (node) selected as the root node belongs. Further, for example, the information processing device 100B may randomly extract blobs as the partial blob set B.

そして、情報処理装置１００Ｂは、判定処理を実行する（ステップＳ６０３）。情報処理装置１００Ｂは、図２８に示すステップＳ７０１～Ｓ７１７の判定処理を実行する。図２８に示す判定処理の詳細については後述する。 The information processing device 100B then executes determination processing (step S603). The information processing device 100B executes the determination processing of steps S701 to S717 shown in FIG. 28. Details of the determination process shown in FIG. 28 will be described later.

そして、情報処理装置１００Ｂは、ブロブ集合Ｓに部分ブロブ集合Ｂを設定する（ステップＳ６０４）。 Then, the information processing device 100B sets partial blob set B in blob set S (step S604).

情報処理装置１００Ｂは、ブロブ集合Ｓに含まれるブロブの中で最小のクエリ距離ｄのブロブｓを選択する（ステップＳ６０５）。なお、ここでいう「クエリ距離」とは、ブロブ内の全てのオブジェクトとクエリオブジェクト（クエリ）間の中で最も短い距離である。例えば、情報処理装置１００Ｂは、クエリ（検索クエリ）となるオブジェクトをｙとすると、ブロブ集合Ｓのうち、オブジェクトｙとのクエリ距離が最も短いブロブｓを選択する。なお、クエリ距離は、上記に限らず、例えばクエリとブロブの代表点（セントロイド等）との間の距離であってもよい。そして、情報処理装置１００Ｂは、ブロブ集合Ｓからブロブｓを除外する（ステップＳ６０６）。 The information processing device 100B selects the blob s with the minimum query distance d among the blobs included in the blob set S (step S605). Note that the "query distance" here is the shortest distance between all objects in the blob and the query object (query). For example, when the object serving as a query (search query) is y, the information processing device 100B selects the blob s having the shortest query distance to the object y from the blob set S. Note that the query distance is not limited to the above, and may be, for example, the distance between the query and a representative point (centroid, etc.) of the blob. The information processing device 100B then excludes the blob s from the blob set S (step S606).

そして、情報処理装置１００Ｂは、ブロブｓのクエリ距離ｄがｒ（１＋ε）を超えるか否かを判定する（ステップＳ６０７）。ブロブｓのクエリ距離ｄがｒ（１＋ε）を超える場合（ステップＳ６０７：Ｙｅｓ）、情報処理装置１００Ｂは、オブジェクト集合Ｒをオブジェクトｙの近傍オブジェクト集合として出力し（ステップＳ６０８）、処理を終了する。 Then, the information processing device 100B determines whether the query distance d of the blob s exceeds r(1+ε) (step S607). If the query distance d of the blob s exceeds r(1+ε) (step S607: Yes), the information processing device 100B outputs the object set R as a neighboring object set of the object y (step S608), and ends the process.

ブロブｓのクエリ距離ｄがｒ（１＋ε）を超えない場合（ステップＳ６０７：Ｎｏ）、情報処理装置１００Ｂは、ブロブｓがブロブ集合Ｃに含まれるかを判定する（ステップＳ６０９）。 If the query distance d of the blob s does not exceed r(1+ε) (step S607: No), the information processing device 100B determines whether the blob s is included in the blob set C (step S609).

ブロブｓがブロブ集合Ｃに含まれる場合（ステップＳ６０９：Ｙｅｓ）、情報処理装置１００Ｂは、ステップＳ６０５に戻って処理を繰り返す。 If the blob s is included in the blob set C (step S609: Yes), the information processing device 100B returns to step S605 and repeats the process.

ブロブｓがブロブ集合Ｃに含まれない場合（ステップＳ６０９：Ｎｏ）、情報処理装置１００Ｂは、ブロブ集合Ｃにブロブｓを追加する（ステップＳ６１０）。 If the blob s is not included in the blob set C (step S609: No), the information processing device 100B adds the blob s to the blob set C (step S610).

そして、情報処理装置１００Ｂは、部分ブロブ集合Ｂにブロブｓの近傍ブロブ集合Ｎ（Ｇ，ｓ）を設定する（ステップＳ６１１）。近傍ブロブ集合Ｎ（Ｇ，ｓ）は、例えばブロブｓに関連付けられているブロブ（近傍のブロブ）の集合である。例えば、近傍ブロブ集合Ｎ（Ｇ，ｓ）は、ブロブｓからのエッジが連結されているブロブ（近傍のブロブ）の集合である。そして、情報処理装置１００Ｂは、判定処理を実行する（ステップＳ６１２）。詳細は後述するが、例えば、情報処理装置１００Ｂは、上述したステップＳ６０３と同様に、図２８に示すステップＳ７０１～Ｓ７１７の判定処理を実行する。 Then, the information processing device 100B sets a neighboring blob set N(G, s) of the blob s in the partial blob set B (step S611). The neighboring blob set N(G, s) is, for example, a set of blobs (nearby blobs) associated with blob s. For example, the neighboring blob set N(G, s) is a set of blobs (nearby blobs) to which edges from blob s are connected. The information processing device 100B then executes determination processing (step S612). Although details will be described later, for example, the information processing device 100B executes the determination processing of steps S701 to S717 shown in FIG. 28, similar to step S603 described above.

そして、情報処理装置１００Ｂは、ブロブ集合Ｓが空集合（Φ）であるか否かを判定する（ステップＳ６１３）。ブロブ集合Ｓが空集合でない場合（ステップＳ６１３：Ｎｏ）、情報処理装置１００Ｂは、ステップＳ６０５に戻って処理を繰り返す。また、ブロブ集合Ｓが空集合である場合（ステップＳ６１３：Ｙｅｓ）、情報処理装置１００Ｂは、オブジェクト集合Ｒを出力し、処理を終了する（ステップＳ６１４）。例えば、情報処理装置１００Ｂは、オブジェクト集合Ｒに含まれるオブジェクト（ノード）を追加ノード（入力オブジェクトｙ）に対応する近傍ノードとして選択してもよい。例えば、情報処理装置１００Ｂは、オブジェクト集合Ｒに含まれるオブジェクト（ノード）を対象ノード（入力オブジェクトｙ）に対応する近傍ノードとして抽出（選択）してもよい。また、例えば、情報処理装置１００Ｂは、オブジェクト集合Ｒに含まれるオブジェクト（ノード）を検索クエリ（入力オブジェクトｙ）に対応する検索結果として、検索を行った端末装置等へ提供してもよい。 Then, the information processing device 100B determines whether the blob set S is an empty set (Φ) (step S613). If the blob set S is not an empty set (step S613: No), the information processing device 100B returns to step S605 and repeats the process. Furthermore, if the blob set S is an empty set (step S613: Yes), the information processing device 100B outputs the object set R and ends the process (step S614). For example, the information processing device 100B may select an object (node) included in the object set R as a neighboring node corresponding to the additional node (input object y). For example, the information processing device 100B may extract (select) objects (nodes) included in the object set R as neighboring nodes corresponding to the target node (input object y). Further, for example, the information processing device 100B may provide the objects (nodes) included in the object set R as the search results corresponding to the search query (input object y) to the terminal device or the like that performed the search.

ここから、図２８に示す処理（判定処理）について説明する。まず、情報処理装置１００Ｂは、ブロブ集合Ｔに部分ブロブ集合Ｂを設定する（ステップＳ７０１）。 From here, the process (determination process) shown in FIG. 28 will be explained. First, the information processing device 100B sets a partial blob set B in the blob set T (step S701).

そして、情報処理装置１００Ｂは、ブロブ集合Ｔからブロブｂを一つ取得し、ブロブ集合Ｔからブロブｂを削除する（ステップＳ７０２）。そして、情報処理装置１００Ｂは、判定処理に用いる変数であるｍｉｎを∞（無限大）に設定する（ステップＳ７０３）。 Then, the information processing device 100B obtains one blob b from the blob set T, and deletes blob b from the blob set T (step S702). Then, the information processing device 100B sets min, which is a variable used in the determination process, to ∞ (infinity) (step S703).

そして、情報処理装置１００Ｂは、ブロブｂからオブジェクトｕを一つ取得し、ブロブｂからオブジェクトｕを削除する（ステップＳ７０４）。 Then, the information processing device 100B acquires one object u from blob b, and deletes object u from blob b (step S704).

そして、情報処理装置１００Ｂは、オブジェクトｕとオブジェクトｙとの距離ｄ（ｕ，ｙ）がｍｉｎ未満か否かを判定する（ステップＳ７０５）。 Then, the information processing device 100B determines whether the distance d(u, y) between the object u and the object y is less than min (step S705).

オブジェクトｕとオブジェクトｙとの距離ｄ（ｕ，ｙ）がｍｉｎ未満である場合（ステップＳ７０５：Ｙｅｓ）、情報処理装置１００Ｂは、ｍｉｎの値を距離ｄ（ｕ，ｙ）に設定（更新）し（ステップＳ７０６）、ステップＳ７０７の処理を行う。 If the distance d(u,y) between the object u and the object y is less than min (step S705: Yes), the information processing device 100B sets (updates) the value of min to the distance d(u,y). (Step S706), and the process of Step S707 is performed.

オブジェクトｕとオブジェクトｙとの距離ｄ（ｕ，ｙ）がｍｉｎ未満ではない場合（ステップＳ７０５：Ｎｏ）、情報処理装置１００Ｂは、ステップＳ７０６の処理を行わず、ステップＳ７０７の処理を行う。 If the distance d(u, y) between object u and object y is not less than min (step S705: No), the information processing device 100B does not perform the process of step S706, but performs the process of step S707.

情報処理装置１００Ｂは、オブジェクトｕとオブジェクトｙとの距離ｄ（ｕ，ｙ）がｒ（１＋ε）以下であるか否かを判定する（ステップＳ７０７）。 The information processing device 100B determines whether the distance d(u, y) between the object u and the object y is less than or equal to r(1+ε) (step S707).

オブジェクトｕとオブジェクトｙとの距離ｄ（ｕ，ｙ）がｒ（１＋ε）以下である場合（ステップＳ７０７：Ｙｅｓ）、情報処理装置１００Ｂは、ブロブｂをブロブ集合Ｓに追加し（ステップＳ７０８）、ステップＳ７０９の処理を行う。 If the distance d(u,y) between the object u and the object y is less than or equal to r(1+ε) (step S707: Yes), the information processing device 100B adds the blob b to the blob set S (step S708), The process of step S709 is performed.

オブジェクトｕとオブジェクトｙとの距離ｄ（ｕ，ｙ）がｒ（１＋ε）以下ではない場合（ステップＳ７０７：Ｎｏ）、情報処理装置１００Ｂは、ステップＳ７０８の処理を行わず、ステップＳ７０９の処理を行う。 If the distance d(u, y) between object u and object y is not less than or equal to r(1+ε) (step S707: No), the information processing device 100B performs the process of step S709 without performing the process of step S708. .

情報処理装置１００Ｂは、オブジェクトｕとオブジェクトｙとの距離ｄ（ｕ，ｙ）がｒ以下であるか否かを判定する（ステップＳ７０９）。 The information processing device 100B determines whether the distance d(u, y) between the object u and the object y is less than or equal to r (step S709).

オブジェクトｕとオブジェクトｙとの距離ｄ（ｕ，ｙ）がｒ以下である場合（ステップＳ７０９：Ｙｅｓ）、情報処理装置１００Ｂは、オブジェクトｕをオブジェクト集合Ｒに追加し（ステップＳ７１０）、ステップＳ７１１の処理を行う。 If the distance d(u,y) between the object u and the object y is less than or equal to r (step S709: Yes), the information processing device 100B adds the object u to the object set R (step S710) and performs the steps in step S711. Perform processing.

オブジェクトｕとオブジェクトｙとの距離ｄ（ｕ，ｙ）がｒ以下ではない場合（ステップＳ７０９：Ｎｏ）、情報処理装置１００Ｂは、ステップＳ７１０の処理を行わず、ステップＳ７１１の処理を行う。 If the distance d(u, y) between the object u and the object y is not less than or equal to r (step S709: No), the information processing device 100B performs the process of step S711 without performing the process of step S710.

情報処理装置１００Ｂは、オブジェクト集合Ｒに含まれるオブジェクト数がｋｓを超えるか否かを判定する（ステップＳ７１１）。所定数ｋｓは、任意に定められる自然数である。例えば、ｋｓは、検索数や抽出対象数であってもよい。また、例えば、範囲検索等において抽出するオブジェクト数の上限を設けない場合、ｋｓは、無限大に設定されてもよい。例えば、ｋｓ＝４等であってもよい。 The information processing device 100B determines whether the number of objects included in the object set R exceeds ks (step S711). The predetermined number ks is an arbitrarily determined natural number. For example, ks may be the number of searches or the number of extraction targets. Further, for example, when there is no upper limit on the number of objects to be extracted in a range search or the like, ks may be set to infinity. For example, ks may be 4, etc.

オブジェクト集合Ｒに含まれるオブジェクト数がｋｓを超える場合（ステップＳ７１１：Ｙｅｓ）、情報処理装置１００Ｂは、オブジェクト集合Ｒに含まれるオブジェクトの中でオブジェクトｙから最も遠いオブジェクトを、オブジェクト集合Ｒから除外し（ステップＳ７１２）、ステップＳ７１３の処理を行う。 If the number of objects included in the object set R exceeds ks (step S711: Yes), the information processing device 100B excludes from the object set R the object that is farthest from the object y among the objects included in the object set R. (Step S712), and performs the process of Step S713.

オブジェクト集合Ｒに含まれるオブジェクト数がｋｓを超えない場合（ステップＳ７１１：Ｎｏ）、情報処理装置１００Ｂは、ステップＳ７１２の処理を行わず、ステップＳ７１３の処理を行う。 If the number of objects included in the object set R does not exceed ks (step S711: No), the information processing device 100B performs the process of step S713 without performing the process of step S712.

情報処理装置１００Ｂは、オブジェクト集合Ｒに含まれるオブジェクト数がｋｓと一致するか否かを判定する（ステップＳ７１３）。 The information processing device 100B determines whether the number of objects included in the object set R matches ks (step S713).

オブジェクト集合Ｒに含まれるオブジェクト数がｋｓと一致する場合（ステップＳ７１３：Ｙｅｓ）、情報処理装置１００Ｂは、オブジェクト集合Ｒに含まれるオブジェクトの中でオブジェクトｙから最も遠いオブジェクトと、オブジェクトｙとの距離をｒに設定し（ステップＳ７１４）、ステップＳ７１５の処理を行う。 If the number of objects included in the object set R matches ks (step S713: Yes), the information processing device 100B determines the distance between the object y and the object that is farthest from the object y among the objects included in the object set R. is set to r (step S714), and the process of step S715 is performed.

オブジェクト集合Ｒに含まれるオブジェクト数がｋｓと一致しない場合（ステップＳ７１３：Ｎｏ）、情報処理装置１００Ｂは、ステップＳ７１４の処理を行わず、ステップＳ７１５の処理を行う。 If the number of objects included in the object set R does not match ks (step S713: No), the information processing device 100B performs the process of step S715 without performing the process of step S714.

情報処理装置１００Ｂは、ブロブｂが空集合（Φ）であるか否かを判定する（ステップＳ７１５）。ブロブｂが空集合でない場合（ステップＳ７１５：Ｎｏ）、情報処理装置１００Ｂは、ステップＳ７０４に戻って処理を繰り返す。 The information processing device 100B determines whether blob b is an empty set (Φ) (step S715). If blob b is not an empty set (step S715: No), the information processing device 100B returns to step S704 and repeats the process.

また、ブロブｂが空集合である場合（ステップＳ７１５：Ｙｅｓ）、情報処理装置１００Ｂは、ｍｉｎを部分ブロブ集合Ｂ内のブロブｂのブロブのクエリ距離とする（ステップＳ７１６）。例えば、情報処理装置１００Ｂは、ブロブｂの全てのオブジェクトとクエリとの間の距離のうち最も短い距離をクエリ距離としてｍｉｎに設定する。 Further, when blob b is an empty set (step S715: Yes), the information processing device 100B sets min as the blob query distance of blob b in partial blob set B (step S716). For example, the information processing device 100B sets the shortest distance among the distances between all objects of blob b and the query to min as the query distance.

情報処理装置１００Ｂは、ブロブ集合Ｔが空集合（Φ）であるか否かを判定する（ステップＳ７１７）。ブロブ集合Ｔが空集合でない場合（ステップＳ７１７：Ｎｏ）、情報処理装置１００Ｂは、ステップＳ７０２に戻って処理を繰り返す。 The information processing device 100B determines whether the blob set T is an empty set (Φ) (step S717). If the blob set T is not an empty set (step S717: No), the information processing device 100B returns to step S702 and repeats the process.

また、ブロブ集合Ｔが空集合である場合（ステップＳ７１７：Ｙｅｓ）、判定処理を終了する。 Further, if the blob set T is an empty set (step S717: Yes), the determination process is ended.

〔３－５．変形例〕
ブロブ連結グラフは、上述した例に限らず様々な態様により構成されてもよい。例えば、ブロブ連結グラフは、各グループ（ブロブ）の代表点間がエッジで連結されたグラフであってもよい。この点について、以下、第３の実施形態に係る変形例として説明する。なお、第１の実施形態や第２の実施形態や第３の実施形態と同様の点については、適宜説明を省略する。第３の実施形態の変形例に係る情報処理装置１００Ｂは、ブロブ情報記憶部１２５及びブロブ連結グラフ情報記憶部１２６に代えて、ブロブ情報記憶部１２５Ａ及びブロブ連結グラフ情報記憶部１２６Ａを有する。 [3-5. Modified example]
The blob connection graph is not limited to the example described above, and may be configured in various ways. For example, the blob connected graph may be a graph in which representative points of each group (blob) are connected by edges. This point will be explained below as a modification of the third embodiment. Note that descriptions of points similar to the first embodiment, second embodiment, and third embodiment will be omitted as appropriate. The information processing device 100B according to the modification of the third embodiment includes a blob information storage section 125A and a blob connection graph information storage section 126A instead of the blob information storage section 125 and the blob connection graph information storage section 126.

〔３－５－１．情報処理〕
まず、図２９を用いて、変形例に係る情報処理の概要を説明する。図２９は、変形例に係るブロブ連結グラフ情報の一例を示す図である。図２９の例ではブロブがクラスタリングにより生成されている場合を示す。 [3-5-1. Information processing〕
First, an overview of information processing according to a modified example will be explained using FIG. 29. FIG. 29 is a diagram illustrating an example of blob connection graph information according to a modification. The example in FIG. 29 shows a case where blobs are generated by clustering.

図２９の例では、情報処理装置１００Ｂは、各ブロブのセントロイドを連結したブロブ連結グラフを生成する。情報処理装置１００Ｂは、空間情報ＳＰ４１に示すように、各ブロブＢＬ１～ＢＬ１０の各々に対応するセントロイドＣ１～Ｃ１０をエッジで連結したブロブ連結グラフであるグラフＧＲ４２を生成する。情報処理装置１００Ｂは、グラフ生成に関する種々の従来技術を適宜用いてグラフＧＲ４２を生成する。 In the example of FIG. 29, the information processing device 100B generates a blob connection graph that connects the centroids of each blob. The information processing device 100B generates a graph GR42, which is a blob connection graph in which centroids C1 to C10 corresponding to each of the blobs BL1 to BL10 are connected by edges, as shown in the spatial information SP41. The information processing device 100B generates the graph GR42 using various conventional techniques related to graph generation.

例えば、情報処理装置１００Ｂは、ブロブＢＬ１のセントロイドＣ１がブロブＢＬ２のセントロイドＣ２、ブロブＢＬ３のセントロイドＣ３、ブロブＢＬ４のセントロイドＣ４、ブロブＢＬ７のセントロイドＣ７、及びブロブＢＬ８のセントロイドＣ８の５つのセントロイドにエッジで連結されたグラフＧＲ４２を生成する。また、例えば、情報処理装置１００Ｂは、ブロブＢＬ２のセントロイドＣ２が、ブロブＢＬ１のセントロイドＣ１、ブロブＢＬ３のセントロイドＣ３、及びブロブＢＬ４のセントロイドＣ４の３つのセントロイドにエッジで連結されたグラフＧＲ４２を生成する。 For example, in the information processing device 100B, the centroid C1 of the blob BL1 is the centroid C2 of the blob BL2, the centroid C3 of the blob BL3, the centroid C4 of the blob BL4, the centroid C7 of the blob BL7, and the centroid C8 of the blob BL8. A graph GR42 is generated which is connected to the five centroids of by edges. For example, the information processing device 100B connects the centroid C2 of the blob BL2 to three centroids, the centroid C1 of the blob BL1, the centroid C3 of the blob BL3, and the centroid C4 of the blob BL4, by edges. Generate graph GR42.

なお、情報処理装置１００Ｂは、どのような方法によりグラフＧＲ４２を生成してもよい。情報処理装置１００Ｂは、様々なインデックスを用いて、グラフＧＲ４２を生成してもよい。例えば、情報処理装置１００Ｂは、オブジェクトノードが連結されたグラフＧＲ２１を用いて、グラフＧＲ４２を生成してもよい。この場合、情報処理装置１００Ｂは、図２３中のグラフＧＲ４１と同様の処理によりグラフＧＲ４２を生成できるため、詳細な説明は省略する。例えば、情報処理装置１００Ｂは、図２３に示す例と同様の処理により、グラフＧＲ４２を生成してもよい。なお、グラフＧＲ４２は無向（双方向）エッジを一例として示すが、有向エッジであってもよい。 Note that the information processing device 100B may generate the graph GR42 using any method. The information processing device 100B may generate the graph GR42 using various indexes. For example, the information processing device 100B may generate the graph GR42 using the graph GR21 in which object nodes are connected. In this case, the information processing apparatus 100B can generate the graph GR42 by the same process as the graph GR41 in FIG. 23, so detailed explanation will be omitted. For example, the information processing device 100B may generate the graph GR42 by a process similar to the example shown in FIG. 23. Although the graph GR42 shows an undirected (bidirectional) edge as an example, it may be a directed edge.

このように、変形例に係る情報処理装置１００Ｂは、セントロイドのグラフをブロブ連結グラフとして生成する。そして、情報処理装置１００Ｂは、生成したブロブ連結グラフを用いてセントロイドで通常の検索を行い、距離計算はブロブ単位で行う。このように、情報処理装置１００Ｂは、ブロブをセントロイドで代表させることで、簡便な処理とすることができる。なお、情報処理装置１００Ｂは、ブロブがクラスタリングではない場合には、各ブロブに属するオブジェクトを基に中心点を算出し、算出した中心点を代表点として用いてもよい。 In this way, the information processing device 100B according to the modification generates a centroid graph as a blob connection graph. Then, the information processing device 100B performs a normal search using the centroid using the generated blob connection graph, and performs distance calculation on a blob-by-blob basis. In this way, the information processing apparatus 100B can simplify processing by representing blobs with centroids. Note that if the blobs are not clustered, the information processing device 100B may calculate the center point based on the objects belonging to each blob, and use the calculated center point as the representative point.

図２９の例では、情報処理装置１００Ｂは、空間情報ＳＰ４２に示すように、ブロブの代表点であるセントロイド間がエッジで連結されたグラフＧＲ４２を用いて、検索処理を行う。例えば、情報処理装置１００Ｂは、検索クエリＱＥ２を対象として、グラフＧＲ２１を用いた図２７及び図２８に示すような検索処理を行うことにより、検索クエリＱＥ２の検索結果を得る。この点については、上述した例と同様であり詳細な説明は省略する。 In the example of FIG. 29, the information processing device 100B performs a search process using a graph GR42 in which centroids, which are representative points of blobs, are connected by edges, as shown in spatial information SP42. For example, the information processing device 100B obtains the search results for the search query QE2 by performing search processing as shown in FIGS. 27 and 28 using the graph GR21 for the search query QE2. This point is similar to the example described above, and detailed explanation will be omitted.

〔３－５－２．情報〕
次に、図３０及び図３１を用いて、変形例に係る情報の概要を説明する。図３０は、変形例に係るブロブ情報記憶部の一例を示す図である。図３１は、変形例に係るブロブ連結グラフ情報記憶部の一例を示す図である。 [3-5-2. information〕
Next, an outline of the information related to the modified example will be explained using FIGS. 30 and 31. FIG. 30 is a diagram illustrating an example of a blob information storage unit according to a modification. FIG. 31 is a diagram illustrating an example of a blob connection graph information storage unit according to a modification.

まず、図３０に示す変形例に係るブロブ情報記憶部１２５Ａについて説明する。図３０に示す変形例に係るブロブ情報記憶部１２５Ａは、「ブロブＩＤ」、「ノードＩＤ」、「ベクトル情報」、および「セントロイドＩＤ」といった項目が含まれる。このように、変形例に係るブロブ情報記憶部１２５Ａは、「セントロイドＩＤ」を有する点で図１５のブロブ情報記憶部１２５と相違する。なお、図１５のブロブ情報記憶部１２５と同様の点については適宜説明を省略する。 First, a blob information storage unit 125A according to a modification shown in FIG. 30 will be described. The blob information storage unit 125A according to the modification shown in FIG. 30 includes items such as "blob ID", "node ID", "vector information", and "centroid ID". In this way, the blob information storage unit 125A according to the modified example is different from the blob information storage unit 125 of FIG. 15 in that it has a "centroid ID." Note that descriptions of the same points as those of the blob information storage unit 125 in FIG. 15 will be omitted as appropriate.

「セントロイドＩＤ」は、各ブロブのセントロイドを識別するための識別情報を示す。「ベクトル情報」は、ブロブの代表点であるセントロイドのベクトル情報を示す。 "Centroid ID" indicates identification information for identifying the centroid of each blob. "Vector information" indicates vector information of a centroid, which is a representative point of a blob.

図３０に示す例では、ブロブＩＤ「ＢＬ１」により識別されるブロブＢＬ１のセントロイドは、セントロイドＩＤ「Ｃ１」により識別されるセントロイド（セントロイドＣ１）であることを示す。ブロブＩＤ「ＢＬ９」により識別されるブロブＢＬ９のセントロイドは、セントロイドＩＤ「Ｃ９」により識別されるセントロイド（セントロイドＣ９）であることを示す。 The example shown in FIG. 30 indicates that the centroid of blob BL1 identified by blob ID "BL1" is the centroid (centroid C1) identified by centroid ID "C1". The centroid of blob BL9 identified by blob ID "BL9" indicates that it is the centroid (centroid C9) identified by centroid ID "C9".

なお、ブロブ情報記憶部１２５Ａは、上記に限らず、目的に応じて種々の情報を記憶してもよい。 Note that the blob information storage unit 125A is not limited to the above, and may store various information depending on the purpose.

次に、図３１に示す変形例に係るブロブ連結グラフ情報記憶部１２６Ａについて説明する。図３０に示す変形例に係るブロブ連結グラフ情報記憶部１２６Ａは、「セントロイドＩＤ」および「接続セントロイド情報」といった項目を有する。なお、図２５のブロブ連結グラフ情報記憶部１２６と同様の点については適宜説明を省略する。 Next, a blob connection graph information storage unit 126A according to a modification shown in FIG. 31 will be described. The blob connection graph information storage unit 126A according to the modification shown in FIG. 30 has items such as "centroid ID" and "connection centroid information." Note that descriptions of points similar to those of the blob connected graph information storage unit 126 in FIG. 25 will be omitted as appropriate.

「セントロイドＩＤ」は、グラフにおける各ブロブの代表点であるセントロイドを識別するための識別情報を示す。「接続セントロイド情報」は、対応するセントロイドから辿ることができるセントロイド（参照先のセントロイド）に関する情報を示す。例えば、「接続セントロイド情報」には、「参照先」といった情報が含まれる。「参照先」は、エッジにより連結され、そのセントロイドから辿ることができる参照先（セントロイド）を識別するための情報を示す。すなわち、図３１の例では、セントロイドを識別するセントロイドＩＤに対して、そのセントロイドからエッジにより辿ることができる参照先（セントロイド）が対応付けられて登録されている。なお、「接続セントロイド情報」には、参照先に接続されるエッジを識別するための情報（エッジＩＤ）等が含まれてもよい。 "Centroid ID" indicates identification information for identifying a centroid that is a representative point of each blob in the graph. “Connection centroid information” indicates information regarding a centroid (reference destination centroid) that can be traced from a corresponding centroid. For example, "connection centroid information" includes information such as "reference destination." “Reference destination” indicates information for identifying a reference destination (centroid) that is connected by an edge and can be traced from the centroid. That is, in the example of FIG. 31, a reference destination (centroid) that can be traced from the centroid by an edge is registered in association with a centroid ID that identifies the centroid. Note that the "connection centroid information" may include information (edge ID) for identifying an edge connected to a reference destination.

図３１の例では、セントロイドＩＤ「Ｃ１」により識別されるセントロイド（セントロイドＣ１）からは、セントロイドＩＤ「Ｃ２」、「Ｃ３」、「Ｃ４」、「Ｃ７」、「Ｃ８」の各々により識別される５つのセントロイドにエッジが連結されていることを示す。すなわち、セントロイドＣ１からは、セントロイドＣ２、Ｃ３、Ｃ４、Ｃ７、Ｃ８の５つのセントロイドの各々へ辿ることができることを示す。 In the example of FIG. 31, from the centroid (centroid C1) identified by the centroid ID "C1", each of the centroid IDs "C2", "C3", "C4", "C7", and "C8" The edge is connected to the five centroids identified by . In other words, it is shown that each of the five centroids C2, C3, C4, C7, and C8 can be traced from the centroid C1.

なお、ブロブ連結グラフ情報記憶部１２６Ａは、上記に限らず、目的に応じて種々の情報を記憶してもよい。また、ブロブ連結グラフは、クエリを入力とし、ブロブ連結グラフ中のエッジを辿ることによりオブジェクト（オブジェクトノード）を探索し、クエリに類似するオブジェクトを抽出し出力するプログラムモジュールを含んでもよい。すなわち、ブロブ連結グラフは、ブロブ連結グラフを用いて検索処理を行うプログラムモジュールとしての利用が想定されるものであってもよい。例えば、ブロブ連結グラフであるグラフＧＲ４２は、クエリとしてベクトルデータが入力された場合に、そのベクトルデータに類似するベクトルデータに対応するオブジェクトをブロブ連結グラフにより抽出し、出力するプログラムであってもよい。例えば、グラフＧＲ４２は、クエリ画像に対応する類似画像を検索するプログラムモジュールとして利用されるデータであってもよい。例えば、グラフＧＲ４２は、入力されたクエリに基づいて、ブロブ連結グラフにおいてそのクエリに類似するオブジェクトを抽出し、出力するよう、コンピュータを機能させる。 Note that the blob connection graph information storage unit 126A is not limited to the above, and may store various information depending on the purpose. The blob connection graph may also include a program module that receives a query as input, searches for objects (object nodes) by tracing edges in the blob connection graph, and extracts and outputs objects similar to the query. That is, the blob connection graph may be assumed to be used as a program module that performs search processing using the blob connection graph. For example, the graph GR42, which is a blob connection graph, may be a program that, when vector data is input as a query, extracts and outputs objects corresponding to vector data similar to the vector data using a blob connection graph. . For example, the graph GR42 may be data used as a program module that searches for similar images corresponding to a query image. For example, based on an input query, the graph GR 42 causes the computer to extract and output objects similar to the query in the blob connection graph.

上述したように、各実施形態及び変形例において、情報処理システム１は、以下のような処理を実行する。例えば、情報処理システム１は、ベクトルのクラスタリングでブロブを生成する。例えば、情報処理システム１は、ブロブのセントロイドをノードとしてグラフインデックスを生成する。 As described above, in each embodiment and modification, the information processing system 1 executes the following processing. For example, the information processing system 1 generates blobs by clustering vectors. For example, the information processing system 1 generates a graph index using the centroid of a blob as a node.

例えば、情報処理システム１は、任意のベクトルを一定数取得して、直積量子化のクラスタリングを行う。つまり、情報処理システム１は、部分ベクトルに分割して、部分ベクトル単位にクラスタリングを行い、コードブックを生成する。 For example, the information processing system 1 acquires a certain number of arbitrary vectors and performs clustering using Cartesian product quantization. That is, the information processing system 1 divides the vector into partial vectors, performs clustering for each partial vector, and generates a codebook.

例えば、情報処理システム１は、ブロブ単位に属するオブジェクトを直積量子化し、転置インデックスを生成する。例えば、情報処理システム１は、オブジェクトからブロブへのエッジを含むグラフ（ブロブ用グラフ）を生成する。例えば、情報処理システム１は、一般的な直積量子化の手法を用により近傍オブジェクトを検索する。ここでいう一般的な直積量子化の手法とは、例えば、オブジェクトとブロブのすべてのセントロイドとの距離を算出し上位一定数のブロブを取得した後に、ブロブ内のオブジェクトの距離を直積量子化で距離を求めて近傍オブジェクトを取得することであってもよい。なお、情報処理システム１は、上述したように、セントロイドのグラフを生成している場合はグラフを用いて一定数のブロブを検索して取得することが可能である。または、情報処理システム１は、上記した、オブジェクトからブロブへのエッジを含むグラフ（ブロブ用グラフ）ではなく、ブロブ間をエッジで連結したグラフ（ブロブ連結グラフ）を生成してもよい。 For example, the information processing system 1 performs product quantization on objects belonging to blob units and generates transposed indexes. For example, the information processing system 1 generates a graph (blob graph) including edges from objects to blobs. For example, the information processing system 1 searches for nearby objects using a general product quantization method. The general method of direct product quantization mentioned here is, for example, after calculating the distances between objects and all centroids of blobs and obtaining a certain number of top blobs, the distances of objects within the blobs are subjected to direct product quantization. It may also be possible to obtain nearby objects by calculating the distance. Note that, as described above, when the information processing system 1 generates a centroid graph, it is possible to search for and obtain a certain number of blobs using the graph. Alternatively, the information processing system 1 may generate a graph in which blobs are connected by edges (blob connection graph) instead of the above-described graph including edges from objects to blobs (blob graph).

〔４．第４の実施形態〕
上述した例では、オブジェクトを分類するグループの一例であるブロブを用いる場合を示したが、オブジェクトを分類するグループはブロブのみに限らず、オブジェクトを分類する様々なグラフが用いられてもよい。例えば、ブロブ（第１グループ）以外にもオブジェクトを量子化する単位となる第２グループであるクラスタが用いられてもよい。 [4. Fourth embodiment]
In the above example, a case is shown in which blobs, which are an example of groups for classifying objects, are used, but the groups for classifying objects are not limited to blobs, and various graphs for classifying objects may be used. For example, in addition to blobs (the first group), clusters, which are the second group that serve as units for quantizing objects, may be used.

この点について、以下、第４の実施形態として説明する。第４の実施形態においては、情報処理システム１は、情報処理装置１００、情報処理装置１００Ａまたは情報処理装置１００Ｂに代えて、情報処理装置１００Ｃを有する。なお、第１の実施形態、第２の実施形態及び第３の実施形態等において上述した内容と同様の点については適宜説明を省略する。 This point will be described below as a fourth embodiment. In the fourth embodiment, the information processing system 1 includes an information processing device 100C instead of the information processing device 100, the information processing device 100A, or the information processing device 100B. Note that descriptions of points similar to those described above in the first embodiment, second embodiment, third embodiment, etc. will be omitted as appropriate.

情報処理装置１００Ｃは、複数のオブジェクトを分類する複数のブロブを示すブロブ情報（第１グループ情報）と、ブロブとは異なる分類となるグループであるクラスタを示すクラスタ情報（第２グループ情報）とを生成する。 The information processing device 100C stores blob information (first group information) indicating a plurality of blobs that classify a plurality of objects, and cluster information (second group information) indicating a cluster, which is a group classified differently from the blobs. generate.

〔４－１．情報処理〕
まず、図３２を用いて、第４の実施形態に係る情報処理の概要を説明する。図３２は、第４の実施形態に係る情報処理の一例を示す図である。なお、図３２では、情報処理装置１００Ｃがオブジェクトを複数のブロブに分類し、そのブロブを基にクラスタに分類する場合を示すが、クラスタに分類した後にブロブに分類してもよいが、この点については後述する。 [4-1. Information processing〕
First, an overview of information processing according to the fourth embodiment will be explained using FIG. 32. FIG. 32 is a diagram illustrating an example of information processing according to the fourth embodiment. Note that although FIG. 32 shows a case where the information processing device 100C classifies objects into a plurality of blobs and classifies them into clusters based on the blobs, it is also possible to classify objects into blobs after classifying them into clusters. This will be discussed later.

まず、情報処理装置１００Ｃは、複数のオブジェクトを複数のブロブに分類するブロブ情報を生成する（ステップＳ７１）。図３２の例では、情報処理装置１００Ｃは、任意の方法により、複数のオブジェクトをブロブＢＬ１～ＢＬ１０等に分類するブロブ情報を生成する。情報処理装置１００Ｃは、ｋ－ｍｅａｎｓ等の任意のクラスタリングにより各ノードをブロブＢＬ１～ＢＬ１０のいずれかに分類してもよい。なお、情報処理装置１００Ｃは、ブロブを生成可能であれば、どのような方法によりブロブを生成してもよい。例えば、情報処理装置１００Ｃは、グラフ等のインデックスを用いて、ブロブを生成してもよい。例えば、情報処理装置１００Ｃは、上述した第１方法～第４方法等に任意のいずれかの方法によりブロブＢＬ１～ＢＬ１０等を示す情報を生成してもよい。 First, the information processing device 100C generates blob information that classifies a plurality of objects into a plurality of blobs (step S71). In the example of FIG. 32, the information processing device 100C generates blob information that classifies a plurality of objects into blobs BL1 to BL10, etc., using an arbitrary method. The information processing device 100C may classify each node into one of the blobs BL1 to BL10 using arbitrary clustering such as k-means. Note that the information processing device 100C may use any method to generate blobs as long as it is capable of generating blobs. For example, the information processing device 100C may generate a blob using an index such as a graph. For example, the information processing device 100C may generate information indicating the blobs BL1 to BL10, etc. using any of the first to fourth methods described above.

なお、図３２では、情報処理装置１００Ｃは、空間情報ＳＰ４１に示すように、ブロブ連結グラフであるグラフＧＲ４１を生成するが、グラフＧＲ４１を生成しなくてもよい。例えば、情報処理装置１００Ｃは、グラフＧＲ４１を生成する場合、図２３と同様の情報をもちいた処理によりグラフＧＲ４１を生成するが、図２３での説明と同様であるため詳細な説明を省略する。 Note that in FIG. 32, the information processing device 100C generates the graph GR41, which is a blob connection graph, as shown in the spatial information SP41, but it is not necessary to generate the graph GR41. For example, when generating the graph GR41, the information processing device 100C generates the graph GR41 by processing using the same information as in FIG. 23, but since it is the same as the explanation in FIG. 23, detailed explanation will be omitted.

そして、情報処理装置１００Ｃは、複数のブロブを複数のクラスタに分類するクラスタ情報を生成する（ステップＳ７２）。例えば、情報処理装置１００Ｃは、任意の方法により、ブロブＢＬ１～ＢＬ１０等の複数のブロブを分類するクラスタを示すクラスタ情報を生成する。情報処理装置１００Ｃは、ｋ－ｍｅａｎｓ等の任意のクラスタリングによりブロブＢＬ１～ＢＬ１０等の複数のクラスタのいずれかに分類してもよい。なお、情報処理装置１００Ｃは、ブロブを生成可能であれば、どのような方法によりブロブを生成してもよい。例えば、情報処理装置１００Ｃは、グラフ等のインデックスを用いて、ブロブを生成してもよい。 The information processing device 100C then generates cluster information for classifying the plurality of blobs into a plurality of clusters (step S72). For example, the information processing device 100C generates cluster information indicating a cluster for classifying a plurality of blobs such as blobs BL1 to BL10 using an arbitrary method. The information processing device 100C may classify the blobs into any one of a plurality of clusters such as blobs BL1 to BL10 using arbitrary clustering such as k-means. Note that the information processing device 100C may use any method to generate blobs as long as it is capable of generating blobs. For example, the information processing device 100C may generate a blob using an index such as a graph.

図３２の例では、情報処理装置１００Ｃは、ブロブＢＬ１～ＢＬ１０等のクラスタＣＬ１～ＣＬ４等に分類するクラスタ情報を生成する。例えば、情報処理装置１００Ｃは、ブロブＢＬ１及びブロブＢＬ７をクラスタＣＬ１に分類する。すなわち、情報処理装置１００Ｃは、ブロブＢＬ１に属するオブジェクト、及びブロブＢＬ７に属するオブジェクトをクラスタＣＬ１に分類する。なお、図３２及び図３４では、ブロブの境界線がクラスタの境界線上にあるように図示したが、分割の方法によっては一致しない場合があり、また方法によっては空間が境界線で分割されるとも限らない。 In the example of FIG. 32, the information processing device 100C generates cluster information that classifies blobs BL1 to BL10 and the like into clusters CL1 to CL4. For example, the information processing device 100C classifies blob BL1 and blob BL7 into cluster CL1. That is, the information processing device 100C classifies objects belonging to blob BL1 and objects belonging to blob BL7 into cluster CL1. Although the blob boundaries are shown to be on the cluster boundaries in FIGS. 32 and 34, they may not match depending on the dividing method, and depending on the dividing method, the space may be divided by the boundary lines. Not exclusively.

また、情報処理装置１００Ｃは、ブロブＢＬ２及びブロブＢＬ４をクラスタＣＬ２に分類する。すなわち、情報処理装置１００Ｃは、ブロブＢＬ２に属するオブジェクト、及びブロブＢＬ４に属するオブジェクトをクラスタＣＬ２に分類する。 Further, the information processing device 100C classifies the blob BL2 and the blob BL4 into a cluster CL2. That is, the information processing device 100C classifies objects belonging to blob BL2 and objects belonging to blob BL4 into cluster CL2.

また、情報処理装置１００Ｃは、ブロブＢＬ３、ブロブＢＬ８及びブロブＢＬ１０をクラスタＣＬ３に分類する。すなわち、情報処理装置１００Ｃは、ブロブＢＬ３に属するオブジェクト、ブロブＢＬ８に属するオブジェクト、及びブロブＢＬ１０に属するオブジェクトをクラスタＣＬ３に分類する。 Furthermore, the information processing device 100C classifies blob BL3, blob BL8, and blob BL10 into cluster CL3. That is, the information processing device 100C classifies objects belonging to blob BL3, objects belonging to blob BL8, and objects belonging to blob BL10 into cluster CL3.

また、情報処理装置１００Ｃは、ブロブＢＬ５、ブロブＢＬ６及びブロブＢＬ９をクラスタＣＬ４に分類する。すなわち、情報処理装置１００Ｃは、ブロブＢＬ５に属するオブジェクト、ブロブＢＬ６に属するオブジェクト、及びブロブＢＬ９に属するオブジェクトをクラスタＣＬ４に分類する。 Furthermore, the information processing device 100C classifies blob BL5, blob BL6, and blob BL9 into cluster CL4. That is, the information processing device 100C classifies objects belonging to blob BL5, objects belonging to blob BL6, and objects belonging to blob BL9 into cluster CL4.

例えば、情報処理装置１００Ｃは、クラスタ情報を用いて量子化に関する処理を実行する。情報処理装置１００Ｃは、クラスタ情報を用いて複数のオブジェクトを対象とする量子化に関する処理を行う。例えば、情報処理装置１００Ｃは、クラスタＣＬ１～ＣＬ４の各々に属するオブジェクトごとに量子化を行うが、この点については後述する。 For example, the information processing device 100C uses cluster information to perform processing related to quantization. The information processing device 100C performs processing related to quantization targeting a plurality of objects using cluster information. For example, the information processing device 100C performs quantization for each object belonging to each of the clusters CL1 to CL4, but this point will be described later.

上記のように、情報処理装置１００Ｃは、データ検索の対象となる複数のオブジェクトを分類する複数のブロブを示すブロブ情報と、複数のブロブとは異なる分類となる複数のクラスタを示すクラスタ情報とを生成する。これにより、情報処理装置１００Ｃは、複数のオブジェクトを分類する２種類のグループを生成することができ、オブジェクトを分類するグループを生成することができる。 As described above, the information processing device 100C stores blob information indicating a plurality of blobs that classify a plurality of objects that are data search targets, and cluster information indicating a plurality of clusters that are classified differently from the plurality of blobs. generate. Thereby, the information processing apparatus 100C can generate two types of groups for classifying a plurality of objects, and can generate groups for classifying objects.

〔４－１－１．他の処理例〕
なお、図３２に示した処理は一例に過ぎず、情報処理装置１００Ｃは、ブロブ情報及びクラスタ情報を生成可能であれば、どのような態様の処理により生成を行ってもよい。 [4-1-1. Other processing examples]
Note that the process shown in FIG. 32 is only an example, and the information processing apparatus 100C may generate blob information and cluster information using any type of process as long as it is capable of generating the blob information and cluster information.

例えば、情報処理装置１００Ｃは、クラスタに分類するクラスタ情報を生成した後に、ブロブに分類するブロブ情報を生成してもよい。この場合、情報処理装置１００Ｃは、複数のオブジェクトをクラスタに分類するクラスタリングにより、各オブジェクトを複数のクラスタのいずれかの分類するクラスタ情報を生成してもよい。例えば、情報処理装置１００Ｃは、ｋ－ｍｅａｎｓ等の任意のクラスタリングにより各オブジェクトをクラスタＣＬ１～ＣＬ４のいずれかに分類してもよい。 For example, the information processing device 100C may generate cluster information for classifying into clusters, and then generate blob information for classifying into blobs. In this case, the information processing apparatus 100C may generate cluster information for classifying each object into one of the plurality of clusters by performing clustering in which the plurality of objects are classified into clusters. For example, the information processing device 100C may classify each object into one of clusters CL1 to CL4 using arbitrary clustering such as k-means.

そして、情報処理装置１００Ｃは、クラスタＣＬ１～ＣＬ４の少なくとも１つを分割し、二以上のブロブを生成することにより、ブロブを示すブロブ情報を生成してもよい。例えば、情報処理装置１００Ｃは、各クラスタに属するオブジェクトをｋ－ｍｅａｎｓ等の任意のクラスタリングによってクラスタリングすることにより、各クラスタを分割しブロブを生成する。例えば、情報処理装置１００Ｃは、クラスタＣＬ１を２つのブロブに分割することにより、ブロブＢＬ１及びブロブＢＬ７を生成してもよい。例えば、情報処理装置１００Ｃは、クラスタＣＬ１に属しているオブジェクトをクラスタ数２でｋ－ｍｅａｎｓによりクラスタリングして、クラスタＣＬ１を分割しブロブＢＬ１及びブロブＢＬ７を生成する。 The information processing device 100C may generate blob information indicating the blob by dividing at least one of the clusters CL1 to CL4 and generating two or more blobs. For example, the information processing device 100C divides each cluster and generates blobs by clustering objects belonging to each cluster using arbitrary clustering such as k-means. For example, the information processing device 100C may generate blob BL1 and blob BL7 by dividing cluster CL1 into two blobs. For example, the information processing device 100C clusters objects belonging to the cluster CL1 using k-means with the number of clusters being 2, and divides the cluster CL1 to generate blob BL1 and blob BL7.

例えば、情報処理装置１００Ｃは、クラスタＣＬ２を２つのブロブに分割することにより、ブロブＢＬ２及びブロブＢＬ４を生成してもよい。例えば、情報処理装置１００Ｃは、クラスタＣＬ３を３つのブロブに分割することにより、ブロブＢＬ３、ブロブＢＬ８及びブロブＢＬ１０を生成してもよい。例えば、情報処理装置１００Ｃは、クラスタＣＬ４を３つのブロブに分割することにより、ブロブＢＬ５、ブロブＢＬ６及びブロブＢＬ９を生成してもよい。 For example, the information processing device 100C may generate blob BL2 and blob BL4 by dividing cluster CL2 into two blobs. For example, the information processing device 100C may generate blob BL3, blob BL8, and blob BL10 by dividing cluster CL3 into three blobs. For example, the information processing device 100C may generate blob BL5, blob BL6, and blob BL9 by dividing cluster CL4 into three blobs.

〔４－２．情報処理装置の構成〕
次に、図３３を用いて、第４の実施形態に係る情報処理装置１００Ｃの構成について説明する。図３３は、第４の実施形態に係る情報処理装置の構成例を示す図である。図３３に示すように、情報処理装置１００Ｃは、通信部１１０と、記憶部１２０Ｃと、制御部１３０Ｂとを有する。なお、情報処理装置１００Ｃにおいて、情報処理装置１００、情報処理装置１００Ａまたは情報処理装置１００Ｂと同様の点は適宜説明を省略する。 [4-2. Configuration of information processing device]
Next, the configuration of an information processing apparatus 100C according to the fourth embodiment will be described using FIG. 33. FIG. 33 is a diagram illustrating a configuration example of an information processing apparatus according to the fourth embodiment. As shown in FIG. 33, the information processing device 100C includes a communication section 110, a storage section 120C, and a control section 130B. Note that in the information processing device 100C, descriptions of the same points as the information processing device 100, the information processing device 100A, or the information processing device 100B will be omitted as appropriate.

（記憶部１２０Ｃ）
記憶部１２０Ｃは、例えば、ＲＡＭ、フラッシュメモリ等の半導体メモリ素子、または、ハードディスク、光ディスク等の記憶装置によって実現される。第４の実施形態に係る記憶部１２０Ｃは、図３３に示すように、オブジェクト情報記憶部１２１と、グラフ情報記憶部１２２と、量子化情報記憶部１２３と、コードブック情報記憶部１２４と、ブロブ情報記憶部１２５と、ブロブ連結グラフ情報記憶部１２６と、クラスタ情報記憶部１２７とを有する。 (Storage unit 120C)
The storage unit 120C is realized by, for example, a semiconductor memory device such as a RAM or a flash memory, or a storage device such as a hard disk or an optical disk. As shown in FIG. 33, the storage unit 120C according to the fourth embodiment includes an object information storage unit 121, a graph information storage unit 122, a quantization information storage unit 123, a codebook information storage unit 124, and a blob information storage unit 120. It has an information storage section 125, a blob connection graph information storage section 126, and a cluster information storage section 127.

第４の実施形態に係るクラスタ情報記憶部１２７は、第２グループであるクラスタに関する情報を記憶する。クラスタ情報記憶部１２７は、量子化の単位となるクラスタに関する情報を記憶する。クラスタ情報記憶部１２７は、各クラスタを識別する識別情報（クラスタＩＤ等）に、そのクラスタに属する第１グループであるブロブを示す情報（ブロブＩＤ等）を対応付けて記憶してもよい。また、クラスタ情報記憶部１２７は、各クラスタを識別する識別情報（クラスタＩＤ等）に、そのクラスタに属するオブジェクトを示す情報（オブジェクトＩＤ等）を対応付けて記憶してもよい。また、クラスタ情報記憶部１２７は、各クラスタを識別する識別情報（クラスタＩＤ等）に、そのクラスタのセントロイド等の基点を特定するための情報（基点ＩＤ等）及びそのベクトル等を対応付けて記憶してもよい。 The cluster information storage unit 127 according to the fourth embodiment stores information regarding clusters that are the second group. The cluster information storage unit 127 stores information regarding clusters, which are units of quantization. The cluster information storage unit 127 may store identification information for identifying each cluster (cluster ID, etc.) in association with information (blob ID, etc.) indicating a blob that is the first group belonging to the cluster. Further, the cluster information storage unit 127 may store identification information (such as a cluster ID) for identifying each cluster in association with information (such as an object ID) indicating an object belonging to the cluster. The cluster information storage unit 127 also associates identification information (cluster ID, etc.) for identifying each cluster with information (base point ID, etc.) for specifying a base point such as a centroid of that cluster, its vector, etc. You may memorize it.

（制御部１３０Ｃ）
制御部１３０Ｃは、コントローラ（controller）であり、例えば、ＣＰＵやＭＰＵやＧＰＵ等によって、情報処理装置１００Ｃ内部の記憶装置に記憶されている各種プログラム（情報処理プログラムの一例に相当）がＲＡＭを作業領域として実行されることにより実現される。また、制御部１３０Ｃは、コントローラであり、例えば、ＡＳＩＣやＦＰＧＡ等の集積回路により実現される。 (Control unit 130C)
The control unit 130C is a controller, and allows various programs (corresponding to an example of an information processing program) stored in a storage device inside the information processing device 100C to work on the RAM by, for example, a CPU, MPU, GPU, etc. This is achieved by executing it as a region. Further, the control unit 130C is a controller, and is realized by, for example, an integrated circuit such as an ASIC or an FPGA.

図３３に示すように、制御部１３０Ｃは、取得部１３１と、生成部１３２Ｃと、検索処理部１３３Ｃと、提供部１３４とを有し、以下に説明する情報処理の機能や作用を実現または実行する。なお、制御部１３０Ｃの内部構成は、図３３に示した構成に限られず、後述する情報処理を行う構成であれば他の構成であってもよい。 As shown in FIG. 33, the control unit 130C includes an acquisition unit 131, a generation unit 132C, a search processing unit 133C, and a provision unit 134, and realizes or executes information processing functions and operations described below. do. Note that the internal configuration of the control unit 130C is not limited to the configuration shown in FIG. 33, and may be any other configuration as long as it performs information processing to be described later.

第４の実施形態に係る取得部１３１は、データ検索の対象となる複数のオブジェクトを示すオブジェクト情報を取得する。取得部１３１は、複数のオブジェクトを検索対象とするインデックスを示すインデックス情報を取得する。 The acquisition unit 131 according to the fourth embodiment acquires object information indicating a plurality of objects that are data search targets. The acquisition unit 131 acquires index information indicating an index that searches for a plurality of objects.

取得部１３１は、データ検索の対象となる複数のオブジェクトを分類する複数のブロブと、複数のブロブとは異なる分類となるグループであって、複数のオブジェクトのうち、複数のブロブにおいて同じブロブに属するオブジェクトを同じグループに分類するグループである複数のクラスタとを示す処理用情報を取得する。取得部１３１は、複数のオブジェクトを分類する第１数のブロブと、複数のオブジェクトを第１数よりも少ない第２数に分類するクラスタとを示す処理用情報を取得する。取得部１３１は、複数のブロブのうち二以上のブロブの各々に属するオブジェクト群が一のクラスタに分類された複数のクラスタを示す処理用情報を取得する。 The acquisition unit 131 includes a plurality of blobs that classify a plurality of objects that are data search targets, and a group that is classified differently from the plurality of blobs, and among the plurality of objects, the plurality of blobs belong to the same blob. Processing information indicating a plurality of clusters that classify objects into the same group is obtained. The acquisition unit 131 acquires processing information indicating a first number of blobs for classifying a plurality of objects and clusters for classifying a plurality of objects into a second number smaller than the first number. The acquisition unit 131 acquires processing information indicating a plurality of clusters in which object groups belonging to each of two or more blobs among the plurality of blobs are classified into one cluster.

取得部１３１は、複数のオブジェクトを対象とする検索処理における一括処理である第１処理を実行するために用いられる複数のブロブを示す処理用情報を取得する。取得部１３１は、複数のオブジェクトを対象とする量子化に関する処理である第２処理を行うために用いられる複数のクラスタを示す処理用情報を取得する。 The acquisition unit 131 acquires processing information indicating a plurality of blobs used to execute a first process which is a batch process in a search process targeting a plurality of objects. The acquisition unit 131 acquires processing information indicating a plurality of clusters used to perform second processing, which is processing related to quantization targeting a plurality of objects.

取得部１３１は、複数のブロブの各々を識別する複数の第１識別情報と、複数のクラスタの各々を識別する複数の第２識別情報とを対応付けた処理用情報を取得する。取得部１３１は、一の第２識別情報が二以上の第１識別情報に対応付けられた処理用情報を取得する。 The acquisition unit 131 acquires processing information that associates a plurality of first identification information that identifies each of a plurality of blobs with a plurality of second identification information that identifies each of a plurality of clusters. The acquisition unit 131 acquires processing information in which one piece of second identification information is associated with two or more pieces of first identification information.

取得部１３１は、複数のオブジェクトの各々が属するブロブの第１識別情報に対応付けられた処理用情報を取得する。取得部１３１は、複数のオブジェクトの各々が量子化された量子化後オブジェクト情報が第１識別情報に対応付けられた処理用情報を取得する。取得部１３１は、直積量子化により、複数のオブジェクトの各々に対応するベクトルを分割した各部分ベクトルを量子化するベクトル量子化により生成された量子化後オブジェクト情報が、第１識別情報に対応付けられた処理用情報を取得する。取得部１３１は、量子化に対応する各セントロイドの各部分領域に対応する各部分セントロイドに基づいて生成された量子化後オブジェクト情報が、第１識別情報に対応付けられた処理用情報を取得する。取得部１３１は、量子化に対応する各セントロイドと、当該各セントロイドの各部分領域内に対応する各部分セントロイドと、により生成される残差ベクトルに基づいて生成された量子化後オブジェクト情報が、第１識別情報に対応付けられた処理用情報を取得する。 The acquisition unit 131 acquires processing information associated with first identification information of a blob to which each of the plurality of objects belongs. The acquisition unit 131 acquires processing information in which quantized object information obtained by quantizing each of a plurality of objects is associated with first identification information. The acquisition unit 131 associates the quantized object information generated by vector quantization, which quantizes each partial vector obtained by dividing the vector corresponding to each of the plurality of objects, with the first identification information by direct product quantization. Get the information for processing. The acquisition unit 131 allows the quantized object information generated based on each partial centroid corresponding to each partial region of each centroid corresponding to quantization to obtain processing information associated with the first identification information. get. The acquisition unit 131 generates a quantized object based on a residual vector generated by each centroid corresponding to quantization and each partial centroid corresponding in each partial region of each centroid. The information acquires processing information associated with the first identification information.

（生成部１３２Ｃ）
生成部１３２Ｃは、生成部１３２、生成部１３２Ａまたは生成部１３２Ｂと同様に各種情報を生成する。 (Generation unit 132C)
The generation unit 132C generates various information in the same way as the generation unit 132, the generation unit 132A, or the generation unit 132B.

生成部１３２Ｃは、取得部１３１により取得されたオブジェクト情報に基づいて、複数のオブジェクトを分類する複数のブロブを示すブロブ情報を生成する。生成部１３２Ｃは、複数のブロブとは異なる分類となるグループであって、複数のオブジェクトのうち、複数のブロブにおいて同じブロブに属するオブジェクトを同じグループに分類するグループである複数のクラスタを示すクラスタ情報を生成する。 The generation unit 132C generates blob information indicating a plurality of blobs that classify a plurality of objects based on the object information acquired by the acquisition unit 131. The generating unit 132C generates cluster information indicating a plurality of clusters, which are groups that are classified differently from a plurality of blobs, and which are groups that classify objects belonging to the same blob among a plurality of blobs into the same group. generate.

生成部１３２Ｃは、複数のオブジェクトを第１数のブロブに分類するブロブ情報と、複数のオブジェクトを第１数よりも少ない第２数のクラスタに分類するクラスタ情報とを生成する。生成部１３２Ｃは、複数のブロブのうち二以上のブロブの各々に属するオブジェクト群が一のクラスタに分類された複数のクラスタを示すクラスタ情報を生成する。 The generating unit 132C generates blob information that classifies a plurality of objects into a first number of blobs, and cluster information that classifies a plurality of objects into a second number of clusters that are smaller than the first number. The generating unit 132C generates cluster information indicating a plurality of clusters in which a group of objects belonging to each of two or more blobs among the plurality of blobs is classified into one cluster.

生成部１３２Ｃは、複数のオブジェクトを対象とする検索処理において一括処理を行うために用いられる複数のブロブを示すブロブ情報を生成する。生成部１３２Ｃは、複数のオブジェクトに関する距離の算出を一括して行うために用いられる複数のブロブを示すブロブ情報を生成する。生成部１３２Ｃは、検索処理で用いられる検索クエリとオブジェクトとの距離の算出を並列処理するために用いられる複数のブロブを示すブロブ情報を生成する。 The generation unit 132C generates blob information indicating a plurality of blobs used for performing batch processing in a search process targeting a plurality of objects. The generation unit 132C generates blob information indicating a plurality of blobs used to collectively calculate distances regarding a plurality of objects. The generation unit 132C generates blob information indicating a plurality of blobs used to calculate the distance between the search query and the object used in the search process in parallel.

生成部１３２Ｃは、複数のオブジェクトを対象とする量子化に関する処理を行うために用いられる複数のクラスタを示すクラスタ情報を生成する。生成部１３２Ｃは、量子化の単位となる複数のクラスタを示すクラスタ情報を生成する。生成部１３２Ｃは、複数のオブジェクトの各々に対応するベクトルを量子化するために用いられる複数のクラスタを示すクラスタ情報を生成する。 The generation unit 132C generates cluster information indicating a plurality of clusters used to perform processing related to quantization targeting a plurality of objects. The generating unit 132C generates cluster information indicating a plurality of clusters that are units of quantization. The generating unit 132C generates cluster information indicating a plurality of clusters used to quantize vectors corresponding to each of a plurality of objects.

生成部１３２Ｃは、複数のブロブの分類後に、複数のクラスタの分類を行う。生成部１３２Ｃは、複数のオブジェクトをクラスタリングすることにより複数のブロブを示すブロブ情報を生成する。生成部１３２Ｃは、インデックス情報を用いて、複数のブロブを示すブロブ情報を生成する。生成部１３２Ｃは、ブロブを示すブロブ情報を用いて、複数のクラスタを示すクラスタ情報を生成する。生成部１３２Ｃは、複数のブロブをクラスタリングすることにより複数のクラスタを示すクラスタ情報を生成する。 After classifying the plurality of blobs, the generation unit 132C performs classification of the plurality of clusters. The generation unit 132C generates blob information indicating a plurality of blobs by clustering a plurality of objects. The generation unit 132C uses the index information to generate blob information indicating a plurality of blobs. The generation unit 132C generates cluster information indicating a plurality of clusters using blob information indicating blobs. The generation unit 132C generates cluster information indicating a plurality of clusters by clustering a plurality of blobs.

生成部１３２Ｃは、複数のクラスタの分類後に、複数のブロブの分類を行う。生成部１３２Ｃは、複数のオブジェクトをクラスタリングすることにより複数のクラスタを示すクラスタ情報を生成する。生成部１３２Ｃは、クラスタを示すクラスタ情報を用いて、複数のブロブを示すブロブ情報を生成する。生成部１３２Ｃは、複数のクラスタのうち、少なくとも一のクラスタを分割し、二以上のブロブを生成することにより、複数のブロブを示すブロブ情報を生成する。生成部１３２Ｃは、各第２グループに属するオブジェクトをクラスタリングすることにより、各第２グループを二以上の第１グループに分割することにより、複数の第１グループを示す第１グループ情報を生成する。 After classifying the plurality of clusters, the generation unit 132C performs classification of the plurality of blobs. The generating unit 132C generates cluster information indicating a plurality of clusters by clustering a plurality of objects. The generating unit 132C generates blob information indicating a plurality of blobs using cluster information indicating a cluster. The generating unit 132C generates blob information indicating a plurality of blobs by dividing at least one cluster among the plurality of clusters and generating two or more blobs. The generation unit 132C generates first group information indicating a plurality of first groups by clustering objects belonging to each second group and dividing each second group into two or more first groups.

（検索処理部１３３Ｃ）
検索処理部１３３Ｃは、ブロブに属するオブジェクトを対象とする第１処理、及びクラスタに属するオブジェクトを対象とする第２処理を実行する処理部として機能する。検索処理部１３３Ｃは、検索処理部１３３、検索処理部１３３Ａまたは検索処理部１３３Ｂと同様に検索処理に関する各種処理を行う。検索処理部１３３Ｃは、生成部１３２Ｃにより生成された情報を用いた処理を行う。 (Search processing unit 133C)
The search processing unit 133C functions as a processing unit that executes a first process that targets objects belonging to a blob and a second process that targets objects that belong to a cluster. The search processing unit 133C performs various processes related to search processing in the same way as the search processing unit 133, the search processing unit 133A, or the search processing unit 133B. The search processing unit 133C performs processing using the information generated by the generation unit 132C.

検索処理部１３３Ｃは、取得部１３１により取得された処理用情報を用いて、ブロブに属するオブジェクトを対象とする第１処理と、クラスタに属するオブジェクトを対象とする第２処理とを実行する。検索処理部１３３Ｃは、一のクラスタに属するオブジェクト群を対象に第２処理を実行する。検索処理部１３３Ｃは、一のブロブに属するオブジェクト群を対象に第１処理を実行する。 The search processing unit 133C uses the processing information acquired by the acquisition unit 131 to execute a first process targeting objects belonging to a blob and a second process targeting objects belonging to a cluster. The search processing unit 133C executes the second process on a group of objects belonging to one cluster. The search processing unit 133C performs a first process on a group of objects belonging to one blob.

検索処理部１３３Ｃは、一のブロブに属するオブジェクト群を対象に距離の算出を一括して行う第１処理を実行する。検索処理部１３３Ｃは、一のブロブに属するオブジェクト群の各々と、検索処理で用いられる検索クエリとの距離の算出の並列処理である第１処理を実行する。 The search processing unit 133C executes a first process of collectively calculating distances for a group of objects belonging to one blob. The search processing unit 133C executes a first process that is a parallel process of calculating distances between each object group belonging to one blob and a search query used in the search process.

検索処理部１３３Ｃは、一のクラスタに属するオブジェクト群を対象に第２処理を実行する。検索処理部１３３Ｃは、一のクラスタに属するオブジェクト群を対象に量子化である第２処理を実行する。検索処理部１３３Ｃは、一のクラスタに属するオブジェクト群の各々に対応するベクトルを量子化する第２処理を実行する。 The search processing unit 133C executes the second process on a group of objects belonging to one cluster. The search processing unit 133C executes second processing, which is quantization, on a group of objects belonging to one cluster. The search processing unit 133C executes a second process of quantizing the vectors corresponding to each of the object groups belonging to one cluster.

検索処理部１３３Ｃは、一のクラスタに二以上のブロブが属する場合、当該二以上のブロブに属するオブジェクト群を対象として第２処理を実行する。検索処理部１３３Ｃは、一のクラスタに二以上のブロブが属する場合、当該二以上のブロブに属する全オブジェクトを対象として量子化を実行する。 When two or more blobs belong to one cluster, the search processing unit 133C executes the second process on a group of objects that belong to the two or more blobs. When two or more blobs belong to one cluster, the search processing unit 133C performs quantization on all objects belonging to the two or more blobs.

検索処理部１３３Ｃは、処理用情報を参照して、第１処理と第２処理とを実行する。検索処理部１３３Ｃは、処理用情報を参照して処理対象となるオブジェクトを特定する。検索処理部１３３Ｃは、複数のオブジェクトを対象とする検索処理において、二以上の第１識別情報のうち、一の第１識別情報のブロブを対象として第１処理を行うとともに、一の第２識別情報のクラスタに関して検索処理において参照する参照用情報を生成した後、二以上の第１識別情報のうち、一の第１識別情報以外の他の第１識別情報の他のブロブを対象として第１処理を行い際は参照用情報を生成しない。 The search processing unit 133C refers to the processing information and executes the first process and the second process. The search processing unit 133C refers to the processing information to identify the object to be processed. In a search process targeting a plurality of objects, the search processing unit 133C performs a first process on one blob of first identification information among two or more pieces of first identification information, and also performs a first process on one blob of first identification information. After generating the reference information to be referred to in the search process regarding the information cluster, the first identification information is generated for other blobs of the first identification information other than one of the two or more first identification information. No reference information is generated during processing.

検索処理部１３３Ｃは、ブロブ連結グラフを用いた検索処理を行う。検索処理部１３３Ｃは、ブロブ連結グラフを用いて、検索クエリの近傍オブジェクトを検索する検索処理を行う。検索処理部１３３Ｃは、ブロブ連結グラフにおいてブロブ間を連結するエッジを辿ることにより、検索クエリの近傍オブジェクトを検索する検索処理を行う。 The search processing unit 133C performs search processing using a blob connection graph. The search processing unit 133C performs a search process to search for nearby objects of the search query using the blob connection graph. The search processing unit 133C performs a search process to search for nearby objects of the search query by tracing edges connecting blobs in the blob connection graph.

検索処理部１３３Ｃは、検索処理において、複数のブロブを示すブロブ情報を用いて、一のブロブに属するオブジェクトである対象オブジェクトと検索クエリとの距離を、ベクトル量子化された対象オブジェクトのベクトル情報を用いて算出する。検索処理部１３３Ｃは、対象オブジェクトと検索クエリとの距離の算出を並列化して一括で行う。検索処理部１３３Ｃは、情報処理装置の仕様に基づいて決定される一括処理数の対象オブジェクトと検索クエリとの距離の算出を並列処理する。 In the search process, the search processing unit 133C uses blob information indicating a plurality of blobs to calculate the distance between the target object, which is an object belonging to one blob, and the search query, and the vector information of the target object that has been vector quantized. Calculate using The search processing unit 133C parallelizes and collectively calculates the distance between the target object and the search query. The search processing unit 133C performs parallel processing to calculate the distance between the search query and the target object of the batch processing number determined based on the specifications of the information processing device.

検索処理部１３３Ｃは、ブロブ連結グラフを用い、図２７及び図２８に示す処理により検索処理を行う。例えば、検索処理部１３３Ｃは、空間情報ＳＰ５１に示すように、ブロブ間をエッジで連結するグラフＧＲ４１を用いて検索処理を行う。例えば、検索処理部１３３Ｃは、検索クエリＱＥ３を対象として、ブロブ連結グラフであるグラフＧＲ４１を用いた図２７及び図２８に示すような検索処理を行うことにより、検索クエリＱＥ３の検索結果を得る。例えば、検索処理部１３３Ｃは、ブロブ連結グラフを辿ることにより、処理対象となったブロブについて第１処理を実行する。 The search processing unit 133C uses the blob connection graph to perform search processing as shown in FIGS. 27 and 28. For example, the search processing unit 133C performs a search process using a graph GR41 that connects blobs with edges, as shown in the spatial information SP51. For example, the search processing unit 133C obtains the search results for the search query QE3 by performing search processing as shown in FIGS. 27 and 28 using the graph GR41, which is a blob connection graph, for the search query QE3. For example, the search processing unit 133C executes the first process on the blob that is the processing target by tracing the blob connection graph.

検索処理部１３３Ｃは、処理対象となったブロブが属するクラスタについて、参照用情報が未生成である場合、参照用情報を生成する。例えば、検索処理部１３３Ｃは、距離のルックアップテーブル（ＬＵＴ）を参照用情報として生成する。例えば、検索処理部１３３Ｃは、検索処理において、クラスタ単位に距離のルックアップテーブルを生成する。すなわち、検索処理部１３３Ｃは、検索処理において、ブロブ単位ではなく、クラスタ単位で、距離のルックアップテーブルを生成する。これにより、情報処理装置１００Ｃは、検索時間の増大を抑制することができる。 The search processing unit 133C generates reference information if reference information has not yet been generated for the cluster to which the processing target blob belongs. For example, the search processing unit 133C generates a distance lookup table (LUT) as reference information. For example, the search processing unit 133C generates a distance lookup table for each cluster in the search process. That is, in the search processing, the search processing unit 133C generates a lookup table of distances not in units of blobs but in units of clusters. Thereby, the information processing device 100C can suppress an increase in search time.

検索処理部１３３Ｃは、生成した参照用情報を用いて、処理対象となったブロブに属するオブジェクトについて一括して距離計算を実行する。例えば、検索処理部１３３Ｃは、処理対象となったブロブが属するクラスタについて、参照用情報が生成済みである場合、参照用情報を生成しない。そして、検索処理部１３３Ｃは、既に生成済みである参照用情報を用いて、処理対象となったブロブに属するオブジェクトについて一括して距離計算を実行する。 The search processing unit 133C uses the generated reference information to collectively calculate distances for objects belonging to the blob that is the processing target. For example, the search processing unit 133C does not generate reference information if reference information has already been generated for the cluster to which the blob targeted for processing belongs. Then, the search processing unit 133C uses the reference information that has already been generated to perform distance calculations all at once for objects belonging to the blob that is the processing target.

〔４－３．処理及び情報例〕
ここから、第４の実施形態における処理や処理に用いる情報等の例について説明する。 [4-3. Examples of processing and information]
From here, examples of processing, information used for processing, etc. in the fourth embodiment will be described.

まず、図３４を用いて量子化に関する概念について説明する。図３４は、第４の実施形態に係る量子化に関する概念図である。 First, the concept regarding quantization will be explained using FIG. 34. FIG. 34 is a conceptual diagram regarding quantization according to the fourth embodiment.

例えば、基点ＣＬ１は、クラスタＣＬ１の量子化に用いられる基点を示す。この場合、基点ＣＬ１は、ブロブＢＬ１及びブロブＢＬ７を含むクラスタＣＬ１のセントロイドを示す。また、例えば、基点ＣＬ２は、クラスタＣＬ２の量子化に用いられる基点を示す。この場合、基点ＣＬ２は、ブロブＢＬ２及びブロブＢＬ４を含むクラスタＣＬ２のセントロイドを示す。 For example, base point CL1 indicates a base point used for quantization of cluster CL1. In this case, base point CL1 indicates the centroid of cluster CL1 including blob BL1 and blob BL7. Further, for example, base point CL2 indicates a base point used for quantization of cluster CL2. In this case, base point CL2 indicates the centroid of cluster CL2 including blob BL2 and blob BL4.

例えば、基点ＣＬ３は、クラスタＣＬ３の量子化に用いられる基点を示す。この場合、基点ＣＬ３は、ブロブＢＬ３、ブロブＢＬ８及びブロブＢＬ１０を含むクラスタＣＬ３のセントロイドを示す。また、例えば、基点ＣＬ４は、クラスタＣＬ４の量子化に用いられる基点を示す。この場合、基点ＣＬ４は、ブロブＢＬ５、ブロブＢＬ６及びブロブＢＬ９を含むクラスタＣＬ４のセントロイドを示す。 For example, base point CL3 indicates a base point used for quantization of cluster CL3. In this case, base point CL3 indicates the centroid of cluster CL3 including blob BL3, blob BL8, and blob BL10. Further, for example, base point CL4 indicates a base point used for quantization of cluster CL4. In this case, base point CL4 indicates the centroid of cluster CL4 including blob BL5, blob BL6, and blob BL9.

この場合、情報処理装置１００Ｃは、４つのクラスタＣＬ１～ＣＬ４の各々の中で残差ベクトルを求める。そして、情報処理装置１００Ｃは、４つのクラスタＣＬ１～ＣＬ４の各々の中で求めた残差ベクトルを用いて、量子化を行う。例えば、情報処理装置１００Ｃは、クラスタＣＬ１に属するオブジェクトについては基点ＣＬ１からの残差ベクトルを用いて、量子化を行う。残差ベクトルを用いた量子化についての詳細な説明は省略するが、情報処理装置１００Ｃは、例えば特許文献３に示す技術などを適宜用いて、残差ベクトルを用いた量子化を行う。 In this case, the information processing device 100C finds residual vectors in each of the four clusters CL1 to CL4. The information processing device 100C then performs quantization using the residual vectors found in each of the four clusters CL1 to CL4. For example, the information processing device 100C performs quantization on objects belonging to the cluster CL1 using the residual vector from the base point CL1. Although a detailed explanation of quantization using residual vectors will be omitted, the information processing device 100C performs quantization using residual vectors, for example, appropriately using the technique disclosed in Patent Document 3.

なお、図３４では図を用いて説明するために、ベクトルを部分ベクトルに分割していない状態を示すが、情報処理装置１００Ｃは、直積量子化により分割された部分空間ごとに基点を導出し、導出した基点を基に残差ベクトルを用いて、量子化を行ってもよい。例えば、情報処理装置１００Ｃは、ベクトルを４つの部分ベクトルに分割して直積量子化を行う場合、各部分ベクトルに対応する部分空間ごとに基点があり、その基点からの残差ベクトルを用いて、量子化を行う。 Note that although FIG. 34 shows a state in which the vector is not divided into subvectors for the sake of illustration, the information processing device 100C derives a base point for each subspace divided by direct product quantization, Quantization may be performed using a residual vector based on the derived base point. For example, when performing Cartesian product quantization by dividing a vector into four partial vectors, the information processing device 100C has a base point for each subspace corresponding to each partial vector, and uses the residual vector from the base point to Perform quantization.

以下、図３４を参照しつつ、情報処理装置１００Ｃによるオブジェクトの量子化及び量子化による処理についての一例を説明する。図３４では、クエリオブジェクト（検索クエリ）が検索クエリＱＥ３である場合を一例として説明する。なお、以下で説明するオブジェクト（ベクトル）の量子化や直積量子化等において、特許文献３に開示された内容と同様の点については、詳細な説明を適宜省略する。 Hereinafter, an example of object quantization and quantization processing by the information processing apparatus 100C will be described with reference to FIG. 34. In FIG. 34, a case where the query object (search query) is search query QE3 will be described as an example. Note that in the quantization of objects (vectors), direct product quantization, etc. described below, detailed explanations of points similar to those disclosed in Patent Document 3 will be omitted as appropriate.

図３４の空間情報ＳＰ５２に示すように、検索クエリＱＥ３と、ノードＮ７等のクラスタＣＬ１に属する各オブジェクトに対応する各ベクトルとの距離は異なるが、情報処理装置１００Ｃは、ノードＮ７等のクラスタＣＬ１に属する各オブジェクトに対応するベクトルを用いない。この場合、検索クエリＱＥ３と各ベクトルとの距離は、例えば検索クエリＱＥ３とクラスタＣＬ１の基点ＣＬ１に対応するベクトルとの距離とみなされることとなる。すなわち、クラスタＣＬ１に属する各オブジェクトに対応するベクトルは、基点ＣＬ１に対応するベクトルに量子化される。具体的には、情報処理装置１００Ｃは、基点ＣＬ１からの各ベクトルの差分である残差ベクトルを求めて、その残差ベクトルを直積量子化（さらに部分空間に分割して量子化）する。 As shown in the spatial information SP52 of FIG. 34, although the distances between the search query QE3 and each vector corresponding to each object belonging to the cluster CL1 such as the node N7 are different, the information processing apparatus 100C do not use vectors corresponding to each object belonging to . In this case, the distance between the search query QE3 and each vector is considered to be, for example, the distance between the search query QE3 and the vector corresponding to the base point CL1 of the cluster CL1. That is, the vectors corresponding to each object belonging to cluster CL1 are quantized into vectors corresponding to base point CL1. Specifically, the information processing device 100C obtains a residual vector that is the difference between each vector from the base point CL1, and performs direct product quantization (further divides into subspaces and quantizes) the residual vector.

例えば、情報処理装置１００Ｃは、各クラスタをさらに複数の部分領域に分割し、複数の部分領域に対応する部分セントロイドをコードブックとして生成する。例えば、情報処理装置１００Ｃは、各クラスタに関する残差ベクトルの情報に基づいて、各クラスタが分割された複数の部分領域に対応する部分セントロイドを生成してもよい。 For example, the information processing device 100C further divides each cluster into a plurality of partial regions, and generates partial centroids corresponding to the plurality of partial regions as a codebook. For example, the information processing device 100C may generate partial centroids corresponding to a plurality of partial regions into which each cluster is divided, based on residual vector information regarding each cluster.

例えば、情報処理装置１００Ｃは、各クラスタに含まれるオブジェクトの数に基づいて、各クラスタの部分領域の数を決定してもよいし、所定の設定値に基づいて部分領域の数を決定してもよい。例えば、情報処理装置１００Ｃは、クラスタリングに関する種々の従来技術を適宜用いて、部分領域を決定してもよい。例えば、情報処理装置１００Ｃは、クラスタＣＬ１の各部分領域に含まれるオブジェクトとクラスタＣＬ１の基点ＣＬ１から残差ベクトルを算出し、その残差ベクトルから部分領域の部分セントロイドを生成する。例えば、情報処理装置１００Ｃは、クラスタの基点と、各部分領域の部分セントロイドとの残差ベクトルの情報を用いる。例えば、情報処理装置１００Ｃは、クラスタＣＬ１の基点ＣＬ１と、その各部分領域の部分セントロイドとの残差ベクトの情報を用いる。例えば、情報処理装置１００Ｃは、各オブジェクトに対応付けてそのオブジェクトが属する部分領域の部分セントロイドの情報を記憶する。 For example, the information processing device 100C may determine the number of partial regions of each cluster based on the number of objects included in each cluster, or may determine the number of partial regions based on a predetermined setting value. Good too. For example, the information processing device 100C may determine the partial area using various conventional techniques related to clustering as appropriate. For example, the information processing device 100C calculates a residual vector from the object included in each partial region of the cluster CL1 and the base point CL1 of the cluster CL1, and generates a partial centroid of the partial region from the residual vector. For example, the information processing device 100C uses information on the residual vector between the base point of the cluster and the partial centroid of each partial region. For example, the information processing device 100C uses information on the residual vector between the base point CL1 of the cluster CL1 and the partial centroid of each partial region thereof. For example, the information processing device 100C stores, in association with each object, information on the partial centroid of the partial area to which the object belongs.

例えば、情報処理装置１００Ｃは、上述した情報を用いることにより、各オブジェクトの位置をそのベクトルが属する部分領域のセントロイドの位置に量子化することができる。これにより、情報処理装置１００Ｃは、検索クエリＱＥ３と各オブジェクトとの距離を、検索クエリＱＥ３と各オブジェクトが属する部分領域の部分セントロイドに対応するベクトルとの距離まで細部化することができる。 For example, by using the above-mentioned information, the information processing device 100C can quantize the position of each object to the position of the centroid of the partial area to which the vector belongs. Thereby, the information processing device 100C can refine the distance between the search query QE3 and each object to the distance between the search query QE3 and the vector corresponding to the partial centroid of the partial area to which each object belongs.

例えば、情報処理装置１００Ｃは、各クラスタの基点のベクトルと、部分セントロイドに関する残差ベクトルとを示す情報とを用いる。これにより、情報処理装置１００Ｃは、例えば検索クエリＱＥ３に対して、部分セントロイドまでの距離を算出することができる。したがって、情報処理装置１００Ｃは、例えば検索クエリＱＥ３と各オブジェクトとの距離を、クラスタの基点よりもさらに近似した部分セントロイドまでの距離とすることができる。 For example, the information processing device 100C uses information indicating the base point vector of each cluster and the residual vector regarding the partial centroid. Thereby, the information processing device 100C can calculate the distance to the partial centroid for, for example, the search query QE3. Therefore, the information processing device 100C can, for example, set the distance between the search query QE3 and each object as the distance to a partial centroid that is even closer than the base point of the cluster.

なお、情報処理装置１００Ｃは、各オブジェクトのベクトルを複数の部分ベクトルに分割して処理してもよい。すなわち、情報処理装置１００Ｃは、いわゆる直積量子化に関する技術を用いて、処理を行ってもよい。この点については、上述した第１の実施形態等と同様であるため、重複する説明は適宜省略する。 Note that the information processing device 100C may divide the vector of each object into a plurality of partial vectors for processing. That is, the information processing device 100C may perform processing using a technique related to so-called direct product quantization. Since this point is similar to the first embodiment and the like described above, overlapping explanation will be omitted as appropriate.

例えば、図３４の場合、情報処理装置１００Ｃは、検索クエリＱＥ３のベクトルを複数の部分ベクトル（「検索クエリＱＥ３の部分ベクトル」ともいう）に分割する。この場合、空間情報ＳＰ５２も複数の部分空間（「空間情報ＳＰ５２の部分空間」ともいう）に分割される。例えば、情報処理装置１００Ｃは、図１と同様に検索クエリＱＥ３のベクトルを４分割してもよい。この場合、空間情報ＳＰ５２も４つの部分空間に分割され、各クラスタが複数の部分空間に分割される。 For example, in the case of FIG. 34, the information processing device 100C divides the vector of search query QE3 into a plurality of partial vectors (also referred to as "partial vectors of search query QE3"). In this case, the spatial information SP52 is also divided into a plurality of subspaces (also referred to as "subspaces of the spatial information SP52"). For example, the information processing device 100C may divide the vector of the search query QE3 into four as in FIG. In this case, the spatial information SP52 is also divided into four subspaces, and each cluster is divided into a plurality of subspaces.

例えば、情報処理装置１００Ｃは、４つの部分空間において各オブジェクトが属する領域に対応する部分セントロイド（コードブック）を示す情報を、各オブジェクトに対応付けて記憶する。例えば、情報処理装置１００Ｃは、各オブジェクトに対応する４つのコードブックと検索クエリＱＥ３との距離を合計することにより、各オブジェクトと検索クエリＱＥ３との距離を算出する。 For example, the information processing device 100C stores, in association with each object, information indicating a partial centroid (codebook) corresponding to a region to which each object belongs in the four partial spaces. For example, the information processing device 100C calculates the distance between each object and the search query QE3 by summing the distances between the four codebooks corresponding to each object and the search query QE3.

次に、図３５及び図３６を用いて、転置インデックスの概要を説明する。図３５は、第４の実施形態に係る転置インデックスの一例を示す図である。図３６は、第４の実施形態に係る転置インデックスの具体例を示す図である。 Next, an overview of the transposed index will be explained using FIGS. 35 and 36. FIG. 35 is a diagram illustrating an example of a transposed index according to the fourth embodiment. FIG. 36 is a diagram showing a specific example of a transposed index according to the fourth embodiment.

まず、図３５に示す転置インデックスＩＮＤ１について説明する。図３５に示す転置インデックスＩＮＤ１は、「ブロブＩＤ」、「クラスタＩＤ」、「量子化オブジェクト数」、「量子化オブジェクト＃１」、「量子化オブジェクト＃２」、「量子化オブジェクト＃３」、および「量子化オブジェクト＃４」等といった項目が含まれる。なお、図３５では、「量子化オブジェクト＃１」～「量子化オブジェクト＃４」までの４つを図示するが、転置インデックスＩＮＤ１には、「量子化オブジェクト＃５」、「量子化オブジェクト＃６」等、各ブロブに属するオブジェクト数に対応する数の項目が含まれる。 First, the transposed index IND1 shown in FIG. 35 will be explained. The transposed index IND1 shown in FIG. 35 includes "blob ID", "cluster ID", "quantization object number", "quantization object #1", "quantization object #2", "quantization object #3", It includes items such as "quantization object #4" and the like. In addition, in FIG. 35, four objects from "quantization object #1" to "quantization object #4" are illustrated, but "quantization object #5" and "quantization object #6" are shown in the transposed index IND1. ”, etc., the number of items corresponding to the number of objects belonging to each blob is included.

「ブロブＩＤ」は、第１グループであるブロブを識別するための識別情報を示す。「クラスタＩＤ」は、第２グループであるクラスタを識別するための識別情報を示す。「量子化オブジェクト数」は、ブロブに属するオブジェクト数を示す。「量子化オブジェクト＃１」～「量子化オブジェクト＃４」は、量子化されたオブジェクトを示す。例えば、「量子化オブジェクト＃１」～「量子化オブジェクト＃４」には、オブジェクトが直積量子化され、各部分ベクトルに対応するコードブックを示す情報の一覧が記憶される。 “Blob ID” indicates identification information for identifying blobs that are the first group. "Cluster ID" indicates identification information for identifying the cluster that is the second group. "Number of quantized objects" indicates the number of objects belonging to the blob. “Quantized object #1” to “quantized object #4” indicate quantized objects. For example, "quantization object #1" to "quantization object #4" store a list of information indicating the codebook corresponding to each partial vector after the object has been subjected to product quantization.

図３５の例では、ブロブＩＤ「１」であるブロブは、クラスタＩＤ「２」であるクラスタに属し、属するオブジェクト数が４つであることを示す。また、ブロブＩＤ「１」であるブロブについて、「量子化オブジェクト＃１」には「４、２、８、１０」が格納され、そのオブジェクトが４つの部分ベクトルで構成され、それぞれコードブック「４」、「２」、「８」、「１０」で量子化されていることを示す。例えば、ブロブＩＤ「１」の「量子化オブジェクト＃１」に対応するオブジェクトについては、４つ分割された部分ベクトルのうち、最初の部分ベクトルがコードブック「４」で量子化され、２番目の部分ベクトルがコードブック「２」で量子化され、３番目の部分ベクトルがコードブック「８」で量子化され、最後の部分ベクトルがコードブック「１０」で量子化されることを示す。 In the example of FIG. 35, the blob with the blob ID "1" belongs to the cluster with the cluster ID "2", indicating that the number of belonging objects is four. Furthermore, for the blob with blob ID "1", "4, 2, 8, 10" are stored in "quantization object #1", and the object is composed of four partial vectors, each of which has a codebook "4". ”, “2”, “8”, and “10”. For example, for the object corresponding to "quantization object #1" with blob ID "1", among the four subvectors divided, the first subvector is quantized with codebook "4", and the second subvector is quantized with codebook "4". It shows that the subvectors are quantized with codebook "2", the third subvector is quantized with codebook "8", and the last subvector is quantized with codebook "10".

なお、転置インデックスＩＮＤ１は、上記に限らず、目的に応じて種々の情報を記憶してもよい。例えば、転置インデックスＩＮＤ１は、各量子化オブジェクトを示す情報（オブジェクトＩＤ等）を各量子化オブジェクトに対応付けて記憶してもよい。 Note that the transposed index IND1 is not limited to the above, and may store various information depending on the purpose. For example, the transposed index IND1 may store information indicating each quantization object (object ID, etc.) in association with each quantization object.

次に、図３６に示す転置インデックスＩＮＤ２について説明する。図３６は、図３２に等に示す例と対応する転置インデックスの具体例を示す。なお、転置インデックスＩＮＤ１と同様の点については適宜説明を省略する。図３６に示す転置インデックスＩＮＤ２は、「ブロブＩＤ」、「クラスタＩＤ」、「量子化オブジェクト数」、「量子化オブジェクト＃１」等といった項目が含まれる。なお、図３６では、「量子化オブジェクト＃１」のみを図示するが、転置インデックスＩＮＤ１と同様に各ブロブに属するオブジェクト数に対応する数の項目が含まれる。 Next, the transposed index IND2 shown in FIG. 36 will be explained. FIG. 36 shows a specific example of a transposed index corresponding to the example shown in FIG. Note that descriptions of points similar to the transposed index IND1 will be omitted as appropriate. The transposed index IND2 shown in FIG. 36 includes items such as "blob ID", "cluster ID", "quantization object number", and "quantization object #1". Note that although only "quantized object #1" is illustrated in FIG. 36, it includes a number of items corresponding to the number of objects belonging to each blob, similar to the transposed index IND1.

転置インデックスＩＮＤ２では、図３２に示す空間情報ＳＰ５１と同様に、ブロブＩＤ「ＢＬ１」により識別されるブロブＢＬ１は、クラスタＩＤ「ＣＬ１」であるクラスタＣＬ１に属することを示す。また、ブロブＢＬ１に属するオブジェクト数は、「ＮＭ１」であることを示す。なお、図３６では「ＮＭ１」といった抽象的な符号で示すが、「量子化オブジェクト数」には、転置インデックスＩＮＤ１と同様に具体的な数を示す値（例えば１０や１００等）が記憶される。 The transposed index IND2 indicates that the blob BL1 identified by the blob ID "BL1" belongs to the cluster CL1 having the cluster ID "CL1", similar to the spatial information SP51 shown in FIG. 32. Further, the number of objects belonging to blob BL1 is “NM1”. Although it is indicated by an abstract code such as "NM1" in FIG. 36, the "quantization object number" stores a value indicating a specific number (for example, 10, 100, etc.) similarly to the transposed index IND1. .

また、ブロブＢＬ１は、「量子化オブジェクト＃１」には「ＣＤ５１、ＣＤ６４、ＣＤ７７、ＣＤ８２」が格納され、そのオブジェクトが４つの部分ベクトルで構成され、それぞれコードブック「ＣＤ５１」、「ＣＤ６４」、「ＣＤ７７」、「ＣＤ８２」で量子化されていることを示す。 In addition, in the blob BL1, "quantization object #1" stores "CD51, CD64, CD77, CD82", and the object is composed of four partial vectors, codebooks "CD51", "CD64", Indicates that the data is quantized as "CD77" or "CD82".

また、転置インデックスＩＮＤ２では、図３２に示す空間情報ＳＰ５１と同様に、ブロブＩＤ「ＢＬ２」により識別されるブロブＢＬ２は、クラスタＩＤ「ＣＬ２」であるクラスタＣＬ２に属することを示す。また、転置インデックスＩＮＤ２では、図３２に示す空間情報ＳＰ５１と同様に、ブロブＩＤ「ＢＬ７」により識別されるブロブＢＬ７は、クラスタＩＤ「ＣＬ１」であるクラスタＣＬ１に属することを示す。 Further, the transposed index IND2 indicates that the blob BL2 identified by the blob ID "BL2" belongs to the cluster CL2 having the cluster ID "CL2", similar to the spatial information SP51 shown in FIG. 32. Further, the transposed index IND2 indicates that the blob BL7 identified by the blob ID "BL7" belongs to the cluster CL1 having the cluster ID "CL1", similarly to the spatial information SP51 shown in FIG. 32.

なお、転置インデックスＩＮＤ２は、上記に限らず、目的に応じて種々の情報を記憶してもよい。例えば、転置インデックスＩＮＤ２は、各量子化オブジェクトを示す情報（オブジェクトＩＤ等）を各量子化オブジェクトに対応付けて記憶してもよい。このように、転置インデックスには、クラスタを識別する情報（クラスタＩＤ）が付与されており、情報処理装置１００Ｃは、転置インデックスを用いて残差ベクトルを算出する。例えば、残差ベクトルはインデックスの生成および検索時に用いられるが、特許文献３等の従来技術における処理と同様であるため詳細な説明を適宜省略する。 Note that the transposed index IND2 is not limited to the above, and may store various information depending on the purpose. For example, the transposed index IND2 may store information indicating each quantization object (object ID, etc.) in association with each quantization object. In this way, information for identifying clusters (cluster ID) is attached to the transposed index, and the information processing device 100C calculates the residual vector using the transposed index. For example, the residual vector is used during index generation and search, but since the process is similar to that in conventional techniques such as Patent Document 3, detailed explanation will be omitted as appropriate.

〔４－４．情報処理のフロー〕
次に、図３７及び図３８を用いて、第４の実施形態に係る情報処理の手順について説明する。図３７及び図３８は、第４の実施形態に係る情報処理の一例を示すフローチャートである。 [4-4. Information processing flow]
Next, the information processing procedure according to the fourth embodiment will be described using FIGS. 37 and 38. 37 and 38 are flowcharts illustrating an example of information processing according to the fourth embodiment.

まず、図３７について説明する。例えば、図３７は、グループの生成処理の一例を示すフローチャートである。図３７に示すように、情報処理装置１００Ｃは、データ検索の対象となる複数のオブジェクトを示すオブジェクト情報を取得する（ステップＳ６０１）。例えば、情報処理装置１００Ｃは、オブジェクト情報記憶部１２１から複数のオブジェクトを示すオブジェクト情報を取得する。 First, FIG. 37 will be explained. For example, FIG. 37 is a flowchart illustrating an example of group generation processing. As shown in FIG. 37, the information processing apparatus 100C acquires object information indicating a plurality of objects that are data search targets (step S601). For example, the information processing device 100C acquires object information indicating a plurality of objects from the object information storage unit 121.

情報処理装置１００Ｃは、オブジェクト情報に基づいて、複数のオブジェクトを分類する複数の第１グループを示す第１グループ情報を生成する（ステップＳ６０２）。例えば、情報処理装置１００Ｃは、オブジェクト情報に基づいて、複数のオブジェクトを分類する複数のブロブを示すブロブ情報を生成する。 The information processing device 100C generates first group information indicating a plurality of first groups into which a plurality of objects are classified based on the object information (step S602). For example, the information processing device 100C generates blob information indicating a plurality of blobs that classify a plurality of objects based on the object information.

情報処理装置１００Ｃは、複数の第１グループとは異なる分類となるグループであって、複数のオブジェクトのうち、複数の第１グループにおいて同じ第１グループに属するオブジェクトを同じグループに分類するグループである複数の第２グループを示す第２グループ情報を生成する（ステップＳ６０３）。例えば、情報処理装置１００Ｃは、少なくとも１つのクラスタが２つ以上のブロブを含む複数のクラスタを示すクラスタ情報を生成する。なお、情報処理装置１００Ｃは、ブロブと一対一に対応するクラスタを示すクラスタ情報を生成してもよい。 The information processing device 100C is a group that has a different classification from the plurality of first groups, and is a group in which objects belonging to the same first group in the plurality of first groups among the plurality of objects are classified into the same group. Second group information indicating a plurality of second groups is generated (step S603). For example, the information processing device 100C generates cluster information indicating a plurality of clusters in which at least one cluster includes two or more blobs. Note that the information processing apparatus 100C may generate cluster information indicating clusters that correspond one-to-one to blobs.

次に、図３８について説明する。例えば、図３８は、グループを用いた処理の一例を示すフローチャートである。図３８に示すように、情報処理装置１００Ｃは、データ検索の対象となる複数のオブジェクトを分類する複数の第１グループと、複数の第１グループとは異なる分類となるグループであって、複数のオブジェクトのうち、複数の第１グループにおいて同じ第１グループに属するオブジェクトを同じグループに分類するグループである複数の第２グループとを示す処理用情報を取得する（ステップＳ７０１）。例えば、情報処理装置１００Ｃは、ブロブ情報記憶部１２５に記憶されたブロブ情報と、クラスタ情報記憶部１２７に記憶されたクラスタ情報とを処理用情報として取得する。例えば、情報処理装置１００Ｃは、転置インデックスＩＮＤ１または転置インデックスＩＮＤ２を取得する。 Next, FIG. 38 will be explained. For example, FIG. 38 is a flowchart illustrating an example of processing using groups. As shown in FIG. 38, the information processing device 100C has a plurality of first groups that classify a plurality of objects that are data search targets, and a plurality of groups that are classified differently from the plurality of first groups. Among the objects, processing information indicating a plurality of second groups, which are groups in which objects belonging to the same first group are classified into the same group, is acquired (step S701). For example, the information processing device 100C obtains blob information stored in the blob information storage unit 125 and cluster information stored in the cluster information storage unit 127 as processing information. For example, the information processing device 100C obtains the transposed index IND1 or the transposed index IND2.

情報処理装置１００Ｃは、処理用情報を用いて、第１グループに属するオブジェクトを対象とする第１処理と、第２グループに属するオブジェクトを対象とする第２処理とを実行する（ステップＳ７０２）。例えば、情報処理装置１００Ｃは、ブロブに属するオブジェクトを対象とする一括処理を実行し、クラスタに属するオブジェクトを対象とする量子化を実行する。 The information processing apparatus 100C uses the processing information to execute a first process targeting objects belonging to the first group and a second process targeting objects belonging to the second group (step S702). For example, the information processing device 100C performs batch processing on objects belonging to a blob, and performs quantization on objects belonging to a cluster.

〔５．第５の実施形態〕
なお、上述したようなブロブ間を連結したブロブ連結グラフ（「ブロブグラフ」ともいう）を用いた検索処理を行う際に、複数の観点での検索範囲が用いられてもよい。この点について、以下、第５の実施形態として説明する。第５の実施形態においては、情報処理システム１は、情報処理装置１００、情報処理装置１００Ａ、情報処理装置１００Ｂまたは情報処理装置１００Ｃに代えて、情報処理装置１００Ｄを有する。なお、第１の実施形態、第２の実施形態、第３の実施形態及び第４の実施形態等において上述した内容と同様の点については適宜説明を省略する。例えば、以下の説明でのブロブＢＬ＊（＊は任意の数）は、上述したブロブＢＬ＊（＊は任意の数）と同様にオブジェクトをグルーピングしたグループであるため、詳細な説明は省略する。また、以下の説明でのセントロイドＣ＊（＊は任意の数）は、上述したセントロイドＣ＊（＊は任意の数）と同様であるため、詳細な説明は省略する。以下の説明でのクラスタＣＬ＊（＊は任意の数）は、上述したクラスタＣＬ＊（＊は任意の数）と同様にブロブを分類するクラスタであるため、詳細な説明は省略する。 [5. Fifth embodiment]
Note that when performing a search process using a blob connection graph (also referred to as a "blob graph") in which blobs are connected as described above, a search range from a plurality of viewpoints may be used. This point will be described below as a fifth embodiment. In the fifth embodiment, the information processing system 1 includes an information processing device 100D instead of the information processing device 100, the information processing device 100A, the information processing device 100B, or the information processing device 100C. Note that descriptions of points similar to those described above in the first embodiment, second embodiment, third embodiment, fourth embodiment, etc. will be omitted as appropriate. For example, the blob BL* (* is an arbitrary number) in the following description is a group of objects in the same way as the blob BL* (* is an arbitrary number) described above, so detailed explanation will be omitted. Further, since the centroid C* (* is an arbitrary number) in the following explanation is the same as the centroid C* (* is an arbitrary number) described above, detailed explanation will be omitted. Cluster CL* (* is an arbitrary number) in the following explanation is a cluster for classifying blobs in the same way as cluster CL* (* is an arbitrary number) described above, so detailed explanation will be omitted.

なお、図３９及び図４０では、クラスタの区別を実線の太い線で示す点及びブロブ間の連結をブロブＢＬのセントロイドＣ間を繋ぐ点線で示す点以外の表現態様については、図３２と同様である。例えば、実線のうち、太い線がクラスタ及びブロブの分割対応を示し、細い線がブロブの分割対応を示す。例えば、図３９及び図４０では、実線のうち細い線はクラスタ内でのブロブの分割対応を示す。また、図３９及び図４０は、セントロイドＣＮ１１～ＣＮ１４は、各クラスタＣＬ１１～ＣＬ１４の各々のセントロイドを示す。クラスタＣＬ１１には、ブロブＢＬ１１、ＢＬ１２、ＢＬ１３の３つのブロブＢＬが含まれる。ブロブＢＬ１１には、ノードＮ３１、Ｎ２０５等が含まれる。クラスタＣＬ１２には、ブロブＢＬ１４、ＢＬ１５の２つのブロブＢＬが含まれる。クラスタＣＬ１３には、ブロブＢＬ１６、ＢＬ１７、ＢＬ１８の３つのブロブＢＬが含まれる。クラスタＣＬ１４には、ブロブＢＬ１９、ＢＬ２０の２つのブロブＢＬが含まれる。なお、図３９及び図４０に示すクラスタＣＬ、ブロブＢＬ及びノードＮ等は一部であり、処理対象となるクラスタＣＬ、ブロブＢＬ及びノードＮには、図示したもの以外にも多数のクラスタＣＬ、ブロブＢＬ及びノードＮが含まれる。また、ブロブＢＬ及びクラスタＣＬの符号は、図３９のみで図示し、図４０では図示を省略するが、図３９と図４０とは同じ空間情報（空間情報ＳＰ６１）を示す。 In addition, in FIGS. 39 and 40, the expressions are the same as in FIG. 32 except for the points where the clusters are distinguished by solid thick lines and the connections between blobs are shown by dotted lines connecting centroids C of blobs BL. It is. For example, among the solid lines, thick lines indicate correspondence between clusters and blobs, and thin lines indicate correspondence between divisions of blobs. For example, in FIGS. 39 and 40, thin solid lines indicate blob division correspondence within a cluster. Furthermore, in FIGS. 39 and 40, centroids CN11 to CN14 indicate the centroids of each of the clusters CL11 to CL14. The cluster CL11 includes three blobs BL, BL11, BL12, and BL13. Blob BL11 includes nodes N31, N205, and the like. The cluster CL12 includes two blobs BL, blobs BL14 and BL15. Cluster CL13 includes three blobs BL: blobs BL16, BL17, and BL18. The cluster CL14 includes two blobs BL, blobs BL19 and BL20. Note that the clusters CL, blobs BL, nodes N, etc. shown in FIGS. 39 and 40 are only a part, and the clusters CL, blobs BL, and nodes N to be processed include many clusters CL, A blob BL and a node N are included. Further, although the symbols of the blob BL and the cluster CL are illustrated only in FIG. 39 and omitted from illustration in FIG. 40, FIG. 39 and FIG. 40 indicate the same spatial information (spatial information SP61).

情報処理装置１００Ｄは、データ検索の対象となる複数のオブジェクトが分類された複数のグループ間がエッジにより連結されたグループ連結グラフであるブロブグラフを用いた検索処理を実行する。また、情報処理装置１００Ｄは、グループであるブロブの検索に対応する範囲（以下「第１検索範囲」ともいう）と、複数のオブジェクトの検索に対応する範囲（以下「第２検索範囲」ともいう）とを用いて検索処理を実行する。これにより、情報処理装置１００Ｄは、複数のオブジェクトから、検索クエリに対応する近傍のオブジェクトを抽出する。 The information processing device 100D executes a search process using a blob graph, which is a group-connected graph in which a plurality of groups into which a plurality of objects to be data searched are classified are connected by edges. The information processing device 100D also has a range corresponding to a search for a blob as a group (hereinafter also referred to as a "first search range") and a range corresponding to a search for a plurality of objects (hereinafter also referred to as a "second search range"). ) to execute the search process. Thereby, the information processing device 100D extracts a nearby object corresponding to the search query from the plurality of objects.

〔５－１．情報処理〕
まず、図３９及び図４０を用いて、第５の実施形態に係る情報処理の概要を説明する。図３９及び図４０は、第５の実施形態に係る情報処理の一例を示す図である。具体的には、図３９は、情報処理装置１００Ｄが実行する検索処理の全体概要を示す。また、図４０は、情報処理装置１００Ｄが実行する検索処理における第１検索範囲及び第２検索範囲の変化を示す。このように、図３９及び図４０では、第１検索範囲及び第２検索範囲の変化及び検索処理の概要を説明し、具体的な検索処理の流れについては図４３で示す。 [5-1. Information processing〕
First, an overview of information processing according to the fifth embodiment will be explained using FIGS. 39 and 40. 39 and 40 are diagrams illustrating an example of information processing according to the fifth embodiment. Specifically, FIG. 39 shows an overall outline of the search process executed by the information processing device 100D. Further, FIG. 40 shows changes in the first search range and the second search range in the search process executed by the information processing device 100D. In this way, FIGS. 39 and 40 will explain the changes in the first search range and the second search range and the outline of the search process, and the specific flow of the search process will be shown in FIG. 43.

図３９中の空間情報ＳＰ６１に示すグラフＧＲ６１は、図３２中の空間情報ＳＰ５１に示すグラフＧＲ４１と同様に、ブロブ間を連結したブロブグラフである。例えば、グラフＧＲ６１でブロブＢＬのセントロイドＣ間を連結した点線（エッジ）は、グラフＧＲ４１でブロブＢＬ間を相互に連結した矢印線（エッジ）と同義である。図３９のグラフＧＲ６１は、ブロブＢＬ１１～ＢＬ２０の部分を図示したものである。例えば、情報処理装置１００Ｄは、上述した方法によりブロブＢＬ１１～ＢＬ１０等を示す情報を生成してもよいし、他の装置からブロブＢＬ１１～ＢＬ１０等を示す情報を取得してもよい。なお、図４０中の空間情報ＳＰ６１－１～ＳＰ６１－４は、第１検索範囲及び第２検索範囲の変化の説明の便宜上異なる符号を付すが、図３９の空間情報ＳＰ６１と同様であり、区別せず説明する場合、空間情報ＳＰ６１と記載する場合がある。 Graph GR61 shown in spatial information SP61 in FIG. 39 is a blob graph in which blobs are connected, similar to graph GR41 shown in spatial information SP51 in FIG. For example, a dotted line (edge) connecting centroids C of blobs BL in graph GR61 is synonymous with an arrow line (edge) connecting blobs BL with each other in graph GR41. Graph GR61 in FIG. 39 illustrates the blobs BL11 to BL20. For example, the information processing device 100D may generate information indicating the blobs BL11 to BL10, etc. using the method described above, or may acquire information indicating the blobs BL11 to BL10, etc. from another device. Note that the spatial information SP61-1 to SP61-4 in FIG. 40 are given different symbols for convenience of explanation of changes in the first search range and the second search range, but they are the same as the spatial information SP61 in FIG. 39 and cannot be distinguished. If the information is not explained, it may be described as spatial information SP61.

なお、グラフＧＲ６１は、グラフＧＲ４１のうち各ブロブＢＬ１１～ＢＬ２０の部分を図示した状態であってもよい。この場合、空間情報ＳＰ５１および空間情報ＳＰ６１は、同じ複数のオブジェクトを対象とし、グラフＧＲ６１とグラフＧＲ４１とは同じグラフのうちの異なる一部を図示したものである。例えば、図３２中のグラフＧＲ４１は、ブロブＢＬ１～ＢＬ１０の部分を図示し、グラフＧＲ６１は、ブロブＢＬ１１～ＢＬ２０の部分を図示したものであってもよい。また、グラフＧＲ６１は、図２９中のグラフＧＲ４２のように各ブロブのセントロイド間を連結したグラフであってもよい。すなわち、グラフＧＲ６１は、後述する検索処理が実行可能であれば、どのような態様でブロブ間が連結されたグラフであってもよい。 Note that the graph GR61 may be in a state where each of the blobs BL11 to BL20 of the graph GR41 is illustrated. In this case, spatial information SP51 and spatial information SP61 target the same plurality of objects, and graph GR61 and graph GR41 illustrate different parts of the same graph. For example, graph GR41 in FIG. 32 may illustrate a portion of blobs BL1 to BL10, and graph GR61 may illustrate a portion of blobs BL11 to BL20. Further, the graph GR61 may be a graph in which the centroids of each blob are connected like the graph GR42 in FIG. 29. That is, the graph GR61 may be a graph in which blobs are connected in any manner as long as the search process described below can be executed.

図３９では、クエリが検索クエリＱＥ１１であり、近傍のオブジェクトとして抽出するオブジェクトの数（「近傍オブジェクト抽出数」ともいう）が「５」である場合の処理を一例として示す。検索クエリＱＥ１１を中心とする５つの円のうち、一点鎖線の円が第１検索範囲の一例であるブロブ検索範囲ＳＢを示す。なお、図３９では、ブロブ検索範囲ＳＢを示す３つの一点鎖線の円のうち、最も半径の大きい円のみに符号「ＳＢ」を付すが、３つの一点鎖線の円が検索処理において変化するブロブ検索範囲ＳＢを示す。 FIG. 39 shows, as an example, a process in which the query is search query QE11 and the number of objects extracted as nearby objects (also referred to as "number of extracted nearby objects") is "5". Among the five circles centered on the search query QE11, the circle with a dash-dot line indicates the blob search range SB, which is an example of the first search range. In addition, in FIG. 39, only the circle with the largest radius among the three dot-dashed circles indicating the blob search range SB is given the symbol "SB", but the three dot-dashed circles indicate blob searches that change during the search process. Range SB is shown.

例えば、ブロブ検索範囲ＳＢは、検索クエリＱＥ１１を中心とする半径（ブロブ検索半径）により規定される第１検索範囲である。図３９中の一点鎖線の矢印線で示すように、ブロブ検索範囲ＳＢは、検索処理が進むにつれて、その範囲（半径）が広くなるが、この点については図４０を用いて後述する。なお、図４０で示すブロブ検索範囲ＳＢ１、ＳＢ２、ＳＢ３は、図３９中の３つのブロブ検索範囲ＳＢの各々に対応する。ブロブ検索範囲ＳＢ１、ＳＢ２、ＳＢ３を区別せず説明する場合、ブロブ検索範囲ＳＢと記載する場合がある。 For example, the blob search range SB is a first search range defined by a radius (blob search radius) centered on the search query QE11. As shown by the dashed-dotted arrow line in FIG. 39, the range (radius) of the blob search range SB becomes wider as the search process progresses, but this point will be described later using FIG. 40. Note that blob search ranges SB1, SB2, and SB3 shown in FIG. 40 correspond to each of the three blob search ranges SB in FIG. 39. When describing the blob search ranges SB1, SB2, and SB3 without distinguishing them, they may be referred to as blob search range SB.

また、図３９では、検索クエリＱＥ１１を中心とする５つの円のうち、二点鎖線の円が第１検索範囲の一例であるオブジェクト検索範囲ＳＯを示す。なお、図３９では、オブジェクト検索範囲ＳＯを示す２つの二点鎖線の円のうち、外側の円のみに符号「ＳＯ」を付すが、２つの二点鎖線の円が検索処理において変化するオブジェクト検索範囲ＳＯを示す。 Furthermore, in FIG. 39, among the five circles centered on the search query QE11, the circle indicated by the two-dot chain line indicates the object search range SO, which is an example of the first search range. In addition, in FIG. 39, only the outer circle of the two dashed-dotted circles indicating the object search range SO is marked with the symbol "SO"; Indicates range SO.

例えば、オブジェクト検索範囲ＳＯは、検索クエリＱＥ１１を中心とする半径（オブジェクト検索半径）により規定される第２検索範囲である。図３９中の二点鎖線の矢印線で示すように、オブジェクト検索範囲ＳＯは、検索処理が進むにつれて、その範囲（半径）が小さくなるが、この点については図４０を用いて後述する。なお、図４０で示すオブジェクト検索範囲ＳＯ１、ＳＯ２は、図３９中の２つのオブジェクト検索範囲ＳＯの各々に対応する。オブジェクト検索範囲ＳＯ１、ＳＯ２を区別せず説明する場合、オブジェクト検索範囲ＳＯと記載する場合がある。 For example, the object search range SO is a second search range defined by a radius (object search radius) centered on the search query QE11. As shown by the two-dot chain arrow line in FIG. 39, the range (radius) of the object search range SO becomes smaller as the search process progresses, but this point will be described later using FIG. 40. Note that object search ranges SO1 and SO2 shown in FIG. 40 correspond to each of the two object search ranges SO in FIG. 39. When describing the object search ranges SO1 and SO2 without distinguishing them, they may be referred to as object search range SO.

まず、情報処理装置１００Ｄは、空間情報ＳＰ６１－１に示す状態から検索処理を開始する。そして、情報処理装置１００Ｄは、空間情報ＳＰ６１－２に示すように、ブロブ検索範囲ＳＢをブロブ検索範囲ＳＢ１に決定し、オブジェクト検索範囲ＳＯを無限大「∞」に決定する。すなわち、図４０では、情報処理装置１００Ｄは、ブロブ検索範囲ＳＢを、最近傍のブロブＢＬ２０と検索クエリＱＥ１１との間の距離に基づく半径で規定されるブロブ検索範囲ＳＢ１に決定する。また、図４０では、情報処理装置１００Ｄは、ブロブＢＬ２０の属するノードＮ２、Ｎ６、Ｎ８の各々（に対応するオブジェクト）の３つを、検索クエリＱＥ１１に対応する近傍のオブジェクトの候補として抽出し、近傍オブジェクト候補に追加する。なお、近傍オブジェクト候補は３つであり、近傍オブジェクト抽出数「５」以下であるため、オブジェクト検索範囲ＳＯを無限大「∞」が維持される。 First, the information processing device 100D starts the search process from the state shown in the spatial information SP61-1. Then, the information processing device 100D determines the blob search range SB to be the blob search range SB1, and the object search range SO to infinity "∞", as shown in the spatial information SP61-2. That is, in FIG. 40, the information processing apparatus 100D determines the blob search range SB to be the blob search range SB1 defined by the radius based on the distance between the nearest blob BL20 and the search query QE11. In addition, in FIG. 40, the information processing device 100D extracts three nodes (objects corresponding to) nodes N2, N6, and N8 to which the blob BL20 belongs as candidates for nearby objects corresponding to the search query QE11, Add to nearby object candidates. Note that since there are three nearby object candidates and the number of extracted nearby objects is "5" or less, the object search range SO is maintained at infinity "∞".

そして、情報処理装置１００Ｄは、空間情報ＳＰ６１－３に示すように、ブロブ検索範囲ＳＢをブロブ検索範囲ＳＢ２に決定し、オブジェクト検索範囲ＳＯをオブジェクト検索範囲ＳＯ１に決定する。すなわち、図４０では、情報処理装置１００Ｄは、ブロブ検索範囲ＳＢを、ブロブＢＬ２０の次に近傍のブロブＢＬ１１と検索クエリＱＥ１１との間の距離に基づく半径で規定されるブロブ検索範囲ＳＢ２に決定する。また、図４０では、情報処理装置１００Ｄは、ブロブＢＬ１１の属するノードＮ３１、Ｎ２０５、Ｎ３１２の各々（に対応するオブジェクト）の３つを、検索クエリＱＥ１１に対応する近傍のオブジェクトの候補として抽出し、近傍オブジェクト候補に追加する。これにより、近傍オブジェクト候補にはノードＮ２、Ｎ６、Ｎ８、Ｎ３１、Ｎ２０５、Ｎ３１２の６個が含まれ、近傍オブジェクト抽出数「５」を超える。そのため、情報処理装置１００Ｄは、近傍オブジェクト候補のうち検索クエリＱＥ１１から距離が近い方から５個を残し、残りを除外する。この場合、情報処理装置１００Ｄは、６個のノードのうち、検索クエリＱＥ１１から距離が最も遠いノードＮ３１２を除外し、近傍オブジェクト候補の数を５個にする。すなわち、情報処理装置１００Ｄは、近傍オブジェクト候補をノードＮ２、Ｎ６、Ｎ８、Ｎ３１、Ｎ２０５の５個にする。近傍オブジェクト候補の数が近傍オブジェクト抽出数「５」に達したため、情報処理装置１００Ｄは、オブジェクト検索範囲ＳＯを、近傍オブジェクト候補のうち検索クエリＱＥ１１から最も遠いノードＮ２０５と検索クエリＱＥ１１との間の距離に基づく半径で規定されるオブジェクト検索範囲ＳＯ１に決定する。 Then, the information processing device 100D determines the blob search range SB to be the blob search range SB2 and the object search range SO to the object search range SO1, as shown in the spatial information SP61-3. That is, in FIG. 40, the information processing device 100D determines the blob search range SB to be the blob search range SB2 defined by the radius based on the distance between the next nearest blob BL11 of the blob BL20 and the search query QE11. . In addition, in FIG. 40, the information processing device 100D extracts three nodes (objects corresponding to) nodes N31, N205, and N312 to which the blob BL11 belongs as candidates for nearby objects corresponding to the search query QE11, Add to nearby object candidates. As a result, the neighboring object candidates include six nodes N2, N6, N8, N31, N205, and N312, which exceeds the number of extracted neighboring objects "5". Therefore, the information processing device 100D leaves five of the nearby object candidates that are closest to the search query QE11, and excludes the rest. In this case, the information processing device 100D excludes the node N312 which is the farthest from the search query QE11 among the six nodes, and sets the number of nearby object candidates to five. That is, the information processing device 100D sets five nearby object candidates: nodes N2, N6, N8, N31, and N205. Since the number of neighboring object candidates has reached the number of neighboring object extractions "5", the information processing device 100D sets the object search range SO to the area between the node N205, which is the farthest from the search query QE11 among the neighboring object candidates, and the search query QE11. The object search range SO1 defined by the radius based on the distance is determined.

そして、情報処理装置１００Ｄは、空間情報ＳＰ６１－４に示すように、ブロブ検索範囲ＳＢをブロブ検索範囲ＳＢ３に決定し、オブジェクト検索範囲ＳＯをオブジェクト検索範囲ＳＯ２に決定する。すなわち、図４０では、情報処理装置１００Ｄは、ブロブ検索範囲ＳＢを、ブロブＢＬ１１の次に近傍のブロブＢＬ１９と検索クエリＱＥ１１との間の距離に基づく半径で規定されるブロブ検索範囲ＳＢ３に決定する。また、図４０では、情報処理装置１００Ｄは、ブロブＢＬ１９の属するノードＮ３、Ｎ１１、Ｎ１１４の各々（に対応するオブジェクト）の３つを、検索クエリＱＥ１１に対応する近傍のオブジェクトの候補として抽出し、近傍オブジェクト候補に追加する。これにより、近傍オブジェクト候補にはノードＮ２、Ｎ６、Ｎ８、Ｎ３１、Ｎ２０５、Ｎ３、Ｎ１１、Ｎ１１４の８個が含まれ、近傍オブジェクト抽出数「５」を超える。そのため、情報処理装置１００Ｄは、近傍オブジェクト候補のうち検索クエリＱＥ１１から距離が近い方から５個を残し、残りを除外する。この場合、情報処理装置１００Ｄは、検索クエリＱＥ１１から距離が遠い方から３個のノードＮ２０５、Ｎ３、Ｎ１１４を除外し、近傍オブジェクト候補の数を５個にする。すなわち、情報処理装置１００Ｄは、近傍オブジェクト候補をノードＮ２、Ｎ６、Ｎ８、Ｎ１１、Ｎ３１の５個にする。そして、情報処理装置１００Ｄは、オブジェクト検索範囲ＳＯを、近傍オブジェクト候補のうち検索クエリＱＥ１１から最も遠いノードＮ１１と検索クエリＱＥ１１との間の距離に基づく半径で規定されるオブジェクト検索範囲ＳＯ２に決定する。 Then, the information processing device 100D determines the blob search range SB to be the blob search range SB3 and the object search range SO to the object search range SO2, as shown in the spatial information SP61-4. That is, in FIG. 40, the information processing device 100D determines the blob search range SB to be the blob search range SB3 defined by the radius based on the distance between the next nearest blob BL19 of the blob BL11 and the search query QE11. . In addition, in FIG. 40, the information processing device 100D extracts three nodes (objects corresponding to) nodes N3, N11, and N114 to which the blob BL19 belongs as candidates for nearby objects corresponding to the search query QE11, Add to nearby object candidates. As a result, the neighboring object candidates include eight nodes N2, N6, N8, N31, N205, N3, N11, and N114, which exceeds the number of extracted neighboring objects "5". Therefore, the information processing device 100D leaves five of the nearby object candidates that are closest to the search query QE11, and excludes the rest. In this case, the information processing device 100D excludes the three nodes N205, N3, and N114 from the farthest distance from the search query QE11, and sets the number of nearby object candidates to five. That is, the information processing device 100D sets five nearby object candidates: nodes N2, N6, N8, N11, and N31. Then, the information processing device 100D determines the object search range SO to be an object search range SO2 defined by a radius based on the distance between the search query QE11 and the node N11 which is farthest from the search query QE11 among the neighboring object candidates. .

情報処理装置１００Ｄは、上述のような処理を後述する終了基準＃１～＃３に当てはまるまで繰り返す。例えば、情報処理装置１００Ｄは、終了基準＃１～＃３に当てはまる場合、その時点で近傍オブジェクト候補に含まれるオブジェクトを検索クエリＱＥ１１の近傍のオブジェクトに決定する。そして、情報処理装置１００Ｄは、検索クエリＱＥ１１の近傍のオブジェクトに決定したオブジェクトを示す情報を、検索クエリＱＥ１１を指定した検索の要求元等に提供する。 The information processing device 100D repeats the above-described processing until termination criteria #1 to #3, which will be described later, are met. For example, if termination criteria #1 to #3 are met, the information processing device 100D determines the objects included in the nearby object candidates at that time to be the objects near the search query QE11. Then, the information processing device 100D provides information indicating the object determined as an object in the vicinity of the search query QE11 to the requestor of the search that specified the search query QE11.

上記のように、情報処理装置１００Ｄは、第１検索範囲及び第２検索範囲といった複数の観点での検索範囲を用いて検索処理を行い、各検索範囲を検索処理の状況に応じて変化させることにより、グラフを用いて柔軟な検索処理を行うことができる。 As described above, the information processing device 100D performs a search process using search ranges from multiple viewpoints, such as the first search range and the second search range, and changes each search range according to the situation of the search process. This makes it possible to perform flexible search processing using graphs.

ここで、例えばブロブグラフでは、大規模なブロブグラフは無駄なブロブの参照によって検索速度が低下する場合がある。また、従来、直積量子化（Product Quantization：PQ）ではグラフを検索し近傍のブロブを特定したのちに特定されたブロブについて近似距離を算出する方法があるが、この場合も無駄なブロブ参照によって検索速度が低下する。 Here, for example, in a blob graph, search speed may decrease due to unnecessary blob references in a large-scale blob graph. In addition, conventional product quantization (PQ) involves searching a graph, identifying nearby blobs, and then calculating approximate distances for the identified blobs, but in this case too, the search is performed using unnecessary blob references. Speed decreases.

一方で、情報処理装置１００Ｄは、ブロブグラフを探索し近傍順に順位が確定できたブロブに対して逐一ブロブ内のオブジェクトの近似距離を算出し、所定の終了基準を満たした時点で検索を終了することで、無駄なオブジェクト探索を削減できる。 On the other hand, the information processing device 100D searches the blob graph, calculates approximate distances of objects within the blobs for each blob whose rank has been determined in order of proximity, and terminates the search when a predetermined termination criterion is met. This can reduce unnecessary object searches.

〔５－２．情報処理装置の構成〕
次に、図４１を用いて、第５の実施形態に係る情報処理装置１００Ｄの構成について説明する。図４１は、第５の実施形態に係る情報処理装置の構成例を示す図である。図４１に示すように、情報処理装置１００Ｄは、通信部１１０と、記憶部１２０Ｃと、制御部１３０Ｂとを有する。なお、情報処理装置１００Ｄにおいて、情報処理装置１００、情報処理装置１００Ａ、情報処理装置１００Ｂまたは情報処理装置１００Ｃと同様の点は適宜説明を省略する。 [5-2. Configuration of information processing device]
Next, the configuration of the information processing device 100D according to the fifth embodiment will be described using FIG. 41. FIG. 41 is a diagram illustrating a configuration example of an information processing apparatus according to the fifth embodiment. As shown in FIG. 41, the information processing device 100D includes a communication section 110, a storage section 120C, and a control section 130B. Note that in the information processing device 100D, descriptions of the same points as the information processing device 100, the information processing device 100A, the information processing device 100B, or the information processing device 100C will be omitted as appropriate.

（記憶部１２０Ｃ）
記憶部１２０Ｃは、例えば、ＲＡＭ、フラッシュメモリ等の半導体メモリ素子、または、ハードディスク、光ディスク等の記憶装置によって実現される。第５の実施形態に係る記憶部１２０Ｃは、図４１に示すように、オブジェクト情報記憶部１２１と、グラフ情報記憶部１２２と、量子化情報記憶部１２３と、コードブック情報記憶部１２４と、ブロブ情報記憶部１２５と、ブロブ連結グラフ情報記憶部１２６と、クラスタ情報記憶部１２７とを有する。 (Storage unit 120C)
The storage unit 120C is realized by, for example, a semiconductor memory device such as a RAM or a flash memory, or a storage device such as a hard disk or an optical disk. As shown in FIG. 41, the storage unit 120C according to the fifth embodiment includes an object information storage unit 121, a graph information storage unit 122, a quantization information storage unit 123, a codebook information storage unit 124, and a blob information storage unit 121. It has an information storage section 125, a blob connection graph information storage section 126, and a cluster information storage section 127.

（制御部１３０Ｄ）
制御部１３０Ｄは、コントローラ（controller）であり、例えば、ＣＰＵやＭＰＵやＧＰＵ等によって、情報処理装置１００Ｄ内部の記憶装置に記憶されている各種プログラム（情報処理プログラムの一例に相当）がＲＡＭを作業領域として実行されることにより実現される。また、制御部１３０Ｄは、コントローラであり、例えば、ＡＳＩＣやＦＰＧＡ等の集積回路により実現される。 (Control unit 130D)
The control unit 130D is a controller, and allows various programs (corresponding to an example of an information processing program) stored in a storage device inside the information processing device 100D to work on the RAM by, for example, a CPU, MPU, GPU, etc. This is achieved by executing it as a region. Further, the control unit 130D is a controller, and is realized by, for example, an integrated circuit such as an ASIC or an FPGA.

図４１に示すように、制御部１３０Ｄは、取得部１３１と、生成部１３２Ｃと、検索処理部１３３Ｄと、提供部１３４とを有し、以下に説明する情報処理の機能や作用を実現または実行する。なお、制御部１３０Ｄの内部構成は、図４１に示した構成に限られず、後述する情報処理を行う構成であれば他の構成であってもよい。 As shown in FIG. 41, the control unit 130D includes an acquisition unit 131, a generation unit 132C, a search processing unit 133D, and a provision unit 134, and realizes or executes information processing functions and operations described below. do. Note that the internal configuration of the control unit 130D is not limited to the configuration shown in FIG. 41, and may be any other configuration as long as it performs information processing to be described later.

第５の実施形態に係る取得部１３１は、記憶部１２０Ｃから各種情報を取得する。例えば、第５の実施形態に係る取得部１３１は、第４の実施形態に係る取得部１３１と同様に、生成部１３２Ｃが用いる各種情報を取得する。 The acquisition unit 131 according to the fifth embodiment acquires various information from the storage unit 120C. For example, the acquisition unit 131 according to the fifth embodiment acquires various information used by the generation unit 132C, similarly to the acquisition unit 131 according to the fourth embodiment.

取得部１３１は、データ検索の対象となる複数のオブジェクトが分類された複数のグループ間がエッジにより連結されたグループ連結グラフを取得する。取得部１３１は、複数のオブジェクトに対する検索クエリを取得する。 The acquisition unit 131 acquires a group connection graph in which a plurality of groups into which a plurality of objects to be data searched are classified are connected by edges. The acquisition unit 131 acquires search queries for multiple objects.

（検索処理部１３３Ｄ）
検索処理部１３３Ｄは、図４３に示す検索処理を実行する。例えば、検索処理部１３３Ｄは、検索処理部１３３、検索処理部１３３Ａ、検索処理部１３３Ｂまたは検索処理部１３３ＢＣと同様に検索処理に関する各種処理を行う。検索処理部１３３Ｄは、記憶部１２０Ｃに記憶された情報を用いた処理を行う。検索処理部１３３Ｄは、生成部１３２Ｃにより生成された情報を用いた処理を行う。 (Search processing unit 133D)
The search processing unit 133D executes the search process shown in FIG. 43. For example, the search processing unit 133D performs various processes related to search processing in the same way as the search processing unit 133, the search processing unit 133A, the search processing unit 133B, or the search processing unit 133BC. The search processing unit 133D performs processing using the information stored in the storage unit 120C. The search processing unit 133D performs processing using the information generated by the generation unit 132C.

検索処理部１３３Ｄは、複数のグループの検索に対応する範囲を示す第１検索範囲と、複数のオブジェクトの検索に対応する範囲を示す第２検索範囲とを用いて、グループ連結グラフを探索することにより、複数のオブジェクトから、検索クエリに対応する近傍のオブジェクトを抽出する検索処理を実行する。検索処理部１３３Ｄは、第１検索範囲と、第１検索範囲よりも範囲が広い第２検索範囲とを用いて、検索処理を実行する。 The search processing unit 133D searches the group connection graph using a first search range indicating a range corresponding to a search of a plurality of groups and a second search range indicating a range corresponding to a search of a plurality of objects. A search process is executed to extract nearby objects corresponding to the search query from a plurality of objects. The search processing unit 133D executes a search process using a first search range and a second search range that is wider than the first search range.

検索処理部１３３Ｄは、検索処理が進むにつれて範囲が広がる第１検索範囲を用いて、検索処理を実行する。検索処理部１３３Ｄは、複数のグループのうち、検索処理の対象とされた一のグループに基づいて決定される第１検索範囲を用いて、検索処理を実行する。検索処理部１３３Ｄは、検索処理中の一のグループに基づいて決定される第１検索範囲を用いて、検索処理を実行する。検索処理部１３３Ｄは、検索クエリと、一のグループに含まれるオブジェクトのうち検索クエリに最も近いオブジェクト、または一のグループのセントロイドとの間の距離に基づく第１検索範囲を用いて、検索処理を実行する。 The search processing unit 133D executes the search process using a first search range that expands as the search process progresses. The search processing unit 133D executes the search process using a first search range determined based on one group targeted for the search process among the plurality of groups. The search processing unit 133D executes the search process using the first search range determined based on the one group being searched. The search processing unit 133D performs search processing using a first search range based on the distance between the search query and the object closest to the search query among the objects included in one group, or the centroid of one group. Execute.

検索処理部１３３Ｄは、検索処理が進むにつれて範囲が狭まる第２検索範囲を用いて、検索処理を実行する。検索処理部１３３Ｄは、複数のオブジェクトのうち、検索処理の対象とされた一のオブジェクトに基づいて決定される第２検索範囲を用いて、検索処理を実行する。検索処理部１３３Ｄは、検索処理中において、検索クエリに対応する近傍のオブジェクトの候補として抽出されたオブジェクトのうち、検索クエリから最も遠いオブジェクトである一のオブジェクトに基づいて決定される第２検索範囲を用いて、検索処理を実行する。検索処理部１３３Ｄは、検索クエリと、一のオブジェクトとの間の距離に基づく第２検索範囲を用いて、検索処理を実行する。 The search processing unit 133D executes the search process using a second search range that becomes narrower as the search process progresses. The search processing unit 133D executes the search process using a second search range determined based on one object targeted for the search process among the plurality of objects. During the search process, the search processing unit 133D determines a second search range based on one object that is the farthest object from the search query among the objects extracted as candidates for nearby objects corresponding to the search query. Execute the search process using . The search processing unit 133D executes a search process using a second search range based on the distance between the search query and one object.

検索処理部１３３Ｄは、所定の終了条件を満たした場合、検索処理を終了する。検索処理部１３３Ｄは、検索処理の対象としたグループの数が条件を満たした場合、検索処理を終了する。検索処理部１３３Ｄは、検索処理の対象としたグループの数が所定の閾値を超えた場合、検索処理を終了する。 The search processing unit 133D ends the search process when a predetermined end condition is met. The search processing unit 133D ends the search process when the number of groups targeted for the search process satisfies the condition. The search processing unit 133D ends the search process when the number of groups targeted for the search process exceeds a predetermined threshold.

検索処理部１３３Ｄは、所定のグループに関する距離が第２検索範囲を超えた場合、検索処理を終了する。検索処理部１３３Ｄは、検索処理中の所定のグループに関する距離が第２検索範囲を超えた場合、検索処理を終了する。検索処理部１３３Ｄは、所定のグループに含まれるオブジェクトのうち検索クエリに最も近いオブジェクトと、検索クエリとの間の距離が第２検索範囲を超えた場合、検索処理を終了する。 The search processing unit 133D ends the search process when the distance related to the predetermined group exceeds the second search range. The search processing unit 133D ends the search process when the distance related to the predetermined group being searched exceeds the second search range. The search processing unit 133D ends the search process when the distance between the object closest to the search query among the objects included in the predetermined group and the search query exceeds the second search range.

検索処理部１３３Ｄは、第１検索範囲と第２検索範囲との大小関係が条件を満たす場合、検索処理を終了する。検索処理部１３３Ｄは、第１検索範囲が第２検索範囲を超えた場合、検索処理を終了する。 The search processing unit 133D ends the search process when the size relationship between the first search range and the second search range satisfies the condition. The search processing unit 133D ends the search process when the first search range exceeds the second search range.

〔５－３．情報処理のフロー〕
次に、図４２を用いて、第５の実施形態に係る情報処理の手順について説明する。図４２は、第５の実施形態に係る情報処理の一例を示すフローチャートである。 [5-3. Information processing flow]
Next, the information processing procedure according to the fifth embodiment will be described using FIG. 42. FIG. 42 is a flowchart illustrating an example of information processing according to the fifth embodiment.

まず、図４２について説明する。例えば、図４２は、グループの生成処理の一例を示すフローチャートである。図４２に示すように、情報処理装置１００Ｄは、データ検索の対象となる複数のオブジェクトが分類された複数のグループ間がエッジにより連結されたグループ連結グラフを取得する（ステップＳ８０１）。例えば、情報処理装置１００Ｄは、ブロブ連結グラフ情報記憶部１２６からブロブ連結グラフをグループ連結グラフとして取得する。 First, FIG. 42 will be explained. For example, FIG. 42 is a flowchart illustrating an example of group generation processing. As shown in FIG. 42, the information processing apparatus 100D obtains a group connection graph in which a plurality of groups into which a plurality of objects to be data searched are classified are connected by edges (step S801). For example, the information processing device 100D obtains a blob connection graph from the blob connection graph information storage unit 126 as a group connection graph.

情報処理装置１００Ｄは、複数のオブジェクトに対する検索クエリを取得する（ステップＳ８０２）。例えば、情報処理装置１００Ｄは、ユーザが利用する端末装置１０（図４参照）から検索クエリを取得する。 The information processing device 100D obtains search queries for a plurality of objects (step S802). For example, the information processing device 100D obtains a search query from the terminal device 10 (see FIG. 4) used by the user.

情報処理装置１００Ｄは、複数のグループの検索に対応する範囲を示す第１検索範囲と、複数のオブジェクトの検索に対応する範囲を示す第２検索範囲とを用いて、グループ連結グラフを探索することにより、複数のオブジェクトから、検索クエリに対応する近傍のオブジェクトを抽出する検索処理を実行する（ステップＳ８０３）。例えば、情報処理装置１００Ｄは、第１検索範囲であるブロブ検索範囲と、第２検索範囲であるオブジェクト検索範囲とを用いて、図４３に示す検索処理を実行する。 The information processing device 100D searches for a group connection graph using a first search range indicating a range corresponding to a search of a plurality of groups and a second search range indicating a range corresponding to a search of a plurality of objects. Accordingly, a search process is executed to extract a nearby object corresponding to the search query from a plurality of objects (step S803). For example, the information processing device 100D executes the search process shown in FIG. 43 using a blob search range that is a first search range and an object search range that is a second search range.

〔５－４．検索処理例〕
ここで、第５の実施形態に係る検索処理の一例について、図４３を一例として説明する。図４３は、第５の実施形態に係る検索処理の一例を示すフローチャートである。以下に説明する検索処理は、情報処理装置１００Ｄの検索処理部１３３Ｄによって行われる。なお、第１の実施形態、第２の実施形態、第３の実施形態及び第４の実施形態で説明した検索処理と同様の点については適宜説明を省略する。 [5-4. Search processing example]
Here, an example of the search process according to the fifth embodiment will be described using FIG. 43 as an example. FIG. 43 is a flowchart illustrating an example of search processing according to the fifth embodiment. The search processing described below is performed by the search processing unit 133D of the information processing device 100D. Note that descriptions of points similar to the search processing described in the first embodiment, second embodiment, third embodiment, and fourth embodiment will be omitted as appropriate.

情報処理装置１００Ｄは、探索ブロブ起点集合を取得し、個々のオブジェクトとクエリとの距離を計算する（ステップＳ９０１）。例えば、情報処理装置１００Ｄは、探索の起点となるブロブを示す探索ブロブ起点集合を取得し、探索ブロブ起点集合のブロブに含まれる各オブジェクトとクエリとの距離を計算する。例えば、情報処理装置１００Ｄは、ランダムに探索ブロブ起点集合を選択してもよいし、例えばブロブをリーフとする木構造のインデックス等の所定のインデックスを用いて、探索ブロブ起点集合を選択してもよい。例えば、情報処理装置１００Ｄは、上述した近似距離の算出により、探索ブロブ起点集合のブロブに含まれる各オブジェクトとクエリとの近似距離を計算する。 The information processing device 100D obtains a search blob starting point set and calculates the distance between each object and the query (step S901). For example, the information processing device 100D obtains a search blob starting point set indicating blobs that are search starting points, and calculates the distance between each object included in the blobs in the search blob starting point set and the query. For example, the information processing device 100D may select a search blob origin set at random, or may select a search blob origin set using a predetermined index, such as an index of a tree structure in which blobs are leaves. good. For example, the information processing device 100D calculates the approximate distance between each object included in the blob of the search blob origin set and the query by calculating the approximate distance described above.

情報処理装置１００Ｄは、探索ブロブ起点集合のクエリの最近傍ブロブを現最近傍ブロブＢ（以下単に「ブロブＢ」ともいう）とする（ステップＳ９０２）。例えば、情報処理装置１００Ｄは、探索ブロブ起点集合のうち、クエリに最も近いブロブを現最近傍ブロブＢに決定する。 The information processing device 100D sets the nearest neighbor blob of the query of the search blob origin set as the current nearest neighbor blob B (hereinafter also simply referred to as "blob B") (step S902). For example, the information processing device 100D determines the blob closest to the query as the current nearest neighbor blob B from the search blob origin set.

情報処理装置１００Ｄは、各種パラメータを設定する（ステップＳ９０３）。図４３では、情報処理装置１００Ｄは、ブロブ検索数を「１」に設定する。また、情報処理装置１００Ｄは、第２検索範囲（オブジェクト検索範囲）を示すオブジェクト検索半径Ｒを「∞（無限大）」に設定する。例えば、オブジェクト検索半径Ｒは、第２検索範囲を規定するクエリｑを中心とする半径である。また、情報処理装置１００Ｄは、第２探索範囲の一例であるオブジェクト探索半径Ｅを「∞（無限大）」に設定する。 The information processing device 100D sets various parameters (step S903). In FIG. 43, the information processing apparatus 100D sets the number of blob searches to "1". Further, the information processing device 100D sets the object search radius R indicating the second search range (object search range) to “∞ (infinity)”. For example, the object search radius R is a radius centered on the query q that defines the second search range. Further, the information processing device 100D sets the object search radius E, which is an example of the second search range, to “∞ (infinity)”.

また、情報処理装置１００Ｄは、第１検索範囲（ブロブ検索範囲）の一例であるブロブ検索半径Ｒｂを「ｄ（ｑ,Ｂ）」に設定する。例えば、ブロブ検索半径Ｒｂは、第１検索範囲を規定するクエリｑを中心とする半径である。例えば、情報処理装置１００Ｄは、ブロブ検索半径Ｒｂを、クエリｑと現最近傍ブロブＢとの間の距離ｄ（ｑ,Ｂ）に設定する。例えば、情報処理装置１００Ｄは、ブロブ検索半径Ｒｂを、クエリｑと、現最近傍ブロブＢに含まれるオブジェクトのうちクエリｑに最も近いオブジェクトとの間の距離ｄ（ｑ,Ｂ）に設定する。なお、ブロブ検索半径Ｒｂは、クエリｑと現最近傍ブロブＢのセントロイドとの間の距離であってもよい。また、情報処理装置１００Ｄは、第１探索範囲（ブロブ検索範囲）を示すブロブ探索半径Ｅｂを「Ｒｂ×（１＋ｅｂ）」に設定する。ここで、パラメータｅｂは、ブロブ探索半径拡大率（「第１検索範囲係数」ともいう）であり、例えば０．１等の任意の値が設定される。例えば、情報処理装置１００Ｄは、ブロブ探索半径Ｅｂを、ブロブ検索半径Ｒｂに１＋ｅｂが乗算した値を設定する。 Further, the information processing device 100D sets a blob search radius Rb, which is an example of a first search range (blob search range), to “d(q,B)”. For example, the blob search radius Rb is a radius centered on the query q that defines the first search range. For example, the information processing device 100D sets the blob search radius Rb to the distance d(q,B) between the query q and the current nearest neighbor blob B. For example, the information processing device 100D sets the blob search radius Rb to the distance d(q,B) between the query q and the object closest to the query q among the objects included in the current nearest neighbor blob B. Note that the blob search radius Rb may be the distance between the query q and the centroid of the current nearest neighbor blob B. Further, the information processing device 100D sets the blob search radius Eb indicating the first search range (blob search range) to “Rb×(1+eb)”. Here, the parameter eb is a blob search radius expansion rate (also referred to as a "first search range coefficient"), and is set to an arbitrary value such as 0.1, for example. For example, the information processing device 100D sets the blob search radius Eb to a value obtained by multiplying the blob search radius Rb by 1+eb.

情報処理装置１００Ｄは、ブロブＢ以外の探索ブロブ起点集合を除外ブロブ集合に追加する（ステップＳ９０４）。例えば、情報処理装置１００Ｄは、探索ブロブ起点集合のうち、現最近傍ブロブＢを除くブロブを除外ブロブ集合に追加する。 The information processing device 100D adds search blob origin sets other than blob B to the excluded blob set (step S904). For example, the information processing device 100D adds blobs excluding the current nearest neighbor blob B from the search blob origin set to the excluded blob set.

情報処理装置１００Ｄは、探索ブロブ起点集合を未探索ブロブ集合及び距離算出済み集合に追加する（ステップＳ９０５）。例えば、情報処理装置１００Ｄは、探索ブロブ起点集合のうち、現最近傍ブロブＢを含む全ブロブを、未だ探索の処理対象とされていないことを示す未探索ブロブ集合に追加する。例えば、情報処理装置１００Ｄは、探索ブロブ起点集合のうち、現最近傍ブロブＢを含む全ブロブを、そのブロブ内のオブジェクトについて距離算出処理済みであることを示す距離算出済み集合に追加する。 The information processing device 100D adds the search blob origin set to the unsearched blob set and the distance calculated set (step S905). For example, the information processing device 100D adds all blobs including the current nearest neighbor blob B from the search blob origin set to an unsearched blob set indicating that they have not yet been searched. For example, the information processing device 100D adds all blobs including the current nearest neighbor blob B from the search blob origin set to a distance calculation completed set indicating that distance calculation processing has been completed for objects within the blobs.

情報処理装置１００Ｄは、未探索ブロブ集合が空であるかどうかを判定する（ステップＳ９０６）。情報処理装置１００Ｄは、未探索ブロブ集合が空ではない場合（ステップＳ９０６：Ｎｏ）、未探索ブロブ集合から最もクエリに近いブロブＴを取得する（ステップＳ９０７）。これにより、未探索ブロブ集合からは未探索ブロブ集合が除かれる。例えば、情報処理装置１００Ｄは、未探索ブロブ集合のうち、クエリｑとの間の距離が最も短いブロブをブロブＴとして取得する。例えば、クエリｑとの間の距離は、クエリｑと、ブロブに含まれるオブジェクトのうちクエリｑに最も近いオブジェクトとの間の距離である。なお、クエリｑとの間の距離は、クエリｑとブロブのセントロイドとの間の距離であってもよい。 The information processing device 100D determines whether the unsearched blob set is empty (step S906). If the unsearched blob set is not empty (step S906: No), the information processing device 100D acquires the blob T closest to the query from the unsearched blob set (step S907). As a result, the unsearched blob set is removed from the unsearched blob set. For example, the information processing device 100D obtains, as the blob T, the blob with the shortest distance to the query q from among the set of unsearched blobs. For example, the distance between query q is the distance between query q and the object closest to query q among the objects included in the blob. Note that the distance between the query q and the blob may be the distance between the query q and the centroid of the blob.

情報処理装置１００Ｄは、ブロブＴがブロブ探索範囲内にあるかどうかを判定する（ステップＳ９０８）。例えば、情報処理装置１００Ｄは、ブロブＴのクエリｑとの間の距離がブロブ探索半径Ｅｂ以下であるかどうかを判定する。 The information processing device 100D determines whether the blob T is within the blob search range (step S908). For example, the information processing device 100D determines whether the distance between the blob T and the query q is less than or equal to the blob search radius Eb.

情報処理装置１００Ｄは、ブロブＴがブロブ探索範囲内にある場合（ステップＳ９０８：Ｙｅｓ）、ブロブＴのエッジで接続されているブロブを全て処理したかどうかを判定する（ステップＳ９０９）。例えば、情報処理装置１００Ｄは、ブロブＴとの間がエッジで接続されているブロブ（「接続ブロブ」ともいう）についての処理有無を示す情報（「処理有無情報」）を用いて、ブロブＴの接続ブロブを全て処理したかどうかを判定する。例えば、情報処理装置１００Ｄは、ブロブＴの接続ブロブの処理有無をフラグ等の情報により管理してもよい。例えば、情報処理装置１００Ｄは、記憶部１２０Ｃに記憶された処理有無情報を処理状況に応じて更新し、記憶部１２０Ｃ中の処理有無情報を参照して、ブロブＴの接続ブロブを全て処理したかどうかを判定してもよい。 If the blob T is within the blob search range (step S908: Yes), the information processing device 100D determines whether all blobs connected by the edges of the blob T have been processed (step S909). For example, the information processing device 100D uses information indicating whether or not to process a blob (also referred to as a "connection blob") that is connected to the blob T by an edge ("processing/non-processing information"). Determine whether all connection blobs have been processed. For example, the information processing device 100D may manage whether or not a connected blob of the blob T is processed using information such as a flag. For example, the information processing device 100D updates the processing presence/absence information stored in the storage unit 120C according to the processing status, and refers to the processing presence/absence information in the storage unit 120C to determine whether all connected blobs of the blob T have been processed. You may decide whether

情報処理装置１００Ｄは、ブロブＴのエッジで接続されているブロブを全て処理していない場合（ステップＳ９０９：Ｎｏ）、ブロブＴからエッジで接続されているブロブＪを取得する（ステップＳ９１０）。例えば、情報処理装置１００Ｄは、ブロブＴとの間がエッジで接続されているブロブのうち、ブロブＴのエッジで接続されているブロブとしての処理が未処理であるブロブ（「未処理接続ブロブ」ともいう）をブロブＪとして取得する。例えば、情報処理装置１００Ｄは、処理有無情報を参照し、ブロブＴとの間がエッジで接続されているブロブのうち、未処理接続ブロブをブロブＪとして取得し、ブロブＪとして取得したブロブについて処理有無情報を処理済みに更新する。 If the information processing device 100D has not processed all the blobs connected by the edges of the blob T (step S909: No), the information processing device 100D acquires the blob J connected by the edges from the blob T (step S910). For example, the information processing device 100D selects a blob that is unprocessed as a blob that is connected to the blob T by an edge (an "unprocessed connected blob") among blobs that are connected to the blob T by an edge. ) is obtained as blob J. For example, the information processing device 100D refers to the processing presence/absence information, acquires an unprocessed connected blob as a blob J among the blobs connected to the blob T by an edge, and processes the blob acquired as the blob J. Update the presence/absence information to processed.

情報処理装置１００Ｄは、ブロブＪが距離算出済み集合に存在するかどうかを判定する（ステップＳ９１１）。情報処理装置１００Ｄは、ブロブＪが距離算出済み集合に存在する場合（ステップＳ９１１：Ｙｅｓ）、ステップＳ９０９に戻って処理を行う。 The information processing device 100D determines whether blob J exists in the distance calculation completed set (step S911). If the blob J exists in the distance-calculated set (step S911: Yes), the information processing device 100D returns to step S909 and performs the process.

また、情報処理装置１００Ｄは、ブロブＪが距離算出済み集合に存在しない場合（ステップＳ９１１：Ｎｏ）、ブロブＪを距離算出済み集合に追加し（ステップＳ９１２）、ブロブＪを未探索ブロブ集合に追加する（ステップＳ９１３）。そして、情報処理装置１００Ｄは、ブロブＪのセントロイドとクエリの距離Ｄｅを算出する（ステップＳ９１４）。 Further, if blob J does not exist in the distance calculated set (step S911: No), the information processing device 100D adds blob J to the distance calculated set (step S912), and adds blob J to the unsearched blob set. (Step S913). The information processing device 100D then calculates the distance De between the centroid of the blob J and the query (step S914).

情報処理装置１００Ｄは、距離Ｄｅがブロブ検索半径Ｒｂよりも小さいかどうかを判定する（ステップＳ９１５）。情報処理装置１００Ｄは、距離Ｄｅがブロブ検索半径Ｒｂよりも小さくない場合（ステップＳ９１５：Ｎｏ）、ブロブＪを除外ブロブ集合に追加し（ステップＳ９１６）、ステップＳ９０９に戻って処理を行う。 The information processing device 100D determines whether the distance De is smaller than the blob search radius Rb (step S915). If the distance De is not smaller than the blob search radius Rb (step S915: No), the information processing device 100D adds blob J to the excluded blob set (step S916), and returns to step S909 for processing.

また、情報処理装置１００Ｄは、距離Ｄｅがブロブ検索半径Ｒｂよりも小さい場合（ステップＳ９１５：Ｙｅｓ）、現最近傍ブロブＢを除外ブロブ集合に追加し、ブロブＪを現最近傍ブロブＢとする（ステップＳ９１７）。このように、情報処理装置１００Ｄは、距離Ｄｅがブロブ検索半径Ｒｂよりも小さい場合、現最近傍ブロブＢをブロブＪに更新する。 Further, if the distance De is smaller than the blob search radius Rb (step S915: Yes), the information processing device 100D adds the current nearest neighbor blob B to the excluded blob set and sets the blob J as the current nearest neighbor blob B ( Step S917). In this way, the information processing device 100D updates the current nearest neighbor blob B to the blob J when the distance De is smaller than the blob search radius Rb.

そして、情報処理装置１００Ｄは、ブロブＪに更新された現最近傍ブロブＢを基に、ブロブに関するパラメータを更新する（ステップＳ９１８）。図４３では、情報処理装置１００Ｄは、ブロブＪに更新された現最近傍ブロブＢを基にブロブ検索半径Ｒｂを「ｄ（ｑ,Ｂ）」に更新する。また、情報処理装置１００Ｄは、更新されたブロブ検索半径Ｒｂを基に、ブロブ探索半径Ｅｂを「Ｒｂ×（１＋ｅｂ）」に更新する。そして、情報処理装置１００Ｄは、ステップＳ９０９に戻って処理を行う。 Then, the information processing device 100D updates parameters related to the blob based on the current nearest neighbor blob B that has been updated to the blob J (step S918). In FIG. 43, the information processing device 100D updates the blob search radius Rb to “d(q,B)” based on the current nearest neighbor blob B updated to the blob J. Furthermore, the information processing device 100D updates the blob search radius Eb to “Rb×(1+eb)” based on the updated blob search radius Rb. The information processing apparatus 100D then returns to step S909 and performs the process.

情報処理装置１００Ｄは、ブロブＴのエッジで接続されているブロブを全て処理した場合（ステップＳ９０９：Ｙｅｓ）、ステップＳ９０６に戻って処理を行う。情報処理装置１００Ｄは、未探索ブロブ集合が空である場合（ステップＳ９０６：Ｙｅｓ）、ステップＳ９２０の処理を行う。また、情報処理装置１００Ｄは、ブロブＴがブロブ探索範囲内にない場合（ステップＳ９０８：Ｎｏ）、ブロブＴを未探索ブロブに戻して（ステップＳ９１９）、ステップＳ９２０の処理を行う。 When the information processing device 100D has processed all the blobs connected by the edges of the blob T (step S909: Yes), the information processing device 100D returns to step S906 and performs the processing. If the unsearched blob set is empty (step S906: Yes), the information processing device 100D performs the process of step S920. Further, if the blob T is not within the blob search range (step S908: No), the information processing device 100D returns the blob T to an unsearched blob (step S919), and performs the process of step S920.

ステップＳ９２０では、情報処理装置１００Ｄは、ブロブ検索数に１加える（ステップＳ９２０）。そして、情報処理装置１００Ｄは、ブロブＢ内の全てのオブジェクトの近似距離を算出し、最短距離を当該ブロブＢのブロブ距離とする（ステップＳ９２１）。例えば、情報処理装置１００Ｄは、ブロブＢ内の全オブジェクトについてクエリｑとの間の近似距離を算出し、クエリｑと、現最近傍ブロブＢに含まれるオブジェクトのうちクエリｑに最も近いオブジェクトとの間の距離を、ブロブＢのブロブ距離に設定する。なお、ブロブＢのブロブ距離は、ステップＳ９２４での終了判定の処理に用いられる。 In step S920, the information processing device 100D adds 1 to the number of blob searches (step S920). Then, the information processing device 100D calculates the approximate distances of all objects in the blob B, and sets the shortest distance as the blob distance of the blob B (step S921). For example, the information processing device 100D calculates approximate distances between all objects in the blob B and the query q, and calculates the distance between the query q and the object closest to the query q among the objects included in the current nearest neighbor blob B. Set the distance between them to the blob distance of blob B. Note that the blob distance of blob B is used in the end determination process in step S924.

情報処理装置１００Ｄは、ブロブＢ内の全てのオブジェクトを検索結果集合に追加、ソートし、指定された検索数上位のみを検索結果集合とする（ステップＳ９２２）。例えば、情報処理装置１００Ｄは、クエリｑの近傍のオブジェクトの候補を示す検索結果集合に、ブロブＢ内の全てのオブジェクトを追加する。そして、情報処理装置１００Ｄは、ブロブＢ内のオブジェクトを追加後の検索結果集合内のオブジェクトを、クエリｑからの距離が短い方から順にソートすることにより検索結果集合を更新し、更新した検索結果集合のうち指定された検索数（例えば５、１０等）よりも下位のオブジェクトを検索結果集合から除外する。例えば、情報処理装置１００Ｄは、ブロブＢ内のオブジェクトを追加後の検索結果集合内のオブジェクトのうち、クエリｑからの距離が短い方から検索数（例えば５、１０等）までのオブジェクト（すなわち、検索数の個数のオブジェクト）を残し、その他を除外する。 The information processing device 100D adds all the objects in the blob B to the search result set, sorts them, and sets only the designated top search result set as the search result set (step S922). For example, the information processing device 100D adds all objects in the blob B to a search result set indicating object candidates near the query q. The information processing device 100D then updates the search result set by sorting the objects in the search result set after adding the objects in blob B in descending order of distance from the query q, and updates the search result set with the updated search results. Objects lower than a specified number of searches (for example, 5, 10, etc.) in the set are excluded from the search result set. For example, the information processing device 100D selects objects in the search result set after adding the objects in blob B, starting from the one with the shortest distance from the query q, up to the number of searches (for example, 5, 10, etc.) (i.e., Keep the number of searched objects) and exclude the others.

そして、情報処理装置１００Ｄは、更新された検索結果集合を基に、オブジェクトに関するパラメータを更新する（ステップＳ９２３）。図４３では、情報処理装置１００Ｄは、更新された検索結果集合を基に、オブジェクト検索半径Ｒを、検索結果集合内のオブジェクトの最大距離に更新する。例えば、情報処理装置１００Ｄは、オブジェクト検索半径Ｒを、検索結果集合において内のクエリｑから最も遠いオブジェクトの距離に更新する。また、情報処理装置１００Ｄは、更新された検索結果集合を基に、オブジェクト探索半径Ｅを「Ｒ×（１＋ｅｏ）」に設定する。ここで、パラメータｅｏは、オブジェクト探索半径拡大率（「第２検索範囲係数」ともいう）であり、例えば０．０５等の任意の値が設定される。例えば、パラメータｅｏは、上述した検索範囲係数「ε」に対応する。例えば、情報処理装置１００Ｄは、オブジェクト探索半径Ｅを、オブジェクト検索半径Ｒに１＋ｅｏが乗算した値を設定する。パラメータｅｏとパラメータｅｂとは同じ値であってもよい。 Then, the information processing device 100D updates parameters related to the object based on the updated search result set (step S923). In FIG. 43, the information processing device 100D updates the object search radius R to the maximum distance of objects in the search result set based on the updated search result set. For example, the information processing device 100D updates the object search radius R to the distance of the object furthest from the query q in the search result set. Furthermore, the information processing device 100D sets the object search radius E to “R×(1+eo)” based on the updated search result set. Here, the parameter eo is an object search radius expansion rate (also referred to as a "second search range coefficient"), and is set to an arbitrary value such as 0.05, for example. For example, the parameter eo corresponds to the above-mentioned search range coefficient "ε". For example, the information processing device 100D sets the object search radius E to a value obtained by multiplying the object search radius R by 1+eo. Parameter eo and parameter eb may have the same value.

そして、情報処理装置１００Ｄは、終了基準＃１～＃３に当てはまるかどうかを判定する（ステップＳ９２４）。情報処理装置１００Ｄは、処理状況が各終了条件に基づく終了基準＃１～＃３のいずれかに当てはまるかどうかを判定する。終了基準＃１は、ブロブの検索上限数を超えたら終了とする。例えば、終了基準＃１は、パラメータであるブロブ検索数が検索上限数（例えば５、１０等の任意の値）を超えることを条件とする基準である。 Then, the information processing device 100D determines whether termination criteria #1 to #3 are met (step S924). The information processing device 100D determines whether the processing status meets any of termination criteria #1 to #3 based on each termination condition. Termination criterion #1 is that the search is terminated when the upper limit number of blobs is exceeded. For example, termination criterion #1 is a criterion that requires that the number of blob searches, which is a parameter, exceeds the upper limit number of searches (for example, an arbitrary value such as 5, 10, etc.).

終了基準＃２は、ブロブ距離がオブジェクトの探索半径を超えたら終了とする。例えば、終了基準＃２は、ブロブ距離がオブジェクト探索半径Ｅを超えることを条件とする基準である。終了基準＃３は、ブロブの探索半径がオブジェクトの探索半径を超えたら終了とする。例えば、終了基準＃３は、ブロブ探索半径Ｅｂがオブジェクト探索半径Ｅを超えることを条件とする基準である。なお、上記の終了基準＃１～＃３は一例に過ぎず、様々な終了基準が用いられてもよい。また、複数の終了基準が組み合わせて用いられてもよい。 Termination criterion #2 is to terminate when the blob distance exceeds the search radius of the object. For example, termination criterion #2 is a criterion that requires the blob distance to exceed the object search radius E. Termination criterion #3 is that the search is terminated when the blob search radius exceeds the object search radius. For example, termination criterion #3 is a criterion that requires the blob search radius Eb to exceed the object search radius E. Note that the above termination criteria #1 to #3 are merely examples, and various termination criteria may be used. Further, a plurality of termination criteria may be used in combination.

情報処理装置１００Ｄは、終了基準＃１～＃３に当てはまらない場合（ステップＳ９２４：Ｎｏ）、除外ブロブ集合から最短距離のブロブを取得し、現最近傍ブロブＢとする（ステップＳ９２５）。例えば、情報処理装置１００Ｄは、除外ブロブ集合のうち、クエリｑとの間の距離が最も短いブロブを現最近傍ブロブＢとして取得する。これにより、除外ブロブ集合からはその時点での最短距離のブロブが除かれる。例えば、情報処理装置１００Ｄは、除外ブロブ集合のうち、クエリｑとの間の距離が最も短いブロブに現最近傍ブロブＢを更新する。 If the termination criteria #1 to #3 are not applicable (step S924: No), the information processing device 100D obtains the blob with the shortest distance from the excluded blob set and sets it as the current nearest neighbor blob B (step S925). For example, the information processing device 100D obtains the blob with the shortest distance from the query q as the current nearest neighbor blob B from the set of excluded blobs. As a result, the blob with the shortest distance at that time is removed from the excluded blob set. For example, the information processing device 100D updates the current nearest neighbor blob B to the blob with the shortest distance to the query q among the set of excluded blobs.

そして、情報処理装置１００Ｄは、最短距離のブロブに更新された現最近傍ブロブＢを基に、ブロブに関するパラメータを更新する（ステップＳ９２６）。図４３では、情報処理装置１００Ｄは、最短距離のブロブに更新された現最近傍ブロブＢを基にブロブ検索半径Ｒｂを「ｄ（ｑ,Ｂ）」に更新する。また、情報処理装置１００Ｄは、更新されたブロブ検索半径Ｒｂを基に、ブロブ探索半径Ｅｂを「Ｒｂ×（１＋ｅｂ）」に更新する。そして、情報処理装置１００Ｄは、ステップＳ９０６に戻って処理を行う。 Then, the information processing device 100D updates parameters regarding the blob based on the current nearest neighbor blob B that has been updated to the blob with the shortest distance (step S926). In FIG. 43, the information processing device 100D updates the blob search radius Rb to "d(q,B)" based on the current nearest neighbor blob B that has been updated to the blob with the shortest distance. Furthermore, the information processing device 100D updates the blob search radius Eb to “Rb×(1+eb)” based on the updated blob search radius Rb. The information processing apparatus 100D then returns to step S906 and performs the process.

また、情報処理装置１００Ｄは、終了基準＃１～＃３に当てはまる場合（ステップＳ９２４：Ｙｅｓ）、処理を終了する。例えば、情報処理装置１００Ｄは、終了基準＃１～＃３に当てはまる場合、その時点で検索結果集合に含まれるオブジェクトをクエリｑの近傍のオブジェクトに決定する。そして、情報処理装置１００Ｄは、クエリｑの近傍のオブジェクトに決定したオブジェクトを示す情報を、クエリｑを指定した検索の要求元等に提供する。 Further, if the termination criteria #1 to #3 are met (step S924: Yes), the information processing device 100D terminates the process. For example, if the termination criteria #1 to #3 apply, the information processing device 100D determines the objects included in the search result set at that time to be objects in the vicinity of the query q. Then, the information processing device 100D provides information indicating the objects determined as objects in the vicinity of the query q to the requestor of the search that specified the query q.

〔６．第６の実施形態〕
上述した例では、オブジェクトを複数のブロブにクラスタリング（分類）し、各ブロブをさらに複数の量子化クラスタ（単に「クラスタ」ともいう）にクラスタリング（分類）する処理を行う場合を示した。例えば、第４の実施形態及び第５の実施形態では、第１グループ（ブロブ）の個数（第１数）がオブジェクトを量子化する単位となる第２グループ（クラスタ）の個数（第２数）より多い場合を示したが、第２グループ（クラスタ）の個数（第２数）が第１グループ（ブロブ）の個数（第１数）より多くてもよい。例えば、第６の実施形態に係る情報処理装置１００Ｅは、第２グループ（クラスタ）の分類をさらに第１グループ（ブロブ）に分類する階層的なクラスタリング（「階層クラスタリング」ともいう）により第１グループ及び第２グループを生成してもよい。すなわち、第２グループ（クラスタ）の方が第１グループ（ブロブ）よりも上位の階層であってもよい。なお、第４の実施形態及び第５の実施形態で示したように、第１グループ（ブロブ）の方が第２グループ（クラスタ）よりも上位の階層であってもよい。 [6. Sixth embodiment]
In the example described above, a case is shown in which an object is clustered (classified) into a plurality of blobs, and each blob is further clustered (classified) into a plurality of quantized clusters (also simply referred to as "clusters"). For example, in the fourth embodiment and the fifth embodiment, the number (first number) of the first group (blob) is the number (second number) of the second group (cluster) that is the unit for quantizing objects. Although the case where the number of clusters is larger is shown, the number of second groups (clusters) (second number) may be larger than the number of first groups (blobs) (first number). For example, the information processing apparatus 100E according to the sixth embodiment performs hierarchical clustering (also referred to as "hierarchical clustering") in which the classification of the second group (cluster) is further classified into the first group (blob). and a second group may be generated. That is, the second group (cluster) may be in a higher hierarchy than the first group (blob). Note that, as shown in the fourth embodiment and the fifth embodiment, the first group (blob) may be in a higher hierarchy than the second group (cluster).

この点について、以下、第６の実施形態として説明する。第６の実施形態においては、情報処理システム１は、情報処理装置１００、情報処理装置１００Ａ、情報処理装置１００Ｂ、情報処理装置１００Ｃまたは情報処理装置１００Ｄに代えて、情報処理装置１００Ｅを有する。なお、第１の実施形態、第２の実施形態、第３の実施形態、第４の実施形態及び第５の実施形態等において上述した内容と同様の点については適宜説明を省略する。 This point will be described below as a sixth embodiment. In the sixth embodiment, the information processing system 1 includes an information processing device 100E instead of the information processing device 100, the information processing device 100A, the information processing device 100B, the information processing device 100C, or the information processing device 100D. Note that descriptions of points similar to those described above in the first embodiment, second embodiment, third embodiment, fourth embodiment, fifth embodiment, etc. will be omitted as appropriate.

情報処理装置１００Ｅは、複数のオブジェクトを分類する複数のブロブを示すブロブ情報（第１グループ情報）と、ブロブとは異なる分類となるグループであるクラスタを示すクラスタ情報（第２グループ情報）とを生成する。以下では、複数のオブジェクトを第１グループ（ブロブ）に分類する分類処理を第１分類処理と記載し、複数のオブジェクトを第２グループ（クラスタ）に分類する分類処理を第２分類処理と記載する場合がある。 The information processing device 100E stores blob information (first group information) indicating a plurality of blobs that classify a plurality of objects, and cluster information (second group information) indicating a cluster, which is a group classified differently from the blobs. generate. In the following, a classification process for classifying multiple objects into a first group (blob) will be referred to as a first classification process, and a classification process for classifying multiple objects into a second group (cluster) will be referred to as a second classification process. There are cases.

〔６－１．情報処理〕
まず、図４４を用いて、第６の実施形態に係る情報処理の概要を説明する。図４４は、第６の実施形態に係る情報処理の一例を示す図である。具体的には、図４４は、第６の実施形態における第１グループ及び第２グループの第１の生成例を示す図である。なお、以下では、情報処理装置１００Ｅが第２分類処理を実行した後、第１分類処理を実行する場合を示すが、所望の第１グループ及び第２グループが生成可能であれば、第１分類処理を実行した後、第２分類処理を実行してもよい。 [6-1. Information processing〕
First, an overview of information processing according to the sixth embodiment will be explained using FIG. 44. FIG. 44 is a diagram illustrating an example of information processing according to the sixth embodiment. Specifically, FIG. 44 is a diagram showing a first generation example of the first group and the second group in the sixth embodiment. Note that the following describes a case where the information processing device 100E executes the first classification process after executing the second classification process, but if the desired first group and second group can be generated, the first classification After performing the process, a second classification process may be performed.

情報処理装置１００Ｅによる情報処理の説明に先立って、図４４の図示態様について簡単に説明する。図４４では、ブロブの区別を実線の太い線で示すとともに、ブロブ間の連結をブロブＢＬのセントロイドＣ間を繋ぐ点線で示す。このように、各ブロブＢＬのセントロイドを区別せずに説明する場合、「セントロイドＣ」と記載する。例えば、実線のうち、太い線がブロブに対応する分割を示し、細い線がクラスタに対応する分割を示す。例えば、図４４では、実線のうち細い線はブロブ内でのクラスタに対応する分割を示す。 Prior to explaining information processing by the information processing device 100E, the illustrated aspect of FIG. 44 will be briefly explained. In FIG. 44, the distinction between blobs is shown by solid thick lines, and the connections between blobs are shown by dotted lines connecting centroids C of blobs BL. In this way, when explaining the centroid of each blob BL without distinguishing it, it will be described as "centroid C." For example, among the solid lines, thick lines indicate divisions corresponding to blobs, and thin lines indicate divisions corresponding to clusters. For example, in FIG. 44, thin solid lines indicate divisions corresponding to clusters within a blob.

図４４は、セントロイドＣＮ２１～ＣＮ３０は、各クラスタＣＬ２１～ＣＬ３０の各々のセントロイドを示す。なお、各クラスタＣＬのセントロイドを区別せずに説明する場合、「セントロイドＣＮ」と記載する。また、図４４は、セントロイドＣ２１～Ｃ２４は、各ブロブＢＬ２１～ＢＬ２４の各々のセントロイドを示す。ブロブＢＬ２１には、クラスタＣＬ２１、ＣＬ２２、ＣＬ２３の３つのクラスタＣＬが含まれる。クラスタＣＬ２１には、ノードＮ３１、Ｎ２０５等が含まれる。ブロブＢＬ２２には、クラスタＣＬ２４、ＣＬ２５の２つのクラスタＣＬが含まれる。ブロブＢＬ２３には、クラスタＣＬ２６、ＣＬ２７、ＣＬ２８の３つのクラスタＣＬが含まれる。ブロブＢＬ２４には、クラスタＣＬ２９、ＣＬ３０の２つのクラスタＣＬが含まれる。なお、図４４に示すブロブＢＬ、クラスタＣＬ及びノードＮ等は一部であり、処理対象となるブロブＢＬ、クラスタＣＬ及びノードＮには、図示したもの以外にも多数のブロブＢＬ、クラスタＣＬ及びノードＮが含まれる。なお、空間情報ＳＰ７１に示すグラフＧＲ７１を用いた具体的な検索処理の流れについては図４８で示す。 In FIG. 44, centroids CN21 to CN30 indicate the centroids of each cluster CL21 to CL30. In addition, when explaining the centroid of each cluster CL without distinguishing it, it is written as "centroid CN." Further, in FIG. 44, centroids C21 to C24 indicate the centroids of each of the blobs BL21 to BL24. Blob BL21 includes three clusters CL21, CL22, and CL23. Cluster CL21 includes nodes N31, N205, etc. The blob BL22 includes two clusters CL24 and CL25. Blob BL23 includes three clusters CL26, CL27, and CL28. The blob BL24 includes two clusters CL29 and CL30. Note that the blob BL, cluster CL, node N, etc. shown in FIG. Node N is included. Note that the flow of a specific search process using the graph GR71 shown in the spatial information SP71 is shown in FIG.

情報処理装置１００Ｅによる第１グループ及び第２グループの生成処理の一例について説明する。まず、情報処理装置１００Ｅは、第２分類処理を実行し、複数のオブジェクトを複数のブロブに分類するブロブ情報を生成する。例えば、情報処理装置１００Ｅは、任意の方法により、複数のオブジェクトを分類するブロブを示すブロブ情報を生成する。情報処理装置１００Ｅは、ｋ－ｍｅａｎｓ等の任意のクラスタリングにより複数のオブジェクトを複数のブロブに分類する第２分類処理を実行する。 An example of the first group and second group generation processing performed by the information processing device 100E will be described. First, the information processing device 100E executes a second classification process to generate blob information for classifying a plurality of objects into a plurality of blobs. For example, the information processing device 100E generates blob information indicating a blob that classifies a plurality of objects using an arbitrary method. The information processing device 100E executes a second classification process of classifying a plurality of objects into a plurality of blobs using arbitrary clustering such as k-means.

図４４の例では、情報処理装置１００Ｅは、複数のオブジェクトをブロブＢＬ２１～ＢＬ２４等の複数のブロブに分類する。なお、情報処理装置１００Ｅは、複数のオブジェクトを生成可能であれば、どのような方法によりクラスタを生成してもよい。例えば、情報処理装置１００Ｅは、グラフ（例えばグラフＧＲ７１）等のインデックスを用いて、ブロブ情報を生成してもよい。 In the example of FIG. 44, the information processing device 100E classifies a plurality of objects into a plurality of blobs such as blobs BL21 to BL24. Note that the information processing apparatus 100E may use any method to generate clusters as long as it can generate a plurality of objects. For example, the information processing device 100E may generate blob information using an index of a graph (for example, graph GR71).

また、情報処理装置１００Ｅは、複数のオブジェクトを複数のクラスタに分類するクラスタ情報を生成する（ステップＳ７１）。情報処理装置１００Ｅは、任意の方法により、複数のオブジェクトをクラスタＣＬ２１～ＣＬ３０等に分類するクラスタ情報を生成する。例えば、情報処理装置１００Ｅは、第２分類処理を実行により生成した情報を用いて、複数のオブジェクトを複数のクラスタに分類するクラスタ情報を生成する。図４４の例では、情報処理装置１００Ｅは、ブロブＢＬ２１～ＢＬ２４等の複数のブロブの各々を複数のクラスタに分類する第１分類処理を実行することにより、複数のオブジェクトを複数のクラスタに分類するクラスタ情報を生成する。情報処理装置１００Ｅは、ブロブＢＬ２１～ＢＬ２４等の複数のブロブの各々に含まれる複数のオブジェクトごとに複数のクラスタに分類することにより、複数のオブジェクトを複数のクラスタに分類するクラスタ情報を生成する。 Further, the information processing device 100E generates cluster information for classifying a plurality of objects into a plurality of clusters (step S71). The information processing device 100E generates cluster information for classifying a plurality of objects into clusters CL21 to CL30, etc., using an arbitrary method. For example, the information processing device 100E uses information generated by executing the second classification process to generate cluster information for classifying a plurality of objects into a plurality of clusters. In the example of FIG. 44, the information processing device 100E classifies a plurality of objects into a plurality of clusters by performing a first classification process of classifying each of a plurality of blobs, such as blobs BL21 to BL24, into a plurality of clusters. Generate cluster information. The information processing device 100E generates cluster information for classifying a plurality of objects into a plurality of clusters by classifying a plurality of objects included in each of a plurality of blobs such as blobs BL21 to BL24 into a plurality of clusters.

なお、上記は一例に過ぎず、情報処理装置１００Ｅは、クラスタを生成可能であれば、どのような方法によりクラスタを生成してもよい。情報処理装置１００Ｅは、ｋ－ｍｅａｎｓ等の任意のクラスタリングにより各ノード（オブジェクト）をクラスタＣＬ２１～ＣＬ３０のいずれかに分類してもよい。例えば、情報処理装置１００Ｅは、グラフ等のインデックスを用いて、クラスタを生成してもよい。 Note that the above is just an example, and the information processing apparatus 100E may use any method to generate clusters as long as it can generate clusters. The information processing device 100E may classify each node (object) into one of the clusters CL21 to CL30 using arbitrary clustering such as k-means. For example, the information processing apparatus 100E may generate clusters using an index such as a graph.

図４４の例では、情報処理装置１００Ｅは、ブロブＢＬ２１～ＢＬ２４等をさらに分割したクラスタＣＬ２１～ＣＬ３０等のクラスタ情報を生成する。例えば、情報処理装置１００Ｅは、ブロブＢＬ２１～ＢＬ２４ごとに、各ブロブに属するオブジェクトをクラスタリングすることにより、各オブジェクトをクラスタＣＬ２１～ＣＬ３０のいずれかに分類してもよい。 In the example of FIG. 44, the information processing device 100E generates cluster information such as clusters CL21 to CL30, which are obtained by further dividing blobs BL21 to BL24, etc. For example, the information processing device 100E may classify each object into one of the clusters CL21 to CL30 by clustering the objects belonging to each blob BL21 to BL24.

図４４の例では、情報処理装置１００Ｅは、ブロブＢＬ２１をクラスタＣＬ２１、クラスタＣＬ２２及びクラスタＣＬ２３に分割する。すなわち、情報処理装置１００Ｅは、ブロブＢＬ２１属するオブジェクトを、クラスタＣＬ２１に属するオブジェクト、クラスタＣＬ２２に属するオブジェクト及びクラスタＣＬ２３に属するオブジェクトに分類する。なお、図４４では、クラスタの境界線がブロブの境界線上にあるように図示したが、分割の方法によっては一致しない場合があり、また方法によっては空間が境界線で分割されるとも限らない。また、図４４は、生成処理を説明するためにオブジェクト、セントロイド等を概念的に示すものであり、セントロイド等の位置は正確な位置を示すものではない。 In the example of FIG. 44, the information processing device 100E divides the blob BL21 into a cluster CL21, a cluster CL22, and a cluster CL23. That is, the information processing device 100E classifies objects belonging to the blob BL21 into objects belonging to the cluster CL21, objects belonging to the cluster CL22, and objects belonging to the cluster CL23. Note that although the cluster boundaries are shown to be on the blob boundaries in FIG. 44, they may not match depending on the dividing method, and the space may not always be divided by the boundary lines depending on the dividing method. Further, FIG. 44 conceptually shows objects, centroids, etc. in order to explain the generation process, and the positions of centroids, etc. do not indicate accurate positions.

また、情報処理装置１００Ｅは、ブロブＢＬ２２をクラスタＣＬ２４及びクラスタＣＬ２５に分割する。例えば、情報処理装置１００Ｅは、ブロブＢＬ２３をクラスタＣＬ２６、ＣＬ２７及びクラスタＣＬ２８に分割する。情報処理装置１００Ｅは、ブロブＢＬ２４をクラスタＣＬ２９及びクラスタＣＬ３０に分割する。このように、情報処理装置１００Ｅは、第１グループの情報を用いて、第２グループの情報を生成する。このように、情報処理装置１００Ｅは、第１分類処理により生成した第１グループ情報を用いて、第２分類処理を行う。 Furthermore, the information processing device 100E divides the blob BL22 into a cluster CL24 and a cluster CL25. For example, the information processing device 100E divides the blob BL23 into clusters CL26, CL27, and cluster CL28. The information processing device 100E divides the blob BL24 into a cluster CL29 and a cluster CL30. In this way, the information processing device 100E uses the first group of information to generate the second group of information. In this way, the information processing device 100E performs the second classification process using the first group information generated by the first classification process.

なお、図４４では、情報処理装置１００Ｅは、空間情報ＳＰ７１に示すように、ブロブ連結グラフであるグラフＧＲ７１を生成するが、グラフＧＲ７１を生成しなくてもよい。例えば、情報処理装置１００Ｅは、グラフＧＲ７１を生成する場合、図２３及び図２９等と同様の情報を用いた処理によりグラフＧＲ７１を生成するが、図２３及び図２９等での説明と同様であるため詳細な説明を省略する。 Note that in FIG. 44, the information processing device 100E generates the graph GR71, which is a blob connection graph, as shown in the spatial information SP71, but it is not necessary to generate the graph GR71. For example, when generating the graph GR71, the information processing device 100E generates the graph GR71 by processing using information similar to that in FIGS. 23, 29, etc. Therefore, detailed explanation will be omitted.

例えば、情報処理装置１００Ｅは、ブロブ情報を用いて量子化に関する処理を実行する。情報処理装置１００Ｅは、ブロブ情報を用いて複数のオブジェクトを対象とする量子化に関する処理を行う。例えば、情報処理装置１００Ｅは、クラスタＣＬ２１～ＣＬ３０の各々に属するオブジェクトごとに量子化を行うが、この点については後述する。 For example, the information processing device 100E executes processing related to quantization using blob information. The information processing device 100E performs processing related to quantization targeting a plurality of objects using blob information. For example, the information processing device 100E performs quantization for each object belonging to each of the clusters CL21 to CL30, but this point will be described later.

上記のように、情報処理装置１００Ｅは、データ検索の対象となる複数のオブジェクトを分類する複数のクラスタを示すクラスタ情報と、複数のクラスタとは異なる分類となる複数のブロブを示すブロブ情報とを生成する。具体的には、情報処理装置１００Ｅは、第１数の第１グループ（ブロブ）と第１数よりも多い第２数の第２グループ（クラスタ）とを生成する。 As described above, the information processing device 100E stores cluster information indicating a plurality of clusters that classify a plurality of objects that are data search targets, and blob information indicating a plurality of blobs that are classified differently from the plurality of clusters. generate. Specifically, the information processing device 100E generates a first number of first groups (blobs) and a second number of second groups (clusters) that is larger than the first number.

これにより、情報処理装置１００Ｅは、複数のオブジェクトを分類する２種類のグループを生成することができ、オブジェクトを分類するグループを生成することができる。例えば、一つの量子化クラスタ（第２グループ）に複数のブロブ（第１グループ）が包含されるデータ構造の場合に高い検索性能が出ない場合がある。このような場合であっても、情報処理装置１００Ｅは、第１数の第１グループと第１数よりも多い第２数の第２グループとを生成することにより、適切なグループの関係にして検索性能を向上させることが可能となる。 Thereby, the information processing apparatus 100E can generate two types of groups for classifying a plurality of objects, and can generate groups for classifying objects. For example, in the case of a data structure in which a plurality of blobs (first group) are included in one quantization cluster (second group), high search performance may not be achieved. Even in such a case, the information processing device 100E maintains an appropriate group relationship by generating a first group with a first number and a second group with a second number larger than the first number. It becomes possible to improve search performance.

〔６－１－１．他の処理例〕
なお、図４４に示した処理は一例に過ぎず、情報処理装置１００Ｅは、ブロブ情報及びクラスタ情報を生成可能であれば、どのような態様の処理により生成を行ってもよい。 [6-1-1. Other processing examples]
Note that the process shown in FIG. 44 is only an example, and the information processing apparatus 100E may perform any type of process to generate blob information and cluster information as long as it is capable of generating the blob information and cluster information.

例えば、情報処理装置１００Ｅは、クラスタに分類するクラスタ情報を生成した後に、ブロブに分類するブロブ情報を生成してもよい。この場合、情報処理装置１００Ｅは、複数のオブジェクトをクラスタに分類するクラスタリングにより、各オブジェクトを複数のクラスタのいずれかの分類するクラスタ情報を生成してもよい。例えば、情報処理装置１００Ｅは、ｋ－ｍｅａｎｓ等の任意のクラスタリングにより各オブジェクトをクラスタＣＬ２１～ＣＬ３０のいずれかに分類してもよい。 For example, the information processing device 100E may generate cluster information for classifying into clusters and then generate blob information for classifying into blobs. In this case, the information processing apparatus 100E may generate cluster information for classifying each object into one of the plurality of clusters by performing clustering in which the plurality of objects are classified into clusters. For example, the information processing device 100E may classify each object into one of the clusters CL21 to CL30 using arbitrary clustering such as k-means.

そして、情報処理装置１００Ｅは、クラスタＣＬ２１～ＣＬ３０等のブロブＢＬ２１～ＢＬ２４等に分類するブロブ情報を生成する。情報処理装置１００Ｅは、ｋ－ｍｅａｎｓ等の任意のクラスタリングによりクラスタＣＬ２１～ＣＬ３０等を複数のブロブのいずれかに分類してもよい。なお、情報処理装置１００Ｅは、ブロブを生成可能であれば、どのような方法によりブロブを生成してもよい。例えば、情報処理装置１００Ｅは、グラフ等のインデックスを用いて、ブロブを生成してもよい。 Then, the information processing device 100E generates blob information that is classified into blobs BL21 to BL24, etc., such as clusters CL21 to CL30. The information processing device 100E may classify clusters CL21 to CL30, etc. into any of a plurality of blobs using arbitrary clustering such as k-means. Note that the information processing device 100E may use any method to generate blobs as long as it is capable of generating blobs. For example, the information processing device 100E may generate a blob using an index such as a graph.

例えば、情報処理装置１００Ｅは、クラスタＣＬ２１、クラスタＣＬ２２及びクラスタＣＬ２３をブロブＢＬ２１に分類する。また、情報処理装置１００Ｅは、クラスタＣＬ２４及びクラスタＣＬ２５をブロブＢＬ２２に分類する。例えば、情報処理装置１００Ｅは、クラスタＣＬ２６、ＣＬ２７及びクラスタＣＬ２８をブロブＢＬ２３に分類する。情報処理装置１００Ｅは、クラスタＣＬ２９及びクラスタＣＬ３０をブロブＢＬ２４に分類する。これにより、情報処理装置１００Ｅは、第２グループの情報を用いて、第１グループの情報を生成する。このように、情報処理装置１００Ｅは、第２分類処理により生成した第２グループ情報を用いて、第１分類処理を行ってもよい。 For example, the information processing device 100E classifies cluster CL21, cluster CL22, and cluster CL23 into blob BL21. Furthermore, the information processing device 100E classifies cluster CL24 and cluster CL25 into blob BL22. For example, the information processing device 100E classifies clusters CL26, CL27, and cluster CL28 into blob BL23. The information processing device 100E classifies cluster CL29 and cluster CL30 into blob BL24. Thereby, the information processing device 100E generates the first group of information using the second group of information. In this way, the information processing device 100E may perform the first classification process using the second group information generated by the second classification process.

〔６－２．情報処理装置の構成〕
次に、図４５を用いて、第６の実施形態に係る情報処理装置１００Ｅの構成について説明する。図４５は、第６の実施形態に係る情報処理装置の構成例を示す図である。図４５に示すように、情報処理装置１００Ｅは、通信部１１０と、記憶部１２０Ｃと、制御部１３０Ｂとを有する。なお、情報処理装置１００Ｅにおいて、情報処理装置１００、情報処理装置１００Ａ、情報処理装置１００Ｂ、情報処理装置１００Ｃまたは情報処理装置１００Ｄと同様の点は適宜説明を省略する。 [6-2. Configuration of information processing device]
Next, the configuration of the information processing apparatus 100E according to the sixth embodiment will be described using FIG. 45. FIG. 45 is a diagram illustrating a configuration example of an information processing apparatus according to the sixth embodiment. As shown in FIG. 45, the information processing device 100E includes a communication section 110, a storage section 120C, and a control section 130B. Note that in the information processing device 100E, descriptions of the same points as the information processing device 100, the information processing device 100A, the information processing device 100B, the information processing device 100C, or the information processing device 100D will be omitted as appropriate.

（記憶部１２０Ｃ）
記憶部１２０Ｃは、例えば、ＲＡＭ、フラッシュメモリ等の半導体メモリ素子、または、ハードディスク、光ディスク等の記憶装置によって実現される。第６の実施形態に係る記憶部１２０Ｃは、図４５に示すように、オブジェクト情報記憶部１２１と、グラフ情報記憶部１２２と、量子化情報記憶部１２３と、コードブック情報記憶部１２４と、ブロブ情報記憶部１２５と、ブロブ連結グラフ情報記憶部１２６と、クラスタ情報記憶部１２７とを有する。 (Storage unit 120C)
The storage unit 120C is realized by, for example, a semiconductor memory device such as a RAM or a flash memory, or a storage device such as a hard disk or an optical disk. As shown in FIG. 45, the storage unit 120C according to the sixth embodiment includes an object information storage unit 121, a graph information storage unit 122, a quantization information storage unit 123, a codebook information storage unit 124, and a blob information storage unit 121. It has an information storage section 125, a blob connection graph information storage section 126, and a cluster information storage section 127.

（制御部１３０Ｅ）
制御部１３０Ｅは、コントローラ（controller）であり、例えば、ＣＰＵやＭＰＵやＧＰＵ等によって、情報処理装置１００Ｅ内部の記憶装置に記憶されている各種プログラム（情報処理プログラムの一例に相当）がＲＡＭを作業領域として実行されることにより実現される。また、制御部１３０Ｅは、コントローラであり、例えば、ＡＳＩＣやＦＰＧＡ等の集積回路により実現される。 (Control unit 130E)
The control unit 130E is a controller, and allows various programs (corresponding to an example of an information processing program) stored in a storage device inside the information processing device 100E to operate the RAM by, for example, a CPU, MPU, GPU, etc. This is achieved by executing it as a region. Further, the control unit 130E is a controller, and is realized by, for example, an integrated circuit such as an ASIC or an FPGA.

図４５に示すように、制御部１３０Ｅは、取得部１３１と、生成部１３２Ｅと、検索処理部１３３Ｅと、提供部１３４とを有し、以下に説明する情報処理の機能や作用を実現または実行する。なお、制御部１３０Ｅの内部構成は、図４５に示した構成に限られず、後述する情報処理を行う構成であれば他の構成であってもよい。 As shown in FIG. 45, the control unit 130E includes an acquisition unit 131, a generation unit 132E, a search processing unit 133E, and a provision unit 134, and realizes or executes information processing functions and operations described below. do. Note that the internal configuration of the control unit 130E is not limited to the configuration shown in FIG. 45, and may be any other configuration as long as it performs information processing to be described later.

第６の実施形態に係る取得部１３１は、データ検索の対象となる複数のオブジェクトを示すオブジェクト情報を取得する。取得部１３１は、複数のオブジェクトを検索対象とするインデックスを示すインデックス情報を取得する。 The acquisition unit 131 according to the sixth embodiment acquires object information indicating a plurality of objects that are data search targets. The acquisition unit 131 acquires index information indicating an index that searches for a plurality of objects.

第６の実施形態に係る取得部１３１は、第１の実施形態～第５の実施形態に係る取得部１３１と同様に各種情報を取得する。例えば、取得部１３１は、データ検索の対象となる複数のオブジェクトを示すオブジェクト情報を取得する。例えば、取得部１３１は、複数のオブジェクトを検索対象とするインデックスを示すインデックス情報を取得する。 The acquisition unit 131 according to the sixth embodiment acquires various information in the same way as the acquisition unit 131 according to the first to fifth embodiments. For example, the acquisition unit 131 acquires object information indicating a plurality of objects that are data search targets. For example, the acquisition unit 131 acquires index information indicating an index that searches for multiple objects.

（生成部１３２Ｅ）
生成部１３２Ｅは、生成部１３２、生成部１３２Ａ、生成部１３２Ｂまたは生成部１３２Ｃと同様に各種情報を生成する。 (Generation unit 132E)
The generating unit 132E generates various information in the same way as the generating unit 132, 132A, 132B, or 132C.

生成部１３２Ｅは、取得部１３１により取得されたオブジェクト情報に基づいて、複数のオブジェクトを分類する第１数のグループである複数の第１グループを示す第１グループ情報と、複数の第１グループとは異なる分類となるグループであって、複数のオブジェクトを分類する第１数よりも多い第２数のグループである複数の第２グループを示す第２グループ情報とを生成する。 Based on the object information acquired by the acquisition unit 131, the generation unit 132E generates first group information indicating a plurality of first groups, which are a first number of groups into which a plurality of objects are classified, and a plurality of first groups. generates second group information indicating a plurality of second groups, which are groups of different classifications and are a second number of groups that is larger than the first number for classifying the plurality of objects.

生成部１３２Ｅは、複数のオブジェクトのうち、複数の第２グループにおいて同じ第２グループに属するオブジェクトを同じ第１グループに分類するグループである第１グループ情報を生成する。生成部１３２Ｅは、複数の第２グループのうち二以上の第２グループの各々に属するオブジェクト群が一の第１グループに分類された複数の第１グループを示す第１グループ情報を生成する。 The generation unit 132E generates first group information that is a group that classifies objects belonging to the same second group among the plurality of second groups into the same first group. The generation unit 132E generates first group information indicating a plurality of first groups in which objects belonging to each of two or more second groups among the plurality of second groups are classified into one first group.

生成部１３２Ｅは、複数のオブジェクトを対象とする検索処理において一括処理を行うために用いられる複数の第１グループを示す第１グループ情報を生成する。生成部１３２Ｅは、複数のオブジェクトに関する距離の算出を一括して行うために用いられる複数の第１グループを示す第１グループ情報を生成する。生成部１３２Ｅは、検索処理で用いられる検索クエリとオブジェクトとの距離の算出を並列処理するために用いられる複数の第１グループを示す第１グループ情報を生成する。 The generation unit 132E generates first group information indicating a plurality of first groups used for performing batch processing in a search process targeting a plurality of objects. The generation unit 132E generates first group information indicating a plurality of first groups used to collectively calculate distances regarding a plurality of objects. The generation unit 132E generates first group information indicating a plurality of first groups used to calculate the distance between the search query and the object used in the search process in parallel.

生成部１３２Ｅは、複数のオブジェクトを対象とする量子化に関する処理を行うために用いられる複数の第２グループを示す第２グループ情報を生成する。生成部１３２Ｅは、量子化の単位となる複数の第２グループを示す第２グループ情報を生成する。生成部１３２Ｅは、複数のオブジェクトの各々に対応するベクトルを量子化するために用いられる複数の第２グループを示す第２グループ情報を生成する。 The generation unit 132E generates second group information indicating a plurality of second groups used to perform processing related to quantization targeting a plurality of objects. The generation unit 132E generates second group information indicating a plurality of second groups that are units of quantization. The generation unit 132E generates second group information indicating a plurality of second groups used to quantize vectors corresponding to each of the plurality of objects.

生成部１３２Ｅは、複数の第２グループの分類後に、複数の第１グループの分類を行う。生成部１３２Ｅは、複数のオブジェクトをクラスタリングすることにより複数の第２グループを示す第２グループ情報を生成する。生成部１３２Ｅは、インデックス情報を用いて、複数の第２グループを示す第２グループ情報を生成する。生成部１３２Ｅは、第２グループを示す第２グループ情報を用いて、複数の第１グループを示す第１グループ情報を生成する。 After classifying the plurality of second groups, the generation unit 132E performs classification of the plurality of first groups. The generation unit 132E generates second group information indicating a plurality of second groups by clustering a plurality of objects. The generation unit 132E uses the index information to generate second group information indicating a plurality of second groups. The generation unit 132E generates first group information indicating a plurality of first groups using second group information indicating a second group.

生成部１３２Ｅは、複数の第２グループをクラスタリングすることにより複数の第１グループを示す第１グループ情報を生成する。生成部１３２Ｅは、複数の第１グループの分類後に、複数の第２グループの分類を行う。生成部１３２Ｅは、複数のオブジェクトをクラスタリングすることにより複数の第１グループを示す第１グループ情報を生成する。 The generation unit 132E generates first group information indicating a plurality of first groups by clustering a plurality of second groups. After classifying the plurality of first groups, the generation unit 132E performs classification of the plurality of second groups. The generation unit 132E generates first group information indicating a plurality of first groups by clustering a plurality of objects.

生成部１３２Ｅは、第１グループを示す第１グループ情報を用いて、複数の第２グループを示す第２グループ情報を生成する。生成部１３２Ｅは、複数の第１グループのうち、少なくとも一の第１グループを分割し、二以上の第２グループを生成することにより、複数の第２グループを示す第２グループ情報を生成する。生成部１３２Ｅは、各第１グループに属するオブジェクトをクラスタリングし、各第１グループを二以上の第２グループに分割することにより、複数の第２グループを示す第２グループ情報を生成する。 The generation unit 132E generates second group information indicating a plurality of second groups using first group information indicating the first group. The generating unit 132E generates second group information indicating the plurality of second groups by dividing at least one first group among the plurality of first groups and generating two or more second groups. The generation unit 132E generates second group information indicating a plurality of second groups by clustering objects belonging to each first group and dividing each first group into two or more second groups.

生成部１３２Ｅは、取得部１３１により取得されたオブジェクト情報に基づいて、複数のオブジェクトを複数の第１グループに分類する分類処理である第１分類処理と、複数のオブジェクトを複数の第１グループとは異なる分類となるグループである複数の第２グループに分類する分類処理である第２分類処理とのうち、いずれか一方の分類処理を実行し、一方の分類処理後に他方の分類処理を一方の分類処理の結果を用いて実行することにより、複数のオブジェクトを分類する複数の第１グループを示す第１グループ情報と、複数のオブジェクトを分類する複数の第２グループを示す第２グループ情報とを生成する。 The generation unit 132E performs a first classification process, which is a classification process of classifying a plurality of objects into a plurality of first groups, based on the object information acquired by the acquisition unit 131, and a first classification process that classifies a plurality of objects into a plurality of first groups. are groups with different classifications. Execute one of the second classification processes, which is a classification process to classify into a plurality of second groups, and after one classification process, perform the other classification process. By executing the classification process using the results, first group information indicating a plurality of first groups for classifying a plurality of objects and second group information indicating a plurality of second groups for classifying a plurality of objects are generated. generate.

生成部１３２Ｅは、複数のオブジェクトのうち、複数の第２グループにおいて同じ第２グループに属するオブジェクトを同じ第１グループに分類するグループである第１グループ情報を生成する。生成部１３２Ｅは、少なくとも一の第２グループに属するオブジェクト群のうち一のオブジェクトが属する一の第１グループとは異なる他の第１グループに他のオブジェクトが分類された複数の第１グループを示す第１グループ情報を生成する。 The generation unit 132E generates first group information that is a group that classifies objects belonging to the same second group among the plurality of second groups into the same first group. The generating unit 132E indicates a plurality of first groups in which other objects are classified into another first group different from the one first group to which one object belongs among the objects belonging to at least one second group. Generate first group information.

生成部１３２Ｅは、一方の分類処理である第２分類処理後に、他方の分類処理である第１分類処理を行うことにより、第１グループ情報と第２グループ情報とを生成する。生成部１３２Ｅは、複数のオブジェクトをクラスタリングする第２分類処理により複数の第２グループを示す第２グループ情報を生成する。生成部１３２Ｅは、インデックス情報を用いた第２分類処理により、第２グループ情報を生成する。 The generation unit 132E generates the first group information and the second group information by performing the first classification process, which is the other classification process, after the second classification process, which is one of the classification processes. The generation unit 132E generates second group information indicating a plurality of second groups by a second classification process that clusters a plurality of objects. The generation unit 132E generates second group information by a second classification process using index information.

生成部１３２Ｅは、第２分類処理により生成した第２グループ情報を用いた第１分類処理により、第１グループ情報を生成する。生成部１３２Ｅは、複数のオブジェクトをクラスタリングしたクラスタリング結果を用いた第１分類処理により複数の第１グループを示す第１グループ情報を生成する。 The generation unit 132E generates first group information through a first classification process using the second group information generated through the second classification process. The generation unit 132E generates first group information indicating a plurality of first groups by performing a first classification process using a clustering result obtained by clustering a plurality of objects.

生成部１３２Ｅは、複数の第２グループの各々に対応する第２セントロイドを用いた第１分類処理により、第１グループ情報を生成する。生成部１３２Ｅは、第２グループの各々を、第１グループのうち、属するオブジェクトが最大の第１グループである候補第１グループに属させる第１分類処理により、第１グループ情報を生成する。 The generation unit 132E generates first group information through a first classification process using second centroids corresponding to each of the plurality of second groups. The generation unit 132E generates first group information by performing a first classification process that causes each of the second groups to belong to a candidate first group, which is the first group to which the largest object belongs among the first groups.

生成部１３２Ｅは、第２グループの各々を複数に分割した部分グループに対応する部分セントロイドを用いた第１分類処理により、第１グループ情報を生成する。生成部１３２Ｅは、第２グループの各々を、第１グループのうち、属する部分セントロイドが最大の第１グループである候補第１グループに属させる第１分類処理により、第１グループ情報を生成する。 The generation unit 132E generates first group information by performing a first classification process using partial centroids corresponding to partial groups obtained by dividing each of the second groups into a plurality of parts. The generation unit 132E generates first group information through a first classification process that causes each of the second groups to belong to a candidate first group, which is the first group to which the partial centroid to which it belongs is the largest among the first groups. .

生成部１３２Ｅは、一の第２グループに候補第１グループが複数ある場合、複数の候補第１グループのうち、一の第２グループに対応する第２セントロイドにセントロイドの最も近い第１グループに一の第２グループを属させる第１分類処理により、第１グループ情報を生成する。 When there are a plurality of candidate first groups in one second group, the generation unit 132E selects the first group whose centroid is closest to the second centroid corresponding to one second group among the plurality of candidate first groups. First group information is generated by a first classification process that assigns one second group to the second group.

（検索処理部１３３Ｅ）
検索処理部１３３Ｅは、図４８に示す検索処理を実行する。例えば、検索処理部１３３Ｅは、検索処理部１３３、検索処理部１３３Ａ、検索処理部１３３Ｂ、検索処理部１３３Ｃ、または検索処理部１３３Ｄと同様に検索処理に関する各種処理を行う。検索処理部１３３Ｅは、記憶部１２０Ｃに記憶された情報を用いた処理を行う。検索処理部１３３Ｅは、生成部１３２Ｅにより生成された情報を用いた処理を行う。 (Search processing unit 133E)
The search processing unit 133E executes the search process shown in FIG. 48. For example, the search processing section 133E performs various processes related to search processing in the same way as the search processing section 133, the search processing section 133A, the search processing section 133B, the search processing section 133C, or the search processing section 133D. The search processing unit 133E performs processing using the information stored in the storage unit 120C. The search processing unit 133E performs processing using the information generated by the generation unit 132E.

例えば、検索処理部１３３Ｅは、複数のグループの検索に対応する範囲を示す第１検索範囲と、複数のオブジェクトの検索に対応する範囲を示す第２検索範囲とを用いて、グループ連結グラフを探索することにより、複数のオブジェクトから、検索クエリに対応する近傍のオブジェクトを抽出する検索処理を実行する。検索処理部１３３Ｅは、第１検索範囲と、第１検索範囲よりも範囲が広い第２検索範囲とを用いて、検索処理を実行する。なお、検索処理部１３３Ｅが実行する検索処理は、第２グループの第２数が第１グループの第１数よりも多い情報を対象とする点以外、検索処理部１３３Ｄと同様であるため、詳細な説明は省略する。 For example, the search processing unit 133E searches for a group connection graph using a first search range that indicates a range that corresponds to a search for multiple groups, and a second search range that indicates a range that corresponds to a search for multiple objects. By doing so, a search process is executed to extract nearby objects corresponding to the search query from a plurality of objects. The search processing unit 133E executes a search process using a first search range and a second search range that is wider than the first search range. Note that the search processing executed by the search processing unit 133E is similar to that of the search processing unit 133D, except that the second number of the second group is larger than the first number of the first group, so the details will not be explained. Further explanation will be omitted.

〔６－３．転置インデックス例〕
ここから、図４６を用いて、第６の実施形態における転置インデックスの例について説明する。図４６は、第６の実施形態に係る転置インデックスの一例を示す図である。なお、図４６及び図３６で示した転置インデックスと同様の点については適宜説明を省略する。 [6-3. Transposed index example]
An example of a transposed index in the sixth embodiment will now be described using FIG. 46. FIG. 46 is a diagram illustrating an example of a transposed index according to the sixth embodiment. Note that descriptions of points similar to the transposed index shown in FIGS. 46 and 36 will be omitted as appropriate.

まず、図４６に示す転置インデックスＩＮＤ３について説明する。図４６に示す転置インデックスＩＮＤ３は、「ブロブＩＤ」、「クラスタ数」、「クラスタ＃１」、「クラスタ＃２」等といった項目が含まれる。なお、図４６では「クラスタ＃１」、「クラスタ＃２」の２つを図示するが、転置インデックスＩＮＤ３には、各ブロブ（第１グループ）に属するクラスタ（第２グループ）の数に対応する数の項目が含まれる。以下、「クラスタ＃１」、「クラスタ＃２」等を総称する場合、「クラスタ」と記載する場合がある。 First, the transposed index IND3 shown in FIG. 46 will be explained. The transposed index IND3 shown in FIG. 46 includes items such as "blob ID", "number of clusters", "cluster #1", and "cluster #2". Note that although two clusters, “cluster #1” and “cluster #2” are illustrated in FIG. 46, the transposed index IND3 corresponds to the number of clusters (second group) belonging to each blob (first group). Contains several items. Hereinafter, when "cluster #1", "cluster #2", etc. are collectively referred to, they may be written as "cluster".

「クラスタ＃１」、「クラスタ＃２」等の「クラスタ」には、「クラスタＩＤ」、「量子化オブジェクト数」、「量子化オブジェクト」等といった項目が含まれる。なお、図４６では、「量子化オブジェクト」に配列（リスト）のような形式で複数の量子化オブジェクトに関すル情報が記憶される場合を示すが、図３５等の「量子化オブジェクト＃１」、「量子化オブジェクト＃２」、「量子化オブジェクト＃３」、および「量子化オブジェクト＃４」等のように個別の項目が含まれてもよい。 “Clusters” such as “cluster #1” and “cluster #2” include items such as “cluster ID”, “quantization object number”, and “quantization object”. Note that although FIG. 46 shows a case where the "quantization object" stores information regarding a plurality of quantization objects in a format such as an array (list), "quantization object #1" in FIG. , "Quantization Object #2," "Quantization Object #3," and "Quantization Object #4" may also be included.

「ブロブＩＤ」は、第１グループであるブロブを識別するための識別情報を示す。「クラスタ数」は、対応するブロブに属するクラスタの数を示す。「クラスタ」は、対応するブロブに属するクラスタに関する情報を示す。 “Blob ID” indicates identification information for identifying blobs that are the first group. "Number of clusters" indicates the number of clusters belonging to the corresponding blob. “Cluster” indicates information regarding clusters belonging to the corresponding blob.

「クラスタＩＤ」は、第２グループであるクラスタを識別するための識別情報を示す。「量子化オブジェクト数」は、クラスタに属するオブジェクト数を示す。「量子化オブジェク」は、量子化されたオブジェクトを示す。例えば、「量子化オブジェクト」には、オブジェクトが直積量子化され、各部分ベクトルに対応するコードブックを示す情報の一覧が記憶される。 "Cluster ID" indicates identification information for identifying the cluster that is the second group. "Number of quantized objects" indicates the number of objects belonging to a cluster. "Quantized object" indicates a quantized object. For example, in "quantization object", an object is subjected to product quantization, and a list of information indicating the codebook corresponding to each partial vector is stored.

図４６の例では、ブロブＩＤ「１」であるブロブには、４つのクラスタが属することを示す。ブロブＩＤ「１」であるブロブには、クラスタＩＤ「２」であるクラスタ、クラスタＩＤ「５」であるクラスタ等が属することを示す。 The example in FIG. 46 shows that four clusters belong to the blob with blob ID "1". This indicates that a cluster with cluster ID "2", a cluster with cluster ID "5", etc. belong to the blob with blob ID "1".

また、クラスタＩＤ「２」であるクラスタに属するオブジェクト数が４つであることを示す。クラスタＩＤ「２」であるクラスタに属するオブジェクトを示す情報がクラスタＩＤ「２」に対応付けられる。例えば、クラスタＩＤ「２」であるクラスタに属するオブジェクトが順番に並べられた一覧リストを示す情報がクラスタＩＤ「２」に対応付けられる。 It also shows that the number of objects belonging to the cluster with cluster ID "2" is four. Information indicating an object belonging to the cluster with cluster ID "2" is associated with cluster ID "2". For example, information indicating a list in which objects belonging to the cluster with cluster ID "2" are arranged in order is associated with cluster ID "2".

また、クラスタＩＤ「２」であるクラスタについて、「量子化オブジェクト」には「４、２、８、１０」が格納され、そのオブジェクトが４つの部分ベクトルで構成され、それぞれコードブック「４」、「２」、「８」、「１０」で量子化されていることを示す。例えば、クラスタＩＤ「２」であるクラスタに属するオブジェクトのうち１つ目のオブジェクトについては、４つ分割された部分ベクトルのうち、最初の部分ベクトルがコードブック「４」で量子化され、２番目の部分ベクトルがコードブック「２」で量子化され、３番目の部分ベクトルがコードブック「８」で量子化され、最後の部分ベクトルがコードブック「１０」で量子化されることを示す。 Furthermore, for the cluster with cluster ID "2", "4, 2, 8, 10" are stored in the "quantization object", and the object is composed of four partial vectors, each with codebook "4", It shows that it is quantized with "2", "8", and "10". For example, for the first object belonging to the cluster with cluster ID "2", the first partial vector among the four divided partial vectors is quantized with codebook "4", and the second The subvector of is quantized with codebook "2", the third subvector is quantized with codebook "8", and the last subvector is quantized with codebook "10".

また、クラスタＩＤ「２」であるクラスタについて、「量子化オブジェクト」には「５、６、２、０」が格納され、そのオブジェクトが４つの部分ベクトルで構成され、それぞれコードブック「５」、「６」、「２」、「０」で量子化されていることを示す。例えば、クラスタＩＤ「２」であるクラスタに属するオブジェクトのうち２つ目のオブジェクトについては、４つ分割された部分ベクトルのうち、最初の部分ベクトルがコードブック「５」で量子化され、２番目の部分ベクトルがコードブック「６」で量子化され、３番目の部分ベクトルがコードブック「２」で量子化され、最後の部分ベクトルがコードブック「０」で量子化されることを示す。 Furthermore, for the cluster with the cluster ID "2", "5, 6, 2, 0" are stored in the "quantization object", and the object is composed of four partial vectors, each of which has the codebook "5", It shows that it is quantized with "6", "2", and "0". For example, for the second object belonging to the cluster with cluster ID "2", the first partial vector of the four divided partial vectors is quantized with codebook "5", and the second The subvector is quantized with codebook "6", the third subvector is quantized with codebook "2", and the last subvector is quantized with codebook "0".

なお、転置インデックスＩＮＤ３は、上記に限らず、目的に応じて種々の情報を記憶してもよい。例えば、転置インデックスＩＮＤ３は、各量子化オブジェクトを示す情報（オブジェクトＩＤ等）を各量子化オブジェクトに対応付けて記憶してもよい。 Note that the transposed index IND3 is not limited to the above, and may store various information depending on the purpose. For example, the transposed index IND3 may store information indicating each quantization object (object ID, etc.) in association with each quantization object.

上述したように、転置インデックスには、クラスタを識別する情報（クラスタＩＤ）が付与されており、情報処理装置１００Ｅは、転置インデックスを用いて残差ベクトルを算出する。例えば、残差ベクトルはインデックスの生成および検索時に用いられるが、特許文献３等の従来技術における処理と同様であるため詳細な説明を適宜省略する。 As described above, information for identifying clusters (cluster ID) is attached to the transposed index, and the information processing device 100E calculates the residual vector using the transposed index. For example, the residual vector is used during index generation and search, but since the process is similar to that in conventional techniques such as Patent Document 3, detailed explanation will be omitted as appropriate.

〔６－４．情報処理のフロー〕
次に、図４７を用いて、第６の実施形態に係る情報処理の手順について説明する。図４７は、第６の実施形態に係る情報処理の一例を示すフローチャートである。 [6-4. Information processing flow]
Next, the information processing procedure according to the sixth embodiment will be explained using FIG. 47. FIG. 47 is a flowchart illustrating an example of information processing according to the sixth embodiment.

まず、図４７について説明する。例えば、図４７は、グループの生成処理の一例を示すフローチャートである。図４７に示すように、情報処理装置１００Ｅは、データ検索の対象となる複数のオブジェクトを示すオブジェクト情報を取得する（ステップＳ１００１）。例えば、情報処理装置１００Ｅは、オブジェクト情報記憶部１２１から複数のオブジェクトを示すオブジェクト情報を取得する。 First, FIG. 47 will be explained. For example, FIG. 47 is a flowchart illustrating an example of group generation processing. As shown in FIG. 47, the information processing apparatus 100E acquires object information indicating a plurality of objects that are data search targets (step S1001). For example, the information processing device 100E obtains object information indicating a plurality of objects from the object information storage unit 121.

情報処理装置１００Ｅは、オブジェクト情報に基づいて、複数のオブジェクトを分類する第１数のグループである複数の第１グループを示す第１グループ情報を生成する（ステップＳ１００２）。例えば、情報処理装置１００Ｅは、オブジェクト情報に基づいて、複数のオブジェクトを分類する複数のブロブを示すブロブ情報を生成する。 The information processing device 100E generates first group information indicating a plurality of first groups, which are a first number of groups into which a plurality of objects are classified, based on the object information (step S1002). For example, the information processing device 100E generates blob information indicating a plurality of blobs that classify a plurality of objects based on the object information.

情報処理装置１００Ｅは、複数の第１グループとは異なる分類となるグループであって、複数のオブジェクトを分類する第１数よりも多い第２数のグループである複数の第２グループを示す第２グループ情報とを生成する（ステップＳ１００３）。例えば、情報処理装置１００Ｅは、第１グループの第１数よりも多い第２数の第２グループを生成する。 The information processing device 100E is configured to display a plurality of second groups that are classified into a group different from the plurality of first groups and have a second number larger than the first number for classifying the plurality of objects. Group information is generated (step S1003). For example, the information processing device 100E generates second groups with a second number greater than the first number of first groups.

〔６－５．検索処理例〕
ここで、第６の実施形態に係る検索処理の一例について、図４８を一例として説明する。図４８は、第６の実施形態に係る検索処理の一例を示すフローチャートである。以下に説明する検索処理は、情報処理装置１００Ｅの検索処理部１３３Ｅによって行われる。図４８中のステップＳ１１０１～Ｓ１１２８の各々の処理は、図４３中のステップＳ９０１～Ｓ９２８の各々に対応する。そのため、ステップＳ１１０１～Ｓ１１２８の各々の処理において、ステップＳ９０１～Ｓ９２８の処理と同様の点については適宜説明を省略する。例えば、図４８の処理と図４３の処理とは、ステップＳ１１２１がステップＳ９２１での処理に一部変更がある点以外、図４８の処理と図４３の処理とは同様である。 [6-5. Search processing example]
Here, an example of the search process according to the sixth embodiment will be described using FIG. 48 as an example. FIG. 48 is a flowchart illustrating an example of search processing according to the sixth embodiment. The search processing described below is performed by the search processing unit 133E of the information processing device 100E. Each process of steps S1101 to S1128 in FIG. 48 corresponds to each of steps S901 to S928 in FIG. 43. Therefore, in each process of steps S1101 to S1128, descriptions of points similar to those of steps S901 to S928 will be omitted as appropriate. For example, the process in FIG. 48 and the process in FIG. 43 are the same as the process in FIG. 43 except that step S1121 is partially changed from the process in step S921.

情報処理装置１００Ｅは、探索ブロブ起点集合を取得し、個々のオブジェクトとクエリとの距離を計算する（ステップＳ１１０１）。情報処理装置１００Ｅは、探索ブロブ起点集合のクエリの最近傍ブロブを現最近傍ブロブＢ（ブロブＢ）とする（ステップＳ１１０２）。情報処理装置１００Ｅは、各種パラメータを設定する（ステップＳ１１０３）。 The information processing device 100E obtains the search blob starting point set and calculates the distance between each object and the query (step S1101). The information processing device 100E sets the nearest neighbor blob of the query of the search blob origin set as the current nearest neighbor blob B (blob B) (step S1102). The information processing device 100E sets various parameters (step S1103).

情報処理装置１００Ｅは、ブロブＢ以外の探索ブロブ起点集合を除外ブロブ集合に追加する（ステップＳ１１０４）。情報処理装置１００Ｅは、探索ブロブ起点集合を未探索ブロブ集合及び距離算出済み集合に追加する（ステップＳ１１０５）。 The information processing device 100E adds search blob origin sets other than blob B to the excluded blob set (step S1104). The information processing device 100E adds the search blob origin set to the unsearched blob set and the distance calculated set (step S1105).

情報処理装置１００Ｅは、未探索ブロブ集合が空であるかどうかを判定する（ステップＳ１１０６）。情報処理装置１００Ｅは、未探索ブロブ集合が空ではない場合（ステップＳ１１０６：Ｎｏ）、未探索ブロブ集合から最もクエリに近いブロブＴを取得する（ステップＳ１１０７）。 The information processing device 100E determines whether the unsearched blob set is empty (step S1106). If the unsearched blob set is not empty (step S1106: No), the information processing device 100E acquires the blob T closest to the query from the unsearched blob set (step S1107).

情報処理装置１００Ｅは、ブロブＴがブロブ探索範囲内にあるかどうかを判定する（ステップＳ１１０８）。情報処理装置１００Ｅは、ブロブＴがブロブ探索範囲内にある場合（ステップＳ１１０８：Ｙｅｓ）、ブロブＴのエッジで接続されているブロブを全て処理したかどうかを判定する（ステップＳ１１０９）。例えば、情報処理装置１００Ｅは、ブロブＴとの間がエッジで接続されているブロブ（接続ブロブ）についての処理有無を示す情報（処理有無情報）を用いて、ブロブＴの接続ブロブを全て処理したかどうかを判定する。例えば、情報処理装置１００Ｅは、ブロブＴの接続ブロブの処理有無をフラグ等の情報により管理してもよい。例えば、情報処理装置１００Ｅは、記憶部１２０Ｃに記憶された処理有無情報を処理状況に応じて更新し、記憶部１２０Ｃ中の処理有無情報を参照して、ブロブＴの接続ブロブを全て処理したかどうかを判定してもよい。 The information processing device 100E determines whether the blob T is within the blob search range (step S1108). If the blob T is within the blob search range (step S1108: Yes), the information processing device 100E determines whether all blobs connected by the edges of the blob T have been processed (step S1109). For example, the information processing device 100E processes all connected blobs of blob T using information (processing presence/absence information) indicating whether or not to process blobs (connection blobs) that are connected to blob T by an edge. Determine whether or not. For example, the information processing device 100E may manage whether or not a connected blob of the blob T is processed using information such as a flag. For example, the information processing device 100E updates the processing presence/absence information stored in the storage unit 120C according to the processing status, and refers to the processing presence/absence information in the storage unit 120C to determine whether all connected blobs of the blob T have been processed. You may decide whether

情報処理装置１００Ｅは、ブロブＴのエッジで接続されているブロブを全て処理していない場合（ステップＳ１１０９：Ｎｏ）、ブロブＴからエッジで接続されているブロブＪを取得する（ステップＳ１１１０）。 If the information processing device 100E has not processed all the blobs connected by the edges of the blob T (step S1109: No), the information processing device 100E acquires the blob J connected by the edges from the blob T (step S1110).

情報処理装置１００Ｅは、ブロブＪが距離算出済み集合に存在するかどうかを判定する（ステップＳ１１１１）。情報処理装置１００Ｅは、ブロブＪが距離算出済み集合に存在する場合（ステップＳ１１１１：Ｙｅｓ）、ステップＳ１１０９に戻って処理を行う。 The information processing device 100E determines whether blob J exists in the distance calculation completed set (step S1111). If the blob J exists in the distance calculation completed set (step S1111: Yes), the information processing device 100E returns to step S1109 and performs the process.

また、情報処理装置１００Ｅは、ブロブＪが距離算出済み集合に存在しない場合（ステップＳ１１１１：Ｎｏ）、ブロブＪを距離算出済み集合に追加し（ステップＳ１１１２）、ブロブＪを未探索ブロブ集合に追加する（ステップＳ１１１３）。そして、情報処理装置１００Ｅは、ブロブＪのセントロイドとクエリの距離Ｄｅを算出する（ステップＳ１１１４）。 Furthermore, if blob J does not exist in the distance calculated set (step S1111: No), the information processing device 100E adds blob J to the distance calculated set (step S1112), and adds blob J to the unsearched blob set. (Step S1113). The information processing device 100E then calculates the distance De between the centroid of the blob J and the query (step S1114).

情報処理装置１００Ｅは、距離Ｄｅがブロブ検索半径Ｒｂよりも小さいかどうかを判定する（ステップＳ１１１５）。情報処理装置１００Ｅは、距離Ｄｅがブロブ検索半径Ｒｂよりも小さくない場合（ステップＳ１１１５：Ｎｏ）、ブロブＪを除外ブロブ集合に追加し（ステップＳ１１１６）、ステップＳ１１０９に戻って処理を行う。 The information processing device 100E determines whether the distance De is smaller than the blob search radius Rb (step S1115). If the distance De is not smaller than the blob search radius Rb (step S1115: No), the information processing device 100E adds blob J to the excluded blob set (step S1116), and returns to step S1109 for processing.

また、情報処理装置１００Ｅは、距離Ｄｅがブロブ検索半径Ｒｂよりも小さい場合（ステップＳ１１１５：Ｙｅｓ）、現最近傍ブロブＢを除外ブロブ集合に追加し、ブロブＪを現最近傍ブロブＢとする（ステップＳ１１１７）。このように、情報処理装置１００Ｅは、距離Ｄｅがブロブ検索半径Ｒｂよりも小さい場合、現最近傍ブロブＢをブロブＪに更新する。 Further, if the distance De is smaller than the blob search radius Rb (step S1115: Yes), the information processing device 100E adds the current nearest neighbor blob B to the excluded blob set and sets the blob J as the current nearest neighbor blob B ( Step S1117). In this way, the information processing device 100E updates the current nearest neighbor blob B to the blob J when the distance De is smaller than the blob search radius Rb.

そして、情報処理装置１００Ｅは、ブロブＪに更新された現最近傍ブロブＢを基に、ブロブに関するパラメータを更新する（ステップＳ１１１８）。図４８では、情報処理装置１００Ｅは、ブロブＪに更新された現最近傍ブロブＢを基にブロブ検索半径Ｒｂを「ｄ（ｑ,Ｂ）」に更新する。また、情報処理装置１００Ｅは、更新されたブロブ検索半径Ｒｂを基に、ブロブ探索半径Ｅｂを「Ｒｂ×（１＋ｅｂ）」に更新する。そして、情報処理装置１００Ｅは、ステップＳ１１０９に戻って処理を行う。 The information processing device 100E then updates the parameters regarding the blob based on the current nearest neighbor blob B that has been updated to the blob J (step S1118). In FIG. 48, the information processing device 100E updates the blob search radius Rb to "d(q,B)" based on the current nearest neighbor blob B updated to blob J. Furthermore, the information processing device 100E updates the blob search radius Eb to “Rb×(1+eb)” based on the updated blob search radius Rb. The information processing apparatus 100E then returns to step S1109 and performs the process.

情報処理装置１００Ｅは、ブロブＴのエッジで接続されているブロブを全て処理した場合（ステップＳ１１０９：Ｙｅｓ）、ステップＳ１１０６に戻って処理を行う。情報処理装置１００Ｅは、未探索ブロブ集合が空である場合（ステップＳ１１０６：Ｙｅｓ）、ステップＳ１１２０の処理を行う。また、情報処理装置１００Ｅは、ブロブＴがブロブ探索範囲内にない場合（ステップＳ１１０８：Ｎｏ）、ブロブＴを未探索ブロブに戻して（ステップＳ１１１９）、ステップＳ１１２０の処理を行う。 When the information processing device 100E has processed all the blobs connected by the edges of the blob T (step S1109: Yes), the information processing device 100E returns to step S1106 and performs the processing. If the unsearched blob set is empty (step S1106: Yes), the information processing device 100E performs the process of step S1120. Further, if the blob T is not within the blob search range (step S1108: No), the information processing device 100E returns the blob T to an unsearched blob (step S1119), and performs the process of step S1120.

ステップＳ１１２０では、情報処理装置１００Ｅは、ブロブ検索数に１加える（ステップＳ１１２０）。そして、情報処理装置１００Ｅは、ブロブＢ内の全てのクラスタごとにクラスタ内の全てのオブジェクトの近似距離を算出し、最短距離を当該ブロブＢのブロブ距離とする（ステップＳ１１２１）。このように、図４８では、情報処理装置１００Ｅは、ブロブＢに属するクラスタの各々についてそのクラスタ内の全てのオブジェクトの近似距離を算出し、近似距離を算出した全てのオブジェクトの距離のうち、最短距離を当該ブロブＢのブロブ距離とする。 In step S1120, the information processing device 100E adds 1 to the number of blob searches (step S1120). Then, the information processing device 100E calculates approximate distances of all objects in the cluster for each cluster in the blob B, and sets the shortest distance as the blob distance of the blob B (step S1121). In this way, in FIG. 48, the information processing device 100E calculates approximate distances of all objects in the cluster for each cluster belonging to blob B, and calculates the shortest distance among all the objects for which the approximate distances have been calculated. Let the distance be the blob distance of the blob B.

情報処理装置１００Ｅは、ブロブＢ内の全てのオブジェクトを検索結果集合に追加、ソートし、指定された検索数上位のみを検索結果集合とする（ステップＳ１１２２）。そして、情報処理装置１００Ｅは、更新された検索結果集合を基に、オブジェクトに関するパラメータを更新する（ステップＳ１１２３）。そして、情報処理装置１００Ｅは、終了基準＃１～＃３に当てはまるかどうかを判定する（ステップＳ１１２４）。 The information processing device 100E adds all the objects in the blob B to the search result set, sorts them, and sets only the designated top search result set as the search result set (step S1122). Then, the information processing device 100E updates parameters related to the object based on the updated search result set (step S1123). The information processing device 100E then determines whether termination criteria #1 to #3 are met (step S1124).

情報処理装置１００Ｅは、終了基準＃１～＃３に当てはまらない場合（ステップＳ１１２４：Ｎｏ）、除外ブロブ集合から最短距離のブロブを取得し、現最近傍ブロブＢとする（ステップＳ１１２５）。 If the termination criteria #1 to #3 are not applicable (step S1124: No), the information processing device 100E obtains the blob with the shortest distance from the excluded blob set and sets it as the current nearest neighbor blob B (step S1125).

そして、情報処理装置１００Ｅは、最短距離のブロブに更新された現最近傍ブロブＢを基に、ブロブに関するパラメータを更新する（ステップＳ１１２６）。図４８では、情報処理装置１００Ｅは、最短距離のブロブに更新された現最近傍ブロブＢを基にブロブ検索半径Ｒｂを「ｄ（ｑ,Ｂ）」に更新する。また、情報処理装置１００Ｅは、更新されたブロブ検索半径Ｒｂを基に、ブロブ探索半径Ｅｂを「Ｒｂ×（１＋ｅｂ）」に更新する。そして、情報処理装置１００Ｅは、ステップＳ１１０６に戻って処理を行う。 Then, the information processing device 100E updates parameters regarding the blob based on the current nearest neighbor blob B that has been updated to the blob with the shortest distance (step S1126). In FIG. 48, the information processing device 100E updates the blob search radius Rb to "d(q,B)" based on the current nearest neighbor blob B that has been updated to the blob with the shortest distance. Furthermore, the information processing device 100E updates the blob search radius Eb to “Rb×(1+eb)” based on the updated blob search radius Rb. The information processing apparatus 100E then returns to step S1106 and performs the process.

また、情報処理装置１００Ｅは、終了基準＃１～＃３に当てはまる場合（ステップＳ１１２４：Ｙｅｓ）、処理を終了する。例えば、情報処理装置１００Ｅは、終了基準＃１～＃３に当てはまる場合、その時点で検索結果集合に含まれるオブジェクトをクエリｑの近傍のオブジェクトに決定する。そして、情報処理装置１００Ｅは、クエリｑの近傍のオブジェクトに決定したオブジェクトを示す情報を、クエリｑを指定した検索の要求元等に提供する。 Further, if the termination criteria #1 to #3 are met (step S1124: Yes), the information processing device 100E terminates the process. For example, if the termination criteria #1 to #3 apply, the information processing device 100E determines the objects included in the search result set at that time to be objects in the vicinity of the query q. Then, the information processing device 100E provides information indicating the objects determined to be objects in the vicinity of the query q to the requestor of the search that specified the query q.

〔６－６．グループの生成例〕
なお、上述した第１の生成例は、グループ生成の一例に過ぎず、情報処理装置１００Ｅは、様々な態様のグループを生成してもよい。図４４では、情報処理装置１００Ｅは、階層クラスタリングにより、第１グループをさらに分割したグループを第２グループとして生成（クラスタリング）する場合を一例として説明したが、情報処理装置１００Ｅは、階層クラスタリングに限らず様々な処理により第１グループ及び第２グループを生成してもよい。この点の例示である第２～４の生成例ついて、図４９～図５１を用いて説明する。図４９～図５１は、第１グループ及び第２グループの生成の一例を示す図である。なお、上述した処理と同様の点については適宜説明を省略する。 [6-6. Group generation example]
Note that the first generation example described above is only an example of group generation, and the information processing apparatus 100E may generate groups in various ways. In FIG. 44, the information processing apparatus 100E uses hierarchical clustering to generate (clustering) a group by further dividing the first group as the second group. However, the information processing apparatus 100E is limited to hierarchical clustering. The first group and the second group may be generated by various processes. The second to fourth generation examples illustrating this point will be explained using FIGS. 49 to 51. FIGS. 49 to 51 are diagrams showing examples of generation of the first group and the second group. Note that descriptions of points similar to those described above will be omitted as appropriate.

〔６－６－１．第２の生成例〕
まず、図４９を用いて、第２の生成例について説明する。図４９は、第１グループ及び第２グループの生成の一例を示す図である。具体的には、図４９は、第６の実施形態における第１グループ及び第２グループの第２の生成例を示す図である。 [6-6-1. Second generation example]
First, a second generation example will be described using FIG. 49. FIG. 49 is a diagram illustrating an example of generation of the first group and the second group. Specifically, FIG. 49 is a diagram showing a second generation example of the first group and the second group in the sixth embodiment.

図４９での図示態様について簡単に説明する。図４９では、クラスタの境界を実線で示し、ブロブの境界を一点鎖線で示す。また、図４９では、各クラスタがどのブロブに属するかを、クラスタＣＬのセントロイドＣＮとブロブＢＬのセントロイドＣとの間を点線で繋ぐことにより示す。例えば、クラスタＣＬ３４のセントロイドＣＮ３４及びクラスタＣＬ３５のセントロイドＣＮ３５がブロブＢＬ３３のセントロイドＣ３３と繋がれているため、図４９では、クラスタＣＬ３４及びクラスタＣＬ３５がブロブＢＬ３３に属する場合を示す。なお、図４９は、生成処理を説明するためにオブジェクト、セントロイド等を概念的に示すものであり、セントロイド等の位置は正確な位置を示すものではない。 The illustrated aspect in FIG. 49 will be briefly described. In FIG. 49, the boundaries of clusters are shown by solid lines, and the boundaries of blobs are shown by dashed lines. Further, in FIG. 49, which blob each cluster belongs to is shown by connecting the centroid CN of the cluster CL and the centroid C of the blob BL with a dotted line. For example, since centroid CN34 of cluster CL34 and centroid CN35 of cluster CL35 are connected to centroid C33 of blob BL33, FIG. 49 shows a case where cluster CL34 and cluster CL35 belong to blob BL33. Note that FIG. 49 conceptually shows objects, centroids, etc. in order to explain the generation process, and the positions of centroids, etc. do not indicate exact positions.

ここから、情報処理装置１００Ｅによる第２の生成例に関する情報処理について説明する。情報処理装置１００Ｅは、複数のオブジェクトを複数のクラスタに分類するクラスタ情報を生成する。図４９の例では、情報処理装置１００Ｅは、空間情報ＳＰ７２に示すように、任意の方法により、複数のオブジェクトをクラスタＣＬ３１～ＣＬ４０等に分類するクラスタ情報を生成する。情報処理装置１００Ｅは、ｋ－ｍｅａｎｓ等の任意のクラスタリングにより各ノードをクラスタＣＬ３１～ＣＬ４０のいずれかに分類してもよい。なお、情報処理装置１００Ｅは、クラスタを生成可能であれば、どのような方法によりクラスタを生成してもよい。 From here, information processing regarding the second generation example by the information processing apparatus 100E will be described. The information processing device 100E generates cluster information that classifies multiple objects into multiple clusters. In the example of FIG. 49, the information processing device 100E generates cluster information for classifying a plurality of objects into clusters CL31 to CL40, etc., using an arbitrary method, as shown in spatial information SP72. The information processing device 100E may classify each node into one of clusters CL31 to CL40 using arbitrary clustering such as k-means. Note that the information processing apparatus 100E may use any method to generate clusters as long as the clusters can be generated.

そして、情報処理装置１００Ｅは、複数のクラスタを複数のブロブに分類するブロブ情報を生成する。例えば、情報処理装置１００Ｅは、クラスタＣＬ３１～ＣＬ４０等の各々のセントロイドＣＮ３１～ＣＮ４０等を用いて、クラスタＣＬ３１～ＣＬ４０等を分類するブロブを示すブロブ情報を生成する。このように、情報処理装置１００Ｅは、複数の第２グループの各々に対応する第２セントロイドを用いた第１分類処理により、第１グループ情報を生成する。例えば、情報処理装置１００Ｅは、ｋ－ｍｅａｎｓ等の任意のクラスタリングによりクラスタＣＬ３１～ＣＬ４０等の各々のセントロイドＣＮ３１～ＣＮ４０等を複数のブロブのいずれかに分類する。例えば、情報処理装置１００Ｅは、クラスタＣＬの第２数よりも少ない第１数のセントロイドＣＮを生成し、生成したセントロイドＣＮにブロブＢＬのセントロイドＣＮを対応付けることにより、クラスタＣＬをブロブＢＬのいずれかに分類する。 The information processing device 100E then generates blob information that classifies the multiple clusters into multiple blobs. For example, the information processing device 100E uses the centroids CN31 to CN40 of each of the clusters CL31 to CL40 to generate blob information indicating a blob that classifies the clusters CL31 to CL40. In this way, the information processing device 100E generates first group information through the first classification process using the second centroids corresponding to each of the plurality of second groups. For example, the information processing device 100E classifies each of the centroids CN31 to CN40 of the clusters CL31 to CL40 into one of a plurality of blobs using arbitrary clustering such as k-means. For example, the information processing device 100E generates a first number of centroids CN that is smaller than the second number of clusters CL, and associates the generated centroid CN with the centroid CN of the blob BL, thereby converting the cluster CL into a blob BL. Classify into one of the following.

図４９の例では、情報処理装置１００Ｅは、セントロイドＣＮ３１～ＣＮ４０等の各々をブロブＢＬ３１～ＢＬ３４のセントロイドＣ３１～Ｃ３４等のいずれかに割り当てることにより、クラスタＣＬ３１～ＣＬ４０等のブロブＢＬ３１～ＢＬ３４等に分類するブロブ情報を生成する。上述した処理によりクラスタＣＬの各セントロイドＣＮは各ブロブＢＬに割り当てられる。 In the example of FIG. 49, the information processing device 100E assigns each of the centroids CN31 to CN40, etc. to any of the centroids C31 to C34, etc. of the blobs BL31 to BL34. Generate blob information that is classified into etc. Through the process described above, each centroid CN of the cluster CL is assigned to each blob BL.

例えば、情報処理装置１００Ｅは、セントロイドＣＮ３１、セントロイドＣＮ３８及びセントロイドＣＮ４０をセントロイドＣ３１に割り当てることにより、クラスタＣＬ３１、クラスタＣＬ３８及びクラスタＣＬ４０をブロブＢＬ３１に分類する。また、情報処理装置１００Ｅは、セントロイドＣＮ３２及びセントロイドＣＮ３３をセントロイドＣ３２に割り当てることにより、クラスタＣＬ３２及びクラスタＣＬ３３をブロブＢＬ３２に分類する。また、情報処理装置１００Ｅは、セントロイドＣＮ３４及びセントロイドＣＮ３５をセントロイドＣ３３に割り当てることにより、クラスタＣＬ３４及びクラスタＣＬ３５をブロブＢＬ３３に分類する。また、情報処理装置１００Ｅは、セントロイドＣＮ３６、セントロイドＣＮ３７及びセントロイドＣＮ３９をセントロイドＣ３４に割り当てることにより、クラスタＣＬ３６、クラスタＣＬ３７及びクラスタＣＬ３９をブロブＢＬ３４に分類する。 For example, the information processing device 100E classifies cluster CL31, cluster CL38, and cluster CL40 into blob BL31 by assigning centroid CN31, centroid CN38, and centroid CN40 to centroid C31. Furthermore, the information processing device 100E classifies cluster CL32 and cluster CL33 into blob BL32 by assigning centroid CN32 and centroid CN33 to centroid C32. Furthermore, the information processing device 100E classifies cluster CL34 and cluster CL35 into blob BL33 by assigning centroid CN34 and centroid CN35 to centroid C33. Furthermore, the information processing device 100E classifies cluster CL36, cluster CL37, and cluster CL39 into blob BL34 by assigning centroid CN36, centroid CN37, and centroid CN39 to centroid C34.

なお、図４９では、図示は省略するが、情報処理装置１００Ｅは、ブロブＢＬ３１～ＢＬ３４等の複数ブロブ間を連結するブロブ連結グラフを生成する。例えば、情報処理装置１００Ｅは、図２３、図２９及び図４４等と同様の情報を用いた処理によりブロブ連結グラフを生成するが詳細な説明を省略する。 Although not shown in FIG. 49, the information processing device 100E generates a blob connection graph that connects a plurality of blobs such as blobs BL31 to BL34. For example, the information processing apparatus 100E generates a blob connection graph by processing using information similar to that shown in FIGS. 23, 29, 44, etc., but detailed description thereof will be omitted.

例えば、情報処理装置１００Ｅは、ブロブ情報を用いて量子化に関する処理を実行する。情報処理装置１００Ｅは、ブロブ情報を用いて複数のオブジェクトを対象とする量子化に関する処理を行う。例えば、情報処理装置１００Ｅは、ブロブＢＬ３１～ＢＬ３４の各々に属するオブジェクトごとに量子化を行うが、この点については上述した処理と同様の点については適宜説明を省略する。 For example, the information processing device 100E executes processing related to quantization using blob information. The information processing device 100E performs processing related to quantization targeting a plurality of objects using blob information. For example, the information processing device 100E performs quantization for each object belonging to each of the blobs BL31 to BL34, but a description of the same processing as described above will be omitted as appropriate.

上記のように、情報処理装置１００Ｅは、データ検索の対象となる複数のオブジェクトを分類する複数のクラスタを示すクラスタ情報と、複数のクラスタとは異なる分類となる複数のブロブを示すブロブ情報とを生成する。これにより、情報処理装置１００Ｅは、複数のオブジェクトを分類する２種類のグループを生成することができ、オブジェクトを分類するグループを生成することができる。 As described above, the information processing device 100E stores cluster information indicating a plurality of clusters that classify a plurality of objects that are data search targets, and blob information indicating a plurality of blobs that are classified differently from the plurality of clusters. generate. Thereby, the information processing apparatus 100E can generate two types of groups for classifying a plurality of objects, and can generate groups for classifying objects.

〔６－６－２．第３の生成例〕
次に、図５０を用いて、第３の生成例について説明する。図５０は、第１グループ及び第２グループの生成の一例を示す図である。具体的には、図５０は、第６の実施形態における第１グループ及び第２グループの第３の生成例を示す図である。なお、上述した処理と同様の点については適宜説明を省略する。 [6-6-2. Third generation example]
Next, a third generation example will be described using FIG. 50. FIG. 50 is a diagram illustrating an example of generation of the first group and the second group. Specifically, FIG. 50 is a diagram showing a third generation example of the first group and the second group in the sixth embodiment. Note that descriptions of points similar to those described above will be omitted as appropriate.

第３の生成例では、情報処理装置１００Ｅは、第２グループの各々を、複数の第１グループのうち、属するオブジェクトが最大の第１グループである候補第１グループに属させる第１分類処理により、第１グループ情報を生成する。例えば、情報処理装置１００Ｅは、クラスタＣＬの各々を、複数のブロブＢＬのうち、属するオブジェクトが最大のブロブＢＬである候補ブロブに属させる第１分類処理により、第１グループ情報を生成する。例えば、情報処理装置１００Ｅは、図５０の空間情報ＳＰ７３に示すように、クラスタＣＬの第２数よりも少ない第１数のセントロイドＣＮを生成し、生成したセントロイドＣＮにブロブＢＬのセントロイドＣＮを対応付けることにより、クラスタＣＬをブロブＢＬのいずれかに分類する。なお、図５０は、生成処理を説明するためにオブジェクト、セントロイド等を概念的に示すものであり、セントロイド等の位置は正確な位置を示すものではない。 In the third generation example, the information processing device 100E performs a first classification process that causes each of the second groups to belong to a candidate first group to which the object belongs to the largest first group among the plurality of first groups. , generates first group information. For example, the information processing device 100E generates the first group information by performing a first classification process in which each of the clusters CL belongs to a candidate blob to which the object belongs to the largest blob BL among the plurality of blobs BL. For example, as shown in spatial information SP73 in FIG. 50, the information processing device 100E generates a first number of centroids CN that is smaller than the second number of clusters CL, and uses the generated centroid CN as the centroid of the blob BL. By associating the CNs, the cluster CL is classified into one of the blobs BL. Note that FIG. 50 conceptually shows objects, centroids, etc. in order to explain the generation process, and the positions of centroids, etc. do not indicate exact positions.

情報処理装置１００Ｅは、任意の手法によりクラスタＣＬを分類する。例えば、情報処理装置１００Ｅは、クラスタＣＬのセントロイドＣＮが最も近いブロブＢＬのセントロイドＣに、そのクラスタＣＬを割り当ててもよい。例えば、情報処理装置１００Ｅは、全てのオブジェクトをクラスタリングしてブロブＢＬを生成し、各量子化クラスタ（クラスタＣＬ）に属するオブジェクトの数が最も多いブロブＢＬに量子化クラスタ（クラスタＣＬ）を割り当てる。例えば、情報処理装置１００Ｅは、一のクラスタＣＬに属するオブジェクトについて、最も距離が短いオブジェクトの数が最大のセントロイドＣに対応するブロブＢＬを候補ブロブに決定する。例えば、情報処理装置１００Ｅは、個々のブロブＢＬの最大数が同一の場合には、セントロイドが近い方のブロブＢＬにクラスタＣＬを割り当ててもよい。図５０では、クラスタＣＬ３１については、ノードＮ９、Ｎ１２６の各々に対応する２つのオブジェクトがセントロイドＣ３３に最も近い。一方、クラスタＣＬ３１については、ノードＮ７に対応する１つのオブジェクトがセントロイドＣ３２に最も近く、ノードＮ８５に対応する１つのオブジェクトがセントロイドＣ３１に最も近い。すなわち、クラスタＣＬ３１については、セントロイドＣ３３が最も距離が短いオブジェクトの数が最大のセントロイドＣである。 The information processing device 100E classifies the clusters CL using an arbitrary method. For example, the information processing device 100E may assign the cluster CL to the centroid C of the blob BL to which the centroid CN of the cluster CL is closest. For example, the information processing device 100E clusters all objects to generate blobs BL, and assigns a quantization cluster (cluster CL) to a blob BL having the largest number of objects belonging to each quantization cluster (cluster CL). For example, the information processing device 100E determines, as a candidate blob, the blob BL corresponding to the centroid C with the largest number of objects having the shortest distance for objects belonging to one cluster CL. For example, if the maximum number of individual blobs BL is the same, the information processing device 100E may assign the cluster CL to the blob BL with a closer centroid. In FIG. 50, for cluster CL31, the two objects corresponding to each of nodes N9 and N126 are closest to centroid C33. On the other hand, for cluster CL31, one object corresponding to node N7 is closest to centroid C32, and one object corresponding to node N85 is closest to centroid C31. That is, for cluster CL31, centroid C33 is the centroid C that has the largest number of objects with the shortest distance.

そのため、図５０では、情報処理装置１００Ｅは、クラスタＣＬ３１については、セントロイドＣ３３に対応するブロブＢＬ３３を候補ブロブに決定する。クラスタＣＬ３１の候補ブロブがブロブＢＬ３３の１つであるため、情報処理装置１００Ｅは、クラスタＣＬ３１のセントロイドＣＮ３１をブロブＢＬ３３のセントロイドＣ３３に割り当てることにより、クラスタＣＬ３１をブロブＢＬ３３に分類する。図５０では、他のクラスタＣＮ３２～ＣＮ４０等のクラスタリング結果は、図４９と同様となるため詳細な説明は省略する。 Therefore, in FIG. 50, the information processing device 100E determines the blob BL33 corresponding to the centroid C33 as the candidate blob for the cluster CL31. Since the candidate blob of the cluster CL31 is one of the blobs BL33, the information processing device 100E classifies the cluster CL31 into the blob BL33 by assigning the centroid CN31 of the cluster CL31 to the centroid C33 of the blob BL33. In FIG. 50, the clustering results for other clusters CN32 to CN40, etc. are similar to those in FIG. 49, so detailed explanations will be omitted.

なお、情報処理装置１００Ｅは、一の第２グループに候補第１グループが複数ある場合、複数の候補第１グループのうち、一の第２グループに対応する第２セントロイドにセントロイドの最も近い第１グループに一の第２グループを属させる。例えば、情報処理装置１００Ｅは、一のクラスタＣＬに候補ブロブが複数ある場合、複数の候補ブロブのうち、一のクラスタＣＬに対応するセントロイドＣＮにセントロイドＣの最も近いブロブＢＬに属させる。図５０では、情報処理装置１００Ｅは、例えばクラスタＣＬ３３の候補ブロブがブロブＢＬ３１、ＢＬ３２の２つである場合、ブロブＢＬ３１、ＢＬ３２のうち、クラスタＣＬ３３に対応するセントロイドＣＮ３３にセントロイドＣの最も近いブロブＢＬ３２にクラスタＣＬ３３を属させる。 Note that when there are a plurality of candidate first groups in one second group, the information processing device 100E selects a centroid whose centroid is closest to a second centroid corresponding to one second group among the plurality of candidate first groups. A second group is made to belong to the first group. For example, when there are multiple candidate blobs in one cluster CL, the information processing apparatus 100E causes the centroid CN corresponding to one cluster CL to belong to the blob BL closest to the centroid C among the multiple candidate blobs. In FIG. 50, for example, when the candidate blobs of cluster CL33 are two blobs BL31 and BL32, the information processing apparatus 100E determines that among blobs BL31 and BL32, centroid C is closest to centroid CN33 corresponding to cluster CL33. Cluster CL33 is made to belong to blob BL32.

上記は一例に過ぎず、情報処理装置１００Ｅは、様々な処理により生成処理を行ってもよい。例えば、情報処理装置１００Ｅは、全てのオブジェクトではなく、各量子化用クラスタ（クラスタＣＬ）から同一数ランダムに抽出したオブジェクト、または、クラスタＣＬのオブジェクト数の比率に合わせてランダムに抽出したオブジェクトを、クラスタリングしてブロブＢＬを生成してもよい。そして、情報処理装置１００Ｅは、各量子化クラスタ（クラスタＣＬ）に属するオブジェクトの数が最も多いブロブに量子化クラスタを割り当てる。 The above is just an example, and the information processing device 100E may perform the generation process using various processes. For example, the information processing device 100E may randomly extract the same number of objects from each quantization cluster (cluster CL), or randomly extract objects according to the ratio of the number of objects in the clusters CL, instead of all objects. , clustering may be performed to generate the blob BL. Then, the information processing device 100E assigns a quantization cluster to the blob having the largest number of objects belonging to each quantization cluster (cluster CL).

〔６－６－３．第４の生成例〕
次に、図５１を用いて、第４の生成例について説明する。図５１は、第１グループ及び第２グループの生成の一例を示す図である。具体的には、図５１は、第６の実施形態における第１グループ及び第２グループの第４の生成例を示す図である。なお、上述した処理と同様の点については適宜説明を省略する。 [6-6-3. Fourth generation example]
Next, a fourth generation example will be described using FIG. 51. FIG. 51 is a diagram illustrating an example of generation of the first group and the second group. Specifically, FIG. 51 is a diagram showing a fourth generation example of the first group and the second group in the sixth embodiment. Note that descriptions of points similar to those described above will be omitted as appropriate.

第４の生成例では、情報処理装置１００Ｅは、第２グループの各々を複数に分割した部分グループに対応する部分セントロイドを用いた第１分類処理により、第１グループ情報を生成する。具体的には、情報処理装置１００Ｅは、第２グループの各々を、第１グループのうち、属する部分セントロイドが最大の第１グループである候補第１グループに属させる第１分類処理により、第１グループ情報を生成する。 In the fourth generation example, the information processing device 100E generates first group information by a first classification process using partial centroids corresponding to partial groups obtained by dividing each of the second groups into a plurality of groups. Specifically, the information processing apparatus 100E performs a first classification process that causes each of the second groups to belong to a candidate first group, which is the first group to which the partial centroid to which it belongs is the largest among the first groups. 1 group information is generated.

例えば、情報処理装置１００Ｅは、クラスタＣＬの各々を複数の部分グループに分割し、各クラスタＣＬについて、複数のブロブＢＬのうち、属する部分セントロイドが最大のブロブＢＬである候補ブロブに属させる第１分類処理により、第１グループ情報を生成する。情報処理装置１００Ｅは、部分クラスタ（部分グループ）のセントロイド（部分セントロイド）の情報を用いて、複数のブロブＢＬを生成する。情報処理装置１００Ｅは、部分グループの部分セントロイドをクラスタリングすることにより複数のブロブＢＬを生成する。このように、情報処理装置１００Ｅは、各量子化用クラスタ（クラスタＣＬ）にさらに階層クラスタリングを行い、クラスタ（部分グループ）のセントロイドを生成する。 For example, the information processing device 100E divides each of the clusters CL into a plurality of subgroups, and for each cluster CL, out of the plurality of blobs BL, the information processing device 100E makes the candidate blob belong to the blob BL with the largest partial centroid. First group information is generated by the first classification process. The information processing device 100E generates a plurality of blobs BL using information on centroids (partial centroids) of partial clusters (partial groups). The information processing device 100E generates a plurality of blobs BL by clustering the partial centroids of the partial groups. In this way, the information processing device 100E further performs hierarchical clustering on each quantization cluster (cluster CL) to generate a centroid of the cluster (subgroup).

なお、図５１では、空間情報ＳＰ７４に示すように、各クラスタＣＬの部分グループへの分割をクラスタＣＬ内の二点鎖線により示す。また、図５１では、説明に必要な部分のみクラスタＣＬを複数の部分グループに対応する部分セントロイドを図示する。具体的には、図５１では、クラスタＣＬ３１が２つの部分グループに分割され、一方の部分グループ（図５１では下側）に対応する部分セントロイドＰＣ１１、及び他方の部分グループ（図５１では上側）に対応する部分セントロイドＰＣ１２のみを示す。部分セントロイドＰＣ１１及び部分セントロイドＰＣ１２等の部分セントロイドを区別せずに説明する場合、部分セントロイドＰＣと記載する場合がある。なお、図５１では説明のため各クラスタＣＬが２つの部分グループに分割される例を示すが、各クラスタＣＬの部分グループの数（分割数）は２に限らず、任意の数であってもよい。例えば、各クラスタＣＬは、そのクラスタＣＬの属するオブジェクトの数に応じて決定されてもよい。例えば、各クラスタＣＬは、そのクラスタＣＬの属するオブジェクトの数多い程、多い部分グループに分割されてもよい。 In addition, in FIG. 51, as shown in the spatial information SP74, the division of each cluster CL into subgroups is indicated by a chain double-dashed line within the cluster CL. Further, in FIG. 51, partial centroids corresponding to a plurality of partial groups of the cluster CL are illustrated only in a portion necessary for explanation. Specifically, in FIG. 51, the cluster CL31 is divided into two subgroups, the partial centroid PC11 corresponding to one subgroup (lower side in FIG. 51), and the partial centroid PC11 corresponding to the other subgroup (upper side in FIG. 51). Only the partial centroid PC12 corresponding to is shown. When partial centroids such as partial centroid PC11 and partial centroid PC12 are described without distinction, they may be referred to as partial centroid PC. Note that although FIG. 51 shows an example in which each cluster CL is divided into two subgroups for explanation, the number of subgroups (number of divisions) of each cluster CL is not limited to two, but may be any number. good. For example, each cluster CL may be determined according to the number of objects to which the cluster CL belongs. For example, each cluster CL may be divided into as many subgroups as the number of objects to which the cluster CL belongs.

例えば、情報処理装置１００Ｅは、一のクラスタＣＬに属する部分セントロイドＰＣについて、最も距離が短い部分セントロイドＰＣの数が最大のセントロイドＣに対応するブロブＢＬを候補ブロブに決定する。図５１では、クラスタＣＬ３１については、部分セントロイドＰＣ１１がセントロイドＣ３２に最も近く、部分セントロイドＰＣ１２がセントロイドＣ３３に最も近い。すなわち、クラスタＣＬ３１については、セントロイドＣ３２及びセントロイドＣ３３の２つのセントロイドＣが最も距離が短い部分セントロイドＰＣの数が最大のセントロイドＣである。なお、上記は一例に過ぎず、情報処理装置１００Ｅは、任意の手法により各クラスタをブロブに割り当ててもよい。情報処理装置１００Ｅは、各クラスタをクラスタリングして部分クラスタを生成し、そのセントロイドをクラスタリングしてブロブを生成するので、ブロブとクラスタの関係は必然的に分かっている。そのため、情報処理装置１００Ｅは、ブロブの生成により得られたブロブとクラスタの関係を示す情報を基に、各クラスタをブロブに割り当ててもよい。 For example, for partial centroid PCs belonging to one cluster CL, the information processing apparatus 100E determines, as a candidate blob, a blob BL corresponding to a centroid C having the largest number of partial centroid PCs with the shortest distance. In FIG. 51, for cluster CL31, partial centroid PC11 is closest to centroid C32, and partial centroid PC12 is closest to centroid C33. That is, for cluster CL31, two centroids C, centroid C32 and centroid C33, are the centroids C with the shortest distance and the largest number of partial centroids PC. Note that the above is just an example, and the information processing apparatus 100E may allocate each cluster to a blob using any method. Since the information processing device 100E clusters each cluster to generate partial clusters and clusters the centroids thereof to generate blobs, the relationship between blobs and clusters is necessarily known. Therefore, the information processing device 100E may assign each cluster to a blob based on information indicating the relationship between the blob and the cluster obtained by generating the blob.

このように、図５１では、クラスタＣＬ３１に候補ブロブが複数あるため、情報処理装置１００Ｅは、複数の候補ブロブのうち、クラスタＣＬ３１に対応するセントロイドＣＮにセントロイドＣの最も近いブロブＢＬに属させる。図５１では、情報処理装置１００Ｅは、ブロブＢＬ３２、ＢＬ３３のうち、クラスタＣＬ３１に対応するセントロイドＣＮ３１にセントロイドＣの最も近いブロブＢＬ３３にクラスタＣＬ３１を属させる。図５１では、他のクラスタＣＮ３２～ＣＮ４０等のクラスタリング結果も、図５０と同様となるため詳細な説明は省略する。 In this way, in FIG. 51, since there are multiple candidate blobs in the cluster CL31, the information processing device 100E selects a blob BL whose centroid C is closest to the centroid CN corresponding to the cluster CL31 among the multiple candidate blobs. let In FIG. 51, the information processing apparatus 100E makes the cluster CL31 belong to the blob BL33 whose centroid C is closest to the centroid CN31 corresponding to the cluster CL31 among the blobs BL32 and BL33. In FIG. 51, the clustering results for other clusters CN32 to CN40, etc. are also similar to those in FIG. 50, so a detailed explanation will be omitted.

なお、情報処理装置１００Ｅは、一の第２グループに候補第１グループが１つである場合、一の第２グループを属させる第１グループの決定については、その候補第１グループとすればよいため、詳細な説明は省略する。 Note that when there is one candidate first group in one second group, the information processing device 100E may determine the first group to which one second group belongs to the candidate first group. Therefore, detailed explanation will be omitted.

〔６－６－４．情報処理のフロー〕
次に、図５２を用いて、第６の実施形態に係る情報処理の手順について説明する。図５２は、第６の実施形態に係る情報処理の一例を示すフローチャートである。 [6-6-4. Information processing flow]
Next, the information processing procedure according to the sixth embodiment will be described using FIG. 52. FIG. 52 is a flowchart illustrating an example of information processing according to the sixth embodiment.

図５２に示すように、情報処理装置１００Ｅは、データ検索の対象となる複数のオブジェクトを示すオブジェクト情報を取得する（ステップＳ１２０１）。例えば、情報処理装置１００Ｅは、オブジェクト情報記憶部１２１から複数のオブジェクトを示すオブジェクト情報を取得する。 As shown in FIG. 52, the information processing apparatus 100E acquires object information indicating a plurality of objects that are data search targets (step S1201). For example, the information processing device 100E obtains object information indicating a plurality of objects from the object information storage unit 121.

情報処理装置１００Ｅは、複数のオブジェクトを複数の第１グループに分類する分類処理である第１分類処理と、複数のオブジェクトを複数の第１グループとは異なる分類となるグループである複数の第２グループに分類する分類処理である第２分類処理とのうち、いずれか一方の分類処理を実行する（ステップＳ１２０２）。例えば、情報処理装置１００Ｅは、複数の第２グループに分類する分類処理である第２分類処理を実行する。 The information processing device 100E performs a first classification process that classifies a plurality of objects into a plurality of first groups, and a plurality of second classification processes that classify a plurality of objects into a plurality of second groups that are different from the plurality of first groups. One of the second classification processes, which is a classification process for classifying into groups, is executed (step S1202). For example, the information processing device 100E executes a second classification process that is a classification process of classifying into a plurality of second groups.

情報処理装置１００Ｅは、一方の分類処理後に他方の分類処理を一方の分類処理の結果を用いて実行することにより、複数のオブジェクトを分類する複数の第１グループを示す第１グループ情報と、複数のオブジェクトを分類する複数の第２グループを示す第２グループ情報とを生成する（ステップＳ１２０３）。例えば、情報処理装置１００Ｅは、２分類処理後に、他方の分類処理である第１分類処理を行うことにより、第１グループ情報と第２グループ情報とを生成する。 The information processing device 100E performs one classification process and then performs the other classification process using the result of one classification process, thereby obtaining first group information indicating a plurality of first groups into which a plurality of objects are classified, and a plurality of first group information. second group information indicating a plurality of second groups for classifying the objects (step S1203). For example, the information processing device 100E generates first group information and second group information by performing a first classification process, which is the other classification process, after performing two classification processes.

〔７．効果〕
上述してきたように、第６の実施形態に係る情報処理装置１００Ｅは、取得部１３１と、生成部１３２Ｅとを有する。取得部１３１は、データ検索の対象となる複数のオブジェクトを示すオブジェクト情報を取得する。生成部１３２Ｅは、取得部１３１により取得されたオブジェクト情報に基づいて、複数のオブジェクトを分類する第１数のグループである複数の第１グループを示す第１グループ情報と、複数の第１グループとは異なる分類となるグループであって、複数のオブジェクトを分類する第１数よりも多い第２数のグループである複数の第２グループを示す第２グループ情報とを生成する。 [7. effect〕
As described above, the information processing apparatus 100E according to the sixth embodiment includes the acquisition section 131 and the generation section 132E. The acquisition unit 131 acquires object information indicating a plurality of objects that are data search targets. Based on the object information acquired by the acquisition unit 131, the generation unit 132E generates first group information indicating a plurality of first groups, which are a first number of groups into which a plurality of objects are classified, and a plurality of first groups. generates second group information indicating a plurality of second groups, which are groups of different classifications and are a second number of groups that is larger than the first number for classifying the plurality of objects.

このように、第６の実施形態に係る情報処理装置１００Ｅは、複数のオブジェクトを分類する第１数の第１グループと、複数の第１グループとは異なる分類となるグループであって、第１数よりも多い第２数の第２グループとの２種類のグループを生成することができ、オブジェクトを分類するグループを生成することができる。 In this way, the information processing apparatus 100E according to the sixth embodiment has a first number of first groups for classifying a plurality of objects, and a group that is classified differently from the plurality of first groups, It is possible to generate two types of groups, the second group having the second number, which is larger than the number, and the group for classifying objects.

生成部１３２Ｅは、複数のオブジェクトのうち、複数の第２グループにおいて同じ第２グループに属するオブジェクトを同じ第１グループに分類するグループである第１グループ情報を生成する。 The generation unit 132E generates first group information that is a group that classifies objects belonging to the same second group among the plurality of second groups into the same first group.

このように、情報処理装置１００Ｅは、包含関係のある２種類のグループを生成することができ、オブジェクトを分類するグループを生成することができる。 In this way, the information processing apparatus 100E can generate two types of groups that have an inclusive relationship, and can generate groups that classify objects.

生成部１３２Ｅは、複数の第２グループのうち二以上の第２グループの各々に属するオブジェクト群が一の第１グループに分類された複数の第１グループを示す第１グループ情報を生成する。 The generation unit 132E generates first group information indicating a plurality of first groups in which objects belonging to each of two or more second groups among the plurality of second groups are classified into one first group.

生成部１３２Ｅは、複数のオブジェクトを対象とする検索処理において一括処理を行うために用いられる複数の第１グループを示す第１グループ情報を生成する。 The generation unit 132E generates first group information indicating a plurality of first groups used for performing batch processing in a search process targeting a plurality of objects.

このように、情報処理装置１００Ｅは、複数のオブジェクトを対象とする検索処理において一括処理を行うために用いられる複数の第１グループを含む２種類のグループを生成することができ、オブジェクトを分類するグループを生成することができる。 In this way, the information processing device 100E can generate two types of groups including a plurality of first groups used for performing batch processing in a search process targeting a plurality of objects, and can classify objects. Groups can be created.

生成部１３２Ｅは、複数のオブジェクトに関する距離の算出を一括して行うために用いられる複数の第１グループを示す第１グループ情報を生成する。 The generation unit 132E generates first group information indicating a plurality of first groups used to collectively calculate distances regarding a plurality of objects.

このように、情報処理装置１００Ｅは、距離の算出を一括して行うために用いられる第１グループを含む２種類のグループを生成することができ、オブジェクトを分類するグループを生成することができる。 In this way, the information processing apparatus 100E can generate two types of groups including the first group used to collectively calculate distances, and can generate groups for classifying objects.

生成部１３２Ｅは、検索処理で用いられる検索クエリとオブジェクトとの距離の算出を並列処理するために用いられる複数の第１グループを示す第１グループ情報を生成する。 The generation unit 132E generates first group information indicating a plurality of first groups used to calculate the distance between the search query and the object used in the search process in parallel.

このように、情報処理装置１００Ｅは、検索処理で用いられる検索クエリとオブジェクトとの距離の算出を並列処理するために用いられる第１グループを含む２種類のグループを生成することができ、オブジェクトを分類するグループを生成することができる。 In this way, the information processing device 100E can generate two types of groups including the first group used for parallel processing of calculating the distance between the search query and the object used in the search process, and It is possible to generate groups for classification.

生成部１３２Ｅは、複数のオブジェクトを対象とする量子化に関する処理を行うために用いられる複数の第２グループを示す第２グループ情報を生成する。 The generation unit 132E generates second group information indicating a plurality of second groups used to perform processing related to quantization targeting a plurality of objects.

このように、情報処理装置１００Ｅは、複数のオブジェクトを対象とする量子化に関する処理を行うために用いられる第２グループを含む２種類のグループを生成することができ、オブジェクトを分類するグループを生成することができる。 In this way, the information processing device 100E can generate two types of groups including the second group used to perform processing related to quantization for a plurality of objects, and can generate a group for classifying objects. can do.

生成部１３２Ｅは、量子化の単位となる複数の第２グループを示す第２グループ情報を生成する。 The generation unit 132E generates second group information indicating a plurality of second groups that are units of quantization.

このように、情報処理装置１００Ｅは、量子化の単位となる第２グループを含む２種類のグループを生成することができ、オブジェクトを分類するグループを生成することができる。 In this way, the information processing apparatus 100E can generate two types of groups including the second group that is the unit of quantization, and can generate groups for classifying objects.

生成部１３２Ｅは、複数のオブジェクトの各々に対応するベクトルを量子化するために用いられる複数の第２グループを示す第２グループ情報を生成する。 The generation unit 132E generates second group information indicating a plurality of second groups used to quantize vectors corresponding to each of the plurality of objects.

このように、情報処理装置１００Ｅは、複数のオブジェクトの各々に対応するベクトルを量子化するために用いられる第２グループを含む２種類のグループを生成することができ、オブジェクトを分類するグループを生成することができる。 In this way, the information processing device 100E can generate two types of groups including the second group used to quantize vectors corresponding to each of a plurality of objects, and generate a group for classifying objects. can do.

生成部１３２Ｅは、複数の第２グループの分類後に、複数の第１グループの分類を行う。 After classifying the plurality of second groups, the generation unit 132E performs classification of the plurality of first groups.

このように、情報処理装置１００Ｅは、第２グループの分類、第１グループの分類の順でグループの情報を生成する処理を行うことで、オブジェクトを分類するグループを生成することができる。 In this way, the information processing apparatus 100E can generate groups for classifying objects by performing the process of generating group information in the order of the second group classification and the first group classification.

生成部１３２Ｅは、複数のオブジェクトをクラスタリングすることにより複数の第２グループを示す第２グループ情報を生成する。 The generation unit 132E generates second group information indicating a plurality of second groups by clustering a plurality of objects.

このように、情報処理装置１００Ｅは、複数のオブジェクトをクラスタリングすることにより複数の第２グループの分類を行うことで、オブジェクトを分類するグループを生成することができる。 In this way, the information processing apparatus 100E can generate groups into which objects are classified by classifying the plurality of second groups by clustering the plurality of objects.

取得部１３１は、複数のオブジェクトを検索対象とするインデックスを示すインデックス情報を取得する。生成部１３２Ｅは、インデックス情報を用いて、複数の第２グループを示す第２グループ情報を生成する。 The acquisition unit 131 acquires index information indicating an index that searches for a plurality of objects. The generation unit 132E uses the index information to generate second group information indicating a plurality of second groups.

このように、情報処理装置１００Ｅは、インデックスを用いて複数の第２グループの分類を行う、オブジェクトを分類するグループを生成することができる。 In this way, the information processing apparatus 100E can generate groups for classifying objects by classifying the plurality of second groups using the index.

生成部１３２Ｅは、第２グループを示す第２グループ情報を用いて、複数の第１グループを示す第１グループ情報を生成する。 The generation unit 132E generates first group information indicating a plurality of first groups using second group information indicating a second group.

このように、情報処理装置１００Ｅは、第２グループ情報を用いて、複数の第１グループを示す第１グループ情報を生成することで、第２グループの分類を基に第１グループの分類ができ、オブジェクトを分類するグループを生成することができる。 In this way, the information processing device 100E can classify the first group based on the classification of the second group by generating first group information indicating a plurality of first groups using the second group information. , it is possible to generate groups to classify objects.

生成部１３２Ｅは、複数の第２グループをクラスタリングすることにより複数の第１グループを示す第１グループ情報を生成する。 The generation unit 132E generates first group information indicating a plurality of first groups by clustering a plurality of second groups.

このように、情報処理装置１００Ｅは、第２グループをクラスタリングして複数の第１グループを示す第１グループ情報を生成することで、第２グループの分類を基に第１グループの分類ができ、オブジェクトを分類するグループを生成することができる。 In this way, the information processing device 100E can classify the first group based on the classification of the second group by clustering the second group and generating first group information indicating a plurality of first groups, Groups can be created to categorize objects.

生成部１３２Ｅは、複数の第１グループの分類後に、複数の第２グループの分類を行う。 After classifying the plurality of first groups, the generation unit 132E performs classification of the plurality of second groups.

このように、情報処理装置１００Ｅは、第１グループの分類、第２グループの分類の順でグループの情報を生成する処理を行うことで、オブジェクトを分類するグループを生成することができる。 In this way, the information processing apparatus 100E can generate groups for classifying objects by performing a process of generating group information in the order of first group classification and second group classification.

生成部１３２Ｅは、複数のオブジェクトをクラスタリングすることにより複数の第１グループを示す第１グループ情報を生成する。 The generation unit 132E generates first group information indicating a plurality of first groups by clustering a plurality of objects.

このように、情報処理装置１００Ｅは、複数のオブジェクトをクラスタリングすることにより複数の第１グループの分類を行うことで、オブジェクトを分類するグループを生成することができる。 In this way, the information processing apparatus 100E can generate groups into which objects are classified by classifying the plurality of first groups by clustering the plurality of objects.

生成部１３２Ｅは、第１グループを示す第１グループ情報を用いて、複数の第２グループを示す第２グループ情報を生成する。 The generation unit 132E generates second group information indicating a plurality of second groups using first group information indicating the first group.

このように、情報処理装置１００Ｅは、第１グループ情報を用いて、複数の第２グループを示す第２グループ情報を生成することで、第１グループの分類を基に第２グループの分類ができ、オブジェクトを分類するグループを生成することができる。 In this way, the information processing device 100E can classify the second group based on the classification of the first group by generating second group information indicating a plurality of second groups using the first group information. , it is possible to generate groups to classify objects.

生成部１３２Ｅは、複数の第１グループのうち、少なくとも一の第１グループを分割し、二以上の第２グループを生成することにより、複数の第２グループを示す第２グループ情報を生成する。 The generating unit 132E generates second group information indicating the plurality of second groups by dividing at least one first group among the plurality of first groups and generating two or more second groups.

このように、情報処理装置１００Ｅは、第１グループを分割して複数の第２グループを示す第２グループ情報を生成することで、第１グループの分類を基に第２グループの分類ができ、オブジェクトを分類するグループを生成することができる。 In this way, the information processing device 100E can classify the second group based on the classification of the first group by dividing the first group and generating second group information indicating a plurality of second groups. Groups can be created to categorize objects.

生成部１３２Ｅは、各第１グループに属するオブジェクトをクラスタリングし、各第１グループを二以上の第２グループに分割することにより、複数の第２グループを示す第２グループ情報を生成する。 The generation unit 132E generates second group information indicating a plurality of second groups by clustering objects belonging to each first group and dividing each first group into two or more second groups.

このように、情報処理装置１００Ｅは、各第１グループに属するオブジェクトをクラスタリングし、各第１グループを二以上の第２グループに分割して複数の第２グループを示す第２グループ情報を生成することで、第１グループの分類を基に第２グループの分類ができ、オブジェクトを分類するグループを生成することができる。 In this way, the information processing device 100E clusters objects belonging to each first group, divides each first group into two or more second groups, and generates second group information indicating a plurality of second groups. As a result, the second group can be classified based on the first group classification, and groups for classifying objects can be generated.

〔８．ハードウェア構成〕
上述してきた各実施形態に係る情報処理装置１００、１００Ａ、１００Ｂ、１００Ｃ、１００Ｄ、１００Ｅは、例えば図５３に示すような構成のコンピュータ１０００によって実現される。図５３は、情報処理装置の機能を実現するコンピュータの一例を示すハードウェア構成図である。コンピュータ１０００は、ＣＰＵ１１００、ＲＡＭ１２００、ＲＯＭ（Read Only Memory）１３００、ＨＤＤ（Hard Disk Drive）１４００、通信インターフェイス（Ｉ／Ｆ）１５００、入出力インターフェイス（Ｉ／Ｆ）１６００、及びメディアインターフェイス（Ｉ／Ｆ）１７００を有する。 [8. Hardware configuration]
The information processing apparatuses 100, 100A, 100B, 100C, 100D, and 100E according to each of the embodiments described above are realized by, for example, a computer 1000 having a configuration as shown in FIG. 53. FIG. 53 is a hardware configuration diagram showing an example of a computer that implements the functions of the information processing device. The computer 1000 includes a CPU 1100, a RAM 1200, a ROM (Read Only Memory) 1300, an HDD (Hard Disk Drive) 1400, a communication interface (I/F) 1500, an input/output interface (I/F) 1600, and a media interface (I/F). ) has 1700.

ＣＰＵ１１００は、ＲＯＭ１３００またはＨＤＤ１４００に格納されたプログラムに基づいて動作し、各部の制御を行う。ＲＯＭ１３００は、コンピュータ１０００の起動時にＣＰＵ１１００によって実行されるブートプログラムや、コンピュータ１０００のハードウェアに依存するプログラム等を格納する。 CPU 1100 operates based on a program stored in ROM 1300 or HDD 1400, and controls each section. The ROM 1300 stores a boot program executed by the CPU 1100 when the computer 1000 is started, programs depending on the hardware of the computer 1000, and the like.

ＨＤＤ１４００は、ＣＰＵ１１００によって実行されるプログラム、及び、かかるプログラムによって使用されるデータ等を格納する。通信インターフェイス１５００は、ネットワークＮを介して他の機器からデータを受信してＣＰＵ１１００へ送り、ＣＰＵ１１００が生成したデータをネットワークＮを介して他の機器へ送信する。 The HDD 1400 stores programs executed by the CPU 1100, data used by the programs, and the like. Communication interface 1500 receives data from other devices via network N and sends it to CPU 1100, and sends data generated by CPU 1100 to other devices via network N.

ＣＰＵ１１００は、入出力インターフェイス１６００を介して、ディスプレイやプリンタ等の出力装置、及び、キーボードやマウス等の入力装置を制御する。ＣＰＵ１１００は、入出力インターフェイス１６００を介して、入力装置からデータを取得する。また、ＣＰＵ１１００は、生成したデータを入出力インターフェイス１６００を介して出力装置へ出力する。 The CPU 1100 controls output devices such as a display and a printer, and input devices such as a keyboard and mouse via an input/output interface 1600. CPU 1100 obtains data from an input device via input/output interface 1600. Further, CPU 1100 outputs the generated data to an output device via input/output interface 1600.

メディアインターフェイス１７００は、記録媒体１８００に格納されたプログラムまたはデータを読み取り、ＲＡＭ１２００を介してＣＰＵ１１００に提供する。ＣＰＵ１１００は、かかるプログラムを、メディアインターフェイス１７００を介して記録媒体１８００からＲＡＭ１２００上にロードし、ロードしたプログラムを実行する。記録媒体１８００は、例えばＤＶＤ（Digital Versatile Disc）、ＰＤ（Phase change rewritable Disk）等の光学記録媒体、ＭＯ（Magneto-Optical disk）等の光磁気記録媒体、テープ媒体、磁気記録媒体、または半導体メモリ等である。 Media interface 1700 reads programs or data stored in recording medium 1800 and provides them to CPU 1100 via RAM 1200. CPU 1100 loads this program from recording medium 1800 onto RAM 1200 via media interface 1700, and executes the loaded program. The recording medium 1800 is, for example, an optical recording medium such as a DVD (Digital Versatile Disc) or a PD (Phase change rewritable disk), a magneto-optical recording medium such as an MO (Magneto-Optical disk), a tape medium, a magnetic recording medium, or a semiconductor memory. etc.

例えば、コンピュータ１０００が第１の実施形態に係る情報処理装置１００として機能する場合、コンピュータ１０００のＣＰＵ１１００は、ＲＡＭ１２００上にロードされたプログラムを実行することにより、制御部１３０の機能を実現する。コンピュータ１０００のＣＰＵ１１００は、これらのプログラムを記録媒体１８００から読み取って実行するが、他の例として、他の装置からネットワークＮを介してこれらのプログラムを取得してもよい。 For example, when the computer 1000 functions as the information processing device 100 according to the first embodiment, the CPU 1100 of the computer 1000 realizes the functions of the control unit 130 by executing a program loaded onto the RAM 1200. The CPU 1100 of the computer 1000 reads these programs from the recording medium 1800 and executes them, but as another example, these programs may be acquired from another device via the network N.

以上、本願の実施形態のいくつかを図面に基づいて詳細に説明したが、これらは例示であり、発明の開示の行に記載の態様を始めとして、当業者の知識に基づいて種々の変形、改良を施した他の形態で本発明を実施することが可能である。 As mentioned above, some embodiments of the present application have been described in detail based on the drawings, but these are merely examples, and various modifications and variations can be made based on the knowledge of those skilled in the art, including the embodiments described in the disclosure section of the invention. It is possible to carry out the invention in other forms with modifications.

〔９．その他〕
また、上記実施形態において説明した各処理のうち、自動的に行われるものとして説明した処理の全部または一部を手動的に行うこともでき、あるいは、手動的に行われるものとして説明した処理の全部または一部を公知の方法で自動的に行うこともできる。この他、上記文書中や図面中で示した処理手順、具体的名称、各種のデータやパラメータを含む情報については、特記する場合を除いて任意に変更することができる。例えば、各図に示した各種情報は、図示した情報に限られない。 [9. others〕
Furthermore, among the processes described in the above embodiments, all or part of the processes described as being performed automatically can be performed manually, or the processes described as being performed manually can be performed manually. All or part of the process can also be performed automatically using known methods. In addition, information including the processing procedures, specific names, and various data and parameters shown in the above documents and drawings may be changed arbitrarily, unless otherwise specified. For example, the various information shown in each figure is not limited to the illustrated information.

また、図示した各装置の各構成要素は機能概念的なものであり、必ずしも物理的に図示の如く構成されていることを要しない。すなわち、各装置の分散・統合の具体的形態は図示のものに限られず、その全部または一部を、各種の負荷や使用状況などに応じて、任意の単位で機能的または物理的に分散・統合して構成することができる。 Furthermore, each component of each device shown in the drawings is functionally conceptual, and does not necessarily need to be physically configured as shown in the drawings. In other words, the specific form of distributing and integrating each device is not limited to what is shown in the diagram, and all or part of the devices can be functionally or physically distributed or integrated in arbitrary units depending on various loads and usage conditions. Can be integrated and configured.

また、上述してきた各実施形態に記載された各処理は、処理内容を矛盾させない範囲で適宜組み合わせることが可能である。 Furthermore, the processes described in the embodiments described above can be combined as appropriate within a range that does not conflict with the process contents.

また、上述してきた「部（section、module、unit）」は、「手段」や「回路」などに読み替えることができる。例えば、取得部は、取得手段や取得回路に読み替えることができる。 Further, the above-mentioned "section, module, unit" can be read as "means", "circuit", etc. For example, the acquisition unit can be read as an acquisition means or an acquisition circuit.

１情報処理システム
１０端末装置
５０情報提供装置
１００、１００Ａ、１００Ｂ、１００Ｃ、１００Ｄ、１００Ｅ情報処理装置
１２０、１２０Ａ、１２０Ｂ、１２０Ｃ記憶部
１２１オブジェクト情報記憶部
１２２グラフ情報記憶部
１２３、１２３Ａ量子化情報記憶部
１２４コードブック情報記憶部
１２５、１２５Ａブロブ情報記憶部
１２６、１２６Ａブロブ連結グラフ情報記憶部
１３０、１３０Ａ、１３０Ｂ、１３０Ｃ、１３０Ｄ、１３０Ｅ制御部
１３１取得部
１３２、１３２Ａ、１３２Ｂ、１３２Ｃ、１３２Ｅ生成部（処理部）
１３３、１３３Ａ、１３３Ｂ、１３３Ｃ、１３３Ｄ、１３３Ｅ検索処理部
１３４提供部 1 Information processing system 10 Terminal device 50 Information providing device 100, 100A, 100B, 100C, 100D, 100E Information processing device 120, 120A, 120B, 120C Storage unit 121 Object information storage unit 122 Graph information storage unit 123, 123A Quantization information Storage unit 124 Codebook information storage unit 125, 125A Blob information storage unit 126, 126A Blob connected graph information storage unit 130, 130A, 130B, 130C, 130D, 130E Control unit 131 Acquisition unit 132, 132A, 132B, 132C, 132E Generation Department (processing department)
133, 133A, 133B, 133C, 133D, 133E Search processing unit 134 Providing unit

Claims

an acquisition unit that acquires object information indicating multiple objects to be searched for;
First group information indicating a plurality of first groups that are a first number of groups for classifying the plurality of objects based on the object information acquired by the acquisition unit is different from the plurality of first groups. a generation unit that generates second group information indicating a plurality of second groups, which are groups that are classified and are a second number of groups that are larger than the first number of groups that classify the plurality of objects;
An information processing device comprising:

The generation unit is
The first group information is generated that is a group that classifies objects belonging to the same second group among the plurality of second groups into the same first group among the plurality of objects. information processing equipment.

The generation unit is
Generating the first group information indicating the plurality of first groups in which a group of objects belonging to each of two or more second groups among the plurality of second groups is classified into one first group. The information processing device according to claim 1.

The generation unit is
The information processing apparatus according to claim 1, wherein the information processing apparatus generates the first group information indicating the plurality of first groups used for performing batch processing in a search process targeting the plurality of objects.

The generation unit is
The information processing apparatus according to claim 4, wherein the information processing apparatus generates the first group information indicating the plurality of first groups used to collectively calculate distances regarding the plurality of objects.

The generation unit is
5. The method according to claim 4, further comprising: generating the first group information indicating the plurality of first groups used for parallel processing calculation of a distance between a search query and an object used in the search process. Information processing device.

The generation unit is
The information processing apparatus according to claim 1, wherein the information processing apparatus generates the second group information indicating the plurality of second groups used to perform processing related to quantization for the plurality of objects.

The generation unit is
The information processing apparatus according to claim 7, wherein the information processing apparatus generates the second group information indicating the plurality of second groups that are the units of the quantization.

The generation unit is
The information processing apparatus according to claim 7, wherein the second group information indicating the plurality of second groups used for quantizing the vectors corresponding to each of the plurality of objects is generated.

The generation unit is
The information processing apparatus according to claim 1, wherein the plurality of first groups are classified after the plurality of second groups are classified.

The generation unit is
The information processing apparatus according to claim 10, wherein the second group information indicating the plurality of second groups is generated by clustering the plurality of objects.

The acquisition unit includes:
obtaining index information indicating an index for searching the plurality of objects;
The generation unit is
The information processing apparatus according to claim 10, wherein the second group information indicating the plurality of second groups is generated using the index information.

The generation unit is
The information processing apparatus according to claim 10, wherein the first group information indicating the plurality of first groups is generated using the second group information indicating the second group.

The generation unit is
The information processing apparatus according to claim 10, wherein the first group information indicating the plurality of first groups is generated by clustering the plurality of second groups.

The generation unit is
The information processing apparatus according to claim 10, wherein the plurality of second groups are classified after the plurality of first groups are classified.

The generation unit is
The information processing apparatus according to claim 15, wherein the first group information indicating the plurality of first groups is generated by clustering the plurality of objects.

The generation unit is
The information processing apparatus according to claim 15, wherein the second group information indicating the plurality of second groups is generated using the first group information indicating the first group.

The generation unit is
The second group information indicating the plurality of second groups is generated by dividing at least one first group among the plurality of first groups and generating two or more second groups. The information processing device according to claim 15.

The generation unit is
The second group information indicating the plurality of second groups is generated by clustering objects belonging to each first group and dividing each first group into two or more second groups. The information processing device according to claim 15.

An information processing method performed by a computer, the method comprising:
an acquisition step of acquiring object information indicating a plurality of objects to be searched for;
First group information indicating a plurality of first groups that are a first number of groups for classifying the plurality of objects based on the object information acquired in the acquisition step is different from the plurality of first groups. a generation step of generating second group information indicating a plurality of second groups, which are groups to be classified and are a second number of groups larger than the first number for classifying the plurality of objects;
An information processing method characterized by comprising:

an acquisition procedure for acquiring object information indicating multiple objects to be searched for;
First group information indicating a plurality of first groups that are a first number of groups for classifying the plurality of objects based on the object information acquired by the acquisition procedure is different from the plurality of first groups. a generation procedure for generating second group information indicating a plurality of second groups, which are groups that are classified and are a second number of groups that are larger than the first number of groups that classify the plurality of objects;
An information processing program that causes a computer to execute.