JP7122120B2

JP7122120B2 - Information processing device, information processing method, and information processing program

Info

Publication number: JP7122120B2
Application number: JP2018033936A
Authority: JP
Inventors: 博基谷口; 雅二郎岩崎
Original assignee: Yahoo Japan Corp
Current assignee: Yahoo Japan Corp
Priority date: 2018-02-27
Filing date: 2018-02-27
Publication date: 2022-08-19
Anticipated expiration: 2038-02-27
Also published as: JP2019149057A

Description

本発明は、情報処理装置、情報処理方法、及び情報処理プログラムに関する。 The present invention relates to an information processing device, an information processing method, and an information processing program.

従来、種々の情報を抽出する技術が提供されている。例えば、他のカテゴリーに属する概念間の相対的な関係を考慮した上で、概念間の類似性を推定する技術が提供されている。 Conventionally, techniques for extracting various information have been provided. For example, techniques are provided for estimating similarities between concepts in consideration of relative relationships between concepts belonging to other categories.

特開２００７－２１３１５１号公報Japanese Patent Application Laid-Open No. 2007-213151

岩崎雅二郎 "木構造型インデックスを利用した近似k最近傍グラフによる近傍検索", 情報処理学会論文誌, 2011/2, Vol. 52, No. 2. pp.817-828.Masajiro Iwasaki, "Neighborhood Retrieval Using Approximate k-Nearest Neighbor Graph Using Tree Structured Index", Transactions of Information Processing Society of Japan, 2011/2, Vol. 52, No. 2. pp.817-828.

しかしながら、上記の従来技術では、所望の対象と所定の対応関係を有する対象を適切に抽出することが難しい場合がある。例えば、他のカテゴリーに属する概念間の類似性を推定するだけでは、所望の情報を抽出することが難しい。例えば、同じカテゴリーに属する情報の類似性を推定することができるとは限らず、このような場合、所望の情報を抽出することが難しい。 However, with the conventional technology described above, it may be difficult to appropriately extract an object having a predetermined correspondence relationship with a desired object. For example, it is difficult to extract desired information only by estimating similarities between concepts belonging to other categories. For example, it is not always possible to estimate the similarity of information belonging to the same category, and in such cases, it is difficult to extract desired information.

本願は、上記に鑑みてなされたものであって、所望の対象と所定の対応関係を有する対象を適切に抽出する情報処理装置、情報処理方法、及び情報処理プログラムを提供することを目的とする。 The present application has been made in view of the above, and aims to provide an information processing apparatus, an information processing method, and an information processing program for appropriately extracting an object having a predetermined correspondence relationship with a desired object. .

本願に係る情報処理装置は、人を要素とする複数の対象の各々に対応する複数のノードが、前記複数の対象の類似性に応じて連結されたグラフ情報と、一の対象に関する対象情報を取得する取得部と、前記取得部により取得された前記グラフ情報の前記複数のノードのうち、所定の基準に基づいて決定された前記グラフ情報の検索の起点となる起点ノードを起点として、前記グラフ情報を検索することにより、前記複数の対象のうち、前記一の対象と所定の対応関係を有する対象である対応対象を抽出する検索部と、を備えたことを特徴とする。 An information processing apparatus according to the present application includes graph information in which a plurality of nodes corresponding to each of a plurality of objects having people as elements are connected according to the similarity of the plurality of objects, and object information related to one object. an acquiring unit for acquiring, from among the plurality of nodes of the graph information acquired by the acquiring unit, a starting node determined based on a predetermined criterion as a starting point for searching the graph information; and a search unit that extracts, from among the plurality of targets, a corresponding target that has a predetermined correspondence relationship with the one target by searching for information.

実施形態の一態様によれば、所望の対象と所定の対応関係を有する対象を適切に抽出することができるという効果を奏する。 According to one aspect of the embodiment, it is possible to appropriately extract an object having a predetermined correspondence relationship with a desired object.

図１は、実施形態に係る情報処理の一例を示す図である。FIG. 1 is a diagram illustrating an example of information processing according to an embodiment. 図２は、実施形態に係る情報処理システムの構成例を示す図である。FIG. 2 is a diagram illustrating a configuration example of an information processing system according to the embodiment; 図３は、実施形態に係る情報処理装置の構成例を示す図である。FIG. 3 is a diagram illustrating a configuration example of an information processing apparatus according to the embodiment; 図４は、実施形態に係る対象情報記憶部の一例を示す図である。4 is a diagram illustrating an example of a target information storage unit according to the embodiment; FIG. 図５は、実施形態に係るインデックス情報記憶部の一例を示す図である。FIG. 5 is a diagram illustrating an example of an index information storage unit according to the embodiment; 図６は、実施形態に係るグラフ情報記憶部の一例を示す図である。6 is a diagram illustrating an example of a graph information storage unit according to the embodiment; FIG. 図７は、実施形態に係るモデル情報記憶部の一例を示す図である。7 is a diagram illustrating an example of a model information storage unit according to the embodiment; FIG. 図８は、実施形態に係る情報処理の一例を示すフローチャートである。FIG. 8 is a flowchart illustrating an example of information processing according to the embodiment. 図９は、実施形態に係る生成処理の一例を示すフローチャートである。FIG. 9 is a flowchart illustrating an example of generation processing according to the embodiment. 図１０は、実施形態に係る特徴量の抽出の一例を示す図である。FIG. 10 is a diagram illustrating an example of feature amount extraction according to the embodiment. 図１１は、グラフ情報を用いた検索処理の一例を示すフローチャートである。FIG. 11 is a flowchart illustrating an example of search processing using graph information. 図１２は、グループのグラフ情報を概念的に示す図である。FIG. 12 is a diagram conceptually showing group graph information. 図１３は、情報処理装置の機能を実現するコンピュータの一例を示すハードウェア構成図である。FIG. 13 is a hardware configuration diagram showing an example of a computer that implements the functions of the information processing apparatus.

以下に、本願に係る情報処理装置、情報処理方法、及び情報処理プログラムを実施するための形態（以下、「実施形態」と呼ぶ）について図面を参照しつつ詳細に説明する。なお、この実施形態により本願に係る情報処理装置、情報処理方法、及び情報処理プログラムが限定されるものではない。また、以下の各実施形態において同一の部位には同一の符号を付し、重複する説明は省略される。 Hereinafter, modes for implementing an information processing apparatus, an information processing method, and an information processing program (hereinafter referred to as "embodiments") according to the present application will be described in detail with reference to the drawings. The information processing apparatus, information processing method, and information processing program according to the present application are not limited to this embodiment. Also, in each of the following embodiments, the same parts are denoted by the same reference numerals, and overlapping descriptions are omitted.

（実施形態）
〔１．情報処理〕
図１を用いて、実施形態に係る情報処理の一例について説明する。図１は、実施形態に係る情報処理の一例を示す図である。図１では、情報処理装置１００が人を要素とする対象の一例として、複数の個人（一人の人）をグラフ構造化したグラフデータ（グラフ情報）を検索することにより、類似する対象（以下、「類似対象」ともいう）を抽出する場合を示す。なお、対象は、個人（一人の人）に限らず、人を要素とするものであれば、種々の物であってもよい。図１では、情報処理装置１００は、各対象に対応するベクトルデータ（「ベクトル情報」や、単に「ベクトル」ともいう）を用いて対象をグラフ構造化したグラフ情報を用いる場合を一例として説明する。 (embodiment)
[1. information processing]
An example of information processing according to the embodiment will be described with reference to FIG. FIG. 1 is a diagram illustrating an example of information processing according to an embodiment. In FIG. 1, the information processing apparatus 100 retrieves graph data (graph information) in which a plurality of individuals (one person) are graph-structured, as an example of an object whose element is a person. (also referred to as "similar target") is extracted. The object is not limited to an individual (one person), and may be various objects as long as they are human elements. In FIG. 1, the information processing apparatus 100 uses vector data (also referred to as “vector information” or simply “vector”) corresponding to each object to use graph information in which objects are graph-structured. .

なお、情報処理装置１００が用いる情報は、ベクトルに限らず、人を要素とする各対象の類似性を表現可能な情報であれば、どのような形式の情報であってもよい。例えば、情報処理装置１００は、各対象に対応する所定のデータや値を用いて対象をグラフ構造化したグラフ情報を用いてもよい。例えば、情報処理装置１００は、人を要素とする各対象から生成された所定の数値（例えば２進数の値や１６進数の値）を用いて対象をグラフ構造化したグラフ情報を用いてもよい。例えば、ベクトルに代えて、データ間の距離（類似度）が定義されていれば任意の形態のデータであっても良い。なお、図１の例では、人を要素とする対象の各々が、個人（一人の人）である場合を示すが、対象は人を要素とするものであれば、個人に限らず、各対象が複数人を含むグループ（ユーザ群）であってもよい。すなわち、対象（オブジェクト）は、人を要素とするものであれば、どのような対象であってもよい。 The information used by the information processing apparatus 100 is not limited to vectors, and may be information of any format as long as it is information capable of expressing the similarity of each object whose element is a person. For example, the information processing apparatus 100 may use graph information in which objects are graph-structured using predetermined data and values corresponding to each object. For example, the information processing apparatus 100 may use graph information in which an object is graph-structured using a predetermined numerical value (for example, a binary value or a hexadecimal value) generated from each object whose element is a person. . For example, instead of vectors, any form of data may be used as long as the distance (similarity) between data is defined. In the example of FIG. 1, each of the objects whose elements are people is an individual (one person). may be a group (user group) including a plurality of people. That is, the target (object) may be any target as long as it has a person as an element.

〔１－１．グラフ情報について〕
また、情報処理装置１００は、図１中のグラフ情報ＧＲ１１に示すように、各ベクトル（ノード）が有向エッジにより連結されたグラフデータを用いて情報処理を行う。なお、図１中のグラフ情報ＧＲ１１に示すようなグラフ情報は、情報処理装置１００が生成してもよいし、情報処理装置１００は、図１中のグラフ情報ＧＲ１１に示すようなグラフ情報を情報提供装置５０（図２参照）等の他の外部装置から取得してもよい。 [1-1. About graph information]
The information processing apparatus 100 also performs information processing using graph data in which vectors (nodes) are connected by directed edges, as indicated by graph information GR11 in FIG. Graph information as indicated by graph information GR11 in FIG. It may be acquired from another external device such as the providing device 50 (see FIG. 2).

また、ここでいう、有向エッジとは、一方向にしかデータを辿れないエッジを意味する。以下では、エッジにより辿る元、すなわち始点となるノードを参照元とし、エッジにより辿る先、すなわち終点となるノードを参照先とする。例えば、所定のノード「Ａ」から所定のノード「Ｂ」に連結される有向エッジとは、参照元をノード「Ａ」とし、参照先をノード「Ｂ」とするエッジであることを示す。なお、各ノードを連結するエッジは、有向エッジに限らず、種々のエッジであってもよい。例えば、各ノードを連結するエッジは、ノードを連結する方向のないエッジであってもよい。例えば、各ノードを連結するエッジは、相互に参照可能なエッジであってもよい。例えば、各ノードを連結するエッジは、全て双方向エッジであってもよい。 Also, the term "directed edge" used herein means an edge that allows data to be traced in only one direction. In the following description, the source traced by the edge, that is, the node that is the starting point, is the reference source, and the destination that is traced by the edge, that is, the node that is the end point, is the reference destination. For example, a directed edge connecting a predetermined node "A" to a predetermined node "B" indicates an edge having a reference source of node "A" and a reference destination of node "B". Edges connecting nodes are not limited to directed edges, and may be various edges. For example, the edge connecting each node may be a directionless edge connecting the nodes. For example, the edges connecting each node may be mutually referable edges. For example, the edges connecting each node may all be bidirectional edges.

例えば、このようにノード「Ａ」を参照元とするエッジをノード「Ａ」の出力エッジという。また、例えば、このようにノード「Ｂ」を参照先とするエッジをノード「Ｂ」の入力エッジという。すなわち、ここでいう出力エッジ及び入力エッジとは、一の有向エッジをその有向エッジが連結する２つのノードのうち、いずれのノードを中心として捉えるかの相違であり、一の有向エッジが出力エッジ及び入力エッジになる。すなわち、出力エッジ及び入力エッジは、相対的な概念であって、一の有向エッジについて、参照元となるノードを中心として捉えた場合に出力エッジとなり、参照先となるノードを中心として捉えた場合に入力エッジとなる。なお、本実施形態においては、エッジについては、出力エッジや入力エッジ等の有向エッジを対象とするため、以下では、有向エッジを単に「エッジ」と記載する場合がある。 For example, an edge having node "A" as a reference in this way is called an output edge of node "A". Further, for example, an edge having the node "B" as a reference in this way is referred to as an input edge of the node "B". In other words, the term "output edge" and "input edge" as used herein refers to the difference in which of the two nodes to which one directed edge connects is regarded as the center. are the output and input edges. In other words, an output edge and an input edge are relative concepts. For one directed edge, it is an output edge when the reference node is regarded as the center, and the reference node is regarded as the center. is an input edge. Note that, in the present embodiment, directed edges such as output edges and input edges are targeted for edges, and therefore directed edges may be simply referred to as "edges" below.

例えば、情報処理装置１００は、数百万～数億等の単位の膨大な対象（個人やグループ）に対応するノードを対象に処理を行うが、図面においてはその一部のみを図示する。図１の例では、説明を簡単にするために、８個のノードを図示して処理の概要を説明する。例えば、情報処理装置１００は、図１中のグラフ情報ＧＲ１１に示すように、ノードＮ１、Ｎ２、Ｎ３等に示すような複数のノード（ベクトル）を含むグラフ情報を取得する。また、図１の例では、グラフ情報ＧＲ１１における各ノードは、そのノードとの間の距離が近い方から所定数のノードへのエッジ（出力エッジ）が連結される。例えば、所定数は、目的や用途等に応じて、２や５や１０や１００等の種々の値であってもよい。例えば、所定数が２である場合、ノードＮ１からは、ノードＮ１からの距離が最も近いノード及び２番目に距離が近い２つのノードに出力エッジが連結される。なお、類似度を示す指標としての距離は、ベクトル（Ｎ次元ベクトル）間の距離として適用可能であれば、どのような距離であってもよく、例えば、ユークリッド距離やマハラノビス距離やコサイン距離等の種々の距離が用いられてもよい。 For example, the information processing apparatus 100 processes nodes corresponding to a huge number of objects (individuals and groups) in units of millions to hundreds of millions, but only some of them are shown in the drawings. In the example of FIG. 1, to simplify the explanation, eight nodes are illustrated to explain the outline of the processing. For example, the information processing apparatus 100 acquires graph information including a plurality of nodes (vectors) such as nodes N1, N2, N3, etc., as indicated by graph information GR11 in FIG. Also, in the example of FIG. 1, each node in the graph information GR11 is connected to a predetermined number of edges (output edges) from the node closest to the node. For example, the predetermined number may be various values such as 2, 5, 10, 100, etc., depending on the purpose and application. For example, if the predetermined number is 2, output edges from the node N1 are connected to the node closest to the node N1 and the two nodes closest to the node N1. Note that the distance as an index indicating similarity may be any distance as long as it can be applied as the distance between vectors (N-dimensional vectors). Various distances may be used.

また、このように「ノードＮ＊（＊は任意の数値）」と記載した場合、そのノードはノードＩＤ「Ｎ＊」により識別されるノードであることを示す。例えば、「ノードＮ１」と記載した場合、そのノードはノードＩＤ「Ｎ１」により識別されるノードである。 Also, when "node N* (* is an arbitrary number)" is described in this way, it indicates that the node is identified by the node ID "N*". For example, when "node N1" is described, the node is identified by the node ID "N1".

また、図１中のグラフ情報ＧＲ１１では、ノードＮ１０は、ノードＮ７へ向かう有向エッジであるエッジＥ７が連結される。すなわち、ノードＮ１０は、ノードＮ７とエッジＥ７により連結される。このように「エッジＥ＊（＊は任意の数値）」と記載した場合、そのエッジはエッジＩＤ「Ｅ＊」により識別されるエッジであることを示す。例えば、「エッジＥ１１」と記載した場合、そのエッジはエッジＩＤ「Ｅ１１」により識別されるエッジである。例えば、ノードＮ１０を参照元とし、ノードＮ７を参照先として連結されるエッジＥ７により、ノードＮ１０からノードＮ７に辿ることが可能となる。この場合、有向エッジであるエッジＥ７は、ノードＮ１０を中心として識別される場合、出力エッジとなり、ノードＮ７を中心として識別される場合、入力エッジとなる。また、図１のグラフ情報ＧＲ１１中の双方向矢印は、両方のノードから他方のノードへの有向エッジが連結されることを示す。例えば、グラフ情報ＧＲ１１中のノードＮ２とノードＮ４５１との間の双方向矢印は、ノードＮ２からノードＮ４５１へ向かう有向エッジと、ノードＮ４５１からノードＮ２へ向かう有向エッジとの２つのエッジが連結されることを示す。 In graph information GR11 in FIG. 1, node N10 is connected to edge E7, which is a directed edge toward node N7. That is, node N10 is connected to node N7 by edge E7. When "edge E* (* is an arbitrary numerical value)" is described in this way, it indicates that the edge is identified by the edge ID "E*". For example, when "edge E11" is described, the edge is identified by the edge ID "E11". For example, an edge E7 connecting the node N10 as a reference source and the node N7 as a reference destination enables tracing from the node N10 to the node N7. In this case, the directed edge E7 is an output edge when identified with the node N10 as the center, and an input edge when identified with the node N7 as the center. Also, the bidirectional arrows in the graph information GR11 of FIG. 1 indicate that directed edges from both nodes to the other node are connected. For example, a two-way arrow between node N2 and node N451 in graph information GR11 connects two edges: a directed edge from node N2 to node N451 and a directed edge from node N451 to node N2. indicates that

また、図１中のグラフ情報ＧＲ１１は、ユークリッド空間であってもよい。また、図１に示すグラフ情報ＧＲ１１は、各ベクトル間の距離等の説明のための概念的な図であり、グラフ情報ＧＲ１１は、多次元空間である。例えば、図１に示すグラフ情報ＧＲ１１は、平面上に図示するため２次元の態様にて図示されるが、例えば１００次元や１０００次元等の多次元空間であるものとする。 Further, the graph information GR11 in FIG. 1 may be Euclidean space. Graph information GR11 shown in FIG. 1 is a conceptual diagram for explaining distances between vectors, and the graph information GR11 is a multidimensional space. For example, the graph information GR11 shown in FIG. 1 is illustrated in a two-dimensional manner because it is illustrated on a plane, but it is assumed to be a multi-dimensional space such as 100-dimensional space or 1000-dimensional space.

ここで、ベクトルデータ間の距離は、対象の類似性を示し、距離が近いほど類似している。本実施形態においては、グラフ情報ＧＲ１１における各ノードの距離を対応する各オブジェクト間の類似度とする。例えば、各ノードに対応する対象（個人）の類似性が、グラフ情報ＧＲ１１内におけるノード間の距離として写像されているものとする。例えば、各ノードに対応する概念間の類似度が各ノード間の距離に写像されているものとする。ここで、図１に示す例においては、グラフ情報ＧＲ１１における各ノード間の距離が短いオブジェクト同士の類似度が高く、グラフ情報ＧＲ１１における各ノード間の距離が長いオブジェクト同士の類似度が低い。例えば、図１中のグラフ情報ＧＲ１１において、ノードＩＤ「Ｎ３５」により識別されるノードと、ノードＩＤ「Ｎ６９３」により識別されるノードとは近接している、すなわち距離が短い。そのため、ノードＩＤ「Ｎ３５」により識別されるノードに対応するオブジェクトと、ノードＩＤ「Ｎ６９３」により識別されるノードに対応するオブジェクトとは類似度が高いことを示す。 Here, the distance between vector data indicates the similarity of objects, and the closer the distance, the more similar. In this embodiment, the distance of each node in the graph information GR11 is used as the degree of similarity between corresponding objects. For example, it is assumed that the similarity of the object (individual) corresponding to each node is mapped as the distance between nodes in the graph information GR11. For example, it is assumed that the similarity between concepts corresponding to each node is mapped to the distance between each node. Here, in the example shown in FIG. 1, the similarity between objects with short distances between nodes in the graph information GR11 is high, and the similarity between objects with long distances between nodes in the graph information GR11 is low. For example, in the graph information GR11 in FIG. 1, the node identified by the node ID "N35" and the node identified by the node ID "N693" are close to each other, that is, the distance between them is short. Therefore, it indicates that the object corresponding to the node identified by the node ID "N35" and the object corresponding to the node identified by the node ID "N693" have a high degree of similarity.

また、例えば、図１中のグラフ情報ＧＲ１１において、ノードＩＤ「Ｎ７」により識別されるノードと、ノードＩＤ「Ｎ２」により識別されるノードとは遠隔にある、すなわち距離が長い。そのため、ノードＩＤ「Ｎ７」により識別されるノードに対応するオブジェクトと、ノードＩＤ「Ｎ２」により識別されるノードに対応するオブジェクトとは類似度が低いことを示す。 Further, for example, in the graph information GR11 in FIG. 1, the node identified by the node ID "N7" and the node identified by the node ID "N2" are remote, that is, the distance is long. Therefore, the similarity between the object corresponding to the node identified by the node ID "N7" and the object corresponding to the node identified by the node ID "N2" is low.

〔１－２．ベクトルの生成例〕
また、ここでいう、各ノード（ベクトル）は、各オブジェクト（対象）に対応する。対象情報が様々な対象（個人やグループ）の状態を示す数値データであれば、それをベクトルデータとして、そのまま扱うことも可能である。しかし、対象情報が、テキスト、画像、または、音声であったりする場合には、ベクトルデータへの変換が必要となる。そこで、図１の例では、各対象（個人）の対象情報（個人情報）から抽出された特徴量により生成された多次元（Ｎ次元）のベクトルがオブジェクトであってもよい。 [1-2. Vector generation example]
Each node (vector) here corresponds to each object (target). If the target information is numerical data indicating the states of various targets (individuals and groups), it can be treated as it is as vector data. However, if the target information is text, image, or voice, conversion to vector data is required. Therefore, in the example of FIG. 1, the object may be a multi-dimensional (N-dimensional) vector generated from the feature amount extracted from the target information (personal information) of each target (individual).

例えば、情報処理装置１００は、対象情報の特徴を抽出するモデルを用いて各対象の対象情報からＮ次元ベクトルを生成してもよい。図１の例では、情報処理装置１００は、モデル情報記憶部１２４（図７参照）に示すように、モデルＩＤ「Ｍ１」により識別されるモデル（モデルＭ１）を用いて、各対象の対象情報からベクトルを生成する。上記のように、「モデルＭ＊（＊は任意の数値）」と記載した場合、そのモデルはモデルＩＤ「Ｍ＊」により識別されるモデルであることを示す。例えば、「モデルＭ１」と記載した場合、そのモデルはモデルＩＤ「Ｍ１」により識別されるモデルである。また、モデル情報記憶部１２４に示すように、モデルＭ１は用途「特徴抽出（個人）」、すなわち個人のデータ（対象情報）からの特徴抽出のために用いられるモデルであり、その具体的なモデルデータが「モデルデータＭＤＴ１」であることを示す。 For example, the information processing apparatus 100 may generate an N-dimensional vector from the target information of each target using a model for extracting features of the target information. In the example of FIG. 1, the information processing apparatus 100 uses the model (model M1) identified by the model ID "M1" as shown in the model information storage unit 124 (see FIG. 7) to obtain the target information of each target. Generate a vector from . As described above, when "model M* (* is an arbitrary numerical value)" is described, it indicates that the model is identified by the model ID "M*". For example, when "model M1" is described, the model is identified by the model ID "M1". Further, as shown in the model information storage unit 124, the model M1 is a model used for use "feature extraction (individual)", that is, for feature extraction from personal data (target information). It indicates that the data is "model data MDT1".

例えば、情報処理装置１００は、モデルＭ１に任意の対象（個人）の対象情報を入力することにより、モデルＭ１中の各要素（ニューロン）の値を演算し、入力した対象情報と同様の情報を出力する。例えば、情報処理装置１００は、中間層の各要素（ニューロン）の値を特徴量として抽出し、各対象（個人）に対応するＮ次元のベクトルデータを生成してもよい。 For example, by inputting target information of an arbitrary target (individual) into the model M1, the information processing apparatus 100 calculates the value of each element (neuron) in the model M1, and obtains information similar to the input target information. Output. For example, the information processing apparatus 100 may extract the value of each element (neuron) of the intermediate layer as a feature amount, and generate N-dimensional vector data corresponding to each target (individual).

ここで、図１０を用いて、各対象に対応するベクトルデータの生成の一例を示す。図１０は、実施形態に係る特徴量の抽出の一例を示す図である。図１０は、モデルＭ１の概念図である。なお、図１０では、各要素（ニューロン）の各接続関係を示す線の図示を省略する。図１０に示すように、モデルＭ１は、入力層ＩＬと、中間層ＣＬと、出力層ＯＬとを含む。例えば、モデルＭ１の入力層ＩＬは、対象情報が入力される層である。また、出力層ＯＬは、入力層ＩＬへの入力に応じて、入力された対象情報と同様の情報を出力される層である。 Here, an example of generation of vector data corresponding to each target is shown using FIG. FIG. 10 is a diagram illustrating an example of feature amount extraction according to the embodiment. FIG. 10 is a conceptual diagram of the model M1. Note that in FIG. 10, illustration of lines indicating respective connection relationships of respective elements (neurons) is omitted. As shown in FIG. 10, the model M1 includes an input layer IL, an intermediate layer CL, and an output layer OL. For example, the input layer IL of the model M1 is a layer into which target information is input. Also, the output layer OL is a layer that outputs information similar to the input target information in response to input to the input layer IL.

また、例えば、中間層ＣＬの中央部の最も圧縮された圧縮層ＲＰは、入力された対象情報の特徴を表現する層である。例えば、モデルＭ１の中間層ＣＬにおいて、入力層ＩＬから圧縮層ＲＰまでの間は、エンコードの処理を行う部分に対応する。モデルＭ１の中間層ＣＬにおいて、入力層ＩＬから圧縮層ＲＰまでの間は、入力された対象情報の特徴を圧縮する処理を行う部分に対応する。例えば、モデルＭ１の中間層ＣＬにおいて、圧縮層ＲＰから出力層ＯＬまでの間は、デコードの処理を行う部分に対応する。モデルＭ１の中間層ＣＬにおいて、圧縮層ＲＰから出力層ＯＬまでの間は、圧縮された対象情報を復元する処理を行う部分に対応する。 Also, for example, the most compressed layer RP in the central portion of the intermediate layer CL is a layer that expresses the characteristics of the input target information. For example, in the intermediate layer CL of the model M1, the portion from the input layer IL to the compression layer RP corresponds to the portion where the encoding process is performed. In the intermediate layer CL of the model M1, the portion from the input layer IL to the compression layer RP corresponds to a portion for performing processing for compressing the features of the input target information. For example, in the intermediate layer CL of the model M1, the portion from the compression layer RP to the output layer OL corresponds to a portion that performs decoding processing. In the intermediate layer CL of the model M1, the portion from the compression layer RP to the output layer OL corresponds to a portion that performs processing for restoring compressed target information.

例えば、情報処理装置１００は、圧縮層ＲＰに含まれるニューロンＮＬ１やニューロンＮＬ２等の情報をベクトルに用いてもよい。例えば、情報処理装置１００は、ある対象（個人やグループ）の対象情報が入力された場合に、算出されるニューロンＮＬ１に対応する値ＶＥ１やニューロンＮＬ２に対応する値ＶＥ２をベクトルの要素（一の次元の値）として抽出してもよい。例えば、情報処理装置１００は、ある対象の対象情報が入力された場合に、算出されるニューロンＮＬ１に対応する値ＶＥ１をその対象のベクトルの１次元目の要素として抽出してもよい。また、例えば、情報処理装置１００は、ある対象の対象情報が入力された場合に、算出されるニューロンＮＬ２に対応する値ＶＥ２をその対象のベクトルの２次元目の要素として抽出してもよい。このように、情報処理装置１００は、各対象の対象情報をモデルＭ１に入力することにより、各対象に対応するベクトルを生成してもよい。なお、情報処理装置１００は、各対象に対応するベクトルを情報提供装置５０等の他の外部装置から取得してもよい。なお、情報処理装置１００は、ベクトルの各要素として、各ニューロンに対応する値自体を用いてもよいし、各ニューロンに対応する値に所定の係数を乗算した値を用いてもよい。また、図１の例では説明を簡単にするために、ベクトルの各要素（値）が整数である場合を示すが、ベクトルの各要素（値）は、小数点以下の数値を含む実数であってもよい。 For example, the information processing apparatus 100 may use the information of the neuron NL1, the neuron NL2, etc. included in the compression layer RP as the vector. For example, when target information of a certain target (individual or group) is input, the information processing apparatus 100 converts the calculated value VE1 corresponding to the neuron NL1 and the calculated value VE2 corresponding to the neuron NL2 to the elements of the vector (one dimension value). For example, when target information of a certain target is input, the information processing apparatus 100 may extract the calculated value VE1 corresponding to the neuron NL1 as the first-dimensional element of the target vector. Further, for example, when target information of a certain target is input, the information processing apparatus 100 may extract the calculated value VE2 corresponding to the neuron NL2 as the second-dimensional element of the target vector. In this way, the information processing apparatus 100 may generate a vector corresponding to each target by inputting target information of each target into the model M1. Note that the information processing apparatus 100 may acquire vectors corresponding to each target from another external apparatus such as the information providing apparatus 50 . Note that the information processing apparatus 100 may use the value itself corresponding to each neuron, or may use a value obtained by multiplying the value corresponding to each neuron by a predetermined coefficient, as each element of the vector. In the example of FIG. 1, each element (value) of the vector is an integer to simplify the explanation. good too.

なお、情報処理装置１００は、圧縮層ＲＰの要素（ニューロン）に限らず、中間層ＣＬ中の他の要素（ニューロン）の情報をベクトルに用いてもよい。例えば、情報処理装置１００は、エンコード部分のニューロンＮＬ３やデコード部分のニューロンＮＬ４等の情報をベクトルに用いてもよい。例えば、情報処理装置１００は、ある対象の対象情報が入力された場合に、算出されるニューロンＮＬ３に対応する値ＶＥ３やニューロンＮＬ４に対応する値ＶＥ４をベクトルの要素（一の次元の値）として抽出してもよい。なお、上記は、一例であり、情報処理装置１００は、オートエンコーダに限らず、種々のモデルを用いて、対象情報からの特徴抽出を行ってもよい。例えば、スタートアップメンバ等の所定の目的に適した対象と適さない対象を識別するように学習させたモデルを生成して、その中間層をベクトルデータとして抽出しても良い。また、例えば、トリプレットロス（triplet loss）といった類似性を学習する方法によりモデルを生成しても良い。また、情報処理装置１００は、モデルを用いずに、特徴抽出を行ってもよい。例えば、情報処理装置１００は、情報処理装置１００の管理者等が設定して特徴（素性）に対応する情報を対象情報から抽出し、ベクトルを生成してもよい。例えば、情報処理装置１００は、対象（個人）の年齢、性別等の属性情報や顔や全体を撮像した画像や音声や行動等の特徴（素性）に対応する情報を対象情報から抽出し、ベクトルを生成してもよい。 Note that the information processing apparatus 100 may use information of not only the elements (neurons) of the compression layer RP but also other elements (neurons) in the intermediate layer CL for the vectors. For example, the information processing apparatus 100 may use the information of the encoding part neuron NL3, the decoding part neuron NL4, or the like for the vector. For example, when target information of a certain target is input, the information processing apparatus 100 uses the calculated value VE3 corresponding to the neuron NL3 and the calculated value VE4 corresponding to the neuron NL4 as vector elements (one-dimensional values). may be extracted. Note that the above is just an example, and the information processing apparatus 100 may perform feature extraction from target information using various models, not limited to autoencoders. For example, it is possible to generate a model that has been trained to distinguish targets such as start-up members that are suitable for a given purpose and targets that are not suitable, and extract the intermediate layer as vector data. Alternatively, a model may be generated by a similarity learning method such as triplet loss. Also, the information processing apparatus 100 may perform feature extraction without using a model. For example, the information processing apparatus 100 may extract information corresponding to features (features) set by an administrator or the like of the information processing apparatus 100 from the target information and generate a vector. For example, the information processing apparatus 100 extracts, from the target information, attribute information such as the age and gender of a target (individual), and information corresponding to features (features) such as images, voices, and behaviors of the face and the whole image, and extracts vector may be generated.

また、例えば、情報処理装置１００は、情報提供装置５０等の他の外部装置からモデルＭ１を取得してもよい。なお、情報処理装置１００は、対象情報記憶部１２１（図４参照）に記憶された各対象の対象情報ＴＤＴ１、ＴＤＴ２、ＴＤＴ４５１等を入力として、モデルＭ１を生成してもよい。例えば、情報処理装置１００は、各対象の対象情報ＴＤＴ１、ＴＤＴ２、ＴＤＴ４５１等を入力として、モデルＭ１を生成してもよい。例えば、対象情報ＴＤＴ１、ＴＤＴ２、ＴＤＴ４５１等は、対象（個人）の属性情報や画像や音声や行動等を含んでもよい。例えば、対象情報ＴＤＴ１、ＴＤＴ２、ＴＤＴ４５１等の対象情報は、対応する個人のデモグラフィック属性情報やサイコグラフィック属性情報等のユーザに関する種々の属性情報を含んでもよい。例えば、対象情報ＴＤＴ１、ＴＤＴ２、ＴＤＴ４５１等の対象情報は、対応する個人の年齢、性別、住所、興味関心等の情報を含んでもよい。また、例えば、対象情報は、対応する個人を撮像した画像情報を含んでもよい。例えば、対象情報は、対応する個人の顔や全体（全身）を撮像した画像情報を含んでもよい。例えば、対象情報は、対応する個人の発話等の音声情報を含んでもよい。例えば、対象情報は、対応する個人の行動に関する種々の行動情報を含んでもよい。例えば、対象情報は、対応する個人の検索やコンテンツ閲覧や購買等の行動情報を含んでもよい。なお、対象情報ＴＤＴ１、ＴＤＴ２、ＴＤＴ４５１等の対象情報は、対象（個人）の経歴等の業務に関する経歴情報（文字情報）のみであってもよい。 Further, for example, the information processing device 100 may acquire the model M1 from another external device such as the information providing device 50 . The information processing apparatus 100 may generate the model M1 by inputting the target information TDT1, TDT2, TDT451, etc. of each target stored in the target information storage unit 121 (see FIG. 4). For example, the information processing apparatus 100 may generate the model M1 by inputting the target information TDT1, TDT2, TDT451, etc. of each target. For example, the target information TDT1, TDT2, TDT451, etc. may include attribute information, images, sounds, actions, and the like of the target (individual). For example, target information such as target information TDT1, TDT2, and TDT451 may include various attribute information about the user, such as corresponding individual demographic attribute information and psychographic attribute information. For example, target information such as target information TDT1, TDT2, and TDT451 may include information such as the age, gender, address, and interests of the corresponding individual. Also, for example, the target information may include image information obtained by imaging the corresponding individual. For example, the target information may include image information obtained by imaging the corresponding individual's face or entire body (whole body). For example, the target information may include audio information such as the corresponding individual's speech. For example, the subject information may include various behavioral information regarding the behavior of the corresponding individual. For example, the target information may include action information such as search, content viewing, and purchase of the corresponding individual. Note that the target information such as the target information TDT1, TDT2, and TDT451 may be only career information (character information) related to work such as the career of the target (individual).

情報処理装置１００は、対象の対象情報が入力される入力層と、出力層と、入力層から出力層までのいずれかの層であって出力層以外の層に属する第１要素と、第１要素と第１要素の重みとに基づいて値が算出される第２要素と、を含み、入力層に入力された対象情報に対し、出力層以外の各層に属する各要素を第１要素として、第１要素と第１要素の重みとに基づく演算を行うことにより、入力層に入力される情報と同様の情報を出力層から出力するモデルＭ１を生成してもよい。なお、情報処理装置１００は、ベクトル生成に用いるモデルを情報提供装置５０等の他の外部装置から取得してもよい。 The information processing apparatus 100 includes an input layer to which target information of a target is input, an output layer, a first element belonging to any layer from the input layer to the output layer other than the output layer, and a first layer. and a second element whose value is calculated based on the weight of the element and the first element. By performing an operation based on the first element and the weight of the first element, the model M1 may be generated that outputs from the output layer the same information as the information input to the input layer. Note that the information processing apparatus 100 may acquire a model used for vector generation from another external apparatus such as the information providing apparatus 50 or the like.

〔１－３．処理例〕
ここから、情報処理装置１００は、ユーザから一の対象（個人）の対象情報を取得し、一の対象に類似する対象（個人）に関する情報提供を行う場合を一例として説明する。図１の例では、端末装置１０は、所定の管理者であるユーザＵ１により利用される。例えば、ユーザＵ１は、企業における組織編制等の業務行うユーザであってもよい。図１の例では、ユーザＵ１が対象である一の個人（以下、「個人Ｘ」とする）がどの個人（人物）に類似するかの情報を所望する場合を示す。 [1-3. Processing example]
From here, a case where the information processing apparatus 100 acquires target information of one target (individual) from a user and provides information on a target (individual) similar to the one target will be described as an example. In the example of FIG. 1, the terminal device 10 is used by a user U1 who is a predetermined administrator. For example, the user U1 may be a user who conducts business such as organizational organization in a company. The example of FIG. 1 shows a case where user U1 desires information about which individual (person) a target individual (hereinafter referred to as “individual X”) is similar to.

まず、個人Ｘの対象情報を取得した端末装置１０は、個人Ｘの対象情報ＴＤＴ１１を情報処理装置１００へ送信する。図１の例では、企業におけるプロジェクトのスタートアップのチーム編成（組織編制）を行うユーザＵ１が、個人Ｘに類似する個人をチームのメンバとして加えることを希望するものとする。図１の例では、個人Ｘは、他の企業においてプロジェクトで重要は貢献を行った個人であるものとする。なお、個人Ｘは、ユーザＵ１が属する企業の個人であってもよい。例えば、個人ＸはユーザＵ１であってもよい。なお、個人ＸがユーザＵ１である場合、ユーザＵ１は、ユーザＵ１自身の個人情報（対象情報）を情報処理装置１００へ送信する。例えば、ユーザＵ１は、個人Ｘの対象情報ＴＤＴ１１を収集した後、個人Ｘの対象情報ＴＤＴ１１を情報処理装置１００へ送信するように端末装置１０を操作する。例えば、端末装置１０は、個人Ｘの個人Ｘの対象情報ＴＤＴ１１を所定の情報処理装置から取得し、取得した対象情報ＴＤＴ１１を情報処理装置１００へ送信する。図１の例では、対象情報ＴＤＴ１１等は、対象（個人）の年齢、性別等の属性情報や顔や全体を撮像した画像や音声や行動等に関する情報を含む。なお、対象情報ＴＤＴ１１等は、対象（個人）の経歴等の業務に関する経歴情報（文字情報）のみであってもよい。 First, the terminal device 10 that has acquired the target information of the individual X transmits the target information TDT11 of the individual X to the information processing device 100 . In the example of FIG. 1, a user U1 who organizes a team (organizational organization) for a project startup in a company wishes to add an individual similar to the individual X as a member of the team. In the example of FIG. 1, individual X is assumed to be an individual who has made significant contributions to projects in other companies. In addition, the individual X may be an individual of the company to which the user U1 belongs. For example, person X may be user U1. Note that when the individual X is the user U1, the user U1 transmits personal information (target information) of the user U1 to the information processing apparatus 100 . For example, after the user U1 collects the target information TDT11 of the person X, the user U1 operates the terminal device 10 so as to transmit the target information TDT11 of the person X to the information processing apparatus 100 . For example, the terminal device 10 acquires the target information TDT11 of the individual X of the individual X from a predetermined information processing device, and transmits the acquired target information TDT11 to the information processing device 100 . In the example of FIG. 1, the target information TDT11 and the like include attribute information such as the age and sex of the target (individual), and information related to images, voices, actions, and the like obtained by imaging the face and the whole. Note that the target information TDT11 and the like may be only career information (character information) related to work such as the career of the target (individual).

そして、情報処理装置１００は、一の対象に関する対象情報（個人情報）を取得する（ステップＳ１１）。図１の例では、情報処理装置１００は、端末装置１０から個人Ｘに関する対象情報ＴＤＴ１１を取得する。 Then, the information processing apparatus 100 acquires target information (personal information) regarding one target (step S11). In the example of FIG. 1 , the information processing device 100 acquires target information TDT11 regarding the person X from the terminal device 10 .

そして、情報処理装置１００は、一の対象に対応する対象情報からグラフ情報の探索に用いるベクトルを生成する。図１の例では、情報処理装置１００は、処理群ＰＳ１１に示すような処理により、個人Ｘに対応するベクトルを生成する。情報処理装置１００は、個人Ｘに関する対象情報ＴＤＴ１１をモデルＭ１に入力する（ステップＳ１２）。具体的には、情報処理装置１００は、個人Ｘの対象情報ＴＤＴ１１をモデルＭ１に入力する。そして、情報処理装置１００は、対象情報ＴＤＴ１１の入力後のモデルＭ１中の情報を用いて、ベクトルを生成する（ステップＳ１３）。例えば、情報処理装置１００は、対象情報ＴＤＴ１１が入力されたモデルＭ１中の各要素を用いて、ベクトルデータを生成する。 Then, the information processing apparatus 100 generates a vector used for searching graph information from target information corresponding to one target. In the example of FIG. 1, the information processing apparatus 100 generates a vector corresponding to the individual X by performing the processing shown in the processing group PS11. The information processing device 100 inputs the target information TDT11 regarding the individual X to the model M1 (step S12). Specifically, the information processing apparatus 100 inputs the target information TDT11 of the individual X to the model M1. Then, the information processing apparatus 100 generates a vector using information in the model M1 after input of the target information TDT11 (step S13). For example, the information processing apparatus 100 generates vector data using each element in the model M1 to which the target information TDT11 is input.

図１の例では、情報処理装置１００は、対象情報ＴＤＴ１１が入力されたモデルＭ１中の各要素の値を用いて、ベクトルデータＶＤ１１（単に「ベクトルＶＤ１１」ともいう）を生成する。例えば、情報処理装置１００は、個人Ｘの対象情報ＴＤＴ１１が入力された場合における、モデルＭ１のニューロンＮＬ１に対応する値ＶＥ１（図１０参照）やニューロンＮＬ２に対応する値ＶＥ２（図１０参照）を用いて、ベクトルを生成する。例えば、情報処理装置１００は、個人Ｘの対象情報ＴＤＴ１１が入力された場合に、算出されるニューロンＮＬ１に対応する値ＶＥ１をベクトルＶＤ１１の１次元目の要素として抽出してもよい。また、例えば、情報処理装置１００は、ある対象の対象情報が入力された場合に、算出されるニューロンＮＬ２に対応する値ＶＥ２をベクトルＶＤ１１の２次元目の要素として、ベクトルＶＤ１１を生成する。図１の例では、情報処理装置１００は、１次元目の要素が「３５」であり、２次元目の要素が「６３」であるようなベクトルＶＤ１１を生成する。 In the example of FIG. 1, the information processing apparatus 100 generates vector data VD11 (also simply referred to as "vector VD11") using the values of the elements in the model M1 to which the target information TDT11 is input. For example, the information processing apparatus 100 calculates a value VE1 (see FIG. 10) corresponding to the neuron NL1 and a value VE2 (see FIG. 10) corresponding to the neuron NL2 of the model M1 when the target information TDT11 of the person X is input. to generate a vector. For example, the information processing apparatus 100 may extract the calculated value VE1 corresponding to the neuron NL1 as the first-dimensional element of the vector VD11 when the target information TDT11 of the individual X is input. Further, for example, when target information of a certain target is input, the information processing apparatus 100 generates a vector VD11 by using the calculated value VE2 corresponding to the neuron NL2 as the second-dimensional element of the vector VD11. In the example of FIG. 1, the information processing apparatus 100 generates a vector VD11 whose first-dimensional element is "35" and whose second-dimensional element is "63."

そして、情報処理装置１００は、個人Ｘに類似する対象（類似対象）を検索する（ステップＳ１４）。例えば、情報処理装置１００は、非特許文献１に開示されるような近傍検索の技術等の種々の従来技術を適宜用いて、個人Ｘの類似対象を検索してもよい。例えば、情報処理装置１００は、類似性が高いノード間がエッジで連結されたグラフ情報ＧＲ１１を用いる。 Then, the information processing device 100 searches for a target (similar target) similar to the individual X (step S14). For example, the information processing apparatus 100 may search for a similar target of the person X by appropriately using various conventional techniques such as the neighborhood search technique disclosed in Non-Patent Document 1. For example, the information processing apparatus 100 uses graph information GR11 in which nodes with high similarity are connected by edges.

図１の例では、情報処理装置１００は、情報群ＩＮＦ１１に示すように、グラフ情報ＧＲ１１やインデックス情報ＩＮＤ１１を用いて個人Ｘの類似対象を検索する。例えば、情報処理装置１００は、グラフ情報記憶部１２３（図６参照）から対象（個人）に関するグラフ情報ＧＲ１１を取得する。また、例えば、情報処理装置１００は、インデックス情報記憶部１２２（図５参照）から、グラフ情報ＧＲ１１における検索の起点となるノード（以下、「起点ベクトル」ともいう）の決定に用いるインデックス情報ＩＮＤ１１を取得する。なお、インデックス情報ＩＮＤ１１は、情報処理装置１００が生成してもよいし、情報処理装置１００は、インデックス情報ＩＮＤ１１を情報提供装置５０等の他の外部装置から取得してもよい。 In the example of FIG. 1, the information processing apparatus 100 searches for a similar target of the person X using the graph information GR11 and the index information IND11, as shown in the information group INF11. For example, the information processing apparatus 100 acquires graph information GR11 regarding an object (individual) from the graph information storage unit 123 (see FIG. 6). Further, for example, the information processing apparatus 100 stores index information IND11 used for determining a node (hereinafter also referred to as "origin vector") serving as a starting point for searching in the graph information GR11 from the index information storage unit 122 (see FIG. 5). get. Note that the index information IND11 may be generated by the information processing device 100, or the information processing device 100 may acquire the index information IND11 from another external device such as the information providing device 50 or the like.

そして、情報処理装置１００は、一の対象（クエリ）に対応する起点ベクトルを決定（特定）するために、インデックス情報ＩＮＤ１１を用いる。図１の例では、情報処理装置１００は、個人ＸのベクトルＶＤ１１に対応する起点ベクトルを決定（特定）するために、インデックス情報ＩＮＤ１１を用いる。すなわち、情報処理装置１００は、ベクトルＶＤ１１とインデックス情報ＩＮＤ１１とを用いて、グラフ情報ＧＲ１１における起点ベクトルを決定する。 Then, the information processing apparatus 100 uses the index information IND11 to determine (specify) the origin vector corresponding to one target (query). In the example of FIG. 1, the information processing apparatus 100 uses the index information IND11 to determine (identify) the origin vector corresponding to the person X's vector VD11. That is, the information processing apparatus 100 uses the vector VD11 and the index information IND11 to determine the origin vector in the graph information GR11.

図１中のインデックス情報ＩＮＤ１１は、図５中のインデックス情報記憶部１２２に示す階層構造を有する。例えば、インデックス情報ＩＮＤ１１は、ルートＲＴの直下に位置する第１階層のノード（ベクトル）が、節点ＶＴ１、ＶＴ２等であることを示す。また、例えば、インデックス情報ＩＮＤ１１は、節点ＶＴ２の直下の第２階層のノードが、節点ＶＴ２－１～ＶＴ２－４（図示せず）であることを示す。また、例えば、インデックス情報ＩＮＤ１１は、節点ＶＴ２－２の直下の第３階層のノードが、ノードＮ３５、ノードＮ４５１、ノードＮ６９３、すなわちグラフ情報ＧＲ１１中のノード（ベクトル）であることを示す。 The index information IND11 in FIG. 1 has a hierarchical structure shown in the index information storage section 122 in FIG. For example, the index information IND11 indicates that the nodes (vectors) in the first layer located directly below the root RT are the nodes VT1, VT2, and the like. Also, for example, the index information IND11 indicates that the nodes of the second layer immediately below the node VT2 are the nodes VT2-1 to VT2-4 (not shown). Also, for example, the index information IND11 indicates that the nodes of the third layer immediately below the node VT2-2 are the node N35, the node N451, and the node N693, that is, nodes (vectors) in the graph information GR11.

例えば、情報処理装置１００は、図１中のインデックス情報ＩＮＤ１１に示すような木構造型のインデックス情報を用いて、グラフ情報ＧＲ１１における起点ベクトルを決定する（ステップＳ１５）。図１の例では、情報処理装置１００は、ベクトルＶＤ１１を生成した後、インデックス情報ＩＮＤ１１を上から下へ辿ることにより、インデックス情報ＩＮＤ１１の近傍候補となる起点ベクトルを特定することにより、効率的に検索クエリ（一の対象）に対応する起点ベクトルを決定することができる。 For example, the information processing apparatus 100 determines the origin vector in the graph information GR11 using tree-structured index information as indicated by the index information IND11 in FIG. 1 (step S15). In the example of FIG. 1, after generating the vector VD11, the information processing apparatus 100 traces the index information IND11 from top to bottom to identify the origin vector that is a candidate for the vicinity of the index information IND11. An origin vector corresponding to a search query (one subject) can be determined.

例えば、情報処理装置１００は、インデックス情報ＩＮＤ１１をルートＲＴからリーフノード（グラフ情報ＧＲ１１中のノード（ベクトル））まで辿ることにより、ベクトルＶＤ１１に対応する起点ベクトルを決定してもよい。図１の例では、例えば、情報処理装置１００は、インデックス情報ＩＮＤ１１をルートＲＴからノードＮ４５１まで辿ることにより、ノードＮ４５１を起点ベクトルとして決定する。例えば、情報処理装置１００は、木構造に関する種々の従来技術を適宜用いて、インデックス情報ＩＮＤ１１をルートＲＴからリーフノードまで辿ることにより、辿りついたリーフノードを起点ベクトルとして決定してもよい。例えば、情報処理装置１００は、ベクトルＶＤ１１との類似度に基づいて、インデックス情報ＩＮＤ１１を下へ辿ることにより、起点ベクトルを決定してもよい。例えば、情報処理装置１００は、ルートＲＴから節点ＶＴ１、ＶＴ２等のいずれの節点に辿るかを、ベクトルＶＤ１１と節点ＶＴ１、ＶＴ２との類似度に基づいて決定してもよい。例えば、情報処理装置１００は、ルートＲＴから節点ＶＴ１、ＶＴ２等のうち、ベクトルＶＤ１１との類似度が最も高い節点ＶＴ２へ辿ると決定してもよい。また、例えば、情報処理装置１００は、節点ＶＴ２から節点ＶＴ２－１～ＶＴ２－４等のうち、ベクトルＶＤ１１との類似度が最も高い節点ＶＴ２－２へ辿ると決定してもよい。また、例えば、情報処理装置１００は、節点ＶＴ２－２からノードＮ３５、Ｎ４５１、Ｎ６９３等のうち、ベクトルＶＤ１１との類似度が最も高い節点ノードＮ４５１へ辿ると決定してもよい。なお、図１の例では、説明を簡単にするために、起点ベクトルを１つ決定する場合を示すが、情報処理装置１００は、複数個の起点ベクトルを決定してもよい。例えば、情報処理装置１００は、ノードＮ４５１、Ｎ３５、Ｎ６９３、Ｎ２等の複数個のベクトル（ノード）を起点ベクトルとして決定してもよい。なお、インデックス情報ＩＮＤ１１を用いずに、検索開始時にグラフ情報ＧＲ１１からランダムに１つ以上のノードを選択し、それを起点ベクトルとしてもよいし、または、予め指定された１つ以上のノードを起点ベクトルとしても良い。 For example, the information processing apparatus 100 may determine the origin vector corresponding to the vector VD11 by tracing the index information IND11 from the root RT to leaf nodes (nodes (vectors) in the graph information GR11). In the example of FIG. 1, for example, the information processing apparatus 100 determines the node N451 as the origin vector by tracing the index information IND11 from the root RT to the node N451. For example, the information processing apparatus 100 may determine the reached leaf node as the origin vector by tracing the index information IND11 from the root RT to the leaf node using various conventional techniques related to the tree structure. For example, the information processing apparatus 100 may determine the origin vector by tracing down the index information IND11 based on the degree of similarity with the vector VD11. For example, the information processing apparatus 100 may determine which of the nodes VT1, VT2, etc. is followed from the root RT based on the degree of similarity between the vector VD11 and the nodes VT1, VT2. For example, the information processing apparatus 100 may determine to trace from the root RT to the node VT2, which has the highest degree of similarity with the vector VD11, among the nodes VT1, VT2, and the like. Further, for example, the information processing apparatus 100 may determine to trace from the node VT2 to the node VT2-2 having the highest degree of similarity with the vector VD11 among the nodes VT2-1 to VT2-4. Further, for example, the information processing apparatus 100 may determine to trace from the node VT2-2 to the node N451, which has the highest degree of similarity with the vector VD11 among the nodes N35, N451, N693, and the like. Note that the example of FIG. 1 shows a case where one origin vector is determined for the sake of simplicity of explanation, but the information processing apparatus 100 may determine a plurality of origin vectors. For example, the information processing apparatus 100 may determine a plurality of vectors (nodes) such as nodes N451, N35, N693, and N2 as origin vectors. Instead of using the index information IND11, one or more nodes may be randomly selected from the graph information GR11 at the start of the search and used as the starting vector, or one or more nodes specified in advance may be used as the starting point. It can also be a vector.

そして、情報処理装置１００は、グラフ情報ＧＲ１１を検索することにより、個人Ｘの類似対象を抽出する（ステップＳ１６）。例えば、情報処理装置１００は、ノードＮ４５１の近傍に位置するノードを類似対象として抽出する。例えば、情報処理装置１００は、ノードＮ４５１からの距離が近いノードを類似対象として抽出する。例えば、情報処理装置１００は、ノードＮ４５１を起点として、エッジを辿ることにより、ノードＮ４５１から到達可能なノードを類似対象として抽出する。例えば、情報処理装置１００は、所定数（例えば、２個や１０個等）のノードを類似対象として抽出する。例えば、情報処理装置１００は、図１１に示すような検索処理により、個人Ｘの類似対象を抽出してもよいが、詳細は後述する。図１の例では、情報処理装置１００は、ノードＮ４５１を起点として、グラフ情報ＧＲ１１を探索することにより、ノードＮ４５１やノードＮ３５を類似対象として抽出する。 Then, the information processing apparatus 100 extracts a similar target of the individual X by searching the graph information GR11 (step S16). For example, the information processing apparatus 100 extracts nodes located near the node N451 as similar targets. For example, the information processing apparatus 100 extracts a node that is close to the node N451 as a similar target. For example, the information processing apparatus 100 extracts nodes reachable from the node N451 as similar objects by tracing edges starting from the node N451. For example, the information processing apparatus 100 extracts a predetermined number (for example, 2 or 10) of nodes as similar targets. For example, the information processing apparatus 100 may extract a similar target of the person X through a search process as shown in FIG. 11, which will be described later in detail. In the example of FIG. 1, the information processing apparatus 100 extracts the node N451 and the node N35 as similar objects by searching the graph information GR11 with the node N451 as a starting point.

そして、情報処理装置１００は、抽出した類似対象に関する情報を提供する（ステップＳ１７）。図１の例では、情報処理装置１００は、ノードＮ４５１に対応する個人＃４５１や、ノードＮ３５に対応する個人＃３５を個人Ｘに類似する対象（類似個人）としてユーザＵ１が利用する端末装置１０に提供する。 Then, the information processing apparatus 100 provides information on the extracted similar target (step S17). In the example of FIG. 1, the information processing apparatus 100 treats the individual #451 corresponding to the node N451 and the individual #35 corresponding to the node N35 as targets (similar individuals) similar to the individual X. provide to

上述したように、情報処理装置１００は、グラフ情報ＧＲ１１やインデックス情報ＩＮＤ１１を用いて、ユーザＵ１から取得した個人Ｘの類似対象を抽出する。例えば、情報処理装置１００は、インデックス情報ＩＮＤ１１を用いて、個人Ｘの類似対象を抽出する際のグラフ情報ＧＲ１１における起点ベクトルを決定する。そして、情報処理装置１００は、決定した起点ベクトルを起点としてグラフ情報ＧＲ１１を探索することにより、個人Ｘの類似対象を抽出する。これにより、情報処理装置１００は、所望の対象と所定の対応関係を有する対象を適切に抽出することができる。 As described above, the information processing apparatus 100 uses the graph information GR11 and the index information IND11 to extract similar objects of the individual X acquired from the user U1. For example, the information processing apparatus 100 uses the index information IND11 to determine the origin vector in the graph information GR11 when extracting the similar target of the person X. FIG. Then, the information processing apparatus 100 extracts a similar target of the individual X by searching the graph information GR11 with the determined starting vector as a starting point. Accordingly, the information processing apparatus 100 can appropriately extract targets having a predetermined correspondence relationship with a desired target.

例えば、情報処理装置１００は、個人Ｘと類似する対象（類似個人）が、例えば個人Ｘと一見関係が無いように見える場合であっても、グラフ情報を用いることで、個人Ｘの類似個人として抽出でき、所望の対象と所定の対応関係を有する対象を適切に抽出することができる。例えば、ディープラーニングといった手法で、事業やスタートアップメンバ等の所定の目的に適した対象を識別するという方法もあるが、所定の目的に適するかに影響する多種多様な要素が存在する場合においては、各要素の事例が少ないので、学習ができずに識別精度を向上させることが難しいという問題がある。例えば、ディープラーニングといった学習手法で生成したモデルを用いて、そのモデルに対象の情報を入力し、出力されたスコアからその対象が所定の目的に適するかを識別するという方法もあるが、多種多様な要素が存在する場合においては、各要素の事例が少ないため、モデルの精度を向上させることが難しく、識別精度を向上させることが難しい。一方で、情報処理装置１００は、各要素から抽出した特徴を基に生成した複数のベクトルのグラフ構造化したグラフ情報を用いて検索を行うことにより、ある対象に類似する類似対象を適切に抽出することができる。そのため、情報処理装置１００は、ある対象がどのような目的に適した対象に類似するかを精度よく識別することができる。 For example, even if an object (similar individual) similar to individual X does not seem to be related to individual X at first glance, the information processing apparatus 100 uses the graph information to identify it as a similar individual to individual X. It is possible to appropriately extract an object that can be extracted and that has a predetermined correspondence with a desired object. For example, there is a method of identifying targets suitable for a given purpose, such as a business or startup member, using a method such as deep learning, but if there are a wide variety of factors that affect whether it is suitable for a given purpose, Since there are few examples of each element, there is a problem that learning cannot be performed and it is difficult to improve the identification accuracy. For example, there is a method of using a model generated by a learning method such as deep learning, inputting target information into the model, and identifying whether the target is suitable for a given purpose from the output score. When such elements exist, it is difficult to improve the accuracy of the model because there are few examples of each element, and it is difficult to improve the accuracy of identification. On the other hand, the information processing apparatus 100 appropriately extracts similar objects similar to a certain object by performing a search using graph information in which a plurality of vectors are graph-structured and generated based on features extracted from each element. can do. Therefore, the information processing apparatus 100 can accurately identify for what purpose a certain object is similar to a suitable object.

〔１－４．対象について〕
図１の例では、人を要素とする対象の一例として、個人（一人の人）である場合を示したが、人を要素とする対象は、個人に限らず、各対象が複数人を要素として含む対象であってもよい。例えば、情報処理装置１００は、複数人を要素とする含むグループ（ユーザ群）を対象として、類似対象を抽出してもよい。この場合、情報処理装置１００は、一のグループを対象として、類似するグループ（類似グループ）を抽出してもよい。例えば、情報処理装置１００は、一の組織を対象として、類似する組織（類似組織）を抽出してもよい。例えば、情報処理装置１００は、一の企業を対象として、類似する企業（類似企業）を抽出してもよい。 [1-4. About the target]
In the example of FIG. 1, an individual (single person) is shown as an example of an object whose element is a person. It may be a target to be included as For example, the information processing apparatus 100 may extract a similar target from a group (user group) including a plurality of persons as a target. In this case, the information processing apparatus 100 may extract similar groups (similar groups) from one group. For example, the information processing apparatus 100 may extract similar tissues (similar tissues) from one tissue. For example, the information processing apparatus 100 may extract similar companies (similar companies) for one company.

上記のように対象がグループの場合、例えば、対象情報は、対象（グループ）に含まれる各個人の年齢、性別等の属性情報や顔や全体を撮像した画像や音声や行動等を含んでもよい。例えば、対象情報は、対応する対象（グループ）に含まれる各個人のデモグラフィック属性情報やサイコグラフィック属性情報等のユーザに関する種々の属性情報を含んでもよい。例えば、対象情報は、対応する対象（グループ）に含まれる各個人の年齢、性別、住所、興味関心等の情報を含んでもよい。また、例えば、対象情報は、対応する対象（グループ）に含まれる各個人を撮像した画像情報を含んでもよい。例えば、対象情報は、対応する個人の顔や全体（全身）を撮像した画像情報を含んでもよい。例えば、対象情報は、対応する対象（グループ）に含まれる各個人の発話等の音声情報を含んでもよい。例えば、対象情報は、対応する対象（グループ）に含まれる各個人の行動に関する種々の行動情報を含んでもよい。例えば、対象情報は、対応する対象（グループ）に含まれる各個人の検索やコンテンツ閲覧や購買等の行動情報を含んでもよい。なお、対象情報は、対象（グループ）に含まれる各個人の経歴等の業務に関する経歴情報（文字情報）のみであってもよい。 When the target is a group as described above, for example, the target information may include attribute information such as age and gender of each individual included in the target (group), and images, voices, and behaviors of the face and the whole image. . For example, the target information may include various user attribute information such as demographic attribute information and psychographic attribute information for each individual included in the corresponding target (group). For example, the target information may include information such as age, gender, address, and interests of each individual included in the corresponding target (group). Further, for example, the target information may include image information obtained by imaging each individual included in the corresponding target (group). For example, the target information may include image information obtained by imaging the corresponding individual's face or entire body (whole body). For example, the target information may include audio information such as utterances of each individual included in the corresponding target (group). For example, the target information may include various behavioral information regarding the behavior of each individual included in the corresponding target (group). For example, the target information may include action information such as search, content viewing, and purchase of each individual included in the corresponding target (group). Note that the target information may be only career information (character information) related to work such as the career of each individual included in the target (group).

例えば、情報処理装置１００は、グラフ情報ＧＲ１１と同様に、グループを対象とするモデルＭ２を用いて生成されたベクトルをグラフ構造化したグラフ情報を用いて、類似対象を抽出してもよい。例えば、情報処理装置１００は、各グループをノードが類似性に応じてエッジにより連結されたグラフ情報を用いて、類似対象を抽出してもよい。例えば、情報処理装置１００は、図１２に示すようなグラフ情報ＧＲ２１を用いて、グループを対象として、類似情報を抽出してもよい。図１２は、グループのグラフ情報を概念的に示す図である。 For example, similar to the graph information GR11, the information processing apparatus 100 may extract similar targets using graph information obtained by structuring vectors generated using the model M2 targeting groups. For example, the information processing apparatus 100 may extract similar targets using graph information in which nodes of each group are connected by edges according to similarity. For example, the information processing apparatus 100 may use graph information GR21 as shown in FIG. 12 to extract similarity information for groups. FIG. 12 is a diagram conceptually showing group graph information.

例えば、情報処理装置１００は、複数人を要素として含む一のグループ（グループＹ）を対象として、グループＹに関する対象情報を取得した場合、そのグループＹに類似するグループ(類似対象)の情報をグラフ情報ＧＲ２１から抽出してもよい。例えば、情報処理装置１００は、グループＹに関する対象情報（対象情報ＹＤＴ）を取得する。例えば、情報処理装置１００は、複数人を要素として含むグループを対象とするグラフ情報ＧＲ２１をグラフ情報記憶部１２３から取得する。例えば、情報処理装置１００は、複数人を要素として含むグループの各々に対応する複数のノードが、複数のグループの類似性に応じて連結されたグラフ情報ＧＲ２１を取得する。そして、情報処理装置１００は、モデル情報記憶部１２４に記憶されたモデルＭ２を用いて、グループＹの対象情報ＹＤＴからベクトルＹＶＤを生成する。 For example, when the information processing apparatus 100 acquires target information about a group (group Y) that includes a plurality of people as elements, the information processing apparatus 100 displays information on a group (similar target) similar to the group Y as a graph. It may be extracted from the information GR21. For example, the information processing apparatus 100 acquires target information about group Y (target information YDT). For example, the information processing apparatus 100 acquires from the graph information storage unit 123 graph information GR21 for a group including multiple persons as elements. For example, the information processing apparatus 100 acquires graph information GR21 in which a plurality of nodes corresponding to each group including a plurality of persons as elements are connected according to the similarity of the plurality of groups. Then, the information processing apparatus 100 generates a vector YVD from the target information YDT of the group Y using the model M2 stored in the model information storage unit 124 .

そして、情報処理装置１００は、生成したベクトルＹＶＤとインデックス情報を用いて起点ベクトルをノードＮＧＰ５（図１２中のグループＧＰ５に対応）に決定する。そして、情報処理装置１００は、起点ベクトルであるノードＮＧＰ５を起点としてグラフ情報ＧＲ２１を検索することにより、グループＹの類似対象（類似グループ）を抽出する。例えば、情報処理装置１００は、ノードＮＧＰ５を起点として、グラフ情報ＧＲ２１を探索することにより、所定数（例えば２個など）のノードを類似対象として抽出する。図１２の例では、情報処理装置１００は、類似対象範囲ＳＡＲ１１に含まれるグループＧＰ１（ノードＮＧＰ１）やグループＧＰ５（ノードＮＧＰ５）を類似対象として抽出する。 Then, the information processing apparatus 100 determines the origin vector to be the node NGP5 (corresponding to the group GP5 in FIG. 12) using the generated vector YVD and the index information. Then, the information processing apparatus 100 extracts similar objects (similar groups) of the group Y by searching the graph information GR21 with the node NGP5, which is the starting vector, as the starting point. For example, the information processing apparatus 100 extracts a predetermined number (for example, two) of nodes as similar objects by searching the graph information GR21 starting from the node NGP5. In the example of FIG. 12, the information processing apparatus 100 extracts the group GP1 (node NGP1) and the group GP5 (node NGP5) included in the similar target range SAR11 as similar targets.

そして、情報処理装置１００は、抽出した類似対象に関する情報を提供する。例えば、情報処理装置１００は、類似対象として抽出されたノードに対応するグループＧＰ１やグループＧＰ５等をグループＹに類似するグループとしてユーザＵ１が利用する端末装置１０に提供する。 Then, the information processing apparatus 100 provides information about the extracted similar objects. For example, the information processing apparatus 100 provides the terminal device 10 used by the user U1 as groups similar to the group Y, such as the group GP1 and the group GP5 corresponding to the nodes extracted as similar targets.

〔１－５．インデックス情報〕
図１の例に示すインデックス情報（インデックスデータ）は一例であり、情報処理装置１００は、種々のインデックス情報を用いて、グラフ情報を検索してもよい。また、例えば、情報処理装置１００は、検索時に用いるインデックスデータを生成してもよい。例えば、情報処理装置１００は、高次元ベクトルを検索する検索インデックスをインデックスデータとして生成する。ここでいう高次元ベクトルとは、例えば、数百次元から数千次元のベクトルであってもよいし、それ以上の次元のベクトルであってもよい。 [1-5. Index information]
The index information (index data) shown in the example of FIG. 1 is an example, and the information processing apparatus 100 may search graph information using various types of index information. Further, for example, the information processing apparatus 100 may generate index data used for searching. For example, the information processing apparatus 100 generates a search index for searching high-dimensional vectors as index data. The high-dimensional vector referred to here may be, for example, a vector with several hundred to several thousand dimensions, or a vector with more dimensions.

例えば、情報処理装置１００は、図１に示すようなツリー構造（木構造）に関する検索インデックスをインデックスデータとして生成してもよい。例えば、情報処理装置１００は、ｋｄ木（k-dimensional tree）に関する検索インデックスをインデックスデータとして生成してもよい。例えば、情報処理装置１００は、ＶＰ木（Vantage-Point tree）に関する検索インデックスをインデックスデータとして生成してもよい。 For example, the information processing apparatus 100 may generate a search index related to a tree structure (tree structure) as shown in FIG. 1 as index data. For example, the information processing apparatus 100 may generate a search index for a kd-tree (k-dimensional tree) as index data. For example, the information processing apparatus 100 may generate a search index for a VP tree (Vantage-Point tree) as index data.

また、例えば、情報処理装置１００は、その他の木構造を有するインデックスデータとして生成してもよい。例えば、情報処理装置１００は、木構造のインデックスデータのリーフがグラフデータに接続する種々のインデックスデータを生成してもよい。例えば、情報処理装置１００は、木構造のインデックスデータのリーフがグラフデータ中のノードに対応する種々のインデックスデータを生成してもよい。また、情報処理装置１００は、このようなインデックスデータを用いて検索を行う場合、インデックスデータを辿って到達したリーフ（ノード）からグラフデータを探索してもよい。 Further, for example, the information processing apparatus 100 may generate index data having another tree structure. For example, the information processing apparatus 100 may generate various types of index data in which leaves of tree-structured index data are connected to graph data. For example, the information processing apparatus 100 may generate various types of index data in which leaves of the tree-structured index data correspond to nodes in the graph data. When performing a search using such index data, the information processing apparatus 100 may search for graph data from a leaf (node) reached by tracing the index data.

なお、上述したようなインデックスデータは一例であり、情報処理装置１００は、グラフデータ中のクエリを高速に特定することが可能であれば、どのようなデータ構造のインデックスデータを生成してもよい。例えば、情報処理装置１００は、クエリに対応するグラフ情報中のノードを高速に特定することが可能であれば、バイナリ空間分割に関する技術等の種々の従来技術を適宜用いて、インデックスデータを生成してもよい。例えば、情報処理装置１００は、高次元ベクトルの検索に対応可能なインデックスであれば、どのようなデータ構造のインデックスデータを生成してもよい。例えば、情報処理装置１００は、非特許文献１に記載されるようなグラフ型の検索インデックスに関する情報をインデックス情報として用いてもよい。情報処理装置１００は、上述のようなインデックスデータとグラフデータとを用いることにより、所定の対象に関するより効率的な検索を可能にすることができる。 Note that the index data as described above is merely an example, and the information processing apparatus 100 may generate index data having any data structure as long as the query in the graph data can be identified at high speed. . For example, the information processing apparatus 100 generates index data by appropriately using various conventional techniques such as a technique related to binary space division, as long as it is possible to identify a node in graph information corresponding to a query at high speed. may For example, the information processing apparatus 100 may generate index data having any data structure as long as the index is compatible with high-dimensional vector searches. For example, the information processing apparatus 100 may use information about a graph-type search index as described in Non-Patent Document 1 as index information. The information processing apparatus 100 can use the index data and the graph data as described above to enable a more efficient search for a predetermined target.

〔１－６．用途について〕
なお、情報処理装置１００は、類似対象の抽出を種々の目的について行ってもよい。例えば、図１のように、チーム編成を行う場合、ハイパフォーマンス（高成績）を出した他のチームで重要な貢献をした個人に類似する類似対象を抽出してもよい。そして、情報処理装置１００は、目的に応じて抽出した類似対象の情報を用いて、目的に対応するサービスを提供してもよい。例えば、情報処理装置１００は、人材を外部からスカウトする、いわゆるヘッドハンティングを目的として、類似対象の抽出を行ってもよい。例えば、類似対象の抽出の目的が、お見合いや婚活等のマッチングを目的として、類似対象の抽出を行ってもよい。 [1-6. Usage]
Note that the information processing apparatus 100 may extract similar objects for various purposes. For example, as shown in FIG. 1, when forming a team, similar objects that are similar to individuals who have made important contributions to other teams that have achieved high performance (high results) may be extracted. Then, the information processing apparatus 100 may use the similar target information extracted according to the purpose to provide a service corresponding to the purpose. For example, the information processing apparatus 100 may extract similar targets for the purpose of so-called headhunting, in which personnel are scouted from the outside. For example, similar objects may be extracted for the purpose of matching such as matchmaking or matchmaking.

また、情報処理装置１００は、抽出した情報に基づいて、クエリとなる一の対象についての判定を行ってもよい。例えば、情報処理装置１００は、一のチーム（グループ）を対象として、抽出された類似グループにハイパフォーマンスを出したとチーム（グループ）が含まれる場合、一のチームが成功する可能性が高いチームであると判定してもよい。例えば、情報処理装置１００は、一の企業（グループ）を対象として、抽出された類似グループに急成長した企業（グループ）が含まれる場合、一の企業が成功する可能性が高い企業であると判定してもよい。このような情報を用いて、情報処理装置１００は、スタートアップメンバの選定や、投資判断を行ってもよい。 Further, the information processing apparatus 100 may make a determination about one target to be a query based on the extracted information. For example, the information processing apparatus 100 selects one team (group) as a target, and if the extracted similar groups include a team (group) that exhibits high performance, the one team is a team that is highly likely to succeed. It may be determined that there is For example, the information processing apparatus 100 targets one company (group), and if the extracted similar group includes a rapidly growing company (group), the information processing apparatus 100 determines that the one company is a company with a high probability of success. You can judge. Using such information, the information processing apparatus 100 may select startup members and make investment decisions.

また、情報処理装置１００は、各企業等の組織ごとに生成されたグラフ情報を用いて、類似する類似対象を抽出してもよい。例えば、情報処理装置１００は、Ａ企業のグラフ情報を用いて、Ｂ企業の個人ＺＺＺに類似する類似個人をＡ企業内の個人から抽出してもよい。そして、情報処理装置１００は、抽出した情報をＡ企業からＢ企業がヘッドハンティングを行うために用いてもよい。 Further, the information processing apparatus 100 may extract similar similar targets using graph information generated for each organization such as each company. For example, the information processing apparatus 100 may extract similar individuals similar to individual ZZZ of B company from individuals in A company using graph information of A company. Then, the information processing apparatus 100 may use the extracted information to conduct headhunting from the A company to the B company.

また、情報処理装置１００は、情報提供を受けるユーザ自身の情報を対象情報として、類似対象の抽出を行ってもよい。この場合、情報処理装置１００は、ユーザがどのような人物に似ているかを示す情報をユーザに提供することができる。例えば、情報処理装置１００は、ユーザが過去に成功した人物のうち、どの人物に似ているかを示す情報をユーザに提供することができる。例えば、情報処理装置１００は、ユーザと似た境遇にある人物示す情報をユーザに提供することができる。また、この場合、情報処理装置１００は、ユーザと似た境遇にある人物からのアドバイスをユーザに提供してもよい。 Further, the information processing apparatus 100 may extract a similar target using information of the user himself/herself who receives information as target information. In this case, the information processing apparatus 100 can provide the user with information indicating what kind of person the user resembles. For example, the information processing apparatus 100 can provide the user with information indicating which person the user resembles among successful persons in the past. For example, the information processing apparatus 100 can provide the user with information indicating a person in a situation similar to that of the user. Also, in this case, the information processing apparatus 100 may provide the user with advice from a person who is in a situation similar to that of the user.

また、情報処理装置１００は、病気の人を見つける診断等を目的として、類似対象の抽出を行ってもよい。情報処理装置１００は、抽出した情報に基づいて、クエリとなる個人についての判定を行ってもよい。例えば、情報処理装置１００は、一の個人を対象として、抽出された類似個人に所定の疾病（「疾病ＤＳＸ」とする）を有する個人が含まれる場合、一の個人が疾病ＤＳＸになる可能性が高いと判定してもよい。 Further, the information processing apparatus 100 may extract similar objects for the purpose of diagnosing a sick person. The information processing apparatus 100 may determine the individual serving as the query based on the extracted information. For example, the information processing apparatus 100 targets one individual, and if the extracted similar individuals include an individual with a predetermined disease (referred to as “disease DSX”), the information processing apparatus 100 determines the possibility that the one individual will have the disease DSX. can be determined to be high.

〔１－６－１．用途に応じた対象情報〕
なお、図１の例では、対象情報に対象（個人）の属性情報や画像や音声や行動等の種々の情報が含まれる場合を一例として説明したが、情報処理装置１００は、類似対象の抽出を行う目的に応じた対象情報を用いてもよい。すなわち、情報処理装置１００が類似対象の抽出を行う目的に適していると想定される情報が、対象情報として用いられてもよい。 [1-6-1. Target information according to use]
In the example of FIG. 1, the case where the target information includes attribute information of the target (individual) and various information such as images, voices, and actions has been described as an example. You may use the target information according to the purpose of performing. That is, information assumed to be suitable for the purpose of extracting similar targets by the information processing apparatus 100 may be used as the target information.

例えば、類似対象の抽出の目的が、仕事等のビジネス（経済行為）に関する類似対象の抽出であれば、情報処理装置１００は、その対象のビジネスに関する情報を対象情報として用いてもよい。 For example, if the purpose of extracting similar targets is to extract similar targets related to business (economic activity) such as work, the information processing apparatus 100 may use information related to the target business as target information.

例えば、情報処理装置１００がヘッドハンティングを目的とした類似対象の抽出を行う場合、対象情報は、各個人の職務や実務経験や過去の実績等を含む経歴情報であってもよい。この場合、情報処理装置１００は、ユーザが所望する経歴を示す一の経歴情報がクエリとして取得された場合、経歴情報の類似性に基づいて各ノードが連結されたグラフ情報を用いて類似対象を抽出する。 For example, when the information processing apparatus 100 extracts similar targets for the purpose of headhunting, the target information may be biographical information including job duties, work experience, past achievements, and the like of each individual. In this case, when one piece of history information indicating a history desired by the user is acquired as a query, the information processing apparatus 100 uses graph information in which each node is connected based on the similarity of the history information to find a similar target. Extract.

また、例えば、類似対象の抽出の目的が、マッチングに関する類似対象の抽出であれば、情報処理装置１００は、その対象の容姿に関する情報（画像情報）やプロフィール情報を対象情報として用いてもよい。 Further, for example, if the purpose of extracting a similar target is to extract a similar target related to matching, the information processing apparatus 100 may use information (image information) relating to the appearance of the target and profile information as target information.

例えば、情報処理装置１００がお見合いを目的とした類似対象の抽出を行う場合、対象情報は、各個人の画像情報やプロフィール情報等の個人情報であってもよい。この場合、情報処理装置１００は、ユーザが所望する相手を示す一の個人情報がクエリとして取得された場合、個人情報の類似性に基づいて各ノードが連結されたグラフ情報を用いて類似対象を抽出する。 For example, when the information processing apparatus 100 extracts similar targets for the purpose of matching, the target information may be personal information such as image information and profile information of each individual. In this case, when one piece of personal information indicating a partner desired by the user is acquired as a query, the information processing apparatus 100 identifies a similar target using graph information in which each node is connected based on the similarity of the personal information. Extract.

また、例えば、類似対象の抽出の目的が、診断等に関する類似対象の抽出であれば、情報処理装置１００は、その対象のカルテ情報を対象情報として用いてもよい。 Further, for example, if the purpose of extracting a similar target is to extract a similar target related to diagnosis or the like, the information processing apparatus 100 may use medical chart information of the target as target information.

例えば、情報処理装置１００が疾病ＤＳＸの診断を目的とした類似対象の抽出を行う場合、対象情報は、各個人のカルテ情報であってもよい。この場合、情報処理装置１００は、診断が所望される個人のカルテ情報がクエリとして取得された場合、カルテ情報の類似性に基づいて各ノードが連結されたグラフ情報を用いて類似対象を抽出する。そして、情報処理装置１００は、抽出された類似個人に疾病ＤＳＸを有する個人が含まれる場合、一の個人が疾病ＤＳＸになる可能性が高いと判定してもよい。 For example, when the information processing apparatus 100 extracts similar objects for the purpose of diagnosing disease DSX, the object information may be medical record information of each individual. In this case, when medical record information of an individual whose diagnosis is desired is acquired as a query, the information processing apparatus 100 extracts a similar target using graph information in which each node is connected based on the similarity of the medical record information. . Then, when an individual having DSX is included in the extracted similar individuals, the information processing apparatus 100 may determine that one individual is highly likely to have DSX.

〔１－６－２．非類似対象の抽出〕
なお、図１の例では、類似性が高いノード間がエッジで連結されたグラフ情報ＧＲ１１を用いた場合を示したが、グラフ情報は種々の基準に基づいて各ノード間が連結されたグラフ情報であってもよい。情報処理装置１００は、複数の対象のうち、類似性が低いノード間がエッジで連結されたグラフ情報（「非類似連結グラフ」とする）を用いてもよい。 [1-6-2. Extraction of dissimilar objects]
In the example of FIG. 1, the graph information GR11 in which nodes with high similarity are connected by edges is used. may be The information processing apparatus 100 may use graph information (referred to as a “dissimilar connection graph”) in which nodes with low similarity among a plurality of objects are connected by edges.

例えば、情報処理装置１００は、非類似連結グラフを用いて、一の対象と非類似の対象を抽出してもよい。非類似連結グラフにおける複数の対象のうち、一の対象と非類似の対象を対応対象として抽出してもよい。この場合、情報処理装置１００は、一の対象と類似性が低いという対応関係を有する対応対象（非類似対象）を抽出する。情報処理装置１００は、非類似連結グラフにおける起点ノードからエッジが連結されたノードを抽出することにより、クエリ（一の対象）との類似性が低いノードを抽出することができる。すなわち、情報処理装置１００は、グラフ情報における類似関係が低い（距離が遠い）ユーザを探すことができる。例えば、情報処理装置１００は、グラフ構造の距離関数を逆転させて、一のユーザから遠い人を探すことができる。例えば、情報処理装置１００は、グラフ構造の距離関数の逆関数を用いることにより、一のユーザから遠い人を探すことができる。 For example, the information processing apparatus 100 may extract a target that is dissimilar to one target using a dissimilar connection graph. A target that is dissimilar to one of the plurality of targets in the dissimilar connection graph may be extracted as a corresponding target. In this case, the information processing apparatus 100 extracts a corresponding target (non-similar target) having a correspondence relationship of low similarity with one target. The information processing apparatus 100 can extract a node with low similarity to a query (one target) by extracting a node to which an edge is connected from a starting node in a dissimilar connection graph. That is, the information processing apparatus 100 can search for a user with a low similarity relationship (long distance) in the graph information. For example, the information processing apparatus 100 can search for a person far from one user by inverting the distance function of the graph structure. For example, the information processing apparatus 100 can search for a person far from one user by using the inverse function of the distance function of the graph structure.

例えば、情報処理装置１００は、非類似連結グラフを用いて抽出した一の個人（ユーザ）と非類似の個人（ユーザ）に対して、所定のマーケティングを行ってもよい。例えば、情報処理装置１００は、口コミ等の投稿を行わせたい所定の情報を提供した一のユーザと非類似のユーザ（非類似ユーザ）に対して、所定の情報を非類似ユーザに提供することにより、一のユーザや非類似ユーザ等の幅広いユーザが投稿を行う可能性を高めることができる。所定のマーケティングを行ってもよい。このように、情報処理装置１００は、グラフ情報における広範囲のユーザに対して所定の情報を提供することにより、幅広いユーザに対して所定の情報の拡散を行わせることができる。例えば、情報処理装置１００は、グラフ情報のベクトル空間における広範囲のユーザに対して所定の情報を提供することにより、幅広いユーザに対してバズマーケティングを行うことができる。これにより、情報処理装置１００は、バズマーケティング等の所望のマーケティングを適切に実行することができる。 For example, the information processing apparatus 100 may perform predetermined marketing to individuals (users) who are dissimilar to one individual (user) extracted using a dissimilar connection graph. For example, the information processing apparatus 100 provides predetermined information to a user (dissimilar user) who is dissimilar to one user who has provided predetermined information to be posted, such as word of mouth, to the dissimilar user. Accordingly, it is possible to increase the possibility that a wide range of users, such as one user and dissimilar users, will post. Predetermined marketing may be carried out. In this way, the information processing apparatus 100 can spread predetermined information to a wide range of users by providing predetermined information to a wide range of users in the graph information. For example, the information processing apparatus 100 can perform buzz marketing to a wide range of users by providing predetermined information to a wide range of users in the vector space of graph information. Thereby, the information processing apparatus 100 can appropriately perform desired marketing such as buzz marketing.

〔２．情報処理システムの構成〕
図２に示すように、情報処理システム１は、端末装置１０と、情報提供装置５０と、情報処理装置１００とが含まれる。端末装置１０と、情報提供装置５０と、情報処理装置１００とは所定のネットワークＮを介して、有線または無線により通信可能に接続される。図２は、実施形態に係る情報処理システムの構成例を示す図である。なお、図２に示した情報処理システム１には、複数台の端末装置１０や、複数台の情報提供装置５０や、複数台の情報処理装置１００が含まれてもよい。 [2. Configuration of information processing system]
As shown in FIG. 2 , the information processing system 1 includes a terminal device 10 , an information providing device 50 and an information processing device 100 . The terminal device 10, the information providing device 50, and the information processing device 100 are connected via a predetermined network N so as to be communicable by wire or wirelessly. FIG. 2 is a diagram illustrating a configuration example of an information processing system according to the embodiment; The information processing system 1 shown in FIG. 2 may include multiple terminal devices 10 , multiple information providing devices 50 , and multiple information processing devices 100 .

端末装置１０は、ユーザによって利用される情報処理装置である。端末装置１０は、ユーザによる種々の操作を受け付ける。なお、以下では、端末装置１０をユーザと表記する場合がある。すなわち、以下では、ユーザを端末装置１０と読み替えることもできる。なお、上述した端末装置１０は、例えば、スマートフォンや、タブレット型端末や、ノート型ＰＣ（Personal Computer）や、デスクトップＰＣや、携帯電話機や、ＰＤＡ（Personal Digital Assistant）等により実現される。例えば、端末装置１０は、所定のサーバシステムを管理者が利用する情報処理装置であってもよい。例えば、端末装置１０は、対象情報を収集し、対象情報を情報処理装置１００へ送信する。 The terminal device 10 is an information processing device used by a user. The terminal device 10 accepts various operations by the user. In addition, below, the terminal device 10 may be described as a user. That is, hereinafter, the user can also be read as the terminal device 10 . The terminal device 10 described above is implemented by, for example, a smart phone, a tablet terminal, a notebook PC (Personal Computer), a desktop PC, a mobile phone, a PDA (Personal Digital Assistant), or the like. For example, the terminal device 10 may be an information processing device in which a predetermined server system is used by an administrator. For example, the terminal device 10 collects target information and transmits the target information to the information processing device 100 .

情報処理装置１００は、起点ベクトルを起点としてグラフ情報を検索することにより、複数の対象のうち、一の対象に類似する対象である類似対象を抽出する情報処理装置である。例えば、情報処理装置１００は、一の対象に関する対象情報を取得し、対象情報と、グラフ情報の検索の起点となる起点ベクトルに関する情報とに基づいて、起点ベクトルを決定する。 The information processing apparatus 100 is an information processing apparatus that extracts a similar target that is similar to one target from among a plurality of targets by searching graph information using an origin vector as a starting point. For example, the information processing apparatus 100 acquires target information about one target, and determines a starting vector based on the target information and information about a starting vector that is a starting point for searching graph information.

情報処理装置１００は、ユーザ等に種々の情報提供を行うための情報が格納された情報処理装置である。例えば、情報処理装置１００は、端末装置１０から一の対象の対象情報（以下、「クエリ情報」や「クエリ」ともいう）を取得すると、クエリに類似する対象（ベクトル情報等）を検索し、検索結果を端末装置１０に提供する。図１の例では、情報処理装置１００は、端末装置１０から一の対象の対象情報を取得すると、一の対象に類似する対象を検索し、検索結果を類似の対象として端末装置１０に提供する。また、例えば、情報処理装置１００が端末装置１０に提供するデータは、対象の名称や対象の対象情報自体であってもよいし、ＵＲＬ（Uniform Resource Locator）等の対応するデータを参照するための情報であってもよい。例えば、情報処理装置１００が端末装置１０に提供するデータが対象の名称である場合、氏名等の個人名や企業名等のグループ名であってもよい。 The information processing device 100 is an information processing device that stores information for providing various information to users and the like. For example, when the information processing device 100 acquires target information of one target (hereinafter also referred to as “query information” or “query”) from the terminal device 10, the information processing device 100 searches for a target (vector information etc.) similar to the query, A search result is provided to the terminal device 10 . In the example of FIG. 1, when the information processing apparatus 100 acquires the target information of one target from the terminal device 10, the information processing device 100 searches for a target similar to the one target, and provides the search result to the terminal device 10 as a similar target. . Further, for example, the data provided by the information processing device 100 to the terminal device 10 may be a target name or target information itself, or may be a URL (Uniform Resource Locator) for referring to corresponding data. It may be information. For example, when the data provided by the information processing device 100 to the terminal device 10 is a target name, it may be an individual name such as a name or a group name such as a company name.

情報提供装置５０は、情報処理装置１００に種々の情報提供を行うための情報が格納された情報処理装置である。例えば、情報提供装置５０は、ウェブサーバ等の種々の外部装置から収集した対象情報等が格納されてもよい。例えば、情報提供装置５０は、グラフ情報やインデックス情報やモデル等の種々の情報を情報処理装置１００に提供する情報処理装置である。 The information providing device 50 is an information processing device that stores information for providing various information to the information processing device 100 . For example, the information providing device 50 may store target information and the like collected from various external devices such as a web server. For example, the information providing device 50 is an information processing device that provides various information such as graph information, index information, and models to the information processing device 100 .

〔３．情報処理装置の構成〕
次に、図３を用いて、実施形態に係る情報処理装置１００の構成について説明する。図３は、実施形態に係る情報処理装置の構成例を示す図である。図３に示すように、情報処理装置１００は、通信部１１０と、記憶部１２０と、制御部１３０とを有する。なお、情報処理装置１００は、情報処理装置１００の管理者等から各種操作を受け付ける入力部（例えば、キーボードやマウス等）や、各種情報を表示するための表示部（例えば、液晶ディスプレイ等）を有してもよい。 [3. Configuration of Information Processing Device]
Next, the configuration of the information processing apparatus 100 according to the embodiment will be described using FIG. FIG. 3 is a diagram illustrating a configuration example of an information processing apparatus according to the embodiment; As shown in FIG. 3, the information processing apparatus 100 has a communication section 110, a storage section 120, and a control section . The information processing apparatus 100 includes an input unit (for example, a keyboard, a mouse, etc.) that receives various operations from an administrator of the information processing apparatus 100, and a display unit (for example, a liquid crystal display, etc.) for displaying various information. may have.

（通信部１１０）
通信部１１０は、例えば、ＮＩＣ（Network Interface Card）等によって実現される。そして、通信部１１０は、ネットワーク（例えば図２中のネットワークＮ）と有線または無線で接続され、端末装置１０や情報提供装置５０との間で情報の送受信を行う。 (Communication unit 110)
The communication unit 110 is realized by, for example, a NIC (Network Interface Card) or the like. The communication unit 110 is connected to a network (for example, network N in FIG. 2) by wire or wirelessly, and transmits and receives information to and from the terminal device 10 and the information providing device 50 .

（記憶部１２０）
記憶部１２０は、例えば、ＲＡＭ（Random Access Memory）、フラッシュメモリ（Flash Memory）等の半導体メモリ素子、または、ハードディスク、光ディスク等の記憶装置によって実現される。実施形態に係る記憶部１２０は、図３に示すように、対象情報記憶部１２１と、インデックス情報記憶部１２２と、グラフ情報記憶部１２３と、モデル情報記憶部１２４とを有する。 (storage unit 120)
The storage unit 120 is realized by, for example, a semiconductor memory device such as a RAM (Random Access Memory) or a flash memory, or a storage device such as a hard disk or an optical disk. The storage unit 120 according to the embodiment includes a target information storage unit 121, an index information storage unit 122, a graph information storage unit 123, and a model information storage unit 124, as shown in FIG.

（対象情報記憶部１２１）
実施形態に係る対象情報記憶部１２１は、対象（オブジェクト）に関する各種情報を記憶する。例えば、対象情報記憶部１２１は、対象ＩＤやベクトルデータを記憶する。図４の例では、対象（個人）に関する対象情報（個人情報）を記憶する。図４は、実施形態に係る対象情報記憶部の一例を示す図である。図４に示す対象情報記憶部１２１は、「対象ＩＤ」、「対象」、「対象情報」、「ベクトル情報」といった項目が含まれる。 (Target information storage unit 121)
The target information storage unit 121 according to the embodiment stores various types of information about targets (objects). For example, the target information storage unit 121 stores target IDs and vector data. In the example of FIG. 4, target information (personal information) relating to the target (individual) is stored. 4 is a diagram illustrating an example of a target information storage unit according to the embodiment; FIG. The target information storage unit 121 shown in FIG. 4 includes items such as "target ID", "target", "target information", and "vector information".

「対象ＩＤ」は、対象（オブジェクト）を識別するための識別情報を示す。また、「対象」は、対象ＩＤにより識別される対象（個人）の具体的な名称や内容等を示す。なお、図４の例では、対象を「個人＃１」といった抽象的な符号で示すが、括弧内の「Ａさん」等のように、個人を特定するための文字情報等が含まれてもよい。 "Target ID" indicates identification information for identifying a target (object). "Target" indicates the specific name, content, etc. of the target (individual) identified by the target ID. In the example of FIG. 4, the target is indicated by an abstract code such as "individual #1". good.

「対象情報」は、対象ＩＤにより識別される対象（個人）に関する情報を示す。なお、図４の例では、対象情報を「ＴＤＴ１」といった抽象的な符号で示すが、各対象情報は、対象ＩＤにより識別される対象（個人）の年齢、性別等の属性情報や年齢、性別等の属性情報や顔や全体を撮像した画像や音声や行動等の種々の対象（個人）に関する情報を含んでもよい。「ベクトル情報」とは、対象ＩＤにより識別される対象（オブジェクト）に対応するベクトル情報を示す。すなわち、図４の例では、対象（オブジェクト）を識別する対象ＩＤに対して、オブジェクトに対応するベクトルデータ（ベクトル情報）が対応付けられて登録されている。 "Target information" indicates information about a target (individual) identified by the target ID. In the example of FIG. 4, the target information is indicated by an abstract code such as "TDT1", but each target information is attribute information such as age and gender of the target (individual) identified by the target ID It may also include attribute information, such as face and whole image, voice, behavior, and other information related to various objects (individuals). "Vector information" indicates vector information corresponding to a target (object) identified by the target ID. That is, in the example of FIG. 4, vector data (vector information) corresponding to an object is registered in association with an object ID that identifies the object.

例えば、図４の例では、対象ＩＤ「ＴＧ１」により識別される対象（オブジェクト）は、「１０，２４，５４，２．．．」の多次元（Ｎ次元）のベクトル情報が対応付けられることを示す。例えば、個人＃１については、モデルＭ１等により、個人＃１の特徴を示す「１０，２４，５４，２．．．」の多次元（Ｎ次元）のベクトル情報が対象情報ＴＤＴ１から抽出されたことを示す。 For example, in the example of FIG. 4, the target (object) identified by the target ID "TG1" is associated with multidimensional (N-dimensional) vector information of "10, 24, 54, 2...". indicates For example, for individual #1, multidimensional (N-dimensional) vector information of “10, 24, 54, 2 . indicates that

なお、対象情報記憶部１２１は、上記に限らず、目的に応じて種々の情報を記憶してもよい。 It should be noted that the target information storage unit 121 may store various types of information, not limited to the above, depending on the purpose.

（インデックス情報記憶部１２２）
実施形態に係るインデックス情報記憶部１２２は、インデックスに関する各種情報を記憶する。図５は、実施形態に係るインデックス情報記憶部の一例を示す図である。具体的には、図５の例では、インデックス情報記憶部１２２は、ツリー構造のインデックス情報を示す。図５の例では、インデックス情報記憶部１２２は、「ルート階層」、「第１階層」、「第２階層」、「第３階層」等といった項目が含まれる。なお、「第１階層」～「第３階層」に限らず、インデックスの階層数に応じて、「第４階層」、「第５階層」、「第６階層」等が含まれてもよい。 (Index information storage unit 122)
The index information storage unit 122 according to the embodiment stores various information regarding indexes. FIG. 5 is a diagram illustrating an example of an index information storage unit according to the embodiment; Specifically, in the example of FIG. 5, the index information storage unit 122 indicates tree-structured index information. In the example of FIG. 5, the index information storage unit 122 includes items such as "root layer", "first layer", "second layer", and "third layer". It should be noted that not only "first layer" to "third layer" but "fourth layer", "fifth layer", "sixth layer", etc. may be included according to the number of layers of the index.

「ルート階層」は、インデックスを用いた起点ノードの決定の開始点となるルート（最上位）の階層を示す。「第１階層」は、インデックスの第１階層に属するノード（節点またはグラフ情報中のベクトル）を識別（特定）する情報が格納される。「第１階層」に格納されるノードは、インデックスの根（ルート）に直接結ばれる階層に対応するノードとなる。 The “root hierarchy” indicates the root (top) hierarchy that is the starting point for determining the origin node using the index. "First layer" stores information for identifying (specifying) a node (node or vector in graph information) belonging to the first layer of the index. A node stored in the "first layer" is a node corresponding to a layer directly connected to the root of the index.

「第２階層」は、インデックスの第２階層に属するノード（節点またはグラフ情報中のベクトル）を識別（特定）する情報が格納される。「第２階層」に格納されるノードは、第１階層のノードに結ばれる直下の階層に対応するノードとなる。「第３階層」は、インデックスの第３階層に属するノード（節点またはグラフ情報中のベクトル）を識別（特定）する情報が格納される。「第３階層」に格納されるノードは、第２階層のノードに結ばれる直下の階層に対応するノードとなる。 "Second layer" stores information that identifies (specifies) a node (node or vector in graph information) belonging to the second layer of the index. A node stored in the "second layer" is a node corresponding to the immediately lower layer connected to a node in the first layer. "Third layer" stores information for identifying (specifying) a node (node or vector in graph information) belonging to the third layer of the index. A node stored in the "third layer" is a node corresponding to the immediately lower layer linked to a node in the second layer.

例えば、図５に示す例においては、インデックス情報記憶部１２２には、図１中のインデックス情報ＩＮＤ１１に対応する情報が記憶される。例えば、インデックス情報記憶部１２２は、第１階層のノードが、節点ＶＴ１～ＶＴ３等であることを示す。また、各節点の下の括弧内の数値は、各節点に対応するベクトルの値を示す。 For example, in the example shown in FIG. 5, the index information storage unit 122 stores information corresponding to the index information IND11 in FIG. For example, the index information storage unit 122 indicates that the nodes of the first layer are the nodes VT1 to VT3. Numerical values in parentheses under each node indicate vector values corresponding to each node.

また、例えば、インデックス情報記憶部１２２は、節点ＶＴ２の直下の第２階層のノードが、節点ＶＴ２－１～ＶＴ２－４であることを示す。また、例えば、インデックス情報記憶部１２２は、節点ＶＴ２－２の直下の第３階層のノードが、ノードＮ３５、ノードＮ４５１、ノードＮ６９３のグラフ情報ＧＲ１１中のノード（ベクトル）であることを示す。 Also, for example, the index information storage unit 122 indicates that the nodes of the second layer immediately below the node VT2 are the nodes VT2-1 to VT2-4. Also, for example, the index information storage unit 122 indicates that the nodes in the third layer immediately below the node VT2-2 are the nodes (vectors) in the graph information GR11 of the nodes N35, N451, and N693.

なお、インデックス情報記憶部１２２は、上記に限らず、目的に応じて種々の情報を記憶してもよい。 It should be noted that the index information storage unit 122 may store various types of information, not limited to the above, depending on the purpose.

（グラフ情報記憶部１２３）
実施形態に係るグラフ情報記憶部１２３は、グラフ情報に関する各種情報を記憶する。図６は、実施形態に係るグラフ情報記憶部の一例を示す図である。図６の例では、グラフ情報記憶部１２３は、「ノードＩＤ」、「対象ＩＤ」、および「エッジ情報」といった項目を有する。また、「エッジ情報」には、「エッジＩＤ」や「参照先」といった情報が含まれる。 (Graph information storage unit 123)
The graph information storage unit 123 according to the embodiment stores various types of information regarding graph information. 6 is a diagram illustrating an example of a graph information storage unit according to the embodiment; FIG. In the example of FIG. 6, the graph information storage unit 123 has items such as "node ID", "target ID", and "edge information". "Edge information" includes information such as "edge ID" and "reference destination".

「ノードＩＤ」は、グラフデータにおける各ノード（対象）を識別するための識別情報を示す。また、「対象ＩＤ」は、対象（オブジェクト）を識別するための識別情報を示す。 “Node ID” indicates identification information for identifying each node (object) in graph data. "Target ID" indicates identification information for identifying a target (object).

また、「エッジ情報」は、対応するノードに接続されるエッジに関する情報を示す。図６の例では、「エッジ情報」は、エッジが有向エッジである場合を示し、対応するノードから出力される出力エッジに関する情報を示す。また、「エッジＩＤ」は、ノード間を連結するエッジを識別するための識別情報を示す。また、「参照先」は、エッジにより連結された参照先（ノード）を示す情報を示す。すなわち、図６の例では、ノードを識別するノードＩＤに対して、そのノードに対応するオブジェクト（対象）を識別する情報やそのノードからの有向エッジ（出力エッジ）が連結される参照先（ノード）が対応付けられて登録されている。 "Edge information" indicates information about edges connected to the corresponding node. In the example of FIG. 6, "edge information" indicates the case where the edge is a directed edge, and indicates information regarding the output edge output from the corresponding node. "Edge ID" indicates identification information for identifying an edge connecting nodes. "Reference destination" indicates information indicating a reference destination (node) connected by an edge. That is, in the example of FIG. 6, for a node ID that identifies a node, a reference destination ( node) are associated and registered.

例えば、図６の例では、ノードＩＤ「Ｎ１」により識別されるノード（ベクトル）は、対象ＩＤ「ＴＧ１」により識別される対象（オブジェクト）に対応することを示す。また、ノードＩＤ「Ｎ１」により識別されるノードからは、エッジＩＤ「Ｅ１１」により識別されるエッジが、ノードＩＤ「Ｎ２５」により識別されるノード（ベクトル）に連結されることを示す。すなわち、図６の例では、ノードＩＤ「Ｎ１」により識別されるノード（ベクトル）からはノードＩＤ「Ｎ２５」により識別されるノード（ベクトル）に辿ることができることを示す。 For example, the example of FIG. 6 indicates that the node (vector) identified by the node ID "N1" corresponds to the target (object) identified by the target ID "TG1". It also indicates that the edge identified by the edge ID "E11" is connected to the node (vector) identified by the node ID "N25" from the node identified by the node ID "N1". That is, the example of FIG. 6 indicates that the node (vector) identified by the node ID "N1" can be traced to the node (vector) identified by the node ID "N25".

なお、グラフ情報記憶部１２３は、上記に限らず、目的に応じて種々の情報を記憶してもよい。例えば、グラフ情報記憶部１２３は、各ノード（ベクトル）間を連結するエッジの長さが記憶されてもよい。すなわち、グラフ情報記憶部１２３は、各ノード（ベクトル）間の距離を示す情報が記憶されてもよい。 It should be noted that the graph information storage unit 123 may store various types of information, not limited to the above, depending on the purpose. For example, the graph information storage unit 123 may store lengths of edges connecting nodes (vectors). That is, the graph information storage unit 123 may store information indicating distances between nodes (vectors).

（モデル情報記憶部１２４）
実施形態に係るモデル情報記憶部１２４は、モデルに関する情報を記憶する。例えば、モデル情報記憶部１２４は、生成処理により生成されたモデル情報（モデルデータ）を記憶する。図７は、実施形態に係るモデル情報記憶部の一例を示す図である。図７に示すモデル情報記憶部１２４は、「モデルＩＤ」、「用途」、「モデルデータ」といった項目が含まれる。なお、図７では、モデルＭ１、Ｍ２のみを図示するが、Ｍ２１、Ｍ２２等、各用途（予測の対象）に応じて多数のモデル情報が記憶されてもよい。 (Model information storage unit 124)
The model information storage unit 124 according to the embodiment stores information about models. For example, the model information storage unit 124 stores model information (model data) generated by the generation process. 7 is a diagram illustrating an example of a model information storage unit according to the embodiment; FIG. The model information storage unit 124 shown in FIG. 7 includes items such as "model ID", "usage", and "model data". Although only the models M1 and M2 are illustrated in FIG. 7, a large number of model information such as M21 and M22 may be stored according to each application (prediction target).

「モデルＩＤ」は、モデルを識別するための識別情報を示す。例えば、モデルＩＤ「Ｍ１」により識別されるモデルは、図１の例に示したモデルＭ１に対応する。「用途」は、対応するモデルの用途を示す。また、「モデルデータ」は、対応付けられた対応するモデルのデータを示す。例えば、「モデルデータ」には、各層におけるノードと、各ノードが採用する関数と、ノードの接続関係と、ノード間の接続に対して設定される接続係数とを含む情報が含まれる。 "Model ID" indicates identification information for identifying a model. For example, the model identified by model ID "M1" corresponds to model M1 shown in the example of FIG. "Use" indicates the use of the corresponding model. Also, "model data" indicates the data of the associated corresponding model. For example, "model data" includes information including nodes in each layer, functions adopted by each node, connection relations of nodes, and connection coefficients set for connections between nodes.

例えば、図７に示す例において、モデルＩＤ「Ｍ１」により識別されるモデル（モデルＭ１）は、用途が「特徴抽出（個人）」であり、入力された対象情報からの特徴の抽出に用いられることを示す。また、モデルＭ１のモデルデータは、モデルデータＭＤＴ１であることを示す。 For example, in the example shown in FIG. 7, the model (model M1) identified by the model ID "M1" is used for "feature extraction (individual)" and is used to extract features from input target information. indicates that It also indicates that the model data of the model M1 is the model data MDT1.

モデルＭ１（モデルデータＭＤＴ１）は、対象（個人）の対象情報（個人情報）が入力される入力層と、出力層と、入力層から出力層までのいずれかの層であって出力層以外の層に属する第１要素と、第１要素と第１要素の重みとに基づいて値が算出される第２要素と、を含み、入力層に入力された対象情報に対し、出力層以外の各層に属する各要素を第１要素として、第１要素と第１要素の重みとに基づく演算を行うことにより、入力層に入力される情報と同様の情報を出力層から出力するよう、コンピュータを機能させるためのモデルである。 The model M1 (model data MDT1) includes an input layer in which target information (personal information) of a target (individual) is input, an output layer, and any layer from the input layer to the output layer, except for the output layer. Each layer other than the output layer includes a first element belonging to the layer and a second element whose value is calculated based on the weight of the first element and the first element, and for the target information input to the input layer with each element belonging to the first element as the first element, and by performing an operation based on the first element and the weight of the first element, the computer functions to output the same information as the information input to the input layer from the output layer It is a model for

また、モデルＭ１、Ｍ２等がＤＮＮ（Deep Neural Network）等、１つまたは複数の中間層を有するニューラルネットワークで実現されるとする。この場合、例えば、モデルＭ１、Ｍ２が含む第１要素は、入力層または中間層が有するいずれかのノードに対応する。また、第２要素は、第１要素と対応するノードから値が伝達されるノードである次段のノードに対応する。また、第１要素の重みは、第１要素と対応するノードから第２要素と対応するノードに伝達される値に対して考慮される重みである接続係数に対応する。 It is also assumed that the models M1, M2, etc. are implemented by a neural network having one or more intermediate layers, such as a DNN (Deep Neural Network). In this case, for example, the first element included in the models M1 and M2 corresponds to any node of the input layer or the intermediate layer. Also, the second element corresponds to the next node, which is a node to which the value is transmitted from the node corresponding to the first element. Also, the weight of the first element corresponds to the connection coefficient, which is the weight considered for the value transmitted from the node corresponding to the first element to the node corresponding to the second element.

ここで、モデルＭ２１、Ｍ２２等が「ｙ＝ａ１＊ｘ１＋ａ２＊ｘ２＋・・・＋ａｉ＊ｘｉ」で示す回帰モデルで実現されるとする。この場合、例えば、モデルＭ２１、Ｍ２２が含む第１要素は、ｘ１やｘ２等といった入力データ（ｘｉ）に対応する。また、第１要素の重みは、ｘｉに対応する係数ａｉに対応する。ここで、回帰モデルは、入力層と出力層とを有する単純パーセプトロンと見做すことができる。各モデルを単純パーセプトロンと見做した場合、第１要素は、入力層が有するいずれかのノードに対応し、第２要素は、出力層が有するノードと見做すことができる。 Here, it is assumed that the models M21, M22, etc. are realized by a regression model represented by "y=a1*x1+a2*x2+...+ai*xi". In this case, for example, the first elements included in the models M21 and M22 correspond to input data (xi) such as x1 and x2. Also, the weight of the first element corresponds to the coefficient ai corresponding to xi. Here, the regression model can be viewed as a simple perceptron with an input layer and an output layer. When each model is regarded as a simple perceptron, the first element can be regarded as a node of the input layer, and the second element can be regarded as a node of the output layer.

なお、モデル情報記憶部１２４は、上記に限らず、目的に応じて種々のモデル情報を記憶してもよい。 It should be noted that the model information storage unit 124 may store various types of model information, not limited to the above, depending on the purpose.

（制御部１３０）
図３の説明に戻って、制御部１３０は、コントローラ（controller）であり、例えば、ＣＰＵ（Central Processing Unit）やＭＰＵ（Micro Processing Unit）等によって、情報処理装置１００内部の記憶装置に記憶されている各種プログラム（情報処理プログラムの一例に相当）がＲＡＭを作業領域として実行されることにより実現される。また、制御部１３０は、コントローラであり、例えば、ＡＳＩＣ（Application Specific Integrated Circuit）やＦＰＧＡ（Field Programmable Gate Array）等の集積回路により実現される。制御部１３０は、モデル情報記憶部１２４に記憶されているモデルＭ１、Ｍ２等に従った情報処理により、対象の対象情報が入力される入力層と、出力層と、入力層から出力層までのいずれかの層であって出力層以外の層に属する第１要素と、第１要素と第１要素の重みとに基づいて値が算出される第２要素と、を含み、入力層に入力された対象情報に対し、出力層以外の各層に属する各要素を第１要素として、第１要素と第１要素の重みとに基づく演算を行うことにより、入力層に入力される情報と同様の情報を出力層から出力する。 (control unit 130)
Returning to the description of FIG. 3, the control unit 130 is a controller, and is stored in a storage device inside the information processing apparatus 100 by, for example, a CPU (Central Processing Unit) or an MPU (Micro Processing Unit). Various programs (corresponding to an example of an information processing program) are executed by using the RAM as a work area. Also, the control unit 130 is a controller, and is implemented by an integrated circuit such as an ASIC (Application Specific Integrated Circuit) or an FPGA (Field Programmable Gate Array). The control unit 130 performs information processing in accordance with the models M1, M2, etc. stored in the model information storage unit 124 to create an input layer to which target information of a target is input, an output layer, and layers from the input layer to the output layer. A first element belonging to any layer other than the output layer and a second element whose value is calculated based on the first element and the weight of the first element, and is input to the input layer The same information as the information input to the input layer is obtained by performing an operation based on the first element and the weight of the first element, with each element belonging to each layer other than the output layer as the first element. is output from the output layer.

図３に示すように、制御部１３０は、取得部１３１と、生成部１３２と、決定部１３３と、検索部１３４と、提供部１３５とを有し、以下に説明する情報処理の機能や作用を実現または実行する。なお、制御部１３０の内部構成は、図３に示した構成に限られず、後述する情報処理を行う構成であれば他の構成であってもよい。 As shown in FIG. 3, the control unit 130 includes an acquisition unit 131, a generation unit 132, a determination unit 133, a search unit 134, and a provision unit 135, and has information processing functions and actions described below. realize or perform Note that the internal configuration of the control unit 130 is not limited to the configuration shown in FIG. 3, and may be another configuration as long as it performs information processing described later.

（取得部１３１）
取得部１３１は、各種情報を取得する。例えば、取得部１３１は、記憶部１２０から各種情報を取得する。例えば、取得部１３１は、対象情報記憶部１２１や、インデックス情報記憶部１２２や、グラフ情報記憶部１２３や、モデル情報記憶部１２４等から各種情報を取得する。また、取得部１３１は、各種情報を外部の情報処理装置から取得する。取得部１３１は、端末装置１０や情報提供装置５０から各種情報を取得する。 (Acquisition unit 131)
Acquisition unit 131 acquires various types of information. For example, the acquisition unit 131 acquires various information from the storage unit 120 . For example, the acquisition unit 131 acquires various information from the target information storage unit 121, the index information storage unit 122, the graph information storage unit 123, the model information storage unit 124, and the like. The acquisition unit 131 also acquires various types of information from an external information processing device. The acquisition unit 131 acquires various types of information from the terminal device 10 and the information providing device 50 .

取得部１３１は、人を要素とする複数の対象の各々に対応する複数のノードが、複数の対象の類似性に応じて連結されたグラフ情報と、一の対象に関する対象情報を取得する。取得部１３１は、複数の対象の各々の特徴を示す複数のベクトルが類似性に応じて連結されたグラフ情報を取得する。取得部１３１は、複数の対象の各々に対応する複数のベクトルが類似性に応じて連結されたグラフ情報と、一の対象に関する対象情報を取得する。取得部１３１は、所定のモデルを用いて複数の対象の各々から抽出された特徴量を要素とする複数のベクトルが、類似性に応じて連結されたグラフ情報を取得する。取得部１３１は、複数の対象に関する情報を所定のモデルに入力することにより、抽出される複数の対象の各々の特徴量を要素とする複数のベクトルが、類似性に応じて連結されたグラフ情報を取得する。 The acquiring unit 131 acquires graph information in which a plurality of nodes corresponding to each of a plurality of targets having people as elements are connected according to the similarity of the plurality of targets, and target information about one target. The acquiring unit 131 acquires graph information in which a plurality of vectors representing characteristics of each of a plurality of targets are connected according to similarity. The acquiring unit 131 acquires graph information in which a plurality of vectors corresponding to each of a plurality of targets are connected according to similarity, and target information regarding one target. The acquisition unit 131 acquires graph information in which a plurality of vectors, each element of which is a feature amount extracted from each of a plurality of targets using a predetermined model, are connected according to similarity. By inputting information about a plurality of targets into a predetermined model, the acquiring unit 131 obtains graph information in which a plurality of vectors whose elements are the feature amounts of each of the extracted targets are connected according to similarity. to get

取得部１３１は、ユーザが利用する端末装置１０から一の対象に関する対象情報を取得する。取得部１３１は、各々が個人である複数の対象の各々に対応する複数のノードが連結されたグラフ情報と、一の個人である一の対象に関する対象情報を取得する。取得部１３１は、各々が複数人を含むグループである複数の対象の各々に対応する複数のノードが連結されたグラフ情報と、一のグループである一の対象に関する対象情報を取得する。 The acquisition unit 131 acquires target information about one target from the terminal device 10 used by the user. The acquiring unit 131 acquires graph information in which a plurality of nodes corresponding to each of a plurality of subjects each of which is an individual are connected, and target information regarding one subject which is one individual. The acquisition unit 131 acquires graph information in which a plurality of nodes corresponding to each of a plurality of targets, each of which is a group including a plurality of people, are connected, and target information regarding one target, which is one group.

例えば、取得部１３１は、データ検索の対象となる複数のノード（ベクトル）を取得する。例えば、取得部１３１は、複数のノードと、複数のノードの各々を連結する複数の有向エッジを含む有向エッジ群を取得する。 For example, the acquisition unit 131 acquires a plurality of nodes (vectors) to be searched for data. For example, the acquiring unit 131 acquires a directed edge group including a plurality of nodes and a plurality of directed edges connecting each of the plurality of nodes.

例えば、取得部１３１は、外部の情報処理装置からグラフ情報（グラフデータ）を取得する。例えば、取得部１３１は、グラフ情報記憶部１２３からグラフ情報を取得する。例えば、取得部１３１は、グラフ情報を取得する。図１の例では、取得部１３１は、グラフ情報ＧＲ１１を取得する。 For example, the acquisition unit 131 acquires graph information (graph data) from an external information processing device. For example, the acquisition unit 131 acquires graph information from the graph information storage unit 123 . For example, the acquisition unit 131 acquires graph information. In the example of FIG. 1, the acquisition unit 131 acquires graph information GR11.

例えば、取得部１３１は、外部の情報処理装置からインデックス情報（インデックスデータ）を取得する。例えば、取得部１３１は、インデックス情報記憶部１２２からインデックス情報を取得する。例えば、取得部１３１は、木構造型のインデックス情報を取得する。図１の例では、取得部１３１は、インデックス情報ＩＮＤ１１を取得する。 For example, the acquisition unit 131 acquires index information (index data) from an external information processing device. For example, the acquisition unit 131 acquires index information from the index information storage unit 122 . For example, the acquisition unit 131 acquires tree-structured index information. In the example of FIG. 1, the acquisition unit 131 acquires index information IND11.

また、取得部１３１は、ユーザが利用する端末装置１０から一の対象に関する対象情報を取得する。例えば、取得部１３１は、検索クエリとして、一の対象の対象情報を取得する。例えば、取得部１３１は、対象（個人）に関する検索クエリを取得する。取得部１３１は、ユーザＵ１が利用する端末装置１０から個人Ｘに関する対象情報ＴＤＴ１１を取得する。 Also, the acquisition unit 131 acquires target information about one target from the terminal device 10 used by the user. For example, the acquisition unit 131 acquires target information of one target as a search query. For example, the acquisition unit 131 acquires a search query regarding an object (individual). The acquisition unit 131 acquires the target information TDT11 regarding the person X from the terminal device 10 used by the user U1.

例えば、取得部１３１は、一の対象に関する対象情報（個人情報）を取得する。例えば、取得部１３１は、端末装置１０から個人Ｘに関する対象情報ＴＤＴ１１を取得する。例えば、取得部１３１は、グラフ情報記憶部１２３から対象に関するグラフ情報ＧＲ１１を取得する。また、例えば、取得部１３１は、インデックス情報記憶部１２２（図５参照）から、グラフ情報ＧＲ１１における検索の起点となるノード（起点ベクトル）の決定に用いるインデックス情報ＩＮＤ１１を取得する。 For example, the acquisition unit 131 acquires target information (personal information) regarding one target. For example, the acquisition unit 131 acquires the target information TDT11 regarding the person X from the terminal device 10 . For example, the acquisition unit 131 acquires the graph information GR11 about the object from the graph information storage unit 123. FIG. Further, for example, the acquiring unit 131 acquires index information IND11 used for determining a node (originating vector) serving as a starting point for searching in the graph information GR11 from the index information storage unit 122 (see FIG. 5).

（生成部１３２）
生成部１３２は、各種情報を生成する。例えば、生成部１３２は、対象情報記憶部１２１に記憶された学習データ（対象情報）を用いて、モデル情報記憶部１２４に示すようなモデルを生成する。例えば、生成部１３２は、取得部１３１により取得された学習データに基づいて、入力した対象情報と同様の情報を出力するモデル（オートエンコーダ）を生成する。例えば、生成部１３２は、入力する対象情報自体を正解情報として、入力した対象情報と同様の情報を出力するモデル（オートエンコーダ）を生成する。 (Generating unit 132)
The generator 132 generates various types of information. For example, the generation unit 132 generates a model shown in the model information storage unit 124 using learning data (target information) stored in the target information storage unit 121 . For example, the generation unit 132 generates a model (autoencoder) that outputs information similar to the input target information based on the learning data acquired by the acquisition unit 131 . For example, the generation unit 132 generates a model (autoencoder) that outputs information similar to the input target information, using the input target information itself as correct information.

例えば、生成部１３２は、モデルＭ１等を生成し、生成したモデルＭ１等をモデル情報記憶部１２４に格納する。なお、生成部１３２は、いかなる学習アルゴリズムを用いてモデルＭ１を生成してもよい。例えば、生成部１３２は、ニューラルネットワーク（neural network）等の学習アルゴリズムを用いてモデルＭ１を生成する。一例として、生成部１３２がニューラルネットワークを用いてモデルＭ１等を生成する場合、モデルＭ１等は、一以上のニューロンを含む入力層と、一以上のニューロンを含む中間層と、一以上のニューロンを含む出力層とを有する。 For example, the generation unit 132 generates the model M1 and the like, and stores the generated model M1 and the like in the model information storage unit 124 . Note that the generation unit 132 may generate the model M1 using any learning algorithm. For example, the generator 132 generates the model M1 using a learning algorithm such as a neural network. As an example, when the generation unit 132 generates the model M1 or the like using a neural network, the model M1 or the like includes an input layer including one or more neurons, an intermediate layer including one or more neurons, and one or more neurons. and an output layer containing

生成部１３２は、対象の対象情報が入力される入力層と、出力層と、入力層から出力層までのいずれかの層であって出力層以外の層に属する第１要素と、第１要素と第１要素の重みとに基づいて値が算出される第２要素と、を含み、入力層に入力された対象情報に対し、出力層以外の各層に属する各要素を第１要素として、第１要素と第１要素の重みとに基づく演算を行うことにより、入力層に入力される情報と同様の情報を出力層から出力するモデルを生成する。 The generation unit 132 generates an input layer to which target information of a target is input, an output layer, a first element belonging to any layer from the input layer to the output layer other than the output layer, and a first element and a second element whose value is calculated based on the weight of the first element. By performing calculations based on the one element and the weight of the first element, a model is generated in which the same information as that input to the input layer is output from the output layer.

例えば、生成部１３２は、学習データに基づいてモデルを生成する。例えば、生成部１３２は、学習データに基づいてモデルを生成する。例えば、生成部１３２は、対象情報記憶部１２１中の対象情報ＴＤＴ１、ＴＤＴ２等を学習データ（教師データ）として、学習を行なうことにより、モデルを生成する。 For example, the generator 132 generates a model based on learning data. For example, the generator 132 generates a model based on learning data. For example, the generation unit 132 generates a model by performing learning using the target information TDT1, TDT2, etc. in the target information storage unit 121 as learning data (teacher data).

例えば、生成部１３２は、対象情報ＴＤＴ１が入力された場合に、モデルＭ１が対象情報ＴＤＴ１と同様の情報を出力するように、学習処理を行う。例えば、生成部１３２は、対象情報ＴＤＴ２が入力された場合に、モデルＭ１が対象情報ＴＤＴ２と同様の情報を出力するように、学習処理を行う。 For example, when the target information TDT1 is input, the generation unit 132 performs learning processing so that the model M1 outputs information similar to the target information TDT1. For example, the generation unit 132 performs learning processing so that the model M1 outputs information similar to the target information TDT2 when the target information TDT2 is input.

なお、生成部１３２は、オートエンコーダとしてのモデルＭ１に限らず、種々の学習アルゴリズムを用いて対象「個人：営業適正」に対応するモデルＭ２１や対象「グループ：事業成功」に対応するＭ２２等を生成してもよい。例えば、生成部１３２は、ニューラルネットワーク（neural network）、サポートベクターマシン（ＳＶＭ）、クラスタリング、強化学習等の学習アルゴリズムを用いてモデルＭ２１、Ｍ２２等を生成する。一例として、生成部１３２がニューラルネットワークを用いてモデルＭ２１、Ｍ２２等を生成する場合、モデルＭ２１、Ｍ２２等は、一以上のニューロンを含む入力層と、一以上のニューロンを含む中間層と、一以上のニューロンを含む出力層とを有する。 Note that the generation unit 132 is not limited to the model M1 as an autoencoder, and uses various learning algorithms to generate a model M21 corresponding to the target "individual: business suitability" and a model M22 corresponding to the target "group: business success". may be generated. For example, the generation unit 132 generates the models M21, M22, etc. using learning algorithms such as neural networks, support vector machines (SVM), clustering, and reinforcement learning. As an example, when the generation unit 132 generates the models M21, M22, etc. using a neural network, the models M21, M22, etc. include an input layer including one or more neurons, an intermediate layer including one or more neurons, and a and an output layer containing the above neurons.

生成部１３２は、モデルを生成し、生成したモデルをモデル情報記憶部１２４に格納する。具体的には、生成部１３２は、対象の対象情報が入力される入力層と、出力層と、入力層から出力層までのいずれかの層であって出力層以外の層に属する第１要素と、第１要素と第１要素の重みとに基づいて値が算出される第２要素と、を含み、入力層に入力された対象情報に対し、出力層以外の各層に属する各要素を第１要素として、第１要素と第１要素の重みとに基づく演算を行うことにより、所定の目的に適した対象かどうかの予測に用いられるスコアの値を出力層から出力するモデルを生成する。例えば、生成部１３２は、ある対象の対象情報が入力される入力層と、出力層と、入力層から出力層までのいずれかの層であって出力層以外の層に属する第１要素と、第１要素と第１要素の重みとに基づいて値が算出される第２要素と、を含み、入力層に入力された対象情報に対し、出力層以外の各層に属する各要素を第１要素として、第１要素と第１要素の重みとに基づく演算を行うことにより、その対象が所定の目的に適した対象かどうかの予測に用いられるスコアの値を出力層から出力するモデルＭ２１を生成する。例えば、生成部１３２は、営業に適した個人の対象情報が入力された場合に、スコア「１」を出力するようにモデルＭ２１を生成する。例えば、生成部１３２は、営業に適さない個人の対象情報が入力された場合に、スコア「０」を出力するようにモデルＭ２１を生成する。なお、情報処理装置１００は、情報提供装置５０等の他の外部装置からモデルを取得する場合、生成部１３２を有しなくてもよい。 The generation unit 132 generates a model and stores the generated model in the model information storage unit 124 . Specifically, the generation unit 132 generates a first element belonging to an input layer to which the target information of the target is input, an output layer, and any layer from the input layer to the output layer other than the output layer. and a second element whose value is calculated based on the first element and the weight of the first element. By performing calculations based on the first element and the weight of the first element as one element, a model is generated in which the output layer outputs a score value used for predicting whether a target is suitable for a predetermined purpose. For example, the generation unit 132 includes an input layer to which target information of a certain target is input, an output layer, a first element belonging to any layer from the input layer to the output layer other than the output layer, and a second element whose value is calculated based on the first element and the weight of the first element. , the calculation based on the first element and the weight of the first element generates a model M21 that outputs from the output layer the value of the score used to predict whether the target is suitable for a given purpose. do. For example, the generating unit 132 generates the model M21 so as to output a score of "1" when personal target information suitable for sales is input. For example, the generating unit 132 generates the model M21 so as to output a score of "0" when personal target information unsuitable for sales is input. Note that the information processing apparatus 100 may not have the generation unit 132 when acquiring a model from another external apparatus such as the information providing apparatus 50 .

図１の例では、生成部１３２は、対象情報の特徴を抽出するモデルを用いて各対象の対象情報からＮ次元ベクトルを生成する。生成部１３２は、モデル情報記憶部１２４（図７）に示すモデルＭ１を用いて、各対象（個人）の対象情報（個人情報）からベクトルを生成する。 In the example of FIG. 1, the generation unit 132 generates an N-dimensional vector from the target information of each target using a model for extracting features of the target information. The generation unit 132 generates a vector from the target information (personal information) of each target (individual) using the model M1 shown in the model information storage unit 124 (FIG. 7).

また、生成部１３２は、一の対象に対応する対象情報からグラフ情報の探索に用いるベクトルを生成する。図１の例では、生成部１３２は、処理群ＰＳ１１に示すような処理により、個人Ｘに対応するベクトルを生成する。生成部１３２は、個人Ｘに関する対象情報ＴＤＴ１１をモデルＭ１に入力する。具体的には、生成部１３２は、個人Ｘの対象（個人）の年齢、性別等の属性情報や顔や全体を撮像した画像や音声や行動等の情報を含む対象情報ＴＤＴ１１をモデルＭ１に入力する。そして、生成部１３２は、対象情報ＴＤＴ１１の入力後のモデルＭ１中の情報を用いて、ベクトルを生成する。例えば、生成部１３２は、対象情報ＴＤＴ１１が入力されたモデルＭ１中の各要素を用いて、ベクトルデータを生成する。 The generation unit 132 also generates a vector used for searching graph information from target information corresponding to one target. In the example of FIG. 1, the generation unit 132 generates a vector corresponding to the individual X by performing the processing shown in the processing group PS11. The generation unit 132 inputs the target information TDT11 regarding the individual X to the model M1. Specifically, the generation unit 132 inputs target information TDT11 including attribute information such as the age and gender of the target (individual) of the individual X, and information such as an image of the face and the whole image, voice, behavior, etc. to the model M1. do. Then, the generation unit 132 generates a vector using information in the model M1 after the input of the target information TDT11. For example, the generator 132 generates vector data using each element in the model M1 to which the target information TDT11 is input.

図１の例では、生成部１３２は、対象情報ＴＤＴ１１が入力されたモデルＭ１中の各要素の値を用いて、ベクトルＶＤ１１を生成する。例えば、生成部１３２は、個人Ｘの対象情報ＴＤＴ１１が入力された場合における、モデルＭ１のニューロンＮＬ１に対応する値ＶＥ１（図１０参照）やニューロンＮＬ２に対応する値ＶＥ２（図１０参照）を用いて、ベクトルを生成する。例えば、生成部１３２は、個人Ｘの対象情報ＴＤＴ１１が入力された場合に、算出されるニューロンＮＬ１に対応する値ＶＥ１をベクトルＶＤ１１の１次元目の要素として抽出してもよい。また、例えば、生成部１３２は、対象の対象情報が入力された場合に、算出されるニューロンＮＬ２に対応する値ＶＥ２をベクトルＶＤ１１の２次元目の要素として、ベクトルＶＤ１１を生成する。図１の例では、生成部１３２は、１次元目の要素が「３５」であり、２次元目の要素が「６３」であるようなベクトルＶＤ１１を生成する。 In the example of FIG. 1, the generator 132 generates the vector VD11 using the values of the elements in the model M1 to which the target information TDT11 is input. For example, the generating unit 132 uses the value VE1 (see FIG. 10) corresponding to the neuron NL1 and the value VE2 (see FIG. 10) corresponding to the neuron NL2 of the model M1 when the target information TDT11 of the person X is input. to generate a vector. For example, when the target information TDT11 of the individual X is input, the generation unit 132 may extract the calculated value VE1 corresponding to the neuron NL1 as the first-dimensional element of the vector VD11. Further, for example, when the target information of the target is input, the generating unit 132 generates the vector VD11 by using the calculated value VE2 corresponding to the neuron NL2 as the second-dimensional element of the vector VD11. In the example of FIG. 1, the generator 132 generates a vector VD11 whose first-dimensional element is "35" and whose second-dimensional element is "63."

例えば、生成部１３２は、モデルＭ２１を用いて各対象の対象情報からＮ次元ベクトルを生成してもよい。例えば、生成部１３２は、対象に対応する対象情報をモデルＭ２１に入力し、各対象の対象情報からベクトルを生成する。例えば、生成部１３２は、モデルＭ２１に対象の対象情報を入力することにより、モデルＭ２１中の各要素（ニューロン）の値を演算し、入力した対象情報に対応するスコアを出力する。例えば、生成部１３２は、モデルＭ２１の中間層の各要素（ニューロン）の値を特徴量として抽出し、各対象に対応するＮ次元のベクトルデータを生成してもよい。 For example, the generation unit 132 may generate an N-dimensional vector from the target information of each target using the model M21. For example, the generation unit 132 inputs target information corresponding to the target to the model M21 and generates a vector from the target information of each target. For example, the generation unit 132 inputs target information of a target to the model M21, calculates the value of each element (neuron) in the model M21, and outputs a score corresponding to the input target information. For example, the generation unit 132 may extract the value of each element (neuron) in the intermediate layer of the model M21 as a feature amount, and generate N-dimensional vector data corresponding to each object.

（決定部１３３）
決定部１３３は、各種情報を決定する。決定部１３３は、起点ノードの決定に用いるインデックス情報に基づいて、起点ノードを決定する。例えば、決定部１３３は、取得部１３１により取得された対象情報と、グラフ情報の検索の起点となる起点ベクトルに関する情報とに基づいて、起点ベクトルを決定する。例えば、決定部１３３は、対象情報と、グラフ情報の検索の起点ベクトルの決定の基準となる情報とに基づいて、起点ベクトルを決定する。また、決定部１３３は、起点ベクトルの決定に用いるインデックス情報に基づいて、起点ベクトルを決定する。また、決定部１３３は、木構造型のインデックス情報に基づいて、起点ベクトルを決定する。 (Determination unit 133)
The determination unit 133 determines various types of information. The determination unit 133 determines the origin node based on the index information used for determining the origin node. For example, the determination unit 133 determines the starting vector based on the target information acquired by the acquiring unit 131 and the information about the starting vector that is the starting point for searching the graph information. For example, the determination unit 133 determines the starting vector based on the target information and the information that serves as a reference for determining the starting vector for searching the graph information. Further, the determination unit 133 determines the origin vector based on the index information used for determining the origin vector. Further, the determination unit 133 determines the origin vector based on the tree-structured index information.

図１の例では、決定部１３３は、一の対象（クエリ）に対応する起点ベクトルを決定（特定）するために、インデックス情報ＩＮＤ１１を用いる。例えば、決定部１３３は、個人ＸのベクトルＶＤ１１に対応する起点ベクトルを決定（特定）するために、インデックス情報ＩＮＤ１１を用いる。決定部１３３は、ベクトルＶＤ１１とインデックス情報ＩＮＤ１１とを用いて、グラフ情報ＧＲ１１における起点ベクトルを決定する。 In the example of FIG. 1, the determination unit 133 uses the index information IND11 to determine (specify) the origin vector corresponding to one target (query). For example, the determination unit 133 uses the index information IND11 to determine (specify) the origin vector corresponding to the person X's vector VD11. The determining unit 133 determines the starting vector in the graph information GR11 using the vector VD11 and the index information IND11.

例えば、決定部１３３は、図１中のインデックス情報ＩＮＤ１１に示すような木構造型のインデックス情報を用いて、グラフ情報ＧＲ１１における起点ベクトルを決定する。図１の例では、決定部１３３は、ベクトルＶＤ１１を生成した後、インデックス情報ＩＮＤ１１を上から下へ辿ることにより、インデックス情報ＩＮＤ１１の近傍候補となる起点ベクトルを特定することにより、効率的に検索クエリ（一の対象）に対応する起点ベクトルを決定する。 For example, the determining unit 133 determines the origin vector in the graph information GR11 using tree-structured index information as indicated by the index information IND11 in FIG. In the example of FIG. 1, after generating the vector VD11, the determination unit 133 traces the index information IND11 from top to bottom to identify the origin vector that is a candidate for the neighborhood of the index information IND11, thereby efficiently searching. Determine the origin vector corresponding to the query (one object).

例えば、決定部１３３は、インデックス情報ＩＮＤ１１をルートＲＴからリーフノード（グラフ情報ＧＲ１１中のノード（ベクトル））まで辿ることにより、ベクトルＶＤ１１に対応する起点ベクトルを決定する。図１の例では、決定部１３３は、インデックス情報ＩＮＤ１１をルートＲＴからノードＮ４５１まで辿ることにより、ノードＮ４５１を起点ベクトルとして決定する。 For example, the determining unit 133 determines the origin vector corresponding to the vector VD11 by tracing the index information IND11 from the root RT to leaf nodes (nodes (vectors) in the graph information GR11). In the example of FIG. 1, the determining unit 133 determines the node N451 as the origin vector by tracing the index information IND11 from the root RT to the node N451.

（検索部１３４）
検索部１３４は、各種情報を検索する。例えば、検索部１３４は、取得部１３１により取得された情報を用いて、各種情報を検索する。例えば、検索部１３４は、各種情報を抽出する。例えば、検索部１３４は、取得部１３１により取得された情報を用いて、各種情報を抽出する。例えば、検索部１３４は、各種判定処理を行う。例えば、検索部１３４は、取得部１３１により取得された情報を用いて、各種情報を判定する。例えば、検索部１３４は、記憶部１２０に記憶された各種情報を検索する。例えば、取得部１３１は、対象情報記憶部１２１や、インデックス情報記憶部１２２や、グラフ情報記憶部１２３や、モデル情報記憶部１２４等に記憶された各種情報を検索する。例えば、検索部１３４は、各種情報を用いて算出処理を行う。例えば、検索部１３４は、ベクトルに関する情報を用いて、ベクトル間の距離を算出する。例えば、検索部１３４は、グラフ情報記憶部１２３に記憶された各ノード（ベクトル）間を連結するエッジの長さ（距離）の情報を用いてもよいし、各ノードのベクトル情報から各ノード（ベクトル）間を連結するエッジの長さ（距離）の情報を算出し、算出した長さ（距離）の情報を用いてもよい。 (Search unit 134)
The search unit 134 searches for various types of information. For example, the search unit 134 searches various information using the information acquired by the acquisition unit 131 . For example, the search unit 134 extracts various information. For example, the search unit 134 uses information acquired by the acquisition unit 131 to extract various types of information. For example, the search unit 134 performs various determination processes. For example, the search unit 134 uses information acquired by the acquisition unit 131 to determine various types of information. For example, the search unit 134 searches various information stored in the storage unit 120 . For example, the acquisition unit 131 searches various types of information stored in the target information storage unit 121, the index information storage unit 122, the graph information storage unit 123, the model information storage unit 124, and the like. For example, the search unit 134 performs calculation processing using various information. For example, the search unit 134 uses information about vectors to calculate the distance between the vectors. For example, the search unit 134 may use information on the length (distance) of edges connecting nodes (vectors) stored in the graph information storage unit 123, or may use vector information on each node ( It is also possible to calculate information on the length (distance) of the edge connecting the vectors) and use the information on the calculated length (distance).

検索部１３４は、取得部１３１により取得されたグラフ情報の複数のノードのうち、所定の基準に基づいて決定されたグラフ情報の検索の起点となる起点ノードを起点として、グラフ情報を検索することにより、複数の対象のうち、一の対象と所定の対応関係を有する対象である対応対象を抽出する。検索部１３４は、決定部１３３により決定された起点ノードを起点として、対応対象を抽出する。検索部１３４は、複数の対象のうち、一の対象と類似する対象を対応対象として抽出する。 The retrieving unit 134 retrieves graph information starting from a starting node, which is determined based on a predetermined criterion, among the plurality of nodes of the graph information acquired by the acquiring unit 131. extracts a corresponding target that has a predetermined correspondence relationship with one of the plurality of targets. The search unit 134 extracts corresponding objects starting from the starting node determined by the determining unit 133 . The search unit 134 extracts targets similar to one target from among the plurality of targets as corresponding targets.

検索部１３４は、取得部１３１により取得されたグラフ情報の複数のベクトルのうち、所定の基準に基づいて決定されたグラフ情報の検索の起点となる起点ベクトルを起点としてグラフ情報を検索することにより、複数の対象のうち、一の対象に類似する対象である類似対象（類似個人）を抽出する。検索部１３４は、各種情報を抽出する。例えば、検索部１３４は、決定部１３３により決定された起点ベクトルを起点としてグラフ情報を検索することにより、複数の対象のうち、一の対象に類似する対象である類似対象を抽出する。 The retrieving unit 134 retrieves the graph information from among the plurality of vectors of the graph information acquired by the acquiring unit 131, starting from the starting vector that is the starting point for searching the graph information determined based on a predetermined criterion. , to extract a similar object (similar individual) that is similar to one object from among a plurality of objects. The search unit 134 extracts various information. For example, the retrieval unit 134 retrieves graph information using the starting vector determined by the determination unit 133 as a starting point, thereby extracting a similar target that is similar to one target from among the plurality of targets.

図１の例では、検索部１３４は、個人Ｘに類似する対象（個人）を検索する。例えば、検索部１３４は、グラフ情報ＧＲ１１を検索することにより、個人Ｘの類似対象を抽出する。検索部１３４は、ノードＮ４５１の近傍に位置するノードを類似対象として抽出する。例えば、検索部１３４は、ノードＮ４５１からの距離が近いノードを類似対象として抽出する。例えば、検索部１３４は、ノードＮ４５１を起点として、エッジを辿ることにより、ノードＮ４５１から到達可能なノードを類似対象として抽出する。例えば、検索部１３４は、所定数（例えば、２個や１０個等）のノードを類似対象として抽出する。例えば、検索部１３４は、図１１に示すような検索処理により、個人Ｘの類似対象を抽出する。図１の例では、検索部１３４は、ノードＮ４５１を起点として、グラフ情報ＧＲ１１を探索することにより、ノードＮ４５１やノードＮ３５を類似対象として抽出する。 In the example of FIG. 1, the search unit 134 searches for objects (individuals) similar to person X. In FIG. For example, the search unit 134 extracts a similar target of the individual X by searching the graph information GR11. The search unit 134 extracts nodes located near the node N451 as similar objects. For example, the search unit 134 extracts a node that is close to the node N451 as a similar target. For example, the search unit 134 extracts nodes reachable from the node N451 as similar objects by tracing edges starting from the node N451. For example, the search unit 134 extracts a predetermined number (for example, 2 or 10) of nodes as similar targets. For example, the search unit 134 extracts a similar target of the individual X through search processing as shown in FIG. In the example of FIG. 1, the search unit 134 extracts the node N451 and the node N35 as similar objects by searching the graph information GR11 with the node N451 as the starting point.

検索部１３４は、複数の対象のうち、一の対象と非類似の対象を対応対象として抽出する。例えば、検索部１３４は、非類似連結グラフを用いて、一の対象と非類似の対象を抽出してもよい。非類似連結グラフにおける複数の対象のうち、一の対象と非類似の対象を対応対象として抽出してもよい。この場合、検索部１３４は、は、一の対象と類似性が低いという対応関係を有する対応対象（非類似対象）を抽出する。 The search unit 134 extracts a target dissimilar to one target from among the plurality of targets as a corresponding target. For example, the search unit 134 may extract a target that is dissimilar to one target using a dissimilar connection graph. A target that is dissimilar to one of the plurality of targets in the dissimilar connection graph may be extracted as a corresponding target. In this case, the search unit 134 extracts a corresponding target (non-similar target) having a correspondence relationship of low similarity with one target.

（提供部１３５）
提供部１３５は、各種情報を提供する。例えば、提供部１３５は、端末装置１０や情報提供装置５０に各種情報を送信する。例えば、提供部１３５は、端末装置１０や情報提供装置５０に各種情報を配信する。例えば、提供部１３５は、端末装置１０や情報提供装置５０に各種情報を提供する。提供部１３５は、検索部１３４により抽出された対応対象に基づいて、所定のサービスを提供する。提供部１３５は、対応対象に関する情報提供サービスを提供する。提供部１３５は、端末装置１０に対応対象に関する情報を提供する。提供部１３５は、検索部１３４により抽出された類似対象に基づいて、所定のサービスを提供する。また、提供部１３５は、類似対象に関する情報提供サービスを提供する。提供部１３５は、端末装置１０に類似対象に関する情報を提供する。 (Providing unit 135)
The providing unit 135 provides various information. For example, the providing unit 135 transmits various information to the terminal device 10 and the information providing device 50 . For example, the providing unit 135 distributes various types of information to the terminal device 10 and the information providing device 50 . For example, the providing unit 135 provides various information to the terminal device 10 and the information providing device 50 . The providing unit 135 provides a predetermined service based on the correspondence target extracted by the searching unit 134 . The providing unit 135 provides an information providing service regarding the correspondence target. The providing unit 135 provides the terminal device 10 with information on the correspondence target. The providing unit 135 provides a predetermined service based on the similar objects extracted by the searching unit 134 . In addition, the providing unit 135 provides an information providing service regarding similar objects. The providing unit 135 provides the terminal device 10 with information on the similar target.

例えば、提供部１３５は、クエリに対応するオブジェクトＩＤを検索結果として提供する。例えば、提供部１３５は、検索部１３４により選択されたオブジェクトＩＤを情報提供装置５０へ提供する。提供部１３５は、検索部１３４により選択されたオブジェクトＩＤをクエリに対応するベクトルを示す情報として情報提供装置５０に提供する。また、提供部１３５は、生成部１３２により生成されたモデルを外部の情報処理装置へ提供してもよい。 For example, the providing unit 135 provides object IDs corresponding to queries as search results. For example, the providing unit 135 provides the information providing device 50 with the object ID selected by the searching unit 134 . The providing unit 135 provides the information providing device 50 with the object ID selected by the searching unit 134 as information indicating the vector corresponding to the query. Further, the providing unit 135 may provide the model generated by the generating unit 132 to an external information processing device.

図１の例では、提供部１３５は、検索部１３４により抽出された類似対象に関する情報を提供する。例えば、提供部１３５は、ノードＮ４５１に対応する個人＃４５１や、ノードＮ３５に対応する個人＃３５を個人Ｘに類似する対象（個人）としてユーザＵ１が利用する端末装置１０に提供する。 In the example of FIG. 1 , the providing unit 135 provides information about similar targets extracted by the searching unit 134 . For example, the providing unit 135 provides individual #451 corresponding to the node N451 and individual #35 corresponding to the node N35 as objects (individuals) similar to the individual X to the terminal device 10 used by the user U1.

〔４．情報処理のフロー〕
次に、図８を用いて、実施形態に係る情報処理システム１による情報処理の手順について説明する。図８は、実施形態に係る情報処理の一例を示すフローチャートである。 [4. Information processing flow]
Next, the procedure of information processing by the information processing system 1 according to the embodiment will be described with reference to FIG. FIG. 8 is a flowchart illustrating an example of information processing according to the embodiment.

図８に示すように、情報処理装置１００は、人を要素とする一の対象に関する対象情報する（ステップＳ１０１）。図１の例では、情報処理装置１００は、端末装置１０から個人Ｘに関する対象情報ＴＤＴ１１を取得する。 As shown in FIG. 8, the information processing apparatus 100 obtains target information about one target whose element is a person (step S101). In the example of FIG. 1 , the information processing device 100 acquires target information TDT11 regarding the person X from the terminal device 10 .

情報処理装置１００は、人を要素とする対象に関するグラフ情報を取得する（ステップＳ１０２）。例えば、情報処理装置１００は、グラフ情報記憶部１２３から対象（個人）に関するグラフ情報ＧＲ１１を取得する。 The information processing apparatus 100 acquires graph information about an object whose element is a person (step S102). For example, the information processing apparatus 100 acquires the graph information GR11 regarding the target (individual) from the graph information storage unit 123 .

そして、情報処理装置１００は、モデルを用いて一の対象に関する対象情報からベクトルを生成する（ステップＳ１０３）。図１の例では、情報処理装置１００は、モデル情報記憶部１２４に記憶されたモデルＭ１を用いて、対象情報ＴＤＴ１１からベクトルＶＤ１１を生成する。 Then, the information processing apparatus 100 uses the model to generate a vector from the target information regarding one target (step S103). In the example of FIG. 1, the information processing apparatus 100 uses the model M1 stored in the model information storage unit 124 to generate the vector VD11 from the target information TDT11.

そして、情報処理装置１００は、生成したベクトルとインデックス情報を用いて起点ベクトルを決定する（ステップＳ１０４）。図１の例では、情報処理装置１００は、ベクトルＶＤ１１と、インデックス情報記憶部１２２に記憶されたインデックス情報ＩＮＤ１１とを用いて、起点ベクトルをノードＮ４５１に決定する。 Then, the information processing apparatus 100 determines the origin vector using the generated vector and the index information (step S104). In the example of FIG. 1, the information processing apparatus 100 uses the vector VD11 and the index information IND11 stored in the index information storage unit 122 to determine the origin vector to be the node N451.

そして、情報処理装置１００は、グラフ情報を検索することにより、一の対象と所定の対応関係を有する対象である対応対象を抽出する（ステップＳ１０５）。例えば、情報処理装置１００は、グラフ情報を検索することにより、一の対象と類似する対象を対応対象として抽出する。図１の例では、情報処理装置１００は、ノードＮ４５１を起点として、グラフ情報ＧＲ１１を探索することにより、ノードＮ４５１やノードＮ３５を類似対象として抽出する。 Then, the information processing apparatus 100 extracts a corresponding target that has a predetermined correspondence relationship with the one target by searching the graph information (step S105). For example, the information processing apparatus 100 extracts targets similar to one target as corresponding targets by searching graph information. In the example of FIG. 1, the information processing apparatus 100 extracts the node N451 and the node N35 as similar objects by searching the graph information GR11 with the node N451 as a starting point.

そして、情報処理装置１００は、抽出した類似対象に関する情報を提供する（ステップＳ１０６）。図１の例では、情報処理装置１００は、ノードＮ４５１に対応する個人＃４５１や、ノードＮ３５に対応する個人＃３５を個人Ｘに類似する対象（類似個人）としてユーザＵ１が利用する端末装置１０に提供する。 Then, the information processing apparatus 100 provides information on the extracted similar target (step S106). In the example of FIG. 1, the information processing apparatus 100 treats the individual #451 corresponding to the node N451 and the individual #35 corresponding to the node N35 as targets (similar individuals) similar to the individual X. provide to

〔５．生成処理のフロー〕
次に、図９を用いて、実施形態に係る情報処理システム１による生成処理の手順について説明する。図９は、実施形態に係る生成処理の一例を示すフローチャートである。 [5. Generation process flow]
Next, a procedure of generation processing by the information processing system 1 according to the embodiment will be described with reference to FIG. 9 . FIG. 9 is a flowchart illustrating an example of generation processing according to the embodiment.

図９に示すように、情報処理装置１００は、学習データを取得する（ステップＳ２０１）。例えば、情報処理装置１００は、対象情報記憶部１２１から学習データを取得する。例えば、情報処理装置１００は、対象情報記憶部１２１から対象情報ＴＤＴ１、ＴＤＴ２等を学習データとして取得する。 As shown in FIG. 9, the information processing apparatus 100 acquires learning data (step S201). For example, the information processing device 100 acquires learning data from the target information storage unit 121 . For example, the information processing apparatus 100 acquires the target information TDT1, TDT2, etc. from the target information storage unit 121 as learning data.

その後、情報処理装置１００は、学習データに基づきモデルを生成する（ステップＳ２０２）。例えば、情報処理装置１００は、対象情報記憶部１２１から学習データを用いてモデルＭ１を生成する。例えば、情報処理装置１００は、入力層に入力される情報（対象情報）と同様の情報（対象情報）を出力層から出力するようにモデルＭ１を生成する。例えば、情報処理装置１００は、対象（個人）の対象情報（個人情報）を入力とするオートエンコーダとしてのモデルＭ１を生成する。 After that, the information processing apparatus 100 generates a model based on the learning data (step S202). For example, the information processing apparatus 100 uses learning data from the target information storage unit 121 to generate the model M1. For example, the information processing apparatus 100 generates the model M1 so that information (target information) similar to information (target information) input to the input layer is output from the output layer. For example, the information processing apparatus 100 generates a model M1 as an autoencoder that receives target information (personal information) of a target (individual) as input.

〔６．検索例〕
ここで、上述したグラフ情報を用いた検索の一例を示す。なお、グラフ情報（グラフデータ）を用いた検索は下記に限らず、種々の手順により行われてもよい。この点について、図１１を一例として説明する。図１１は、グラフデータ（グラフ情報）を用いた検索処理の一例を示すフローチャートである。以下に説明する検索処理は、例えば情報処理装置１００の検索部１３４によって行われる。また、以下でいうオブジェクトは、ベクトルやノードと読み替えてもよい。なお、以下では、情報処理装置１００が検索処理を行うものとして説明するが、検索処理は他の装置により行われてもよい。例えば、情報処理装置１００は、検索クエリとして、一の対象の対象情報から生成されたベクトルデータを用いる。例えば、情報処理装置１００は、一の対象の対象情報から生成されたベクトルデータとインデックス情報とに基づいて決定された起点ベクトルを起点としてグラフデータを検索する。図１の例では、情報処理装置１００は、個人ＸのベクトルＶＤ１１とインデックス情報ＩＮＤ１１とに基づいて決定された起点ベクトルであるノードＮ４５１を起点としてグラフ情報ＧＲ１１を検索する。 [6. Search example]
Here, an example of retrieval using the graph information described above will be shown. The search using graph information (graph data) is not limited to the following, and may be performed by various procedures. This point will be described with reference to FIG. 11 as an example. FIG. 11 is a flowchart illustrating an example of search processing using graph data (graph information). The search processing described below is performed by the search unit 134 of the information processing apparatus 100, for example. Also, the object referred to below may be read as a vector or a node. In the following description, the information processing apparatus 100 performs the search process, but the search process may be performed by another apparatus. For example, the information processing apparatus 100 uses vector data generated from target information of one target as a search query. For example, the information processing apparatus 100 searches for graph data starting from an origin vector determined based on vector data generated from object information of one object and index information. In the example of FIG. 1, the information processing apparatus 100 searches for the graph information GR11 starting from the node N451, which is the starting vector determined based on the vector VD11 of the person X and the index information IND11.

ここでは、近傍オブジェクト集合Ｎ（Ｇ，ｙ）は、ノードｙに付与されているエッジにより関連付けられている近傍のオブジェクトの集合である。「Ｇ」は、所定のグラフデータ（例えば、グラフ情報ＧＲ１１等）であってもよい。例えば、情報処理装置１００は、ｋ近傍検索処理を実行する。 Here, the neighborhood object set N(G, y) is the set of neighborhood objects associated by the edge attached to node y. "G" may be predetermined graph data (for example, graph information GR11, etc.). For example, the information processing apparatus 100 executes k-nearest neighbor search processing.

例えば、情報処理装置１００は、超球の半径ｒを∞（無限大）に設定し（ステップＳ３００）、既存のオブジェクト集合から部分集合Ｓを抽出する（ステップＳ３０１）。例えば、情報処理装置１００は、ルートノード（起点ベクトル）として選択されたオブジェクト（ノード）を部分集合Ｓとして抽出してもよい。図１の例では、情報処理装置１００は、起点ベクトルであるノードＮ４５１等を部分集合Ｓとして抽出してもよい。また、例えば、超球とは、検索範囲を示す仮想的な球である。なお、ステップＳ３０１において抽出されたオブジェクト集合Ｓに含まれるオブジェクトは、同時に検索結果のオブジェクト集合Ｒの初期集合にも含められる。 For example, the information processing apparatus 100 sets the radius r of the hypersphere to ∞ (infinity) (step S300), and extracts the subset S from the existing object set (step S301). For example, the information processing apparatus 100 may extract, as the subset S, an object (node) selected as a root node (origin vector). In the example of FIG. 1, the information processing apparatus 100 may extract the node N451, which is the origin vector, etc., as the subset S. In the example of FIG. Also, for example, a hypersphere is a virtual sphere that indicates a search range. The objects included in the object set S extracted in step S301 are also included in the initial set of the object set R of the search results.

次に、情報処理装置１００は、オブジェクト集合Ｓに含まれるオブジェクトの中で、検索クエリオブジェクトをｙとするとオブジェクトｙとの距離が最も短いオブジェクトを抽出し、オブジェクトｓとする（ステップＳ３０２）。図１の例では、情報処理装置１００は、オブジェクト集合Ｓに含まれるオブジェクトの中で、検索クエリオブジェクトであるベクトルＶＤ１１との距離が最も短いオブジェクトを抽出し、オブジェクトｓとする。例えば、情報処理装置１００は、オブジェクト集合Ｓに含まれるオブジェクトの中で、ベクトルＶＤ１１との距離が最も短いノードＮ４５１を抽出し、オブジェクトｓとする。例えば、情報処理装置１００は、ルートノード（起点ベクトル）として選択されたオブジェクト（ノード）のみがオブジェクト集合Ｓの要素の場合には、結果的にルートノード（起点ベクトル）がオブジェクトｓとして抽出される。次に、情報処理装置１００は、オブジェクトｓをオブジェクト集合Ｓから除外する（ステップＳ３０３）。 Next, the information processing apparatus 100 extracts an object having the shortest distance from the object y, where y is the search query object, among the objects included in the object set S, and sets it as an object s (step S302). In the example of FIG. 1, the information processing apparatus 100 extracts the object with the shortest distance from the vector VD11, which is the search query object, from among the objects included in the object set S, and sets it as the object s. For example, the information processing apparatus 100 extracts the node N451 having the shortest distance from the vector VD11 among the objects included in the object set S, and sets it as the object s. For example, when only the object (node) selected as the root node (originating vector) is an element of the object set S, the information processing apparatus 100 extracts the root node (originating vector) as the object s. . Next, the information processing apparatus 100 excludes the object s from the object set S (step S303).

次に、情報処理装置１００は、オブジェクトｓとオブジェクトｙとの距離ｄ（ｓ，ｙ）がｒ（１＋ε）を超えるか否かを判定する（ステップＳ３０４）。ここで、εは拡張要素であり、ｒ（１＋ε）は、探索範囲（この範囲内のノードのみを探索する。検索範囲よりも大きくすることで精度を高めることができる）の半径を示す値である。オブジェクトｓとオブジェクトｙとの距離ｄ（ｓ，ｙ）がｒ（１＋ε）を超える場合（ステップＳ３０４：Ｙｅｓ）、情報処理装置１００は、オブジェクト集合Ｒをオブジェクトｙの近傍オブジェクト集合として出力し（ステップＳ３０５）、処理を終了する。 Next, the information processing apparatus 100 determines whether or not the distance d(s, y) between the object s and the object y exceeds r(1+ε) (step S304). Here, ε is an expansion factor, and r(1+ε) is a value that indicates the radius of the search range (only nodes within this range are searched; making it larger than the search range increases the accuracy). be. When the distance d(s, y) between the object s and the object y exceeds r(1+ε) (step S304: Yes), the information processing device 100 outputs the object set R as a neighborhood object set of the object y (step S305), the process ends.

オブジェクトｓと検索クエリオブジェクトｙとの距離ｄ（ｓ，ｙ）がｒ（１＋ε）を超えない場合（ステップＳ３０４：Ｎｏ）、情報処理装置１００は、オブジェクトｓの近傍オブジェクト集合Ｎ（Ｇ，ｓ）の要素であるオブジェクトの中からオブジェクト集合Ｃに含まれないオブジェクトを一つ選択し、選択したオブジェクトｕを、オブジェクト集合Ｃに格納する（ステップＳ３０６）。オブジェクト集合Ｃは、重複検索を回避するために便宜上設けられるものであり、処理開始時には空集合に設定される。 If the distance d(s, y) between the object s and the search query object y does not exceed r(1+ε) (step S304: No), the information processing apparatus 100 calculates the neighborhood object set N(G, s) of the object s. One object not included in the object set C is selected from the objects that are elements of , and the selected object u is stored in the object set C (step S306). The object set C is provided for convenience in order to avoid duplicate searches, and is set to an empty set at the start of processing.

次に、情報処理装置１００は、オブジェクトｕとオブジェクトｙとの距離ｄ（ｕ，ｙ）がｒ（１＋ε）以下であるか否かを判定する（ステップＳ３０７）。オブジェクトｕとオブジェクトｙとの距離ｄ（ｕ，ｙ）がｒ（１＋ε）以下である場合（ステップＳ３０７：Ｙｅｓ）、情報処理装置１００は、オブジェクトｕをオブジェクト集合Ｓに追加する（ステップＳ３０８）。また、オブジェクトｕとオブジェクトｙとの距離ｄ（ｕ，ｙ）がｒ（１＋ε）以下ではない場合（ステップＳ３０７：Ｎｏ）、情報処理装置１００は、ステップＳ３０９の判定（処理）を行う。 Next, the information processing apparatus 100 determines whether or not the distance d(u, y) between the object u and the object y is r(1+ε) or less (step S307). If the distance d(u, y) between the object u and the object y is less than or equal to r(1+ε) (step S307: Yes), the information processing apparatus 100 adds the object u to the object set S (step S308). If the distance d(u, y) between the object u and the object y is not less than r(1+ε) (step S307: No), the information processing apparatus 100 performs determination (processing) in step S309.

次に、情報処理装置１００は、オブジェクトｕとオブジェクトｙとの距離ｄ（ｕ，ｙ）がｒ以下であるか否かを判定する（ステップＳ３０９）。オブジェクトｕとオブジェクトｙとの距離ｄ（ｕ，ｙ）がｒを超える場合（ステップＳ３０９：Ｎｏ）、情報処理装置１００は、ステップＳ３１５の判定（処理）を行う。また、オブジェクトｕとオブジェクトｙとの距離ｄ（ｕ，ｙ）がｒ以下ではない場合（ステップＳ３０９：Ｎｏ）、情報処理装置１００は、ステップＳ３１５の判定（処理）を行う。 Next, the information processing apparatus 100 determines whether or not the distance d(u, y) between the object u and the object y is less than or equal to r (step S309). When the distance d(u, y) between the object u and the object y exceeds r (step S309: No), the information processing apparatus 100 performs determination (processing) of step S315. If the distance d(u, y) between the object u and the object y is not equal to or less than r (step S309: No), the information processing apparatus 100 performs determination (processing) in step S315.

オブジェクトｕとオブジェクトｙとの距離ｄ（ｕ，ｙ）がｒ以下である場合（ステップＳ３０９：Ｙｅｓ）、情報処理装置１００は、オブジェクトｕをオブジェクト集合Ｒに追加する（ステップＳ３１０）。そして、情報処理装置１００は、オブジェクト集合Ｒに含まれるオブジェクト数がｋｓを超えるか否かを判定する（ステップＳ３１１）。所定数ｋｓは、任意に定められる自然数である。例えば、ｋｓ＝２やｋｓ＝１０等の種々の設定であってもよい。オブジェクト集合Ｒに含まれるオブジェクト数がｋｓを超えない場合（ステップＳ３１１：Ｎｏ）、情報処理装置１００は、ステップＳ３１３の判定（処理）を行う。 If the distance d(u, y) between the object u and the object y is less than or equal to r (step S309: Yes), the information processing apparatus 100 adds the object u to the object set R (step S310). Then, the information processing apparatus 100 determines whether or not the number of objects included in the object set R exceeds ks (step S311). The predetermined number ks is an arbitrarily determined natural number. For example, various settings such as ks=2 and ks=10 may be used. When the number of objects included in the object set R does not exceed ks (step S311: No), the information processing apparatus 100 performs determination (processing) in step S313.

オブジェクト集合Ｒに含まれるオブジェクト数がｋｓを超える場合（ステップＳ３１１：Ｙｅｓ）、情報処理装置１００は、オブジェクト集合Ｒに含まれるオブジェクトの中でオブジェクトｙとの距離が最も長い（遠い）オブジェクトを、オブジェクト集合Ｒから除外する（ステップＳ３１２）。 If the number of objects included in the object set R exceeds ks (step S311: Yes), the information processing apparatus 100 selects the object that is the longest (farthest) from the object y among the objects included in the object set R, It is excluded from the object set R (step S312).

次に、情報処理装置１００は、オブジェクト集合Ｒに含まれるオブジェクト数がｋｓと一致するか否かを判定する（ステップＳ３１３）。オブジェクト集合Ｒに含まれるオブジェクト数がｋｓと一致しない場合（ステップＳ３１３：Ｎｏ）、情報処理装置１００は、ステップＳ３１５の判定（処理）を行う。また、オブジェクト集合Ｒに含まれるオブジェクト数がｋｓと一致する場合（ステップＳ３１３：Ｙｅｓ）、情報処理装置１００は、オブジェクト集合Ｒに含まれるオブジェクトの中でオブジェクトｙとの距離が最も長い（遠い）オブジェクトと、オブジェクトｙとの距離を、新たなｒに設定する（ステップＳ３１４）。 Next, the information processing apparatus 100 determines whether or not the number of objects included in the object set R matches ks (step S313). If the number of objects included in the object set R does not match ks (step S313: No), the information processing apparatus 100 performs determination (processing) in step S315. If the number of objects included in the object set R matches ks (step S313: Yes), the information processing apparatus 100 determines that the object y has the longest (farthest) distance among the objects included in the object set R. The distance between the object and object y is set to a new r (step S314).

そして、情報処理装置１００は、オブジェクトｓの近傍オブジェクト集合Ｎ（Ｇ，ｓ）の要素であるオブジェクトから全てのオブジェクトを選択してオブジェクト集合Ｃに格納し終えたか否かを判定する（ステップＳ３１５）。オブジェクトｓの近傍オブジェクト集合Ｎ（Ｇ，ｓ）の要素であるオブジェクトから全てのオブジェクトを選択してオブジェクト集合Ｃに格納し終えていない場合（ステップＳ３１５：Ｎｏ）、情報処理装置１００は、ステップＳ３０６に戻って処理を繰り返す。 Then, the information processing apparatus 100 determines whether or not all objects have been selected from objects that are elements of the neighborhood object set N(G, s) of the object s and stored in the object set C (step S315). . If all the objects that are elements of the neighborhood object set N(G, s) of the object s have not been selected and stored in the object set C (step S315: No), the information processing apparatus 100 proceeds to step S306. and repeat the process.

オブジェクトｓの近傍オブジェクト集合Ｎ（Ｇ，ｓ）の要素であるオブジェクトから全てのオブジェクトを選択してオブジェクト集合Ｃに格納し終えた場合（ステップＳ３１５：Ｙｅｓ）、情報処理装置１００は、オブジェクト集合Ｓが空集合であるか否かを判定する（ステップＳ３１６）。オブジェクト集合Ｓが空集合でない場合（ステップＳ３１６：Ｎｏ）、情報処理装置１００は、ステップＳ３０２に戻って処理を繰り返す。また、オブジェクト集合Ｓが空集合である場合（ステップＳ３１６：Ｙｅｓ）、情報処理装置１００は、オブジェクト集合Ｒを出力し、処理を終了する（ステップＳ３１７）。例えば、情報処理装置１００は、オブジェクト集合Ｒに含まれるオブジェクト（ノード）を検索クエリ（入力オブジェクトｙ）に対応する検索結果として、検索を行った端末装置１０等へ提供してもよい。図１の例では、情報処理装置１００は、オブジェクト集合Ｒに含まれるノードＮ４５１やノードＮ３５を検索クエリ（個人ＸのベクトルＶＤ１１）に対応する検索結果として、検索を行った端末装置１０等へ提供してもよい。例えば、情報処理装置１００は、ノードＮ４５１に対応する個人＃４５１や、ノードＮ３５に対応する個人＃３５を個人Ｘに類似する対象（個人）としてユーザＵ１が利用する端末装置１０に提供する。 When all objects are selected from the objects that are elements of the neighborhood object set N(G, s) of the object s and stored in the object set C (step S315: Yes), the information processing apparatus 100 returns the object set S is an empty set (step S316). If the object set S is not an empty set (step S316: No), the information processing apparatus 100 returns to step S302 and repeats the process. If the object set S is an empty set (step S316: Yes), the information processing apparatus 100 outputs the object set R and ends the process (step S317). For example, the information processing apparatus 100 may provide objects (nodes) included in the object set R as search results corresponding to the search query (input object y) to the terminal apparatus 10 or the like that performed the search. In the example of FIG. 1, the information processing apparatus 100 provides the node N451 and the node N35 included in the object set R as search results corresponding to the search query (the vector VD11 of the individual X) to the terminal device 10 or the like that performed the search. You may For example, the information processing apparatus 100 provides a person #451 corresponding to the node N451 and a person #35 corresponding to the node N35 as objects (individuals) similar to the person X to the terminal device 10 used by the user U1.

〔７．効果〕
上述してきたように、実施形態に係る情報処理装置１００は、取得部１３１と、検索部１３４とを有する。取得部１３１は、人を要素とする複数の対象の各々に対応する複数のノードが、複数の対象の類似性に応じて連結されたグラフ情報と、一の対象に関する対象情報を取得する。検索部１３４は、取得部１３１により取得されたグラフ情報の複数のノードのうち、所定の基準に基づいて決定されたグラフ情報の検索の起点となる起点ノードを起点として、グラフ情報を検索することにより、複数の対象のうち、一の対象と所定の対応関係を有する対象である対応対象を抽出する。 [7. effect〕
As described above, the information processing apparatus 100 according to the embodiment has the acquisition unit 131 and the search unit 134 . The acquiring unit 131 acquires graph information in which a plurality of nodes corresponding to each of a plurality of targets having people as elements are connected according to the similarity of the plurality of targets, and target information about one target. The retrieving unit 134 retrieves graph information starting from a starting node, which is determined based on a predetermined criterion, among the plurality of nodes of the graph information acquired by the acquiring unit 131. extracts a corresponding target that has a predetermined correspondence relationship with one of the plurality of targets.

このように、実施形態に係る情報処理装置１００は、起点ベクトルを起点としてグラフ情報を検索し、複数の対象のうち、一の対象と所定の対応関係を有する対象である対応対象を抽出することにより、所望の対象と所定の対応関係を有する対象を適切に抽出することができる。 As described above, the information processing apparatus 100 according to the embodiment searches the graph information with the starting vector as the starting point, and extracts a corresponding target having a predetermined correspondence relationship with one of the plurality of targets. , it is possible to appropriately extract a target having a predetermined correspondence with a desired target.

また、実施形態に係る情報処理装置１００は、決定部１３３を有する。決定部１３３は、起点ノードの決定に用いるインデックス情報に基づいて、起点ノードを決定する。検索部１３４は、決定部１３３により決定された起点ノードを起点として、対応対象を抽出する。 Further, the information processing apparatus 100 according to the embodiment has a determination unit 133 . The determination unit 133 determines the origin node based on the index information used for determining the origin node. The search unit 134 extracts corresponding objects starting from the starting node determined by the determining unit 133 .

このように、実施形態に係る情報処理装置１００は、起点ベクトルの決定に用いるインデックス情報に基づいて、起点ベクトルを決定することにより、所望の対象と所定の対応関係を有する対象を適切に抽出することができる。 As described above, the information processing apparatus 100 according to the embodiment appropriately extracts a target having a predetermined correspondence with a desired target by determining the starting vector based on the index information used for determining the starting vector. be able to.

また、実施形態に係る情報処理装置１００において、決定部１３３は、木構造型のインデックス情報に基づいて、起点ベクトルを決定する。 Further, in the information processing apparatus 100 according to the embodiment, the determining unit 133 determines the origin vector based on the tree-structured index information.

このように、実施形態に係る情報処理装置１００は、木構造型のインデックス情報に基づいて、起点ベクトルを決定することにより、所望の対象と所定の対応関係を有する対象を適切に抽出することができる。 As described above, the information processing apparatus 100 according to the embodiment can appropriately extract a target having a predetermined correspondence with a desired target by determining the origin vector based on the tree-structured index information. can.

また、実施形態に係る情報処理装置１００において、取得部１３１は、複数の対象の各々の特徴を示す複数のベクトルが類似性に応じて連結されたグラフ情報を取得する。 Further, in the information processing apparatus 100 according to the embodiment, the acquisition unit 131 acquires graph information in which a plurality of vectors representing characteristics of each of a plurality of targets are connected according to similarity.

このように、実施形態に係る情報処理装置１００は、複数の対象の各々の特徴を示す複数のベクトルが類似性に応じて連結されたグラフ情報を取得することにより、所望の対象と所定の対応関係を有する対象を適切に抽出することができる。 As described above, the information processing apparatus 100 according to the embodiment obtains graph information in which a plurality of vectors representing characteristics of each of a plurality of targets are connected according to similarity, thereby obtaining a desired target and a predetermined correspondence. Objects with relationships can be appropriately extracted.

また、実施形態に係る情報処理装置１００において、取得部１３１は、所定のモデルを用いて複数の対象の各々から抽出された特徴量を要素とする複数のベクトルが、類似性に応じて連結されたグラフ情報を取得する。 Further, in the information processing apparatus 100 according to the embodiment, the acquisition unit 131 connects a plurality of vectors whose elements are feature amounts extracted from each of a plurality of targets using a predetermined model according to similarity. get graph information.

このように、実施形態に係る情報処理装置１００は、所定のモデルを用いて複数の対象の各々から抽出された特徴量を要素とする複数のベクトルが、類似性に応じて連結されたグラフ情報を取得することにより、所望の対象と所定の対応関係を有する対象を適切に抽出することができる。 As described above, the information processing apparatus 100 according to the embodiment provides graph information in which a plurality of vectors whose elements are feature amounts extracted from each of a plurality of targets using a predetermined model are connected according to similarity. By obtaining , it is possible to appropriately extract an object having a predetermined correspondence with a desired object.

また、実施形態に係る情報処理装置１００において、取得部１３１は、複数の対象に関する情報を所定のモデルに入力することにより、抽出される複数の対象の各々の特徴量を要素とする複数のベクトルが、類似性に応じて連結されたグラフ情報を取得する。 Further, in the information processing apparatus 100 according to the embodiment, the acquisition unit 131 inputs information about a plurality of targets into a predetermined model, and obtains a plurality of vectors whose elements are feature amounts of each of the extracted targets. obtains graph information connected according to similarity.

このように、実施形態に係る情報処理装置１００は、複数の対象に関する情報を所定のモデルに入力することにより、抽出される複数の対象の各々の特徴量を要素とする複数のベクトルが、類似性に応じて連結されたグラフ情報を取得することにより、所望の対象と所定の対応関係を有する対象を適切に抽出することができる。 As described above, the information processing apparatus 100 according to the embodiment inputs information about a plurality of targets into a predetermined model, so that a plurality of vectors whose elements are feature amounts of each of the plurality of targets to be extracted are similar to each other. By acquiring the graph information linked according to the gender, it is possible to appropriately extract the target having a predetermined correspondence with the desired target.

また、実施形態に係る情報処理装置１００は、提供部１３５を有する。提供部１３５は、検索部１３４により抽出された対応対象に基づいて、所定のサービスを提供する。 The information processing apparatus 100 according to the embodiment also has a providing unit 135 . The providing unit 135 provides a predetermined service based on the correspondence target extracted by the searching unit 134 .

このように、実施形態に係る情報処理装置１００は、抽出した対応対象（例えば類似対象）に基づいて、所定のサービスを提供することにより、所望の対象と所定の対応関係を有する対象に関する情報を用いたサービスを適切に提供することができる。 As described above, the information processing apparatus 100 according to the embodiment provides a predetermined service based on the extracted corresponding target (for example, a similar target), thereby obtaining information about a target having a predetermined correspondence relationship with a desired target. It is possible to appropriately provide the services used.

また、実施形態に係る情報処理装置１００において、提供部１３５は、対応対象に関する情報提供サービスを提供する。 In addition, in the information processing apparatus 100 according to the embodiment, the providing unit 135 provides an information providing service regarding correspondence targets.

このように、実施形態に係る情報処理装置１００は、対応対象に関する情報提供サービスを提供することにより、所望の対象と所定の対応関係を有する対象に関する情報を用いたサービスを適切に提供することができる。 As described above, the information processing apparatus 100 according to the embodiment can appropriately provide a service using information about an object having a predetermined correspondence relationship with a desired object by providing an information providing service about the corresponding object. can.

また、実施形態に係る情報処理装置１００において、取得部１３１は、ユーザが利用する端末装置１０から一の対象に関する対象情報を取得する。提供部１３５は、端末装置１０に対応対象に関する情報を提供する。 Further, in the information processing apparatus 100 according to the embodiment, the acquisition unit 131 acquires target information regarding one target from the terminal device 10 used by the user. The providing unit 135 provides the terminal device 10 with information on the correspondence target.

このように、実施形態に係る情報処理装置１００は、ユーザが利用する端末装置１０から一の対象に関する対象情報を取得し、端末装置１０に対応対象に関する情報を提供することにより、対応対象（例えば類似対象）に関する情報を用いたサービスを適切に提供することができる。 As described above, the information processing apparatus 100 according to the embodiment acquires target information related to one target from the terminal device 10 used by the user, and provides the terminal device 10 with information related to the target to be handled. It is possible to appropriately provide services using information on similar objects).

また、実施形態に係る情報処理装置１００において、取得部１３１は、各々が個人である複数の対象の各々に対応する複数のノードが連結されたグラフ情報と、一の個人である一の対象に関する対象情報を取得する。 Further, in the information processing apparatus 100 according to the embodiment, the acquisition unit 131 obtains graph information in which a plurality of nodes each corresponding to each of a plurality of subjects, each of which is an individual, are connected, and Get target information.

このように、実施形態に係る情報処理装置１００は、各々が個人である複数の対象の各々に対応する複数のノードが連結されたグラフ情報と、一の個人である一の対象に関する対象情報を取得することにより、所望の個人と所定の対応関係を有する個人を適切に抽出することができる。 As described above, the information processing apparatus 100 according to the embodiment combines graph information in which a plurality of nodes each corresponding to each of a plurality of subjects, each of which is an individual, are connected, and target information regarding one subject, which is one individual. By acquiring the information, it is possible to appropriately extract individuals having a predetermined correspondence with the desired individual.

また、実施形態に係る情報処理装置１００において、取得部１３１は、各々が複数人を含むグループである複数の対象の各々に対応する複数のノードが連結されたグラフ情報と、一のグループである一の対象に関する対象情報を取得する。 In addition, in the information processing apparatus 100 according to the embodiment, the acquisition unit 131 obtains graph information in which a plurality of nodes corresponding to each of a plurality of targets, each of which is a group including a plurality of people, is connected, and Get object information about one object.

このように、実施形態に係る情報処理装置１００は、各々が複数人を含むグループである複数の対象の各々に対応する複数のノードが連結されたグラフ情報と、一のグループである一の対象に関する対象情報を取得することにより、所望のグループと所定の対応関係を有するグループを適切に抽出することができる。 As described above, the information processing apparatus 100 according to the embodiment provides graph information in which a plurality of nodes corresponding to each of a plurality of targets, each of which is a group including a plurality of people, is connected, and one target that is one group. By acquiring the target information regarding, it is possible to appropriately extract a group having a predetermined correspondence with a desired group.

また、実施形態に係る情報処理装置１００において、検索部１３４は、複数の対象のうち、一の対象と類似する対象を対応対象として抽出する。 Further, in the information processing apparatus 100 according to the embodiment, the search unit 134 extracts targets similar to one target from among the plurality of targets as corresponding targets.

このように、実施形態に係る情報処理装置１００は、複数の対象のうち、一の対象と類似する対象を対応対象として抽出することにより、所望の対象と類似する対象（類似対象）を適切に抽出することができる。この場合、情報処理装置１００は、所望の対象と類似性が高い対象（類似対象）を適切に抽出することができる。 As described above, the information processing apparatus 100 according to the embodiment extracts a target similar to one target from among a plurality of targets as a corresponding target, thereby appropriately selecting a target similar to a desired target (similar target). can be extracted. In this case, the information processing apparatus 100 can appropriately extract targets (similar targets) that are highly similar to the desired target.

また、実施形態に係る情報処理装置１００において、検索部１３４は、複数の対象のうち、一の対象と非類似の対象を対応対象として抽出する。 Further, in the information processing apparatus 100 according to the embodiment, the search unit 134 extracts a target dissimilar to one target from among the plurality of targets as a corresponding target.

このように、実施形態に係る情報処理装置１００は、複数の対象のうち、一の対象と非類似の対象を対応対象として抽出することにより、所望の対象と類似しない対象（非類似対象）を適切に抽出することができる。この場合、情報処理装置１００は、所望の対象と類似性が低い対象（非類似対象）を適切に抽出することができる。 As described above, the information processing apparatus 100 according to the embodiment extracts a target dissimilar to one target from among a plurality of targets as a corresponding target, thereby extracting a target dissimilar to the desired target (dissimilar target). Can be properly extracted. In this case, the information processing apparatus 100 can appropriately extract targets having low similarity to the desired target (dissimilar targets).

〔８．ハードウェア構成〕
上述してきた実施形態に係る情報処理装置１００は、例えば図１３に示すような構成のコンピュータ１０００によって実現される。図１３は、情報処理装置の機能を実現するコンピュータの一例を示すハードウェア構成図である。コンピュータ１０００は、ＣＰＵ１１００、ＲＡＭ１２００、ＲＯＭ（Read Only Memory）１３００、ＨＤＤ（Hard Disk Drive）１４００、通信インターフェイス（Ｉ／Ｆ）１５００、入出力インターフェイス（Ｉ／Ｆ）１６００、及びメディアインターフェイス（Ｉ／Ｆ）１７００を有する。 [8. Hardware configuration]
The information processing apparatus 100 according to the embodiments described above is implemented by a computer 1000 configured as shown in FIG. 13, for example. FIG. 13 is a hardware configuration diagram showing an example of a computer that implements the functions of the information processing apparatus. The computer 1000 includes a CPU 1100, a RAM 1200, a ROM (Read Only Memory) 1300, a HDD (Hard Disk Drive) 1400, a communication interface (I/F) 1500, an input/output interface (I/F) 1600, and a media interface (I/F). ) 1700.

ＣＰＵ１１００は、ＲＯＭ１３００またはＨＤＤ１４００に格納されたプログラムに基づいて動作し、各部の制御を行う。ＲＯＭ１３００は、コンピュータ１０００の起動時にＣＰＵ１１００によって実行されるブートプログラムや、コンピュータ１０００のハードウェアに依存するプログラム等を格納する。 The CPU 1100 operates based on programs stored in the ROM 1300 or HDD 1400 and controls each section. The ROM 1300 stores a boot program executed by the CPU 1100 when the computer 1000 is started up, a program depending on the hardware of the computer 1000, and the like.

ＨＤＤ１４００は、ＣＰＵ１１００によって実行されるプログラム、及び、かかるプログラムによって使用されるデータ等を格納する。通信インターフェイス１５００は、ネットワークＮを介して他の機器からデータを受信してＣＰＵ１１００へ送り、ＣＰＵ１１００が生成したデータをネットワークＮを介して他の機器へ送信する。 The HDD 1400 stores programs executed by the CPU 1100, data used by the programs, and the like. Communication interface 1500 receives data from other devices via network N, sends the data to CPU 1100, and transmits data generated by CPU 1100 to other devices via network N. FIG.

ＣＰＵ１１００は、入出力インターフェイス１６００を介して、ディスプレイやプリンタ等の出力装置、及び、キーボードやマウス等の入力装置を制御する。ＣＰＵ１１００は、入出力インターフェイス１６００を介して、入力装置からデータを取得する。また、ＣＰＵ１１００は、生成したデータを入出力インターフェイス１６００を介して出力装置へ出力する。 The CPU 1100 controls output devices such as displays and printers, and input devices such as keyboards and mice, through an input/output interface 1600 . CPU 1100 acquires data from an input device via input/output interface 1600 . CPU 1100 also outputs the generated data to an output device via input/output interface 1600 .

メディアインターフェイス１７００は、記録媒体１８００に格納されたプログラムまたはデータを読み取り、ＲＡＭ１２００を介してＣＰＵ１１００に提供する。ＣＰＵ１１００は、かかるプログラムを、メディアインターフェイス１７００を介して記録媒体１８００からＲＡＭ１２００上にロードし、ロードしたプログラムを実行する。記録媒体１８００は、例えばＤＶＤ（Digital Versatile Disc）、ＰＤ（Phase change rewritable Disk）等の光学記録媒体、ＭＯ（Magneto-Optical disk）等の光磁気記録媒体、テープ媒体、磁気記録媒体、または半導体メモリ等である。 Media interface 1700 reads programs or data stored in recording medium 1800 and provides them to CPU 1100 via RAM 1200 . CPU 1100 loads such a program from recording medium 1800 onto RAM 1200 via media interface 1700, and executes the loaded program. The recording medium 1800 is, for example, an optical recording medium such as a DVD (Digital Versatile Disc) or a PD (Phase change rewritable disc), a magneto-optical recording medium such as an MO (Magneto-Optical disk), a tape medium, a magnetic recording medium, or a semiconductor memory. etc.

例えば、コンピュータ１０００が実施形態に係る情報処理装置１００として機能する場合、コンピュータ１０００のＣＰＵ１１００は、ＲＡＭ１２００上にロードされたプログラムまたはデータ（例えば、モデルＭ１（モデルデータＭＤＴ１））を実行することにより、制御部１３０の機能を実現する。コンピュータ１０００のＣＰＵ１１００は、これらのプログラムまたはデータ（例えば、モデルＭ１（モデルデータＭＤＴ１））を記録媒体１８００から読み取って実行するが、他の例として、他の装置からネットワークＮを介してこれらのプログラムを取得してもよい。 For example, when the computer 1000 functions as the information processing apparatus 100 according to the embodiment, the CPU 1100 of the computer 1000 executes a program or data (for example, model M1 (model data MDT1)) loaded on the RAM 1200 to It implements the functions of the control unit 130 . CPU 1100 of computer 1000 reads these programs or data (for example, model M1 (model data MDT1)) from recording medium 1800 and executes them. may be obtained.

以上、本願の実施形態のいくつかを図面に基づいて詳細に説明したが、これらは例示であり、発明の開示の行に記載の態様を始めとして、当業者の知識に基づいて種々の変形、改良を施した他の形態で本発明を実施することが可能である。 As described above, some of the embodiments of the present application have been described in detail based on the drawings, but these are merely examples, and various modifications, It is possible to carry out the invention in other forms with modifications.

〔９．その他〕
また、上記実施形態において説明した各処理のうち、自動的に行われるものとして説明した処理の全部または一部を手動的に行うこともでき、あるいは、手動的に行われるものとして説明した処理の全部または一部を公知の方法で自動的に行うこともできる。この他、上記文書中や図面中で示した処理手順、具体的名称、各種のデータやパラメータを含む情報については、特記する場合を除いて任意に変更することができる。例えば、各図に示した各種情報は、図示した情報に限られない。 [9. others〕
Further, among the processes described in the above embodiments, all or part of the processes described as being automatically performed can be manually performed, or the processes described as being performed manually can be performed manually. All or part of this can also be done automatically by known methods. In addition, information including processing procedures, specific names, various data and parameters shown in the above documents and drawings can be arbitrarily changed unless otherwise specified. For example, the various information shown in each drawing is not limited to the illustrated information.

また、図示した各装置の各構成要素は機能概念的なものであり、必ずしも物理的に図示の如く構成されていることを要しない。すなわち、各装置の分散・統合の具体的形態は図示のものに限られず、その全部または一部を、各種の負荷や使用状況などに応じて、任意の単位で機能的または物理的に分散・統合して構成することができる。 Also, each component of each device illustrated is functionally conceptual, and does not necessarily need to be physically configured as illustrated. In other words, the specific form of distribution and integration of each device is not limited to the one shown in the figure, and all or part of them can be functionally or physically distributed and integrated in arbitrary units according to various loads and usage conditions. Can be integrated and configured.

また、上述してきた各実施形態に記載された各処理は、処理内容を矛盾させない範囲で適宜組み合わせることが可能である。 Further, each process described in each embodiment described above can be appropriately combined within a range in which the contents of the process are not inconsistent.

また、上述してきた「部（section、module、unit）」は、「手段」や「回路」などに読み替えることができる。例えば、取得部は、取得手段や取得回路に読み替えることができる。 Also, the above-mentioned "section, module, unit" can be read as "means" or "circuit". For example, the acquisition unit can be read as acquisition means or an acquisition circuit.

１情報処理システム
１００情報処理装置
１２１対象情報記憶部
１２２インデックス情報記憶部
１２３グラフ情報記憶部
１２４モデル情報記憶部
１３０制御部
１３１取得部
１３２生成部
１３３決定部
１３４検索部
１３５提供部
１０端末装置
５０情報提供装置
Ｎネットワーク 1 information processing system 100 information processing device 121 target information storage unit 122 index information storage unit 123 graph information storage unit 124 model information storage unit 130 control unit 131 acquisition unit 132 generation unit 133 determination unit 134 search unit 135 provision unit 10 terminal device 50 Information providing device N network

Claims

Graph information in which a plurality of nodes corresponding to each of a plurality of subjects, each of which is an individual, are linked according to the similarity of the plurality of subjects, and one individual who has made an important contribution in one company an acquisition unit that acquires target information about a target from a terminal device used by a user who organizes a project in a company different from the one company;
retrieving the graph information starting from a starting point node determined based on a predetermined criterion among the plurality of nodes of the graph information obtained by the obtaining unit and serving as a starting point for searching the graph information; , a search unit for extracting, from among the plurality of targets , a corresponding target similar to the individual who has made an important contribution to the one company, which is the one target;
The user who organizes the project uses information indicating that the person who is the target of correspondence extracted by the search unit is a person similar to the person who has made an important contribution to the one company. a providing unit that transmits to the terminal device to
An information processing device comprising:

a determination unit that determines the origin node based on index information used to determine the origin node;
further comprising
The search unit is
The information processing apparatus according to claim 1, wherein the correspondence target is extracted with the starting node determined by the determination unit as a starting point.

The decision unit
3. The information processing apparatus according to claim 2, wherein the starting node is determined based on the tree-structured index information.

The acquisition unit
4. The method according to any one of claims 1 to 3, wherein graph information in which the plurality of nodes are connected is acquired according to the similarity of the plurality of vectors corresponding to each of the plurality of objects. Information processing equipment.

The acquisition unit
5. The information processing apparatus according to claim 4, wherein the graph information is acquired in which the plurality of vectors indicating the characteristics of each of the plurality of targets are connected according to similarity.

The acquisition unit
5. Obtaining the graph information in which the plurality of vectors whose elements are feature amounts extracted from each of the plurality of targets using a predetermined model are connected according to similarity. Or the information processing apparatus according to claim 5 .

The acquisition unit
By inputting information about the plurality of targets into a predetermined model, the plurality of vectors whose elements are feature amounts of each of the plurality of targets to be extracted form the graph information linked according to similarity. 7. The information processing apparatus according to any one of claims 4 to 6, characterized in that:

A computer-executed information processing method comprising:
Graph information in which a plurality of nodes corresponding to each of a plurality of subjects, each of which is an individual, are linked according to the similarity of the plurality of subjects, and one individual who has made an important contribution in one company an acquisition step of acquiring target information about a target from a terminal device used by a user who organizes a project in a company different from the one company;
retrieving the graph information starting from a starting point node determined based on a predetermined criterion among the plurality of nodes of the graph information obtained by the obtaining step, and serving as a starting point for searching the graph information; , a search step of extracting, from among the plurality of targets , a corresponding target similar to the individual who has made an important contribution to the one company, which is the one target;
The user who organizes the project uses information indicating that the person who is the target of correspondence extracted by the search step is a person similar to the person who has made an important contribution to the one company. a providing step of transmitting to the terminal device to
An information processing method characterized by including

Graph information in which a plurality of nodes corresponding to each of a plurality of subjects, each of which is an individual, are linked according to the similarity of the plurality of subjects, and one individual who has made an important contribution in one company an acquisition procedure for acquiring target information about a target from a terminal device used by a user who organizes a project in a company different from the one company;
Searching for the graph information starting from a starting point node determined based on a predetermined criterion among the plurality of nodes of the graph information obtained by the obtaining procedure. , a search procedure for extracting, from among the plurality of targets , a corresponding target similar to the individual who has made an important contribution to the one company, which is the one target;
The user who organizes the project uses information indicating that the person who is the target of correspondence extracted by the search procedure is a person similar to the person who has made an important contribution to the one company. a providing procedure for transmitting to the terminal device to
An information processing program characterized by causing a computer to execute