JP7008152B1

JP7008152B1 - Information processing equipment, information processing methods and information processing programs

Info

Publication number: JP7008152B1
Application number: JP2021034630A
Authority: JP
Inventors: 孝太坪内; 拓明田口
Original assignee: Yahoo Japan Corp
Current assignee: Yahoo Japan Corp
Priority date: 2021-03-04
Filing date: 2021-03-04
Publication date: 2022-01-25
Anticipated expiration: 2041-03-04
Also published as: JP2022135055A

Abstract

【課題】クエリ間の関係性を適切に推定することができる情報処理装置、情報処理方法および情報処理プログラムを提供する。【解決手段】本願に係る情報処理装置は、取得部と、抽出部と、推定部とを備える。取得部は、ユーザによって入力された第１クエリによる検索要求に基づいて第１クエリとは異なる第２クエリを用いた検索結果が提供された場合のユーザの検索行動に関する履歴情報を取得する。抽出部は、取得部によって取得された履歴情報に基づいて検索行動に際してのコンテキストを含む検索行動の特徴情報を抽出する。推定部は、抽出部によって抽出された特徴情報の変化に基づいて、第１クエリと第２クエリとの間の関係性を推定する。【選択図】図５PROBLEM TO BE SOLVED: To provide an information processing device, an information processing method and an information processing program capable of appropriately estimating a relationship between queries. An information processing apparatus according to the present application includes an acquisition unit, an extraction unit, and an estimation unit. The acquisition unit acquires history information regarding the user's search behavior when a search result using a second query different from the first query is provided based on the search request by the first query input by the user. The extraction unit extracts the characteristic information of the search behavior including the context in the search behavior based on the history information acquired by the acquisition unit. The estimation unit estimates the relationship between the first query and the second query based on the change in the feature information extracted by the extraction unit. [Selection diagram] FIG. 5

Description

本発明は、情報処理装置、情報処理方法および情報処理プログラムに関する。 The present invention relates to an information processing apparatus, an information processing method and an information processing program.

従来、ユーザが入力した文字列を検索クエリ（以下、単に「クエリ」と呼ぶ）として、様々な情報の検索サービスを行うための技術が提供されている。そのような検索サービスにおいては、ユーザが入力したクエリに誤記が含まれる場合等において、その誤記を変換した上で検索を行い、検索結果をユーザに提供する技術が提供されている（例えば、特許文献１参照）。 Conventionally, there has been provided a technique for performing a search service for various information by using a character string input by a user as a search query (hereinafter, simply referred to as a "query"). In such a search service, when a query entered by a user contains an error, a technique is provided in which the error is converted and then searched, and the search result is provided to the user (for example, a patent). See Document 1).

特許第６５２９４５６号公報Japanese Patent No. 6529456

しかしながら、上記の従来技術を用いた場合、入力されたクエリと実際の検索に用いるべきクエリの間の関係性を判断することが難しい。例えば、従来技術では、第１文字列とその第１文字列の誤記として入力され得る第２文字列とを対応付けた辞書を用いて、第２文字列が入力された場合に、第２文字列を第１文字列へと変換しているに過ぎない。 However, when the above-mentioned conventional technique is used, it is difficult to determine the relationship between the input query and the query to be used for the actual search. For example, in the prior art, when the second character string is input by using a dictionary in which the first character string and the second character string that can be input as an erroneous description of the first character string are associated with each other, the second character is input. It's just converting the column to the first string.

このため、従来技術では、ユーザが入力した第２文字列をそのままクエリとしたい場合には、ユーザは望む検索結果を得られないおそれがある。したがって、クエリ間の関係性を適切に推定することが望まれている。 Therefore, in the prior art, if the user wants to use the second character string input by the user as a query as it is, the user may not be able to obtain the desired search result. Therefore, it is desired to properly estimate the relationship between queries.

本願は、上記に鑑みてなされたものであって、クエリ間の関係性を適切に推定することができる情報処理装置、情報処理方法および情報処理プログラムを提供することを目的とする。 The present application has been made in view of the above, and an object of the present application is to provide an information processing apparatus, an information processing method, and an information processing program capable of appropriately estimating the relationship between queries.

本願に係る情報処理装置は、取得部と、抽出部と、推定部とを備える。前記取得部は、ユーザによって入力された第１クエリによる検索要求に基づいて前記第１クエリとは異なる第２クエリを用いた検索結果が提供された場合の前記ユーザの検索行動に関する履歴情報を取得する。前記抽出部は、前記取得部によって取得された前記履歴情報に基づいて前記検索行動に際してのコンテキストを含む前記検索行動の特徴情報を抽出する。前記推定部は、前記抽出部によって抽出された前記特徴情報の変化に基づいて、前記第１クエリと前記第２クエリとの間の関係性を推定する。 The information processing apparatus according to the present application includes an acquisition unit, an extraction unit, and an estimation unit. The acquisition unit acquires history information regarding the search behavior of the user when a search result using a second query different from the first query is provided based on a search request by the first query input by the user. do. The extraction unit extracts characteristic information of the search behavior including a context for the search behavior based on the history information acquired by the acquisition unit. The estimation unit estimates the relationship between the first query and the second query based on the change in the feature information extracted by the extraction unit.

実施形態の一態様によれば、クエリ間の関係性を適切に推定することができるという効果を奏する。 According to one aspect of the embodiment, there is an effect that the relationship between queries can be appropriately estimated.

図１は、実施形態に係る情報処理の一例を示す図である。FIG. 1 is a diagram showing an example of information processing according to an embodiment. 図２は、実施形態に係る情報処理の処理説明図（その１）である。FIG. 2 is a processing explanatory diagram (No. 1) of information processing according to the embodiment. 図３は、実施形態に係る情報処理の処理説明図（その２）である。FIG. 3 is a process explanatory diagram (No. 2) of information processing according to the embodiment. 図４は、実施形態に係る情報処理システムの構成例を示す図である。FIG. 4 is a diagram showing a configuration example of an information processing system according to an embodiment. 図５は、実施形態に係る情報処理装置の構成例を示すブロック図である。FIG. 5 is a block diagram showing a configuration example of the information processing apparatus according to the embodiment. 図６は、実施形態に係る推定処理の処理説明図（その１）である。FIG. 6 is a process explanatory diagram (No. 1) of the estimation process according to the embodiment. 図７は、実施形態に係る推定処理の処理説明図（その２）である。FIG. 7 is a process explanatory diagram (No. 2) of the estimation process according to the embodiment. 図８は、実施形態に係る推定処理の処理説明図（その３）である。FIG. 8 is a process explanatory diagram (No. 3) of the estimation process according to the embodiment. 図９は、実施形態に係る情報処理装置が実行する処理手順を示すフローチャートである。FIG. 9 is a flowchart showing a processing procedure executed by the information processing apparatus according to the embodiment. 図１０は、実施形態に係る情報処理装置の機能を実現するコンピュータの一例を示すハードウェア構成図である。FIG. 10 is a hardware configuration diagram showing an example of a computer that realizes the functions of the information processing apparatus according to the embodiment.

以下に、本願に係る情報処理装置、情報処理方法および情報処理プログラムを実施するための形態（以下、「実施形態」と呼ぶ）について図面を参照しつつ詳細に説明する。なお、この実施形態により本願に係る情報処理装置、情報処理方法および情報処理プログラムが限定されるものではない。また、各実施形態は、処理内容を矛盾させない範囲で適宜組み合わせることが可能である。また、以下の各実施形態において同一の部位には同一の符号を付し、重複する説明は省略される。 Hereinafter, an information processing apparatus, an information processing method, and an embodiment for implementing an information processing program (hereinafter referred to as “embodiments”) according to the present application will be described in detail with reference to the drawings. Note that this embodiment does not limit the information processing apparatus, information processing method, and information processing program according to the present application. In addition, each embodiment can be appropriately combined as long as the processing contents do not contradict each other. Further, in each of the following embodiments, the same parts are designated by the same reference numerals, and duplicate explanations are omitted.

〔１．情報処理の一例〕
まず、実施形態に係る情報処理の一例について、図１～図３を用いて説明する。図１は、実施形態に係る情報処理の一例を示す図である。また、図２は、実施形態に係る情報処理の処理説明図（その１）である。また、図３は、実施形態に係る情報処理の処理説明図（その３）である。 [1. An example of information processing]
First, an example of information processing according to the embodiment will be described with reference to FIGS. 1 to 3. FIG. 1 is a diagram showing an example of information processing according to an embodiment. Further, FIG. 2 is a processing explanatory diagram (No. 1) of information processing according to the embodiment. Further, FIG. 3 is a processing explanatory diagram (No. 3) of information processing according to the embodiment.

図１～図３では、実施形態に係る情報処理システム１に含まれる情報処理装置１００が、ユーザＵによって入力された第１クエリによる検索要求に基づいて第１クエリとは異なる第２クエリを用いた検索結果が提供された場合のユーザＵの検索行動に関する履歴情報を取得し、取得された履歴情報に基づいて検索行動に際してのコンテキストを含む検索行動の特徴情報を抽出し、抽出された特徴情報の変化に基づいて、第１クエリと第２クエリとの間の関係性を推定する処理を実行する例を示す。 In FIGS. 1 to 3, the information processing apparatus 100 included in the information processing system 1 according to the embodiment uses a second query different from the first query based on the search request by the first query input by the user U. The history information about the search behavior of the user U when the searched result is provided is acquired, and the feature information of the search behavior including the context of the search behavior is extracted based on the acquired history information, and the extracted feature information is extracted. An example of executing a process of estimating the relationship between the first query and the second query based on the change of is shown.

図１に示すように、実施形態に係る情報処理システム１は、ユーザ端末１０と、情報処理装置１００とを含む。 As shown in FIG. 1, the information processing system 1 according to the embodiment includes a user terminal 10 and an information processing device 100.

ユーザ端末１０は、ユーザＵすなわちユーザＵ１，Ｕ２…がそれぞれ利用する端末装置であるユーザ端末１０－１，１０－２…である。ユーザＵは、ユーザ端末１０から図示略のネットワークを介し、情報処理装置１００へアクセスすることで、情報処理装置１００が提供するサービスの一つである検索サービスを利用する。なお、以下では、「ユーザＵ」は、適宜「ユーザ端末１０」と読み替えることができる。 The user terminal 10 is a user terminal 10-1, 10-2 ... Which is a terminal device used by the user U, that is, the users U1, U2 ... The user U uses the search service, which is one of the services provided by the information processing apparatus 100, by accessing the information processing apparatus 100 from the user terminal 10 via a network (not shown). In the following, "user U" can be appropriately read as "user terminal 10".

情報処理装置１００は、各種サービスをユーザＵへ提供するサーバ装置である。情報処理装置１００が提供するサービスは、例えば、ユーザ端末１０にインストールされた各種アプリやブラウザを介して各種情報を提供するサービスである。提供されるサービスには、検索サービスの他、例えば、ニュース提供サービスや、オークションサービス、天気予報サービス、ショッピングサービス、金融取引（株取引等）サービス、路線検索サービス、地図提供サービス、旅行サービス、飲食店紹介サービス、ブログサービス等が含まれてもよい。 The information processing device 100 is a server device that provides various services to the user U. The service provided by the information processing apparatus 100 is, for example, a service that provides various information via various applications and browsers installed in the user terminal 10. In addition to search services, the services provided include, for example, news services, auction services, weather forecast services, shopping services, financial trading (stock trading, etc.) services, route search services, map providing services, travel services, and food and beverage services. Store introduction services, blog services, etc. may be included.

ユーザＵは、検索サービスを利用する場合、情報処理装置１００へアクセス後、ユーザ端末１０から検索キーワードとなる文字列であるクエリを入力する。以下、ユーザＵが入力するクエリを「第１クエリ」と呼ぶ。そして、ユーザＵは、第１クエリによる検索要求を情報処理装置１００へ送信する（ステップＳ１）。 When using the search service, the user U inputs a query, which is a character string as a search keyword, from the user terminal 10 after accessing the information processing apparatus 100. Hereinafter, the query input by the user U is referred to as a "first query". Then, the user U transmits the search request by the first query to the information processing apparatus 100 (step S1).

ここで、第１クエリには誤記が含まれる場合がある。情報処理装置１００は、例えば誤記として入力され得る第１クエリと、第１クエリに対し正しいと想定される文字列である第２クエリとを対応付けた辞書情報を有しており、これを用いて第１クエリが入力された場合に、第１クエリを第２クエリへ変換して第２クエリを用いた情報の検索を行う。 Here, the first query may contain an error. The information processing apparatus 100 has dictionary information in which a first query that can be input as an erroneous description and a second query, which is a character string that is assumed to be correct for the first query, are associated with each other, and is used. When the first query is input, the first query is converted into the second query and the information is searched using the second query.

そして、情報処理装置１００は、かかる第２クエリを用いた検索結果をユーザＵへ送信する（ステップＳ２）。ユーザＵは、これに応じ、閲覧したい検索結果のうちの一つを選択する（ステップＳ３）。 Then, the information processing apparatus 100 transmits the search result using the second query to the user U (step S2). The user U selects one of the search results to be browsed accordingly (step S3).

ステップＳ１～Ｓ３までの流れを分かりやすく図２に示した。図２に示すのは、ユーザ端末１０に表示された検索結果一覧画面の一部の例である。図２では、ユーザＵが第１クエリ「ＸＸＸＸ」を入力し、検索を実行したところ、「ＸＸＸＸ」は「ＹＹＹＹ」の誤記候補に含まれており、情報処理装置１００は、これに応じて「ＹＹＹＹ」を第２クエリとして用いた検索結果を提供した例を示している。 The flow from steps S1 to S3 is shown in FIG. 2 in an easy-to-understand manner. FIG. 2 shows a part of the search result list screen displayed on the user terminal 10. In FIG. 2, when the user U inputs the first query "XXXX" and executes the search, "XXXXX" is included in the error candidate of "YYYY", and the information processing apparatus 100 responds to "". An example of providing a search result using "YYYY" as a second query is shown.

なお、同図に示すように、第１クエリ「ＸＸＸＸ」による検索結果も下方に表示されており、ユーザＵがかかる第１クエリによる検索結果を望んでいるのであれば、これを選択することはできる。ただし、下方に表示されるなど、第２クエリによる検索結果よりも低い優先度で取り扱われる都合上、ユーザＵがこれを選択するまでには、第２クエリによる検索結果を選択するよりも時間がかかることとなる。 As shown in the figure, the search result by the first query "XXXX" is also displayed below, and if the user U wants the search result by the first query, he / she can select this. can. However, because it is treated with a lower priority than the search result by the second query, such as being displayed at the bottom, it takes longer than the user U to select the search result by the second query before selecting this. It will take.

図１の説明に戻る。情報処理装置１００は、これらステップＳ１～Ｓ３のユーザＵの検索行動を含むユーザＵの行動に関する履歴情報をユーザ端末１０から随時収集し、蓄積する。 Returning to the description of FIG. The information processing apparatus 100 collects and stores historical information about the user U's behavior including the search behavior of the user U in steps S1 to S3 from the user terminal 10 at any time.

履歴情報には、例えば、ユーザＵが入力した第１クエリによる検索が要求された時点から、ユーザＵに第１クエリとは異なる第２クエリを用いた検索結果を提供し、ユーザＵによりその検索結果が選択された時点までの経過時間が含まれる。また、履歴情報には、例えば、ユーザＵの行動から推定されるコンテキストが含まれる。 For the history information, for example, from the time when the search by the first query input by the user U is requested, the search result using the second query different from the first query is provided to the user U, and the search is performed by the user U. Includes the elapsed time to the point when the result was selected. Further, the history information includes, for example, a context estimated from the behavior of the user U.

コンテキストは、検索行動を含む各種の行動をユーザＵが取るに際してのユーザＵの状況や、ユーザＵの環境等を示す情報である。具体的には、コンテキストには、ユーザＵのジオグラフィック属性（検索行動をとった際の位置等）や、デモグラフィック属性（年齢や、性別、収入、職業等）、サイコグラフィック属性（価値観や、ライフスタイル、性格、好み等）等が含まれる。 The context is information indicating the situation of the user U when the user U takes various actions including the search action, the environment of the user U, and the like. Specifically, the context includes the geographic attribute of the user U (position when the search action is taken, etc.), the demographic attribute (age, gender, income, occupation, etc.), and the psychographic attribute (values, etc.). , Lifestyle, personality, taste, etc.) are included.

そして、情報処理装置１００は随時、履歴情報に基づいてユーザＵの検索行動に関する特徴情報を抽出する（ステップＳ４）。特徴情報は、前述した選択までの経過時間に関する特徴や、検索行動に際してのコンテキストに関する特徴を含む。 Then, the information processing apparatus 100 extracts feature information regarding the search behavior of the user U based on the history information at any time (step S4). The feature information includes the feature regarding the elapsed time until the selection described above and the feature regarding the context in the search behavior.

これら特徴は、例えば統計的な特徴である。情報処理装置１００は、例えば前述の経過時間の標準偏差を抽出する。例えば、図１には、情報処理装置１００が、経過時間の中央値μに対する標準偏差σを抽出した例を示している。また、情報処理装置１００は、例えば前述のコンテキストの分布を抽出する。 These features are, for example, statistical features. The information processing apparatus 100 extracts, for example, the standard deviation of the elapsed time described above. For example, FIG. 1 shows an example in which the information processing apparatus 100 extracts the standard deviation σ with respect to the median μ of the elapsed time. Further, the information processing apparatus 100 extracts, for example, the above-mentioned context distribution.

そして、情報処理装置１００は、抽出された時系列上の特徴情報の変化に基づいて、クエリ間、すなわち第１クエリと第２クエリとの間の関係性を推定する（ステップＳ５）。 Then, the information processing apparatus 100 estimates the relationship between the queries, that is, between the first query and the second query, based on the change in the extracted feature information on the time series (step S5).

具体的には、図３に示すように、情報処理装置１００は、例えば経過時間のばらつき度合いが小さい間は、第１クエリと第２クエリとの関係性が高いと推定する。すなわち、情報処理装置１００は、第１クエリが第２クエリの誤記であると推定する。 Specifically, as shown in FIG. 3, the information processing apparatus 100 estimates that the relationship between the first query and the second query is high, for example, while the degree of variation in the elapsed time is small. That is, the information processing apparatus 100 estimates that the first query is an error in the second query.

また、情報処理装置１００は、例えば経過時間のばらつき度合いが大きくなると、第１クエリと第２クエリとの関係性が低いと推定する。すなわち、情報処理装置１００は、第１クエリが第２クエリの誤記でなく、新規のワードであると推定する。これにより、情報処理装置１００は、例えば前述の第２クエリの誤記候補から第１クエリを除外することができる。 Further, the information processing apparatus 100 estimates that the relationship between the first query and the second query is low, for example, when the degree of variation in the elapsed time becomes large. That is, the information processing apparatus 100 estimates that the first query is not an error in the second query, but a new word. Thereby, the information processing apparatus 100 can exclude the first query from, for example, the above-mentioned error candidate of the second query.

また、情報処理装置１００は、例えばコンテキストの分布のばらつき度合いが小さい間は、第１クエリと第２クエリとの関係性が高いと推定する。すなわち、情報処理装置１００は、この場合、ユーザＵの検索行動の際のコンテキストが例えば一様であり、これまでの既知の用法により、第１クエリが用いられていると推定する。したがって、第１クエリが第２クエリの誤記である場合を含む。 Further, the information processing apparatus 100 estimates that the relationship between the first query and the second query is high, for example, while the degree of variation in the context distribution is small. That is, in this case, the information processing apparatus 100 estimates that the context of the user U's search behavior is uniform, and that the first query is used according to the known usage so far. Therefore, the case where the first query is an error of the second query is included.

また、情報処理装置１００は、例えばコンテキストの分布のばらつき度合いが大きくなると、第１クエリと第２クエリとの関係性が低いと推定する。すなわち、情報処理装置１００は、既知の用法とは異なる新規の用法で用いられていると推定する。したがって、第１クエリが第２クエリの誤記でなく、新規のワードとして入力される場合を含む。 Further, the information processing apparatus 100 estimates that the relationship between the first query and the second query is low, for example, when the degree of variation in the context distribution becomes large. That is, it is presumed that the information processing apparatus 100 is used in a new usage different from the known usage. Therefore, the case where the first query is input as a new word instead of the typographical error of the second query is included.

このように、新規の用法で用いられていることを推定できる、言い換えれば、特定の文字列に新たな特徴が表出してくるのを検出できることによって、ターゲティング等の各種分析や、いわゆる「バズる」といった事象発生の検出等に活用することができる。 In this way, it is possible to presume that it is used in a new usage, in other words, by being able to detect that a new feature appears in a specific character string, various analyzes such as targeting and so-called "buzz" It can be used to detect the occurrence of an event such as ".

上述したように、実施形態に係る情報処理方法では、ユーザＵによって入力された第１クエリによる検索要求に基づいて第１クエリとは異なる第２クエリを用いた検索結果が提供された場合のユーザＵの検索行動に関する履歴情報を取得し、取得された履歴情報に基づいて検索行動に際してのコンテキストを含む検索行動の特徴情報を抽出し、抽出された特徴情報の変化に基づいて、第１クエリと第２クエリとの間の関係性を推定する。 As described above, in the information processing method according to the embodiment, the user when the search result using the second query different from the first query is provided based on the search request by the first query input by the user U. The history information about the search behavior of U is acquired, the feature information of the search behavior including the context of the search behavior is extracted based on the acquired history information, and the first query and the first query are based on the change of the extracted feature information. Estimate the relationship with the second query.

したがって、実施形態に係る情報処理方法によれば、クエリ間の関係性を適切に推定することができる。 Therefore, according to the information processing method according to the embodiment, the relationship between the queries can be appropriately estimated.

以下、上記のような情報処理を行う情報処理装置１００、および、かかる情報処理装置１００を含む情報処理システム１について詳細に説明する。 Hereinafter, the information processing apparatus 100 that performs the above-mentioned information processing and the information processing system 1 including the information processing apparatus 100 will be described in detail.

〔２．情報処理システム１の構成〕
次に、図４を用いて、実施形態に係る情報処理システム１の構成について説明する。図４は、実施形態に係る情報処理システム１の構成例を示す図である。図４に例示するように、実施形態に係る情報処理システム１は、複数のユーザ端末１０－１～１０－ｎと、情報処理装置１００と、を含む。 [2. Configuration of information processing system 1]
Next, the configuration of the information processing system 1 according to the embodiment will be described with reference to FIG. FIG. 4 is a diagram showing a configuration example of the information processing system 1 according to the embodiment. As illustrated in FIG. 4, the information processing system 1 according to the embodiment includes a plurality of user terminals 10-1 to 10-n and an information processing apparatus 100.

これらの各種装置は、ネットワークＮを介して、有線または無線により通信可能に接続される。ネットワークＮは、ＬＡＮ（Local Area Network）、ＷＡＮ（Wide Area Network）、電話網（携帯電話網、固定電話網等）、地域ＩＰ（Internet Protocol）網、インターネット等の通信ネットワークである。ネットワークＮには、有線ネットワークが含まれていてもよいし、無線ネットワークが含まれていてもよい。 These various devices are connected so as to be communicable by wire or wirelessly via the network N. The network N is a communication network such as a LAN (Local Area Network), a WAN (Wide Area Network), a telephone network (mobile telephone network, fixed telephone network, etc.), a regional IP (Internet Protocol) network, and the Internet. The network N may include a wired network or may include a wireless network.

ユーザ端末１０は、上記したように、ユーザＵによって利用される端末装置である。ユーザ端末１０は、例えば、スマートフォンを含む携帯電話機や、タブレット端末や、デスクトップ型ＰＣや、ノート型ＰＣや、ＰＤＡ（Personal Digital Assistant）等の情報処理装置である。また、ユーザ端末１０には、眼鏡型や時計型の情報処理装置であるウェアラブルデバイス（wearable device）も含まれる。 As described above, the user terminal 10 is a terminal device used by the user U. The user terminal 10 is, for example, an information processing device such as a mobile phone including a smartphone, a tablet terminal, a desktop PC, a notebook PC, or a PDA (Personal Digital Assistant). Further, the user terminal 10 also includes a wearable device which is a glasses-type or watch-type information processing device.

ユーザ端末１０は、ユーザＵによる操作や、ユーザ端末１０が有する機能（例えば、検索サービスを利用するためのアプリを実行する機能や、ブラウザ機能等）に応じて各種情報を取得し、取得した情報に応じた情報を生成して送信する。例えば、ユーザ端末１０は、ネットワークＮを介して、情報処理装置１００が提供する検索サービスのサイトへアクセスする。そして、ユーザＵが例えば検索したい検索キーワードであるクエリを指定することによって、ユーザ端末１０は、情報処理装置１００に対してクエリを送信する。 The user terminal 10 acquires various information according to the operation by the user U and the function of the user terminal 10 (for example, a function of executing an application for using a search service, a browser function, etc.), and the acquired information. Generates and sends information according to the above. For example, the user terminal 10 accesses the site of the search service provided by the information processing apparatus 100 via the network N. Then, by designating a query that is a search keyword that the user U wants to search for, for example, the user terminal 10 transmits the query to the information processing apparatus 100.

情報処理装置１００は、ユーザ端末１０に対し、検索サービスを提供するサービス提供サーバとして機能するサーバ装置である。また、情報処理装置１００は、検索サービスの履歴情報に基づいてユーザＵの検索行動の特徴情報を抽出するとともに、抽出された特徴情報の変化に基づいてクエリ間の関係性を推定する。 The information processing device 100 is a server device that functions as a service providing server that provides a search service to the user terminal 10. Further, the information processing apparatus 100 extracts the feature information of the search behavior of the user U based on the history information of the search service, and estimates the relationship between the queries based on the change of the extracted feature information.

なお、図４では、１つの情報処理装置１００が、検索サービスの提供、特徴情報の抽出、クエリ間の関係性の推定の各機能を兼ね備える場合を示したが、例えば、複数の情報処理装置１００が、これらの各機能を分担して行ってもよい。 Note that FIG. 4 shows a case where one information processing device 100 has functions of providing a search service, extracting feature information, and estimating a relationship between queries. For example, a plurality of information processing devices 100 are shown. However, each of these functions may be shared.

〔３．情報処理装置１００〕
次に、図５を用いて、情報処理装置１００の構成例について説明する。図５は、実施形態に係る情報処理装置１００の構成例を示すブロック図である。なお、図５では、情報処理装置１００の説明に必要となる構成要素のみを示しており、一般的な構成要素についての記載を省略している。 [3. Information processing device 100]
Next, a configuration example of the information processing apparatus 100 will be described with reference to FIG. FIG. 5 is a block diagram showing a configuration example of the information processing apparatus 100 according to the embodiment. Note that FIG. 5 shows only the components necessary for the explanation of the information processing apparatus 100, and the description of general components is omitted.

図５に示すように、情報処理装置１００は、通信部１１０と、記憶部１２０と、制御部１３０とを有する。なお、情報処理装置１００は、情報処理装置１００を利用する管理者等から各種操作を受け付ける入力部（例えば、キーボードやマウス等）や、各種情報を表示するための表示部（例えば、液晶ディスプレイ等）を有してもよい。 As shown in FIG. 5, the information processing apparatus 100 includes a communication unit 110, a storage unit 120, and a control unit 130. The information processing device 100 includes an input unit (for example, a keyboard, a mouse, etc.) that receives various operations from an administrator or the like who uses the information processing device 100, and a display unit (for example, a liquid crystal display, etc.) for displaying various information. ) May have.

（通信部１１０について）
通信部１１０は、例えば、ＮＩＣ（Network Interface Card）等によって実現される。通信部１１０は、ネットワークＮと有線または無線で接続され、ネットワークＮを介して、ユーザ端末１０との間で情報の送受信を行う。 (About communication unit 110)
The communication unit 110 is realized by, for example, a NIC (Network Interface Card) or the like. The communication unit 110 is connected to the network N by wire or wirelessly, and transmits / receives information to / from the user terminal 10 via the network N.

（記憶部１２０について）
記憶部１２０は、例えば、ＲＡＭ（Random Access Memory）、フラッシュメモリ（Flash Memory）等の半導体メモリ素子、または、ハードディスク、光ディスク等の記憶装置によって実現され、図５の例では、記憶部１２０は、履歴情報記憶部１２１と、特徴情報記憶部１２２と、推定情報記憶部１２３とを有する。 (About the storage unit 120)
The storage unit 120 is realized by, for example, a semiconductor memory element such as a RAM (Random Access Memory) or a flash memory (Flash Memory), or a storage device such as a hard disk or an optical disk. In the example of FIG. 5, the storage unit 120 is a storage unit 120. It has a history information storage unit 121, a feature information storage unit 122, and an estimation information storage unit 123.

（履歴情報記憶部１２１）
履歴情報記憶部１２１は、後述する取得部１３１によって取得されるユーザＵの検索行動の履歴に関する各種情報、すなわち上述した履歴情報を記憶する。したがって、履歴情報記憶部１２１は、例えば、ユーザＵが入力した第１クエリによる検索が要求された第１時点から、ユーザＵに第１クエリとは異なる第２クエリを用いた検索結果を提供し、ユーザＵによりその検索結果が選択された第２時点までの経過時間を記憶する。また、履歴情報記憶部１２１は、例えば、ユーザＵの検索行動から推定されるコンテキストを記憶する。 (History information storage unit 121)
The history information storage unit 121 stores various information regarding the history of the search behavior of the user U acquired by the acquisition unit 131 described later, that is, the above-mentioned history information. Therefore, the history information storage unit 121 provides the user U with a search result using a second query different from the first query from the first time point when the search by the first query input by the user U is requested, for example. , The elapsed time up to the second time point when the search result is selected by the user U is stored. Further, the history information storage unit 121 stores, for example, the context estimated from the search behavior of the user U.

（特徴情報記憶部１２２）
特徴情報記憶部１２２は、後述する抽出部１３２によって抽出されるユーザＵの検索行動に関する特徴情報を記憶する。特徴情報は、上述したように、前述の経過時間に関する統計的な特徴や、コンテキストに関する統計的な特徴を含む。 (Characteristic information storage unit 122)
The feature information storage unit 122 stores feature information related to the search behavior of the user U extracted by the extraction unit 132, which will be described later. As described above, the feature information includes the above-mentioned statistical features related to elapsed time and statistical features related to context.

（推定情報記憶部１２３）
推定情報記憶部１２３は、後述する推定部１３３によって推定された推定結果に関する情報を記憶する。例えば、推定情報記憶部１２３は、新たなワードとして推定した新単語を記憶する。また、例えば、推定情報記憶部１２３は、推定した新たな用法を単語（新単語を含む）に対応付けて記憶する。 (Estimated information storage unit 123)
The estimation information storage unit 123 stores information regarding the estimation result estimated by the estimation unit 133, which will be described later. For example, the estimation information storage unit 123 stores a new word estimated as a new word. Further, for example, the estimation information storage unit 123 stores the estimated new usage in association with a word (including a new word).

（制御部１３０について）
制御部１３０は、コントローラ（controller）であり、例えば、ＣＰＵ（Central Processing Unit）やＭＰＵ（Micro Processing Unit）等によって、情報処理装置１００内部の記憶装置に記憶されている各種プログラム（情報処理プログラムの一例に相当）がＲＡＭを作業領域として実行されることにより実現される。また、制御部１３０は、例えば、コントローラであり、ＡＳＩＣ（Application Specific Integrated Circuit）やＦＰＧＡ（Field Programmable Gate Array）等の集積回路により実現される。 (About control unit 130)
The control unit 130 is a controller, and for example, various programs (information processing programs) stored in a storage device inside the information processing device 100 by a CPU (Central Processing Unit), an MPU (Micro Processing Unit), or the like. (Corresponding to one example) is realized by executing RAM as a work area. Further, the control unit 130 is, for example, a controller, and is realized by an integrated circuit such as an ASIC (Application Specific Integrated Circuit) or an FPGA (Field Programmable Gate Array).

図５に示すように、制御部１３０は、取得部１３１と、抽出部１３２と、推定部１３３と、処理部１３４と、提供部１３５とを有し、以下に説明する情報処理の機能や作用を実現または実行する。なお、制御部１３０の内部構成は、図５に示した構成に限られず、後述する情報処理を行うことができる構成であれば他の構成であってもよい。また、制御部１３０が有する各処理部の接続関係は、図５に示した接続関係に限られず、他の接続関係であってもよい。 As shown in FIG. 5, the control unit 130 includes an acquisition unit 131, an extraction unit 132, an estimation unit 133, a processing unit 134, and a provision unit 135, and has functions and operations of information processing described below. Realize or execute. The internal configuration of the control unit 130 is not limited to the configuration shown in FIG. 5, and may be any other configuration as long as it can perform information processing described later. Further, the connection relationship of each processing unit included in the control unit 130 is not limited to the connection relationship shown in FIG. 5, and may be another connection relationship.

制御部１３０は、ユーザＵによって入力された第１クエリによる検索要求に基づいて第１クエリとは異なる第２クエリを用いた検索結果が提供された場合のユーザＵの検索行動に関する履歴情報を取得し、取得された履歴情報に基づいて検索行動に際してのコンテキストを含む検索行動の特徴情報を抽出し、抽出された特徴情報の変化に基づいて、第１クエリと第２クエリとの間の関係性を推定する。 The control unit 130 acquires history information regarding the search behavior of the user U when a search result using a second query different from the first query is provided based on the search request by the first query input by the user U. Then, based on the acquired history information, the characteristic information of the search behavior including the context of the search behavior is extracted, and the relationship between the first query and the second query is based on the change of the extracted characteristic information. To estimate.

（取得部１３１について）
取得部１３１は、ユーザＵによって入力された第１クエリによる検索要求に基づいて第１クエリとは異なる第２クエリを用いた検索結果が提供された場合のユーザＵの検索行動に関する履歴情報を取得する。 (About acquisition unit 131)
The acquisition unit 131 acquires history information regarding the search behavior of the user U when a search result using a second query different from the first query is provided based on the search request by the first query input by the user U. do.

具体的には、取得部１３１は、通信部１１０を介して、ユーザＵによって入力された第１クエリによる検索要求を取得する。また、取得部１３１は、通信部１１０を介して、ユーザＵによって選択された検索結果を取得する。また、取得部１３１は、通信部１１０を介して履歴情報を取得し、履歴情報記憶部１２１へ記憶させる。 Specifically, the acquisition unit 131 acquires the search request by the first query input by the user U via the communication unit 110. Further, the acquisition unit 131 acquires the search result selected by the user U via the communication unit 110. Further, the acquisition unit 131 acquires the history information via the communication unit 110 and stores it in the history information storage unit 121.

（抽出部１３２について）
抽出部１３２は、履歴情報記憶部１２１へ記憶された履歴情報に基づいて、ユーザＵの検索行動に際してのコンテキストを含む検索行動の特徴情報を抽出する。 (About the extraction unit 132)
The extraction unit 132 extracts the characteristic information of the search behavior including the context in the search behavior of the user U based on the history information stored in the history information storage unit 121.

抽出部１３２は、例えば前述の経過時間の標準偏差を抽出する。また、抽出部１３２は、例えば前述のコンテキストの分布を抽出する。コンテキストの分布については、図６～図８を用いた説明で後述する。また、抽出部１３２は、抽出された特徴情報を特徴情報記憶部１２２へ記憶させる。 The extraction unit 132 extracts, for example, the standard deviation of the elapsed time described above. Further, the extraction unit 132 extracts, for example, the above-mentioned context distribution. The distribution of the context will be described later with reference to FIGS. 6 to 8. Further, the extraction unit 132 stores the extracted feature information in the feature information storage unit 122.

（推定部１３３について）
推定部１３３は、特徴情報記憶部１２２へ記憶された特徴情報の変化に基づいて、第１クエリと第２クエリとの間の関係性を推定する。また、推定部１３３は、推定された推定結果に関する情報を推定情報記憶部１２３へ記憶させる。 (About estimation unit 133)
The estimation unit 133 estimates the relationship between the first query and the second query based on the change in the feature information stored in the feature information storage unit 122. Further, the estimation unit 133 stores information about the estimated estimation result in the estimation information storage unit 123.

ここで、推定部１３３が実行する推定処理について、より具体的に説明する。図６は、実施形態に係る推定処理の処理説明図（その１）である。また、図７は、実施形態に係る推定処理の処理説明図（その２）である。また、図８は、実施形態に係る推定処理の処理説明図（その３）である。 Here, the estimation process executed by the estimation unit 133 will be described more specifically. FIG. 6 is a process explanatory diagram (No. 1) of the estimation process according to the embodiment. Further, FIG. 7 is a process explanatory diagram (No. 2) of the estimation process according to the embodiment. Further, FIG. 8 is a process explanatory diagram (No. 3) of the estimation process according to the embodiment.

なお、図６～図８を用いた説明では、上述したコンテキストの分布に基づいてクエリ間の関係性を推定する場合について説明する。 In the description using FIGS. 6 to 8, a case where the relationship between queries is estimated based on the above-mentioned context distribution will be described.

図６に示すように、例えば既知のメディア作品に関する既知のワード「鬼滅の巨人」があり、これに対する第１クエリとして「撲滅の巨人」があるものとする。当初、「撲滅の巨人」は、「鬼滅の巨人」の誤記候補であるものとする。 As shown in FIG. 6, for example, it is assumed that there is a known word "Kimetsu no Giant" regarding a known media work, and there is "Eradication Giant" as the first query for this. Initially, the "Giant of Eradication" is assumed to be a false candidate for the "Giant of Demon".

かかる場合に、第１クエリ「撲滅の巨人」が入力されたユーザＵの検索行動から推定されるコンテキスト群が、図７に示すように、時点Ｔ１においては、年齢「１０代」、性別「男性および女性」、職業「学生」、検索時間帯「「鬼滅の巨人」の放送時間帯」、位置情報「全国各地」等であったとする。 In such a case, the context group estimated from the search behavior of the user U to which the first query "Giant of Eradication" is input is, as shown in FIG. 7, at the time point T1, the age "teen" and the gender "male". And women, profession "student", search time zone "broadcast time zone of" demon giant "", location information "all over the country", etc.

一方、同図に示すように、時点Ｔ１より後の時点Ｔ２においては、年齢「２０代以上」、性別「男性および女性」、職業「社会人」、検索時間帯「Ｂ県警の「事故撲滅の巨人」キャンペーンＣＭ放送後」、位置情報「Ｂ県」等であったとする。なお、時点Ｔ１から時点Ｔ２の間は通常、ある程度の日数が空く場合が多い。 On the other hand, as shown in the figure, at time point T2 after time point T1, age "20s or older", gender "male and female", occupation "worker", search time zone "B prefectural police" accident eradication " It is assumed that "Giants" campaign CM broadcast ", location information" B prefecture ", etc. It should be noted that there is usually a certain number of days between the time point T1 and the time point T2.

推定部１３３は、これら時点Ｔ１，Ｔ２のそれぞれにおけるコンテキストの分布を比較することによって、クエリ間の関係性を推定する。なお、同図に示すようにコンテキストの内容およびその変化は多様であり、言い換えればコンテキストは多次元の特徴を有すると言える。 The estimation unit 133 estimates the relationship between queries by comparing the distribution of contexts at each of these time points T1 and T2. As shown in the figure, the contents of the context and its changes are various, in other words, it can be said that the context has multidimensional characteristics.

そこで、推定部１３３は、例えば機械学習のアルゴリズムを用いた時点Ｔ１，Ｔ２それぞれにおけるコンテキストの分布を算出する。 Therefore, the estimation unit 133 calculates the distribution of the context at each of the time points T1 and T2 using, for example, a machine learning algorithm.

具体的には、図８に示すように、例えば推定部１３３は、ＵＭＡＰ（Uniform Manifold Approximation and Projection）を用いた次元削減により、コンテキストの分布を算出する。ＵＭＡＰは、機械学習による非線形次元圧縮手法であり、リーマン幾何学と代数トポロジーに基づき、高次元空間のデータ構造を保ち、トポロジー間のクロス・エントロピーを最小にしながら低次元のデータに変換する。 Specifically, as shown in FIG. 8, for example, the estimation unit 133 calculates the distribution of the context by dimension reduction using UMAP (Uniform Manifold Approximation and Projection). UMAP is a machine learning non-linear dimensional compression technique that, based on Lehman geometry and algebraic topology, preserves the data structure of high dimensional space and converts it into low dimensional data while minimizing cross-entropy between topologies.

すなわち、推定部１３３は、かかるＵＭＡＰを用いて、時点Ｔ１，Ｔ２それぞれにおける検索行動のコンテキストの高次元空間における特徴量を、次元削減による低次元表現へ変換する。同図では、推定部１３３が、コンテキストの高次元空間における特徴量を三次元の低次元空間へ変換して埋め込んだ例を示している。図中の丸印の一つ一つは、それぞれユーザＵの検索行動の一つ一つに対応する。 That is, the estimation unit 133 uses the UMAP to convert the feature amount in the high-dimensional space of the context of the search behavior at each of the time points T1 and T2 into a low-dimensional representation by dimension reduction. The figure shows an example in which the estimation unit 133 converts and embeds a feature amount in a high-dimensional space of a context into a three-dimensional low-dimensional space. Each of the circles in the figure corresponds to each of the search actions of the user U.

ここで、同図に示すように、例えばコンテキストの分布が、時点Ｔ１においてはある領域Ｒ１にまとまっていたが、時点Ｔ２においては、領域Ｒ１については減少したものの、領域Ｒ１とは明らかに異なる領域Ｒ２へ新たにまとまって見られるようになったとする。 Here, as shown in the figure, for example, the distribution of the context was gathered in a certain region R1 at the time point T1, but at the time point T2, although the region R1 decreased, the region clearly different from the region R1. It is assumed that the R2 can be seen as a new group.

かかる場合、推定部１３３は、同図に示す三次元の特徴量空間においてコンテキストの分布のばらつき度合いが大きくなったと判定できるので、第１クエリと第２クエリとの関係性が低いと推定する。すなわち、推定部１３３は、時点Ｔ２における第１クエリ「撲滅巨人」は、既知の用法とは異なる新規の用法で用いられていると推定する。なお、コンテキストの分布のばらつき度合いが大きいか否かは、例えば上述の三次元空間におけるコンテキストの分布の時点Ｔ１から時点Ｔ２にかけての変化量が、所定の閾値以上である否かによって判定することができる。 In such a case, the estimation unit 133 can determine that the degree of variation in the context distribution has increased in the three-dimensional feature space shown in the figure, and therefore it is estimated that the relationship between the first query and the second query is low. That is, the estimation unit 133 estimates that the first query "eradication giant" at time point T2 is used in a new usage different from the known usage. Whether or not the degree of variation in the context distribution is large can be determined, for example, by whether or not the amount of change in the context distribution from the time point T1 to the time point T2 in the above-mentioned three-dimensional space is equal to or greater than a predetermined threshold value. can.

また、図８を用いた説明では、次元削減のアルゴリズムとしてＵＭＡＰを用いる例を挙げたが、用いるアルゴリズムを限定するものではない。例えば他のアルゴリズムとして、主成分分析や、ｔ分布型確率的近傍埋め込み法（ｔ－ＳＮＥ：t-distributed Stochastic Neighbor Embedding）等を用いてもよい。 Further, in the description using FIG. 8, an example in which UMAP is used as the dimension reduction algorithm is given, but the algorithm to be used is not limited. For example, as another algorithm, principal component analysis, t-distributed Stochastic Neighbor Embedding, or the like may be used.

（処理部１３４について）
図５の説明に戻る。処理部１３４は、各種の処理を実行する。処理部１３４は、ユーザＵが入力した第１クエリに基づく検索処理を実行する。処理部１３４は、ユーザＵが入力した第１クエリを用いて検索処理を実行する。また、処理部１３４は、ユーザＵへ提供するコンテンツを含む各種情報を生成する。 (About processing unit 134)
Returning to the description of FIG. The processing unit 134 executes various processes. The processing unit 134 executes a search process based on the first query input by the user U. The processing unit 134 executes the search process using the first query input by the user U. Further, the processing unit 134 generates various information including contents to be provided to the user U.

また、処理部１３４は、ユーザＵが入力した第１クエリとは異なる第２クエリを用いて検索処理を実行する。処理部１３４は、ユーザＵが入力した第１クエリが前述の誤記候補に含まれる場合、第１クエリを、その第１クエリに対応付けられた第２クエリへ変換する。そして、処理部１３４は、変換後の第２クエリを用いて検索処理を実行する。 Further, the processing unit 134 executes the search process using a second query different from the first query input by the user U. When the first query input by the user U is included in the above-mentioned error candidate, the processing unit 134 converts the first query into the second query associated with the first query. Then, the processing unit 134 executes the search process using the converted second query.

（提供部１３５について）
提供部１３５は、通信部１１０を介して、ユーザ端末１０へ各種情報を送信する。提供部１３５は、ユーザＵへ検索サービスを提供する。例えば、提供部１３５は、処理部１３４による検索処理の結果である検索結果をユーザ端末１０へ送信する。また、提供部１３５は、処理部１３４により生成された各種情報をユーザ端末１０へ送信する。また、提供部１３５は、推定部１３３によって推定された推定結果を情報処理装置１００のオペレータ等へ提供する。 (About the provider 135)
The providing unit 135 transmits various information to the user terminal 10 via the communication unit 110. The providing unit 135 provides a search service to the user U. For example, the providing unit 135 transmits the search result, which is the result of the search processing by the processing unit 134, to the user terminal 10. Further, the providing unit 135 transmits various information generated by the processing unit 134 to the user terminal 10. Further, the providing unit 135 provides the estimation result estimated by the estimation unit 133 to the operator of the information processing apparatus 100 and the like.

〔４．情報処理装置１００の処理手順〕
次に、実施形態に係る情報処理装置１００が実行する処理手順について説明する。図９は、実施形態に係る情報処理装置１００が実行する処理手順を示すフローチャートである。 [4. Processing procedure of information processing apparatus 100]
Next, a processing procedure executed by the information processing apparatus 100 according to the embodiment will be described. FIG. 9 is a flowchart showing a processing procedure executed by the information processing apparatus 100 according to the embodiment.

図９に示すように、まず取得部１３１が、ユーザＵによって入力された第１クエリによる検索要求に基づいて第１クエリとは異なる第２クエリを用いた検索結果が提供された場合のユーザＵの検索行動に関する履歴情報を取得する（ステップＳ１０１）。 As shown in FIG. 9, first, the user U when the acquisition unit 131 provides a search result using a second query different from the first query based on the search request by the first query input by the user U. Acquire history information regarding the search behavior of (step S101).

そして、抽出部１３２が、取得された履歴情報に基づいて検索行動に際してのコンテキストを含む検索行動の特徴情報を抽出する（ステップＳ１０２）。 Then, the extraction unit 132 extracts the characteristic information of the search action including the context in the search action based on the acquired history information (step S102).

そして、推定部１３３が、抽出された特徴情報の変化に基づいて、第１クエリと第２クエリとの間の関係性を推定する（ステップＳ１０３）。そして、処理を終了する。 Then, the estimation unit 133 estimates the relationship between the first query and the second query based on the change in the extracted feature information (step S103). Then, the process is terminated.

〔５．ハードウェア構成〕
上述してきた実施形態に係る情報処理装置１００やユーザ端末１０は、例えば図１０に示すような構成のコンピュータ１０００によって実現される。以下、情報処理装置１００を例に挙げて説明する。図１０は、実施形態に係る情報処理装置１００の機能を実現するコンピュータの一例を示すハードウェア構成図である。コンピュータ１０００は、ＣＰＵ（Central Processing Unit）１１００、ＲＡＭ（Random Access Memory）１２００、ＲＯＭ（Read Only Memory）１３００、ＨＤＤ（Hard Disk Drive）１４００、通信インターフェイス（Ｉ／Ｆ）１５００、入出力インターフェイス（Ｉ／Ｆ）１６００、およびメディアインターフェイス（Ｉ／Ｆ）１７００を備える。 [5. Hardware configuration]
The information processing apparatus 100 and the user terminal 10 according to the above-described embodiment are realized by, for example, a computer 1000 having a configuration as shown in FIG. Hereinafter, the information processing apparatus 100 will be described as an example. FIG. 10 is a hardware configuration diagram showing an example of a computer that realizes the functions of the information processing apparatus 100 according to the embodiment. The computer 1000 includes a CPU (Central Processing Unit) 1100, a RAM (Random Access Memory) 1200, a ROM (Read Only Memory) 1300, an HDD (Hard Disk Drive) 1400, a communication interface (I / F) 1500, and an input / output interface (I). It includes a / F) 1600 and a media interface (I / F) 1700.

ＣＰＵ１１００は、ＲＯＭ１３００またはＨＤＤ１４００に格納されたプログラムに基づいて動作し、各部の制御を行う。ＲＯＭ１３００は、コンピュータ１０００の起動時にＣＰＵ１１００によって実行されるブートプログラムや、コンピュータ１０００のハードウェアに依存するプログラム等を格納する。 The CPU 1100 operates based on a program stored in the ROM 1300 or the HDD 1400, and controls each part. The ROM 1300 stores a boot program executed by the CPU 1100 when the computer 1000 is started, a program depending on the hardware of the computer 1000, and the like.

ＨＤＤ１４００は、ＣＰＵ１１００によって実行されるプログラムおよび当該プログラムによって使用されるデータ等を格納する。通信インターフェイス１５００は、通信網５００（図４に示したネットワークＮに対応）を介して他の機器からデータを受信してＣＰＵ１１００へ送り、ＣＰＵ１１００が生成したデータを、通信網５００を介して他の機器へ送信する。 The HDD 1400 stores a program executed by the CPU 1100, data used by the program, and the like. The communication interface 1500 receives data from another device via the communication network 500 (corresponding to the network N shown in FIG. 4) and sends the data to the CPU 1100, and the data generated by the CPU 1100 is transmitted to another device via the communication network 500. Send to the device.

ＣＰＵ１１００は、入出力インターフェイス１６００を介して、ディスプレイやプリンタ等の出力装置、および、キーボードやマウス等の入力装置を制御する。ＣＰＵ１１００は、入出力インターフェイス１６００を介して、入力装置からデータを取得する。また、ＣＰＵ１１００は、生成したデータを、入出力インターフェイス１６００を介して出力装置へ出力する。 The CPU 1100 controls an output device such as a display or a printer, and an input device such as a keyboard or a mouse via the input / output interface 1600. The CPU 1100 acquires data from the input device via the input / output interface 1600. Further, the CPU 1100 outputs the generated data to the output device via the input / output interface 1600.

メディアインターフェイス１７００は、記録媒体１８００に格納されたプログラムまたはデータを読み取り、ＲＡＭ１２００を介してＣＰＵ１１００に提供する。ＣＰＵ１１００は、当該プログラムを、メディアインターフェイス１７００を介して記録媒体１８００からＲＡＭ１２００上にロードし、ロードしたプログラムを実行する。記録媒体１８００は、例えばＤＶＤ（Digital Versatile Disc）、ＰＤ（Phase change rewritable Disk）等の光学記録媒体、ＭＯ（Magneto-Optical disk）等の光磁気記録媒体、テープ媒体、磁気記録媒体、または半導体メモリ等である。 The media interface 1700 reads a program or data stored in the recording medium 1800 and provides the program or data to the CPU 1100 via the RAM 1200. The CPU 1100 loads the program from the recording medium 1800 onto the RAM 1200 via the media interface 1700, and executes the loaded program. The recording medium 1800 is, for example, an optical recording medium such as a DVD (Digital Versatile Disc) or PD (Phase change rewritable Disk), a magneto-optical recording medium such as MO (Magneto-Optical disk), a tape medium, a magnetic recording medium, or a semiconductor memory. And so on.

例えば、コンピュータ１０００が実施形態に係る情報処理装置１００として機能する場合、コンピュータ１０００のＣＰＵ１１００は、ＲＡＭ１２００上にロードされたプログラムを実行することにより、制御部１３０の各機能を実現する。また、ＨＤＤ１４００には、記憶部１２０内のデータが記憶される。コンピュータ１０００のＣＰＵ１１００は、これらのプログラムを、記録媒体１８００から読み取って実行するが、他の例として、他の装置から、通信網５００を介してこれらのプログラムを取得してもよい。 For example, when the computer 1000 functions as the information processing apparatus 100 according to the embodiment, the CPU 1100 of the computer 1000 realizes each function of the control unit 130 by executing a program loaded on the RAM 1200. Further, the data in the storage unit 120 is stored in the HDD 1400. The CPU 1100 of the computer 1000 reads and executes these programs from the recording medium 1800, but as another example, these programs may be acquired from another device via the communication network 500.

〔６．その他〕
また、上記実施形態において説明した各処理のうち、自動的に行われるものとして説明した処理の全部または一部を手動的に行うこともでき、あるいは、手動的に行われるものとして説明した処理の全部または一部を公知の方法で自動的に行うこともできる。この他、上記文書中や図面中で示した処理手順、具体的名称、各種のデータやパラメータを含む情報については、特記する場合を除いて任意に変更することができる。例えば、各図に示した各種情報は、図示した情報に限られない。 [6. others〕
Further, among the processes described in the above-described embodiment, all or a part of the processes described as being automatically performed can be manually performed, or the processes described as being manually performed can be performed. All or part of it can be done automatically by a known method. In addition, information including processing procedures, specific names, various data and parameters shown in the above documents and drawings can be arbitrarily changed unless otherwise specified. For example, the various information shown in each figure is not limited to the information shown in the figure.

また、図示した各装置の各構成要素は機能概念的なものであり、必ずしも物理的に図示の如く構成されていることを要しない。すなわち、各装置の分散・統合の具体的形態は図示のものに限られず、その全部または一部を、各種の負荷や使用状況などに応じて、任意の単位で機能的または物理的に分散・統合して構成することができる。 Further, each component of each of the illustrated devices is a functional concept, and does not necessarily have to be physically configured as shown in the figure. That is, the specific form of distribution / integration of each device is not limited to the one shown in the figure, and all or part of them may be functionally or physically distributed / physically in any unit according to various loads and usage conditions. Can be integrated and configured.

例えば、図５に示した取得部１３１と提供部１３５とは、統合されてもよい。また、例えば、抽出部１３２と推定部１３３とは、統合されてもよい。また、例えば、記憶部１２０に記憶される情報は、ネットワークＮを介して、外部に備えられた所定の記憶装置に記憶されてもよい。 For example, the acquisition unit 131 and the provision unit 135 shown in FIG. 5 may be integrated. Further, for example, the extraction unit 132 and the estimation unit 133 may be integrated. Further, for example, the information stored in the storage unit 120 may be stored in a predetermined storage device provided externally via the network N.

また、上記実施形態では、情報処理装置１００が、例えば、ユーザＵによって入力された第１クエリによる検索要求に基づいて第１クエリとは異なる第２クエリを用いた検索結果が提供された場合のユーザＵの検索行動に関する履歴情報を取得する取得処理と、取得された履歴情報に基づいて検索行動に際してのコンテキストを含む検索行動の特徴情報を抽出する抽出処理と、抽出された特徴情報の変化に基づいて、第１クエリと第２クエリとの間の関係性を推定する推定処理と、を行う例を示した。しかし、上述した情報処理装置１００は、取得処理を行う取得装置と、抽出処理を行う抽出装置と、推定処理を行う推定装置とが分離されてもよい。この場合、取得装置は、少なくとも取得部１３１を有する。抽出装置は、少なくとも抽出部１３２を有する。推定装置は、少なくとも推定部１３３を有する。そして、上記の情報処理装置１００による処理は、取得装置と、抽出装置と、推定装置との各装置を含む情報処理システム１によって実現される。 Further, in the above embodiment, when the information processing apparatus 100 is provided with a search result using a second query different from the first query based on the search request by the first query input by the user U, for example. For acquisition processing to acquire history information related to the search behavior of user U, extraction processing to extract feature information of the search behavior including the context of the search behavior based on the acquired history information, and change of the extracted feature information. Based on this, an example of performing estimation processing for estimating the relationship between the first query and the second query is shown. However, in the information processing device 100 described above, the acquisition device that performs the acquisition process, the extraction device that performs the extraction process, and the estimation device that performs the estimation process may be separated. In this case, the acquisition device has at least the acquisition unit 131. The extraction device has at least an extraction unit 132. The estimation device has at least an estimation unit 133. The processing by the information processing device 100 is realized by the information processing system 1 including the acquisition device, the extraction device, and the estimation device.

また、上述してきた実施形態及び変形例は、処理内容を矛盾させない範囲で適宜組み合わせることが可能である。 Further, the above-described embodiments and modifications can be appropriately combined as long as the processing contents do not contradict each other.

〔７．効果〕
実施形態に係る情報処理装置１００は、取得部１３１と、抽出部１３２と、推定部１３３とを備える。取得部１３１は、ユーザＵによって入力された第１クエリによる検索要求に基づいて第１クエリとは異なる第２クエリを用いた検索結果が提供された場合のユーザＵの検索行動に関する履歴情報を取得する。抽出部１３２は、取得部１３１によって取得された履歴情報に基づいて上記検索行動に際してのコンテキストを含む上記検索行動の特徴情報を抽出する。推定部１３３は、抽出部１３２によって抽出された特徴情報の変化に基づいて、第１クエリと第２クエリとの間の関係性を推定する。 [7. effect〕
The information processing apparatus 100 according to the embodiment includes an acquisition unit 131, an extraction unit 132, and an estimation unit 133. The acquisition unit 131 acquires history information regarding the search behavior of the user U when a search result using a second query different from the first query is provided based on the search request by the first query input by the user U. do. The extraction unit 132 extracts the feature information of the search behavior including the context for the search behavior based on the history information acquired by the acquisition unit 131. The estimation unit 133 estimates the relationship between the first query and the second query based on the change in the feature information extracted by the extraction unit 132.

したがって、実施形態に係る情報処理装置１００によれば、クエリ間の関係性を適切に推定することができる。 Therefore, according to the information processing apparatus 100 according to the embodiment, the relationship between queries can be appropriately estimated.

また、推定部１３３は、特徴情報に含まれる上記検索行動に関する統計的な特徴の時系列上の変化に基づいて、第１クエリと第２クエリとの間の関係性を推定する。 In addition, the estimation unit 133 estimates the relationship between the first query and the second query based on the time-series changes in the statistical features related to the search behavior included in the feature information.

したがって、実施形態に係る情報処理装置１００によれば、統計的な特徴の時系列上の変化に基づいて、クエリ間の関係性を適切に推定することができる。 Therefore, according to the information processing apparatus 100 according to the embodiment, the relationship between queries can be appropriately estimated based on the time-series changes in statistical features.

また、推定部１３３は、特徴情報に含まれるコンテキストの分布のばらつき度合いに基づいて、第１クエリと第２クエリとの間の関係性を推定する。 Further, the estimation unit 133 estimates the relationship between the first query and the second query based on the degree of variation in the distribution of the context included in the feature information.

したがって、実施形態に係る情報処理装置１００によれば、コンテキストの分布のばらつき度合いに基づいて、クエリ間の関係性を適切に推定することができる。 Therefore, according to the information processing apparatus 100 according to the embodiment, the relationship between queries can be appropriately estimated based on the degree of variation in the distribution of the context.

また、推定部１３３は、時点Ｔ１（「第１時点」の一例に相当）における上記ばらつき度合いと、時点Ｔ１よりも後の時点Ｔ２（「第２時点」の一例に相当）における上記ばらつき度合いとを比較することによって、第１クエリと第２クエリとの間の関係性を推定する。 Further, the estimation unit 133 determines the degree of variation at the time point T1 (corresponding to an example of the "first time point") and the degree of variation at the time point T2 after the time point T1 (corresponding to an example of the "second time point"). Estimate the relationship between the first query and the second query by comparing.

したがって、実施形態に係る情報処理装置１００によれば、時点Ｔ１における上記ばらつき度合いと、時点Ｔ２における上記ばらつき度合いとの比較により、クエリ間の関係性を適切に推定することができる。 Therefore, according to the information processing apparatus 100 according to the embodiment, the relationship between the queries can be appropriately estimated by comparing the degree of variation at the time point T1 with the degree of variation at the time point T2.

また、推定部１３３は、時点Ｔ１から時点Ｔ２にかけての上記ばらつき度合いの変化量が所定の閾値以上である場合に、第１クエリが既知の用法とは異なる新規の用法で用いられていると推定する。 Further, the estimation unit 133 estimates that the first query is used in a new usage different from the known usage when the amount of change in the degree of variation from the time point T1 to the time point T2 is equal to or more than a predetermined threshold value. do.

したがって、実施形態に係る情報処理装置１００によれば、上記ばらつき度合いに基づいて、第１クエリが既知の用法とは異なる新規の用法で用いられていると推定することができる。また、特定の文字列に新たな特徴が表出してくるのを検出できることによって、ターゲティング等の各種分析や、いわゆる「バズる」といった事象発生の検出等に活用することができる。 Therefore, according to the information processing apparatus 100 according to the embodiment, it can be estimated that the first query is used in a new usage different from the known usage based on the above degree of variation. In addition, by being able to detect the appearance of a new feature in a specific character string, it can be utilized for various analyzes such as targeting and detection of the occurrence of an event such as so-called "buzz".

以上、本願の実施形態を図面に基づいて詳細に説明したが、これは例示であり、発明の開示の欄に記載の態様を始めとして、当業者の知識に基づいて種々の変形、改良を施した他の形態で本発明を実施することが可能である。 The embodiments of the present application have been described in detail with reference to the drawings, but this is an example, and various modifications and improvements are made based on the knowledge of those skilled in the art, including the embodiments described in the disclosure column of the invention. It is possible to carry out the present invention in other forms described above.

また、上述してきた「部（section、module、unit）」は、「手段」や「回路」などに読み替えることができる。例えば、取得部は、取得手段や取得回路に読み替えることができる。 Further, the above-mentioned "section, module, unit" can be read as "means" or "circuit". For example, the acquisition unit can be read as an acquisition means or an acquisition circuit.

１情報処理システム
１０ユーザ端末
１００情報処理装置
１１０通信部
１２０記憶部
１２１履歴情報記憶部
１２２特徴情報記憶部
１２３推定情報記憶部
１３０制御部
１３１取得部
１３２抽出部
１３３推定部
１３４処理部
１３５提供部 1 Information processing system 10 User terminal 100 Information processing device 110 Communication unit 120 Storage unit 121 History information storage unit 122 Feature information storage unit 123 Estimated information storage unit 130 Control unit 131 Acquisition unit 132 Extraction unit 133 Estimating unit 134 Processing unit 135 Providing unit

Claims

An acquisition unit that acquires history information regarding the user's search behavior when a search result using a second query different from the first query is provided based on a search request by the first query input by the user.
An extraction unit that extracts characteristic information of the search behavior including a context for the search behavior based on the history information acquired by the acquisition unit, and an extraction unit.
An information processing apparatus including an estimation unit that estimates a relationship between the first query and the second query based on a change in the feature information extracted by the extraction unit.

The estimation unit
Claim 1 is characterized in that the relationship between the first query and the second query is estimated based on the time-series changes in the statistical features of the search behavior included in the feature information. The information processing device described in.

The estimation unit
The information processing apparatus according to claim 2, wherein the relationship between the first query and the second query is estimated based on the degree of variation in the distribution of the context included in the feature information.

The estimation unit
To estimate the relationship between the first query and the second query by comparing the degree of variation at the first time point with the degree of variation at the second time point after the first time point. The information processing apparatus according to claim 3.

The estimation unit
When the amount of change in the degree of variation from the first time point to the second time point is equal to or greater than a predetermined threshold value, it is estimated that the first query is used in a new usage different from the known usage. The information processing apparatus according to claim 4.

It is an information processing method executed by a computer.
An acquisition process for acquiring history information regarding the user's search behavior when a search result using a second query different from the first query is provided based on a search request by the first query input by the user.
An extraction step of extracting characteristic information of the search behavior including a context for the search behavior based on the history information acquired in the acquisition step, and an extraction step.
An information processing method comprising an estimation step of estimating a relationship between the first query and the second query based on a change in the feature information extracted in the extraction step.

An acquisition procedure for acquiring history information regarding the user's search behavior when a search result using a second query different from the first query is provided based on a search request by the first query input by the user, and an acquisition procedure.
An extraction procedure for extracting characteristic information of the search behavior including a context for the search behavior based on the history information acquired by the acquisition procedure, and an extraction procedure.
An information processing program characterized by causing a computer to execute an estimation procedure for estimating a relationship between the first query and the second query based on a change in the feature information extracted by the extraction procedure. ..