JP7209297B2

JP7209297B2 - Preference analyzer

Info

Publication number: JP7209297B2
Application number: JP2019197278A
Authority: JP
Inventors: 智彦野口; 拓見安増; 佑西原; 重和近藤
Original assignee: Nomura Research Institute Ltd
Current assignee: Nomura Research Institute Ltd
Priority date: 2019-10-30
Filing date: 2019-10-30
Publication date: 2023-01-20
Anticipated expiration: 2039-10-30
Also published as: JP2021071853A

Description

本発明は、ユーザの嗜好を分析する技術、に関する。 The present invention relates to technology for analyzing user preferences.

インターネットには２０億近いウェブサイトが存在するといわれる。情報の豊富化により、目的とする情報がインターネットに存在する可能性が高まったともいえるが、最適な情報を見つけるのが難しくなったともいえる。加えて、ユーザは自分が本当はどんな情報を求めているのかを明確に認識あるいは表現できないことも多い。 It is said that there are nearly 2 billion websites on the Internet. It can be said that the possibility that the desired information exists on the Internet has increased due to the abundance of information, but it can also be said that it has become difficult to find the most suitable information. Additionally, users often fail to clearly recognize or express what information they really want.

このような現状において、インターネットにおける行動履歴からユーザの好みや興味、思想、信念などの嗜好を分析する研究が盛んに行われている。ユーザの嗜好を適切に分析し、分析結果に基づいて情報フィルタリングをすれば、ユーザは自分が望んでいた情報にたどり着きやすくなる。また、ターゲティング広告の精度をいっそう向上させることもできると考えられる。 Under such circumstances, studies are actively being conducted to analyze preferences such as user preferences, interests, thoughts, beliefs, etc. from behavior histories on the Internet. Appropriate analysis of user preferences and filtering of information based on the analysis results will make it easier for users to find the information they desire. It is also believed that the accuracy of targeting advertisements can be further improved.

特開２００３－１４１１８０号公報Japanese Patent Application Laid-Open No. 2003-141180 特開平９－２４５０５８号公報JP-A-9-245058

本発明者らは、ユーザの嗜好（思考傾向）をより深く分析するためには、ユーザ自身すら認識していない思考や感情を形成する素因、いいかえれば、ユーザの潜在意識に隠れている「思考の素」を引き出し、それを言語化する必要があると考える。 In order to deeply analyze the user's preferences (thinking tendencies), the present inventors have investigated the predispositions that form thoughts and feelings that even the user is not aware of, in other words, the "thoughts" hidden in the subconscious of the user. I think that it is necessary to draw out the “source of the essence” and verbalize it.

本発明は、上記課題認識に基づいて完成された発明であり、その主たる目的は、ユーザとの対話を通してユーザの嗜好を分析するための技術、を提供することにある。 The present invention has been completed based on the recognition of the above problems, and its main purpose is to provide a technique for analyzing user preferences through dialogue with the user.

本発明のある態様における嗜好分析装置は、分析対象となるユーザから、基準用語から想起される派生用語の入力を受け付ける用語取得部と、基準用語と派生用語を関連づけることにより、用語の関連性を示す用語関連情報を更新する関連設定部と、ユーザにとっての重要用語を特定する重要用語特定部と、を備える。
関連設定部は、派生用語を次の基準用語とした上で新たな派生用語が取得されたときに用語関連情報を更新する。
重要用語特定部は、用語関連情報に含まれる複数の用語それぞれの出現態様に基づいて複数の用語から重要用語を特定する。 A preference analysis device according to an aspect of the present invention includes: a term acquiring unit that receives input of derived terms evoked from reference terms by a user to be analyzed; It includes a related setting unit that updates the term-related information to be displayed, and an important term specifying unit that specifies important terms for the user.
The relationship setting unit updates the term-related information when a new derived term is acquired after setting the derived term as the next reference term.
The important term identification unit identifies important terms from a plurality of terms based on the appearance of each of the plurality of terms included in the term-related information.

本発明によれば、ユーザの嗜好を深く分析しやすくなる。 ADVANTAGE OF THE INVENTION According to this invention, it becomes easy to deeply analyze a user's taste.

図１（ａ）は、サーバ型における嗜好分析方法を説明するための模式図である。図１（ｂ）は、エッジ型における嗜好分析方法を説明するための模式図である。FIG. 1(a) is a schematic diagram for explaining a preference analysis method in a server type. FIG. 1(b) is a schematic diagram for explaining a preference analysis method in the edge type. 本実施形態における文書情報の選別方法を説明するための模式図である。FIG. 4 is a schematic diagram for explaining a document information sorting method according to the embodiment; ユーザ端末の機能ブロック図である。3 is a functional block diagram of a user terminal; FIG. 対話画面の画面図である。It is a screen figure of a dialog screen. 用語関連情報の生成過程を示す模式図である。FIG. 4 is a schematic diagram showing a process of generating term-related information; 用語関連情報の構造を示す模式図である。FIG. 4 is a schematic diagram showing the structure of term-related information; 重要用語の評価座標を示す図である。It is a figure which shows the evaluation coordinate of an important term. ユーザ分析過程を示すフローチャートである。4 is a flow chart showing a user analysis process; 文書情報の評価座標を示す図である。FIG. 4 is a diagram showing evaluation coordinates of document information; 文書分析過程および適合判定過程のフローチャートである。4 is a flow chart of a document analysis process and a match determination process;

本実施形態においては、ユーザとの対話を通してユーザの嗜好（思考傾向）を分析し、その嗜好に適合するコンテンツ（文書情報）を選別する方法について説明する。まず、ユーザの嗜好を分析するシステムとして、サーバ型とエッジ型の２種類の構成について説明する。なお、本実施形態においてはエッジ型を想定する。 In this embodiment, a method of analyzing the user's preference (thinking tendency) through dialogue with the user and selecting content (document information) that matches the preference will be described. First, as a system for analyzing user preferences, two types of configurations, a server type and an edge type, will be described. Note that the edge type is assumed in this embodiment.

図１（ａ）は、サーバ型における嗜好分析方法を説明するための模式図である。
複数の企業（サービサー）は複数のサーバ１０２（自社サイト）から文書情報を発信する。ここでいう「文書情報」は、広告、論評、紹介、ニュースなどに関するテキスト情報を含むデジタルコンテンツであればよい。図１において、企業Ａはサーバ１０２ａを運営し、企業Ｂはサーバ１０２ｂを運営する。企業Ａは、主として、ユーザＰ１のサーバ１０２ａに対するアクセス履歴から、ユーザＰ１の性別、年齢、注文履歴、アクセス時間帯などの情報を抽出し、これをユーザデータとして外部データベース１０４ａに蓄積する。ユーザデータに基づいて、企業ＡはユーザＰ１の嗜好、たとえば、どんなことに興味をもっているか、どんなものが好きなのか、どんなことを気にするのか、といった情報を分析する。企業Ｂも、ユーザＰ１のサーバ１０２ｂに対するアクセス履歴に基づいて、外部データベース１０４ｂにユーザデータを独自に蓄積する。 FIG. 1(a) is a schematic diagram for explaining a preference analysis method in a server type.
A plurality of companies (servicers) transmit document information from a plurality of servers 102 (own company sites). The "document information" referred to here may be any digital content including text information relating to advertisements, comments, introductions, news, and the like. In FIG. 1, company A operates server 102a and company B operates server 102b. Company A mainly extracts information such as user P1's sex, age, order history, and access time zone from the access history of user P1 to server 102a, and stores this as user data in external database 104a. Based on the user data, company A analyzes user P1's preferences, for example, information about what he/she is interested in, what he likes, and what he cares about. Company B also independently accumulates user data in the external database 104b based on the access history of the user P1 to the server 102b.

企業Ａは、自社の外部データベース１０４ａに蓄積されるユーザデータに基づいて、ユーザＰ１が興味を持つと思われる文書情報を選び、ユーザ端末１００に送信する。ユーザ端末１００は、スマートフォンなどの携帯端末であってもよいし、ラップトップコンピュータあるいはタブレットコンピュータであってもよい。企業Ｂも同様である。このようにサーバ１０２がユーザデータを管理し、ユーザデータからユーザの嗜好を分析し、サーバ１０２がユーザに提供すべき情報を選ぶ方式が現在の主流である。 Company A selects document information that may be of interest to user P1 based on the user data accumulated in the company's external database 104a, and transmits the document information to user terminal 100. FIG. The user terminal 100 may be a mobile terminal such as a smart phone, a laptop computer, or a tablet computer. The same applies to Company B. In this manner, the current mainstream method is that the server 102 manages user data, analyzes user preferences from the user data, and selects information to be provided to the user by the server 102 .

サーバ型の場合、個人情報が企業に収集されることになるため、ユーザは抵抗感や警戒感をもちやすい。また、ユーザはサーバ１０２ａ以外にもさまざまなサーバ１０２にアクセスするため、１つの企業がユーザデータを漏れなく集めることは不可能に近い。この結果として、企業からユーザＰ１に提供される文書情報とユーザＰ１の嗜好にずれが生じる可能性がある。サーバ型の場合、ユーザは一方的に興味のない文書情報を押し付けられているかのような気持ちになることが多い。 In the case of the server type, personal information is collected by the company, so users tend to feel resistance and caution. In addition, since users access various servers 102 in addition to the server 102a, it is nearly impossible for one company to collect all user data. As a result, there is a possibility that the document information provided by the company to the user P1 and the preference of the user P1 will be different. In the case of the server type, the user often feels as if he/she is being forced to unilaterally receive uninteresting document information.

図１（ｂ）は、エッジ型における嗜好分析方法を説明するための模式図である。
エッジ型においては、ユーザ端末１００にコンシェルジュとして機能するソフトウェア（以下、「コンシェルジュ機能」とよぶ）をインストールする。コンシェルジュ機能を搭載したユーザ端末１００が「嗜好分析装置」として機能する。コンシェルジュは、いわゆるチャットボット（Chatbot）であり、ユーザとの対話を通して得られたユーザデータをユーザ端末１００（内部データベース１０６）に蓄積する。内部データベース１０６はユーザ端末１００が内蔵する不揮発性メモリである。ユーザ端末１００のコンシェルジュ機能は、定期的にサーバ１０２を巡回し、文書情報をダウンロードして検証し、ユーザに提供すべき情報か否かを判断する。 FIG. 1(b) is a schematic diagram for explaining a preference analysis method in the edge type.
In the edge type, software that functions as a concierge (hereinafter referred to as “concierge function”) is installed in the user terminal 100 . A user terminal 100 equipped with a concierge function functions as a "preference analyzer". The concierge is a so-called chatbot, and accumulates user data obtained through interaction with the user in the user terminal 100 (internal database 106). The internal database 106 is a non-volatile memory built into the user terminal 100 . The concierge function of the user terminal 100 periodically visits the server 102, downloads and verifies the document information, and determines whether or not the information should be provided to the user.

エッジ型の場合、コンシェルジュ機能は、内部データベース１０６に蓄積されたユーザデータに基づいてユーザの嗜好を分析する。ユーザは個人情報を外部に渡す必要はない。行動の相手となる多数のサーバ１０２ではなく、行動の起点となる唯一のユーザ端末１００においてユーザデータを収集するため、サーバ型よりもよりユーザデータを集約および充実させやすく、高精度な情報フィルタリングが可能である。 In the edge type, the concierge function analyzes user preferences based on user data accumulated in internal database 106 . Users do not need to pass personal information to the outside. Since the user data is collected by the single user terminal 100 that is the starting point of the action, rather than by a large number of servers 102 that act as counterparts, it is easier to collect and enrich the user data than in the server type, and high-precision information filtering is possible. It is possible.

図２は、本実施形態における文書情報の選別方法を説明するための模式図である。
ユーザ端末１００（嗜好分析装置）は、ユーザの嗜好を分析するとともにサーバ１０２からさまざまな文書情報を取得する。ユーザ端末１００は、文書情報がユーザに嗜好に適合していれば、いいかえれば、この文書情報がユーザに求められている情報であると推測されるならば文書情報をユーザに提示し、適合していなければ提示しない。文書情報の選別には（１）ユーザの嗜好を分析するユーザ分析過程（２）文書情報の特性を分析する文書分析過程（３）ユーザと文書情報の適合性を判定する適合判定過程の３つの処理過程が含まれる。まず、３種類の処理の概要を説明し、詳細については図４以降に関連して後述する。 FIG. 2 is a schematic diagram for explaining a document information sorting method according to the present embodiment.
The user terminal 100 (preference analysis device) analyzes the user's preferences and acquires various document information from the server 102 . The user terminal 100 presents the document information to the user if the document information matches the user's taste, in other words, if the document information is presumed to be the information that the user wants. If not, do not present. There are three processes for selecting document information: (1) a user analysis process for analyzing user preferences, (2) a document analysis process for analyzing characteristics of document information, and (3) a match determination process for determining compatibility between users and document information. Includes processing. First, an outline of the three types of processing will be described, and details will be described later with reference to FIG. 4 and subsequent figures.

（１）ユーザ分析過程
まず、ユーザは、ユーザ端末１００において「用語」を入力する。詳細は後述するが自由連想方式にて思いついた用語をユーザはユーザ端末１００に次々と入力する。ユーザ端末１００は用語と用語のつながりを示す「用語関連情報」をつくる。この用語関連情報はユーザの心の中に浮かんだ言葉のつながりを示す情報であり、いわば、「心の中のネットワーク」を模した情報である。用語関連情報については図５、図６に関連して詳述する。ユーザ端末１００は、用語関連情報からユーザの嗜好を特徴づける「重要用語」を抽出する。重要用語は、ユーザが嗜好を形成するときの重要な素因（思考の素）となっていると想定される概念を言語化したものである。次に、ユーザは重要用語を評価する。このようにユーザ分析過程においては、重要用語を見つけ出し、その重要用語に対するユーザの評価（印象）を設定する。 (1) User Analysis Process First, the user inputs a “term” on the user terminal 100 . Although the details will be described later, the user inputs terms one after another into the user terminal 100 by using the free association method. The user terminal 100 creates "term-related information" that indicates connections between terms. This term-related information is information indicating connections between words that come to mind in the user's mind, and is, so to speak, information that simulates a "mental network." Term-related information will be described in detail with reference to FIGS. The user terminal 100 extracts "important terms" that characterize the user's preferences from the term-related information. An important term is a verbalization of a concept assumed to be an important predisposing factor (element of thought) when a user forms a preference. The user then rates the key terms. Thus, in the user analysis process, important terms are found and the user's evaluation (impression) of the important terms is set.

（２）文書分析過程
企業はサーバ１０２からさまざまな文書情報を提供する。ユーザ端末１００はサーバ１０２から文書情報をウェブページとして取得し、文書情報の内容を特徴づける用語として「特徴用語」を抽出する。いいかえれば、ユーザ端末１００は、１以上の特徴用語から文書情報の特性を定義する。 (2) Document Analysis Process The company provides various document information from the server 102 . The user terminal 100 acquires document information as a web page from the server 102 and extracts "feature terms" as terms that characterize the content of the document information. In other words, the user terminal 100 defines characteristics of document information from one or more feature terms.

（３）適合判定過程
ユーザ端末１００は、ユーザの嗜好を重要用語とその評価値に基づいて定義し、文書情報の特性を特徴用語に基づいて定義し、それぞれを比較することで文書情報とユーザとの適合性を判定する。たとえば、ユーザが強い興味を抱いている重要用語に類似する特徴用語を含む文書情報は適合性が高いと判断される。ユーザ端末１００は、ユーザの嗜好に適合すると考えられる文書情報を提示対象とすることにより、膨大な文書情報をフィルタリングする。 (3) Conformity Judgment Process The user terminal 100 defines user preferences based on important terms and their evaluation values, defines characteristics of document information based on feature terms, and compares the document information and the user. Determine compatibility with For example, document information containing feature terms similar to important terms that the user has a strong interest in is judged to have high relevance. The user terminal 100 filters a large amount of document information by presenting document information that is considered to match the user's taste.

図３は、ユーザ端末１００の機能ブロック図である。
ユーザ端末１００の各構成要素は、ＣＰＵ（Central Processing Unit）および各種コプロセッサなどの演算器、メモリやストレージといった記憶装置、それらを連結する有線または無線の通信線を含むハードウェアと、記憶装置に格納され、演算器に処理命令を供給するソフトウェアによって実現される。コンピュータプログラムは、デバイスドライバ、オペレーティングシステム、それらの上位層に位置する各種アプリケーションプログラム、また、これらのプログラムに共通機能を提供するライブラリによって構成されてもよい。以下に説明する各ブロックは、ハードウェア単位の構成ではなく、機能単位のブロックを示している。 FIG. 3 is a functional block diagram of the user terminal 100. As shown in FIG.
Each component of the user terminal 100 includes computing units such as a CPU (Central Processing Unit) and various coprocessors, storage devices such as memory and storage, hardware including wired or wireless communication lines connecting them, and storage devices. It is implemented by software that is stored and provides processing instructions to the calculator. A computer program may consist of a device driver, an operating system, various application programs located in their higher layers, and a library that provides common functions to these programs. Each block described below represents a functional block rather than a hardware configuration.

ユーザ端末１００は、ユーザインタフェース処理部１１０、通信部１１４、データ処理部１１２およびデータ格納部１１６を含む。
ユーザインタフェース処理部１１０は、タッチパネルを介してユーザからの操作を受け付けるほか、画像表示や音声出力など、ユーザインタフェースに関する処理を担当する。通信部１１４は、インターネットを介してサーバ１０２や他のユーザ端末１００との通信処理を担当する。データ格納部１１６は各種データを格納する。データ処理部１１２は、ユーザインタフェース処理部１１０や通信部１１４により取得されたデータ、データ格納部１１６に格納されているデータに基づいて各種処理を実行する。データ処理部１１２は、ユーザインタフェース処理部１１０、通信部１１４およびデータ格納部１１６のインタフェースとしても機能する。 User terminal 100 includes user interface processing unit 110 , communication unit 114 , data processing unit 112 and data storage unit 116 .
The user interface processing unit 110 receives operations from the user via the touch panel, and is in charge of user interface processing such as image display and audio output. The communication unit 114 is in charge of communication processing with the server 102 and other user terminals 100 via the Internet. The data storage unit 116 stores various data. The data processing unit 112 executes various processes based on data acquired by the user interface processing unit 110 and the communication unit 114 and data stored in the data storage unit 116 . Data processing unit 112 also functions as an interface for user interface processing unit 110 , communication unit 114 and data storage unit 116 .

ユーザインタフェース処理部１１０は、ユーザからの入力を受け付ける入力部１１８と、ユーザに対して画像や音声等の各種情報を出力する出力部１２０を含む。 The user interface processing unit 110 includes an input unit 118 that receives input from a user, and an output unit 120 that outputs various information such as images and sounds to the user.

入力部１１８は、用語取得部１２２と評価入力部１２４を含む。
用語取得部１２２は、ユーザから用語の入力を受け付ける。評価入力部１２４は、ユーザから重要用語に対する評価値の入力を受け付ける。 The input unit 118 includes a term acquisition unit 122 and an evaluation input unit 124 .
The term acquiring unit 122 receives term input from the user. The evaluation input unit 124 receives an input of an evaluation value for important terms from the user.

通信部１１４は、データを送信する送信部１３８と、データを受信する受信部１４０を含む。受信部１４０は文書取得部１４２を含む。文書取得部１４２は、インターネットを巡回し、さまざまなサーバ１０２からウェブページ（文書情報）を自動的にダウンロードする。 Communication unit 114 includes a transmission unit 138 that transmits data and a reception unit 140 that receives data. Reception unit 140 includes document acquisition unit 142 . The document acquisition unit 142 crawls the Internet and automatically downloads web pages (document information) from various servers 102 .

データ処理部１１２は、関連設定部１２６、重要用語特定部１２８、重要用語評価部１３０、特徴用語抽出部１３２、特徴用語評価部１３４および適合判定部１３６を含む。
関連設定部１２６は、用語と用語の関連性（リンク）を設定し、用語関連情報を更新する（図５、図６に関連して後述）。用語関連情報はデータ格納部１１６に格納される。重要用語特定部１２８は、用語関連情報に基づいて重要用語を抽出する（図６に関連して後述）。重要用語評価部１３０は、ユーザからの入力にしたがって重要用語の評価値を設定する（図７に関連して後述）。特徴用語抽出部１３２は、文書情報から特徴用語を抽出する（図９に関連して後述）。特徴用語評価部１３４は、特徴用語の評価値を計算する（図９に関連して後述）。適合判定部１３６は、文書情報のユーザに対する適合性を判定する（図９に関連して後述）。 The data processing unit 112 includes a relation setting unit 126 , an important term specifying unit 128 , an important term evaluating unit 130 , a characteristic term extracting unit 132 , a characteristic term evaluating unit 134 and a conformity determining unit 136 .
The relationship setting unit 126 sets relationships (links) between terms and updates term-related information (described later with reference to FIGS. 5 and 6). The term-related information is stored in data storage unit 116 . The important term identification unit 128 extracts important terms based on the term-related information (described later with reference to FIG. 6). The important term evaluation unit 130 sets the evaluation value of the important term according to the input from the user (described later with reference to FIG. 7). The feature term extraction unit 132 extracts feature terms from the document information (described later with reference to FIG. 9). The feature term evaluation unit 134 calculates the evaluation value of the feature term (described later with reference to FIG. 9). The suitability determination unit 136 determines suitability of the document information for the user (described later with reference to FIG. 9).

図４は、対話画面１５０の画面図である。
ユーザは、ユーザ端末１００においてコンシェルジュ機能を起動したとき、出力部１２０は対話画面１５０を表示させる。ユーザは執事１５２（キャラクタ）と対話する。ユーザと執事１５２の対話を通してコンシェルジュ機能はユーザの嗜好を探る。 FIG. 4 is a screen diagram of the dialogue screen 150. As shown in FIG.
When the user activates the concierge function on the user terminal 100, the output unit 120 causes the interactive screen 150 to be displayed. A user interacts with butler 152 (a character). Through interaction between the user and butler 152, the concierge function explores the user's preferences.

図４の対話画面１５０においては、執事１５２はユーザに対して「気になるモノはありますか？」という質問を投げかけている。これに対して、ユーザは用語入力領域１５４に「沖縄」と入力している。ユーザが返答ボタン１５６をタッチすると、ユーザの気になる用語として「沖縄」が登録される。 On the interactive screen 150 of FIG. 4, the butler 152 asks the user, "Is there anything that interests you?" On the other hand, the user has entered "Okinawa" in the term input area 154 . When the user touches the reply button 156, "Okinawa" is registered as a term of interest to the user.

続いて、執事１５２は「沖縄といえば何ですか？」「沖縄の何が気になるの？」「沖縄の好きなところは何？」「沖縄の芸能人といえば誰？」のように「沖縄」にまつわる質問をユーザに投げかける。ユーザは、たとえば、「沖縄といえば何ですか？」という質問に対して「ミミガー」と答えたとする。この場合、用語「沖縄」は連想の起点となる用語（以下、「基準用語」という）であり、用語「ミミガー」は基準用語「沖縄」から想起される用語（以下、「派生用語」という）である。関連設定部１２６は、基準用語「沖縄」と派生用語「ミミガー」に関連性（リンク）を設定して用語関連情報に登録する。 Then, butler 152 asked, "What do you think of Okinawa?" "What do you like about Okinawa?" "What do you like about Okinawa?" ” to users. For example, it is assumed that the user answers "Mimigar" to the question "What do you think of Okinawa?". In this case, the term ``Okinawa'' is the term that serves as the starting point of association (hereinafter referred to as the ``standard term''), and the term ``Mimigar'' is a term evoked from the standard term ``Okinawa'' (hereinafter referred to as the ``derivative term''). is. The relation setting unit 126 sets a relation (link) between the reference term “Okinawa” and the derivative term “Mimigar” and registers them in the term relation information.

次に、執事１５２は「ミミガー」を新たな基準用語として同様の質問をする。ユーザが「ミミガー」から想起される用語として「肉」と答えたとき、用語「肉」は基準用語「ミミガー」の派生用語として用語関連情報に登録される。このように、基準用語に基づいて派生用語を入力させ、派生用語を新たな基準用語として次の派生用語を入力させる処理を繰り返すことにより、用語取得部１２２はユーザから複数の用語を取得する。また、新たな用語が取得されるごとに、関連設定部１２６は用語関連情報を更新する。 Butler 152 then asks a similar question using "mimigar" as the new reference term. When the user answers "niku" as a term evoked from "mimi ga", the term "niku" is registered in the term-related information as a derived term of the reference term "mimi ga". In this way, by repeating the process of inputting a derived term based on a reference term and inputting the next derived term using the derived term as a new reference term, the term acquiring unit 122 acquires a plurality of terms from the user. Also, each time a new term is acquired, the association setting unit 126 updates the term association information.

図５は、用語関連情報の生成過程を示す模式図である。
ユーザは、「沖縄」から「ミミガー」を想起し（Ｓ１）、「ミミガー」から「肉」を想起し（Ｓ２）、「肉」から「ビール」を想起したとする（Ｓ３）。更に、ユーザは「ビール」から「ウィスキー」を想起し（Ｓ４）、「ウィスキー」から「アイルランド」を想起し（Ｓ６）、「アイルランド」から再び「ビール」を想起したとする（Ｓ６）。ユーザは「ビール」という用語を２回想起しているため、このユーザにとって「ビール」はユーザの嗜好、たとえば、欲しいもの、好きなもの、気になるものを探る上で重要なキーワードである可能性がある。 FIG. 5 is a schematic diagram showing the process of generating term-related information.
It is assumed that the user recalls "mimi ga" from "Okinawa" (S1), "mimi ga" from "mimi ga" (S2), and "beer" from "meat" (S3). Further, the user recalls "whiskey" from "beer" (S4), "Ireland" from "whiskey" (S6), and "beer" from "Ireland" again (S6). Since the user recalls the term "beer" twice, it is possible that "beer" is an important keyword for this user in exploring the user's tastes, for example, what they want, what they like, and what they care about. have a nature.

続いて、「ビール」のあと「夏」（Ｓ７）、「沖縄」（Ｓ８）、「海」（Ｓ９）が早期されている。「沖縄」も再登場した用語であるため、「沖縄」もこのユーザの潜在意識において重要な地位を確立しているキーワードであると考えられる。このように、自由連想の結果として想起されやすい用語が「重要用語の候補」となる。 After "beer", "summer" (S7), "Okinawa" (S8), and "sea" (S9) are played early. Since "Okinawa" is also a reappearing term, "Okinawa" is also considered to be a keyword that has established an important position in the subconscious of this user. In this way, terms that are likely to be recalled as a result of free association are “candidates for important terms”.

図６は、用語関連情報の構造を示す模式図である。
上述したように、ユーザはさまざまな用語をユーザ端末１００に入力する。関連設定部１２６は、用語取得部１２２が用語を取得するごとに基準用語と派生用語の関連（連想関係）を設定することで用語関連情報を充実化させる。最初の基準用語はユーザに自由に入力させてもよいし、関連設定部１２６が執事１５２を介してランダムにテーマを投げかけてもよい。たとえば、執事１５２は最初に「大阪といえば？」と質問し、用語取得部１２２は「大阪」を基準用語とする派生用語を取得してもよい。この場合、最初の基準用語「大阪」はユーザではなく執事１５２（関連設定部１２６）が初期設定している。一方、図４の例に示した「沖縄」のように最初の基準用語をユーザが自由に入力してもよい。 FIG. 6 is a schematic diagram showing the structure of term-related information.
As described above, users enter various terms into user terminal 100 . The relationship setting unit 126 enriches the term-related information by setting a relationship (associative relationship) between the reference term and the derivative term each time the term acquisition unit 122 acquires a term. The first reference term may be freely input by the user, or the related setting unit 126 may randomly propose a theme via the butler 152 . For example, the butler 152 may first ask, "When you think of Osaka?" In this case, the first reference term “Osaka” is initially set by butler 152 (relationship setting unit 126) instead of by the user. On the other hand, the user may freely input the first reference term such as "Okinawa" shown in the example of FIG.

ユーザは、基準用語１から派生用語２を入力し、派生用語２を新たな基準用語２とした上で次の派生用語３を入力する。また、複数種類の基準用語が提供されてもよい。たとえば、用語取得部１２２は、基準用語「大阪」を起点として想起された派生用語群を取得したあと、別のまったく異なる基準用語「木星」を起点として派生用語群を取得してもよい。何回も入力される用語もあれば、１回しか入力されない用語もある。 The user inputs the derivative term 2 from the reference term 1 , sets the derived term 2 as a new reference term 2 , and then inputs the next derived term 3 . Also, multiple types of reference terms may be provided. For example, the term acquiring unit 122 may acquire a derived term group evoked with the reference term “Osaka” as a starting point, and then acquire a derived term group with a completely different reference term “Jupiter” as a starting point. Some terms are entered multiple times, while others are entered only once.

本実施形態においては、重要用語特定部１２８はこれらの用語群から、関連性の数（リンク数）において上位１０位以内であり、かつ、５種類以上の他の用語と関連性を有する用語を「重要用語」として特定する。図６の丸印は用語を示す。図６においては「英国」、「ロック」、「下関」、「韓国」などが多数の関連性を有するため、重要用語として特定される。重要用語は、ユーザが執事１５２との対話を通して想起した膨大な用語群のうち、特に、ユーザの深層心理において重要な地位を占める用語であり、ユーザの嗜好を分析する上で重要な手がかりとなるべきキーワードである。 In this embodiment, the important term identification unit 128 selects terms that are ranked within the top 10 in terms of the number of relationships (number of links) from these term groups and that are related to five or more other terms. Identify as “key terms”. Circles in FIG. 6 indicate terms. In FIG. 6, "UK", "Rock", "Shimonoseki", "Korea", etc. are identified as important terms because they have many relationships. Important terms are terms that occupy an important position in the user's deep psychology, among the huge group of terms that the user recalls through dialogue with the butler 152, and serve as important clues in analyzing the user's preferences. should be a keyword.

図７は、重要用語の評価座標を示す図である。
重要用語特定部１２８は１以上の重要用語を特定し、出力部１２０は重要用語をユーザに示す。ユーザは、重要用語に対して評価値を入力する。たとえば、重要用語「英国」に関して、執事１５２は「英国は好きですか」「英国に興味はありますか」という質問をする。ユーザは、英国が大好きなら「５」、大嫌いなら「－５」、嫌いでも好きでもないなら「０」を入力する。数値入力により、ユーザはどのくらい「英国」が好きかを評価する。評価入力部１２４は、英国の「好き／嫌い」という評価軸に対して、ユーザからの評価値を取得する（以下、「好悪評価」とよぶ）。また、「興味」という評価軸についても同様に評価値を取得する（以下、「興味評価」とよぶ）。 FIG. 7 is a diagram showing evaluation coordinates of important terms.
A key term identifier 128 identifies one or more key terms, and an output component 120 presents the key terms to the user. The user enters rating values for key terms. For example, with respect to the key term "Britain", butler 152 asks the question "Do you like England?" and "Are you interested in England?" The user enters '5' if they love the UK, '-5' if they hate it, and '0' if they do not like or dislike it. Numerical inputs rate how much the user likes "UK". The evaluation input unit 124 acquires an evaluation value from the user with respect to the evaluation axis of "like/dislike" in the UK (hereinafter referred to as "like/dislike evaluation"). Similarly, an evaluation value is obtained for the evaluation axis of "interest" (hereinafter referred to as "interest evaluation").

重要用語評価部１３０は、他の重要用語についても同様にしてユーザからの評価値を取得することにより、重要用語の「評価空間」における座標を特定する。ここでいう「評価空間」とは、さまざまな用語について評価値を座標表現することで形成される概念上の空間である。図７に示す評価空間は好悪評価と興味評価の２軸により構成される二次元空間である。図７によれば、ユーザは「ゴルフ」については少し好きだがそれほど興味があるわけでもなく、「下関」については好きでも嫌いでもないし興味もない。一方、ユーザは「ロック」については強い興味をもつだけでなく大好きであることがわかる。評価空間により、ユーザの深層心理において重要な地位を占める重要用語と、その重要用語に対するユーザの印象が模式的に表現される。 The important term evaluation unit 130 similarly obtains evaluation values from the user for other important terms, thereby specifying the coordinates of the important terms in the “evaluation space”. The “evaluation space” here is a conceptual space formed by expressing evaluation values for various terms in coordinates. The evaluation space shown in FIG. 7 is a two-dimensional space composed of two axes of favorable evaluation and interest evaluation. According to FIG. 7, the user likes "golf" a little, but is not so interested in it, and does not like, dislike, or is not interested in "Shimonoseki". On the other hand, it can be seen that users not only have a strong interest in "rock" but also love it. The evaluation space schematically expresses important terms that occupy an important position in the user's deep psychology and the user's impression of the important terms.

評価空間は、２次元空間である必要はなく、３以上の評価軸によって座標が定められる高次元空間であってもよい。３次元以下のときには、出力部１２０は図７に示す評価空間をグラフィック表示させてもよいが、評価空間をグラフィック表示させることは必須ではない。 The evaluation space need not be a two-dimensional space, and may be a high-dimensional space whose coordinates are defined by three or more evaluation axes. When the dimension is three or less, the output unit 120 may graphically display the evaluation space shown in FIG. 7, but it is not essential to graphically display the evaluation space.

図８は、ユーザ分析過程を示すフローチャートである。
上述したように、ユーザ分析過程においては、執事１５２がユーザに質問を投げかけることでユーザから用語（言葉）を引き出し、重要用語を特定し、重要用語に対するユーザの評価値を取得することでユーザの嗜好を探る。 FIG. 8 is a flow chart showing the user analysis process.
As described above, in the user analysis process, the butler 152 elicits terms (words) from the user by asking questions to the user, identifies important terms, and obtains the user's evaluation value for the important terms. explore your tastes.

用語取得部１２２は、まず、ユーザから用語を取得する（Ｓ１０）。関連設定部１２６は新たな用語が取得されるごとに用語関連情報を更新する（Ｓ１２）。用語入力を続けるときには（Ｓ１４のＮ）、処理はＳ１０に戻り、前回入力された用語から想起される派生用語、または、新たな基準用語が入力される。用語入力を終了するときには（Ｓ１４のＹ）、重要用語特定部１２８は取得済みの用語群から重要用語を特定する（Ｓ１６）。出力部１２０は重要用語をユーザに提示し、ユーザは重要用語に対して評価値を入力する（Ｓ１８）。重要用語評価部１３０は、重要用語とその評価値を対応づけてデータ格納部１１６に登録する。 The term acquiring unit 122 first acquires terms from the user (S10). The association setting unit 126 updates the term association information each time a new term is acquired (S12). When continuing term input (N of S14), the process returns to S10, and a derivative term recalled from the previously input term or a new reference term is input. When term input ends (Y of S14), the important term identification unit 128 identifies important terms from the obtained term group (S16). The output unit 120 presents the important terms to the user, and the user inputs an evaluation value for the important terms (S18). The important term evaluation unit 130 associates the important terms with their evaluation values and registers them in the data storage unit 116 .

図９は、文書情報の評価座標を示す図である。
図９においては、好悪評価軸をｘ座標、興味評価軸をｙ座標として説明する。文書取得部１４２は、「九州の温泉のデジタルアーツ」に関する文書情報Ｄ１を取得したとする。ユーザ端末１００は、この文書情報Ｄ１がユーザにとって適切な情報であるか、いいかえれば、ユーザに提示すべき価値のある情報であるかを判定する。 FIG. 9 is a diagram showing evaluation coordinates of document information.
In FIG. 9, the like/dislike evaluation axis is the x-coordinate, and the interest evaluation axis is the y-coordinate. It is assumed that the document acquisition unit 142 acquires document information D1 related to "digital arts of hot springs in Kyushu". The user terminal 100 determines whether this document information D1 is appropriate information for the user, in other words, whether it is valuable information that should be presented to the user.

まず、特徴用語抽出部１３２は、文書情報Ｄ１から特徴用語を抽出する。特徴用語は、文書情報Ｄ１を特徴づけるキーワードである。まず、特徴用語抽出部１３２は、既知の形態素解析手法により、文書情報Ｄ１から用語（単語）を抽出する。これらの用語それぞれについて特徴用語抽出部１３２は「特異度」を計算する。特異度とは、その用語が文書情報Ｄ１の内容を特徴づける度合いの大きさを示す。特徴用語抽出部１３２は、文書情報Ｄ１で頻出する用語、タイトルや見出しなどの重要部分に含まれる用語に対して高い特異度を設定する。特徴用語抽出部１３２は、ＴＦ／ＩＤＦ法（Term Frequency／Inverse Document Frequency）などの既知の文書解析手法を応用することで文書情報Ｄ１に含まれる用語の特異度を計算してもよい。ＴＦ／ＩＤＦ法においては、一般的な文書群（コーパス）においてはあまり使われていないが文書情報Ｄ１において出現している用語を特異性の高い用語と判断する。 First, the feature term extraction unit 132 extracts feature terms from the document information D1. A characteristic term is a keyword that characterizes the document information D1. First, the feature term extraction unit 132 extracts terms (words) from the document information D1 by a known morphological analysis method. The feature term extraction unit 132 calculates “specificity” for each of these terms. The specificity indicates the extent to which the term characterizes the content of the document information D1. The feature term extraction unit 132 sets a high degree of specificity for terms that frequently appear in the document information D1 and terms that are included in important parts such as titles and headings. The feature term extraction unit 132 may calculate the specificity of terms included in the document information D1 by applying a known document analysis method such as the TF/IDF (Term Frequency/Inverse Document Frequency) method. In the TF/IDF method, terms that are rarely used in general document groups (corpus) but appear in the document information D1 are determined to be highly specific terms.

特徴用語抽出部１３２は、特異度が所定値以上であり、かつ、５位以内となる用語を文書情報の特性を示す特徴用語として特定する。文書情報Ｄ１からは、特徴用語として「九州」「温泉」「デジタルアート」が特定されたとする。 The feature term extraction unit 132 identifies terms having a specificity equal to or higher than a predetermined value and ranked within the top five as feature terms indicating the characteristics of the document information. It is assumed that "Kyushu", "hot spring" and "digital art" are specified as characteristic terms from the document information D1.

次に、特徴用語「九州」の評価空間における座標（評価値）を求める。ここで、重要用語「韓国」の座標を（ｘａ、ｙａ）、重要用語「焼酎」の座標を（ｘｂ、ｙｂ）とする。特徴用語評価部１３４は、まず、特徴用語「九州」と重要用語「韓国」の類似度を計算する。本実施形態においては「word2vec」として知られる既知のアルゴリズムに基づいて、特徴用語評価部１３４は重要用語「韓国」と特徴用語「九州」の類似度を計算する。word2vecに限らず、類似度の計算方法は任意であり、言葉としての意味やカテゴリの近さを示す指標値として定められればよい。 Next, the coordinates (evaluation value) in the evaluation space of the characteristic term "Kyushu" are obtained. Here, let the coordinates of the important term “Korea” be (xa, ya) and the coordinates of the important term “shochu” be (xb, yb). The characteristic term evaluation unit 134 first calculates the degree of similarity between the characteristic term “Kyushu” and the important term “Korea”. Based on a known algorithm known as "word2vec" in this embodiment, the feature term evaluation unit 134 calculates the similarity between the key term "Korea" and the feature term "Kyushu". The similarity calculation method is not limited to word2vec, and any method may be used as long as it is determined as an index value that indicates the meaning of words and the closeness of categories.

特徴用語「九州」と重要用語「韓国」の類似度がｒ１、特徴用語「九州」と重要用語「小中」の類似度はｒ２であったとする。この場合、特徴用語「九州」の座標（ｘｃ，ｙｃ）は、
ｘｃ＝（ｒ１×ｘａ＋ｒ２×ｘｂ）／（ｒ１＋ｒ２）
ｙｃ＝（ｒ１×ｙａ＋ｒ２×ｙｂ）／（ｒ１＋ｒ２）
として算出される。すなわち、特徴用語「九州」は、重要用語「韓国」および「焼酎」それぞれの中間にあり、かつ、類似度が高い重要用語の方に引き寄せられる。このように、重要用語とその評価値（座標）、重要用語と特徴用語の類似度に基づいて、特徴用語の評価空間における座標が定められる。 Assume that the similarity between the characteristic term "Kyushu" and the important term "Korea" is r1, and the similarity between the characteristic term "Kyushu" and the important term "konaka" is r2. In this case, the coordinates (xc, yc) of the characteristic term "Kyushu" are
xc=(r1*xa+r2*xb)/(r1+r2)
yc=(r1×ya+r2×yb)/(r1+r2)
calculated as In other words, the feature term "Kyushu" is located between the key terms "Korea" and "shochu" and is attracted to key terms with a high degree of similarity. In this way, the coordinates in the feature term evaluation space are determined based on the key term and its evaluation value (coordinates) and the similarity between the key term and the feature term.

重要用語が３以上のときも同様であり、特徴用語「九州」の座標は、複数の重要用語（「韓国」「焼酎」「下関」「ゴルフ」「英国」「ロック」）それぞれに対する類似度に基づいて定められる。図９によれば特徴用語「九州」は、重要用語「焼酎」「下関」「韓国」などとは類似しているが、重要用語「英国」「ロック」とはあまり類似していないことがわかる。
特徴用語評価部１３４は、特徴用語「温泉」および「デジタルアート」についても同様の方法により座標計算する。 The same is true when there are three or more important terms, and the coordinates of the characteristic term “Kyushu” are calculated based on the similarity to each of the plurality of important terms (“Korea”, “Shochu”, “Shimonoseki”, “Golf”, “British”, “Rock”). determined based on According to FIG. 9, the feature term "Kyushu" is similar to key terms such as "shochu", "shimonoseki" and "korea", but is not very similar to key terms such as "UK" and "rock". .
The feature term evaluation unit 134 also performs coordinate calculation for the feature terms “hot spring” and “digital art” in a similar manner.

適合判定部１３６は、次に、文書情報Ｄ１の評価空間における座標（以下、「文書座標」とよぶ）を計算する。本実施形態においては、適合判定部１３６は３つの特徴用語「九州」「温泉」「デジタルアート」の座標の重心を文書情報Ｄ１の文書座標（ｘ１、ｙ１）とする。文書情報Ｄ１の文書座標は、特徴用語の座標に基づいて決定されればよく、特徴用語の座標の重心に限る必要はない。たとえば、特異度が比較的小さな特徴用語の座標よりも比較的大きな特徴用語の座標に近づくように文書座標を調整してもよい。 The conformity determination unit 136 then calculates the coordinates of the document information D1 in the evaluation space (hereinafter referred to as "document coordinates"). In this embodiment, the conformity determination unit 136 sets the center of gravity of the coordinates of the three characteristic terms "Kyushu", "hot spring" and "digital art" to the document coordinates (x1, y1) of the document information D1. The document coordinates of the document information D1 may be determined based on the coordinates of the characteristic term, and need not be limited to the center of gravity of the coordinates of the characteristic term. For example, the document coordinates may be adjusted to approximate the coordinates of feature terms with relatively high specificity more than the coordinates of feature terms with relatively low specificity.

以上の処理結果によれば文書情報Ｄ１は、ｘ１（好き）＞０、ｙ１（興味）＜０となっているため、「好きかもしれないけれども興味がなさそうな情報」であると推測される。一方、別の文書情報Ｄ２の座標（ｘ２，ｙ２）は、ｘ２（好き）＞０、ｙ２（興味）＞０となっていれば、文書情報Ｄ２は「好きな内容であり、興味を持ってもらえそうな情報」であるという推測が成り立つ。この場合、文書情報Ｄ１よりも文書情報Ｄ２の方が「適合性」が高い。 According to the above processing results, document information D1 satisfies x1 (favorite)>0 and y1 (interest)<0, so it is presumed to be "information that may be liked but is not likely to be of interest". . On the other hand, if the coordinates (x2, y2) of another document information D2 satisfy x2 (like)>0 and y2 (interest)>0, then the document information D2 is "favorite content and interested Information that is likely to be obtained" is conjectured. In this case, the document information D2 has a higher "suitability" than the document information D1.

適合判定部１３６は、ある文書情報の文書座標（ｘｄ、ｙｄ）において「ｘｄが閾値ＴＸ以上かつｙｄも閾値ＴＹ以上」という条件を満たすとき、この文書情報はユーザの嗜好に適合していると判定する。ここでいう閾値ＴＸ、ＴＹは任意に設定可能であり、ユーザからの設定に基づいて適合判定部１３６はこれらの閾値を変更してもよい。 When the document coordinate (xd, yd) of certain document information satisfies the condition that "xd is equal to or greater than the threshold value TX and yd is also equal to or greater than the threshold value TY", the matching determination unit 136 determines that the document information is compatible with the user's preference. judge. The thresholds TX and TY referred to here can be arbitrarily set, and the conformity determination unit 136 may change these thresholds based on the settings from the user.

図１０は、文書分析過程および適合判定過程のフローチャートである。
図１０において文書分析過程はＳ２０からＳ２６までが対応し、適合判定過程はＳ２８およびＳ３０が対応する。文書取得部１４２は、定期的にインターネットを巡回（クローリング）し、文書情報を収集する。図１０に示す処理は文書情報が取得されるごとに実行される。 FIG. 10 is a flow chart of the document analysis process and match determination process.
In FIG. 10, S20 to S26 correspond to the document analysis process, and S28 and S30 correspond to the match determination process. The document acquisition unit 142 regularly crawls the Internet to collect document information. The processing shown in FIG. 10 is executed each time document information is acquired.

（文書分析過程）
特徴用語評価部１３４は、新たに取得された文書情報から１以上の特徴用語を抽出する（Ｓ２０）。次に、特徴用語評価部１３４は、すべての特徴用語とすべての重要用語の類似度を計算する（Ｓ２２）。特徴用語評価部１３４は、重要用語の座標値（評価値）、重要用語と特徴用語の類似度に基づいて、特徴用語の評価空間における座標値を計算する（Ｓ２４）。いいかえれば、特徴用語評価部１３４は重要用語の評価値に基づいて特徴用語を評価する。適合判定部１３６は、特徴用語の座標値に基づいて文書座標を計算する（Ｓ２６）。まとめると、適合判定部１３６は特徴用語の評価に基づいて文書情報を評価する。特徴用語の評価は、重要用語の評価に基づく。そして、重要用語の選択とその評価はユーザの嗜好を反映している。 (Document analysis process)
The feature term evaluation unit 134 extracts one or more feature terms from the newly acquired document information (S20). Next, the feature term evaluation unit 134 calculates similarities between all feature terms and all key terms (S22). The feature term evaluation unit 134 calculates the coordinate values of the feature terms in the evaluation space based on the coordinate values (evaluation values) of the key terms and the degree of similarity between the key terms and the feature terms (S24). In other words, the characteristic term evaluation unit 134 evaluates the characteristic term based on the evaluation value of the important term. The matching determination unit 136 calculates document coordinates based on the coordinate values of the feature terms (S26). In summary, the match determination unit 136 evaluates document information based on the evaluation of feature terms. The evaluation of feature terms is based on the evaluation of key terms. The selection of important terms and their evaluation reflect the user's preferences.

（適合判定過程）
適合判定部１３６は、文書座標に基づいて、この文書情報がユーザの嗜好に適合するか否かを判定する（Ｓ２８）。適合するときには（Ｓ２８のＹ）、適合判定部１３６は出力部１２０に指示してこの文書情報を画面表示させる（Ｓ３０）。適合しないときにはＳ３０の処理はスキップされる。適合するときにはユーザ端末１００のデータ格納部１１６に文書情報を保存し、適合しないときには文書情報を破棄するとしてもよい。 (Compliance judgment process)
Based on the document coordinates, the suitability determination unit 136 determines whether or not the document information matches the user's preferences (S28). When it matches (Y of S28), the match determining section 136 instructs the output section 120 to display this document information on the screen (S30). If not, the process of S30 is skipped. The document information may be stored in the data storage unit 116 of the user terminal 100 if it matches, and the document information may be discarded if it does not match.

以上、実施形態に基づいてユーザ端末１００（嗜好分析装置）のコンシェルジュ機能について説明した。
本実施形態によれば、ユーザは執事１５２との対話を通して、自分の心の中に溜め込んでいる「思考の素」を言語化しやすくなる。ユーザが自由な連想によってたくさんの用語を入力するとき、関連設定部１２６は用語のつながりを示す用語関連情報を充実化させる。重要用語特定部１２８は、この用語関連情報から、ユーザの嗜好を特徴づける重要用語を見つけ出す。執事１５２は、ユーザに対して「あなたの心の中にはこんな言葉が溢れているんだよ」として、重要用語を提示してもよい。このとき、ユーザは執事１５２が「自分発見」を手伝ってくれるかのような感覚を味わうことができる。 The concierge function of the user terminal 100 (preference analysis device) has been described above based on the embodiment.
According to the present embodiment, the user can easily verbalize the "source of thoughts" stored in his or her mind through dialogue with the butler 152 . When the user inputs many terms by free association, the association setting unit 126 enriches the term association information indicating the connection of the terms. The important term identifying unit 128 finds out important terms that characterize the user's preferences from this term-related information. The butler 152 may present important terms to the user as "Your mind is overflowing with such words." At this time, the user can feel as if the butler 152 is helping him "discover himself."

適合判定部１３６は、文書情報を特徴づける特徴用語とユーザの嗜好を特徴づける重要用語に基づいて、ユーザが意識的または無意識的に求めている情報が文書情報に含まれているか否かを判定する。このようなコンシェルジュ機能により、ユーザの嗜好に合った文書情報を合理的に選別できる。 Based on characteristic terms that characterize the document information and key terms that characterize the user's preferences, the conformity determination unit 136 determines whether or not the information that the user is consciously or unconsciously seeking is included in the document information. do. With such a concierge function, it is possible to rationally select document information that matches the user's taste.

図１（ｂ）に関連して説明したように、コンシェルジュ機能はユーザ端末１００において実装されるため、重要用語などのプライバシーに深く関わる情報を外部に提供する必要がない。このため、ユーザは安心して気軽に執事１５２と対話しやすくなる。ユーザのコンシェルジュ機能（執事１５２）に対する安心感は、ユーザの本音に近い重要用語を引き出す上でも重要である。ユーザは執事１５２と気軽に、遊び感覚で対話をしながら、執事１５２が自分を深く理解していく過程を楽しむことができる。 As described with reference to FIG. 1(b), the concierge function is implemented in the user terminal 100, so there is no need to externally provide information that is deeply related to privacy, such as important terms. Therefore, the user can easily interact with the butler 152 with peace of mind. The user's sense of security with respect to the concierge function (butler 152) is also important for eliciting important terms close to the user's true intentions. The user can enjoy the process of the butler 152 gaining a deeper understanding of himself while casually interacting with the butler 152 in a playful way.

なお、本発明は上記実施形態や変形例に限定されるものではなく、要旨を逸脱しない範囲で構成要素を変形して具体化することができる。上記実施形態や変形例に開示されている複数の構成要素を適宜組み合わせることにより種々の発明を形成してもよい。また、上記実施形態や変形例に示される全構成要素からいくつかの構成要素を削除してもよい。 It should be noted that the present invention is not limited to the above-described embodiments and modifications, and can be embodied by modifying constituent elements without departing from the scope of the invention. Various inventions may be formed by appropriately combining a plurality of constituent elements disclosed in the above embodiments and modifications. Also, some constituent elements may be deleted from all the constituent elements shown in the above embodiments and modifications.

［変形例］
上記したほかにもさまざまな変形例が考えられる。 [Modification]
Various modifications other than those described above are conceivable.

本実施形態においては、基準用語から派生用語を連想したとき、基準用語と派生用語に関連性（リンク）を設定するとして説明した。関連性は、このような連想に限る必要はなく、「基準用語に基づいて派生用語が想起される関係」でさえあればよい。たとえば、基準用語に対して反対、類似、下位概念、上位概念、例示などの関係を有する用語を派生用語としてもよい。 In the present embodiment, it has been described that when a derived term is associated with a standard term, a relationship (link) is set between the standard term and the derived term. Relevance is not limited to such an association, and may be "a relationship in which a derived term is recollected based on a reference term". For example, terms having opposite, similar, subordinate concept, superordinate concept, exemplified, etc. relationships to the reference term may be derived terms.

本実施形態における「用語」は「九州」「ロック」などの単語を想定して説明した。処理対象となる「用語」は、単語に限らず、複合語あるいは文であってもよい。 The "terms" in this embodiment have been described assuming words such as "Kyushu" and "rock". The "terms" to be processed are not limited to words, and may be compound words or sentences.

本実施形態においては、重要用語特定部１２８はある用語と他の用語の関連性の多さに基づいて重要用語を特定するとして説明した。このほかにも、重要用語特定部１２８は用語関連情報において所定回数以上出現する用語を重要用語として特定してもよいし、用語１から用語２が派生し、用語２からも用語１が派生したとき、すなわち、双方向の連想が成り立つときに用語１，２をいずれも重要用語として特定してもよい。このように、関連性（リンク）の多さに限らず、他の用語との関連、用語の出現頻度など、重要用語特定部１２８は用語関連情報における用語のさまざまな出現態様に基づいて重要用語を特定すればよい。 In the present embodiment, the important term identification unit 128 has been described as identifying important terms based on the degree of relevance between a certain term and other terms. In addition, the important term identification unit 128 may identify a term that appears more than a predetermined number of times in the term-related information as an important term, and term 2 is derived from term 1, and term 1 is derived from term 2. Both terms 1 and 2 may be identified as important terms when, ie, when two-way association is established. In this way, the important term identifying unit 128 identifies important terms based on various modes of occurrence of terms in the term-related information, such as relationships with other terms, frequency of occurrence of terms, and the like, as well as the number of relationships (links). should be specified.

執事１５２は、さまざまな質問をユーザに投げかけることで重要用語を引き出してもよい。たとえば、執事１５２は「好きな花はなんですか？」「１００億円あったら何をしたい？」「おすすめのアーティストって誰？」など多様な質問を用意しておき、いろいろな角度から多様な基準用語を引き出してもよい。 The butler 152 may elicit key terms by posing various questions to the user. For example, butler 152 prepares various questions such as "What is your favorite flower?", "What would you do if you had 10 billion yen?" Reference terms may be derived.

本実施形態においては、ユーザは執事１５２との対話に答えるかたちで用語を入力するとして説明した。このほかにもユーザがＳＮＳ（Social Networking Service）において投稿したテキストからユーザの嗜好を探ってもよい。たとえば、ユーザが「九州でトライアスロンをしたよ」という投稿をしたときには、用語取得部１２２は「九州」「トライアスロン」を抽出する。関連設定部１２６は同一文に出現した「九州」と「トライアスロン」を関連づけて用語関連情報に登録する。また、「トライアスロンのあとで飲み会になった」という投稿があれば、「トライアスロン」と「飲み会」が関連付けられる。このように、関連設定部１２６はユーザのＳＮＳにおける投稿記事に基づいて用語関連情報を更新してもよい。関連設定部１２６は、執事１５２との対話と投稿記事の双方に基づいて用語関連情報を更新してもよい。 In this embodiment, it has been described that the user inputs terms by responding to dialogue with the butler 152 . In addition, the user's preferences may be searched for from text posted by the user on SNS (Social Networking Service). For example, when the user posts "I did a triathlon in Kyushu", the term acquiring unit 122 extracts "Kyushu" and "triathlon". The association setting unit 126 associates "Kyushu" and "triathlon" appearing in the same sentence and registers them in the term association information. Also, if there is a post saying "I had a drinking party after the triathlon", the "triathlon" and the "drinking party" are associated. In this way, the association setting unit 126 may update the term-related information based on the article posted by the user on the SNS. The association setting unit 126 may update the term association information based on both the dialogue with the butler 152 and the posted article.

本実施形態においてはコンシェルジュ機能をユーザ端末１００に搭載するエッジ型を想定して説明した。変形例として、コンシェルジュ機能をサーバ１０２に搭載するサーバ型においても、ユーザの嗜好分析は可能である。この場合には、ユーザはユーザ端末１００を介してサーバ１０２が提供するコンシェルジュ機能を対話し、サーバ１０２のコンシェルジュ機能はユーザが提供する用語から重要用語を分析すればよい。 In the present embodiment, the description has been given assuming an edge type in which the user terminal 100 is equipped with the concierge function. As a modification, a server type in which the server 102 is equipped with a concierge function can also analyze user preferences. In this case, the user may interact with the concierge function provided by the server 102 via the user terminal 100, and the concierge function of the server 102 may analyze important terms from the terms provided by the user.

本実施形態においては、ユーザは重要用語に対して評価値を数値入力するとして説明した。変形例として、重要用語評価部１３０はＡＨＰ（Analytic Hierarchy Process）などの既知の意思決定モデルに基づいて各重要用語の評価値を算出してもよい。このように、ユーザは評価値を直接入力してもよいし、ユーザからの入力に基づいて意思決定モデルが評価値を算出してもよい。 In the present embodiment, it has been described that the user numerically inputs evaluation values for important terms. As a modification, the important term evaluation unit 130 may calculate the evaluation value of each important term based on a known decision-making model such as AHP (Analytic Hierarchy Process). Thus, the user may directly input the evaluation value, or the decision-making model may calculate the evaluation value based on the input from the user.

ＳＮＳの投稿記事から、重要用語評価部１３０は重要用語の評価値を算出してもよい。たとえば、「九州に行きたいです」という投稿記事があれば、重要用語評価部１３０は「行きたい」という興味を示す用語に基づいて、重要用語「九州」の興味に対する評価値を上昇させてもよい。別の例として、「特に８０年代のロックには凄い名曲が多い」という投稿記事があれば、重要用語評価部１３０は「ロック」について「好き」「興味」に対する評価値を上昇させてもよい。「凄い」「名曲」などの肯定的用語は、「好き」「興味」を含意することをあらかじめ設定しておけばよい。また、「８０年代のロックばっかり聞いていたので、最近は違うのも聞こうかと思う」という投稿記事から、重要用語評価部１３０は「ロック」について「興味」の評価値を低下させてもよい。このように、投稿記事を自然言語解析することで、重要用語評価部１３０は投稿記事に含まれる重要用語の評価を適宜増減させてもよい。 The important term evaluation unit 130 may calculate the evaluation value of the important term from the article posted on the SNS. For example, if there is a posted article "I want to go to Kyushu", the important term evaluation unit 130 may increase the evaluation value for the interest of the important term "Kyushu" based on the term "I want to go" indicating interest. good. As another example, if there is a posted article stating that "especially 80's rock has many great songs", the important term evaluation unit 130 may increase the evaluation values for "like" and "interest" for "rock". . Affirmative terms such as "amazing" and "masterpiece" may be set in advance to imply "like" and "interest". Also, the important term evaluation unit 130 may reduce the evaluation value of "interest" for "rock" from the posted article "I listened to nothing but 80's rock, so I think I should listen to something different these days." By performing natural language analysis on the posted article in this way, the important term evaluation unit 130 may appropriately increase or decrease the evaluation of the important terms included in the posted article.

本実施形態においては、重要用語あるいは特徴用語の評価軸として好悪評価と興味評価の２軸を例示した。このほかの評価軸としては「身近である／身近ではない」「詳しい／詳しくない」「怖い／怖くない」「楽しい／退屈」などが考えられる。 In the present embodiment, two axes of favorable evaluation and interest evaluation are exemplified as evaluation axes of important terms or characteristic terms. Other evaluation criteria include "familiar/not familiar", "familiar/not detailed", "scary/not scary", and "fun/boring".

ＳＮＳに対する投稿記事に基づいて、特徴用語評価部１３４は重要用語の評価値を調整してもよい。たとえば、ユーザが「沖縄に行った」のように重要用語「沖縄」に対して経験を表す「行った」という用語が付与されているとき、特徴用語評価部１３４は重要用語「沖縄」の「興味」に関する評価値を減少させてもよい。このような制御方法によれば、ユーザが沖縄に興味をもっている場合でも、ユーザが実際に沖縄を体験したときには興味が下がるという心理傾向を評価空間において早期に反映させることができる。 The feature term evaluation unit 134 may adjust the evaluation value of the important term based on the article posted to the SNS. For example, when the user has given the term “was” representing an experience to the important term “Okinawa”, such as “I went to Okinawa”, the feature term evaluation unit 134 determines that the important term “Okinawa” The evaluation value for "interest" may be decreased. According to such a control method, even when the user is interested in Okinawa, it is possible to quickly reflect in the evaluation space the psychological tendency of the user to lose interest when the user actually experiences Okinawa.

ユーザは、重要用語に対して停止または削除を指示する入力をしてもよい。評価入力部１２４は、ユーザから重要用語「ロック」について「６ヶ月停止」の指示を受け付けたとする。この場合、重要用語特定部１２８は用語「ロック」は一時的に重要用語ではなくなる。このような設定により、ユーザは「ロック」に関連する文書情報の提供を一時的に抑制できる。たとえば、ロックに興味があるとしても、ロックに関連する文章情報を大量に提供されるとユーザはこのような文書情報に食傷感を覚える可能性がある。このような制御方法によれば、「ロックは好きだし興味はあるけれども、しばらくはロックに関する情報の提供は控えてほしい」という意思をユーザはコンシェルジュ機能に伝えることができる。 The user may input an instruction to stop or delete important terms. It is assumed that the evaluation input unit 124 has received from the user an instruction to “suspend for 6 months” with respect to the important term “lock”. In this case, the important term specifying unit 128 temporarily removes the term "lock" from being an important term. With such a setting, the user can temporarily suppress provision of document information related to "lock". For example, even if the user is interested in rock, if a large amount of textual information related to rock is provided, the user may feel sick to such textual information. According to such a control method, the user can inform the concierge function that "I like locks and am interested, but I would like you to refrain from providing information on locks for a while."

ユーザは、重要用語の評価値を一時的に変更してもよい。評価入力部１２４は、ユーザから重要用語「英国」について「６ヶ月間は「興味」の評価値をゼロにする」旨の設定を受け付けたとする。この場合、重要用語評価部１３０は重要用語「英国」の興味に関する評価値を６ヶ月間だけゼロに設定し、６ヶ月が経過したときに元の評価値に戻す。このような制御方法によってもユーザは一時的な興味の変化をコンシェルジュ機能に伝えることができる。 A user may temporarily change the evaluation value of an important term. It is assumed that the evaluation input unit 124 has received a setting from the user to the effect that "the evaluation value of 'interest' will be zero for 6 months" for the important term "UK". In this case, the important term evaluation unit 130 sets the interest-related evaluation value of the important term “UK” to zero for only six months, and restores the original evaluation value after six months. Such a control method also enables the user to inform the concierge function of temporary changes in interest.

関連設定部１２６は時間経過にともなって用語関連情報を更新してもよい。たとえば、第１の期間において取得された用語１，２，３のうち、続く第２の期間において用語１が再取得されなかったときには用語１を用語関連情報から削除してもよい。このとき、重要用語特定部１２８は、更新後の用語関連情報に基づいて重要用語を再特定する。このような処理方法によれば、ユーザは執事１５２との対話を繰り返しつつも、心を占めなくなった用語を用語関連情報から効果的に除外できる。 The association setting unit 126 may update the term association information over time. For example, of terms 1, 2, and 3 acquired in the first period, term 1 may be deleted from the term-related information when term 1 is not reacquired in the subsequent second period. At this time, the important term specifying unit 128 re-specifies the important term based on the updated term-related information. According to such a processing method, the user can effectively exclude terms that no longer occupy his mind from the term-related information while repeatedly interacting with the butler 152 .

１００ユーザ端末、１０２サーバ、１０４外部データベース、１０６内部データベース、１１０ユーザインタフェース処理部、１１２データ処理部、１１４通信部、１１６データ格納部、１１８入力部、１２０出力部、１２２用語取得部、１２４評価入力部、１２６関連設定部、１２８重要用語特定部、１３０重要用語評価部、１３２特徴用語抽出部、１３４特徴用語評価部、１３６適合判定部、１３８送信部、１４０受信部、１４２文書取得部、１５０対話画面、１５２執事、１５４用語入力領域、１５６返答ボタン 100 user terminal, 102 server, 104 external database, 106 internal database, 110 user interface processing unit, 112 data processing unit, 114 communication unit, 116 data storage unit, 118 input unit, 120 output unit, 122 term acquisition unit, 124 evaluation input unit, 126 relation setting unit, 128 important term identification unit, 130 important term evaluation unit, 132 feature term extraction unit, 134 feature term evaluation unit, 136 match determination unit, 138 transmission unit, 140 reception unit, 142 document acquisition unit, 150 dialogue screen, 152 butler, 154 term input area, 156 reply button

Claims

a term acquisition unit that receives an input of a derivative term recollected from the reference term from the user to be analyzed;
a relevance setting unit that updates term-related information indicating relevance of terms by associating a reference term with a derived term;
an important term identification unit that identifies important terms for the user,
The association setting unit updates the term-related information when a new derivative term is acquired after setting the derivative term as the next reference term,
The preference analysis device, wherein the important term identification unit identifies the important term from the plurality of terms based on the manner of appearance of each of the plurality of terms included in the term related information.

The important term identification unit determines whether or not the first term is an important term based on the number of relationships of the first term to be verified with other terms. The preference analyzer according to claim 1.

3. The preference analysis device according to claim 1, further comprising an important term evaluation unit that gives evaluation values to important terms.

a document acquisition unit that acquires document information;
a feature term extraction unit that extracts feature terms from the document information;
a feature term evaluation unit that calculates an evaluation value of the feature term based on the similarity between the key term for the user and the extracted feature term and an evaluation value of the key term;
4. The preference analysis apparatus according to claim 3, further comprising a compatibility determination unit that determines compatibility between the document information and the user based on evaluation values of the characteristic terms.

A function to receive input of derived terms recalled from the reference term from the user to be analyzed;
the ability to update term-related information that indicates the relevance of terms by associating base terms and derived terms;
A function of updating the term-related information when a new derived term is acquired with the derived term as the next reference term;
A preference analysis program for causing a computer to exhibit a function of identifying important terms for the user from the plurality of terms based on the appearance of each of the plurality of terms included in the term-related information.