JP7212599B2

JP7212599B2 - Information provision system

Info

Publication number: JP7212599B2
Application number: JP2019167336A
Authority: JP
Inventors: 知憲石原; 将之秦; 理恵角田
Original assignee: NTT Docomo Inc
Current assignee: NTT Docomo Inc
Priority date: 2019-09-13
Filing date: 2019-09-13
Publication date: 2023-01-25
Anticipated expiration: 2039-09-13
Also published as: JP2021043878A

Description

本開示の一側面は情報提供システムに関する。 One aspect of the present disclosure relates to an information providing system.

コンピュータ上での処理のために用いられるキーワードをユーザに提案する手法が知られている。例えば、特許文献１には、放送内容に関連するキーワードを自動的にユーザに提供するキーワード提供システムが記載されている。このシステムは、所定のキーワード群から、放送内容に対応するキーワードと一致するキーワードを抽出し、その抽出されたキーワードを放送情報に関連するキーワードとしてユーザ端末装置に提供する。 Techniques for proposing keywords for use in computer processing to users are known. For example, Patent Literature 1 describes a keyword providing system that automatically provides users with keywords related to broadcast content. This system extracts keywords that match keywords corresponding to broadcast content from a predetermined keyword group, and provides the extracted keywords to user terminal devices as keywords related to broadcast information.

特開２００６－３１９４５６号公報JP 2006-319456 A

ユーザによって用いられる蓋然性が高いキーワードをより高い精度で該ユーザに提示することができれば、より便宜である。そこで、そのようなキーワードの提示をより高い精度で行う仕組みが望まれている。 It would be more convenient if keywords that are likely to be used by the user could be presented to the user with higher accuracy. Therefore, a mechanism for presenting such keywords with higher accuracy is desired.

本開示の一側面に係る情報提供システムは、少なくとも一つのプロセッサを備える。少なくとも一つのプロセッサは、対象ユーザを含む複数の閲覧者が第１情報源から提供される第１コンテンツにアクセスしたことを示す閲覧履歴を記憶する第１データベースを参照し、該閲覧履歴に基づいて少なくとも一つの第１キーワードを選択し、第２情報源から提供される第２コンテンツでのキーワードの出現頻度を示すメタ情報を記憶する第２データベースを参照し、該メタ情報に基づいて少なくとも一つの第２キーワードを選択し、少なくとも一つの第１キーワードのうちの少なくとも一つと、少なくとも一つの第２キーワードのうちの少なくとも一つとを含むキーワードリストを生成し、キーワードリストを対象ユーザの端末上に表示させる。 An information providing system according to one aspect of the present disclosure includes at least one processor. At least one processor refers to a first database that stores browsing histories indicating that a plurality of viewers, including the target user, have accessed first content provided from a first information source, and based on the browsing histories, selecting at least one first keyword, referring to a second database storing meta information indicating frequency of occurrence of the keyword in second content provided from a second information source, and based on the meta information, at least one selecting a second keyword, generating a keyword list including at least one of the at least one first keyword and at least one of the at least one second keyword, and displaying the keyword list on the terminal of the target user; Let

このような側面においては、第１情報源に関する閲覧履歴に基づいて選択された第１キーワードと、第２情報源での出現頻度に基づいて選択された第２キーワードとの双方を含むキーワードリストが対象ユーザの端末上に表示される。第１キーワードは対象ユーザの関心が高いと推定されるキーワードであるといえ、第２キーワードは人々の間で関心が高いと推定されるキーワードであるといえる。したがって、これら２種類のキーワードの混合を端末上に表示させることで、対象ユーザによって用いられる蓋然性が高いキーワードを精度良く提供することが可能になる。 In such an aspect, a keyword list including both a first keyword selected based on the browsing history regarding the first information source and a second keyword selected based on the appearance frequency in the second information source is provided. Displayed on the terminal of the target user. It can be said that the first keyword is a keyword that is presumed to be of high interest to the target user, and that the second keyword is a keyword that is presumed to be of high interest to people. Therefore, by displaying a mixture of these two types of keywords on the terminal, it is possible to accurately provide keywords that are likely to be used by the target user.

本開示の一側面によれば、ユーザによって用いられる蓋然性が高いキーワードを精度良く提供することができる。 According to one aspect of the present disclosure, it is possible to accurately provide keywords that are highly likely to be used by users.

実施形態に係る情報提供システムの利用の一例を示す図である。It is a figure which shows an example of utilization of the information provision system which concerns on embodiment. 実施形態に係る情報提供システムの構成の一例を示す図である。It is a figure showing an example of composition of an information service system concerning an embodiment. 実施形態に係る情報提供システムの動作の一例を示すフローチャートである。It is a flow chart which shows an example of operation of an information service system concerning an embodiment. 第１候補キーワードを抽出する処理の一例を示すフローチャートである。9 is a flow chart showing an example of processing for extracting a first candidate keyword; キーワード辞書の一例を示す図である。It is a figure which shows an example of a keyword dictionary. 第１候補キーワードを抽出するための中間レコードの一例を示す図である。It is a figure which shows an example of the intermediate record for extracting a 1st candidate keyword. 閲覧者のクラスタリングの一例を示す図である。FIG. 5 is a diagram showing an example of clustering of viewers; 第１候補キーワードを抽出するための中間レコードの一例を示す図である。It is a figure which shows an example of the intermediate record for extracting a 1st candidate keyword. 第２キーワードを選択する処理の一例を示すフローチャートである。10 is a flowchart showing an example of processing for selecting a second keyword; 実施形態に係る情報提供システムで用いられるコンピュータのハードウェア構成の一例を示す図である。It is a figure which shows an example of the hardware constitutions of the computer used with the information provision system which concerns on embodiment.

以下、添付図面を参照しながら本開示での実施形態を詳細に説明する。なお、図面の説明において同一または同等の要素には同一の符号を付し、重複する説明を省略する。 Hereinafter, embodiments of the present disclosure will be described in detail with reference to the accompanying drawings. In the description of the drawings, the same or equivalent elements are denoted by the same reference numerals, and overlapping descriptions are omitted.

実施形態に係る情報提供システム１は、コンピュータ上での処理のために用いられるキーワードを対象ユーザに提案するコンピュータシステムである。情報提供システム１は１以上のキーワードを対象ユーザのユーザ端末に送信することで該ユーザ端末上にそのキーワードを表示させ、これにより対象ユーザは提案されたキーワードを用いることができる。対象ユーザとは、キーワードを提供する宛先になるユーザのことをいう。 The information providing system 1 according to the embodiment is a computer system that proposes a keyword to be used for processing on a computer to a target user. By transmitting one or more keywords to the user terminal of the target user, the information providing system 1 displays the keywords on the user terminal, so that the target user can use the suggested keywords. A target user is a user to whom a keyword is provided.

キーワードの目的および利用方法は限定されない。一例では、情報提供システム１は情報の検索に用いられるキーワードを提案してもよい。図１は情報提供システム１の利用の一例を示す図である。この例では、ユーザ端末２０はニュースなどの様々な情報を表示するニュース・アプリケーション・プログラムを実行しており、複数の記事２０１と、複数のキーワードから成るキーワードリスト２０２とを表示している。対象ユーザはそのキーワードリストから好みのキーワードを選択および登録することで、そのキーワードに対応する記事を検索して該記事をユーザ端末２０上に表示させることができる。 The purpose and usage of keywords are not limited. In one example, the information providing system 1 may suggest keywords that are used to search for information. FIG. 1 is a diagram showing an example of use of the information providing system 1. As shown in FIG. In this example, the user terminal 20 is running a news application program that displays various information such as news, and displays a plurality of articles 201 and a keyword list 202 consisting of a plurality of keywords. By selecting and registering a favorite keyword from the keyword list, the target user can retrieve an article corresponding to the keyword and display the article on the user terminal 20 .

情報提供システム１は２種類の手法を用いてキーワードを選択し、それぞれの手法によって得られたキーワードの混合をキーワードリストとしてユーザ端末２０に提供する。第１の手法は、対象ユーザを含む複数の閲覧者が第１情報源から提供される第１コンテンツにアクセスしたことを示す閲覧履歴に基づいて少なくとも一つの第１キーワードを選択する手法である。第２の手法は、第１情報源とは異なる第２情報源から提供される第２コンテンツでのキーワードの出現頻度に基づいて少なくとも一つの第２キーワードを選択する手法である。図１に示すキーワードリスト２０２は、少なくとも一つの第１キーワードと少なくとも一つの第２キーワードとによって構成される。 The information providing system 1 selects keywords using two types of techniques, and provides the user terminal 20 with a mixture of keywords obtained by the respective techniques as a keyword list. A first method is to select at least one first keyword based on viewing histories indicating that a plurality of viewers, including the target user, have accessed first content provided from a first information source. A second method is to select at least one second keyword based on the appearance frequency of the keyword in the second content provided from a second information source different from the first information source. The keyword list 202 shown in FIG. 1 consists of at least one first keyword and at least one second keyword.

第１キーワードは対象ユーザが第１コンテンツを閲覧する傾向に基づいて提示されるキーワードであり、したがって、対象ユーザの関心が高いと推定されるキーワードであるといえる。一方、第２キーワードは世間での話題性に基づいて推定されるキーワードであり、したがって、他の人々と同様に対象ユーザが関心を持つ見込みが高いと推定されるキーワードであるといえる。このような２種類のキーワードをキーワードリストに含めることで、キーワードの選択の幅を効果的に拡げることができる。その結果、対象ユーザによって用いられる蓋然性が高いキーワードを精度良く提供することが可能になる。 The first keyword is a keyword presented based on the tendency of the target user to browse the first content, and therefore can be said to be a keyword that is presumed to be of high interest to the target user. On the other hand, the second keyword is a keyword that is estimated based on public topicality, and therefore can be said to be a keyword that is highly likely to be of interest to the target user as well as other people. By including these two types of keywords in the keyword list, it is possible to effectively expand the range of keyword selection. As a result, it is possible to accurately provide keywords that are highly likely to be used by the target user.

図２は情報提供システム１の構成の一例を示す図である。情報提供システム１は、キーワードを対象ユーザに提供するコンピュータであるサーバ１０を備える。サーバ１０は通信ネットワークを介してユーザ端末２０とデータ通信を実行することができる。図２ではユーザ端末２０を一つのみ示すが、サーバ１０とデータ通信するユーザ端末２０の個数は何ら限定されず、サーバ１０は複数のユーザ端末２０と通信接続してもよい。さらに、サーバ１０は通信ネットワークを介してデータベース群３０にアクセスすることができる。通信ネットワークの構成は限定されず、任意の方針で設計されてよい。例えば、それぞれの通信ネットワークは移動体通信網、インターネット、イントラネット、ＷＡＮ（ＷｉｄｅＡｒｅａＮｅｔｗｏｒｋ）のうちの少なくとも一つを含んで構成されてもよい。 FIG. 2 is a diagram showing an example of the configuration of the information providing system 1. As shown in FIG. The information providing system 1 includes a server 10 which is a computer that provides keywords to target users. The server 10 can perform data communication with the user terminal 20 via the communication network. Although only one user terminal 20 is shown in FIG. 2, the number of user terminals 20 that communicate with the server 10 is not limited, and the server 10 may be connected to a plurality of user terminals 20 for communication. Furthermore, the server 10 can access the database group 30 via a communication network. The configuration of the communication network is not limited and may be designed according to any policy. For example, each communication network may include at least one of a mobile communication network, the Internet, an intranet, and a WAN (Wide Area Network).

サーバ１０は機能要素として閲覧履歴解析部１１、受付部１２、第１選択部１３、第２選択部１４、リスト生成部１５、および送信部１６を備える。閲覧履歴解析部１１は、対象ユーザを含む複数の閲覧者が第１情報源から提供される第１コンテンツにアクセスしたことを示す閲覧履歴を解析して第１候補キーワードを抽出する機能要素である。第１候補キーワードとは、第１キーワードの候補となる語句のことをいう。受付部１２はキーワードを提供せよとの指示を受け付ける機能要素である。第１選択部１３は少なくとも一つの第１キーワードを選択する機能要素である。第２選択部１４は、第２情報源から提供される第２コンテンツでのキーワードの出現頻度に基づいて少なくとも一つの第２キーワードを選択する機能要素である。リスト生成部１５は少なくとも一つの第１キーワードと少なくとも一つの第２キーワードとを含むキーワードリストを生成する機能要素である。送信部１６はそのキーワードリストをユーザ端末２０に送信する機能要素であり、これによりキーワードがユーザ端末２０上に表示される。 The server 10 includes a viewing history analysis unit 11, a reception unit 12, a first selection unit 13, a second selection unit 14, a list generation unit 15, and a transmission unit 16 as functional elements. The viewing history analysis unit 11 is a functional element that analyzes viewing histories indicating that a plurality of viewers, including the target user, have accessed first content provided from a first information source, and extracts first candidate keywords. . A first candidate keyword is a phrase that is a candidate for the first keyword. The receiving unit 12 is a functional element that receives an instruction to provide a keyword. The first selection unit 13 is a functional element that selects at least one first keyword. The second selection unit 14 is a functional element that selects at least one second keyword based on the appearance frequency of keywords in the second content provided from the second information source. The list generator 15 is a functional element that generates a keyword list including at least one first keyword and at least one second keyword. The transmission unit 16 is a functional element that transmits the keyword list to the user terminal 20 , whereby the keywords are displayed on the user terminal 20 .

サーバ１０は少なくとも一つのコンピュータを用いて構成される。複数のコンピュータが用いられる場合には、これらのコンピュータが通信ネットワークを介して相互に接続することで、論理的に一つのサーバ１０が構築される。 The server 10 is configured using at least one computer. When a plurality of computers are used, one server 10 is logically constructed by connecting these computers to each other via a communication network.

ユーザ端末２０は、対象ユーザによって操作されるコンピュータである。ユーザ端末２０の種類は限定されない。例えば、ユーザ端末２０は、携帯電話機、高機能携帯電話機（スマートフォン）、タブレット端末、ウェアラブル端末（例えば、スマートウォッチ、ヘッドマウントディスプレイ（ＨＭＤ）など）、ラップトップなどの携帯端末でもよい。あるいは、ユーザ端末２０は据置型のパーソナルコンピュータでもよい。 The user terminal 20 is a computer operated by the target user. The type of user terminal 20 is not limited. For example, the user terminal 20 may be a portable terminal such as a mobile phone, a high-performance mobile phone (smartphone), a tablet terminal, a wearable terminal (for example, a smart watch, a head-mounted display (HMD), etc.), or a laptop. Alternatively, the user terminal 20 may be a stationary personal computer.

データベース群３０は、情報提供システム１において必要なデータを記憶するデータベースの集合である。本実施形態では、データベース群３０は閲覧履歴データベース３１、第１コンテンツデータベース３２、第１候補キーワードデータベース３３、メタ情報データベース３４、およびユーザデータベース３５を含む。 The database group 30 is a set of databases that store necessary data in the information providing system 1 . In this embodiment, the database group 30 includes a browsing history database 31 , a first content database 32 , a first candidate keyword database 33 , a meta information database 34 and a user database 35 .

閲覧履歴データベース３１は、対象ユーザを含む複数の閲覧者が第１情報源から提供される第１コンテンツにアクセスしたことを示す閲覧履歴を記憶する非一時的な記憶媒体または記憶装置である。一例では、閲覧履歴の個々のレコードはユーザＩＤ、コンテンツＩＤ、コンテンツ日時、および操作種別を含む。ユーザＩＤは、第１コンテンツにアクセスしたユーザ（すなわち閲覧者）を一意に特定する識別子である。コンテンツＩＤは個々の第１コンテンツを一意に特定する識別子である。コンテンツ日時は第１コンテンツが生成または公開された日時である。操作種別は、第１コンテンツに対するユーザ（閲覧者）の操作の種類を示す。例えば、操作種別は、ユーザが特定の第１コンテンツをユーザ端末２０上で見たことを示す「閲覧」、ユーザが特定の第１コンテンツ中に存在するリンクをクリックしたことを示す「クリック」などの様々な操作を示し得る。 The viewing history database 31 is a non-temporary storage medium or storage device that stores viewing histories indicating that a plurality of viewers including the target user have accessed the first content provided from the first information source. In one example, each record of viewing history includes a user ID, a content ID, a content date and time, and an operation type. A user ID is an identifier that uniquely identifies a user (that is, a viewer) who has accessed the first content. A content ID is an identifier that uniquely identifies each first content. The content date and time is the date and time when the first content was generated or published. The operation type indicates the type of user's (viewer's) operation on the first content. For example, the operation type may be "browse" indicating that the user has viewed a specific first content on the user terminal 20, or "click" indicating that the user has clicked on a link in a specific first content. can represent various manipulations of

第１情報源の種類は限定されず、これに対応して、第１コンテンツの種類も限定されない。例えば、第１情報源は通信ネットワークを介して任意の端末または装置に情報を提供する情報サービスまたは情報発信者でもよい。一例では、第１情報源は、キーワードリストを表示する機能を有するアプリケーション・プログラムのために第１コンテンツを提供する情報サービスまたは情報発信者でもよい。図１の例では、第１情報源は、ニュース・アプリケーション・プログラムのために第１コンテンツを提供する情報発信者または情報サービスでもよい。あるいは、第１情報源は、キーワードリストを表示する機能を有するアプリケーション・プログラムとは独立した別のアプリケーション・プログラムのために第１コンテンツを提供する情報発信者または情報サービスであってもよい。第１コンテンツは可視要素を含んで構成され、例えば、テキスト、画像（静止画または動画）、またはそれらの組合せを含んで構成される。第１コンテンツは個人または法人によって作成された記事であってもよく、例えば、新聞、雑誌、オンライン・ニュース、ブログ、ソーシャル・ネットワーキング・サービス（ＳＮＳ）などによって提供される記事でもよい。記事とは事柄を伝えるための文章のことをいう。記事は少なくとも文字列を含み、画像（静止画または動画）をさらに含んでもよい。 The type of the first information source is not limited, and correspondingly, the type of the first content is also not limited. For example, the first information source may be an information service or information originator that provides information to any terminal or device over a communication network. In one example, the first information source may be an information service or information originator that provides first content for an application program capable of displaying a keyword list. In the example of FIG. 1, the primary information source may be an information publisher or information service that provides primary content for a news application program. Alternatively, the first information source may be an information originator or information service that provides the first content for another application program independent of the application program having the function of displaying the keyword list. The first content includes visible elements, such as text, images (still or moving images), or a combination thereof. The first content may be an article created by an individual or a corporation, such as an article provided by newspapers, magazines, online news, blogs, social networking services (SNS), and the like. An article is a piece of writing that tells a story. An article includes at least a character string and may further include an image (still image or moving image).

閲覧履歴はアクセス管理システムによって生成されて閲覧履歴データベース３１に格納される。アクセス管理システムの構成は限定されない。例えば、情報提供システム１がアクセス管理システムとしての機能を有してもよい。あるいは、情報提供システム１とは異なるコンピュータシステムがアクセス管理システムとして機能してもよい。アクセス管理システムは、１以上のユーザ端末２０からの第１コンテンツへのアクセスを監視し、その監視結果に基づいて閲覧履歴のレコードを生成し、そのレコードを閲覧履歴データベース３１に格納する。個々の第１コンテンツに対する個々のアクセスが監視されることで、閲覧履歴データベース３１に閲覧履歴が蓄積される。 The browsing history is generated by the access management system and stored in the browsing history database 31 . The configuration of the access management system is not limited. For example, the information providing system 1 may have a function as an access management system. Alternatively, a computer system different from the information providing system 1 may function as the access management system. The access management system monitors access to the first content from one or more user terminals 20 , generates a browsing history record based on the monitoring results, and stores the record in the browsing history database 31 . A browsing history is accumulated in the browsing history database 31 by monitoring individual accesses to individual first contents.

第１コンテンツデータベース３２は第１コンテンツを記憶する非一時的な記憶媒体または記憶装置である。それぞれの第１コンテンツはコンテンツＩＤと関連付けられる。 The first content database 32 is a non-temporary storage medium or storage device that stores first content. Each first content is associated with a content ID.

第１候補キーワードデータベース３３は、閲覧履歴に基づいて抽出された第１候補キーワードに関する第１候補キーワード情報を記憶する非一時的な記憶媒体または記憶装置である。一例では、第１候補キーワード情報の個々のレコードはジャンルＩＤと、クラスタＩＤと、１以上の第１候補キーワードに関する特徴ベクトルとを含む。ジャンルＩＤは、コンテンツの分類または種別であるジャンルを一意に特定するための識別子である。クラスタＩＤは、ユーザが属するクラスタを一意に特定するための識別子である。個々のユーザのクラスタは、データの集合を複数のクラスタ（部分集合）に分類する処理であるクラスタリングによって決定される。クラスタリングの詳細は後述する。特徴ベクトルは、第１候補キーワードとスコアとの組合せによって表現される成分を１以上含んで構成される。特徴ベクトルの詳細も後述する。 The first candidate keyword database 33 is a non-temporary storage medium or storage device that stores first candidate keyword information related to first candidate keywords extracted based on browsing history. In one example, each record of first candidate keyword information includes a genre ID, a cluster ID, and a feature vector for one or more first candidate keywords. A genre ID is an identifier for uniquely identifying a genre, which is a classification or type of content. A cluster ID is an identifier for uniquely identifying a cluster to which a user belongs. Clusters of individual users are determined by clustering, which is the process of classifying a set of data into multiple clusters (subsets). The details of clustering will be described later. A feature vector includes one or more components expressed by a combination of a first candidate keyword and a score. Details of the feature vector will also be described later.

メタ情報データベース３４は、第２情報源から提供される第２コンテンツでのキーワードの出現頻度を示すメタ情報を永続的に記憶する非一時的な記憶媒体または記憶装置である。一例では、メタ情報の個々のレコードはキーワードおよび出現頻度を含む。キーワードは、第２コンテンツ上に現われた語句である。出現頻度は、所定の期間においてそのキーワードが１以上の第２コンテンツ上に現われた程度を示す指標である。 The meta-information database 34 is a non-temporary storage medium or storage device that permanently stores meta-information indicating the appearance frequency of keywords in the second content provided from the second information source. In one example, each record of meta information includes keywords and frequency of occurrence. A keyword is a phrase that appears on the second content. Appearance frequency is an index that indicates the extent to which the keyword appears in one or more second contents during a predetermined period.

第２情報源の種類は限定されず、これに対応して、第２コンテンツの種類も限定されない。ただし、第２情報源は第１情報源と異なる。例えば、第２情報源は通信ネットワークまたは放送ネットワークを介して任意の端末または装置に情報を提供することができる情報サービスまたは情報発信者でもよい。一例では、第２情報源は放送ネットワークまたは通信ネットワークを介してテレビ番組またはラジオ番組を放送する放送局でもよいし、インターネットを介して記事を提供する発信者でもよい。第２コンテンツは可視要素を含んで構成され、例えば、テキスト、画像（静止画または動画）、またはそれらの組合せを含んで構成される。第２コンテンツは個人または法人によって作成されたテレビ番組、ラジオ番組、または記事であってもよい。 The type of the second information source is not limited, and correspondingly, the type of the second content is also not limited. However, the second information source is different from the first information source. For example, the second information source may be an information service or information originator capable of providing information to any terminal or device via a communication network or broadcast network. In one example, the second source may be a broadcast station that broadcasts television or radio programs over a broadcast or communications network, or a publisher that provides articles over the Internet. The second content includes visible elements, such as text, images (still or moving images), or a combination thereof. The secondary content may be television programs, radio programs, or articles produced by individuals or corporations.

メタ情報はメタ情報管理システムによって生成されてメタ情報データベース３４に格納される。メタ情報管理システムの構成は限定されない。例えば、情報提供システム１がメタ情報管理システムとしての機能を有してもよい。あるいは、情報提供システム１とは異なるコンピュータシステムがメタ情報管理システムとして機能してもよい。メタ情報管理システムは、所定の期間内に放送または配信された第２コンテンツのそれぞれを解析することで、それぞれの第２コンテンツ中にテキスト、画像、または音声によって表された所定のキーワードを抽出する。そして、メタ情報管理システムは個々のキーワードの出現回数をカウントし、この集計結果に基づいてメタ情報のレコードを生成し、そのレコードをメタ情報データベース３４に格納する。個々の第２コンテンツが解析されることで、メタ情報データベース３４にメタ情報が蓄積される。出現頻度の設定方法は限定されない。例えば、或るキーワードが１以上の第２コンテンツ中に１回以上現われた場合に、メタ情報管理システムはその出現回数をそのまま出現頻度として設定してもよいし、そのキーワードが現われた第２コンテンツの個数を出現頻度として設定してもよい。 Meta information is generated by the meta information management system and stored in the meta information database 34 . The configuration of the meta information management system is not limited. For example, the information providing system 1 may have a function as a meta information management system. Alternatively, a computer system different from the information providing system 1 may function as the meta information management system. The meta-information management system analyzes each of the secondary contents broadcasted or distributed within a predetermined period of time to extract predetermined keywords represented by text, images, or sounds in each of the secondary contents. . Then, the meta-information management system counts the number of appearances of each keyword, generates a meta-information record based on the result of counting, and stores the record in the meta-information database 34 . Meta information is accumulated in the meta information database 34 by analyzing each second content. The setting method of appearance frequency is not limited. For example, when a certain keyword appears one or more times in one or more second contents, the meta information management system may set the number of times of appearance as the frequency of appearance, or the second contents in which the keyword appears. may be set as the appearance frequency.

ユーザデータベース３５は対象ユーザによって選択および登録された好みのキーワードを示すキーワード情報を永続的に記憶する非一時的な記憶媒体または記憶装置である。一例では、キーワード情報の個々のレコードは、ユーザＩＤ、キーワード、およびキーワード種別を含む。ユーザＩＤはキーワードを選択および登録した対象ユーザを一意に特定する識別子である。キーワードは、対象ユーザによって選択および登録されたキーワードのことをいい、例えば、図１に示すキーワードリスト２０２から選択されたキーワードであり得る。キーワード種別は、そのキーワードの由来を示すデータ項目であり、例えば、そのキーワードが第１キーワードおよび第２キーワードのうちのどちらであったかを示す。キーワード情報はキーワード管理システムによって生成されてユーザデータベース３５に格納される。キーワード管理システムの構成は限定されない。例えば、情報提供システム１がキーワード管理システムとしての機能を有してもよい。あるいは、情報提供システム１とは異なるコンピュータシステムがキーワード管理システムとして機能してもよい。キーワード管理システムは、ユーザ端末２０において選択されたキーワードと、そのキーワードに関連付けられたキーワード種別と、ユーザＩＤとをそのユーザ端末２０から受信する。そして、キーワード管理システムはこれらのデータ項目に基づいてキーワード情報のレコードを生成し、そのレコードをユーザデータベース３５に格納する。 The user database 35 is a non-temporary storage medium or storage device that permanently stores keyword information indicating preferred keywords selected and registered by the target user. In one example, each record of keyword information includes a user ID, a keyword, and a keyword type. A user ID is an identifier that uniquely identifies a target user who has selected and registered a keyword. A keyword refers to a keyword selected and registered by a target user, and can be, for example, a keyword selected from the keyword list 202 shown in FIG. The keyword type is a data item indicating the origin of the keyword, and indicates, for example, whether the keyword was the first keyword or the second keyword. Keyword information is generated by the keyword management system and stored in the user database 35 . The configuration of the keyword management system is not limited. For example, the information providing system 1 may have a function as a keyword management system. Alternatively, a computer system different from the information providing system 1 may function as the keyword management system. The keyword management system receives from the user terminal 20 the keyword selected by the user terminal 20, the keyword type associated with the keyword, and the user ID. Then, the keyword management system generates a keyword information record based on these data items and stores the record in the user database 35 .

個々のデータベースに格納される個々の情報のデータ構造は限定されず、任意の方針で設計されてよい。例えば、閲覧履歴、第１候補キーワード情報、メタ情報、およびキーワード情報の少なくとも一つが任意の方針で正規化または非正規化されて一または複数のデータテーブル上に記憶されてもよい。 The data structure of each piece of information stored in each database is not limited, and may be designed according to any policy. For example, at least one of browsing history, first candidate keyword information, meta information, and keyword information may be normalized or non-normalized according to an arbitrary policy and stored on one or more data tables.

図３は情報提供システム１の動作の一例を処理フローＳ１として示すフローチャートである。ステップＳ１１では、受付部１２がキーワードリストの要求を受け付ける。この要求は、キーワードリストの提供を指示するためのデータ信号である。要求は、キーワードリストの送信先であるユーザ端末２０に対応する対象ユーザのユーザＩＤを含む。要求の受付方法は限定されない。例えば、受付部１２はユーザ端末２０での所定の操作に基づいて該ユーザ端末２０から送信されてきた要求を受信してもよい。あるいは、受付部１２は情報提供システム１内の他の機能要素から入力された要求を取得してもよい。要求は少なくとも一つのジャンルＩＤを含んでもよく、この場合には、それぞれのジャンルＩＤに対応するキーワードによって構成されるキーワードリストがユーザ端末２０に送信される。 FIG. 3 is a flow chart showing an example of the operation of the information providing system 1 as a processing flow S1. In step S11, the receiving unit 12 receives a request for a keyword list. This request is a data signal for instructing provision of the keyword list. The request includes the user ID of the target user corresponding to the user terminal 20 to which the keyword list is sent. The request acceptance method is not limited. For example, the reception unit 12 may receive a request transmitted from the user terminal 20 based on a predetermined operation on the user terminal 20 . Alternatively, the reception unit 12 may acquire requests input from other functional elements within the information providing system 1 . The request may include at least one genre ID, in which case a keyword list consisting of keywords corresponding to each genre ID is sent to the user terminal 20 .

ステップＳ１２では、第１選択部１３がその要求に応答して少なくとも一つの第１キーワードを選択する。この選択のために、閲覧履歴解析部１１が予め、第１候補キーワードを抽出して第１候補キーワード情報を第１候補キーワードデータベース３３に格納する。図４は第１候補キーワードを抽出する処理の一例を処理フローＳ２として示すフローチャートである。一例では、閲覧履歴解析部１１は定期的な（例えば１時間毎の）バッチ処理によって処理フローＳ２を実行し、これにより、第１候補キーワード情報が最新の状態に更新される。 At step S12, the first selection unit 13 selects at least one first keyword in response to the request. For this selection, the browsing history analysis unit 11 extracts first candidate keywords in advance and stores first candidate keyword information in the first candidate keyword database 33 . FIG. 4 is a flow chart showing an example of processing for extracting the first candidate keyword as a processing flow S2. In one example, the browsing history analysis unit 11 executes the process flow S2 by periodic (for example, hourly) batch processing, thereby updating the first candidate keyword information to the latest state.

ステップＳ２１では、閲覧履歴解析部１１は閲覧履歴の個々のレコードについて第１候補キーワードの特徴ベクトルを算出する。閲覧履歴解析部１１は閲覧履歴のそれぞれのレコード、すなわち、閲覧履歴で示されるそれぞれのアクセスについて、第１コンテンツから１以上の第１候補キーワードを特定する。閲覧履歴解析部１１は閲覧履歴データベース３１から１レコードを読み出し、そのコンテンツＩＤに対応する第１コンテンツを第１コンテンツデータベース３２から読み出す。閲覧履歴解析部１１はその第１コンテンツのタイトルおよび本文のうちの少なくとも一方を解析することで１以上の第１候補キーワードを該第１コンテンツから特定する。閲覧履歴解析部１１は閲覧履歴の個々のレコードについてその処理を実行する。 In step S21, the browsing history analysis unit 11 calculates the feature vector of the first candidate keyword for each record of the browsing history. The browsing history analysis unit 11 identifies one or more first candidate keywords from the first content for each record of the browsing history, that is, for each access indicated by the browsing history. The viewing history analysis unit 11 reads one record from the viewing history database 31 and reads the first content corresponding to the content ID from the first content database 32 . The browsing history analysis unit 11 identifies one or more first candidate keywords from the first content by analyzing at least one of the title and text of the first content. The viewing history analysis unit 11 executes the processing for each record of the viewing history.

続いて、閲覧履歴解析部１１は閲覧履歴のそれぞれのレコード（閲覧履歴で示されるそれぞれのアクセス）について、１以上の第１候補キーワードのそれぞれの特徴量を算出し、第１候補キーワードおよび特徴量の１以上の組合せを含む特徴ベクトルを生成する。 Subsequently, the browsing history analysis unit 11 calculates the feature amount of each of the one or more first candidate keywords for each record of the browsing history (each access indicated in the browsing history), and calculates the first candidate keyword and the feature amount. generate a feature vector containing one or more combinations of

閲覧履歴解析部１１はこの処理のために、第１候補キーワードと、基準特徴量と、ジャンルＩＤとの関連付けを示すキーワード辞書を参照する。このキーワード辞書は予め用意されて情報提供システム１内の任意の記憶装置に記憶される。このキーワード辞書は少なくとも一部の第１候補キーワードについての表記ゆれ、類義語、または同義語をさらに示してもよい。図５は第１候補キーワードに関するキーワード辞書の一例を示す。この例では、キーワード辞書の個々のレコードは、第１候補キーワードである主語句と、主語句に対応する副語句と、個々の副語句の基準特徴量（図５において括弧書きで示される数値）と、ジャンルＩＤとを含む。一つの第１候補キーワードが複数のジャンルに関連付けられてもよい。 For this process, the browsing history analysis unit 11 refers to a keyword dictionary that indicates the association between the first candidate keyword, the reference feature amount, and the genre ID. This keyword dictionary is prepared in advance and stored in an arbitrary storage device within the information providing system 1 . The keyword dictionary may further indicate alternatives, synonyms, or synonyms for at least some of the first candidate keywords. FIG. 5 shows an example of a keyword dictionary for the first candidate keyword. In this example, each record in the keyword dictionary contains the subject phrase that is the first candidate keyword, the sub-phrase corresponding to the subject phrase, and the reference feature amount of each sub-phrase (numerical values shown in parentheses in FIG. 5). and a genre ID. One first candidate keyword may be associated with multiple genres.

閲覧履歴解析部１１はそのキーワード辞書を参照して、第１コンテンツのジャンルＩＤに対応する第１候補キーワードおよび基準特徴量を特定する。図５の例に関して、副語句「ＮａｔｉｏｎａｌＴｅａｍ」から第１候補キーワード「日本代表」が得られた場合には、閲覧履歴解析部１１はその第１候補キーワードの基準特徴量を０．５に設定する。閲覧履歴解析部１１は特定された基準特徴量に、第１コンテンツが生成されてからの経過時間と、第１コンテンツに対する操作種別とのうちの少なくとも一方に基づく重みを適用することで特徴量を算出してもよい。例えば、閲覧履歴解析部１１は基準特徴量に重みを乗ずることで特徴量を得てもよい。第１コンテンツが生成されてからの経過時間はコンテンツ日時に基づいて求めることができる。例えば、その重みは、経過時間が短いほど（すなわち、コンテンツ日時が新しいほど）特徴量が高くなるように設定されてもよい。あるいは、重みは、「閲覧」よりも「クリック」の方の特徴量が高くなるように設定されてもよい。 The browsing history analysis unit 11 refers to the keyword dictionary to specify the first candidate keyword and the reference feature amount corresponding to the genre ID of the first content. Regarding the example of FIG. 5, when the first candidate keyword "Japanese representative" is obtained from the subphrase "National Team", the browsing history analysis unit 11 sets the reference feature amount of the first candidate keyword to 0.5. do. The browsing history analysis unit 11 applies a weight based on at least one of the elapsed time after the generation of the first content and the type of operation on the first content to the specified reference feature value to determine the feature value. can be calculated. For example, the browsing history analysis unit 11 may obtain a feature amount by multiplying the reference feature amount by a weight. The elapsed time since the first content was generated can be obtained based on the date and time of the content. For example, the weight may be set such that the shorter the elapsed time (that is, the newer the date and time of the content), the higher the feature amount. Alternatively, the weight may be set so that the feature amount of "click" is higher than that of "view".

閲覧履歴解析部１１は閲覧履歴の個々のレコードについて第１候補キーワードの特徴ベクトルを算出し、その特徴ベクトルを示す第１中間レコードを生成する。個々の第１中間レコードはユーザＩＤ、コンテンツＩＤ、および特徴ベクトルを含む。図６は第１中間レコードの一例を示す。この例では説明の便宜のために、特定の閲覧者であるユーザＡに関する５個の第１中間レコード３０１のみを示すが、当然ながら、閲覧履歴解析部１１は個々の閲覧者の個々の閲覧履歴について第１候補キーワードの特徴ベクトルを算出し第１中間レコードを生成する。図６の例ではコンテンツＩＤ「Ｃ４０４２」が２レコード存在し、これは、ユーザＡがそのコンテンツＩＤで識別される第１コンテンツに２回アクセスしたことを意味する。 The browsing history analysis unit 11 calculates the feature vector of the first candidate keyword for each record of the browsing history, and generates a first intermediate record indicating the feature vector. Each first intermediate record contains a user ID, a content ID, and a feature vector. FIG. 6 shows an example of the first intermediate record. In this example, for convenience of explanation, only five first intermediate records 301 related to user A, who is a specific viewer, are shown. is calculated to generate a first intermediate record. In the example of FIG. 6, there are two records with the content ID "C4042", which means that the user A has accessed the first content identified by the content ID twice.

ステップＳ２２では、閲覧履歴解析部１１は閲覧者と第１コンテンツのジャンルとの組合せごとに特徴ベクトルを合算する。閲覧履歴解析部１１は或る一人の閲覧者について次のように処理する。すなわち、閲覧履歴解析部１１はその閲覧者の１以上の第１中間レコードのそれぞれについて第１コンテンツのジャンルＩＤを特定し、これにより、その閲覧者に対応する１以上のジャンルＩＤを特定する。そして、閲覧履歴解析部１１は特定された１以上のジャンルＩＤのそれぞれについて、該ジャンルＩＤの特徴ベクトルを合算し、その計算結果を示す第２中間レコードを生成する。特徴ベクトルの合算とは、具体的には、１以上の第１候補キーワードのそれぞれについて特徴量の和を求める処理のことをいう。閲覧履歴解析部１１は複数の閲覧者のそれぞれについて、このような一連の処理を実行する。 In step S22, the viewing history analysis unit 11 sums the feature vectors for each combination of the viewer and the genre of the first content. The browsing history analysis unit 11 processes a certain browsing person as follows. That is, the viewing history analysis unit 11 identifies the genre ID of the first content for each of the one or more first intermediate records of the viewer, thereby identifying one or more genre IDs corresponding to the viewer. Then, the browsing history analysis unit 11 sums up the feature vectors of the genre IDs for each of the specified one or more genre IDs, and generates a second intermediate record indicating the calculation result. Summation of feature vectors specifically refers to a process of obtaining the sum of feature amounts for each of the one or more first candidate keywords. The viewing history analysis unit 11 executes such a series of processes for each of a plurality of viewers.

図６は特徴ベクトルの合算により得られる第２中間レコードの一例も示す。この例では、コンテンツＩＤが「Ｃ４０４２」、「Ｃ４０４２」、「Ｃ４０５３」である３レコードがジャンルＩＤ「３」に対応し、コンテンツＩＤが「Ｃ４００１」、「Ｃ４１０１」である２レコードがジャンルＩＤ「１５」に対応するとする。閲覧履歴解析部１１はこれらの第１中間レコード３０１について特徴ベクトルの合算を実行することで２個の第２中間レコード３０２を生成する。ジャンルＩＤ「３」の第２中間レコード３０２について説明すると、閲覧履歴解析部１１は第１候補キーワード「サッカー」の特徴量を１＋１．５＝２．５と合算する。同様に、第１候補キーワード「日本代表」の特徴量は１＋１．５＋０．７＝３．２と合算され、第１候補キーワード「チームＲ」の特徴量は０．９＋１．４＝２．３と合算される。第１候補キーワード「サッカーＷ杯」は１レコードでしか現われていないので、この第１候補キーワードの合算値は０．７である。 FIG. 6 also shows an example of the second intermediate record obtained by summing the feature vectors. In this example, three records with content IDs "C4042", "C4042", and "C4053" correspond to genre ID "3", and two records with content IDs "C4001", "C4101" correspond to genre ID "C4042", "C4042", and "C4053". 15". The browsing history analysis unit 11 generates two second intermediate records 302 by adding feature vectors for these first intermediate records 301 . For the second intermediate record 302 with the genre ID "3", the browsing history analysis unit 11 sums the feature amount of the first candidate keyword "soccer" as 1+1.5=2.5. Similarly, the feature amount of the first candidate keyword "Japan national team" is 1+1.5+0.7=3.2, and the feature amount of the first candidate keyword "team R" is 0.9+1.4=2.3. added up. Since the first candidate keyword "Soccer World Cup" appears in only one record, the total value of this first candidate keyword is 0.7.

ステップＳ２３では、閲覧履歴解析部１１は特徴ベクトルに基づく閲覧者のクラスタリングをジャンル毎に実行する。閲覧履歴解析部１１がクラスタリングを実行することで複数の閲覧者が複数のクラスタに分類され、これにより、共通の特徴を有する１以上の閲覧者を含むクラスタが複数個生成される。クラスタリングの手法は限定されず、閲覧履歴解析部１１は１以上の任意の手法を用いて閲覧者をクラスタリングしてよい。例えば、閲覧履歴解析部１１はコサイン類似度およびＬｏｃａｌｉｔｙＳｅｎｓｉｔｉｖｅＨａｓｈｉｎｇ（ＬＳＨ）を用いて閲覧者をクラスタリングしてよい。より具体的には、閲覧履歴解析部１１はコサイン類似度を用いて１回目のクラスタリングを実行し、ＬＳＨを用いた２回目のクラスタリングを実行することでクラスタを再調整してもよい。 In step S23, the viewing history analysis unit 11 performs clustering of viewers based on feature vectors for each genre. A plurality of viewers are classified into a plurality of clusters by clustering performed by the viewing history analysis unit 11, thereby generating a plurality of clusters each including one or more viewers having common features. The clustering method is not limited, and the viewing history analysis unit 11 may cluster viewers using one or more arbitrary methods. For example, the viewing history analysis unit 11 may cluster viewers using cosine similarity and Locality Sensitive Hashing (LSH). More specifically, the browsing history analysis unit 11 may perform first clustering using cosine similarity, and may readjust clusters by performing second clustering using LSH.

図７は或る一つのジャンルにおける閲覧者のクラスタリングの一例を示す図である。この例では、ユーザＡ，Ｂがクラスタ４０１に分類され、ユーザＣがクラスタ４０２に分類され、ユーザＤ，Ｅがクラスタ４０３に分類されている。それぞれクラスタの中心に描かれた点は該クラスタの重心を示す。図７は個々のクラスタが存在する空間を便宜的に３次元座標で示すが、クラスタリングにおいて考慮される次元数は限定されず、任意に設定されてよい。 FIG. 7 is a diagram showing an example of viewer clustering in a certain genre. In this example, users A and B are classified into cluster 401 , user C is classified into cluster 402 , and users D and E are classified into cluster 403 . The point drawn at the center of each cluster indicates the centroid of the cluster. Although FIG. 7 shows the space in which individual clusters exist in three-dimensional coordinates for the sake of convenience, the number of dimensions considered in clustering is not limited and may be set arbitrarily.

個々のジャンルにおいて、閲覧履歴解析部１１は個々の閲覧者が属するクラスタを特定し、個々の第２中間レコードにクラスタＩＤを付加する。図８は、クラスタＩＤが付加された第２中間レコードの一例を示す図である。この例では、ジャンルＩＤ「３」についての５人のユーザＡ，Ｂ，Ｃ，Ｄ，Ｅの第２中間レコード３０３を示す。ユーザＡ，Ｂはクラスタ「１」に分類され、ユーザＣはクラスタ「２」に分類され、ユーザＤ，Ｅはクラスタ「３」に分類されている。 In each genre, the viewing history analysis unit 11 identifies the cluster to which each viewer belongs, and adds a cluster ID to each second intermediate record. FIG. 8 is a diagram showing an example of a second intermediate record to which a cluster ID is added. This example shows the second intermediate record 303 of five users A, B, C, D, and E for genre ID "3". Users A and B are classified into cluster "1", user C is classified into cluster "2", and users D and E are classified into cluster "3".

ステップＳ２４では、閲覧履歴解析部１１はジャンルおよびクラスタの組合せ毎に特徴ベクトルを合算して第１候補キーワード情報を生成する。閲覧履歴解析部１１はジャンルおよびクラスタの組合せのそれぞれについて、該組合せに対応する１以上の第２中間レコードの特徴ベクトルを合算し、この計算結果を示す第３中間レコードを生成する。この合算も、１以上の第１候補キーワードのそれぞれについて特徴量の和を求める処理である。第３中間レコードは第１候補キーワード情報のレコードに対応する。図８は、第２中間レコード３０３の特徴ベクトルを合算して第３中間レコード３０４を生成する処理をさらに示す。ジャンルＩＤ「３」およびクラスタＩＤ「１」の組合せについていうと、閲覧履歴解析部１１は第１候補キーワード「サッカー」の特徴量を２．５＋１．０＝３．５と合算し、第１候補キーワード「サッカーＷ杯」の特徴量を０．７＋１．０＝１．７と合算する。第１候補キーワード「日本代表」、「チームＲ」、「イングランド代表」はそれぞれ１レコードでしか現われていないので、これら３語の特徴量はそのまま第３中間レコード３０４に組み込まれる。閲覧履歴解析部１１はクラスタＩＤ「２」、「３」のそれぞれについても同様に第３中間レコード３０４を生成する。 In step S24, the viewing history analysis unit 11 generates first candidate keyword information by summing the feature vectors for each combination of genre and cluster. For each combination of genre and cluster, browsing history analysis unit 11 sums the feature vectors of one or more second intermediate records corresponding to the combination, and generates a third intermediate record indicating this calculation result. This addition is also a process of obtaining the sum of feature amounts for each of the one or more first candidate keywords. The third intermediate record corresponds to the record of the first candidate keyword information. FIG. 8 further illustrates the process of summing the feature vectors of the second intermediate record 303 to generate the third intermediate record 304 . Regarding the combination of the genre ID “3” and the cluster ID “1”, the browsing history analysis unit 11 sums the feature amount of the first candidate keyword “soccer” as 2.5+1.0=3.5, and obtains the first candidate The feature amount of the keyword “soccer world cup” is summed up as 0.7+1.0=1.7. Since the first candidate keywords "Japan national team", "Team R", and "England national team" appear only in one record each, the feature values of these three words are incorporated into the third intermediate record 304 as they are. The browsing history analysis unit 11 similarly generates the third intermediate records 304 for each of the cluster IDs "2" and "3".

ステップＳ２５では、閲覧履歴解析部１１は生成された１以上の第３中間レコードを第１候補キーワード情報のレコードとして第１候補キーワードデータベース３３に登録する。或るジャンルおよびクラスタの組合せについて、第１候補キーワードデータベース３３がその組合せに対応するレコードを既に記憶している場合には、閲覧履歴解析部１１はそのレコードを第３中間レコードで上書きすることで第１候補キーワード情報を更新する。 In step S25, the browsing history analysis unit 11 registers the generated one or more third intermediate records in the first candidate keyword database 33 as records of first candidate keyword information. If the first candidate keyword database 33 already stores a record corresponding to the combination of genre and cluster, the browsing history analysis unit 11 overwrites the record with the third intermediate record. Update the first candidate keyword information.

図３に戻り、ステップＳ１２では、第１選択部１３は処理フローＳ２によって予め用意された第１候補キーワード情報に基づいて少なくとも一つの第１キーワードを選択する。第１選択部１３は、要求に対応するジャンルと対象ユーザが属するクラスタとの組合せに対応する１以上の第１候補キーワードのうちの少なくとも一つを第１キーワードとして選択する。第１選択部１３は第１候補キーワードデータベース３３を参照して、その組合せに対応するレコードを読み出し、相対的に特徴量が高い第１候補キーワードを第１キーワードとして選択する。例えば、第１選択部１３は特徴量の降順に第１候補キーワードを並べた上で、先頭から１以上の第１候補キーワードを第１キーワードとして選択してもよい。このように、第１選択部１３は対象ユーザが属するクラスタに対応する１以上の特徴ベクトルに基づいて複数の第１候補キーワードから少なくとも一つの第１キーワードを選択する。具体的には、第１選択部１３は対象ユーザが属するジャンルおよびクラスタの組合せに対応する１以上の特徴ベクトルに基づいて１以上の第１キーワードを選択する。典型的には、第１選択部１３は要求に対応するジャンルと対象ユーザが属するクラスタとに対応する１以上の特徴ベクトルに基づいて１以上の第１キーワードを選択する。 Returning to FIG. 3, in step S12, the first selection unit 13 selects at least one first keyword based on the first candidate keyword information prepared in advance by the process flow S2. The first selection unit 13 selects at least one of the one or more first candidate keywords corresponding to the combination of the genre corresponding to the request and the cluster to which the target user belongs, as the first keyword. The first selection unit 13 refers to the first candidate keyword database 33, reads a record corresponding to the combination, and selects a first candidate keyword having a relatively high feature amount as the first keyword. For example, the first selection unit 13 may arrange the first candidate keywords in descending order of feature amount, and then select one or more first candidate keywords from the top as the first keyword. Thus, the first selection unit 13 selects at least one first keyword from a plurality of first candidate keywords based on one or more feature vectors corresponding to the cluster to which the target user belongs. Specifically, the first selection unit 13 selects one or more first keywords based on one or more feature vectors corresponding to a combination of genres and clusters to which the target user belongs. Typically, the first selection unit 13 selects one or more first keywords based on one or more feature vectors corresponding to the genre corresponding to the request and the cluster to which the target user belongs.

ステップＳ１３では、第２選択部１４が要求に応答して少なくとも一つの第２キーワードを選択する。ステップＳ１３の詳細を図９に示す。図９は第２キーワードを選択する処理の一例を示すフローチャートである。 In step S13, the second selection unit 14 selects at least one second keyword in response to the request. Details of step S13 are shown in FIG. FIG. 9 is a flowchart showing an example of processing for selecting a second keyword.

ステップＳ１３１では、第２選択部１４はメタ情報データベースを参照してキーワードを取得する。例えば、第２選択部１４は出現頻度が所定の基準よりも大きいキーワードを取得する。具体的には、第２選択部１４は所定数のキーワードを出現頻度の降順に取得してもよいし、出現頻度が所定の閾値以上であるキーワードを取得してもよい。１以上のジャンルが要求によって指定されている場合には、第２選択部１４はその指定されたジャンルに属するキーワードを取得する。 In step S131, the second selection unit 14 refers to the meta information database and acquires keywords. For example, the second selection unit 14 acquires keywords whose frequency of appearance is higher than a predetermined standard. Specifically, the second selection unit 14 may acquire a predetermined number of keywords in descending order of appearance frequency, or may acquire keywords whose appearance frequency is equal to or higher than a predetermined threshold. If one or more genres are specified by the request, the second selection unit 14 acquires keywords belonging to the specified genres.

ステップＳ１３２では、第２選択部１４は表記揺れを吸収することで第２候補キーワードを特定する。第２選択部１４はキーワード辞書を参照して、ステップＳ１３１で取得したキーワードのそれぞれについて主語句を特定し、その主語句を第２候補キーワードとして特定する。第２選択部１４は、キーワードが主語句であればそのキーワードをそのまま第２候補キーワードとして特定し、キーワードが副語句であればそのキーワードに対応する主語句を第２候補キーワードとして特定する。この処理の結果、複数のキーワードが一つの主語句に対応する場合には、第２選択部１４は該複数のキーワードの出現頻度を合算することで、第２候補キーワードの出現頻度を得る。この処理によって、第２選択部１４はそれぞれの第２候補キーワードの出現頻度を示す中間リストを得る。 In step S132, the second selection unit 14 identifies the second candidate keyword by absorbing spelling variations. The second selection unit 14 refers to the keyword dictionary, specifies the subject phrase for each of the keywords acquired in step S131, and specifies the subject phrase as the second candidate keyword. If the keyword is a subject phrase, the second selection unit 14 identifies the keyword as it is as the second candidate keyword, and if the keyword is a subphrase, identifies the subject phrase corresponding to the keyword as the second candidate keyword. As a result of this processing, when a plurality of keywords correspond to one subject phrase, the second selection unit 14 adds up the frequency of appearance of the plurality of keywords to obtain the frequency of appearance of the second candidate keyword. Through this process, the second selection unit 14 obtains an intermediate list indicating the appearance frequency of each second candidate keyword.

ステップＳ１３３では、第２選択部１４は特定の第２候補キーワードを排除する。排除の条件は限定されず、任意の方針で設定されてよい。例えば、第２選択部１４は、文字数が所定の閾値以上である第２候補キーワードを排除することで、文字数が長い第２候補キーワードが第２キーワードとして選択されることを防止してもよい。あるいは、第２選択部１４は予め定められたルールに基づいて、公序良俗の観点から相応しくない第２候補キーワードを排除してもよい。あるいは、第２選択部１４は第２候補キーワードに対応する第１コンテンツを検索して、該検索によって抽出される第１コンテンツの個数をヒット数として取得し、そのヒット数が所定の閾値未満である第２候補キーワードを排除してもよい。この処理によって、対応する第１コンテンツが少ない第２候補キーワードが第２キーワードとして選択されることを防止できる。言い換えると、第２選択部１４は、ヒット数が所定の閾値以上である第２候補キーワードを残す。第２選択部１４はこれら３種類の条件のうちの任意の２以上を用いて第２候補キーワードを排除してもよい。 In step S133, the second selection unit 14 excludes specific second candidate keywords. Conditions for exclusion are not limited and may be set by any policy. For example, the second selection unit 14 may prevent a second candidate keyword having a long number of characters from being selected as the second keyword by excluding second candidate keywords having a number of characters equal to or greater than a predetermined threshold. Alternatively, the second selection unit 14 may exclude second candidate keywords that are inappropriate from the viewpoint of public order and morals, based on predetermined rules. Alternatively, the second selection unit 14 searches for the first content corresponding to the second candidate keyword, acquires the number of the first content extracted by the search as the number of hits, and if the number of hits is less than a predetermined threshold, Certain second candidate keywords may be excluded. This processing can prevent a second candidate keyword with few corresponding first contents from being selected as a second keyword. In other words, the second selection unit 14 leaves the second candidate keywords whose number of hits is equal to or greater than the predetermined threshold. The second selection unit 14 may exclude the second candidate keyword using any two or more of these three types of conditions.

ステップＳ１３４では、第２選択部１４は残った第２候補キーワードを第２キーワードとして選択する。以上の処理により、第２選択部１４は１以上の第２キーワードを選択する。 In step S134, the second selection unit 14 selects the remaining second candidate keywords as second keywords. Through the above processing, the second selection unit 14 selects one or more second keywords.

図３に戻って、ステップＳ１４では、リスト生成部１５が少なくとも一つの第１キーワードと少なくとも一つの第２キーワードとを含むキーワードリストを生成する。一例では、リスト生成部１５は複数の第１キーワードから一部をランダムに選択し、複数の第２キーワードから一部をランダムに選択する。 Returning to FIG. 3, in step S14, the list generator 15 generates a keyword list including at least one first keyword and at least one second keyword. In one example, the list generator 15 randomly selects some of the plurality of first keywords and randomly selects some of the plurality of second keywords.

別の例では、リスト生成部１５はユーザデータベース３５にアクセスして、対象ユーザに対応するキーワード情報を読み出すことで、第２キーワードとして対象ユーザによって提示され且つ該対象ユーザによって選択されたキーワードの個数を特定する。そして、リスト生成部１５はその個数が所定の閾値未満である場合には、キーワードリストにおける第２キーワードの割合をＰａに設定し、その個数が該閾値以上である場合にはその割合をＰｂ（ただし、Ｐｂ＜Ｐａ）に設定する。これは、自身が関心を持つキーワードを多く選択する傾向がある対象ユーザに、他の人々が感心を持つキーワードをより多く提示することを意図する。Ｐａ＋Ｐｂ＝１００（％）になるように割合Ｐａ，Ｐｂが設定されてもよい。 In another example, the list generating unit 15 accesses the user database 35 and reads out keyword information corresponding to the target user, thereby determining the number of keywords presented by the target user as the second keywords and selected by the target user. identify. Then, the list generation unit 15 sets the ratio of the second keywords in the keyword list to Pa when the number is less than the predetermined threshold, and sets the ratio to Pb ( However, it is set to Pb<Pa). This is intended to present more keywords that other people are interested in to target users who tend to select more keywords that interest them. The ratios Pa and Pb may be set so that Pa+Pb=100(%).

あるいは、リスト生成部１５は、ユーザＩＤに対応する、ジャンルＩＤおよびクラスタＩＤの組合せの個数を特定する。そして、リスト生成部１５は、その個数が所定の閾値未満である場合には、キーワードリストにおける第２キーワードの割合をＰｃに設定し、その個数が該閾値以上である場合にはその割合をＰｄ（ただし、Ｐｄ＜Ｐｃ）に設定する。これは、自身の関心の範囲が狭い傾向がある対象ユーザに、様々な分野のキーワードをより多く提示することを意図する。Ｐｃ＋Ｐｄ＝１００（％）になるように割合Ｐｃ，Ｐｄが設定されてもよい。 Alternatively, the list generator 15 identifies the number of combinations of genre IDs and cluster IDs corresponding to user IDs. Then, the list generating unit 15 sets the ratio of the second keywords in the keyword list to Pc when the number is less than the predetermined threshold, and sets the ratio to Pd when the number is equal to or greater than the threshold. (However, Pd<Pc). This is intended to present more keywords in various fields to target users who tend to have narrower interests. The ratios Pc and Pd may be set so that Pc+Pd=100(%).

リスト生成部１５はキーワードリスト内で第１キーワードおよび第２キーワードをシャッフルしてもよい。同じ語句が第１キーワードおよび第２キーワードの双方から選択された場合には、リスト生成部１５はその語句を重複してキーワードリストに含めるのではなく、一つのみをキーワードリストに含める。一例では、リスト生成部１５は第１キーワードおよび第２キーワードのそれぞれに、対応するキーワード種別を関連付ける。 The list generator 15 may shuffle the first keyword and the second keyword within the keyword list. When the same word/phrase is selected from both the first keyword and the second keyword, the list generation unit 15 does not duplicate the word/phrase in the keyword list, but includes only one word/phrase in the keyword list. In one example, the list generator 15 associates each of the first keyword and the second keyword with the corresponding keyword type.

ステップＳ１５では、送信部１６がキーワードリストをユーザ端末２０に送信する。ユーザ端末２０はそのキーワードリストを受信および表示する。例えば、ユーザ端末２０は図１に示すキーワードリスト２０２を表示する。送信部１６は、第１情報源での第１コンテンツの検索のために用いられるキーワードを対象ユーザに提供するためにキーワードリストをユーザ端末２０上に表示させてもよい。 In step S<b>15 , the transmission unit 16 transmits the keyword list to the user terminal 20 . User terminal 20 receives and displays the keyword list. For example, the user terminal 20 displays the keyword list 202 shown in FIG. The transmitting unit 16 may cause the keyword list to be displayed on the user terminal 20 in order to provide the target user with keywords used for searching for the first content in the first information source.

コンテンツのジャンルは必須の構成要素ではなく、したがって、情報提供システム１はジャンルを考慮することなく、キーワードリストを生成および送信するための一連の処理を実行してもよい。 The genre of content is not an essential component, and therefore the information providing system 1 may perform a series of processes for generating and transmitting keyword lists without considering genre.

閲覧履歴解析部１１および第１候補キーワードデータベース３３は必須の構成要素ではなく、第１選択部１３が閲覧履歴解析部１１の役割も担ってもよい。この場合、第１選択部１３は要求に応答して閲覧履歴データベース３１および第１コンテンツデータベース３２を参照し、閲覧履歴に基づいて少なくとも一つの第１キーワードを選択する。すなわち、閲覧履歴に基づいて少なくとも一つの第１キーワードを選択する一連の処理は、上記実施形態のように定期的なバッチ処理を含んでもよいし、すべてリアルタイムに処理されてもよい。 The viewing history analysis unit 11 and the first candidate keyword database 33 are not essential components, and the first selection unit 13 may also play the role of the viewing history analysis unit 11 . In this case, the first selection unit 13 refers to the browsing history database 31 and the first content database 32 in response to the request, and selects at least one first keyword based on the browsing history. That is, a series of processes for selecting at least one first keyword based on browsing history may include periodic batch processing as in the above embodiment, or may be processed in real time.

上記実施形態の説明に用いたブロック図は、機能単位のブロックを示している。これらの機能ブロック（構成部）は、ハードウェア及びソフトウェアの少なくとも一方の任意の組み合わせによって実現される。また、各機能ブロックの実現方法は特に限定されない。すなわち、各機能ブロックは、物理的又は論理的に結合した１つの装置を用いて実現されてもよいし、物理的又は論理的に分離した２つ以上の装置を直接的又は間接的に（例えば、有線、無線などを用いて）接続し、これら複数の装置を用いて実現されてもよい。機能ブロックは、上記１つの装置又は上記複数の装置にソフトウェアを組み合わせて実現されてもよい。 The block diagrams used in the description of the above embodiments show blocks for each function. These functional blocks (components) are realized by any combination of at least one of hardware and software. Also, the method of implementing each functional block is not particularly limited. That is, each functional block may be implemented using one device that is physically or logically coupled, or directly or indirectly using two or more devices that are physically or logically separated (e.g. , wired, wireless, etc.) and may be implemented using these multiple devices. A functional block may be implemented by combining software in the one device or the plurality of devices.

機能には、判断、決定、判定、計算、算出、処理、導出、調査、探索、確認、受信、送信、出力、アクセス、解決、選択、選定、確立、比較、想定、期待、見做し、報知（broadcasting）、通知（notifying）、通信（communicating）、転送（forwarding）、構成（configuring）、再構成（reconfiguring）、割り当て（allocating、mapping）、割り振り（assigning）などがあるが、これらに限られない。たとえば、送信を機能させる機能ブロック（構成部）は、送信部（transmitting unit）または送信機（transmitter）と呼称される。いずれも、上述したとおり、実現方法は特に限定されない。 Functions include judging, determining, determining, calculating, calculating, processing, deriving, investigating, searching, checking, receiving, transmitting, outputting, accessing, resolving, selecting, choosing, establishing, comparing, assuming, expecting, assuming, Broadcasting, notifying, communicating, forwarding, configuring, reconfiguring, allocating, mapping, assigning, etc. can't For example, the functional block responsible for transmission is called a transmitting unit or transmitter. In either case, as described above, the implementation method is not particularly limited.

例えば、本開示の一実施の形態における情報提供システム１またはサーバ１０は、本開示の処理を行うコンピュータとして機能してもよい。図１０は、情報提供システム１またはサーバ１０として機能するコンピュータ１００のハードウェア構成の一例を示す図である。コンピュータ１００は、物理的には、プロセッサ１００１、メモリ１００２、ストレージ１００３、通信装置１００４、入力装置１００５、出力装置１００６、バス１００７などを含んでもよい。 For example, the information providing system 1 or the server 10 in one embodiment of the present disclosure may function as a computer that performs the processing of the present disclosure. FIG. 10 is a diagram showing an example of a hardware configuration of a computer 100 functioning as the information providing system 1 or server 10. As shown in FIG. Computer 100 may physically include processor 1001, memory 1002, storage 1003, communication device 1004, input device 1005, output device 1006, bus 1007, and the like.

なお、以下の説明では、「装置」という文言は、回路、デバイス、ユニットなどに読み替えることができる。サーバ１０のハードウェア構成は、図に示した各装置を１つ又は複数含むように構成されてもよいし、一部の装置を含まずに構成されてもよい。 Note that in the following description, the term "apparatus" can be read as a circuit, device, unit, or the like. The hardware configuration of the server 10 may be configured to include one or more of each device shown in the figure, or may be configured without some of the devices.

サーバ１０における各機能は、プロセッサ１００１、メモリ１００２などのハードウェア上に所定のソフトウェア（プログラム）を読み込ませることによって、プロセッサ１００１が演算を行い、通信装置１００４による通信を制御したり、メモリ１００２及びストレージ１００３におけるデータの読み出し及び書き込みの少なくとも一方を制御したりすることによって実現される。 Each function in the server 10 is performed by causing the processor 1001 to perform calculations, controlling communication by the communication device 1004, controlling communication by the communication device 1004, and controlling the communication by the memory 1002 and It is realized by controlling at least one of data reading and writing in the storage 1003 .

プロセッサ１００１は、例えば、オペレーティングシステムを動作させてコンピュータ全体を制御する。プロセッサ１００１は、周辺装置とのインターフェース、制御装置、演算装置、レジスタなどを含む中央処理装置（ＣＰＵ：Central Processing Unit）によって構成されてもよい。 The processor 1001, for example, operates an operating system to control the entire computer. The processor 1001 may be configured by a central processing unit (CPU) including an interface with peripheral devices, a control device, an arithmetic device, registers, and the like.

また、プロセッサ１００１は、プログラム（プログラムコード）、ソフトウェアモジュール、データなどを、ストレージ１００３及び通信装置１００４の少なくとも一方からメモリ１００２に読み出し、これらに従って各種の処理を実行する。プログラムとしては、上述の実施の形態において説明した動作の少なくとも一部をコンピュータに実行させるプログラムが用いられる。例えば、サーバ１０の各機能要素は、メモリ１００２に格納され、プロセッサ１００１において動作する制御プログラムによって実現されてもよい。上述の各種処理は、１つのプロセッサ１００１によって実行される旨を説明してきたが、２以上のプロセッサ１００１により同時又は逐次に実行されてもよい。プロセッサ１００１は、１以上のチップによって実装されてもよい。なお、プログラムは、電気通信回線を介してネットワークから送信されてもよい。 The processor 1001 also reads programs (program codes), software modules, data, etc. from at least one of the storage 1003 and the communication device 1004 to the memory 1002, and executes various processes according to these. As the program, a program that causes a computer to execute at least part of the operations described in the above embodiments is used. For example, each functional element of the server 10 may be stored in the memory 1002 and implemented by a control program running on the processor 1001 . Although it has been explained that the above-described various processes are executed by one processor 1001, they may be executed simultaneously or sequentially by two or more processors 1001. FIG. Processor 1001 may be implemented by one or more chips. Note that the program may be transmitted from a network via an electric communication line.

メモリ１００２は、コンピュータ読み取り可能な記録媒体であり、例えば、ＲＯＭ（Read Only Memory）、ＥＰＲＯＭ（Erasable Programmable ＲＯＭ）、ＥＥＰＲＯＭ（Electrically Erasable Programmable ＲＯＭ）、ＲＡＭ（Random Access Memory）などの少なくとも１つによって構成されてもよい。メモリ１００２は、レジスタ、キャッシュ、メインメモリ（主記憶装置）などと呼ばれてもよい。メモリ１００２は、本開示の一実施の形態に係る方法を実施するために実行可能なプログラム（プログラムコード）、ソフトウェアモジュールなどを保存することができる。 The memory 1002 is a computer-readable recording medium, and is composed of at least one of, for example, ROM (Read Only Memory), EPROM (Erasable Programmable ROM), EEPROM (Electrically Erasable Programmable ROM), and RAM (Random Access Memory). may be The memory 1002 may also be called a register, cache, main memory (main storage device), or the like. The memory 1002 can store executable programs (program code), software modules, etc. to perform a method according to an embodiment of the present disclosure.

ストレージ１００３は、コンピュータ読み取り可能な記録媒体であり、例えば、ＣＤ－ＲＯＭ（Compact Disc ＲＯＭ）などの光ディスク、ハードディスクドライブ、フレキシブルディスク、光磁気ディスク（例えば、コンパクトディスク、デジタル多用途ディスク、Ｂｌｕ－ｒａｙ（登録商標）ディスク）、スマートカード、フラッシュメモリ（例えば、カード、スティック、キードライブ）、フロッピー（登録商標）ディスク、磁気ストリップなどの少なくとも１つによって構成されてもよい。ストレージ１００３は、補助記憶装置と呼ばれてもよい。上述の記憶媒体は、例えば、メモリ１００２及びストレージ１００３の少なくとも一方を含むデータベース、サーバその他の適切な媒体であってもよい。 The storage 1003 is a computer-readable recording medium, for example, an optical disc such as a CD-ROM (Compact Disc ROM), a hard disk drive, a flexible disc, a magneto-optical disc (for example, a compact disc, a digital versatile disc, a Blu-ray disk), smart card, flash memory (eg, card, stick, key drive), floppy disk, magnetic strip, and/or the like. Storage 1003 may also be called an auxiliary storage device. The storage medium described above may be, for example, a database, server, or other suitable medium including at least one of memory 1002 and storage 1003 .

通信装置１００４は、有線ネットワーク及び無線ネットワークの少なくとも一方を介してコンピュータ間の通信を行うためのハードウェア（送受信デバイス）であり、例えばネットワークデバイス、ネットワークコントローラ、ネットワークカード、通信モジュールなどともいう。通信装置１００４は、例えば周波数分割複信（ＦＤＤ：Frequency Division Duplex）及び時分割複信（ＴＤＤ：Time Division Duplex）の少なくとも一方を実現するために、高周波スイッチ、デュプレクサ、フィルタ、周波数シンセサイザなどを含んで構成されてもよい。 The communication device 1004 is hardware (transmitting/receiving device) for communicating between computers via at least one of a wired network and a wireless network, and is also called a network device, a network controller, a network card, a communication module, or the like. The communication device 1004 includes a high-frequency switch, a duplexer, a filter, a frequency synthesizer, and the like, for example, in order to realize at least one of frequency division duplex (FDD) and time division duplex (TDD). may consist of

入力装置１００５は、外部からの入力を受け付ける入力デバイス（例えば、キーボード、マウス、マイクロフォン、スイッチ、ボタン、センサなど）である。出力装置１００６は、外部への出力を実施する出力デバイス（例えば、ディスプレイ、スピーカ、LEDランプなど）である。なお、入力装置１００５及び出力装置１００６は、一体となった構成（例えば、タッチパネル）であってもよい。 The input device 1005 is an input device (for example, keyboard, mouse, microphone, switch, button, sensor, etc.) that receives input from the outside. The output device 1006 is an output device (for example, display, speaker, LED lamp, etc.) that outputs to the outside. Note that the input device 1005 and the output device 1006 may be integrated (for example, a touch panel).

また、プロセッサ１００１、メモリ１００２などの各装置は、情報を通信するためのバス１００７によって接続される。バス１００７は、単一のバスを用いて構成されてもよいし、装置間ごとに異なるバスを用いて構成されてもよい。 Devices such as the processor 1001 and the memory 1002 are connected by a bus 1007 for communicating information. The bus 1007 may be configured using a single bus, or may be configured using different buses between devices.

また、コンピュータ１００は、マイクロプロセッサ、デジタル信号プロセッサ（ＤＳＰ：Digital Signal Processor）、ＡＳＩＣ（Application Specific Integrated Circuit）、ＰＬＤ（Programmable Logic Device）、ＦＰＧＡ（Field Programmable Gate Array）などのハードウェアを含んで構成されてもよく、当該ハードウェアにより、各機能ブロックの一部又は全てが実現されてもよい。例えば、プロセッサ１００１は、これらのハードウェアの少なくとも１つを用いて実装されてもよい。 Further, the computer 100 includes hardware such as a microprocessor, a digital signal processor (DSP), an ASIC (Application Specific Integrated Circuit), a PLD (Programmable Logic Device), and an FPGA (Field Programmable Gate Array). A part or all of each functional block may be implemented by the hardware. For example, processor 1001 may be implemented using at least one of these pieces of hardware.

以上説明したように、本開示の一側面に係る情報提供システムは、少なくとも一つのプロセッサを備える。少なくとも一つのプロセッサは、対象ユーザを含む複数の閲覧者が第１情報源から提供される第１コンテンツにアクセスしたことを示す閲覧履歴を記憶する第１データベースを参照し、該閲覧履歴に基づいて少なくとも一つの第１キーワードを選択し、第２情報源から提供される第２コンテンツでのキーワードの出現頻度を示すメタ情報を記憶する第２データベースを参照し、該メタ情報に基づいて少なくとも一つの第２キーワードを選択し、少なくとも一つの第１キーワードのうちの少なくとも一つと、少なくとも一つの第２キーワードのうちの少なくとも一つとを含むキーワードリストを生成し、キーワードリストを対象ユーザの端末上に表示させる。 As described above, the information providing system according to one aspect of the present disclosure includes at least one processor. At least one processor refers to a first database that stores browsing histories indicating that a plurality of viewers, including the target user, have accessed first content provided from a first information source, and based on the browsing histories, selecting at least one first keyword, referring to a second database storing meta information indicating frequency of occurrence of the keyword in second content provided from a second information source, and based on the meta information, at least one selecting a second keyword, generating a keyword list including at least one of the at least one first keyword and at least one of the at least one second keyword, and displaying the keyword list on the terminal of the target user; Let

他の側面に係る情報提供システムでは、少なくとも一つのプロセッサが、閲覧履歴で示されるそれぞれのアクセスについて第１コンテンツから１以上の第１候補キーワードを特定し、閲覧履歴で示されるそれぞれのアクセスについて、１以上の第１候補キーワードのそれぞれについて特徴量を算出し、第１候補キーワードおよび特徴量の１以上の組合せを含む特徴ベクトルを生成し、それぞれの特徴ベクトルに基づいて複数の閲覧者をクラスタリングすることで複数のクラスタを生成し、対象ユーザが属するクラスタに対応する１以上の特徴ベクトルに基づいて、複数の第１候補キーワードから少なくとも一つの第１キーワードを選択してもよい。第１候補キーワードの特徴量を示す特徴ベクトルに基づいて閲覧者をクラスタリングし、対象ユーザが属するクラスタに対応する特徴ベクトルに基づいて第１キーワードを選択することで、対象ユーザの関心が高いと推定されるキーワードを高い精度で提供することができる。 In the information providing system according to another aspect, at least one processor identifies one or more first candidate keywords from the first content for each access indicated by the browsing history, and for each access indicated by the browsing history, A feature amount is calculated for each of the one or more first candidate keywords, a feature vector including one or more combinations of the first candidate keyword and the feature amount is generated, and a plurality of viewers are clustered based on each feature vector. By doing so, a plurality of clusters may be generated, and at least one first keyword may be selected from the plurality of first candidate keywords based on one or more feature vectors corresponding to the cluster to which the target user belongs. By clustering the viewers based on the feature vector indicating the feature amount of the first candidate keyword and selecting the first keyword based on the feature vector corresponding to the cluster to which the target user belongs, it is estimated that the target user is highly interested. It is possible to provide keywords with high accuracy.

他の側面に係る情報提供システムでは、少なくとも一つのプロセッサが、閲覧者と第１コンテンツのジャンルとの組合せ毎に特徴ベクトルを合算し、ジャンル毎に、複数の閲覧者をクラスタリングすることで複数のクラスタを生成し、対応するジャンルと対象ユーザが属するクラスタとの組合せに対応する特徴ベクトルに基づいて少なくとも一つの第１キーワードを選択してもよい。第１コンテンツのジャンルを考慮して閲覧者をクラスタリングし特徴ベクトルを選択することで、対象ユーザの関心が高いジャンルに対応するキーワードを提供することができる。 In the information providing system according to another aspect, at least one processor sums the feature vectors for each combination of the viewer and the genre of the first content, and clusters the plurality of viewers for each genre. Clusters may be generated and at least one first keyword may be selected based on feature vectors corresponding to combinations of corresponding genres and clusters to which the target user belongs. By clustering viewers in consideration of the genre of the first content and selecting a feature vector, it is possible to provide keywords corresponding to the genre of which the target user is highly interested.

他の側面に係る情報提供システムでは、少なくとも一つのプロセッサが、１以上の第１候補キーワードのそれぞれについて、所定の基準特徴量を取得し、１以上の第１候補キーワードのそれぞれについて、該第１候補キーワードに対応する第１コンテンツが生成されてからの経過時間と、該第１コンテンツに対する閲覧操作とのうちの少なくとも一つに基づいて重みを設定し、１以上の第１候補キーワードのそれぞれについて、基準特徴量に重みを適用することで特徴量を算出してもよい。この処理によって、特徴量は第１コンテンツの鮮度と第１コンテンツに対する閲覧者の関心の度合いとを示す。このような特徴量に基づいて第１キーワードを選択することで、対象ユーザの関心が高いと推定されるキーワードを高い精度で提供することができる。 In the information providing system according to another aspect, at least one processor obtains a predetermined reference feature amount for each of the one or more first candidate keywords, and for each of the one or more first candidate keywords, obtains the first A weight is set based on at least one of the elapsed time since the first content corresponding to the candidate keyword was generated and the viewing operation for the first content, and for each of the one or more first candidate keywords , the feature amount may be calculated by applying a weight to the reference feature amount. Through this process, the feature quantity indicates the freshness of the first content and the degree of interest of the viewer in the first content. By selecting the first keyword based on such a feature amount, it is possible to provide a keyword that is estimated to be of high interest to the target user with high accuracy.

他の側面に係る情報提供システムでは、少なくとも一つのプロセッサが、出現頻度が所定の基準よりも大きいキーワードを少なくとも一つの第２キーワードとして選択してもよい。この場合には、人々の間で特に関心が高いと推定されるキーワードを提供することができる。 In the information providing system according to another aspect, at least one processor may select, as at least one second keyword, a keyword whose appearance frequency is higher than a predetermined standard. In this case, keywords that are presumed to be of particular interest to people can be provided.

他の側面に係る情報提供システムでは、少なくとも一つのプロセッサが、メタ情報で示される１以上のキーワードのそれぞれについて、該キーワードに対応する第１コンテンツのヒット数を取得し、ヒット数が所定の閾値以上である場合に、該キーワードを少なくとも一つの第２キーワードとして選択してもよい。この処理により、人々の間で関心が高く、且つ第１コンテンツの検索に貢献できると推定されるキーワードを提供することができる。 In the information providing system according to another aspect, at least one processor acquires the number of hits of the first content corresponding to each of the one or more keywords indicated by the meta information, and the number of hits exceeds a predetermined threshold. If so, the keyword may be selected as at least one second keyword. Through this process, it is possible to provide keywords that are of high interest among people and that are estimated to contribute to searches for the first content.

他の側面に係る情報提供システムでは、少なくとも一つのプロセッサが、第１情報源での第１コンテンツの検索のために用いられるキーワードを対象ユーザに提供するために、キーワードリストを端末上に表示させてもよい。この場合には、対象ユーザが第１コンテンツを検索するために用いる蓋然性が高いキーワードを精度良く提供することが可能になる。 In the information providing system according to another aspect, at least one processor causes a keyword list to be displayed on a terminal in order to provide a target user with keywords used for searching for first content in a first information source. may In this case, it is possible to accurately provide keywords that are highly likely to be used by the target user to search for the first content.

以上、本開示について詳細に説明したが、当業者にとっては、本開示が本開示中に説明した実施形態に限定されるものではないということは明らかである。本開示は、請求の範囲の記載により定まる本開示の趣旨及び範囲を逸脱することなく修正及び変更態様として実施することができる。したがって、本開示の記載は、例示説明を目的とするものであり、本開示に対して何ら制限的な意味を有するものではない。 Although the present disclosure has been described in detail above, it should be apparent to those skilled in the art that the present disclosure is not limited to the embodiments described in this disclosure. The present disclosure can be practiced with modifications and variations without departing from the spirit and scope of the present disclosure as defined by the claims. Accordingly, the description of the present disclosure is for illustrative purposes and is not meant to be limiting in any way.

情報の通知は、本開示において説明した態様／実施形態に限られず、他の方法を用いて行われてもよい。例えば、情報の通知は、物理レイヤシグナリング（例えば、ＤＣＩ（Downlink Control Information）、ＵＣＩ（Uplink Control Information））、上位レイヤシグナリング（例えば、ＲＲＣ（Radio Resource Control）シグナリング、ＭＡＣ（Medium Access Control）シグナリング、報知情報（ＭＩＢ（Master Information Block）、ＳＩＢ（System Information Block）））、その他の信号又はこれらの組み合わせによって実施されてもよい。また、ＲＲＣシグナリングは、ＲＲＣメッセージと呼ばれてもよく、例えば、ＲＲＣ接続セットアップ（RRC Connection Setup）メッセージ、ＲＲＣ接続再構成（RRC Connection Reconfiguration）メッセージなどであってもよい。 Notification of information is not limited to the aspects/embodiments described in this disclosure, and may be performed using other methods. For example, notification of information includes physical layer signaling (e.g., DCI (Downlink Control Information), UCI (Uplink Control Information)), higher layer signaling (e.g., RRC (Radio Resource Control) signaling, MAC (Medium Access Control) signaling, It may be implemented by broadcast information (MIB (Master Information Block), SIB (System Information Block)), other signals, or a combination thereof. RRC signaling may also be called an RRC message, and may be, for example, an RRC connection setup message, an RRC connection reconfiguration message, or the like.

本開示において説明した各態様／実施形態は、ＬＴＥ（Long Term Evolution）、ＬＴＥ－Ａ（LTE-Advanced）、ＳＵＰＥＲ３Ｇ、ＩＭＴ－Ａｄｖａｎｃｅｄ、４Ｇ（4th generation mobile communication system）、５Ｇ（5th generation mobile communication system）、ＦＲＡ（Future Radio Access）、ＮＲ（new Radio）、Ｗ－ＣＤＭＡ（登録商標）、ＧＳＭ（登録商標）、ＣＤＭＡ２０００、ＵＭＢ（Ultra Mobile Broadband）、ＩＥＥＥ８０２．１１（Ｗｉ－Ｆｉ（登録商標））、ＩＥＥＥ８０２．１６（ＷｉＭＡＸ（登録商標））、ＩＥＥＥ８０２．２０、ＵＷＢ（Ultra-WideBand）、Ｂｌｕｅｔｏｏｔｈ（登録商標）、その他の適切なシステムを利用するシステム及びこれらに基づいて拡張された次世代システムの少なくとも一つに適用されてもよい。また、複数のシステムが組み合わされて（例えば、ＬＴＥ及びＬＴＥ－Ａの少なくとも一方と５Ｇとの組み合わせ等）適用されてもよい。 Each aspect/embodiment described in the present disclosure includes LTE (Long Term Evolution), LTE-A (LTE-Advanced), SUPER 3G, IMT-Advanced, 4G (4th generation mobile communication system), 5G (5th generation mobile communication system), FRA (Future Radio Access), NR (new Radio), W-CDMA (registered trademark), GSM (registered trademark), CDMA2000, UMB (Ultra Mobile Broadband), IEEE 802.11 (Wi-Fi (registered trademark) )), IEEE 802.16 (WiMAX®), IEEE 802.20, UWB (Ultra-WideBand), Bluetooth®, and other suitable systems and extended It may be applied to at least one of the next generation systems. Also, a plurality of systems may be applied in combination (for example, a combination of at least one of LTE and LTE-A and 5G, etc.).

本開示において説明した各態様／実施形態の処理手順、シーケンス、フローチャートなどは、矛盾の無い限り、順序を入れ替えてもよい。例えば、本開示において説明した方法については、例示的な順序を用いて様々なステップの要素を提示しており、提示した特定の順序に限定されない。 The processing procedures, sequences, flowcharts, etc. of each aspect/embodiment described in this disclosure may be rearranged as long as there is no contradiction. For example, the methods described in this disclosure present elements of the various steps using a sample order, and are not limited to the specific order presented.

本開示において基地局によって行われるとした特定動作は、場合によってはその上位ノード（upper node）によって行われることもある。基地局を有する１つ又は複数のネットワークノード（network nodes）からなるネットワークにおいて、端末との通信のために行われる様々な動作は、基地局及び基地局以外の他のネットワークノード（例えば、ＭＭＥ又はＳ－ＧＷなどが考えられるが、これらに限られない）の少なくとも１つによって行われ得ることは明らかである。上記において基地局以外の他のネットワークノードが１つである場合を例示したが、複数の他のネットワークノードの組み合わせ（例えば、ＭＭＥ及びＳ－ＧＷ）であってもよい。 Certain operations that are described in this disclosure as being performed by a base station may also be performed by its upper node in some cases. In a network consisting of one or more network nodes with a base station, various operations performed for communication with a terminal may be performed by the base station and other network nodes other than the base station (e.g. MME or S-GW, etc. (including but not limited to). Although the case where there is one network node other than the base station is exemplified above, it may be a combination of a plurality of other network nodes (for example, MME and S-GW).

情報等は、上位レイヤ（又は下位レイヤ）から下位レイヤ（又は上位レイヤ）へ出力され得る。複数のネットワークノードを介して入出力されてもよい。 Information, etc., may be output from a higher layer (or lower layer) to a lower layer (or higher layer). It may be input and output via multiple network nodes.

入出力された情報等は特定の場所（例えば、メモリ）に保存されてもよいし、管理テーブルを用いて管理してもよい。入出力される情報等は、上書き、更新、又は追記され得る。出力された情報等は削除されてもよい。入力された情報等は他の装置へ送信されてもよい。 Input/output information and the like may be stored in a specific location (for example, memory), or may be managed using a management table. Input/output information and the like can be overwritten, updated, or appended. The output information and the like may be deleted. The entered information and the like may be transmitted to another device.

判定は、１ビットで表される値（０か１か）によって行われてもよいし、真偽値（Boolean：true又はfalse）によって行われてもよいし、数値の比較（例えば、所定の値との比較）によって行われてもよい。 The determination may be made by a value represented by one bit (0 or 1), by a true/false value (Boolean: true or false), or by numerical comparison (for example, a predetermined value).

本開示において説明した各態様／実施形態は単独で用いてもよいし、組み合わせて用いてもよいし、実行に伴って切り替えて用いてもよい。また、所定の情報の通知（例えば、「Ｘであること」の通知）は、明示的に行うものに限られず、暗黙的（例えば、当該所定の情報の通知を行わない）ことによって行われてもよい。 Each aspect/embodiment described in the present disclosure may be used alone, may be used in combination, or may be used by switching according to execution. In addition, the notification of predetermined information (for example, notification of “being X”) is not limited to being performed explicitly, but may be performed implicitly (for example, not notifying the predetermined information). good too.

ソフトウェアは、ソフトウェア、ファームウェア、ミドルウェア、マイクロコード、ハードウェア記述言語と呼ばれるか、他の名称で呼ばれるかを問わず、命令、命令セット、コード、コードセグメント、プログラムコード、プログラム、サブプログラム、ソフトウェアモジュール、アプリケーション、ソフトウェアアプリケーション、ソフトウェアパッケージ、ルーチン、サブルーチン、オブジェクト、実行可能ファイル、実行スレッド、手順、機能などを意味するよう広く解釈されるべきである。 Software, whether referred to as software, firmware, middleware, microcode, hardware description language or otherwise, includes instructions, instruction sets, code, code segments, program code, programs, subprograms, and software modules. , applications, software applications, software packages, routines, subroutines, objects, executables, threads of execution, procedures, functions, and the like.

また、ソフトウェア、命令、情報などは、伝送媒体を介して送受信されてもよい。例えば、ソフトウェアが、有線技術（同軸ケーブル、光ファイバケーブル、ツイストペア、デジタル加入者回線（ＤＳＬ：Digital Subscriber Line）など）及び無線技術（赤外線、マイクロ波など）の少なくとも一方を使用してウェブサイト、サーバ、又は他のリモートソースから送信される場合、これらの有線技術及び無線技術の少なくとも一方は、伝送媒体の定義内に含まれる。 Software, instructions, information, etc. may also be sent and received over a transmission medium. For example, the software uses wired technology (coaxial cable, fiber optic cable, twisted pair, Digital Subscriber Line (DSL), etc.) and/or wireless technology (infrared, microwave, etc.) to create websites, Wired and/or wireless technologies are included within the definition of transmission medium when sent from a server or other remote source.

本開示において説明した情報、信号などは、様々な異なる技術のいずれかを使用して表されてもよい。例えば、上記の説明全体に渡って言及され得るデータ、命令、コマンド、情報、信号、ビット、シンボル、チップなどは、電圧、電流、電磁波、磁界若しくは磁性粒子、光場若しくは光子、又はこれらの任意の組み合わせによって表されてもよい。 Information, signals, etc. described in this disclosure may be represented using any of a variety of different technologies. For example, data, instructions, commands, information, signals, bits, symbols, chips, etc. that may be referred to throughout the above description may refer to voltages, currents, electromagnetic waves, magnetic fields or magnetic particles, light fields or photons, or any of these. may be represented by a combination of

なお、本開示において説明した用語及び本開示の理解に必要な用語については、同一の又は類似する意味を有する用語と置き換えてもよい。例えば、チャネル及びシンボルの少なくとも一方は信号（シグナリング）であってもよい。また、信号はメッセージであってもよい。また、コンポーネントキャリア（ＣＣ：Component Carrier）は、キャリア周波数、セル、周波数キャリアなどと呼ばれてもよい。 The terms explained in this disclosure and the terms necessary for understanding the present disclosure may be replaced with terms having the same or similar meanings. For example, the channel and/or symbols may be signaling. A signal may also be a message. A component carrier (CC) may also be called a carrier frequency, a cell, a frequency carrier, or the like.

本開示において使用する「システム」及び「ネットワーク」という用語は、互換的に使用される。 As used in this disclosure, the terms "system" and "network" are used interchangeably.

また、本開示において説明した情報、パラメータなどは、絶対値を用いて表されてもよいし、所定の値からの相対値を用いて表されてもよいし、対応する別の情報を用いて表されてもよい。例えば、無線リソースはインデックスによって指示されるものであってもよい。 In addition, the information, parameters, etc. described in the present disclosure may be expressed using absolute values, may be expressed using relative values from a predetermined value, or may be expressed using other corresponding information. may be represented. For example, radio resources may be indexed.

上述したパラメータに使用する名称はいかなる点においても限定的な名称ではない。さらに、これらのパラメータを使用する数式等は、本開示で明示的に開示したものと異なる場合もある。様々なチャネル（例えば、ＰＵＣＣＨ、ＰＤＣＣＨなど）及び情報要素は、あらゆる好適な名称によって識別できるので、これらの様々なチャネル及び情報要素に割り当てている様々な名称は、いかなる点においても限定的な名称ではない。 The names used for the parameters described above are not limiting names in any way. Further, the formulas, etc., using these parameters may differ from those expressly disclosed in this disclosure. Since the various channels (e.g., PUCCH, PDCCH, etc.) and information elements can be identified by any suitable name, the various names assigned to these various channels and information elements are in no way restrictive names. is not.

本開示においては、「基地局（ＢＳ：Base Station）」、「無線基地局」、「固定局（fixed station）」、「ＮｏｄｅＢ」、「ｅＮｏｄｅＢ（ｅＮＢ）」、「ｇＮｏｄｅＢ（ｇＮＢ）」、「アクセスポイント（access point）」、「送信ポイント（transmission point）」、「受信ポイント（reception point）、「送受信ポイント（transmission/reception point）」、「セル」、「セクタ」、「セルグループ」、「キャリア」、「コンポーネントキャリア」などの用語は、互換的に使用され得る。基地局は、マクロセル、スモールセル、フェムトセル、ピコセルなどの用語で呼ばれる場合もある。 In the present disclosure, "base station (BS)", "radio base station", "fixed station", "NodeB", "eNodeB (eNB)", "gNodeB (gNB)", " "access point", "transmission point", "reception point", "transmission/reception point", "cell", "sector", "cell group", " Terms such as "carrier", "component carrier" may be used interchangeably. A base station may also be referred to by terms such as macrocell, small cell, femtocell, picocell, and the like.

基地局は、１つ又は複数（例えば、３つ）のセルを収容することができる。基地局が複数のセルを収容する場合、基地局のカバレッジエリア全体は複数のより小さいエリアに区分でき、各々のより小さいエリアは、基地局サブシステム（例えば、屋内用の小型基地局（ＲＲＨ：ＲｅｍｏｔｅＲａｄｉｏＨｅａｄ）によって通信サービスを提供することもできる。「セル」又は「セクタ」という用語は、このカバレッジにおいて通信サービスを行う基地局及び基地局サブシステムの少なくとも一方のカバレッジエリアの一部又は全体を指す。 A base station may serve one or more (eg, three) cells. When a base station accommodates multiple cells, the overall coverage area of the base station can be partitioned into multiple smaller areas, each smaller area being associated with a base station subsystem (e.g., an indoor small base station (RRH: The term "cell" or "sector" refers to part or all of the coverage area of a base station and/or base station subsystem serving communication in this coverage. point to

本開示においては、「移動局（ＭＳ：Mobile Station）」、「ユーザ端末（user terminal）」、「ユーザ装置（ＵＥ：User Equipment）」、「端末」などの用語は、互換的に使用され得る。 In this disclosure, terms such as “Mobile Station (MS),” “user terminal,” “User Equipment (UE),” “terminal,” etc. may be used interchangeably. .

移動局は、当業者によって、加入者局、モバイルユニット、加入者ユニット、ワイヤレスユニット、リモートユニット、モバイルデバイス、ワイヤレスデバイス、ワイヤレス通信デバイス、リモートデバイス、モバイル加入者局、アクセス端末、モバイル端末、ワイヤレス端末、リモート端末、ハンドセット、ユーザエージェント、モバイルクライアント、クライアント、又はいくつかの他の適切な用語で呼ばれる場合もある。 A mobile station is defined by those skilled in the art as a subscriber station, mobile unit, subscriber unit, wireless unit, remote unit, mobile device, wireless device, wireless communication device, remote device, mobile subscriber station, access terminal, mobile terminal, wireless It may also be called a terminal, remote terminal, handset, user agent, mobile client, client, or some other suitable term.

基地局及び移動局の少なくとも一方は、送信装置、受信装置、通信装置などと呼ばれてもよい。なお、基地局及び移動局の少なくとも一方は、移動体に搭載されたデバイス、移動体自体などであってもよい。当該移動体は、乗り物（例えば、車、飛行機など）であってもよいし、無人で動く移動体（例えば、ドローン、自動運転車など）であってもよいし、ロボット（有人型又は無人型）であってもよい。なお、基地局及び移動局の少なくとも一方は、必ずしも通信動作時に移動しない装置も含む。例えば、基地局及び移動局の少なくとも一方は、センサなどのＩｏＴ（Internet of Things）機器であってもよい。 At least one of a base station and a mobile station may be called a transmitter, a receiver, a communication device, and the like. At least one of the base station and the mobile station may be a device mounted on a mobile object, the mobile object itself, or the like. The mobile object may be a vehicle (e.g., car, airplane, etc.), an unmanned mobile object (e.g., drone, self-driving car, etc.), or a robot (manned or unmanned ). Note that at least one of the base station and the mobile station includes devices that do not necessarily move during communication operations. For example, at least one of the base station and the mobile station may be an IoT (Internet of Things) device such as a sensor.

また、本開示における基地局は、ユーザ端末で読み替えてもよい。例えば、基地局及びユーザ端末間の通信を、複数のユーザ端末間の通信（例えば、Ｄ２Ｄ（Device-to-Device）、Ｖ２Ｘ（Vehicle-to-Everything）などと呼ばれてもよい）に置き換えた構成について、本開示の各態様／実施形態を適用してもよい。この場合、基地局が有する機能をユーザ端末が有する構成としてもよい。また、「上り」及び「下り」などの文言は、端末間通信に対応する文言（例えば、「サイド（side）」）で読み替えられてもよい。例えば、上りチャネル、下りチャネルなどは、サイドチャネルで読み替えられてもよい。 Also, the base station in the present disclosure may be read as a user terminal. For example, communication between a base station and a user terminal is replaced with communication between multiple user terminals (for example, D2D (Device-to-Device), V2X (Vehicle-to-Everything), etc.) Regarding the configuration, each aspect/embodiment of the present disclosure may be applied. In this case, the user terminal may have the functions that the base station has. Also, words such as "up" and "down" may be replaced with words corresponding to inter-terminal communication (for example, "side"). For example, uplink channels, downlink channels, etc. may be read as side channels.

同様に、本開示におけるユーザ端末は、基地局で読み替えてもよい。この場合、ユーザ端末が有する機能を基地局が有する構成としてもよい。 Similarly, user terminals in the present disclosure may be read as base stations. In this case, the base station may have the functions that the user terminal has.

本開示で使用する「判断（determining）」、「決定（determining）」という用語は、多種多様な動作を包含する場合がある。「判断」、「決定」は、例えば、判定（judging）、計算（calculating）、算出（computing）、処理（processing）、導出（deriving）、調査（investigating）、探索（looking up、search、inquiry）（例えば、テーブル、データベース又は別のデータ構造での探索）、確認（ascertaining）した事を「判断」「決定」したとみなす事などを含み得る。また、「判断」、「決定」は、受信（receiving）（例えば、情報を受信すること）、送信（transmitting）（例えば、情報を送信すること）、入力（input）、出力（output）、アクセス（accessing）（例えば、メモリ中のデータにアクセスすること）した事を「判断」「決定」したとみなす事などを含み得る。また、「判断」、「決定」は、解決（resolving）、選択（selecting）、選定（choosing）、確立（establishing）、比較（comparing）などした事を「判断」「決定」したとみなす事を含み得る。つまり、「判断」「決定」は、何らかの動作を「判断」「決定」したとみなす事を含み得る。また、「判断（決定）」は、「想定する（assuming）」、「期待する（expecting）」、「みなす（considering）」などで読み替えられてもよい。 As used in this disclosure, the terms "determining" and "determining" may encompass a wide variety of actions. "Judgement", "determining" are, for example, judging, calculating, computing, processing, deriving, investigating, looking up, searching, inquiring (eg, lookup in a table, database, or other data structure), ascertaining as "judged" or "determined", and the like. Also, "judgment" and "decision" are used for receiving (e.g., receiving information), transmitting (e.g., transmitting information), input, output, access (accessing) (for example, accessing data in memory) may include deeming that something has been "determined" or "decided". In addition, "judgment" and "decision" are considered to be "judgment" and "decision" by resolving, selecting, choosing, establishing, comparing, etc. can contain. In other words, "judgment" and "decision" may include considering that some action is "judgment" and "decision". Also, "judgment (decision)" may be read as "assuming", "expecting", "considering", or the like.

「接続された（connected）」、「結合された（coupled）」という用語、又はこれらのあらゆる変形は、２又はそれ以上の要素間の直接的又は間接的なあらゆる接続又は結合を意味し、互いに「接続」又は「結合」された２つの要素間に１又はそれ以上の中間要素が存在することを含むことができる。要素間の結合又は接続は、物理的なものであっても、論理的なものであっても、或いはこれらの組み合わせであってもよい。例えば、「接続」は「アクセス」で読み替えられてもよい。本開示で使用する場合、２つの要素は、１又はそれ以上の電線、ケーブル及びプリント電気接続の少なくとも一つを用いて、並びにいくつかの非限定的かつ非包括的な例として、無線周波数領域、マイクロ波領域及び光（可視及び不可視の両方）領域の波長を有する電磁エネルギーなどを用いて、互いに「接続」又は「結合」されると考えることができる。 The terms "connected," "coupled," or any variation thereof mean any direct or indirect connection or connection between two or more elements, It can include the presence of one or more intermediate elements between two elements being "connected" or "coupled." Couplings or connections between elements may be physical, logical, or a combination thereof. For example, "connection" may be read as "access". As used in this disclosure, two elements are defined using at least one of one or more wires, cables, and printed electrical connections and, as some non-limiting and non-exhaustive examples, in the radio frequency domain. , electromagnetic energy having wavelengths in the microwave and optical (both visible and invisible) regions, and the like.

本開示において使用する「に基づいて」という記載は、別段に明記されていない限り、「のみに基づいて」を意味しない。言い換えれば、「に基づいて」という記載は、「のみに基づいて」と「に少なくとも基づいて」の両方を意味する。 As used in this disclosure, the phrase "based on" does not mean "based only on," unless expressly specified otherwise. In other words, the phrase "based on" means both "based only on" and "based at least on."

本開示において使用する「第１の」、「第２の」などの呼称を使用した要素へのいかなる参照も、それらの要素の量又は順序を全般的に限定しない。これらの呼称は、２つ以上の要素間を区別する便利な方法として本開示において使用され得る。したがって、第１及び第２の要素への参照は、２つの要素のみが採用され得ること、又は何らかの形で第１の要素が第２の要素に先行しなければならないことを意味しない。 Any reference to elements using the "first," "second," etc. designations used in this disclosure does not generally limit the quantity or order of those elements. These designations may be used in this disclosure as a convenient method of distinguishing between two or more elements. Thus, reference to a first and second element does not imply that only two elements can be employed or that the first element must precede the second element in any way.

本開示において、「含む（include）」、「含んでいる（including）」及びそれらの変形が使用されている場合、これらの用語は、用語「備える（comprising）」と同様に、包括的であることが意図される。さらに、本開示において使用されている用語「又は（or）」は、排他的論理和ではないことが意図される。 Where "include," "including," and variations thereof are used in this disclosure, these terms are inclusive, as is the term "comprising." is intended. Furthermore, the term "or" as used in this disclosure is not intended to be an exclusive OR.

本開示において、例えば、英語でのa, an及びtheのように、翻訳により冠詞が追加された場合、本開示は、これらの冠詞の後に続く名詞が複数形であることを含んでもよい。 In this disclosure, where articles have been added by translation, such as a, an, and the in English, the disclosure may include the plural nouns following these articles.

本開示において、「ＡとＢが異なる」という用語は、「ＡとＢが互いに異なる」ことを意味してもよい。なお、当該用語は、「ＡとＢがそれぞれＣと異なる」ことを意味してもよい。「離れる」、「結合される」などの用語も、「異なる」と同様に解釈されてもよい。 In the present disclosure, the term "A and B are different" may mean "A and B are different from each other." The term may also mean that "A and B are different from C". Terms such as "separate," "coupled," etc. may also be interpreted in the same manner as "different."

１…情報提供システム、１０…サーバ、１１…閲覧履歴解析部、１２…受付部、１３…第１選択部、１４…第２選択部、１５…リスト生成部、１６…送信部、２０…ユーザ端末、３０…データベース群、３１…閲覧履歴データベース、３２…第１コンテンツデータベース、３３…第１候補キーワードデータベース、３４…メタ情報データベース、３５…ユーザデータベース、２０２…キーワードリスト。 DESCRIPTION OF SYMBOLS 1... Information provision system, 10... Server, 11... Browsing history analysis part, 12... Reception part, 13... 1st selection part, 14... 2nd selection part, 15... List generation part, 16... Transmission part, 20... User Terminal, 30... Database group, 31... Browsing history database, 32... First content database, 33... First candidate keyword database, 34... Meta information database, 35... User database, 202... Keyword list.

Claims

comprising at least one processor,
the at least one processor
Referring to a first database storing browsing histories indicating that a plurality of viewers, including the target user, have accessed first content provided from a first information source, and determining at least one first keyword based on the browsing histories. and select
referring to a second database storing meta information indicating the appearance frequency of keywords in the second content provided from a second information source, selecting at least one second keyword based on the meta information;
generating a keyword list including at least one of the at least one first keyword and at least one of the at least one second keyword;
displaying the keyword list on the terminal of the target user;
information system.

the at least one processor
identifying one or more first candidate keywords from the first content for each access indicated by the browsing history;
calculating a feature amount for each of the one or more first candidate keywords for each access indicated by the browsing history, and generating a feature vector containing one or more combinations of the first candidate keyword and the feature amount;
generating a plurality of clusters by clustering the plurality of viewers based on their respective feature vectors;
Selecting the at least one first keyword from the plurality of first candidate keywords based on the one or more feature vectors corresponding to the cluster to which the target user belongs;
The information providing system according to claim 1.

the at least one processor
summing the feature vectors for each combination of the viewer and the genre of the first content;
generating the plurality of clusters by clustering the plurality of viewers for each genre;
selecting the at least one first keyword based on the feature vector corresponding to the combination of the corresponding genre and the cluster to which the target user belongs;
The information providing system according to claim 2.

the at least one processor
obtaining a predetermined reference feature amount for each of the one or more first candidate keywords;
for each of the one or more first candidate keywords, based on at least one of the elapsed time since the first content corresponding to the first candidate keyword was generated, and browsing operation on the first content to set the weights, and
calculating the feature amount by applying the weight to the reference feature amount for each of the one or more first candidate keywords;
The information providing system according to claim 2 or 3.

The at least one processor selects, as the at least one second keyword, the keyword whose appearance frequency is greater than a predetermined criterion.
The information providing system according to any one of claims 1 to 4.

the at least one processor
obtaining, for each of the one or more keywords indicated by the meta information, the number of hits of the first content corresponding to the keyword;
selecting the keyword as the at least one second keyword if the number of hits is greater than or equal to a predetermined threshold;
The information providing system according to any one of claims 1 to 5.

the at least one processor causing the keyword list to be displayed on the terminal to provide the target user with keywords used for searching the first content at the first information source;
The information providing system according to any one of claims 1-6.