JP2014106943A

JP2014106943A - Group extraction device, group extraction method, and program

Info

Publication number: JP2014106943A
Application number: JP2012262005A
Authority: JP
Inventors: Tomoya Takeyoshi; 朋也武吉; Erdmann Maike; マイケエルドマン; Keiichiro Hoashi; 啓一郎帆足
Original assignee: KDDI Corp
Current assignee: KDDI Corp
Priority date: 2012-11-30
Filing date: 2012-11-30
Publication date: 2014-06-09
Anticipated expiration: 2032-11-30
Also published as: JP6042188B2

Abstract

PROBLEM TO BE SOLVED: To dynamically extract a friend group in consideration of time series or to extract a group keyword.SOLUTION: A steady feature of a friend set is calculated from a behavior history. Based on steady feature quantity and the behavior history within a specific period, a feature which does not appear regularly but appears only within the specific period is calculated. A friend relationship in the friend set is weighted. A group of friends who have a similar interest or concern within the specific period is extracted from the friend relationship which is weighted.

Description

本発明は、ソーシャルメディア上で、ユーザがつながる他者のアカウント集合から、特定の期間内での興味あるいは、関心が類似するアカウントグループを抽出し、更に当該グループ毎に興味・関心を表すキーワードを抽出するグループ抽出装置、グループ抽出方法およびプログラムに関する。 In the present invention, on social media, an account group having similar interests or interests within a specific period is extracted from an account set of others connected to the user, and a keyword representing interests / interests for each group is extracted. The present invention relates to a group extraction device, a group extraction method, and a program.

今まで接点の無かったユーザ同士をメンバに含むようなコミュニティを形成可能なコミュニティ形成システムを提供する技術が知られている。この従来技術の処理手順の概要は、（１）ある話題でコミュニケーションが頻繁なユーザ集合をクラスタとして抽出し、複数のクラスタから新規クラスタ形成の基になるクラスタ対の候補を選択する。次に（２）クラスタ対に属するユーザ情報（主に所属や趣味等のプロフィール属性）からコミュニティを特徴付けるよう共通するトピック属性を検出する。（３）トピック属性で対象のクラスタ対を選択的に併合してコミュニティ（本発明ではグループに相当）を形成、提示するものである（例えば、特許文献１参照。）。 There is known a technology for providing a community formation system that can form a community that includes users who have not had contacts until now as members. The outline of the processing procedure of this prior art is as follows: (1) A user set that is frequently communicated on a certain topic is extracted as a cluster, and a cluster pair candidate that forms a basis for forming a new cluster is selected from a plurality of clusters. Next, (2) common topic attributes that characterize the community are detected from user information belonging to the cluster pair (mainly profile attributes such as affiliation and hobbies). (3) A target cluster pair is selectively merged with a topic attribute to form and present a community (corresponding to a group in the present invention) (see, for example, Patent Document 1).

また、ソーシャルメディア上でつながるアカウントを複数グループに分割し、各グループの特徴を表すキーワードをグループ内の投稿内容から抽出する技術が知られている。この従来技術の処理手順の概要は、（１）友人関係の有無の疎密に基づき、友人関係が密なアカウントの集合をクリーク（本発明ではグループに相当）として抽出する。次に、（２）各クリーク内で影響力の高いアカウント（グループ内での友人関係が多いアカウント、あるいはニュースソースらしいアカウント）が発した単語の重要度を高くし、クリーク内の投稿内容から総合的に重要度の高い単語をキーワードとして抽出、提示するものである（例えば、非特許文献１参照。）。 In addition, a technique is known in which accounts connected on social media are divided into a plurality of groups, and keywords representing the characteristics of each group are extracted from the content posted in the group. The outline of this prior art processing procedure is as follows: (1) A set of accounts with close friendships is extracted as a clique (corresponding to a group in the present invention) based on the density of friendships. Next, (2) increase the importance of words from accounts that have a strong influence within each creek (accounts with many friends in the group, or accounts that seem to be news sources) A word having a high importance is extracted and presented as a keyword (see, for example, Non-Patent Document 1).

特開２０１０−２８６８６８号公報JP 2010-286868 A ＭａｉｋｅＥｒｄｍａｎｎ, ＴｏｍｏｙａＴａｋｅｙｏｓｈｉ, ＧｅｎＨａｔｔｏｒｉａｎｄＣｈｉｈｉｒｏＯｎｏ, “ＥｘｔｒａｃｔｉｏｎａｎｄＡｎｎｏｔａｔｉｏｎｏｆＰｅｒｓｏｎａｌＣｌｉｑｕｅｓｆｒｏｍＳｏｃｉａｌＮｅｔｗｏｒｋｓ,” Ｔｈｅ１２ｔｈＩＥＥＥ/ＩＰＳＪＩｎｔｅｒｎａｔｉｏｎａｌＳｙｍｐｏｓｉｕｍｏｎＡｐｐｌｉｃａｔｉｏｎａｎｄｔｈｅＩｎｔｅｒｎｅｔ（ＳＡＩＮＴ２０１２）, Ｊｕｌｙ, ２０１２.Maike Erdmann, Tomoya Takeyoshi, Gen Hattori and Chihiro Ono, "Extraction and Annotation of Personal Cliques from Social Networks," The 12th IEEE / IPSJ International Symposium on Application and the Internet (SAINT2012), July, 2012.

ところが、上記の特許文献１および非特許文献１に記載の技術では、過去からこれまでの定常的な話題と特定の期間内（例えば、以下では、直近１週間を例にとって説明する）の話題の差異といった、時間変化に伴う興味、関心の変化（時系列性）を考慮していないため、特定の期間内の興味や関心に基づくグループが優先的に形成されない。また、グループを特徴付けるキーワードがプロフィール属性に基づくため、特定の期間内の興味・関心が正しくキーワードとして表されないという問題があった。 However, in the techniques described in Patent Literature 1 and Non-Patent Literature 1 described above, the regular topics from the past to the past and topics within a specific period (for example, the following is described taking the most recent week as an example) Since interest such as differences and changes in interest (change in time series) are not considered, a group based on interest and interest within a specific period is not formed preferentially. In addition, since the keywords that characterize the group are based on profile attributes, there is a problem that interests within a specific period are not correctly expressed as keywords.

そこで、本発明は、上述の課題に鑑みてなされたものであり、時系列性を考慮する動的な友人グループの抽出、あるいはグループキーワードの抽出が可能なグループ抽出装置、グループ抽出方法およびプログラムを提供することを目的とする。 Therefore, the present invention has been made in view of the above-described problems, and includes a group extraction apparatus, a group extraction method, and a program capable of performing dynamic friend group extraction or group keyword extraction considering time series. The purpose is to provide.

本発明は、上記の課題を解決するために、以下の事項を提案している。なお、理解を容易にするために、本発明の実施形態に対応する符号を付して説明するが、これに限定されるものではない。 The present invention proposes the following matters in order to solve the above problems. In addition, in order to make an understanding easy, although the code | symbol corresponding to embodiment of this invention is attached | subjected and demonstrated, it is not limited to this.

（１）本発明は、ソーシャルメディア上の友人集合と友人の行動履歴から、特定の期間内の興味あるいは関心が類似するグループを抽出するグループ抽出装置であって、前記行動履歴から友人集合の定常的な特徴を算出する大域特徴算出部（例えば、図１の大域特徴算出部２０２に相当）と、前記定常的な特徴量と特定の期間内の行動履歴とに基づいて、定常的に出現せず、特定の期間内にのみ出現する特徴を算出し、前記友人集合における友人間の関係性を重み付けする重み算出部（例えば、図１の重み算出部２０３に相当）と、前記重み付けされた友人関係から、特定の期間内の興味あるいは関心が類似する友人のグループを抽出するグループ抽出部（例えば、図１のグループ抽出部２０３に相当）と、を備えることを特徴とするグループ抽出装置を提案している。 (1) The present invention is a group extraction apparatus that extracts a group of similar interests or interests within a specific period from a friend set on social media and a friend's action history, and the steady state of the friend set from the action history. On the basis of the global feature calculation unit (for example, equivalent to the global feature calculation unit 202 in FIG. 1), the steady feature amount, and the action history within a specific period. First, a weight calculation unit (for example, corresponding to the weight calculation unit 203 in FIG. 1) that calculates features that appear only within a specific period and weights the relationship between friends in the friend set, and the weighted friend A group extracting unit (for example, equivalent to the group extracting unit 203 in FIG. 1) that extracts a group of friends having similar interests or similar interests within a specific period from the relationship. It has proposed the extraction device.

この発明によれば、大域特徴算出部が、行動履歴から友人集合の定常的な特徴を算出し、重み算出部が、定常的な特徴量と特定の期間内の行動履歴とに基づいて、定常的に出現せず、特定の期間内にのみ出現する特徴を算出し、友人集合における友人間の関係性を重み付けし、グループ抽出部が、重み付けされた友人関係から、特定の期間内の興味あるいは関心が類似する友人のグループを抽出する。したがって、グループ抽出結果を提供することで、ソーシャルメディア上でつながる友人について、特定の期間内の興味・関心、およびその興味・関心を有する友人を概観することができる。 According to the present invention, the global feature calculation unit calculates the steady feature of the friend set from the action history, and the weight calculation unit calculates the steady feature based on the steady feature value and the action history within a specific period. Characteristic that does not appear in a specific manner and appears only within a specific period, weights the relationship between friends in the friend set, and the group extraction unit determines whether the interest within the specific period or the Extract a group of friends with similar interests. Therefore, by providing the group extraction result, it is possible to give an overview of the interests / interests within a specific period and the friends who have the interests / interests for friends connected on social media.

（２）本発明は、（１）のグループ抽出装置について、前記グループ毎に、特定の期間内の友人の投稿内容からグループを特徴付けるキーワードを抽出するグループキーワード抽出部（例えば、図１のグループキーワード抽出部２０５に相当）を備えたことを特徴とするグループ抽出装置を提案している。 (2) The present invention relates to a group keyword extraction unit (for example, the group keyword in FIG. 1) that extracts a keyword that characterizes a group from the content posted by a friend within a specific period for each group. A group extraction apparatus characterized by having an extraction unit 205) is proposed.

この発明によれば、グループキーワード抽出部が、グループ毎に、特定の期間内の友人の投稿内容からグループを特徴付けるキーワードを抽出する。つまり、グループ毎に、特定の期間内の友人の投稿内容からグループを特徴付けるキーワードを抽出することから、特定の期間内の興味や関心に基づく、正確なキーワードを抽出することができる。 According to the present invention, the group keyword extraction unit extracts, for each group, a keyword that characterizes the group from the posted contents of friends within a specific period. In other words, for each group, keywords that characterize the group are extracted from the content posted by friends within a specific period, so that accurate keywords based on interests and interests within a specific period can be extracted.

（３）本発明は、（２）のグループ抽出装置について、前記グループキーワードをユーザに提示し、ユーザの興味あるいは関心に合致するグループの指定をユーザから受け付け、指定されたグループに所属する友人の特定の期間内の投稿内容のみを取得し、出力するグループフィルタリング部（例えば、図９のグループフィルタリング部２０６に相当）を備えたことを特徴とするグループ抽出装置を提案している。 (3) In the group extraction device of (2), the present invention presents the group keyword to the user, accepts designation of a group that matches the user's interests or interests from the user, and identifies the friends belonging to the designated group A group extraction apparatus is proposed that includes a group filtering unit (for example, equivalent to the group filtering unit 206 in FIG. 9) that acquires and outputs only the posted contents within a specific period.

この発明によれば、グループフィルタリング部は、グループキーワードをユーザに提示し、ユーザの興味あるいは関心に合致するグループの指定をユーザから受け付け、指定されたグループに所属する友人の特定の期間内の投稿内容のみを取得し、出力する。したがって、グループ抽出結果を基にタイムラインのフィルタ機能を提供することで、ある興味、関心に基づく投稿内容をユーザが効率的に追うことができる。 According to this invention, the group filtering unit presents a group keyword to the user, receives designation of a group that matches the user's interest or interest from the user, and posts within a specific period of friends belonging to the designated group Get only the contents and output. Therefore, by providing a timeline filter function based on the group extraction result, the user can efficiently follow the posting contents based on a certain interest and interest.

（４）本発明は、（２）のグループ抽出装置について、前記グループキーワードのうち、ユーザがプロフィール（属性）情報として登録する興味あるいは関心を表す単語、あるいはユーザの投稿内容から抽出された単語に合致するグループキーワードを友人毎に求めるキーワードマッチング部（例えば、図１２のキーワードマッチング部２０７に相当）を備えたことを特徴とするグループ抽出装置を提案している。 (4) The present invention relates to the group extraction device of (2), wherein among the group keywords, a word representing an interest or interest registered as profile (attribute) information by a user, or a word extracted from a user's posted content A group extraction apparatus is proposed that includes a keyword matching unit (for example, equivalent to the keyword matching unit 207 in FIG. 12) for obtaining a matching group keyword for each friend.

この発明によれば、キーワードマッチング部は、グループキーワードのうち、ユーザがプロフィール（属性）情報として登録する興味あるいは関心を表す単語、あるいはユーザの投稿内容から抽出された単語に合致するグループキーワードを友人毎に求める。したがって、キーワードマッチングを行い、合致した友人とキーワードを列挙し、これらのリストを提示すれば、ユーザの発信先設定が支援可能となる。 According to the present invention, the keyword matching unit adds a group keyword that matches a word representing an interest or interest registered as profile (attribute) information by a user or a word extracted from the posted content of the user among the group keywords. Ask every time. Therefore, by performing keyword matching, enumerating matched friends and keywords, and presenting these lists, it is possible to support the user's destination setting.

（５）本発明は、（１）から（４）のグループ抽出装置について、前記行動履歴は、ソーシャルメディア上の友人の投稿内容あるいは、コミュニケーション履歴であり、前記大域特徴算出部は、投稿内容に定常的に出現する単語群とその出現頻度、および定常的にコミュニケーションを行う友人の組み合わせとその頻度を抽出し、前記重み算出部は、前記定常的に出現する単語群に比して特定の期間内で出現頻度が向上した単語を、共に投稿内容に含む友人間の重みを高く設定し、更に定常的なコミュニケーション頻度に比して特定の期間内でコミュニケーション頻度が向上した友人間の重みを高く設定することを特徴とするグループ抽出装置を提案している。 (5) The present invention relates to the group extraction device of (1) to (4), wherein the action history is a post content or a communication history of a friend on social media, and the global feature calculation unit The word group that appears regularly and its appearance frequency, and the combination and frequency of friends that regularly communicate, are extracted, and the weight calculation unit has a specific period compared to the word group that appears regularly The weight of the friend who includes the words whose appearance frequency has improved in the posting content is set higher, and the weight of the friend who has improved the communication frequency within a specific period is higher than the regular communication frequency. A group extraction device characterized by setting is proposed.

この発明によれば、行動履歴は、ソーシャルメディア上の友人の投稿内容あるいは、コミュニケーション履歴であり、大域特徴算出部は、投稿内容に定常的に出現する単語群とその出現頻度、および定常的にコミュニケーションを行う友人の組み合わせとその頻度を抽出し、重み算出部は、定常的に出現する単語群に比して特定の期間内で出現頻度が向上した単語を、共に投稿内容に含む友人間の重みを高く設定し、更に定常的なコミュニケーション頻度に比して特定の期間内でコミュニケーション頻度が向上した友人間の重みを高く設定する。このような処理を行うことにより、類似する話題を発信し、かつ、コミュニケーションが活性化しやすいグループの抽出が容易となる。 According to the present invention, the action history is a post content or communication history of a friend on social media, and the global feature calculation unit is configured to constantly and regularly express a group of words that appear in the post content and their appearance frequency. The combination of friends who communicate and their frequency are extracted, and the weight calculation unit compares the words whose appearance frequency has improved within a specific period compared to the words that appear regularly. The weight is set high, and the weight between friends whose communication frequency is improved within a specific period is set higher than the regular communication frequency. By performing such processing, it becomes easy to extract a group that transmits a similar topic and that is likely to activate communication.

（６）本発明は、（１）から（４）のグループ抽出装置について、前記重み算出部は、さらに、プロフィール属性の類似度が高い友人間の重みを高く設定することを特徴とするグループ抽出装置を提案している。 (6) In the group extraction device according to any one of (1) to (4), the weight calculation unit further sets a high weight between friends having high similarity of profile attributes. A device is proposed.

この発明によれば、重み算出部は、さらに、プロフィール属性の類似度が高い友人間の重みを高く設定する。これにより、類似あるいは同一の勤務先や類似あるいは同一の出身校である友人が所属するグループの抽出が容易となる。 According to this invention, the weight calculation unit further sets a high weight between friends whose profile attribute similarity is high. Thereby, it becomes easy to extract a group to which a friend who belongs to a similar or identical work place or a similar or identical school belongs.

（７）本発明は、ソーシャルメディア上の友人集合と友人の行動履歴から、特定の期間内の興味あるいは関心が類似するグループを抽出するグループ抽出装置におけるグループ抽出方法であって、前記行動履歴から友人集合の定常的な特徴を算出する第１のステップ（例えば、図２のステップＳ１００に相当）と、前記定常的な特徴量と特定の期間内の行動履歴を基に、定常的には出現せず、特定の期間内に出現する特徴を算出し、前記友人集合における友人間の関係性を重み付けする第２のステップ（例えば、図２のステップＳ２００に相当）と、前記重み付けされた友人関係から、特定の期間内の興味あるいは関心が類似する友人のグループを抽出する第３のステップ（例えば、図２のステップＳ３００に相当）と、を備えることを特徴とするグループ抽出方法を提案している。 (7) The present invention is a group extraction method in a group extraction device for extracting a group of similar interests or interests within a specific period from a friend set on social media and a friend's action history, and from the action history Based on the first step (for example, equivalent to step S100 in FIG. 2) for calculating the steady feature of the friend set, the steady feature amount, and the action history within a specific period, it appears regularly. 2nd step (for example, corresponding to step S200 in FIG. 2) of calculating the features that appear within a specific period and weighting the relationship between friends in the friend set, and the weighted friend relationship And a third step (for example, corresponding to step S300 in FIG. 2) for extracting a group of friends having similar interests or similar interests within a specific period. It has proposed a group extraction method to.

この発明によれば、行動履歴から友人集合の定常的な特徴を算出し、定常的な特徴量と特定の期間内の行動履歴とに基づいて、定常的に出現せず、特定の期間内にのみ出現する特徴を算出し、友人集合における友人間の関係性を重み付けして、重み付けされた友人関係から、特定の期間内の興味あるいは関心が類似する友人のグループを抽出する。したがって、グループ抽出結果を提供することで、ソーシャルメディア上でつながる友人について、特定の期間内の興味・関心、およびその興味・関心を有する友人を概観することができる。 According to this invention, the stationary feature of the friend set is calculated from the behavior history, and based on the stationary feature amount and the behavior history within a specific period, it does not appear regularly, but within a specific period. The feature which appears only is calculated, the relationship between friends in a friend set is weighted, and the group of friends with similar interest or interest within a specific period is extracted from the weighted friend relationships. Therefore, by providing the group extraction result, it is possible to give an overview of the interests / interests within a specific period and the friends who have the interests / interests for friends connected on social media.

（８）本発明は、（７）のグループ抽出方法について、前記グループ毎に、特定の期間内の友人の投稿内容からグループを特徴付けるキーワードを抽出する第４のステップ（例えば、図２のステップＳ４００に相当）を備えたことを特徴とするグループ抽出方法を提案している。 (8) In the group extraction method of (7), the present invention provides a fourth step (for example, step S400 in FIG. 2) for extracting a keyword characterizing a group from the posted contents of friends within a specific period for each group. A group extraction method characterized by comprising

この発明によれば、さらに、グループ毎に、特定の期間内の友人の投稿内容からグループを特徴付けるキーワードを抽出する。つまり、グループ毎に、特定の期間内の友人の投稿内容からグループを特徴付けるキーワードを抽出することから、特定の期間内の興味や関心に基づく、正確なキーワードを抽出することができる。 Further, according to the present invention, for each group, a keyword that characterizes the group is extracted from the content posted by friends within a specific period. In other words, for each group, keywords that characterize the group are extracted from the content posted by friends within a specific period, so that accurate keywords based on interests and interests within a specific period can be extracted.

（９）本発明は、（８）のグループ抽出方法について、前記第４のステップにおいて、前記グループキーワードをユーザに提示し、ユーザの興味あるいは関心に合致するグループの指定をユーザから受け付け、指定されたグループに所属する友人の特定の期間内の投稿内容のみを取得し、出力することを特徴とするグループ抽出方法を提案している。 (9) In the group extraction method of (8), the present invention presents the group keyword to the user in the fourth step, accepts designation of a group that matches the user's interest or interest from the user, and is designated A group extraction method is proposed in which only the contents of postings within a specific period of friends belonging to a group are acquired and output.

この発明によれば、グループキーワードをユーザに提示し、ユーザの興味あるいは関心に合致するグループの指定をユーザから受け付け、指定されたグループに所属する友人の特定の期間内の投稿内容のみを取得し、出力する。したがって、グループ抽出結果を基にタイムラインのフィルタ機能を提供することで、ある興味、関心に基づく投稿内容をユーザが効率的に追うことができる。 According to this invention, the group keyword is presented to the user, the user's interest or the designation of the group that matches the interest is accepted from the user, and only the posting contents within a specific period of friends belonging to the designated group are acquired. ,Output. Therefore, by providing a timeline filter function based on the group extraction result, the user can efficiently follow the posting contents based on a certain interest and interest.

（１０）本発明は、（８）のグループ抽出方法について、前記第４のステップにおいて、前記グループキーワードのうち、ユーザがプロフィール（属性）情報として登録する興味あるいは関心を表す単語、あるいはユーザの投稿内容から抽出された単語に合致するグループキーワードを友人毎に求めることを特徴とするグループ抽出方法を提案している。 (10) The present invention relates to the group extraction method of (8), in the fourth step, among the group keywords, a word representing interest or interest that the user registers as profile (attribute) information, or a user's post A group extraction method is proposed in which a group keyword matching a word extracted from content is obtained for each friend.

この発明によれば、グループキーワードのうち、ユーザがプロフィール（属性）情報として登録する興味あるいは関心を表す単語、あるいはユーザの投稿内容から抽出された単語に合致するグループキーワードを友人毎に求める。したがって、キーワードマッチングを行い、合致した友人とキーワードを列挙し、これらのリストを提示すれば、ユーザの発信先設定が支援可能となる。 According to this invention, among group keywords, a group keyword that matches an interest or interest word registered as profile (attribute) information by the user or a word extracted from the posted content of the user is obtained for each friend. Therefore, by performing keyword matching, enumerating matched friends and keywords, and presenting these lists, it is possible to support the user's destination setting.

（１１）本発明は、ソーシャルメディア上の友人集合と友人の行動履歴から、特定の期間内の興味あるいは関心が類似するグループを抽出するグループ抽出装置におけるグループ抽出方法をコンピュータに実行させるためのプログラムであって、前記行動履歴から友人集合の定常的な特徴を算出する第１のステップ（例えば、図２のステップＳ１００に相当）と、前記定常的な特徴量と特定の期間内の行動履歴を基に、定常的には出現せず、特定の期間内に出現する特徴を算出し、前記友人集合における友人間の関係性を重み付けする第２のステップ（例えば、図２のステップＳ２００に相当）と、前記重み付けされた友人関係から、特定の期間内の興味あるいは関心が類似する友人のグループを抽出する第３のステップ（例えば、図２のステップＳ３００に相当）と、をコンピュータに実行させるためのプログラムを提案している。 (11) The present invention is a program for causing a computer to execute a group extraction method in a group extraction apparatus that extracts groups having similar interests or interests within a specific period from a friend set on social media and a friend's action history. A first step (for example, corresponding to step S100 in FIG. 2) for calculating a stationary feature of a friend set from the behavior history, the stationary feature amount, and a behavior history within a specific period. Based on the second step of calculating features that do not appear regularly but appear within a specific period, and weight the relationship between friends in the friend set (for example, corresponding to step S200 in FIG. 2) And a third step of extracting a group of friends having similar interests or similar interests within a specific period from the weighted friendship (for example, the step of FIG. 2). And Tsu equivalent to up S300), has proposed a program to be executed by the computer.

（１２）本発明は、（１１）のプログラムについて、前記グループ毎に、特定の期間内の友人の投稿内容からグループを特徴付けるキーワードを抽出する第４のステップ（例えば、図２のステップＳ４００に相当）を備えたことを特徴とするプログラムを提案している。 (12) The present invention relates to the fourth step of extracting a keyword characterizing a group from the content posted by friends within a specific period for each group in the program of (11) (for example, corresponding to step S400 of FIG. 2). ) Has been proposed.

（１３）本発明は、（１２）のプログラムについて、前記第４のステップにおいて、前記グループキーワードをユーザに提示し、ユーザの興味あるいは関心に合致するグループの指定をユーザから受け付け、指定されたグループに所属する友人の特定の期間内の投稿内容のみを取得し、出力することを特徴とするプログラムを提案している。 (13) In the program of (12), the present invention presents the group keyword to the user in the fourth step, receives designation of a group that matches the user's interest or interest from the user, and designates the group We propose a program that acquires and outputs only the content of posts within a specific period of friends belonging to.

（１４）本発明は、（１２）のプログラムについて、前記第４のステップにおいて、前記グループキーワードのうち、ユーザがプロフィール（属性）情報として登録する興味あるいは関心を表す単語、あるいはユーザの投稿内容から抽出された単語に合致するグループキーワードを友人毎に求めることを特徴とするプログラムを提案している。 (14) The present invention relates to the program of (12), from the word representing the interest or interest that the user registers as profile (attribute) information among the group keywords in the fourth step, or the content posted by the user. We have proposed a program characterized by finding a group keyword that matches the extracted word for each friend.

本発明によれば、グループ抽出結果を提供することで、ソーシャルメディア上でつながる友人について、特定の期間内の興味・関心、およびその興味・関心を有する友人を概観することが可能になるという効果がある。また、グループ抽出結果を基にタイムラインのフィルタ機能を提供することで、ある興味、関心に基づく投稿内容をユーザが効率的に追うことができるという効果がある。 According to the present invention, by providing a group extraction result, it is possible to give an overview of interests and interests within a specific period and friends who have interests and interests in friends connected on social media. There is. Also, by providing a timeline filter function based on the group extraction result, there is an effect that the user can efficiently follow the posting contents based on a certain interest and interest.

さらに、グループ抽出結果の提示（グループ構成アカウントとキーワードの提示）により、既知の友人に対する新たな気付きの提供が可能となり、コミュニケーションの活性化が期待できるという効果がある。また、ユーザがある内容（例：Ｗｅｂニュース）をシェア、あるいは送信する際に、上記グループ構成メンバのみを宛先とするシェア、送信を可能にすることで、ユーザの発信先設定が支援可能となるという効果がある。 Further, the presentation of the group extraction result (presentation of the group configuration account and the keyword) makes it possible to provide a new awareness to a known friend and to expect the activation of communication. In addition, when a user shares or transmits a certain content (for example, Web news), sharing and transmission with only the group member as a destination can be supported, so that setting of a destination of the user can be supported. There is an effect.

本発明の第１の実施形態に係るグループ抽出装置の構成を示す図である。It is a figure which shows the structure of the group extraction apparatus which concerns on the 1st Embodiment of this invention. 本発明の第１の実施形態に係るグループ抽出装置の処理を示す図である。It is a figure which shows the process of the group extraction apparatus which concerns on the 1st Embodiment of this invention. 本発明の第１の実施形態に係る重み算出の処理を示す図である。It is a figure which shows the process of weight calculation which concerns on the 1st Embodiment of this invention. 本発明の第１の実施形態に係る投稿内容の類似度の算出処理を示す図である。It is a figure which shows the calculation process of the similarity of the content of contribution concerning the 1st Embodiment of this invention. 本発明の第１の実施形態に係るコミュニケーション頻度の算出処理を示す図である。It is a figure which shows the calculation process of the communication frequency which concerns on the 1st Embodiment of this invention. 本発明の第１の実施形態に係る属性類似度の算出処理を示す図である。It is a figure which shows the calculation process of the attribute similarity based on the 1st Embodiment of this invention. 本発明の概要を示す図である。It is a figure which shows the outline | summary of this invention. 本発明の出力内容を示す図である。It is a figure which shows the output content of this invention. 本発明の第２の実施形態に係るグループ抽出装置の構成を示す図である。It is a figure which shows the structure of the group extraction apparatus which concerns on the 2nd Embodiment of this invention. 本発明の第２の実施形態に係るグループ抽出装置の処理を示す図である。It is a figure which shows the process of the group extraction apparatus which concerns on the 2nd Embodiment of this invention. 本発明の第２の実施形態に係る入出力の関係を示す図である。It is a figure which shows the relationship of the input / output which concerns on the 2nd Embodiment of this invention. 本発明の第３の実施形態に係るグループ抽出装置の構成を示す図である。It is a figure which shows the structure of the group extraction apparatus which concerns on the 3rd Embodiment of this invention. 本発明の第３の実施形態に係るグループ抽出装置の処理を示す図である。It is a figure which shows the process of the group extraction apparatus which concerns on the 3rd Embodiment of this invention. 本発明の第４の実施形態に係るグループ抽出装置の構成を示す図である。It is a figure which shows the structure of the group extraction apparatus which concerns on the 4th Embodiment of this invention. 本発明の第４の実施形態に係るグループ抽出装置の処理を示す図である。It is a figure which shows the process of the group extraction apparatus which concerns on the 4th Embodiment of this invention.

＜本発明の概要＞
実施形態の詳細な説明の前に、図７を用いて、本発明の概要について説明する。
まず、グループ抽出の対象とする特定の期間内（例えば、直近１週間）と、過去〜グループ抽出期間前とを分割（図７参照）し、後者の期間に含まれるソーシャルメディア上の行動（投稿やコミュニケーション）履歴から、定常的に出現する話題語や、定常的にコミュニケーションをとる友人間を表す特徴を抽出する。 <Outline of the present invention>
Prior to detailed description of the embodiment, the outline of the present invention will be described with reference to FIG.
First, a group extraction target period (for example, the last one week) and the past to the group extraction period before (see FIG. 7) are divided into social media actions (posts) included in the latter period. And communication), a topic word that appears regularly and a feature representing a friend who regularly communicates are extracted.

次に、特定の期間内での興味・関心が類似する友人を同一グループのメンバとして抽出可能とするため、上記大域的特徴、および特定の期間内の特徴の比較から友人間で下記２種類（または３種類）の指標値を算出し、更にこれら指標値を用いて友人関係の重みを決定する。 Next, in order to make it possible to extract friends with similar interests within a specific period as members of the same group, the following two types (between the global characteristics and the characteristics within a specific period are compared between friends: Alternatively, three types of index values are calculated, and the weights of friendships are determined using these index values.

友人関係の重みを決定するパラメータとして、投稿内容の類似度：Ｓ（Ａ、Ｂ）、コミュニケーション頻度：Ｆ（Ａ、Ｂ）または、プロフィール属性の類似度：Ｐ（Ａ、Ｂ）が考えられる。 As parameters for determining the weight of friendship, the similarity of posted contents: S (A, B), the communication frequency: F (A, B), or the similarity of profile attributes: P (A, B) can be considered.

投稿内容の類似度：Ｓ（Ａ、Ｂ）は、投稿内容について、ＡさんとＢさんとの間の類似度である。特定の期間内の投稿内容が類似する場合、友人関係の重みを高くする。その際、定常的には出現せず、特定の期間内になって出現頻度が高くなった単語が、ＡさんとＢさんの投稿内容にともに出現する場合、特に友人関係の重みを高くする。これにより、同じ話題に言及している友人が同一グループに含まれやすくする。 Posting content similarity: S (A, B) is the similarity between A and B regarding the posting content. If the posted content within a specific period is similar, the weight of the friendship is increased. At that time, when words that do not appear regularly but have a high appearance frequency within a specific period appear together in the posted contents of Mr. A and Mr. B, the weight of the friendship is particularly increased. This makes it easier for friends who are referring to the same topic to be included in the same group.

コミュニケーション頻度：Ｆ（Ａ、Ｂ）は、ＡさんとＢさんの間でのコミュニケーション頻度である。特定の期間内のコミュニケーション頻度が高い場合、友人関係の重みを高くする。これにより、ある話題下でコミュニケーションをとる（＝興味を示す）友人が同一グループに含まれやすくする。例えば、Ａさんの「フットサル日本代表の試合感動したー」という投稿にたいし、Ｂさんが反応している場合、Ｂさんも「フットサル」あるいは「日本代表」に興味を示していると考える。 Communication frequency: F (A, B) is a communication frequency between Mr. A and Mr. B. When the communication frequency within a specific period is high, the weight of the friendship is increased. This makes it easier for friends who communicate under a certain topic (= show interest) to be included in the same group. For example, if Mr. B is responding to Mr. A's post “I was impressed with the game of Japan Futsal National Team”, Mr. B thinks that Mr. B is also interested in “Futsal” or “Japan National Team”.

プロフィール属性の類似度：Ｐ（Ａ、Ｂ）は、プロフィール属性について、ＡさんとＢさんとの間の類似度である。プロフィール属性の類似度が高い場合、友人関係の重みを高くする。友人グループに対し、勤務先や出身校といった属性を加味したい場合は、プロフィール属性の類似度を採用し、重み付けに対する重視度合を投稿内容の類似度：Ｓ（Ａ、Ｂ）、コミュニケーション頻度：Ｆ（Ａ、Ｂ）と同程度、もしくは、投稿内容の類似度：Ｓ（Ａ、Ｂ）、コミュニケーション頻度：Ｆ（Ａ、Ｂ）よりも高くする（つまり、後述のα、β、γについて、γ>=α、βとする）。 Profile attribute similarity: P (A, B) is the similarity between A and B for profile attributes. If the similarity of profile attributes is high, the weight of friendship is increased. If you want to add attributes such as work or school to a friend group, you can use the similarity of profile attributes, and the degree of emphasis on weighting is the similarity of posted content: S (A, B), communication frequency: F ( A or B) or higher than the similarity of posted content: S (A, B), communication frequency: F (A, B) (that is, for α, β, γ described later, γ> = α, β).

例えば、「サッカーといった同じ興味・関心でも、なるべく同僚だけのグループ、大学の同級生のグループに分けたい」といったケースである。ただし、プロフィール属性が取得可能な環境下では、これのみを用いてフィルタリングを行うことも可能であるため、本発明ではオプショナルの位置付けとする。 For example, “I want to divide even the same interests and interests such as soccer into a group of colleagues only and a group of classmates as much as possible”. However, in an environment where profile attributes can be acquired, filtering can be performed using only the profile attribute. Therefore, in the present invention, the position is optional.

上記の値から、ＡさんとＢさんの間の「友人関係の重み」Ｗ（Ａ、Ｂ）を式（１）により決定する。
Ｗ（Ａ、Ｂ）＝α×Ｓ（Ａ、Ｂ）＋β×Ｆ（Ａ、Ｂ）＋γ×P（Ａ、Ｂ）…（１）
ここで、α、β、γ（全て０以上）は各指標値の重みに与える影響度合いを表し、本発明をサービス、あるいはアプリケーションとして提供する際に、自由に設定可能とする。 From the above values, the “friendship weight” W (A, B) between Mr. A and Mr. B is determined by equation (1).
W (A, B) = α × S (A, B) + β × F (A, B) + γ × P (A, B) (1)
Here, α, β, and γ (all 0 or more) represent the degree of influence on the weight of each index value, and can be freely set when the present invention is provided as a service or application.

具体的には、下記３種類のパターン、および抽出されるグループに対する効果が考えられる。
αの影響度合いが高い：投稿内容の類似性を重視し、類似する話題を発信するグループが抽出されやすくなる。
βの影響度合いが高い：コミュニケーションの発生を重視し、コミュニケーションが活性化しやすいグループが抽出されやすくなる。
γの影響度合いが高い：プロフィールの類似性を重視し、類似・同一の勤務先や出身校である友人が所属するグループが抽出されやすくなる。 Specifically, the following three types of patterns and effects on extracted groups are conceivable.
The degree of influence of α is high: Emphasis is placed on the similarity of the posted contents, and it is easy to extract groups that transmit similar topics.
High degree of influence of β: Emphasis is placed on the occurrence of communication, and it is easy to extract groups that facilitate communication.
High degree of influence of γ: Emphasizing the similarity of profiles, it becomes easy to extract a group to which friends who are similar or at the same work place or school belong.

次いで、重み付けされた友人関係（ソーシャルグラフ）から、グループ内の友人関係の数、あるいは重みの総計が高くなるようなグループを求める。そして、各グループに所属する友人の投稿内容から、グループを特徴付けるキーワードを抽出する。この時、定常的には出現せず、特定の期間内で出現した話題語を優先的に選択する。 Next, from the weighted friendship (social graph), the number of friendships in the group or a group with a high total weight is obtained. And the keyword which characterizes a group is extracted from the contribution content of the friend who belongs to each group. At this time, topic words that do not appear regularly but appear within a specific period are preferentially selected.

以下、本発明の実施形態について、図面を用いて、詳細に説明する。
なお、本実施形態における構成要素は適宜、既存の構成要素等との置き換えが可能であり、また、他の既存の構成要素との組合せを含む様々なバリエーションが可能である。したがって、本実施形態の記載をもって、特許請求の範囲に記載された発明の内容を限定するものではない。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.
Note that the constituent elements in the present embodiment can be appropriately replaced with existing constituent elements and the like, and various variations including combinations with other existing constituent elements are possible. Therefore, the description of the present embodiment does not limit the contents of the invention described in the claims.

＜第１の実施形態＞
図１から図８を用いて、本発明の実施形態について説明する。 <First Embodiment>
An embodiment of the present invention will be described with reference to FIGS.

＜グループ抽出装置の構成＞
本実施形態に係るグループ抽出装置は、図１に示すように、ソーシャルメディアと、ＡＰＩ１００と、データ収集部２０１と、大域特徴算出部２０２と、重み算出部２０３と、グループ抽出部２０４と、グループキーワード抽出部２０５と、プロフィールＤＢ２１１と、投稿内容ＤＢ２１２と、コミュニケーション履歴ＤＢ２１３と、友人関係ＤＢ２１４と、表示部３００とから構成されている。 <Configuration of group extraction device>
As shown in FIG. 1, the group extraction apparatus according to the present embodiment includes social media, an API 100, a data collection unit 201, a global feature calculation unit 202, a weight calculation unit 203, a group extraction unit 204, and a group. The keyword extraction unit 205, the profile DB 211, the posted content DB 212, the communication history DB 213, the friend relationship DB 214, and the display unit 300 are configured.

ソーシャルメディアは、アカウント間の関係性（例：友人関係）をベースにするサービスである。ＡＰＩ１００は、各種ソーシャルメディアが提供し、外部からソーシャルメディア上のデータを取得可能とするインターフェースである。表示部３００は、出力情報をユーザに提示する。 Social media is a service based on relationships between accounts (eg, friendships). The API 100 is an interface provided by various social media and capable of acquiring data on the social media from the outside. The display unit 300 presents output information to the user.

データ収集部２０１は、ＡＰＩ１００を通じ、各種ソーシャルメディアから必要なデータを収集する。大域特徴算出部２０２は。収集・蓄積済みデータから、大域特徴を算出する。ここで、大域特徴とは、時系列上、グループ抽出の対象外であり、かつ過去から蓄積されたデータを基に算出する特徴を指す（図７の「過去」に相当する期間、およびデータ）。グループ抽出は特定の期間内のデータ（例えば、最近１週間）を対象にする。 The data collection unit 201 collects necessary data from various social media through the API 100. The global feature calculation unit 202. Calculate global features from collected and accumulated data. Here, the global feature refers to a feature that is not subject to group extraction in time series and is calculated based on data accumulated from the past (period and data corresponding to “past” in FIG. 7). . Group extraction covers data within a specific time period (eg, last week).

重み算出部２０３は、大域特徴、および特定の期間内のデータから、ソーシャルグラフ上の各友人関係について、重みを算出する。グループ抽出部２０４は、ソーシャルグラフ、および友人関係の重みから、動的な友人グループを抽出する。グループキーワード抽出部２０５は、各グループについて、グループ内の友人の投稿内容から、グループを特徴付けるキーワードを抽出する。 The weight calculation unit 203 calculates a weight for each friend relationship on the social graph from the global feature and data within a specific period. The group extraction unit 204 extracts a dynamic friend group from the social graph and the weight of friendship. For each group, the group keyword extraction unit 205 extracts a keyword that characterizes the group from the content posted by friends in the group.

プロフィールＤＢ２１１は、ユーザの友人がソーシャルメディアに登録しているプロフィール（属性）情報を格納するデータベースである。投稿内容ＤＢ２１２は、ユーザの友人がソーシャルメディア上で投稿した内容を格納するデータベースである。コミュニケーション履歴ＤＢ２１３は、ユーザの友人がソーシャルメディア上で反応した履歴を格納するデータベースである。ここで反応とは、例えば、Ｆａｃｅｂｏｏｋであればコメント入力行動、「いいね！」ボタンの押下行動を指す。友人関係ＤＢ２１４は、ソーシャルグラフ（誰と誰が友人であるかを表すネットワーク）を格納するデータベースである。 The profile DB 211 is a database that stores profile (attribute) information registered by a user's friend in social media. The posted content DB 212 is a database that stores content posted by user's friends on social media. The communication history DB 213 is a database that stores a history of how a user's friend reacted on social media. Here, the reaction indicates, for example, a comment input action in the case of Facebook and a pressing action of a “Like” button. The friend relationship DB 214 is a database that stores a social graph (a network representing who and who are friends).

＜グループ抽出装置の処理＞
図２から図６を用いて、本実施形態に係るグループ抽出装置の処理について説明する。 <Process of group extraction device>
Processing of the group extraction device according to the present embodiment will be described with reference to FIGS.

図１は、グループ抽出装置の処理の大きな流れを示したものである。
この図によれば、まず、大域特徴算出部２０２は、ＤＢ群に蓄積された各種データから、大域特徴量を算出する（図２のステップＳ１００）。算出対象となるデータは、過去から蓄積された全データより、グループ抽出の対象となる特定の期間内のデータを除いたデータである（図７の「過去」に相当する期間、およびデータ）。ここで算出された特徴量と特定の期間内のデータの比較により、図２のステップＳ２００において友人関係の重みを決定する。 FIG. 1 shows a large flow of processing of the group extraction apparatus.
According to this figure, first, the global feature calculation unit 202 calculates a global feature amount from various data accumulated in the DB group (step S100 in FIG. 2). The data to be calculated is data obtained by excluding data within a specific period to be group extraction from all data accumulated from the past (period and data corresponding to “past” in FIG. 7). The weight of the friendship is determined in step S200 of FIG. 2 by comparing the feature amount calculated here with data within a specific period.

ここで、ステップＳ１００において、算出する大域特徴は、Ａ）各単語に関する大域特徴、Ｂ）各友人に関する大域特徴であり、Ａ）各単語に関する大域特徴では、単語ｉが出現する投稿数ｄｆ（ｉ)、つまり、過去のデータ上、ある単語が出現する投稿の総数を単語ごとに集計して求める。 Here, in step S100, the calculated global features are A) a global feature for each word, B) a global feature for each friend, and A) for the global feature for each word, the number of posts df (i ) That is, the total number of posts in which a certain word appears in the past data is calculated for each word.

一方、Ｂ）各友人に関する大域特徴では、ユーザのある友人Ａの投稿数Ｐ（Ａ）つまり、過去のデータ上、ある友人の投稿の総数を友人毎に集計して求める。また、ユーザとある友人Ａさんとの共通友人の投稿総数Ｇ（Ａ）、ユーザとある友人Ａさんとの共通友人の投稿に対する、Ａさんのコミュニケーションの総数Ｃ（Ａ）、他者（例えば、共通友人Ｂさん）の投稿に対するＡさんのコミュニケーションの回数Ｃ（Ａ、Ｂ）を求める。ここで、共通友人とは、例えば、ユーザの友人Ａさんに着目する場合、ユーザとＡさんとの間での共通友人を指す。このとき、Ａさんが共通友人の投稿に対して行ったコミュニケーションの総数を集計して求める。（ＡさんとＢさんとの間、ＡさんとＣさんとの間、…と集計する） On the other hand, in B) the global feature regarding each friend, the number of posts P (A) of a friend A of the user, that is, the total number of posts of a friend on the past data is calculated for each friend. In addition, the total number of posts G (A) of common friends with the user and a friend A, the total number C (A) of communication of A with respect to the posts of the common friends with the user and a friend A, and others (for example, The number of times C (A, B) of Mr. A's communication with respect to the post by the common friend B) is obtained. Here, the common friend refers to a common friend between the user and Mr. A, for example, when paying attention to the friend A of the user. At this time, the total number of communications performed by Mr. A with respect to the posting of the common friend is calculated and obtained. (Between Mr. A and Mr. B, Mr. A and Mr. C, and so on)

ステップＳ２００では、重み算出部２０３が、ステップＳ１００で算出した大域特徴と、特定の期間内のデータとの比較から友人関係の重みを決定する。 In step S200, the weight calculation unit 203 determines the weight of the friendship from the comparison between the global feature calculated in step S100 and the data within a specific period.

ここで、図３を用いて、ステップＳ２００での処理を詳細に説明する。
まず、ステップＳ２１０では、友人関係の抽出を行う。具体的には、ユーザの各友人間について、ソーシャルメディア上での明示的な友人関係を友人関係ＤＢから取得し、友人同士である友人のペアを生成する。例えば、ユーザの友人Ａ、Ｂ、Ｃ、Ｄさんがいたとした場合、友人同士である下記ペアをリストアップする。（以下、下記のリストをリスト１とする）
１）ＡさんとＢさん
２）ＡさんとＣさん
３）ＢさんとＣさん Here, the processing in step S200 will be described in detail with reference to FIG.
First, in step S210, friendship is extracted. Specifically, for each friend of the user, an explicit friend relationship on social media is acquired from the friend relationship DB, and a pair of friends who are friends is generated. For example, if there are user friends A, B, C, and D, the following pairs that are friends are listed. (Hereafter, the following list is called List 1)
1) A and B 2) A and C 3) B and C

ステップＳ２２０では、投稿内容の類似度の算出を行う。具体的な内容については、図４を参照して説明する。
まず、ステップＳ２２１では、対象友人リストの作成を行う。つまり、ステップＳ２２０で処理対象となる友人のリストを、ステップＳ２１０で取得した友人関係（友人同士であるペア）から生成する。例えば、リスト１からはＡ、Ｂ、Ｃさんを抽出し、リスト（以下、このリストをリスト２とする）を生成する。 In step S220, the similarity of the posted content is calculated. Specific contents will be described with reference to FIG.
First, in step S221, a target friend list is created. That is, a list of friends to be processed in step S220 is generated from the friend relationship (a pair that is a friend) acquired in step S210. For example, A, B, and C are extracted from list 1 and a list (hereinafter, this list is referred to as list 2) is generated.

ステップＳ２２２では、ステップＳ２２３からステップＳ２２６の処理を実施していない友人が存在する場合は、ステップＳ２２３へ進む。存在しない場合は、ステップＳ２２７へ進む。ステップＳ２２３では、未処理友人の投稿内容の抽出を行う。つまり、投稿内容ＤＢから、対象となる友人の特定の期間内の投稿内容（テキスト）を抽出する。 In step S222, if there is a friend who has not performed the processing from step S223 to step S226, the process proceeds to step S223. If not, the process proceeds to step S227. In step S223, the post content of the unprocessed friend is extracted. That is, the posting content (text) within a specific period of the target friend is extracted from the posting content DB.

ステップＳ２２４では、形態素解析を行う。つまり、投稿内容に形態素解析を適用し、投稿内容中に出現する単語を取得する。ステップＳ２２５では、単語の重要度の算出を行う。つまり、各単語について、大域特徴と比較し、友人Ａの投稿内における、ある単語ｉの重要度ｗ（ｉ、Ａ）を算出する。算出式は下記の通りとする。
ｗ（ｉ、Ａ）＝ｄｆ（ｉ、Ａ）×ｉｄｆ（ｉ)…（２）
ここで、ｄｆ（ｉ、Ａ）は、友人Ａの特定の期間内の投稿内容のうち、単語ｉを含む投稿数を表し、ｉｄｆ（ｉ)は前述のｄｆ（ｉ)を用いて下記の式で算出されるＩｎｖｅｒｓｅｄＤｏｃｕｍｅｎｔＦｒｅｑｕｅｎｃｙである。
ｉｄｆ（ｉ)＝ｌｏｇ{ ｜Ｄ｜／ｄｆ（ｉ) }…（３）
ここで、｜Ｄ｜は、過去のデータにおける、ユーザの友人の投稿総数である。また、ｄｆ（ｉ)＝０となる可能性がある場合には、ｄｆ（ｉ)＝１と読み替え、あるいは、ｄｆ（ｉ)をｄｆ（ｉ)＋１と置き換える。 In step S224, morphological analysis is performed. That is, morphological analysis is applied to the posted content, and words appearing in the posted content are acquired. In step S225, the importance of the word is calculated. That is, for each word, the importance w (i, A) of a certain word i in the posting of the friend A is calculated by comparing with the global feature. The calculation formula is as follows.
w (i, A) = df (i, A) × idf (i) (2)
Here, df (i, A) represents the number of posts including the word i among the posted contents of the friend A within a specific period, and idf (i) is expressed by the following formula using the above-described df (i). Inverse Document Frequency calculated in (1).
idf (i) = log {| D | / df (i)} (3)
Here, | D | is the total number of posts of the user's friend in the past data. When there is a possibility that df (i) = 0, it is read as df (i) = 1, or df (i) is replaced with df (i) +1.

ステップＳ２２６では、特徴ベクトルを生成する。具体的には、友人Ａの特定の期間内の投稿内容に出現する単語ｉ、および単語の重要度ｗ（ｉ、Ａ）から、友人毎に特徴ベクトルを生成する。ステップＳ２２７では、ステップＳ２２８からステップＳ２２９の処理を実施していない友人関係（ペア）が存在する場合は、ステップＳ２２８へ進む。存在しない場合は、本処理フローを終了する。 In step S226, a feature vector is generated. Specifically, a feature vector is generated for each friend from the word i appearing in the posted content of the friend A in a specific period and the importance w (i, A) of the word. In step S227, if there is a friendship (pair) for which the processing from step S228 to step S229 is not performed, the process proceeds to step S228. If it does not exist, this processing flow ends.

ステップＳ２２８では、未処理２友人の特徴ベクトルを取得する。つまり、未処理の友人関係（ペア）について、対象となる友人それぞれの特徴ベクトル（ステップＳ２２３からステップＳ２２６で生成済）を取得する。 In step S228, feature vectors of two unprocessed friends are acquired. That is, for an unprocessed friend relationship (pair), a feature vector (generated from step S223 to step S226) of each target friend is acquired.

ステップＳ２２９では、特徴ベクトル間の類似度を算出する。具体的には、特徴ベクトル間の類似度を算出し、投稿内容の類似度Ｓ（Ａ、Ｂ）とする。例えば、ベクトル間の距離に相当するコサイン類似度を算出する。この類似度は、２友人間の投稿内容についての類似度に相当する。 In step S229, the similarity between feature vectors is calculated. Specifically, the similarity between the feature vectors is calculated and set as the similarity S (A, B) of the posted content. For example, the cosine similarity corresponding to the distance between vectors is calculated. This degree of similarity corresponds to the degree of similarity regarding the posted content between two friends.

ステップＳ２３０では、コミュニケーション頻度の算出を行う。この詳細について、図５を参照して説明する。
まず、ステップＳ２３１では、ステップＳ２３２からステップＳ２３３の処理を実施していない友人関係（ペア）が存在する場合は、ステップＳ２３２へ進む。存在しない場合は本処理フローを終了する。 In step S230, the communication frequency is calculated. The details will be described with reference to FIG.
First, in step S231, if there is a friendship (pair) for which the processing from step S232 to step S233 is not performed, the process proceeds to step S232. If it does not exist, this processing flow ends.

ステップＳ２３２では、未処理２友人のコミュニケーション履歴・大域特徴を取得する。具体的には、未処理２友人を仮にＡさん、Ｂさんとする。過去のデータから、前述の大域特徴Ｐ（Ａ）、Ｐ（Ｂ）、Ｃ（Ａ）、Ｃ（Ｂ）、Ｃ（Ａ、Ｂ）、Ｃ（Ｂ、Ａ）、Ｇ（Ａ）、Ｇ（Ｂ）を取得する。次に、過去のデータにおけるＡさんからＢさんへのコミュニケーションの希少性を表す数値Ｒ（Ａ、Ｂ）を次式により算出する。
Ｒ（Ａ、Ｂ）＝ (Ｃ（Ａ）／Ｇ（Ａ）) × (Ｃ（Ａ、Ｂ）／Ｐ（Ｂ）) …（４） In step S232, the communication history / global feature of two unprocessed friends is acquired. Specifically, suppose that two unprocessed friends are Mr. A and Mr. B. From the past data, the global features P (A), P (B), C (A), C (B), C (A, B), C (B, A), G (A), G ( B) is acquired. Next, a numerical value R (A, B) representing the rarity of communication from Mr. A to Mr. B in the past data is calculated by the following equation.
R (A, B) = (C (A) / G (A)) × (C (A, B) / P (B)) (4)

ステップＳ２３３では、２友人間のコミュニケーション頻度を算出する。ここで、２友人間のコミュニケーション頻度Ｆ（Ａ、Ｂ）を次式により算出する。
Ｆ（Ａ、Ｂ）＝
( ｉＲ（Ａ、Ｂ）× Ｒ‘（Ａ、Ｂ）＋ｉＲ（Ｂ、Ａ）×Ｒ’（Ｂ、Ａ）)／２…（５）
ここで、ｉＲ（Ａ、Ｂ）は式（４）の逆数を表し、Ｒ‘（Ａ、Ｂ）は特定の期間内の友人の投稿のみを用いてＲ（Ａ、Ｂ）を算出した値を表す。なお、式（４）と式（５）との適用に際し、分母が０となる可能性がある場合には、１と読み替え、あるいはデフォルトで１を加算して適用する。 In step S233, the communication frequency between two friends is calculated. Here, the communication frequency F (A, B) between two friends is calculated by the following equation.
F (A, B) =
(iR (A, B) × R ′ (A, B) + iR (B, A) × R ′ (B, A)) / 2 (5)
Here, iR (A, B) represents the reciprocal of Equation (4), and R ′ (A, B) is a value obtained by calculating R (A, B) using only a friend's post within a specific period. Represent. In addition, when the expressions (4) and (5) are applied, if there is a possibility that the denominator becomes 0, the denominator is replaced with 1, or 1 is added by default.

ステップＳ２４１では、属性の類似度の算出を行う。具体的な内容について、図６を参照して説明する。 In step S241, the attribute similarity is calculated. Specific contents will be described with reference to FIG.

ステップＳ２４１では、ステップＳ２４２からステップＳ２４４の処理を実施していない友人関係（ペア）が存在する場合は、ステップＳ２４２へ進む。一方で、存在しない場合は、本処理フローを終了する。 In step S241, if there is a friendship (pair) for which the processing from step S242 to step S244 is not performed, the process proceeds to step S242. On the other hand, if it does not exist, this processing flow ends.

ステップＳ２４２では、未処理２友人の属性情報を取得する。つまり、未処理２友人を仮にＡさん、Ｂさんとすると、両者のプロフィールおよび属性情報をプロフィールＤＢから取得する。 In step S242, attribute information of two unprocessed friends is acquired. That is, if two unprocessed friends are assumed to be Mr. A and Mr. B, their profile and attribute information are acquired from the profile DB.

ステップＳ２４３では、各友人の属性ベクトルを生成する。具体的には、取得可能なプロフィールおよび属性項目に応じて、Ａさん、Ｂさんのプロフィール属性をベクトル化する。例えば、性別、出身地、勤務先が取得可能な場合には、属性ベクトルは、「男性、 ○○県、××会社」とする。 In step S243, an attribute vector for each friend is generated. Specifically, the profile attributes of Mr. A and Mr. B are vectorized according to the obtainable profile and attribute items. For example, if gender, birthplace, and office can be acquired, the attribute vector is “male, XX prefecture, xx company”.

ステップＳ２４４では、属性ベクトル間の類似度を算出する。例えば、ＡさんとＢさんの属性ベクトル間で類似度を算出し、プロフィール属性の類似度Ｐ（Ａ、Ｂ）とする。このとき、シンプソン係数を算出して、プロフィール属性の類似度Ｐ（Ａ、Ｂ）を求めてもよい。 In step S244, the similarity between attribute vectors is calculated. For example, the similarity is calculated between the attribute vectors of Mr. A and Mr. B, and the profile attribute similarity P (A, B) is obtained. At this time, the Simpson coefficient may be calculated to obtain the profile attribute similarity P (A, B).

ステップＳ２５０では、友人関係の重みを出力する。つまり、ステップＳ２２０からステップＳ２４０において算出したＳ（Ａ、Ｂ）、Ｆ（Ａ、Ｂ）、Ｐ（Ａ、Ｂ）を式（１）に代入し、ＡさんとＢさんとの間の友人関係の重みＷ（Ａ、Ｂ）とする。 In step S250, the friendship weight is output. That is, S (A, B), F (A, B), and P (A, B) calculated in steps S220 to S240 are substituted into equation (1), and the friendship between A and B The weight W (A, B) of

ステップＳ３００では、グループの抽出を行う。グループ抽出部２０４は、ソーシャルグラフ上の友人関係をステップＳ２００で出力された値で重み付けし、重み付きグラフから動的にグループを抽出する。ここでは、グループ内のエッジ数、および重みの総計が高くなるようなグループを抽出することが、特定の期間内の興味、関心が類似するグループを抽出することに相当する。例えば、文献「Ａ．Ｃｌａｕｓｅｔ，Ｍ．Ｅ．Ｊ．ＮｅｗｍａｎａｎｄＣ．Ｍｏｏｒｅ，“Ｆｉｎｄｉｎｇｃｏｍｍｕｎｉｔｙｓｔｒｕｃｔｕｒｅｉｎｖｅｒｙｌａｒｇｅｎｅｔｗｏｒｋｓ，” Ｐｈｙｓ．Ｒｅｖ．Ｅ７０，０６６１１１（２００４）．」に示されている方法を適用することで、上記のようなグループを抽出可能である。 In step S300, a group is extracted. The group extraction unit 204 weights the friendship on the social graph with the value output in step S200, and dynamically extracts a group from the weighted graph. Here, extracting a group having a high total number of edges and weights in the group corresponds to extracting a group having similar interests and interests within a specific period. For example, the method described in the document “A. Clauset, M. E. J. Newman and C. Moore,“ Finding community structure in large large networks, ”Phys. Rev. E70, 066111 (2004).” Is applied. By doing so, the above groups can be extracted.

ステップＳ４００では、グループキーワードを抽出する。グループキーワード抽出部２０５は、ステップＳ３００で抽出された各グループについて、グループ内の友人の特定の期間内の投稿内容から、グループを特徴付けるキーワードを抽出する。例えば、算出対象となるグループにＡさんが所属する場合、Ａさんの特定の期間内の投稿内容に含まれる各単語について、単語の重要度ｗ（ｉ、Ａ）を総計し、その値が高い単語をグループキーワードとして抽出する。 In step S400, group keywords are extracted. The group keyword extraction unit 205 extracts, for each group extracted in step S300, a keyword that characterizes the group from the contents posted within a specific period of friends in the group. For example, when Mr. A belongs to a group to be calculated, the word importance level w (i, A) is totaled for each word included in the posted content of Mr. A in a specific period, and the value is high. Extract words as group keywords.

つまり、図８に示すように、ソーシャルグラフの構造から所属や興味が似ているアカウントをグループ化し、グループに所属する友人の投稿内容からグループを特徴づけるキーワードを抽出する。したがって、友人リストを自動生成することにより、興味や関心が類似するアカウントごとに投稿内容を確認することができる。 That is, as shown in FIG. 8, accounts having similar affiliations and interests are grouped from the structure of the social graph, and keywords characterizing the group are extracted from the posted contents of friends belonging to the group. Therefore, by automatically generating a friend list, the posted content can be confirmed for each account with similar interests.

以上、説明したように、本実施形態によれば、グループ抽出結果を提供することで、ソーシャルメディア上でつながる友人について、特定の期間内の興味・関心、およびその興味・関心を有する友人を概観することができる。また、グループ毎に、特定の期間内の友人の投稿内容からグループを特徴付けるキーワードを抽出することから、特定の期間内の興味や関心に基づく、正確なキーワードを抽出することができる。 As described above, according to the present embodiment, by providing a group extraction result, an overview of the interests / interests within a specific period and the friends who have the interests / interests for friends connected on social media. can do. Moreover, since the keyword which characterizes a group is extracted from the contribution content of the friend in a specific period for every group, the exact keyword based on the interest and interest in a specific period can be extracted.

＜第２の実施形態＞
図９から図１１を用いて、本発明の実施形態について説明する。 <Second Embodiment>
The embodiment of the present invention will be described with reference to FIGS.

＜グループ抽出装置の構成＞
本実施形態に係るグループ抽出装置は、図９に示すように、ソーシャルメディアと、ＡＰＩ１００と、データ収集部２０１と、大域特徴算出部２０２と、重み算出部２０３と、グループ抽出部２０４と、グループキーワード抽出部２０５と、グループフィルタ部２０６と、プロフィールＤＢ２１１と、投稿内容ＤＢ２１２と、コミュニケーション履歴ＤＢ２１３と、友人関係ＤＢ２１４と、表示部３００とから構成されている。なお、第１の実施形態と同一の符号を付す構成要素については、同様の機能を有することから、その詳細な説明は省略する。 <Configuration of group extraction device>
As shown in FIG. 9, the group extraction apparatus according to the present embodiment includes social media, an API 100, a data collection unit 201, a global feature calculation unit 202, a weight calculation unit 203, a group extraction unit 204, and a group. The keyword extraction unit 205, the group filter unit 206, the profile DB 211, the posted content DB 212, the communication history DB 213, the friend relationship DB 214, and the display unit 300 are configured. In addition, about the component which attaches | subjects the same code | symbol as 1st Embodiment, since it has the same function, the detailed description is abbreviate | omitted.

グループフィルタ部２０６は、ユーザの指定したグループに所属する友人の投稿内容のみを取得し、提示する。なお、本実施形態は、第１の実施形態で取得したグループを基に、ユーザが閲覧可能なソーシャルストリーム（友人の更新情報の流れ）をフィルタリングする機能を提供するものである。なお、グループの指定からフィルタリング適用結果までの具体例を画面イメージとして図１１に示す。 The group filter unit 206 acquires and presents only the posting contents of friends belonging to the group designated by the user. In addition, this embodiment provides the function which filters the social stream (flow of a friend's update information) which a user can browse based on the group acquired in 1st Embodiment. A specific example from the group designation to the filtering application result is shown in FIG. 11 as a screen image.

＜グループ抽出装置の処理＞
図１０を用いて、本実施形態に係るグループ抽出装置の処理について説明する。 <Process of group extraction device>
The process of the group extraction device according to this embodiment will be described with reference to FIG.

まず、ステップＳ１００からステップＳ４００の処理内容については、第１の実施形態と同様である。 First, the processing content from step S100 to step S400 is the same as in the first embodiment.

ステップＳ５００では、グループ指定の受付を行う。つまり、ユーザのフィルタリング要求に応じて、ステップＳ４００で抽出したグループ毎に、グループキーワード一覧を提示し、ユーザがキーワードを選択することにより、グループの指定が可能となる。ここで、グループキーワードが複数個存在する場合には、所定の個数（例：１個）を提示することとし、選択基準はグループキーワード抽出時に採用した重要度を用いる。 In step S500, group designation is accepted. In other words, in response to a user's filtering request, a group keyword list is presented for each group extracted in step S400, and the user can select a keyword to select a group. Here, when there are a plurality of group keywords, a predetermined number (for example, one) is presented, and the importance used at the time of group keyword extraction is used as the selection criterion.

ステップＳ６００では、グループフィルタの適用を行う。具体的には、ユーザの指定したグループに所属する友人の投稿内容のみを取得し、提示する。 In step S600, a group filter is applied. Specifically, only the posted contents of friends belonging to the group designated by the user are acquired and presented.

以上、説明したように、本実施形態によれば、第１の実施形態で取得したグループを基に、ユーザが閲覧可能なソーシャルストリーム（友人の更新情報の流れ）をフィルタリングする機能を提供することにより、抽出および提示されたグループキーワードを選択することによって、ユーザは自身の興味・関心に合致するグループを選択し、指定されたグループに所属する友人の投稿内容のみを閲覧することが可能となる。 As described above, according to the present embodiment, the function of filtering the social stream (flow of friend update information) that can be browsed by the user is provided based on the group acquired in the first embodiment. By selecting a group keyword extracted and presented, the user can select a group that matches his / her interest and view only the posted content of friends belonging to the specified group. .

＜第３の実施形態＞
図１２および図１３を用いて、本発明の実施形態について説明する。 <Third Embodiment>
The embodiment of the present invention will be described with reference to FIGS.

＜グループ抽出装置の構成＞
本実施形態に係るグループ抽出装置は、図１２に示すように、ソーシャルメディアと、ＡＰＩ１００と、データ収集部２０１と、大域特徴算出部２０２と、重み算出部２０３と、グループ抽出部２０４と、グループキーワード抽出部２０５と、キーワードマッチング部２０７と、プロフィールＤＢ２１１と、投稿内容ＤＢ２１２と、コミュニケーション履歴ＤＢ２１３と、友人関係ＤＢ２１４と、表示部３００とから構成されている。なお、第１の実施形態と同一の符号を付す構成要素については、同様の機能を有することから、その詳細な説明は省略する。 <Configuration of group extraction device>
As shown in FIG. 12, the group extraction apparatus according to this embodiment includes social media, an API 100, a data collection unit 201, a global feature calculation unit 202, a weight calculation unit 203, a group extraction unit 204, and a group. The keyword extraction unit 205, the keyword matching unit 207, the profile DB 211, the posted content DB 212, the communication history DB 213, the friend relationship DB 214, and the display unit 300 are configured. In addition, about the component which attaches | subjects the same code | symbol as 1st Embodiment, since it has the same function, the detailed description is abbreviate | omitted.

キーワードマッチング部２０７は、ユーザのプロフィール情報、またはユーザの投稿内容から、ユーザの興味・関心を表すキーワードを取得する。なお、本実施形態は、第１の実施形態で取得したグループ、およびグループキーワードから、友人に対する特定の期間内の興味・関心についてのラベリングを行い、ユーザの興味・関心に合致しそうな友人とその理由（グループキーワード）を提示する機能を提供する。 The keyword matching unit 207 acquires a keyword representing the user's interest / interest from the user's profile information or the posted content of the user. In this embodiment, from the group and the group keyword acquired in the first embodiment, the interest is labeled with respect to the friend within a specific period, and the friend who is likely to match the user's interest Provides a function to present the reason (group keyword).

＜グループ抽出装置の処理＞
図１３を用いて、本実施形態に係るグループ抽出装置の処理について説明する。 <Process of group extraction device>
The process of the group extraction apparatus according to the present embodiment will be described with reference to FIG.

ステップＳ７００では、ユーザキーワードの取得を行う。具体的には、ユーザのプロフィール情報、またはユーザの投稿内容から、ユーザの興味・関心を表すキーワードを取得する。なお、「趣味」や「興味」、「関心」といったカテゴリに対し、ユーザ自身が特定のキーワード（例えば、アーティスト名といった固有名詞）を登録済みの場合は、該当するキーワードを列挙する。また、ユーザの投稿内容を取得可能な場合は、友人の投稿内容に対して、単語の重要度を算出した際の処理（図４のステップＳ２２４からステップＳ２２５）と同様の処理により、投稿内容に含まれる各単語に重要度を割り振り、重要度の高い単語をキーワードとする。 In step S700, a user keyword is acquired. Specifically, a keyword representing the user's interest / interest is acquired from the user's profile information or the user's posted content. If the user has already registered specific keywords (for example, proper nouns such as artist names) for categories such as “hobby”, “interest”, and “interest”, the corresponding keywords are listed. In addition, when the user's posted content can be acquired, the posted content is converted into the posted content by processing similar to the processing (step S224 to step S225 in FIG. 4) when the importance of the word is calculated with respect to the posted content of the friend. Assign importance to each word included, and use words with high importance as keywords.

ステップＳ８００では、キーワードマッチングを行う。具体的には、グループに所属する友人それぞれに対して、友人の興味・関心を表すキーワードを割り当てる。例えば、グループキーワードとして抽出された単語を特定の期間内の投稿内容に含む場合は、その友人に当該単語をキーワードとして割り当てる。次に、ユーザと各友人との間で、キーワードのマッチングを行い、合致した友人、およびキーワードを列挙する。 In step S800, keyword matching is performed. Specifically, a keyword representing the interest of the friend is assigned to each friend who belongs to the group. For example, when a post extracted within a specific period includes a word extracted as a group keyword, the word is assigned to the friend as a keyword. Next, keyword matching is performed between the user and each friend, and the matched friend and keyword are listed.

ステップＳ９００では、友人およびキーワードの対を提示する。つまり、ステップＳ８００において、合致した友人、およびキーワードのリストを提示する。 In step S900, a friend and keyword pair is presented. That is, in step S800, a list of matched friends and keywords is presented.

以上、説明したように、本実施形態によれば、第１の実施形態で取得したグループ、およびグループキーワードから、友人に対する特定の期間内の興味・関心についてのラベリングを行い、ユーザの興味・関心に合致しそうな友人とその理由（グループキーワード）を提示する機能を提供することにより、友人の興味や関心が変化する中で、ユーザ自身とコミュニケーションの活性化が期待できる友人を容易に知ることが可能となる。 As described above, according to the present embodiment, the interest and interest of a user within a specific period are labeled from the group and the group keyword acquired in the first embodiment, and the user's interest and interest are expressed. By providing a function that presents friends who are likely to match and the reason (group keyword), it is possible to easily know friends that can be expected to activate communication with the users themselves while their interests and interests change. It becomes possible.

＜第４の実施形態＞
図１４および図１５を用いて、本発明の実施形態について説明する。 <Fourth Embodiment>
The embodiment of the present invention will be described with reference to FIGS. 14 and 15.

本実施形態は、グループ抽出の対象となる期間をずらしつつ（例えば、１週間前、２週間前、…）、第１の実施形態に示したグループ抽出手段、およびグループキーワード抽出手段を蓄積済みデータに適用し、各期間におけるグループ抽出結果を取得する。更に、各グループのバースト性（盛り上がり度合い）を評価することで、友人間で発生・共有された特殊なイベント（例えば、サッカーＷ杯や友人の記念日）に係るグループを抽出、列挙する。つまり、第１の実施形態から第３の実施形態では、特定の期間から、共通の話題やコミュニケーション頻度の向上にフォーカスしてグループを抽出するが、本実施形態では更に個人、あるいは世間一般の間で特に盛り上がっている話題に関連するグループが存在する場合に、特殊なイベント関連グループとして抽出することを特徴とする。 In the present embodiment, the group extraction means and the group keyword extraction means shown in the first embodiment are stored in the accumulated data while shifting the period for group extraction (for example, one week ago, two weeks ago,...). To obtain the group extraction result in each period. Further, by evaluating the burst property (exciting degree) of each group, a group related to a special event (for example, a soccer World Cup or a friend's anniversary) generated and shared between friends is extracted and listed. In other words, in the first to third embodiments, groups are extracted from a specific period with a focus on improving common topics and communication frequency. In this embodiment, individuals or the general public are further extracted. In the case where there is a group related to a topic that is particularly exciting in the above, it is extracted as a special event-related group.

＜グループ抽出装置の構成＞
本実施形態に係るグループ抽出装置は、図１４に示すように、ソーシャルメディアと、ＡＰＩ１００と、データ収集部２０１と、大域特徴算出部２０２と、重み算出部２０３と、グループ抽出部２０４と、グループキーワード抽出部２０５と、グループ評価部２０８と、プロフィールＤＢ２１１と、投稿内容ＤＢ２１２と、コミュニケーション履歴ＤＢ２１３と、友人関係ＤＢ２１４と、表示部３００とから構成されている。なお、第１の実施形態と同一の符号を付す構成要素については、同様の機能を有することから、その詳細な説明は省略する。 <Configuration of group extraction device>
As shown in FIG. 14, the group extraction apparatus according to the present embodiment includes social media, an API 100, a data collection unit 201, a global feature calculation unit 202, a weight calculation unit 203, a group extraction unit 204, and a group. The keyword extraction unit 205, the group evaluation unit 208, the profile DB 211, the posted content DB 212, the communication history DB 213, the friend relationship DB 214, and the display unit 300 are configured. In addition, about the component which attaches | subjects the same code | symbol as 1st Embodiment, since it has the same function, the detailed description is abbreviate | omitted.

グループ評価部２０８は、抽出された各グループのデータからバースト性（盛り上がり）を判定し、特に盛り上がっているグループをイベント連動グループと判定する。 The group evaluation unit 208 determines burstiness (swell) from the extracted data of each group, and particularly determines a group that is rising as an event-linked group.

＜グループ抽出装置の処理＞
図１５を用いて、本実施形態に係るグループ抽出装置の処理について説明する。 <Process of group extraction device>
The process of the group extraction device according to the present embodiment will be described with reference to FIG.

ステップＳ１１００では、グループ抽出処理が未実施の期間が存在する場合には、ステップＳ１２００へ進み、存在しない場合には本処理フローを終了する。 In step S1100, if there is a period in which the group extraction process has not been performed, the process proceeds to step S1200, and if not, the process flow ends.

ステップＳ１２００では、データの分割を行う。具体的には、グループ抽出対象となる期間に係るデータと、大域特徴の算出対象となる期間に係るデータとを分割する。分割単位は任意のもの（例えば、１週間）とし、サービスやアプリケーション提供者、またはユーザが指定可能としても良い。 In step S1200, data is divided. Specifically, the data related to the period to be the group extraction target and the data related to the period to be the global feature calculation target are divided. The division unit may be arbitrary (for example, one week), and may be specified by the service or application provider or the user.

ステップＳ１３００では、キーワードグループの評価を行う。具体的には、抽出された各グループのデータからバースト性（盛り上がり）を判定し、特に盛り上がっているグループをイベント連動グループと判定する。ここで、盛り上がりの判定には閾値処理を適用し、例えば「全グループから算出する盛り上がり指標の平均値に比して、あるグループの盛り上がり指標の値がＸ倍以上の場合にイベント関連グループと判定」、といった処理を行う。なお、盛り上がり指標としては、例えば、「グループ内の投稿総数」、「グループ内のコミュニケーション回数の総数」のうちいずれか１種類、あるいは複数を組み合わせて判定する。 In step S1300, the keyword group is evaluated. Specifically, burstiness (swelling) is determined from the extracted data of each group, and a particularly exciting group is determined as an event-linked group. Here, threshold processing is applied to the determination of the climax, for example, “if the climax index value of a certain group is X times or more compared to the average value of the climax index calculated from all groups, it is determined as an event-related group ”Is performed. Note that, as the climax index, for example, any one of “total number of posts in the group” and “total number of communication times in the group” or a combination of a plurality of the determinations is determined.

以上、説明したように、本実施形態によれば、グループ抽出の対象となる期間をずらしつつ（例えば、１週間前、２週間前、…）、第１の実施形態に示したグループ抽出手段、およびグループキーワード抽出手段を蓄積済みデータに適用し、各期間におけるグループ抽出結果を取得し、更に、各グループのバースト性（盛り上がり度合い）を評価することで、友人間で発生・共有された特殊なイベント（例えば、サッカーＷ杯や友人の記念日）に係るグループを抽出、列挙することから、個人（友人）、あるいは世間一般に発生している特殊なイベントについて、これに関連するグループを概観可能となる。 As described above, according to the present embodiment, the group extraction means shown in the first embodiment, while shifting the period for group extraction (for example, one week ago, two weeks ago,...) And applying the group keyword extraction means to the accumulated data, obtaining group extraction results for each period, and evaluating the burstiness (excitement level) of each group, so that it is generated and shared between friends By extracting and enumerating groups related to events (for example, soccer World Cup or friends' anniversary), it is possible to overview the groups related to individual (friends) or special events that occur in general. Become.

なお、グループ抽出装置の処理をコンピュータシステムが読み取り可能な記録媒体に記録し、この記録媒体に記録されたプログラムをグループ抽出装置に読み込ませ、実行することによって本発明のグループ抽出装置を実現することができる。ここでいうコンピュータシステムとは、ＯＳや周辺装置等のハードウェアを含む。 Note that the group extraction apparatus of the present invention is realized by recording the processing of the group extraction apparatus on a recording medium readable by the computer system, causing the group extraction apparatus to read and execute the program recorded on the recording medium. Can do. The computer system here includes an OS and hardware such as peripheral devices.

また、「コンピュータシステム」は、ＷＷＷ（ＷｏｒｌｄＷｉｄｅＷｅｂ）システムを利用している場合であれば、ホームページ提供環境（あるいは表示環境）も含むものとする。また、上記プログラムは、このプログラムを記憶装置等に格納したコンピュータシステムから、伝送媒体を介して、あるいは、伝送媒体中の伝送波により他のコンピュータシステムに伝送されてもよい。ここで、プログラムを伝送する「伝送媒体」は、インターネット等のネットワーク（通信網）や電話回線等の通信回線（通信線）のように情報を伝送する機能を有する媒体のことをいう。 Further, the “computer system” includes a homepage providing environment (or display environment) if a WWW (World Wide Web) system is used. The program may be transmitted from a computer system storing the program in a storage device or the like to another computer system via a transmission medium or by a transmission wave in the transmission medium. Here, the “transmission medium” for transmitting the program refers to a medium having a function of transmitting information, such as a network (communication network) such as the Internet or a communication line (communication line) such as a telephone line.

また、上記プログラムは、前述した機能の一部を実現するためのものであってもよい。さらに、前述した機能をコンピュータシステムにすでに記録されているプログラムとの組合せで実現できるもの、いわゆる差分ファイル（差分プログラム）であってもよい。 The program may be for realizing a part of the functions described above. Furthermore, what can implement | achieve the function mentioned above in combination with the program already recorded on the computer system, what is called a difference file (difference program) may be sufficient.

以上、この発明の実施形態につき、図面を参照して詳述してきたが、具体的な構成はこの実施形態に限られるものではなく、この発明の要旨を逸脱しない範囲の設計等も含まれる。 The embodiments of the present invention have been described in detail with reference to the drawings. However, the specific configuration is not limited to the embodiments, and includes designs and the like that do not depart from the gist of the present invention.

１００；ＡＰＩ
２０１；データ収集部
２０２；大域特徴算出部
２０３；重み算出部
２０４；グループ抽出部
２０５；グループキーワード抽出部
２０６；グループフィルタ部
２０７；キーワードマッチング部
２０８：グループ評価部
２１１；プロフィールＤＢ
２１２；投稿内容ＤＢ
２１３；コミュニケーション履歴ＤＢ
２１４；友人関係ＤＢ
３００；表示部
100; API
201; data collection unit 202; global feature calculation unit 203; weight calculation unit 204; group extraction unit 205; group keyword extraction unit 206; group filter unit 207; keyword matching unit 208: group evaluation unit 211;
212; Post content DB
213: Communication history DB
214; Friendship DB
300: Display section

Claims

A group extraction device that extracts a group of similar interests or interests in a specific period from a friend collection on social media and a friend's action history,
A global feature calculation unit for calculating a steady feature of a friend set from the action history;
A weight that calculates a feature that does not appear regularly but only appears in a specific period based on the steady feature value and an action history in a specific period, and weights the relationship between friends in the friend set A calculation unit;
A group extractor for extracting a group of friends having similar interests or interests within a specific period from the weighted friendships;
A group extracting apparatus comprising:

The group extraction device according to claim 1, further comprising a group keyword extraction unit that extracts a keyword that characterizes a group from the content posted by a friend within a specific period for each group.

Group filtering that presents the group keyword to the user, accepts designation of the group that matches the user's interest or interest from the user, and obtains and outputs only the posting contents within a specific period of friends belonging to the designated group The group extraction apparatus according to claim 2, further comprising a unit.

Among the group keywords, a keyword matching unit is provided for each friend to find a group keyword that matches a word representing an interest or interest registered as profile (attribute) information by the user or a word extracted from the posted content of the user. The group extraction apparatus according to claim 2, wherein:

The action history is a posted content of a friend on social media or a communication history,
The global feature calculation unit extracts a group of words that regularly appear in the posted content and its appearance frequency, and a combination and frequency of friends that regularly communicate.
The weight calculation unit sets a high weight between friends including both words whose appearance frequency is improved within a specific period as compared to the regularly appearing word group in the posted content, and further steady communication. The group extraction device according to any one of claims 1 to 4, wherein a weight between friends whose communication frequency is improved within a specific period is set higher than a frequency.

5. The group extraction apparatus according to claim 1, wherein the weight calculation unit further sets a high weight between friends having high similarity of profile attributes.

A group extraction method in a group extraction device for extracting a group of similar interests or interests within a specific period from a friend set on social media and a friend's action history,
A first step of calculating a stationary feature of a friend set from the behavior history;
Based on the steady feature value and the action history within a specific period, a feature that does not appear regularly but appears within a specific period is calculated, and the relationship between friends in the friend set is weighted A second step;
A third step of extracting from the weighted friendships a group of friends with similar interests or interests within a specific time period;
A group extraction method comprising:

The group extraction method according to claim 7, further comprising a fourth step of extracting a keyword characterizing the group from the content posted by a friend within a specific period for each group.

In the fourth step, the group keyword is presented to the user, designation of the group that matches the user's interest or interest is accepted from the user, and only the content posted within a specific period of friends belonging to the designated group is received. The group extraction method according to claim 8, wherein the group extraction method is obtained and output.

In the fourth step, among the group keywords, a group keyword that matches an interest or interest expressing word registered as profile (attribute) information by the user or a word extracted from the posted content of the user is obtained for each friend. The group extraction method according to claim 8, wherein:

A program for causing a computer to execute a group extraction method in a group extraction apparatus that extracts a group of similar interests or interests within a specific period from a friend set on social media and a friend's action history,
A first step of calculating a stationary feature of a friend set from the behavior history;
Based on the steady feature value and the action history within a specific period, a feature that does not appear regularly but appears within a specific period is calculated, and the relationship between friends in the friend set is weighted A second step;
A third step of extracting from the weighted friendships a group of friends with similar interests or interests within a specific time period;
A program that causes a computer to execute.

The program according to claim 11, further comprising a fourth step of extracting a keyword characterizing the group from the content posted by a friend within a specific period for each group.

In the fourth step, the group keyword is presented to the user, designation of the group that matches the user's interest or interest is accepted from the user, and only the content posted within a specific period of friends belonging to the designated group is received. The program according to claim 12, wherein the program is acquired and output.

In the fourth step, among the group keywords, a group keyword that matches an interest or interest expressing word registered as profile (attribute) information by the user or a word extracted from the posted content of the user is obtained for each friend. The program according to claim 12, wherein: