JP5159451B2

JP5159451B2 - Information processing apparatus, analysis system, network behavior analysis method and program for analyzing network behavior

Info

Publication number: JP5159451B2
Application number: JP2008155237A
Authority: JP
Inventors: ルディ・レイモンド・ハリー・プテラ; 弘揮 ▲柳▼澤; 一星吉田; 明子鈴木
Original assignee: International Business Machines Corp
Current assignee: International Business Machines Corp
Priority date: 2008-06-13
Filing date: 2008-06-13
Publication date: 2013-03-06
Anticipated expiration: 2028-06-13
Also published as: JP2009301334A

Description

本発明は、ネットワークにアクセスするユーザの、特定のテーマに関連する行動を分析する技術に関し、より詳細には、キーワード等の特徴パラメータによって特徴付けられるコンテンツへのアクセスから、ユーザ集団のネットワーク行動を分析する、情報処理装置、分析システム、ネットワーク行動の分析方法およびプログラムに関する。 The present invention relates to a technique for analyzing behavior related to a specific theme of a user who accesses a network, and more particularly, from the access to contents characterized by characteristic parameters such as keywords, the network behavior of a user group. The present invention relates to an information processing apparatus, an analysis system, a network behavior analysis method, and a program.

ネットワーク通信網の普及に伴い、ブログ、ＳＮＳ(Social
Network Service)などを介してユーザ間での交流が一般化している。ネットワーク上での情報交換は、どのユーザ間でも同じ程度の情報交換が行われている訳ではなく、実世界での情報交換と同様に、頻繁に情報交換が行われる内容やユーザ集団が存在する。 Blogs, SNS (Social
Networking between users via Network Service) has become common. Information exchange on the network does not mean the same level of information exchange among all users, but there are contents and user groups that are frequently exchanged as in the real world information exchange. .

頻繁に情報交換が行われている内容を有するコンテンツに関連付けて当該情報交換を行っているユーザ集団を分析することにより、特定のユーザ集団における情報の拡散度合いを分析することが可能となる。また、特定の内容を有するコンテンツに関連するユーザ集団の規模を時系列的に分析し、コンテンツに関連して特徴付けられるユーザ集団の拡散／縮小の傾向を分析することにより、マーケッティング情報およびユーザ集団情報などを取得することができ、ネットワーク行動を効率的にビジネス・ソリューションに反映させることができる。 It is possible to analyze the degree of information diffusion in a specific user group by analyzing the user group performing the information exchange in association with the content having the information exchanged frequently. Further, by analyzing the size of the user group related to the content having a specific content in time series and analyzing the tendency of the user group to be spread / reduced characterized in relation to the content, the marketing information and the user group are analyzed. Information can be acquired and network behavior can be efficiently reflected in business solutions.

これまで、ネットワークを介して伝送される情報を使用して、伝送されるメッセージの中から重要なテーマを抽出する技術が知られている。例えば、特開２００７−３２８６１０号公報（特許文献１）では、親子関係のあるメッセージ間で関連度を示すスコアを算出し、スコアを用いたクラスタリングによって、テーマ抽出対象となるメッセージ群の絞り込み、電子掲示板などのコミュニケーションではメッセージ間に親子関係が形成されていることを利用して、抽出テーマを選択する技術が提案されている。 Conventionally, a technique for extracting an important theme from a transmitted message using information transmitted via a network is known. For example, in Japanese Patent Application Laid-Open No. 2007-328610 (Patent Document 1), a score indicating a degree of association between messages having a parent-child relationship is calculated, and a group of messages to be subject to theme extraction is narrowed down by clustering using the score. In communication such as a bulletin board, a technique for selecting an extraction theme by utilizing the fact that a parent-child relationship is formed between messages has been proposed.

上述した特許文献１は、掲示板の中から重要なスレッドを抽出することを目的とするものであり、メッセージを、トランザクションの返答関係を使用してリンクを定義してツリー構造を作成するものである。上述した特許文献１では、例えば、ユーザ間のリンクに対し、スコアを付与する際に、限定されたユーザ集団内で高密度のトランザクションが発生する場合に高スコアを与える。しかしながら、特定の内容を有するコンテンツに関連して集合離散するユーザ集団を分析することを目的とするものではない。 Patent Document 1 described above is intended to extract important threads from a bulletin board, and creates a tree structure by defining a link for a message using a transaction response relationship. . In Patent Document 1 described above, for example, when a score is given to a link between users, a high score is given when a high-density transaction occurs in a limited user group. However, it is not intended to analyze a user group that is aggregated and dispersed in relation to content having specific contents.

例えば、ブログやＳＮＳ上で特定の内容を有するコンテンツがアップロードされた場合、当該コンテンツに関連するメッセージをネットワーク上で送受信する複数のユーザは、予測がつかず、また事前に予測して解析することは極めて困難である。特定のコンテンツに関連して生成するユーザ間のトランザクション頻度を使用して複数のユーザを、特定のユーザ集団として識別することは、ユーザ集団の特徴付けを可能とし、特定のユーザ集団を対象とした宣伝広告などを行うための重要な情報を提供する。 For example, when content having specific contents is uploaded on a blog or SNS, a plurality of users who send and receive messages related to the content on the network cannot predict and analyze in advance. Is extremely difficult. Identifying multiple users as a specific user population using the frequency of transactions between users generated in connection with specific content allows the characterization of the user population and is targeted to a specific user population Providing important information for advertising.

また、逆に特定のユーザ集団がどのようなコンテンツに関心があるのか、およびユーザ集団の特徴を解析することは、コンテンツに関連するビジネスに対する次世代ソリューションの検討を可能とするものということができる。
特開２００７−３２８６１０号公報 Conversely, what kind of content a particular user group is interested in and analyzing the characteristics of the user group can be considered as a next-generation solution for content-related business. .
JP 2007-328610 A

上述したように、情報の広がり方は、ユーザの属性だけを見ていても判断することができない。例えば、「キーワードとして“ＩＢＭ東京基礎研究所”を含むコンテンツは、いったいどのようなユーザ層に広まっているのか？」に関して、その情報伝播を解析することが必要な場合があった。さらにＳＮＳ内での情報拡散および縮小を時系列的に追跡することによって、以下の項目についての情報を取得することが可能となる。
○情報を最初に発信した人・集団は？
○情報は広がる傾向か？それとも収まる傾向か？
○情報は移動するか？どのような経路での移動か？
○情報を広げる方向にもっとも貢献する人・集団は？
以上の点で、上述したように、特許文献１は、特定のコンテンツに関連した不特定のユーザをユーザ集団として識別することを可能とする技術ではなかった。また、特許文献１は、特定のコンテンツに関連したユーザ集団の時系列的な解析を使用して、ユーザ手段の拡散および縮小を追跡することを可能とするものではない。さらに特許文献１は、特定のユーザ集団に分類されたユーザに共通する属性を分析することにより、ネットワークを介して集合離散するユーザを解析することによってユーザのネットワーク行動を分析することを可能とするものではない。 As described above, it is not possible to determine how the information is spread even if only the user attributes are viewed. For example, there is a case where it is necessary to analyze the information propagation regarding “what kind of user group is content including“ IBM Tokyo Basic Research Laboratories ”as a keyword?”. Furthermore, information on the following items can be acquired by tracking information diffusion and reduction in the SNS in time series.
○ Who was the first person to send information?
○ Does the information tend to spread? Or do they tend to fit?
○ Does the information move? What is the route of travel?
○ Who or group contributes most to the direction of expanding information?
In the above points, as described above, Patent Document 1 is not a technique that enables an unspecified user related to specific content to be identified as a user group. Further, Patent Document 1 does not enable tracking of diffusion and reduction of user means using a time-series analysis of a user group related to specific content. Furthermore, Patent Literature 1 enables analysis of user network behavior by analyzing users that are aggregated and dispersed via a network by analyzing attributes common to users classified into a specific user group. It is not a thing.

すなわち、これまで、特定のコンテンツに関連して複数のユーザが形成するユーザ集団を特徴付ける技術が必要とされていた。 That is, until now, there has been a need for a technique for characterizing a user group formed by a plurality of users in relation to specific content.

また、これまで、特徴付けされたユーザ集団の規模が時系列的にどのように推移して行くのかを分析し、コンテンツの内容に関連したネットワーク行動を分析する技術が必要とされていた。 In addition, until now, there has been a need for a technique for analyzing how the scale of the characterized user group changes over time and analyzing network behavior related to the content.

さらに、特徴付けられたユーザ集団に帰属されるユーザに共通する属性を取得して分析する技術が必要とされていた。 Furthermore, there is a need for a technique for acquiring and analyzing attributes common to users belonging to a characterized user group.

本発明は上記の課題を解決するために、情報処理装置は、ネットワークを介してユーザ・コンピュータから情報に対するアクセスを受付け、ユーザ・コンピュータ間に、情報を特徴付けるパラメータに関連して情報に対するネットワーク行動を生成させる。情報の広がりを示す一つの塊を、インフォバブルとして参照し、当該塊を構成するユーザを、特徴付けされたユーザ集団として参照する。ネットワーク行動は、行動履歴として記憶され、インフォバブルを生成するために利用される。情報処理装置は、インフォバブル生成部を含んでおり、インフォバブル生成部は、特徴パラメータを含む情報を抽出し、抽出した情報に対するネットワーク行動を取得する。そして、前記ユーザ・コンピュータをノードとするユーザ・リンクのユーザから、抽出した情報に対するネットワークでのユーザ関わり度ｇを計算し、さらに設定されたサンプリング間隔で計算したユーザ関わり度ｇを累積計算してスコア値を計算し、少なくとも１つのユーザをインフォバブルに登録し、インフォバブルを生成する。生成されたインフォバブルは、ネットワーク行動解析部により読み出され、特徴パラメータに関連付けられた情報に対するユーザのネットワーク行動を分析するために使用される。 In order to solve the above-described problems, the information processing apparatus receives access to information from a user computer via a network, and performs network action on the information in relation to parameters characterizing the information between the user computers. Generate. One lump indicating the spread of information is referred to as an info bubble, and users constituting the lump are referred to as a characterized user group. Network behavior is stored as behavior history and used to generate info bubbles. The information processing apparatus includes an info bubble generation unit, which extracts information including feature parameters and acquires network behavior for the extracted information. Then, from the user of the user link having the user computer as a node, the user's relevance g in the network for the extracted information is calculated, and the user relevance g calculated at the set sampling interval is cumulatively calculated. A score value is calculated, at least one user is registered in the info bubble, and an info bubble is generated. The generated info bubble is read by the network behavior analysis unit and used to analyze the user's network behavior with respect to the information associated with the feature parameter.

サンプリング周期ごとに前記ユーザ関わり度により計算されるスコア値を累積計算し、直前生成したインフォバブルを拡張するか、または直前生成したインフォバブルを収縮させるかを決定して、時系列的に複数の異なるユーザを含むインフォバブル・シーケンスを生成する。 A cumulative calculation of the score value calculated according to the degree of user involvement for each sampling period determines whether to expand the info bubble generated immediately before or to contract the info bubble generated immediately before, and Generate an info bubble sequence that includes different users.

インフォバブルの膨張・収縮は、スコア値に設定された膨張しきい値および収縮しきい値を参照して決定される。インフォバブルの膨張は、ユーザ・リンクの隣接ユーザを直前生成したインフォバブルに含ませた場合に、膨張後のバブル平均スコアが収縮しきい値以下とならないことを基準として実行される。 The expansion / contraction of the info bubble is determined with reference to the expansion threshold value and the contraction threshold value set in the score value. The expansion of the info bubble is executed based on the fact that the bubble average score after the expansion does not become the contraction threshold value or less when the adjacent user of the user link is included in the previously generated info bubble.

また、インフォバブルの収縮は、直前生成したインフォバブルが含むスコア値の最小のユーザを選択し、バブル平均スコアが収縮しきい値を超えるまでユーザをインフォバブルから削除することにより実行される。 Further, the shrinkage of the info bubble is executed by selecting the user with the smallest score value included in the info bubble generated immediately before and deleting the user from the info bubble until the bubble average score exceeds the shrinkage threshold.

さらに、ネットワーク行動分析部は、識別値で指定されるインフォバブルを読み出し、インフォバブルの時間進化を使用し特徴パラメータを含む情報の情報伝播を分析し、さらに、読み出した複数のインフォバブルのユーザ属性の類似性を使用してインフォバブルのクラスタを生成し、クラスタを形成するユーザを分析する。 Further, the network behavior analysis unit reads the info bubble specified by the identification value, analyzes the information propagation of the information including the characteristic parameter using the time evolution of the info bubble, and further, the user attributes of the plurality of read info bubbles Generate the info bubble cluster using the similarity of and analyze the users who form the cluster.

すなわち、本発明によれば、特定のコンテンツに関連して生成するユーザ間のトランザクション頻度を使用して複数のユーザを、特定のユーザ集団として識別することにより、ユーザ集団の特徴付けを可能とする、情報処理装置、分析システム、分析方法およびプログラムを提供することが可能となる。 That is, according to the present invention, it is possible to characterize a user group by identifying a plurality of users as a specific user group using a transaction frequency between users generated in relation to a specific content. It is possible to provide an information processing apparatus, an analysis system, an analysis method, and a program.

また、本発明では、上述したユーザ集団の特徴付けを使用することで、逆に特定のユーザ集団がどのようなコンテンツに関心があるのか、およびユーザ集団の特徴を解析することは、コンテンツに関連するビジネスに対する次世代ソリューションの検討を可能とする、情報処理装置、分析システム、分析方法およびプログラムを提供することが可能となる。 In addition, in the present invention, by using the above-described characterization of the user group, it is related to the content to analyze what kind of content the specific user group is interested in and to analyze the characteristics of the user group. It is possible to provide an information processing apparatus, an analysis system, an analysis method, and a program that make it possible to examine next-generation solutions for business.

＜セクション１：ハードウェア構成＞
以下、本発明を実施の形態をもって説明するが、本発明は、後述する実施形態に限定されるものではない。図１は、本実施形態のユーザのネットワーク行動を分析する分析システム１００の実施形態を示す。分析システム１００は、ネットワーク１１６に接続され、ユーザにより操作されて、ネットワーク１１６を介してサーバ１２２にアクセスする複数のユーザ・コンピュータ（以下、説明の便宜上、ユーザとして参照する。）１１２、１１４を含んで構成されている。ユーザ１１２は、例えばＳＮＳなどで友人などとして登録されたユーザ・グループを形成する。また、ユーザ１１４は、これとは異なるユーザ・グループを形成している。ユーザのネットワーク行動は極めて多岐にわたり、また広範なコンテンツがアプリケーション・サーバなどに登録されるので、各ユーザ・グループは、常時完全に分離しているわけではなく、時系列的に相互関連し合い、特定のユーザ・グループに帰属されているユーザが、他のユーザ・グループに参加し、また離脱するなどの行動を取る。 <Section 1: Hardware configuration>
The present invention will be described below with reference to embodiments, but the present invention is not limited to the embodiments described below. FIG. 1 shows an embodiment of an analysis system 100 for analyzing a user's network behavior according to this embodiment. The analysis system 100 includes a plurality of user computers (hereinafter referred to as users for convenience of explanation) 112 and 114 connected to a network 116 and operated by a user to access the server 122 via the network 116. It consists of For example, the user 112 forms a user group registered as a friend or the like in SNS or the like. Further, the user 114 forms a different user group. User network behavior is very diverse, and a wide range of content is registered with application servers, etc., so user groups are not always completely separated, but are related to each other in time series, A user belonging to a specific user group takes an action such as joining or leaving another user group.

ネットワーク１１６には、サーバ機能部１２０が接続されていて、情報検索、ＳＮＳ、ウェブ・サービス、メール・サービスなどを含むアプリケーション・サービスをユーザに対して提供している。サーバ機能部１２０は、情報処理装置１２２と、複数のデータベース１２４〜１３０を含んで構成されている。情報処理装置１２２は、本実施形態では、ＳＮＳ、情報検索などのサービスを提供するアプリケーション・サーバモジュールおよびユーザのネットワーク行動を分析するための分析モジュールとを含んで実装されている。これらの各モジュール構成については、より詳細に後述する。 A server function unit 120 is connected to the network 116 and provides application services including information retrieval, SNS, web service, mail service, and the like to users. The server function unit 120 includes an information processing device 122 and a plurality of databases 124 to 130. In the present embodiment, the information processing apparatus 122 is implemented including an application server module that provides services such as SNS and information search, and an analysis module for analyzing network behavior of the user. Each of these module configurations will be described in detail later.

また、情報処理装置１２２は、複数のデータベース１２４〜１３０を使用してユーザのネットワーク行動を分析する。ネットワーク・データ記憶部１２４は、ユーザのネットワーク・アドレスなどを含むネットワーク・データを記憶している。テキスト・データ記憶部１２６は、ユーザがアップデートしたコンテンツのテキスト部分、メール、ブログなどのコメントなどといったネットワーク行動を分析するために使用するテキスト・データを記憶する。行動履歴記憶部１２８は、情報処理装置１２２へのユーザ・アクセスの履歴、いわゆるアクセスログを記録して、後の分析を実行するためのデータを提供する。 In addition, the information processing apparatus 122 analyzes a user's network behavior using the plurality of databases 124 to 130. The network data storage unit 124 stores network data including a user's network address and the like. The text data storage unit 126 stores text data used for analyzing network behavior such as a text portion of content updated by a user, a comment such as an email or a blog. The action history storage unit 128 records a history of user access to the information processing apparatus 122, so-called access log, and provides data for performing later analysis.

さらに、情報処理装置１２２は、ユーザ情報記憶部１３０を管理していて、ネットワーク１１６を介してアクセスするユーザを、例えばユーザ識別値（ユーザＩＤ）、ハンドルネーム、ＩＰアドレスなどを使用して識別し、行動履歴記憶部１２８による行動履歴作成を可能とする。その他、ユーザ情報管理部１３０は、ＳＮＳや特定の権限を有するサービスへのアクセス管理を実行するための情報を含んでいて、情報処理装置１２２によるサービス提供の管理を可能とさせている。本実施形態では、ネットワーク１１６は、インターネットなどのネットワークを含むことが好ましいが、インターネット以外にもＷＡＮ(Wide Area Network)、ＬＡＮ(Local Area network)などを含んで構成されていてもよい。 Furthermore, the information processing apparatus 122 manages the user information storage unit 130 and identifies a user who accesses through the network 116 using, for example, a user identification value (user ID), a handle name, an IP address, and the like. The action history storage unit 128 can create an action history. In addition, the user information management unit 130 includes information for executing access management to an SNS or a service having a specific authority, and allows the information processing apparatus 122 to manage service provision. In the present embodiment, the network 116 preferably includes a network such as the Internet, but may include a WAN (Wide Area Network), a LAN (Local Area Network), and the like in addition to the Internet.

また、情報処理装置１２２は、図１に示すように、アプリケーション・モジュールと分析モジュールとが一体として構成することができる。他の実施形態では、アプリケーション・モジュールを、アプリケーション・サーバとして構成し、分析モジュールを分離して分析サーバとして配置する実装形式を採用することもできる。いずれの実装形式を採用するかについては、特定の用途に依存して、適宜選択することができる。 Further, as shown in FIG. 1, the information processing apparatus 122 can be configured by integrating an application module and an analysis module. In another embodiment, the application module may be configured as an application server, and an implementation form in which the analysis module is separated and arranged as an analysis server may be employed. Which mounting format is adopted can be appropriately selected depending on a specific application.

上述した情報処理装置１２２は、ＰＥＮＴＩＵＭ（登録商標）、ＰＥＮＴＩＵＭ（登録商標）互換チップ、などのＣＩＳＣアーキテクチャのマイクロプロセッサ、または、ＰＯＷＥＲＰＣ（登録商標）などのＲＩＳＣアーキテクチャのマイクロプロセッサを実装することができる。また、情報処理装置１２２は、ＷＩＮＤＯＷＳ（登録商標）２００Ｘ、ＵＮＩＸ（登録商標）、ＬＩＮＵＸ（登録商標）などのオペレーティング・システムにより制御されていて、Ｃ＋＋、ＪＡＶＡ（登録商標）、ＪＡＶＡ（登録商標）ＢＥＡＮＳ、ＰＥＲＬ、ＲＵＢＹなどのプログラミング言語を使用して実装される、ＣＧＩ、サーブレット、ＡＰＡＣＨＥなどのサーバ・プログラムを実行し、ユーザ１１２、１１４からの要求を処理し、サービスを提供する。 The information processing apparatus 122 described above may be implemented with a CISC architecture microprocessor such as PENTIUM (registered trademark) or a PENTIUM (registered trademark) compatible chip, or a RISC architecture microprocessor such as POWER PC (registered trademark). it can. The information processing apparatus 122 is controlled by an operating system such as WINDOWS (registered trademark) 200X, UNIX (registered trademark), or LINUX (registered trademark), and includes C ++, JAVA (registered trademark), and JAVA (registered trademark). It executes server programs such as CGI, servlets, and APACHE implemented using programming languages such as BEANS, PERL, and RUBY, processes requests from users 112 and 114, and provides services.

ユーザ１１２、１１４と、情報処理装置１２２との間は、ＴＣＰ／ＩＰなどのトランザクション・プロトコルに基づき、ＨＴＴＰプロトコルなどのファイル転送プロトコルを使用するトランザクションを使用したネットワークで接続される。ユーザ１１２、１１４は、情報処理装置１２２にアクセスし、ファイルのアップロード、ダウンロード、ブログ書込み、ブログ読出し、感想・意見の記述、チャット、フォーム送信、フォーム・ダウンロード、コンテンツ・アップロード、コンテンツ・ダウンロードなどを行っている。以下、ユーザが情報処理装置１２２に対してネットワーク１１６を介して行う各種の行動を、ユーザのネットワーク行動として参照する。 The users 112 and 114 and the information processing apparatus 122 are connected via a network using a transaction using a file transfer protocol such as an HTTP protocol based on a transaction protocol such as TCP / IP. The users 112 and 114 access the information processing apparatus 122 and perform file upload, download, blog write, blog read, comment / opinion description, chat, form transmission, form download, content upload, content download, etc. Is going. Hereinafter, various actions that the user performs on the information processing apparatus 122 via the network 116 are referred to as the user's network actions.

一方、ユーザ１１２、１１４は、パーソナル・コンピュータまたはワークステーションなどを使用して実装でき、また、そのマイクロプロセッサ（ＭＰＵ）は、これまで知られたいかなるシングルコア・プロセッサまたはデュアルコア・プロセッサを含んでいてもよい。また、ユーザ１１２、１１４は、ＷＩＮＤＯＷＳ（登録商標）、ＵＮＩＸ（登録商標）、ＬＩＮＵＸ（登録商標）、ＭＡＣＯＳなど、いかなるオペレーティング・システムにより制御することができる。ユーザ１１２、１１４がウェブ・クライアントとして機能する場合には、ユーザ１１２、１１４は、Internet Explorer（商標）、Mozilla、Opera、Netscape Navigator（商標）などのブラウザ・ソフトウェアを使用して、情報処理装置１２２にアクセスする。 On the other hand, the users 112 and 114 can be implemented using a personal computer or a workstation, and the microprocessor (MPU) includes any single-core processor or dual-core processor known so far. May be. The users 112 and 114 can be controlled by any operating system such as WINDOWS (registered trademark), UNIX (registered trademark), LINUX (registered trademark), and MAC OS. When the users 112 and 114 function as web clients, the users 112 and 114 use browser software such as Internet Explorer (trademark), Mozilla, Opera, and Netscape Navigator (trademark) to process the information processing apparatus 122. To access.

図２は、図１に示した情報処理装置１２２の機能ブロック構成２００を示す。情報処理装置１２２は、ネットワーク１１６を介してユーザ１１２、１１４からの要求を受領し、またサービスの提供を行うため、ＯＳＩ基本参照モデルでは、物理層／データリンク層／ネットワーク層レベルの処理を実行するネットワーク・アダプタ２８０を含んで構成される。ネットワーク・アダプタ２８０が受領した要求は、アプリケーション提供部２７０に送られて、情報処理装置１２２によるサービス処理が実行され、その実行結果は、ネットワーク・アダプタ２８０を介してネットワーク１１６を経由してユーザ１１２、１１４に送付される。 FIG. 2 shows a functional block configuration 200 of the information processing apparatus 122 shown in FIG. The information processing apparatus 122 receives requests from the users 112 and 114 via the network 116 and provides services. Therefore, in the OSI basic reference model, processing at the physical layer / data link layer / network layer level is executed. Network adapter 280 to be configured. The request received by the network adapter 280 is sent to the application providing unit 270, and service processing by the information processing apparatus 122 is executed. The execution result is sent to the user 112 via the network 116 via the network adapter 280. , 114.

一方、ユーザ１１２、１１４が情報処理装置１２２に対して行った要求、アップロード、ダウンロード、書き込み、読み出しなどのネットワーク行動は、アプリケーション提供部２７０の処理を処理履歴として各データベースに送付される。例えば、ユーザ１１２、１１４のネットワーク行動は、そのトランザクションが発生したタイムスタンプ、ユーザＩＤ、コンテンツ内容などが抽出され、行動履歴記憶部１２８、テキスト・データ記憶部１２６などに登録され、以後、ユーザ・グループの時間的進化を分析するために利用される。 On the other hand, network actions such as requests, uploads, downloads, writes, and reads made by the users 112 and 114 to the information processing apparatus 122 are sent to each database as processing history of the application providing unit 270. For example, the network behavior of the users 112 and 114 is extracted from the time stamp when the transaction occurred, the user ID, the content content, etc., and registered in the behavior history storage unit 128, the text data storage unit 126, etc. Used to analyze group temporal evolution.

情報処理装置１２２は、アプリケーション提供部２７０の他、ユーザのネットワーク行動を分析するための複数のモジュールを含んで構成される。情報処理装置１２２は、ユーザのネットワーク行動を分析するために、パラメータ制御部２１０、インフォバブル生成部２２０とを含んで構成される。パラメータ制御部２１０は、ユーザのネットワーク行動を、ネットワークを介して送受信されたデータから分析するために使用する、キーワードなどの特徴パラメータを設定・変更する処理を実行する。 The information processing apparatus 122 includes a plurality of modules for analyzing a user's network behavior in addition to the application providing unit 270. The information processing apparatus 122 includes a parameter control unit 210 and an info bubble generation unit 220 in order to analyze a user's network behavior. The parameter control unit 210 executes a process of setting / changing a characteristic parameter such as a keyword used for analyzing a user's network behavior from data transmitted / received via the network.

また、インフォバブル生成部２２０は、ネットワークを介して送受信されたコンテンツから、設定された特徴パラメータを含むデータを選択して抽出し、当該トランザクションに関わるユーザを、ユーザＩＤ、ハンドルネーム、ＩＰアドレスなどにより抽出する。そして、インフォバブル生成部２２０は、ユーザについて、ユーザ・グループを割当てる処理を実行する。なお、以後、特定の特徴パラメータにより識別されるユーザ・グループを、インフォバブル（ＩＢ）として参照する。 In addition, the info bubble generation unit 220 selects and extracts data including the set characteristic parameters from the content transmitted / received via the network, and identifies the user involved in the transaction as the user ID, handle name, IP address, etc. Extract by And the info bubble production | generation part 220 performs the process which allocates a user group about a user. Hereinafter, a user group identified by a specific feature parameter is referred to as an info bubble (IB).

インフォバブル生成部２２０は、上述した処理を実行するため、リレーショナル・データベース機能を含んで構成することができ、ＳＱＬ(Structured Query Language）を使用して、設定されたタイムウィンドウごとにネットワーク行動をモニタして、ユーザをインフォバブルに帰属し、インフォバブルの時間的進化を分析する。また、インフォバブル生成部２２０は、生成したインフォバブル・データを、インフォバブル記憶部２６０に登録し、後述するインフォバブルのクラスタリングなどのために利用する。 The info bubble generation unit 220 can be configured to include a relational database function to execute the above-described processing, and monitors network behavior for each set time window using SQL (Structured Query Language). Then, the user is attributed to the info bubble and the time evolution of the info bubble is analyzed. Further, the info bubble generation unit 220 registers the generated info bubble data in the info bubble storage unit 260 and uses it for information bubble clustering, which will be described later.

インフォバブル記憶部２６０に登録されたインフォバブル・データは、適切なＡＰＩ(Application Programming Interface)２３０を介してネットワーク行動分析部２４０に送付される。ネットワーク行動分析部２４０は、インフォバブル・データを特定のタイムウィンドウ内について分析することにより、インフォバブルを構成するユーザのついての情報を取得し、インフォバブルを構成する特徴パラメータについてのユーザ分析などを実行する。 The info bubble data registered in the info bubble storage unit 260 is sent to the network behavior analysis unit 240 via an appropriate API (Application Programming Interface) 230. The network behavior analysis unit 240 analyzes information bubble data within a specific time window to acquire information about users constituting the info bubble, and performs user analysis on characteristic parameters constituting the info bubble. Run.

また、ネットワーク行動分析部２４０は、タイムウィンドウごとのインフォバブルの変化などの時間進化を取得して、特徴パラメータについてのインフォバブルの形成期間、種類などを分析する。さらに、ネットワーク行動分析部２４０は、インフォバブルに関連してインフォバブルを、特定のユーザ属性などを使用してクラスタリングし、より、大域的なネットワーク行動の分析を実行する。なお、インフォバブル生成部２２０およびネットワーク行動分析部２４０の詳細な処理については、詳細に後述する。 In addition, the network behavior analysis unit 240 acquires time evolution such as a change in info bubble for each time window, and analyzes the formation period and type of the info bubble for the characteristic parameter. Furthermore, the network behavior analysis unit 240 clusters the info bubbles in relation to the info bubbles using specific user attributes and the like, and performs a more global network behavior analysis. Detailed processing of the info bubble generation unit 220 and the network behavior analysis unit 240 will be described later in detail.

ネットワーク行動分析部２４０の分析結果は、テーブル、リスト、またはグラフなどの適切な表現形式でネットワーク行動分析結果出力部２５０に送付され、出力が行われる。なお、ネットワーク行動分析結果出力部２５０は、情報処理装置１２２がローカルに管理する出力デバイスとすることもできるし、ネットワーク・アダプタ２８０を介して管理者端末などにデータをアップロードする例えば、ＦＴＰモジュール、ＨＴＴＰモジュールとして構成することができる。 The analysis result of the network behavior analysis unit 240 is sent to the network behavior analysis result output unit 250 in an appropriate expression format such as a table, list, or graph, and output. The network behavior analysis result output unit 250 can be an output device that is locally managed by the information processing apparatus 122, or uploads data to an administrator terminal or the like via the network adapter 280. For example, an FTP module, It can be configured as an HTTP module.

＜セクション２：インフォバブル定義づけ＞
図３は、本実施形態のインフォバブル３００の例示的な実施形態を示す。インフォバブル３００は、複数のユーザ３２０、３１０のリンクについて、特定の特徴パラメータについてのアクセス頻度が一定以上のユーザ３１０を登録することにより生成される。以下、本実施形態では、特定の情報Ｉに対するアクセス頻度に対して、ユーザ関わり度ｇおよびスコア関数ｆ（ｕ，Ｉ，Ｔ）を定義して定式化する。図３に示した実施形態では、インフォバブル境界３３０が定義され、その中に、情報Ｉについて知っているユーザが多く含まれている。 <Section 2: Defining Info Bubble>
FIG. 3 shows an exemplary embodiment of the info bubble 300 of the present embodiment. The info bubble 300 is generated by registering a user 310 whose access frequency with respect to a specific feature parameter is equal to or greater than a certain value for a link of a plurality of users 320 and 310. Hereinafter, in the present embodiment, the user association degree g and the score function f (u, I, T) are defined and formulated with respect to the access frequency for the specific information I. In the embodiment shown in FIG. 3, an info bubble boundary 330 is defined, which includes many users who know about the information I.

また、インフォバブル境界３３０の内側には、情報Ｉについて知らない（アクセスしない）ユーザも少数含まれる。この情報Ｉを知らないユーザは、可能性として、当該情報Ｉにまだアクセスしていないだけの可能性もあるので、インフォバブル３００を構成するインフォバブル境界３３０内の他のユーザとの関係でインフォバブル３００の要素ユーザとして登録される。なお、インフォバブル境界３３０の内側のユーザは、インフォバブル・データを構成するユーザ集合として、そのユーザＩＤ、ハンドルネーム、ＩＰアドレスなどをＣＳＶ、スペース区切り、カンマ区切りなどのフォーマットで、リストに登録される。 In addition, a small number of users who do not know (do not access) the information I are included inside the info bubble boundary 330. A user who does not know the information I may possibly have not yet accessed the information I, so that the information I is related to other users in the info bubble boundary 330 constituting the info bubble 300. It is registered as an element user of bubble 300. In addition, the users inside the info bubble boundary 330 are registered in the list in the format such as CSV, space delimiter, comma delimiter, etc., as the user set constituting the info bubble data. The

また、ユーザ３２０は、情報Ｉに関連してアクセス頻度が高くないので、図３に示した実施形態では、インフォバブル境界３３０の外側に示されている。なお、時間経過に対応して、ユーザ３２０が情報Ｉについてもアクセス頻度がしきい値を超えるようになる場合、インフォバブル境界３３０がユーザ３２０を含むように拡張される。 Further, since the user 320 is not frequently accessed in relation to the information I, the user 320 is shown outside the info bubble boundary 330 in the embodiment shown in FIG. If the access frequency of the user 320 for the information I exceeds the threshold value as time passes, the info bubble boundary 330 is expanded to include the user 320.

図３に示すユーザ・リンクは、予めインフォバブル生成部２２０が管理する。ユーザ・リンクは、ユーザがノードとされ、ノード間のリンクは、ノード間のネットワーク行動に関連する関わり合いの存在を示すものである。ユーザ間のリンクを生成する処理は、種々想定でき、第１の実施形態としては、情報処理装置１２２がＳＮＳサービスを提供する場合には、個々のユーザにより登録された登録関係を使用して、リンクを生成し、適切な記憶領域に記憶させておくことができる。なお、このリンクは、例えば、ユーザ情報記憶部１３０に登録しておき、インフォバブル生成部２２０がインフォバブルを生成する処理に応じてアクセスすることができる。また、ユーザ・リンクは、ネットワーク・データとして、ネットワーク・データ記憶部１２４に格納することもできる。 The user bubble shown in FIG. 3 is managed in advance by the info bubble generation unit 220. The user link indicates that the user is a node, and the link between nodes indicates the existence of an association related to the network behavior between the nodes. Various processes for generating a link between users can be assumed. In the first embodiment, when the information processing apparatus 122 provides an SNS service, a registration relationship registered by each user is used. A link can be generated and stored in an appropriate storage area. In addition, this link can be registered in the user information storage unit 130, for example, and can be accessed according to the process in which the info bubble generation unit 220 generates the info bubble. The user link can also be stored in the network data storage unit 124 as network data.

また、第２の実施形態では、ユーザ間で送受信された情報の行動を分析するための行動マトリックスを定義し、行動マトリックス特定のキーワードといった特徴パラメータの共有割合の高いユーザ間にリンクを定義する方法を採用することができる。行動マトリックスを使用する詳細な処理は、特願２００７−３３６９１９号明細に記載されているが、本明細書においてその処理の概略を説明する。 In the second embodiment, a method of defining a behavior matrix for analyzing behavior of information transmitted and received between users, and defining a link between users having a high sharing ratio of feature parameters such as behavior matrix specific keywords Can be adopted. Detailed processing using the behavior matrix is described in Japanese Patent Application No. 2007-336919, and the outline of the processing will be described in this specification.

図４は、行動マトリックスのデータ構成およびその生成処理を示した概略図である。ユーザ４０２は、例えばオンライン・コミュニティ・サービスにおいて、行動４０４として示した、「メッセージを書く」４０４ａ、「掲示板に書き込む」４０４ｂ、「ブログを書く」４０４ｃ・・・「メッセージを読む」４０４ｉ、「掲示板を読む」４０４ｊ、「ブログを読む」４０４ｋ、「ニュースを読む」４０４ｌ・・などの行動（活動）を行う。 FIG. 4 is a schematic diagram showing the data structure of the behavior matrix and its generation process. For example, in the online community service, the user 402 indicates “write message” 404 a, “write to bulletin board” 404 b, “write blog” 404 c... “Read message” 404 i, “bulletin board” shown as actions 404. ”404 j,“ Read blog ”404 k,“ Read news ”404 l...

このような、ユーザがオンライン・コミュニティ・サービス内でとり得る行動の種類は、情報処理装置１２２の適切な処理モジュール、例えば、パラメータ制御部２１０などに予め設定されている。上述した活動の各々に対して、ある一定期間に、ユーザ４０２が、メッセージとして読み書きしたテキスト４０６ａ、掲示板に読み書きしたテキスト４０６ｂ、ブログとして読み書きしたテキスト４０６ｃ、ニュースとして読んだテキスト４０６ｄ・・・を、テキスト情報４０６として、情報処理装置１２２が管理するテキスト・データ記憶部１２６に一旦保存する。 The types of actions that the user can take in the online community service are preset in an appropriate processing module of the information processing apparatus 122, such as the parameter control unit 210. For each of the above-mentioned activities, the user 402 reads / writes a text 406a read / written as a message, a text 406b read / written on a bulletin board, a text 406c read / written as a blog, a text 406d ... The text information 406 is temporarily stored in the text / data storage unit 126 managed by the information processing apparatus 122.

図４では、情報を、読書きしたテキスト４０６ａのように一括して参照するが、読んだテキストおよび書いたテキストは、別個に識別可能に保存される。これは、掲示板やブログについても、同様である。そこで、特開２００１−８４２５０、特開２００２−２５１４０２、特開２００４−２４６４４０などから周知の構文解析技術などにより、情報が含むキーワードや特定表現の頻度を、ＴＦ−ＩＤＦ(Term Frequency−Invert Document Frequency)などの手法を使用してパラメータ制御部２１０などが解析する。解析結果を使用して、情報処理装置１２２は、ことにより実行し、当該ユーザ４０２に対する行動マトリクス４０８を生成する。 In FIG. 4, information is collectively referred to as read / written text 406a, but the read text and the written text are stored separately and identifiable. The same applies to bulletin boards and blogs. Therefore, the frequency of keywords and specific expressions included in the information is changed to TF-IDF (Term Frequency-Invert Document Frequency) using a syntax analysis technique known from JP-A-2001-84250, JP-A-2002-251402, and JP-A-2004-246440. The parameter control unit 210 or the like analyzes using a method such as). Using the analysis result, the information processing apparatus 122 executes the process to generate an action matrix 408 for the user 402.

図４に示した特定の実施形態では、本発明の行動マトリクス４０８は、行をキーワード、列を行動種別として定義されている。行動マトリックス４０８は、キーワードなどの特徴パラメータを登録する行が多くの場合数千行であり、行動種類は、ネットワーク行動に関連して多くの場合それよりも少なく、列が数十程度であり、矩形行列を構成する。また、行に現れるキーワードは、テキスト情報４０６として保存されたすべてのテキストから構文解析により抽出されたキーワードのすべてを網羅する。そして、行動マトリックスの要素は、特定のキーワードなどの特徴パラメータの行の、特定行動の列の成分の値として、当該行動に関連するテキストから取得された、当該する特徴パラメータの出現頻度を設定する。 In the specific embodiment shown in FIG. 4, the behavior matrix 408 of the present invention is defined with the rows as keywords and the columns as behavior types. The behavior matrix 408 has thousands of rows for registering feature parameters such as keywords in many cases, the behavior type is often smaller than that in connection with network behavior, and the number of columns is about tens. Construct a rectangular matrix. The keywords appearing on the line cover all the keywords extracted by the syntax analysis from all the texts stored as the text information 406. The element of the behavior matrix sets the appearance frequency of the relevant feature parameter acquired from the text related to the behavior as the value of the component of the specific behavior column in the row of the feature parameter such as the particular keyword. .

上述した行動マトリクスの値は、ユーザごとに、個別のファイルとして、適切な記憶部、例えば行動履歴記憶部１２８に保存される。このとき、保存する形式は、例えば、ＣＳＶ、ＨＴＭＬ、ＸＭＬなど、行と列の値として、Ｃ、Ｃ＋＋、Ｃ＃、Ｊａｖａ（登録商標）、Ｐｅｒｌ、Ｒｕｂｙ、ＰＨＰなどのプログラミング・ツールによって識別可能な任意の形式とすることができる。 The behavior matrix values described above are stored in an appropriate storage unit, for example, the behavior history storage unit 128, as an individual file for each user. At this time, the format to be saved can be identified by programming tools such as C, C ++, C #, Java (registered trademark), Perl, Ruby, and PHP as row and column values such as CSV, HTML, and XML. Any format can be used.

上述のように作成された行動マトリックスは、Hausholder法などによる特異値計算により、特定のネットワーク行動においてその時点で送受信される特徴パラメータを取得するためにも使用することができる。また、上述のようにして生成された行動マトリックス４０８は、本発明においては、特徴パラメータに関連したユーザ間のリンクを生成するために使用することができる。例えば、行動マトリックス４０８の要素値は、特徴パラメータの出現頻度なので、特徴パラメータの出現頻度についてしきい値を設定し、しきい値を超えた特徴パラメータを共有するユーザ間にリンクを割当てることにより図３のユーザ・リンクを生成させ、登録しておくことができる。 The behavior matrix created as described above can also be used to acquire feature parameters that are transmitted / received at that time in a specific network behavior by singular value calculation by Hausholder method or the like. In addition, the behavior matrix 408 generated as described above can be used in the present invention to generate a link between users related to the characteristic parameter. For example, since the element value of the behavior matrix 408 is the appearance frequency of the feature parameter, a threshold is set for the appearance frequency of the feature parameter, and a link is allocated between users who share the feature parameter exceeding the threshold. Three user links can be generated and registered.

この際、トランザクションが直接行われたユーザ間についてまずリンクを生成し、トランザクションのノードとなったユーザから、同一の特徴パラメータの出現頻度について設定される第２のしきい値以上を有するリンクを当該ノードについての子ノードとして設定するなどの処理を使用してユーザ・リンクを拡張する。 At this time, a link is first generated between users who have directly performed the transaction, and a link having a second threshold value or higher set for the appearance frequency of the same feature parameter is determined from the user who has become the node of the transaction. Extend the user link using a process such as setting it as a child node for the node.

図５は、本実施形態のインフォバブルの時間進化を説明する説明図である。インフォバブル生成部２２０は、周期的に、行動履歴記憶部１２８などにアクセスして、設定されたタイムウィンドウ間の行動履歴を抽出し、抽出した行動履歴に含まれるユーザについて、インフォバブルを生成する。例えば、インフォバブル生成部は、サンプリングタイムＴ＝ｔ_０で、タイムウィンドウＴ_０５４０の期間についての行動履歴をサンプリングして、ユーザ・リンク５１０に登録されたユーザに対してインフォバブル５１２、５１４、５１６を生成し、インフォバブル記憶部２６０に登録する。 FIG. 5 is an explanatory diagram for explaining the time evolution of the info bubble according to the present embodiment. The info bubble generation unit 220 periodically accesses the action history storage unit 128 and the like, extracts the action history between the set time windows, and generates an info bubble for the user included in the extracted action history. . For example, the info bubble generation unit samples the action history for the period of the time window T ₀ 540 at the sampling time T = t ₀ and gives info bubbles 512 514 514 to the users registered in the user link 510. 516 is generated and registered in the info bubble storage unit 260.

その後、インフォバブル生成部２２０は、サンプリング周期が経過したサンプリングタイムＴ＝ｔ_１でタイムウィンドウＴ_１５５０の期間にわたり、同所の処理を実行して、ユーザ・リンク５２０についてインフォバブル５２２、５２４、５２６を生成する。この時、特徴パラメータに関連してユーザの興味や話題が変化する場合、インフォバブル５１２に対応するインフォバブル５２２のサイズ、すなわち要素ユーザ数が変化する。インフォバブルの時間進化は、それぞれインフォバブル５１４、５２４、インフォバブル５１６、５２６についても生成することができる。同様に、サンプリングタイムＴ＝t_２においてもタイムウィンドウＴ_２が設定され、インフォバブルを生成するために利用される。 After that, the info bubble generation unit 220 performs the same processing over the period of the time window T ₁ 550 at the sampling time T = t ₁ at which the sampling period has elapsed, and the info bubbles 522, 524, 526 is generated. At this time, when the user's interest or topic changes in relation to the feature parameter, the size of the info bubble 522 corresponding to the info bubble 512, that is, the number of element users changes. The time evolution of info bubbles can also be generated for info bubbles 514 and 524 and info bubbles 516 and 526, respectively. Similarly, a time window T ₂ is set at the sampling time T = t ₂ and is used to generate an info bubble.

インフォバブル生成部２２０は、サンプリングタイムＴ＝Ｔ_３となった場合、再度、タイムウィンドウ５６０の間の行動履歴を取得して、インフォバブル５３２、５３４、５３６を生成し、ユーザ・リンク５３０に各インフォバブルをインフォバブル・シーケンスとして生成させて行くことにより、特徴パラメータに関連付けられたインフォバブルの時間進化を分析することが可能となる。なお、インフォバブル生成部２２０は、行動履歴記憶部１２８にすでに登録されたデータを参照してインフォバブルを生成することができるし、行動履歴記憶部１２８に登録されるアクセスログをサンプリングタイムが到来した段階で、タイムウィンドウの期間だけ取得して、インフォバブルを生成してもよい。また、サンプリングウィンドウＴ_ｍ、サンプリング周期（ｔ_ｍ＋１−ｔ_ｍ）の比率は、適切な精度でインフォバブルを生成することができる限り、特に限定はない。 When the sampling time T = T ₃ , the info bubble generation unit 220 acquires the action history during the time window 560 again, generates the info bubbles 532, 534, and 536, and sets the user links 530 to each of the user links 530. By generating an info bubble as an info bubble sequence, it is possible to analyze the time evolution of the info bubble associated with the feature parameter. The info bubble generation unit 220 can generate an info bubble by referring to data already registered in the action history storage unit 128, and the access log registered in the action history storage unit 128 has been sampled. At this stage, the information bubble may be generated by acquiring only the period of the time window. Further, the ratio of the sampling window T _m and the sampling period (t _{m + 1} −t _m ) is not particularly limited as long as an info bubble can be generated with appropriate accuracy.

＜セクション３：インフォバブル生成処理＞
インフォバブル生成部２２０は、インフォバブルを生成するためのタイムウィンドウの期間にわたり、特徴パラメータに関連する情報について、ユーザ関わり度ｇを計算する。ユーザ関わり度ｇは、情報Ｉに対して何度ユーザが書き込みアクセスを行い、また読み込みアクセスを行ったかを、行動履歴から取得し、書き込みアクセスがある場合については、下記式（１）で、書き込みアクセスはないが、読み込みアクセスがある場合については、下記式（２）で与えられる値ｇ（ｕ，Ｉ，Ｔ_ｍ，Ｔ_ｍ＋１）で定義される。また、情報Iに対して書き・読み込みアクセス両方がないユーザに対しては、対応するそのユーザ関わり度をゼロとする。なお、Ｔ_ｍは、タイムウィンドウの時系列順序を示すための識別値であり、ｍは、０以上の整数である。 <Section 3: Info bubble generation processing>
The info bubble generation unit 220 calculates a user relationship g for information related to the feature parameter over the period of the time window for generating the info bubble. The degree of user involvement g is obtained from the action history as to how many times the user has made write access to the information I, and read access has been made. If there is write access, the following formula (1) When there is no access but there is a read access, it is defined by values g (u, I, T _m , T _{m + 1} ) given by the following equation (2). For users who do not have both write and read access to information I, the corresponding degree of user involvement is set to zero. T _m is an identification value for indicating the time series order of the time window, and m is an integer of 0 or more.

上記式中、ｕは、ユーザＩＤ、Ｉは、情報ＩＤ、ｎ＞０は、アクセス回数、ｊは、正の整数である。

In the above formula, u is a user ID, I is an information ID, n> 0 is the number of accesses, and j is a positive integer.

また、本実施形態では、ユーザがアクセスする可能性のある関連事項例えば、「日本アイ・ビー・エム」、「ＩＢＭ」などについては、特徴パラメータに関連付けられるものとして、小さな定数を導入し、上記式（１）、（２）の右辺の係数として設定し、ユーザ関わり度の計算に含ませることができる。 Further, in the present embodiment, related items that the user may access, such as “Japan IBM”, “IBM”, etc., are introduced as a small constant as being associated with the feature parameter, and It can be set as a coefficient on the right side of the equations (1) and (2) and included in the calculation of the user relation.

さらに、インフォバブル生成部２２０は、インフォバブル境界を、現時点で登録されている、インフォバブル境界から膨張させるか、または収縮させるかを判断するためのスコア関数ｆ（ｕ，Ｉ，Ｔ）を実装する。スコア関数ｆ（ｕ，Ｉ，Ｔ）は、説明する実施形態では、下記式（３）として実装することができる。 Further, the info bubble generation unit 220 implements a score function f (u, I, T) for determining whether the info bubble boundary is expanded or contracted from the info bubble boundary currently registered. To do. The score function f (u, I, T) can be implemented as the following equation (3) in the embodiment to be described.

上記式（３）中、αは、０＜α＜１の実数である。ただし、減少の場合、時系列的になめらかな関数であって、アクセス回数に関連して値を増減することができる関数であれば、関数の型式に特に限定はない。なお、特定のインフォバブル内でのスコア値の平均値を、以下、バブル平均スコア(Average Info Bubble Scoreであり、以後、これを略してAvgInfoBubbleとする。)と定義し、インフォバブルの膨張・収縮判断のために提供する。 In the above formula (3), α is a real number of 0 <α <1. However, in the case of a decrease, the function type is not particularly limited as long as it is a smooth function in time series and can increase or decrease a value in relation to the number of accesses. The average score value in a specific info bubble is hereinafter defined as the bubble average score (Average Info Bubble Score, hereinafter abbreviated as AvgInfoBubble). Provide for judgment.

図６は、ユーザ関わり度ｇを使用して生成されるスコア関数ｆ（ｕ，Ｉ，Ｔ）の実施形態および、インフォバブルの膨張・収縮判断について説明した図である。図６に示すように、上記式（１）〜（３）を使用して、各時刻のスコア関数ｆ（ｕ，Ｉ，Ｔ）は直前の時刻からのタイムウィンドウに関連して計算される。図６に示したスコア・ポイント６１０は、各タイムウィンドウで計算されたスコア値を示すものである。各スコア・ポイントの間は、スプライン関数や各種補間関数などを使用してなめらかに接続されるように計算されてもよい。インフォバブル生成部２２０は、スコア値に対して、膨張しきい値および収縮しきい値を設定し、スコア値と各しきい値とを比較して、インフォバブル境界の膨張・収縮処理を実行する。 FIG. 6 is a diagram illustrating an embodiment of a score function f (u, I, T) generated using the user relation g and info bubble expansion / contraction determination. As shown in FIG. 6, using the above equations (1) to (3), the score function f (u, I, T) at each time is calculated in relation to the time window from the immediately preceding time. The score points 610 shown in FIG. 6 indicate the score values calculated in each time window. Each score point may be calculated so as to be smoothly connected using a spline function, various interpolation functions, or the like. The info bubble generation unit 220 sets an expansion threshold value and a contraction threshold value for the score value, compares the score value with each threshold value, and executes an expansion / contraction process of the info bubble boundary. .

インフォバブル境界の膨張とは、特定のインフォバブルを構成するインフォバブル・データに、ユーザ・リンクで隣接し、他のインフォバブルに含まれるか隣接するか、またはインフォバブルに含まれていないユーザを追加する処理を意味する。また、インフォバブルの収縮とは、膨張の逆の処理に対応し、インフォバブル・データに含まれる要素ユーザを、インフォバブル・データから削除する処理を意味する。例えば、図６に示したスコア値では、ポイント６２０では、インフォバブルの収縮処理を実行し、ポイント６３０では、インフォバブルの膨張処理を実行する。なお、膨張処理および収縮処理については、より詳細に後述する。 The expansion of an info bubble boundary refers to the info bubble data that constitutes a specific info bubble, which is adjacent to the user link by a user link, is included in another info bubble, is adjacent, or is not included in an info bubble. Means processing to be added. The contraction of the info bubble corresponds to a process opposite to the expansion, and means a process of deleting the element user included in the info bubble data from the info bubble data. For example, in the score value shown in FIG. 6, the info bubble contraction process is executed at point 620, and the info bubble expansion process is executed at point 630. The expansion process and the contraction process will be described later in detail.

図７は、本実施形態で生成されるインフォバブル・データ７００のデータ構造を示す。図７に示すインフォバブル・データ７００は、変数名と、当該変数に対して登録する変数のデータ内容とを含む構成として示す。図７に示すように、フィールド７１０は、変数名＝Ｉｎｆｏとしてインフォバブルを定義するための情報内容が登録される。この値としては、例えば特徴パラメータなどを使用することができる。また、フィールド７２０には、サンプリングした時刻を、例えばサンプリングについてのタイムウィンドウの終点のＴ_ｍ＋１の値を代表させるなどして、タイムスタンプとして使用する。 FIG. 7 shows a data structure of the info bubble data 700 generated in the present embodiment. The info bubble data 700 shown in FIG. 7 is shown as a configuration including a variable name and data contents of a variable registered for the variable. As shown in FIG. 7, in the field 710, information contents for defining an info bubble with variable name = Info are registered. As this value, for example, a characteristic parameter can be used. In the field 720, the sampled time is used as a time stamp, for example, by representing the value of T _{m + 1} at the end point of the time window for sampling.

また、フィールド７３０は、Ｐｅｏｐｌｅとして、インフォバブル７００に含まれるユーザ集合を、ユーザＩＤを列記する型式などで、登録する。さらに、フィールド７５０は、当該インフォバブルが含むユーザのユーザ関わり度の平均値AvgInfoScoreを登録するフィールドである。なお、AvgInfoScoreは、０≦AvgInfoScore≦１を満たす実数である。図７に示したインフォバブル・データ７００は、例えば、B(I，T，P，AIS)として固有に識別され、インフォデータ記憶部２６０に登録される。
＜セクション３：インフォバブル生成部の処理＞
以下、本実施形態の各処理について説明するが、使用するパラメータ値の定義および内容は、以下の通り、
○Ｉ＝Ｉｎｆｏの値
○Ｔ＝時刻・タイムスタンプの値
○Ｐ＝Ｐｅｏｐｌｅの値
○ＡＩＳ＝AvgInfoScoreの値
○膨張しきい値＝ExpandThreshold
○収縮しきい値＝ShrinkThreshold
○インフォバブル＝Ｂ
○インフォバブル集合＝ＩＢＳｅｔ
である。 In the field 730, a user set included in the info bubble 700 is registered as a People by a model that lists user IDs. Further, the field 750 is a field for registering the average value AvgInfoScore of the user involvement of the user included in the info bubble. AvgInfoScore is a real number that satisfies 0 ≦ AvgInfoScore ≦ 1. The info bubble data 700 shown in FIG. 7 is uniquely identified as B (I, T, P, AIS), for example, and registered in the info data storage unit 260.
<Section 3: Processing of Info Bubble Generation Unit>
Hereinafter, each process of the present embodiment will be described. Definitions and contents of parameter values to be used are as follows:
○ I = Info value ○ T = Time / time stamp value ○ P = People value ○ AIS = AvgInfoScore value ○ Expansion threshold = ExpandThreshold
○ Shrink threshold = ShrinkThreshold
○ Info Bubble = B
○ Info bubble set = IBSet
It is.

図８は、インフォバブル生成処理の実施形態のフローチャートである。なお、図８の処理では、時刻Ｔ_Ｎまでデータが登録されているものとする。図８の処理は、ステップＳ８００から開始し、ステップＳ８０１で、時刻のカウンタおよびＩＢＳｅｔを、それぞれｉ＝１、ＩＢＳｅｔ＝φ（空集合）として初期化する。ステップＳ８０２では、インフォバブルを初期化し、ステップＳ８０３で、時刻が時刻の最大値Ｔ_Ｎ以下であるか否かを判断する。時刻のカウンタ値がＮを超える場合（Ｎｏ）、処理をステップＳ８０６に分岐させて処理を終了させる。ステップＳ８０３で、ｉ≦Ｎの場合（Ｙｅｓ）、ステップＳ８０４で、時刻Ｔ_ｉのインフォバブルの生成処理を実行する。ステップＳ８０５では、時刻のカウンタｉをインクリメントし、処理をステップＳ８０３に戻し、時刻のカウンタがＮを超えるまで、インフォバブルの生成処理を反復させる。 FIG. 8 is a flowchart of an embodiment of the info bubble generation process. In the process of FIG. 8, it is assumed that data is registered until time _TN . The processing in FIG. 8 starts from step S800, and in step S801, the time counter and IBSet are initialized as i = 1 and IBSet = φ (empty set), respectively. In step S802, the info bubble is initialized, and in step S803, it is determined whether or not the time is equal to or less than the maximum time value _TN . If the time counter value exceeds N (No), the process branches to step S806 and the process is terminated. In step S803, when the i ≦ N (Yes), at step S804, the executing the generation processing of the info bubble time _{T i.} In step S805, the time counter i is incremented, the process returns to step S803, and the info bubble generation process is repeated until the time counter exceeds N.

図９は、図８のステップＳ８０２のインフォバブル初期化処理の実施形態についてのフローチャートである。処理は、ステップＳ９００から開始し、ステップＳ９０１で、処理対象のユーザがまだ残っているか否かを判断し、処理対象のユーザが残っていない場合（Ｎｏ）、処理をステップＳ９０５に分岐させる。 FIG. 9 is a flowchart of an embodiment of the info bubble initialization process in step S802 of FIG. The process starts from step S900. In step S901, it is determined whether or not a user to be processed still remains. If no user to be processed remains (No), the process branches to step S905.

ステップＳ９０１で、処理対象のユーザがまだ残っていると判断された場合（Ｙｅｓ）ステップＳ９０２で、未処理のユーザｕを選択し、スコア値ｆ（ｕ，Ｉ，Ｔ）を計算する。ステップＳ９０３では、スコア値ｆ（ｕ，Ｉ，Ｔ）が収縮しきい値ShrinkThresholdよりも大きいか否かを判断し、スコア値が収縮しきい値以下の場合（Ｎｏ）、処理をステップＳ９０１に戻し、処理を反復させる。 If it is determined in step S901 that there are still users to be processed (Yes), an unprocessed user u is selected in step S902, and a score value f (u, I, T) is calculated. In step S903, it is determined whether or not the score value f (u, I, T) is larger than the shrinkage threshold ShrinkThreshold. If the score value is equal to or smaller than the shrinkage threshold (No), the process returns to step S901. Repeat the process.

ステップＳ９０３で、スコア値が収縮しきい値よりも大きいと判断された場合（Ｙｅｓ）、ステップＳ９０４でＩＢＳｅｔの要素として、インフォバブル・データを登録する。ステップＳ９０５では、時刻Ｔ_１（つまり、時刻Ｔ_０からＴ₁までのタイムウィンドウ）のインフォバブル膨張処理を実行させ、ステップＳ９０６で、生成したインフォバブルのＩＢ識別値を修正して付け直しステップＳ９０７で処理を終了させる。 If it is determined in step S903 that the score value is larger than the contraction threshold (Yes), info bubble data is registered as an element of IBSet in step S904. In step S905, the info bubble expansion process at time T ₁ (that is, the time window from time T ₀ to T ₁ ) is executed. In step S906, the IB identification value of the generated info bubble is corrected and reattached. To end the process.

図１０は、タイムウィンドウＴ_ｉ（ｉ＞０）におけるインフォバブル生成処理の実施形態のフローチャートを示す。図１０の処理は、ステップＳ１０００から開始し、ステップＳ１００１で各Ｂ∈ＩＢＳｅｔに対するＡＩＳの再計算を実行する。ステップＳ１００２では、時刻Ｔ_ｉにおけるインフォバブル収縮処理を実行し、ステップＳ１００３では、時刻Ｔ_ｉ−１のインフォバブルに関わらないユーザ集合のインフォバブルを初期化する。ステップＳ１００４では、時刻Ｔ_ｉにおけるインフォバブル膨張処理を実行して、処理をステップＳ１００５で終了させる。 FIG. 10 shows a flowchart of an embodiment of the info bubble generation process in the time window T _i (i> 0). The process of FIG. 10 starts from step S1000, and recalculates the AIS for each BεIBSet in step S1001. In step S1002, an info bubble contraction process at time T _i is executed, and in step S1003, an info bubble of a user set not related to the info bubble at time T _i-1 is initialized. In step S1004, the info bubble expansion process at time T _i is executed, and the process ends in step S1005.

図１１は、時刻Ｔ_ｉでのインフォバブル膨張処理の実施形態についてのフローチャートを示す。図１１の処理は、ステップＳ１１００から開始し、ステップＳ１１０１で、インフォバブルが変更されたか否かを示す変更識別値ｉｓＣｈａｎｇｅｄ＝ｆａｌｓｅに設定する。ステップＳ１１０２は、統合可能なインフォバブルＢ１、Ｂ２がＩＢＳｅｔの要素として存在するか否かを判断するステップである。Ｓ１１０２の判断で統合可能なインフォバブルＢ１、Ｂ２がない場合には（Ｎｏ）、処理をステップＳ１１０４に分岐させる。 Figure 11 shows a flowchart of an embodiment of the info bubble expansion process at the time T _i. The processing in FIG. 11 starts from step S1100, and in step S1101, a change identification value isChanged = false indicating whether or not the info bubble has been changed. Step S1102 is a step of determining whether or not the info bubbles B1 and B2 that can be integrated exist as elements of the IBSet. If there is no info bubble B1, B2 that can be integrated in the determination in S1102 (No), the process branches to step S1104.

一方、ステップＳ１１０２で統合可能なＢ１、Ｂ２が存在する場合（Ｙｅｓ）、ステップＳ１１０３で、統合可能なＢ１、Ｂ２の組を１つ選択し、集合和を計算し、その和集合をＢ１、Ｂ２の値を削除したＩＢＳｅｔと統合し、新しいＩＢＳｅｔを得る。その後、変更識別値ｉｓＣｈａｎｇｅｄ＝ｔｒｕｅに変更する。 On the other hand, if B1 and B2 that can be integrated exist in step S1102 (Yes), one set of B1 and B2 that can be integrated is selected in step S1103, a set sum is calculated, and the union is calculated as B1 and B2. Is integrated with the deleted IBSet to obtain a new IBSet. After that, the change identification value isChanged = true.

ステップＳ１１０４では、追加可能なユーザｕとインフォバブルＢの組み合わせが存在するか否かを判断し、存在しない場合（Ｎｏ）、処理をステップＳ１１０６に分岐させ、変更識別値ｉｓＣｈａｎｇｅｄがｔｒｕｅであるか否かを判断し、ｉｓＣｈａｎｇｅｄ＝ｆａｌｓｅである場合（Ｎｏ）、処理をステップＳ１１０７で終了させる。一方、ステップＳ１１０６で変更識別値ｉｓＣｈａｎｇｅｄがｔｒｕｅである場合（Ｙｅｓ）、処理をステップＳ１１０１に分岐させ、統合可能なＢ１、Ｂ２がＩＢＳｅｔ内に存在しなくなるまで、処理を反復する。図１１の処理によって、時刻Ｔ_ｉに対応するインフォバブルが生成される。 In step S1104, it is determined whether there is a combination of user u and info bubble B that can be added. If there is no combination (No), the process branches to step S1106, and whether the change identification value isChanged is true. If isChanged = false (No), the process ends in step S1107. On the other hand, if the change identification value isChanged is true in step S1106 (Yes), the process branches to step S1101, and the process is repeated until there is no B1 and B2 that can be integrated in the IBSet. The info bubble corresponding to the time T _i is generated by the process of FIG.

なお、以下の、ステップＳ１１０３の統合処理およびステップＳ１１０５の追加処理の処理プロセスを説明する。 Note that the following processing process of the integration process in step S1103 and the additional process in step S1105 will be described.

[統合処理]
例えば、インフォバブルＢ１、Ｂ２が統合可能であるということは、ｕ∈Ｂ１、ｖ∈Ｂ２であり、ｕ∈Ｎ（ｖ）を満たすｕ、ｖが存在することと同値である。ここで、Ｎ（ｖ）とは、ユーザ・リンクを構成するユーザ集合の中からユーザｖと直接的に接続される、または隣接する、ユーザ集合のことである。統合処理は、インフォバブルB1(I，T，P1，AIS1)およびインフォバブルB2(I，T，P2，AIS2) に対し、merge(B1，B2)：=(I，T，P1∪P2，AIS)で指定されるインフォバブルを生成する処理に対応する。ここで、統合されたインフォバブルのＡＩＳについては重付き平均を使用して計算し、AIS：＝p1×AIS1+p2×AIS2，p1=|P1|/|P1∪P2|，p2=|P2|/|P1∪P2|で与えるものとする。 [Integration processing]
For example, the fact that info bubbles B1 and B2 can be integrated is equivalent to the existence of u and v satisfying uεN (v) because uεB1 and vεB2. Here, N (v) is a user set that is directly connected to or adjacent to the user v from among the user sets constituting the user link. Integration process is merge (B1, B2): = (I, T, P1∪P2, AIS) for info bubble B1 (I, T, P1, AIS1) and info bubble B2 (I, T, P2, AIS2) This corresponds to the process of generating the info bubble specified by). Here, the AIS of the integrated info bubble is calculated using a weighted average, and AIS: = p1 × AIS1 + p2 × AIS2, p1 = | P1 | / | P1∪P2 |, p2 = | P2 | / | P1∪P2 |.

[追加処理]
ユーザｕは、インフォバブルＢに追加可能であるとは、インフォバブルＢのＡＩＳがExpandThreshold値以上であって、かつｕがどのインフォバブルの要素ではなく、かつ、merge(B，{u})のＡＩＳがShrinkThresholdよりも大きいことを要件として実行される。 [Additional processing]
The user u can be added to the info bubble B when the info bubble B's AIS is equal to or greater than the ExpandThreshold value, u is not an info bubble element, and merge (B, {u}) It is implemented with the requirement that AIS is greater than ShrinkThreshold.

図１２は、図１０のステップＳ１００２で説明したインフォバブル収縮処理の実施形態についてのフローチャートを示す。図１２の処理は、ステップＳ１２００から開始し、ステップＳ１２０１で未処理のインフォバブルがまだ残っているか否かを判断し、残っていない場合（Ｎｏ）、処理をステップＳ１２０５に分岐させ、ＩＢＳｅｔを更新し、処理をステップＳ１２０６で終了させる。 FIG. 12 shows a flowchart of the embodiment of the info bubble contraction process described in step S1002 of FIG. The process in FIG. 12 starts from step S1200. In step S1201, it is determined whether or not an unprocessed info bubble still remains. If not (No), the process branches to step S1205 to update IBSet. Then, the process ends in step S1206.

一方、ステップＳ１２０１で未処理のインフォバブルが残っていると判断された場合（Ｙｅｓ）、ステップＳ１２０２で、未処理のインフォバブルＢをＩＢＳｅｔから選択し、ステップＳ１２０３で、インフォバブルＢのＡＩＳが収縮しきい値ShrinkThreshold以上か否かを判断する。ステップＳ１２０３で、インフォバブルＢのＡＩＳが収縮しきい値ShrinkThreshold以上ではないと判断された場合（Ｎｏ）、ステップＳ１２０４でＢに対して、ユーザの削除および分離を実行し、新に生成されたインフォバブルに処理済みのマークを付けて、ＩＢＳｅｔに登録する。 On the other hand, if it is determined in step S1201 that an unprocessed info bubble remains (Yes), an unprocessed info bubble B is selected from IBSet in step S1202, and the AIS of the info bubble B contracts in step S1203. It is determined whether or not the threshold value is equal to or greater than ShrinkThreshold. If it is determined in step S1203 that the AIS of the info bubble B is not greater than or equal to the shrinkage threshold ShrinkThreshold (No), the user is deleted and separated from B in step S1204, and the newly generated info Mark the bubble as processed and register it in the IBSet.

一方、ステップＳ１２０３で、インフォバブルＢのＡＩＳが収縮しきい値ShrinkThereshold以上と判断された場合（ｙｅｓ）、処理をステップＳ１２０１に分岐させ、未処理のインフォバブルがなくなるまで、処理を反復させる。 On the other hand, if it is determined in step S1203 that the AIS of the info bubble B is equal to or greater than the shrinkage threshold ShrinkThereshold (yes), the process branches to step S1201, and the process is repeated until there is no unprocessed info bubble.

以下、図１２のステップＳ１２０４の削除・分離処理についての処理プロセスを説明する。 Hereinafter, a processing process for the deletion / separation processing in step S1204 of FIG. 12 will be described.

[削除処理]
インフォバブルB(I，T，P，AIS)から、Ｐ∋ｕであって、ｆ（ｕ，Ｉ，Ｔ）が最も小さいｕを順に選択し、削除後のインフォバブルのAISの値がShrinkThreshold以上となるようにユーザｕを次々にインフォバブルＢから取り除く処理である。この処理は、削除された全ユーザのfが必ずShrinkThreshold以下で、かつ削除された後のインフォバブルＢのAISを必ずShrinkThreshold以上とする処理であり、細切れのインフォバブルの生成を最小化させるための処理である。 [Delete processing]
From info bubble B (I, T, P, AIS), select u that has P∋u and the smallest f (u, I, T), and AIS value of info bubble after deletion is equal to or greater than ShrinkThreshold In this process, the user u is removed from the info bubble B one after another. This process is a process in which f of all deleted users is always equal to or less than ShrinkThreshold, and the AIS of the deleted info bubble B is always equal to or greater than ShrinkThreshold, and is for minimizing the generation of shredded info bubbles. It is processing.

[分離処理]
削除を行ったインフォバブルB(I，T，P，AIS)が連結できない場合、インフォバブルＢの各統合・連結成分を新たにインフォバブルとして生成する処理である。Ｉ、Ｔ、その他のバブル情報は、元のインフォバブルＢから継承するが、Ｐと、ＡＩＳとを新しく計算して登録する。 [Separation process]
When the deleted info bubble B (I, T, P, AIS) cannot be connected, it is a process of newly generating each integrated / connected component of the info bubble B as an info bubble. I, T, and other bubble information are inherited from the original info bubble B, but P and AIS are newly calculated and registered.

図１３は、図１０〜図１２に説明したインフォバブルの膨張・収縮処理について、グラフ表現１３００を使用して説明する図である。図１３に示す膨張収縮処理では、インフォバブル１３１０は、インフォバブル１３１０をバブルＡ、バブルＢ、バブルＣを統合して生成される。統合前のバブルＡ、バブルＢ、バブルＣは、それぞれ破線で示したものであり、図１１の処理を使用して、統合が実行されている。また、インフォバブル１３２０は、ＡＩＳの値が膨張しきい値ExpansionThreshold以上の場合に、インフォバブル境界を拡張させ、インフォバブル１３２０を構成する要素ユーザをインフォバブル１３２０に追加登録することにより実行される。 FIG. 13 is a diagram illustrating the expansion / contraction process of the info bubble described with reference to FIGS. In the expansion / contraction process shown in FIG. 13, the info bubble 1310 is generated by integrating the info bubble 1310 with the bubble A, the bubble B, and the bubble C. Bubble A, bubble B, and bubble C before integration are indicated by broken lines, respectively, and integration is executed using the processing of FIG. Further, the info bubble 1320 is executed by expanding the info bubble boundary and additionally registering the element users constituting the info bubble 1320 in the info bubble 1320 when the AIS value is equal to or greater than the expansion threshold ExpansionThreshold.

また、インフォバブル１３３０は、ＡＩＳが収縮しきい値以下となって、インフォバブル１３３０が収縮される。この例では、ユーザ１３４０のスコア関数が同じインフォバブルに属するユーザの中で最小であるため、ユーザ１３４０がインフォバブル１３３０から削除されている。インフォバブルの収縮とユーザの削除を併用することにより、インフォバブル１３３０のＡＩＳがShrinkThreshold以上になるような、最適なサイズのインフォバブルが残ることになる。 Further, in the info bubble 1330, the AIS becomes equal to or smaller than the contraction threshold value, and the info bubble 1330 is contracted. In this example, since the score function of the user 1340 is the smallest among users belonging to the same info bubble, the user 1340 is deleted from the info bubble 1330. By using the shrinkage of the info bubble and the deletion of the user in combination, an info bubble of an optimal size remains such that the AIS of the info bubble 1330 is equal to or greater than ShrinkThreshold.

図１４は、インフォバブルを初期化した段階で与えられるインフォバブル１４１０、１４２０、１４３０、１４４０を、グラフ表現で示したものである。図１４で示した初期化では、収縮しきい値：ShrinkThreshold＝０．５５、膨張しきい値：ExpandThreshold＝０．７５を使用した。図１４に示すように、インフォバブル生成部２２０は、定義されたユーザ・リンクに含まれるユーザに対して最小単位のインフォバブルを割当てている。図１４でインフォバブルが割当てられているユーザは、スコア値ｆ（ｕ，Ｉ，Ｔ_０）が、設定以上の値を有するユーザであり、図１４に示した実施形態では、スコア値ｆ（ｕ，Ｉ，Ｔ_０）が、０．５５以上のユーザについてインフォバブルを割り当てている。 FIG. 14 is a graph representation of info bubbles 1410, 1420, 1430, and 1440 given at the stage of initializing the info bubble. In the initialization shown in FIG. 14, the shrinkage threshold: ShrinkThreshold = 0.55 and the expansion threshold: ExpandThreshold = 0.75 were used. As shown in FIG. 14, the info bubble generation unit 220 assigns a minimum unit info bubble to a user included in a defined user link. The user to whom the info bubble is assigned in FIG. 14 is a user whose score value f (u, I, T ₀ ) is greater than or equal to the set value. In the embodiment shown in FIG. 14, the score value f (u , I, T ₀ ) assigns info bubbles to users of 0.55 or more.

なお、初期にインフォバブルを割当てるための値は、適宜設定することができる。図１４でインフォバブルが割当てられていないユーザは、当該特徴パラメータを含む情報Ｉについてアクセスしていない、または、充分にアクセスしていないためである。初期化された図１４の状態から、インフォバブル生成部２２０は、インフォバブルを可能か限り膨張させる処理を実行する。これが、図９のステップＳ９０５が実行する時刻Ｔ_１での膨張処理に対応する。図ではユーザに対応するノード・節点には、ユーザを固有に識別するためのハンドルネームおよび当該ユーザのスコア値を併せて示す。 Note that a value for assigning an info bubble in the initial stage can be set as appropriate. This is because a user who is not assigned an info bubble in FIG. 14 does not access the information I including the characteristic parameter or does not access it sufficiently. From the initialized state of FIG. 14, the info bubble generation unit 220 executes a process of expanding info bubbles as much as possible. This corresponds to the expansion processing at the time T ₁ in which step S905 of FIG. 9 is executed. In the figure, the node / node corresponding to the user is shown together with a handle name for uniquely identifying the user and the score value of the user.

図１５は、図１４に示した初期膨張処理によって生成されたインフォバブルの実施形態を示す。図１５に示した実施形態では、インフォバブル１５１０は、ハンドルネーム＝Ｈａｒｒｙ（０．６１）のユーザのみを含んでおり、またインフォバブル１５２０には、ハンドルネーム＝Ｆｕｍｉｏ（０．５８）のユーザのみを含んでいる。そして、それぞれハンドルネーム＝Ｈａｒｒｙのインフォバブル１５１０は、ＩＢＳｅｔ集合において、識別値＝３として登録され、ハンドルネーム＝Ｆｕｍｉｏのインフォバブル１５２０は、識別値＝２として登録されている。 FIG. 15 shows an embodiment of an info bubble generated by the initial expansion process shown in FIG. In the embodiment shown in FIG. 15, the info bubble 1510 includes only the user with handle name = Harry (0.61), and the info bubble 1520 includes only the user with handle name = Fumio (0.58). Is included. The info bubble 1510 with handle name = Harry is registered as identification value = 3 in the IBSet set, and the info bubble 1520 with handle name = Fumio is registered as identification value = 2.

さらに、図１５を参照すると、ハンドルネーム＝Ｈａｒｒｙのスコア値がExpandThreshold＝０．７５より低いので、インフォバブル１５１０は、初期膨張処理では、変化しない。この状況は、ハンドルネーム＝Ｆｕｍｉｏ（０．５８）についても同様であるため、ハンドルネーム＝Ｆｕｍｉｏについてもインフォバブルは変化しない。 Further, referring to FIG. 15, since the score value of handle name = Harry is lower than ExpandThreshold = 0.75, the info bubble 1510 does not change in the initial expansion process. This situation is the same for handle name = Fumio (0.58), so the info bubble does not change for handle name = Fumio.

一方、ハンドルネーム＝Ｈｉｔｏｓｈｉ（０．７５）、ハンドルネーム＝Ａｋｉｋｏ（０．９８）については、それぞれのスコア値がExpandThreshold＝０．７５よりも大きい。初期設定時にはインフォバブル１４３０、１４４０がそれぞれ割当てられていたものである。初期膨張処理では、このインフォバブルが隣接するために、両方をまず統合する。統合後のインフォバブル１５３０の要素ユーザのスコア値の平均ＡＩＳが、膨張しきい値を超える限り、インフォバブルを統合するとともに、直隣接するユーザの中でスコア関数が一番高いユーザの追加の可能性を判断し、可能である場合、すなわち、追加後のＡＩＳの値が収縮しきい値以上である場合には、インフォバブルにユーザの追加を実行する。このため、図１４のインフォバブル１４３０、１４４０が統合され、さらに、ハンドルネーム＝Ａｋｉｋｏに隣接するハンドルネーム＝Ｉｓｓｅｉのユーザが統合されたインフォバブルに直隣接するユーザの中からスコア関数が最大であることと、ハンドルネーム＝Ｉｓｓｅｉのユーザを含めた場合でもＡＩＳ＝０．６８、つまり、ShrinkThresholdの０．５５以上であるので、統合後のインフォバブル１５３０には、ハンドルネーム＝Ｉｓｓｅｉが追加されている。 On the other hand, for handle name = Hitoshi (0.75) and handle name = Akiko (0.98), the respective score values are larger than ExpandThreshold = 0.75. Info bubbles 1430 and 1440 were assigned at the time of initial setting. In the initial expansion process, since the info bubbles are adjacent, both are first integrated. As long as the average AIS of the score values of the element users of the integrated info bubble 1530 exceeds the expansion threshold, it is possible to integrate the info bubbles and add the user with the highest score function among the immediately adjacent users If it is possible, that is, if the value of the added AIS is equal to or greater than the contraction threshold value, the user is added to the info bubble. For this reason, the info bubbles 1430 and 1440 in FIG. 14 are integrated, and the score function is the largest among the users immediately adjacent to the info bubble in which the user with the handle name = Issei adjacent to the handle name = Akiko is integrated. Even when a user with handle name = Issei is included, since AIS = 0.68, that is, ShrinkThreshold is 0.55 or more, handle name = Issei is added to the integrated info bubble 1530. .

その後、インフォバブル１５３０のＡＩＳがExpandThreshold＝０．７５よりも小さい（０．６８）値をとることから、ユーザ・リンク上で隣接してハンドルネーム＝Ｈｉｄｅｏ（０．２８）のユーザとハンドルネーム＝Ｒｉｓａ（０．０５）のユーザが存在しても、インフォバブル１５３０には追加されず、初期膨張処理は、インフォバブル１７３０の要素ユーザとして、ユーザ＝Ｈｉｔｏｓｈｉ、Ａｋｉｋｏ、Ｉｓｓｅｉを登録し、各インフォバブルの識別値を割当て直して、統合後のインフォバブル１５３０を、ＩＢＳｅｔにおいて、識別値＝１として登録する。 After that, since the AIS of the info bubble 1530 takes a value (0.68) smaller than ExpandThreshold = 0.75, the user and the handle name = Hide (0.28) are adjacent to each other on the user link. Even if there is a user of Risa (0.05), it is not added to the info bubble 1530, and the initial expansion process registers user = Hitoshi, Akiko, Issei as element users of the info bubble 1730, and each info bubble And the integrated information bubble 1530 is registered as identification value = 1 in the IBSet.

図１６は、インフォバブルの時間的進化に対応するインフォバブル生成処理１６００の実施形態を示す。図１６に示すように、時刻Ｔ_ｍで生成されたＩＢＳｅｔ１６１０には、インフォバブル１６２０およびインフォバブル１６３０が登録されている。図１６に示した実施形態では、ユーザ＝Ｈａｒｒｙが形成するインフォバブル１６２０は、識別値＝７で示される値が付され、インフォバブル１６３０には、識別値＝１．５が付されている。 FIG. 16 shows an embodiment of an info bubble generation process 1600 that corresponds to the temporal evolution of info bubbles. As shown in FIG. 16, the IBSet1610 generated at time _{T m,} Info bubble 1620 and Info bubbles 1630 are registered. In the embodiment shown in FIG. 16, the information bubble 1620 formed by the user = Harry is assigned a value indicated by the identification value = 7, and the information bubble 1630 is assigned the identification value = 1.5.

識別値１．５は、インフォバブルの時間的進化を追跡することを可能とするため、初期膨張処理で与えられた識別値を、時刻Ｔ_ｍごとに当該インフォバブルが存在している場合、統合された側のインフォバブルの識別値を昇順にピリオドで区切り、識別値＝Ｖ_０．Ｖ_１．Ｖ_２．・・・．Ｖ_ｍとして生成し、インフォバブルに割り当てて登録する。例えば、ＩＢＳｅｔ１６１０では、インフォバブル１６３０は、ユーザ＝Ｈｉｔｏｓｈｉがインフォバブルの識別値１から削除され、ユーザ＝Ｒｉｓａが追加されているものの、初期膨張時からインフォバブル１が存在し、図１６に示した実施形態での時刻Ｔ_ｍでは、識別値Ｖ_ｍ＝５に割当てられたインフォバブルと統合されることによって、新しくインフォバブルの識別値１．５ができたことを示している。 Identification value 1.5, in order to make it possible to track the temporal evolution of the info bubble, if the identification value given in the initial expansion process, the info bubble is present for each time T _m, integrated The identification values of the information bubbles on the side that have been assigned are separated by periods in ascending order, and the identification value = V ₀ . V ₁ . V ₂ . .... V _m is generated and assigned to the info bubble and registered. For example, in the IBSet 1610, the info bubble 1630 has the user = Hitoshi deleted from the info bubble identification value 1 and the user = Risa added, but the info bubble 1 exists from the initial expansion, and is shown in FIG. at time T _m of a in the embodiment, shows that by being integrated with the info bubble assigned to identification value V _{m =} 5, could identification value 1.5 of new info bubble.

ここで、さらに時間が経過して、時刻Ｔ_ｍ＋１について、インフォバブル生成部２２０がインフォバブルの生成処理を実行した結果がＩＢＳｅｔ１６５０で示されている。ＩＢＳｅｔ１６５０には、さらに複数のインフォバブルが登録されており、ＩＢＳｅｔ１６１０で生成されたインフォバブル７がさらに拡大してインフォバブル１６６０として生成され、その識別値＝７．１０が割り当てられている。なお、インフォバブル１６６０は、膨張処理前には、インフォバブル１０およびインフォバブル７として登録されていたものが統合処理によって、生成されたものである。 Here, IBSet 1650 shows the result of the info bubble generation unit 220 executing the info bubble generation process at time T _{m + 1} after further elapse of time. A plurality of info bubbles are further registered in the IBSet 1650, and the info bubble 7 generated in the IBSet 1610 is further expanded to be generated as an info bubble 1660, and the identification value = 7.10 is assigned. In addition, the information bubble 1660 was generated by the integration process that was registered as the information bubble 10 and the information bubble 7 before the expansion process.

一方、ＩＢＳｅｔ１６５０には、さらに他のインフォバブル１６９０も登録される。インフォバブル１６９０は、初期のインフォバブルである識別値＝１から進化したものであり、その識別値＝１．５．１３であり、それ以前には、識別値＝１．５として参照されていたインフォバブル１６３０とインフォバブル１６９５（識別値＝１３）とが統合され生成されたものである。図１６に示した実施形態では、時刻Ｔ_ｍから時刻Ｔ_ｍ＋１の間に、ユーザ＝Ｈａｒｒｙを起源とするインフォバブルが拡大し、一方、ユーザ＝Ｆｕｍｉｏを起源とするインフォバブルが再度、ユーザ＝Ｈｉｔｏｓｈｉのインフォバブル１６９５を吸収して拡大して行くのが示されている。以上のように、時刻ごとにインフォバブルを生成することにより、初期に生成したインフォバブルの時間的進化を追跡することが可能となり、特定の特徴パラメータを含む情報のユーザ間での共有または関心の拡大または縮小を追跡することが可能となる。 On the other hand, another info bubble 1690 is also registered in the IBSet 1650. The info bubble 1690 evolved from the initial info bubble identification value = 1, the identification value = 1.5.13, and before that, it was referred to as the identification value = 1.5. The info bubble 1630 and the info bubble 1695 (identification value = 13) are integrated and generated. In the embodiment shown in FIG. 16, between time T _m and time T _{m + 1} , the info bubble originating from the user = Hary expands, while the info bubble originating from the user = Fumio again becomes the user = Hitoshi. The information bubble 1695 is absorbed and expanded. As described above, by generating an info bubble for each time, it is possible to track the temporal evolution of the initially generated info bubble, and information including specific feature parameters can be shared between users or It becomes possible to track enlargement or reduction.

したがって、例えばＳＮＳなどで特徴パラメータを有する情報についての情報伝播が、どのユーザを起源とするものであるかを追跡することが可能となる。また、特徴パラメータに関連する情報についての時間発展を追跡できることから、特定時刻の直後から、情報処理装置１２２に現にアクセスしているユーザに対するバナー広告などの表示制御や広告内容などの選択にフィードバックすることが可能となる。 Therefore, for example, it is possible to track which user originates information propagation for information having characteristic parameters in SNS or the like. In addition, since it is possible to track the time development of information related to the characteristic parameter, the feedback to the display control such as the banner advertisement and the selection of the advertisement content for the user who is currently accessing the information processing apparatus 122 is fed back immediately after the specific time. It becomes possible.

＜インフォバブルを使用したネットワーク行動解析＞
以上説明したインフォバブル生成処理は、ユーザ・リンクを使用し、特徴パラメータに関心を有するユーザ集合を生成するものである。このことは、逆に、特徴パラメータに関心を有するユーザ集合を構成するユーザのユーザ属性と特徴パラメータとを対応付け、ユーザ属性ごとにクラスタリングすることを可能とする。クラスタリングのために、インフォバブル生成に使用しないユーザの属性（例えば年齢、職業、趣味など）を用いることができ、その類似度を見ることにより，ある情報に関するインフォバブルのユーザが、そのユーザ属性に関連する多様性、分布、性向があるかを分析することができる。 <Network behavior analysis using info bubbles>
The info bubble generation process described above uses a user link to generate a user set that is interested in feature parameters. On the contrary, this makes it possible to associate the user attributes of the users constituting the user set interested in the feature parameters with the feature parameters and perform clustering for each user attribute. For clustering, user attributes that are not used for generating information bubbles (for example, age, occupation, hobbies, etc.) can be used. Analyzes whether there is diversity, distribution and propensity related.

図１７は、本実施形態で、インフォバブルをクラスタリングする処理の実施形態を示す。時刻Ｔで生成されたユーザ・リンク１７１０内には、特徴パラメータに関連して複数のインフォバブル、例えばインフォバブル１７２０〜インフォバブル１７４０が生成されている。ユーザ・リンク１７１０を対象とし、インフォバブルを構成するユーザのユーザ属性の類似性を使用して、インフォバブルをクラスタ・リストなどとして登録することにより、インフォバブル１７２０〜１７４０をユーザ属性の類似性に基づいてクラスタリングすることができる。ユーザ属性の類似性を判定するためには、ユーザ属性について属性類似リストを提供し、属性類似リストの同一レコードに分類されるユーザ属性を類似として判断することが好ましい。下記表１に、属性類似リストの実施形態を示す。下記表１の属性類似リストのエントリ項目は、列の属性それぞれを使用して類似判断に利用することもできるし、複数列の属性を組み合わせて、より詳細な類似判断を行うことができる。 FIG. 17 shows an embodiment of a process for clustering info bubbles in this embodiment. In the user link 1710 generated at time T, a plurality of info bubbles, for example, info bubble 1720 to info bubble 1740, are generated in association with the feature parameter. By registering the info bubble as a cluster list or the like by using the similarity of the user attributes of the users constituting the info bubble for the user link 1710, the info bubbles 1720 to 1740 are changed to the similarity of the user attributes. Clustering can be performed based on this. In order to determine the similarity of user attributes, it is preferable to provide an attribute similarity list for the user attributes and determine that the user attributes classified in the same record of the attribute similarity list are similar. Table 1 below shows an embodiment of the attribute similarity list. The entry items of the attribute similarity list in Table 1 can be used for similarity determination using each column attribute, or more detailed similarity determination can be performed by combining attributes of a plurality of columns.

クラスタリング処理は、情報処理装置１２２のネットワーク行動分析部２４０の特定モジュールとして構成することができ、適切なＡＰＩ２３０を介して、インフォバブル・データを取得し、登録されたユーザのユーザ属性を上記表１にマッピングして、ユーザ・リンク１７５０に対してインフォバブル１７２０〜１７４０を含むクラスタ１７６０、１７７０、１７８０などを、ユーザ属性に対応付けて生成する。さらに、タイムウィンドウごとにクラスタリング処理を実行することで、クラスタの時間進化を分析することができる。以下、クラスタリング処理についてさらに詳細に説明する。 The clustering process can be configured as a specific module of the network behavior analysis unit 240 of the information processing apparatus 122. Info bubble data is acquired via an appropriate API 230, and user attributes of registered users are listed in Table 1 above. And the clusters 1760, 1770, and 1780 including the info bubbles 1720 to 1740 for the user link 1750 are generated in association with the user attributes. Furthermore, the time evolution of the cluster can be analyzed by executing the clustering process for each time window. Hereinafter, the clustering process will be described in more detail.

[クラスタ生成処理]
クラスタ生成処理は、種々の方法で行うことができ、例えばＫ−ｍｅａｎｓや
ａｇｇｌｏｍｅｒａｔｉｖｅｃｌｕｓｔｅｒｉｎｇなど既存手法のうち、いかなるものでも用いることができる。クラスタ生成処理では、入力は、時刻Ｔにおけるインフォバブルの集合ＩＢＳｅｔの要素集合：IB(T)＝{IB1, IB2, …, IBm }および実装するクラスタリング・アルゴリズムに与えるパラメータ、例えばＫ−ｍｅａｎｓの場合には、クラスタの個数ｋである。また、出力は、例えばK-means
の場合は、IB(T)の分割IB(T)1，IB(T)2，…， IB(T)kであって、IB(T)i ∩IB(T)j＝φ(if i ≠ j)、∪_i=1 ^k
IB(T)i＝IB(T)の条件を満たすものである。また、ｉ，ｊに対して IB(T)iとIB(T)jの類似度を出力しても良い。 [Cluster generation processing]
The cluster generation process can be performed by various methods. For example, any of existing methods such as K-means and aggregate clustering can be used. In the cluster generation process, the input is an element set of an info bubble set IBSet at time T: IB (T) = {IB1, IB2,..., IBm} and a parameter given to the clustering algorithm to be implemented, for example, K-means Is the number of clusters k. The output is, for example, K-means
IB (T) is divided into IB (T) 1, IB (T) 2, ..., IB (T) k, and IB (T) i ∩IB (T) j = φ (if i ≠ j), ∪ _{i = 1} ^k
The condition of IB (T) i = IB (T) is satisfied. Also, the similarity between IB (T) i and IB (T) j may be output for i and j.

また、本実施形態では、クラスタ生成は、サンプリング周期ごとの時刻Ｔ_ｍについて生成される。このため、時刻Ｔ_ｍ（ｍ≧０の整数）ごとに生成されたクラスタの類似度を使用してネットワーク行動分析を行うことが可能となる。図１８は、生成されたクラスタの異なる時刻Ｔ_ｍおよびＴ_ｍ＋１の間におけるクラスタの類似判断処理の概略図である。 In the present embodiment, the cluster generation is performed for the time T _m for each sampling period. For this reason, it becomes possible to perform network behavior analysis using the similarity of the clusters generated at each time T _m (m ≧ 0). FIG. 18 is a schematic diagram of cluster similarity determination processing between different times T _m and T _{m + 1} of the generated cluster.

図１８に示す実施形態に示すように、時刻Ｔ_ｍに対応するユーザ・リンク１８１０内には、クラスタ１８３０、１８４０などが生成されている。その後時間が経過し、時刻Ｔ_ｍ＋１に対応する時点では、ユーザ・リンク１８２０内には、クラスタ１８３０、１８５０などが生成されている。図１８に示した実施形態では、クラスタ１８３０は、そのまま生存しているものの、クラスタ１８４０は、他のクラスタを吸収してクラスタ１８５０として生成されている。ネットワーク行動解析部２４０は、上述した時刻Ｔ_ｍ、Ｔ_ｍ＋１において生成された各クラスタについてクラスタ間の対応付けを行う。 As shown in the embodiment shown in FIG. 18, the user links 1810 corresponding to the time _{T m,} and cluster 1830, 1840 are generated. Thereafter, when the time elapses and the time corresponding to the time T _{m + 1} , clusters 1830, 1850 and the like are generated in the user link 1820. In the embodiment shown in FIG. 18, the cluster 1830 remains alive, but the cluster 1840 is generated as a cluster 1850 by absorbing other clusters. The network behavior analysis unit 240 associates the clusters with respect to each cluster generated at the above-described times T _m and T _{m + 1} .

クラスタの類似度判断処理は、類似度１８６０を使用して計算することができる。図１８に示すように、類似度１８６０は、本実施形態では、クラスタＸおよびクラスタＹに帰属されるユーザの共通性を使用して計算することが好ましい。より具体的には、類似度１８６０は、クラスタＸに含まれるユーザと、クラスタＹに含まれるユーザの要素数を重複を排除して合計する。そしてクラスタＸおよびクラスタＹのユーザ集合の重複要素数を合計し、Ｓ_ｍ＋１＝（重複要素数）／（クラスタＸとクラスタＹの重複を除いた要素数）として、類似度を計算する。図１８に示した実施形態では、クラスタＸと、クラスタＹとの間の類似度Ｓ＝０．４が与えられる。クラスタが時間進化により分裂した場合や統合された場合は、クラスタの対応付けは、各クラスタが含むインフォバブルの識別子をトレースし、クラスタ間の対応付けを行ない、類似度を計算することができる。なお、ここに示した以外の、クラスタ間に定義される任意の類似度を用いることもできる。 The cluster similarity determination process can be calculated using the similarity 1860. As shown in FIG. 18, the similarity 1860 is preferably calculated using the commonality of users belonging to the cluster X and the cluster Y in this embodiment. More specifically, the similarity 1860 is obtained by summing up the number of elements of the users included in the cluster X and the users included in the cluster Y without duplication. Then, the number of overlapping elements of the user sets of the cluster X and the cluster Y is summed, and the similarity is calculated as S _{m + 1} = (number of overlapping elements) / (number of elements excluding the overlapping of the cluster X and the cluster Y). In the embodiment shown in FIG. 18, the similarity S = 0.4 between the cluster X and the cluster Y is given. When clusters are divided or integrated due to time evolution, cluster association can trace the info bubble identifiers included in each cluster, perform association between clusters, and calculate the degree of similarity. Note that any degree of similarity defined between clusters other than those shown here can also be used.

図１９は、クラスタの類似度計算の実施形態についてのフローチャートを示す。処理は、ステップＳ１９００から開始し、ステップＳ１９０１で、カウンタｉをｉ＝０に初期設定する。ステップＳ１９０２では、時刻Ｔ_ｉのインフォバブルを生成し、ステップＳ１９０３で、ｉ＜Ｎであるか否かを判断し、ｉ＜Ｎでない場合（Ｎｏ）、次にクラスタ計算するべきインフォバブルがまだ生成されていないので、ステップＳ１９０７に分岐させて処理を終了させる。 FIG. 19 shows a flowchart for an embodiment of cluster similarity calculation. The process starts from step S1900, and in step S1901, a counter i is initialized to i = 0. In step S1902, an info bubble at time T _i is generated. In step S1903, whether i <N is determined. If i <N is not satisfied (No), an info bubble to be clustered next is still generated. Since it has not been done, the process branches to step S1907 to end the process.

ステップＳ１９０３で、ｉ＜Ｎであると判断された場合（Ｙｅｓ）、ステップＳ１９０４で、時刻Ｔ_ｉ＋１のインフォバブルを生成する。その後、ステップＳ１９０５では、Ｔ_ｉとＴ_ｉ＋１のクラスタについて類似度を計算し、適切な記憶領域に類似度を保存する。ステップＳ１９０６では、カウンタｉを、ｉ＝ｉ＋１としてインクリメントし、処理をステップＳ１９０３に戻し、ステップＳ１９０３の判断が否定的な値を返すまで、処理を反復させ、クラスタの時間進化に対応する類似度シーケンスを生成する。生成した類似度シーケンスは、インフォバブル・データ、クラスタ・データなどとともにネットワーク行動分析結果出力部２５０に送られて、以後の処理において利用される。 If it is determined in step S1903 that i <N (Yes), an info bubble at time T _{i + 1} is generated in step S1904. After that, in step S1905, the similarity is calculated for the clusters of T _i and T _{i + 1} , and the similarity is stored in an appropriate storage area. In step S 1906, the counter i is incremented as i = i + 1, the process returns to step S 1903, and the process is repeated until the determination in step S 1903 returns a negative value, and the similarity sequence corresponding to the time evolution of the cluster Is generated. The generated similarity sequence is sent to the network behavior analysis result output unit 250 together with info bubble data, cluster data, etc., and used in the subsequent processing.

図２０は、図１９に示したステップＳ１９０５の類似度計算処理の実施形態のフローチャートである。図２０の処理は、ステップＳ２０００から開始し、ステップＳ２００１で、時刻Ｔ_ｉのクラスタがまだ残っているか否かを判断し、残されていない場合（Ｎｏ）には、ステップＳ２００８に処理を分岐させて処理を終了させる。 FIG. 20 is a flowchart of an embodiment of the similarity calculation process in step S1905 shown in FIG. The process in FIG. 20 starts from step S2000. In step S2001, it is determined whether or not the cluster at time T _i still remains. If not (No), the process branches to step S2008. To end the process.

一方、ステップＳ２００１で、未処理のクラスタが残っている場合（Ｙｅｓ）、ステップＳ２００２で未処理の時刻Ｔ_ｉのクラスタＸを選択しＳｃｏｒｅを０に初期化し、Ｙ（Ｘ）を空集合に初期化する。ステップＳ２００３で時刻Ｔ_ｉ＋１のクラスタＸに対応付けられたクラスタが残っているか否かを判断し、残っている場合（Ｙｅｓ）、ステップＳ２００４で未処理の時刻Ｔ_ｉ＋１のクラスタＹを選択し、類似度スコアＳ（Ｘ、Ｙ）を計算する。ステップＳ２００５では、Ｓｃｏｒｅ＜Ｓ（Ｘ、Ｙ）か否かを判断し、Ｓ（Ｘ、Ｙ）がＳｃｏｒｅを超える場合（Ｙｅｓ）、ステップＳ２００６で、Ｓｃｏｒｅ＝Ｓ（Ｘ、Ｙ）に設定し、Ｙ（Ｘ）＝Ｙとする。 On the other hand, if an unprocessed cluster remains in step S2001 (Yes), an unprocessed cluster X at time T _i is selected in step S2002, Score is initialized to 0, and Y (X) is initialized to an empty set. Turn into. In step S2003, it is determined whether or not there is a cluster associated with the cluster X at time T _{i + 1.} If so (Yes), an unprocessed cluster Y at time T _{i + 1} is selected in step S2004 and similar. A degree score S (X, Y) is calculated. In step S2005, it is determined whether or not Score <S (X, Y). If S (X, Y) exceeds Score (Yes), in step S2006, Score = S (X, Y) is set. Let Y (X) = Y.

その後、処理をステップＳ２００３に戻し、クラスタＸに対応付けされるクラスタＹが存在しなくなるまで（ステップＳ２００３でＮｏ）、処理を反復させ、ステップＳ２００３の判断が否定的な結果を返す場合（Ｎｏ）、処理をステップＳ２００７に分岐させ、クラスタＸに対して最も高いＳｃｏｒｅの値を返すクラスタＹ（Ｘ）を最も類似度の高いクラスタとして出力する。その後、処理をステップＳ２００１に戻し、時刻Ｔ_ｉのクラスタＸの全部について類似度計算を終了させる。 Thereafter, the process returns to step S2003, and the process is repeated until the cluster Y associated with the cluster X does not exist (No in step S2003), and the determination in step S2003 returns a negative result (No). Then, the process is branched to step S2007, and the cluster Y (X) that returns the highest Score value for the cluster X is output as the cluster having the highest similarity. Thereafter, the process returns to step S2001, and the similarity calculation is completed for all the clusters X at time T _i .

以上の処理を使用することにより、特徴パラメータを含む情報のユーザ・リンク内での情報伝播を追跡することが可能となるとともに、タイムウィンドウごとに生成されるインフォバブルのユーザ集合を解析することによって、特徴パラメータに関連したユーザのネットワーク行動を、特定のユーザについてのミクロな追跡ではなく、マクロな追跡を使用して解析することが可能となる。 By using the above processing, it is possible to track information propagation within the user link of information including feature parameters, and by analyzing the user set of info bubbles generated for each time window The user's network behavior associated with the feature parameters can be analyzed using macro tracking rather than micro tracking for a particular user.

さらにインフォバブルのクラスタ化を可能とするので、インフォバブルを構成するユーザ集団の特性を分析することが可能となり、ネットワーク行動解析の結果表示、バナー広告制御、流行予測などのソリューション、または情報伝播分析などのために利用することが可能となる。 In addition, info bubbles can be clustered, so it is possible to analyze the characteristics of the user groups that make up the info bubble, display network behavior analysis results, banner advertisement control, trend prediction solutions, or information propagation analysis. It can be used for such purposes.

本実施形態の上記機能は、Ｃ＋＋、Ｊａｖａ（登録商標）、Ｊａｖａ（登録商標）Ｂｅａｎｓ、Ｊａｖａ（登録商標）Ａｐｐｌｅｔ、Ｊａｖａ（登録商標）Ｓｃｒｉｐｔ、Ｐｅｒｌ、Ｒｕｂｙなどのオブジェクト指向プログラミング言語などで記述された装置実行可能なプログラムにより実現でき、当該プログラムは、ハードディスク装置、ＣＤ−ＲＯＭ、ＭＯ、フレキシブルディスク、ＥＥＰＲＯＭ、ＥＰＲＯＭなどの装置可読な記録媒体に格納して頒布することができ、また他装置が可能な形式でネットワークを介して伝送することができる。 The functions of this embodiment are described in an object-oriented programming language such as C ++, Java (registered trademark), Java (registered trademark) Beans, Java (registered trademark) Applet, Java (registered trademark) Script, Perl, and Ruby. The program can be realized by a program executable by the apparatus, and the program can be stored in a device-readable recording medium such as a hard disk device, CD-ROM, MO, flexible disk, EEPROM, EPROM, and distributed. It can be transmitted over the network in a possible format.

これまで本実施形態につき説明してきたが、本発明は、上述した実施形態に限定されるものではなく、他の実施形態、追加、変更、削除など、当業者が想到することができる範囲内で変更することができ、いずれの態様においても本発明の作用・効果を奏する限り、本発明の範囲に含まれるものである。 Although the present embodiment has been described so far, the present invention is not limited to the above-described embodiment, and other embodiments, additions, changes, deletions, and the like can be conceived by those skilled in the art. It can be changed, and any aspect is within the scope of the present invention as long as the effects and effects of the present invention are exhibited.

本実施形態の分析システム１００の実施形態を示した図。The figure which showed embodiment of the analysis system 100 of this embodiment. 図１に示した情報処理装置１２２の機能ブロックを示した図。The figure which showed the functional block of the information processing apparatus 122 shown in FIG. 本実施形態のインフォバブル３００の例示的な実施形態を示した図。The figure which showed exemplary embodiment of the info bubble 300 of this embodiment. 行動間トリックのデータ構成およびその生成処理を示した概略図。Schematic which showed the data structure of the trick between actions, and its production | generation process. 本実施形態のインフォバブルの時間進化を説明する説明図。Explanatory drawing explaining the time evolution of the info bubble of this embodiment. ユーザ関わり度ｇを使用して生成されるスコア関数ｆ（ｕ，Ｉ，Ｔ）の実施形態および、インフォバブルの膨張・収縮判断について説明した図。The figure explaining embodiment of the score function f (u, I, T) produced | generated using the user involvement degree g, and expansion / contraction determination of an info bubble. 本実施形態で生成されるインフォバブル・データ７００のデータ構造を示した図。The figure which showed the data structure of the info bubble data 700 produced | generated by this embodiment. インフォバブル生成処理の実施形態のフローチャート。The flowchart of embodiment of an info bubble production | generation process. 図８のステップＳ８０２のインフォバブル初期化処理の実施形態についてのフローチャート。The flowchart about embodiment of the info bubble initialization process of FIG.8 S802. タイムウィンドウＴ_ｉ（ｉ＞０）におけるインフォバブル生成処理の実施形態のフローチャート。The flowchart of embodiment of the info bubble production | generation process in time window _Ti (i> 0). タイムウィンドウＴ_ｉでのインフォバブル膨張処理の実施形態についてのフローチャート。Flowchart of an embodiment of the info bubble expansion process in the time window T _i. 図１０のステップＳ１００２で説明したインフォバブル収縮処理の実施形態についてのフローチャート。The flowchart about embodiment of the info bubble contraction process demonstrated by step S1002 of FIG. 図１０〜図１２に説明したインフォバブルの膨張・収縮処理について、グラフ表現１３００を使用して説明する図FIG. 10 is a diagram for explaining the expansion / contraction process of the info bubble described in FIGS. インフォバブルを初期化した段階で与えられるインフォバブル１４１０、１４２０、１４３０、１４４０のグラフ表現。Graphic representation of info bubbles 1410, 1420, 1430, 1440 given at the stage of initializing info bubbles. 図１４に示した初期膨張処理によって生成されたインフォバブルの実施形態を示した図。The figure which showed embodiment of the info bubble produced | generated by the initial expansion process shown in FIG. インフォバブルの時間的進化に対応するインフォバブル生成処理１６００の実施形態を示した図。The figure which showed embodiment of the info bubble production | generation process 1600 corresponding to the temporal evolution of an info bubble. 本実施形態で、インフォバブルをクラスタリングする処理の実施形態を示した図。The figure which showed embodiment of the process which clusters an info bubble in this embodiment. 生成されたクラスタの異なるタイムウィンドウＴ_ｍおよびＴ_ｍ＋１の間におけるクラスタの類似判断処理の概略図Schematic diagram of cluster similarity determination process between different time windows T _m and T _{m + 1} of the generated cluster クラスタの類似度計算の実施形態についてのフローチャート。The flowchart about embodiment of the similarity calculation of a cluster. 図１９に示したステップＳ１９０５の類似度計算処理の実施形態のフローチャート。The flowchart of embodiment of the similarity calculation process of step S1905 shown in FIG.

Explanation of symbols

１００…分析システム、１１２、１１４…ユーザ・コンピュータ、１１６…ネットワーク、１２０…サーバ機能部、１２２…情報処理装置、１２４…ネットワーク・データ記憶部、１２６…テキスト・データ記憶部、１２８…行動履歴記憶部、１３０…ユーザ情報記憶部、２００…機能ブロック（情報処理装置）、２１０…パラメータ制御部、２２０…インフォバブル生成部、２３０…ＡＰＩ、２４０…ネットワーク行動分析部、２５０…ネットワーク行動分析結果出力部、２６０…インフォバブル記憶部、２７０…アプリケーション提供部、２８０…ネットワーク・アダプタ DESCRIPTION OF SYMBOLS 100 ... Analysis system, 112, 114 ... User computer, 116 ... Network, 120 ... Server function part, 122 ... Information processing apparatus, 124 ... Network data storage part, 126 ... Text data storage part, 128 ... Action history storage , 130 ... user information storage unit, 200 ... functional block (information processing device), 210 ... parameter control unit, 220 ... info bubble generation unit, 230 ... API, 240 ... network behavior analysis unit, 250 ... network behavior analysis result output 260, Info bubble storage unit, 270 ... Application providing unit, 280 ... Network adapter

Claims

An information processing apparatus for analyzing network behavior, wherein the information processing apparatus includes:
An application providing unit that receives access to information from a user computer via a network and generates the network behavior for the information to the user computer;
An action history storage unit for storing an action history of the network action;
Information including feature parameters is extracted, network behavior is acquired for the extracted information, and the degree of user involvement in the network for the extracted information is obtained from a user link user having the user computer as a node. Registering at least one user in an info bubble that is a characterized user group, and generating an info bubble;
To read the info bubble, to analyze the network behavior of the user, look at including a network behavior analysis unit,
The info bubble generation unit cumulatively calculates a score value calculated according to the degree of user involvement for each sampling period in which the info bubble is set, and expands the info bubble generated immediately before or the info bubble generated immediately before An information processing apparatus that determines whether to contract and generates an info bubble sequence including a plurality of different users in time series.

The information processing apparatus according to claim 1 , wherein the info bubble generation unit determines expansion or contraction of the info bubble with reference to an expansion threshold value and a contraction threshold value set in the score value.

The information bubble generation unit, based on the fact that the bubble average score after expansion does not become the contraction threshold or less when the adjacent user of the user link is included in the information bubble generated immediately before The information processing apparatus according to claim 2 , wherein the info bubble is expanded so as to include the information bubble.

The info bubble generation unit selects the user with the smallest score value included in the immediately generated info bubble, and deletes the user from the info bubble until the bubble average score exceeds the contraction threshold value. The information processing apparatus according to claim 3 , wherein the info bubble is contracted.

The network behavior analysis unit reads the info bubble specified by an identification value, analyzes information propagation of the information including the feature parameter using time evolution of the info bubble, and further reads the plurality of the read The information processing apparatus according to claim 4 , wherein a cluster of the info bubbles is generated using similarity of user attributes of the info bubble, and a user who forms the cluster is analyzed.

An analysis system including an information processing apparatus for analyzing network behavior between a user and a computer, wherein the information processing apparatus includes:
An application providing unit that receives access to information from a user computer via a network and generates the network behavior for the information to the user computer;
An action history storage unit for storing an action history of the network action;
Information including feature parameters is extracted, network behavior is acquired for the extracted information, and the degree of user involvement in the network for the extracted information is obtained from a user link user having the user computer as a node. An at least one user is registered in an info bubble, which is a characterized user group, and an info bubble is generated,
To read the info bubble, to analyze the network behavior of the user, look at including a network behavior analysis unit,
The info bubble generation unit cumulatively calculates a score value calculated according to the degree of user involvement for each sampling period in which the info bubble is set, and expands the info bubble generated immediately before or the info bubble generated immediately before An analysis system that determines whether to contract and generates an info bubble sequence that includes multiple different users over time.

The analysis system according to claim 6 , wherein the info bubble generation unit determines expansion or contraction of the info bubble with reference to an expansion threshold value and a contraction threshold value set in the score value.

An analysis method executed by an information processing apparatus for network behavior of a user computer, wherein the analysis method includes:
Receiving access to information from the user computer over a network and generating the network behavior for the information to the user computer;
Storing an action history of the network action;
Information including feature parameters is extracted, network behavior for the extracted information is acquired, and the degree of user involvement in the network for the extracted information is calculated from a user link user whose node is the user computer. Steps,
Generate a score value by accumulatively calculating the degree of user involvement, expand the info bubble that is the characterized user group generated immediately before referring to the score value, or contract the info bubble generated immediately before A step of determining whether to
Generating an info bubble sequence including a plurality of different users in time series by using or adding at least one user to or removing from the info bubble;
Reading the info bubble and analyzing the network behavior of the user.

The step of generating the info bubble sequence is based on the fact that the bubble average score after expansion does not become less than or equal to the contraction threshold when an adjacent user of the user link is included in the immediately generated info bubble. 9. The method of claim 8 , comprising inflating the info bubble to include the neighboring user as.

The step of generating the info bubble sequence selects a user having the smallest score value included in the immediately generated info bubble, and removes the user from the info bubble until the bubble average score exceeds the contraction threshold value. The method of claim 9 , comprising shrinking the info bubble by deleting.

The step of analyzing the network behavior includes reading the info bubble specified by an identification value, and analyzing information propagation of the information including the feature parameter using temporal evolution of the info bubble. 10. The method according to 10 .

The step of analyzing the network behavior includes generating a cluster of the info bubbles using similarity of user attributes of the plurality of read info bubbles, and analyzing users who form the cluster. 11. The method according to 11 .

An information processing apparatus executable program for executing an analysis method for network behavior of a user computer, the program comprising:
Means for accepting access to information from the user computer over a network and generating the network behavior for the information to the user computer;
Means for storing an action history of the network action;
Information including feature parameters is extracted, network behavior for the extracted information is acquired, and the degree of user involvement in the network for the extracted information is calculated from a user link user whose node is the user computer. Means,
Cumulative calculation of the degree of user involvement to generate a score value, which is a characterized user group generated immediately before referring to the score value, expands an info bubble, or contracts an info bubble generated immediately before A means of deciding whether to
Means for generating an info bubble sequence comprising a plurality of different users in time series by using or adding or removing at least one user to the info bubble;
An information processing apparatus executable program that reads the info bubble and causes it to function as a means for analyzing the network behavior of the user.

The means for generating the info bubble sequence is based on the fact that the bubble average score after expansion is not less than or equal to the contraction threshold when an adjacent user of the user link is included in the information bubble generated immediately before. The program according to claim 13 , comprising means for inflating the info bubble to include the adjacent user.

The means for generating the info bubble sequence selects a user having the smallest score value included in the immediately generated info bubble, and removes the user from the info bubble until the bubble average score exceeds the contraction threshold value. The program according to claim 14 , comprising means for contracting the info bubble by deleting.

The means for analyzing the network behavior includes means for reading the info bubble specified by an identification value and analyzing information propagation of the information including the feature parameter using temporal evolution of the info bubble. 15. The program according to 15 .

The means for analyzing the network behavior includes means for generating a cluster of the info bubbles using similarity of user attributes of the plurality of read info bubbles and analyzing the users forming the cluster. 16. The program according to 16 .