JP2021192237A

JP2021192237A - Related score calculation system, method and program

Info

Publication number: JP2021192237A
Application number: JP2021124963A
Authority: JP
Inventors: 洋介本橋; Yosuke Motohashi; 昌子今西; Masako Imanishi
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2017-11-06
Filing date: 2021-07-30
Publication date: 2021-12-16
Anticipated expiration: 2037-11-06
Also published as: JP6972935B2; JP2019086940A; JP7103496B2; JP7375861B2; JP2022133401A

Abstract

To provide a related score calculation system capable of clarifying the strength of the relationship between a person in an organization and a word.SOLUTION: A collection unit 86 collects schedule information describing a person, an event name, and a time zone in which the person has been involved in an event having the event name. A word extraction unit 87 extracts a word from the event names described in each piece of schedule information. A related score calculation unit 88 calculates a related score indicating a strength of a relationship between the person and the word based on each piece of schedule information.SELECTED DRAWING: Figure 20

Description

本発明は、人と単語との関連の強さを数値化する関連スコア算出システム、関連スコア算出方法および関連スコア算出プログラムに関する。 The present invention relates to a related score calculation system, a related score calculation method, and a related score calculation program for quantifying the strength of a relationship between a person and a word.

特許文献１には、パーソナルコンピュータまたは携帯情報端末に導入されているアプリケーションプログラムがどの程度使用されているかを判定するために必要な情報を含む操作ログによって、ユーザが使用しているアプリケーションプログラムを判定する情報提供装置が記載されている。また、特許文献１には、情報提供装置が、ユーザがどの程度アプリケーションプログラムを使用しているかを判定したり、ユーザのアプリケーションプログラムに対する知識レベルを判定したりすることも記載されている。 Patent Document 1 determines an application program used by a user by an operation log including information necessary for determining how much the application program installed in a personal computer or a personal digital assistant is used. The information providing device to be used is described. Further, Patent Document 1 also describes that the information providing device determines how much the user is using the application program and determines the knowledge level of the user for the application program.

特許文献２には、プロファイルデータベースに、人材に関する情報の登録、削除、更新等を行い、プロファイル情報を参照して検索キーワードに合致する人材を検索する人材検索システムが記載されている。また、特許文献２には、人が著作者となっている文書のキーワードを抽出し、上位キーワードを得ることによって、人材の専門分野や業務についての情報を得ることが記載されている。 Patent Document 2 describes a human resources search system that registers, deletes, updates, etc., information about human resources in a profile database, and searches for human resources that match the search keyword by referring to the profile information. Further, Patent Document 2 describes that information on the specialized field and business of human resources can be obtained by extracting keywords of a document in which a person is the author and obtaining high-ranking keywords.

特開２０１３−３７５８４号公報Japanese Unexamined Patent Publication No. 2013-37584 特開２００５−３２７０２８号公報Japanese Unexamined Patent Publication No. 2005-327028

企業等の組織内において、特定の分野や技術に精通している人や、あるプロジェクトに参加したことのある人を見つけられることが好ましい。また、ある人が精通している分野、技術や、その人が参加したことのあるプロジェクトを容易に知ることができることが好ましい。しかし、大企業等の大きな組織では、「誰がどの分野やどの技術に詳しいか」、「誰がどのプロジェクトに参加したことがあるか」等は、長年、その組織にいないと分からない知識となってしまう。特に、「過去において、誰がどの分野やどの技術に詳しかったか」、「過去において、誰がどのプロジェクトに参加したか」等の情報については、その傾向が強くなる。そのため、特に、新入社員や派遣社員にとって、聞きたいことを誰にきけばよいのか分からなくなってしまう。その結果、例えば、製品開発の効率が低下する場合が生じ得る。 It is preferable to be able to find people who are familiar with a specific field or technology or who have participated in a certain project within an organization such as a company. It is also preferable to be able to easily know the fields and technologies that a person is familiar with and the projects that the person has participated in. However, in a large organization such as a large company, "who is familiar with which field and which technology", "who has participated in which project", etc. become knowledge that can only be understood by the organization for many years. It ends up. In particular, the tendency becomes stronger for information such as "who was familiar with which field and which technology in the past" and "who participated in which project in the past". Therefore, especially for new employees and dispatched employees, it becomes difficult to know who to ask what they want to hear. As a result, for example, the efficiency of product development may decrease.

そのため、本発明の発明者らは、組織内の人と、単語との関連の強さを明確化できることが好ましいと考えた。本発明の発明者らは、例えば、ある人と、「人工知能」という単語の関連の強さを明確化できれば、その人が、「人工知能」の分野や技術に詳しいかどうかや、その人が「人工知能」に関するプロジェクトに参加したことがあるかどうかを推定しやすいと考えた。 Therefore, the inventors of the present invention considered that it is preferable to be able to clarify the strength of the relationship between the person in the organization and the word. For example, if the inventors of the present invention can clarify the strength of the relationship between a person and the word "artificial intelligence", whether the person is familiar with the field or technology of "artificial intelligence" and the person I thought it would be easy to estimate if he had participated in a project on "artificial intelligence".

そこで、本発明は、組織内の人と単語との関連の強さを明確化することができる関連スコア算出システム、関連スコア算出方法および関連スコア算出プログラムを提供することを目的とする。 Therefore, an object of the present invention is to provide a related score calculation system, a related score calculation method, and a related score calculation program that can clarify the strength of the relationship between a person and a word in an organization.

また、本発明による関連スコア算出システムは、人と、イベント名と、そのイベント名を有するイベントにその人が関わった時間帯とを記述したスケジュール情報を収集する収集部と、各スケジュール情報に記述されているイベント名から単語を抽出する単語抽出部と、各スケジュール情報に基づいて、人と単語との関連の強さを表す関連スコアを算出する関連スコア算出部とを備え、関連スコア算出部が、一の人と一の単語の関連スコアを、一の単語をイベント名に含む各イベントについての組織内の全ての人のイベント参加時間の総和に対する、一の単語をイベント名に含む各イベントについての一の人のイベント参加時間の総和の割合と、単語をイベント名に含む各イベントについての一の人のイベント参加時間の総和を単語毎に求めた場合におけるその総和の総和に対する、一の単語をイベント名に含む各イベントについての一の人のイベント参加時間の総和の割合とに基づいて算出することを特徴とする。 Further, the related score calculation system according to the present invention describes a person, an event name, and a collection unit that collects schedule information describing the time zone in which the person was involved in the event having the event name, and each schedule information. It is equipped with a word extraction unit that extracts words from the event names, and a related score calculation unit that calculates a related score that indicates the strength of the relationship between a person and a word based on each schedule information. However, each event that includes one word in the event name for the total event participation time of all people in the organization for each event that includes one word in the event name and the related score of one person and one word. The ratio of the total event participation time of one person for each event and the total sum of event participation time of one person for each event that includes a word in the event name are calculated for each word. It is characterized in that it is calculated based on the ratio of the total event participation time of one person for each event including the word in the event name.

また、本発明による関連スコア算出方法は、コンピュータが、人と、イベント名と、前記イベント名を有するイベントに前記人が関わった時間帯とを記述したスケジュール情報を収集し、各スケジュール情報に記述されているイベント名から単語を抽出し、各スケジュール情報に基づいて、人と単語との関連の強さを表す関連スコアを算出し、関連スコアを算出するときに、一の人と一の単語の関連スコアを、一の単語をイベント名に含む各イベントについての組織内の全ての人のイベント参加時間の総和に対する、一の単語をイベント名に含む各イベントについての一の人のイベント参加時間の総和の割合と、単語をイベント名に含む各イベントについての一の人のイベント参加時間の総和を単語毎に求めた場合におけるその総和の総和に対する、一の単語をイベント名に含む各イベントについての一の人のイベント参加時間の総和の割合とに基づいて算出することを特徴とする。 Further, in the related score calculation method according to the present invention, the computer collects schedule information describing a person, an event name, and a time zone in which the person is involved in an event having the event name, and describes the schedule information in each schedule information. Words are extracted from the event name, and based on each schedule information, the relation score indicating the strength of the relation between the person and the word is calculated, and when the relation score is calculated, one person and one word The related score of is the sum of the event participation time of all people in the organization for each event that includes one word in the event name, and the event participation time of one person for each event that includes one word in the event name. For each event that includes one word in the event name for each word when the ratio of the sum of the total and the sum of the event participation time of one person for each event that includes the word in the event name is calculated for each word. It is characterized in that it is calculated based on the ratio of the total event participation time of one person.

また、本発明による関連スコア算出プログラムは、コンピュータに、人と、イベント名と、前記イベント名を有するイベントに前記人が関わった時間帯とを記述したスケジュール情報を収集する収集処理、各スケジュール情報に記述されているイベント名から単語を抽出する単語抽出処理、および、各スケジュール情報に基づいて、人と単語との関連の強さを表す関連スコアを算出する関連スコア算出処理を実行させ、関連スコア算出処理で、一の人と一の単語の関連スコアを、一の単語をイベント名に含む各イベントについての組織内の全ての人のイベント参加時間の総和に対する、一の単語をイベント名に含む各イベントについての一の人のイベント参加時間の総和の割合と、単語をイベント名に含む各イベントについての一の人のイベント参加時間の総和を単語毎に求めた場合におけるその総和の総和に対する、一の単語をイベント名に含む各イベントについての一の人のイベント参加時間の総和の割合とに基づいて算出させることを特徴とする。 Further, the related score calculation program according to the present invention is a collection process for collecting schedule information describing a person, an event name, and a time zone in which the person is involved in an event having the event name, and each schedule information. A word extraction process for extracting words from the event name described in, and a related score calculation process for calculating a related score indicating the strength of the relationship between a person and a word based on each schedule information are executed and related. In the score calculation process, the related score of one person and one word is included in the event name. One word is used as the event name for the total event participation time of all people in the organization for each event. For each word, the ratio of the total event participation time of one person for each event including the event and the total event participation time of one person for each event containing a word in the event name are calculated for each word. , It is characterized in that it is calculated based on the ratio of the total event participation time of one person for each event including one word in the event name.

本発明によれば、組織内の人と単語との関連の強さを明確化することができる。 According to the present invention, it is possible to clarify the strength of the relationship between a person and a word in an organization.

本発明の第１の実施形態の関連スコア算出システムの構成例を示すブロック図である。It is a block diagram which shows the structural example of the related score calculation system of 1st Embodiment of this invention. 操作ログの例を示す模式図である。It is a schematic diagram which shows the example of the operation log. ユーザと単語の組合せ毎に算出された関連スコアの例を示す模式図である。It is a schematic diagram which shows the example of the relation score calculated for each combination of a user and a word. 第１の実施形態の関連スコア算出システムの処理経過の例を示すフローチャートである。It is a flowchart which shows the example of the processing progress of the relation score calculation system of 1st Embodiment. 第１の実施形態の変形例を示すブロック図である。It is a block diagram which shows the modification of 1st Embodiment. 記憶部に記憶されているユーザ名と単語と関連スコアとの組の例を示す模式図である。It is a schematic diagram which shows the example of the set of the user name, the word, and the relation score stored in the storage part. 本発明の第２の実施形態の関連スコア算出システムの構成例を示すブロック図である。It is a block diagram which shows the structural example of the related score calculation system of the 2nd Embodiment of this invention. 関連スコア算出結果（ユーザ名と単語と関連スコアとの組の集合）の例を示す模式図である。It is a schematic diagram which shows the example of the relation score calculation result (the set of the set of the user name, the word, and the relation score). 第１のテーブルの例を示す模式図である。It is a schematic diagram which shows the example of the 1st table. 第２のテーブルの例を示す模式図である。It is a schematic diagram which shows the example of the 2nd table. 第２の実施形態の処理経過の例を示すフローチャートである。It is a flowchart which shows the example of the processing progress of 2nd Embodiment. 選択ユーザとキーワード単語の検索スコアを算出する処理の例を示すフローチャートである。It is a flowchart which shows the example of the process of calculating the search score of a selected user and a keyword word. スコア算出対象単語とキーワードユーザ名の検索スコアを算出する処理の例を示すフローチャートである。It is a flowchart which shows the example of the process of calculating the search score of the score calculation target word and the keyword user name. 本発明の第３の実施形態の関連スコア算出システムの構成例を示すブロック図である。It is a block diagram which shows the structural example of the related score calculation system of the 3rd Embodiment of this invention. クラスタリング結果を示す画面の例を示す模式図である。It is a schematic diagram which shows the example of the screen which shows the clustering result. 第４の実施形態の関連スコア算出システムの構成例を示すブロック図である。It is a block diagram which shows the structural example of the related score calculation system of 4th Embodiment. スケジュール情報の例を示す模式図である。It is a schematic diagram which shows the example of the schedule information. 本発明の各実施形態に係るコンピュータの構成例を示す概略ブロック図である。It is a schematic block diagram which shows the structural example of the computer which concerns on each embodiment of this invention. 本発明の概要を示すブロック図である。It is a block diagram which shows the outline of this invention. 本発明の概要の他の例を示すブロック図である。It is a block diagram which shows another example of the outline of this invention.

以下、本発明の実施形態を図面を参照して説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

実施形態１．
図１は、本発明の第１の実施形態の関連スコア算出システムの構成例を示すブロック図である。ただし、図１では、通信ネットワークを介して本発明の関連スコア算出システムに接続されている装置も図示している。 Embodiment 1.
FIG. 1 is a block diagram showing a configuration example of a related score calculation system according to the first embodiment of the present invention. However, FIG. 1 also illustrates a device connected to the related score calculation system of the present invention via a communication network.

本発明の関連スコア算出システム１には、通信ネットワーク１０を介して、パーソナルコンピュータ（以下、ＰＣと記す。）９１が接続されている。 A personal computer (hereinafter referred to as a PC) 91 is connected to the related score calculation system 1 of the present invention via a communication network 10.

ＰＣ９１は、例えば、会社や企業等の組織に属する人によって使用される。各ＰＣ９１は、１つの組織内に設けられているものとする。以下、組織が会社である場合を例にして説明する。ただし、ＰＣ９１を使用する人が属する組織は会社や企業でなくてもよい。また、組織は、複数の会社等によって形成される組織であってもよく、また、１つの会社や企業等の一部門であってもよい。 The PC 91 is used, for example, by a person who belongs to an organization such as a company or a company. It is assumed that each PC 91 is provided in one organization. Hereinafter, the case where the organization is a company will be described as an example. However, the organization to which the person who uses the PC91 belongs does not have to be a company or a company. Further, the organization may be an organization formed by a plurality of companies or the like, or may be a department of one company or a company or the like.

個々のＰＣ９１は、ファイルを操作するユーザによって、ファイルに関する１つの操作ログを作成し、記憶する。後述するように、各ＰＣ９１が記憶している操作ログは、関連スコア算出システム１（より具体的には、関連スコア算出システム１の収集部２）によって収集される。 Each PC 91 creates and stores one operation log related to the file by the user who operates the file. As will be described later, the operation log stored in each PC 91 is collected by the related score calculation system 1 (more specifically, the collecting unit 2 of the related score calculation system 1).

図２は、ＰＣ９１で作成される操作ログの例を示す模式図である。操作ログは、例えば、ファイル名と、ファイルを操作したユーザのユーザ名と、操作の内容と、その操作が行われた日時とを関連付けている（図２参照）。図２では、便宜的に、操作ログの番号も図示している。なお、図２は、操作ログの例であり、操作ログは、図２に示す例に限定されない。 FIG. 2 is a schematic diagram showing an example of an operation log created by the PC 91. The operation log associates, for example, the file name, the user name of the user who operated the file, the content of the operation, and the date and time when the operation was performed (see FIG. 2). In FIG. 2, the operation log numbers are also shown for convenience. Note that FIG. 2 is an example of an operation log, and the operation log is not limited to the example shown in FIG.

ＰＣ９１は、操作ログに、ファイル名として、パス名を含むファイル名を記述する。 The PC 91 describes a file name including a path name as a file name in the operation log.

また、ＰＣ９１は、操作ログに記述するユーザ名を、例えば、ユーザがＰＣ９１にログインする際に用いるＩＤ（Identification）から判定すればよい。ただし、ＰＣ９１がユーザ名を判定する方法は、この方法に限定されない。 Further, the PC 91 may determine the user name described in the operation log from, for example, an ID (Identification) used when the user logs in to the PC 91. However, the method by which the PC 91 determines the user name is not limited to this method.

なお、各実施形態では、ユーザＩＤ（ユーザの識別情報）として、ユーザ名を用いる。 In each embodiment, a user name is used as a user ID (user identification information).

操作ログに記述される操作内容の例として、例えば、「ファイルオープン」、「キータッチ」、「更新（保存）」、「ファイルクローズ」等が挙げられる。ただし、操作ログに記述される操作の内容は、これらに限定されず、「新規作成」等であってもよい。 Examples of the operation contents described in the operation log include "file open", "key touch", "update (save)", "file close" and the like. However, the content of the operation described in the operation log is not limited to these, and may be "new creation" or the like.

例えば、ユーザ「山田」が、２０１７年１０月１０日の１３時１５分に、ファイル“/・・/人工知能/・・/Ａ社の機械学習.pptx”を開いた場合、ＰＣ９１は、図２に例示する１番目の操作ログを作成する。また、例えば、ユーザ「山田」が、２０１７年１０月１０日の１３時１６分に、そのファイルに対して、キータッチ（キー入力）を行った場合、ＰＣ９１は、図２に例示する２番目の操作ログを作成する。また、例えば、ユーザ「山田」が、２０１７年１０月１０日の１５時１７分に、そのファイルを更新（保存）した場合、ＰＣ９１は、図２に例示するｍ−１番目の操作ログを作成する。また、例えば、ユーザ「山田」が、２０１７年１０月１０日の１５時４５分に、そのファイルを閉じた場合には、ＰＣ９１は、図２に例示するｍ番目の操作ログを作成する。 For example, if the user "Yamada" opens the file "/ .../ Artificial Intelligence / ... / A Company's Machine Learning.pptx" at 13:15 on October 10, 2017, the PC91 will be shown in the figure. Create the first operation log illustrated in 2. Further, for example, when the user "Yamada" performs a key touch (key input) on the file at 13:16 on October 10, 2017, the PC 91 is the second example illustrated in FIG. Create an operation log of. Further, for example, when the user "Yamada" updates (saves) the file at 15:17 on October 10, 2017, the PC 91 creates the m-1st operation log illustrated in FIG. do. Further, for example, when the user "Yamada" closes the file at 15:45 on October 10, 2017, the PC 91 creates the m-th operation log illustrated in FIG. 2.

図２に示すｍ＋１番目からｎ番目までの操作ログは、ユーザ「山田」が別のファイル“/・・/人工知能/・・/ディープラーニング.docx”を操作した際における操作ログの例である。 The operation logs from the m + 1st to the nth shown in FIG. 2 are examples of operation logs when the user "Yamada" operates another file "/.../ artificial intelligence / ... / deep learning.docx". ..

各ＰＣ９１は、それぞれ、同様に、ユーザがファイルに対して操作を行う毎に、操作ログを追加し、記憶していく。 Similarly, each PC 91 adds and stores an operation log each time the user performs an operation on the file.

関連スコア算出システム１は、収集部２と、単語抽出部３と、スコア算出部４と、記憶部５とを備える。 The related score calculation system 1 includes a collection unit 2, a word extraction unit 3, a score calculation unit 4, and a storage unit 5.

収集部２は、各ＰＣ９１から、各ＰＣ９１に記憶されている操作ログを収集する。図２に示すように、個々の操作ログは、パス名を含むファイル名と、そのファイル名を有するファイルを使用したユーザのユーザ名とを含む。個々の操作ログは、ユーザと単語との関連の強さを表す指標値である関連スコアを導出可能な情報も含む。図２に示す例では、「日時」および「操作」として記載された情報が、関連スコアを導出可能な情報に該当する。ただし、関連スコアは、１つの操作ログからは導出されず、複数の操作ログから導出される。 The collecting unit 2 collects the operation log stored in each PC 91 from each PC 91. As shown in FIG. 2, each operation log includes a file name including a path name and a user name of a user who used a file having the file name. Each operation log also contains information from which the association score, which is an index value indicating the strength of the association between the user and the word, can be derived. In the example shown in FIG. 2, the information described as "date and time" and "operation" corresponds to the information from which the related score can be derived. However, the related score is not derived from one operation log, but is derived from a plurality of operation logs.

単語抽出部３は、収集部２によって収集された各操作ログに記述されている各ファイル名（パス名を含むファイル名）に対して形態素解析を実行することによって、パス名を含むファイル名に含まれている単語を抽出する。ただし、単語抽出部３は、同一の単語を、重複して抽出しない。例えば、単語抽出部３は、「人工知能」という単語を既に抽出している場合、２回目以降に抽出された「人工知能」という単語については無視する。 The word extraction unit 3 performs morphological analysis on each file name (file name including the path name) described in each operation log collected by the collection unit 2 to obtain a file name including the path name. Extract the contained words. However, the word extraction unit 3 does not extract the same word in duplicate. For example, when the word extraction unit 3 has already extracted the word "artificial intelligence", the word "artificial intelligence" extracted from the second time onward is ignored.

例えば、単語抽出部３は、“/・・/人工知能/・・/Ａ社の機械学習.pptx”というパス名を含むファイル名に対して形態素解析を実行することによって、「人工知能」、「Ａ社」、「機械学習」等の単語を抽出する。なお、以下の説明において、各ユーザが属している会社（組織）が「Ａ社」であるものとして説明する。 For example, the word extraction unit 3 performs morphological analysis on a file name including the path name “/ ・・ / artificial intelligence / ・・ / machine learning of company A.pptx” to obtain “artificial intelligence”. Extract words such as "Company A" and "machine learning". In the following description, it is assumed that the company (organization) to which each user belongs is "Company A".

さらに、例えば、単語抽出部３は、“/・・/人工知能/・・/ディープラーニング.docx”というパス名を含むファイル名に対して形態素解析を実行することによって、「人工知能」、「ディープラーニング」等の単語を抽出する。ただし、前述のように、「人工知能」は既に抽出されているので、単語抽出部３は、ここで抽出された「人工知能」という単語については無視する。 Further, for example, the word extraction unit 3 performs morphological analysis on a file name including the path name “/ ・・ / artificial intelligence / ・・ / deep learning.docx” to obtain “artificial intelligence” and “artificial intelligence”. Extract words such as "deep learning". However, as described above, since "artificial intelligence" has already been extracted, the word extraction unit 3 ignores the word "artificial intelligence" extracted here.

単語抽出部３は、同様の処理を、各操作ログに記述されている各ファイル名に対して行うことによって、単語の集合を得る。これらの単語は、互いに異なる。 The word extraction unit 3 obtains a set of words by performing the same processing for each file name described in each operation log. These words are different from each other.

スコア算出部４は、ファイルを操作した各ユーザと、単語抽出部３によって抽出された各単語の組合せ毎に、ユーザと単語との関連の強さを表す指標値である関連スコアを算出する。なお、ユーザは、操作ログに記述されるユーザ名で表される。 The score calculation unit 4 calculates the association score, which is an index value indicating the strength of the association between the user and the word, for each combination of each user who operated the file and each word extracted by the word extraction unit 3. The user is represented by a user name described in the operation log.

なお、本発明の第２の実施形態等では、単語同士の関連の強さを表す指標値も用いる。本発明において、ユーザと単語との関連の強さを表す指標値を「関連スコア」と称し、単語同士の関連の強さを表す指標値を「関連度」と称することによって、２種類の指標値を区別する。 In the second embodiment of the present invention, an index value indicating the strength of the relationship between words is also used. In the present invention, the index value indicating the strength of the relationship between the user and the word is referred to as "relationship score", and the index value indicating the strength of the relationship between words is referred to as "relevance degree". Distinguish between values.

スコア算出部４は、ユーザ（ユーザ名）と単語の組合せ毎に、関連スコアを算出し、そのユーザ名と単語と関連スコアとの組を記憶部５に記憶させる。 The score calculation unit 4 calculates a related score for each combination of a user (user name) and a word, and stores the set of the user name, the word, and the related score in the storage unit 5.

記憶部５は、ユーザ（ユーザ名）と単語と関連スコアとの組を記憶する記憶装置である。 The storage unit 5 is a storage device that stores a set of a user (user name), a word, and a related score.

関連スコアの算出方法は、複数、存在する。以下、関連スコアの算出方法として、３種類の方法を説明する。以下に示す３種類のいずれの方法においても、スコア算出部４は、ユーザと単語の組合せ毎に、関連スコアを算出し、記憶部５に記憶させる。 There are multiple methods for calculating the related score. Hereinafter, three types of methods will be described as methods for calculating the related score. In any of the three types of methods shown below, the score calculation unit 4 calculates the related score for each combination of the user and the word, and stores it in the storage unit 5.

第１の算出方法は、一のユーザ（以下、ユーザＵと記す。）と一の単語（以下、単語Ｗと記す。）の関連スコアとして、単語Ｗをファイル名に含む各ファイルについてのユーザＵの操作時間の総和を算出する方法である。操作時間が長いほど、ユーザＵと単語Ｗの関連が強く、操作時間が短いほど、ユーザＵと単語Ｗの関連が弱いと言える。従って、操作時間を、関連スコアとして用いることができる。 The first calculation method is the user U for each file containing the word W in the file name as the association score between one user (hereinafter referred to as user U) and one word (hereinafter referred to as word W). It is a method of calculating the total operation time of. It can be said that the longer the operation time, the stronger the relationship between the user U and the word W, and the shorter the operation time, the weaker the relationship between the user U and the word W. Therefore, the operation time can be used as a related score.

ここではまず、実質参照時間を操作時間として扱う場合を例にして説明する。 Here, first, a case where the actual reference time is treated as the operation time will be described as an example.

なお、ファイル名は、パス名を含むファイル名である。従って、単語Ｗがパス名の方に含まれている場合であっても、単語Ｗはファイル名に含まれているものとして扱う。この点は、後述の第２の算出方法および第３の算出方法においても同様である。 The file name is a file name including a path name. Therefore, even if the word W is included in the path name, the word W is treated as if it is included in the file name. This point is the same in the second calculation method and the third calculation method described later.

実質参照時間は、ＰＣ９１において、ファイルの内容を表しているウィンドウがアクティブになっている時間（すなわち、ファイルの内容を表しているウィンドウがユーザから見て一番手前に表示されている時間）である。 The actual reference time is the time during which the window representing the contents of the file is active on the PC91 (that is, the time when the window representing the contents of the file is displayed in the foreground when viewed from the user). be.

関連スコアの第１の算出方法では、スコア算出部４は、単語Ｗをファイル名に含む各ファイルについてのユーザＵの実質参照時間の総和を算出し、その総和を、ユーザＵと単語Ｗの関連スコアとする。 In the first calculation method of the association score, the score calculation unit 4 calculates the sum of the actual reference times of the user U for each file including the word W in the file name, and the sum is the relationship between the user U and the word W. Let it be a score.

キータッチが行われていれば、ファイルの内容を表すウィンドウはアクティブである。従って、例えば、スコア算出部４は、単語Ｗをファイル名に含む１つのファイルに関して、ユーザＵによってキータッチが続けて行われている状態を操作ログから判断し、その状態における最初のキータッチ時刻から、最後のキータッチ時刻までの時間を、そのファイルにおけるユーザＵの実質参照時間とする。 If a key touch has been made, the window showing the contents of the file is active. Therefore, for example, the score calculation unit 4 determines from the operation log the state in which the key touch is continuously performed by the user U for one file containing the word W in the file name, and the first key touch time in that state is determined. The time from to the last key touch time is taken as the actual reference time of the user U in the file.

さらに、単語Ｗをファイル名に含み、ユーザＵに操作された他のファイルがあれば、スコア算出部４は、そのファイルに関しても同様に、実質参照時間を算出する。 Further, if the file name includes the word W and there is another file operated by the user U, the score calculation unit 4 similarly calculates the actual reference time for that file.

そして、スコア算出部４は、単語Ｗをファイル名に含み、ユーザＵに操作されたファイル毎に算出した実質参照時間の総和を算出し、その総和をユーザＵと単語Ｗの関連スコアとする。 Then, the score calculation unit 4 includes the word W in the file name, calculates the sum of the actual reference times calculated for each file operated by the user U, and sets the sum as the related score between the user U and the word W.

また、操作ログにおいて、アクティブ状態となった開始時刻および終了時刻を明示しているのであれば、スコア算出部４は、操作ログにおいて明示されているそれらの時刻に基づいて、実質参照時間を算出してもよい。 If the operation log clearly indicates the start time and end time of the active state, the score calculation unit 4 calculates the actual reference time based on those times specified in the operation log. You may.

また、スコア算出部４は、ファイルオープンからファイルクローズまでの時間を操作時間として算出してもよい。この場合、スコア算出部４は、単語Ｗをファイル名に含み、ユーザＵに操作されたファイル毎に、ファイルオープンからファイルクローズまでの時間を算出し、その時間の総和をユーザＵと単語Ｗの関連スコアとすればよい。 Further, the score calculation unit 4 may calculate the time from the file opening to the file closing as the operation time. In this case, the score calculation unit 4 includes the word W in the file name, calculates the time from the file opening to the file closing for each file operated by the user U, and the sum of the times is the sum of the time for the user U and the word W. It can be a related score.

なお、実質参照時間を操作時間として扱うことが好ましい。 It is preferable to treat the actual reference time as the operation time.

関連スコアの第２の算出方法は、一のユーザ（ユーザＵ）と一の単語（単語Ｗ）の関連スコアとして、単語Ｗをファイル名に含む各ファイルをユーザＵが操作した際のキータッチの回数の総和を算出する方法である。キータッチの回数が多いほど、単語Ｗをファイル名に含むファイルをユーザＵが操作した量が多いことになる。よって、キータッチの回数が多いほど、ユーザＵと単語Ｗの関連が強く、キータッチの回数が少ないほど、ユーザＵと単語Ｗの関連が弱いと言える。従って、キータッチの回数を、関連スコアとして用いることができる。 The second calculation method of the relation score is a key touch when the user U operates each file containing the word W in the file name as the relation score of one user (user U) and one word (word W). This is a method of calculating the total number of times. As the number of key touches increases, the amount of operations by the user U on the file containing the word W in the file name increases. Therefore, it can be said that the greater the number of key touches, the stronger the relationship between the user U and the word W, and the smaller the number of key touches, the weaker the relationship between the user U and the word W. Therefore, the number of key touches can be used as the related score.

第２の算出方法では、スコア算出部４は、操作ログを参照して、単語Ｗをファイル名に含む一つのファイルをユーザＵが操作した際のキータッチの回数をカウントすることによって、そのファイルにおけるキータッチの回数を求める。 In the second calculation method, the score calculation unit 4 refers to the operation log and counts the number of key touches when the user U operates one file containing the word W in the file name. Find the number of key touches in.

さらに、単語Ｗをファイル名に含み、ユーザＵに操作された他のファイルがあれば、スコア算出部４は、そのファイルに関しても同様に、キータッチの回数を求める。 Further, if the file name includes the word W and there is another file operated by the user U, the score calculation unit 4 similarly obtains the number of key touches for that file.

そして、スコア算出部４は、単語Ｗをファイル名に含み、ユーザＵに操作されたファイル毎に算出したキータッチの回数の総和を算出し、その総和をユーザＵと単語Ｗの関連スコアとする。 Then, the score calculation unit 4 includes the word W in the file name, calculates the total number of key touches calculated for each file operated by the user U, and sets the total as the related score between the user U and the word W. ..

関連スコアの第３の算出方法は、一のユーザ（ユーザＵ）と一の単語（単語Ｗ）の関連スコアを、次に説明する２つの割合に基づいて算出する方法である。この２つの割合のうち、一方の割合をＲ_１と記し、もう一方の割合をＲ_２と記す。 The third method of calculating the association score is a method of calculating the association score of one user (user U) and one word (word W) based on the two ratios described below. Of these two ratios, one ratio is referred to as R ₁ and the other ratio is referred to as R ₂ .

Ｒ_１は、単語Ｗをファイル名に含む各ファイルについての組織内の全ユーザの操作時間の総和に対する、単語Ｗをファイル名に含む各ファイルについてのユーザＵの操作時間の総和の割合である。すなわち、Ｒ_１は、以下に示す式（１）で表される。 R ₁ is, to the sum of the operation times of all the users in the organization for each file containing the word W in the file name, which is the ratio of the sum of the operation time of the user U for each file containing the word W in the file name. That is, R ₁ is represented by the following equation (1).

Ｒ_２は、個々の単語に着目した場合における、着目した単語をファイル名に含む各ファイルについてのユーザＵの操作時間の総和に対する、単語Ｗをファイル名に含む各ファイルについてのユーザＵの操作時間の総和の割合である。すなわち、Ｒ_２は、以下に示す式（２）で表される。 R ₂ is the operation time of the user U for each file containing the word W in the file name with respect to the total operation time of the user U for each file containing the word of interest in the file name when focusing on each word. It is the ratio of the total of. That is, R ₂ is represented by the following equation (2).

Ｒ_１について説明する。単語Ｗをファイル名に含む各ファイルについてのユーザＵの操作時間の総和（式（１）の右辺の分子）は、前述の第１の算出方法で算出される関連スコアに相当する。すなわち、スコア算出部４は、前述の第１の算出方法で説明した方法で、単語Ｗをファイル名に含む各ファイルについてのユーザＵの操作時間の総和を算出すればよい。 R ₁ will be described. The sum of the operation times of the user U (the numerator on the right side of the equation (1)) for each file containing the word W in the file name corresponds to the association score calculated by the first calculation method described above. That is, the score calculation unit 4 may calculate the total operation time of the user U for each file including the word W in the file name by the method described in the first calculation method described above.

単語Ｗをファイル名に含む各ファイルについての組織内の全ユーザの操作時間の総和（式（１）の右辺の分母）について説明する。スコア算出部４は、単語Ｗをファイル名に含む各ファイルについての、組織内の一人目のユーザの操作時間の総和も、前述の第１の算出方法で説明した方法で算出する。同様に、スコア算出部４は、単語をＷファイル名に含む各ファイルについての、組織内の二人目のユーザの操作時間の総和も、前述の第１の算出方法で説明した方法で算出する。同様に、スコア算出部４は、組織に属する一人一人について、単語Ｗをファイル名に含む各ファイルについてのユーザの操作時間の総和を算出する。さらに、スコア算出部４は、組織に属する一人一人について算出した「単語Ｗをファイル名に含む各ファイルについてのユーザの操作時間の総和」の総和を算出する。この値が、単語Ｗをファイル名に含む各ファイルについての組織内の全ユーザの操作時間の総に該当する。 The sum of the operation times of all users in the organization for each file containing the word W in the file name (the denominator on the right side of the equation (1)) will be described. The score calculation unit 4 also calculates the total operation time of the first user in the organization for each file containing the word W in the file name by the method described in the first calculation method described above. Similarly, the score calculation unit 4 also calculates the total operation time of the second user in the organization for each file containing the word in the W file name by the method described in the first calculation method described above. Similarly, the score calculation unit 4 calculates the total operation time of the user for each file including the word W in the file name for each person belonging to the organization. Further, the score calculation unit 4 calculates the sum of "the sum of the user's operation times for each file containing the word W in the file name" calculated for each person belonging to the organization. This value corresponds to the total operating time of all users in the organization for each file containing the word W in the file name.

例えば、単語Ｗが「人工知能」であり、ユーザＵが「山田」であるとする。また、山田が属する組織「Ａ社」に３００人のユーザがいるとする。この場合、スコア算出部４は、「人工知能」をファイル名に含むファイルについてのユーザ「山田」の操作時間の総和を、式（１）の右辺の分子として求める。また、スコア算出部４は、「人工知能」をファイル名に含むファイルについてのユーザの操作時間の総和を、３００人の個々のユーザ毎に算出し、さらに、個々のユーザ毎に算出した「操作時間の総和」の総和を、式（１）の右辺の分母として求める。そして、スコア算出部４は、式（１）によって、Ｒ_１を算出する。 For example, assume that the word W is "artificial intelligence" and the user U is "Yamada". Further, it is assumed that there are 300 users in the organization "Company A" to which Yamada belongs. In this case, the score calculation unit 4 obtains the total operation time of the user "Yamada" for the file containing "artificial intelligence" in the file name as the numerator on the right side of the equation (1). Further, the score calculation unit 4 calculates the total operation time of the user for the file containing "artificial intelligence" in the file name for each of 300 individual users, and further calculates the "operation" for each individual user. The sum of "sum of time" is calculated as the denominator on the right side of equation (1). Then, the score calculation unit 4 calculates R ₁ by the equation (1).

次に、Ｒ_２について説明する。式（２）の右辺の分子は、式（１）の右辺の分子と同じである。従って、スコア算出部４は、前述の第１の算出方法で説明した方法で、単語Ｗをファイル名に含む各ファイルについてのユーザＵの操作時間の総和を算出すればよい。 Next, R ₂ will be described. The molecule on the right side of the formula (2) is the same as the molecule on the right side of the formula (1). Therefore, the score calculation unit 4 may calculate the total operation time of the user U for each file including the word W in the file name by the method described in the first calculation method described above.

個々の単語に着目した場合における、着目した単語をファイル名に含む各ファイルについてのユーザＵの操作時間の総和（式（２）の右辺の分母）について説明する。スコア算出部４は、単語抽出部３によって抽出された個々の単語に着目する（換言すれば、個々の単語を１つ１つ選択する）。そして、スコア算出部４は、着目した単語（選択した単語）をファイル名に含む各ファイルについてのユーザＵの操作時間の総和を、前述の第１の算出方法で説明した方法で算出する。スコア算出部４は、次の単語に着目し（換言すれば、次の単語を選択し）、着目した単語（選択した単語）をファイル名に含む各ファイルについてのユーザＵの操作時間の総和を、前述の第１の算出方法で説明した方法で算出する。このように、スコア算出部４は、単語毎に、単語をファイル名に含む各ファイルについてのユーザＵの操作時間の総和を算出する。そして、スコア算出部４は、単語毎に算出した「単語をファイル名に含む各ファイルについてのユーザＵの操作時間の総和」の総和を算出する。この値が、式（２）の右辺の分母に該当する。「個々の単語に着目した場合における、着目した単語をファイル名に含む各ファイルについてのユーザＵの操作時間の総和（式（２）の右辺の分母）」は、「単語をファイル名に含む各ファイルについてのユーザＵの操作時間の総和を単語毎に求めた場合における前記総和の総和」であると言うことができる。 The sum of the operation times of the user U (the denominator on the right side of the equation (2)) for each file including the word of interest in the file name when focusing on each word will be described. The score calculation unit 4 pays attention to the individual words extracted by the word extraction unit 3 (in other words, selects each individual word one by one). Then, the score calculation unit 4 calculates the total operation time of the user U for each file including the word of interest (selected word) in the file name by the method described in the first calculation method described above. The score calculation unit 4 focuses on the next word (in other words, selects the next word), and sums up the operation time of the user U for each file containing the focused word (selected word) in the file name. , Calculate by the method described in the above-mentioned first calculation method. In this way, the score calculation unit 4 calculates the total operation time of the user U for each file including the word in the file name for each word. Then, the score calculation unit 4 calculates the sum of "the sum of the operation times of the user U for each file including the word in the file name" calculated for each word. This value corresponds to the denominator on the right side of equation (2). "The total operation time of the user U for each file containing the word of interest in the file name when focusing on each word (the denominator on the right side of equation (2))" is "each including the word in the file name." It can be said that it is "the sum of the sums when the sum of the operation times of the user U for the file is obtained for each word".

例えば、前述の例のように、単語Ｗが「人工知能」であり、ユーザＵが「山田」であるとする。この場合、スコア算出部４は、「人工知能」をファイル名に含むファイルについてのユーザ「山田」の操作時間の総和を、式（２）の右辺の分子として求める。また、スコア算出部４は、「人工知能」、「Ａ社」、「ディープラーニング」等の抽出された単語毎に、単語をファイル名に含むファイルについてのユーザ「山田」の操作時間の総和を算出する。さらに、個々の単語毎に算出した「操作時間の総和」の総和を、式（２）の右辺の分母として、求める。そして、スコア算出部４は、式（２）によって、Ｒ_２を算出する。 For example, as in the above example, it is assumed that the word W is "artificial intelligence" and the user U is "Yamada". In this case, the score calculation unit 4 obtains the total operation time of the user "Yamada" for the file containing "artificial intelligence" in the file name as the numerator on the right side of the equation (2). In addition, the score calculation unit 4 sums up the operation time of the user "Yamada" for the file containing the word in the file name for each extracted word such as "artificial intelligence", "company A", and "deep learning". calculate. Further, the sum of the "sum of operation times" calculated for each word is obtained as the denominator on the right side of the equation (2). Then, the score calculation unit 4 calculates R ₂ by the equation (2).

スコア算出部４は、Ｒ_１，Ｒ_２を求めた後、ユーザＵと単語Ｗの関連スコアを、以下に示す式（３）によって算出する。 _{After obtaining R 1} and R ₂ , the score calculation unit 4 calculates the related score between the user U and the word W by the following equation (3).

関連スコア＝Ｒ_１×ｌｏｇ（Ｒ_２）・・・（３） Related score = R ₁ x log (R ₂ ) ・・・ (3)

この第３の算出方法で関連スコアを算出した場合、組織に属する多くの人に関連のある単語については、関連スコアの値が低くなり、組織に属する特定の人に関連のある単語については、関連スコアの値が高くなる。例えば、各ユーザはＡ社に属しているので、「Ａ社」という単語は、各ユーザと関連があると考えられる。しかし、「Ａ社」という単語と、各ユーザの関連が強いということは、自明であると言える。そのため、「Ａ社」という単語と各ユーザの関連スコアを高くしても、あまり意味がなく、関連スコアを低くした方が好ましい。また、組織に属する特定のユーザのみが、「人工知能」という単語と関連している場合、そのユーザと「人工知能」という単語の関連スコアは高くした方が好ましい。第３の算出方法では、そのように、関連スコアを算出することができる。 When the association score is calculated by this third calculation method, the value of the association score is low for words related to many people belonging to the organization, and the value of the association score is low for words related to a specific person belonging to the organization. The value of the related score is high. For example, since each user belongs to company A, the word "company A" is considered to be related to each user. However, it can be said that it is self-evident that the word "Company A" is strongly related to each user. Therefore, it does not make much sense to increase the association score between the word "Company A" and each user, and it is preferable to decrease the association score. Further, when only a specific user belonging to an organization is associated with the word "artificial intelligence", it is preferable that the association score between the user and the word "artificial intelligence" is high. In the third calculation method, the related score can be calculated as such.

図３は、ユーザと単語の組合せ毎に算出された関連スコアの例を示す模式図である。図３に示す第１の関連スコアは、第１の算出方法で算出された関連スコアである。第２の関連スコアは、第２の算出方法で算出された関連スコアである。第３の関連スコアは、第３の算出方法で算出された関連スコアである。図３では、３種類の関連スコアを図示したが、スコア算出部４は、いずれか１種類の関連スコアを算出すればよい。ただし、スコア算出部４は、２種類以上の関連スコアを算出してもよい。 FIG. 3 is a schematic diagram showing an example of the association score calculated for each combination of the user and the word. The first association score shown in FIG. 3 is the association score calculated by the first calculation method. The second association score is the association score calculated by the second calculation method. The third association score is the association score calculated by the third calculation method. Although three types of related scores are shown in FIG. 3, the score calculation unit 4 may calculate any one type of related score. However, the score calculation unit 4 may calculate two or more types of related scores.

既に説明したように、スコア算出部４は、ユーザと単語の組合せ毎に、関連スコアを算出し、ユーザ名と単語と関連スコアとの組を記憶部５に記憶させる。 As described above, the score calculation unit 4 calculates the association score for each combination of the user and the word, and stores the pair of the user name, the word, and the association score in the storage unit 5.

収集部２は、例えば、関連スコア算出プログラムに従って動作するコンピュータのＣＰＵ（Central Processing Unit ）およびそのコンピュータの通信インタフェースによって実現される。例えば、ＣＰＵが、コンピュータのプログラム記憶装置等のプログラム記録媒体から関連スコア算出プログラムを読み込み、関連スコア算出プログラムに従って、通信インタフェースを用いて、収集部２として動作すればよい。また、単語抽出部３およびスコア算出部４も、例えば、関連スコア算出プログラムに従って動作する上記のコンピュータのＣＰＵによって実現される。すなわち、上記のように、関連スコア算出プログラムを読み込んだＣＰＵが、関連スコア算出プログラムに従って、単語抽出部３およびスコア算出部４として動作すればよい。記憶部５は、上記のコンピュータの記憶装置によって実現される。また、収集部２、単語抽出部３およびスコア算出部４がそれぞれ別々のハードウェアによって実現されてもよい。 The collecting unit 2 is realized by, for example, a CPU (Central Processing Unit) of a computer operating according to a related score calculation program and a communication interface of the computer. For example, the CPU may read the related score calculation program from a program recording medium such as a program storage device of a computer, and operate as the collecting unit 2 by using the communication interface according to the related score calculation program. Further, the word extraction unit 3 and the score calculation unit 4 are also realized by, for example, the CPU of the above-mentioned computer that operates according to the related score calculation program. That is, as described above, the CPU that has read the related score calculation program may operate as the word extraction unit 3 and the score calculation unit 4 according to the related score calculation program. The storage unit 5 is realized by the storage device of the above-mentioned computer. Further, the collection unit 2, the word extraction unit 3, and the score calculation unit 4 may be realized by separate hardware.

また、関連スコア算出システム１は、２つ以上の物理的に分離した装置が有線または無線で接続されている構成であってもよい。この点は、後述する他の実施形態でも同様である。 Further, the related score calculation system 1 may be configured such that two or more physically separated devices are connected by wire or wirelessly. This point is the same in other embodiments described later.

次に、第１の実施形態の処理経過について説明する。図４は、第１の実施形態の関連スコア算出システムの処理経過の例を示すフローチャートである。なお、既に説明した事項については、詳細な説明を省略する。 Next, the processing progress of the first embodiment will be described. FIG. 4 is a flowchart showing an example of the processing progress of the related score calculation system of the first embodiment. The details of the matters already described will be omitted.

まず、収集部２が、会社内に設けられている各ＰＣ９１から、操作ログを収集する（ステップＳ１）。 First, the collection unit 2 collects operation logs from each PC 91 provided in the company (step S1).

次に、単語抽出部３が、各操作ログに記述されているファイル名（パス名を含むファイル名）に対して形態素解析を行うことにより、単語を抽出する（ステップＳ２）。 Next, the word extraction unit 3 extracts words by performing morphological analysis on the file names (file names including path names) described in each operation log (step S2).

次に、スコア算出部４が、各操作ログに基づいて、操作ログに記述されているユーザ名と、ステップＳ２で抽出された単語との組み合わせ毎に、そのユーザ名が表わすユーザと単語との関連の強さを表す関連スコアを算出。そして、スコア算出部４は、ユーザ名と単語と関連スコアとの組を記憶部５に記憶させる（ステップＳ３）。スコア算出部４は、前述の第１の算出方法、第２の算出方法、および、第３の算出方法のうちの、いずれの方法で関連スコアを算出してもよい。 Next, the score calculation unit 4 sets the user and the word represented by the user name for each combination of the user name described in the operation log and the word extracted in step S2 based on each operation log. Calculate the association score that represents the strength of the association. Then, the score calculation unit 4 stores the set of the user name, the word, and the related score in the storage unit 5 (step S3). The score calculation unit 4 may calculate the related score by any of the above-mentioned first calculation method, second calculation method, and third calculation method.

この結果、記憶部５には、ユーザ名と単語と関連スコアとの組が複数組、記憶される。 As a result, a plurality of pairs of the user name, the word, and the related score are stored in the storage unit 5.

本実施形態によれば、スコア算出部４が、各操作ログに基づいて、ユーザと単語の組合せ毎に、関連スコアを算出する。そして、前述の第１の算出方法、第２の算出方法、および、第３の算出方法は、いずれも、基本的に、単語Ｗをファイル名に含むファイルに対するユーザＵの操作の量（キータッチの回数、操作時間等）が多いほど、関連スコアとして大きな値を算出する。従って、ユーザと単語の組合せ毎に、ユーザと単語との関連の強さが、適切に数値化される。よって、組織内の人と単語との関連の強さを明確化することができる。 According to the present embodiment, the score calculation unit 4 calculates the related score for each combination of the user and the word based on each operation log. Then, in each of the above-mentioned first calculation method, second calculation method, and third calculation method, basically, the amount of operation of the user U on the file containing the word W in the file name (key touch). The larger the number of times, the operation time, etc.), the larger the value is calculated as the related score. Therefore, the strength of the relationship between the user and the word is appropriately quantified for each combination of the user and the word. Therefore, it is possible to clarify the strength of the relationship between a person and a word in an organization.

次に、第１の実施形態の変形例について説明する。図５は、第１の実施形態の変形例を示すブロック図である。図５に示す関連スコア算出システム１は、収集部２、単語抽出部３、スコア算出部４および記憶部５に加えて、キーワード受付部６と、検索部７と、出力部８とを備える。図１に示す要素と同様の要素については、図１と同一の符号を付し、説明を省略する。 Next, a modification of the first embodiment will be described. FIG. 5 is a block diagram showing a modified example of the first embodiment. The related score calculation system 1 shown in FIG. 5 includes a keyword reception unit 6, a search unit 7, and an output unit 8 in addition to the collection unit 2, the word extraction unit 3, the score calculation unit 4, and the storage unit 5. The same elements as those shown in FIG. 1 are designated by the same reference numerals as those shown in FIG. 1, and the description thereof will be omitted.

以下に示す例では、ユーザ名と、単語と、前述の第３の算出方法によって算出された関連スコアとの組が、複数、記憶部５に記憶されているものとして説明する。図６は、記憶部５に記憶されている複数の組の例を示す模式図である。 In the example shown below, it is assumed that a plurality of pairs of the user name, the word, and the related score calculated by the above-mentioned third calculation method are stored in the storage unit 5. FIG. 6 is a schematic diagram showing an example of a plurality of sets stored in the storage unit 5.

図５に例示する関連スコア算出システム１は、単語を検索キーワードとして受け付け、その単語に応じたユーザのユーザ名を検索する。あるいは、関連スコア算出システム１は、ユーザ名を検索キーワードとして受け付け、そのユーザ名に応じた単語を検索する。また、関連スコア算出システム１は、上記の２種類の検索をそれぞれ実行可能であってもよい。 The related score calculation system 1 illustrated in FIG. 5 accepts a word as a search keyword and searches for the user name of the user corresponding to the word. Alternatively, the related score calculation system 1 accepts the user name as a search keyword and searches for a word corresponding to the user name. Further, the related score calculation system 1 may be able to execute each of the above two types of searches.

キーワード受付部６は、検索者から検索キーワードを受け付ける。 The keyword reception unit 6 receives a search keyword from a searcher.

検索部７は、検索キーワードに応じて、検索を実行する。 The search unit 7 executes a search according to the search keyword.

出力部８は、検索結果を出力する。 The output unit 8 outputs the search result.

なお、キーワード受付部６は、例えば、検索者の使用する端末装置（図示略）から、通信ネットワークを介して、検索キーワードを受け付け、出力部８は、その端末装置に対して、検索結果を送信すればよい。以下、このようにキーワード受付部６が検索キーワードを受け付け、出力部８がこのように検索結果を出力する場合を例にして説明する。ただし、検索キーワードの受け付け態様や、検索結果の出力態様は、この例に限定されない。例えば、キーワード受付部６は、関連スコア算出システム１が備える入力デバイス（図示略）を介して検索キーワードを受け付けてもよい。また、出力部８は、関連スコア算出システム１が備えるディスプレイ装置（図示略）に検索結果を出力（表示）してもよい。 The keyword receiving unit 6 receives a search keyword from, for example, a terminal device (not shown) used by the searcher via a communication network, and the output unit 8 transmits the search result to the terminal device. do it. Hereinafter, a case where the keyword receiving unit 6 accepts the search keyword and the output unit 8 outputs the search result in this way will be described as an example. However, the mode of accepting search keywords and the mode of outputting search results are not limited to this example. For example, the keyword receiving unit 6 may accept a search keyword via an input device (not shown) included in the related score calculation system 1. Further, the output unit 8 may output (display) the search result to a display device (not shown) included in the related score calculation system 1.

キーワード受付部６および出力部８は、収集部２と同様に、例えば、関連スコア算出プログラムに従って動作するコンピュータのＣＰＵおよびそのコンピュータの通信インタフェースによって実現される。また、検索部７は、関連スコア算出プログラムに従って動作するそのコンピュータのＣＰＵによって実現される。また、キーワード受付部６、検索部７、出力部８、および他の構成要素がそれぞれ別々のハードウェアによって実現されてもよい。 Similar to the collection unit 2, the keyword reception unit 6 and the output unit 8 are realized by, for example, a CPU of a computer operating according to a related score calculation program and a communication interface of the computer. Further, the search unit 7 is realized by the CPU of the computer that operates according to the related score calculation program. Further, the keyword reception unit 6, the search unit 7, the output unit 8, and other components may be realized by separate hardware.

次に、関連スコア算出システム１が、単語を検索キーワードとして受け付け、その単語に応じたユーザのユーザ名を検索する処理の例について説明する。 Next, an example of a process in which the related score calculation system 1 accepts a word as a search keyword and searches for a user name of a user corresponding to the word will be described.

まず、キーワード受付部６が、検索者から単語を検索キーワードとして受け付ける。 First, the keyword reception unit 6 accepts a word from a searcher as a search keyword.

次に、検索部７が、記憶部５に記憶されている、ユーザ名と単語と関連スコアとの組の中から、検索キーワードに該当する単語と、閾値（例えば、０．５）以上の関連スコアを含む組を特定し、その組に含まれているユーザ名を検索結果として特定する。 Next, the search unit 7 associates the word corresponding to the search keyword with the word corresponding to the search keyword from the set of the user name, the word, and the association score stored in the storage unit 5, and the association of the threshold value (for example, 0.5) or more. Identify the set that includes the score, and identify the user names included in that set as search results.

例えば、キーワード受付部６が検索キーワードとして、「人工知能」という単語を受け付けたとする。また、上記の閾値が０．５であるとする。この場合、検索部７は、図６に例示する複数の組の中から、「人工知能」という単語と、０．５以上の関連スコアを含む組を特定する。本例では、図６に示す１番目の組が特定される。検索部７は、特定した組含まれるユーザ名「山田」を検索結果として得る。出力部８は、その検索結果を出力する。なお、検索部７は、「人工知能」という単語と、０．５以上の関連スコアを含む組が複数存在するならば、その組を全て特定し、その各組から得られるユーザ名を検索結果とする。従って、検索結果として得られるユーザ名は１つとは限らない。出力部８は、検索結果として得た複数のユーザ名を、関連スコアの高い順に並べて出力してもよい。 For example, suppose that the keyword reception unit 6 accepts the word "artificial intelligence" as a search keyword. Further, it is assumed that the above threshold value is 0.5. In this case, the search unit 7 identifies a group including the word "artificial intelligence" and a related score of 0.5 or more from the plurality of sets illustrated in FIG. In this example, the first set shown in FIG. 6 is specified. The search unit 7 obtains the user name "Yamada" included in the specified group as a search result. The output unit 8 outputs the search result. If there are a plurality of pairs including the word "artificial intelligence" and a related score of 0.5 or more, the search unit 7 identifies all the pairs and searches for the user name obtained from each pair. And. Therefore, the number of user names obtained as a search result is not limited to one. The output unit 8 may output a plurality of user names obtained as search results in descending order of the related score.

このように、単語からユーザ名が検索できるので、検索者は、検索キーワードとして指定した単語が表わす分野、技術、プロジェクト等に強く関わったユーザのユーザ名を容易に知ることができる。 Since the user name can be searched from the word in this way, the searcher can easily know the user name of the user who is strongly involved in the field, technology, project, etc. represented by the word designated as the search keyword.

次に、関連スコア算出システム１が、ユーザ名をキーワードとして受け付け、そのユーザ名に応じた単語を検索する処理の例について説明する。 Next, an example of a process in which the related score calculation system 1 accepts a user name as a keyword and searches for a word corresponding to the user name will be described.

まず、キーワード受付部６が、検索者からユーザ名を検索キーワードとして受け付ける。 First, the keyword reception unit 6 accepts a user name as a search keyword from a searcher.

次に、検索部７が、記憶部５に記憶されている、ユーザ名と単語と関連スコアとの組の中から、検索キーワードに該当するユーザ名と、閾値（例えば、０．５）以上の関連スコアを含む組を特定し、その組に含まれている単語を検索結果として特定する。このとき、検索キーワードに該当するユーザ名と、閾値以上の関連スコアを含む組が複数存在するならば、検索部７は、その組を全て特定し、その各組から得られる単語を検索結果とする。従って、検索結果として得られる単語は１つとは限らない。出力部８は、検索結果として得た複数の単語を、関連スコアの高い順に並べて出力してもよい。 Next, the search unit 7 has a user name corresponding to the search keyword and a threshold value (for example, 0.5) or more from the set of the user name, the word, and the related score stored in the storage unit 5. Identify the set that contains the relevant score, and identify the words contained in that set as search results. At this time, if there are a plurality of pairs including the user name corresponding to the search keyword and the related score equal to or higher than the threshold value, the search unit 7 identifies all the pairs, and the word obtained from each pair is used as the search result. do. Therefore, the number of words obtained as a search result is not limited to one. The output unit 8 may output a plurality of words obtained as search results in descending order of the related score.

例えば、キーワード受付部６が検索キーワードとして、「山田」というユーザ名を受け付けたとする。また、上記の閾値が０．５であるとする。この場合、検索部７は、図６に例示する複数の組の中から、「山田」というユーザ名と、０．５以上の関連スコアを含む組を特定する。本例では、図６に示す１番目の組、３番目の組および４番目の組が特定される。検索部７は、特定した各組に含まれる単語を検索結果として得る。すなわち、検索部７は、「人工知能」、「機械学習」および「ディープラーニング」を検索結果として得る。出力部８は、その検索結果を出力する。 For example, it is assumed that the keyword reception unit 6 accepts the user name "Yamada" as a search keyword. Further, it is assumed that the above threshold value is 0.5. In this case, the search unit 7 identifies a group including the user name "Yamada" and a related score of 0.5 or more from the plurality of groups illustrated in FIG. In this example, the first set, the third set, and the fourth set shown in FIG. 6 are specified. The search unit 7 obtains the words included in each specified set as the search result. That is, the search unit 7 obtains "artificial intelligence", "machine learning", and "deep learning" as search results. The output unit 8 outputs the search result.

このように、ユーザ名から単語を検索できるので、検索者は、検索キーワードとして指定したユーザ名を有する人が精通している分野、技術等を容易に推定したり、その人が参加したことがあるプロジェクト等を容易に推定したりすることができる。 In this way, since the word can be searched from the user name, the searcher can easily estimate the field, technology, etc. that the person having the user name specified as the search keyword is familiar with, or that the person participated. It is possible to easily estimate a certain project or the like.

なお、上記の閾値“０．５”は例示であり、閾値は０．５でなくてもよい。また、閾値は、関連スコアの算出方法に応じて定めておけばよい。 The above threshold value “0.5” is an example, and the threshold value does not have to be 0.5. Further, the threshold value may be set according to the calculation method of the related score.

実施形態２．
図７は、本発明の第２の実施形態の関連スコア算出システムの構成例を示すブロック図である。第１の実施形態の関連スコア算出システム（図１参照）や第１の実施形態の変形例（図５参照）に示す構成要素と同様の構成要素については、図１や図５に示す符号と同一の符号を付し、説明を省略する。 Embodiment 2.
FIG. 7 is a block diagram showing a configuration example of the related score calculation system according to the second embodiment of the present invention. The same components as those shown in the related score calculation system of the first embodiment (see FIG. 1) and the modified example of the first embodiment (see FIG. 5) are referred to as reference numerals shown in FIGS. 1 and 5. The same reference numerals are given, and the description thereof will be omitted.

第２の実施形態では、関連スコア算出システム１は、収集部２と、単語抽出部３と、スコア算出部４と、記憶部５と、第１のテーブル生成部１１と、第２のテーブル生成部１２と、キーワード受付部６と、検索部１７と、出力部８とを備える。 In the second embodiment, the related score calculation system 1 includes a collection unit 2, a word extraction unit 3, a score calculation unit 4, a storage unit 5, a first table generation unit 11, and a second table generation. A unit 12, a keyword reception unit 6, a search unit 17, and an output unit 8 are provided.

収集部２、単語抽出部３およびスコア算出部４は、第１の実施形態（図１参照）や、第１の実施形態の変形例（図５参照）で示したそれらの各要素と同様である。 The collection unit 2, the word extraction unit 3, and the score calculation unit 4 are the same as those of the first embodiment (see FIG. 1) and the modified examples of the first embodiment (see FIG. 5). be.

また、キーワード受付部６および出力部８は、第１の実施形態の変形例（図５参照）で示したそれらの各要素と同様である。 Further, the keyword receiving unit 6 and the output unit 8 are the same as the respective elements shown in the modified example (see FIG. 5) of the first embodiment.

記憶部５は、第１の実施形態（図１参照）や、第１の実施形態の変形例（図５参照）における記憶部５と同様である。ただし、本実施形態では、記憶部５は、ユーザ名と単語と関連スコアとの組を複数組記憶するだけでなく、後述の第１のテーブル２１および第２のテーブル２２も記憶する。以下、スコア算出部４が記憶部５に記憶させる、ユーザ名と単語と関連スコアとの組の集合（例えば、図６に例示する複数の組）を関連スコア算出結果２０と記す。 The storage unit 5 is the same as the storage unit 5 in the first embodiment (see FIG. 1) and the modified example of the first embodiment (see FIG. 5). However, in the present embodiment, the storage unit 5 not only stores a plurality of sets of the user name, the word, and the related score, but also stores the first table 21 and the second table 22, which will be described later. Hereinafter, a set of a set of a user name, a word, and a related score (for example, a plurality of sets illustrated in FIG. 6) stored in the storage unit 5 by the score calculation unit 4 is referred to as a related score calculation result 20.

また、本実施形態では、スコア算出部４が、前述の第３の算出方法で関連スコアを算出する場合を例にして説明する。ただし、スコア算出部４は、前述の第１の算出方法または第２の算出方法で関連スコアを算出してもよい。 Further, in the present embodiment, the case where the score calculation unit 4 calculates the related score by the above-mentioned third calculation method will be described as an example. However, the score calculation unit 4 may calculate the related score by the above-mentioned first calculation method or the second calculation method.

以下の説明では、スコア算出部４が、既に関連スコアを算出し、記憶部５に関連スコア算出結果２０を記憶させているものとして説明する。図８は、関連スコア算出結果２０（ユーザ名と単語と関連スコアとの組の集合）の例を示す模式図である。ここでは、図８に例示する関連スコア算出結果２０が記憶部５に記憶されている場合を例にして説明する。 In the following description, it is assumed that the score calculation unit 4 has already calculated the related score and the storage unit 5 stores the related score calculation result 20. FIG. 8 is a schematic diagram showing an example of the related score calculation result 20 (a set of a set of a user name, a word, and a related score). Here, a case where the related score calculation result 20 illustrated in FIG. 8 is stored in the storage unit 5 will be described as an example.

第１のテーブル生成部１１は、関連スコア算出結果２０に基づいて、第１のテーブル２１を生成し、記憶部５に記憶させる。第１のテーブル２１は、ユーザＩＤと、単語と、関連スコアとの関係を記述したテーブルである。より具体的には、第１のテーブル２１は、関連スコア算出結果２０に含まれているユーザ名を縦軸と横軸のうちの一方の軸に並べ、関連スコア算出結果２０に含まれている単語を他方の軸に並べ、ユーザ名と単語とが交差する欄に、そのユーザ名を有するユーザとその単語の関連スコアを記述したテーブルである。 The first table generation unit 11 generates the first table 21 based on the related score calculation result 20, and stores it in the storage unit 5. The first table 21 is a table describing the relationship between the user ID, the word, and the related score. More specifically, in the first table 21, the user names included in the related score calculation result 20 are arranged on one of the vertical axis and the horizontal axis, and are included in the related score calculation result 20. It is a table in which words are arranged on the other axis and the association score of the user having the user name and the word is described in the column where the user name and the word intersect.

本例では、第１のテーブル生成部１１が、第１のテーブルを生成する際に、ユーザ名を縦軸に並べ、単語を横軸に並べる場合を例にして説明するが、第１のテーブル生成部１１は、ユーザ名を横軸に並べ、単語を縦軸に並べてもよい。 In this example, when the first table generation unit 11 generates the first table, the case where the user names are arranged on the vertical axis and the words are arranged on the horizontal axis will be described as an example. The generation unit 11 may arrange the user names on the horizontal axis and the words on the vertical axis.

また、第１のテーブル生成部１１は、ユーザ名を軸に沿って並べる際、同一ユーザのユーザ名を重複させずに並べる。例えば、図８に示す関連スコア算出結果２０において、１番目の組にもユーザ名「山田」が含まれ、２番目の組にもユーザ名「山田」が含まれている。このユーザ名「山田」は、同一ユーザのユーザ名である。従って、第１のテーブル生成部１１は、ユーザ名を軸に沿って並べる際、ユーザ名「山田」を１回並べればよい。 Further, when arranging the user names along the axis, the first table generation unit 11 arranges the user names of the same user without duplication. For example, in the related score calculation result 20 shown in FIG. 8, the user name “Yamada” is also included in the first group, and the user name “Yamada” is also included in the second group. This user name "Yamada" is a user name of the same user. Therefore, when arranging the user names along the axis, the first table generation unit 11 may arrange the user names "Yamada" once.

同様に、第１のテーブル生成部１１は、単語を軸に沿って並べる際、同一の単語を重複させずに並べる。例えば、図８に示す関連スコア算出結果２０において、複数の組で「人工知能」という単語が含まれている。しかし、第１のテーブル生成部１１は、単語を軸に沿って並べる際、「人工知能」という単語を一回並べればよい。 Similarly, when arranging the words along the axis, the first table generation unit 11 arranges the same words without duplication. For example, in the related score calculation result 20 shown in FIG. 8, the word "artificial intelligence" is included in a plurality of sets. However, the first table generation unit 11 may arrange the word "artificial intelligence" once when arranging the words along the axis.

第１のテーブル生成部１１によって生成される第１のテーブルの例を、図９に示す。第１のテーブル生成部１１は、関連スコア算出結果２０に含まれているユーザ名（「山田」、「鈴木」、「田中」等）を、縦軸の方向に沿って並べる（図９参照）。また、第１のテーブル生成部１１は、関連スコア算出結果２０に含まれている単語（「人工知能」、「Ａ社」、「機械学習」、「ディープラーニング」等）を、横軸の方向に沿って並べる（図９参照）。 An example of the first table generated by the first table generation unit 11 is shown in FIG. The first table generation unit 11 arranges the user names (“Yamada”, “Suzuki”, “Tanaka”, etc.) included in the related score calculation result 20 along the direction of the vertical axis (see FIG. 9). .. Further, the first table generation unit 11 sets the words (“artificial intelligence”, “company A”, “machine learning”, “deep learning”, etc.) included in the related score calculation result 20 in the direction of the horizontal axis. Arrange along (see FIG. 9).

そして、第１のテーブル生成部１１は、ユーザ名と単語とが交差する欄に、そのユーザ名を有するユーザとその単語の関連スコアを記述する。例えば、図８に示す例で、ユーザ名「山田」と単語「人工知能」の関連スコアは、“０．８０”である。従って、第１のテーブル生成部１１は、第１のテーブル２１において、「山田」と「人工知能」とが交差する欄に“０．８０”を記述する（図９参照）。また、例えば、図８に示す例で、ユーザ名「山田」と単語「Ａ社」の関連スコアは、“０．１０”である。従って、第１のテーブル生成部１１は、第１のテーブル２１において、「山田」と「Ａ社」とが交差する欄に“０．１０”を記述する。第１のテーブル生成部１１は、ユーザ名と単語とが交差する欄毎に、同様に、関連スコアを記述する。 Then, the first table generation unit 11 describes the association score between the user having the user name and the word in the column where the user name and the word intersect. For example, in the example shown in FIG. 8, the association score between the user name “Yamada” and the word “artificial intelligence” is “0.80”. Therefore, the first table generation unit 11 describes "0.80" in the column where "Yamada" and "artificial intelligence" intersect in the first table 21 (see FIG. 9). Further, for example, in the example shown in FIG. 8, the related score of the user name “Yamada” and the word “Company A” is “0.10”. Therefore, the first table generation unit 11 describes "0.10" in the column where "Yamada" and "Company A" intersect in the first table 21. The first table generation unit 11 similarly describes the related score for each column where the user name and the word intersect.

第１のテーブル生成部１１は、上記のようにして生成した第１のテーブル２１を、記憶部５に記憶させる。 The first table generation unit 11 stores the first table 21 generated as described above in the storage unit 5.

第２のテーブル生成部１２は、第１のテーブル２１に基づいて、第２のテーブル２２を生成し、記憶部５に記憶させる。第２のテーブルは、第１のテーブルに基づいて算出した単語同士の関連の強さを表す関連度を記述したテーブルである。より具体的には、第２のテーブル２２は、縦軸と横軸の両方に単語を並べ、単語同士が交差する欄に、第１のテーブル２１に基づいて算出したその単語同士の関連の強さを表す関連度を記述したテーブルである。既に説明したように、本発明では、ユーザと単語との関連の強さを表す指標値を「関連スコア」と称し、単語同士の関連の強さを表す指標値を「関連度」と称することによって、２種類の指標値を区別する。 The second table generation unit 12 generates a second table 22 based on the first table 21, and stores it in the storage unit 5. The second table is a table describing the degree of relevance representing the strength of the relevance between words calculated based on the first table. More specifically, in the second table 22, words are arranged on both the vertical axis and the horizontal axis, and in the column where the words intersect, the strength of the relationship between the words calculated based on the first table 21 is strong. It is a table that describes the degree of relevance that represents the value. As described above, in the present invention, the index value indicating the strength of the relationship between the user and the word is referred to as "relationship score", and the index value indicating the strength of the relationship between words is referred to as "relevance degree". Distinguish between two types of index values.

図１０は、第２のテーブルの例を示す模式図である。本例では、第２のテーブル生成部１２が、縦軸、横軸それぞれに、第１のテーブル２１と同じ順番で単語を並べる場合を例にして説明する。例えば、図９に例示する第１のテーブルでは、単語が、「人工知能」、「Ａ社」、「機械学習」、「ディープラーニング」、・・・の順に並べられている。第２のテーブル生成部１２は、第２のテーブルの縦軸、横軸それぞれにおいても、その順番と同じ順番で単語を並べる（図１０参照）。 FIG. 10 is a schematic diagram showing an example of the second table. In this example, a case where the second table generation unit 12 arranges words in the same order as the first table 21 on each of the vertical axis and the horizontal axis will be described as an example. For example, in the first table illustrated in FIG. 9, words are arranged in the order of "artificial intelligence", "company A", "machine learning", "deep learning", and so on. The second table generation unit 12 arranges words in the same order on the vertical axis and the horizontal axis of the second table (see FIG. 10).

そして、第２のテーブル生成部１２は、１つの単語と１つの単語との組合せ毎に、その単語同士の関連の強さを表す関連度を算出し、その単語同士が交差する欄に、その関連度を記述する。なお、第２のテーブル生成部１２は、同一の単語同士に関しても、関連度を算出する。例えば、第２のテーブル生成部１２は、「人工知能」と「人工知能」の関連度も算出する。 Then, the second table generation unit 12 calculates the degree of relevance indicating the strength of the relevance between the words for each combination of one word and one word, and puts the relevance in the column where the words intersect. Describe the degree of relevance. The second table generation unit 12 calculates the degree of relevance even for the same words. For example, the second table generation unit 12 also calculates the degree of relevance between "artificial intelligence" and "artificial intelligence".

ここで、単語同士の関連度について説明する。まず、図９に例示する第１のテーブ２１において、「人工知能」と「機械学習」という２つの単語に着目した場合について説明する。 Here, the degree of relevance between words will be described. First, in the first power strip 21 illustrated in FIG. 9, a case where two words "artificial intelligence" and "machine learning" are focused on will be described.

「人工知能」と個々のユーザの関連スコアを、第１のテーブル２１におけるユーザ名順に並べると、以下のようになる。 The association scores of "artificial intelligence" and individual users are arranged in the order of user names in the first table 21 as follows.

０．８０，０．２０，０．６０，０．４３，・・・ 0.80, 0.20, 0.60, 0.43, ...

また、「機械学習」と個々のユーザの関連スコアを、第１のテーブル２１におけるユーザ名順に並べると、以下のようになる。 Further, the related scores of "machine learning" and each user are arranged in the order of user names in the first table 21 as follows.

０．７９，０．２２，０．５８，０．４５，・・・ 0.79, 0.22, 0.58, 0.45, ...

上記の関連スコアの並びにおける関連スコアの変化の傾向は似ていると言える。この場合、「人工知能」と「機械学習」の関連度は高いことになる。 It can be said that the tendency of the change of the related score in the above-mentioned sequence of the related score is similar. In this case, the degree of relevance between "artificial intelligence" and "machine learning" is high.

次に、「人工知能」と「Ａ社」という２つの単語に着目した場合について説明する。 Next, a case where the two words "artificial intelligence" and "company A" are focused on will be described.

前述のように、「人工知能」と個々のユーザの関連スコアを、第１のテーブル２１におけるユーザ名順に並べると、以下のようになる。 As described above, the related scores of "artificial intelligence" and individual users are arranged in the order of user names in the first table 21 as follows.

また、「Ａ社」と個々のユーザの関連スコアを、第１のテーブル２１におけるユーザ名順に並べると、以下のようになる。 Further, the related scores of "Company A" and each user are arranged in the order of user names in the first table 21 as follows.

０．１０，０．４０，０．３５，０．０５，・・・ 0.10, 0.40, 0.35, 0.05, ...

上記の関連スコアの並びにおける関連スコアの変化の傾向は似ていないと言える。この場合、「人工知能」と「Ａ社」の関連度は低いことになる。 It can be said that the tendency of the change of the related score in the above-mentioned sequence of related scores is not similar. In this case, the degree of relevance between "artificial intelligence" and "company A" is low.

第２のテーブル生成部１２は、単語同士の関連度として、単語同士の相関係数を算出すればよい。ここでは、単語ｗ_１と単語ｗ_２の関連度として、単語ｗ_１と単語ｗ_２の相関係数を算出する場合について説明する。単語ｗ_１と単語ｗ_２の相関係数は、より具体的には、単語ｗ_１と個々のユーザの関連スコアの並びと、単語ｗ_２と個々のユーザの関連スコアの並びとの相関関数である。 The second table generation unit 12 may calculate the correlation coefficient between words as the degree of relevance between words. Here, as relevance of a word w ₁ and word w _2, it will be described for calculating a correlation coefficient of words w ₁ and word w _2. Correlation coefficient of words w ₁ and word w _2, more specifically, the arrangement of words w ₁ and related scores for individual users, the correlation function of the sequence of words w ₂ and associated scores for individual users be.

１つの単語と個々のユーザの関連スコアを、第１のテーブル２１におけるユーザ名順に並べた場合における関連スコア（数値）の並びを、その単語の系列と称することとする。例えば、図９に示す例において、「人工知能」の系列は、以下のようになる。 The sequence of related scores (numerical values) when the related scores of one word and each user are arranged in the order of user names in the first table 21 is referred to as a sequence of the words. For example, in the example shown in FIG. 9, the series of "artificial intelligence" is as follows.

単語ｗ_１の系列が（ｘ_１，ｘ_２，・・・，ｘ_ｎ）であるとする。そして、この系列をｘとする。なお、図９に示す例では、単語の系列は、数値を縦方向に並べたものであるが、ここでは、便宜的に、（ｘ_１，ｘ_２，・・・，ｘ_ｎ）と横に並べて示す。この点は、次に述べる単語ｗ_２についても同様である。 It is assumed that the sequence of the word w ₁ _{is (x 1} , x ₂ , ..., X _n). Then, let x be this series. In the example shown in FIG. 9, the word sequence is a series of words arranged in the vertical direction, but here, for convenience, (x ₁ , x ₂ , ..., X _n ) and horizontally. Shown side by side. This is also true for the word w ₂ described next.

また、単語ｗ_２の系列が（ｙ_１，ｙ_２，・・・，ｙ_ｎ）であるとする。そして、この系列をｙとする。 Further, it is assumed that the sequence of _{words w 2} _{is (y 1} , y ₂ , ..., Y _n). Then, let y be this series.

上記のように系列に属する関連スコアの数をｎ個とする。 As described above, the number of related scores belonging to the series is n.

第２のテーブル生成部１２は、単語ｗ_１と単語ｗ_２の関連度として、ｘとｙの相関係数を算出すればよい。ｘとｙの相関係数をｒとする。第２のテーブル生成部１２は、以下に示す式（４）の計算により、ｘとｙの相関係数ｒを算出する。 Second table generator 12, a relevance of words w ₁ and word w _2, may be calculated a correlation coefficient of x and y. Let r be the correlation coefficient between x and y. The second table generation unit 12 calculates the correlation coefficient r between x and y by the calculation of the following equation (4).

式（４）において、ｓ_ｘｙは、ｘとｙの共分散である。また、ｓ_ｘは、ｘの標準偏差であり、ｓ_ｙは、ｙの標準偏差である。ｘ_ｉは、ｘにおけるｉ番目の関連スコアである。ｙ_ｉは、ｙにおけるｉ番目の関連スコアである。 In equation (4), s _xy is a covariance of x and y. Further, s _x is the standard deviation of x, and _sy is the standard deviation of y. x _i is the i-th association score in x. y _i is the i-th association score in y.

は、ｘの平均値である。

Is the average value of x.

は、ｙの平均値である。

Is the average value of y.

第２のテーブル生成部１２は、１つの単語と１つの単語との組合せ毎に、式（４）の計算により、相関係数を算出し、その相関係数を関連度として、第２のテーブル２２に記述する。組み合わせをなす２つの単語が異なる単語である場合、その２つの単語の関連度は、第２のテーブル２２において、２箇所に記述される。例えば、図１０に示す例において、「人工知能」と「Ａ社」の関連度は、第１行第２列と、第２行第１列にそれぞれ記述される。 The second table generation unit 12 calculates the correlation coefficient by the calculation of the equation (4) for each combination of one word and one word, and uses the correlation coefficient as the degree of relevance as the second table. 22 is described. When two words in a combination are different words, the degree of relevance of the two words is described in two places in the second table 22. For example, in the example shown in FIG. 10, the degree of relevance between "artificial intelligence" and "company A" is described in the first row and the second column and the second row and the first column, respectively.

第２のテーブル生成部１２は、上記のようにして生成した第２のテーブル２２を、記憶部５に記憶させる。 The second table generation unit 12 stores the second table 22 generated as described above in the storage unit 5.

第２の実施形態の関連スコア算出システム１は、単語を検索キーワードとして受け付け、その単語に応じたユーザのユーザ名を検索する。あるいは、第２の実施形態の関連スコア算出システム１は、ユーザ名を検索キーワードとして受け付け、そのユーザ名に応じた単語を検索する。また、第２の実施形態の関連スコア算出システム１は、上記の２種類の検索をそれぞれ実行可能であってもよい。 The related score calculation system 1 of the second embodiment accepts a word as a search keyword and searches for the user name of the user corresponding to the word. Alternatively, the related score calculation system 1 of the second embodiment accepts the user name as a search keyword and searches for a word corresponding to the user name. Further, the related score calculation system 1 of the second embodiment may be capable of executing each of the above two types of searches.

検索者に指定された単語に応じたユーザのユーザ名を検索する場合、キーワード受付部６が、検索者から単語を検索キーワードとして受け付ける。そして、検索部１７は、第１のテーブル２１および第２のテーブル２２に基づいて、検索キーワードに該当する単語に応じたユーザ名を検索する。 When searching for a user name of a user corresponding to a word designated as a searcher, the keyword receiving unit 6 accepts the word from the searcher as a search keyword. Then, the search unit 17 searches for the user name corresponding to the word corresponding to the search keyword based on the first table 21 and the second table 22.

また、検索者に指定されたユーザ名に応じた単語を検索する場合、キーワード受付部６が、検索者からユーザ名を検索キーワードとして受け付ける。そして、検索部１７は、第１のテーブル２１および第２のテーブル２２に基づいて、検索キーワードに該当するユーザ名に応じた単語を検索する。 Further, when searching for a word corresponding to the user name designated by the searcher, the keyword receiving unit 6 accepts the user name from the searcher as a search keyword. Then, the search unit 17 searches for a word corresponding to the user name corresponding to the search keyword based on the first table 21 and the second table 22.

第２の実施形態において、収集部２、キーワード受付部６および出力部８は、関連スコア算出プログラムに従って動作するコンピュータのＣＰＵおよびそのコンピュータの通信インタフェースによって実現される。例えば、ＣＰＵが、コンピュータのプログラム記憶装置等のプログラム記録媒体から関連スコア算出プログラムを読み込み、関連スコア算出プログラムに従って、通信インタフェースを用いて、収集部２、キーワード受付部６および出力部８として動作すればよい。また、単語抽出部３、スコア算出部４、第１のテーブル生成部１１、第２のテーブル生成部１２および検索部１７も、例えば、関連スコア算出プログラムに従って動作する上記のコンピュータのＣＰＵによって実現される。すなわち、上記のように、関連スコア算出プログラムを読み込んだＣＰＵが、関連スコア算出プログラムに従って、単語抽出部３、スコア算出部４、第１のテーブル生成部１１、第２のテーブル生成部１２および検索部１７として動作すればよい。また、収集部２、キーワード受付部６、出力部８、単語抽出部３、スコア算出部４、第１のテーブル生成部１１、第２のテーブル生成部１２および検索部１７がそれぞれ別々のハードウェアによって実現されてもよい。 In the second embodiment, the collection unit 2, the keyword reception unit 6, and the output unit 8 are realized by the CPU of the computer operating according to the related score calculation program and the communication interface of the computer. For example, the CPU reads a related score calculation program from a program recording medium such as a program storage device of a computer, and operates as a collecting unit 2, a keyword receiving unit 6, and an output unit 8 using a communication interface according to the related score calculation program. Just do it. Further, the word extraction unit 3, the score calculation unit 4, the first table generation unit 11, the second table generation unit 12, and the search unit 17 are also realized by, for example, the CPU of the above computer operating according to the related score calculation program. To. That is, as described above, the CPU that has read the related score calculation program performs the word extraction unit 3, the score calculation unit 4, the first table generation unit 11, the second table generation unit 12, and the search according to the related score calculation program. It may operate as a unit 17. Further, the collection unit 2, the keyword reception unit 6, the output unit 8, the word extraction unit 3, the score calculation unit 4, the first table generation unit 11, the second table generation unit 12, and the search unit 17 have separate hardware. May be realized by.

図１１は、第２の実施形態の処理経過の例を示すフローチャートである。第１の実施形態で説明した動作と同様の動作や、第２の実施形態で既に説明した動作については、詳細な説明を省略する。 FIG. 11 is a flowchart showing an example of the processing progress of the second embodiment. Detailed description of the operation similar to the operation described in the first embodiment and the operation already described in the second embodiment will be omitted.

ステップＳ１〜ステップＳ３は、第１の実施形態におけるステップＳ１〜Ｓ３（図４参照）と同様であり、説明を省略する。 Steps S1 to S3 are the same as steps S1 to S3 (see FIG. 4) in the first embodiment, and the description thereof will be omitted.

ステップＳ３の次に、第１のテーブル生成部１１は、関連スコア算出結果２０に基づいて、第１のテーブル２１（図９参照）を生成し、第１のテーブル２１を記憶部５に記憶させる（ステップＳ４）。第１のテーブル２１を生成する動作については、既に説明したので、ここでは説明を省略する。 After step S3, the first table generation unit 11 generates the first table 21 (see FIG. 9) based on the related score calculation result 20, and stores the first table 21 in the storage unit 5. (Step S4). Since the operation of generating the first table 21 has already been described, the description thereof will be omitted here.

次に、第２のテーブル生成部１２は、第１のテーブル２１に基づいて、第２のテーブル２２（図１０参照）を生成し、第２のテーブル２２を記憶部５に記憶させる（ステップＳ５）。第２のテーブル２２を生成する動作についても、既に説明したので、ここでは説明を省略する。 Next, the second table generation unit 12 generates a second table 22 (see FIG. 10) based on the first table 21, and stores the second table 22 in the storage unit 5 (step S5). ). Since the operation of generating the second table 22 has already been described, the description thereof will be omitted here.

第１のテーブル２１および第２のテーブル２２が生成された後に、キーワード受付部６は、検索者から、検索キーワードを受け付ける（ステップＳ６）。キーワード受付部６は、検索キーワードとして、単語を受け付けてもよい。また、キーワード受付部６は、検索キーワードとして、ユーザ名を受け付けてもよい。 After the first table 21 and the second table 22 are generated, the keyword receiving unit 6 receives the search keyword from the searcher (step S6). The keyword reception unit 6 may accept a word as a search keyword. Further, the keyword receiving unit 6 may accept a user name as a search keyword.

次に、検索部１７は、第１のテーブル２１および第２のテーブル２２に基づいて、検索キーワードに応じた検索結果を求める（ステップＳ７）。検索キーワードが単語である場合、検索部１７は、第１のテーブル２１および第２のテーブル２２に基づいて、その単語に応じたユーザ名を検索する。また、検索キーワードがユーザ名である場合、検索部１７は、第１のテーブル２１および第２のテーブル２２に基づいて、そのユーザ名に応じた単語を検索する。ステップＳ７の動作の詳細については、後述する。 Next, the search unit 17 obtains search results according to the search keyword based on the first table 21 and the second table 22 (step S7). When the search keyword is a word, the search unit 17 searches for a user name corresponding to the word based on the first table 21 and the second table 22. When the search keyword is a user name, the search unit 17 searches for a word corresponding to the user name based on the first table 21 and the second table 22. The details of the operation of step S7 will be described later.

ステップＳ７の後、出力部８が検索結果を出力する（ステップＳ８）。出力部８が検索結果を出力する態様は、第１の実施形態の変形例と同様である。 After step S7, the output unit 8 outputs the search result (step S8). The mode in which the output unit 8 outputs the search result is the same as the modification of the first embodiment.

次に、ステップＳ７の動作について説明する。まず、ステップＳ６において、キーワード受付部６が選択キーワードとして単語を受け付け、検索部１７が単語に応じたユーザ名を検索する場合について説明する。以下、検索キーワードに該当する単語を、キーワード単語と記す。 Next, the operation of step S7 will be described. First, in step S6, a case where the keyword receiving unit 6 accepts a word as a selection keyword and the search unit 17 searches for a user name corresponding to the word will be described. Hereinafter, the word corresponding to the search keyword is referred to as a keyword word.

検索部１７は、個々のユーザ名を順次選択し、選択したユーザ名（以下、選択ユーザ名と記す。）とキーワード単語の検索スコアを算出する。検索スコアは、選択ユーザ名を有するユーザと単語の関連の強さを示す指標値であるが、既に説明した関連スコアを用いて算出され、関連スコアとは算出方法が異なる。そのため、以下の説明では、関連スコアと区別して、検索スコアという語を用いる。 The search unit 17 sequentially selects individual user names, and calculates a search score for the selected user name (hereinafter referred to as a selected user name) and a keyword word. The search score is an index value indicating the strength of the association between the user having the selected user name and the word, but it is calculated using the association score already described, and the calculation method is different from the association score. Therefore, in the following description, the term search score is used to distinguish it from the related score.

検索部１７が、１つの選択ユーザ名を選択しているとする。検索部１７が選択ユーザ名とキーワード単語の検索スコアを算出する動作について説明する。図１２は、選択ユーザとキーワード単語の検索スコアを算出する処理の例を示すフローチャートである。 It is assumed that the search unit 17 has selected one selected user name. The operation of the search unit 17 to calculate the search score of the selected user name and the keyword word will be described. FIG. 12 is a flowchart showing an example of a process of calculating the search score of the selected user and the keyword word.

まず、検索部１７は、選択ユーザ名の検索スコアの値を０に初期化する（ステップＳ１１）。 First, the search unit 17 initializes the value of the search score of the selected user name to 0 (step S11).

次に、検索部１７は、第１のテーブルの軸（第２テーブルの軸でもよい。）に並べられている単語の中から、未だステップＳ１２で選択されていない単語を１つ選択する（ステップＳ１２）。ステップＳ１２で選択した単語を、以下、選択単語と記す。なお、選択単語がキーワード単語と同一である場合もあり得る。 Next, the search unit 17 selects one word that has not yet been selected in step S12 from the words arranged on the axis of the first table (may be the axis of the second table) (step). S12). The word selected in step S12 is hereinafter referred to as a selected word. The selected word may be the same as the keyword word.

次に、検索部１７は、キーワード単語と選択単語の関連度と、選択ユーザ名と選択単語の関連スコアとの積を算出する（ステップＳ１３）。検索部１７は、ステップＳ１３で用いる関連度を第２のテーブルから読み込み、ステップＳ１３で用いる関連スコアを第１のテーブルから読み込めばよい。 Next, the search unit 17 calculates the product of the degree of relevance between the keyword word and the selected word and the relevance score between the selected user name and the selected word (step S13). The search unit 17 may read the relevance degree used in step S13 from the second table and the relevance score used in step S13 from the first table.

次に、検索部１７は、ステップＳ１３で算出した積を、検索スコアに加算する（ステップＳ１４）。 Next, the search unit 17 adds the product calculated in step S13 to the search score (step S14).

次に、検索部１７は、ステップＳ１２で選択されていない単語があるか否かを判定する（ステップＳ１５）。未選択の単語がある場合（ステップＳ１５のＹｅｓ）、検索部１７は、ステップＳ１２以降の処理を繰り返す。 Next, the search unit 17 determines whether or not there is a word not selected in step S12 (step S15). If there is an unselected word (Yes in step S15), the search unit 17 repeats the processes after step S12.

未選択の単語がない場合（ステップＳ１５のＮｏ）、検索部１７は、その時点における検索スコアの値を、選択ユーザ名とキーワード単語の検索スコアとして確定し、処理を終了する。 When there is no unselected word (No in step S15), the search unit 17 determines the value of the search score at that time as the search score of the selected user name and the keyword word, and ends the process.

上記の検索スコアの算出処理は、以下の式（５）で表すことができる。 The above search score calculation process can be expressed by the following equation (5).

式（５）において、単語ｉは、ｉ番目に選択された選択単語を意味する。 In equation (5), the word i means the i-th selected selected word.

例えば、図９に示す第１のテーブル２１と図１０に示す第２のテーブル２２が記憶部５に記憶されているとする。そして、キーワード単語（検索キーワードに該当する単語）が「人工知能」であり、選択ユーザ名が「山田」であるとする。この場合、検索部１７は、以下の式によって、選択ユーザ名「山田」とキーワード単語「人工知能」の検索スコアを算出する（図９、図１０を参照）。 For example, it is assumed that the first table 21 shown in FIG. 9 and the second table 22 shown in FIG. 10 are stored in the storage unit 5. Then, it is assumed that the keyword word (word corresponding to the search keyword) is "artificial intelligence" and the selected user name is "Yamada". In this case, the search unit 17 calculates the search score of the selected user name "Yamada" and the keyword word "artificial intelligence" by the following formula (see FIGS. 9 and 10).

検索スコア＝1.00×0.80＋0.07×0.10＋0.87×0.79＋0.79×0.82＋・・・ Search score = 1.00 x 0.80 + 0.07 x 0.10 + 0.87 x 0.79 + 0.79 x 0.82 + ...

また、選択ユーザ名が「鈴木」であるとする。この場合、検索部１７は、以下の式によって、選択ユーザ名「鈴木」とキーワード単語「人工知能」の検索スコアを算出する（図９、図１０を参照）。 Further, it is assumed that the selected user name is "Suzuki". In this case, the search unit 17 calculates the search score of the selected user name "Suzuki" and the keyword word "artificial intelligence" by the following formula (see FIGS. 9 and 10).

検索スコア＝1.00×0.20＋0.07×0.40＋0.87×0.22＋0.79×0.18＋・・・ Search score = 1.00 x 0.20 + 0.07 x 0.40 + 0.87 x 0.22 + 0.79 x 0.18 + ...

検索部１７は、第１のテーブル２１に記述されているユーザ名毎に、上記の処理によって検索スコアを得る。そして、検索部１７は、検索スコアが閾値以上になっているユーザ名を検索結果として得る。従って、検索結果として得られるユーザ名は、複数個となり得る。出力部８は、検索結果として得られたユーザ名を出力する。出力部８は、検索結果として得た複数のユーザ名を、検索スコアの高い順に並べて出力してもよい。 The search unit 17 obtains a search score by the above processing for each user name described in the first table 21. Then, the search unit 17 obtains a user name whose search score is equal to or higher than the threshold value as a search result. Therefore, the number of user names obtained as a search result may be plural. The output unit 8 outputs the user name obtained as the search result. The output unit 8 may output a plurality of user names obtained as search results in descending order of search score.

この検索方法では、単語同士の関連度を示す第２のテーブルも用いている。従って、検索者が検索キーワードとして指定した単語との関連度が高い別の単語との関連が強いユーザのユーザ名も検索結果として得ることができる。 This search method also uses a second table showing the degree of relevance between words. Therefore, the user name of a user who is strongly related to another word which is highly related to the word specified by the searcher as the search keyword can also be obtained as the search result.

例えば、各ユーザが属している会社において、「レッドロケッツ」および「グリーンロケッツ」が重要な製品の製品名であり、その二つの製品の関連性が強いとする。この場合、上記の検索方法によれば、「レッドロケッツ」を検索キーワードにした場合であっても、「レッドロケッツ」と関連の強いユーザのユーザ名だけでなく、「レッドロケッツ」と関連性のある「グリーンロケッツ」と関連の強いユーザのユーザ名も検索結果として得ることができる。従って、検索者が検索キーワードとして指定した単語に基づいて、ユーザ名を幅広く検索することができる。 For example, in the company to which each user belongs, "Red Rockets" and "Green Rockets" are product names of important products, and the two products are strongly related. In this case, according to the above search method, even if "Red Rockets" is used as the search keyword, not only the user name of the user who is strongly related to "Red Rockets" but also the user name related to "Red Rockets". The user name of a user who is strongly related to a certain "Green Rockets" can also be obtained as a search result. Therefore, it is possible to search a wide range of user names based on the words specified by the searcher as the search keyword.

また、本実施形態では、会社に設けられたＰＣ９１から収集した操作ログに含まれる単語を用いて、第２のテーブルを生成する。従って、上記に例示したような「レッドロケッツ」および「グリーンロケッツ」等のその会社独自で用いることの多い単語を、第２のテーブルに含めることができる。仮に、第１のテーブルや第２のテーブルを人手で作成する場合、膨大な手間がかかるだけでなく、組織独自で用いる単語等は、第１のテーブルおよび第２のテーブルから漏れやすい。従って、組織独自で用いる単語も漏らさずに、第１のテーブルや第２のテーブルを容易に作成することができ、さらに上記のように、検索キーワードとして指定した単語に基づいて、ユーザ名を幅広く検索することができる。 Further, in the present embodiment, the second table is generated by using the words included in the operation log collected from the PC 91 provided in the company. Therefore, words often used by the company such as "Red Rockets" and "Green Rockets" as exemplified above can be included in the second table. If the first table and the second table are manually created, not only is it time-consuming, but also words and the like used by the organization are likely to leak from the first table and the second table. Therefore, it is possible to easily create the first table and the second table without leaking the words used by the organization, and as described above, a wide range of user names are used based on the words specified as the search keywords. You can search.

次に、ステップＳ６において、キーワード受付部６が選択キーワードとしてユーザ名を受け付け、検索部１７がユーザ名に応じた単語を検索する場合について説明する。以下、検索キーワードに該当するユーザ名を、キーワードユーザ名と記す。 Next, in step S6, a case where the keyword receiving unit 6 accepts a user name as a selection keyword and the search unit 17 searches for a word corresponding to the user name will be described. Hereinafter, the user name corresponding to the search keyword is referred to as a keyword user name.

検索部１７は、個々の単語を順次選択し、選択した単語とキーワードユーザ名の検索スコアを算出する。この選択された単語をスコア算出対象単語と記す。また、１つのスコア算出対象単語を選択した後にも、後述の説明で示すように、別途、単語を順次選択する（後述の図１３におけるステップＳ２２を参照）。後述のステップＳ２２で選択される単語を、選択単語と記す。 The search unit 17 sequentially selects individual words and calculates a search score for the selected word and the keyword user name. This selected word is referred to as a score calculation target word. Further, even after selecting one score calculation target word, the words are separately sequentially selected as described later (see step S22 in FIG. 13 described later). The word selected in step S22 described later is referred to as a selected word.

検索部１７が、１つのスコア算出対象単語を選択しているとする。以下に、検索部１７がスコア算出対象単語とキーワードユーザ名の検索スコアを算出する動作について説明する。図１３は、スコア算出対象単語とキーワードユーザ名の検索スコアを算出する処理の例を示すフローチャートである。 It is assumed that the search unit 17 has selected one score calculation target word. The operation of the search unit 17 to calculate the search score of the score calculation target word and the keyword user name will be described below. FIG. 13 is a flowchart showing an example of a process of calculating the search score of the score calculation target word and the keyword user name.

まず、検索部１７は、スコア算出対象単語の検索スコアの値を０に初期化する（ステップＳ２１）。 First, the search unit 17 initializes the value of the search score of the score calculation target word to 0 (step S21).

次に、検索部１７は、第１のテーブルの軸（第２テーブルの軸でもよい。）に並べられている単語の中から、未だステップＳ２２で選択されていない単語を１つ選択する（ステップＳ２２）。既に述べたように、ステップＳ２２で選択した単語を、選択単語と記す。なお、選択単語がスコア算出対象単語と同一である場合もあり得る。 Next, the search unit 17 selects one word that has not yet been selected in step S22 from the words arranged on the axis of the first table (may be the axis of the second table) (step). S22). As described above, the word selected in step S22 is referred to as a selected word. The selected word may be the same as the score calculation target word.

次に、検索部１７は、キーワードユーザ名と選択単語の関連スコアと、スコア算出対象単語と選択単語の関連度との積を算出する（ステップＳ２３）。検索部１７は、ステップＳ２３で用いる関連スコアを第１のテーブルから読み込み、ステップＳ２３で用いる関連度を第２のテーブルから読み込めばよい。 Next, the search unit 17 calculates the product of the relevance score between the keyword user name and the selected word and the relevance degree between the score calculation target word and the selected word (step S23). The search unit 17 may read the association score used in step S23 from the first table and the association degree used in step S23 from the second table.

次に、検索部１７は、ステップＳ２３で算出した積を、検索スコアに加算する（ステップＳ２４）。 Next, the search unit 17 adds the product calculated in step S23 to the search score (step S24).

次に、検索部１７は、ステップＳ２２で選択されていない単語があるか否かを判定する（ステップＳ２５）。未選択の単語がある場合（ステップＳ２５のＹｅｓ）、検索部１７は、ステップＳ２２以降の処理を繰り返す。 Next, the search unit 17 determines whether or not there is a word not selected in step S22 (step S25). If there is an unselected word (Yes in step S25), the search unit 17 repeats the processes after step S22.

未選択の単語がない場合（ステップＳ２５のＮｏ）、検索部１７は、その時点における検索スコアの値を、スコア算出対象単語とキーワードユーザ名の検索スコアとして確定し、処理を終了する。 When there is no unselected word (No in step S25), the search unit 17 determines the value of the search score at that time as the search score of the word to be scored and the keyword user name, and ends the process.

上記の検索スコアの算出処理は、以下の式（６）で表すことができる。 The above search score calculation process can be expressed by the following equation (6).

式（６）において、単語ｉは、ｉ番目に選択された選択単語を意味する。 In equation (6), the word i means the i-th selected selected word.

例えば、図９に示す第１のテーブル２１と図１０に示す第２のテーブル２２が記憶部５に記憶されているとする。そして、キーワードユーザ名（検索キーワードに該当するユーザ名）が「山田」であり、スコア算出対象単語が「人工知能」であるとする。この場合、検索部１７は、以下の式によって、スコア算出対象単語「人工知能」とキーワードユーザ名「山田」の検索スコアを算出する（図９、図１０を参照）。 For example, it is assumed that the first table 21 shown in FIG. 9 and the second table 22 shown in FIG. 10 are stored in the storage unit 5. Then, it is assumed that the keyword user name (user name corresponding to the search keyword) is "Yamada" and the score calculation target word is "artificial intelligence". In this case, the search unit 17 calculates the search score of the score calculation target word "artificial intelligence" and the keyword user name "Yamada" by the following formula (see FIGS. 9 and 10).

検索スコア＝0.08×1.00＋0.10×0.07＋0.79×0.87＋0.82×0.79＋・・・ Search score = 0.08 x 1.00 + 0.10 x 0.07 + 0.79 x 0.87 + 0.82 x 0.79 + ...

また、スコア算出対象単語が「Ａ社」であるとする。この場合、検索部１７は、以下の式によって、スコア算出対象単語が「Ａ社」とキーワードユーザ名「山田」の検索スコアを算出する（図９、図１０を参照）。 Further, it is assumed that the word for which the score is calculated is "Company A". In this case, the search unit 17 calculates the search score for the score calculation target word "Company A" and the keyword user name "Yamada" by the following formula (see FIGS. 9 and 10).

検索スコア＝0.08×0.07＋0.10×1.00＋0.79×0.09＋0.82×0.11＋・・・ Search score = 0.08 x 0.07 + 0.10 x 1.00 + 0.79 x 0.09 + 0.82 x 0.11 + ...

検索部１７は、順次選択するスコア算出対象単語毎に、上記の処理によって検索スコアを得る。そして、そして、検索部１７は、検索スコアが閾値以上になっている単語を検索結果として得る。従って、検索結果として得られる単語は、複数個となり得る。出力部８は、検索結果として得られた単語を出力する。出力部８は、検索結果として得た複数の単語を、検索スコアの高い順に並べて出力してもよい。 The search unit 17 obtains a search score by the above processing for each score calculation target word to be sequentially selected. Then, the search unit 17 obtains a word whose search score is equal to or higher than the threshold value as a search result. Therefore, the number of words obtained as a search result may be plural. The output unit 8 outputs the word obtained as the search result. The output unit 8 may output a plurality of words obtained as search results in descending order of search score.

上記のようにユーザ名から単語を検索する検索方法においても、単語同士の関連度を示す第２のテーブルも用いている。従って、検索者が指定したキーワードユーザ名が表わすユーザ名との関連度が高い単語だけでなく、その単語との関連が強い別の単語も検索結果として得ることができる。 In the search method for searching for a word from a user name as described above, a second table showing the degree of relevance between words is also used. Therefore, not only a word having a high degree of relevance to the user name represented by the keyword user name designated by the searcher but also another word having a strong relevance to the word can be obtained as a search result.

例えば、前述の例のように、各ユーザが属している会社において、「レッドロケッツ」および「グリーンロケッツ」が重要な製品の製品名であり、その二つの製品の関連性が強いとする。そして、ユーザ「山田」と単語「レッドロケッツ」との関連が強いとする。この場合、検索者がキーワードユーザ名として「山田」を指定した場合、「山田」と関連の強い単語「レッドロケッツ」だけでなく、単語「レッドロケッツ」と関連の強い「グリーンロケッツ」も検索結果として得ることができる。「レッドロケッツ」と「グリーンロケッツ」との関連が強いので、「レッドロケッツ」との関連が強い「山田」は、「グリーンロケッツ」とも関連が強いと考えられる。上記の方法によれば、検索キーワードとして指定されたユーザ名が示すユーザと関連が強いと考えられる単語を幅広く検索することができる。 For example, as in the above example, in the company to which each user belongs, "Red Rockets" and "Green Rockets" are product names of important products, and it is assumed that the two products are strongly related. Then, it is assumed that the user "Yamada" and the word "Red Rockets" are strongly related. In this case, if the searcher specifies "Yamada" as the keyword user name, not only the word "Red Rockets" that is strongly related to "Yamada" but also the word "Green Rockets" that is strongly related to the word "Red Rockets" will be searched. Can be obtained as. Since "Red Rockets" and "Green Rockets" are strongly related, "Yamada", which is strongly related to "Red Rockets", is considered to be strongly related to "Green Rockets". According to the above method, it is possible to search a wide range of words that are considered to be strongly related to the user indicated by the user name specified as the search keyword.

また、組織独自で用いる単語も漏らさずに、第１のテーブルや第２のテーブルを容易に作成することができるという点については、既に説明した通りである。 Further, as described above, the first table and the second table can be easily created without leaking the words used by the organization.

以上に説明したように、本実施形態では、キーワード受付部６が選択キーワードとして単語を受け付け、検索部１７が単語に応じたユーザ名を検索する場合、ユーザ名を幅広く検索することができる。また、キーワード受付部６が選択キーワードとしてユーザ名を受け付け、検索部１７がユーザ名に応じた単語を検索する場合、単語を幅広く検索することができる。また、組織独自で用いる単語も漏らさずに、第１のテーブルや第２のテーブルを容易に作成することができる。 As described above, in the present embodiment, when the keyword receiving unit 6 accepts a word as a selection keyword and the search unit 17 searches for a user name corresponding to the word, the user name can be widely searched. Further, when the keyword receiving unit 6 accepts a user name as a selection keyword and the search unit 17 searches for a word corresponding to the user name, the word can be widely searched. In addition, the first table and the second table can be easily created without leaking the words used by the organization.

実施形態３．
図１４は、本発明の第３の実施形態の関連スコア算出システムの構成例を示すブロック図である。第３の実施形態の関連スコア算出システムは、第２の実施形態の関連スコア算出システム（図７参照）が備える構成要素に加え、さらに、クラスタリング部３１と、クラスタ出力部３２と、除外対象単語受付部３３とを備える。図１４において、図７に示す要素と同様の要素については、図７と同一の符号を付し、説明を省略する。なお、第１のテーブル生成部１１および第２のテーブル生成部１２はそれぞれ、第２の実施形態で説明した動作に加え、後述の動作も行う。 Embodiment 3.
FIG. 14 is a block diagram showing a configuration example of the related score calculation system according to the third embodiment of the present invention. In addition to the components included in the related score calculation system of the second embodiment (see FIG. 7), the related score calculation system of the third embodiment further includes a clustering unit 31, a cluster output unit 32, and excluded words. It is equipped with a reception unit 33. In FIG. 14, the same elements as those shown in FIG. 7 are designated by the same reference numerals as those in FIG. 7, and the description thereof will be omitted. The first table generation unit 11 and the second table generation unit 12 perform the operations described later in addition to the operations described in the second embodiment, respectively.

クラスタリング部３１は、第２のテーブル２２（例えば、図１０を参照）を記憶部５から読み込み、第２のテーブル２２に基づいて、第２のテーブル２２に記述されている単語に対してクラスタリングを行う。クラスタリング部３１は、例えば、ｋ−ｍｅａｎｓ法または階層型クラスタリングアルゴリズム等のクラスタリング方法によって、単語に対するクラスタリングを行う。ｋ−ｍｅａｎｓ法または階層型クラスタリングアルゴリズム等のクラスタリング方法では、第２のテーブル２２を入力データとして、第２のテーブル２２の軸に並ぶ単語に対してクラスタリングを行うことができる。クラスタリング部３１は、各クラスタに対して、クラスタの識別情報として、例えば、クラスタ番号を付し、クラスタとそのクラスタに属する単語との関係を、クラスタリング結果２３として記憶部５に記憶させる。 The clustering unit 31 reads the second table 22 (see, for example, FIG. 10) from the storage unit 5, and clusters the words described in the second table 22 based on the second table 22. conduct. The clustering unit 31 clusters words by, for example, a clustering method such as a k-means method or a hierarchical clustering algorithm. In a clustering method such as the k-means method or a hierarchical clustering algorithm, clustering can be performed on words arranged on the axis of the second table 22 using the second table 22 as input data. The clustering unit 31 assigns, for example, a cluster number to each cluster as cluster identification information, and stores the relationship between the cluster and the word belonging to the cluster in the storage unit 5 as the clustering result 23.

クラスタ出力部３２は、例えば、関連スコア算出システム１の管理者（以下、単に管理者と記す。）の端末装置（図示略）から、通信ネットワークを介して、クラスタリング結果の出力要求を受け付け、その出力要求に応じて、クラスタリング結果を示す画面の画面情報を、その端末装置（図示略）に送信する。図１５は、クラスタリング結果を示す画面の例を示す模式図である。クラスタリング結果を示す画面では、例えば、クラスタ毎に、クラスタ番号と、クラスタに属する各単語が表示される。また、各クラスタ番号および各単語とともに、それぞれチェックボックス４１が表示される。図１５に示すように、各クラスタに属する単語の数は異なっていてよい。また、クラスタリング結果を示す画面には、確定ボタン４２が含まれる。 The cluster output unit 32 receives, for example, a clustering result output request from a terminal device (not shown) of an administrator of the related score calculation system 1 (hereinafter, simply referred to as an administrator) via a communication network. In response to the output request, the screen information of the screen showing the clustering result is transmitted to the terminal device (not shown). FIG. 15 is a schematic diagram showing an example of a screen showing a clustering result. On the screen showing the clustering result, for example, the cluster number and each word belonging to the cluster are displayed for each cluster. In addition, a check box 41 is displayed together with each cluster number and each word. As shown in FIG. 15, the number of words belonging to each cluster may be different. Further, the screen showing the clustering result includes a confirmation button 42.

クラスタ出力部３２は、通信ネットワークを介して、管理者の端末装置からクラスタリング結果の出力要求を受け付けると、記憶部５からクラスタリング結果２３を読み込む。そして、クラスタ出力部３２は、クラスタリング結果２３に基づいて、クラスタ番号と、そのクラスタ番号が示すクラスタに属する単語を表示するとともに、各クラスタ番号および各単語とともにそれぞれチェックボックス４１を表示し、さらに、確定ボタン４２も含む画面（例えば、図１５に例示する画面）の画面情報を生成する。そして、クラスタ出力部３２は、通信ネットワークを介して、その画面情報を管理者の端末装置に送信する。 When the cluster output unit 32 receives a clustering result output request from the administrator's terminal device via the communication network, the cluster output unit 32 reads the clustering result 23 from the storage unit 5. Then, the cluster output unit 32 displays the cluster number and the word belonging to the cluster indicated by the cluster number based on the clustering result 23, displays each cluster number and the check box 41 together with each word, and further. The screen information of the screen including the confirmation button 42 (for example, the screen illustrated in FIG. 15) is generated. Then, the cluster output unit 32 transmits the screen information to the terminal device of the administrator via the communication network.

管理者の端末装置は、クラスタ出力部３２からその画面情報を受信すると、その画面情報に基づいて、例えば、図１５に例示する画面を表示する。 When the administrator's terminal device receives the screen information from the cluster output unit 32, the administrator's terminal device displays, for example, the screen illustrated in FIG. 15 based on the screen information.

管理者は、図１５に例示する画面（単語のクラスタリング結果を表示する画面）を確認し、管理者が除外すべきと判断した単語に対応するチェックボックス４１にチェックを入れる。また、管理者は、１つのクラスタに属する単語全てを除外すべきであると判断した場合、そのクラスタに対応するチェックボックス４１にチェックを入れる。単語に対応するチェックボックス４１にチェックが入れられたということは、その単語を除外すべきと判断されたことを意味する。また、クラスタに対応するチェックボックス４１にチェックが入れられたということは、そのクラスタに属する単語全てを除外すべきであると判断されたことを意味する。また、ここで、「除外すべき」とは、第１のテーブルの横軸および第２のテーブルの各軸に並ぶ単語から除外すべきであるということを意味する。 The administrator confirms the screen illustrated in FIG. 15 (the screen displaying the clustering result of the word), and checks the check box 41 corresponding to the word determined to be excluded by the administrator. If the administrator determines that all words belonging to one cluster should be excluded, the check box 41 corresponding to that cluster is checked. The fact that the check box 41 corresponding to a word is checked means that it is determined that the word should be excluded. Further, the fact that the check box 41 corresponding to the cluster is checked means that it is determined that all the words belonging to the cluster should be excluded. Further, here, "to be excluded" means that the word should be excluded from the words arranged on the horizontal axis of the first table and each axis of the second table.

確定ボタン４２は、各単語に対する、除外すべきか否かの判断が完了したことを入力するためのボタンである。管理者の端末装置は、管理者によって確定ボタン４２をクリックされると、どのチェックボックス４１にチェックが入れられたかに応じて、管理者が除外すべきと判断した単語を判定し、管理者によって除外すべきと判断された単語を、関連スコア算出システム１に送信する。 The confirmation button 42 is a button for inputting that the determination as to whether or not to exclude each word has been completed. When the confirmation button 42 is clicked by the administrator, the terminal device of the administrator determines a word that the administrator has determined to be excluded according to which check box 41 is checked, and the administrator determines. Words determined to be excluded are transmitted to the related score calculation system 1.

関連スコア算出システム１の除外対象単語受付部３３は、管理者の端末装置が送信した単語（管理者によって、除外すべきと判断された単語）を、通信ネットワークを介して、受信する。除外対象単語受付部３３が受信する単語は、１つとは限らない。除外対象単語受付部３３は、第１のテーブルの横軸および第２のテーブルの各軸に並ぶ単語から除外すべき単語の指定を受け付けていると言うことができる。 The exclusion target word reception unit 33 of the related score calculation system 1 receives the words transmitted by the terminal device of the administrator (words determined by the administrator to be excluded) via the communication network. The number of words received by the exclusion target word reception unit 33 is not limited to one. It can be said that the exclusion target word reception unit 33 accepts the designation of words to be excluded from the words arranged on the horizontal axis of the first table and each axis of the second table.

除外対象単語受付部３３は、管理者の端末装置から受信した単語（除外すべき単語）を第１のテーブル生成部１１に通知する。第１のテーブル生成部１１は、その単語の通知を受けると、既に生成済みの第１のテーブル２１の横軸に並ぶ単語から、通知された単語を除外して、第１のテーブル２１を再度生成する。そして、第１のテーブル生成部１１は、記憶部５に記憶されている第１のテーブル２１を、再度生成した第１のテーブル２１で置き換える。 The exclusion target word reception unit 33 notifies the first table generation unit 11 of the words (words to be excluded) received from the terminal device of the administrator. Upon receiving the notification of the word, the first table generation unit 11 excludes the notified word from the words arranged on the horizontal axis of the already generated first table 21, and re-uses the first table 21. Generate. Then, the first table generation unit 11 replaces the first table 21 stored in the storage unit 5 with the regenerated first table 21.

第１のテーブル生成部１１が第１のテーブル２１を再度生成すると、第２のテーブル生成部１２は、新たに生成された第１のテーブル２１に基づいて、第２のテーブルを再度生成する。このとき、第２のテーブル生成部１２は、除外対象単語受付部３３が受信した単語を、各軸に並ぶ単語から除外して、第２のテーブルを生成する。そして、第２のテーブル生成部１２は、記憶部５に記憶されている第２のテーブル２２を、再度生成した第２のテーブル２２で置き換える。 When the first table generation unit 11 regenerates the first table 21, the second table generation unit 12 regenerates the second table based on the newly generated first table 21. At this time, the second table generation unit 12 excludes the words received by the exclusion target word reception unit 33 from the words arranged on each axis to generate the second table. Then, the second table generation unit 12 replaces the second table 22 stored in the storage unit 5 with the second table 22 generated again.

第１のテーブル２１を再度生成する動作および第２のテーブル２２を再度生成する動作は、それぞれ、軸に並ぶ単語数が減少している点を除けば、第２の実施形態で説明した第１のテーブル２１を生成する動作および第２のテーブル２２を生成する動作と同様である。 The operation of regenerating the first table 21 and the operation of regenerating the second table 22 are the first described in the second embodiment, except that the number of words arranged on the axis is reduced. It is the same as the operation of generating the table 21 and the operation of generating the second table 22.

クラスタ出力部３２および除外対象単語受付部３３は、例えば、関連スコア算出プログラムに従って動作するコンピュータのＣＰＵおよびそのコンピュータの通信インタフェースによって実現される。また、クラスタリング部３１は、関連スコア算出プログラムに従って動作するそのコンピュータのＣＰＵによって実現される。 The cluster output unit 32 and the exclusion target word reception unit 33 are realized, for example, by the CPU of a computer operating according to the related score calculation program and the communication interface of the computer. Further, the clustering unit 31 is realized by the CPU of the computer that operates according to the related score calculation program.

本実施形態によれば、管理者によって、第１のテーブルおよび第２のテーブルから除外すべきと判断された単語を、第１のテーブルおよび第２のテーブルから除外することができる。その結果、例えば、ユーザ名を検索キーワードとして、単語の検索を行う場合、管理者が除外すべき単語として指定した単語を、検索結果に含まれないようにすることができる。 According to the present embodiment, words determined by the administrator to be excluded from the first table and the second table can be excluded from the first table and the second table. As a result, for example, when searching for a word using a user name as a search keyword, the word specified by the administrator as a word to be excluded can be prevented from being included in the search result.

管理者によって、除外すべきと判断される単語の例について説明する。例えば、ユーザ「山田」が、新規作成したり閲覧したりするファイル名に、ユーザ名「山田」を含めることがあり得る。その場合、ユーザ名「山田」を検索キーワードとして、単語の検索を行う場合に、検索結果に「山田」という単語が含まれ得る。しかし、ユーザ名「山田」を検索キーワードとして単語の検索を行う場合に、検索結果に「山田」という単語が含まれていても、検索者にとってはあまり意味がない。 Here are some examples of words that the administrator decides should be excluded. For example, the user name "Yamada" may be included in the file name newly created or browsed by the user "Yamada". In that case, when searching for a word using the user name "Yamada" as a search keyword, the word "Yamada" may be included in the search result. However, when searching for a word using the user name "Yamada" as a search keyword, even if the search result includes the word "Yamada", it does not make much sense to the searcher.

そこで、管理者は、例えば、図１５に例示する画面が表示された場合、明らかに、ユーザ名と同じ文字列であると考えられる単語「山田」は除外すべきであると判断し、図１５に示す単語「山田」に対応するチェックボックス４１にチェックを入れ、確定ボタン４２をクリックすればよい。その結果、第１のテーブル生成部１１は、横軸の単語の並びから「山田」を除外した、新たな第１のテーブルを生成する。続いて、第２のテーブル生成部１２は、縦軸および横軸それぞれの単語の並びから「山田」を除外した、新たな第２のテーブルを生成する。その結果、例えば、ユーザ名「山田」を検索キーワードとして単語の検索を行う場合に、検索結果に「山田」という単語が含まれないようにすることができる。 Therefore, for example, when the screen illustrated in FIG. 15 is displayed, the administrator determines that the word "Yamada", which is clearly considered to be the same character string as the user name, should be excluded, and FIG. 15 Check the check box 41 corresponding to the word "Yamada" shown in 1 and click the confirm button 42. As a result, the first table generation unit 11 generates a new first table excluding "Yamada" from the sequence of words on the horizontal axis. Subsequently, the second table generation unit 12 generates a new second table by excluding "Yamada" from the sequence of words on the vertical axis and the horizontal axis. As a result, for example, when searching for a word using the user name "Yamada" as a search keyword, it is possible to prevent the word "Yamada" from being included in the search results.

なお、クラスタ出力部３２は、クラスタリング結果を示す画面を、関連スコア算出システム１が備えるディスプレイ装置（図示略）に表示してもよい。また、除外対象単語受付部３３は、関連スコア算出システム１が備える入力デバイス（図示略）によって、そのディスプレイ装置に表示された画面に対する操作（チェックボックス４１へのチェックの入力、および、確定ボタン４２のクリック）を受け付け、その操作に応じて、除外すべき単語の指定を受け付けてもよい。 The cluster output unit 32 may display a screen showing the clustering result on a display device (not shown) included in the related score calculation system 1. Further, the exclusion target word receiving unit 33 operates on the screen displayed on the display device by the input device (not shown) included in the related score calculation system 1 (input of a check in the check box 41 and the confirmation button 42). (Click) may be accepted, and the designation of words to be excluded may be accepted according to the operation.

実施形態４．
第１の実施形態から第３の実施形態では、収集部２が操作ログを収集し、単語抽出部３が、各操作ログに記述されている各ファイル名から単語を抽出し、スコア算出部４が、ファイルを操作した各ユーザと抽出された各単語の組合せ毎に、関連スコアを算出する場合を示した。 Embodiment 4.
In the first to third embodiments, the collection unit 2 collects the operation log, the word extraction unit 3 extracts words from each file name described in each operation log, and the score calculation unit 4 Shown the case where the association score is calculated for each combination of each user who operated the file and each extracted word.

第４の実施形態では、収集部がファイルの操作ログではなく、スケジュール情報を収集する場合を例にして説明する。 In the fourth embodiment, a case where the collecting unit collects schedule information instead of the operation log of the file will be described as an example.

図１６は、第４の実施形態の関連スコア算出システムの構成例を示すブロック図である。図１６に示す関連スコア算出システム１は、第１の実施形態における収集部２、単語抽出部３およびスコア算出部４を、収集部５２、単語抽出部５３およびスコア算出部５４に置き換えたものである。記憶部５は、第１の実施形態における記憶部５と同様である。 FIG. 16 is a block diagram showing a configuration example of the related score calculation system of the fourth embodiment. The related score calculation system 1 shown in FIG. 16 replaces the collection unit 2, the word extraction unit 3, and the score calculation unit 4 in the first embodiment with the collection unit 52, the word extraction unit 53, and the score calculation unit 54. be. The storage unit 5 is the same as the storage unit 5 in the first embodiment.

また、図１６に示す関連スコア算出システム１は、組織に属する各人のスケジュール情報を保持するスケジュール管理サーバ６１に、通信ネットワーク１０を介して接続されている。 Further, the related score calculation system 1 shown in FIG. 16 is connected to the schedule management server 61 that holds the schedule information of each person belonging to the organization via the communication network 10.

収集部５２は、スケジュール管理サーバ６１に記憶されている、組織に属する各人のスケジュール情報を、スケジュール管理サーバ６１から収集する。なお、スケジュール情報は個々のＰＣ（図１６において図示略）に記憶されていてもよい。この場合、収集部５２は、その個々のＰＣから組織に属する各人のスケジュール情報を収集すればよい。 The collection unit 52 collects the schedule information of each person belonging to the organization stored in the schedule management server 61 from the schedule management server 61. The schedule information may be stored in individual PCs (not shown in FIG. 16). In this case, the collecting unit 52 may collect the schedule information of each person belonging to the organization from the individual PCs.

以下、組織に属する人の識別情報を、人識別情報と記す。また、人識別情報として、例えば、「山田」等の人の名を用いる場合を例にして説明する。 Hereinafter, the identification information of the person belonging to the organization is referred to as the person identification information. Further, a case where a person's name such as "Yamada" is used as the person identification information will be described as an example.

図１７は、スケジュール情報の例を示す模式図である。スケジュール情報は、例えば、図１７に例示するように、組織に属する人の人識別情報と、イベント名と、そのイベント名を有するイベントにその人が関わった時間帯とが関連付けて記述されている。時間帯は、開始日時および終了日時によって表される。 FIG. 17 is a schematic diagram showing an example of schedule information. The schedule information is described, for example, as illustrated in FIG. 17, in which the person identification information of a person belonging to the organization, the event name, and the time zone in which the person was involved in the event having the event name are associated with each other. .. The time zone is represented by the start date and time and the end date and time.

図１７に示す１番目のスケジュール情報は、「山田」という人が、２０１７年８月１７日１０時から同日の１２時まで、「人工知能開発会議」に関わった（換言すれば、出席した）ということを表している。以下、ある人が、あるイベントに関わった時間をイベント参加時間と記す。１つのイベントに関するイベント参加時間は、そのイベントに関連付けて記述された終了日時から開始日時を減算して得られる時間である。 In the first schedule information shown in Fig. 17, a person named "Yamada" was involved in the "Artificial Intelligence Development Conference" from 10:00 on August 17, 2017 to 12:00 on the same day (in other words, attended). It represents that. Hereinafter, the time when a person is involved in a certain event is referred to as the event participation time. The event participation time for one event is the time obtained by subtracting the start date and time from the end date and time described in association with the event.

単語抽出部５３は、収集部５２によって収集された各スケジュール情報に記述されている各イベント名に対して形態素解析を実行することによって、イベント名に含まれている単語を抽出する。 The word extraction unit 53 extracts words included in the event name by performing morphological analysis on each event name described in each schedule information collected by the collection unit 52.

ただし、単語抽出部５３は、同一の単語を、重複して抽出しない。例えば、単語抽出部５３は、「人工知能」という単語を既に抽出している場合、２回目以降に抽出された「人工知能」という単語については無視する。この点は、既に説明した単語抽出部３と同様である。 However, the word extraction unit 53 does not extract the same word in duplicate. For example, when the word extraction unit 53 has already extracted the word "artificial intelligence", the word "artificial intelligence" extracted from the second time onward is ignored. This point is the same as the word extraction unit 3 already described.

例えば、単語抽出部５３は、「人工知能開発会議」というイベント名に対して形態素解析を実行することによって、「人工知能」、「開発」、「会議」等の単語を抽出する。 For example, the word extraction unit 53 extracts words such as "artificial intelligence", "development", and "meeting" by performing morphological analysis on the event name "artificial intelligence development meeting".

単語抽出部５３は、この処理を、各スケジュール情報に記述されている各イベント名に対して行うことによって、単語の集合を得る。同一の単語は重複して抽出されないので、この集合に属する単語は、互いに異なる。 The word extraction unit 53 obtains a set of words by performing this process for each event name described in each schedule information. Words that belong to this set are different from each other because the same word is not extracted more than once.

スコア算出部５４は、組織に属する各人と、単語抽出部５３によって抽出された各単語の組合せ毎に、人と単語との関連の強さを示す関連スコアを算出する。この関連スコアは、第１の実施形態から第３の実施形態までにおける関連スコアと同様である。ただし、第４の実施形態では、キータッチ回数は、関連スコアとして用いられない。すなわち、単語抽出部５３は、第１の実施形態で説明した第２の算出方法で関連スコアを算出することはない。 The score calculation unit 54 calculates a related score indicating the strength of the relationship between the person and the word for each combination of each person belonging to the organization and each word extracted by the word extraction unit 53. This association score is the same as the association score in the first embodiment to the third embodiment. However, in the fourth embodiment, the number of key touches is not used as the related score. That is, the word extraction unit 53 does not calculate the related score by the second calculation method described in the first embodiment.

以下、第４の実施形態における関連スコアの算出方法として、２種類の方法を説明する。以下に示す２種類のいずれの方法においても、スコア算出部５４は、人と単語の組合せ毎に、関連スコアを算出し、記憶部５に記憶させる。 Hereinafter, two types of methods will be described as methods for calculating the related score in the fourth embodiment. In any of the two types of methods shown below, the score calculation unit 54 calculates a related score for each combination of a person and a word and stores it in the storage unit 5.

第４の実施形態における関連スコアの第１の算出方法は、第１の実施形態における関連スコアの第１の算出方法と同様の方法である。ただし、ファイルの操作時間の代わりに、イベント参加時間を用いる。 The first calculation method of the association score in the fourth embodiment is the same method as the first calculation method of the association score in the first embodiment. However, the event participation time is used instead of the file operation time.

第４の実施形態における関連スコアの第１の算出方法は、一の人（以下、人Ｈと記す。）と一の単語（以下、単語Ｗと記す。）の関連スコアとして、単語Ｗをイベント名に含む各イベントへの人Ｈのイベント参加時間の総和を算出する方法である。イベント参加時間が長いほど、人Ｈと単語Ｗの関連が強く、イベント参加時間が短いほど、人Ｈと単語Ｗの関連が弱いと言える。従って、イベント参加時間を、関連スコアとして用いることができる。 In the first calculation method of the association score in the fourth embodiment, the word W is used as an event as the association score of one person (hereinafter referred to as person H) and one word (hereinafter referred to as word W). This is a method of calculating the total event participation time of person H for each event included in the name. It can be said that the longer the event participation time, the stronger the relationship between the person H and the word W, and the shorter the event participation time, the weaker the relationship between the person H and the word W. Therefore, the event participation time can be used as a related score.

第１の算出方法では、スコア算出部５４は、単語Ｗをイベント名に含む各イベントに対する人Ｈのイベント参加時間の総和を算出し、その総和を、人Ｈと単語Ｗの関連スコアとする。 In the first calculation method, the score calculation unit 54 calculates the total event participation time of the person H for each event including the word W in the event name, and the total is used as the related score between the person H and the word W.

すなわち、スコア算出部５４は、単語Ｗをイベント名に含み、人Ｈが参加したイベント毎にイベント参加時間を算出し、その総和を、人Ｈと単語Ｗの関連スコアとする。 That is, the score calculation unit 54 includes the word W in the event name, calculates the event participation time for each event in which the person H participates, and sets the total as the related score between the person H and the word W.

第４の実施形態における関連スコアの第２の算出方法は、第１の実施形態における関連スコアの第３の算出方法と同様の方法である。ただし、ファイルの操作時間の代わりに、イベント参加時間を用いる。 The second calculation method of the association score in the fourth embodiment is the same method as the third calculation method of the association score in the first embodiment. However, the event participation time is used instead of the file operation time.

本実施形態における関連スコアの第２の算出方法は、一の人（人Ｈ）と一の単語（単語Ｗ）の関連スコアを、次に説明する２つの割合に基づいて算出する方法である。この２つの割合のうち、一方の割合をＱ_１と記し、もう一方の割合をＱ_２と記す。 The second method of calculating the association score in the present embodiment is a method of calculating the association score of one person (person H) and one word (word W) based on the two ratios described below. Of the two proportions, describing one ratio of the Q _1, mark the other ratio between Q _2.

Ｑ_１は、単語Ｗをイベント名に含む各イベントに対する組織内の全ての人のイベント参加時間の総和に対する、単語Ｗをイベント名に含む各イベントに対する人Ｈのイベント参加時間の総和の割合である。すなわち、Ｑ_１は、以下に示す式（７）で表される。 Q ₁ is, for all of the people of the events of participation time the sum of the organization for each event, including the word W in the event name, there is a percentage of the sum of the events participation time of people H for each event, including the word W in the event name .. That is, Q ₁ is represented by the following equation (7).

Ｑ_２は、個々の単語に着目した場合における、着目した単語をイベント名に含む各イベントに対する人Ｈのイベント参加時間の総和に対する、単語Ｗをイベント名に含む各イベントに対する人Ｈのイベント参加時間の総和の割合である。すなわち、Ｑ２は、以下に示す式（８）で表される。 Q ₂ is, in the case of focusing on individual words, to the sum of events participation time of people H for each event, including the word that focuses on the event name, event participation time of people H for each event, including the word W in the event name It is the ratio of the total of. That is, Q2 is represented by the following equation (8).

Ｑ_１について説明する。単語Ｗをイベント名に含む各イベントに対する人Ｈのイベント参加時間の総和（式（７）の右辺の分子）は、前述の第１の算出方法で算出される関連スコアに相当する。すなわち、スコア算出部５４は、前述の第１の算出方法で説明した方法で、単語Ｗをイベント名に含む各イベントに対する人Ｈのイベント参加時間の総和を算出すればよい。 Q ₁ will be explained. The sum of the event participation times of the person H for each event including the word W in the event name (the numerator on the right side of the equation (7)) corresponds to the association score calculated by the first calculation method described above. That is, the score calculation unit 54 may calculate the total event participation time of the person H for each event including the word W in the event name by the method described in the first calculation method described above.

単語Ｗをイベント名に含む各イベントに対する組織内の全ての人のイベント参加時間の総和（式（７）の右辺の分母）について、説明する。スコア算出部５４は、単語Ｗをイベント名に含む各イベントに対する、組織内の一人目の人のイベント参加時間の総和も、前述の第１の算出方法で説明した方法で算出する。同様に、スコア算出部５４は、単語Ｗをイベント名に含む各イベントに対する、組織内の二人目の人のイベント参加時間の総和も、前述の第１の算出方法で説明した方法で算出する。同様に、スコア算出部５４は、組織に属する一人一人について、単語Ｗをイベント名に含む各イベントに対する人のイベント参加時間の総和を算出する。さらに、スコア算出部５４は、組織に属する一人一人について算出した「単語Ｗをイベント名に含む各イベントに対する人のイベント参加時間の総和」の総和を算出する。この値が、単語Ｗをイベント名に含む各イベントに対する組織内の全ての人のイベント参加時間の総和に該当する。 The sum of the event participation times of all the people in the organization for each event including the word W in the event name (the denominator on the right side of the equation (7)) will be described. The score calculation unit 54 also calculates the total event participation time of the first person in the organization for each event including the word W in the event name by the method described in the first calculation method described above. Similarly, the score calculation unit 54 also calculates the total event participation time of the second person in the organization for each event including the word W in the event name by the method described in the first calculation method described above. Similarly, the score calculation unit 54 calculates the total event participation time of a person for each event including the word W in the event name for each person belonging to the organization. Further, the score calculation unit 54 calculates the sum of "the sum of the event participation time of a person for each event including the word W in the event name" calculated for each person belonging to the organization. This value corresponds to the sum of the event participation times of all people in the organization for each event including the word W in the event name.

スコア算出部５４は、上記のように算出した式（７）の右辺の分子、分母を用いて、式（７）の計算により、Ｑ_１を算出する。 Score calculation unit 54, the molecules of the right-hand side of formula (7) calculated as described above, by using a denominator, the calculation of equation (7), and calculates the Q _1.

次に、Ｑ_２について説明する。式（８）の右辺の分子は、式（７）の右辺の分子と同じである。従って、スコア算出部５４は、前述の第１の算出方法で説明した方法で、単語Ｗをイベント名に含む各イベントに対する人Ｈのイベント参加時間の総和を算出すればよい。 Next, the _{Q 2} will be described. The molecule on the right side of the formula (8) is the same as the molecule on the right side of the formula (7). Therefore, the score calculation unit 54 may calculate the total event participation time of the person H for each event including the word W in the event name by the method described in the first calculation method described above.

個々の単語に着目した場合における、着目した単語をイベント名に含む各イベントに対する人Ｈのイベント参加時間の総和（式（８）の右辺の分母）について説明する。スコア算出部５４は、単語抽出部５３によって抽出された個々の単語に着目する（換言すれば、個々の単語を１つ１つ選択する）。そして、スコア算出部５４は、着目した単語（選択した単語）をイベント名に含む各イベントに対する人Ｈのイベント参加時間の総和を、前述の第１の算出方法で説明した方法で算出する。スコア算出部５４は、次の単語に着目し（換言すれば、次の単語を選択し）、着目した単語（選択した単語）をイベント名に含む各イベントに対する人Ｈのイベント参加時間の総和を、前述の第１の算出方法で説明した方法で算出する。このように、スコア算出部５４は、単語毎に、単語をイベント名に含む各イベントに対する人Ｈのイベント参加時間の総和を算出する。そして、スコア算出部５４は、単語毎に算出した「単語をイベント名に含む各イベントに対する人Ｈのイベント参加時間の総和」の総和を算出する。この値が、式（８）の分母に該当する。「個々の単語に着目した場合における、着目した単語をイベント名に含む各イベントに対する人Ｈのイベント参加時間の総和（式（８）の右辺の分母）」は、「単語をイベント名に含む各イベントに対する人Ｈのイベント参加時間の総和を単語毎に求めた場合における前記総和の総和」であると言うことができる。 The sum of the event participation time of the person H for each event including the focused word in the event name (denominator on the right side of the equation (8)) when focusing on each word will be described. The score calculation unit 54 pays attention to the individual words extracted by the word extraction unit 53 (in other words, each individual word is selected one by one). Then, the score calculation unit 54 calculates the total event participation time of the person H for each event including the word of interest (selected word) in the event name by the method described in the first calculation method described above. The score calculation unit 54 focuses on the next word (in other words, selects the next word), and sums up the event participation time of the person H for each event including the focused word (selected word) in the event name. , Calculate by the method described in the above-mentioned first calculation method. In this way, the score calculation unit 54 calculates the total event participation time of the person H for each event including the word in the event name for each word. Then, the score calculation unit 54 calculates the total of "the total of the event participation time of the person H for each event including the word in the event name" calculated for each word. This value corresponds to the denominator of the equation (8). "When focusing on individual words, the sum of the event participation times of person H for each event that includes the focused word in the event name (the denominator on the right side of equation (8))" is "each including the word in the event name. It can be said that it is the total sum of the total sum of the event participation times of the person H for the event for each word.

スコア算出部５４は、上記のように算出した式（８）の右辺の分子、分母を用いて、式（８）の計算により、Ｑ_２を算出する。 Score calculation unit 54, the molecules of the right-hand side of Equation (8) calculated as described above, by using a denominator, the calculation of equation (8), and calculates the Q _2.

スコア算出部５４は、Ｑ_１，Ｑ_２を求めた後、人Ｈと単語Ｗの関連スコアを、以下に示す式（９）によって算出する。 Score calculation unit _54, after obtaining the Q 1, _{Q 2,} the relevance score of human H and words W, is calculated by the equation (9) below.

関連スコア＝Ｑ_１×ｌｏｇ（Ｑ_２）・・・（９） Related score = Q ₁ x log (Q ₂ ) ・・・ (9)

第４の実施形態における第２の算出方法で関連スコアを算出した場合、第１の実施形態で説明した第３の算出方法で関連スコアを算出した場合と同様の効果が得られる。 When the related score is calculated by the second calculation method in the fourth embodiment, the same effect as when the related score is calculated by the third calculation method described in the first embodiment can be obtained.

収集部５２は、例えば、関連スコア算出プログラムに従って動作するコンピュータのＣＰＵおよびそのコンピュータの通信インタフェースによって実現される。例えば、ＣＰＵが、コンピュータのプログラム記憶装置等のプログラム記録媒体から関連スコア算出プログラムを読み込み、関連スコア算出プログラムに従って、通信インタフェースを用いて、収集部５２として動作すればよい。また、単語抽出部５３およびスコア算出部５４も、例えば、関連スコア算出プログラムに従って動作する上記のコンピュータのＣＰＵによって実現される。すなわち、上記のように、関連スコア算出プログラムを読み込んだＣＰＵが、関連スコア算出プログラムに従って、単語抽出部５３およびスコア算出部５４として動作すればよい。記憶部５は、上記のコンピュータの記憶装置によって実現される。また、収集部５２、単語抽出部５３およびスコア算出部５４がそれぞれ別々のハードウェアによって実現されてもよい。 The collecting unit 52 is realized by, for example, a CPU of a computer operating according to a related score calculation program and a communication interface of the computer. For example, the CPU may read the related score calculation program from a program recording medium such as a program storage device of a computer, and operate as the collecting unit 52 using the communication interface according to the related score calculation program. Further, the word extraction unit 53 and the score calculation unit 54 are also realized by, for example, the CPU of the above-mentioned computer that operates according to the related score calculation program. That is, as described above, the CPU that has read the related score calculation program may operate as the word extraction unit 53 and the score calculation unit 54 according to the related score calculation program. The storage unit 5 is realized by the storage device of the above-mentioned computer. Further, the collection unit 52, the word extraction unit 53, and the score calculation unit 54 may be realized by separate hardware.

第４の実施形態は、第１の実施形態の変形例（図５参照）、第２の実施形態（図７参照）、および第３の実施形態（図１４参照）に適用されてもよい。すなわち、第１の実施形態の変形例（図５参照）において、収集部２、単語抽出部３およびスコア算出部４を、第４の実施形態で説明した収集部５２、単語抽出部５３およびスコア算出部５４に置き換えてもよい。また、第２の実施形態（図７参照）において、収集部２、単語抽出部３およびスコア算出部４を、第４の実施形態で説明した収集部５２、単語抽出部５３およびスコア算出部５４に置き換えてもよい。また、第３の実施形態（図１４参照）において、収集部２、単語抽出部３およびスコア算出部４を、第４の実施形態で説明した収集部５２、単語抽出部５３およびスコア算出部５４に置き換えてもよい。 The fourth embodiment may be applied to a modification of the first embodiment (see FIG. 5), a second embodiment (see FIG. 7), and a third embodiment (see FIG. 14). That is, in the modified example of the first embodiment (see FIG. 5), the collection unit 2, the word extraction unit 3, and the score calculation unit 4 are divided into the collection unit 52, the word extraction unit 53, and the score described in the fourth embodiment. It may be replaced with the calculation unit 54. Further, in the second embodiment (see FIG. 7), the collection unit 2, the word extraction unit 3 and the score calculation unit 4 are combined with the collection unit 52, the word extraction unit 53 and the score calculation unit 54 described in the fourth embodiment. May be replaced with. Further, in the third embodiment (see FIG. 14), the collection unit 2, the word extraction unit 3 and the score calculation unit 4 are combined with the collection unit 52, the word extraction unit 53 and the score calculation unit 54 described in the fourth embodiment. May be replaced with.

図１８は、本発明の各実施形態に係るコンピュータの構成例を示す概略ブロック図である。コンピュータ１０００は、ＣＰＵ１００１と、主記憶装置１００２と、補助記憶装置１００３と、インタフェース１００４と、通信インタフェース１００５とを備える。 FIG. 18 is a schematic block diagram showing a configuration example of a computer according to each embodiment of the present invention. The computer 1000 includes a CPU 1001, a main storage device 1002, an auxiliary storage device 1003, an interface 1004, and a communication interface 1005.

本発明の各実施形態の関連スコア算出システム１は、コンピュータ１０００に実装される。関連スコア算出システム１の動作は、関連スコア算出プログラムの形式で補助記憶装置１００３に記憶されている。ＣＰＵ１００１は、その関連スコア算出プログラムを補助記憶装置１００３から読み出して主記憶装置１００２に展開し、その関連スコア算出プログラムに従って上記の処理を実行する。 The related score calculation system 1 of each embodiment of the present invention is implemented in the computer 1000. The operation of the related score calculation system 1 is stored in the auxiliary storage device 1003 in the form of the related score calculation program. The CPU 1001 reads the related score calculation program from the auxiliary storage device 1003, deploys it to the main storage device 1002, and executes the above processing according to the related score calculation program.

補助記憶装置１００３は、一時的でない有形の媒体の例である。一時的でない有形の媒体の他の例として、インタフェース１００４を介して接続される磁気ディスク、光磁気ディスク、ＣＤ−ＲＯＭ（Compact Disk Read Only Memory ）、ＤＶＤ−ＲＯＭ（Digital Versatile Disk Read Only Memory ）、半導体メモリ等が挙げられる。また、このプログラムが通信回線によってコンピュータ１０００に配信される場合、配信を受けたコンピュータ１０００がそのプログラムを主記憶装置１００２に展開し、上記の処理を実行してもよい。 Auxiliary storage 1003 is an example of a non-temporary tangible medium. Other examples of non-temporary tangible media include magnetic disks, optical magnetic disks, CD-ROMs (Compact Disk Read Only Memory), DVD-ROMs (Digital Versatile Disk Read Only Memory), which are connected via interface 1004. Examples include semiconductor memory. Further, when this program is distributed to the computer 1000 by a communication line, the distributed computer 1000 may expand the program to the main storage device 1002 and execute the above processing.

また、プログラムは、前述の処理の一部を実現するためのものであってもよい。さらに、プログラムは、補助記憶装置１００３に既に記憶されている他のプログラムとの組み合わせで前述の処理を実現する差分プログラムであってもよい。 Further, the program may be for realizing a part of the above-mentioned processing. Further, the program may be a difference program that realizes the above-mentioned processing in combination with another program already stored in the auxiliary storage device 1003.

また、各構成要素の一部または全部は、汎用または専用の回路（circuitry ）、プロセッサ等やこれらの組み合わせによって実現されてもよい。これらは、単一のチップによって構成されてもよいし、バスを介して接続される複数のチップによって構成されてもよい。各構成要素の一部または全部は、上述した回路等とプログラムとの組み合わせによって実現されてもよい。 Further, a part or all of each component may be realized by a general-purpose or dedicated circuitry, a processor, or a combination thereof. These may be composed of a single chip or may be composed of a plurality of chips connected via a bus. A part or all of each component may be realized by the combination of the circuit or the like and the program described above.

各構成要素の一部または全部が複数の情報処理装置や回路等により実現される場合には、複数の情報処理装置や回路等は集中配置されてもよいし、分散配置されてもよい。例えば、情報処理装置や回路等は、クライアントアンドサーバシステム、クラウドコンピューティングシステム等、各々が通信ネットワークを介して接続される形態として実現されてもよい。 When a part or all of each component is realized by a plurality of information processing devices and circuits, the plurality of information processing devices and circuits may be centrally arranged or distributed. For example, the information processing device, the circuit, and the like may be realized as a form in which each is connected via a communication network, such as a client-and-server system and a cloud computing system.

次に、本発明の概要について説明する。図１９は、本発明の概要を示すブロック図である。本発明の関連スコア算出システムは、収集部８２と、単語抽出部８３と、関連スコア算出部８４とを備える。 Next, the outline of the present invention will be described. FIG. 19 is a block diagram showing an outline of the present invention. The related score calculation system of the present invention includes a collecting unit 82, a word extraction unit 83, and a related score calculation unit 84.

収集部８２（例えば、収集部２）は、ユーザがファイルを操作した記録である操作ログを、端末装置から収集する。 The collection unit 82 (for example, the collection unit 2) collects an operation log, which is a record of the user operating the file, from the terminal device.

単語抽出部８３（例えば、単語抽出部３）は、各操作ログに記述されているファイル名から単語を抽出する。 The word extraction unit 83 (for example, the word extraction unit 3) extracts words from the file names described in each operation log.

関連スコア算出部８４（例えば、スコア算出部４）は、各操作ログに基づいて、ユーザと単語との関連の強さを表す関連スコアを算出する。 The related score calculation unit 84 (for example, the score calculation unit 4) calculates a related score indicating the strength of the relationship between the user and the word based on each operation log.

そのような構成によって、組織内の人と単語との関連の強さを明確化することができる。 Such a structure can clarify the strength of the relationship between a person and a word in an organization.

図２０は、本発明の概要の他の例を示すブロック図である。本発明の関連スコア算出システムは、収集部８６と、単語抽出部８７と、関連スコア算出部８８とを備える。 FIG. 20 is a block diagram showing another example of the outline of the present invention. The related score calculation system of the present invention includes a collecting unit 86, a word extraction unit 87, and a related score calculation unit 88.

収集部８６（例えば、収集部５２）は、人と、イベント名と、そのイベント名を有するイベントにその人が関わった時間帯とを記述したスケジュール情報を収集する。 The collecting unit 86 (for example, the collecting unit 52) collects schedule information describing a person, an event name, and a time zone in which the person is involved in an event having the event name.

単語抽出部８７（例えば、単語抽出部５３）は、各スケジュール情報に記述されているイベント名から単語を抽出する。 The word extraction unit 87 (for example, the word extraction unit 53) extracts words from the event names described in each schedule information.

関連スコア算出部８８（例えば、スコア算出部５４）は、各スケジュール情報に基づいて、人と単語との関連の強さを表す関連スコアを算出する。 The related score calculation unit 88 (for example, the score calculation unit 54) calculates a related score indicating the strength of the relationship between a person and a word based on each schedule information.

そのような構成によっても、組織内の人と単語との関連の強さを明確化することができる。 Such a structure can also clarify the strength of the relationship between a person and a word in an organization.

上記の本発明の各実施形態は、以下の付記のようにも記載され得るが、以下に限定されるわけではない。 Each of the above embodiments of the present invention may be described as in the appendix below, but is not limited to the following.

（付記１）
ユーザがファイルを操作した記録である操作ログを、端末装置から収集する収集部と、
各操作ログに記述されているファイル名から単語を抽出する単語抽出部と、
各操作ログに基づいて、ユーザと単語との関連の強さを表す関連スコアを算出する関連スコア算出部とを備える
ことを特徴とする関連スコア算出システム。 (Appendix 1)
A collection unit that collects operation logs, which are records of user operations on files, from terminal devices.
A word extractor that extracts words from the file names described in each operation log,
A related score calculation system including a related score calculation unit that calculates a related score indicating the strength of the relationship between a user and a word based on each operation log.

（付記２）
関連スコア算出部は、一のユーザと一の単語の関連スコアとして、前記一の単語をファイル名に含む各ファイルについての前記一のユーザの操作時間の総和を算出する
付記１に記載の関連スコア算出システム。 (Appendix 2)
The related score calculation unit calculates the total operation time of the one user for each file containing the one word in the file name as the related score of one user and one word. The related score described in Appendix 1. Calculation system.

（付記３）
関連スコア算出部は、一のユーザと一の単語の関連スコアとして、前記一の単語をファイル名に含む各ファイルを前記一のユーザが操作した際のキータッチの回数の総和を算出する
付記１に記載の関連スコア算出システム。 (Appendix 3)
The related score calculation unit calculates the total number of key touches when the one user operates each file containing the one word in the file name as the related score of one user and one word. Appendix 1 Related score calculation system described in.

（付記４）
関連スコア算出部は、
一のユーザと一の単語の関連スコアを、
前記一の単語をファイル名に含む各ファイルについての組織内の全ユーザの操作時間の総和に対する、前記一の単語をファイル名に含む各ファイルについての前記一のユーザの操作時間の総和の割合と、
単語をファイル名に含む各ファイルについての前記一のユーザの操作時間の総和を単語毎に求めた場合における前記総和の総和に対する、前記一の単語をファイル名に含む各ファイルについての前記一のユーザの操作時間の総和の割合と
に基づいて算出する
付記１に記載の関連スコア算出システム。 (Appendix 4)
The related score calculation unit
The related score of one user and one word,
The ratio of the total operation time of the one user for each file containing the one word to the total operation time of all users in the organization for each file containing the one word in the file name. ,
The one user for each file containing the one word in the file name with respect to the sum of the sums when the sum of the operation times of the one user for each file containing the word is obtained for each word. The related score calculation system described in Appendix 1, which is calculated based on the ratio of the total operation time of.

（付記５）
検索キーワードを受け付けるキーワード受付部と、
検索キーワードに応じて検索を実行する検索部とを備え、
前記キーワード受付部は、
検索キーワードとして単語を受け付け、
前記検索部は、
関連スコアに基づいて、前記単語に応じたユーザＩＤを検索する
付記１から付記４のうちのいずれかに記載の関連スコア算出システム。 (Appendix 5)
The keyword reception department that accepts search keywords and
It is equipped with a search unit that executes a search according to the search keyword.
The keyword reception department
Accept words as search keywords,
The search unit
The related score calculation system according to any one of Supplements 1 to 4, which searches for a user ID corresponding to the word based on the related score.

（付記６）
検索キーワードを受け付けるキーワード受付部と、
検索キーワードに応じて検索を実行する検索部とを備え、
前記キーワード受付部は、
検索キーワードとしてユーザＩＤを受け付け、
前記検索部は、
関連スコアに基づいて、前記ユーザＩＤに応じた単語を検索する
付記１から付記５のうちのいずれかに記載の関連スコア算出システム。 (Appendix 6)
The keyword reception department that accepts search keywords and
It is equipped with a search unit that executes a search according to the search keyword.
The keyword reception department
Accepts user ID as a search keyword,
The search unit
The related score calculation system according to any one of Supplements 1 to 5, which searches for a word corresponding to the user ID based on the related score.

（付記７）
ユーザＩＤと、単語と、関連スコアとの関係を記述した第１のテーブルを生成する第１のテーブル生成部と、
第１のテーブルに基づいて算出した単語同士の関連の強さを表す関連度を記述した第２のテーブル生成する第２のテーブル生成部と、
検索キーワードを受け付けるキーワード受付部と、
検索キーワードに応じて検索を実行する検索部とを備え、
前記キーワード受付部は、
検索キーワードとして単語を受け付け、
前記検索部は、
前記第１のテーブルおよび前記第２のテーブルに基づいて、前記単語に応じたユーザＩＤを検索する
付記１から付記４のうちのいずれかに記載の関連スコア算出システム。 (Appendix 7)
A first table generator that generates a first table that describes the relationship between a user ID, a word, and a related score.
A second table generator that describes the degree of relevance that represents the strength of the relevance between words calculated based on the first table, and a second table generator that generates the second table.
The keyword reception department that accepts search keywords and
It is equipped with a search unit that executes a search according to the search keyword.
The keyword reception department
Accept words as search keywords,
The search unit
The related score calculation system according to any one of Supplementary note 1 to Supplementary note 4, which searches for a user ID corresponding to the word based on the first table and the second table.

（付記８）
ユーザＩＤと、単語と、関連スコアとの関係を記述した第１のテーブルを生成する第１のテーブル生成部と、
第１のテーブルに基づいて算出した単語同士の関連の強さを表す関連度を記述した第２のテーブル生成する第２のテーブル生成部と、
検索キーワードを受け付けるキーワード受付部と、
検索キーワードに応じて検索を実行する検索部とを備え、
前記キーワード受付部は、
検索キーワードとしてユーザＩＤを受け付け、
前記検索部は、
前記第１のテーブルおよび前記第２のテーブルに基づいて、前記ユーザＩＤに応じた単語を検索する
付記１から付記４および付記７のうちのいずれかに記載の関連スコア算出システム。 (Appendix 8)
A first table generator that generates a first table that describes the relationship between a user ID, a word, and a related score.
A second table generator that describes the degree of relevance that represents the strength of the relevance between words calculated based on the first table, and a second table generator that generates the second table.
The keyword reception department that accepts search keywords and
It is equipped with a search unit that executes a search according to the search keyword.
The keyword reception department
Accepts user ID as a search keyword,
The search unit
The related score calculation system according to any one of Supplementary note 1, Supplementary note 4, and Supplementary note 7, which searches for a word according to the user ID based on the first table and the second table.

（付記９）
第２のテーブルに基づいて、単語をクラスタリングするクラスタリング部と、
クラスタ毎に単語を提示する単語提示部と、
提示された単語のうち、第１のテーブルおよび第２のテーブルから除外すべき単語の指定を受け付ける削除対象受付部とを備え、
第１のテーブル生成部は、
除外すべき単語として指定された単語を除外して、第１のテーブルを再度、生成し、
第２のテーブル生成部は、
前記第１のテーブルに基づいて、第２のテーブルを再度、生成する
付記７または付記８に記載の関連スコア算出システム。 (Appendix 9)
A clustering part that clusters words based on the second table,
A word presentation unit that presents words for each cluster,
It is equipped with a deletion target reception unit that accepts the designation of words to be excluded from the first table and the second table among the presented words.
The first table generator is
Exclude the word specified as the word to be excluded, generate the first table again, and
The second table generator is
The related score calculation system according to Appendix 7 or Appendix 8, which regenerates the second table based on the first table.

（付記１０）
人と、イベント名と、前記イベント名を有するイベントに前記人が関わった時間帯とを記述したスケジュール情報を収集する収集部と、
各スケジュール情報に記述されているイベント名から単語を抽出する単語抽出部と、
各スケジュール情報に基づいて、人と単語との関連の強さを表す関連スコアを算出する関連スコア算出部とを備える
ことを特徴とする関連スコア算出システム。 (Appendix 10)
A collection unit that collects schedule information that describes a person, an event name, and a time zone in which the person was involved in an event having the event name.
A word extractor that extracts words from the event names described in each schedule information,
A related score calculation system including a related score calculation unit that calculates a related score indicating the strength of the relationship between a person and a word based on each schedule information.

（付記１１）
ユーザがファイルを操作した記録である操作ログを、端末装置から収集し、
各操作ログに記述されているファイル名から単語を抽出し、
各操作ログに基づいて、ユーザと単語との関連の強さを表す関連スコアを算出する
ことを特徴とする関連スコア算出方法。 (Appendix 11)
The operation log, which is a record of the user operating the file, is collected from the terminal device and
Extract words from the file names described in each operation log and
A related score calculation method characterized in that a related score indicating the strength of the relationship between a user and a word is calculated based on each operation log.

（付記１２）
人と、イベント名と、前記イベント名を有するイベントに前記人が関わった時間帯とを記述したスケジュール情報を収集し、
各スケジュール情報に記述されているイベント名から単語を抽出し、
各スケジュール情報に基づいて、人と単語との関連の強さを表す関連スコアを算出することを特徴とする関連スコア算出方法。 (Appendix 12)
Collect schedule information that describes the person, the event name, and the time zone in which the person was involved in the event having the event name.
Extract words from the event names described in each schedule information,
A related score calculation method characterized in that a related score indicating the strength of a person-word relationship is calculated based on each schedule information.

（付記１３）
コンピュータに、
ユーザがファイルを操作した記録である操作ログを、端末装置から収集する収集処理、
各操作ログに記述されているファイル名から単語を抽出する単語抽出処理、および、
各操作ログに基づいて、ユーザと単語との関連の強さを表す関連スコアを算出する関連スコア算出処理
を実行させるための関連スコア算出プログラム。 (Appendix 13)
On the computer
Collection processing that collects operation logs, which are records of user operations on files, from terminal devices.
Word extraction processing that extracts words from the file names described in each operation log, and
A related score calculation program for executing a related score calculation process that calculates a related score indicating the strength of the relationship between a user and a word based on each operation log.

（付記１４）
コンピュータに、
人と、イベント名と、前記イベント名を有するイベントに前記人が関わった時間帯とを記述したスケジュール情報を収集する収集処理、
各スケジュール情報に記述されているイベント名から単語を抽出する単語抽出処理、および、
各スケジュール情報に基づいて、人と単語との関連の強さを表す関連スコアを算出する関連スコア算出処理
を実行させるための関連スコア算出プログラム。 (Appendix 14)
On the computer
A collection process that collects schedule information that describes a person, an event name, and a time zone in which the person was involved in an event having the event name.
Word extraction process that extracts words from the event name described in each schedule information, and
A related score calculation program for executing a related score calculation process that calculates a related score that indicates the strength of the relationship between a person and a word based on each schedule information.

本発明は、人と単語との関連の強さを数値化する関連スコア算出システムに好適に適用される。 The present invention is suitably applied to a related score calculation system that quantifies the strength of a relationship between a person and a word.

１関連スコア算出システム
２，５２収集部
３，５３単語抽出部
４，５４スコア算出部
５記憶部
６キーワード受付部
７，１７検索部
８出力部
３１クラスタリング部
３２クラスタ出力部
３３除外対象単語受付部 1 Related score calculation system 2,52 Collection unit 3,53 Word extraction unit 4,54 Score calculation unit 5 Storage unit 6 Keyword reception unit 7,17 Search unit 8 Output unit 31 Clustering unit 32 Cluster output unit 33 Excluded word reception unit

Claims

A collection unit that collects schedule information that describes a person, an event name, and a time zone in which the person was involved in an event having the event name.
A word extractor that extracts words from the event names described in each schedule information,
It is equipped with a related score calculation unit that calculates a related score that indicates the strength of the relationship between a person and a word based on each schedule information.
The related score calculation unit
The related score of one person and one word,
The total event participation time of the one person for each event containing the one word in the event name with respect to the total event participation time of all persons in the organization for each event containing the one word in the event name. Percentage and
The above-mentioned one for each event containing the one word in the event name with respect to the total of the total of the sum of the event participation times of the one person for each event including the word in the event name. A related score calculation system characterized in that it is calculated based on the ratio of the total time of a person's participation in an event.

The computer
Collect schedule information that describes the person, the event name, and the time zone in which the person was involved in the event having the event name.
Extract words from the event names described in each schedule information,
Based on each schedule information, a relation score indicating the strength of the relation between a person and a word is calculated.
When calculating the relevant score,
The related score of one person and one word,
The total event participation time of the one person for each event containing the one word in the event name with respect to the total event participation time of all persons in the organization for each event containing the one word in the event name. Percentage and
The above-mentioned one for each event containing the one word in the event name with respect to the total of the total of the sum of the event participation times of the one person for each event including the word in the event name. A related score calculation method characterized in that it is calculated based on the ratio of the total time of a person's participation in an event.

On the computer
A collection process that collects schedule information that describes a person, an event name, and a time zone in which the person was involved in an event having the event name.
Word extraction process that extracts words from the event name described in each schedule information, and
Based on each schedule information, the related score calculation process that calculates the related score indicating the strength of the relationship between the person and the word is executed.
In the related score calculation process,
The related score of one person and one word,
The total event participation time of the one person for each event containing the one word in the event name with respect to the total event participation time of all persons in the organization for each event containing the one word in the event name. Percentage and
The above-mentioned one for each event containing the one word in the event name with respect to the total of the total of the sum of the event participation times of the one person for each event including the word in the event name. A related score calculation program for calculating based on the ratio of the total time a person participates in an event.