JP2022133401A

JP2022133401A - Relevance score calculation system, method, and program

Info

Publication number: JP2022133401A
Application number: JP2022108819A
Authority: JP
Inventors: 洋介本橋; Yosuke Motohashi; 昌子今西; Masako Imanishi
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2017-11-06
Filing date: 2022-07-06
Publication date: 2022-09-13
Anticipated expiration: 2037-11-06
Also published as: JP2019086940A; JP2021192237A; JP7375861B2; JP7103496B2; JP6972935B2

Abstract

PROBLEM TO BE SOLVED: To provide a relevance score calculation system, relevance score calculation method, and relevance score calculation program for quantifying the strength of relation between a person and a word.

SOLUTION: In a relevance score calculation system, a keyword reception unit 6 receives a keyword from a searcher, and a search unit 7 acquires, from a storage unit 5, a relevance score based on the total operation time during which a user operated each file for each combination of a word included in each file name and each user who operated the file in an operation log, identifies a pair containing the word corresponding to the search keyword and the relevance score equal to or greater than a threshold from among pairs of a user name, word, and relevance score, and identifies the user name included in the pair as a search result.

SELECTED DRAWING: Figure 5

Description

本発明は、人と単語との関連の強さを数値化する関連スコア算出システム、関連スコア算出方法および関連スコア算出プログラムに関する。 The present invention relates to a relation score calculation system, a relation score calculation method, and a relation score calculation program for quantifying the strength of relation between a person and a word.

特許文献１には、パーソナルコンピュータまたは携帯情報端末に導入されているアプリケーションプログラムがどの程度使用されているかを判定するために必要な情報を含む操作ログによって、ユーザが使用しているアプリケーションプログラムを判定する情報提供装置が記載されている。また、特許文献１には、情報提供装置が、ユーザがどの程度アプリケーションプログラムを使用しているかを判定したり、ユーザのアプリケーションプログラムに対する知識レベルを判定したりすることも記載されている。 Japanese Patent Application Laid-Open No. 2002-200002 discloses a method for determining an application program being used by a user by means of an operation log containing information necessary for determining how much an application program installed in a personal computer or a mobile information terminal is being used. An information providing device is described. Patent Document 1 also describes that the information providing apparatus determines how much the user uses the application program and determines the user's knowledge level of the application program.

特許文献２には、プロファイルデータベースに、人材に関する情報の登録、削除、更新等を行い、プロファイル情報を参照して検索キーワードに合致する人材を検索する人材検索システムが記載されている。また、特許文献２には、人が著作者となっている文書のキーワードを抽出し、上位キーワードを得ることによって、人材の専門分野や業務についての情報を得ることが記載されている。 Patent Literature 2 describes a personnel search system that registers, deletes, updates, etc. information about personnel in a profile database and searches for personnel that match a search keyword by referring to profile information. Further, Japanese Patent Laid-Open No. 2002-200002 describes obtaining information about a human resource's specialized field and work by extracting keywords of a document authored by a person and obtaining high-level keywords.

特開２０１３－３７５８４号公報JP 2013-37584 A 特開２００５－３２７０２８号公報Japanese Patent Application Laid-Open No. 2005-327028

企業等の組織内において、特定の分野や技術に精通している人や、あるプロジェクトに参加したことのある人を見つけられることが好ましい。また、ある人が精通している分野、技術や、その人が参加したことのあるプロジェクトを容易に知ることができることが好ましい。しかし、大企業等の大きな組織では、「誰がどの分野やどの技術に詳しいか」、「誰がどのプロジェクトに参加したことがあるか」等は、長年、その組織にいないと分からない知識となってしまう。特に、「過去において、誰がどの分野やどの技術に詳しかったか」、「過去において、誰がどのプロジェクトに参加したか」等の情報については、その傾向が強くなる。そのため、特に、新入社員や派遣社員にとって、聞きたいことを誰にきけばよいのか分からなくなってしまう。その結果、例えば、製品開発の効率が低下する場合が生じ得る。 It is preferable to be able to find a person who is familiar with a specific field or technology, or who has participated in a certain project, within an organization such as a company. It would also be desirable to be able to easily find out which fields and technologies a person is familiar with and which projects he or she has participated in. However, in large organizations such as large corporations, "who is familiar with which field and which technology?" and "who has participated in which project?" put away. This tendency is particularly strong for information such as "who was familiar with which field and which technology in the past" and "who participated in which project in the past". As a result, new employees and temporary employees, in particular, do not know who to ask what they want to hear. As a result, for example, the efficiency of product development may decrease.

そのため、本発明の発明者らは、組織内の人と、単語との関連の強さを明確化できることが好ましいと考えた。本発明の発明者らは、例えば、ある人と、「人工知能」という単語の関連の強さを明確化できれば、その人が、「人工知能」の分野や技術に詳しいかどうかや、その人が「人工知能」に関するプロジェクトに参加したことがあるかどうかを推定しやすいと考えた。 Therefore, the inventors of the present invention thought that it would be preferable to be able to clarify the strength of association between people in an organization and words. For example, if the inventors of the present invention can clarify the strength of the relationship between a certain person and the word "artificial intelligence", they can determine whether the person is familiar with the field and technology of "artificial intelligence" and whether the person thought that it would be easy to estimate whether or not they have participated in a project related to "artificial intelligence".

そこで、本発明は、組織内の人と単語との関連の強さを明確化することができる関連スコア算出システム、関連スコア算出方法および関連スコア算出プログラムを提供することを目的とする。 SUMMARY OF THE INVENTION Accordingly, an object of the present invention is to provide a relation score calculation system, a relation score calculation method, and a relation score calculation program capable of clarifying the strength of the relation between a person and a word in an organization.

本発明による検索システムは、検索者からキーワードを受け付けるキーワード受付部と、一のユーザの各ファイルの操作時間のうち、キーワードをファイル名に含む各ファイルの操作時間の総和に基づいて、キーワードに関連するユーザを検索する検索部とを備える。 A search system according to the present invention includes a keyword reception unit that receives a keyword from a searcher; a search unit for searching for a user who

本発明による検索方法は、検索者からキーワードを受け付け、一のユーザの各ファイルの操作時間のうち、キーワードをファイル名に含む各ファイルの操作時間の総和に基づいて、キーワードに関連するユーザを検索する。 A search method according to the present invention receives a keyword from a searcher, and searches for a user related to the keyword based on the total operation time of each file including the keyword in the file name among the operation times of each file of one user. do.

本発明による検索プログラムは、コンピュータに、検索者からキーワードを受け付ける処理と、一のユーザの各ファイルの操作時間のうち、キーワードをファイル名に含む各ファイルの操作時間の総和に基づいて、キーワードに関連するユーザを検索する処理とを実行させる。 The search program according to the present invention provides a computer with a process of accepting a keyword from a searcher, and based on the total operation time of each file containing the keyword in the file name, out of the operation time of each file by one user. and a process of retrieving related users.

本発明による検索システムは、検索者からキーワードを受け付けるキーワード受付部と、一のユーザのイベントの参加時間のうち、キーワードをイベント名に含む各イベントの参加時間の総和に基づいて、キーワードに関連するユーザを検索する検索部とを備える。 A search system according to the present invention includes a keyword reception unit that receives a keyword from a searcher; a search unit for searching for a user;

本発明による検索方法は、検索者からキーワードを受け付け、一のユーザのイベントの参加時間のうち、キーワードをイベント名に含む各イベントの参加時間の総和に基づいて、キーワードに関連するユーザを検索する。 A search method according to the present invention receives a keyword from a searcher, and searches for a user related to the keyword based on the total participation time of each event including the keyword in the event name among the participation times of one user's event. .

本発明による検索プログラムは、コンピュータに、検索者からキーワードを受け付ける処理と、一のユーザのイベントの参加時間のうち、キーワードをイベント名に含む各イベントの参加時間の総和に基づいて、キーワードに関連するユーザを検索する処理とを実行させる。 A search program according to the present invention provides a computer with a process of accepting a keyword from a searcher, and based on the total participation time of each event that includes the keyword in the event name, among the participation times of one user's event, is related to the keyword. and a process of searching for a user who wants to do so.

本発明によれば、組織内の人と単語との関連の強さを明確化することができる。 According to the present invention, it is possible to clarify the strength of the relationship between people and words in an organization.

本発明の第１の実施形態の関連スコア算出システムの構成例を示すブロック図である。1 is a block diagram showing a configuration example of a related score calculation system according to the first embodiment of the present invention; FIG. 操作ログの例を示す模式図である。FIG. 4 is a schematic diagram showing an example of an operation log; ユーザと単語の組合せ毎に算出された関連スコアの例を示す模式図である。FIG. 4 is a schematic diagram showing an example of a related score calculated for each combination of a user and a word; 第１の実施形態の関連スコア算出システムの処理経過の例を示すフローチャートである。4 is a flow chart showing an example of the progress of processing in the related score calculation system of the first embodiment; 第１の実施形態の変形例を示すブロック図である。It is a block diagram which shows the modification of 1st Embodiment. 記憶部に記憶されているユーザ名と単語と関連スコアとの組の例を示す模式図である。FIG. 4 is a schematic diagram showing an example of a set of a user name, a word, and a related score stored in a storage unit; 本発明の第２の実施形態の関連スコア算出システムの構成例を示すブロック図である。It is a block diagram which shows the structural example of the related score calculation system of the 2nd Embodiment of this invention. 関連スコア算出結果（ユーザ名と単語と関連スコアとの組の集合）の例を示す模式図である。FIG. 10 is a schematic diagram showing an example of a related score calculation result (a set of pairs of a user name, a word, and a related score); 第１のテーブルの例を示す模式図である。FIG. 4 is a schematic diagram showing an example of a first table; FIG. 第２のテーブルの例を示す模式図である。FIG. 11 is a schematic diagram showing an example of a second table; FIG. 第２の実施形態の処理経過の例を示すフローチャートである。9 is a flow chart showing an example of the progress of processing according to the second embodiment; 選択ユーザとキーワード単語の検索スコアを算出する処理の例を示すフローチャートである。10 is a flowchart showing an example of processing for calculating search scores for selected users and keyword words. スコア算出対象単語とキーワードユーザ名の検索スコアを算出する処理の例を示すフローチャートである。FIG. 10 is a flowchart showing an example of processing for calculating a score calculation target word and a search score for a keyword user name; FIG. 本発明の第３の実施形態の関連スコア算出システムの構成例を示すブロック図である。It is a block diagram which shows the structural example of the related score calculation system of the 3rd Embodiment of this invention. クラスタリング結果を示す画面の例を示す模式図である。FIG. 5 is a schematic diagram showing an example of a screen showing clustering results; 第４の実施形態の関連スコア算出システムの構成例を示すブロック図である。FIG. 11 is a block diagram showing a configuration example of a related score calculation system according to a fourth embodiment; FIG. スケジュール情報の例を示す模式図である。FIG. 4 is a schematic diagram showing an example of schedule information; 本発明の各実施形態に係るコンピュータの構成例を示す概略ブロック図である。1 is a schematic block diagram showing a configuration example of a computer according to each embodiment of the present invention; FIG. 本発明の概要を示すブロック図である。1 is a block diagram showing an overview of the present invention; FIG. 本発明の概要の他の例を示すブロック図である。FIG. 4 is a block diagram showing another example of the outline of the present invention;

以下、本発明の実施形態を図面を参照して説明する。 BEST MODE FOR CARRYING OUT THE INVENTION Hereinafter, embodiments of the present invention will be described with reference to the drawings.

実施形態１．
図１は、本発明の第１の実施形態の関連スコア算出システムの構成例を示すブロック図である。ただし、図１では、通信ネットワークを介して本発明の関連スコア算出システムに接続されている装置も図示している。 Embodiment 1.
FIG. 1 is a block diagram showing a configuration example of a related score calculation system according to the first embodiment of the present invention. However, FIG. 1 also illustrates devices connected to the relevant score calculation system of the present invention via a communication network.

本発明の関連スコア算出システム１には、通信ネットワーク１０を介して、パーソナルコンピュータ（以下、ＰＣと記す。）９１が接続されている。 A personal computer (hereinafter referred to as PC) 91 is connected to the related score calculation system 1 of the present invention via a communication network 10 .

ＰＣ９１は、例えば、会社や企業等の組織に属する人によって使用される。各ＰＣ９１は、１つの組織内に設けられているものとする。以下、組織が会社である場合を例にして説明する。ただし、ＰＣ９１を使用する人が属する組織は会社や企業でなくてもよい。また、組織は、複数の会社等によって形成される組織であってもよく、また、１つの会社や企業等の一部門であってもよい。 The PC 91 is used, for example, by a person belonging to an organization such as a company or enterprise. It is assumed that each PC 91 is provided within one organization. A case where the organization is a company will be described below as an example. However, the organization to which the person using the PC 91 belongs does not have to be a company or enterprise. Also, the organization may be an organization formed by a plurality of companies or the like, or may be a single company or a division of a company or the like.

個々のＰＣ９１は、ファイルを操作するユーザによって、ファイルに関する１つの操作ログを作成し、記憶する。後述するように、各ＰＣ９１が記憶している操作ログは、関連スコア算出システム１（より具体的には、関連スコア算出システム１の収集部２）によって収集される。 Each PC 91 creates and stores one operation log regarding a file by a user who operates the file. As will be described later, the operation logs stored in each PC 91 are collected by the related score calculation system 1 (more specifically, the collection unit 2 of the related score calculation system 1).

図２は、ＰＣ９１で作成される操作ログの例を示す模式図である。操作ログは、例えば、ファイル名と、ファイルを操作したユーザのユーザ名と、操作の内容と、その操作が行われた日時とを関連付けている（図２参照）。図２では、便宜的に、操作ログの番号も図示している。なお、図２は、操作ログの例であり、操作ログは、図２に示す例に限定されない。 FIG. 2 is a schematic diagram showing an example of an operation log created by the PC 91. As shown in FIG. The operation log associates, for example, the file name, the user name of the user who operated the file, the content of the operation, and the date and time when the operation was performed (see FIG. 2). In FIG. 2, the number of the operation log is also illustrated for convenience. Note that FIG. 2 is an example of an operation log, and the operation log is not limited to the example shown in FIG.

ＰＣ９１は、操作ログに、ファイル名として、パス名を含むファイル名を記述する。 The PC 91 writes the file name including the path name as the file name in the operation log.

また、ＰＣ９１は、操作ログに記述するユーザ名を、例えば、ユーザがＰＣ９１にログインする際に用いるＩＤ（Identification）から判定すればよい。ただし、ＰＣ９１がユーザ名を判定する方法は、この方法に限定されない。 Also, the PC 91 may determine the user name described in the operation log from, for example, an ID (Identification) used when the user logs into the PC 91 . However, the method by which the PC 91 determines the user name is not limited to this method.

なお、各実施形態では、ユーザＩＤ（ユーザの識別情報）として、ユーザ名を用いる。 In addition, in each embodiment, a user name is used as a user ID (user identification information).

操作ログに記述される操作内容の例として、例えば、「ファイルオープン」、「キータッチ」、「更新（保存）」、「ファイルクローズ」等が挙げられる。ただし、操作ログに記述される操作の内容は、これらに限定されず、「新規作成」等であってもよい。 Examples of operation contents described in the operation log include "file open", "key touch", "update (save)", and "file close". However, the content of the operation described in the operation log is not limited to these, and may be "new creation" or the like.

例えば、ユーザ「山田」が、２０１７年１０月１０日の１３時１５分に、ファイル“/・・/人工知能/・・/Ａ社の機械学習.pptx”を開いた場合、ＰＣ９１は、図２に例示する１番目の操作ログを作成する。また、例えば、ユーザ「山田」が、２０１７年１０月１０日の１３時１６分に、そのファイルに対して、キータッチ（キー入力）を行った場合、ＰＣ９１は、図２に例示する２番目の操作ログを作成する。また、例えば、ユーザ「山田」が、２０１７年１０月１０日の１５時１７分に、そのファイルを更新（保存）した場合、ＰＣ９１は、図２に例示するｍ－１番目の操作ログを作成する。また、例えば、ユーザ「山田」が、２０１７年１０月１０日の１５時４５分に、そのファイルを閉じた場合には、ＰＣ９１は、図２に例示するｍ番目の操作ログを作成する。 For example, when the user "Yamada" opens the file "/.../artificial intelligence/.../Machine learning of company A.pptx" at 13:15 on October 10, 2017, the PC 91 Create the first operation log exemplified in 2. Further, for example, when the user "Yamada" performs a key touch (key input) on the file at 13:16 on October 10, 2017, the PC 91 outputs the second create an operation log for Also, for example, when the user "Yamada" updates (saves) the file at 15:17 on October 10, 2017, the PC 91 creates the m-1th operation log illustrated in FIG. do. Also, for example, when the user "Yamada" closes the file at 15:45 on October 10, 2017, the PC 91 creates the m-th operation log illustrated in FIG.

図２に示すｍ＋１番目からｎ番目までの操作ログは、ユーザ「山田」が別のファイル“/・・/人工知能/・・/ディープラーニング.docx”を操作した際における操作ログの例である。 The operation logs from the (m+1)th to the nth shown in FIG. 2 are examples of operation logs when the user "Yamada" operates another file "/.../artificial intelligence/.../deep learning.docx". .

各ＰＣ９１は、それぞれ、同様に、ユーザがファイルに対して操作を行う毎に、操作ログを追加し、記憶していく。 Each PC 91 similarly adds and stores an operation log each time the user operates a file.

関連スコア算出システム１は、収集部２と、単語抽出部３と、スコア算出部４と、記憶部５とを備える。 A related score calculation system 1 includes a collection unit 2 , a word extraction unit 3 , a score calculation unit 4 and a storage unit 5 .

収集部２は、各ＰＣ９１から、各ＰＣ９１に記憶されている操作ログを収集する。図２に示すように、個々の操作ログは、パス名を含むファイル名と、そのファイル名を有するファイルを使用したユーザのユーザ名とを含む。個々の操作ログは、ユーザと単語との関連の強さを表す指標値である関連スコアを導出可能な情報も含む。図２に示す例では、「日時」および「操作」として記載された情報が、関連スコアを導出可能な情報に該当する。ただし、関連スコアは、１つの操作ログからは導出されず、複数の操作ログから導出される。 The collection unit 2 collects operation logs stored in each PC 91 from each PC 91 . As shown in FIG. 2, each operation log includes a file name including a path name and the user name of the user who used the file with that file name. Each operation log also includes information from which a relevance score, which is an index value representing the strength of relevance between a user and a word, can be derived. In the example shown in FIG. 2, the information described as "date and time" and "operation" corresponds to information from which a related score can be derived. However, the related score is derived not from one operation log, but from multiple operation logs.

単語抽出部３は、収集部２によって収集された各操作ログに記述されている各ファイル名（パス名を含むファイル名）に対して形態素解析を実行することによって、パス名を含むファイル名に含まれている単語を抽出する。ただし、単語抽出部３は、同一の単語を、重複して抽出しない。例えば、単語抽出部３は、「人工知能」という単語を既に抽出している場合、２回目以降に抽出された「人工知能」という単語については無視する。 The word extraction unit 3 performs morphological analysis on each file name (file name including path name) described in each operation log collected by the collection unit 2, thereby extracting the file name including the path name. Extract contained words. However, the word extractor 3 does not extract the same word redundantly. For example, when the word "artificial intelligence" has already been extracted, the word extraction unit 3 ignores the word "artificial intelligence" extracted for the second and subsequent times.

例えば、単語抽出部３は、“/・・/人工知能/・・/Ａ社の機械学習.pptx”というパス名を含むファイル名に対して形態素解析を実行することによって、「人工知能」、「Ａ社」、「機械学習」等の単語を抽出する。なお、以下の説明において、各ユーザが属している会社（組織）が「Ａ社」であるものとして説明する。 For example, the word extraction unit 3 performs morphological analysis on a file name including the path name "/.../artificial intelligence/.../machine learning of company A.pptx" to extract "artificial intelligence", Words such as "Company A" and "machine learning" are extracted. In the following explanation, it is assumed that the company (organization) to which each user belongs is "A company".

さらに、例えば、単語抽出部３は、“/・・/人工知能/・・/ディープラーニング.docx”というパス名を含むファイル名に対して形態素解析を実行することによって、「人工知能」、「ディープラーニング」等の単語を抽出する。ただし、前述のように、「人工知能」は既に抽出されているので、単語抽出部３は、ここで抽出された「人工知能」という単語については無視する。 Further, for example, the word extraction unit 3 performs morphological analysis on a file name including a path name of "/.../artificial intelligence/.../deep learning.docx" to extract "artificial intelligence", " Extract words such as “deep learning”. However, since "artificial intelligence" has already been extracted as described above, the word extractor 3 ignores the word "artificial intelligence" extracted here.

単語抽出部３は、同様の処理を、各操作ログに記述されている各ファイル名に対して行うことによって、単語の集合を得る。これらの単語は、互いに異なる。 The word extraction unit 3 obtains a set of words by performing similar processing on each file name described in each operation log. These words are different from each other.

スコア算出部４は、ファイルを操作した各ユーザと、単語抽出部３によって抽出された各単語の組合せ毎に、ユーザと単語との関連の強さを表す指標値である関連スコアを算出する。なお、ユーザは、操作ログに記述されるユーザ名で表される。 The score calculation unit 4 calculates, for each combination of each user who operated the file and each word extracted by the word extraction unit 3, a relation score, which is an index value representing the strength of the relation between the user and the word. A user is represented by a user name described in the operation log.

なお、本発明の第２の実施形態等では、単語同士の関連の強さを表す指標値も用いる。本発明において、ユーザと単語との関連の強さを表す指標値を「関連スコア」と称し、単語同士の関連の強さを表す指標値を「関連度」と称することによって、２種類の指標値を区別する。 In addition, in the second embodiment of the present invention, etc., an index value representing the strength of the relationship between words is also used. In the present invention, an index value representing the strength of the relationship between a user and a word is called a "relevance score", and an index value representing the strength of the relationship between words is called a "relevance level". Distinguish values.

スコア算出部４は、ユーザ（ユーザ名）と単語の組合せ毎に、関連スコアを算出し、そのユーザ名と単語と関連スコアとの組を記憶部５に記憶させる。 The score calculation unit 4 calculates a related score for each combination of a user (user name) and a word, and stores the combination of the user name, word, and related score in the storage unit 5 .

記憶部５は、ユーザ（ユーザ名）と単語と関連スコアとの組を記憶する記憶装置である。 The storage unit 5 is a storage device that stores a set of a user (user name), a word, and a related score.

関連スコアの算出方法は、複数、存在する。以下、関連スコアの算出方法として、３種類の方法を説明する。以下に示す３種類のいずれの方法においても、スコア算出部４は、ユーザと単語の組合せ毎に、関連スコアを算出し、記憶部５に記憶させる。 There are multiple methods for calculating the related score. Three types of methods for calculating the related score will be described below. In any of the three methods described below, the score calculation unit 4 calculates a related score for each combination of the user and the word, and stores it in the storage unit 5 .

第１の算出方法は、一のユーザ（以下、ユーザＵと記す。）と一の単語（以下、単語Ｗと記す。）の関連スコアとして、単語Ｗをファイル名に含む各ファイルについてのユーザＵの操作時間の総和を算出する方法である。操作時間が長いほど、ユーザＵと単語Ｗの関連が強く、操作時間が短いほど、ユーザＵと単語Ｗの関連が弱いと言える。従って、操作時間を、関連スコアとして用いることができる。 In the first calculation method, as a relation score between one user (hereinafter referred to as user U) and one word (hereinafter referred to as word W), user U This is a method of calculating the sum of the operation times of It can be said that the longer the operation time is, the stronger the relationship between the user U and the word W is, and the shorter the operation time is, the weaker the relationship between the user U and the word W is. Therefore, operation time can be used as a relevance score.

ここではまず、実質参照時間を操作時間として扱う場合を例にして説明する。 Here, first, a case where the actual reference time is treated as the operation time will be described as an example.

なお、ファイル名は、パス名を含むファイル名である。従って、単語Ｗがパス名の方に含まれている場合であっても、単語Ｗはファイル名に含まれているものとして扱う。この点は、後述の第２の算出方法および第３の算出方法においても同様である。 Note that the file name is a file name including a path name. Therefore, even if the word W is included in the path name, it is treated as being included in the file name. This point also applies to the second calculation method and the third calculation method, which will be described later.

実質参照時間は、ＰＣ９１において、ファイルの内容を表しているウィンドウがアクティブになっている時間（すなわち、ファイルの内容を表しているウィンドウがユーザから見て一番手前に表示されている時間）である。 The actual reference time is the time during which the window showing the contents of the file is active in the PC 91 (that is, the time during which the window showing the contents of the file is displayed in front of the user). be.

関連スコアの第１の算出方法では、スコア算出部４は、単語Ｗをファイル名に含む各ファイルについてのユーザＵの実質参照時間の総和を算出し、その総和を、ユーザＵと単語Ｗの関連スコアとする。 In the first calculation method of the relation score, the score calculation unit 4 calculates the sum of the actual reference times of the user U for each file containing the word W in the file name, and calculates the sum as the relation between the user U and the word W. score.

キータッチが行われていれば、ファイルの内容を表すウィンドウはアクティブである。従って、例えば、スコア算出部４は、単語Ｗをファイル名に含む１つのファイルに関して、ユーザＵによってキータッチが続けて行われている状態を操作ログから判断し、その状態における最初のキータッチ時刻から、最後のキータッチ時刻までの時間を、そのファイルにおけるユーザＵの実質参照時間とする。 If a key is touched, the window representing the contents of the file is active. Therefore, for example, the score calculation unit 4 determines from the operation log the state in which the user U continues to make key touches with respect to one file including the word W in the file name, and calculates the time of the first key touch in that state. to the time of the last key touch is set as the substantial reference time of the user U in the file.

さらに、単語Ｗをファイル名に含み、ユーザＵに操作された他のファイルがあれば、スコア算出部４は、そのファイルに関しても同様に、実質参照時間を算出する。 Furthermore, if there is another file that includes the word W in its file name and is operated by the user U, the score calculator 4 similarly calculates the effective reference time for that file.

そして、スコア算出部４は、単語Ｗをファイル名に含み、ユーザＵに操作されたファイル毎に算出した実質参照時間の総和を算出し、その総和をユーザＵと単語Ｗの関連スコアとする。 Then, the score calculation unit 4 calculates the sum of the actual reference times calculated for each file that includes the word W in the file name and is operated by the user U, and sets the sum as the relation score between the user U and the word W.

また、操作ログにおいて、アクティブ状態となった開始時刻および終了時刻を明示しているのであれば、スコア算出部４は、操作ログにおいて明示されているそれらの時刻に基づいて、実質参照時間を算出してもよい。 In addition, if the operation log clearly indicates the start time and end time of the active state, the score calculation unit 4 calculates the actual reference time based on those times indicated in the operation log. You may

また、スコア算出部４は、ファイルオープンからファイルクローズまでの時間を操作時間として算出してもよい。この場合、スコア算出部４は、単語Ｗをファイル名に含み、ユーザＵに操作されたファイル毎に、ファイルオープンからファイルクローズまでの時間を算出し、その時間の総和をユーザＵと単語Ｗの関連スコアとすればよい。 Further, the score calculation unit 4 may calculate the time from file open to file close as the operation time. In this case, the score calculation unit 4 calculates the time from the file opening to the file closing for each file that includes the word W in the file name and is operated by the user U. A related score may be used.

なお、実質参照時間を操作時間として扱うことが好ましい。 Note that it is preferable to treat the actual reference time as the operation time.

関連スコアの第２の算出方法は、一のユーザ（ユーザＵ）と一の単語（単語Ｗ）の関連スコアとして、単語Ｗをファイル名に含む各ファイルをユーザＵが操作した際のキータッチの回数の総和を算出する方法である。キータッチの回数が多いほど、単語Ｗをファイル名に含むファイルをユーザＵが操作した量が多いことになる。よって、キータッチの回数が多いほど、ユーザＵと単語Ｗの関連が強く、キータッチの回数が少ないほど、ユーザＵと単語Ｗの関連が弱いと言える。従って、キータッチの回数を、関連スコアとして用いることができる。 A second method of calculating the association score is the association score between one user (user U) and one word (word W). This is a method of calculating the total number of times. The greater the number of key touches, the greater the number of times the user U has operated files containing the word W in the file name. Therefore, it can be said that the larger the number of key touches, the stronger the relationship between the user U and the word W, and the smaller the number of key touches, the weaker the relationship between the user U and the word W. Therefore, the number of key touches can be used as a relevance score.

第２の算出方法では、スコア算出部４は、操作ログを参照して、単語Ｗをファイル名に含む一つのファイルをユーザＵが操作した際のキータッチの回数をカウントすることによって、そのファイルにおけるキータッチの回数を求める。 In the second calculation method, the score calculation unit 4 refers to the operation log and counts the number of key touches when the user U operates a file containing the word W in its file name. Find the number of key touches in .

さらに、単語Ｗをファイル名に含み、ユーザＵに操作された他のファイルがあれば、スコア算出部４は、そのファイルに関しても同様に、キータッチの回数を求める。 Furthermore, if there is another file that includes the word W in its file name and is operated by the user U, the score calculator 4 similarly obtains the number of key touches for that file.

そして、スコア算出部４は、単語Ｗをファイル名に含み、ユーザＵに操作されたファイル毎に算出したキータッチの回数の総和を算出し、その総和をユーザＵと単語Ｗの関連スコアとする。 Then, the score calculation unit 4 calculates the sum of the number of key touches calculated for each file that includes the word W in the file name and is operated by the user U, and sets the sum as the association score between the user U and the word W. .

関連スコアの第３の算出方法は、一のユーザ（ユーザＵ）と一の単語（単語Ｗ）の関連スコアを、次に説明する２つの割合に基づいて算出する方法である。この２つの割合のうち、一方の割合をＲ_１と記し、もう一方の割合をＲ_２と記す。 A third method of calculating the association score is a method of calculating the association score between one user (user U) and one word (word W) based on two ratios described below. Of these two ratios, one ratio is denoted R ₁ and the other ratio is denoted R ₂ .

Ｒ_１は、単語Ｗをファイル名に含む各ファイルについての組織内の全ユーザの操作時間の総和に対する、単語Ｗをファイル名に含む各ファイルについてのユーザＵの操作時間の総和の割合である。すなわち、Ｒ_１は、以下に示す式（１）で表される。 _R1 is the ratio of the total operating time of user U for each file containing word W in its file name to the total operating time of all users in the organization for each file containing word W in its file name. That is, R ₁ is represented by formula (1) shown below.

Ｒ_２は、個々の単語に着目した場合における、着目した単語をファイル名に含む各ファイルについてのユーザＵの操作時間の総和に対する、単語Ｗをファイル名に含む各ファイルについてのユーザＵの操作時間の総和の割合である。すなわち、Ｒ_２は、以下に示す式（２）で表される。 R ₂ is the operation time of the user U for each file including the word W in the file name with respect to the sum of the operation time of the user U for each file including the word of interest in the file name when focusing on individual words. is the ratio of the sum of That is, R ₂ is represented by formula (2) shown below.

Ｒ_１について説明する。単語Ｗをファイル名に含む各ファイルについてのユーザＵの操作時間の総和（式（１）の右辺の分子）は、前述の第１の算出方法で算出される関連スコアに相当する。すなわち、スコア算出部４は、前述の第１の算出方法で説明した方法で、単語Ｗをファイル名に含む各ファイルについてのユーザＵの操作時間の総和を算出すればよい。 _R1 will be explained. The sum of the user U's operation time for each file including the word W in the file name (the numerator on the right side of equation (1)) corresponds to the relation score calculated by the first calculation method described above. That is, the score calculation unit 4 may calculate the sum of the operation time of the user U for each file including the word W in the file name by the method described in the first calculation method.

単語Ｗをファイル名に含む各ファイルについての組織内の全ユーザの操作時間の総和（式（１）の右辺の分母）について説明する。スコア算出部４は、単語Ｗをファイル名に含む各ファイルについての、組織内の一人目のユーザの操作時間の総和も、前述の第１の算出方法で説明した方法で算出する。同様に、スコア算出部４は、単語をＷファイル名に含む各ファイルについての、組織内の二人目のユーザの操作時間の総和も、前述の第１の算出方法で説明した方法で算出する。同様に、スコア算出部４は、組織に属する一人一人について、単語Ｗをファイル名に含む各ファイルについてのユーザの操作時間の総和を算出する。さらに、スコア算出部４は、組織に属する一人一人について算出した「単語Ｗをファイル名に含む各ファイルについてのユーザの操作時間の総和」の総和を算出する。この値が、単語Ｗをファイル名に含む各ファイルについての組織内の全ユーザの操作時間の総に該当する。 The sum of the operation times of all users in the organization (the denominator on the right side of equation (1)) for each file containing the word W in the file name will be described. The score calculation unit 4 also calculates the total operation time of the first user in the organization for each file including the word W in the file name, using the method described in the first calculation method above. Similarly, the score calculation unit 4 also calculates the total operation time of the second user in the organization for each file whose W file name includes a word, using the method described in the first calculation method above. Similarly, the score calculation unit 4 calculates the sum of the user's operation time for each file including the word W in the file name for each person belonging to the organization. Further, the score calculation unit 4 calculates the sum of the "sum of the user's operation time for each file containing the word W in the file name" calculated for each person belonging to the organization. This value corresponds to the total operation time of all users in the organization for each file containing the word W in the file name.

例えば、単語Ｗが「人工知能」であり、ユーザＵが「山田」であるとする。また、山田が属する組織「Ａ社」に３００人のユーザがいるとする。この場合、スコア算出部４は、「人工知能」をファイル名に含むファイルについてのユーザ「山田」の操作時間の総和を、式（１）の右辺の分子として求める。また、スコア算出部４は、「人工知能」をファイル名に含むファイルについてのユーザの操作時間の総和を、３００人の個々のユーザ毎に算出し、さらに、個々のユーザ毎に算出した「操作時間の総和」の総和を、式（１）の右辺の分母として求める。そして、スコア算出部４は、式（１）によって、Ｒ_１を算出する。 For example, assume that the word W is "artificial intelligence" and the user U is "Yamada". It is also assumed that there are 300 users in the organization "A company" to which Yamada belongs. In this case, the score calculation unit 4 obtains the sum of the operation times of the user "Yamada" for files including "artificial intelligence" in the file name as the numerator on the right side of Equation (1). In addition, the score calculation unit 4 calculates the sum of user operation times for files including "artificial intelligence" in the file name for each of the 300 individual users, and further calculates the total "operation time" for each individual user. The sum of "sum of time" is obtained as the denominator of the right side of equation (1). Then, the score calculation unit 4 calculates _R1 by the formula (1).

次に、Ｒ_２について説明する。式（２）の右辺の分子は、式（１）の右辺の分子と同じである。従って、スコア算出部４は、前述の第１の算出方法で説明した方法で、単語Ｗをファイル名に含む各ファイルについてのユーザＵの操作時間の総和を算出すればよい。 Next, _R2 will be explained. The numerator on the right side of equation (2) is the same as the numerator on the right side of equation (1). Therefore, the score calculation unit 4 may calculate the sum of the operation time of the user U for each file including the word W in the file name by the method described in the first calculation method.

個々の単語に着目した場合における、着目した単語をファイル名に含む各ファイルについてのユーザＵの操作時間の総和（式（２）の右辺の分母）について説明する。スコア算出部４は、単語抽出部３によって抽出された個々の単語に着目する（換言すれば、個々の単語を１つ１つ選択する）。そして、スコア算出部４は、着目した単語（選択した単語）をファイル名に含む各ファイルについてのユーザＵの操作時間の総和を、前述の第１の算出方法で説明した方法で算出する。スコア算出部４は、次の単語に着目し（換言すれば、次の単語を選択し）、着目した単語（選択した単語）をファイル名に含む各ファイルについてのユーザＵの操作時間の総和を、前述の第１の算出方法で説明した方法で算出する。このように、スコア算出部４は、単語毎に、単語をファイル名に含む各ファイルについてのユーザＵの操作時間の総和を算出する。そして、スコア算出部４は、単語毎に算出した「単語をファイル名に含む各ファイルについてのユーザＵの操作時間の総和」の総和を算出する。この値が、式（２）の右辺の分母に該当する。「個々の単語に着目した場合における、着目した単語をファイル名に含む各ファイルについてのユーザＵの操作時間の総和（式（２）の右辺の分母）」は、「単語をファイル名に含む各ファイルについてのユーザＵの操作時間の総和を単語毎に求めた場合における前記総和の総和」であると言うことができる。 A description will be given of the sum of the user U's operation time (the denominator on the right side of equation (2)) for each file containing the focused word in the file name when focusing on individual words. The score calculator 4 focuses on the individual words extracted by the word extractor 3 (in other words, selects the individual words one by one). Then, the score calculation unit 4 calculates the total operation time of the user U for each file whose file name includes the word of interest (selected word) by the method described in the first calculation method above. The score calculation unit 4 focuses on the next word (in other words, selects the next word), and calculates the sum of the user U's operation time for each file containing the focused word (selected word) in the file name. , is calculated by the method described in the first calculation method above. In this way, the score calculation unit 4 calculates, for each word, the sum of the operation times of the user U for each file including the word in the file name. Then, the score calculation unit 4 calculates the sum of "the sum of the operation times of the user U for each file containing the word in the file name" calculated for each word. This value corresponds to the denominator on the right side of Equation (2). "The total sum of the user U's operation time for each file containing the focused word in the file name when focusing on individual words (the denominator on the right side of Equation (2))" It can be said that it is the sum of the sums when the sum of the user U's operation time for the file is obtained for each word.

例えば、前述の例のように、単語Ｗが「人工知能」であり、ユーザＵが「山田」であるとする。この場合、スコア算出部４は、「人工知能」をファイル名に含むファイルについてのユーザ「山田」の操作時間の総和を、式（２）の右辺の分子として求める。また、スコア算出部４は、「人工知能」、「Ａ社」、「ディープラーニング」等の抽出された単語毎に、単語をファイル名に含むファイルについてのユーザ「山田」の操作時間の総和を算出する。さらに、個々の単語毎に算出した「操作時間の総和」の総和を、式（２）の右辺の分母として、求める。そして、スコア算出部４は、式（２）によって、Ｒ_２を算出する。 For example, as in the example above, assume that the word W is "artificial intelligence" and the user U is "Yamada". In this case, the score calculation unit 4 obtains the sum of the operation times of the user "Yamada" for files including "artificial intelligence" in the file name as the numerator on the right side of Equation (2). In addition, the score calculation unit 4 calculates the sum of the operation time of the user "Yamada" for each extracted word such as "artificial intelligence", "company A", "deep learning", etc. calculate. Furthermore, the sum of the "sum of operation times" calculated for each individual word is obtained as the denominator of the right side of Equation (2). Then, the score calculation unit 4 calculates _R2 by the formula (2).

スコア算出部４は、Ｒ_１，Ｒ_２を求めた後、ユーザＵと単語Ｗの関連スコアを、以下に示す式（３）によって算出する。 After obtaining R ₁ and R ₂ , the score calculation unit 4 calculates the association score between the user U and the word W using the following equation (3).

関連スコア＝Ｒ_１×ｌｏｇ（Ｒ_２）・・・（３） Relevant score = R ₁ ×log(R ₂ ) (3)

この第３の算出方法で関連スコアを算出した場合、組織に属する多くの人に関連のある単語については、関連スコアの値が低くなり、組織に属する特定の人に関連のある単語については、関連スコアの値が高くなる。例えば、各ユーザはＡ社に属しているので、「Ａ社」という単語は、各ユーザと関連があると考えられる。しかし、「Ａ社」という単語と、各ユーザの関連が強いということは、自明であると言える。そのため、「Ａ社」という単語と各ユーザの関連スコアを高くしても、あまり意味がなく、関連スコアを低くした方が好ましい。また、組織に属する特定のユーザのみが、「人工知能」という単語と関連している場合、そのユーザと「人工知能」という単語の関連スコアは高くした方が好ましい。第３の算出方法では、そのように、関連スコアを算出することができる。 When the relevance score is calculated by this third calculation method, words that are related to many people belonging to the organization have a low relevance score value, and words that are related to a specific person belonging to the organization have a low relevance score value. Relevance score increases. For example, each user belongs to Company A, so the word "Company A" may be associated with each user. However, it is self-evident that the word "A company" is strongly related to each user. Therefore, it is meaningless to increase the association score between the word "A company" and each user, and it is preferable to lower the association score. Also, if only a specific user belonging to an organization is associated with the word "artificial intelligence", it is preferable to increase the association score between that user and the word "artificial intelligence". In the third calculation method, the relevance score can be calculated as such.

図３は、ユーザと単語の組合せ毎に算出された関連スコアの例を示す模式図である。図３に示す第１の関連スコアは、第１の算出方法で算出された関連スコアである。第２の関連スコアは、第２の算出方法で算出された関連スコアである。第３の関連スコアは、第３の算出方法で算出された関連スコアである。図３では、３種類の関連スコアを図示したが、スコア算出部４は、いずれか１種類の関連スコアを算出すればよい。ただし、スコア算出部４は、２種類以上の関連スコアを算出してもよい。 FIG. 3 is a schematic diagram showing an example of association scores calculated for each combination of a user and a word. The first related score shown in FIG. 3 is the related score calculated by the first calculation method. The second related score is the related score calculated by the second calculation method. The third related score is the related score calculated by the third calculation method. Although three types of related scores are shown in FIG. 3, the score calculation unit 4 may calculate any one type of related score. However, the score calculator 4 may calculate two or more types of related scores.

既に説明したように、スコア算出部４は、ユーザと単語の組合せ毎に、関連スコアを算出し、ユーザ名と単語と関連スコアとの組を記憶部５に記憶させる。 As already explained, the score calculation unit 4 calculates a relation score for each combination of a user and a word, and causes the storage unit 5 to store the set of the user name, the word, and the relation score.

収集部２は、例えば、関連スコア算出プログラムに従って動作するコンピュータのＣＰＵ（Central Processing Unit ）およびそのコンピュータの通信インタフェースによって実現される。例えば、ＣＰＵが、コンピュータのプログラム記憶装置等のプログラム記録媒体から関連スコア算出プログラムを読み込み、関連スコア算出プログラムに従って、通信インタフェースを用いて、収集部２として動作すればよい。また、単語抽出部３およびスコア算出部４も、例えば、関連スコア算出プログラムに従って動作する上記のコンピュータのＣＰＵによって実現される。すなわち、上記のように、関連スコア算出プログラムを読み込んだＣＰＵが、関連スコア算出プログラムに従って、単語抽出部３およびスコア算出部４として動作すればよい。記憶部５は、上記のコンピュータの記憶装置によって実現される。また、収集部２、単語抽出部３およびスコア算出部４がそれぞれ別々のハードウェアによって実現されてもよい。 The collection unit 2 is implemented by, for example, a CPU (Central Processing Unit) of a computer that operates according to a related score calculation program and a communication interface of the computer. For example, the CPU may read a related score calculation program from a program recording medium such as a program storage device of the computer, and operate as the collection unit 2 using a communication interface according to the related score calculation program. The word extractor 3 and the score calculator 4 are also implemented by, for example, the CPU of the above computer that operates according to the related score calculator program. That is, as described above, the CPU that has read the related score calculation program should operate as the word extractor 3 and the score calculator 4 according to the related score calculation program. The storage unit 5 is implemented by the storage device of the computer described above. Also, the collection unit 2, the word extraction unit 3, and the score calculation unit 4 may be realized by separate hardware.

また、関連スコア算出システム１は、２つ以上の物理的に分離した装置が有線または無線で接続されている構成であってもよい。この点は、後述する他の実施形態でも同様である。 Also, the related score calculation system 1 may have a configuration in which two or more physically separated devices are connected by wire or wirelessly. This point also applies to other embodiments described later.

次に、第１の実施形態の処理経過について説明する。図４は、第１の実施形態の関連スコア算出システムの処理経過の例を示すフローチャートである。なお、既に説明した事項については、詳細な説明を省略する。 Next, the process progress of the first embodiment will be described. FIG. 4 is a flow chart showing an example of the process progress of the related score calculation system of the first embodiment. In addition, detailed description is omitted about the matter already demonstrated.

まず、収集部２が、会社内に設けられている各ＰＣ９１から、操作ログを収集する（ステップＳ１）。 First, the collection unit 2 collects operation logs from each PC 91 installed in the company (step S1).

次に、単語抽出部３が、各操作ログに記述されているファイル名（パス名を含むファイル名）に対して形態素解析を行うことにより、単語を抽出する（ステップＳ２）。 Next, the word extraction unit 3 extracts words by performing morphological analysis on the file names (file names including path names) described in each operation log (step S2).

次に、スコア算出部４が、各操作ログに基づいて、操作ログに記述されているユーザ名と、ステップＳ２で抽出された単語との組み合わせ毎に、そのユーザ名が表わすユーザと単語との関連の強さを表す関連スコアを算出。そして、スコア算出部４は、ユーザ名と単語と関連スコアとの組を記憶部５に記憶させる（ステップＳ３）。スコア算出部４は、前述の第１の算出方法、第２の算出方法、および、第３の算出方法のうちの、いずれの方法で関連スコアを算出してもよい。 Next, based on each operation log, the score calculation unit 4 calculates the relationship between the user represented by the user name and the word for each combination of the user name described in the operation log and the word extracted in step S2. Calculates a relevance score that indicates the strength of the relevance. Then, the score calculation unit 4 causes the storage unit 5 to store the set of the user name, the word, and the related score (step S3). The score calculator 4 may calculate the related score by any one of the above-described first calculation method, second calculation method, and third calculation method.

この結果、記憶部５には、ユーザ名と単語と関連スコアとの組が複数組、記憶される。 As a result, the storage unit 5 stores a plurality of sets of user names, words, and related scores.

本実施形態によれば、スコア算出部４が、各操作ログに基づいて、ユーザと単語の組合せ毎に、関連スコアを算出する。そして、前述の第１の算出方法、第２の算出方法、および、第３の算出方法は、いずれも、基本的に、単語Ｗをファイル名に含むファイルに対するユーザＵの操作の量（キータッチの回数、操作時間等）が多いほど、関連スコアとして大きな値を算出する。従って、ユーザと単語の組合せ毎に、ユーザと単語との関連の強さが、適切に数値化される。よって、組織内の人と単語との関連の強さを明確化することができる。 According to this embodiment, the score calculation unit 4 calculates a related score for each combination of a user and a word based on each operation log. The first calculation method, the second calculation method, and the third calculation method described above are all basically based on the amount of user U's operation (key touch number of times, operation time, etc.), a larger value is calculated as the related score. Therefore, the strength of the relationship between the user and the word is appropriately quantified for each combination of the user and the word. Therefore, it is possible to clarify the strength of the relationship between the person in the organization and the word.

次に、第１の実施形態の変形例について説明する。図５は、第１の実施形態の変形例を示すブロック図である。図５に示す関連スコア算出システム１は、収集部２、単語抽出部３、スコア算出部４および記憶部５に加えて、キーワード受付部６と、検索部７と、出力部８とを備える。図１に示す要素と同様の要素については、図１と同一の符号を付し、説明を省略する。 Next, a modification of the first embodiment will be described. FIG. 5 is a block diagram showing a modification of the first embodiment. A related score calculation system 1 shown in FIG. Elements similar to those shown in FIG. 1 are denoted by the same reference numerals as in FIG. 1, and description thereof is omitted.

以下に示す例では、ユーザ名と、単語と、前述の第３の算出方法によって算出された関連スコアとの組が、複数、記憶部５に記憶されているものとして説明する。図６は、記憶部５に記憶されている複数の組の例を示す模式図である。 In the example shown below, it is assumed that a plurality of sets of user names, words, and related scores calculated by the third calculation method are stored in the storage unit 5 . FIG. 6 is a schematic diagram showing an example of a plurality of sets stored in the storage unit 5. As shown in FIG.

図５に例示する関連スコア算出システム１は、単語を検索キーワードとして受け付け、その単語に応じたユーザのユーザ名を検索する。あるいは、関連スコア算出システム１は、ユーザ名を検索キーワードとして受け付け、そのユーザ名に応じた単語を検索する。また、関連スコア算出システム１は、上記の２種類の検索をそれぞれ実行可能であってもよい。 The related score calculation system 1 illustrated in FIG. 5 accepts a word as a search keyword, and searches for the username of the user according to the word. Alternatively, the related score calculation system 1 accepts a user name as a search keyword and searches for words corresponding to the user name. Also, the related score calculation system 1 may be capable of executing each of the above two types of searches.

キーワード受付部６は、検索者から検索キーワードを受け付ける。 The keyword reception unit 6 receives a search keyword from a searcher.

検索部７は、検索キーワードに応じて、検索を実行する。 The search unit 7 executes a search according to the search keyword.

出力部８は、検索結果を出力する。 The output unit 8 outputs search results.

なお、キーワード受付部６は、例えば、検索者の使用する端末装置（図示略）から、通信ネットワークを介して、検索キーワードを受け付け、出力部８は、その端末装置に対して、検索結果を送信すればよい。以下、このようにキーワード受付部６が検索キーワードを受け付け、出力部８がこのように検索結果を出力する場合を例にして説明する。ただし、検索キーワードの受け付け態様や、検索結果の出力態様は、この例に限定されない。例えば、キーワード受付部６は、関連スコア算出システム１が備える入力デバイス（図示略）を介して検索キーワードを受け付けてもよい。また、出力部８は、関連スコア算出システム１が備えるディスプレイ装置（図示略）に検索結果を出力（表示）してもよい。 For example, the keyword reception unit 6 receives a search keyword from a terminal device (not shown) used by a searcher via a communication network, and the output unit 8 transmits the search result to the terminal device. do it. A case where the keyword reception unit 6 receives a search keyword in this manner and the output unit 8 outputs the search results in this manner will be described below as an example. However, the mode of accepting search keywords and the mode of outputting search results are not limited to this example. For example, the keyword accepting unit 6 may accept search keywords via an input device (not shown) included in the related score calculation system 1 . The output unit 8 may also output (display) the search results on a display device (not shown) provided in the related score calculation system 1 .

キーワード受付部６および出力部８は、収集部２と同様に、例えば、関連スコア算出プログラムに従って動作するコンピュータのＣＰＵおよびそのコンピュータの通信インタフェースによって実現される。また、検索部７は、関連スコア算出プログラムに従って動作するそのコンピュータのＣＰＵによって実現される。また、キーワード受付部６、検索部７、出力部８、および他の構成要素がそれぞれ別々のハードウェアによって実現されてもよい。 The keyword reception unit 6 and the output unit 8 are implemented, like the collection unit 2, by, for example, a CPU of a computer that operates according to a related score calculation program and a communication interface of the computer. Also, the search unit 7 is realized by the CPU of the computer that operates according to the related score calculation program. Moreover, the keyword reception unit 6, the search unit 7, the output unit 8, and other components may be realized by separate hardware.

次に、関連スコア算出システム１が、単語を検索キーワードとして受け付け、その単語に応じたユーザのユーザ名を検索する処理の例について説明する。 Next, an example of processing in which the related score calculation system 1 accepts a word as a search keyword and searches for the username of the user according to the word will be described.

まず、キーワード受付部６が、検索者から単語を検索キーワードとして受け付ける。 First, the keyword reception unit 6 receives a word as a search keyword from a searcher.

次に、検索部７が、記憶部５に記憶されている、ユーザ名と単語と関連スコアとの組の中から、検索キーワードに該当する単語と、閾値（例えば、０．５）以上の関連スコアを含む組を特定し、その組に含まれているユーザ名を検索結果として特定する。 Next, the search unit 7 selects the word corresponding to the search keyword from among the sets of the user name, the word, and the association score stored in the storage unit 5, Identify the tuples that contain the scores, and identify the user names that are included in the tuples as search results.

例えば、キーワード受付部６が検索キーワードとして、「人工知能」という単語を受け付けたとする。また、上記の閾値が０．５であるとする。この場合、検索部７は、図６に例示する複数の組の中から、「人工知能」という単語と、０．５以上の関連スコアを含む組を特定する。本例では、図６に示す１番目の組が特定される。検索部７は、特定した組含まれるユーザ名「山田」を検索結果として得る。出力部８は、その検索結果を出力する。なお、検索部７は、「人工知能」という単語と、０．５以上の関連スコアを含む組が複数存在するならば、その組を全て特定し、その各組から得られるユーザ名を検索結果とする。従って、検索結果として得られるユーザ名は１つとは限らない。出力部８は、検索結果として得た複数のユーザ名を、関連スコアの高い順に並べて出力してもよい。 For example, suppose that the keyword reception unit 6 receives the word "artificial intelligence" as a search keyword. It is also assumed that the above threshold is 0.5. In this case, the search unit 7 identifies a set containing the word "artificial intelligence" and a related score of 0.5 or higher from among the plurality of sets illustrated in FIG. In this example, the first set shown in FIG. 6 is identified. The search unit 7 obtains the user name "Yamada" included in the specified group as a search result. The output unit 8 outputs the search result. If there are a plurality of sets containing the word "artificial intelligence" and a related score of 0.5 or higher, the search unit 7 identifies all the sets, and retrieves the user name obtained from each set as a search result. and Therefore, the number of user names obtained as search results is not limited to one. The output unit 8 may output a plurality of user names obtained as search results, arranging them in descending order of related scores.

このように、単語からユーザ名が検索できるので、検索者は、検索キーワードとして指定した単語が表わす分野、技術、プロジェクト等に強く関わったユーザのユーザ名を容易に知ることができる。 Since user names can be searched from words in this way, the searcher can easily know the user names of users who have been strongly involved in the field, technology, project, etc. indicated by the word specified as the search keyword.

次に、関連スコア算出システム１が、ユーザ名をキーワードとして受け付け、そのユーザ名に応じた単語を検索する処理の例について説明する。 Next, an example of processing in which the related score calculation system 1 accepts a user name as a keyword and searches for a word corresponding to the user name will be described.

まず、キーワード受付部６が、検索者からユーザ名を検索キーワードとして受け付ける。 First, the keyword reception unit 6 receives a user name as a search keyword from a searcher.

次に、検索部７が、記憶部５に記憶されている、ユーザ名と単語と関連スコアとの組の中から、検索キーワードに該当するユーザ名と、閾値（例えば、０．５）以上の関連スコアを含む組を特定し、その組に含まれている単語を検索結果として特定する。このとき、検索キーワードに該当するユーザ名と、閾値以上の関連スコアを含む組が複数存在するならば、検索部７は、その組を全て特定し、その各組から得られる単語を検索結果とする。従って、検索結果として得られる単語は１つとは限らない。出力部８は、検索結果として得た複数の単語を、関連スコアの高い順に並べて出力してもよい。 Next, the search unit 7 selects the user name corresponding to the search keyword and Identify the tuples containing the relevance scores and identify the words contained in the tuples as search results. At this time, if there are a plurality of sets containing a user name corresponding to the search keyword and a related score equal to or greater than the threshold, the search unit 7 identifies all the sets, and uses the words obtained from each set as the search result. do. Therefore, the number of words obtained as search results is not limited to one. The output unit 8 may output a plurality of words obtained as search results, arranging them in descending order of related scores.

例えば、キーワード受付部６が検索キーワードとして、「山田」というユーザ名を受け付けたとする。また、上記の閾値が０．５であるとする。この場合、検索部７は、図６に例示する複数の組の中から、「山田」というユーザ名と、０．５以上の関連スコアを含む組を特定する。本例では、図６に示す１番目の組、３番目の組および４番目の組が特定される。検索部７は、特定した各組に含まれる単語を検索結果として得る。すなわち、検索部７は、「人工知能」、「機械学習」および「ディープラーニング」を検索結果として得る。出力部８は、その検索結果を出力する。 For example, it is assumed that the keyword reception unit 6 has received the user name "Yamada" as a search keyword. It is also assumed that the above threshold is 0.5. In this case, the search unit 7 identifies a set including the user name "Yamada" and a related score of 0.5 or higher from among the plurality of sets illustrated in FIG. In this example, the first, third and fourth sets shown in FIG. 6 are identified. The search unit 7 obtains words included in each identified pair as a search result. That is, the search unit 7 obtains "artificial intelligence", "machine learning" and "deep learning" as search results. The output unit 8 outputs the search result.

このように、ユーザ名から単語を検索できるので、検索者は、検索キーワードとして指定したユーザ名を有する人が精通している分野、技術等を容易に推定したり、その人が参加したことがあるプロジェクト等を容易に推定したりすることができる。 In this way, since a user name can be searched for words, the searcher can easily guess the field, technology, etc., that the person with the user name specified as the search keyword is familiar with, or find out that the person has participated. A certain project etc. can be estimated easily.

なお、上記の閾値“０．５”は例示であり、閾値は０．５でなくてもよい。また、閾値は、関連スコアの算出方法に応じて定めておけばよい。 Note that the above threshold "0.5" is an example, and the threshold does not have to be 0.5. Also, the threshold may be determined according to the calculation method of the related score.

実施形態２．
図７は、本発明の第２の実施形態の関連スコア算出システムの構成例を示すブロック図である。第１の実施形態の関連スコア算出システム（図１参照）や第１の実施形態の変形例（図５参照）に示す構成要素と同様の構成要素については、図１や図５に示す符号と同一の符号を付し、説明を省略する。 Embodiment 2.
FIG. 7 is a block diagram showing a configuration example of a related score calculation system according to the second embodiment of the present invention. Components similar to those shown in the related score calculation system of the first embodiment (see FIG. 1) and the modified example of the first embodiment (see FIG. 5) are denoted by the reference numerals shown in FIGS. The same reference numerals are given, and the explanation is omitted.

第２の実施形態では、関連スコア算出システム１は、収集部２と、単語抽出部３と、スコア算出部４と、記憶部５と、第１のテーブル生成部１１と、第２のテーブル生成部１２と、キーワード受付部６と、検索部１７と、出力部８とを備える。 In the second embodiment, the related score calculation system 1 includes a collection unit 2, a word extraction unit 3, a score calculation unit 4, a storage unit 5, a first table generation unit 11, and a second table generation unit. It includes a unit 12 , a keyword reception unit 6 , a search unit 17 and an output unit 8 .

収集部２、単語抽出部３およびスコア算出部４は、第１の実施形態（図１参照）や、第１の実施形態の変形例（図５参照）で示したそれらの各要素と同様である。 The collection unit 2, the word extraction unit 3, and the score calculation unit 4 are the same as those shown in the first embodiment (see FIG. 1) and the modified example of the first embodiment (see FIG. 5). be.

また、キーワード受付部６および出力部８は、第１の実施形態の変形例（図５参照）で示したそれらの各要素と同様である。 Also, the keyword reception unit 6 and the output unit 8 are the same as those elements shown in the modified example of the first embodiment (see FIG. 5).

記憶部５は、第１の実施形態（図１参照）や、第１の実施形態の変形例（図５参照）における記憶部５と同様である。ただし、本実施形態では、記憶部５は、ユーザ名と単語と関連スコアとの組を複数組記憶するだけでなく、後述の第１のテーブル２１および第２のテーブル２２も記憶する。以下、スコア算出部４が記憶部５に記憶させる、ユーザ名と単語と関連スコアとの組の集合（例えば、図６に例示する複数の組）を関連スコア算出結果２０と記す。 The storage unit 5 is the same as the storage unit 5 in the first embodiment (see FIG. 1) and the modification of the first embodiment (see FIG. 5). However, in this embodiment, the storage unit 5 not only stores a plurality of pairs of user names, words, and related scores, but also stores a first table 21 and a second table 22, which will be described later. Hereinafter, a set of pairs of a user name, a word, and a related score (for example, a plurality of pairs illustrated in FIG. 6) stored in the storage unit 5 by the score calculator 4 will be referred to as a related score calculation result 20 .

また、本実施形態では、スコア算出部４が、前述の第３の算出方法で関連スコアを算出する場合を例にして説明する。ただし、スコア算出部４は、前述の第１の算出方法または第２の算出方法で関連スコアを算出してもよい。 Moreover, in this embodiment, the case where the score calculation part 4 calculates a related score by the above-mentioned 3rd calculation method is made into an example, and is demonstrated. However, the score calculator 4 may calculate the related score by the first calculation method or the second calculation method described above.

以下の説明では、スコア算出部４が、既に関連スコアを算出し、記憶部５に関連スコア算出結果２０を記憶させているものとして説明する。図８は、関連スコア算出結果２０（ユーザ名と単語と関連スコアとの組の集合）の例を示す模式図である。ここでは、図８に例示する関連スコア算出結果２０が記憶部５に記憶されている場合を例にして説明する。 In the following description, it is assumed that the score calculation unit 4 has already calculated the related score and the storage unit 5 has stored the related score calculation result 20 . FIG. 8 is a schematic diagram showing an example of a related score calculation result 20 (set of pairs of user names, words, and related scores). Here, a case where the related score calculation result 20 illustrated in FIG. 8 is stored in the storage unit 5 will be described as an example.

第１のテーブル生成部１１は、関連スコア算出結果２０に基づいて、第１のテーブル２１を生成し、記憶部５に記憶させる。第１のテーブル２１は、ユーザＩＤと、単語と、関連スコアとの関係を記述したテーブルである。より具体的には、第１のテーブル２１は、関連スコア算出結果２０に含まれているユーザ名を縦軸と横軸のうちの一方の軸に並べ、関連スコア算出結果２０に含まれている単語を他方の軸に並べ、ユーザ名と単語とが交差する欄に、そのユーザ名を有するユーザとその単語の関連スコアを記述したテーブルである。 The first table generation unit 11 generates the first table 21 based on the related score calculation result 20 and causes the storage unit 5 to store the first table 21 . The first table 21 is a table describing the relationship between user IDs, words, and related scores. More specifically, the first table 21 arranges the user names included in the related score calculation result 20 on one of the vertical axis and the horizontal axis, and includes the user names included in the related score calculation result 20. It is a table in which words are arranged on the other axis, and the user having the user name and the related score of the word are described in the column where the user name and the word intersect.

本例では、第１のテーブル生成部１１が、第１のテーブルを生成する際に、ユーザ名を縦軸に並べ、単語を横軸に並べる場合を例にして説明するが、第１のテーブル生成部１１は、ユーザ名を横軸に並べ、単語を縦軸に並べてもよい。 In this example, the case where the first table generation unit 11 generates the first table by arranging user names on the vertical axis and arranging words on the horizontal axis will be described as an example. The generation unit 11 may arrange user names on the horizontal axis and words on the vertical axis.

また、第１のテーブル生成部１１は、ユーザ名を軸に沿って並べる際、同一ユーザのユーザ名を重複させずに並べる。例えば、図８に示す関連スコア算出結果２０において、１番目の組にもユーザ名「山田」が含まれ、２番目の組にもユーザ名「山田」が含まれている。このユーザ名「山田」は、同一ユーザのユーザ名である。従って、第１のテーブル生成部１１は、ユーザ名を軸に沿って並べる際、ユーザ名「山田」を１回並べればよい。 When arranging the user names along the axis, the first table generation unit 11 arranges the user names of the same user without duplication. For example, in the related score calculation result 20 shown in FIG. 8, the user name "Yamada" is included in the first group, and the user name "Yamada" is included in the second group. This user name "Yamada" is the user name of the same user. Therefore, when arranging the user names along the axis, the first table generation unit 11 only needs to arrange the user name "Yamada" once.

同様に、第１のテーブル生成部１１は、単語を軸に沿って並べる際、同一の単語を重複させずに並べる。例えば、図８に示す関連スコア算出結果２０において、複数の組で「人工知能」という単語が含まれている。しかし、第１のテーブル生成部１１は、単語を軸に沿って並べる際、「人工知能」という単語を一回並べればよい。 Similarly, when arranging words along the axis, the first table generation unit 11 arranges the same words without duplication. For example, in the related score calculation result 20 shown in FIG. 8, the word "artificial intelligence" is included in a plurality of sets. However, when arranging the words along the axis, the first table generation unit 11 only needs to arrange the word "artificial intelligence" once.

第１のテーブル生成部１１によって生成される第１のテーブルの例を、図９に示す。第１のテーブル生成部１１は、関連スコア算出結果２０に含まれているユーザ名（「山田」、「鈴木」、「田中」等）を、縦軸の方向に沿って並べる（図９参照）。また、第１のテーブル生成部１１は、関連スコア算出結果２０に含まれている単語（「人工知能」、「Ａ社」、「機械学習」、「ディープラーニング」等）を、横軸の方向に沿って並べる（図９参照）。 An example of the first table generated by the first table generator 11 is shown in FIG. The first table generation unit 11 arranges the user names (“Yamada”, “Suzuki”, “Tanaka”, etc.) included in the related score calculation result 20 along the direction of the vertical axis (see FIG. 9). . In addition, the first table generation unit 11 arranges the words ("artificial intelligence", "company A", "machine learning", "deep learning", etc.) included in the related score calculation result 20 in the direction of the horizontal axis. (See FIG. 9).

そして、第１のテーブル生成部１１は、ユーザ名と単語とが交差する欄に、そのユーザ名を有するユーザとその単語の関連スコアを記述する。例えば、図８に示す例で、ユーザ名「山田」と単語「人工知能」の関連スコアは、“０．８０”である。従って、第１のテーブル生成部１１は、第１のテーブル２１において、「山田」と「人工知能」とが交差する欄に“０．８０”を記述する（図９参照）。また、例えば、図８に示す例で、ユーザ名「山田」と単語「Ａ社」の関連スコアは、“０．１０”である。従って、第１のテーブル生成部１１は、第１のテーブル２１において、「山田」と「Ａ社」とが交差する欄に“０．１０”を記述する。第１のテーブル生成部１１は、ユーザ名と単語とが交差する欄毎に、同様に、関連スコアを記述する。 Then, the first table generation unit 11 writes the user having the user name and the related score of the word in the column where the user name and the word intersect. For example, in the example shown in FIG. 8, the association score between the user name "Yamada" and the word "artificial intelligence" is "0.80". Therefore, the first table generator 11 writes "0.80" in the column where "Yamada" and "artificial intelligence" intersect in the first table 21 (see FIG. 9). Also, for example, in the example shown in FIG. 8, the association score between the user name "Yamada" and the word "A company" is "0.10". Therefore, the first table generation unit 11 writes “0.10” in the column where “Yamada” and “A company” intersect in the first table 21 . The first table generation unit 11 similarly writes a related score for each column where a user name and a word intersect.

第１のテーブル生成部１１は、上記のようにして生成した第１のテーブル２１を、記憶部５に記憶させる。 The first table generation unit 11 causes the storage unit 5 to store the first table 21 generated as described above.

第２のテーブル生成部１２は、第１のテーブル２１に基づいて、第２のテーブル２２を生成し、記憶部５に記憶させる。第２のテーブルは、第１のテーブルに基づいて算出した単語同士の関連の強さを表す関連度を記述したテーブルである。より具体的には、第２のテーブル２２は、縦軸と横軸の両方に単語を並べ、単語同士が交差する欄に、第１のテーブル２１に基づいて算出したその単語同士の関連の強さを表す関連度を記述したテーブルである。既に説明したように、本発明では、ユーザと単語との関連の強さを表す指標値を「関連スコア」と称し、単語同士の関連の強さを表す指標値を「関連度」と称することによって、２種類の指標値を区別する。 The second table generation unit 12 generates the second table 22 based on the first table 21 and stores it in the storage unit 5 . The second table is a table describing the degree of association, which indicates the strength of the association between words calculated based on the first table. More specifically, in the second table 22, words are arranged on both the vertical axis and the horizontal axis, and in the columns where the words intersect, the strength of the relationship between the words calculated based on the first table 21 is displayed. It is a table describing the degree of relevance that represents the degree of relevance. As already explained, in the present invention, the index value representing the strength of the relationship between the user and the word is called the "relevance score", and the index value representing the strength of the relationship between words is called the "degree of association". distinguish between two types of index values.

図１０は、第２のテーブルの例を示す模式図である。本例では、第２のテーブル生成部１２が、縦軸、横軸それぞれに、第１のテーブル２１と同じ順番で単語を並べる場合を例にして説明する。例えば、図９に例示する第１のテーブルでは、単語が、「人工知能」、「Ａ社」、「機械学習」、「ディープラーニング」、・・・の順に並べられている。第２のテーブル生成部１２は、第２のテーブルの縦軸、横軸それぞれにおいても、その順番と同じ順番で単語を並べる（図１０参照）。 FIG. 10 is a schematic diagram showing an example of the second table. In this example, a case where the second table generation unit 12 arranges words in the same order as in the first table 21 on each of the vertical axis and the horizontal axis will be described. For example, in the first table illustrated in FIG. 9, the words are arranged in order of "artificial intelligence", "company A", "machine learning", "deep learning", and so on. The second table generation unit 12 arranges the words in the same order as the order on the vertical axis and the horizontal axis of the second table (see FIG. 10).

そして、第２のテーブル生成部１２は、１つの単語と１つの単語との組合せ毎に、その単語同士の関連の強さを表す関連度を算出し、その単語同士が交差する欄に、その関連度を記述する。なお、第２のテーブル生成部１２は、同一の単語同士に関しても、関連度を算出する。例えば、第２のテーブル生成部１２は、「人工知能」と「人工知能」の関連度も算出する。 Then, the second table generation unit 12 calculates, for each combination of one word and another word, the degree of association representing the strength of the relationship between the words, and displays the Describe the degree of relevance. Note that the second table generation unit 12 also calculates the degree of association between the same words. For example, the second table generation unit 12 also calculates the degree of association between "artificial intelligence" and "artificial intelligence".

ここで、単語同士の関連度について説明する。まず、図９に例示する第１のテーブ２１において、「人工知能」と「機械学習」という２つの単語に着目した場合について説明する。 Here, the degree of association between words will be described. First, in the first table 21 illustrated in FIG. 9, a case will be described where attention is paid to the two words "artificial intelligence" and "machine learning".

「人工知能」と個々のユーザの関連スコアを、第１のテーブル２１におけるユーザ名順に並べると、以下のようになる。 When the association scores of "artificial intelligence" and individual users are arranged in the order of the user names in the first table 21, they are as follows.

０．８０，０．２０，０．６０，０．４３，・・・ 0.80, 0.20, 0.60, 0.43,...

また、「機械学習」と個々のユーザの関連スコアを、第１のテーブル２１におけるユーザ名順に並べると、以下のようになる。 In addition, the relation scores of “machine learning” and individual users are arranged in the order of user names in the first table 21 as follows.

０．７９，０．２２，０．５８，０．４５，・・・ 0.79, 0.22, 0.58, 0.45, ...

上記の関連スコアの並びにおける関連スコアの変化の傾向は似ていると言える。この場合、「人工知能」と「機械学習」の関連度は高いことになる。 It can be said that the trends of changes in the related scores in the above related score sequences are similar. In this case, the degree of relevance between “artificial intelligence” and “machine learning” is high.

次に、「人工知能」と「Ａ社」という２つの単語に着目した場合について説明する。 Next, a case of focusing on two words, "artificial intelligence" and "A company", will be described.

前述のように、「人工知能」と個々のユーザの関連スコアを、第１のテーブル２１におけるユーザ名順に並べると、以下のようになる。 As described above, when the association scores of "artificial intelligence" and individual users are arranged in the order of the user name in the first table 21, they are as follows.

また、「Ａ社」と個々のユーザの関連スコアを、第１のテーブル２１におけるユーザ名順に並べると、以下のようになる。 Also, when the association scores of "company A" and individual users are arranged in the order of the user names in the first table 21, the results are as follows.

０．１０，０．４０，０．３５，０．０５，・・・ 0.10, 0.40, 0.35, 0.05, ...

上記の関連スコアの並びにおける関連スコアの変化の傾向は似ていないと言える。この場合、「人工知能」と「Ａ社」の関連度は低いことになる。 It can be said that the trend of changes in the related scores in the above sequence of related scores is not similar. In this case, the degree of relevance between "artificial intelligence" and "company A" is low.

第２のテーブル生成部１２は、単語同士の関連度として、単語同士の相関係数を算出すればよい。ここでは、単語ｗ_１と単語ｗ_２の関連度として、単語ｗ_１と単語ｗ_２の相関係数を算出する場合について説明する。単語ｗ_１と単語ｗ_２の相関係数は、より具体的には、単語ｗ_１と個々のユーザの関連スコアの並びと、単語ｗ_２と個々のユーザの関連スコアの並びとの相関関数である。 The second table generation unit 12 may calculate the correlation coefficient between words as the degree of association between words. Here, a case will be described where the correlation coefficient between word _w1 and word _w2 is calculated as the degree of association between word _w1 and word _w2 . More specifically, the correlation coefficient between word w1 and word w2 is _a correlation function between word w1 and the _row of association scores of individual users and word _w2 and the _row of association scores of individual users. be.

１つの単語と個々のユーザの関連スコアを、第１のテーブル２１におけるユーザ名順に並べた場合における関連スコア（数値）の並びを、その単語の系列と称することとする。例えば、図９に示す例において、「人工知能」の系列は、以下のようになる。 The arrangement of the association scores (numerical values) when the association scores of one word and individual users are arranged in the order of the user name in the first table 21 is referred to as the word sequence. For example, in the example shown in FIG. 9, the series of "artificial intelligence" is as follows.

単語ｗ_１の系列が（ｘ_１，ｘ_２，・・・，ｘ_ｎ）であるとする。そして、この系列をｘとする。なお、図９に示す例では、単語の系列は、数値を縦方向に並べたものであるが、ここでは、便宜的に、（ｘ_１，ｘ_２，・・・，ｘ_ｎ）と横に並べて示す。この点は、次に述べる単語ｗ_２についても同様である。 Let the sequence of word w ₁ be (x ₁ , x ₂ , . . . , x _n ). Let this sequence be x. In the example shown in _FIG . 9, the sequence of words is _a sequence of _numerical values arranged in the vertical direction. Shown side by side. This point also applies to the word _w2 described below.

また、単語ｗ_２の系列が（ｙ_１，ｙ_２，・・・，ｙ_ｎ）であるとする。そして、この系列をｙとする。 Also, assume that the sequence of word w ₂ is (y ₁ , y ₂ , . . . , y _n ). Let this sequence be y.

上記のように系列に属する関連スコアの数をｎ個とする。 Let n be the number of related scores belonging to the series as described above.

第２のテーブル生成部１２は、単語ｗ_１と単語ｗ_２の関連度として、ｘとｙの相関係数を算出すればよい。ｘとｙの相関係数をｒとする。第２のテーブル生成部１２は、以下に示す式（４）の計算により、ｘとｙの相関係数ｒを算出する。 The _second table generation unit 12 may calculate the correlation coefficient between x and y as the degree of association between word _w1 and word w2. Let r be the correlation coefficient between x and y. The second table generation unit 12 calculates the correlation coefficient r between x and y by calculating the following equation (4).

式（４）において、ｓ_ｘｙは、ｘとｙの共分散である。また、ｓ_ｘは、ｘの標準偏差であり、ｓ_ｙは、ｙの標準偏差である。ｘ_ｉは、ｘにおけるｉ番目の関連スコアである。ｙ_ｉは、ｙにおけるｉ番目の関連スコアである。 In equation (4), s _xy is the covariance of x and y. Also, s _x is the standard deviation of x, and s _y is the standard deviation of y. x _i is the i-th association score in x. y _i is the i-th association score in y.

は、ｘの平均値である。

is the average value of x.

は、ｙの平均値である。

is the mean value of y.

第２のテーブル生成部１２は、１つの単語と１つの単語との組合せ毎に、式（４）の計算により、相関係数を算出し、その相関係数を関連度として、第２のテーブル２２に記述する。組み合わせをなす２つの単語が異なる単語である場合、その２つの単語の関連度は、第２のテーブル２２において、２箇所に記述される。例えば、図１０に示す例において、「人工知能」と「Ａ社」の関連度は、第１行第２列と、第２行第１列にそれぞれ記述される。 The second table generation unit 12 calculates the correlation coefficient by calculating the formula (4) for each combination of one word and one word, and uses the correlation coefficient as the degree of association to create a second table 22. When two words forming a combination are different words, the degrees of relevance of the two words are described in two places in the second table 22 . For example, in the example shown in FIG. 10, the degrees of association between "artificial intelligence" and "company A" are described in the first row, second column and the second row, first column, respectively.

第２のテーブル生成部１２は、上記のようにして生成した第２のテーブル２２を、記憶部５に記憶させる。 The second table generation unit 12 causes the storage unit 5 to store the second table 22 generated as described above.

第２の実施形態の関連スコア算出システム１は、単語を検索キーワードとして受け付け、その単語に応じたユーザのユーザ名を検索する。あるいは、第２の実施形態の関連スコア算出システム１は、ユーザ名を検索キーワードとして受け付け、そのユーザ名に応じた単語を検索する。また、第２の実施形態の関連スコア算出システム１は、上記の２種類の検索をそれぞれ実行可能であってもよい。 The related score calculation system 1 of the second embodiment accepts a word as a search keyword, and searches for the username of the user according to the word. Alternatively, the related score calculation system 1 of the second embodiment accepts a user name as a search keyword, and searches for words corresponding to the user name. Also, the related score calculation system 1 of the second embodiment may be capable of executing each of the above two types of searches.

検索者に指定された単語に応じたユーザのユーザ名を検索する場合、キーワード受付部６が、検索者から単語を検索キーワードとして受け付ける。そして、検索部１７は、第１のテーブル２１および第２のテーブル２２に基づいて、検索キーワードに該当する単語に応じたユーザ名を検索する。 When searching for a user name of a user corresponding to a word specified by a searcher, the keyword reception unit 6 receives the word from the searcher as a search keyword. Based on the first table 21 and the second table 22, the search unit 17 searches for user names corresponding to words corresponding to the search keyword.

また、検索者に指定されたユーザ名に応じた単語を検索する場合、キーワード受付部６が、検索者からユーザ名を検索キーワードとして受け付ける。そして、検索部１７は、第１のテーブル２１および第２のテーブル２２に基づいて、検索キーワードに該当するユーザ名に応じた単語を検索する。 When searching for a word corresponding to a user name specified by a searcher, the keyword reception unit 6 receives the user name from the searcher as a search keyword. Based on the first table 21 and the second table 22, the search unit 17 searches for a word corresponding to the user name corresponding to the search keyword.

第２の実施形態において、収集部２、キーワード受付部６および出力部８は、関連スコア算出プログラムに従って動作するコンピュータのＣＰＵおよびそのコンピュータの通信インタフェースによって実現される。例えば、ＣＰＵが、コンピュータのプログラム記憶装置等のプログラム記録媒体から関連スコア算出プログラムを読み込み、関連スコア算出プログラムに従って、通信インタフェースを用いて、収集部２、キーワード受付部６および出力部８として動作すればよい。また、単語抽出部３、スコア算出部４、第１のテーブル生成部１１、第２のテーブル生成部１２および検索部１７も、例えば、関連スコア算出プログラムに従って動作する上記のコンピュータのＣＰＵによって実現される。すなわち、上記のように、関連スコア算出プログラムを読み込んだＣＰＵが、関連スコア算出プログラムに従って、単語抽出部３、スコア算出部４、第１のテーブル生成部１１、第２のテーブル生成部１２および検索部１７として動作すればよい。また、収集部２、キーワード受付部６、出力部８、単語抽出部３、スコア算出部４、第１のテーブル生成部１１、第２のテーブル生成部１２および検索部１７がそれぞれ別々のハードウェアによって実現されてもよい。 In the second embodiment, the collection unit 2, the keyword reception unit 6, and the output unit 8 are implemented by a CPU of a computer that operates according to the related score calculation program and a communication interface of the computer. For example, the CPU reads a related score calculation program from a program recording medium such as a program storage device of a computer, and operates as the collection unit 2, the keyword reception unit 6, and the output unit 8 using a communication interface according to the related score calculation program. Just do it. The word extraction unit 3, the score calculation unit 4, the first table generation unit 11, the second table generation unit 12, and the search unit 17 are also realized by the CPU of the above computer that operates according to the related score calculation program, for example. be. That is, as described above, the CPU that has read the related score calculation program executes the word extraction unit 3, the score calculation unit 4, the first table generation unit 11, the second table generation unit 12, and the search unit according to the related score calculation program. It suffices to operate as the unit 17 . In addition, the collection unit 2, the keyword reception unit 6, the output unit 8, the word extraction unit 3, the score calculation unit 4, the first table generation unit 11, the second table generation unit 12, and the search unit 17 are each separate hardware. may be realized by

図１１は、第２の実施形態の処理経過の例を示すフローチャートである。第１の実施形態で説明した動作と同様の動作や、第２の実施形態で既に説明した動作については、詳細な説明を省略する。 FIG. 11 is a flowchart showing an example of the progress of processing according to the second embodiment. Detailed descriptions of operations similar to those described in the first embodiment and operations already described in the second embodiment will be omitted.

ステップＳ１～ステップＳ３は、第１の実施形態におけるステップＳ１～Ｓ３（図４参照）と同様であり、説明を省略する。 Steps S1 to S3 are the same as steps S1 to S3 (see FIG. 4) in the first embodiment, and description thereof will be omitted.

ステップＳ３の次に、第１のテーブル生成部１１は、関連スコア算出結果２０に基づいて、第１のテーブル２１（図９参照）を生成し、第１のテーブル２１を記憶部５に記憶させる（ステップＳ４）。第１のテーブル２１を生成する動作については、既に説明したので、ここでは説明を省略する。 After step S3, the first table generation unit 11 generates the first table 21 (see FIG. 9) based on the related score calculation result 20, and stores the first table 21 in the storage unit 5. (Step S4). Since the operation of generating the first table 21 has already been described, the description is omitted here.

次に、第２のテーブル生成部１２は、第１のテーブル２１に基づいて、第２のテーブル２２（図１０参照）を生成し、第２のテーブル２２を記憶部５に記憶させる（ステップＳ５）。第２のテーブル２２を生成する動作についても、既に説明したので、ここでは説明を省略する。 Next, the second table generation unit 12 generates a second table 22 (see FIG. 10) based on the first table 21, and stores the second table 22 in the storage unit 5 (step S5). ). Since the operation of generating the second table 22 has already been explained, the explanation is omitted here.

第１のテーブル２１および第２のテーブル２２が生成された後に、キーワード受付部６は、検索者から、検索キーワードを受け付ける（ステップＳ６）。キーワード受付部６は、検索キーワードとして、単語を受け付けてもよい。また、キーワード受付部６は、検索キーワードとして、ユーザ名を受け付けてもよい。 After the first table 21 and the second table 22 are generated, the keyword reception unit 6 receives a search keyword from the searcher (step S6). The keyword accepting unit 6 may accept words as search keywords. Also, the keyword accepting unit 6 may accept a user name as a search keyword.

次に、検索部１７は、第１のテーブル２１および第２のテーブル２２に基づいて、検索キーワードに応じた検索結果を求める（ステップＳ７）。検索キーワードが単語である場合、検索部１７は、第１のテーブル２１および第２のテーブル２２に基づいて、その単語に応じたユーザ名を検索する。また、検索キーワードがユーザ名である場合、検索部１７は、第１のテーブル２１および第２のテーブル２２に基づいて、そのユーザ名に応じた単語を検索する。ステップＳ７の動作の詳細については、後述する。 Next, the search unit 17 obtains search results corresponding to the search keyword based on the first table 21 and the second table 22 (step S7). When the search keyword is a word, the search unit 17 searches for user names corresponding to the word based on the first table 21 and the second table 22 . Moreover, when the search keyword is a user name, the search unit 17 searches for words corresponding to the user name based on the first table 21 and the second table 22 . Details of the operation of step S7 will be described later.

ステップＳ７の後、出力部８が検索結果を出力する（ステップＳ８）。出力部８が検索結果を出力する態様は、第１の実施形態の変形例と同様である。 After step S7, the output unit 8 outputs the search result (step S8). The manner in which the output unit 8 outputs the search results is the same as in the modified example of the first embodiment.

次に、ステップＳ７の動作について説明する。まず、ステップＳ６において、キーワード受付部６が選択キーワードとして単語を受け付け、検索部１７が単語に応じたユーザ名を検索する場合について説明する。以下、検索キーワードに該当する単語を、キーワード単語と記す。 Next, the operation of step S7 will be described. First, in step S6, the case where the keyword receiving unit 6 receives a word as a selected keyword and the searching unit 17 searches for a user name corresponding to the word will be described. A word corresponding to a search keyword is hereinafter referred to as a keyword word.

検索部１７は、個々のユーザ名を順次選択し、選択したユーザ名（以下、選択ユーザ名と記す。）とキーワード単語の検索スコアを算出する。検索スコアは、選択ユーザ名を有するユーザと単語の関連の強さを示す指標値であるが、既に説明した関連スコアを用いて算出され、関連スコアとは算出方法が異なる。そのため、以下の説明では、関連スコアと区別して、検索スコアという語を用いる。 The search unit 17 sequentially selects individual user names and calculates a search score for the selected user names (hereinafter referred to as selected user names) and keyword words. The search score is an index value indicating the strength of the relationship between the user having the selected user name and the word, but it is calculated using the already-described relationship score, and the calculation method is different from that of the relationship score. Therefore, in the following description, the term "search score" is used to distinguish it from the related score.

検索部１７が、１つの選択ユーザ名を選択しているとする。検索部１７が選択ユーザ名とキーワード単語の検索スコアを算出する動作について説明する。図１２は、選択ユーザとキーワード単語の検索スコアを算出する処理の例を示すフローチャートである。 Assume that the search unit 17 has selected one selected user name. The operation of calculating the search score of the selected user name and the keyword word by the search unit 17 will be described. FIG. 12 is a flow chart showing an example of processing for calculating the search score of a selected user and keyword words.

まず、検索部１７は、選択ユーザ名の検索スコアの値を０に初期化する（ステップＳ１１）。 First, the search unit 17 initializes the value of the search score of the selected user name to 0 (step S11).

次に、検索部１７は、第１のテーブルの軸（第２テーブルの軸でもよい。）に並べられている単語の中から、未だステップＳ１２で選択されていない単語を１つ選択する（ステップＳ１２）。ステップＳ１２で選択した単語を、以下、選択単語と記す。なお、選択単語がキーワード単語と同一である場合もあり得る。 Next, the search unit 17 selects one word that has not yet been selected in step S12 from among the words arranged along the axis of the first table (or the axis of the second table) (step S12). The words selected in step S12 are hereinafter referred to as selected words. Note that the selected word may be the same as the keyword word.

次に、検索部１７は、キーワード単語と選択単語の関連度と、選択ユーザ名と選択単語の関連スコアとの積を算出する（ステップＳ１３）。検索部１７は、ステップＳ１３で用いる関連度を第２のテーブルから読み込み、ステップＳ１３で用いる関連スコアを第１のテーブルから読み込めばよい。 Next, the search unit 17 calculates the product of the degree of association between the keyword word and the selected word and the association score between the selected user name and the selected word (step S13). The search unit 17 may read the degree of association used in step S13 from the second table, and read the association score used in step S13 from the first table.

次に、検索部１７は、ステップＳ１３で算出した積を、検索スコアに加算する（ステップＳ１４）。 Next, the search unit 17 adds the product calculated in step S13 to the search score (step S14).

次に、検索部１７は、ステップＳ１２で選択されていない単語があるか否かを判定する（ステップＳ１５）。未選択の単語がある場合（ステップＳ１５のＹｅｓ）、検索部１７は、ステップＳ１２以降の処理を繰り返す。 Next, the search unit 17 determines whether or not there is a word that has not been selected in step S12 (step S15). If there is an unselected word (Yes in step S15), the search unit 17 repeats the processes after step S12.

未選択の単語がない場合（ステップＳ１５のＮｏ）、検索部１７は、その時点における検索スコアの値を、選択ユーザ名とキーワード単語の検索スコアとして確定し、処理を終了する。 If there is no unselected word (No in step S15), the search unit 17 determines the value of the search score at that time as the search score of the selected user name and keyword word, and terminates the process.

上記の検索スコアの算出処理は、以下の式（５）で表すことができる。 The search score calculation process described above can be represented by the following equation (5).

式（５）において、単語ｉは、ｉ番目に選択された選択単語を意味する。 In equation (5), word i means the i-th selected word.

例えば、図９に示す第１のテーブル２１と図１０に示す第２のテーブル２２が記憶部５に記憶されているとする。そして、キーワード単語（検索キーワードに該当する単語）が「人工知能」であり、選択ユーザ名が「山田」であるとする。この場合、検索部１７は、以下の式によって、選択ユーザ名「山田」とキーワード単語「人工知能」の検索スコアを算出する（図９、図１０を参照）。 For example, it is assumed that the first table 21 shown in FIG. 9 and the second table 22 shown in FIG. It is also assumed that the keyword word (word corresponding to the search keyword) is "artificial intelligence" and the selected user name is "Yamada". In this case, the search unit 17 calculates the search score for the selected user name "Yamada" and the keyword word "artificial intelligence" using the following formula (see FIGS. 9 and 10).

検索スコア＝1.00×0.80＋0.07×0.10＋0.87×0.79＋0.79×0.82＋・・・ Search score = 1.00 x 0.80 + 0.07 x 0.10 + 0.87 x 0.79 + 0.79 x 0.82 + ...

また、選択ユーザ名が「鈴木」であるとする。この場合、検索部１７は、以下の式によって、選択ユーザ名「鈴木」とキーワード単語「人工知能」の検索スコアを算出する（図９、図１０を参照）。 It is also assumed that the selected user name is "Suzuki". In this case, the search unit 17 calculates the search score for the selected user name "Suzuki" and the keyword word "artificial intelligence" using the following formula (see FIGS. 9 and 10).

検索スコア＝1.00×0.20＋0.07×0.40＋0.87×0.22＋0.79×0.18＋・・・ Search score = 1.00 x 0.20 + 0.07 x 0.40 + 0.87 x 0.22 + 0.79 x 0.18 + ...

検索部１７は、第１のテーブル２１に記述されているユーザ名毎に、上記の処理によって検索スコアを得る。そして、検索部１７は、検索スコアが閾値以上になっているユーザ名を検索結果として得る。従って、検索結果として得られるユーザ名は、複数個となり得る。出力部８は、検索結果として得られたユーザ名を出力する。出力部８は、検索結果として得た複数のユーザ名を、検索スコアの高い順に並べて出力してもよい。 The search unit 17 obtains a search score for each user name described in the first table 21 through the above process. Then, the search unit 17 obtains user names whose search scores are equal to or higher than the threshold as search results. Therefore, a plurality of user names can be obtained as search results. The output unit 8 outputs the user name obtained as the search result. The output unit 8 may output a plurality of user names obtained as search results, arranging them in descending order of search score.

この検索方法では、単語同士の関連度を示す第２のテーブルも用いている。従って、検索者が検索キーワードとして指定した単語との関連度が高い別の単語との関連が強いユーザのユーザ名も検索結果として得ることができる。 This search method also uses a second table that indicates the degree of association between words. Therefore, it is possible to obtain the user name of a user who is strongly related to another word that is highly related to the word specified as the search keyword by the searcher as a search result.

例えば、各ユーザが属している会社において、「レッドロケッツ」および「グリーンロケッツ」が重要な製品の製品名であり、その二つの製品の関連性が強いとする。この場合、上記の検索方法によれば、「レッドロケッツ」を検索キーワードにした場合であっても、「レッドロケッツ」と関連の強いユーザのユーザ名だけでなく、「レッドロケッツ」と関連性のある「グリーンロケッツ」と関連の強いユーザのユーザ名も検索結果として得ることができる。従って、検索者が検索キーワードとして指定した単語に基づいて、ユーザ名を幅広く検索することができる。 For example, in the company to which each user belongs, "Red Rockets" and "Green Rockets" are the product names of important products, and the two products are closely related. In this case, according to the above search method, even if "Red Rockets" is used as a search keyword, not only the user names of users strongly related to "Red Rockets" but also User names of users who are strongly related to a certain "Green Rockets" can also be obtained as a search result. Therefore, it is possible to search a wide range of user names based on the words specified by the searcher as search keywords.

また、本実施形態では、会社に設けられたＰＣ９１から収集した操作ログに含まれる単語を用いて、第２のテーブルを生成する。従って、上記に例示したような「レッドロケッツ」および「グリーンロケッツ」等のその会社独自で用いることの多い単語を、第２のテーブルに含めることができる。仮に、第１のテーブルや第２のテーブルを人手で作成する場合、膨大な手間がかかるだけでなく、組織独自で用いる単語等は、第１のテーブルおよび第２のテーブルから漏れやすい。従って、組織独自で用いる単語も漏らさずに、第１のテーブルや第２のテーブルを容易に作成することができ、さらに上記のように、検索キーワードとして指定した単語に基づいて、ユーザ名を幅広く検索することができる。 In addition, in this embodiment, the second table is generated using the words included in the operation logs collected from the PC 91 provided in the company. Therefore, words that are often used uniquely to the company, such as "Red Rockets" and "Green Rockets" as exemplified above, can be included in the second table. If the first table and the second table were to be created manually, not only would it take an enormous amount of time and effort, but words and the like unique to the organization would easily escape from the first table and the second table. Therefore, it is possible to easily create the first table and the second table without omitting the words used uniquely by the organization. can be searched.

次に、ステップＳ６において、キーワード受付部６が選択キーワードとしてユーザ名を受け付け、検索部１７がユーザ名に応じた単語を検索する場合について説明する。以下、検索キーワードに該当するユーザ名を、キーワードユーザ名と記す。 Next, in step S6, the case where the keyword receiving unit 6 receives a user name as a selected keyword and the searching unit 17 searches for a word corresponding to the user name will be described. A user name corresponding to a search keyword is hereinafter referred to as a keyword user name.

検索部１７は、個々の単語を順次選択し、選択した単語とキーワードユーザ名の検索スコアを算出する。この選択された単語をスコア算出対象単語と記す。また、１つのスコア算出対象単語を選択した後にも、後述の説明で示すように、別途、単語を順次選択する（後述の図１３におけるステップＳ２２を参照）。後述のステップＳ２２で選択される単語を、選択単語と記す。 The search unit 17 sequentially selects individual words and calculates a search score for the selected words and the keyword user name. These selected words are referred to as score calculation target words. Further, even after selecting one score calculation target word, as will be described later, the words are separately selected sequentially (see step S22 in FIG. 13 described later). A word selected in step S22, which will be described later, is referred to as a selected word.

検索部１７が、１つのスコア算出対象単語を選択しているとする。以下に、検索部１７がスコア算出対象単語とキーワードユーザ名の検索スコアを算出する動作について説明する。図１３は、スコア算出対象単語とキーワードユーザ名の検索スコアを算出する処理の例を示すフローチャートである。 Assume that the search unit 17 has selected one score calculation target word. The operation of the search unit 17 for calculating the search score of the score calculation target word and the keyword user name will be described below. FIG. 13 is a flow chart showing an example of processing for calculating a search score for a score calculation target word and a keyword user name.

まず、検索部１７は、スコア算出対象単語の検索スコアの値を０に初期化する（ステップＳ２１）。 First, the search unit 17 initializes the value of the search score of the score calculation target word to 0 (step S21).

次に、検索部１７は、第１のテーブルの軸（第２テーブルの軸でもよい。）に並べられている単語の中から、未だステップＳ２２で選択されていない単語を１つ選択する（ステップＳ２２）。既に述べたように、ステップＳ２２で選択した単語を、選択単語と記す。なお、選択単語がスコア算出対象単語と同一である場合もあり得る。 Next, the search unit 17 selects one word that has not been selected in step S22 from among the words arranged along the axis of the first table (or the axis of the second table) (step S22). As already mentioned, the words selected in step S22 are referred to as selected words. Note that the selected word may be the same as the score calculation target word.

次に、検索部１７は、キーワードユーザ名と選択単語の関連スコアと、スコア算出対象単語と選択単語の関連度との積を算出する（ステップＳ２３）。検索部１７は、ステップＳ２３で用いる関連スコアを第１のテーブルから読み込み、ステップＳ２３で用いる関連度を第２のテーブルから読み込めばよい。 Next, the search unit 17 calculates the product of the relevance score between the keyword user name and the selected word and the degree of relevance between the score calculation target word and the selected word (step S23). The search unit 17 may read the association score used in step S23 from the first table, and read the association degree used in step S23 from the second table.

次に、検索部１７は、ステップＳ２３で算出した積を、検索スコアに加算する（ステップＳ２４）。 Next, the search unit 17 adds the product calculated in step S23 to the search score (step S24).

次に、検索部１７は、ステップＳ２２で選択されていない単語があるか否かを判定する（ステップＳ２５）。未選択の単語がある場合（ステップＳ２５のＹｅｓ）、検索部１７は、ステップＳ２２以降の処理を繰り返す。 Next, the search unit 17 determines whether or not there is a word that has not been selected in step S22 (step S25). If there is an unselected word (Yes in step S25), the search unit 17 repeats the processes after step S22.

未選択の単語がない場合（ステップＳ２５のＮｏ）、検索部１７は、その時点における検索スコアの値を、スコア算出対象単語とキーワードユーザ名の検索スコアとして確定し、処理を終了する。 If there is no unselected word (No in step S25), the search unit 17 determines the value of the search score at that time as the search score of the score calculation target word and the keyword user name, and ends the process.

上記の検索スコアの算出処理は、以下の式（６）で表すことができる。 The search score calculation process described above can be represented by the following equation (6).

式（６）において、単語ｉは、ｉ番目に選択された選択単語を意味する。 In equation (6), word i means the i-th selected word.

例えば、図９に示す第１のテーブル２１と図１０に示す第２のテーブル２２が記憶部５に記憶されているとする。そして、キーワードユーザ名（検索キーワードに該当するユーザ名）が「山田」であり、スコア算出対象単語が「人工知能」であるとする。この場合、検索部１７は、以下の式によって、スコア算出対象単語「人工知能」とキーワードユーザ名「山田」の検索スコアを算出する（図９、図１０を参照）。 For example, it is assumed that the first table 21 shown in FIG. 9 and the second table 22 shown in FIG. It is also assumed that the keyword user name (user name corresponding to the search keyword) is "Yamada" and the score calculation target word is "artificial intelligence". In this case, the search unit 17 calculates the search score of the score calculation target word “artificial intelligence” and the keyword user name “Yamada” by the following formula (see FIGS. 9 and 10).

検索スコア＝0.08×1.00＋0.10×0.07＋0.79×0.87＋0.82×0.79＋・・・ Search score = 0.08 x 1.00 + 0.10 x 0.07 + 0.79 x 0.87 + 0.82 x 0.79 + ...

また、スコア算出対象単語が「Ａ社」であるとする。この場合、検索部１７は、以下の式によって、スコア算出対象単語が「Ａ社」とキーワードユーザ名「山田」の検索スコアを算出する（図９、図１０を参照）。 It is also assumed that the score calculation target word is "A company". In this case, the search unit 17 calculates the search score for the score calculation target word "Company A" and the keyword user name "Yamada" using the following formula (see FIGS. 9 and 10).

検索スコア＝0.08×0.07＋0.10×1.00＋0.79×0.09＋0.82×0.11＋・・・ Search score = 0.08 x 0.07 + 0.10 x 1.00 + 0.79 x 0.09 + 0.82 x 0.11 + ...

検索部１７は、順次選択するスコア算出対象単語毎に、上記の処理によって検索スコアを得る。そして、そして、検索部１７は、検索スコアが閾値以上になっている単語を検索結果として得る。従って、検索結果として得られる単語は、複数個となり得る。出力部８は、検索結果として得られた単語を出力する。出力部８は、検索結果として得た複数の単語を、検索スコアの高い順に並べて出力してもよい。 The search unit 17 obtains a search score by the above-described processing for each of the score calculation target words that are sequentially selected. Then, the search unit 17 obtains words whose search scores are equal to or higher than the threshold as search results. Therefore, a plurality of words can be obtained as search results. The output unit 8 outputs words obtained as search results. The output unit 8 may output a plurality of words obtained as search results, arranging them in descending order of search scores.

上記のようにユーザ名から単語を検索する検索方法においても、単語同士の関連度を示す第２のテーブルも用いている。従って、検索者が指定したキーワードユーザ名が表わすユーザ名との関連度が高い単語だけでなく、その単語との関連が強い別の単語も検索結果として得ることができる。 A second table indicating the degree of relevance between words is also used in the search method for searching for words from user names as described above. Therefore, not only words highly related to the user name represented by the keyword user name specified by the searcher, but also other words highly related to that word can be obtained as search results.

例えば、前述の例のように、各ユーザが属している会社において、「レッドロケッツ」および「グリーンロケッツ」が重要な製品の製品名であり、その二つの製品の関連性が強いとする。そして、ユーザ「山田」と単語「レッドロケッツ」との関連が強いとする。この場合、検索者がキーワードユーザ名として「山田」を指定した場合、「山田」と関連の強い単語「レッドロケッツ」だけでなく、単語「レッドロケッツ」と関連の強い「グリーンロケッツ」も検索結果として得ることができる。「レッドロケッツ」と「グリーンロケッツ」との関連が強いので、「レッドロケッツ」との関連が強い「山田」は、「グリーンロケッツ」とも関連が強いと考えられる。上記の方法によれば、検索キーワードとして指定されたユーザ名が示すユーザと関連が強いと考えられる単語を幅広く検索することができる。 For example, as in the above example, in the company to which each user belongs, "Red Rockets" and "Green Rockets" are the product names of important products, and the two products are strongly related. It is also assumed that the user "Yamada" and the word "Red Rockets" have a strong relationship. In this case, when a searcher specifies "Yamada" as a keyword user name, not only the word "Red Rockets" which is strongly related to "Yamada" but also "Green Rockets" which is strongly related to the word "Red Rockets" will be retrieved. can be obtained as Since the relationship between "Red Rockets" and "Green Rockets" is strong, "Yamada", which has a strong relationship with "Red Rockets", is also considered to have a strong relationship with "Green Rockets". According to the above method, it is possible to widely search for words that are considered to be strongly related to the user indicated by the user name specified as the search keyword.

また、組織独自で用いる単語も漏らさずに、第１のテーブルや第２のテーブルを容易に作成することができるという点については、既に説明した通りである。 Moreover, as already explained, the first table and the second table can be easily created without omitting the words used uniquely to the organization.

以上に説明したように、本実施形態では、キーワード受付部６が選択キーワードとして単語を受け付け、検索部１７が単語に応じたユーザ名を検索する場合、ユーザ名を幅広く検索することができる。また、キーワード受付部６が選択キーワードとしてユーザ名を受け付け、検索部１７がユーザ名に応じた単語を検索する場合、単語を幅広く検索することができる。また、組織独自で用いる単語も漏らさずに、第１のテーブルや第２のテーブルを容易に作成することができる。 As described above, in the present embodiment, when the keyword accepting unit 6 accepts a word as a selected keyword and the searching unit 17 searches for a user name according to the word, a wide range of user names can be searched. Further, when the keyword accepting unit 6 accepts a user name as a selected keyword and the searching unit 17 searches for a word corresponding to the user name, a wide range of words can be searched. In addition, the first table and the second table can be easily created without omitting the words used by the organization.

実施形態３．
図１４は、本発明の第３の実施形態の関連スコア算出システムの構成例を示すブロック図である。第３の実施形態の関連スコア算出システムは、第２の実施形態の関連スコア算出システム（図７参照）が備える構成要素に加え、さらに、クラスタリング部３１と、クラスタ出力部３２と、除外対象単語受付部３３とを備える。図１４において、図７に示す要素と同様の要素については、図７と同一の符号を付し、説明を省略する。なお、第１のテーブル生成部１１および第２のテーブル生成部１２はそれぞれ、第２の実施形態で説明した動作に加え、後述の動作も行う。 Embodiment 3.
FIG. 14 is a block diagram showing a configuration example of a related score calculation system according to the third embodiment of the present invention. The related score calculation system of the third embodiment includes, in addition to the components included in the related score calculation system of the second embodiment (see FIG. 7), a clustering unit 31, a cluster output unit 32, an exclusion target word and a reception unit 33 . 14, elements similar to those shown in FIG. 7 are denoted by the same reference numerals as in FIG. 7, and description thereof is omitted. Note that each of the first table generation unit 11 and the second table generation unit 12 performs operations described later in addition to the operations described in the second embodiment.

クラスタリング部３１は、第２のテーブル２２（例えば、図１０を参照）を記憶部５から読み込み、第２のテーブル２２に基づいて、第２のテーブル２２に記述されている単語に対してクラスタリングを行う。クラスタリング部３１は、例えば、ｋ－ｍｅａｎｓ法または階層型クラスタリングアルゴリズム等のクラスタリング方法によって、単語に対するクラスタリングを行う。ｋ－ｍｅａｎｓ法または階層型クラスタリングアルゴリズム等のクラスタリング方法では、第２のテーブル２２を入力データとして、第２のテーブル２２の軸に並ぶ単語に対してクラスタリングを行うことができる。クラスタリング部３１は、各クラスタに対して、クラスタの識別情報として、例えば、クラスタ番号を付し、クラスタとそのクラスタに属する単語との関係を、クラスタリング結果２３として記憶部５に記憶させる。 The clustering unit 31 reads the second table 22 (see, for example, FIG. 10) from the storage unit 5, and clusters the words described in the second table 22 based on the second table 22. conduct. The clustering unit 31 clusters words by a clustering method such as the k-means method or a hierarchical clustering algorithm. In a clustering method such as the k-means method or hierarchical clustering algorithm, the second table 22 is used as input data, and the words aligned along the axis of the second table 22 can be clustered. The clustering unit 31 assigns, for example, a cluster number as cluster identification information to each cluster, and stores the relationship between the cluster and the words belonging to the cluster in the storage unit 5 as the clustering result 23 .

クラスタ出力部３２は、例えば、関連スコア算出システム１の管理者（以下、単に管理者と記す。）の端末装置（図示略）から、通信ネットワークを介して、クラスタリング結果の出力要求を受け付け、その出力要求に応じて、クラスタリング結果を示す画面の画面情報を、その端末装置（図示略）に送信する。図１５は、クラスタリング結果を示す画面の例を示す模式図である。クラスタリング結果を示す画面では、例えば、クラスタ毎に、クラスタ番号と、クラスタに属する各単語が表示される。また、各クラスタ番号および各単語とともに、それぞれチェックボックス４１が表示される。図１５に示すように、各クラスタに属する単語の数は異なっていてよい。また、クラスタリング結果を示す画面には、確定ボタン４２が含まれる。 The cluster output unit 32 receives, for example, a clustering result output request from a terminal device (not shown) of an administrator (hereinafter simply referred to as an administrator) of the related score calculation system 1 via a communication network, and outputs the clustering result. In response to the output request, the screen information of the screen showing the clustering result is transmitted to the terminal device (not shown). FIG. 15 is a schematic diagram showing an example of a screen showing clustering results. On the screen showing the clustering result, for example, the cluster number and each word belonging to the cluster are displayed for each cluster. A check box 41 is also displayed with each cluster number and each word. As shown in FIG. 15, the number of words belonging to each cluster can be different. A confirmation button 42 is included in the screen showing the clustering result.

クラスタ出力部３２は、通信ネットワークを介して、管理者の端末装置からクラスタリング結果の出力要求を受け付けると、記憶部５からクラスタリング結果２３を読み込む。そして、クラスタ出力部３２は、クラスタリング結果２３に基づいて、クラスタ番号と、そのクラスタ番号が示すクラスタに属する単語を表示するとともに、各クラスタ番号および各単語とともにそれぞれチェックボックス４１を表示し、さらに、確定ボタン４２も含む画面（例えば、図１５に例示する画面）の画面情報を生成する。そして、クラスタ出力部３２は、通信ネットワークを介して、その画面情報を管理者の端末装置に送信する。 The cluster output unit 32 reads the clustering result 23 from the storage unit 5 upon receiving a clustering result output request from the administrator's terminal device via the communication network. Then, based on the clustering result 23, the cluster output unit 32 displays the cluster number and the words belonging to the cluster indicated by the cluster number, displays the check box 41 with each cluster number and each word, and further, The screen information of the screen including the enter button 42 (for example, the screen illustrated in FIG. 15) is generated. Then, the cluster output unit 32 transmits the screen information to the administrator's terminal device via the communication network.

管理者の端末装置は、クラスタ出力部３２からその画面情報を受信すると、その画面情報に基づいて、例えば、図１５に例示する画面を表示する。 When receiving the screen information from the cluster output unit 32, the administrator's terminal device displays, for example, the screen shown in FIG. 15 based on the screen information.

管理者は、図１５に例示する画面（単語のクラスタリング結果を表示する画面）を確認し、管理者が除外すべきと判断した単語に対応するチェックボックス４１にチェックを入れる。また、管理者は、１つのクラスタに属する単語全てを除外すべきであると判断した場合、そのクラスタに対応するチェックボックス４１にチェックを入れる。単語に対応するチェックボックス４１にチェックが入れられたということは、その単語を除外すべきと判断されたことを意味する。また、クラスタに対応するチェックボックス４１にチェックが入れられたということは、そのクラスタに属する単語全てを除外すべきであると判断されたことを意味する。また、ここで、「除外すべき」とは、第１のテーブルの横軸および第２のテーブルの各軸に並ぶ単語から除外すべきであるということを意味する。 The administrator confirms the screen illustrated in FIG. 15 (screen for displaying the clustering result of words), and puts a check in the check box 41 corresponding to the word determined by the administrator to be excluded. When the administrator determines that all words belonging to one cluster should be excluded, the administrator checks the check box 41 corresponding to that cluster. A check in the check box 41 corresponding to a word means that the word should be excluded. Also, the fact that the check box 41 corresponding to a cluster is checked means that it has been determined that all the words belonging to that cluster should be excluded. Here, "to be excluded" means to be excluded from the words arranged on the horizontal axis of the first table and the respective axes of the second table.

確定ボタン４２は、各単語に対する、除外すべきか否かの判断が完了したことを入力するためのボタンである。管理者の端末装置は、管理者によって確定ボタン４２をクリックされると、どのチェックボックス４１にチェックが入れられたかに応じて、管理者が除外すべきと判断した単語を判定し、管理者によって除外すべきと判断された単語を、関連スコア算出システム１に送信する。 The confirm button 42 is a button for inputting that the determination of whether or not each word should be excluded has been completed. When the confirmation button 42 is clicked by the administrator, the terminal device of the administrator determines the word that the administrator has determined to be excluded according to which check box 41 is checked. The words determined to be excluded are sent to the related score calculation system 1.

関連スコア算出システム１の除外対象単語受付部３３は、管理者の端末装置が送信した単語（管理者によって、除外すべきと判断された単語）を、通信ネットワークを介して、受信する。除外対象単語受付部３３が受信する単語は、１つとは限らない。除外対象単語受付部３３は、第１のテーブルの横軸および第２のテーブルの各軸に並ぶ単語から除外すべき単語の指定を受け付けていると言うことができる。 The exclusion target word reception unit 33 of the related score calculation system 1 receives, via the communication network, the word transmitted from the administrator's terminal device (the word determined by the administrator to be excluded). The number of words received by the exclusion target word reception unit 33 is not limited to one. It can be said that the exclusion target word receiving unit 33 receives designation of words to be excluded from the words arranged on the horizontal axis of the first table and the respective axes of the second table.

除外対象単語受付部３３は、管理者の端末装置から受信した単語（除外すべき単語）を第１のテーブル生成部１１に通知する。第１のテーブル生成部１１は、その単語の通知を受けると、既に生成済みの第１のテーブル２１の横軸に並ぶ単語から、通知された単語を除外して、第１のテーブル２１を再度生成する。そして、第１のテーブル生成部１１は、記憶部５に記憶されている第１のテーブル２１を、再度生成した第１のテーブル２１で置き換える。 The exclusion target word reception unit 33 notifies the first table generation unit 11 of the words (words to be excluded) received from the administrator's terminal device. When the first table generation unit 11 receives the notification of the word, the first table generation unit 11 removes the notified word from the words arranged on the horizontal axis of the already generated first table 21, and regenerates the first table 21. Generate. Then, the first table generation unit 11 replaces the first table 21 stored in the storage unit 5 with the regenerated first table 21 .

第１のテーブル生成部１１が第１のテーブル２１を再度生成すると、第２のテーブル生成部１２は、新たに生成された第１のテーブル２１に基づいて、第２のテーブルを再度生成する。このとき、第２のテーブル生成部１２は、除外対象単語受付部３３が受信した単語を、各軸に並ぶ単語から除外して、第２のテーブルを生成する。そして、第２のテーブル生成部１２は、記憶部５に記憶されている第２のテーブル２２を、再度生成した第２のテーブル２２で置き換える。 When the first table generator 11 regenerates the first table 21 , the second table generator 12 regenerates the second table based on the newly generated first table 21 . At this time, the second table generation unit 12 generates the second table by excluding the words received by the exclusion target word reception unit 33 from the words arranged along each axis. Then, the second table generation unit 12 replaces the second table 22 stored in the storage unit 5 with the regenerated second table 22 .

第１のテーブル２１を再度生成する動作および第２のテーブル２２を再度生成する動作は、それぞれ、軸に並ぶ単語数が減少している点を除けば、第２の実施形態で説明した第１のテーブル２１を生成する動作および第２のテーブル２２を生成する動作と同様である。 The operation of regenerating the first table 21 and the operation of regenerating the second table 22 are the same as those of the first table 21 described in the second embodiment, except that the number of words arranged along the axis is reduced. 21 and the second table 22 are the same.

クラスタ出力部３２および除外対象単語受付部３３は、例えば、関連スコア算出プログラムに従って動作するコンピュータのＣＰＵおよびそのコンピュータの通信インタフェースによって実現される。また、クラスタリング部３１は、関連スコア算出プログラムに従って動作するそのコンピュータのＣＰＵによって実現される。 The cluster output unit 32 and the exclusion target word reception unit 33 are realized by, for example, a CPU of a computer that operates according to the related score calculation program and a communication interface of the computer. Also, the clustering unit 31 is realized by the CPU of the computer that operates according to the related score calculation program.

本実施形態によれば、管理者によって、第１のテーブルおよび第２のテーブルから除外すべきと判断された単語を、第１のテーブルおよび第２のテーブルから除外することができる。その結果、例えば、ユーザ名を検索キーワードとして、単語の検索を行う場合、管理者が除外すべき単語として指定した単語を、検索結果に含まれないようにすることができる。 According to this embodiment, words determined by the administrator to be excluded from the first and second tables can be excluded from the first and second tables. As a result, for example, when a user name is used as a search keyword to perform a word search, the words specified by the administrator as words to be excluded can be prevented from being included in the search results.

管理者によって、除外すべきと判断される単語の例について説明する。例えば、ユーザ「山田」が、新規作成したり閲覧したりするファイル名に、ユーザ名「山田」を含めることがあり得る。その場合、ユーザ名「山田」を検索キーワードとして、単語の検索を行う場合に、検索結果に「山田」という単語が含まれ得る。しかし、ユーザ名「山田」を検索キーワードとして単語の検索を行う場合に、検索結果に「山田」という単語が含まれていても、検索者にとってはあまり意味がない。 Examples of words that are determined by the administrator to be excluded will be described. For example, a user "Yamada" may include the user name "Yamada" in the name of a file that is newly created or browsed. In that case, when a word search is performed using the user name "Yamada" as a search keyword, the word "Yamada" may be included in the search results. However, when performing a word search using the user name "Yamada" as a search keyword, even if the word "Yamada" is included in the search results, it does not make much sense for the searcher.

そこで、管理者は、例えば、図１５に例示する画面が表示された場合、明らかに、ユーザ名と同じ文字列であると考えられる単語「山田」は除外すべきであると判断し、図１５に示す単語「山田」に対応するチェックボックス４１にチェックを入れ、確定ボタン４２をクリックすればよい。その結果、第１のテーブル生成部１１は、横軸の単語の並びから「山田」を除外した、新たな第１のテーブルを生成する。続いて、第２のテーブル生成部１２は、縦軸および横軸それぞれの単語の並びから「山田」を除外した、新たな第２のテーブルを生成する。その結果、例えば、ユーザ名「山田」を検索キーワードとして単語の検索を行う場合に、検索結果に「山田」という単語が含まれないようにすることができる。 Therefore, for example, when the screen shown in FIG. 15 is displayed, the administrator determines that the word "Yamada", which is clearly considered to be the same character string as the user name, should be excluded. , and click the confirm button 42. As a result, the first table generator 11 generates a new first table by excluding "Yamada" from the word arrangement on the horizontal axis. Subsequently, the second table generation unit 12 generates a new second table by excluding "Yamada" from the word sequences on the vertical and horizontal axes. As a result, for example, when performing a word search using the user name "Yamada" as a search keyword, it is possible to prevent the word "Yamada" from being included in the search results.

なお、クラスタ出力部３２は、クラスタリング結果を示す画面を、関連スコア算出システム１が備えるディスプレイ装置（図示略）に表示してもよい。また、除外対象単語受付部３３は、関連スコア算出システム１が備える入力デバイス（図示略）によって、そのディスプレイ装置に表示された画面に対する操作（チェックボックス４１へのチェックの入力、および、確定ボタン４２のクリック）を受け付け、その操作に応じて、除外すべき単語の指定を受け付けてもよい。 Note that the cluster output unit 32 may display a screen showing the clustering results on a display device (not shown) included in the related score calculation system 1 . In addition, the exclusion target word reception unit 33 operates the screen displayed on the display device (inputting a check in the check box 41 and confirming the confirmation button 42 ) may be accepted, and designation of words to be excluded may be accepted according to the operation.

実施形態４．
第１の実施形態から第３の実施形態では、収集部２が操作ログを収集し、単語抽出部３が、各操作ログに記述されている各ファイル名から単語を抽出し、スコア算出部４が、ファイルを操作した各ユーザと抽出された各単語の組合せ毎に、関連スコアを算出する場合を示した。 Embodiment 4.
In the first to third embodiments, the collection unit 2 collects operation logs, the word extraction unit 3 extracts words from each file name described in each operation log, and the score calculation unit 4 shows a case in which a relevance score is calculated for each combination of each user who manipulated the file and each extracted word.

第４の実施形態では、収集部がファイルの操作ログではなく、スケジュール情報を収集する場合を例にして説明する。 In the fourth embodiment, an example will be described in which the collection unit collects schedule information instead of file operation logs.

図１６は、第４の実施形態の関連スコア算出システムの構成例を示すブロック図である。図１６に示す関連スコア算出システム１は、第１の実施形態における収集部２、単語抽出部３およびスコア算出部４を、収集部５２、単語抽出部５３およびスコア算出部５４に置き換えたものである。記憶部５は、第１の実施形態における記憶部５と同様である。 FIG. 16 is a block diagram showing a configuration example of a related score calculation system according to the fourth embodiment. The related score calculation system 1 shown in FIG. 16 replaces the collection unit 2, the word extraction unit 3 and the score calculation unit 4 in the first embodiment with a collection unit 52, a word extraction unit 53 and a score calculation unit 54. be. The storage unit 5 is the same as the storage unit 5 in the first embodiment.

また、図１６に示す関連スコア算出システム１は、組織に属する各人のスケジュール情報を保持するスケジュール管理サーバ６１に、通信ネットワーク１０を介して接続されている。 Also, the related score calculation system 1 shown in FIG. 16 is connected via a communication network 10 to a schedule management server 61 that holds schedule information for each person belonging to an organization.

収集部５２は、スケジュール管理サーバ６１に記憶されている、組織に属する各人のスケジュール情報を、スケジュール管理サーバ６１から収集する。なお、スケジュール情報は個々のＰＣ（図１６において図示略）に記憶されていてもよい。この場合、収集部５２は、その個々のＰＣから組織に属する各人のスケジュール情報を収集すればよい。 The collecting unit 52 collects from the schedule management server 61 the schedule information of each person belonging to the organization, which is stored in the schedule management server 61 . The schedule information may be stored in individual PCs (not shown in FIG. 16). In this case, the collection unit 52 may collect the schedule information of each person belonging to the organization from each individual PC.

以下、組織に属する人の識別情報を、人識別情報と記す。また、人識別情報として、例えば、「山田」等の人の名を用いる場合を例にして説明する。 Hereinafter, the identification information of a person belonging to an organization is referred to as person identification information. Also, a case where a person's name such as "Yamada" is used as the person identification information will be described as an example.

図１７は、スケジュール情報の例を示す模式図である。スケジュール情報は、例えば、図１７に例示するように、組織に属する人の人識別情報と、イベント名と、そのイベント名を有するイベントにその人が関わった時間帯とが関連付けて記述されている。時間帯は、開始日時および終了日時によって表される。 FIG. 17 is a schematic diagram showing an example of schedule information. For example, as shown in FIG. 17, the schedule information is described by associating the identification information of a person belonging to an organization, the event name, and the time zone in which the person was involved in the event having that event name. . A time period is represented by a start date and time and an end date and time.

図１７に示す１番目のスケジュール情報は、「山田」という人が、２０１７年８月１７日１０時から同日の１２時まで、「人工知能開発会議」に関わった（換言すれば、出席した）ということを表している。以下、ある人が、あるイベントに関わった時間をイベント参加時間と記す。１つのイベントに関するイベント参加時間は、そのイベントに関連付けて記述された終了日時から開始日時を減算して得られる時間である。 The first schedule information shown in FIG. 17 is that a person named "Yamada" was involved in (in other words, attended) the "Artificial Intelligence Development Conference" from 10:00 on August 17, 2017 to 12:00 on the same day. That means. Hereinafter, the time during which a certain person participates in a certain event is referred to as event participation time. The event participation time for one event is the time obtained by subtracting the start date and time from the end date and time described in association with the event.

単語抽出部５３は、収集部５２によって収集された各スケジュール情報に記述されている各イベント名に対して形態素解析を実行することによって、イベント名に含まれている単語を抽出する。 The word extraction unit 53 extracts words included in the event names by executing morphological analysis on each event name described in each schedule information collected by the collection unit 52 .

ただし、単語抽出部５３は、同一の単語を、重複して抽出しない。例えば、単語抽出部５３は、「人工知能」という単語を既に抽出している場合、２回目以降に抽出された「人工知能」という単語については無視する。この点は、既に説明した単語抽出部３と同様である。 However, the word extraction unit 53 does not extract the same word redundantly. For example, when the word extraction unit 53 has already extracted the word "artificial intelligence", it ignores the word "artificial intelligence" extracted for the second and subsequent times. This point is the same as that of the word extraction unit 3 already described.

例えば、単語抽出部５３は、「人工知能開発会議」というイベント名に対して形態素解析を実行することによって、「人工知能」、「開発」、「会議」等の単語を抽出する。 For example, the word extraction unit 53 extracts words such as "artificial intelligence", "development", and "meeting" by executing morphological analysis on the event name "artificial intelligence development conference".

単語抽出部５３は、この処理を、各スケジュール情報に記述されている各イベント名に対して行うことによって、単語の集合を得る。同一の単語は重複して抽出されないので、この集合に属する単語は、互いに異なる。 The word extraction unit 53 obtains a set of words by performing this process on each event name described in each schedule information. The words belonging to this set are different from each other because the same word is not extracted redundantly.

スコア算出部５４は、組織に属する各人と、単語抽出部５３によって抽出された各単語の組合せ毎に、人と単語との関連の強さを示す関連スコアを算出する。この関連スコアは、第１の実施形態から第３の実施形態までにおける関連スコアと同様である。ただし、第４の実施形態では、キータッチ回数は、関連スコアとして用いられない。すなわち、単語抽出部５３は、第１の実施形態で説明した第２の算出方法で関連スコアを算出することはない。 The score calculation unit 54 calculates, for each combination of each person belonging to the organization and each word extracted by the word extraction unit 53, a relationship score indicating the strength of the relationship between the person and the word. This relevance score is similar to the relevance scores in the first to third embodiments. However, in the fourth embodiment, the number of key touches is not used as a related score. That is, the word extraction unit 53 does not calculate the related score by the second calculation method described in the first embodiment.

以下、第４の実施形態における関連スコアの算出方法として、２種類の方法を説明する。以下に示す２種類のいずれの方法においても、スコア算出部５４は、人と単語の組合せ毎に、関連スコアを算出し、記憶部５に記憶させる。 Two types of methods will be described below as the method of calculating the related score in the fourth embodiment. In any of the two types of methods described below, the score calculation unit 54 calculates a related score for each combination of a person and a word, and causes the storage unit 5 to store the score.

第４の実施形態における関連スコアの第１の算出方法は、第１の実施形態における関連スコアの第１の算出方法と同様の方法である。ただし、ファイルの操作時間の代わりに、イベント参加時間を用いる。 The first calculation method of the related score in the fourth embodiment is the same method as the first calculation method of the related score in the first embodiment. However, event participation time is used instead of file operation time.

第４の実施形態における関連スコアの第１の算出方法は、一の人（以下、人Ｈと記す。）と一の単語（以下、単語Ｗと記す。）の関連スコアとして、単語Ｗをイベント名に含む各イベントへの人Ｈのイベント参加時間の総和を算出する方法である。イベント参加時間が長いほど、人Ｈと単語Ｗの関連が強く、イベント参加時間が短いほど、人Ｈと単語Ｗの関連が弱いと言える。従って、イベント参加時間を、関連スコアとして用いることができる。 A first method of calculating the association score in the fourth embodiment is to set word W as an association score between one person (hereinafter referred to as person H) and one word (hereinafter referred to as word W) as an event This is a method of calculating the sum of the event participation time of person H to each event included in the first name. It can be said that the longer the event participation time is, the stronger the relation between the person H and the word W is, and the shorter the event participation time is, the weaker the relation between the person H and the word W is. Therefore, event attendance time can be used as a relevance score.

第１の算出方法では、スコア算出部５４は、単語Ｗをイベント名に含む各イベントに対する人Ｈのイベント参加時間の総和を算出し、その総和を、人Ｈと単語Ｗの関連スコアとする。 In the first calculation method, the score calculation unit 54 calculates the sum of the event participation times of the person H for each event including the word W in the event name, and sets the sum as the relation score between the person H and the word W.

すなわち、スコア算出部５４は、単語Ｗをイベント名に含み、人Ｈが参加したイベント毎にイベント参加時間を算出し、その総和を、人Ｈと単語Ｗの関連スコアとする。 That is, the score calculation unit 54 calculates the event participation time for each event in which the person H participates in which the word W is included in the event name, and sets the sum as the association score between the person H and the word W. FIG.

第４の実施形態における関連スコアの第２の算出方法は、第１の実施形態における関連スコアの第３の算出方法と同様の方法である。ただし、ファイルの操作時間の代わりに、イベント参加時間を用いる。 The second calculation method of the relation score in the fourth embodiment is similar to the third calculation method of the relation score in the first embodiment. However, event participation time is used instead of file operation time.

本実施形態における関連スコアの第２の算出方法は、一の人（人Ｈ）と一の単語（単語Ｗ）の関連スコアを、次に説明する２つの割合に基づいて算出する方法である。この２つの割合のうち、一方の割合をＱ_１と記し、もう一方の割合をＱ_２と記す。 A second method of calculating a relation score in this embodiment is a method of calculating a relation score between one person (person H) and one word (word W) based on two ratios described below. Of these _two proportions, _one is denoted as Q1 and the other as Q2.

Ｑ_１は、単語Ｗをイベント名に含む各イベントに対する組織内の全ての人のイベント参加時間の総和に対する、単語Ｗをイベント名に含む各イベントに対する人Ｈのイベント参加時間の総和の割合である。すなわち、Ｑ_１は、以下に示す式（７）で表される。 _Q1 is the ratio of the total event participation time of person H for each event containing word W in the event name to the total event participation time of all persons in the organization for each event containing word W in the event name. . That is, _Q1 is represented by the following equation (7).

Ｑ_２は、個々の単語に着目した場合における、着目した単語をイベント名に含む各イベントに対する人Ｈのイベント参加時間の総和に対する、単語Ｗをイベント名に含む各イベントに対する人Ｈのイベント参加時間の総和の割合である。すなわち、Ｑ２は、以下に示す式（８）で表される。 _Q2 is the event participation time of person H for each event including the word W in the event name with respect to the sum of the event participation time of the person H for each event including the focused word in the event name when focusing on individual words. is the ratio of the sum of That is, Q2 is represented by the following formula (8).

Ｑ_１について説明する。単語Ｗをイベント名に含む各イベントに対する人Ｈのイベント参加時間の総和（式（７）の右辺の分子）は、前述の第１の算出方法で算出される関連スコアに相当する。すなわち、スコア算出部５４は、前述の第１の算出方法で説明した方法で、単語Ｗをイベント名に含む各イベントに対する人Ｈのイベント参加時間の総和を算出すればよい。 _Q1 will be explained. The sum of event participation times of person H (the numerator on the right side of equation (7)) for each event including the word W in the event name corresponds to the related score calculated by the first calculation method described above. That is, the score calculation unit 54 may calculate the sum of the event participation times of the person H for each event including the word W in the event name by the method described in the first calculation method.

単語Ｗをイベント名に含む各イベントに対する組織内の全ての人のイベント参加時間の総和（式（７）の右辺の分母）について、説明する。スコア算出部５４は、単語Ｗをイベント名に含む各イベントに対する、組織内の一人目の人のイベント参加時間の総和も、前述の第１の算出方法で説明した方法で算出する。同様に、スコア算出部５４は、単語Ｗをイベント名に含む各イベントに対する、組織内の二人目の人のイベント参加時間の総和も、前述の第１の算出方法で説明した方法で算出する。同様に、スコア算出部５４は、組織に属する一人一人について、単語Ｗをイベント名に含む各イベントに対する人のイベント参加時間の総和を算出する。さらに、スコア算出部５４は、組織に属する一人一人について算出した「単語Ｗをイベント名に含む各イベントに対する人のイベント参加時間の総和」の総和を算出する。この値が、単語Ｗをイベント名に含む各イベントに対する組織内の全ての人のイベント参加時間の総和に該当する。 The sum of event participation times of all persons in the organization (the denominator on the right side of Equation (7)) for each event containing the word W in the event name will be described. The score calculation unit 54 also calculates the sum of the event participation times of the first person in the organization for each event including the word W in the event name, using the method described in the first calculation method above. Similarly, the score calculation unit 54 calculates the sum of the event participation times of the second person in the organization for each event including the word W in the event name, using the method described in the first calculation method above. Similarly, the score calculation unit 54 calculates the sum of the event participation time of each person belonging to the organization for each event including the word W in the event name. Furthermore, the score calculation unit 54 calculates the sum of "the sum of the event participation time of people for each event including the word W in the event name" calculated for each person belonging to the organization. This value corresponds to the sum of event participation times of all persons in the organization for each event containing the word W in the event name.

スコア算出部５４は、上記のように算出した式（７）の右辺の分子、分母を用いて、式（７）の計算により、Ｑ_１を算出する。 The score calculator 54 calculates _Q1 by the calculation of Equation (7) using the numerator and denominator on the right side of Equation (7) calculated as described above.

次に、Ｑ_２について説明する。式（８）の右辺の分子は、式（７）の右辺の分子と同じである。従って、スコア算出部５４は、前述の第１の算出方法で説明した方法で、単語Ｗをイベント名に含む各イベントに対する人Ｈのイベント参加時間の総和を算出すればよい。 Next, _Q2 will be explained. The numerator on the right side of equation (8) is the same as the numerator on the right side of equation (7). Therefore, the score calculation unit 54 may calculate the sum of the event participation time of the person H for each event including the word W in the event name by the method described in the first calculation method.

個々の単語に着目した場合における、着目した単語をイベント名に含む各イベントに対する人Ｈのイベント参加時間の総和（式（８）の右辺の分母）について説明する。スコア算出部５４は、単語抽出部５３によって抽出された個々の単語に着目する（換言すれば、個々の単語を１つ１つ選択する）。そして、スコア算出部５４は、着目した単語（選択した単語）をイベント名に含む各イベントに対する人Ｈのイベント参加時間の総和を、前述の第１の算出方法で説明した方法で算出する。スコア算出部５４は、次の単語に着目し（換言すれば、次の単語を選択し）、着目した単語（選択した単語）をイベント名に含む各イベントに対する人Ｈのイベント参加時間の総和を、前述の第１の算出方法で説明した方法で算出する。このように、スコア算出部５４は、単語毎に、単語をイベント名に含む各イベントに対する人Ｈのイベント参加時間の総和を算出する。そして、スコア算出部５４は、単語毎に算出した「単語をイベント名に含む各イベントに対する人Ｈのイベント参加時間の総和」の総和を算出する。この値が、式（８）の分母に該当する。「個々の単語に着目した場合における、着目した単語をイベント名に含む各イベントに対する人Ｈのイベント参加時間の総和（式（８）の右辺の分母）」は、「単語をイベント名に含む各イベントに対する人Ｈのイベント参加時間の総和を単語毎に求めた場合における前記総和の総和」であると言うことができる。 A description will be given of the total sum of the event participation times of the person H (the denominator on the right side of Equation (8)) for each event that includes the focused word in the event name when focusing on individual words. The score calculator 54 focuses on the individual words extracted by the word extractor 53 (in other words, selects the individual words one by one). Then, the score calculation unit 54 calculates the sum of the event participation time of the person H for each event including the focused word (selected word) in the event name by the method described in the first calculation method above. The score calculation unit 54 focuses on the next word (in other words, selects the next word), and calculates the sum of the event participation time of the person H for each event including the focused word (selected word) in the event name. , is calculated by the method described in the first calculation method above. In this manner, the score calculation unit 54 calculates, for each word, the sum of the event participation times of the person H for each event including the word in the event name. Then, the score calculation unit 54 calculates the sum of "the sum of the event participation time of the person H for each event including the word in the event name" calculated for each word. This value corresponds to the denominator of equation (8). "When focusing on individual words, the total sum of the event participation time of person H for each event including the focused word in the event name (the denominator on the right side of Equation (8))" It can be said that it is the "sum of the sums when the sum of the event participation times of the person H for the event is obtained for each word".

スコア算出部５４は、上記のように算出した式（８）の右辺の分子、分母を用いて、式（８）の計算により、Ｑ_２を算出する。 The score calculation unit 54 calculates _Q2 by the calculation of Equation (8) using the numerator and denominator on the right side of Equation (8) calculated as described above.

スコア算出部５４は、Ｑ_１，Ｑ_２を求めた後、人Ｈと単語Ｗの関連スコアを、以下に示す式（９）によって算出する。 After obtaining Q ₁ and Q ₂ , the score calculation unit 54 calculates the association score between the person H and the word W using the following equation (9).

関連スコア＝Ｑ_１×ｌｏｇ（Ｑ_２）・・・（９） Association score = Q ₁ ×log(Q ₂ ) (9)

第４の実施形態における第２の算出方法で関連スコアを算出した場合、第１の実施形態で説明した第３の算出方法で関連スコアを算出した場合と同様の効果が得られる。 When the relation score is calculated by the second calculation method in the fourth embodiment, the same effect as when the relation score is calculated by the third calculation method described in the first embodiment can be obtained.

収集部５２は、例えば、関連スコア算出プログラムに従って動作するコンピュータのＣＰＵおよびそのコンピュータの通信インタフェースによって実現される。例えば、ＣＰＵが、コンピュータのプログラム記憶装置等のプログラム記録媒体から関連スコア算出プログラムを読み込み、関連スコア算出プログラムに従って、通信インタフェースを用いて、収集部５２として動作すればよい。また、単語抽出部５３およびスコア算出部５４も、例えば、関連スコア算出プログラムに従って動作する上記のコンピュータのＣＰＵによって実現される。すなわち、上記のように、関連スコア算出プログラムを読み込んだＣＰＵが、関連スコア算出プログラムに従って、単語抽出部５３およびスコア算出部５４として動作すればよい。記憶部５は、上記のコンピュータの記憶装置によって実現される。また、収集部５２、単語抽出部５３およびスコア算出部５４がそれぞれ別々のハードウェアによって実現されてもよい。 The collection unit 52 is implemented by, for example, a CPU of a computer that operates according to a related score calculation program and a communication interface of the computer. For example, the CPU may read the related score calculation program from a program recording medium such as a program storage device of the computer, and operate as the collection unit 52 using the communication interface according to the related score calculation program. The word extractor 53 and the score calculator 54 are also realized by the CPU of the above computer that operates according to the related score calculator program, for example. That is, as described above, the CPU that has read the related score calculation program should operate as the word extractor 53 and the score calculator 54 according to the related score calculation program. The storage unit 5 is implemented by the storage device of the computer described above. Also, the collecting unit 52, the word extracting unit 53, and the score calculating unit 54 may be realized by separate hardware.

第４の実施形態は、第１の実施形態の変形例（図５参照）、第２の実施形態（図７参照）、および第３の実施形態（図１４参照）に適用されてもよい。すなわち、第１の実施形態の変形例（図５参照）において、収集部２、単語抽出部３およびスコア算出部４を、第４の実施形態で説明した収集部５２、単語抽出部５３およびスコア算出部５４に置き換えてもよい。また、第２の実施形態（図７参照）において、収集部２、単語抽出部３およびスコア算出部４を、第４の実施形態で説明した収集部５２、単語抽出部５３およびスコア算出部５４に置き換えてもよい。また、第３の実施形態（図１４参照）において、収集部２、単語抽出部３およびスコア算出部４を、第４の実施形態で説明した収集部５２、単語抽出部５３およびスコア算出部５４に置き換えてもよい。 The fourth embodiment may be applied to the modification of the first embodiment (see FIG. 5), the second embodiment (see FIG. 7), and the third embodiment (see FIG. 14). That is, in the modified example of the first embodiment (see FIG. 5), the collection unit 2, the word extraction unit 3 and the score calculation unit 4 are replaced with the collection unit 52, the word extraction unit 53 and the score described in the fourth embodiment. You may replace with the calculation part 54. FIG. Further, in the second embodiment (see FIG. 7), the collection unit 2, the word extraction unit 3 and the score calculation unit 4 are replaced with the collection unit 52, the word extraction unit 53 and the score calculation unit 54 described in the fourth embodiment. can be replaced with Further, in the third embodiment (see FIG. 14), the collection unit 2, the word extraction unit 3 and the score calculation unit 4 are replaced with the collection unit 52, the word extraction unit 53 and the score calculation unit 54 described in the fourth embodiment. can be replaced with

図１８は、本発明の各実施形態に係るコンピュータの構成例を示す概略ブロック図である。コンピュータ１０００は、ＣＰＵ１００１と、主記憶装置１００２と、補助記憶装置１００３と、インタフェース１００４と、通信インタフェース１００５とを備える。 FIG. 18 is a schematic block diagram showing a configuration example of a computer according to each embodiment of the present invention. Computer 1000 includes CPU 1001 , main memory 1002 , auxiliary memory 1003 , interface 1004 , and communication interface 1005 .

本発明の各実施形態の関連スコア算出システム１は、コンピュータ１０００に実装される。関連スコア算出システム１の動作は、関連スコア算出プログラムの形式で補助記憶装置１００３に記憶されている。ＣＰＵ１００１は、その関連スコア算出プログラムを補助記憶装置１００３から読み出して主記憶装置１００２に展開し、その関連スコア算出プログラムに従って上記の処理を実行する。 A related score calculation system 1 according to each embodiment of the present invention is implemented in a computer 1000 . The operation of the related score calculation system 1 is stored in the auxiliary storage device 1003 in the form of a related score calculation program. The CPU 1001 reads out the associated score calculation program from the auxiliary storage device 1003, develops it in the main storage device 1002, and executes the above processing according to the associated score calculation program.

補助記憶装置１００３は、一時的でない有形の媒体の例である。一時的でない有形の媒体の他の例として、インタフェース１００４を介して接続される磁気ディスク、光磁気ディスク、ＣＤ－ＲＯＭ（Compact Disk Read Only Memory ）、ＤＶＤ－ＲＯＭ（Digital Versatile Disk Read Only Memory ）、半導体メモリ等が挙げられる。また、このプログラムが通信回線によってコンピュータ１０００に配信される場合、配信を受けたコンピュータ１０００がそのプログラムを主記憶装置１００２に展開し、上記の処理を実行してもよい。 Secondary storage 1003 is an example of non-transitory tangible media. Other examples of non-transitory tangible media include a magnetic disk, a magneto-optical disk, a CD-ROM (Compact Disk Read Only Memory), a DVD-ROM (Digital Versatile Disk Read Only Memory) connected via the interface 1004, A semiconductor memory etc. are mentioned. Further, when this program is distributed to the computer 1000 via a communication line, the computer 1000 receiving the distribution may develop the program in the main storage device 1002 and execute the above processing.

また、プログラムは、前述の処理の一部を実現するためのものであってもよい。さらに、プログラムは、補助記憶装置１００３に既に記憶されている他のプログラムとの組み合わせで前述の処理を実現する差分プログラムであってもよい。 Also, the program may be for realizing part of the above-described processing. Furthermore, the program may be a differential program that achieves the above-described processing in combination with another program already stored in the auxiliary storage device 1003. FIG.

また、各構成要素の一部または全部は、汎用または専用の回路（circuitry ）、プロセッサ等やこれらの組み合わせによって実現されてもよい。これらは、単一のチップによって構成されてもよいし、バスを介して接続される複数のチップによって構成されてもよい。各構成要素の一部または全部は、上述した回路等とプログラムとの組み合わせによって実現されてもよい。 Also, part or all of each component may be realized by general-purpose or dedicated circuitry, processors, etc., or combinations thereof. These may be composed of a single chip, or may be composed of multiple chips connected via a bus. A part or all of each component may be implemented by a combination of the above-described circuit or the like and a program.

各構成要素の一部または全部が複数の情報処理装置や回路等により実現される場合には、複数の情報処理装置や回路等は集中配置されてもよいし、分散配置されてもよい。例えば、情報処理装置や回路等は、クライアントアンドサーバシステム、クラウドコンピューティングシステム等、各々が通信ネットワークを介して接続される形態として実現されてもよい。 When a part or all of each component is realized by a plurality of information processing devices, circuits, etc., the plurality of information processing devices, circuits, etc. may be arranged centrally or distributedly. For example, the information processing device, circuits, and the like may be implemented as a client-and-server system, a cloud computing system, or the like, each of which is connected via a communication network.

次に、本発明の概要について説明する。図１９は、本発明の概要を示すブロック図である。本発明の関連スコア算出システムは、収集部８２と、単語抽出部８３と、関連スコア算出部８４とを備える。 Next, an outline of the present invention will be described. FIG. 19 is a block diagram showing the outline of the present invention. The related score calculation system of the present invention includes a collection unit 82 , a word extraction unit 83 and a related score calculation unit 84 .

収集部８２（例えば、収集部２）は、ユーザがファイルを操作した記録である操作ログを、端末装置から収集する。 The collecting unit 82 (for example, the collecting unit 2) collects operation logs, which are records of user's file operations, from the terminal device.

単語抽出部８３（例えば、単語抽出部３）は、各操作ログに記述されているファイル名から単語を抽出する。 A word extraction unit 83 (for example, word extraction unit 3) extracts words from the file names described in each operation log.

関連スコア算出部８４（例えば、スコア算出部４）は、各操作ログに基づいて、ユーザと単語との関連の強さを表す関連スコアを算出する。 The association score calculator 84 (for example, the score calculator 4) calculates a association score representing the strength of the association between the user and the word based on each operation log.

そのような構成によって、組織内の人と単語との関連の強さを明確化することができる。 Such a configuration can clarify the strength of association between people and words in an organization.

図２０は、本発明の概要の他の例を示すブロック図である。本発明の関連スコア算出システムは、収集部８６と、単語抽出部８７と、関連スコア算出部８８とを備える。 FIG. 20 is a block diagram showing another example of the outline of the present invention. The related score calculation system of the present invention includes a collection unit 86 , a word extraction unit 87 and a related score calculation unit 88 .

収集部８６（例えば、収集部５２）は、人と、イベント名と、そのイベント名を有するイベントにその人が関わった時間帯とを記述したスケジュール情報を収集する。 The collection unit 86 (for example, the collection unit 52) collects schedule information describing a person, an event name, and a time period during which the person was involved in an event having that event name.

単語抽出部８７（例えば、単語抽出部５３）は、各スケジュール情報に記述されているイベント名から単語を抽出する。 The word extractor 87 (for example, the word extractor 53) extracts words from the event names described in each schedule information.

関連スコア算出部８８（例えば、スコア算出部５４）は、各スケジュール情報に基づいて、人と単語との関連の強さを表す関連スコアを算出する。 The association score calculator 88 (for example, the score calculator 54) calculates a association score representing the strength of the association between a person and a word based on each piece of schedule information.

そのような構成によっても、組織内の人と単語との関連の強さを明確化することができる。 Such a configuration also makes it possible to clarify the strength of the relationship between people and words in the organization.

上記の本発明の各実施形態は、以下の付記のようにも記載され得るが、以下に限定されるわけではない。 Each of the embodiments of the present invention described above can also be described in the following supplementary notes, but is not limited to the following.

（付記１）
ユーザがファイルを操作した記録である操作ログを、端末装置から収集する収集部と、
各操作ログに記述されているファイル名から単語を抽出する単語抽出部と、
各操作ログに基づいて、ユーザと単語との関連の強さを表す関連スコアを算出する関連スコア算出部とを備える
ことを特徴とする関連スコア算出システム。 (Appendix 1)
a collection unit that collects operation logs, which are records of file operations performed by users, from terminal devices;
a word extraction unit that extracts words from file names described in each operation log;
A relation score calculation system, comprising: a relation score calculation unit that calculates a relation score representing a strength of relation between a user and a word based on each operation log.

（付記２）
関連スコア算出部は、一のユーザと一の単語の関連スコアとして、前記一の単語をファイル名に含む各ファイルについての前記一のユーザの操作時間の総和を算出する
付記１に記載の関連スコア算出システム。 (Appendix 2)
The relation score calculation unit calculates, as the relation score between the one user and the one word, the sum of the operation times of the one user for each file containing the one word in the file name. calculation system.

（付記３）
関連スコア算出部は、一のユーザと一の単語の関連スコアとして、前記一の単語をファイル名に含む各ファイルを前記一のユーザが操作した際のキータッチの回数の総和を算出する
付記１に記載の関連スコア算出システム。 (Appendix 3)
The relation score calculation unit calculates, as a relation score between one user and one word, the sum of the number of key touches when the one user operates each file including the one word in the file name. The associated score calculation system described in .

（付記４）
関連スコア算出部は、
一のユーザと一の単語の関連スコアを、
前記一の単語をファイル名に含む各ファイルについての組織内の全ユーザの操作時間の総和に対する、前記一の単語をファイル名に含む各ファイルについての前記一のユーザの操作時間の総和の割合と、
単語をファイル名に含む各ファイルについての前記一のユーザの操作時間の総和を単語毎に求めた場合における前記総和の総和に対する、前記一の単語をファイル名に含む各ファイルについての前記一のユーザの操作時間の総和の割合と
に基づいて算出する
付記１に記載の関連スコア算出システム。 (Appendix 4)
The related score calculation unit
The relevance score for one user and one word,
a ratio of the total operating time of the one user for each file containing the one word in the file name to the total operating time of all users in the organization for each file containing the one word in the file name; ,
Said one user for each file including said one word in its file name with respect to the sum of said sums when the sum of operation times of said one user for each file including said word in its file name is obtained for each word The related score calculation system according to Appendix 1.

（付記５）
検索キーワードを受け付けるキーワード受付部と、
検索キーワードに応じて検索を実行する検索部とを備え、
前記キーワード受付部は、
検索キーワードとして単語を受け付け、
前記検索部は、
関連スコアに基づいて、前記単語に応じたユーザＩＤを検索する
付記１から付記４のうちのいずれかに記載の関連スコア算出システム。 (Appendix 5)
a keyword reception unit that receives a search keyword;
a search unit for executing a search according to a search keyword,
The keyword reception unit is
accepts words as search keywords,
The search unit is
The related score calculation system according to any one of supplementary notes 1 to 4, wherein the user ID corresponding to the word is retrieved based on the related score.

（付記６）
検索キーワードを受け付けるキーワード受付部と、
検索キーワードに応じて検索を実行する検索部とを備え、
前記キーワード受付部は、
検索キーワードとしてユーザＩＤを受け付け、
前記検索部は、
関連スコアに基づいて、前記ユーザＩＤに応じた単語を検索する
付記１から付記５のうちのいずれかに記載の関連スコア算出システム。 (Appendix 6)
a keyword reception unit that receives a search keyword;
a search unit for executing a search according to a search keyword,
The keyword reception unit is
Receiving a user ID as a search keyword,
The search unit is
The related score calculation system according to any one of supplementary notes 1 to 5, wherein a word corresponding to the user ID is searched based on the related score.

（付記７）
ユーザＩＤと、単語と、関連スコアとの関係を記述した第１のテーブルを生成する第１のテーブル生成部と、
第１のテーブルに基づいて算出した単語同士の関連の強さを表す関連度を記述した第２のテーブル生成する第２のテーブル生成部と、
検索キーワードを受け付けるキーワード受付部と、
検索キーワードに応じて検索を実行する検索部とを備え、
前記キーワード受付部は、
検索キーワードとして単語を受け付け、
前記検索部は、
前記第１のテーブルおよび前記第２のテーブルに基づいて、前記単語に応じたユーザＩＤを検索する
付記１から付記４のうちのいずれかに記載の関連スコア算出システム。 (Appendix 7)
a first table generation unit that generates a first table that describes the relationship between user IDs, words, and association scores;
a second table generation unit for generating a second table describing the degree of association representing the strength of association between words calculated based on the first table;
a keyword reception unit that receives a search keyword;
a search unit for executing a search according to a search keyword,
The keyword reception unit is
accepts words as search keywords,
The search unit is
The related score calculation system according to any one of appendices 1 to 4, wherein a user ID corresponding to the word is searched based on the first table and the second table.

（付記８）
ユーザＩＤと、単語と、関連スコアとの関係を記述した第１のテーブルを生成する第１のテーブル生成部と、
第１のテーブルに基づいて算出した単語同士の関連の強さを表す関連度を記述した第２のテーブル生成する第２のテーブル生成部と、
検索キーワードを受け付けるキーワード受付部と、
検索キーワードに応じて検索を実行する検索部とを備え、
前記キーワード受付部は、
検索キーワードとしてユーザＩＤを受け付け、
前記検索部は、
前記第１のテーブルおよび前記第２のテーブルに基づいて、前記ユーザＩＤに応じた単語を検索する
付記１から付記４および付記７のうちのいずれかに記載の関連スコア算出システム。 (Appendix 8)
a first table generation unit that generates a first table that describes the relationship between user IDs, words, and association scores;
a second table generation unit that generates a second table describing the degree of association representing the strength of association between words calculated based on the first table;
a keyword reception unit that receives a search keyword;
a search unit for executing a search according to a search keyword,
The keyword reception unit is
Receiving a user ID as a search keyword,
The search unit is
The related score calculation system according to any one of appendices 1 to 4 and 7, wherein a word corresponding to the user ID is searched based on the first table and the second table.

（付記９）
第２のテーブルに基づいて、単語をクラスタリングするクラスタリング部と、
クラスタ毎に単語を提示する単語提示部と、
提示された単語のうち、第１のテーブルおよび第２のテーブルから除外すべき単語の指定を受け付ける削除対象受付部とを備え、
第１のテーブル生成部は、
除外すべき単語として指定された単語を除外して、第１のテーブルを再度、生成し、
第２のテーブル生成部は、
前記第１のテーブルに基づいて、第２のテーブルを再度、生成する
付記７または付記８に記載の関連スコア算出システム。 (Appendix 9)
a clustering unit that clusters words based on the second table;
a word presentation unit that presents words for each cluster;
a deletion target reception unit that receives designation of words to be excluded from the first table and the second table among the presented words,
The first table generation unit
Regenerate the first table by excluding words specified as words to be excluded;
The second table generation unit
The related score calculation system according to appendix 7 or appendix 8, wherein a second table is generated again based on the first table.

（付記１０）
人と、イベント名と、前記イベント名を有するイベントに前記人が関わった時間帯とを記述したスケジュール情報を収集する収集部と、
各スケジュール情報に記述されているイベント名から単語を抽出する単語抽出部と、
各スケジュール情報に基づいて、人と単語との関連の強さを表す関連スコアを算出する関連スコア算出部とを備える
ことを特徴とする関連スコア算出システム。 (Appendix 10)
a collection unit for collecting schedule information describing a person, an event name, and a time period during which the person was involved in an event having the event name;
a word extraction unit for extracting words from event names described in each schedule information;
A relation score calculation system, comprising: a relation score calculation unit that calculates a relation score representing a strength of relation between a person and a word based on each schedule information.

（付記１１）
ユーザがファイルを操作した記録である操作ログを、端末装置から収集し、
各操作ログに記述されているファイル名から単語を抽出し、
各操作ログに基づいて、ユーザと単語との関連の強さを表す関連スコアを算出する
ことを特徴とする関連スコア算出方法。 (Appendix 11)
Collect operation logs, which are records of user's file operations, from the terminal device,
Extract words from the file name described in each operation log,
A relation score calculation method, comprising: calculating a relation score representing a strength of relation between a user and a word based on each operation log.

（付記１２）
人と、イベント名と、前記イベント名を有するイベントに前記人が関わった時間帯とを記述したスケジュール情報を収集し、
各スケジュール情報に記述されているイベント名から単語を抽出し、
各スケジュール情報に基づいて、人と単語との関連の強さを表す関連スコアを算出することを特徴とする関連スコア算出方法。 (Appendix 12)
collecting schedule information that describes a person, an event name, and the time period during which the person was involved in an event having the event name;
Extract words from the event name described in each schedule information,
A relation score calculation method characterized by calculating a relation score representing the strength of relation between a person and a word based on each piece of schedule information.

（付記１３）
コンピュータに、
ユーザがファイルを操作した記録である操作ログを、端末装置から収集する収集処理、
各操作ログに記述されているファイル名から単語を抽出する単語抽出処理、および、
各操作ログに基づいて、ユーザと単語との関連の強さを表す関連スコアを算出する関連スコア算出処理
を実行させるための関連スコア算出プログラム。 (Appendix 13)
to the computer,
Collection processing for collecting operation logs, which are records of user's file operations, from the terminal device;
A word extraction process that extracts words from the file name described in each operation log, and
A relation score calculation program for executing a relation score calculation process for calculating a relation score representing the strength of relation between a user and a word based on each operation log.

（付記１４）
コンピュータに、
人と、イベント名と、前記イベント名を有するイベントに前記人が関わった時間帯とを記述したスケジュール情報を収集する収集処理、
各スケジュール情報に記述されているイベント名から単語を抽出する単語抽出処理、および、
各スケジュール情報に基づいて、人と単語との関連の強さを表す関連スコアを算出する関連スコア算出処理
を実行させるための関連スコア算出プログラム。 (Appendix 14)
to the computer,
a collection process of collecting schedule information describing a person, an event name, and a time period during which the person was involved in an event having the event name;
Word extraction processing for extracting words from the event name described in each schedule information, and
A relation score calculation program for executing a relation score calculation process for calculating a relation score representing the strength of relation between a person and a word based on each schedule information.

本発明は、人と単語との関連の強さを数値化する関連スコア算出システムに好適に適用される。 INDUSTRIAL APPLICABILITY The present invention is preferably applied to a relation score calculation system that quantifies the strength of relation between a person and a word.

１関連スコア算出システム
２，５２収集部
３，５３単語抽出部
４，５４スコア算出部
５記憶部
６キーワード受付部
７，１７検索部
８出力部
３１クラスタリング部
３２クラスタ出力部
３３除外対象単語受付部 1 related score calculation system 2, 52 collection unit 3, 53 word extraction unit 4, 54 score calculation unit 5 storage unit 6 keyword reception unit 7, 17 search unit 8 output unit 31 clustering unit 32 cluster output unit 33 exclusion target word reception unit

Claims

a keyword reception unit that receives keywords from a searcher;
a search unit that searches for a user associated with the keyword based on a sum of operation times of files containing the keyword in the file name among operation times of files by one user.

The search unit is
Based on the ratio of the total operation time of the one user for each file containing the keyword in the file name to the total operation time of all users in the organization for each file containing the keyword in the file name, 2. The search system of claim 1, wherein searching for users associated with a keyword.

3. The search system according to claim 1, wherein said operation time is a time during which a window showing contents of a file is active.

Receiving keywords from searchers,
A search method for searching for a user associated with a keyword based on a sum of operation times of files including the keyword in the file name, among operation times of files by one user.

to the computer,
a process of accepting keywords from searchers;
A search program for executing a process of searching for a user associated with the keyword based on the total operation time of each file including the keyword in the file name among the operation times of each file by one user.

a keyword reception unit that receives keywords from a searcher;
A search system for searching for a user associated with the keyword, based on a sum of participation times of events including the keyword in the event name, among participation times of one user in the event.

The search unit is
Based on the ratio of the total participation time of the one user for each event containing the keyword in the event name to the total participation time of all users in the organization for each event containing the keyword in the event name, 2. The search system of claim 1, wherein searching for users associated with a keyword.

Receiving keywords from searchers,
A search method for searching for a user associated with a keyword based on a sum of participation times of events including the keyword in the event name, among participation times of one user's event.

to the computer,
a process of accepting keywords from searchers;
A search program for executing a process of searching for a user associated with the keyword based on the total participation time of each event including the keyword in the event name, among the participation times of one user's event.