JP7472652B2

JP7472652B2 - Classification program, classification method, and classification device

Info

Publication number: JP7472652B2
Application number: JP2020089137A
Authority: JP
Inventors: 修也阿部; 利光鈴木
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2020-05-21
Filing date: 2020-05-21
Publication date: 2024-04-23
Anticipated expiration: 2040-05-21
Also published as: JP2021184164A

Description

本発明は、分類技術に関する。 The present invention relates to classification technology.

従来、例えば、企業の従業員が利用するＰＣ（Personal Computer）のログに基づいて、当該従業員がどのような仕事にどれくらいの時間を費やしているかを把握できるようにする技術が知られている。また、管理システムが、予め設定された事業に関するキーワードに基づいてカレンダーの登録内容を分類する技術も知られている（例えば、特許文献１を参照）。 Conventionally, for example, technology is known that allows a company's employees to understand how much time they spend on what tasks based on logs from their personal computers (PCs). There is also known technology in which a management system classifies calendar entries based on pre-set business-related keywords (see, for example, Patent Document 1).

特開２０１８－１４７１７４号公報JP 2018-147174 A

しかしながら、予め設定されているキーワードに基づく分類では、システムがログを適切に分類できない場合がある。 However, when classifying logs based on pre-defined keywords, the system may not be able to classify them properly.

一側面では、ログを適切に分類することを目的とする。 On one hand, the goal is to properly classify logs.

一つの案では、キーワードに基づく第１分類規則により第１ログを分類する処理と、前記第１ログを分類する処理によって特定のグループに分類された第２ログから、スケジュール情報、通信元情報、通信先情報、及び通信日時情報のうち少なくとも一つに関連する特定の情報を抽出する処理と、前記特定の情報に基づく第２分類規則により第３ログを分類する処理と、をコンピュータに実行させる分類プログラムを提供する。
In one proposal, a classification program is provided that causes a computer to execute the following processes: classifying a first log according to a first classification rule based on keywords; extracting specific information related to at least one of schedule information, communication source information, communication destination information, and communication date and time information from a second log classified into a specific group by the process of classifying the first log; and classifying a third log according to a second classification rule based on the specific information.

一側面によれば、ログが適切に分類される。 In one aspect, logs are properly classified.

実施形態に係る分類システムの構成例を示す図である。FIG. 1 is a diagram illustrating an example of the configuration of a classification system according to an embodiment. 実施形態に係る情報処理装置のハードウェア構成例を示す図である。FIG. 2 is a diagram illustrating an example of a hardware configuration of an information processing device according to an embodiment. 実施形態に係る情報処理装置の機能構成の一例を示す図である。FIG. 2 is a diagram illustrating an example of a functional configuration of an information processing device according to an embodiment. 第１の実施形態に係る情報処理装置において実行される処理の一例を示すフローチャートである。5 is a flowchart illustrating an example of processing executed in the information processing apparatus according to the first embodiment. 第１の実施形態に係る分類規則データの一例について説明する図である。5 is a diagram illustrating an example of classification rule data according to the first embodiment; FIG. 実施形態に係るログの一例について説明する図である。FIG. 4 is a diagram illustrating an example of a log according to the embodiment. 実施形態に係る管理者端末の表示画面の一例について説明する図である。FIG. 13 is a diagram illustrating an example of a display screen of an administrator terminal according to the embodiment. 第２の実施形態に係るサーバにおいて実行される処理の一例を示すフローチャートである。13 is a flowchart illustrating an example of a process executed in a server according to the second embodiment. 第２の実施形態に係る分類規則データの一例について説明する図である。FIG. 13 is a diagram illustrating an example of classification rule data according to the second embodiment.

以下、図面を参照して、本発明の実施形態を説明する。 The following describes an embodiment of the present invention with reference to the drawings.

［第１の実施形態］
第１の実施形態では、分類システム１が、設定されたキーワードに基づいて、ユーザの業務に関するログを分類する。そして、分類システム１が、特定のグループに分類されたログに基づいて、ログの分類に用いる情報を抽出する例について説明する。これにより、分類システム１は、ログを適切に分類することができる。 [First embodiment]
In the first embodiment, the classification system 1 classifies logs related to user operations based on set keywords. An example will be described in which the classification system 1 extracts information used to classify logs based on logs classified into specific groups. This allows the classification system 1 to appropriately classify logs.

また、分類システム１は、例えば、ユーザが業務で使用するＰＣ（Personal Computer）のログ（例えば、操作情報、スケジュール情報、通信情報）を各グループに分類することで、当該ユーザがどのような種別の業務等を行っているかを可視化してもよい。 The classification system 1 may also visualize what type of work a user is performing by classifying into groups the logs (e.g., operation information, schedule information, communication information) of a PC (Personal Computer) that the user uses for work.

＜システム構成＞
図１は、実施形態に係る分類システム１の構成例を示す図である。図１において、分類システム１は、サーバ１０（「分類装置」の一例。）、端末２０Ａ、端末２０Ｂ、端末２０Ｃ（以下で、それぞれを区別する必要がない場合は、単に「端末２０」と称する。）、管理者端末３０を含む。なお、サーバ１０、端末２０、管理者端末３０のそれぞれの数は、図１の例に限定されない。更に、サーバ１０と管理者端末３０とが１つの装置であってもよいし、端末２０の１つと管理者端末３０とが１つの装置であってもよい。 <System Configuration>
FIG. 1 is a diagram showing an example of the configuration of a classification system 1 according to an embodiment. In FIG. 1, the classification system 1 includes a server 10 (an example of a "classification device"), a terminal 20A, a terminal 20B, and a terminal 20C (hereinafter, when there is no need to distinguish between them, they will be simply referred to as "terminals 20"), and an administrator terminal 30. Note that the number of servers 10, terminals 20, and administrator terminals 30 is not limited to the example in FIG. 1. Furthermore, the server 10 and the administrator terminal 30 may be one device, or one of the terminals 20 and the administrator terminal 30 may be one device.

サーバ１０と端末２０と管理者端末３０とは、それぞれ、例えば、インターネット、ＬＡＮ（Local Area Network）、無線ＬＡＮ、または携帯電話網等のネットワークＮにより接続されている。 The server 10, the terminal 20, and the administrator terminal 30 are each connected to a network N, such as the Internet, a LAN (Local Area Network), a wireless LAN, or a mobile phone network.

端末２０、及び管理者端末３０は、例えば、デスクトップＰＣ、ノートＰＣ、タブレットＰＣ、スマートフォン等の情報処理装置（コンピュータ）である。端末２０は、例えば、社員であるユーザが、各種の業務を行うために用いられてもよい。 The terminal 20 and the administrator terminal 30 are, for example, information processing devices (computers) such as desktop PCs, notebook PCs, tablet PCs, and smartphones. The terminal 20 may be used, for example, by employees who perform various tasks.

サーバ１０は、例えば、サーバ用の情報処理装置である。サーバ１０は、端末２０における各種のログを分類する。管理者端末３０は、サーバ１０により分類されたログに基づく情報を表示させる。 The server 10 is, for example, an information processing device for a server. The server 10 classifies various logs in the terminal 20. The administrator terminal 30 displays information based on the logs classified by the server 10.

＜ハードウェア構成＞
図２は、実施形態に係るサーバ１０、端末２０、及び管理者端末３０のハードウェア構成例を示す図である。以下では、サーバ１０を例として説明する。端末２０、及び管理者端末３０のハードウェア構成は、サーバ１０のハードウェア構成と同様でもよい。 <Hardware Configuration>
2 is a diagram showing an example of the hardware configuration of the server 10, the terminal 20, and the administrator terminal 30 according to the embodiment. In the following, the server 10 will be described as an example. The hardware configuration of the terminal 20 and the administrator terminal 30 may be the same as the hardware configuration of the server 10.

図２のサーバ１０は、それぞれバスＢで相互に接続されているドライブ装置１００、補助記憶装置１０２、メモリ装置１０３、ＣＰＵ１０４、インタフェース装置１０５、表示装置１０６、及び入力装置１０７等を有する。 The server 10 in FIG. 2 includes a drive device 100, an auxiliary storage device 102, a memory device 103, a CPU 104, an interface device 105, a display device 106, and an input device 107, all of which are interconnected via a bus B.

サーバ１０での処理を実現するプログラム（分類プログラム）は、記録媒体１０１によって提供されてもよい。この場合、プログラムを記録した記録媒体１０１がドライブ装置１００にセットされると、プログラムが記録媒体１０１からドライブ装置１００を介して補助記憶装置１０２にインストールされてもよい。また、当該プログラムは、例えば、ネットワークを介して他のコンピュータからサーバ１０にダウンロードされ、サーバ１０にインストールされてもよい。 The program (classification program) that realizes the processing in the server 10 may be provided by the recording medium 101. In this case, when the recording medium 101 on which the program is recorded is set in the drive device 100, the program may be installed in the auxiliary storage device 102 from the recording medium 101 via the drive device 100. The program may also be downloaded to the server 10 from another computer via a network, for example, and installed in the server 10.

補助記憶装置１０２は、インストールされたプログラムを格納すると共に、必要なファイルやデータ等を格納する。メモリ装置１０３は、プログラムの起動指示があった場合に、補助記憶装置１０２からプログラムを読み出して格納する。ＣＰＵ１０４は、メモリ装置１０３に格納されたプログラムに従ってサーバ１０に係る機能を実現する。インタフェース装置１０５は、ネットワークに接続するためのインタフェースとして用いられる。 The auxiliary storage device 102 stores the installed program as well as necessary files, data, etc. The memory device 103 reads out and stores the program from the auxiliary storage device 102 when an instruction to start the program is received. The CPU 104 realizes the functions related to the server 10 according to the program stored in the memory device 103. The interface device 105 is used as an interface for connecting to a network.

表示装置１０６はプログラムによるＧＵＩ（Graphical User Interface）等を表示する装置である。表示装置１０６は、例えば、液晶ディスプレイ、タッチパネル、及びプロジェクタ等でもよい。 The display device 106 is a device that displays a GUI (Graphical User Interface) or the like according to a program. The display device 106 may be, for example, a liquid crystal display, a touch panel, a projector, or the like.

入力装置１０７は様々な操作指示の入力を受け付ける。入力装置１０７は、例えば、キーボード、及びマウス等でもよい。 The input device 107 accepts input of various operational instructions. The input device 107 may be, for example, a keyboard, a mouse, etc.

なお、記録媒体１０１の一例としては、ＣＤ－ＲＯＭ、ＤＶＤディスク、又はＵＳＢメモリ等の可搬型の記録媒体が挙げられる。また、補助記憶装置１０２の一例としては、ＨＤＤ（Hard Disk Drive）又はフラッシュメモリ等が挙げられる。記録媒体１０１及び補助記憶装置１０２のいずれについても、コンピュータ読み取り可能な記録媒体に相当する。 Note that examples of the recording medium 101 include portable recording media such as CD-ROMs, DVDs, and USB memory. Examples of the auxiliary storage device 102 include HDDs (Hard Disk Drives) and flash memory. Both the recording medium 101 and the auxiliary storage device 102 correspond to computer-readable recording media.

＜機能構成＞
次に、図３を参照し、実施形態に係るサーバ１０の機能構成について説明する。図３は、実施形態に係るサーバ１０の機能構成の一例を示す図である。 <Functional configuration>
Next, a functional configuration of the server 10 according to the embodiment will be described with reference to Fig. 3. Fig. 3 is a diagram showing an example of a functional configuration of the server 10 according to the embodiment.

サーバ１０は、記憶部１１、受付部１２、取得部１３、分類部１４、抽出部１５、及び出力部１６を有する。これら各部の機能は、サーバ１０にインストールされた１以上のプログラムと、サーバ１０のＣＰＵ１０４等との協働により実現されてもよい。 The server 10 has a memory unit 11, a reception unit 12, an acquisition unit 13, a classification unit 14, an extraction unit 15, and an output unit 16. The functions of each of these units may be realized by cooperation between one or more programs installed on the server 10 and the CPU 104 of the server 10, etc.

記憶部１１は、各種の情報を記憶する。記憶部１１は、例えば、端末２０のログを分類するための規則（条件）の情報を含む分類規則データ５０１を記憶する。 The memory unit 11 stores various types of information. For example, the memory unit 11 stores classification rule data 501 including information on rules (conditions) for classifying logs of the terminal 20.

受付部１２は、管理者が管理者端末３０に入力した各種の情報を、管理者端末３０から受け付ける。受付部１２は、例えば、取得部１３により取得される端末２０のログを分類するための、管理者により入力された特定のキーワード（文字列）に基づく規則（指定規則。「第１分類規則」の一例。）を管理者端末３０から受け付ける。取得部１３は、各種の情報を取得する。取得部１３は、例えば、端末２０のログを取得する。取得部１３は、端末２０のログを端末２０から直接取得してもよい。また、取得部１３は、各端末２０から収集されたログを記憶するデータベースから、端末２０のログを取得してもよい。 The reception unit 12 receives various pieces of information input by the administrator to the administrator terminal 30 from the administrator terminal 30. The reception unit 12 receives, for example, from the administrator terminal 30, rules (designated rules; an example of "first classification rules") based on specific keywords (character strings) input by the administrator for classifying the logs of the terminals 20 acquired by the acquisition unit 13. The acquisition unit 13 acquires various pieces of information. The acquisition unit 13 acquires, for example, the logs of the terminals 20. The acquisition unit 13 may acquire the logs of the terminals 20 directly from the terminals 20. The acquisition unit 13 may also acquire the logs of the terminals 20 from a database that stores logs collected from each terminal 20.

分類部１４は、取得部１３により取得された端末２０のログを、分類用の規則に基づいて、各種別（グループ）に分類する。抽出部１５は、分類部１４により特定の種別に分類されたログに基づいて、端末２０のログを当該特定の種別に分類するための規則で用いる情報を抽出し、抽出した情報に基づく規則を生成する。 The classification unit 14 classifies the logs of the terminal 20 acquired by the acquisition unit 13 into various types (groups) based on the classification rules. The extraction unit 15 extracts information used in the rules for classifying the logs of the terminal 20 into a specific type based on the logs classified into the specific type by the classification unit 14, and generates rules based on the extracted information.

出力部１６は、各種の情報を管理者端末３０に出力する。出力部１６は、例えば、分類部１４による分類結果に基づく情報を管理者端末３０に出力する。 The output unit 16 outputs various information to the administrator terminal 30. For example, the output unit 16 outputs information based on the classification results by the classification unit 14 to the administrator terminal 30.

＜処理＞
次に、図４、図５を参照し、第１の実施形態に係るサーバ１０において実行される処理の一例について説明する。図４は、第１の実施形態に係るサーバ１０において実行される処理の一例を示すフローチャートである。図５は、第１の実施形態に係る分類規則データ５０１の一例について説明する図である。図６は、実施形態に係るログの一例について説明する図である。図７は、実施形態に係る管理者端末３０の表示画面の一例について説明する図である。以下では、ユーザＡが使用する端末２０Ａの第１期間におけるログを分類する例について説明する。 <Processing>
Next, an example of processing executed in the server 10 according to the first embodiment will be described with reference to Figs. 4 and 5. Fig. 4 is a flowchart showing an example of processing executed in the server 10 according to the first embodiment. Fig. 5 is a diagram explaining an example of classification rule data 501 according to the first embodiment. Fig. 6 is a diagram explaining an example of a log according to the embodiment. Fig. 7 is a diagram explaining an example of a display screen of the administrator terminal 30 according to the embodiment. An example of classifying logs in a first period of the terminal 20A used by user A will be described below.

サーバ１０の受付部１２は、分類用の指定規則の登録要求を管理者端末３０から受け付ける（ステップＳ１）。ここで、分類用の指定規則は、例えば、ユーザＡ、及びユーザＡの上司に対するヒアリングに基づいて管理者により抽出されたキーワードに基づく分類の規則でもよい。 The reception unit 12 of the server 10 receives a request to register a designation rule for classification from the administrator terminal 30 (step S1). Here, the designation rule for classification may be, for example, a classification rule based on keywords extracted by the administrator based on interviews with user A and user A's superior.

続いて、サーバ１０の記憶部１１は、分類用の指定規則を、分類規則データ５０１に登録する（ステップＳ２）。図５の分類規則データ５０１の例では、指定規則として、メールの件名に「コンプライアンス」という文字列（キーワード）が含まれる場合、「教育」の種別に分類する等の規則が登録されている。 Next, the storage unit 11 of the server 10 registers the designation rules for classification in the classification rule data 501 (step S2). In the example of the classification rule data 501 in FIG. 5, a designation rule is registered such that if the subject of an email contains the character string (keyword) "compliance," the email is classified as "education."

続いて、サーバ１０の取得部１３は、第１期間における端末２０Ａの各種のログを取得する（ステップＳ３）。ここで、サーバ１０は、例えば、ユーザＡの操作情報、ユーザＡのスケジュール情報、及びユーザＡの通信情報のログを端末２０Ａから取得する。 Then, the acquisition unit 13 of the server 10 acquires various logs of the terminal 20A during the first period (step S3). Here, the server 10 acquires, for example, logs of the operation information of the user A, the schedule information of the user A, and the communication information of the user A from the terminal 20A.

ユーザＡの操作情報には、端末２０Ａに対する操作履歴が含まれる。端末２０Ａに対する操作履歴は、例えば、端末２０Ａの電源ＯＮ／ＯＦＦの日時の情報、端末２０Ａの位置情報、ユーザＡのプレゼンス（入力操作の受け付け状態）に関する情報、各種アプリケーションに対する操作履歴が含まれてもよい。各種アプリケーションには、例えば、文書作成アプリケーション、メールを送受信するアプリケーション（メーラー）、及びインスタントメッセンジャー等が含まれてもよい。アプリケーションに対する操作履歴は、例えば、動作したアプリケーションの名前の情報、アプリケーションの起動／終了の日時の情報、アプリケーションのウィンドウのタイトル（例えば、ファイルのタイトル）の情報が含まれてもよい。 The operation information of user A includes an operation history for terminal 20A. The operation history for terminal 20A may include, for example, information on the date and time when terminal 20A was turned on/off, location information of terminal 20A, information on user A's presence (input operation acceptance status), and operation history for various applications. The various applications may include, for example, a word processing application, an application for sending and receiving e-mail (mailer), and an instant messenger. The operation history for an application may include, for example, information on the name of the application that was run, information on the date and time when the application was started/ended, and information on the title of the application window (for example, the title of the file).

ユーザＡのスケジュール情報には、例えば、スケジュール管理アプリケーションに登録された情報が含まれる。スケジュール情報は、例えば、登録されている出張に関する情報、及び登録されている会議に関する情報が含まれてもよい。登録されている出張に関する情報には、例えば、出張開始日時、出張終了日時、出張先の場所、相手の氏名、及び同行者の氏名等の情報が含まれてもよい。登録されている会議に関する情報には、例えば、会議開始日時、会議終了日時、会議が開催される場所、会議の参加者の氏名、及び会議の参加者の所属する部署名、会議の参加者の職制等の情報が含まれてもよい。 User A's schedule information includes, for example, information registered in a schedule management application. The schedule information may include, for example, information related to registered business trips and information related to registered conferences. The registered information related to business trips may include, for example, information such as the start date and time of the business trip, the end date and time of the business trip, the location of the business trip destination, the names of the people traveling with the business trip, and the names of the people accompanying the business trip. The registered information related to conferences may include, for example, information such as the start date and time of the conference, the end date and time of the conference, the location where the conference is being held, the names of the conference participants, the names of the departments to which the conference participants belong, and the job titles of the conference participants.

ユーザＡの通信情報には、例えば、ユーザＡが送受信した、メール、インスタントメッセージ（チャット）、ユーザＡが行った通話（ＩＰ電話）、オンライン会議の通信履歴が含まれてもよい。通信履歴は、例えば、送信／受信アドレス、送信／受信日時、通話相手、通話日時、会議参加者、会議日時の情報が含まれてもよい。また、ユーザＡの通信情報には、例えば、ユーザＡが閲覧していたＷｅｂサイトの閲覧履歴が含まれてもよい。 User A's communication information may include, for example, emails and instant messages (chats) sent and received by User A, calls (IP telephone calls) made by User A, and a communication history of online conferences. The communication history may include, for example, sending/receiving addresses, sending/receiving dates and times, call recipients, call dates and times, conference participants, and conference dates and times. User A's communication information may also include, for example, a browsing history of websites that User A viewed.

続いて、サーバ１０の分類部１４は、分類規則データ５０１に登録されている指定規則に基づいて、端末２０Ａから取得された第１期間におけるログを分類する（ステップＳ４）。ここで、サーバ１０は、分類の対象となる第１期間におけるログのうち、特定の種別の指定規則に合致するものを、当該特定の種別のログとして分類する。なお、サーバ１０は、例えば、図５に示す分類規則データ５０１に登録されている各種別の順に、各種別の各指定規則に合致するか否かを判定し、ログを分類してもよい。 The classification unit 14 of the server 10 then classifies the logs for the first period acquired from the terminal 20A based on the designation rules registered in the classification rule data 501 (step S4). Here, the server 10 classifies, among the logs for the first period to be classified, those that match the designation rules for a specific type as logs of that specific type. Note that the server 10 may classify the logs by determining whether or not they match each designation rule for each type in the order of the types registered in the classification rule data 501 shown in FIG. 5, for example.

図６には、端末２０Ａにより受信されたメールのうち、図５に示す分類規則データ５０１に登録されている指定規則により「教育」の種別に分類されたメール６０１の例が示されている。サーバ１０は、件名に「コンプライアンス」という文字列が含まれているため、メール６０１を「教育」の種別に分類されている。また、図６の例では、メール６０１の送信元メールアドレス（Ｆｒｏｍ。「送信元情報」の一例。）は「education-no-reply@our-company.example.co.jp」（以下で、「アドレスＡ」とも称する。）であり、送信先メールアドレス（Ｔｏ。「送信先情報」の一例。）は「all-employee@our-company.example.co.jp」（以下で、「アドレスＢ」とも称する。）である。 Figure 6 shows an example of email 601, which has been received by terminal 20A and classified as "education" according to the designated rules registered in classification rule data 501 shown in Figure 5. Server 10 classifies email 601 as "education" because the subject line contains the character string "compliance". In addition, in the example of Figure 6, the sender email address (From. An example of "sender information") of email 601 is "education-no-reply@our-company.example.co.jp" (hereinafter also referred to as "Address A"), and the destination email address (To. An example of "destination information") is "all-employee@our-company.example.co.jp" (hereinafter also referred to as "Address B").

なお、サーバ１０は、図５に示す分類規則データ５０１に登録されている各規則のいずれにも合致しないため、いずれの種別にも分類されなかったログを、「その他」の種別に分類してもよい。 Note that the server 10 may classify logs that do not match any of the rules registered in the classification rule data 501 shown in FIG. 5 and therefore are not classified into any category, into the "other" category.

続いて、サーバ１０の抽出部１５は、指定規則により各種別に分類された各ログに基づいて、分類用の第１追加規則（「第２分類規則」の一例。）の候補を１以上生成する（ステップＳ５）。ここで、サーバ１０は、例えば、指定規則により「教育」の種別に分類されたメール６０１に基づいて、送信元メールアドレスがアドレスＡ（「特定の情報」の一例。）の場合は「教育」の種別に分類するという第１追加規則の候補を生成してもよい。また、サーバ１０は、送信元メールアドレスがアドレスＢの場合は「教育」の種別に分類するという第１追加規則の候補を生成してもよい。また、サーバ１０は、送信元メールアドレスがアドレスＡであり、送信元メールアドレスがアドレスＢである場合は「教育」の種別に分類するという第１追加規則の候補を生成してもよい。 Next, the extraction unit 15 of the server 10 generates one or more candidates for the first additional rule for classification (an example of a "second classification rule") based on each log classified into various types according to the specified rule (step S5). Here, the server 10 may generate a candidate for the first additional rule of classifying the email into the "education" category when the sender email address is address A (an example of "specific information"), based on the email 601 classified into the "education" category according to the specified rule. The server 10 may also generate a candidate for the first additional rule of classifying the email into the "education" category when the sender email address is address B. The server 10 may also generate a candidate for the first additional rule of classifying the email into the "education" category when the sender email address is address A and the sender email address is address B.

なお、サーバ１０は、指定規則により分類された端末２０Ａのログに含まれる各項目のうち、例えば、時間の経過による変化の頻度が比較的少ない特定の項目の情報に基づいて、第１追加規則の候補を生成してもよい。この場合、時間の経過による変化の頻度が比較的少ない項目は、例えば、管理者によりサーバ１０に予め設定されてもよい。また、時間の経過による変化の頻度が比較的少ない項目は、例えば、ディープラーニング等を用いるＡＩ（Artificial Intelligence）により決定（推論）されてもよい。 The server 10 may generate candidates for the first additional rule based on information on a specific item that changes relatively infrequently over time among the items included in the log of the terminal 20A classified according to the specified rule. In this case, the items that change relatively infrequently over time may be set in advance in the server 10 by an administrator, for example. In addition, the items that change relatively infrequently over time may be determined (inferred) by AI (Artificial Intelligence) using deep learning, etc., for example.

続いて、サーバ１０の分類部１４は、生成された第１追加規則の候補に基づいて、端末２０から取得された第１期間におけるログを分類する（ステップＳ６）。なお、生成された第１追加規則の候補が複数ある場合、サーバ１０は、各候補に基づいて、端末２０から取得されたログをそれぞれ分類する。 Then, the classification unit 14 of the server 10 classifies the logs acquired from the terminal 20 for the first period based on the generated candidates for the first additional rule (step S6). Note that if there are multiple candidates for the generated first additional rule, the server 10 classifies each of the logs acquired from the terminal 20 based on each candidate.

続いて、サーバ１０の抽出部１５は、指定規則による分類の結果と、第１追加規則の候補による分類の結果とに基づいて、各候補のうち、第１追加規則とするものを決定する（ステップＳ７）。ここで、サーバ１０は、第１追加規則の候補により特定の種別に分類されるログが、指定規則により特定の種別に分類される第１度合いが閾値以上となる場合、当該候補を第１追加規則として決定する。この場合、サーバ１０は、第１度合いを、第１追加規則の候補により特定の種別に分類されたログ（以下で、「ログＢ」とも称する。）が、指定規則により当該特定の種別に分類されたログ（以下で、「ログＡ」とも称する。）に含まれる度合い（部分集合となっている度合い）に基づいて決定してもよい。そして、サーバ１０は、第１度合いが第１閾値以上である場合、当該候補を第１追加規則として決定してもよい。 Then, the extraction unit 15 of the server 10 determines which of the candidates is to be the first additional rule based on the result of classification according to the specified rule and the result of classification according to the candidate for the first additional rule (step S7). Here, if the first degree of classification of a log classified into a specific type by the candidate for the first additional rule into a specific type by the specified rule is equal to or greater than a threshold, the server 10 determines the candidate as the first additional rule. In this case, the server 10 may determine the first degree based on the degree to which a log classified into a specific type by the candidate for the first additional rule (hereinafter also referred to as "log B") is included in a log classified into the specific type by the specified rule (hereinafter also referred to as "log A") (the degree to which the log is a subset). Then, if the first degree is equal to or greater than a first threshold, the server 10 may determine the candidate as the first additional rule.

この場合、サーバ１０は、例えば、ログＡとログＢとで共通するログの数を、ログＡに含まれないログＢの数で除算した値を、第１度合いの値してもよい。この場合、第１度合いの逆数Ｅ（以下で、適宜「誤り率Ｅ」とも称する。）は、以下の式（１）で表すことができる。式（１）において、ＡはログＡの集合、ＢはログＢの集合を示す。 In this case, the server 10 may determine the value of the first degree as, for example, the number of logs common to log A and log B divided by the number of logs B that are not included in log A. In this case, the reciprocal E of the first degree (hereinafter also referred to as "error rate E" as appropriate) can be expressed by the following formula (1). In formula (1), A represents the set of logs A, and B represents the set of logs B.

Ｅ＝（Ｂ－Ａ）／Ａ∩Ｂ・・・（１）
続いて、サーバ１０の記憶部１１は、決定した第１追加規則を、図５に示す分類規則データ５０１に登録する（ステップＳ８）。なお、図５の例では、送信元メールアドレスがアドレスＡである場合、「教育」の種別に分類するという第１追加規則が登録されている。 E = (B - A) / A ∩ B ... (1)
Next, the storage unit 11 of the server 10 registers the determined first additional rule in the classification rule data 501 shown in Fig. 5 (step S8). In the example of Fig. 5, the first additional rule is registered such that if the sender email address is address A, the email is classified into the "education" category.

続いて、サーバ１０の分類部１４は、分類規則データ５０１に登録されている第１追加規則に基づいて、端末２０Ａの第１期間より後の各期間におけるログを分類する（ステップＳ９）。ここで、サーバ１０は、第１期間より後の各期間における端末２０Ａのログを、分類規則データ５０１に登録されている各規則に基づいて分類してもよい。この場合、サーバ１０は、まず、各期間における端末２０Ａのログを、指定規則に基づいて分類してもよい。そして、サーバ１０は、各期間における端末２０Ａのログのうち、指定規則によりいずれの種別にも分類されなかったログ（「その他」の種別に分類されたログ）を、第１追加規則に基づいて再分類してもよい。これにより、サーバ１０は、例えば、ユーザＡへのヒアリング等に基づいて設定されたキーワードでは分類できないログ等を、適切に分類することができる。例えば、ｅ－ｌｅａｒｎｉｎｇの講座名等に含まれる特定のキーワードを用いた指定規則により「教育」に分類されたメールがアドレスＡから送信されていた場合、サーバ１０は、送信元メールアドレス等を用いた分類規則を生成する。これにより、サーバ１０は、講座名に当該特定のキーワードが含まれない講座のメールも、適切に分類することができる。また、例えば、ユーザＡが受講している講座がコンプライアンスの教育からセキュリティの教育に変化し、講座のメールの件名に「コンプライアンス」という文字列が含まれないように変化した場合でも、送信元メールアドレスがアドレスＡであるため、追加規則により「教育」に分類される。 Next, the classification unit 14 of the server 10 classifies the logs of the terminal 20A in each period after the first period based on the first additional rule registered in the classification rule data 501 (step S9). Here, the server 10 may classify the logs of the terminal 20A in each period after the first period based on each rule registered in the classification rule data 501. In this case, the server 10 may first classify the logs of the terminal 20A in each period based on the specified rule. Then, the server 10 may reclassify the logs of the terminal 20A in each period that were not classified into any type by the specified rule (logs classified into the type "other") based on the first additional rule. This allows the server 10 to appropriately classify logs that cannot be classified by keywords set based on, for example, a hearing with the user A. For example, if an email classified as "education" based on a specified rule using a specific keyword included in the name of an e-learning course was sent from the address A, the server 10 generates a classification rule using the sender's email address, etc. This allows the server 10 to properly classify emails for courses that do not include the specific keyword in the course name. Also, for example, if the course that user A is taking changes from compliance training to security training, and the subject of the email for the course changes so that it no longer includes the character string "compliance," the email will still be classified as "education" according to the additional rule because the sender email address is address A.

また、サーバ１０は、第１期間における端末２０Ａのログを、分類規則データ５０１に登録されている各規則に基づいて分類してもよい。 The server 10 may also classify the logs of the terminal 20A during the first period based on each rule registered in the classification rule data 501.

例えば、講座名等に含まれる特定のキーワードを用いた指定規則により「教育」に分類されたメールがアドレスＡから送信されていた場合、サーバ１０は、時間の経過による変化の頻度が比較的少ないと考えられる送信元メールアドレス等を用いた分類規則を生成する。これにより、例えば、「教育」に分類するための当該特定のキーワードを講座名に含む講座が終了する等により、別の講座名に変更された場合であっても、サーバ１０はログを適切に分類することができる。また、例えば、「顧客対応」に分類するための特定のキーワードを含む会社名が変更された場合であっても、サーバ１０はログを適切に分類することができる。 For example, if an email classified as "education" according to a designation rule using a specific keyword contained in the course name, etc., was sent from address A, server 10 generates a classification rule using the sender email address, etc., which is thought to change relatively infrequently over time. This allows server 10 to classify logs appropriately, for example, even if a course that includes the specific keyword in its name for classification as "education" ends and the course name is changed to another one. Also, for example, server 10 can classify logs appropriately, even if a company name that includes a specific keyword for classification as "customer service" is changed.

（分類結果の出力について）
サーバ１０の出力部１６は、端末２０Ａの所定期間におけるログの分類結果等に基づく情報を管理者端末３０に出力する。図７の例では、管理者端末３０は、サーバ１０の出力部１６により出力された情報に基づき、当該所定期間におけるユーザＡの各分類の作業時間の比率を示す円グラフ７０１を画面に表示させている。円グラフ７０１において、領域７０２の面積は、「組織業務」の作業時間の比率を示している。同様に、領域７０３から領域７０７の各面積は、それぞれ、「社内対応」、「顧客対応」、「教育」、「その他」、「業務外」の作業時間の比率を示している。これにより、例えば、ユーザＡの働き方の改善等に役立てることができる。 (About output of classification results)
The output unit 16 of the server 10 outputs information based on the classification results of the logs of the terminal 20A for a predetermined period to the administrator terminal 30. In the example of FIG. 7, the administrator terminal 30 displays a pie chart 701 showing the ratio of the work time of each classification of the user A for the predetermined period on the screen based on the information output by the output unit 16 of the server 10. In the pie chart 701, the area of the area 702 indicates the ratio of the work time of the "organizational work". Similarly, the areas of the areas 703 to 707 respectively indicate the ratio of the work time of the "internal correspondence", "customer correspondence", "education", "other", and "non-work". This can be useful for improving the way the user A works, for example.

なお、サーバ１０の出力部１６は、端末２０Ａの所定期間におけるログに含まれるユーザＡの操作情報に基づいて、各分類の作業時間を算出してもよい。この場合、サーバ１０は、例えば、インスタントメッセンジャーのプレゼンス機能によりユーザＡが「在席中」であり、「教育」に分類されたメールを表示するウィンドウにフォーカスが付与されている時間長を、「教育」の作業時間と判定してもよい。なお、端末２０Ａは、ユーザＡによりウィンドウがマウス等でクリックされた場合、当該ウィンドウにフォーカスを付与し、当該ウィンドウに対してキーボード等の操作による入力を受け付ける。 The output unit 16 of the server 10 may calculate the task time for each category based on the operation information of user A contained in the log for a specified period of time of terminal 20A. In this case, the server 10 may determine the task time for "education" as the length of time during which user A is "present" by the presence function of the instant messenger and the window displaying the email classified as "education" has focus. When user A clicks on a window with a mouse or the like, terminal 20A focuses on the window and accepts input to the window via keyboard or other operations.

［第２の実施形態］
第２の実施形態では、第１の実施形態と同様に、サーバ１０が、特定のグループに分類されたログに基づいて、ユーザの業務に関するログを当該特定のグループに分類するための第１情報を抽出する。そして、サーバ１０は、第１情報に基づいてユーザの業務に関するログを分類し、第１情報により特定のグループに分類されたログに基づいて、ユーザの業務に関するログを当該特定のグループに分類するための第２情報を抽出する例について説明する。これにより、サーバ１０は、各種のログを適切に分類することができる。なお、第２の実施形態は一部を除いて第１の実施形態と同様であるため、適宜説明を省略する。以下では、第１の実施形態と共通する部分については説明を省略し、異なる部分についてのみ説明する。なお、サーバ１０は、第１の実施形態の各処理と、第２の実施形態の各処理とを、適宜組み合わせて行うことができる。 Second Embodiment
In the second embodiment, similarly to the first embodiment, the server 10 extracts first information for classifying logs related to the user's work into a specific group based on the logs classified into the specific group. Then, the server 10 classifies logs related to the user's work based on the first information, and extracts second information for classifying logs related to the user's work into the specific group based on the logs classified into the specific group by the first information. This allows the server 10 to appropriately classify various logs. Since the second embodiment is similar to the first embodiment except for a part, the description will be omitted as appropriate. In the following, the description of the parts common to the first embodiment will be omitted, and only the different parts will be described. The server 10 can perform each process of the first embodiment and each process of the second embodiment in appropriate combination.

＜処理＞
次に、図８を参照し、第２の実施形態に係るサーバ１０において実行される処理の一例について説明する。図８は、第２の実施形態に係るサーバ１０において実行される処理の一例を示すフローチャートである。図９は、第２の実施形態に係る分類規則データ５０１Ａの一例について説明する図である。 <Processing>
Next, an example of processing executed in the server 10 according to the second embodiment will be described with reference to Fig. 8. Fig. 8 is a flowchart showing an example of processing executed in the server 10 according to the second embodiment. Fig. 9 is a diagram for explaining an example of classification rule data 501A according to the second embodiment.

サーバ１０は、図４の各処理が実行された後、一定時間が経過した場合に図８の処理を実行してもよい。この場合、サーバ１０は、図４のステップＳ１からステップＳ７の処理が実行された後、所定の種別における指定規則、及び第１追加規則での分類の精度が低下した場合に、当該所定の種別に対して図８の処理を実行してもよい。 The server 10 may execute the process of FIG. 8 when a certain time has elapsed after each process of FIG. 4 has been executed. In this case, the server 10 may execute the process of FIG. 8 for a specific type when the accuracy of classification using the specified rule and the first additional rule for the specific type has decreased after the processes of steps S1 to S7 of FIG. 4 have been executed.

この場合、サーバ１０は、例えば、運用当初（例えば、第１期間等を含む期間）のログが、各規則（指定規則、及び第１追加規則）により当該所定の種別へ分類される数（ログ数、件数）と、第１期間よりも後の第２期間のログが、当該各規則により当該所定の種別へ分類される数とに基づいて、図８の処理を実行してもよい。この場合、サーバ１０は、例えば、運用当初のログが各規則により当該所定の種別へ分類される割合と、第１期間よりも後の第２期間のログが各規則により当該所定の種別へ分類される割合との比の値が閾値以上になった場合、図８の処理を実行してもよい。また、この場合、サーバ１０は、分類対象のログの件数と、いずれの種別にも分類できないログ（「その他」の種別へ分類されたログ）の件数との比の値が閾値以上になった場合、図８の処理を実行してもよい。 In this case, the server 10 may execute the process of FIG. 8 based on, for example, the number (number of logs, number of cases) of logs at the beginning of operation (for example, a period including the first period, etc.) classified into the specified type by each rule (designated rule and first additional rule) and the number of logs in the second period after the first period classified into the specified type by each rule. In this case, the server 10 may execute the process of FIG. 8 when, for example, the ratio value between the proportion of logs at the beginning of operation classified into the specified type by each rule and the proportion of logs in the second period after the first period classified into the specified type by each rule becomes equal to or greater than a threshold value. In addition, in this case, the server 10 may execute the process of FIG. 8 when the ratio value between the number of logs to be classified and the number of logs that cannot be classified into any type (logs classified into the "other" type) becomes equal to or greater than a threshold value.

以下では、第１期間よりも後の第２期間における端末２０Ａのログを分類する例について説明する。 The following describes an example of classifying the logs of terminal 20A in a second period that is later than the first period.

サーバ１０の取得部１３は、第２期間における端末２０Ａの各種のログを取得する（ステップＳ１０１）。続いて、サーバ１０の分類部１４は、分類規則データ５０１に登録されている第１追加規則に基づいて、端末２０Ａの第２期間におけるログを分類する（ステップＳ１０２）。 The acquisition unit 13 of the server 10 acquires various logs of the terminal 20A during the second period (step S101). Next, the classification unit 14 of the server 10 classifies the logs of the terminal 20A during the second period based on the first additional rule registered in the classification rule data 501 (step S102).

続いて、サーバ１０の抽出部１５は、第１追加規則により分類されたログに基づいて、分類用の第２追加規則（「第３分類規則」の一例。）の候補を１以上生成する（ステップＳ１０３）。ここで、サーバ１０は、例えば、ステップＳ１０２の処理により各種別に分類された各ログに含まれる文章を形態素解析することにより名詞の単語を抽出してもよい。そして、サーバ１０は、抽出した単語（キーワード）に基づく分類規則を、第２追加規則の候補としてもよい。 Next, the extraction unit 15 of the server 10 generates one or more candidates for a second additional rule for classification (an example of a "third classification rule") based on the logs classified according to the first additional rule (step S103). Here, the server 10 may extract noun words, for example, by morphologically analyzing sentences included in each log classified into various types by the processing of step S102. Then, the server 10 may set a classification rule based on the extracted words (keywords) as a candidate for the second additional rule.

続いて、サーバ１０の分類部１４は、第２追加規則の候補に基づいて、端末２０Ａのログを分類する（ステップＳ１０４）。続いて、サーバ１０の抽出部１５は、指定規則による端末２０Ａのログの分類の結果と、第２追加規則の候補による端末２０Ａのログの分類の結果とに基づいて、各候補のうち、第２追加規則とするものを決定する（ステップＳ１０５）。これにより、サーバ１０は、時間の経過による変化の頻度が比較的少ない項目の情報を用いる第１追加規則による分類結果に基づいて、時間の経過により変化した後のキーワード等による規則の候補を生成できる。 The classification unit 14 of the server 10 then classifies the log of the terminal 20A based on the candidates for the second additional rule (step S104). The extraction unit 15 of the server 10 then determines which of the candidates will be the second additional rule based on the results of classifying the log of the terminal 20A according to the specified rule and the results of classifying the log of the terminal 20A according to the candidates for the second additional rule (step S105). This allows the server 10 to generate candidates for rules based on keywords, etc., that have changed over time based on the results of classification according to the first additional rule, which uses information on items that change relatively infrequently over time.

ここで、サーバ１０は、ステップＳ１０２の処理により種別Ａに分類されたログにおける単語の出現頻度と、種別Ａ以外の各種別に分類されたログにおける当該単語の出現頻度とに基づいて、当該単語を第２追加規則に用いる単語として抽出するか否かを判定してもよい。この場合、サーバ１０は、ステップＳ１０２の処理により種別Ａに分類されたログにおける出現頻度が比較的高く、種別Ａ以外の各種別に分類されたログにおける出現頻度が比較的低い単語を抽出してもよい。そして、サーバ１０は、当該単語に基づく分類規則を、種別Ａに分類するための第２追加規則として決定してもよい。 Here, the server 10 may determine whether or not to extract the word as a word to be used in the second additional rule based on the frequency of occurrence of the word in logs classified into type A by the processing of step S102 and the frequency of occurrence of the word in logs classified into various types other than type A. In this case, the server 10 may extract a word that appears relatively frequently in logs classified into type A by the processing of step S102 and relatively low frequently in logs classified into various types other than type A. Then, the server 10 may determine the classification rule based on the word as the second additional rule for classifying into type A.

この場合、サーバ１０は、例えば、以下の式（２）により、キーワードの候補である各単語に対するスコアＳを算出し、第２閾値以上のスコアＳを有する単語に基づく分類規則を、第２追加規則の候補としてもよい。式（２）において、ｉは各単語のインデックスを示す。また、ｊは各種別のインデックスを示す。 In this case, the server 10 may calculate a score S for each word that is a keyword candidate, for example, using the following formula (2), and may set a classification rule based on a word that has a score S equal to or greater than the second threshold as a candidate for the second additional rule. In formula (2), i indicates the index of each word, and j indicates the index for each type.

Ｓ_ｉ＝ α_ｉ，ｊ＋ｔｆｉｄｆ_ｉ，ｊ・・・（２）
ここで、α_ｉ，ｊは、インデックスｉにより示される単語ｔ_ｉが、インデックスｊにより示される種別ｄ_ｊに分類されたログにのみ出現する場合は定数Ａ（例えば、１）であり、それ以外の場合は定数Ａよりも小さい定数Ｂ（例えば、０）である。また、ｔｆｉｄｆ_ｉ，ｊは、以下の式（３）により算出される。 S _i = α _i,j + t fidf _i,j ... (2)
Here, α _i,j is a constant A (e.g., 1) when the word t _i indicated by index i appears only in logs classified into the type d _j indicated by index j, and is a constant B (e.g., 0) smaller than the constant A otherwise. tfidf _i,j is calculated by the following formula (3).

ｔｆｉｄｆ_ｉ，ｊ＝ｔｆ_ｉ，ｊ・ｉｄｆ_ｊ・・・（３）
ここで、ｔｆ_ｉ，ｊは、種別ｄ_ｊにおける単語ｔ_ｉの出現頻度である。ｔｆ_ｉ，ｊは、例えば、種別ｄ_ｊにおける単語ｔ_ｉの出現回数を、種別ｄ_ｊにおける全ての単語の出現回数の和の値で除算した値でもよい。また、ｉｄｆ_ｊは、種別の総数を、単語ｔ_ｉを含む種別の数で除算した値の対数（ｌｏｇ）でもよい。 tfidf _i,j = tfi _,j · idf _j ... (3)
Here, tf _i,j is the frequency of occurrence of word t _i in type d _j . tf _i,j may be, for example, the value obtained by dividing the number of occurrences of word t _i in type d _j by the sum of the number of occurrences of all words in type d _j . Also, idf _j may be the logarithm (log) of the value obtained by dividing the total number of types by the number of types that include word t _i .

（第２閾値について）
サーバ１０は、指定規則に設定されているキーワードの運用当初（例えば、第１期間等を含む期間）のログに対するスコアＳと、第２追加規則の候補に用いられるキーワードの第２期間のログに対するスコアＳとが同様の程度となるように、第２閾値を決定してもよい。これにより、例えば、サーバ１０は、ヒアリングに基づいて設定されたキーワードと同様の精度で、一定時間経過後のログを分類可能なキーワードを決定できる。 (Regarding the second threshold)
The server 10 may determine the second threshold value so that the score S of the keyword set in the specified rule for the log at the beginning of operation (for example, a period including the first period, etc.) is similar to the score S of the keyword used as a candidate for the second additional rule for the log in the second period. This allows the server 10 to determine keywords that can classify logs after a certain time has passed with the same accuracy as the keywords set based on the hearing, for example.

この場合、サーバ１０は、以下のような処理を行ってもよい。まず、サーバ１０は、第２期間よりも前の期間における端末２０Ａのログを取得する。そして、サーバ１０は、第１追加規則に基づいて、端末２０Ａの第２期間よりも前の期間におけるログを分類する。そして、サーバ１０は、第１追加規則による分類結果に基づいて、指定規則に設定されている種別Ａの各キーワードに対するスコアＳを、上述した式（２）によりそれぞれ算出する。そして、サーバ１０は、種別Ａの各キーワードに対するスコアＳの代表値（例えば、平均値、最頻値、または中央値）に所定の係数を乗算等した値を、種別Ａに対する第２閾値として決定する。そして、サーバ１０は、種別Ａ以外の各種別についても、同様に、第２閾値を決定する。 In this case, the server 10 may perform the following process. First, the server 10 acquires the logs of the terminal 20A for the period before the second period. Then, the server 10 classifies the logs of the terminal 20A for the period before the second period based on the first additional rule. Then, the server 10 calculates the score S for each keyword of type A set in the specified rule based on the classification result by the first additional rule using the above-mentioned formula (2). Then, the server 10 determines a value obtained by multiplying a representative value (e.g., average value, mode, or median) of the scores S for each keyword of type A by a predetermined coefficient, as the second threshold value for type A. Then, the server 10 similarly determines the second threshold value for each type other than type A.

続いて、サーバ１０の抽出部１５は、第１追加規則の候補と、第２追加規則の候補とに基づいて、分類用の第３追加規則（「第３分類規則」の一例。）の候補を１以上生成する（ステップＳ１０６）。ここで、サーバ１０は、第１追加規則の候補の規則と、第２追加規則の候補の規則とを組み合わせた規則を、第３追加規則の候補としてもよい。この場合、サーバ１０は、例えば、図４のステップＳ７の処理で算出した式（１）誤り率Ｅが閾値以下である第１追加規則の候補と、ステップＳ１０５の処理で算出した式（２）のスコアＳが閾値以上である第２追加規則の候補の規則とを組み合わせた規則を、第３追加規則の候補としてもよい。 Then, the extraction unit 15 of the server 10 generates one or more candidates for a third additional rule for classification (an example of a "third classification rule") based on the candidates for the first additional rule and the candidates for the second additional rule (step S106). Here, the server 10 may set a rule that combines a candidate for the first additional rule with a candidate for the second additional rule as a candidate for the third additional rule. In this case, the server 10 may set a candidate for the third additional rule as a rule that combines, for example, a candidate for the first additional rule whose error rate E of formula (1) calculated in the process of step S7 in FIG. 4 is equal to or less than a threshold, and a candidate for the second additional rule whose score S of formula (2) calculated in the process of step S105 is equal to or greater than a threshold.

続いて、サーバ１０の抽出部１５は、各候補のうち、第３追加規則とするものを決定する（ステップＳ１０７）。ここで、サーバ１０は、例えば、第３追加規則の候補に含まれる第１追加規則の候補に対するスコアＳの値を、当該第３追加規則の候補に含まれる第１追加規則の候補に対する誤り率Ｅの値で除算したスコアＳ_２（Ｓ_２＝Ｓ／Ｅ）を算出してもよい。そして、サーバ１０は、スコアＳ_２が閾値以上である場合のみ、当該第３追加規則の候補を第３追加規則として決定してもよい。 Next, the extraction unit 15 of the server 10 determines which of the candidates is to be the third additional rule (step S107). Here, the server 10 may calculate a score S2 (S2 = S/E) by dividing the value of the score S for the candidate for the first additional rule included in the candidates for the third additional rule by the value of the error rate E for the candidate for the first additional rule included in the candidates for the third additional rule. Then, the server 10 may determine the candidate for the _third _additional rule as the third additional rule only when the score _S2 is equal to or greater than a threshold value.

続いて、サーバ１０の記憶部１１は、決定した第２追加規則、及び第３追加規則を、図９に示す分類規則データ５０１Ａに登録する（ステップＳ１０８）。続いて、サーバ１０の分類部１４は、分類規則データ５０１に登録されている第２追加規則、及び第３追加規則のそれぞれに基づいて、端末２０Ａのログを分類する（ステップＳ１０９）。これにより、例えば、端末２０から取得されたログのうち、指定規則に基づいて分類されず、第１追加規則に基づいても分類されないログを、第２追加規則、または第３追加規則に基づいて分類させることができる。なお、サーバ１０は、例えば、第２追加規則を用いないようにしてもよい。 Then, the storage unit 11 of the server 10 registers the determined second and third additional rules in the classification rule data 501A shown in FIG. 9 (step S108). The classification unit 14 of the server 10 then classifies the logs of the terminal 20A based on the second and third additional rules registered in the classification rule data 501 (step S109). This allows, for example, logs acquired from the terminal 20 that are not classified based on the specified rule and are not classified based on the first additional rule to be classified based on the second or third additional rule. Note that the server 10 may not use the second additional rule, for example.

なお、図９に示す分類規則データ５０１Ａの例では、メールの件名に「法令遵守」という文字列が含まれる場合、「教育」の種別に分類するという第２追加規則が登録されている。また、メールの件名に「優秀活動事例」という文字列が含まれる、かつ、送信先メールアドレスがアドレスＢである場合、「教育」の種別に分類するという第３追加規則が登録されている。 In the example of classification rule data 501A shown in FIG. 9, a second additional rule is registered such that if the subject of an email contains the character string "legal compliance", the email is classified as "education". Also, a third additional rule is registered such that if the subject of an email contains the character string "excellent activity example" and the destination email address is address B, the email is classified as "education".

（分類用の規則の適用順について）
サーバ１０は、分類規則データ５０１に登録されている各規則を、所定の順番で適用して、端末２０のログを分類してもよい。この場合、サーバ１０は、例えば、まず、指定規則に含まれる各規則を、当該各規則に対して予め設定されている所定の順番で適用してログを分類してもよい。そして、サーバ１０は、第１追加に含まれる各規則を、図４のステップＳ７の処理で算出した式（１）の誤り率Ｅが小さい順番で適用してログを分類してもよい。 (Regarding the order of application of classification rules)
The server 10 may classify the logs of the terminal 20 by applying the rules registered in the classification rule data 501 in a predetermined order. In this case, the server 10 may, for example, first classify the logs by applying the rules included in the specified rules in a predetermined order that is preset for each rule. Then, the server 10 may classify the logs by applying the rules included in the first addition in the order of the smallest error rate E of formula (1) calculated in the process of step S7 in FIG. 4.

そして、サーバ１０は、第２追加に含まれる各規則を、ステップＳ１０５の処理で算出した式（２）のスコアＳが小さい順番で適用してログを分類してもよい。そして、サーバ１０は、第３追加に含まれる各規則を、ステップＳ１０７の処理で算出したスコアＳ_２が小さい順番で適用してログを分類してもよい。 The server 10 may then classify the logs by applying each rule included in the second addition in ascending order of the score S of formula (2) calculated in the process of step S105. The server 10 may then classify the logs by applying each rule included in the third addition in ascending order of the score _S2 calculated in the process of step S107.

＜変形例＞
サーバ１０の各機能部は、例えば１以上のコンピュータにより提供されるクラウドコンピューティングにより実現されていてもよい。 <Modification>
Each functional unit of the server 10 may be realized, for example, by cloud computing provided by one or more computers.

以上、本発明の実施例について詳述したが、本発明は斯かる特定の実施形態に限定されるものではなく、特許請求の範囲に記載された本発明の要旨の範囲内において、種々の変形・変更が可能である。 Although the above describes in detail an embodiment of the present invention, the present invention is not limited to such a specific embodiment, and various modifications and variations are possible within the scope of the gist of the present invention described in the claims.

以上の説明に関し、更に以下の項を開示する。
（付記１）
キーワードに基づく第１分類規則により第１ログを分類する処理と、
前記第１ログを分類する処理によって特定のグループに分類された第２ログから、操作情報、スケジュール情報、及び通信情報のうち少なくとも一つに関連する特定の情報を抽出する処理と、
前記特定の情報に基づく第２分類規則により第３ログを分類する処理と、
をコンピュータに実行させることを特徴とする分類プログラム。
（付記２）
抽出された前記特定の情報と、前記特定のグループの情報とに基づいて、前記第２分類規則を生成する処理、
を前記コンピュータに実行させることを特徴とする付記１に記載の分類プログラム。
（付記３）
前記第３ログを分類する処理は、前記第３ログが前記第２分類規則に合致する場合、前記第３ログを前記特定のグループに分類する処理を含む、
ことを特徴とする付記１または２に記載の分類プログラム。
（付記４）
前記第２分類規則により前記第１ログを分類した場合に前記特定のグループに分類される第４ログのうち、前記第２ログと共通のログの割合は閾値以上である、
ことを特徴とする付記１乃至３のいずれか一項に記載の分類プログラム。
（付記５）
前記特定の情報は、前記通信情報に含まれるメッセージの送信元情報とメッセージの送信先情報との少なくとも一方の情報を含む、
ことを特徴とする付記１乃至４のいずれか一項に記載の分類プログラム。
（付記６）
前記特定のグループに分類されたログにおける単語の出現頻度と、前記特定のグループ以外のグループに分類されたログにおける単語の出現頻度とに基づいて、特定の単語を抽出する処理と、
抽出された前記特定の単語に基づく第３分類規則により第４ログを分類する処理と、
を前記コンピュータに実行させることを特徴とする付記１乃至５のいずれか一項に記載の分類プログラム。
（付記７）
前記特定の単語を抽出する処理は、第１期間に取得されたログのうち前記特定のグループに分類されるログの数と、前記第１期間よりも後の第２期間に取得されたログのうち前記特定のグループに分類されるログの数との比較に基づいて、実行される、
ことを特徴とする付記６に記載の分類プログラム。
（付記８）
キーワードに基づく第１分類規則により第１ログを分類する処理と、
前記第１ログを分類する処理によって特定のグループに分類された第２ログから、操作情報、スケジュール情報、及び通信情報のうち少なくとも一つに関連する特定の情報を抽出する処理と、
前記特定の情報に基づく第２分類規則により第３ログを分類する処理と、
をコンピュータが実行する分類方法。
（付記９）
キーワードに基づく第１分類規則により第１ログを分類する分類部と、
前記第１ログを分類する処理によって特定のグループに分類された第２ログから、操作情報、スケジュール情報、及び通信情報のうち少なくとも一つに関連する特定の情報を抽出する抽出部と、を備え、
前記分類部は、前記特定の情報に基づく第２分類規則に合致するログを前記特定のグループに分類する、分類装置。 In addition to the above description, the following items are disclosed.
(Appendix 1)
A process of classifying the first log according to a first classification rule based on a keyword;
A process of extracting specific information related to at least one of operation information, schedule information, and communication information from the second logs classified into the specific groups by the process of classifying the first logs;
A process of classifying the third log according to a second classification rule based on the specific information;
A classification program characterized by causing a computer to execute the above steps.
(Appendix 2)
generating the second classification rule based on the extracted specific information and information of the specific group;
The classification program according to claim 1, characterized in that the classification program is executed by the computer.
(Appendix 3)
The process of classifying the third log includes a process of classifying the third log into the specific group when the third log matches the second classification rule.
3. The classification program according to claim 1 or 2.
(Appendix 4)
a ratio of logs common to the second log among the fourth logs classified into the specific group when the first log is classified according to the second classification rule is equal to or greater than a threshold value;
4. The classification program according to claim 1,
(Appendix 5)
The specific information includes at least one of source information of a message included in the communication information and destination information of the message.
5. The classification program according to claim 1,
(Appendix 6)
A process of extracting a specific word based on the frequency of appearance of a word in the log classified into the specific group and the frequency of appearance of a word in the log classified into a group other than the specific group;
A process of classifying the fourth log according to a third classification rule based on the extracted specific word;
6. The classification program according to claim 1, wherein the classification program is executed by the computer.
(Appendix 7)
the process of extracting the specific word is executed based on a comparison between the number of logs classified into the specific group among logs acquired in a first period and the number of logs classified into the specific group among logs acquired in a second period that is later than the first period.
7. The classification program according to claim 6,
(Appendix 8)
A process of classifying the first log according to a first classification rule based on a keyword;
a process of extracting specific information related to at least one of operation information, schedule information, and communication information from the second logs classified into the specific groups by the process of classifying the first logs;
A process of classifying the third log according to a second classification rule based on the specific information;
A classification method performed by a computer.
(Appendix 9)
a classification unit that classifies the first log according to a first classification rule based on a keyword;
an extracting unit that extracts specific information related to at least one of operation information, schedule information, and communication information from the second log classified into the specific group by the processing for classifying the first log,
The classification unit classifies logs that match a second classification rule based on the specific information into the specific group.

１分類システム
１０サーバ
１１記憶部
１２受付部
１３取得部
１４分類部
１５抽出部
１６出力部
２０端末
３０管理者端末 Reference Signs List 1 Classification system 10 Server 11 Storage unit 12 Reception unit 13 Acquisition unit 14 Classification unit 15 Extraction unit 16 Output unit 20 Terminal 30 Administrator terminal

Claims

A process of classifying the first log according to a first classification rule based on a keyword;
a process of extracting specific information related to at least one of schedule information, communication source information, communication destination information, and communication date and time information from the second logs classified into the specific groups by the process of classifying the first logs;
A process of classifying the third log according to a second classification rule based on the specific information;
A classification program characterized by causing a computer to execute the above steps.

generating the second classification rule based on the extracted specific information and information of the specific group;
2. The classification program according to claim 1, wherein the computer executes the steps of:

The process of classifying the third log includes a process of classifying the third log into the specific group when the third log matches the second classification rule.
3. The classification program according to claim 1 or 2.

a ratio of logs common to the second log among the fourth logs classified into the specific group when the first log is classified according to the second classification rule is equal to or greater than a threshold value;
4. The classification program according to claim 1, wherein the classification program is a program for performing a classification process.

The communication source information is information about a sender of a message,
The communication destination information is message destination information.
5. The classification program according to claim 1, wherein the classification program is a program for performing a classification process.

A process of extracting a specific word based on the frequency of appearance of a word in the log classified into the specific group and the frequency of appearance of a word in the log classified into a group other than the specific group;
A process of classifying the fourth log according to a third classification rule based on the extracted specific word;
6. The classification program according to claim 1, wherein the classification program is executed by the computer.

the process of extracting the specific word is executed based on a comparison between the number of logs classified into the specific group among logs acquired in a first period and the number of logs classified into the specific group among logs acquired in a second period that is later than the first period.
7. The classification program according to claim 6.

A process of classifying the first log according to a first classification rule based on a keyword;
a process of extracting specific information related to at least one of schedule information, communication source information, communication destination information, and communication date and time information from the second logs classified into the specific groups by the process of classifying the first logs;
A process of classifying the third log according to a second classification rule based on the specific information;
A classification method performed by a computer.

a classification unit that classifies the first log according to a first classification rule based on a keyword;
an extracting unit that extracts specific information related to at least one of schedule information, communication source information, communication destination information, and communication date and time information from the second log classified into a specific group by the process of classifying the first log,
The classification unit classifies logs that match a second classification rule based on the specific information into the specific group.