JP2012141928A

JP2012141928A - Computer, document presentation method and document presentation program

Info

Publication number: JP2012141928A
Application number: JP2011001220A
Authority: JP
Inventors: Keiichi Hiroki; 桂一廣木
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 2011-01-06
Filing date: 2011-01-06
Publication date: 2012-07-26

Abstract

PROBLEM TO BE SOLVED: To provide a computer for presenting a document to which a title as a valid determination material of the selection of information is applied.SOLUTION: In a computer connected to a document database in which a document is stored, a processor is configured to extract an inter-word causal relation included in the document stored in the document database, create causal network data expressing the extracted inter-word causal relation with a network structure, extract a word having the causal relation with an interesting word as a word which interests a user from the causal network data, create a title of the document based on a first word and a second word if defining the interesting word as the first word and the extracted word as the second word, and create data for presenting the document to which the created title is applied via an output device to the user.

Description

本発明は、計算機、文書提示方法及び文書提示プログラムに関し、計算機の利用者に対して文書を提示する技術に関する。 The present invention relates to a computer, a document presentation method, and a document presentation program, and to a technique for presenting a document to a computer user.

計算機を使用して作業する作業者に対して、作業の助けとなるような文書（補助資料）を計算機上で提示することによって、作業を支援する技術が知られている（特許文献１参照）。特許文献１には、見出しを付与した文書を提示することによって、作業者の取捨選択や情報に対する概要把握を支援する技術が開示されている。 A technique for supporting work by presenting a document (auxiliary material) that assists the work to a worker who works using the computer on the computer is known (see Patent Document 1). . Japanese Patent Application Laid-Open No. H10-228561 discloses a technique that supports selection of an operator and grasping an outline of information by presenting a document with a headline.

特開２０１０−１１７７９０号公報JP 2010-117790 A

しかしながら、特許文献１に開示された技術において文書に付与される見出しは、作業者の作業状況を考慮したものではない。そのため、情報の取捨選択の判断材料として適切でない問題があった。 However, the headline given to the document in the technique disclosed in Patent Document 1 does not consider the work status of the worker. For this reason, there is a problem that is not appropriate as a judgment material for selecting information.

本発明は、上述した課題を考慮したものであって、情報の取捨選択の有効な判断材料となる見出しを付与した文書を提示する計算機を提供することを目的とする。 An object of the present invention is to provide a computer that presents a document with a heading that is an effective determination material for selecting information, in consideration of the above-described problems.

上記課題を解決するために、例えば特許請求の範囲に記載の構成を採用する。 In order to solve the above problems, for example, the configuration described in the claims is adopted.

本願は上記課題を解決する手段を複数含んでいるが、その一例を挙げるならば、プログラムを実行するプロセッサと、前記プロセッサによって実行されるプログラムを格納するメモリと、前記プロセッサに接続されたインタフェースと、を備え、文書を格納する文書データベースに接続された計算機であって、前記プロセッサは、前記文書データベースに格納された文書に含まれる単語間の因果関係を抽出し、抽出された前記単語間の因果関係をネットワーク構造で表した因果ネットワークデータを作成し、前記因果ネットワークデータから、利用者が興味を持っている単語である興味語と因果関係にある単語を抽出し、前記興味語を第１ワード、抽出された単語を第２ワードとした場合における前記第１ワードと、前記第２ワードとに基づいて、文書の見出しを作成し、作成された前記見出しが付与された文書を、出力装置を介して利用者に提示するためのデータを生成することを特徴とする。 The present application includes a plurality of means for solving the above problems. To give an example, a processor that executes a program, a memory that stores a program executed by the processor, and an interface connected to the processor, A computer connected to a document database for storing documents, wherein the processor extracts a causal relationship between words contained in the document stored in the document database, and the extracted words Causal network data representing a causal relationship in a network structure is created, and from the causal network data, an interesting word that is a word that a user is interested in is extracted and a word that is causal is extracted. Word, based on the first word and the second word when the extracted word is the second word Te, create a heading documents, a document the title that was created is assigned, and generating the data for presentation to a user via the output device.

本発明によれば、情報の取捨選択の有効な判断材料となる見出しを付与した文書を提示することができる。 According to the present invention, it is possible to present a document with a heading that is an effective determination material for selecting information.

上記した以外の課題、構成及び効果は、以下の実施形態の説明により明らかにされる。 Problems, configurations, and effects other than those described above will be clarified by the following description of embodiments.

本発明の実施形態の計算機システムの利用者の操作概要を示すフローチャートである。It is a flowchart which shows the operation | movement outline | summary of the user of the computer system of embodiment of this invention. 本発明の実施形態の計算機システムの構成例を説明する図である。It is a figure explaining the structural example of the computer system of embodiment of this invention. 本発明の実施形態の計算機システムのハードウェア構成例を示す図である。It is a figure which shows the hardware structural example of the computer system of embodiment of this invention. 本発明の実施形態の個人作業状態データの例を示す図である。It is a figure which shows the example of the personal work status data of embodiment of this invention. 本発明の実施形態の個人作業履歴データの例を示す図である。It is a figure which shows the example of the personal work log | history data of embodiment of this invention. 本発明の実施形態の個人プロフィールデータの例を示す図である。It is a figure which shows the example of the personal profile data of embodiment of this invention. 本発明の実施形態の情報提供サーバの全体処理を示すフローチャートである。It is a flowchart which shows the whole process of the information provision server of embodiment of this invention. 本発明の実施形態の熟練度判定部の制御ロジックを示すフローチャートである。It is a flowchart which shows the control logic of the skill level determination part of embodiment of this invention. 本発明の実施形態の多義性解決部の制御ロジックを示すフローチャートである。It is a flowchart which shows the control logic of the ambiguity resolution part of embodiment of this invention. 本発明の実施形態の因果抽出部の制御ロジックを示すフローチャートである。It is a flowchart which shows the control logic of the causal extraction part of embodiment of this invention. 本発明の実施形態の因果抽出部によって作成される因果ネットワークデータの例を説明する図である。It is a figure explaining the example of the causal network data produced by the causal extraction part of embodiment of this invention. 本発明の実施形態の興味語抽出部の制御ロジックを示すフローチャートである。It is a flowchart which shows the control logic of the interested word extraction part of embodiment of this invention. 本発明の実施形態の見出し作成部の制御ロジックを示すフローチャートである。It is a flowchart which shows the control logic of the headline preparation part of embodiment of this invention. 本発明の実施形態の見出し作成部によって作成される見出しの一例を示す図である。It is a figure which shows an example of the headline produced by the headline preparation part of embodiment of this invention. 本発明の実施形態の見出しリスト表示部の制御ロジックを示すフローチャートである。It is a flowchart which shows the control logic of the heading list display part of embodiment of this invention. 本発明の実施形態の見出しリスト表示部による表示画面の例を示す図である。It is a figure which shows the example of the display screen by the heading list display part of embodiment of this invention.

以下、本発明の実施の形態について図面を参照して説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

本発明の実施形態の計算機システム１（図２参照）では、製造現場においてクライアントコンピュータ４０を用いて作業する作業者（以下、「利用者」という。）に、過去の不具合事例や作業ノウハウに関する文書を提示することによって、不具合再発防止や作業の効率化を実現する。 In the computer system 1 (see FIG. 2) according to the embodiment of the present invention, a document on past trouble cases and work know-how is given to a worker (hereinafter referred to as a “user”) who uses the client computer 40 at a manufacturing site. By presenting, it is possible to prevent the recurrence of defects and improve work efficiency.

図１は、本発明の実施形態の計算機システム１の利用者の操作概要を示すフローチャートである。利用者の操作概要について、図２を適宜参照して説明する。 FIG. 1 is a flowchart showing an outline of the operation of the user of the computer system 1 according to the embodiment of this invention. An outline of user operations will be described with reference to FIG.

はじめに、利用者は、業務管理システムクライアント４０２を介して、業務管理システム２０１にログインし、ユーザＩＤを入力する（Ｓ１０１）。その後、クライアントコンピュータ４０の出力部（ディスプレイ等）に業務の一覧が提示されるので、利用者は実行する業務を選択し、業務を開始する（Ｓ１０２）。そうすると、クライアントコンピュータ４０上で、選択された業務の作業状態に関連する情報の見出し一覧が表示される（Ｓ１０３）。 First, the user logs in to the business management system 201 via the business management system client 402 and inputs a user ID (S101). Thereafter, since a list of tasks is presented on the output unit (display or the like) of the client computer 40, the user selects a task to be executed and starts the task (S102). Then, a heading list of information related to the work status of the selected job is displayed on the client computer 40 (S103).

利用者は、表示された見出し一覧の中で、興味を引く見出しがあるか否かを判断する（Ｓ１０４）。興味を引く見出しがある場合（Ｓ１０４でＹＥＳ）、利用者は、この見出しを選択し、情報提供サーバ１０から業務に有用な情報を取得する（Ｓ１０５）。一方、興味を引く見出しがない場合（Ｓ１０４でＮＯ）、利用者は業務を継続する（Ｓ１０６）。 The user determines whether there is a headline that attracts interest in the displayed heading list (S104). When there is a headline that attracts interest (YES in S104), the user selects this headline, and acquires information useful for business from the information providing server 10 (S105). On the other hand, when there is no headline that attracts interest (NO in S104), the user continues the business (S106).

その後、業務中の場合（Ｓ１０７でＮＯ）、ステップ１０３に戻る。この場合、表示される見出し一覧が、その時点における業務の作業状態に応じて更新される。一方、業務が終了すると（Ｓ１０７でＹＥＳ）、利用者は業務管理システム２０１からログアウトする（Ｓ１０８）。 Thereafter, when the job is in progress (NO in S107), the process returns to step 103. In this case, the displayed heading list is updated according to the work status of the task at that time. On the other hand, when the business is completed (YES in S107), the user logs out from the business management system 201 (S108).

以下、ステップ１０３の処理、すなわち見出し一覧の表示方法について説明する。 Hereinafter, the process of step 103, that is, the heading list display method will be described.

図２は、本発明の実施形態の計算機システム１の構成例を説明する図である。 FIG. 2 is a diagram illustrating a configuration example of the computer system 1 according to the embodiment of this invention.

図２において、実線長方形で示すブロック（熟練度判定部１０１等）は、プログラムモジュールである。平行四辺形で示すブロック（ユーザＩＤ１１１等）は、データである。破線長方形で示すブロック（業務管理システム２０１等）は、情報提供サーバ１０と別の独立した外部システムである。円柱で示すブロック（業務情報ＤＢ５０等）は、データベースである。 In FIG. 2, blocks (skill level determination unit 101 and the like) indicated by solid line rectangles are program modules. A block (user ID 111 or the like) indicated by a parallelogram is data. A block (business management system 201 or the like) indicated by a broken-line rectangle is an independent external system separate from the information providing server 10. A block (business information DB 50 or the like) indicated by a cylinder is a database.

計算機システム１は、情報提供サーバ１０、業務管理サーバ２０、検索サーバ３０、クライアントコンピュータ４０、業務情報ＤＢ５０、個人情報ＤＢ６０、文書ＤＢ７０を備える。 The computer system 1 includes an information providing server 10, a business management server 20, a search server 30, a client computer 40, a business information DB 50, a personal information DB 60, and a document DB 70.

情報提供サーバ１０は、クライアントコンピュータ４０に見出し付き文書データ１１８を提供するコンピュータ装置である。この情報提供サーバ１０は、熟練度判定部１０１、興味語抽出部１０２、多義性解決部１０３、因果抽出部１０４、見出し作成部１０５を備える。各機能部については詳細に後述する。 The information providing server 10 is a computer device that provides document data 118 with a header to the client computer 40. The information providing server 10 includes a skill level determination unit 101, an interesting word extraction unit 102, an ambiguity resolution unit 103, a causal extraction unit 104, and a headline creation unit 105. Each functional unit will be described in detail later.

この情報提供サーバ１０は、業務管理サーバ２０から受信したユーザＩＤ１１１に基づいて、業務管理サーバ２０を介して又は介さずに、業務情報ＤＢ５０、個人情報ＤＢ６０から、それぞれ個人作業状態データ１１２、個人プロフィールデータ１１４を入力する。 Based on the user ID 111 received from the business management server 20, the information providing server 10 receives the personal work status data 112 and the personal profile from the business information DB 50 and the personal information DB 60 with or without the business management server 20. Data 114 is input.

業務管理サーバ２０は、クライアントコンピュータ４０からの要求に応じて、業務管理システムクライアント４０２にサービスを提供する業務管理システム２０１を備えたコンピュータ装置である。この業務管理サーバ２０は、クライアントコンピュータ４０から、利用者のユーザＩＤ４１１、業務の作業状態に関するデータ４１２を受信する。 The business management server 20 is a computer device including a business management system 201 that provides a service to the business management system client 402 in response to a request from the client computer 40. The business management server 20 receives from the client computer 40 the user ID 411 of the user and data 412 relating to the work status of the business.

検索サーバ３０は、情報を検索する検索エンジン３０１を備えたコンピュータ装置である。この検索サーバ３０は、情報提供サーバ１０から興味語データ１１７を受信し、興味語データ１１７に基づいて、文書ＤＢ７０中に記録された文書データに対して検索を実行し、検索結果として抽出される文書データ３１１を情報提供サーバ１０に送信する。 The search server 30 is a computer device that includes a search engine 301 that searches for information. The search server 30 receives the interesting word data 117 from the information providing server 10, performs a search on the document data recorded in the document DB 70 based on the interesting word data 117, and is extracted as a search result. The document data 311 is transmitted to the information providing server 10.

クライアントコンピュータ４０は、見出しリスト表示部４０１、業務管理システムクライアント４０２を備えたコンピュータ装置である。利用者は、クライアントコンピュータ４０を用いて自身の業務を実行する。 The client computer 40 is a computer device that includes a heading list display unit 401 and a business management system client 402. The user executes his / her business using the client computer 40.

業務情報ＤＢ５０は、クライアントコンピュータ４０の各利用者によって遂行される業務に関する情報（業務名、関連文書一覧、関連語一覧等、詳細は後述）が記録されたデータベースである。個人情報ＤＢ６０は、クライアントコンピュータ４０の各利用者の個人プロフィールデータ１１４が記録されたデータベースである。文書ＤＢ７０は、各種文書データが記録されたデータベースである。 The business information DB 50 is a database in which information related to business performed by each user of the client computer 40 (business name, related document list, related word list, etc., details will be described later) is recorded. The personal information DB 60 is a database in which personal profile data 114 of each user of the client computer 40 is recorded. The document DB 70 is a database in which various document data are recorded.

図３は、本発明の実施形態の計算機システム１のハードウェア構成例を説明する図である。なお、図３では、図２と同様の機能を果たす部分には同一の符号を付して重複する説明を適宜省略する。 FIG. 3 is a diagram illustrating a hardware configuration example of the computer system 1 according to the embodiment of this invention. In FIG. 3, the same reference numerals are given to portions that perform the same functions as those in FIG. 2, and redundant descriptions are omitted as appropriate.

図３に示す計算機システム１は、ネットワーク８０を介して互いに接続された情報提供サーバ１０、業務管理サーバ２０、検索サーバ３０、クライアントコンピュータ４０を備える。なお、ネットワーク８０は、ＬＡＮ（Local Area Network）、ＷＡＮ（Wide Area Network）、又は、インターネット等のグローバルネットワークである。 The computer system 1 shown in FIG. 3 includes an information providing server 10, a business management server 20, a search server 30, and a client computer 40 that are connected to each other via a network 80. The network 80 is a global network such as a LAN (Local Area Network), a WAN (Wide Area Network), or the Internet.

以下、各装置のハードウェア構成について説明する。 Hereinafter, the hardware configuration of each apparatus will be described.

情報提供サーバ１０は、ネットワークインタフェース１１、ＣＰＵ（Central Processing Unit）１２、主記憶装置１３、二次記憶装置１４、及び、これらを相互に接続するバス１５を備える。 The information providing server 10 includes a network interface 11, a CPU (Central Processing Unit) 12, a main storage device 13, a secondary storage device 14, and a bus 15 that interconnects them.

ネットワークインタフェース１１は、情報提供サーバ１０がネットワーク８０に接続するためのインタフェースである。ＣＰＵ１２は、主記憶装置１３に記憶されているプログラムを実行する演算処理装置である。主記憶装置１３は、ＣＰＵ１２によって実行されるプログラム、及び、プログラムの実行に必要なデータを記憶するＲＡＭ（Random Access Memory）等の記憶装置である。プログラムとは、例えば、不図示のＯＳ（Operating System）である。二次記憶装置１４は、ハードディスク装置などの磁気記憶媒体である。 The network interface 11 is an interface for the information providing server 10 to connect to the network 80. The CPU 12 is an arithmetic processing device that executes a program stored in the main storage device 13. The main storage device 13 is a storage device such as a RAM (Random Access Memory) that stores a program executed by the CPU 12 and data necessary for executing the program. The program is, for example, an OS (Operating System) (not shown). The secondary storage device 14 is a magnetic storage medium such as a hard disk device.

業務管理サーバ２０、検索サーバ３０、クライアントコンピュータ４０のそれぞれのハードウェア構成は、情報提供サーバ１０と同様であるとして、ここでは説明を省略する。 The hardware configurations of the business management server 20, the search server 30, and the client computer 40 are the same as those of the information providing server 10, and a description thereof is omitted here.

図４Ａは、本発明の実施形態の個人作業状態データ１１２の例を示す図である。 FIG. 4A is a diagram illustrating an example of the personal work status data 112 according to the embodiment of this invention.

個人作業状態データ１１２は、各利用者の作業状態を示すデータである。この個人作業状態データ１１２は、業務ＩＤ１１２Ａ、作成中文書１１２Ｂ、閲覧中文書１１２Ｃ、検索エンジン状態１１２Ｄを含む。 The personal work status data 112 is data indicating the work status of each user. The personal work status data 112 includes a business ID 112A, a document being created 112B, a document being browsed 112C, and a search engine status 112D.

業務ＩＤ１１２Ａとは、利用者が従事している業務を一意に識別する識別子である。この業務ＩＤ１１２Ａは、あらかじめ工場内又は製造工程内の全ての業務に振られている。作成中文書１１２Ｂとは、利用者が作業時間中にクライアントコンピュータ４０上で作成している文書、例えば設計図、仕様書等である。この作成中文書１１２Ｂは、常駐ソフトによる監視や、一定のタイミングで業務管理サーバ２０にアップロードする規則の制定などによって取得される。閲覧中文書１１２Ｃとは、利用者が作業時間中にクライアントコンピュータ４０上で閲覧している文書、例えば仕様書、手順書、Ｗｅｂページ等である。この閲覧中文書１１２Ｃは、常駐ソフトによる監視や、ブラウザの履歴の抽出などによって取得される。検索エンジン状態１１２Ｄとは、検索エンジン３０１に入力された単語や検索オプションのことである。この検索エンジン状態１１２Ｄは、常駐ソフトによる監視やブラウザからの情報抽出によって取得される。 The business ID 112A is an identifier that uniquely identifies the business that the user is engaged in. This work ID 112A is assigned in advance to all work in the factory or manufacturing process. The document 112B being created is a document created on the client computer 40 by the user during work, such as a design drawing or a specification. The document 112B being created is acquired by monitoring with resident software or by establishing rules for uploading to the business management server 20 at a certain timing. The browsing document 112C is a document that the user is browsing on the client computer 40 during working hours, such as a specification, a procedure manual, a Web page, and the like. This browsing document 112C is acquired by monitoring with resident software, extracting browser history, or the like. The search engine state 112D is a word or a search option input to the search engine 301. This search engine state 112D is acquired by monitoring with resident software or extracting information from a browser.

図４Ｂは、本発明の実施形態の個人作業履歴データ１１３の例を示す図である。 FIG. 4B is a diagram illustrating an example of the personal work history data 113 according to the embodiment of this invention.

個人作業履歴データ１１３は、各利用者の作業状態の履歴を示すデータである。この個人作業履歴データ１１３は、作成文書履歴１１３Ａ、閲覧文書履歴１１３Ｂを含む。 The personal work history data 113 is data indicating a work state history of each user. The personal work history data 113 includes a created document history 113A and a browsing document history 113B.

作成文書履歴１１３Ａとは、個人作業状態データ１１２の作成中文書１１２Ｂのうち、同じ業務ＩＤの作成中文書１１２Ｂに、時刻情報を加えたものである。閲覧文書履歴１１３Ｂとは、個人作業状態データ１１２の閲覧中文書１１２Ｃのうち、同じ業務ＩＤの閲覧中文書１１２Ｃに、時刻情報を加えたものである。 The created document history 113A is obtained by adding time information to a document 112B being created with the same job ID in the document 112B being created of the personal work status data 112. The browsing document history 113B is obtained by adding time information to the browsing document 112C having the same job ID among the browsing documents 112C of the personal work state data 112.

図４Ｃは、本発明の実施形態の個人プロフィールデータ１１４の例を示す図である。 FIG. 4C is a diagram illustrating an example of the personal profile data 114 according to the embodiment of this invention.

個人プロフィールデータ１１４は、各利用者のプロフィールを示すデータである。この個人プロフィールデータ１１４は、担当業務データ１１４Ａ、所属部署データ１１４Ｂ、過去作成文書データ１１４Ｃ、過去閲覧文書データ１１４Ｄを含む。 The personal profile data 114 is data indicating the profile of each user. The personal profile data 114 includes responsible business data 114A, affiliation department data 114B, past created document data 114C, and past browse document data 114D.

担当業務データ１１４Ａとは、利用者の担当業務に関するデータである。この担当業務データ１１４Ａは、担当業務の業務名、担当業務に関連する文書（以下、「関連文書」という。）の一覧と関連係数、担当業務に関連する語（以下、「関連語」という。）の一覧と関連係数、担当業務に関連する業務のＩＤ（以下、「関連業務ＩＤ」という。）の一覧、担当業務の担当期間を含む。関連係数とは、業務に携わる全ての利用者の閲覧文書や作成文書における単語の出現頻度を用いること等によって算出される。この担当業務データ１１４Ａは、業務ＩＤを用いて読み出すことができる。 The assigned job data 114A is data related to the user's assigned job. The assigned job data 114A is referred to as a job name of the assigned job, a list of documents related to the assigned job (hereinafter referred to as “related documents”) and related coefficients, and a word related to the assigned job (hereinafter referred to as “related word”). ) List and related coefficients, a list of work IDs related to the work in charge (hereinafter referred to as “related work ID”), and a charge period of the work in charge. The related coefficient is calculated by using the appearance frequency of words in the browsed documents and created documents of all users engaged in the business. The assigned business data 114A can be read using the business ID.

所属部署データ１１４Ｂとは、利用者の所属部署に関するデータである。この所属部署データ１１４Ｂは、所属部署の部署名、所属部署の関連業務ＩＤの一覧と関連係数、所属部署の所属メンバの一覧、所属部署における所属期間を含む。過去作成文書データ１１４Ｃとは、業務ＩＤに関わらず過去の全ての作成文書に、時刻情報を加えたものである。過去閲覧文書データ１１４Ｄとは、業務ＩＤに関わらず過去の全ての閲覧文書に、時刻情報を加えたものである。 The belonging department data 114B is data relating to the user's belonging department. The belonging department data 114B includes the department name of the belonging department, a list of related business IDs and related coefficients of the belonging department, a list of belonging members of the belonging department, and a belonging period in the belonging department. The past created document data 114C is obtained by adding time information to all past created documents regardless of the business ID. The past browsing document data 114D is obtained by adding time information to all past browsing documents regardless of the business ID.

図５は、本発明の実施形態の情報提供サーバ１０の全体処理を示すフローチャートである。ここでは、情報提供サーバ１０の全体処理について、図２を適宜参照して説明する。 FIG. 5 is a flowchart showing the overall processing of the information providing server 10 according to the embodiment of this invention. Here, the overall processing of the information providing server 10 will be described with reference to FIG. 2 as appropriate.

はじめに、利用者がクライアントコンピュータ４０を用いて業務管理システム２０１にログインすると、情報提供サーバ１０はユーザＩＤ１１１を受理する（Ｓ５０１）。以降ステップ５０２〜５０４の処理により、情報提供サーバ１０は、ステップ５０１で受理したユーザＩＤ１１１をキーとして、各情報を読み出す。 First, when a user logs in to the business management system 201 using the client computer 40, the information providing server 10 receives the user ID 111 (S501). Thereafter, the information providing server 10 reads each piece of information using the user ID 111 received in step 501 as a key through the processing of steps 502 to 504.

ステップ５０２において、情報提供サーバ１０は、業務管理システム２０１から、利用者の担当業務の業務ＩＤを受理する（Ｓ５０２）。その後ステップ５０３において、因果ネットワークデータ１０６を読み出す（Ｓ５０３）。因果ネットワークデータ１０６とは、文書ＤＢ７０内の複数の文書に含まれる事象の因果関係を表すネットワーク構造のデータである。図９を用いて詳細に後述する。 In step 502, the information providing server 10 receives the business ID of the business in charge of the user from the business management system 201 (S502). Thereafter, in step 503, the causal network data 106 is read (S503). The causal network data 106 is data having a network structure representing the causal relationship of events included in a plurality of documents in the document DB 70. Details will be described later with reference to FIG.

その後ステップ５０４において、情報提供サーバ１０は、個人情報ＤＢ６０から個人プロフィールデータ１１４を読み出す（Ｓ５０４）。その後ステップ５０５において、情報提供サーバ１０（熟練度判定部１０１）は、ステップ５０４で読み出された個人プロフィールデータ１１４を基に、利用者の熟練度を計算する（Ｓ５０５）。このステップ５０５の処理については、図６を用いて詳細に後述する。 Thereafter, in step 504, the information providing server 10 reads the personal profile data 114 from the personal information DB 60 (S504). Thereafter, in step 505, the information providing server 10 (skill level determination unit 101) calculates the skill level of the user based on the personal profile data 114 read in step 504 (S505). The processing in step 505 will be described later in detail with reference to FIG.

その後ステップ５０６において、情報提供サーバ１０は、業務管理システム２０１から、その時点での個人作業状態データ１１２を受理する（Ｓ５０６）。 Thereafter, in step 506, the information providing server 10 receives the personal work status data 112 at that time from the business management system 201 (S506).

その後ステップ５０７において、情報提供サーバ１０は、個人作業履歴データ１１３を更新する（Ｓ５０７）。具体的には、ステップ５０６で受信した個人作業状態データ１１２に含まれる作成中文書１１２Ｂ、閲覧中文書１１２Ｃを、個人作業履歴データ１１３に追加する。その後ステップ５０８において、興味語抽出部１０２が、利用者の興味語を抽出する（Ｓ５０８）。このステップ５０８の処理については、図１０を用いて詳細に後述する。 Thereafter, in step 507, the information providing server 10 updates the personal work history data 113 (S507). Specifically, the creating document 112B and the browsing document 112C included in the personal work status data 112 received in step 506 are added to the personal work history data 113. Thereafter, in step 508, the interested word extracting unit 102 extracts the user's interested words (S508). The processing in step 508 will be described later in detail with reference to FIG.

その後ステップ５０９において、情報提供サーバ１０は、ステップ５０８で抽出された興味語に基づき、当該興味語に関連する文書を、文書ＤＢ７０から検索する（Ｓ５０９）。ここでは、既存の検索エンジン３０１を用いて検索する。興味語に関連する文書とは、興味語と因果関係にあるキーワードを有する文書である。その後ステップ５１０において、多義性解決部１０３が、興味語の多義性を解決する（Ｓ５１０）。このステップ５１０の処理については、図７を用いて詳細に後述する。 Thereafter, in step 509, the information providing server 10 searches the document DB 70 for a document related to the interesting word based on the interesting word extracted in step 508 (S509). Here, the search is performed using the existing search engine 301. A document related to an interesting word is a document having a keyword that has a causal relationship with the interesting word. Thereafter, in step 510, the ambiguity resolution unit 103 resolves the ambiguity of the interested word (S510). The processing of step 510 will be described later in detail with reference to FIG.

その後ステップ５１１において、見出し作成部１０５が、ステップ５０９で検索された文書に対して付与する見出しを作成する（Ｓ５１１）。このステップ５１１の処理については、図１１を用いて詳細に後述する。 Thereafter, in step 511, the headline creation unit 105 creates a headline to be assigned to the document searched in step 509 (S511). The processing in step 511 will be described later in detail with reference to FIG.

その後ステップ５１２において、見出しリスト表示部４０１は、ステップ５１１で作成された見出しを、出力部（ディスプレイ等）に表示することによって、利用者に提示する（Ｓ５１２）。 Thereafter, in step 512, the heading list display unit 401 presents the heading created in step 511 to the user by displaying it on the output unit (display or the like) (S512).

その後ステップ５１３において、情報提供サーバ１０は、利用者の入力を受け付ける（Ｓ５１３）。ここでいう利用者の入力とは、ステップ５１２で提示された見出しに対する利用者の閲覧要求である。その後ステップ５１４において、情報提供サーバ１０は、文書閲覧要求があるか否かを判定する（Ｓ５１４）。 Thereafter, in step 513, the information providing server 10 accepts user input (S513). The user input here is a user browsing request for the headline presented in step 512. Thereafter, in step 514, the information providing server 10 determines whether there is a document browsing request (S514).

文書閲覧要求がある場合（Ｓ５１４でＹＥＳ）、情報提供サーバ１０は、当該要求に対応する文書を提示し（Ｓ５１５）、ステップ５１２に戻って処理を繰り返す。一方、文書閲覧要求がない場合（Ｓ５１４でＮＯ）、情報提供サーバ１０は、待機状態になり、利用者がログアウトしたか否かを判定する（Ｓ５１６）。 If there is a document browsing request (YES in S514), the information providing server 10 presents a document corresponding to the request (S515), returns to step 512, and repeats the process. On the other hand, when there is no document browsing request (NO in S514), the information providing server 10 enters a standby state and determines whether or not the user has logged out (S516).

ログアウトした場合（Ｓ５１６でＹＥＳ）、情報提供サーバ１０の待機状態は終了し、処理を終了する。一方、ログアウトしていない場合（Ｓ５１６でＮＯ）、情報提供サーバ１０は、個人作業状態データ１１２が変化したか否か（更新されたか否か）を判定する（Ｓ５１７）。個人作業状態データ１１２が変化した場合（Ｓ５１７でＹＥＳ）、ステップ５０６へ戻って処理を繰り返す。個人作業状態データ１１２が変化していない場合（Ｓ５１７でＮＯ）、ステップ５１６へ戻って処理を繰り返す。 When logging out (YES in S516), the standby state of the information providing server 10 ends, and the process ends. On the other hand, if the user has not logged out (NO in S516), the information providing server 10 determines whether or not the personal work status data 112 has been changed (updated) (S517). If the personal work status data 112 has changed (YES in S517), the process returns to step 506 and the process is repeated. If the personal work status data 112 has not changed (NO in S517), the process returns to step 516 to repeat the process.

以下、熟練度判定部１０１、興味語抽出部１０２、多義性解決部１０３、因果抽出部１０４、見出し作成部１０５、見出しリスト表示部４０１の処理フローについて説明する。 Hereinafter, the processing flow of the skill level determination unit 101, the interested word extraction unit 102, the ambiguity resolution unit 103, the causal extraction unit 104, the headline creation unit 105, and the headline list display unit 401 will be described.

図６は、本発明の実施形態の熟練度判定部１０１の制御ロジックを示すフローチャートである。ここでは、図５のステップ５０５の処理について、図２を適宜参照して説明する。 FIG. 6 is a flowchart illustrating the control logic of the skill level determination unit 101 according to the embodiment of this invention. Here, the processing of step 505 in FIG. 5 will be described with reference to FIG. 2 as appropriate.

はじめに、熟練度判定部１０１は、ユーザＩＤ１１１をキーに、個人情報ＤＢ６０から個人プロフィールデータ１１４を読み込む（Ｓ６０１）。次に、業務管理システム２０１から業務ＩＤを受信し、受信した業務ＩＤをキーに、業務情報ＤＢ５０から業務関連語一覧を読み込む（Ｓ６０２）。 First, the skill level determination unit 101 reads the personal profile data 114 from the personal information DB 60 using the user ID 111 as a key (S601). Next, a business ID is received from the business management system 201, and a business related word list is read from the business information DB 50 using the received business ID as a key (S602).

その後ステップ６０３において、熟練度判定部１０１は、ステップ６０１で読み込まれた個人プロフィールデータ１１４内の過去作成文書データ１１４Ｃが、ステップ６０２で読み込まれた業務関連語一覧中の業務関連語を含む場合、この業務関連語の出現頻度を取得する（Ｓ６０３）。その後ステップ６０４において、ステップ６０１で読み込まれた個人プロフィールデータ１１４内の過去閲覧文書データ１１４Ｄが、ステップ６０２で読み込まれた業務関連語一覧中の業務関連語を含む場合、この業務関連語の出現頻度を取得する（Ｓ６０４）。 Thereafter, in step 603, the skill level determination unit 101 determines that the past-created document data 114C in the personal profile data 114 read in step 601 includes the business-related words in the business-related word list read in step 602. The appearance frequency of the business related word is acquired (S603). Thereafter, in step 604, if the past browsing document data 114D in the personal profile data 114 read in step 601 includes the business-related words in the business-related word list read in step 602, the appearance frequency of this business-related word. Is acquired (S604).

その後ステップ６０５において、熟練度判定部１０１は、個人プロフィールデータ１１４内の所属部署データ１１４Ｂのうち、関連業務ＩＤ一覧と所属期間を取得する（Ｓ６０５）。その後ステップ６０６において、個人プロフィールデータ１１４内の担当業務データ１１４Ａのうち、関連業務ＩＤと担当期間を取得する（Ｓ６０６）。その後ステップ６０７において、取得された各種データに基づき、式（１）に従って、業務ＩＤで示される業務に対する利用者の熟練度を計算し、利用者熟練度データ１１５として出力する（Ｓ６０７）。 Thereafter, in step 605, the skill level determination unit 101 acquires a list of related business IDs and a belonging period in the belonging department data 114B in the personal profile data 114 (S605). Thereafter, in step 606, the related work ID and the charge period are acquired from the work data 114A in the personal profile data 114 (S606). Thereafter, in step 607, the user's skill level for the work indicated by the work ID is calculated according to the formula (1) based on the acquired various data, and is output as the user skill level data 115 (S607).

式（１）は、業務に対する利用者の熟練度を算出するための式である。式（１）において、部署関連度（業務ＩＤ、部署ＩＤ）とは、部署への所属期間による関連業務への知識習得度（業務と部署の関連の深さ）である。Ｌとは、部署や業務に従事した期間と、関連知識の習熟度との相関を示す習熟関数である。Ｌ（所属期間（部署ＩＤ））とは、部署に所属した期間による業務への習熟度である。Ｌ（担当期間（業務ＩＤ））とは、業務に従事した期間による業務への習熟度である。p（w）は、過去に作成した文書（過去作成文書データ１１４Ｃ）における業務関連語の出現頻度である。D（w）は、業務ＩＤの関連語一覧に基づく単語の専門性である。すなわち、Σ（p（w）×D（w））は、過去に作成した文書から推定される業務への習熟度である。なお、過去に作成した文書が業務関連語を多く含むほど、習熟度は高いと判断される。q（w）は、過去に閲覧した文書（過去閲覧文書データ１１４Ｄ）における業務関連語の出現頻度である。すなわち、Σ（q（w）×D（w））は、過去に閲覧した文書から推定される業務への習熟度である。なお、過去に閲覧した文書が業務関連語を多く含むほど、習熟度は高いと判断される。また、α、β、γ、δは、それぞれα≧０、β≧０、γ≧δ≧０の定数である。 Expression (1) is an expression for calculating the skill level of the user for the business. In equation (1), the degree of department relevance (business ID, department ID) is the degree of knowledge acquisition of related work (depth of relation between business and department) according to the period of belonging to the department. L is a proficiency function that indicates the correlation between the period in which a department or business has been engaged and the proficiency level of related knowledge. L (Affiliation period (department ID)) is the proficiency level for the work by the period of belonging to the department. L (in charge period (business ID)) is a proficiency level for the business during the business period. p (w) is a frequency of appearance of business related words in a document created in the past (past created document data 114C). D (w) is the word specialty based on the related word list of the business ID. In other words, Σ (p (w) × D (w)) is a proficiency level for work estimated from documents created in the past. It is determined that the proficiency level is higher as the documents created in the past contain more business related words. q (w) is a frequency of appearance of business related words in a document browsed in the past (past browsed document data 114D). In other words, Σ (q (w) × D (w)) is a proficiency level for work estimated from documents browsed in the past. In addition, it is judged that the proficiency is higher as the documents browsed in the past contain more business-related words. Α, β, γ, and δ are constants of α ≧ 0, β ≧ 0, and γ ≧ δ ≧ 0, respectively.

以上に示す処理により、熟練度判定部１０１は、業務に対する利用者の熟練度を判定し、数値化することができる。 Through the processing described above, the skill level determination unit 101 can determine and quantify the user's skill level for business.

図７は、本発明の実施形態の多義性解決部１０３の制御ロジックを示すフローチャートである。ここでは、図５のステップ５１０の処理について、図２を適宜参照して説明する。 FIG. 7 is a flowchart illustrating the control logic of the ambiguity resolution unit 103 according to the embodiment of this invention. Here, the processing of step 510 in FIG. 5 will be described with reference to FIG. 2 as appropriate.

はじめに、多義性解決部１０３は、個人作業履歴データ１１３、因果ネットワークデータ１０６、興味語データ１１６を読み込む（Ｓ７０１〜Ｓ７０３）。次にステップ７０４において、入力された興味語データ１１６が因果ネットワークデータ１０６（図９参照）内の２箇所以上のノードに登場するかどうかを判定する（Ｓ７０４）。 First, the ambiguity resolution unit 103 reads the personal work history data 113, the causal network data 106, and the interesting word data 116 (S701 to S703). Next, in step 704, it is determined whether or not the input interesting word data 116 appears in two or more nodes in the causal network data 106 (see FIG. 9) (S704).

１箇所以下のノードに登場する場合（Ｓ７０４でＮＯ）、この興味語データ１１６が新出単語、又は一つの意味しかない単語であるので、ステップ７０５に進み、多義性解決部１０３は、この興味語データ１１６をそのまま出力し（Ｓ７０５）、処理を終了する。一方、２箇所以上（複数）のノードに登場する場合（Ｓ７０４でＹＥＳ）、複数のノードのそれぞれについて、式（２）によって、ノードの文脈適合度を計算する（Ｓ７０６）。文脈適合度とは、文書における興味語の適合性を示す指標である。 If it appears in one or less nodes (NO in S704), since this interesting word data 116 is a new word or a word that has only one meaning, the process proceeds to step 705, and the ambiguity resolution unit 103 determines this interest word. The word data 116 is output as it is (S705), and the process is terminated. On the other hand, when the node appears in two or more (plural) nodes (YES in S704), the node's context suitability is calculated for each of the plurality of nodes by Expression (2) (S706). The context suitability is an index indicating the suitability of an interesting word in a document.

式（２）は、ノードの文脈適合度を計算するための式である。式（２）において、ｄ（ｗ、ｉ）は、個人作業履歴データ１１３に登場する単語ｗと、興味語データ１１６で示される興味語ｉとの、因果ネットワークデータ１０６上の（意味的）距離である。このｄ（
ｗ、ｉ）は、ノードと単語とが強い因果関係にあるほど小さな値になる。ｔ（ｗ）は、単
語ｗが最後に文脈に登場してからの経過時間（文脈内での時間的距離）である。Ｃ１、Ｃ２は定数である。なお、該当する興味語のノードの近辺に位置する単語が、個人作業履歴データ１１３上に多く登場するほど、及び、個人作業履歴データ１１３上での登場時刻が新しいほど、式（２）によって計算される文脈適合度は高くなる。 Expression (2) is an expression for calculating the context suitability of the node. In Expression (2), d (w, i) is a (semantic) distance on the causal network data 106 between the word w appearing in the personal work history data 113 and the interest word i indicated by the interest word data 116. It is. This d (
w, i) becomes smaller as the causal relationship between the node and the word is stronger. t (w) is the elapsed time (temporal distance within the context) since the word w last appeared in the context. C1 and C2 are constants. It is to be noted that the more words that are located in the vicinity of the relevant interesting word node appear on the personal work history data 113 and the newer the appearance time on the personal work history data 113, the more the calculation is performed according to the equation (2) The degree of context suitability is high.

その後ステップ７０７において、多義性解決部１０３は、ステップ７０６で計算された各ノードの文脈適合度を比較し、文脈適合度の高いノードを採用する（Ｓ７０７）。その後ステップ７０８において、ステップ７０７で採用されたノード（意味を限定した興味語）を出力する（Ｓ７０８）。 Thereafter, in step 707, the ambiguity resolution unit 103 compares the context suitability of each node calculated in step 706, and adopts a node having a high context suitability (S707). Thereafter, in step 708, the node (interested word with limited meaning) adopted in step 707 is output (S708).

以上に示す処理により、多義性解決部１０３は、興味語が複数の意味を有する場合、この多義性を解決して、最も適した興味語に意味を限定し、この興味語を出力することができる。 By the processing described above, the ambiguity resolution unit 103 can resolve this ambiguity, limit the meaning to the most suitable interest word, and output this interest word when the interest word has a plurality of meanings. it can.

図８は、本発明の実施形態の因果抽出部１０４の制御ロジックを示すフローチャートである。ここでは、図５のステップ５１０の処理について、図２を適宜参照して説明する。 FIG. 8 is a flowchart illustrating the control logic of the causal extraction unit 104 according to the embodiment of this invention. Here, the processing of step 510 in FIG. 5 will be described with reference to FIG. 2 as appropriate.

はじめに、因果抽出部１０４は、文書ＤＢ７０から未処理の文書（処理対象の文書）を一つ読み出す（Ｓ８０１）。次にステップ８０２において、因果抽出部１０４は、ステップ６０１で読み出された文書から、因果関係を有する単語ペアを一つ抽出する（Ｓ８０２）。 First, the causal extraction unit 104 reads one unprocessed document (processing target document) from the document DB 70 (S801). In step 802, the causal extraction unit 104 extracts one word pair having a causal relationship from the document read in step 601 (S802).

その後ステップ８０３において、因果抽出部１０４は、抽出された単語ペアを構成する２単語が、共に因果ネットワークデータ１０６内にノードとして登録されているか否かを判定する（Ｓ８０３）。 Thereafter, in step 803, the causal extraction unit 104 determines whether or not two words constituting the extracted word pair are both registered as nodes in the causal network data 106 (S803).

２単語が共に登録されている場合（Ｓ８０３でＹＥＳ）、ステップ８０４に進み、因果抽出部１０４は、登録されている二つのノードをリンク（リンクの強度１）で結合する（Ｓ８０４）。リンクの強度とは、ノード間の因果関係の強さを示す指標である。なお、このステップ８０４では、既に二つのノードがリンクで結合されている場合、リンクの強度に１を加算する。一方、２単語共に登録されてはいない場合（Ｓ８０３でＮＯ）、ステップ８０５に進み、因果抽出部１０４は、いずれか片方の単語が因果ネットワークデータ１０６内にノードとして登録されているか否かを判定する（Ｓ８０５）。 If the two words are registered together (YES in S803), the process proceeds to step 804, and the causal extraction unit 104 connects the two registered nodes with a link (link strength 1) (S804). The link strength is an index indicating the strength of the causal relationship between nodes. In step 804, if two nodes are already connected by a link, 1 is added to the link strength. On the other hand, if both words are not registered (NO in S803), the process proceeds to step 805, and the causal extraction unit 104 determines whether one of the words is registered as a node in the causal network data 106. (S805).

片方の単語のみ登録されている場合（Ｓ８０５でＹＥＳ）、ステップ８０６に進み、因果抽出部１０４は、登録されていない方の単語を新しいノードとして登録し、二つのノードをリンク（リンクの強度１）で結合する（Ｓ８０６）。一方、２単語がいずれも登録されていない場合（Ｓ８０５でＮＯ）、ステップ８０７に進み、因果抽出部１０４は、２単語をそれぞれ新しいノードとして登録し、二つのノードをリンク（リンクの強度１）で結合する（Ｓ８０７）。 If only one word is registered (YES in S805), the process proceeds to step 806, where the causal extraction unit 104 registers the unregistered word as a new node, and links the two nodes (link strength 1). ) (S806). On the other hand, if neither of the two words is registered (NO in S805), the process proceeds to step 807, where the causal extraction unit 104 registers the two words as new nodes, and links the two nodes (link strength 1). (S807).

その後ステップ８０８により、因果抽出部１０４は、因果関係を有する他の単語ペアがあるか否かを判定し（Ｓ８０８）、他の単語ペアがある場合（Ｓ８０８でＹＥＳ）、ステップ８０２に戻って処理を繰り返す。またステップ８０９により、因果抽出部１０４は、文書ＤＢ７０に未処理の文書があるか否かを判定し（Ｓ８０９）、他の未処理の文書がある場合（Ｓ８０９でＹＥＳ）、ステップ８０１に戻って処理を繰り返す。 Thereafter, in step 808, the causal extraction unit 104 determines whether there is another word pair having a causal relationship (S808). If there is another word pair (YES in S808), the process returns to step 802 and the process is performed. repeat. In step 809, the causal extraction unit 104 determines whether there is an unprocessed document in the document DB 70 (S809). If there is another unprocessed document (YES in S809), the process returns to step 801. Repeat the process.

以上に示す処理により、因果抽出部１０４は、文書ＤＢ７０中の全ての文書における単語間の因果関係を抽出し、因果ネットワークデータ１０６を作成する。 Through the processing described above, the causal extraction unit 104 extracts the causal relationship between words in all documents in the document DB 70 and creates the causal network data 106.

特に、製造現場における技術文書から因果関係を抽出することによって、設計支援、不具合の原因究明、不良の再発防止等の目的で過去の文書を活用することができる。 In particular, by extracting causal relationships from technical documents at the manufacturing site, past documents can be used for the purposes of design support, investigation of the cause of defects, prevention of recurrence of defects, and the like.

図９は、本発明の実施形態の因果抽出部１０４によって作成される因果ネットワークデータ１０６の例を説明する図である。因果抽出部１０４は、文書ＤＢ７０に格納された製造現場における事故・不良報告書に基づいて、図９に示すような因果ネットワークデータ１０６を作成する。 FIG. 9 is a diagram illustrating an example of the causal network data 106 created by the causal extraction unit 104 according to the embodiment of this invention. The causal extraction unit 104 creates causal network data 106 as shown in FIG. 9 based on the accident / defect report at the manufacturing site stored in the document DB 70.

図９において、長方形はノードを示している。また、矢印はリンク、すなわちノード間の因果関係を表している。例えば「摩擦９０１」と「発熱９０２」とは、「摩擦９０１」が「発熱９０２」を引き起こす因果関係であることを示している。また、「発熱９０２」と「部品変形９０３」とは、「発熱９０２」が「部品変形９０３」を引き起こす因果関係であることを示している。 In FIG. 9, the rectangle indicates a node. An arrow represents a link, that is, a causal relationship between nodes. For example, “friction 901” and “heat generation 902” indicate that “friction 901” is a causal relationship that causes “heat generation 902”. Further, “heat generation 902” and “component deformation 903” indicate that “heat generation 902” is a causal relationship that causes “component deformation 903”.

図１０は、本発明の実施形態の興味語抽出部１０２の制御ロジックを示すフローチャートである。ここでは、図５のステップ５０８の処理について、図２を適宜参照して説明する。 FIG. 10 is a flowchart illustrating the control logic of the interesting word extraction unit 102 according to the embodiment of this invention. Here, the processing of step 508 in FIG. 5 will be described with reference to FIG. 2 as appropriate.

はじめに、興味語抽出部１０２は、個人作業状態データ１１２を読み込む（Ｓ１００１）。次にステップ１００２において、利用者熟練度データ１１５を読み込む（Ｓ１００２）。その後ステップ１００３において、個人作業履歴データ１１３を読み込む（Ｓ１００３）。その後ステップ１００４において、興味語候補を初期化する（Ｓ１００４）。興味語候補とは、興味語抽出部１０２が興味語を抽出するための候補となる興味語群である。 First, the interested word extraction unit 102 reads the personal work state data 112 (S1001). Next, in step 1002, the user skill level data 115 is read (S1002). Thereafter, in step 1003, the personal work history data 113 is read (S1003). Thereafter, in step 1004, the interested word candidate is initialized (S1004). An interesting word candidate is an interesting word group which becomes a candidate for the interesting word extraction part 102 to extract an interesting word.

その後ステップ１００５〜１００８により、興味語抽出部１０２は、個人プロフィールデータ１１４中の関連語一覧、個人作業状態データ１１２中の作成中文書１１２Ｂ内の特徴語、閲覧中文書１１２Ｃ内の特徴語、検索エンジン状態１１２Ｄ中の検索エンジン３０１に入力された検索式に含まれる単語を、興味語候補に加える（Ｓ１００５〜１００８）。ここで、文書内の特徴語とは、その文書内に多く含まれるなど、文書との関連の高い単語のことであり、例えばＴＦ（Term Frequency）−ＩＤＦ（Inverse Document Frequency）の高い単語を特徴語として扱うことができる。ＴＦ（Term Frequency）−ＩＤＦ（Inverse Document Frequency）とは、文書中の特徴語らしさを表す値であって、ＴＦをＤＦ（Document Frequency）で除算することによって求められる。ＴＦは、該当文書中の単語の出現頻度である。ＤＦは、全文書中の単語の出現頻度である。その後ステップ１００９において、興味語抽出部１０２は、全ての興味語候補のそれぞれの興味度を、式（３）に従って計算する（Ｓ８０９）。興味度とは、興味語に対する利用者の興味の度合いを示す指標である。 Thereafter, in steps 1005 to 1008, the interested word extracting unit 102 searches the related word list in the personal profile data 114, the feature words in the document 112B being created in the personal work status data 112, the feature words in the document 112C being browsed, and the search. A word included in the search expression input to the search engine 301 in the engine state 112D is added to the candidate for interesting words (S1005 to 1008). Here, the characteristic words in the document are words that are highly related to the document, such as being included in many in the document. For example, the word is characterized by a word having a high TF (Term Frequency) -IDF (Inverse Document Frequency). Can be treated as a word. TF (Term Frequency) -IDF (Inverse Document Frequency) is a value representing the likelihood of a feature word in a document, and is obtained by dividing TF by DF (Document Frequency). TF is the frequency of appearance of words in the document. DF is the appearance frequency of words in all documents. Thereafter, in step 1009, the interested word extraction unit 102 calculates the interest levels of all the interested word candidates according to the equation (3) (S809). The degree of interest is an index indicating the degree of interest of the user with respect to the interesting word.

式（３）は、興味語候補の興味度を算出するための式である。式（３）において、業務関連係数とは、業務ＩＤによって呼び出される関連語の関連係数である。利用者熟練度とは、ステップ１００２で読み込まれた利用者熟練度データ１１５である。適正熟練度とは、単語の専門性であり、その単語に興味を持つ可能性が高い利用者の熟練度である。専門性の高い単語は出現頻度が低いと仮定すれば、単語の専門性は、ＤＦに基づいて算出することができる。 Expression (3) is an expression for calculating the interest level of the interesting word candidate. In Equation (3), the business related coefficient is a related coefficient of a related word called by the business ID. The user skill level is the user skill level data 115 read in step 1002. The appropriate skill level is a word specialty and a skill level of a user who is highly likely to be interested in the word. If it is assumed that a highly specialized word has a low appearance frequency, the word specialty can be calculated based on the DF.

検索式内重要度とは、検索式における単語の各々に割り振られる単語の重要度である。例えば検索式がＡＮＤ検索である場合、ＡＮＤで結ばれた各単語に重要度を等分に分配する。一方、検索式がＯＲ検索の場合、ＯＲで結ばれた各単語に重要度を等分せずにそのまま付する。 The importance in the search expression is the importance of the word assigned to each word in the search expression. For example, when the search formula is an AND search, importance is equally distributed to each word connected by AND. On the other hand, when the search expression is an OR search, the importance is assigned to each word connected by OR without being divided equally.

例えば検索式が（Ａ∨Ｂ）∧Ｃ）である場合、検索式内重要度は（Ａ、Ｂ、Ｃ）＝（0.5、0.5、0.5）である。また、検索式が（Ａ∧Ｂ）∨Ｃ）である場合、検索式内重要度は（Ａ、Ｂ、Ｃ）＝（0.5、0.5、1.0）である。また、α、β、γ、δ、εは、それぞれ０以上の定数である。 For example, when the search formula is (A∨B) ∧C), the importance in the search formula is (A, B, C) = (0.5, 0.5, 0.5). When the search formula is (A∧B) ∨C), the importance in the search formula is (A, B, C) = (0.5, 0.5, 1.0). Α, β, γ, δ, and ε are constants of 0 or more.

この式（３）によれば、作業中の業務に関連が深い単語、作成文書中や閲覧文書中で特に多く出現する単語、作業者の熟練度と同程度の専門性を持つ単語、作業者が検索システムを使っている場合は検索式に含まれる単語ほど、興味度が高くなる。 According to this equation (3), words that are closely related to the work being worked on, words that appear particularly frequently in the created document and the read document, words that have the same level of expertise as the skill level of the worker, and the worker When using a search system, the more the word included in the search formula, the higher the degree of interest.

その後ステップ８１０において、興味語抽出部１０２は、興味度が一定以上の興味語候補を抽出し、興味語データ１１６として出力する（Ｓ８１０）。 Thereafter, in step 810, the interested word extracting unit 102 extracts an interesting word candidate having a certain degree of interest and outputs it as interesting word data 116 (S810).

以上に示す処理により、興味語抽出部１０２は、個人作業履歴データ１１３、個人プロフィールデータ１１４等に基づいて、その時点において利用者が興味を持つ可能性が高い単語を抽出することができる。 Through the processing described above, the interested word extraction unit 102 can extract words that are likely to be of interest to the user at that time based on the personal work history data 113, the personal profile data 114, and the like.

図１１は、本発明の実施形態の見出し作成部１０５の制御ロジックを示すフローチャートである。ここでは、図５のステップ５１１の処理について、図２を適宜参照して説明する。 FIG. 11 is a flowchart illustrating the control logic of the headline creation unit 105 according to the embodiment of this invention. Here, the processing of step 511 in FIG. 5 will be described with reference to FIG. 2 as appropriate.

はじめに、見出し作成部１０５は、因果ネットワークデータ１０６を読み込む（Ｓ１１０１）。次にステップ１１０２において、検索エンジン３０１による検索結果の文書データ３１１と、興味語データ１１７に示される興味語（以下、「検索語」という。）とのセットを読み込む（Ｓ１１０２）。 First, the headline creation unit 105 reads the causal network data 106 (S1101). Next, in step 1102, a set of document data 311 as a search result by the search engine 301 and an interesting word (hereinafter referred to as “search word”) indicated in the interesting word data 117 is read (S1102).

その後ステップ１１０３において、見出し作成部１０５は、因果ネットワークデータ１０６上で、検索語と一定距離内の因果関係にある単語を抽出する（Ｓ１１０３）。その後ステップ１１０４において、検索語を第１ワード、ステップ１１０３で抽出された単語を第２ワードとする（Ｓ１１０４）。 Thereafter, in step 1103, the headline creation unit 105 extracts words having a causal relationship within a certain distance from the search word on the causal network data 106 (S1103). Thereafter, in step 1104, the search word is the first word, and the word extracted in step 1103 is the second word (S1104).

具体的には、利用者とコンテンツとの関連性の根拠となる興味語を、第１ワードとする。この第１ワードは複数でもよい。利用者が実行中の業務に関連する単語、作成中文書内の単語、閲覧文書履歴の特徴語、検索キーワード等が第１ワードの候補である。このような第１ワードを見出しの一部にすることによって、提示するコンテンツに対する納得感（透過性）を高めることができる。次に、コンテンツの内容において、第１ワードと最も関係の深い第２ワードを決定する。 Specifically, the interesting word that is the basis for the relationship between the user and the content is the first word. This first word may be plural. Words related to the business being executed by the user, words in the document being created, feature words of the browsing document history, search keywords, etc. are candidates for the first word. By making such a first word a part of the headline, the satisfaction (transparency) with respect to the presented content can be enhanced. Next, in the contents, the second word that is most closely related to the first word is determined.

その後ステップ１１０５において、見出し作成部１０５は、第１ワード、第２ワード、補間表現を組み合わせて文書の見出しを作成する（Ｓ１１０５）。ここでの補間表現とは、第１ワードと第２ワードとの因果関係を表す、「〜による…」、「〜を原因とする…」、「〜を引き起こす…」のような表現を示す。 Thereafter, in step 1105, the headline creation unit 105 creates a headline for the document by combining the first word, the second word, and the interpolated expression (S1105). The interpolated expression here indicates expressions such as “by ...”, “cause of ...”, “cause ...”, which represent the causal relationship between the first word and the second word.

このように、見出し作成部１０５は、第１ワードと第２ワードを組み合わせ、これらのワードの関係性を表す表現を補間して見出しを作成する。例えば製造現場における技術文書を扱う場合、技術文書は、原因、現象、対策の構造を有する確率が高い。文書の中で、第１ワードと原因−現象、原因−対策、現象−対策のいずれかの関係にある単語を、第２ワードとして選択する。 In this way, the headline creation unit 105 combines the first word and the second word, and creates a headline by interpolating the expression representing the relationship between these words. For example, when handling a technical document at a manufacturing site, the technical document has a high probability of having a cause, phenomenon, and countermeasure structure. In the document, a word that is in any one of cause-phenomenon, cause-countermeasure, phenomenon-countermeasure relationship with the first word is selected as the second word.

図１２は、本発明の実施形態の見出し作成部１０５によって作成される見出しの一例を示す図である。 FIG. 12 is a diagram illustrating an example of a headline created by the headline creation unit 105 according to the embodiment of this invention.

第１ワード（興味語）が「短絡」、第２ワードが「経年動作」である場合、作成される見出しは、図１２に示すように「経年動作による短絡」となる。また、第２ワードが複数（「ボルト緩み」、「局部過熱」、「ボルト通電」、「溶融物」等）抽出された場合、作成される見出しも複数（「ボルト緩みによる短絡」、「局部過熱による短絡」、「ボルト通電による短絡」、「溶融物による短絡」等）である。一つの文書から複数の見出しが作成された場合、他の文書にも多く登場する因果関係の見出しを優先する。つまり、因果ネットワークデータ１０６上で最もリンク強度の強い２単語によって作成された見出し（図１２に示す例では、「溶融物による短絡」）を採用する。第１ワードと第２ワードとを直接結ぶリンクがない場合、経由する全てのリンクの強度の逆数の総和を計算し、逆数の総和を因果関係の強弱を判定するための指標としてもよい。 When the first word (interesting word) is “short circuit” and the second word is “aging operation”, the created heading is “short circuit by aging operation” as shown in FIG. Also, if the second word is extracted (“bolt loosening”, “local overheating”, “bolt energization”, “melt”, etc.), multiple headlines are created (“short circuit due to bolt loosening”, “local”). Short circuit due to overheating, "short circuit due to volt energization", "short circuit due to melt", etc.). When multiple headings are created from one document, priority is given to causal headings that frequently appear in other documents. That is, a headline (in the example shown in FIG. 12, “short circuit due to melt”) created by two words having the strongest link strength on the causal network data 106 is employed. When there is no link that directly connects the first word and the second word, the sum of the reciprocals of the strengths of all the links that pass through may be calculated, and the sum of the reciprocals may be used as an index for determining the strength of the causal relationship.

図１１に戻り、その後ステップ１１０６において、見出し作成部１０５は、ステップ１１０５で作成された見出しを文書に付与し、見出し付き文書データ１１８として出力する（Ｓ１１０６）。その後ステップ１１０７において、検索結果の他の文書があるか否かを判定し（Ｓ１１０７）、他の文書がある場合（Ｓ１１０７でＹＥＳ）、ステップ１１０２に戻って処理を繰り返す。 Returning to FIG. 11, in step 1106, the headline creation unit 105 adds the headline created in step 1105 to the document and outputs it as headlined document data 118 (S1106). Thereafter, in step 1107, it is determined whether there is another document as a search result (S1107). If there is another document (YES in S1107), the process returns to step 1102 to repeat the process.

以上に示す処理により、見出し作成部１０５は、利用者の作業状況や知識に適合した見出しを作成し、作成された見出しを文書に付与して出力する。なお、見出し作成部１０５は、複数のコンテンツから同一の見出しを作成する場合がある。この場合、これら複数のコンテンツを、同質のコンテンツとして同一の見出しを付与してグルーピングしてもよい。これにより、内容が重複する文書の閲覧を防ぐことができる。 Through the processing described above, the headline creation unit 105 creates a headline that matches the user's work situation and knowledge, and assigns the created headline to the document for output. The headline creation unit 105 may create the same headline from a plurality of contents. In this case, the plurality of contents may be grouped by giving the same heading as the same quality contents. This prevents browsing of documents with overlapping contents.

図１３は、本発明の実施形態の見出しリスト表示部４０１の制御ロジックを示すフローチャートである。ここでは図５のステップ５１２の処理について、図２を適宜参照して説明する。 FIG. 13 is a flowchart illustrating the control logic of the heading list display unit 401 according to the embodiment of this invention. Here, the processing of step 512 in FIG. 5 will be described with reference to FIG. 2 as appropriate.

はじめに、見出しリスト表示部４０１は、見出し作成部１０５から出力された見出し付き文書データ１１８を全て読み込む（Ｓ１３０１）。次にステップ１３０２において、ステップ１３０１で読み込まれた見出し付き文書データ１１８のうち、重複しない全ての見出しを表示する（Ｓ１３０２）。その後ステップ１３０３において、利用者の入力を待機する（Ｓ１３０３）。 First, the heading list display unit 401 reads all the headlined document data 118 output from the heading creation unit 105 (S1301). Next, in step 1302, all the headings that do not overlap among the headlined document data 118 read in step 1301 are displayed (S1302). Thereafter, in step 1303, the user input is waited (S1303).

その後ステップ１３０４において、見出しに対して利用者の閲覧要求があり（Ｓ１３０４でＹＥＳ）、且つ、見出しに対応する文書が二つ以上ある場合（Ｓ１３０５でＹＥＳ）、見出しリスト表示部４０１は、一つの文書を表示し、さらに閲覧要求があった場合に他の文書を表示する（Ｓ１３０６）。 Thereafter, in step 1304, when there is a user browsing request for the headline (YES in S1304) and there are two or more documents corresponding to the headline (YES in S1305), the heading list display unit 401 displays The document is displayed, and when there is a browsing request, another document is displayed (S1306).

一方、見出しに対して利用者の閲覧要求があり（Ｓ１３０４でＹＥＳ）、且つ、見出しに対応する文書が一つである場合（Ｓ１３０５でＮＯ）、見出しリスト表示部４０１は、当該見出しに対応する文書を表示する（Ｓ１３０７）。文書を表示した後は、ステップ１３０３に戻って、再び利用者の入力を待機する。なお、ステップ１３０４において、利用者の閲覧要求がない場合（Ｓ１３０４でＮＯ）、処理を終了する。 On the other hand, when there is a user browsing request for a headline (YES in S1304) and there is one document corresponding to the headline (NO in S1305), the headline list display unit 401 corresponds to the headline. The document is displayed (S1307). After the document is displayed, the process returns to step 1303 to wait for user input again. In step 1304, if there is no user browsing request (NO in step S1304), the process ends.

以上に示す処理により、見出しリスト表示部４０１は、見出し作成部１０５によって作成された見出し付き文書データ１１８を表示する。 Through the processing described above, the heading list display unit 401 displays the document data 118 with a heading created by the heading creation unit 105.

図１４は、本発明の実施形態の見出しリスト表示部４０１による表示画面１４０１の例を示す図である。 FIG. 14 is a diagram showing an example of a display screen 1401 by the heading list display unit 401 according to the embodiment of this invention.

図１４に示す表示画面１４０１において、タイトル１４０２には、見出し作成部１０５によって作成された見出しが表示される。本文確認ボタン１４０３は、タイトル１４０２に対応する文書の本文を確認するためのボタンである。事業者名１４０４、製品名１４０５には、タイトル１４０２に関する事業者名、製品名が表示される。関連ワード１４０６には、タイトル１４０２に関連するワードが表示される。 In the display screen 1401 shown in FIG. 14, a title created by the header creation unit 105 is displayed in the title 1402. A text confirmation button 1403 is a button for confirming the text of the document corresponding to the title 1402. In the business name 1404 and product name 1405, the business name and product name related to the title 1402 are displayed. In the related word 1406, a word related to the title 1402 is displayed.

図１４に示すような表示画面１４０１を表示することにより、利用者は、表示された情報の要不要を迅速且つ正確に判断することができる。 By displaying the display screen 1401 as shown in FIG. 14, the user can quickly and accurately determine whether or not the displayed information is necessary.

以上、本発明の実施形態について説明したが、上記実施形態は本発明の適用例の一つを示したものであり、本発明の技術的範囲を上記実施形態の具体的構成に限定する趣旨ではない。本発明の要旨を逸脱しない範囲において種々変更可能である。 As mentioned above, although embodiment of this invention was described, the said embodiment showed one of the application examples of this invention, and in the meaning which limits the technical scope of this invention to the specific structure of the said embodiment. Absent. Various modifications can be made without departing from the scope of the present invention.

１計算機システム
１０情報提供サーバ
２０業務管理サーバ
３０検索サーバ
４０クライアントコンピュータ
５０業務情報ＤＢ
６０個人情報ＤＢ
７０文書ＤＢ
１０１熟練度判定部
１０２興味語抽出部
１０３多義性解決部
１０４因果抽出部
１０５見出し作成部
１０６因果ネットワークデータ
１１２個人作業状態データ
１１３個人作業履歴データ
１１４個人プロフィールデータ
１１５利用者熟練度データ
１１６興味語データ
１１８見出し付き文書データ
４０１見出しリスト表示部 1 Computer System 10 Information Providing Server 20 Business Management Server 30 Search Server 40 Client Computer 50 Business Information DB
60 Personal Information DB
70 Document DB
101 Skill Level Determination Unit 102 Interesting Word Extraction Unit 103 Ambiguity Resolution Unit 104 Causal Extraction Unit 105 Headline Creation Unit 106 Causal Network Data 112 Personal Work Status Data 113 Personal Work History Data 114 Personal Profile Data 115 User Skill Level Data 116 Interesting Words Data 118 Headlined document data 401 Headline list display section

Claims

A computer comprising a processor for executing a program, a memory for storing a program executed by the processor, and an interface connected to the processor, and connected to a document database for storing documents,
The processor is
Extracting the causal relationship between words included in the document stored in the document database, creating causal network data representing the extracted causal relationship between the words in a network structure,
From the causal network data, extract words that are causal with an interest word that is a word that the user is interested in,
Based on the first word and the second word when the interesting word is the first word and the extracted word is the second word, a headline of the document is created,
A computer that generates data for presenting a created document with the heading to a user via an output device.

The processor extracts an interest word of the user based on work state data indicating a work state of the user for a predetermined job and profile data indicating a profile of the user. Item 1. The computer according to item 1.

The processor creates user skill data indicating the skill level of the user for the predetermined job based on the profile data,
The computer according to claim 2, further comprising extracting an interest word of the user based on the worker skill level data.

When there are a plurality of interesting words, the processor selects an interesting word in which the words around the interesting word in the causal network data appear with the highest appearance frequency in the work state data. The computer according to claim 1, wherein the computer is an interesting word of the user.

The said processor produces the heading of the said document by the said 1st word, the said 2nd word, and the interpolation expression showing the causal relationship of the said 1st word and the said 2nd word. 1. The computer according to 1.

A document presentation method in a computer comprising a processor for executing a program, a memory for storing a program executed by the processor, and an interface connected to the processor, and connected to a document database for storing the document. ,
The processor is
A procedure for extracting a causal relationship between words included in a document stored in the document database, and creating causal network data representing the extracted causal relationship between the words in a network structure;
A procedure for extracting a word that is causal with an interesting word that is a word that the user is interested in from the causal network data;
Creating a document heading based on the first word and the second word when the interesting word is the first word and the extracted word is the second word;
A procedure for generating data for presenting the created document with the heading to a user via an output device;
The document presentation method characterized by performing.

A document presentation program executed on a computer connected to a document database for storing a document, comprising: a processor for executing the program; a memory for storing the program executed by the processor; and an interface connected to the processor. Because
A procedure for extracting a causal relationship between words included in a document stored in the document database, and creating causal network data representing the extracted causal relationship between the words in a network structure;
A procedure for extracting a word that is causal with an interesting word that is a word that the user is interested in from the causal network data;
Creating a document heading based on the first word and the second word when the interesting word is the first word and the extracted word is the second word;
A procedure for generating data for presenting the created document with the heading to a user via an output device;
Is executed by the computer.