JP2007073072A

JP2007073072A - Related document display device

Info

Publication number: JP2007073072A
Application number: JP2006315390A
Authority: JP
Inventors: Hiroshi Tsuda; 宏津田; Kanji Uchino; 寛治内野; Kunio Matsui; くにお松井
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1997-09-08
Filing date: 2006-11-22
Publication date: 2007-03-22
Anticipated expiration: 2018-03-27
Also published as: JP4348357B2

Abstract

PROBLEM TO BE SOLVED: To support an operator to retrieve a necessary document or a document group which may include the necessary document out of a set of documents including a large number of documents. SOLUTION: This related document display device includes a content estimation means 3504, an input means 3509, a retrieval engine means 3506 and a view generation means 3508. The content estimation means 3504 estimates the content of the document group from a document database 3503 comprising document groups having a reference relationship based on the document content and a posting pattern among authors to generate an index corresponding to a topic pattern of the content. The input means 3509 inputs a retrieval request for the document database from a user. The retrieval engine means 3506 retrieves a document of the document database. The view generation means 3508 generates one or more views by using the retrieval result from the retrieval engine means and the index and displays the one or more views on the display device 3510 by changing over them. COPYRIGHT: (C)2007,JPO&INPIT

Description

本発明は、参照関係にある文書群を整理し、その文書群をいろいろな観点から表示することによって、ユーザによる必要な情報へのアクセスを支援する技術に関する。 The present invention relates to a technique for assisting a user to access necessary information by organizing a document group having a reference relationship and displaying the document group from various viewpoints.

パソコン通信又はコンピュータネットワーク上で運用される電子会議室や電子ニュース等において、順次蓄積される文書集合の中から必要な文書をより迅速かつ簡単に見つけ出したいという要請が、従来からある。 2. Description of the Related Art Conventionally, there has been a demand for finding a necessary document more quickly and easily from a collection of documents stored sequentially in an electronic conference room or electronic news operated on a personal computer communication or a computer network.

このような要請に対して、文書集合中の各文書のタイトルを作成日順に並び替え、その結果得られるタイトルリストをユーザに掲示するという従来技術が知られている。
また、文書集合を互いに参照関係のある文書から構成される文書群に分類し、文書群中の各文書のタイトルをインデントして表示することにより、各文書の参照関係を掲示する従来技術や、文書群中の各文書の番号をツリーで表示することにより各文書の参照関係を掲示する従来技術も知られている。 In response to such a request, a conventional technique is known in which the titles of the documents in the document set are rearranged in order of creation date, and a title list obtained as a result is posted to the user.
In addition, by classifying a document set into a document group composed of documents having a reference relationship with each other, and displaying indented titles of each document in the document group, a conventional technique for displaying the reference relationship of each document, There is also known a conventional technique for displaying the reference relationship of each document by displaying the number of each document in the document group as a tree.

更に、文書集合の中から、特定のキーワードを含む文書を全文検索し、その検索結果を羅列的に掲示する従来技術も知られている。 Furthermore, a conventional technique is also known in which a full text search is performed for a document including a specific keyword from a document set and the search results are displayed in a list.

しかし、これらの従来技術におけるような限定された情報表示のみでは、以下に示される問題点を解決することができなかった。
１．雑多な文書集合の中から必要な文書又は必要な文書が含まれているであろう文書群を見つけ出すためには、掲示される文書のタイトルに頼るしかない。タイトルは、必ずしも文書内容を正確に表わしているとは限らないため、正確な検索が困難である。 However, only the limited information display as in these prior arts cannot solve the following problems.
1. In order to find out a necessary document or a group of documents that may contain a necessary document from the miscellaneous document set, it is necessary to rely on the title of the posted document. Since the title does not necessarily accurately represent the document content, it is difficult to perform an accurate search.

２．タイトルのインデント表示や文書番号のツリー表示だけでは、文書群全体の構造を把握すること、及び文書群における話題の推移を把握することが困難である。
３．種々の観点から必要な文書にアクセスすることができない。 2. It is difficult to grasp the structure of the entire document group and the transition of topics in the document group only by the indent display of the title and the tree display of the document number.
3. Necessary documents cannot be accessed from various viewpoints.

４．検索結果が多数件ある場合に、更に絞り込み検索を実行するか検索結果のリストを１件１件チェックしなければ、必要な文書にアクセスすることができない。
一方、複数の特定の文書からキーワードを抽出し、共通のキーワードを含む各文書に対し自動的に他の文書へのリンクを設定する技術が、知られている。この従来技術は、特許の公知例や研究論文等の特定文書中で互いの文献を相互参照することを可能にすることによって、関連する複数の文書を効率的に読み広げることを可能にする。 4). If there are a large number of search results, it is not possible to access a necessary document unless further narrowing search is executed or the search result list is checked one by one.
On the other hand, a technique for extracting a keyword from a plurality of specific documents and automatically setting a link to another document for each document including a common keyword is known. This prior art makes it possible to efficiently read a plurality of related documents by making it possible to cross-reference each other's documents in a specific document such as a known example of a patent or a research paper.

しかし、このような従来技術は、関連する文書の参照を容易にすることを目的としており、電子会議室や電子ニュース等の文書集合からの、必要な文書又は必要な文書が含まれているであろう文書群の検索の支援に、適用することはできなかった。 However, such prior art is intended to facilitate the reference of related documents, and includes necessary documents or necessary documents from a set of documents such as an electronic conference room and electronic news. It could not be applied to support searching for a group of documents.

本発明の課題は、大量の文書が含まれる文書集合からの、必要な文書又は必要な文書が含まれているであろう文書群の検索を、支援することにある。 An object of the present invention is to support a search for a necessary document or a group of documents that may contain a necessary document from a document set including a large number of documents.

本発明は、上記課題を解決するため、参照関係を有する文書からなる文書群を表示する関連文書表示装置であって、参照関係を有する文書群からなる文書データベースから、文書内容及び作者間の投稿パターンに基づいて前記文書群の内容を推定して、該内容の話題パターンに対応するインデックスを生成する内容推定手段と、利用者からの前記文書データベースに対する検索要求を入力する入力手段と、前記文書データベースの文書を検索する検索エンジン手段と、該検索エンジン手段からの検索結果と前記インデックスを利用して１つ以上のビユーを生成し、該１つ以上のビューを切り替えて表示装置に表示するビュー生成手段と、を含むことを特徴とするものである。 In order to solve the above-mentioned problem, the present invention is a related document display device for displaying a document group consisting of documents having a reference relationship. Content estimation means for estimating the content of the document group based on a pattern and generating an index corresponding to the topic pattern of the content; input means for inputting a search request for the document database from a user; and the document A search engine means for searching for documents in a database, a view for generating one or more views using a search result from the search engine means and the index, and switching the one or more views to display on a display device Generating means.

なお、本発明は、コンピュータにより使用されたときに、上述の本発明の構成によって実現される機能と同様の機能をコンピュータに行わせるためのコンピュータ読出し可能記録媒体として構成することもできる。 The present invention can also be configured as a computer-readable recording medium for causing a computer to perform the same functions as those realized by the above-described configuration of the present invention when used by a computer.

本発明によれば、自動的に推定された話題と共に検索結果が表示されるため、検索結果のスレッド数が多い場合でも、利用者は検索結果の概要を容易に把握することが可能となる。 According to the present invention, since the search result is displayed together with the automatically estimated topic, the user can easily grasp the outline of the search result even when the number of threads of the search result is large.

また、スレッド中の文書量が多くても、同じ作者が何度も投稿している場合がある。本発明によれば、作者を中心に見せるビユーが提供されることにより、スレッド内のキーパーソンが把握可能となるだけでなく、スレッドの全体構造もコンパクトに表示することが可能となる。 Also, even if the amount of documents in the thread is large, the same author may post many times. According to the present invention, by providing a view showing mainly the author, not only the key person in the thread can be grasped, but also the entire structure of the thread can be displayed in a compact manner.

このように、本発明によれば、文書群に対して種々の観点からアクセスすることが可能となる。 As described above, according to the present invention, it is possible to access a document group from various viewpoints.

以下、図面を参照しながら本発明の実施の形態について詳細に説明する。
〔全体構成〕
図１は、本発明の実施の形態が対象とする、文書集合及び文書群の例を示す図である。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.
〔overall structure〕
FIG. 1 is a diagram showing an example of a document set and a document group targeted by the embodiment of the present invention.

この例では、コンピュータネットワーク上に、ユーザが話題別に議論を行うフォーラムと呼ばれる仮想的な公開討論会場が設けられており、各フォーラムは、会議室と呼ばれる、更に細分化された話題を扱う複数の仮想的な会場に分類されている。ユーザがこの会議室に、発言文書を投稿（アップロード）することによって、議論が進行する。フォーラム及び会議室とも、サーバコンピュータ上のストレージエリアとして構成され、それらには、上述した分類基準に従って、文書が蓄積される。また、各会議室において、互いに参照関係を有する複数の文書からなる文書群がスレッドを構成する。 In this example, a virtual public discussion venue called a forum where users discuss by topic is provided on a computer network. Each forum is called a conference room, and a plurality of subdivided topics are handled. It is classified as a virtual venue. The discussion proceeds by the user posting (uploading) a comment document to the conference room. Both the forum and the conference room are configured as storage areas on the server computer, in which documents are stored according to the above-described classification criteria. In each conference room, a document group composed of a plurality of documents having a reference relationship with each other forms a thread.

ユーザが投稿する文書は、例えば、図２に示されるデータ構造を有しており、その文書の番号を示す文書番号、日付、タイトル、その文書が参照する文書の番号である参照文書番号、作者名（発言者名）等の文書の属性フィールドが記載されるヘッダ部と、文書の本体が記載される内容部とから構成されている。 The document posted by the user has, for example, the data structure shown in FIG. 2, and includes a document number indicating the number of the document, a date, a title, a reference document number that is the number of the document referenced by the document, It consists of a header portion in which an attribute field of a document such as a name (speaker name) is described, and a content portion in which the main body of the document is described.

本発明の実施の形態では、以下のような表示形態が可能となる。
１．キーワードビユー：会議室を構成する文書集合において、その文書集合中の各スレッド毎に、そのスレッドを構成する文書群からキーワードが抽出され、それらのキーワードが、それらの文書数及びそれらが含まれるスレッドのタイトルと合わせて、図２５に示される表示形態で表示される。 In the embodiment of the present invention, the following display forms are possible.
1. Keyword view: In a document set constituting a conference room, for each thread in the document set, keywords are extracted from a group of documents constituting the thread, and those keywords are the number of those documents and the thread in which they are included. The title is displayed in the display form shown in FIG.

キーワードビユーによって、ユーザは、キーワードを頼りにして、雑多な文書集合の中から必要な文書が含まれているであろうスレッド（文書群）を容易に見つけ出すことが可能となる。 The keyword view allows the user to easily find a thread (document group) that may contain a necessary document from a miscellaneous document set, relying on the keyword.

２．スレッドビュー：文書の参照関係、タイトル、作者名、及び行数が一目にわかる図２６に示される表示形態で、各スレッドを構成する文書群が表示される。
スレッドビューにより、スレッド全体の構造を把握し話題の推移を容易に把握することが可能となる。 2. Thread view: A document group constituting each thread is displayed in a display form shown in FIG. 26 in which the document reference relationship, title, author name, and number of lines can be seen at a glance.
With the thread view, it is possible to grasp the structure of the entire thread and easily grasp the transition of topics.

３．発言者ビュー：各文書のタイトルが、発言者（作者）毎に分類され、かつ発言者が発言の多い順にソートされ、同一発言者内では日付順で、図２７に示される表示形態で、表示される。 3. Speaker View: The title of each document is classified by the speaker (author) and the speakers are sorted in descending order of the speakers, and are displayed in the date order within the same speaker in the display form shown in FIG. Is done.

発言者及び発言日付という観点から、文書集合（会議室）内の文書を参照することが可能となる。
４．各ビューへの検索結果の反映：ユーザが指定した検索キーワードに関連する文書が、図３２又は図３３に示される表示形態で、キーワードビユー、スレッドビュー等の表示中で強調表示される。 It is possible to refer to documents in a document set (conference room) from the viewpoint of a speaker and a statement date.
4). Reflecting search results to each view: A document related to a search keyword designated by the user is highlighted in the display of keyword view, thread view, etc. in the display form shown in FIG.

この結果、より正確な文書の把握が可能となる。
５．各ビューの切替え機能：上述のキーワードビユー、スレッドビュー、及び発言者ビューが任意に切替え可能とされることにより、種々の観点から必要な文書にアクセス可能となる。 As a result, a more accurate document can be grasped.
5. Switching function of each view: Necessary documents can be accessed from various viewpoints by arbitrarily switching the above-described keyword view, thread view, and speaker view.

以上のような表示形態を可能とする本発明の実施の形態について、詳細に説明する。
図３及び図４は、本発明の実施の形態のシステム構成図である。
フォーラム／会議室内の文書群は、所定のサーバコンピュータ上の文書群データベース３０１として、蓄積される。 An embodiment of the present invention that enables the display form as described above will be described in detail.
3 and 4 are system configuration diagrams according to the embodiment of this invention.
The document group in the forum / meeting room is stored as a document group database 301 on a predetermined server computer.

文書群解析装置３０２は、文書群データベース３０１内の各会議室に対応する文書集合毎に、それに含まれる文書群の解析を行う。
集計装置３０３は、文書群解析装置３０２による解析結果に基づいて、メタインデックス３０４、スレッドインデックス３０５、及び索引ファイル４０４を生成する。 The document group analysis device 302 analyzes a document group included in each document set corresponding to each conference room in the document group database 301.
The aggregation device 303 generates a meta index 304, a thread index 305, and an index file 404 based on the analysis result by the document group analysis device 302.

表示装置３０６は、メタインデックス３０４とスレッドインデックス３０５を用いて、キーワードビユー、スレッドビュー、又は発言者ビューの何れかの表示形態で、文書群を表示する。 The display device 306 uses the meta index 304 and the thread index 305 to display a document group in any of the keyword view, thread view, and speaker view display forms.

また、文字列検索装置４０５は、ユーザによる検索語の指定に基づいて、索引ファイル４０４を用いながら文書群データベース３０１内の文書集合を構成する各文書に対して検索を実行する。表示装置３０６は、その検索結果を、キーワードビユー又はスレッドビューに反映させて表示する。 Further, the character string search device 405 executes a search for each document constituting the document set in the document group database 301 using the index file 404 based on the specification of the search word by the user. The display device 306 reflects the search result on the keyword view or thread view and displays it.

文書群解析装置３０２は、書式解析部４０１、構造解析部４０２、及び内容解析部４０３とから構成される。
書式解析部４０１は、文書群データベース３０１内の文書集合を構成する図２のデータ構造を有する各文書のヘッダ部から、文書番号、タイトル、作者名、日付、及び参照文書番号を抽出し、また、各文書の内容部の行数を算出し、それらを、集計装置３０３を経由して、図５に示されるデータ構造を有するメタインデックス３０４に登録する。 The document group analysis apparatus 302 includes a format analysis unit 401, a structure analysis unit 402, and a content analysis unit 403.
The format analysis unit 401 extracts the document number, title, author name, date, and reference document number from the header portion of each document having the data structure shown in FIG. Then, the number of lines in the content part of each document is calculated, and these are registered in the meta index 304 having the data structure shown in FIG.

構造解析部４０２は、書式解析部４０１が各文書から抽出した文書番号と参照文書番号に基づいて、各文書をスレッドを単位とする文書群に分類し、集計装置３０３を経由して、スレッド毎に、それを構成する文書の参照関係のリストであるスレッドインデックス３０５を作成する。 The structure analysis unit 402 classifies each document into a document group with a thread as a unit based on the document number and the reference document number extracted from each document by the format analysis unit 401, and passes through the aggregation device 303 for each thread. In addition, a thread index 305 is created which is a list of reference relationships of documents constituting the document.

図６は、スレッドインデックス３０５のデータ構造を示す図である。各スレッド毎に、ルート文書番号と、文書数と、スレッドの構造を示すリストとが登録される。リストは、
（親文書番号子文書番号／サブツリー子文書番号／サブツリー....）
という記述形式によって記述され、”子文書番号／サブツリー”の部分には、更に再帰的（リカーシブ）に、子リストを記述することができる。 FIG. 6 is a diagram illustrating the data structure of the thread index 305. For each thread, a root document number, the number of documents, and a list indicating the thread structure are registered. The list is
(Parent document number Child document number / Subtree Child document number / Subtree ....)
The child list can be further recursively described in the “child document number / subtree” part.

図６に例示される２つのスレッドの各リストにより表現される各参照関係は、同図の表の右側に示される如くである。
また、構造解析部４０２は、解析したスレッドを構成する文書群を、更にタイトルを同一とするサブ文書群に分類し、各サブ文書群に色番号を付与し、各サブ文書群に含まれる文書に対応する図５のデータ構造を有するメタインデックス３０４内のエントリに、その文書が属するサブ文書群に付与された色番号を登録する。 Each reference relationship represented by each list of two threads illustrated in FIG. 6 is as shown on the right side of the table of FIG.
The structure analysis unit 402 further classifies the document group constituting the analyzed thread into sub-document groups having the same title, assigns a color number to each sub-document group, and includes documents included in each sub-document group. Is registered in the entry in the meta index 304 having the data structure of FIG.

内容解析部４０３は、構造解析部４０２によって分類されたスレッド毎に、そのスレッドを構成する文書群を１つの結合文書ファイルにまとめ、その結合文書からキーワードを抽出する。日本語文書からキーワードを抽出する技術としては、種々の公知技術を採用することができる。この場合に、キーワード抽出の精度を向上させるために、ノイズとなる文字がパターンマッチングによって除去される。また、例えば、上位所定個数のキーワードのみが抽出される。 For each thread classified by the structure analysis unit 402, the content analysis unit 403 collects a group of documents constituting the thread into one combined document file, and extracts a keyword from the combined document. Various known techniques can be adopted as a technique for extracting a keyword from a Japanese document. In this case, in order to improve the accuracy of keyword extraction, the noise character is removed by pattern matching. Also, for example, only the upper predetermined number of keywords are extracted.

内容解析部４０３によって抽出された各スレッドのキーワードは、集計装置３０３を経由して、そのスレッドのルート文書に対応する図５のデータ構造を有するメタインデックス３０４のエントリに、登録される。 The keywords of each thread extracted by the content analysis unit 403 are registered in the entry of the meta index 304 having the data structure of FIG. 5 corresponding to the root document of the thread via the aggregation device 303.

また、内容解析部４０３は、スレッド毎に抽出したキーワードから、そのキーワードに含まれる索引語を抽出し、集計装置３０３を経由して、図７に示されるデータ構造を有する索引ファイル４０４を生成する。 Further, the content analysis unit 403 extracts index words included in the keywords from the keywords extracted for each thread, and generates an index file 404 having the data structure shown in FIG. .

この索引ファイル４０４は、前述したように、文字列検索装置４０５によって参照される。
〔文書群解析装置３０２の詳細説明〕
図８は、図４の文書群解析装置３０２内の書式解析部４０１及び構造解析部４０２が実現する制御動作を示す動作フローチャートである。 The index file 404 is referred to by the character string search device 405 as described above.
[Detailed Description of Document Group Analysis Device 302]
FIG. 8 is an operation flowchart showing control operations realized by the format analysis unit 401 and the structure analysis unit 402 in the document group analysis apparatus 302 of FIG.

まず、書式解析部４０１は、文書群データベース３０１から新たに登録された新規文書ファイルから文書データを１行ずつ読み込みながら、その文書ファイルのヘッダ部（図２参照）から、文書番号、タイトル、作者名、日付、及び参照文書番号を抽出する（ステップ８０１→８０２→８０３→８０１のループ）。 First, the format analysis unit 401 reads document data line by line from a newly registered new document file from the document group database 301, and reads the document number, title, author from the header part of the document file (see FIG. 2). The name, date, and reference document number are extracted (loop of steps 801 → 802 → 803 → 801).

書式解析部４０１は、ヘッダ部の抽出を終了すると、集計装置３０３を経由して、図５のデータ構造を有するメタインデックス３０４において新規エントリを生成し、そのエントリに、抽出した文書番号、タイトル、作者名、日付、及び参照文書番号を登録する（ステップ８０２→８０４）。 When finishing the extraction of the header part, the format analysis unit 401 generates a new entry in the meta index 304 having the data structure of FIG. 5 via the aggregation device 303, and the extracted document number, title, The author name, date, and reference document number are registered (step 802 → 804).

次に、書式解析部４０１は、上記新規文書ファイル内のヘッダ部以降の内容部（図２参照）から文書データを１行ずつ読み込みながら、文書末尾（ＥＯＦ：エンドオブファイル）が検出されるまで、内容部の行数をカウントする（ステップ８０５→８０６→８０７→８０５のループ）。 Next, the format analysis unit 401 reads document data line by line from the content part (see FIG. 2) after the header part in the new document file until the end of the document (EOF: end of file) is detected. The number of lines in the content part is counted (loop of steps 805 → 806 → 807 → 805).

書式解析部４０１は、文書末尾を検出すると、集計装置３０３を経由して、それまでにカウントした内容部の行数を、図５のデータ構造を有するメタインデックス３０４の、現在処理中の新規文書の文書番号に対応するエントリに、登録する（ステップ８０６→８０８）。 When the format analysis unit 401 detects the end of the document, the total number of lines of the content portion counted so far is calculated via the counting device 303, and the new document currently being processed in the meta index 304 having the data structure of FIG. Is registered in the entry corresponding to the document number (step 806 → 808).

なお、上述の行数のカウント処理において、他の文書から引用している行（例えば”> ”で始まる行）については、行数のカウントには算入しないことによって、その文書が実質的に発言している行数をカウントするように構成されてもよい。 In the above-mentioned line count processing, lines cited from other documents (for example, lines starting with “>”) are not included in the line count, so that the document is substantially remarked. It may be configured to count the number of lines being processed.

続いて、構造解析部４０２に制御が移り、構造解析部４０２は、まず、現在処理中の新規文書の文書番号を、集計装置３０３を経由して、図６のデータ構造を有するスレッドインデックス３０５に登録する（ステップ８０９）。 Subsequently, control is transferred to the structure analysis unit 402, and the structure analysis unit 402 first assigns the document number of the new document currently being processed to the thread index 305 having the data structure of FIG. Register (step 809).

図９は、上記ステップ８０９の登録動作を示す動作フローチャートである。
まず、構造解析部４０２は、現在処理中の新規文書が、或るスレッドのルート文書であるか否かを判定する（ステップ９０１）。具体的には、構造解析部４０２は、図８のステップ８０１〜８０３のループにおいて、現在処理中の新規文書から参照文書番号が検出されなかった場合に、その文書はルート文書であると判定する。 FIG. 9 is an operation flowchart showing the registration operation in step 809.
First, the structure analysis unit 402 determines whether or not the new document currently being processed is a root document of a certain thread (step 901). Specifically, when the reference document number is not detected from the new document currently being processed in the loop of steps 801 to 803 in FIG. 8, the structure analysis unit 402 determines that the document is the root document. .

構造解析部４０２は、現在処理中の新規文書が或るスレッドのルート文書であると判定した場合には、集計装置３０３を経由して、スレッドインデックス３０５において新規エントリを生成し、そのエントリに現在処理中の新規文書の文書番号をルート文書番号として登録する（ステップ９０１→９０２）。 If the structure analysis unit 402 determines that the new document currently being processed is the root document of a certain thread, the structure analysis unit 402 generates a new entry in the thread index 305 via the totalization device 303 and adds the current entry to the current entry. The document number of the new document being processed is registered as the root document number (steps 901 → 902).

構造解析部４０２は、ステップ９０２の処理の後、上記エントリの文書数を１に初期設定し（ステップ９０６）、図８のステップ８０９の処理を終了する。
一方、構造解析部４０２は、現在処理中の新規文書が或るスレッドのルート文書ではないと判定した場合には、現在処理中の新規文書のヘッダ部から抽出されている参照文書番号が含まれるスレッドインデックス３０５中のエントリに、その参照文書番号を親文書番号とするリストが存在するか否かを判定する（ステップ９０１→９０３）。 After the processing in step 902, the structure analysis unit 402 initializes the number of documents in the entry to 1 (step 906), and ends the processing in step 809 in FIG.
On the other hand, if the structure analysis unit 402 determines that the new document currently being processed is not the root document of a certain thread, the reference document number extracted from the header portion of the new document currently being processed is included. It is determined whether or not a list having the reference document number as the parent document number exists in the entry in the thread index 305 (step 901 → 903).

構造解析部４０２は、上述のエントリに、現在処理中の新規文書のヘッダ部から抽出されている参照文書番号を親文書番号とするリストが存在すると判定した場合には、現在処理中の新規文書のヘッダ部から抽出されている文書番号を、そのリストの子文書番号として登録する（ステップ９０３→９０５）。 If the structure analysis unit 402 determines that the list includes the reference document number extracted from the header portion of the new document currently being processed as the parent document number in the above entry, the new document currently being processed The document number extracted from the header part is registered as the child document number of the list (step 903 → 905).

一方、構造解析部４０２は、上述のエントリに、現在処理中の新規文書のヘッダ部から抽出されている参照文書番号を親文書番号とするリストが存在しないと判定した場合には、そのエントリに、その参照文書番号を親文書番号とするリストを生成した上で、現在処理中の新規文書のヘッダ部から抽出されている文書番号を、そのリストの子文書番号として登録する（ステップ９０３→９０４→９０５）。 On the other hand, if the structure analysis unit 402 determines that there is no list having the reference document number extracted from the header part of the new document currently being processed as the parent document number in the above-described entry, Then, after generating a list with the reference document number as the parent document number, the document number extracted from the header portion of the new document currently being processed is registered as a child document number of the list (steps 903 → 904). → 905).

構造解析部４０２は、ステップ９０５の処理の後、上述のエントリの文書数を更新（プラス１）し（ステップ９０６）、図８のステップ８０９の処理を終了する。
上記図９の動作フローチャートによって実現される制御動作の具体例につき、図１０の説明図を用いて説明する。この図は、図６のスレッドインデックス３０５において、ルート文書番号が”００１”であるスレッドのエントリのリストが生成される過程を示すものである。 After the processing in step 905, the structure analysis unit 402 updates the number of documents in the entry (plus 1) (step 906), and ends the processing in step 809 in FIG.
A specific example of the control operation realized by the operation flowchart of FIG. 9 will be described with reference to the explanatory diagram of FIG. This figure shows a process of generating a list of entries of a thread whose root document number is “001” in the thread index 305 of FIG.

まず、文書番号”００１”のルート文書が処理される時点で、図９のステップ９０１→９０２が実行されることにより、スレッドインデックス３０５において新規エントリが生成され、そのエントリに文書番号”００１”がルート文書番号として登録され（図１０の（１））、上記エントリの文書数が１に初期設定される。 First, when the root document with the document number “001” is processed, a new entry is generated in the thread index 305 by executing steps 901 → 902 in FIG. 9, and the document number “001” is added to the entry. It is registered as the root document number ((1) in FIG. 10), and the number of documents in the entry is initialized to 1.

次に、文書番号”００２”の文書が処理される時点で、図９のステップ９０１→９０３→９０４→９０５→９０６が実行されることにより、スレッドインデックス３０５内のルート文書番号”００１”のエントリにおいて、文書番号”００２”の文書から抽出された参照文書番号”００１”を親文書番号とするリストが生成された後（図１０の（２））、文書番号”００２”がそのリストの子文書番号として登録され（図１０の（３）の下線部）、上記エントリの文書数が２に更新される。 Next, when the document with the document number “002” is processed, the entry of the root document number “001” in the thread index 305 is executed by executing the steps 901 → 903 → 904 → 905 → 906 in FIG. In FIG. 10, after the list having the reference document number “001” extracted from the document with the document number “002” as the parent document number is generated ((2) in FIG. 10), the document number “002” is a child of the list. Registered as a document number (underlined part (3) in FIG. 10), the number of documents in the entry is updated to 2.

次に、文書番号”００３”の文書が処理される時点で、図９のステップ９０１→９０３→９０４→９０５→９０６が実行されることにより、スレッドインデックス３０５内のルート文書番号”００１”のエントリにおいて、文書番号”００３”の文書から抽出された参照文書番号”００２”を親文書番号とするリストが生成された後（図１０の（４）の下線部）、文書番号”００３”がそのリストの子文書番号として登録され（図１０の（５）の下線部）、上記エントリの文書数が３に更新される。 Next, when the document with the document number “003” is processed, the entry of the root document number “001” in the thread index 305 is executed by executing the steps 901 → 903 → 904 → 905 → 906 in FIG. In FIG. 10, after the list having the reference document number “002” extracted from the document with the document number “003” as the parent document number is generated (underlined part (4) in FIG. 10), the document number “003” is It is registered as a child document number of the list (underlined part (5) in FIG. 10), and the number of documents in the entry is updated to 3.

次に、文書番号”００４”の文書が処理される時点で、図９のステップ９０１→９０３→９０５→９０６が実行されることにより、スレッドインデックス３０５内のルート文書番号”００１”のエントリにおいて、文書番号”００４”の文書から抽出された参照文書番号”００１”を親文書番号とするリストの子文書番号として、文書番号”００４”が登録され（図１０の（６）の下線部）、上記エントリの文書数が４に更新される。 Next, when the document with the document number “004” is processed, the steps 901 → 903 → 905 → 906 in FIG. 9 are executed, so that the entry of the root document number “001” in the thread index 305 is The document number “004” is registered as the child document number of the list having the reference document number “001” extracted from the document with the document number “004” as the parent document number (the underlined portion of (6) in FIG. 10). The number of documents in the entry is updated to 4.

次に、文書番号”００５”の文書が処理される時点で、図９のステップ９０１→９０３→９０４→９０５→９０６が実行されることにより、スレッドインデックス３０５内のルート文書番号”００１”のエントリにおいて、文書番号”００５”の文書から抽出された参照文書番号”００４”を親文書番号とするリストが生成された後（図１０の（７）の下線部）、文書番号”００５”がそのリストの子文書番号として登録され（図１０の（８）の下線部）、上記エントリの文書数が５に更新される。 Next, when the document with the document number “005” is processed, the entry of the root document number “001” in the thread index 305 is executed by executing steps 901 → 903 → 904 → 905 → 906 in FIG. In FIG. 10, after the list having the reference document number “004” extracted from the document with the document number “005” as the parent document number is generated (underlined part (7) in FIG. 10), the document number “005” is It is registered as a child document number of the list (underlined part of (8) in FIG. 10), and the number of documents in the entry is updated to 5.

最後に、文書番号”００６”の文書が処理される時点で、図９のステップ９０１→９０３→９０４→９０５→９０６が実行されることにより、スレッドインデックス３０５内のルート文書番号”００１”のエントリにおいて、文書番号”００６”の文書から抽出された参照文書番号”００５”を親文書番号とするリストが生成された後（図１０の（９）の下線部）、文書番号”００６”がそのリストの子文書番号として登録され（図１０の（１０）の下線部）、上記エントリの文書数が６に更新される。 Finally, when the document with the document number “006” is processed, the entry of the root document number “001” in the thread index 305 is executed by executing steps 901 → 903 → 904 → 905 → 906 in FIG. In FIG. 10, after the list having the reference document number “005” extracted from the document with the document number “006” as the parent document number is generated (underlined part (9) in FIG. 10), the document number “006” is It is registered as a child document number of the list (underlined part of (10) in FIG. 10), and the number of documents in the entry is updated to 6.

以上説明した図８のステップ８０９の処理の後、構造解析部４０２は、現在処理中の新規文書について色番号を決定し、その色番号を図５のデータ構造を有するメタインデックス３０４中の上記新規文書の文書番号に対応するエントリに登録する処理を実行する（図８のステップ８１０）。 After the processing of step 809 in FIG. 8 described above, the structure analysis unit 402 determines a color number for the new document currently being processed, and the color number is used for the new index in the meta index 304 having the data structure in FIG. A process of registering in the entry corresponding to the document number of the document is executed (step 810 in FIG. 8).

図１１は上記ステップ８１０の登録動作を示す動作フローチャートである。なお、この登録動作では、図１２に示されるデータ構造を有するカラーテーブルが使用される。このテーブルは、特には図示しない記憶装置に記憶される。 FIG. 11 is an operation flowchart showing the registration operation in step 810. In this registration operation, a color table having the data structure shown in FIG. 12 is used. This table is stored in a storage device (not shown).

まず、構造解析部４０２は、現在処理中の新規文書が、或るスレッドのルート文書であるか否かを判定する（ステップ１１０１）。具体的には、構造解析部４０２は、図８のステップ８０１〜８０３のループにおいて、現在処理中の新規文書から参照文書番号が検出されなかった場合に、その文書はルート文書であると判定する。 First, the structure analysis unit 402 determines whether or not the new document currently being processed is a root document of a certain thread (step 1101). Specifically, when the reference document number is not detected from the new document currently being processed in the loop of steps 801 to 803 in FIG. 8, the structure analysis unit 402 determines that the document is the root document. .

構造解析部４０２は、現在処理中の新規文書が或るスレッドのルート文書であると判定した場合は、そのルート文書の文書番号に対応するエントリを、図１２のデータ構造を有するカラーテーブルに登録し、そのエントリに現在処理中の新規文書から抽出された文書番号及びタイトル（図８のステップ８０４参照）と、初期色番号を登録する（ステップ１１０１→１１０２→１１０３）。図１２の例では、ルート文書番号”００１”の色番号”＃１”に対応するエントリが登録され、そのタイトルはメイントピックとなり、また、そのエントリの文書番号フィールドには、当初はルート文書番号”００１”のみが登録される。 If the structure analysis unit 402 determines that the new document currently being processed is a root document of a certain thread, the structure analysis unit 402 registers an entry corresponding to the document number of the root document in the color table having the data structure of FIG. Then, the document number and title extracted from the new document currently processed (see step 804 in FIG. 8) and the initial color number are registered in the entry (steps 1101 → 1102 → 1103). In the example of FIG. 12, the entry corresponding to the color number “# 1” of the root document number “001” is registered, the title is the main topic, and the root document number is initially set in the document number field of the entry. Only “001” is registered.

その後、構造解析部４０２は、図５のデータ構造を有するメタインデックス３０４中の上記新規文書の文書番号に対応するエントリに、ステップ１１０３で登録した初期色番号を登録し（ステップ１１０３→１１１０）、図８のステップ８１０の処理を終了する。 After that, the structure analysis unit 402 registers the initial color number registered in step 1103 in the entry corresponding to the document number of the new document in the meta index 304 having the data structure of FIG. 5 (steps 1103 to 1110). The process of step 810 in FIG. 8 ends.

一方、構造解析部４０２は、現在処理中の新規文書が或るスレッドのルート文書ではないと判定した場合には、現在処理中の新規文書から抽出されたタイトル（図８のステップ８０４参照）が、”Ｒｅ：”等の参照記号を含んでいるか否かを判定する（ステップ１１０１→１１０４）。 On the other hand, if the structure analysis unit 402 determines that the new document currently being processed is not the root document of a certain thread, the title extracted from the new document currently being processed (see step 804 in FIG. 8). , “Re:” or the like is included (step 1101 → 1104).

構造解析部４０２は、現在処理中の新規文書から抽出されたタイトルが参照記号を含んでいると判定した場合はそのタイトルから参照記号を削除し（ステップ１１０４→１１０５）、現在処理中の新規文書から抽出されたタイトルが参照記号を含んではいないと判定した場合にはステップ１１０５は実行しない。 When the structure analysis unit 402 determines that the title extracted from the new document currently being processed includes a reference symbol, the structure analysis unit 402 deletes the reference symbol from the title (step 1104 → 1105), and the new document currently being processed. If it is determined that the title extracted from does not contain a reference symbol, step 1105 is not executed.

その後、構造解析部４０２は、図１２のデータ構造を有するカラーテーブル中の現在処理中の新規文書が属するスレッドに対応する何れかのエントリに、現在処理中の新規文書から抽出され参照記号を含まないタイトルと同じタイトルが登録されているか否かを判定する（ステップ１１０６）。現在処理中の新規文書が属するスレッドとそのルート文書番号は、図８のステップ８０９の処理において図６のデータ構造を有するスレッドインデックス３０５のエントリが決定される際に検出されるため、そのルート文書番号からカラーテーブル中のエントリが決定される。例えば、現在処理中の新規文書が文書番号”００２”の文書である場合には、図１２に示されるカラーテーブルにおいて、ルート文書番号”００１”に属するエントリが検出される。 After that, the structure analysis unit 402 includes a reference symbol extracted from the new document currently being processed in any entry corresponding to the thread to which the new document currently being processed belongs in the color table having the data structure of FIG. It is determined whether or not the same title as a non-existing title is registered (step 1106). Since the thread to which the new document currently being processed belongs and its root document number are detected when the entry of the thread index 305 having the data structure of FIG. 6 is determined in the processing of Step 809 of FIG. An entry in the color table is determined from the number. For example, if the new document currently being processed is a document with the document number “002”, an entry belonging to the root document number “001” is detected in the color table shown in FIG.

構造解析部４０２は、図１２のデータ構造を有するカラーテーブル中の現在処理中の新規文書が属するスレッドに対応する何れかのエントリに、現在処理中の新規文書から抽出され参照記号を含まないタイトルと同じタイトルが登録されていると判定した場合には、そのエントリの文書番号フィールドに、現在処理中の新規文書の文書番号を登録する（ステップ１１０６→１１０７）。例えば、現在処理中の新規文書が文書番号”００２”の文書である場合には、図１２に示されるカラーテーブルにおいて、ルート文書番号”００１”に属し色番号”＃１”が登録されているエントリの文書番号フィールドに、文書番号”００２”が登録される。 The structure analysis unit 402 extracts a title that is extracted from the new document being processed and does not include a reference symbol in any entry corresponding to the thread to which the new document currently being processed belongs in the color table having the data structure of FIG. If the same title is registered, the document number of the new document currently being processed is registered in the document number field of the entry (steps 1106 → 1107). For example, if the new document currently being processed is the document with the document number “002”, the color number “# 1” belonging to the root document number “001” is registered in the color table shown in FIG. The document number “002” is registered in the document number field of the entry.

その後、構造解析部４０２は、図５のデータ構造を有するメタインデックス３０４中の上記新規文書の文書番号に対応するエントリに、ステップ１１０７で登録が行われたカラーテーブル中のエントリに設定されている色番号を登録し（ステップ１１０７→１１１０）、図８のステップ８１０の処理を終了する。 Thereafter, the structure analysis unit 402 is set in the entry corresponding to the document number of the new document in the meta index 304 having the data structure of FIG. 5 as the entry in the color table registered in step 1107. The color number is registered (step 1107 → 1110), and the process of step 810 in FIG.

一方、構造解析部４０２は、図１２のデータ構造を有するカラーテーブル中の現在処理中の新規文書が属するスレッドに対応する何れのエントリにも、現在処理中の新規文書から抽出され参照記号を含まないタイトルと同じタイトルが登録されてはいないと判定した場合は、カラーテーブルにおいて上記スレッドに対応する新たなエントリを作成し（ステップ１１０８）、その作成したエントリに、そのスレッド内で新たな色番号と、現在処理中の新規文書から抽出された文書番号及びタイトル（図８のステップ８０４参照）を登録する（ステップ１１０６→１１０８→１１０９）。例えば、現在処理中の新規文書が文書番号”００３”の文書である場合には、図１２のカラーテーブルにおいて、ルート文書番号”００１”に属する新たなエントリが作成され、そのエントリに、色番号”＃２”と、文書番号”００３”の文書のタイトルと、文書番号”００３”とが登録される。このタイトルは、ルート文書番号”００１”のタイトルであるメイントピックに対して、サブトピック１となる。 On the other hand, the structure analysis unit 402 includes a reference symbol extracted from the new document currently being processed in any entry corresponding to the thread to which the new document currently being processed belongs in the color table having the data structure of FIG. If it is determined that the same title as the non-existing title is not registered, a new entry corresponding to the thread is created in the color table (step 1108), and a new color number in the thread is added to the created entry. Then, the document number and title (see step 804 in FIG. 8) extracted from the new document currently being processed are registered (steps 1106 → 1108 → 1109). For example, if the new document currently being processed is the document with the document number “003”, a new entry belonging to the root document number “001” is created in the color table of FIG. “# 2”, the title of the document with the document number “003”, and the document number “003” are registered. This title is subtopic 1 with respect to the main topic that is the title of the root document number “001”.

その後、構造解析部４０２は、図５のデータ構造を有するメタインデックス３０４中の上記新規文書の文書番号に対応するエントリに、ステップ１１０９でカラーテーブル中の新たなエントリに設定された新たな色番号を登録し（ステップ１１０９→１１１０）、図８のステップ８１０の処理を終了する。 After that, the structure analysis unit 402 sets the new color number set in the new entry in the color table in step 1109 to the entry corresponding to the document number of the new document in the meta index 304 having the data structure of FIG. Is registered (step 1109 → 1110), and the processing of step 810 in FIG.

内容解析部４０３は、図６のデータ構造を有するスレッドインデックス３０５を参照することにより、前述したように、スレッド毎に、そのスレッドを構成する文書群を１つの結合文書ファイルにまとめ、その結合文書からキーワードを抽出する。この結果、抽出された各スレッドのキーワードは、そのスレッドのルート文書に対応する図５のデータ構造を有するメタインデックス３０４のエントリに、登録される。 The content analysis unit 403 refers to the thread index 305 having the data structure shown in FIG. 6, and collects a group of documents constituting the thread into one combined document file for each thread as described above. Extract keywords from. As a result, the extracted keyword of each thread is registered in the entry of the meta index 304 having the data structure of FIG. 5 corresponding to the root document of the thread.

〔表示装置３０６の詳細説明〕
表示装置３０６は、前述したように、図５のデータ構造を有するメタインデックス３０４と図６のデータ構造を有するスレッドインデックス３０５を用いて、キーワードビユー、スレッドビュー、又は発言者ビューの何れかの表示形態で、文書群を表示することができる。 [Detailed Description of Display Device 306]
As described above, the display device 306 uses the meta index 304 having the data structure of FIG. 5 and the thread index 305 having the data structure of FIG. 6 to display any of the keyword view, thread view, and speaker view. A group of documents can be displayed in a form.

ここで例えば、図４のシステムが、ホームページの表示を制御するＷｅｂサーバに接続されるように構成されれば、ユーザは、パーソナルコンピュータ等の手元の端末上のＷｅｂブラウザアプリケーションから上記Ｗｅｂサーバに接続して特定のフォーラムの特定の会議室にログインした後に、所定の各ＧＵＩ（グラフィックユーザインタフェース）ボタンをマウス装置等でクリックすることによって、キーワードビユー、スレッドビュー、又は発言者ビューを切り替えて表示させることができる。 Here, for example, if the system of FIG. 4 is configured to be connected to a Web server that controls display of a home page, the user can connect to the Web server from a Web browser application on a terminal at hand such as a personal computer. Then, after logging in to a specific conference room in a specific forum, clicking a predetermined GUI (graphic user interface) button with a mouse device or the like switches the keyword view, thread view, or speaker view to be displayed. be able to.

より具体的には、表示装置３０６は、Ｗｅｂサーバに対して例えばＣＧＩ（コモンゲートウエイインタフェース）アプリケーションとして機能し、Ｗｅｂサーバから引き渡されたユーザからのリクエストに応答して、キーワードビユー、スレッドビュー、又は発言者ビュー等の各ビューを表現するＨＴＭＬ（ハイパーテキストマークアップ言語）による文書データを生成し、それをＷｅｂサーバに引き渡す。そして、これらのＨＴＭＬ文書データをＷｅｂサーバがユーザにインターネット等のコンピュータネットワークを経由して返信することにより、ユーザの端末上のＷｅｂブラウザアプリケーションに、上記ビューが表示される。 More specifically, the display device 306 functions as, for example, a CGI (Common Gateway Interface) application to the Web server, and responds to a request from the user delivered from the Web server in response to a keyword view, thread view, or Document data in HTML (Hyper Text Markup Language) representing each view such as a speaker view is generated and delivered to a Web server. Then, when the HTML server returns the HTML document data to the user via a computer network such as the Internet, the view is displayed on the Web browser application on the user terminal.

まず、表示装置３０６が実現するキーワードビユーの表示動作について説明する。
前述したようにキーワードビユーにおいては、スレッド毎に、そのスレッドを構成する文書群から抽出されているキーワードが、その文書群の文書数及びそのスレッドのタイトルと合わせて、図２５に示される表示形態で表示される。 First, a keyword view display operation realized by the display device 306 will be described.
As described above, in the keyword view, for each thread, the keyword extracted from the document group constituting the thread is displayed together with the number of documents in the document group and the title of the thread, as shown in FIG. Is displayed.

図１３は、表示装置３０６が実行するキーワードビユーの表示動作を示す動作フローチャートである。。まず、表示装置３０６は、図５のデータ構造を有するメタインデックス３０４のファイルを読み込む（ステップ１３０１）。 FIG. 13 is an operation flowchart showing a keyword view display operation executed by the display device 306. . First, the display device 306 reads a file of the meta index 304 having the data structure of FIG. 5 (step 1301).

次に、表示装置３０６は、メタインデックス３０４のファイルから１エントリずつデータを読み込みながら、ルート文書が登録されているエントリを検索する（ステップ１３０１→１３０２→１３０１のループ）。各エントリがルート文書が登録されているエントリであるか否かは、各エントリの参照文書番号フィールドの値が無効なデータ値であるか否かによって判定することができる。 Next, the display device 306 searches the entry in which the root document is registered while reading data from the meta index 304 file one entry at a time (step 1301 → 1302 → 1301 loop). Whether or not each entry is an entry in which a root document is registered can be determined by whether or not the value of the reference document number field of each entry is an invalid data value.

表示装置３０６は、ルート文書が登録されているエントリを検出すると、そのルート文書番号を、そのルート文書番号に対応する文書群データベース３０１内のルート文書を表示するためのアプリケーションへの統一されたアドレス情報であるＵＲＬ（Uniform Resource Locator）がＨＲＥＦ属性の値として指定されるアンカータグに変換する（ステップ１３０２→１３０３）。 When the display device 306 detects an entry in which the root document is registered, a unified address to the application for displaying the root document number in the document group database 301 corresponding to the root document number is displayed. A URL (Uniform Resource Locator) as information is converted into an anchor tag specified as a value of the HREF attribute (steps 1302 → 1303).

次に表示装置３０６は、図６のデータ構造を有するスレッドインデックス３０５において、上記ルート文書番号に対応するエントリを参照することにより、そのスレッドに含まれる文書数（子文書数）を取得する（ステップ１３０４）。 Next, the display device 306 refers to the entry corresponding to the root document number in the thread index 305 having the data structure shown in FIG. 6, thereby acquiring the number of documents (number of child documents) included in the thread (step). 1304).

そして、表示装置３０６は、図５のデータ構造を有するメタインデックス３０４において、上記ルート文書が登録されているエントリから、タイトル（メイントピック）と、キーワードとを抽出し、それらと、ステップ１３０３で変換されたアンカータグ形式のルート文書番号、及びステップ１３０４で取得した子文書数からなるデータ列を１テーブルレコードとして含むＨＴＭＬテーブル文書データを作成する（ステップ１３０５）。 Then, the display device 306 extracts the title (main topic) and the keyword from the entry in which the root document is registered in the meta index 304 having the data structure of FIG. 5, and converts them in step 1303. HTML table document data including a data string composed of the root document number in the anchor tag format and the number of child documents acquired in step 1304 as one table record is created (step 1305).

続いて、表示装置３０６は、メタインデックス３０４のファイルから文書末尾（ＥＯＦ）を検出するまで、上記ステップ１３０１〜１３０５の一連の処理を繰り返し実行することにより、各スレッド毎のＨＴＭＬテーブル文書データを作成する（ステップ１３０６→１３０１）。 Subsequently, the display device 306 creates HTML table document data for each thread by repeatedly executing a series of steps 1301 to 1305 until the end of the document (EOF) is detected from the meta index 304 file. (Step 1306 → 1301).

表示装置３０６は、メタインデックス３０４のファイルから文書末尾を検出すると（ステップ１３０６の判定がＹＥＳ）、最終的に得られたＨＴＭＬテーブル文書データをＷｅｂサーバに引き渡して、キーワードビユーの表示動作を終了する。この結果、ユーザの端末のＷｅｂブラウザアプリケーション上に、図２５に例示されるようなテーブル形式で、キーワードビユーが表示される。 When the display device 306 detects the end of the document from the file of the meta index 304 (YES in step 1306), the display device 306 hands over the finally obtained HTML table document data to the Web server, and ends the keyword view display operation. . As a result, the keyword view is displayed on the Web browser application of the user's terminal in the table format illustrated in FIG.

ユーザは、キーワードビユー上の各スレッド毎のキーワードを頼りにして、雑多な文書集合の中から必要な文書が含まれているであろうスレッドを容易に見つけ出すことが可能となる。 The user can easily find a thread that may contain a necessary document from a miscellaneous document set by relying on a keyword for each thread on the keyword view.

また、ユーザは、ルート文書に対応するアンカーをマウス装置等でクリックすることによって、所望のスレッドのルート文書に即座にアクセスすることができる。
上述のキーワードビユーの表示動作において、子文書数に応じて、各スレッドのテーブルレコードを色分けして表示するように構成されてもよい。これによって、ユーザは、スレッド毎の発言数を一目で判別することができる。 In addition, the user can immediately access the root document of a desired thread by clicking an anchor corresponding to the root document with a mouse device or the like.
In the above-mentioned keyword view display operation, the table record of each thread may be displayed in different colors according to the number of child documents. As a result, the user can determine the number of utterances for each thread at a glance.

続いて、表示装置３０６が実現するスレッドビューの表示動作について説明する。
前述したように、スレッドビューにおいては、文書の参照関係、タイトル、作者名、及び行数が一目にわかる図２６に示される表示形態で、各スレッドを構成する文書群が表示される。 Next, a thread view display operation realized by the display device 306 will be described.
As described above, in the thread view, a document group constituting each thread is displayed in the display form shown in FIG. 26 in which the document reference relationship, title, author name, and number of lines can be seen at a glance.

図２６において、スレッドの参照関係及び話題の推移が色付きツリーによって表示される。各ツリーのノードは、各文書に対応し、その文書の作者名の先頭文字（２バイト）とその文書の行数を用いて、作者名［行数］の形式で表示される。また、各ノードの前後には、”＊”、”＋”、”＝”、又は”．”等の記号が付される。これらの記号の意味は、下記の通りである。 In FIG. 26, thread reference relationships and topic transitions are displayed in a colored tree. The node of each tree corresponds to each document and is displayed in the form of author name [number of lines] using the first character (2 bytes) of the author name of the document and the number of lines of the document. Further, symbols such as “*”, “+”, “=”, or “.” Are attached before and after each node. The meaning of these symbols is as follows.

”＊” この記号が付される文書がルート文書である。
”＋” この記号が付される文書が参照している文書が他の文書によっても参照されている。 “*” The document with this symbol is the root document.
“+” A document referred to by a document to which this symbol is attached is also referred to by another document.

”＝” この記号が付される文書を参照している文書が存在する。
”．” この記号が付される文書を参照している文書が存在しない。また、図２６において、”ＭａｉｎＴｏｐｉｃ：”に続いてそのスレッドのルート文書のタイトルが表示され、”ＳｕｂＴｏｐｉｃ：”に続いてそのスレッド中に現れるルート文書のタイトル以外のタイトルが表示される。そして、各タイトルは色分けされ、各タイトルと同じタイトル（参照記号を除く）を有する文書に対応するノードは、そのタイトルの色と同じ色で表示される。 “=” There is a document that refers to a document with this symbol.
“.” There is no document referring to the document with this symbol. In FIG. 26, the title of the root document of the thread is displayed after “Main Topic:”, and the title other than the title of the root document appearing in the thread is displayed after “Sub Topic:”. Each title is color-coded, and a node corresponding to a document having the same title (excluding reference symbols) as each title is displayed in the same color as the title.

これによって、ユーザは、スレッド全体の構造を把握しスレッド内の話題の推移を一目で把握することが可能となる。
更に、各ノードはアンカーとして表示される。これにより、ユーザは、各ノードをマウス装置等によってクリックすることにより、そのノードに対応する文書に即座にアクセスすることができる。 As a result, the user can grasp the entire structure of the thread and grasp the transition of the topic in the thread at a glance.
Furthermore, each node is displayed as an anchor. Thereby, the user can immediately access a document corresponding to the node by clicking each node with a mouse device or the like.

図１４は、表示装置３０６が実行するスレッドビユーの表示動作を示す動作フローチャートである。
まず、表示装置３０６は、図６のデータ構造を有するスレッドインデックス３０５のファイルから、１つのスレッドに対応する１つのエントリ（１行）のリストと、そのスレッドに含まれる文書数を、読み込む（ステップ１４０１）。例えば、図６のデータ構造を有するスレッドインデックス３０５において、ルート文書番号”００１”に対応するリストとして、
(001 (002 003) (004 (005 006)))
が読み込まれ、文書数として”６”が読み込まれる。 FIG. 14 is an operation flowchart illustrating a thread view display operation executed by the display device 306.
First, the display device 306 reads a list of one entry (one line) corresponding to one thread and the number of documents included in the thread from the file of the thread index 305 having the data structure of FIG. 1401). For example, in the thread index 305 having the data structure of FIG. 6, as a list corresponding to the root document number “001”,
(001 (002 003) (004 (005 006)))
Is read and “6” is read as the number of documents.

次に、表示装置３０６は、読み込んだリストから、例えば図６の表の右側に示されるスレッドのツリー構造を復元する（ステップ１４０２）。このツリー構造を表現するために、表示装置３０６は、例えば図１５に示されるような配列データを生成する。 Next, the display device 306 restores, for example, the thread tree structure shown on the right side of the table of FIG. 6 from the read list (step 1402). In order to express this tree structure, the display device 306 generates array data as shown in FIG. 15, for example.

次に、表示装置３０６は、読み込んだリストの各ノードを構成する文書番号毎に、その文書番号に対応する図５のデータ構造を有するメタインデックス３０４のエントリを抽出し、そのエントリから、作者名、行数、色番号、及びタイトルを抽出する（ステップ１４０３）。これらの抽出されたデータは、上記各ノードに対応付けて記憶される。 Next, the display device 306 extracts, for each document number constituting each node of the read list, an entry of the meta index 304 having the data structure of FIG. 5 corresponding to the document number, and the author name is extracted from the entry. The number of lines, the color number, and the title are extracted (step 1403). These extracted data are stored in association with the respective nodes.

次に、表示装置３０６は、ステップ１４０１で読み込んだ文書数と、ステップ１４０３で抽出した各ノードの色番号とから、スレッドビューの先頭で表示される各タイトルの色を決定する（ステップ１４０４）。この動作は、各色番号に実際の色をマッピングする動作として実現される。 Next, the display device 306 determines the color of each title displayed at the head of the thread view from the number of documents read in step 1401 and the color number of each node extracted in step 1403 (step 1404). This operation is realized as an operation for mapping an actual color to each color number.

次に、表示装置３０６は、スレッドに含まれるルート文書のタイトルとその他のタイトルを、”ＭａｉｎＴｏｐｉｃ：”及び”ＳｕｂＴｏｐｉｃ：”に続けて表示するためのＨＴＭＬ文書を作成する。この場合に、各タイトルは、前述した構造解析部４０２が管理する図１２に示されるカラーテーブルの上記スレッドに属する各エントリから順次読み出され、同時に順次読み出される各色番号からステップ１４０４で決定された各色が算出され、その各色での表示が順次指定される。各色は、ＨＴＭＬ文書の色指定命令（ タグ等）によって指定される。 Next, the display device 306 creates an HTML document for displaying the title of the root document included in the thread and other titles after “Main Topic:” and “Sub Topic:”. In this case, each title is sequentially read from each entry belonging to the thread of the color table shown in FIG. 12 managed by the structure analysis unit 402 described above, and determined in step 1404 from each color number read sequentially. Each color is calculated, and display in each color is sequentially specified. Each color is designated by a color designation command ( tag or the like) of the HTML document.

最後に、表示装置３０６は、ステップ１４０２で復元したスレッドのツリー構造を示す配列データを構成する左端のノードの文書番号から順に処理することにより、そのツリー構造を表示するためのＨＴＭＬ文書を作成する（ステップ１４０６）。この場合、前述したように、表示装置３０６は、ステップ１４０３で抽出した各ノードの作者名、行数、及び色番号に基づいて、ツリー構造の各ノードの文書番号を、そのノードに対応する文書の作者名の先頭文字（２バイト）とその文書の行数とからなる表示データ、
作者名［行数］
に変換し、更に、その表示データをそのノードの色番号に対応する色で表示させるためのＨＴＭＬ文書データを生成する。色番号と実際の色との対応関係は、ステップ１４０４で決定された対応関係に従う。また、前述したように、表示装置３０６は、各ノードに対応する上記表示データの前後に、その接続関係に基づいて、”＊”、”＋”、”＝”、又は”．”等の記号を表示するためのＨＴＭＬ文書データを生成する。ここで、ツリー構造をそのままの形式で表示可能とするために、例えば、ＨＴＭＬにおける制御用タグであるプリフォーマットタグ <PRE>が使用される。更に、上記ノード毎の表示データは、そのノードに対応する文書群データベース３０１内の文書データを表示するためのアプリケーションへのＵＲＬがＨＲＥＦ属性の値として指定されるアンカータグとして生成される。 Finally, the display device 306 creates an HTML document for displaying the tree structure by processing sequentially from the document number of the leftmost node constituting the array data indicating the tree structure of the thread restored in step 1402. (Step 1406). In this case, as described above, the display device 306 determines the document number of each node in the tree structure based on the author name, the number of lines, and the color number of each node extracted in step 1403 as the document corresponding to the node. Display data consisting of the first character (2 bytes) of the author's name and the number of lines in the document,
Author name [Number of lines]
Furthermore, HTML document data for displaying the display data in a color corresponding to the color number of the node is generated. The correspondence relationship between the color number and the actual color follows the correspondence relationship determined in step 1404. In addition, as described above, the display device 306 has a symbol such as “*”, “+”, “=”, or “.” Before and after the display data corresponding to each node based on the connection relationship. HTML document data for displaying is generated. Here, in order to be able to display the tree structure in its original form, for example, a preformat tag <PRE>, which is a control tag in HTML, is used. Further, the display data for each node is generated as an anchor tag in which the URL to the application for displaying the document data in the document group database 301 corresponding to the node is designated as the value of the HREF attribute.

続いて、表示装置３０６は、スレッドインデックス３０５のファイルから文書末尾（ＥＯＦ）を検出するまで、上記ステップ１４０１〜１４０６の一連の処理を繰り返し実行することにより、各スレッド毎のビューデータを作成する（ステップ１４０７→１４０１）。 Subsequently, the display device 306 creates view data for each thread by repeatedly executing a series of processes in steps 1401 to 1406 until the end of the document (EOF) is detected from the file of the thread index 305 (see FIG. Step 1407 → 1401).

表示装置３０６は、スレッドインデックス３０５のファイルから文書末尾を検出すると（ステップ１４０７の判定がＹＥＳ）、最終的に得られたＨＴＭＬテーブル文書データをＷｅｂサーバに引き渡して、スレッドビユーの表示動作を終了する。この結果、ユーザの端末のＷｅｂブラウザアプリケーション上に、図２６に例示されるような形式で、スレッドビユーが表示される。 When the display device 306 detects the end of the document from the file of the thread index 305 (YES in step 1407), the display device 306 hands over the finally obtained HTML table document data to the Web server, and ends the thread view display operation. . As a result, the thread view is displayed in the format illustrated in FIG. 26 on the Web browser application of the user's terminal.

次に、表示装置３０６が実現する発言者ビューの表示動作につき説明する。前述したように、発言者ビューにおいては、各文書のタイトルが、発言者（作者）毎に分類され、かつ発言者が発言の多い順にソートされ、同一発言者内では日付順で、図２７に示される表示形態で、表示される。 Next, a speaker view display operation realized by the display device 306 will be described. As described above, in the speaker view, the titles of the respective documents are classified for each speaker (author), and the speakers are sorted in the descending order of the speakers. Displayed in the display form shown.

図１６は、表示装置３０６が実行する発言者ビユーの表示動作を示す動作フローチャートである。
表示装置３０６は、発言者ビューを実現するために、図１７のデータ構造を有する作者配列データを使用する。そして、表示装置３０６は、発言者ビューの表示開始時に、この作者配列データを初期化する（ステップ１６０１）。 FIG. 16 is an operation flowchart showing the display operation of the speaker view executed by the display device 306.
The display device 306 uses the author array data having the data structure of FIG. 17 in order to realize the speaker view. Then, the display device 306 initializes the author array data at the start of the display of the speaker view (step 1601).

次に、表示装置３０６は、図５のデータ構造を有するメタインデックス３０４のファイルから１つのエントリのデータを読み込む（ステップ１６０２）。
次に、表示装置３０６は、このエントリから抽出される作者名の作者が、作者配列データに含まれていない作者であるか否かを判定する（ステップ１６０３）。 Next, the display device 306 reads data of one entry from the file of the meta index 304 having the data structure of FIG. 5 (step 1602).
Next, the display device 306 determines whether or not the author of the author name extracted from this entry is an author not included in the author array data (step 1603).

表示装置３０６は、上記エントリから抽出される作者名の作者が、作者配列データに含まれていない作者である場合には、作者配列データに新しい作者項目を追加する（ステップ１６０３→１６０４）。表示装置３０６は、上記エントリから抽出される作者名の作者が、作者配列データに含まれている作者である場合には、ステップ１６０４の処理は実行しない。 When the author of the author name extracted from the entry is an author not included in the author array data, the display device 306 adds a new author item to the author array data (steps 1603 → 1604). The display device 306 does not execute the process of step 1604 when the author of the author name extracted from the entry is the author included in the author array data.

次に、表示装置３０６は、作者配列データ中の該当する作者項目に、上記エントリから抽出される文書番号を登録する（ステップ１６０５）。
続いて、表示装置３０６は、メタインデックス３０４のファイルから文書末尾（ＥＯＦ）を検出するまで、上記ステップ１６０２〜１６０５の一連の処理を繰り返し実行することにより、メタインデックス３０４に登録されている全ての文書番号を、作者別に作者配列データに登録する。 Next, the display device 306 registers the document number extracted from the entry in the corresponding author item in the author array data (step 1605).
Subsequently, the display device 306 repeatedly executes a series of processes in steps 1602 to 1605 until the end of the document (EOF) is detected from the file of the meta index 304, whereby all of the registrations in the meta index 304 are performed. The document number is registered in the author arrangement data for each author.

表示装置３０６は、メタインデックス３０４のファイルから文書末尾を検出すると（ステップ１６０６の判定がＮＯ）、作者配列データ中の各作者項目を、それぞれの項目に登録されている文書番号の数、即ち各作者毎の発言文書数に基づいてソートする（ステップ１６０７）。 When the display device 306 detects the end of the document from the file of the meta index 304 (NO in step 1606), the display device 306 sets each author item in the author array data to the number of document numbers registered in each item, that is, each item. Sorting is performed based on the number of utterance documents for each author (step 1607).

続いて、表示装置３０６は、作者配列データ中の同一作者項目内で、文書番号を、それに対応するメタインデックス３０４中のエントリから抽出される日付に基づいてソートする（ステップ１６０８）。 Subsequently, the display device 306 sorts the document numbers in the same author item in the author arrangement data based on the date extracted from the corresponding entry in the meta index 304 (step 1608).

最後に、表示装置３０６は、上記ステップ１６０７及び１６０８でのソートの結果得られる作者配列データの各作者項目毎に、作者名と、その項目内の各文書番号に対応するメタインデックス３０４中のエントリから抽出される日付及びタイトルを表示するためのＨＴＭＬテーブル文書データを生成し、それをＷｅｂサーバに引き渡して、発言者ビューの表示動作を終了する。この結果、ユーザの端末のＷｅｂブラウザアプリケーション上に、図２７に例示されるようなテーブル形式で、発言者ビューが表示される。 Finally, for each author item of the author array data obtained as a result of the sorting in steps 1607 and 1608, the display device 306 includes an author name and an entry in the meta index 304 corresponding to each document number in the item. HTML table document data for displaying the date and title extracted from is generated, delivered to the Web server, and the display operation of the speaker view is terminated. As a result, the speaker view is displayed in the table format illustrated in FIG. 27 on the Web browser application of the user terminal.

ユーザは、発言者ビュー上で、発言者及び発言日付という観点から、文書集合（会議室）内の文書を参照することが可能となる。
また、或る発言者の発言を時間を追って参照したり、会議室内で多くの発言をするリーダー的な発言者を一目で確認することができる。 On the speaker view, the user can refer to the documents in the document set (conference room) from the viewpoint of the speaker and the statement date.
In addition, it is possible to refer to a speaker's speech over time, or to confirm at a glance a leader speaker who makes a lot of speech in the conference room.

〔表示装置３０６の他の表示態様〕
次に、上記各ビューの表示動作以外に表示装置３０６が実現する各表示動作の態様について説明する。 [Other Display Modes of Display Device 306]
Next, aspects of each display operation realized by the display device 306 in addition to the display operation of each view will be described.

まず、表示装置３０６が実現する発言内容表示の動作につき説明する。前述したように、ユーザは、キーワードビユーにおけるそれぞれのスレッド上のアンカー又はスレッドビューにおける各ノード上のアンカーを、マウス装置等でクリックすることにより、各スレッドのルート文書又は各ノードに対応する文書等に、即座にアクセスすることができる。 First, an operation of displaying the message content realized by the display device 306 will be described. As described above, the user clicks the anchor on each thread in the keyword view or the anchor on each node in the thread view with the mouse device or the like, so that the root document of each thread or the document corresponding to each node, etc. Can be accessed immediately.

ユーザによってこれらの操作が実行された場合には、Ｗｅｂサーバから指示によって、表示装置３０６によって実行される図１８に示される動作フローチャートの処理が例えばＣＧＩとして起動される。この場合、この処理には、ユーザによって指定されたアンカータグに含まれる文書番号の情報が引き渡される。 When these operations are executed by the user, the process of the operation flowchart shown in FIG. 18 executed by the display device 306 is started as CGI, for example, according to an instruction from the Web server. In this case, the document number information included in the anchor tag designated by the user is delivered to this process.

この結果まず、表示装置３０６は、上記文書番号の情報を読み込んだ後（ステップ１８０１）、ヘッダ部に上記読み込んだ文書番号と同じ文書番号を含んでいる文書ファイルを読み込むまで、文書群データベース３０１からの文書ファイルの読込みを行う（ステップ１８０２→１８０３→１８０２のループ）。 As a result, the display device 306 first reads the document number information (step 1801) and then reads from the document group database 301 until a document file containing the same document number as the read document number is read in the header portion. The document file is read (loop of steps 1802 → 1803 → 1802).

表示装置３０６は、ヘッダ部に上記読み込んだ文書番号と同じ文書番号を含んでいる文書ファイルを読み込むと（ステップ１８０３の判定がＹＥＳ）、新しい文書のヘッダ部を読み込むまで、ステップ１８０４〜１８０９のループにより、上記文書ファイルから１行ずつデータを読み込み、そのデータを１行分のＨＴＭＬ文書データに変換し、そのＨＴＭＬ文書データをＷｅｂサーバに出力する（ステップ１８０８）。 When the display device 306 reads a document file that includes the same document number as the read document number in the header portion (YES in step 1803), the display device 306 loops steps 1804 to 1809 until the header portion of a new document is read. Thus, data is read line by line from the document file, the data is converted into HTML document data for one line, and the HTML document data is output to the Web server (step 1808).

この場合に、各行のデータが他の文書等へのＵＲＬを含んでいる場合には、表示装置３０６は、そのデータを上記ＵＲＬがＨＲＥＦ属性の値として指定されるアンカータグに変換した上で出力する（ステップ１８０４→１８０５）。 In this case, when the data of each line includes a URL to another document or the like, the display device 306 converts the data into an anchor tag in which the URL is designated as the value of the HREF attribute, and outputs the converted data. (Step 1804 → 1805).

この結果、ユーザは、発言内容の表示中のアンカーを更にマウス装置等によってクリックすることにより、更に他のリソースにジャンプすることができる。
また、各行のデータが他の文書の行を引用したコメント行である場合には、表示装置３０６は、そのデータの色を変換するタグを追加した上で出力する（ステップ１８０６→１８０７）。 As a result, the user can jump to another resource by clicking on the anchor whose message content is being displayed with the mouse device or the like.
If the data of each line is a comment line quoting a line of another document, the display device 306 adds a tag for converting the color of the data and outputs the result (steps 1806 → 1807).

この結果、ユーザは、コメント行を一目で判別することができる。表示装置３０６は、該当する文書データの出力処理を終了すると、上記文書を含むスレッドのツリー構造を表示するＨＴＭＬ文書データを生成し出力して、発言内容表示の動作を終了する（ステップ１８０９→１８１０）。この処理は、前述した図１４の動作フローチャートで示されるスレッドビューの表示動作と同様にして実現できる。 As a result, the user can distinguish the comment line at a glance. When the output processing of the corresponding document data is finished, the display device 306 generates and outputs HTML document data that displays the tree structure of the thread including the document, and finishes the message content display operation (steps 1809 to 1810). ). This process can be realized in the same manner as the thread view display operation shown in the operation flowchart of FIG.

以上の表示動作の結果、ユーザの端末のＷｅｂブラウザアプリケーション上には、例えば図２８に示されるように、表示画面の上半分に発言内容が表示され、表示画面の下半分にはその発言内容の文書を含むスレッドのツリー構造が表示される。なお、この表示画面には、図２８に示されるように、キーワードビユーやスレッドビューを表示させるためのアンカーや、検索を実行するためのアンカー等を同時に表示させることもできる。 As a result of the above display operation, the content of the message is displayed on the upper half of the display screen on the Web browser application of the user terminal, as shown in FIG. 28, for example. A tree structure of threads containing the document is displayed. On this display screen, as shown in FIG. 28, an anchor for displaying a keyword view and a thread view, an anchor for executing a search, and the like can be displayed at the same time.

これらのビューの切替え機能により、例えば、キーワードビユー → スレッドビュー → 発言内容表示 → 発言者ビュー発言内容表示 → スレッドビュー → ・・・というように、会議室内の文書（発言）をユーザの嗜好に応じて横断的に参照してゆくことが可能となる。 With these view switching functions, for example, Keyword View → Thread View → Speech Content Display → Speaker View Speech Content Display → Thread View → ・・・ According to user preference It is possible to refer to them cross-sectionally.

次に、表示装置３０６が実現する作者別／日付別色分け表示の動作につき説明する。図１９は、その動作を示す動作フローチャートである。
まず、表示装置３０６は、メタインデックス３０４及びスレッドインデックス３０５に基づいて、図２０(a) に示されるように作者項目毎に文書番号が分類された作者配列データと、図２０(b) に示されるように日付項目毎に文書番号が分類された日付配列データとを予め作成する。これらの作成処理の詳細は省略するが、前述した図１６の動作フローチャートと同様の処理によって実現できる。そして、作者配列データ中の各作者項目又は日付配列データ中の各日付項目に、それぞれ異なる色が割り当てられる。この色の割当ては、作者項目毎の作者の総数又は日付項目毎の日付の総数から決定される。 Next, an operation of color-coded display by author / date realized by the display device 306 will be described. FIG. 19 is an operation flowchart showing the operation.
First, the display device 306 displays, based on the meta index 304 and the thread index 305, author array data in which document numbers are classified for each author item as shown in FIG. 20 (a), and FIG. 20 (b). As described above, date arrangement data in which document numbers are classified for each date item is created in advance. Although details of these creation processes are omitted, they can be realized by a process similar to the operation flowchart of FIG. A different color is assigned to each author item in the author array data or each date item in the date array data. This color assignment is determined from the total number of authors per author item or the total number of dates per date item.

次に、表示装置３０６は、ユーザの指定に基づく項目選択ボタン情報をＷｅｂサーバを経由して取得し、作者ボタンが押されたか日付選択ボタンが押されたかを判定する（ステップ１９０２、１９０４、図２９参照）。 Next, the display device 306 acquires item selection button information based on the user's designation via the Web server, and determines whether the author button or the date selection button has been pressed (steps 1902 and 1904, FIG. 29).

表示装置３０６は、作者ボタンが押されたと判定した場合には、図２０(a) に示される作者配列データを参照することにより、スレッドツリーの表示データを作成し出力する（ステップ１９０２→１９０３）。この処理は、前述した図１４の動作フローチャートと同様の処理によって実現されるが、この場合に、ツリーの各ノードは、そのノードに対応する作者名に対応する作者配列データ中の作者項目に割当てられている色で表示される。 If the display device 306 determines that the author button has been pressed, the display device 306 creates and outputs thread tree display data by referring to the author array data shown in FIG. 20A (steps 1902 → 1903). . This process is realized by a process similar to the operation flowchart of FIG. 14 described above. In this case, each node of the tree is assigned to the author item in the author array data corresponding to the author name corresponding to the node. The displayed color is displayed.

一方、表示装置３０６は、日付選択ボタンが押されたと判定した場合には、図２０(b) に示される日付配列データを参照することにより、スレッドツリーの表示データを作成し出力する（ステップ１９０４→１９０５）。この処理も、前述した図１４の動作フローチャートと同様の処理によって実現されるが、この場合に、ツリーの各ノードは、そのノードに対応する日付に対応する日付配列データ中の日付項目に割当てられている色で表示される。 On the other hand, when it is determined that the date selection button has been pressed, the display device 306 creates and outputs thread tree display data by referring to the date array data shown in FIG. 20B (step 1904). → 1905). This process is also realized by a process similar to the operation flowchart of FIG. 14 described above. In this case, each node of the tree is assigned to a date item in the date array data corresponding to the date corresponding to the node. Displayed in the color.

以上の表示動作の結果、例えばユーザが作者ボタンを押した場合には、ユーザの端末のＷｅｂブラウザアプリケーション上には、例えば図２９に示されるように、作者別に色分けされたスレッドのツリーが表示され、ユーザは同一の作者の文書を一目で確認することができる。 As a result of the above display operation, for example, when the user presses the author button, a tree of threads color-coded by author is displayed on the Web browser application of the user's terminal as shown in FIG. 29, for example. The user can check the document of the same author at a glance.

次に、表示装置３０６が実現するスレッドビューを使った検索結果の強調表示の動作につき説明する。図２１はその動作を示す動作フローチャートである。
まず、表示装置３０６は、検索後入力フォーム画面を表示するためのＨＴＭＬ文書データを生成し出力する（ステップ２１０１）。この結果、ユーザの端末のＷｅｂブラウザアプリケーション上には、例えば図３０に示されるような検索入力フォーム画面が表示される。ユーザは、この検索入力フォームに検索語を入力して検索の実行を指定する。 Next, the search result highlighting operation using the thread view realized by the display device 306 will be described. FIG. 21 is an operation flowchart showing the operation.
First, the display device 306 generates and outputs HTML document data for displaying the post-search input form screen (step 2101). As a result, a search input form screen as shown in FIG. 30, for example, is displayed on the Web browser application of the user terminal. The user inputs a search term in this search input form and designates execution of the search.

上記検索入力フォームに入力された検索語は、Ｗｅｂサーバを経由して文字列検索装置４０５（図４）に引き渡される。文字列検索装置４０５は、ユーザによる検索語の指定に基づいて、索引ファイル４０４を用いながら文書群データベース３０１内の指定されたスレッドを構成する各文書に対して全文検索を実行し、その検索語を含む文書番号を出力する（ステップ２１０２、２１０３）。 The search term input in the search input form is delivered to the character string search device 405 (FIG. 4) via the Web server. The character string search device 405 performs a full text search on each document constituting the specified thread in the document group database 301 using the index file 404 based on the specification of the search word by the user, and the search word Is output (steps 2102 and 2103).

表示装置３０６は、上記検索語を含む文書番号を受け取ると、その文書番号を含むスレッドのツリー構造を表示するＨＴＭＬ文書データを、前述した図１４の動作フローチャートと同様の処理によって表示する。この場合に、表示装置３０６は、上記文書番号を含むノードの色を強調色に指定する（ステップ２１０４、２１０５）。 When the display device 306 receives the document number including the search term, the display device 306 displays the HTML document data displaying the tree structure of the thread including the document number by the same processing as the operation flowchart of FIG. In this case, the display device 306 designates the color of the node including the document number as a highlight color (steps 2104 and 2105).

この結果、ユーザの端末のＷｅｂブラウザアプリケーション上において、例えば図３１に示されるような検索結果に基づくスレッドビューの強調表示が実現される。これにより、ユーザは、スレッドの構造を把握しつつ、検索を実行することができる。 As a result, the thread view highlighting display based on the search result as shown in FIG. 31, for example, is realized on the Web browser application of the user terminal. Thereby, the user can execute a search while grasping the thread structure.

次に表示装置３０６が実現するキーワードビユーを使った検索結果の強調表示の動作につき説明する。図２２はその動作を示す動作フローチャートである。
まず、表示装置３０６は、図２１のステップ２１０１の場合と同様に、検索後入力フォーム画面を表示するためのＨＴＭＬ文書データを生成し出力する（ステップ２２０１）。ユーザは、この検索入力フォームに検索語を入力して検索の実行を指定する。 Next, the search result highlighting operation using the keyword view realized by the display device 306 will be described. FIG. 22 is an operation flowchart showing the operation.
First, the display device 306 generates and outputs HTML document data for displaying the post-search input form screen, as in step 2101 of FIG. 21 (step 2201). The user inputs a search term in this search input form and designates execution of the search.

上記検索入力フォームに入力された検索語は、Ｗｅｂサーバを経由して文字列検索装置４０５（図４）に引き渡される。文字列検索装置４０５は、ユーザによる検索語の指定に基づいて、索引ファイル４０４を用いながら文書群データベース３０１内の指定された会議室を構成する各文書に対して全文検索を実行し、その検索語を含む文書番号を出力する（ステップ２２０２、２２０３）。 The search term input in the search input form is delivered to the character string search device 405 (FIG. 4) via the Web server. The character string search device 405 performs a full-text search on each document constituting the designated conference room in the document group database 301 using the index file 404 based on the specification of the search word by the user, and the search The document number including the word is output (steps 2202 and 2203).

表示装置３０６は、上記検索語を含む文書番号を受け取ると、まず、図６のデータ構造を有するスレッドインデックス３０５を参照して、上記文書番号を含むエントリに対応するルート文書番号を抽出する（ステップ２２０４）。 Upon receiving the document number including the search term, the display device 306 first extracts the root document number corresponding to the entry including the document number with reference to the thread index 305 having the data structure of FIG. 2204).

続いて、表示装置３０６は、指定された会議室に関するキーワードビユーを表示するＨＴＭＬ文書データを、前述した図１３の動作フローチャートと同様の処理によって表示する。この場合に、表示装置３０６は、ステップ２２０４で抽出されたルート文書番号に対応するスレッドのタイトル又はその表示エリア全体の色を強調色に指定し、更に、表示されるキーワード中に検索語が含まれている場合には、そのキーワードも強調色に指定する（ステップ２２０５、２２０６、２２０７）。 Subsequently, the display device 306 displays the HTML document data for displaying the keyword view related to the designated conference room by the same processing as the operation flowchart of FIG. 13 described above. In this case, the display device 306 designates the title of the thread corresponding to the root document number extracted in step 2204 or the color of the entire display area as a highlight color, and further includes a search word in the displayed keyword. If so, the keyword is also designated as the highlight color (steps 2205, 2206, 2207).

この結果、ユーザの端末のＷｅｂブラウザアプリケーション上において、例えば図３２に示されるような検索結果に基づくキーワードビユーの強調表示が実現される。これにより、ユーザは、検索語を含むスレッドを一目で把握することができる。 As a result, the keyword view highlighting display based on the search result as shown in FIG. 32, for example, is realized on the Web browser application of the user terminal. Thereby, the user can grasp | ascertain the thread | sled containing a search word at a glance.

なお、表示装置３０６は、検索結果の文書番号とそれに対応するタイトルを、例えば図３３に示されるように羅列して表示するように構成することも可能である。
最後に、表示装置３０６が実現するサブトピック毎のキーワードビユーの表示動作について説明する。 The display device 306 can also be configured to display the document number of the search result and the title corresponding to the search result, for example, as shown in FIG.
Finally, a keyword view display operation for each subtopic realized by the display device 306 will be described.

前述したキーワードビユーは、スレッド毎にキーワードを表示するものであった。これに対して、サブトピック毎のキーワードビユーでは、１つのスレッド内のサブトピック毎に、キーワードを抽出して表示することができる。 The keyword view described above displays a keyword for each thread. In contrast, in the keyword view for each subtopic, keywords can be extracted and displayed for each subtopic within one thread.

この動作において、表示装置３０６は、図２４のデータ構造を有するサブトピックインデックスを使用する。サブトピックインデックスは、図１２に示されるカラーテーブルのデータ構造に対して、キーワードフィールドが追加されたデータ構造を有する。 In this operation, the display device 306 uses a subtopic index having the data structure of FIG. The subtopic index has a data structure in which a keyword field is added to the data structure of the color table shown in FIG.

サブトピックインデックスは、実質的には前述したカラーテーブルを置き換えるものであるため、サブトピックインデックスにおけるキーワードフィールド以外のフィールドの内容は、構造解析部４０２による前述した図８のステップ８１０の処理によって予め登録されている。この場合、図８のステップ８１０の処理の説明において前述したように、カラーテーブルであるサブトピックインデックスには、ルート文書番号毎（スレッド毎）に、それに含まれるルート文書のタイトルを示すメイントピックと、それ以外の文書のタイトルを示すサブトピックのそれぞれに対応するエントリが得られる。表示装置３０６は、この登録内容を利用する。 Since the subtopic index substantially replaces the color table described above, the contents of the fields other than the keyword field in the subtopic index are registered in advance by the process of step 810 in FIG. Has been. In this case, as described above in the description of the processing of step 810 in FIG. 8, the subtopic index which is a color table includes a main topic indicating the title of the root document included in each root document number (for each thread). , Entries corresponding to the subtopics indicating the titles of the other documents are obtained. The display device 306 uses this registered content.

図２３は、表示装置３０６が実現するサブトピックからのキーワード抽出の制御を示す動作フローチャートである。
まず、表示装置３０６は、各スレッドについて、サブトピックインデックス内のそのスレッドに含まれる各エントリに登録されている文書番号に基づいて、メイントピック及びサブトピック単位で、それぞれに属する文書群を各結合文書ファイルにまとめ（ステップ２３０１）、その結果得られる各結合文書ファイルを内容解析部４０３（図４）に入力する（ステップ２３０２）。 FIG. 23 is an operation flowchart illustrating keyword extraction control from subtopics realized by the display device 306.
First, for each thread, the display device 306 combines each group of documents belonging to each main topic and subtopic based on the document number registered in each entry included in that thread in the subtopic index. The document files are collected (step 2301), and the resulting combined document files are input to the content analysis unit 403 (FIG. 4) (step 2302).

内容解析部４０３は、各結合文書ファイル別にキーワードを抽出し、その結果を表示装置３０６に返す。表示装置３０６は、内容解析部４０３から返された各結合文書ファイル別のキーワードを、サブトピックインデックス内の上記各結合文書ファイルに対応するエントリのキーワードフィールドに登録する（ステップ２３０３）。 The content analysis unit 403 extracts a keyword for each combined document file and returns the result to the display device 306. The display device 306 registers the keyword for each combined document file returned from the content analysis unit 403 in the keyword field of the entry corresponding to each combined document file in the subtopic index (step 2303).

以上のようにして、各スレッドについて、メイントピック及びサブトピック単位で、それぞれに属する文書群からキーワードが抽出される。
その後は、表示装置３０６は、サブトピックインデックスの内容に基づいて、ユーザにより指定されたスレッドに関して、そのスレッドのメイントピック及びサブトピック単位で、それぞれのタイトルとそれぞれに属するキーワードを表示するためのＨＴＭＬ文書データを生成し出力する。 As described above, keywords are extracted from the document group belonging to each thread for each main topic and subtopic.
Thereafter, the display device 306 displays HTML for displaying the respective titles and keywords belonging to the main topic and the subtopic of the thread specified by the user based on the contents of the subtopic index. Generate and output document data.

この結果、ユーザの端末のＷｅｂブラウザアプリケーション上には、例えば、図３４に示されるような形式で、サブトピック毎のキーワードビユーが表示される。これにより、ユーザは、キーワードによるより精密なトピックの絞込みを行うことができる。 As a result, the keyword view for each subtopic is displayed on the Web browser application of the user's terminal, for example, in the format shown in FIG. Thereby, the user can narrow down the topic more precisely by the keyword.

〔本発明の他の実施の形態（第２の実施の形態）〕
次に、本発明の他の実施の形態（以下、第２の実施の形態という）について説明する。
〔本発明の第２の実施の形態が実現する機能〕
まず、第２の実施の形態では、以下の３つの機能が実現される。 [Another embodiment of the present invention (second embodiment)]
Next, another embodiment of the present invention (hereinafter referred to as a second embodiment) will be described.
[Functions realized by the second embodiment of the present invention]
First, in the second embodiment, the following three functions are realized.

１．狭い画面の中でのスレッドの全体構造の把握機能：
スレッドのツリーが縮退（curtail ）させられることにより、ある大きさの画面内でツリー構造の全体表示が可能となる。第２の実施の形態では、ＴＴＹキャラクタ端末上での表示を例に説明する。 1. Capability to understand the overall structure of threads within a narrow screen:
By reducing the thread tree, the entire tree structure can be displayed in a screen of a certain size. In the second embodiment, display on a TTY character terminal will be described as an example.

キャラクタ端末では、１行に１ノードが表示されるため、ｎ行の画面内にはｎ個のノードを描画することができる。描画されるノードの選択基準としては、以下のものがある。
・そのノードを参照している子ノードの個数。 Since one node is displayed in one line on the character terminal, n nodes can be drawn on the screen of n lines. The selection criteria for nodes to be drawn include the following.
The number of child nodes that refer to that node.

・検索結果として得られる、そのノードを参照している子ノードの個数。
・ルートノード又は親ノードと異なるタイトルを持つノード。
２．スレッド内の話題の進行の推測機能：
「質問−答−お礼」といった特定の会話パターンが検出され、その情報が表示・検索に使用されることにより、効率の良い情報アクセスが可能となる。 -The number of child nodes referring to the node obtained as a search result.
A node with a different title from the root node or parent node.
2. Ability to guess the progress of topics in a thread:
A specific conversation pattern such as "question-answer-thank" is detected, and the information is used for display / search, thereby enabling efficient information access.

より具体的には、第２の実施の形態では、文書の属性情報（タイトル、作者、参照関係）と、文書の内容を特徴づける特定の文章パターンが推測されることにより、スレッド内の話題パターンが抽出される。 More specifically, in the second embodiment, a topic pattern in a thread is estimated by inferring document attribute information (title, author, reference relationship) and a specific sentence pattern characterizing the contents of the document. Is extracted.

３．利用者の発言パターンの視覚化：
ネットワークニュースでは、読んでいる人に対して発言する人の割合は非常に少ない。大きなスレッドであっても実は数人の人が論争しているだけという場合も少なくない。また、特定のニュースグループにおいて、有用な情報を発信する人が決まっている場合も多い。そこで、第２の実施の形態では、記事を投稿する利用者の観点から、ニュースやスレッドが整理されることにより、新たなビューが提供される。 3. Visualizing user speech patterns:
In network news, the percentage of people who speak is very small. In many cases, even with a large thread, only a few people are arguing. Moreover, there are many cases where a person who sends useful information is determined in a specific news group. Therefore, in the second embodiment, a new view is provided by organizing news and threads from the viewpoint of a user who posts an article.

より具体的には、第２の実施の形態では、ニュースグループ内の投稿履歴と、前述の話題推測機能に基づいて、利用者を観点とするビューが提供される。
〔本発明の第２の実施の形態の全体構成〕
図３５は、本発明の第２の実施の形態の構成図である。 More specifically, in the second embodiment, a view from the viewpoint of the user is provided based on the posting history in the news group and the topic estimation function described above.
[Overall Configuration of Second Embodiment of the Present Invention]
FIG. 35 is a block diagram of the second embodiment of the present invention.

まず、検索フェーズの前の準備フェーズにおいては、以下の動作が実行される。
処理装置３５０１内の文書取得部３５０２は、ネットワークを通じて、参照関係のある文書群を取得し、二次記憶装置３５０３内に格納する。 First, in the preparation phase before the search phase, the following operations are executed.
A document acquisition unit 3502 in the processing device 3501 acquires a document group having a reference relationship through the network, and stores the document group in the secondary storage device 3503.

内容推定部３５０４は、二次記憶装置３５０３に格納されている文書群の文書内容、文書付随情報、文書間の参照関係に基づいて、表示用インデックス３５０５を作成する。
検索エンジン３５０６は、二次記憶装置３５０３に格納されている文書群の文書内容に基づいて、検索用インデックス３５０７を作成する。 The content estimation unit 3504 creates a display index 3505 based on the document content of the document group stored in the secondary storage device 3503, the document accompanying information, and the reference relationship between documents.
The search engine 3506 creates a search index 3507 based on the document contents of the document group stored in the secondary storage device 3503.

例えばネットワークニュースサービスでは、文書が随時投稿されてゆく。そのため、上記の準備フェーズは、例えば一日に一度のように定期的に実行され、二次記憶装置３５０３には、常に最新の文書群が格納される。 For example, in a network news service, documents are posted at any time. Therefore, the above preparation phase is periodically executed, for example, once a day, and the secondary storage device 3503 always stores the latest document group.

検索フェーズの実行時には、以下の動作が実行される。
利用者は、入力装置３５０９から入力指示を行う。入力される情報には、検索キーワードと、検索結果を表示させるためのビューの種類、ビューの表示領域の大きさが含まれる。 When the search phase is executed, the following operations are executed.
The user gives an input instruction from the input device 3509. The input information includes the search keyword, the type of view for displaying the search result, and the size of the display area of the view.

ビュー生成部３５０８は、入力装置３５０９からの入力指示に基づいて、検索エンジン３５０６を呼び出し、それに対して二次記憶装置３５０３に格納されている文書群の中から上記入力指示に対応する文書群を検索させる。 The view generation unit 3508 calls the search engine 3506 based on the input instruction from the input device 3509, and selects a document group corresponding to the input instruction from the document group stored in the secondary storage device 3503. Search.

ビュー生成部３５０８は、表示用インデックス３５０５を利用して、検索エンジン３５０６が検索した文書群を表示するための結果ビューを作成し、それを表示装置３５１０に出力する。この場合に、後述するスレッド木の縮退処理が実行される。 The view generation unit 3508 uses the display index 3505 to create a result view for displaying the document group searched by the search engine 3506, and outputs it to the display device 3510. In this case, a thread tree degeneration process to be described later is executed.

以上の動作は、利用者との間の対話処理に基づいて実行される。つまり、利用者は、結果表示を見て、検索キーワードを追加又は変更し、或いは、結果ビューを切り替える。
〔表示用インデックス３５０５の構造〕
第２の実施の形態において、検索前の準備フェーズでは、以下の種類のインデックスが作成される。 The above operation is executed based on a dialogue process with the user. That is, the user sees the result display, adds or changes a search keyword, or switches the result view.
[Structure of display index 3505]
In the second embodiment, the following types of indexes are created in the pre-search preparation phase.

１．ユーザインデックス：
このインデックスは、ユーザの管理を行うためのインデックスであり、図３６に示されるように、エントリ毎に、下記情報を保持する。 1. User index:
This index is an index for managing users, and holds the following information for each entry as shown in FIG.

・ユーザＩＤ（ＵｓｅｒＩＤ）：そのエントリに対応するユーザのＩＤ（キー）である。
・名前：そのエントリに対応するユーザの名前である。 User ID (UserID): ID (key) of the user corresponding to the entry.
Name: The name of the user corresponding to the entry.

・略称：そのエントリに対応するユーザの略称である。
・発言数（回答数）：そのエントリに対応するユーザの、会議室内における発言の総数と、Ｑ and Ａパターンにおける回答文書の数である。 Abbreviated name: Abbreviated name of the user corresponding to the entry.
Number of utterances (number of answers): The total number of utterances in the conference room and the number of answer documents in the Q and A pattern of the user corresponding to the entry.

２．文書インデックス：
このインデックスは、文書毎の情報管理を行うためのインデックスであり、図３７に示されるように、エントリ毎に、下記情報を保持する。 2. Document index:
This index is an index for performing information management for each document, and holds the following information for each entry as shown in FIG.

・文書ＩＤ：そのエントリに対応する文書のＩＤ（キー）である。
・ユーザＩＤ（ＵｓｅｒＩＤ）：そのエントリに対応する文書を作成したユーザのＩＤ（キー）である。 Document ID: ID (key) of the document corresponding to the entry.
User ID (UserID): ID (key) of the user who created the document corresponding to the entry.

・タイトル：そのエントリに対応する文書のタイトルである。
・日付：そのエントリに対応する文書の作成日である。
・参照子孫数：そのエントリに対応する文書を参照する文書の総数である。 Title: The title of the document corresponding to the entry.
Date: The date of creation of the document corresponding to the entry.
Reference number of descendants: The total number of documents that refer to the document corresponding to the entry.

・ルートまでのパス：そのエントリに対応する文書が参照する先頭記事からその文書までのパスである。
・タイトルの識別番号：そのエントリに対応する文書のタイトルが、その文書が含まれるスレッド（文書群）中の何番目のタイトルであるかを示す番号である。 Path to root: A path from the first article referenced by the document corresponding to the entry to the document.
Title identification number: This is a number indicating the number of the title in the thread (document group) in which the document corresponding to the entry is included.

・記事種別：そのエントリに対応する文書が、Ｑand Ａパターンに含まれる場合に、その文書がＱ（質問）文書、Ａ（答）文書、又はＴ（お礼）文書の何れにあたるかを示す情報である。 Article type: When the document corresponding to the entry is included in the Qand A pattern, information indicating whether the document corresponds to a Q (question) document, an A (answer) document, or a T (thank you) document is there.

３．スレッドインデックス：
このインデックスは、スレッド毎の情報管理を行うためのインデックスであり、図３８に示されるように、エントリ毎に、下記情報を保持する。 3. Thread index:
This index is an index for managing information for each thread, and holds the following information for each entry, as shown in FIG.

・スレッドＩＤ：そのエントリに対応するスレッドのＩＤ（キー）である。
・スレッドの木構造：そのエントリに対応するスレッド内の文書の参照関係を文書ＩＤのリストで表現したものである。 Thread ID: ID (key) of the thread corresponding to the entry.
Thread tree structure: A reference relation of documents in a thread corresponding to the entry is expressed by a list of document IDs.

・文書数：そのエントリに対応するスレッド内の文書の総数である。
・作者数：そのエントリに対応するスレッド内の文書の作者の数である。
・最多発言ＵＩＤ：そのエントリに対応するスレッド内で最も多く発言した作者のユーザＩＤである。 Document number: the total number of documents in the thread corresponding to the entry.
Number of authors: The number of authors of the document in the thread corresponding to the entry.
-Most utterance UID: The user ID of the author who made the most utterance in the thread corresponding to the entry.

・内容リスト：そのエントリに対応するスレッドに含まれるＱ and Ａパターン、論争（Ｄｉｓｃｕｓｓｉｏｎ）パターン、又は雑談（Ｃｈａｔ）パターンのパターンＩＤのリストである。Ｑ and ＡパターンのパターンＩＤであるＱＡ＿ＩＤは、後述するＱＡインデックス内のいずれかのエントリに登録されている。ＤｉｓｃｕｓｓｉｏｎパターンのパターンＩＤであるＤＳ−ＩＤは、後述するＤＩＳＣＵＳＳインデックス内のいずれかのエントリに登録されている。ＣｈａｔパターンのパターンＩＤであるＣＴ＿ＩＤは、後述するＣＨＡＴインデックス内のいずれかのエントリに登録されている。 Content list: A list of pattern IDs of Q and A patterns, discussion patterns, or chat patterns included in the thread corresponding to the entry. QA_ID, which is the pattern ID of the Q and A pattern, is registered in any entry in the QA index described later. The DS-ID that is the pattern ID of the Discussion pattern is registered in one of the entries in the DISCUSS index described later. CT_ID which is the pattern ID of the Chat pattern is registered in one of the entries in the CHAT index described later.

４．ＱＡインデックス：
このインデックスは、Ｑ and Ａパターンの情報管理を行うためのインデックスであり、図３９に示されるように、エントリ毎に、下記情報を保持する。 4). QA index:
This index is an index for performing Q and A pattern information management, and holds the following information for each entry as shown in FIG.

・ＱＡ＿ＩＤ：そのエントリに対応するＱ and ＡパターンのＩＤ（キー）である。
・Ｑｕｅｓｔｉｏｎ：そのエントリに対応するＱand Ａパターンを構成するＱ（質問）文書に対応する文書ＩＤを格納するフィールドである。 QA_ID: ID (key) of the Q and A pattern corresponding to the entry.
Question: A field for storing a document ID corresponding to a Q (question) document constituting a Qand A pattern corresponding to the entry.

・Ａｎｓｗｅｒ：そのエントリに対応するＱand Ａパターンを構成するＡ（答え）文書群に対応する文書ＩＤ列を格納するフィールドである。
・Ｔｈａｎｋｓ：そのエントリに対応するＱand Ａパターンを構成するＴ（お礼）文書に対応する文書ＩＤを格納するフィールドである。 Answer: A field for storing a document ID string corresponding to an A (answer) document group constituting a Qand A pattern corresponding to the entry.
Tanks: a field for storing a document ID corresponding to a T (thank you) document constituting a Qand A pattern corresponding to the entry.

・ＭａｘＡｎｓｗｅｒＵＩＤ：そのエントリに対応するＱ and Ａパターンを構成する各Ａ（答え）文書の作者、すなわち、そのエントリ内の「Ａｎｓｗｅｒ」フィールドに登録されている文書ＩＤ列中の各文書ＩＤに対応する文書の作者うち、もっとも登場回数が多い人（又は人達）のユーザＩＤ（又はユーザＩＤ列）を格納するフィールドである。 MaxAnswerUID: corresponding to each document ID in the document ID column registered in the “Answer” field in the entry, that is, the author of each A (answer) document constituting the Q and A pattern corresponding to the entry This is a field for storing the user ID (or user ID string) of the person (or people) who appears most frequently among the authors of the document.

・ＴｈｒｅａｄＩＤ：そのエントリに対応するＱand Ａパターンが存在するスレッドのＩＤである。このスレッドＩＤは、スレッドインデックス内のいずれかのエントリに登録されている。 ThreadID: ID of a thread in which a Qand A pattern corresponding to the entry exists. This thread ID is registered in any entry in the thread index.

５．ＤＩＳＣＵＳＳインデックス：
このインデックスは、Ｄｉｓｃｕｓｓｉｏｎパターンの情報管理を行うためのインデックスであり、図４０に示されるように、エントリ毎に、下記情報を保持する。 5. DISCUSS index:
This index is an index for managing information of the Discusion pattern, and holds the following information for each entry as shown in FIG.

・ＤＳ＿ＩＤ：そのエントリに対応するＤｉｓｃｕｓｓｉｏｎパターンのＩＤ（キー）である。
・記事ＩＤリスト：そのエントリに対応するＤｉｓｃｕｓｓｉｏｎパターンを構成する文書群の文書ＩＤ列を格納するフィールドである。 DS_ID: ID (key) of the Discussion pattern corresponding to the entry.
Article ID list: a field for storing a document ID string of a document group constituting a Discusion pattern corresponding to the entry.

・ＵＩＤ：そのエントリに対応するＤｉｓｃｕｓｓｉｏｎパターンを構成する文書群のユーザＩＤ列である。
・ＴｈｒｅａｄＩＤ：そのエントリに対応するＤｉｓｃｕｓｓｉｏｎパターンが存在するスレッドのＩＤである。このスレッドＩＤは、スレッドインデックス内のいずれかのエントリに登録されている。 UID: a user ID column of a document group that constitutes a Discussion pattern corresponding to the entry.
ThreadID: ID of a thread in which a Discusion pattern corresponding to the entry exists. This thread ID is registered in any entry in the thread index.

６．ＣＨＡＴインデックス：
このインデックスは、Ｃｈａｔパターンの情報管理を行うためのインデックスであり、図４１に示されるように、エントリ毎に、下記情報を保持する。 6). CHAT index:
This index is an index for performing information management of the Chat pattern, and holds the following information for each entry as shown in FIG.

・ＣＴ＿ＩＤ：そのエントリに対応するＣｈａｔパターンのＩＤ（キー）である。
・Ｃｈａｔリスト：そのエントリに対応するＣｈａｔパターンを構成する文書群の文書ＩＤ列を格納するフィールドである。 CT_ID: ID (key) of the Chat pattern corresponding to the entry.
Chat list: A field for storing a document ID string of a document group constituting a Chat pattern corresponding to the entry.

・ＴｈｒｅａｄＩＤ：そのエントリに対応するＣｈａｔパターンが存在するスレッドのＩＤである。このスレッドＩＤは、スレッドインデックス内のいずれかのエントリに登録されている。 ThreadID: ID of a thread in which a Chat pattern corresponding to the entry exists. This thread ID is registered in any entry in the thread index.

〔内容推定部３５０４の構成及び動作〕
図３５に示される第２の実施の形態における内容推定部３５０４の動作について、以下に詳細に説明する。 [Configuration and operation of content estimation unit 3504]
The operation of the content estimation unit 3504 in the second embodiment shown in FIG. 35 will be described in detail below.

前述したように、内容推定部３５０４は、二次記憶装置３５０３に格納されている文書群の文書内容、文書付随情報、文書間の参照関係に基づいて、表示用インデックス３５０５であるユーザインデックス、文書インデックス、スレッドインデックス、ＱＡインデックス、ＤＩＳＣＵＳＳインデックス、及びＣＨＡＴインデックスを作成する。 As described above, the content estimation unit 3504 is based on the document content of the document group stored in the secondary storage device 3503, the document accompanying information, and the reference relationship between documents, and the user index and document that are the display index 3505. Create an index, thread index, QA index, DISCUSS index, and CHAT index.

図４２は、内容推定部３５０４が実行する動作を示す動作フローチャートである。
まず、図３７に示されるデータ構成を有する文書インデックスと図３８に示されるデータ構成を有するスレッドインデックスが作成される（ステップ４２０１）。これらの詳細は省略するが、基本的に、前述した図８及び図９に示される動作フローチャートと同様の動作によって実現できる。この場合には、前述したメタインデックスが文書インデックスに対応する。このとき同時に、各文書中に現れる作成ユーザ名とユーザＩＤ、略称、及び発言数（回答数）を対応づけるための図３６に示されるデータ構成を有するユーザインデックスも作成される。 FIG. 42 is an operation flowchart showing an operation executed by the content estimation unit 3504.
First, a document index having the data structure shown in FIG. 37 and a thread index having the data structure shown in FIG. 38 are created (step 4201). Although these details are omitted, it can be basically realized by an operation similar to the operation flowchart shown in FIGS. 8 and 9 described above. In this case, the above-described meta index corresponds to the document index. At the same time, a user index having a data structure shown in FIG. 36 for associating a created user name appearing in each document with a user ID, an abbreviation, and the number of utterances (number of answers) is also created.

次に、スレッドインデックス内の各エントリが参照されることにより、各エントリに対応するスレッド文書群が読み込まれ（ステップ４２０２）、全てのエントリに対するスレッド文書群の処理が終了したと判定されるまで（ステップ４２０６）、読み込まれたスレッド文書群毎に、Ｑ and Ａパターンの判定処理（ステップ４２０３）、Ｄｉｓｃｕｓｓｉｏｎパターンの判定処理（ステップ４２０４）、及びＣｈａｔパターンの判定処理（ステップ４２０５）が実行される。 Next, by referring to each entry in the thread index, a thread document group corresponding to each entry is read (step 4202) until it is determined that the processing of the thread document group for all the entries has been completed ( Step 4206), Q and A pattern determination processing (Step 4203), Discussion pattern determination processing (Step 4204), and Chat pattern determination processing (Step 4205) are executed for each read thread document group.

図４３は、図４２のステップ４２０３のＱ and Ａパターンの判定処理の動作フローチャートである。この動作フローチャートでは、スレッド文書群内の各参照パス毎に、Ｑ and Ａパターンが推測される。 FIG. 43 is an operation flowchart of the Q and A pattern determination processing in step 4203 of FIG. In this operation flowchart, a Q and A pattern is estimated for each reference path in the thread document group.

まず、スレッドインデックス内の該当エントリの「スレッドの木構造」フィールドが参照されることによって、リーフ文書（パスの末端の文書）に対応する文書ＩＤが１つ選択される（ステップ４３０１）。 First, by referring to the “thread tree structure” field of the corresponding entry in the thread index, one document ID corresponding to the leaf document (the document at the end of the path) is selected (step 4301).

次に、文書インデックスにおいて、ステップ４３０１で選択された文書ＩＤを「文書ＩＤ」フィールドに含むエントリ内の「ルートまでのパス」フィールドから、下記条件を満たす文書ＩＤが検索される（ステップ４３０２）。 Next, in the document index, a document ID satisfying the following conditions is searched from the “path to root” field in the entry including the document ID selected in step 4301 in the “document ID” field (step 4302).

（条件）文書インデックスにおいて、その文書ＩＤを「文書ＩＤ」フィールドに含むエントリ内の「記事種別」フィールドが未登録である。
続いて、上記条件を満たす文書ＩＤが見つかったか否かが判定される（ステップ４３０３）。 (Condition) In the document index, the “article type” field in the entry including the document ID in the “document ID” field is not registered.
Subsequently, it is determined whether or not a document ID satisfying the above conditions is found (step 4303).

上記条件を満たす文書ＩＤ（以下、処理文書ＩＤという）が見つかりステップ４３０３の判定がＹＥＳとなった場合には、その処理文書ＩＤに対応する文書が二次記憶装置３５０３（図３５）から読み出され、その文書中に、図４４に示されるような、センテンスパターンが存在するか否かが判定される（ステップ４３０４）。 If a document ID satisfying the above conditions (hereinafter referred to as a processing document ID) is found and the determination in step 4303 is YES, a document corresponding to the processing document ID is read from the secondary storage device 3503 (FIG. 35). Then, it is determined whether or not a sentence pattern as shown in FIG. 44 exists in the document (step 4304).

ステップ４３０４の判定がＮＯならば、ステップ４３０８にジャンプする。
ステップ４３０４の判定がＹＥＳならば、文書インデックスのステップ４３０２で参照されたエントリ内の「ルートまでのパス」フィールドに登録されている文書ＩＤのうち、下記条件を満たす文書ＩＤが存在するか否かが判定される（ステップ４３０５）。 If the determination in step 4304 is no, the process jumps to step 4308.
If the determination in step 4304 is YES, whether or not there is a document ID satisfying the following condition among the document IDs registered in the “path to root” field in the entry referenced in step 4302 of the document index. Is determined (step 4305).

（条件）その文書ＩＤに対応する文書は、処理文書ＩＤの作者によって作成されたものであって、かつその文書ＩＤは、図３９に示されるデータ構成を有するＱＡインデックス内のいずれかのエントリ内の「Ｔｈａｎｋｓ」フィールドに登録されている。 (Condition) The document corresponding to the document ID is created by the author of the processing document ID, and the document ID is in any of the entries in the QA index having the data structure shown in FIG. In the “Tanks” field.

ステップ４３０５の判定がＹＥＳなら、ステップ４３０５で参照されたＱＡインデックス内のエントリの「Ｑｕｅｓｔｉｏｎ」フィールドに、処理文書ＩＤが追加される。また、図３７に示されるデータ構成を有する文書インデックスにおいて、処理文書ＩＤを「文書ＩＤ」フィールドに含むエントリ内の「記事種別」フィールドに、記号「Ｑ」が追加される（ステップ４３０６）。 If the determination in step 4305 is YES, the processing document ID is added to the “Question” field of the entry in the QA index referenced in step 4305. Also, in the document index having the data structure shown in FIG. 37, the symbol “Q” is added to the “article type” field in the entry including the processed document ID in the “document ID” field (step 4306).

更に、文書インデックスのステップ４３０２で参照されたエントリ内の「ルートまでのパス」フィールドに登録されている文書ＩＤ群のうち、ステップ４３０５で参照されたＱＡインデックス内のエントリの「Ｑｕｅｓｔｉｏｎ」フィールドに登録された処理文書ＩＤとそのエントリの「Ｔｈａｎｋｓ」フィールドに登録された文書ＩＤに挟まれた文書ＩＤ群が、そのエントリの「Ａｎｓｗｅｒ」フィールドに追加される。また、図３７に示されるデータ構成を有する文書インデックスにおいて、上記登録が行われた各文書ＩＤを各「文書ＩＤ」フィールドに含む各エントリ内の「記事種別」フィールドに、それぞれ記号「Ａ」が追加される（ステップ４３０６）。 Further, among the document ID groups registered in the “path to root” field in the entry referenced in step 4302 of the document index, registration is performed in the “Question” field of the entry in the QA index referenced in step 4305. A group of document IDs sandwiched between the processed document ID and the document ID registered in the “Thanks” field of the entry is added to the “Answer” field of the entry. Also, in the document index having the data structure shown in FIG. 37, the symbol “A” is respectively displayed in the “article type” field in each entry including each registered document ID in each “document ID” field. It is added (step 4306).

一方、ステップ４３０６の判定がＮＯなら、ＱＡインデックスにおいて、｛（そのインデックス内のＱＡ＿ＩＤの最大値）＋１｝の値を「ＱＡ＿ＩＤ」フィールドの値として有するエントリが作成され、そのエントリ内の「Ｑｕｅｓｔｉｏｎ」フィールドに、処理文書ＩＤが登録される。また、図３７に示されるデータ構成を有する文書インデックスにおいて、処理文書ＩＤを「文書ＩＤ」フィールドに含むエントリ内の「記事種別」フィールドに、記号「Ｑ」が登録される（ステップ４３０７）。 On the other hand, if the determination in step 4306 is NO, in the QA index, an entry having a value of {(maximum value of QA_ID in the index) +1} as a value of the “QA_ID” field is created, and “Question” in the entry is created. The processing document ID is registered in the field. Also, in the document index having the data structure shown in FIG. 37, the symbol “Q” is registered in the “article type” field in the entry including the processed document ID in the “document ID” field (step 4307).

上記ステップ４３０６又は４３０７の処理の後、又はステップ４３０４の判定がＮＯとなった場合には、二次記憶装置３５０３から読み出されている処理文書ＩＤに対応する文書中に、図４５に示されるような、センテンスパターンが存在するか否かが判定される（ステップ４３０８）。 FIG. 45 shows the document corresponding to the processing document ID read from the secondary storage device 3503 after the processing at step 4306 or 4307 or when the determination at step 4304 is NO. It is determined whether or not such a sentence pattern exists (step 4308).

ステップ４３０８の判定がＮＯならば、ステップ４３０２に戻る。
ステップ４３０８の判定がＹＥＳならば、文書インデックスのステップ４３０２で参照されたエントリ内の「ルートまでのパス」フィールドに登録されている文書ＩＤのうち、下記条件を満たす文書ＩＤが存在するか否かが判定される（ステップ４３０９）。 If the determination in step 4308 is no, the process returns to step 4302.
If the determination in step 4308 is YES, whether or not there is a document ID satisfying the following conditions among the document IDs registered in the “path to root” field in the entry referenced in step 4302 of the document index. Is determined (step 4309).

（条件）その文書ＩＤに対応する文書は、処理文書ＩＤの作者によって作成されたものであって、かつその文書ＩＤは、図３９に示されるデータ構成を有するＱＡインデックス内のいずれかのエントリ内の「Ｑｕｅｓｔｉｏｎ」フィールドに登録されている。 (Condition) The document corresponding to the document ID is created by the author of the processing document ID, and the document ID is in any of the entries in the QA index having the data structure shown in FIG. In the “Question” field.

ステップ４３０９の判定がＹＥＳなら、ステップ４３０９で参照されたＱＡインデックス内のエントリの「Ｔｈａｎｋｓ」フィールドに、処理文書ＩＤが追加される。また、図３７に示されるデータ構成を有する文書インデックスにおいて、処理文書ＩＤを「文書ＩＤ」フィールドに含むエントリ内の「記事種別」フィールドに、記号「Ｔ」が追加される（ステップ４３１０）。 If the determination in step 4309 is YES, the processing document ID is added to the “Thanks” field of the entry in the QA index referenced in step 4309. Also, in the document index having the data structure shown in FIG. 37, the symbol “T” is added to the “article type” field in the entry including the processed document ID in the “document ID” field (step 4310).

更に、文書インデックスのステップ４３０２で参照されたエントリ内の「ルートまでのパス」フィールドに登録されている文書ＩＤ群のうち、ステップ４３０９で参照されたＱＡインデックス内のエントリの「Ｔｈａｎｋｓ」フィールドに登録された処理文書ＩＤとそのエントリの「Ｑｕｅｓｔｉｏｎ」フィールドに登録された文書ＩＤに挟まれた文書ＩＤ群が、そのエントリの「Ａｎｓｗｅｒ」フィールドに追加される。また、図３７に示されるデータ構成を有する文書インデックスにおいて、上記登録が行われた各文書ＩＤを各「文書ＩＤ」フィールドに含む各エントリ内の「記事種別」フィールドに、それぞれ記号「Ａ」が追加される（ステップ４３１０）。 Further, among the document ID groups registered in the “path to root” field in the entry referenced in step 4302 of the document index, registration is performed in the “Thanks” field of the entry in the QA index referenced in step 4309. A group of document IDs sandwiched between the processed document ID and the document ID registered in the “Question” field of the entry is added to the “Answer” field of the entry. Also, in the document index having the data structure shown in FIG. 37, the symbol “A” is respectively displayed in the “article type” field in each entry including each registered document ID in each “document ID” field. It is added (step 4310).

一方、ステップ４３０９の判定がＮＯなら、ＱＡインデックスにおいて、｛（そのインデックス内のＱＡ＿ＩＤの最大値）＋１｝の値を「ＱＡ＿ＩＤ」フィールドの値として有するエントリが作成され、そのエントリ内の「Ｔｈａｎｋｓ」フィールドに、処理文書ＩＤが登録される。また、図３７に示されるデータ構成を有する文書インデックスにおいて、処理文書ＩＤを「文書ＩＤ」フィールドに含むエントリ内の「記事種別」フィールドに、記号「Ｔ」が登録される（ステップ４３１１）。 On the other hand, if the determination in step 4309 is NO, in the QA index, an entry having a value of {(maximum value of QA_ID in the index) +1} as a value of the “QA_ID” field is created, and “Tanks” in the entry is created. The processing document ID is registered in the field. Also, in the document index having the data structure shown in FIG. 37, the symbol “T” is registered in the “article type” field in the entry including the processed document ID in the “document ID” field (step 4311).

上記ステップ４３１０又は４３１１の処理の後、ステップ４３０２に戻り、次の文書ＩＤの検索が実行される。
上記ステップ４３０２〜４３１１の処理が繰り返された結果、ステップ４３０３で、ステップ４３０２における条件を満たす文書ＩＤが見つからなかったと判定された場合には、スレッドインデックス内の現在処理中のエントリの「スレッドの木構造」フィールドが参照されることによって、全てのリーフ文書に対応する文書ＩＤに対する処理が試行されたか否かが判定される（ステップ４３１２）。 After the processing of step 4310 or 4311, the process returns to step 4302 to search for the next document ID.
If it is determined in step 4303 that the document ID satisfying the condition in step 4302 has not been found as a result of repeating the processes in steps 4302 to 4311, the “thread tree” of the entry currently being processed in the thread index is displayed. By referring to the “structure” field, it is determined whether or not the processing for the document IDs corresponding to all the leaf documents has been attempted (step 4312).

全てのリーフ文書に対応する文書ＩＤに対する処理が試行されてはおらずステップ４３１２の判定がＮＯの場合には、ステップ４３０１に戻り、次のパスに対応する話題パターンの推測処理が繰り返される。 If processing for the document IDs corresponding to all leaf documents has not been attempted and the determination in step 4312 is NO, the process returns to step 4301, and the topic pattern inference process corresponding to the next path is repeated.

全てのリーフ文書に対応する文書ＩＤに対する処理が試行されステップ４３１２の判定がＹＥＳとなった場合には、図４２のステップ４２０３のＱ and Ａパターンの判定処理を終了する。 If processing for document IDs corresponding to all leaf documents is attempted and the determination in step 4312 is YES, the Q and A pattern determination processing in step 4203 of FIG. 42 ends.

図４６及び図４７に、上述のＱ and Ａパターンの判定処理によって抽出されるスレッド構造とそれに対応する文書群の例を示す。
なお、文書インデックスの「記事種別」フィールドに記号「Ａ」が付与されたエントリの文書ＩＤに対応する文書の作者について、それに対応するユーザインデックス（図３６参照）のエントリが参照され、そのエントリ内の「発言数（回答数）」フィールドの内容が更新される。 FIG. 46 and FIG. 47 show examples of the thread structure extracted by the above-described Q and A pattern determination processing and the corresponding document group.
Note that the entry of the user index (see FIG. 36) corresponding to the author of the document corresponding to the document ID of the entry assigned the symbol “A” in the “article type” field of the document index is referred to. The content of the “Number of utterances (number of responses)” field is updated.

図４８は、図４２のステップ４２０４のＤｉｓｃｕｓｓｉｏｎパターンの判定処理の動作フローチャートである。この動作フローチャートでは、スレッド文書群内の各参照パス毎に、Ｄｉｓｃｕｓｓｉｏｎパターンが推測される。 FIG. 48 is an operation flowchart of the determination process of the Discussion pattern in step 4204 of FIG. In this operation flowchart, a Discusion pattern is estimated for each reference path in the thread document group.

まず、スレッドインデックス内の該当エントリの「スレッドの木構造」フィールドが参照されることによって、リーフ文書（パスの末端の文書）に対応する文書ＩＤが検索される（ステップ４８０１）。 First, the document ID corresponding to the leaf document (the document at the end of the path) is searched by referring to the “thread tree structure” field of the corresponding entry in the thread index (step 4801).

次に、上記検索の結果、全てのリーフ文書に対応する文書ＩＤに対する処理が試行されたか否かが判定される（ステップ４８０２）。
全てのリーフ文書に対応する文書ＩＤに対する処理が試行されてはおらずステップ４８０２の判定がＮＯの場合には、文書インデックスにおいて、ステップ４８０１で検索された文書ＩＤを「文書ＩＤ」フィールドに含むエントリ内の「ルートまでのパス」フィールドが参照され、上記リーフ文書に対応する文書ＩＤからルート文書までの長さ（文書ＩＤの数）が６以上であるか否かが判定される（ステップ４８０３）。 Next, as a result of the search, it is determined whether or not processing for document IDs corresponding to all leaf documents has been attempted (step 4802).
If processing for document IDs corresponding to all leaf documents has not been attempted and the determination in step 4802 is NO, in the document index, in the entry including the document ID searched in step 4801 in the “document ID” field. The “path to root” field is referred to, and it is determined whether or not the length (number of document IDs) from the document ID corresponding to the leaf document to the root document is 6 or more (step 4803).

上記長さが６以上ではなくステップ４８０３の判定がＮＯの場合には、その参照パスの話題パターンはＤｉｓｃｕｓｓｉｏｎパターンではないと推測され、ステップ４８０１に戻って次のリーフ文書に対する処理が実行される。 If the length is not 6 or more and the determination in step 4803 is NO, it is inferred that the topic pattern of the reference path is not a Discussion pattern, and the process returns to step 4801 to execute processing for the next leaf document.

上記長さが６以上であってステップ４８０３の判定がＹＥＳの場合には、ステップ４８０３で参照された「ルートまでのパス」フィールドに含まれる文書ＩＤ群に対応する文書群において、相異なるユーザＩＤの数がカウントされる（ステップ４８０４）。 If the length is 6 or more and the determination in step 4803 is YES, different user IDs in the document group corresponding to the document ID group included in the “path to root” field referenced in step 4803. Are counted (step 4804).

次に、｛上記「ルートまでのパス」フィールドに含まれる文書ＩＤの数（総文書数）｝に対する｛上記相異なるユーザＩＤの数｝の割合が、０．３より小さいか否かが判定される（ステップ４８０５）。 Next, it is determined whether the ratio of {the number of different user IDs} to the number of document IDs (total number of documents) included in the “path to root” field is smaller than 0.3. (Step 4805).

この判定がＮＯの場合には、特定の少数のユーザによる論争が行われてはいないと推測され、ステップ４８０１に戻って次のリーフ文書に対する処理が実行される。
一方、ステップ４８０５の判定がＹＥＳの場合には、特定の少数のユーザによる論争が行われていると推測され、図４０に示されるデータ構成を有するＤＩＳＣＵＳＳインデックスにおいて、｛（そのインデックス内のＤＳ＿ＩＤの最大値）＋１｝の値を「ＤＳ＿ＩＤ」フィールドの値として有するエントリが作成される。そして、そのエントリ内の「記事ＩＤ」フィールドに、ステップ４８０３で参照された「ルートまでのパス」フィールドに含まれる文書ＩＤ群がリストとして登録され、その登録内容に基づいて、「ＵＩＤ」フィールド及び「ＴｈｒｅａｄＩＤ」フィールドの内容が登録される。また、図３７に示されるデータ構成を有する文書インデックスにおいて、上記各文書ＩＤ群を各「文書ＩＤ」フィールドに含む各エントリ内の「記事種別」フィールドに、記号「Ｄ」が登録される（ステップ４８０６）。その後、ステップ４８０１に戻って次のリーフ文書に対する処理が実行される。 If this determination is NO, it is presumed that no disputes have been made by a specific small number of users, and the processing returns to step 4801 to execute processing for the next leaf document.
On the other hand, if the determination in step 4805 is YES, it is inferred that a dispute is being made by a specific small number of users, and in the DISCUSS index having the data structure shown in FIG. An entry having a value of (maximum value) +1} as a value of the “DS_ID” field is created. Then, in the “article ID” field in the entry, the document ID group included in the “path to route” field referenced in step 4803 is registered as a list, and based on the registered contents, the “UID” field and The contents of the “ThreadID” field are registered. Also, in the document index having the data structure shown in FIG. 37, the symbol “D” is registered in the “article type” field in each entry that includes each document ID group in each “document ID” field (step) 4806). Thereafter, the process returns to step 4801 to execute processing for the next leaf document.

全てのリーフ文書に対応する文書ＩＤに対する処理が試行されステップ４８０２の判定がＹＥＳとなった場合には、図４２のステップ４２０４のＤｉｓｃｕｓｓｉｏｎパターンの判定処理を終了する。 If processing for document IDs corresponding to all leaf documents is attempted and the determination in step 4802 is YES, the determination process for the discrimination pattern in step 4204 in FIG. 42 ends.

図４９に、上述のＤｉｓｃｕｓｓｉｏｎパターンの判定処理によって抽出されるスレッド構造の例を示す。少数のユーザの頭文字のみが多く現れていることがわかり、このスレッドにおいては論争が行われていると推測できる。 FIG. 49 shows an example of a thread structure extracted by the above-described determination process of the Discus pattern. It can be seen that only a few user acronyms are appearing, and that this thread is controversial.

図５０は、図４２のステップ４２０５のＣｈａｔパターンの判定処理の動作フローチャートである。この動作フローチャートでは、スレッド文書群内の各参照パス毎に、Ｃｈａｔパターンが推測される。 FIG. 50 is an operation flowchart of the Chat pattern determination process in step 4205 of FIG. In this operation flowchart, a Chat pattern is estimated for each reference path in the thread document group.

まず、スレッドインデックス内の該当エントリの「スレッドの木構造」フィールドが参照されることによって、リーフ文書（パスの末端の文書）に対応する文書ＩＤが検索される（ステップ５００１）。 First, the document ID corresponding to the leaf document (the document at the end of the path) is searched by referring to the “thread tree structure” field of the corresponding entry in the thread index (step 5001).

次に、上記検索の結果、全てのリーフ文書に対応する文書ＩＤに対する処理が試行されたか否かが判定される（ステップ５００２）。
全てのリーフ文書に対応する文書ＩＤに対する処理が試行されてはおらずステップ５００２の判定がＮＯの場合には、文書インデックスにおいて、ステップ５００１で検索された文書ＩＤを「文書ＩＤ」フィールドに含むエントリ内の「ルートまでのパス」フィールドが参照され、上記リーフ文書に対応する文書ＩＤからルート文書までの長さ（文書ＩＤの数）が６以上であるか否かが判定される（ステップ５００３）。 Next, as a result of the search, it is determined whether or not processing for document IDs corresponding to all leaf documents has been attempted (step 5002).
If processing for document IDs corresponding to all leaf documents has not been attempted and the determination in step 5002 is NO, in the document index, in the entry including the document ID searched in step 5001 in the “document ID” field. The “path to root” field is referred to, and it is determined whether or not the length (number of document IDs) from the document ID corresponding to the leaf document to the root document is 6 or more (step 5003).

上記長さが６以上ではなくステップ５００３の判定がＮＯの場合には、その参照パスの話題パターンはＣｈａｔパターンではないと推測され、ステップ５００１に戻って次のリーフ文書に対する処理が実行される。 If the length is not 6 or more and the determination in step 5003 is NO, it is estimated that the topic pattern of the reference path is not a Chat pattern, and the process returns to step 5001 to execute processing for the next leaf document.

上記長さが６以上であってステップ５００３の判定がＹＥＳの場合には、ステップ５００３で参照された「ルートまでのパス」フィールドに含まれる文書ＩＤ群に対応する文書群において、相異なるユーザＩＤの数がカウントされる（ステップ５００４）。 If the length is 6 or more and the determination in step 5003 is YES, different user IDs in the document group corresponding to the document ID group included in the “path to root” field referenced in step 5003. Are counted (step 5004).

次に、｛上記「ルートまでのパス」フィールドに含まれる文書ＩＤの数（総文書数）｝に対する｛上記相異なるユーザＩＤの数｝の割合が、０．６より大きいか否かが判定される（ステップ５００５）。 Next, it is determined whether the ratio of {the number of different user IDs} to the number of document IDs (total number of documents) included in the “path to root” field is greater than 0.6. (Step 5005).

この判定がＮＯの場合には、多数のユーザによる雑談（チャット）が行われてはいないと推測され、ステップ５００１に戻って次のリーフ文書に対する処理が実行される。
一方、ステップ５００５の判定がＹＥＳの場合には、多数のユーザによる雑談が行われていると推測され、図４１に示されるデータ構成を有するＣＨＡＴインデックスにおいて、｛（そのインデックス内のＣＴ＿ＩＤの最大値）＋１｝の値を「ＣＴ＿ＩＤ」フィールドの値として有するエントリが作成される。そして、そのエントリ内の「Ｃｈａｔリスト」フィールドに、ステップ５００３で参照された「ルートまでのパス」フィールドに含まれる文書ＩＤ群がリストとして登録され、その登録内容に基づいて、「ＵＩＤ」フィールド及び「ＴｈｒｅａｄＩＤ」フィールドの内容が登録される（ステップ４８０６）。その後、ステップ５００１に戻って次のリーフ文書に対する処理が実行される。 If this determination is NO, it is estimated that chatting (chatting) by a large number of users has not been performed, and the process returns to step 5001 to execute processing for the next leaf document.
On the other hand, if the determination in step 5005 is YES, it is presumed that there is a chat by a large number of users, and {(maximum value of CT_ID in the index) in the CHAT index having the data configuration shown in FIG. ) +1} as the value of the “CT_ID” field is created. Then, in the “Chat list” field in the entry, the document ID group included in the “path to root” field referred to in step 5003 is registered as a list. Based on the registered contents, the “UID” field and The contents of the “ThreadID” field are registered (step 4806). Thereafter, the process returns to step 5001 to execute processing for the next leaf document.

全てのリーフ文書に対応する文書ＩＤに対する処理が試行されステップ５００２の判定がＹＥＳとなった場合には、図４２のステップ４２０５のＣｈａｔパターンの判定処理を終了する。 If processing for document IDs corresponding to all leaf documents is attempted and the determination in step 5002 is YES, the chat pattern determination processing in step 4205 of FIG. 42 ends.

図５１に、上述のＣｈａｔパターンの判定処理によって抽出されるスレッド構造の例を示す。多数のユーザの頭文字が雑多に現れていることがわかり、このスレッドにおいては雑談が行われていると推測できる。 FIG. 51 shows an example of a thread structure extracted by the above-described Chat pattern determination process. It turns out that the acronyms of many users appear in various ways, and it can be inferred that chatting is taking place in this thread.

〔スレッド木の縮退処理の原理〕
次に、第２の実施の形態におけるスレッド木の縮退処理の原理について説明する。
図５２及び図５３は、スレッド構造の表示例を示す図である。図５２は、従来の伝統的なニュースリーダにおける表示例、図５３は、前述した図１４のスレッドビューの表示処理に基づく表示例である。 [Principle of thread tree degeneration processing]
Next, the principle of thread tree degeneration processing according to the second embodiment will be described.
52 and 53 are diagrams showing display examples of the thread structure. FIG. 52 is a display example of a conventional traditional news reader, and FIG. 53 is a display example based on the thread view display process of FIG. 14 described above.

わずか１７本の記事からなるスレッドにおいても、図５２に示されるように行数が増えたり、図５３に示されるように横方向にはみだしたりして、スレッド全体を見るのに画面のスクロールが必要となり、全体構造の把握が難しいことがわかる。 Even in a thread consisting of only 17 articles, the number of lines increases as shown in FIG. 52, or it protrudes in the horizontal direction as shown in FIG. 53, and it is necessary to scroll the screen to see the whole thread. Thus, it is difficult to understand the overall structure.

図５４は、スレッド内の上位ｎ個の子孫ノード（図５４の例ではｎ＝６）に対する表示例である。行頭の“＋”記号は子ノードが省略されていることを表し、行末のかっこ付き数字は子孫ノードの個数（図３７に示される文書インデックスにおける参照子孫数）を表す。画面やウインドウサイズに合わせて値ｎを調整することにより、必要な部分のみを表示することが可能となる。 FIG. 54 is a display example for the top n descendant nodes (n = 6 in the example of FIG. 54) in the thread. The “+” symbol at the beginning of a line indicates that a child node is omitted, and the parenthesized number at the end of the line indicates the number of descendant nodes (the number of reference descendants in the document index shown in FIG. 37). By adjusting the value n in accordance with the screen or window size, it is possible to display only necessary portions.

図５５は、子孫ノードのうち、同一タイトル（先頭の“Re: ”は除く）を持つノードが省略された表示例である。行頭の“＋”記号は子ノードが省略されていることを表す。同一スレッドの文書においては、デフォルトではタイトルは、親ノードと同じであるか、又は、最初にフォローを表す“Re: ”が付加されるかである。作者が意図的にタイトルを変えたというのは、そこで話題が変わったことを明示している。このビューにより、スレッド内にどのような話題の変化があったかを容易に把握することができる。 FIG. 55 is a display example in which nodes having the same title (excluding the first “Re:”) among the descendant nodes are omitted. A “+” symbol at the beginning of a line indicates that a child node is omitted. In a document of the same thread, by default, the title is the same as that of the parent node, or “Re:” indicating follow is added first. The author's intentional change of title clearly indicates that the topic has changed. With this view, it is possible to easily grasp what topic has changed in the thread.

図５６は、１０／１から１０／５という時間区間に作成された文書のスレッド構造の表示例である。ノードは、作者のイニシャルを表す。このビューでは、スレッドの時間的展開と一定区間内の情報だけを見ることができる。また、パソコンの画面上でスケジューラなどの時間的情報のあるアプリケーションと並べて見ることによって、自分のスケジュールや世の中の出来事と関連づけて文書情報を見ることができる。 FIG. 56 is a display example of the thread structure of a document created in the time interval from 10/1 to 10/5. The node represents the author's initials. In this view, you can see only the time evolution of threads and information within a certain interval. In addition, by viewing alongside an application with time information, such as a scheduler, on a personal computer screen, it is possible to view document information in association with one's own schedule and events in the world.

図５７は、スレッドの作者をノードとしたグラフ構造である。ノードは、作者のイニシャルである。二重丸で表されたノードは、スレッドの最初の記事の発言者を表す。リンクの濃さにより作者間のやりとりの回数が表される。更に、図３８に示されるスレッドインデックス中の該当エントリの最多発言ＵＩＤに登録されているユーザＩＤに対応するユーザのノードは、例えば強調表示される。このスレッドが、小川さん（小）とパーツィバルさん（パ）とのやりとりが中心であることが容易に理解できる。 FIG. 57 shows a graph structure in which the author of a thread is a node. The node is the author's initials. The node represented by a double circle represents the speaker of the first article in the thread. The number of interactions between authors is expressed by the strength of the link. Furthermore, the node of the user corresponding to the user ID registered in the most frequent UID of the corresponding entry in the thread index shown in FIG. 38 is highlighted, for example. It can be easily understood that this thread is centered on the exchange between Mr. Ogawa (small) and Mr. Partibar (pa).

〔検索フェーズの実行時の動作〕
次に、上述のスレッド木の縮退処理を含む検索フェーズの実行時の動作について説明する。 [Operation during search phase]
Next, an operation at the time of executing a search phase including the above-described thread tree degeneration processing will be described.

検索時には、利用者は、入力装置３５０９から検索要求を指示する。
図５８は、検索要求の入力画面である。入力項目としては、下記に示されるものがある。 At the time of searching, the user instructs a search request from the input device 3509.
FIG. 58 is a search request input screen. The input items include the following items.

・探したい記事に含まれるキーワード列（必須）。
・探したい記事が含んではいけないキーワード列。
・検索対象の記事の種別として、全ての記事か、Ｑand Ａパターンに相当する記事だけか。省略時は全ての記事。・ Keyword string included in the article you want to find (required)
・ Keywords that should not contain the article you are looking for.
-As the types of articles to be searched, all articles or only articles corresponding to the Qand A pattern. The default is all articles.

・検索対象の記事の日付として、全区間か、一ヶ月以内か、一週間以内か。省略時は、全区間。
図５８に示される入力画面の下部には、検索前の準備フェーズにおいて記事が二次記憶装置３５０３に格納（ダウンロード）された最新の日時が表示されている。 -Whether the date of the article to be searched is all sections, within a month, or within a week. The default is all sections.
In the lower part of the input screen shown in FIG. 58, the latest date and time when the article is stored (downloaded) in the secondary storage device 3503 in the preparation phase before the search is displayed.

検索結果としては、下記に示されるものがある。
・スレッド一覧（図６０参照）。
・スレッド構造表示（参照数による縮退表示、同一タイトルによる縮退表示を含む）（図６１参照）。 Search results include those shown below.
・ Thread list (see FIG. 60).
Thread structure display (including a reduced display by the number of references and a reduced display by the same title) (see FIG. 61).

・時間区間スレッド表示（図６２参照）。
・Ｑ and Ａ対照表示。
・作者ノードグラフ表示（図５７参照）。 -Time interval thread display (see FIG. 62).
・ Q and A contrast display.
-Author node graph display (see FIG. 57).

・作者投稿一覧表示。
・記事本文表示。
これらの表示画面は、図５９に示されるように相互に切り替えることができる。これらの表示画面のうち、代表的なものについて以下に説明する。・ List of author posts.
-Article text display.
These display screens can be switched to each other as shown in FIG. Of these display screens, typical ones will be described below.

〔出力結果１：スレッド一覧〕
例えば、検索キーワードとして「エンジン」が入力された場合、図６０に示されるようなスレッド一覧画面が表示される。図６０で表示される検索結果は、下記に示されるものである。 [Output result 1: List of threads]
For example, when “engine” is input as a search keyword, a thread list screen as shown in FIG. 60 is displayed. The search results displayed in FIG. 60 are shown below.

・スレッドのトップ記事のタイトル。
・作者の名前。
・日付。・ Title of the top article of the thread.
・ The author's name.
·date.

・サイズ（スレッドの記事数、全体の記事サイズ）。
・スレッドの内容（ＱＡ：Ｑ and Ａパターン、ＤＣ：Ｄｉｓｃｕｓｓｉｏｎパターン、ＣＴ：Ｃｈａｔパターン）。 -Size (number of articles in the thread, overall article size).
Thread contents (QA: Q and A pattern, DC: Discussion pattern, CT: Chat pattern).

検索時にはスレッドのサイズに基づくソーティング処理が実行され、上位１０スレッドが表示される。ユーザが「次の１０スレッド」をクリックすると、次の１０スレッドが表示される。 When searching, sorting processing based on the thread size is executed, and the top 10 threads are displayed. When the user clicks on “next 10 threads”, the next 10 threads are displayed.

また、結果が多い場合には、更にキーワードを追加することにより絞り込み検索を実行させることも可能である。
他の画面へは、次の方法で移動することができる。 In addition, when there are many results, it is possible to execute a narrowing search by adding more keywords.
You can move to another screen in the following way.

・タイトルをクリックすると、スレッド構造が表示される。
・作者の名前をクリックすると、作者ノードグラフが表示される。
・日付をクリックすると、時間区間スレッド表示が表示される。 -Click on the title to display the thread structure.
・ Click on the author's name to display the author node graph.
-Click on the date to display the time interval thread display.

・スレッドの内容のＱＡをクリックすると、ＱＡの対が表示される。
〔出力結果２：スレッド構造表示〕
図６１は、スレッド構造の表示例である。 -Clicking on QA of the thread contents will display QA pairs.
[Output result 2: Thread structure display]
FIG. 61 is a display example of a thread structure.

図５４に示されるように、スレッド構造が、参照ノード数に基づいて縮退された木構造として表示される。表示領域の行数（縦方向の長さ）に応じて、参照ノード数の少ないノードは省略して表示される。ノードの表示内容は、下記のとおりである。表示領域の桁数（横方向の長さ）に応じて、各ノードにおいて表示される項目も適宜省略される。 As shown in FIG. 54, the thread structure is displayed as a tree structure reduced based on the number of reference nodes. Depending on the number of lines in the display area (vertical length), nodes with a small number of reference nodes are omitted and displayed. The display contents of the node are as follows. Depending on the number of digits in the display area (the length in the horizontal direction), items displayed at each node are also omitted as appropriate.

・行頭の“＋”記号は、省略された子ノードがある場合に付加される。
・ユーザが入力したキーワードを含む記事は、タイトルと作者部分が強調表示される（図６１では、矩形によって囲まれた部分）。 -The "+" sign at the beginning of a line is added when there are omitted child nodes.
In the article including the keyword input by the user, the title and the author part are highlighted (in FIG. 61, a part surrounded by a rectangle).

・記事タイトル。フォロー記事には“Re: ”記号が付加される。
・記事の作者名。
・記事の種別。内容推定部３５０４（図３５）によって推定された話題パターンに応じて、Ｑ（質問）、Ａ（答）、Ｄ（論争）が付加される。 -Article title. “Re:” symbol is added to the follow article.
-The author name of the article.
-Article type. Q (question), A (answer), and D (controversy) are added according to the topic pattern estimated by the content estimation unit 3504 (FIG. 35).

・自ノードの子孫ノードの数。“＋”記号が付加されたノードに対してのみ付加される。
また、このスレッド内において、更にキーワードを指定して絞り込み検索を実行することも可能である。 -Number of descendant nodes of the current node. It is added only to nodes to which the “+” sign is added.
Further, it is also possible to execute a narrowing search by further specifying a keyword in this thread.

他の画面へは、次の方法で移動することができる。
・タイトルをクリックすると、記事本文が表示される。
・作者の名前をクリックすると、作者ノードグラフが表示される。 You can move to another screen in the following way.
-Click the title to display the article text.
・ Click on the author's name to display the author node graph.

・「タイトル一覧」をクリックすると、スレッド構造が、同一タイトルに基づいて縮退された木構造として表示される（図５５参照）。
〔出力結果３：時間区間スレッド表示〕
図６２(a) は、一定の時間区間におけるスレッドの表示例である。キーワードが含まれる記事は黒丸によって、そうでない記事は灰色の丸によって表示されている。また、作者のイニシャルが各ノードの下に付加される。 • When “Title List” is clicked, the thread structure is displayed as a tree structure reduced based on the same title (see FIG. 55).
[Output result 3: Time interval thread display]
FIG. 62 (a) is a display example of threads in a certain time interval. Articles that contain keywords are displayed with black circles, and articles that do not have keywords are displayed with gray circles. Also, the author's initials are added below each node.

この画面では、日付の表示区間、日付の縦横表示、ウインドウのサイズ、セルの幅などが可変である。そこで、例えば他のスケジューラとサイズを合わせることが可能である。例えば、図６２(b) は、他のスケジューラであり、それと図６２(a) に示される時間区間スレッド表示とで、各セル幅が合わせられている。 On this screen, the date display section, date vertical and horizontal display, window size, cell width, and the like are variable. Therefore, for example, it is possible to match the size with other schedulers. For example, FIG. 62 (b) shows another scheduler, and the cell widths are matched in this and the time interval thread display shown in FIG. 62 (a).

他の画面へは、次の方法で移動することができる。
・黒丸又は灰色の丸をクリックすると、記事本文が表示される。
・作者のイニシャルをクリックすると、作者ノードグラフが表示される。 You can move to another screen in the following way.
・ Click on the black circle or gray circle to display the article text.
・ Click the author's initial to display the author node graph.

〔出力結果４：ＱＡ対表示〕
ＱＡ対表示とは、内容推定部３５０４（図３５）によって推測されたＱ andＡパターンに対応する質問と回答の対が、テーブルとして表示されたものである。テーブルの一行には、下記の情報が表示される。 [Output result 4: QA vs. display]
In the QA pair display, a question and answer pair corresponding to the Q and A pattern estimated by the content estimation unit 3504 (FIG. 35) is displayed as a table. The following information is displayed in one row of the table.

・タイトル。
・質問者。
・回答者（複数）。
他の画面へは、次の方法で移動することができる。 ·title.
·Questioner.
-Respondents (multiple).
You can move to another screen in the following way.

・タイトルをクリックすると、スレッド構造が表示される。
・作者の名前をクリックすると、作者ノードグラフが表示される。
〔出力結果５：作者ノードグラフ〕
作者ノードグラフは、そのスレッド内の各記事の作者間の会話関係がグラフ化されたものである。前述の図５７がその表示例である。 -Click on the title to display the thread structure.
・ Click on the author's name to display the author node graph.
[Output result 5: Author node graph]
The author node graph is a graph of the conversation relationship between the authors of each article in the thread. FIG. 57 is an example of the display.

他の画面へは、次の方法で移動することができる。
・作者をクリックすると、その作者の投稿一覧が表示される。
・リンクをクリックすると、スレッド構造が表示される。 You can move to another screen in the following way.
・ Clicking on the author will display the author's post list.
・ Click on the link to display the thread structure.

〔出力結果６：作者の投稿一覧〕
作者の投稿一覧は、各作者が投稿した記事の一覧を見るための画面である。日付、タイトル、記事の種別（Ｑ、Ａ、Ｄ）が日付順に表示される。 [Output result 6: Author's post list]
The author's post list is a screen for viewing a list of articles posted by each author. The date, title, and article type (Q, A, D) are displayed in date order.

他の画面へは、次の方法で移動することができる。
・タイトルをクリックすると、記事本文が表示される。
〔出力結果７：記事本文〕
これは、記事の本文である。他に、作者名、タイトル、日付、親記事へのリンクが表示される。 You can move to another screen in the following way.
-Click the title to display the article text.
[Output result 7: Article text]
This is the body of the article. In addition, the author name, title, date, and link to the parent article are displayed.

他の画面へは、次の方法で移動することができる。
・タイトルをクリックすると、スレッド構造が表示される。
・日付をクリックすると、時間区間スレッド表示が表示される。 You can move to another screen in the following way.
-Click on the title to display the thread structure.
-Click on the date to display the time interval thread display.

・作者の名前をクリックすると、作者ノードグラフが表示される。
〔時間区間スレッド表示の動作〕
図６３は、ビュー生成部３５０８（図３５）が実行する時間区間スレッド表示の動作フローチャートである。・ Click on the author's name to display the author node graph.
[Time section thread display operation]
FIG. 63 is an operation flowchart of time interval thread display executed by the view generation unit 3508 (FIG. 35).

まず、図３８に示されるデータ構成を有するスレッドインデックスにおいて、表示対象スレッドに対応するエントリ内の「スレッドの木構造」フィールドが参照されることにより、そのスレッドに含まれる文書ＩＤが１つ選択される（ステップ６３０１）。 First, in the thread index having the data structure shown in FIG. 38, one “document ID” included in the thread is selected by referring to the “thread tree structure” field in the entry corresponding to the display target thread. (Step 6301).

次に、図３７に示されるデータ構成を有する文書インデックスにおいて、上記選択された文書ＩＤに対応するエントリ内の「日付」フィールドが参照され、その日付が図６４に示されるデータ構成を有するカレンダインデックスに登録される（ステップ６３０２）。 Next, in the document index having the data structure shown in FIG. 37, the “date” field in the entry corresponding to the selected document ID is referred to, and the date is a calendar index having the data structure shown in FIG. (Step 6302).

次に、スレッドインデックスの表示対象スレッドに対応するエントリ内の「スレッドの木構造」フィールド内の全ての文書ＩＤに対する処理が試行されたか否かが判定される（ステップ６３０３）。 Next, it is determined whether processing for all document IDs in the “thread tree structure” field in the entry corresponding to the thread index display target thread has been attempted (step 6303).

全ての文書ＩＤに対する処理が試行されてはおらずステップ６３０３の判定がＮＯの場合には、ステップ６３０１に戻り、次の文書ＩＤに対する処理が繰り返される。
全ての文書ＩＤに対する処理が試行されステップ６３０３の判定がＹＥＳとなった場合には、図６４に示されるデータ構成を有するカレンダインデックスが参照されることにより、カレンダに文書ノードがマッピングされる。参照関係のエッジは、スレッドインデックスを参照して表示される。 If the process for all document IDs has not been attempted and the determination in step 6303 is NO, the process returns to step 6301 and the process for the next document ID is repeated.
If processing for all document IDs is attempted and the determination in step 6303 is YES, the calendar node having the data structure shown in FIG. 64 is referred to, and the document node is mapped to the calendar. The edge of the reference relationship is displayed with reference to the thread index.

〔作者ノードグラフの表示動作〕
図６５は、ビュー生成部３５０８（図３５）が実行する作者ノードグラフの表示動作を示す動作フローチャートである。 [Author node graph display operation]
FIG. 65 is an operation flowchart showing the display operation of the author node graph executed by the view generation unit 3508 (FIG. 35).

まず、図３８に示されるデータ構成を有するスレッドインデックスにおいて、表示対象スレッドに対応するエントリ内の「スレッドの木構造」フィールドが参照されることにより、そのスレッドに含まれる文書ＩＤが１つ選択される。次に、図３７に示されるデータ構成を有する文書インデックスと図３６に示されるデータ構成を有するユーザインデックスとが参照されることにより、上記選択された文書ＩＤに対応する文書の親文書（親発言）のユーザＩＤが取得される（ステップ６５０１）。 First, in the thread index having the data structure shown in FIG. 38, one “document ID” included in the thread is selected by referring to the “thread tree structure” field in the entry corresponding to the display target thread. The Next, by referring to the document index having the data structure shown in FIG. 37 and the user index having the data structure shown in FIG. 36, the parent document (parent message) of the document corresponding to the selected document ID is referred to. ) Is acquired (step 6501).

次に、図６６に示されるデータ構成を有する発言者配列内に、上記親子関係に対応するエントリが存在するか否かが判定される（ステップ６５０２）。
そのエントリが存在するなら、ステップ６５０４の処理に進む。 Next, it is determined whether or not an entry corresponding to the parent-child relationship exists in the speaker array having the data structure shown in FIG. 66 (step 6502).
If the entry exists, the process proceeds to step 6504.

そのエントリが存在しないなら、発言者配列の横軸又は縦軸のエントリが追加される（ステップ６５０３）。
その後、上記エントリの数字が１だけインクリメントされる（ステップ６５０４）。 If the entry does not exist, an entry on the horizontal axis or vertical axis of the speaker array is added (step 6503).
Thereafter, the number of the entry is incremented by 1 (step 6504).

次に、スレッドインデックスの表示対象スレッドに対応するエントリ内の「スレッドの木構造」フィールド内の全ての文書ＩＤに対する処理が試行されたか否かが判定される（ステップ６５０５）。 Next, it is determined whether or not processing for all document IDs in the “thread tree structure” field in the entry corresponding to the thread index display target thread has been attempted (step 6505).

全ての文書ＩＤに対する処理が試行されてはおらずステップ６５０５の判定がＮＯの場合には、ステップ６５０１に戻り、次の文書ＩＤに対する処理が繰り返される。
全ての文書ＩＤに対する処理が試行されステップ６５０５の判定がＹＥＳとなった場合には、図６７に示されるように、図６６に示されるデータ構成を有する発言者配列のエントリの数だけノードが描画され、親から子供に向かって“→”線が描画される。この線の太さは、その親子間の会話の度数に応じて決定される（ステップ６５０６）。 If processing for all document IDs has not been attempted and the determination in step 6505 is NO, processing returns to step 6501 and processing for the next document ID is repeated.
If processing for all document IDs is attempted and the determination in step 6505 is YES, as shown in FIG. 67, nodes are drawn by the number of entries in the speaker array having the data structure shown in FIG. Then, a “→” line is drawn from the parent to the child. The thickness of this line is determined according to the frequency of conversation between the parent and child (step 6506).

以上説明した第２の実施の形態において、スレッドの木構造が縮退されることにより、画面の表示範囲に応じたスレッドの表示可能となる。
また、自動的に推定された話題と共に検索結果が表示されるため、検索結果のスレッド数が多い場合でも、利用者は検索結果の概要を容易に把握することが可能となる。 In the second embodiment described above, the thread tree can be displayed according to the display range of the screen by reducing the tree structure of the thread.
In addition, since the search result is displayed together with the automatically estimated topic, even when the number of threads of the search result is large, the user can easily grasp the outline of the search result.

更に、スレッド中の文書量が多くても、同じ作者が何度も投稿している場合がある。作者を中心に見せるビユーが提供されることにより、スレッド内のキーパーソンが把握可能となるだけでなく、スレッドの全体構造もコンパクトに表示することが可能となる。 Furthermore, even if the amount of documents in the thread is large, the same author may post many times. By providing a view that mainly shows the author, not only can the key person in the thread be grasped, but also the entire structure of the thread can be displayed in a compact manner.

〔各実施の形態を実現するプログラムが記録された記録媒体についての補足〕
本発明は、計算機により使用されたときに、上述の本発明の各実施の形態の各構成によって実現される機能と同様の機能を計算機に行わせるための計算機読出し可能記憶媒体として構成することもできる。 [Supplementary information about a recording medium on which a program for realizing each embodiment is recorded]
The present invention may also be configured as a computer-readable storage medium for causing a computer to perform the same function as the function realized by each configuration of the above-described embodiments of the present invention when used by a computer. it can.

この場合に、図６８に示されるように、例えばフロッピィディスク、ＣＤ−ＲＯＭディスク、光ディスク、リムーバブルハードディスク等の可搬型記憶媒体６８０２や、ネットワーク回線６８０３経由で、本発明の好適実施例の各種機能を実現するプログラムが、コンピュータ６８０１の本体６８０４内のメモリ（ＲＡＭ又はハードディスク等）６８０５にロードされて、実行される。 In this case, as shown in FIG. 68, various functions of the preferred embodiment of the present invention can be performed via a portable storage medium 6802 such as a floppy disk, a CD-ROM disk, an optical disk, a removable hard disk, and the network line 6803. A program to be realized is loaded into a memory (RAM or hard disk or the like) 6805 in the main body 6804 of the computer 6801 and executed.

参照関係を有する文書群の例を示す図である。It is a figure which shows the example of the document group which has a reference relationship. 文書のデータ構造の例を示す図である。It is a figure which shows the example of the data structure of a document. 本発明の実施の形態のシステム構成図（その１）である。It is a system configuration figure (the 1) of an embodiment of the invention. 本発明の実施の形態のシステム構成図（その２）である。It is a system configuration figure (the 2) of an embodiment of the invention. メタインデックスのデータ構造を示す図である。It is a figure which shows the data structure of a meta index. スレッドインデックスのデータ構造を示す図である。It is a figure which shows the data structure of a thread index. 索引ファイルのデータ構造を示す図である。It is a figure which shows the data structure of an index file. 書式解析部と構造解析部の動作フローチャートである。It is an operation | movement flowchart of a format analysis part and a structure analysis part. 文書番号のスレッドインデックスへの登録の動作フローチャートである。It is an operation | movement flowchart of registration to the thread index of a document number. 文書番号のスレッドインデックスへの登録の動作説明図である。It is operation | movement explanatory drawing of registration to the thread index of a document number. 色番号登録の動作フローチャートである。It is an operation | movement flowchart of color number registration. カラーテーブルの例を示す図である。It is a figure which shows the example of a color table. キーワードビユーの動作フローチャートである。It is an operation | movement flowchart of a keyword view. スレッドビューの動作フローチャートである。It is an operation | movement flowchart of a thread view. スレッドビューの制御用配列の例を示す図である。It is a figure which shows the example of the array for thread view control. 発言者ビューの動作フローチャートである。It is an operation | movement flowchart of a speaker view. 発言者ビューの制御用配列の例を示す図である。It is a figure which shows the example of the arrangement | sequence for control of a speaker view. 発言内容表示の動作フローチャートである。It is an operation | movement flowchart of message content display. 作者別／日付別色分け表示の動作フローチャートである。It is an operation | movement flowchart of a color-coded display according to author / date. 作者別／日付別色分け表示用配列の例を示す図である。It is a figure which shows the example of the arrangement | sequence for color-coded display according to author / date. スレッドビューを使った検索結果の強調表示の動作フローチャートである。It is an operation | movement flowchart of the highlight display of the search result using a thread view. キーワードビユーを使った検索結果の強調表示の動作フローチャートである。It is an operation | movement flowchart of the highlighting of the search result using a keyword view. サブトピックからのキーワード抽出の制御動作フローチャートである。It is a control operation | movement flowchart of the keyword extraction from a subtopic. サブトピックインデックスの例を示す図である。It is a figure which shows the example of a subtopic index. キーワードビユーの表示例を示す図である。It is a figure which shows the example of a keyword view display. スレッドビューの表示例を示す図である。It is a figure which shows the example of a display of a thread | sled view. 発言者ビューの表示例を示す図である。It is a figure which shows the example of a display of a speaker view. 発言内容の表示例を示す図である。It is a figure which shows the example of a display of message content. スレッドビューを用いた文書属性「作者」の強調（色別）表示の例を示す図である。It is a figure which shows the example of the emphasis (by color) display of the document attribute "author" using a thread view. 会議室内の全発言の検索（入力）表示例を示す図である。It is a figure which shows the example of search (input) display of all the utterances in a meeting room. スレッドビューを用いた文字列「プロトコル」を含むノードの強調表示の例を示す図である。It is a figure which shows the example of the highlight display of the node containing the character string "protocol" using a thread view. キーワードビユーを用いた文字列「プロトコル」を含むスレッドの強調表示の例を示す図である。It is a figure which shows the example of the highlight display of the thread | sled containing the character string "protocol" using a keyword view. 会議室内の全発言の検索（結果出力）表示例を示す図である。It is a figure which shows the search (result output) display example of all the utterances in a meeting room. サブトピックから抽出したキーワードの表示例を示す図である。It is a figure which shows the example of a display of the keyword extracted from the subtopic. 本発明の他の実施の形態（第２の実施の形態）の構成図である。It is a block diagram of other embodiment (2nd Embodiment) of this invention. ユーザインデックスの構成図である。It is a block diagram of a user index. 文書インデックスの構成図である。It is a block diagram of a document index. スレッドインデックスの構成図である。It is a block diagram of a thread index. ＱＡインデックスの構成図である。It is a block diagram of a QA index. ＤＩＳＣＵＳＳインデックスの構成図である。It is a block diagram of a DISCUSS index. ＣＨＡＴインデックスの構成図である。It is a block diagram of a CHAT index. 内容推定部の動作フローチャートである。It is an operation | movement flowchart of a content estimation part. Ｑ and Ａパターン判定処理の動作フローチャートである。It is an operation | movement flowchart of a Q and A pattern determination process. Ｑ（質問）文書に含まれるパターンの例を示す図である。It is a figure which shows the example of the pattern contained in a Q (question) document. お礼文書に含まれるパターンの例を示す図である。It is a figure which shows the example of the pattern contained in a thank-you document. Ｑ and Ａパターンの判定処理が推測するスレッド構造の例を示す図である。It is a figure which shows the example of the thread structure which the determination process of Q and A pattern guesses. Ｑ and Ａパターンの判定処理が推測するスレッド文書群の例を示す図である。It is a figure which shows the example of the thread document group which the determination process of Q and A pattern guesses. 論争パターンの判定処理の動作フローチャートである。It is an operation | movement flowchart of the determination process of a dispute pattern. Ｄｉｓｃｕｓｓｉｏｎパターンの判定処理が推測するスレッド構造の例を示す図である。It is a figure which shows the example of the thread | sled structure which the determination process of a Discussion pattern guesses. 雑談パターンの判定処理の動作フローチャートである。It is an operation | movement flowchart of the determination process of a chatting pattern. Ｃｈａｔパターンの判定処理が推測するスレッド構造の例を示す図である。It is a figure which shows the example of the thread structure which the determination process of Chat pattern guesses. オリジナルのスレッド構造の表示例を示す図である。It is a figure which shows the example of a display of an original thread structure. オリジナルのスレッド構造の表示例を示す図である。It is a figure which shows the example of a display of an original thread structure. 参照ノード数により縮退したスレッド構造の表示例を示す図である。It is a figure which shows the example of a display of the thread structure degenerated by the number of reference nodes. 同一タイトル文書を縮退したスレッド構造の表示例を示す図である。It is a figure which shows the example of a display of the thread | sled structure which reduced the same title document. 時間区間で取り出したスレッド構造の表示例を示す図である。It is a figure which shows the example of a display of the thread structure taken out in the time area. 作者をノードとしたグラフ構造を示す図である。It is a figure which shows the graph structure which made the author a node. ネットワークニュース検索システムの表示例を示す図である。It is a figure which shows the example of a display of a network news search system. 検索画面の一覧を示す図である。It is a figure which shows the list of a search screen. 出力結果１：スレッド一覧を示す図である。Output result 1: a list of threads. 出力結果２：スレッド構造表示を示す図である。Output result 2: It is a figure which shows thread structure display. 出力結果３：時間区間スレッド表示を示す図である。Output result 3: time interval thread display. 時間区間スレッド表示の作成フローを示す図である。It is a figure which shows the creation flow of a time interval thread display. カレンダインデックスのデータ構成図である。It is a data block diagram of a calendar index. 作者ノードグラフの作成フローを示す図である。It is a figure which shows the creation flow of an author node graph. 発言者配列のデータ構成図である。It is a data block diagram of a speaker arrangement | sequence. 作者ノードグラフの作成説明図である。It is creation explanatory drawing of an author node graph. 本実施の形態を実現するプログラムが記録された記録媒体の説明図である。It is explanatory drawing of the recording medium with which the program which implement | achieves this Embodiment was recorded.

Explanation of symbols

３０１文書群データベース
３０２文書群解析装置
３０３集計装置
３０４メタインデックス
３０５スレッドインデックス
３０６表示装置
４０１書式解析部
４０２構造解析部
４０３内容解析部
４０４索引ファイル
４０５文字列検索装置
３５０１処理装置
３５０２文書取得部
３５０３二次記憶装置
３５０４内容推定部
３５０５表示用インデックス
３５０６検索エンジン
３５０７検索用インデックス
３５０８ビュー生成部
３５０９入力装置
３５１０表示装置 301 Document Group Database 302 Document Group Analysis Device 303 Total Device 304 Meta Index 305 Thread Index 306 Display Device 401 Format Analysis Unit 402 Structure Analysis Unit 403 Content Analysis Unit 404 Index File 405 Character String Search Device 3501 Processing Device 3502 Document Acquisition Unit 3503 Two Next storage device 3504 Content estimation unit 3505 Display index 3506 Search engine 3507 Search index 3508 View generation unit 3509 Input device 3510 Display device

Claims

A related document display device for displaying a document group consisting of documents having a reference relationship,
Content estimation means for estimating the content of the document group based on the document content and a posting pattern between authors from a document database consisting of a document group having a reference relationship, and generating an index corresponding to the topic pattern of the content;
An input means for inputting a search request for the document database from a user;
Search engine means for searching for documents in the document database;
View generation means for generating one or more views using a search result from the search engine means and the index, and switching the one or more views to display on the display device;
A related document display device comprising:

The view generation means summarizes the documents having the same attribute in the displayed documents, displays the tree structure of the reference relationship between the displayed documents and the information regarding the node corresponding to each document, The related document display device according to claim 1, wherein the related document display device allows a user to easily grasp an entire structure of a reference relationship between documents displayed.

The view generation means selects only a document having a large number of references in the reference relationship between displayed documents according to the size of the screen, thereby displaying a tree structure of the reference relationship between displayed documents and each document. The related document display device according to claim 1, wherein information related to a corresponding node is simply displayed so that a user can easily grasp an entire structure of a reference relationship between documents displayed.

The view generation means selects only a document including a keyword input by a user and has a large number of references to the document, thereby corresponding to a tree structure of reference relationships between displayed documents and each document. 2. The related document display device according to claim 1, wherein the related document display device makes it possible for the user to easily grasp the entire structure of the search result for the document database by simply displaying the information about the node.

The view generation means displays the reference relationship between the displayed documents together with the topic pattern estimated by the content estimation means for each document, so that the entire structure of the search result for the document database is displayed to the user. The related document display device according to claim 1, wherein the related document display device makes it possible to easily grasp.

The view generation means displays only a document in a specific time section among displayed documents in a calendar format, thereby allowing a user to easily grasp a search result for the document database in association with temporal information. The related document display device according to claim 1.

The view generation means highlights and displays a specific topic pattern estimated by the content estimation means, thereby allowing a user to easily grasp only an important part of a document reference relationship in a document database. The related document display device according to claim 1.

The view generation unit causes the search engine unit to execute a search only on documents corresponding to the question and the answer among the specific topic patterns estimated by the content estimation unit, thereby obtaining a user question The related document display device according to claim 1, wherein a pair of a question and an answer corresponding to a matter is easily grasped.

The view generation means highlights and displays a specific author based on a posting history for each document in the document database and a specific topic pattern estimated by the content estimation means, thereby displaying a document to the user. The related document display device according to claim 1, wherein only the important part of the reference relationship of documents in the database is easily grasped.

The view generation means displays a directed graph in which the author of each document in the document database is a node, the reference relationship between the documents is a link, and the number of references is the link emphasis level. The related document display device according to claim 1, wherein the reference relationship of the document is easily grasped from the viewpoint of the author.

2. The related document display device according to claim 1, wherein the document stored in the document database is an article document of network news downloaded through a network.

A recording medium that records a program read by the computer when used,
A function that estimates the content of the document group based on the document content and a posting pattern between the authors from a document database including a document group having a reference relationship, and generates an index corresponding to the topic pattern of the content;
A function of inputting a search request for the document database from a user;
A function for searching for documents in the document database;
Generating one or more views using the search result and the index, and switching the one or more views to display on the display device;
A computer-readable recording medium on which a program for causing the computer to execute is recorded.