JP2001134596A

JP2001134596A - Managing device and retrieving method for structured document

Info

Publication number: JP2001134596A
Application number: JP31672099A
Authority: JP
Inventors: Takashi Shimojima; 崇下島; Takeshi Tsurubayashi; 健鶴林; Shinichi Nakai; 信一中井; Chuichi Kikuchi; 忠一菊池
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1999-11-08
Filing date: 1999-11-08
Publication date: 2001-05-18

Abstract

PROBLEM TO BE SOLVED: To enable a document management system, which handles structured documents, to perform retrieval wherein various logical structures are specified and speedily perform, specially, structure specified retrieval wherein an intermediate layer is specified. SOLUTION: In registration, a structure analyzing means 107 analyzes the logical structure of a document to be registered, an ID generating means 108 allocates different IDs to a terminal element among elements divided into logical structure units and elements other than the terminal element, and an index information generation part 108 generates index information using the two kinds of IDs. In retrieval, high-speed full-text retrieval is performed under a retrieval condition inputted by an index information retrieving means 110 by using an index information storage means 114. A result generating means 111 generates an entity for retrieval result list and text body display.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、ＳＧＭＬやＸＭＬ
などの論理的な構造要素を有する構造化文書を計算機を
用いて管理する文書管理システムにおける、論理構造を
指定した全文検索を行なう構造化文書検索方法、および
要素単位のアクセスコントロールについての構造化文書
管理に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to SGML and XML
In a document management system that uses a computer to manage structured documents having logical structural elements such as a document, a structured document search method for performing a full-text search specifying a logical structure, and a structured document for element-based access control Regarding management.

【０００２】[0002]

【従来の技術】電子化文書の増大に伴い、マニュアル、
議事録、仕様書等、論理的構造を有する文書を扱う構造
化文書に対する関心が高まっている。それにより、構造
化文書の特長を生かした、論理構造を指定した検索を行
なう機能が重要となる。構造化文書はその論理構造がＤ
ＴＤ(Document Type Definition:文書型定義)によって
定義される。2. Description of the Related Art With the increase in electronic documents, manuals,
There is an increasing interest in structured documents that handle documents having a logical structure, such as minutes and specifications. Therefore, a function of performing a search specifying a logical structure by utilizing the features of the structured document becomes important. A structured document whose logical structure is D
It is defined by TD (Document Type Definition).

【０００３】従来、構造化文書管理システムにおける文
書の検索装置としては、特開平１０−２４０７５２号公
報（以下、公知例と呼ぶ）に記載された発明が知られて
いる。Conventionally, as a document retrieval apparatus in a structured document management system, an invention described in Japanese Patent Application Laid-Open No. 10-240752 (hereinafter referred to as a known example) is known.

【０００４】以下、公知例の概要について説明する。そ
の文書登録システムの構成図は図１８に示すとおりであ
る。公知例では登録する際、まず文書構造解析プログラ
ム１８０１にて登録対象文書の持つ論理構造を解析し
て、解析済み文書データを作成し解析済み文書データ格
納領域１８０５に登録する。The outline of the known example will be described below. The configuration diagram of the document registration system is as shown in FIG. In registration in the known example, first, the document structure analysis program 1801 analyzes the logical structure of the document to be registered, creates analyzed document data, and registers it in the analyzed document data storage area 1805.

【０００５】次に、構造インデックス作成プログラム１
８０２にて各登録対象文書の持つ論理構造を、登録順に
従って順次重ね合わせ、文書中における出現位置および
種別が同じである要素群は単一のメタ要素によって代表
させ、文書中における出現位置が同じである文字列デー
タ群は単一のメタ文字列データによって代表させること
により、メタ要素群およびメタ文字列データ群（公知例
ではこれらを総称してメタノードと呼ぶ）の木構造から
構成される構造インデックスを生成し該構造インデック
スを構成する全てのメタノードに対して、それらを構造
インデックスの中で一意に識別する識別子（公知例では
これを文脈識別子と呼ぶ）を与え、構造インデックス格
納領域１８０６に登録する。Next, a structure index creation program 1
At 802, the logical structures of the documents to be registered are sequentially superimposed in the order of registration, and elements having the same appearance position and type in the document are represented by a single meta element, and the appearance positions in the document are the same. Is represented by a single meta-character string data, thereby forming a meta-element group and a meta-character string data group (in a known example, these are collectively called a metanode). An index is generated, and an identifier (this is called a context identifier in a known example) that uniquely identifies the metanodes constituting the structure index is assigned to all metanodes constituting the structure index, and registered in the structure index storage area 1806. I do.

【０００６】図１９は上記構造インデックスを作成する
過程を示す図である。図１９において、文書１、文書
２、文書３は、それぞれ登録対象文書の解析済み文書デ
ータを表わしている。これらの解析済み文書データの構
造を既存の構造インデックス上に順次重ね合わせること
により、構造インデックスが形成されていく。まず最初
に文書１が入力されると、最初の段階では構造インデッ
クスは初期状態（空）であるため、該解析済みデータと
等価な木構造が生成されてそのまま構造インデックスに
登録され、構造インデックスは１９０１に示す状態とな
る。新たに生成されたメタ要素にはＥ１からＥ５までの
文脈識別子、新たに生成されたメタ文字列データにはＣ
１からＣ３までの文脈識別子が割り当てられる。次に文
書２が入力されると、既存の構造インデックス（１９０
１）と構造が重複する部分については何も行わず、１９
０２上に対応する部分がなかった部分構造（図中の網掛
け部分）だけが新たに登録される。新たに生成されたメ
タ要素には文脈識別子Ｅ６およびＥ７、新たに生成され
たメタ文字列データには文脈識別子Ｃ４が割り当てられ
る。次に文書３が入力されると、既存の構造インデック
ス（１９０２）と構造が重複する部分については何も行
わず、１９０２上に対応する部分がなかった部分構造
（図中の網掛け部分）だけが新たに登録される。新たに
生成されたメタ要素には文脈識別子Ｅ８、Ｅ９およびＥ
１０、新たに生成されたメタ文字列データには文脈識別
子Ｃ５およびＣ６が割り当てられる。このようにして、
３個の文書が登録された段階で、構造インデックスは１
９０３に示す状態となる。FIG. 19 is a diagram showing a process of creating the structure index. In FIG. 19, Document 1, Document 2, and Document 3 each represent analyzed document data of a registration target document. The structure index is formed by sequentially superimposing the structure of the analyzed document data on the existing structure index. First, when document 1 is input, since the structure index is in the initial state (empty) at the initial stage, a tree structure equivalent to the analyzed data is generated and registered as it is in the structure index. The state shown in 1901 is obtained. The newly generated meta element has context identifiers from E1 to E5, and the newly generated meta character string data has C
Context identifiers from 1 to C3 are assigned. Next, when document 2 is input, the existing structure index (190
No action is taken for the part where the structure overlaps with 1), and 19
Only the partial structure (the shaded portion in the figure) for which there is no corresponding part on 02 is newly registered. The context identifiers E6 and E7 are assigned to the newly generated meta element, and the context identifier C4 is assigned to the newly generated meta character string data. Next, when the document 3 is input, nothing is performed on a portion whose structure overlaps with the existing structure index (1902), and only the partial structure (shaded portion in the figure) having no corresponding portion on 1902 Is newly registered. Context identifiers E8, E9 and E
10. Context identifiers C5 and C6 are assigned to the newly generated meta-character string data. In this way,
When three documents have been registered, the structure index is 1
The state shown in FIG.

【０００７】次に、構造化全文データ生成プログラム１
８０３にて各登録対象文書について、その文書に対応す
る解析済み文書データ中に含まれるすべての文字列と、
その文字列を構造インデックス中で示される文脈識別子
との対応関係の定義から構成されるデータ（公知例では
これを構造化全文データと呼ぶ）を生成し、構造化全文
データ格納領域１８０７に登録する。Next, a structured full-text data generation program 1
At 803, for each document to be registered, all the character strings included in the analyzed document data corresponding to the document,
Data composed of the definition of the correspondence between the character string and the context identifier indicated in the structure index (this is called structured full-text data in a known example) is generated and registered in the structured full-text data storage area 1807. .

【０００８】次に、文字列インデックス作成プログラム
１８０４にて、各登録対象文書に対応する構造化全文デ
ータから、前記文脈識別子を含んだ全文検索を行なうた
めの文字列インデックスを作成し、文字列インデックス
格納領域１８０８に登録する。[0008] Next, a character string index creating program 1804 creates a character string index for performing a full-text search including the context identifier from the structured full-text data corresponding to each document to be registered. Register in the storage area 1808.

【０００９】また、公知例における検索は、まず前記構
造インデックスを参照し、指定された構造条件を満たす
文脈識別子の集合を決定する。In the search in the known example, first, a set of context identifiers satisfying a specified structure condition is determined by referring to the structure index.

【００１０】次に、それらの文脈識別子をキーとして文
字列を検索することにより、指定条件を満たす文書群を
求める。Next, a document group that satisfies the designated condition is obtained by searching a character string using the context identifier as a key.

【００１１】[0011]

【発明が解決しようとする課題】上記従来技術の方法で
は「章以下に“○○”を含む文書」というような検索条
件の場合、構造インデックスを参照して該当する全ての
識別子の集合を求め、その識別子をもとにＯＲ検索を行
なわなければならないため検索が遅くなるという課題を
有していた。According to the method of the prior art described above, in the case of a search condition such as "document including" xx "in a chapter or less", a set of all applicable identifiers is obtained by referring to a structure index. However, since the OR search must be performed on the basis of the identifier, the search becomes slow.

【００１２】また、構造化文書を扱う文書管理装置とし
て、構造化文書の特長を生かし要素単位のアクセス権の
設定ができる機能が求められる。[0012] Further, as a document management apparatus for handling structured documents, a function capable of setting an access right for each element by utilizing the features of structured documents is required.

【００１３】本発明は上記従来技術の課題を解決するも
ので、構造化文書を対象とした全文検索において、様々
な論理構造指定検索に対応すること、および要素単位の
アクセス権の設定を目的とする。An object of the present invention is to solve the above-mentioned problems of the prior art, and to provide a full-text search for a structured document, to support various logical structure designation searches, and to set an access right for each element. I do.

【００１４】[0014]

【課題を解決するための手段】上記課題を解決するため
に、本発明では論理構造指定検索における第一の手段と
して、構造化文書の登録の際に登録文書の論理構造を解
析する構造解析手段と、前記構造解析手段によって論理
構造に分けられた要素を一意に特定するＩＤを作成する
ＩＤ作成手段と、全文検索のための索引情報を作成する
索引情報作成手段とを備え、かつ前記ＩＤ作成手段が論
理構造の位置を末端要素と末端要素以外とで別々にＩＤ
を割当てることを特徴とする。これにより構造を指定し
た高速な検索ができるという作用を有する。According to the present invention, as a first means in a logical structure designation search, a structure analysis means for analyzing a logical structure of a registered document at the time of registration of a structured document. ID creation means for creating an ID for uniquely specifying an element divided into a logical structure by the structure analysis means, and index information creation means for creating index information for full-text search, and Means are used to identify the position of the logical structure separately for the terminal element and the non-terminal element.
Is assigned. This has an effect that a high-speed search specifying a structure can be performed.

【００１５】論理構造指定検索における第二の手段とし
て前記ＩＤ作成手段が論理構造の位置を示すＩＤを、素
数どうしの乗算値で割当てることを特徴とする。これに
より、構造化文書の上位層または中間層を指定する検索
（例えば、「章」は「タイトル」と「節」からなってい
る場合に「章」以下から検索するような場合）におい
て、構造を指定しない検索と同じような検索時間で検索
できるという作用を有する。As a second means in the logical structure designation search, the ID creating means assigns an ID indicating the position of the logical structure by a multiplication value of prime numbers. Thus, in a search that specifies the upper layer or the middle layer of the structured document (for example, when “chapter” is composed of “title” and “section” and is searched from “chapter” and below), This has the effect that the search can be performed in the same search time as the search in which no is specified.

【００１６】また、要素単位のアクセス権設定における
第一の手段として、登録対象構造化文書に各要素に関す
るアクセス権の情報を持たせ、この情報を用いて全末端
要素へのアクセス権を参照可能なテーブルを作成するア
クセス権管理テーブル作成手段を備えることを特徴とす
る。これにより要素単位のアクセス権を考慮した結果一
覧作成および実体取得を実現することができるという作
用を有する。Further, as a first means for setting an access right for each element, the structured document to be registered is provided with access right information on each element, and the access right to all terminal elements can be referred to using this information. Access right management table creating means for creating a simple table. As a result, there is an effect that it is possible to realize the result list creation and entity acquisition in consideration of the access right of each element.

【００１７】要素単位のアクセス権設定における第二の
手段として、本システムで扱う構造化文書のＤＴＤ毎
に、各要素へのアクセス権をに関する情報を格納したア
クセス権情報格納手段を備えることを特徴とする。これ
により登録対象構造化文書にアクセス権に関する情報が
付加されていない場合でも、アクセス権を考慮した結果
一覧作成および実体取得を実現することができるという
作用を有する。As a second means for setting an access right for each element, an access right information storage unit for storing information on an access right to each element is provided for each DTD of a structured document handled by the present system. And As a result, even when the information on the access right is not added to the structured document to be registered, the result list creation and the entity acquisition in consideration of the access right can be realized.

【００１８】[0018]

【発明の実施の形態】以下、本発明の実施の形態につい
て説明する。なお、本発明はこれら実施の形態に何ら限
定されるものではなく、その要旨を逸脱しない範囲にお
いて、種々なる態様で実施し得る。Embodiments of the present invention will be described below. The present invention is not limited to these embodiments at all, and can be implemented in various modes without departing from the gist thereof.

【００１９】（実施の形態１）図１は本発明の実施の形
態１における構造化文書管理装置の構成図である。(Embodiment 1) FIG. 1 is a configuration diagram of a structured document management apparatus according to Embodiment 1 of the present invention.

【００２０】図１に示す構造化文書管理装置は、端末１
０１、構造化文書入力手段１０２、検索条件入力手段１
０３、結果表示手段１０４、検索エンジン１０５、デー
タ格納部１０６からなる。端末１０１は、文書検索にお
ける検索条件の指定および検索結果の表示に使用する。
構造化文書入力手段１０２は、登録対象文書を格納して
おき文書の登録を行なう際にここから検索エンジン１０
５へデータを送る。The structured document management device shown in FIG.
01, structured document input means 102, search condition input means 1
03, a result display unit 104, a search engine 105, and a data storage unit 106. The terminal 101 is used to specify search conditions in document search and to display search results.
The structured document input unit 102 stores the document to be registered, and when the document is registered, starts searching the search engine 10.
Send data to 5.

【００２１】検索条件入力手段１０３は、端末１０１か
ら入力された検索条件を検索エンジン１０５へ送る。結
果表示手段１０４は、検索結果を検索エンジン１０５か
ら受け取り、端末１０１に表示する。検索エンジン１０
５は、実際に検索対象の文書の登録、検索および検索結
果の作成を行なう。まず、登録に関して、１０７は登録
対象文書の論理構造を解析する構造解析手段、１０８は
構造解析済みデータの各要素に対してＩＤを作成し割当
てるＩＤ作成手段、１０９は高速に検索を行なうための
索引情報を作成する索引情報作成手段である。これら１
０７、１０８、および１０９についての詳細は、文書登
録処理の流れの説明の中で述べる。次に検索に関して、
１１０は検索条件入力手段１０３から受けた検索条件
で、実際に索引情報を用いて検索処理を行なう索引情報
検索手段である。１１０についての詳細は文書検索の流
れの説明の中で述べる。次に結果作成に関して、１１１
は検索結果の一覧や、本文表示のための実体を作成し結
果表示手段１０４へ渡す結果作成手段である。データ格
納部１０６は、構造解析手段１０７によって作成された
構造解析済みデータを格納する構造解析済みデータ格納
手段１１２、ＩＤ作成手段によって作成された、構造を
識別するためのＩＤのテーブルを格納するＩＤテーブル
格納手段１１３、索引情報作成手段１０９によって作成
された索引情報を格納する索引情報格納手段１１４、登
録文書の実体データを格納する実体データ格納手段１１
５、検索結果一覧のための書誌データを格納する一覧デ
ータ格納手段１１６からなり、構造化文書の検索および
結果表示に用いるデータの格納に使用する。The search condition input means 103 sends the search condition input from the terminal 101 to the search engine 105. The result display means 104 receives the search result from the search engine 105 and displays it on the terminal 101. Search engine 10
Reference numeral 5 actually registers a document to be searched, performs a search, and creates a search result. First, regarding registration, 107 is a structure analyzing means for analyzing the logical structure of the document to be registered, 108 is an ID creating means for creating and assigning an ID to each element of the structurally analyzed data, and 109 is a means for performing a high-speed search. Index information creating means for creating index information. These one
Details of 07, 108, and 109 will be described in the description of the flow of the document registration process. Next, regarding the search,
Reference numeral 110 denotes a search condition received from the search condition input means 103, which is an index information search means for actually performing a search process using the index information. Details of 110 will be described in the description of the document search flow. Next, regarding the result creation, 111
Is a result creation unit that creates a list of search results and an entity for displaying the text and passes it to the result display unit 104. The data storage unit 106 includes a structure analyzed data storage unit 112 that stores the structure analyzed data created by the structure analysis unit 107, and an ID that stores an ID table for identifying the structure created by the ID creation unit. Table storage means 113, index information storage means 114 for storing the index information created by index information creation means 109, and entity data storage means 11 for storing the entity data of the registered document
5. A list data storage unit 116 for storing bibliographic data for a search result list is used for storing data used for structured document search and result display.

【００２２】次に本実施の形態における文書登録の処理
を具体的な構造化文書の例を用いて説明する。Next, the document registration process in the present embodiment will be described using a specific example of a structured document.

【００２３】まず、構造化文書入力手段１０２から登録
対象文書を読み込む。次に構造解析手段１０７によって
登録対象文書の構造を理解できる形に変換する。この構
造解析手段１０７は登録対象文書のＤＴＤを参照してそ
の構造を解析する。この構造解析手段１０７によって、
文字の並びとしての構造化文書がＩＤ作成手段１０８に
理解できるデータ構造に変換され（以下、構造解析済み
データと呼ぶ）、構造解析済みデータ格納手段１１２に
格納する。First, a document to be registered is read from the structured document input means 102. Next, the structure of the registration target document is converted into a form that can be understood by the structure analysis unit 107. The structure analyzing means 107 analyzes the structure of the document to be registered with reference to the DTD. By this structural analysis means 107,
The structured document as a sequence of characters is converted into a data structure that can be understood by the ID creating unit 108 (hereinafter, referred to as structurally analyzed data), and stored in the structurally analyzed data storage unit 112.

【００２４】次にＩＤ作成手段１０８で、構造を識別す
るためのＩＤの作成、割当てを行なう。図２は構造化文
書の論理構造を表すタグの例を示す図である。図２にお
いて、「論文」は「書誌」と「本文」から構成され、
「書誌」は「タイトル」と「著者」と「日付」から構成
され、「本文」は「章」から構成され、「章」は「タイ
トル」と「節」と「段落」から構成され、「節」は「タ
イトル」と「項」と「段落」から構成され、「項」は
「タイトル」と「段落」から構成されることを示してい
る。以下では、この論理構造を持った構造化文書を中
心に説明する。図２において、木構造の末端に位置する
要素は網掛けで表示されている。この末端要素は構造化
文書の実体を構成する部分である。すなわち文書中の文
字データは、これら末端要素中にのみ存在する。このよ
うな末端要素は、同一構造化文書中に同じタグ名で繰返
し出現することが図２からわかる。図３はこれら末端要
素に対して割当てるＩＤの例である（以下、このＩＤを
メンバＩＤと呼ぶ）。このメンバＩＤはある木構造中の
全ての末端要素に一意に割当てるのではなく、タグ名お
よびその出現順序が等しい場合は同じＩＤを割当てるも
のとする。ここで出現順序とは、同じ親要素を持つ同じ
タグ名の要素の中で何番目に出現した要素かを示す番号
である。Next, ID creating means 108 creates and assigns an ID for identifying the structure. FIG. 2 is a diagram illustrating an example of a tag representing a logical structure of a structured document. In FIG. 2, the "paper" is composed of "bibliography" and "text".
"Bibliography" is composed of "Title", "Author" and "Date", "Body" is composed of "Chapter", "Chapter" is composed of "Title", "Section" and "Paragraph", The “section” is composed of “title”, “section”, and “paragraph”, and the “section” is composed of “title” and “paragraph”. Hereinafter, a description will be given mainly of a structured document having this logical structure. In FIG. 2, elements located at the ends of the tree structure are shaded. This terminal element is a part constituting the entity of the structured document. That is, character data in a document exists only in these terminal elements. It can be seen from FIG. 2 that such terminal elements repeatedly appear with the same tag name in the same structured document. FIG. 3 shows an example of an ID assigned to these terminal elements (hereinafter, this ID is referred to as a member ID). This member ID is not uniquely assigned to all terminal elements in a certain tree structure, but the same ID is assigned when the tag names and the order of appearance are the same. Here, the appearance order is a number indicating the number of an element that appears in an element having the same parent element and the same tag name.

【００２５】これに対し、末端要素以外の要素について
もＩＤを割当てる（以下、このＩＤをグループＩＤと呼
ぶ）。このグループＩＤは、メンバＩＤとは異なり、タ
グ名に関係なく木構造中でその論理位置を一意に決定す
るＩＤを割当てるものとする。このようにグループＩＤ
およびメンバＩＤを割当てた場合、論理構造中のある末
端要素をグループＩＤとメンバＩＤの組み合わせによっ
て一意に特定することが可能となる。以下で例を示す。On the other hand, an ID is assigned to an element other than the terminal element (hereinafter, this ID is called a group ID). This group ID is different from the member ID and is assigned an ID that uniquely determines its logical position in the tree structure regardless of the tag name. Thus, the group ID
When a member ID is assigned, a terminal element in a logical structure can be uniquely specified by a combination of a group ID and a member ID. An example is shown below.

【００２６】図４は図２の論理構造に基づいて書かれた
構造化文書の一例である。図４の構造化文書に対して、
グループＩＤおよびメンバＩＤを割当てた例が図５であ
る。FIG. 4 shows an example of a structured document written based on the logical structure of FIG. For the structured document of FIG.
FIG. 5 shows an example in which a group ID and a member ID are assigned.

【００２７】図５の５０１に当たる末端要素を一意に特
定するグループＩＤおよびメンバＩＤの組み合わせは
（Ｇ２，Ｍ１）であり、５０２に当たる末端要素を一意
に特定するグループＩＤおよびメンバＩＤの組み合わせ
は（Ｇ４、Ｍ１）である。すなわち、ある末端要素を特
定するためには、その親要素のグループＩＤと該末端要
素自身のメンバＩＤを指定すればよい。The combination of the group ID and the member ID uniquely specifying the terminal element 501 in FIG. 5 is (G2, M1), and the combination of the group ID and the member ID uniquely specifying the terminal element 502 is (G4). , M1). That is, in order to specify a certain terminal element, the group ID of the parent element and the member ID of the terminal element may be specified.

【００２８】さらに、５０３は検索時に複数の要素を指
定している例であり、メンバＩＤは特に指定せず、グル
ープＩＤがＧ５またはＧ６またはＧ７として検索すれ
ば、５０３の要素がヒットするといった要素の「集合」
である。Further, reference numeral 503 denotes an example in which a plurality of elements are designated at the time of retrieval. If the member ID is not specified and the group ID is retrieved as G5, G6 or G7, the element 503 is hit. The "set"
It is.

【００２９】索引情報作成手段１０９では、このように
ＩＤ作成手段１０８で割当てられたグループＩＤおよび
メンバＩＤを用いて検索用索引情報を作成する。索引情
報作成手段１０９の処理の流れを図６を用いて説明す
る。まず構造解析済みデータ格納手段１１２から登録対
象文書の構造解析済みデータを読み込む（ステップ６０
１）。次にこの読み込んだ登録対象文書ごとに一意な番
号（以下、文書番号と呼ぶ）を割当てる。また、この読
み込んだ文書のひとつひとつの構成要素（末端要素）ご
とに、その文書内で一意な番号（以下、要素番号と呼
ぶ）を割当てる（ステップ６０２）。なお、この文書番
号と要素番号の組み合わせにより、検索エンジンに登録
された文書のひとつひとつの要素を特定することが可能
となる。次にその構造解析済みデータ中の文字列につい
てあらかじめ定めた文字数の文字連鎖を取り出す（ステ
ップ６０３）。この文字連鎖について、該当する文書番
号、要素番号、該当する要素を特定するグループＩＤと
メンバＩＤ、および該文字連鎖先頭文字がその要素内に
おいて何番目の文字かを表す番号（以下、文字位置番号
と呼ぶ）の情報を索引情報に追加する（ステップ６０
４）。この処理を登録対象文書の全ての文字列について
繰り返す（ステップ６０５）。最後に索引情報を索引情
報格納手段１１４に追加する（ステップ６０６）。The index information creating means 109 creates search index information using the group ID and the member ID assigned by the ID creating means 108 as described above. The processing flow of the index information creating means 109 will be described with reference to FIG. First, the structurally analyzed data of the document to be registered is read from the structurally analyzed data storage unit 112 (step 60).
1). Next, a unique number (hereinafter, referred to as a document number) is assigned to each of the read registration target documents. Also, a unique number (hereinafter, referred to as an element number) within the document is assigned to each component (terminal element) of the read document (step 602). The combination of the document number and the element number makes it possible to specify each element of the document registered in the search engine. Next, a character chain having a predetermined number of characters is extracted from the character string in the structurally analyzed data (step 603). Regarding this character chain, the corresponding document number, element number, group ID and member ID for specifying the corresponding element, and a number indicating the number of the first character in the character chain in the element (hereinafter, character position number) ) Is added to the index information (step 60).
4). This process is repeated for all the character strings of the registration target document (step 605). Finally, the index information is added to the index information storage unit 114 (step 606).

【００３０】図７は索引情報作成手段１０９によって図
４の構造化文書のうち３行目の「<タイトル> 構造化文
書管理 < /タイトル>」という要素について作成した索
引情報の例の一部を示した図である。図７の７０１は、
「文書番号が“１”の文書の、要素番号が“１”の要素
で、グループＩＤが“Ｇ２”、メンバＩＤが“Ｍ１”で
論理構造が特定されるこの要素の文字列中に、“構造”
という文字連鎖が先頭から“１”文字目の位置から存在
する」ということを表している。なお、図７は索引情報
の一部しか示していないが、実際は登録対象文書の全て
の文字列について索引情報が作成される。FIG. 7 shows a part of an example of the index information created by the index information creating means 109 for the element “<title> structured document management </ title>” in the third line of the structured document of FIG. FIG. 701 in FIG.
"In the character string of the document whose document number is" 1 ", whose element number is" 1 ", whose group ID is" G2 "and whose member ID is" M1 "and whose logical structure is specified,"Construction"
Is present from the position of the "1" -th character from the beginning. " Although FIG. 7 shows only a part of the index information, the index information is actually created for all the character strings of the registration target document.

【００３１】なお、この例では２文字ずつ文字連鎖を取
り出してそれぞれに索引情報を作成しているが、この文
字連鎖は２文字ずつでなくても構わない。また、以上の
登録処理を登録対象文書が入力されるごとに繰り返すこ
とにより、索引情報が追加されてゆく。In this example, a character chain is extracted two characters at a time and index information is created for each character chain. However, the character chain does not have to be two characters. The index information is added by repeating the above registration process each time a document to be registered is input.

【００３２】なお、図３他においてグループＩＤおよび
メンバＩＤは“Ｇ２”や“Ｍ１”といった文字で表現し
ているが、これらはそれぞれ、木構造中でその論理位置
を一意に決定するＩＤ、タグ名およびその出現順序が等
しいものを一意に識別するＩＤであればどのようなもの
でも構わない。In FIG. 3 and other figures, the group ID and the member ID are represented by characters such as "G2" and "M1", which are respectively an ID and a tag for uniquely determining the logical position in the tree structure. Any ID may be used as long as it is an ID that uniquely identifies a name and those having the same appearance order.

【００３３】次に本実施の形態における文書検索の処理
の流れを具体例を示して説明する。Next, the flow of document retrieval processing in the present embodiment will be described with a specific example.

【００３４】まず検索条件入力手段１０３を通して、端
末１０１から「タイトルに“構造化”という文字列が含
まれる文書」という条件が与えられたとする。図８はこ
の例での索引情報検索手段１１０での処理を図に示した
ものである。ここでは、説明を簡単にするため索引情報
格納手段１１４には図４に示す構造化文書の索引情報の
みが格納され、ＩＤテーブル格納手段１１３には図５の
ように割当てられたＩＤのテーブルが格納されていると
する。この状態において、まず「構造化」という文字列
で索引情報格納手段１１４を用いて全文検索を行なう
と、文書番号、要素番号、グループＩＤ、メンバＩＤの
組み合わせとして８０１に示す２組が得られる。一方、
検索の論理構造条件に該当する要素を示すメンバＩＤま
たはグループＩＤまたはそれら両方の値をＩＤテーブル
格納手段１１３から求める。この例（論理構造条件が
「タイトル」）の場合、８０２に示す条件となる（８０
２において、グループＩＤはどのような値でも構わない
ことを示している）。最後に８０１の２組のうち８０２
を満たすもの（８０３）が最終検索結果として得られ
る。First, it is assumed that a condition “a document including a character string“ structured ”in a title” is given from the terminal 101 through the search condition input unit 103. FIG. 8 shows the processing in the index information search means 110 in this example. Here, for the sake of simplicity, only the index information of the structured document shown in FIG. 4 is stored in the index information storage unit 114, and the ID table assigned as shown in FIG. It is assumed that it is stored. In this state, when a full-text search is first performed using the character string “structured” using the index information storage unit 114, two sets indicated by 801 are obtained as a combination of a document number, an element number, a group ID, and a member ID. on the other hand,
The member ID and / or group ID indicating the element corresponding to the logical structure condition of the search or both values are obtained from the ID table storage unit 113. In this example (the logical structure condition is “title”), the condition shown in 802 is satisfied (80
2 indicates that the group ID may be any value.) Finally, 802 out of the two sets of 801
(803) that satisfies is obtained as the final search result.

【００３５】なお、８０２において、図５の５０３を指
定して検索したのと同様に、グループＩＤだけ指定した
検索もできる。また、この例では論理構造条件が“タイ
トル”であり、これは論理構造の末端要素を指定した検
索であった。論理構造条件の指定にはこのほかにも“章
のタイトル”といった末端要素を更に限定した指定や
“１章１節”といった末端要素以外を指定する場合も考
えられる。この２つの例について、メンバＩＤおよびグ
ループＩＤの指定方法ついて以下で説明する。論理構造
条件の指定が“章のタイトル”であった場合、図５よ
り、５０２に当たる要素が該当する。この要素を一意に
特定するにはグループＩＤがＧ４、メンバＩＤがＭ１で
ある。It should be noted that, similarly to 802 in 802, a search in which only the group ID is specified can be performed. In this example, the logical structure condition is “title”, which is a search that specifies the terminal element of the logical structure. In addition to the specification of the logical structure condition, a specification in which terminal elements such as “chapter title” are further limited or a non-terminal element such as “chapter 1 section 1” may be specified. Regarding these two examples, a method of specifying a member ID and a group ID will be described below. When the designation of the logical structure condition is “chapter title”, the element corresponding to 502 corresponds to FIG. To uniquely identify this element, the group ID is G4 and the member ID is M1.

【００３６】次に論理構造条件の指定が“１章１節”で
あった場合、図５より、５０３に当たる７つの要素が該
当する。これらの要素を特定するにはメンバＩＤは特に
指定せず、グループＩＤがＧ５またはＧ６またはＧ７と
指定する。このように５０３に当たる７つの要素を指定
して検索する場合にグループＩＤを用いれば３回のＯＲ
検索で済むことになる。（木構造全ての要素に一意な識
別子を割当てている公知例では、７回のＯＲ検索が必
要。）最後に検索結果作成・表示処理について説明す
る。結果作成手段１１１は検索結果として得られた文書
の書誌情報（タイトル、著者、日付など）を結果一覧表
示用のデータとして、一覧データ格納手段１１６に格納
する。このデータを結果表示手段１０４を通して端末１
０１に表示する。次に端末１０１から実体表示要求とし
てこの検索結果一覧の中からどれか１つの文書が選択さ
れると、結果作成手段１１１が実体データ格納手段１１
５から指定された文書の実体を取得し、結果表示手段１
０４を通して端末１０１に表示する。なお、構造解析手
段１０７によって要素に分割された単位で、登録対象文
書を実体データ格納手段１１５に登録しておくことによ
り、検索結果作成・表示処理において要素毎の結果一覧
の作成、および要素毎の実体取得も可能である。Next, when the designation of the logical structure condition is “Chapter 1 Section 1”, seven elements corresponding to 503 are applicable from FIG. To specify these elements, the member ID is not specified, and the group ID is specified as G5, G6, or G7. In this way, when performing a search by designating seven elements corresponding to 503, if a group ID is used, three OR operations are performed.
You only have to search. (In a known example in which a unique identifier is assigned to all elements in the tree structure, seven OR searches are required.) Finally, the search result creation / display process will be described. The result creating unit 111 stores the bibliographic information (title, author, date, etc.) of the document obtained as a search result in the list data storage unit 116 as data for displaying the result list. This data is sent to the terminal 1 through the result display means 104.
01 is displayed. Next, when one of the documents is selected from the search result list as the entity display request from the terminal 101, the result creation unit 111 causes the entity data storage unit 11
5, the entity of the designated document is obtained, and the result display means 1
04 on the terminal 101. By registering the registration target document in the entity data storage unit 115 in units divided into elements by the structure analysis unit 107, it is possible to create a result list for each element in the search result creation / display processing, and It is also possible to obtain the entity.

【００３７】以上のように本実施の形態では、構造化文
書の論理構造情報を、末端要素以外の要素を一意に決定
するグループＩＤと、タグ名およびその出現順序が等し
い末端要素が同じ値を持つメンバＩＤを用いて登録する
ことによって、検索を行なう際に、様々な論理構造を指
定した検索が可能となる。また、論理構造を指定した検
索において末端要素以外が指定された場合、該当するグ
ループＩＤだけを指定すれば良いので全ての要素に一意
な識別子を割当てた場合に比べて、ＯＲ検索の回数が少
なくてすむため、より高速な検索が可能となる。As described above, in the present embodiment, the logical structure information of the structured document is obtained by assuming that the group ID for uniquely determining an element other than the terminal element and the terminal element having the same tag name and the same appearance order have the same value. By registering using the member ID that the user has, it is possible to perform a search specifying various logical structures when performing a search. Also, when a non-terminal element is specified in a search specifying a logical structure, only the corresponding group ID needs to be specified, so that the number of OR searches is smaller than when a unique identifier is assigned to all elements. As a result, faster search is possible.

【００３８】なお、本実施の形態では１文書の構造化文
書の登録、検索について説明したが、複数文書の場合で
も同様の処理で実現が可能である。また本実施の形態で
は、一種類のＤＴＤにおけるＩＤの作成方法について説
明したが、本システムに複数の異なるＤＴＤの文書の登
録要求が起こった場合においても、ＤＴＤの種類毎に別
の値のＩＤを割当てるか、もしくは同じ値を利用してＩ
Ｄを割当て、更にＤＴＤの種類を特定する識別子（以
下、ＤＴＤ識別子と呼ぶ）を登録対象構造化文書毎に持
たせ、索引情報にＤＴＤ識別子を追加しておくことによ
り、論理構造を指定した検索が実現可能である。In the present embodiment, registration and retrieval of a structured document of one document have been described. However, the same processing can be realized for a plurality of documents. In the present embodiment, the method of creating an ID in one type of DTD has been described. However, even when a request for registering a plurality of different DTD documents occurs in the present system, an ID having a different value for each type of DTD is provided. Or use the same value to
D is assigned, and an identifier (hereinafter, referred to as a DTD identifier) for specifying the type of DTD is provided for each structured document to be registered, and the DTD identifier is added to the index information, thereby performing a search specifying a logical structure. Is feasible.

【００３９】（実施の形態２）以下、本発明の実施の形
態２について説明する。実施の形態２における構造化文
書管理装置の構成図は実施の形態１の図１と同様であ
る。ただし、ＩＤ作成手段１０８におけるＩＤの作成方
法が実施の形態１とは異なり、それに伴い索引情報作成
手段１０９、索引情報検索手段１１０における処理、お
よびＩＤテーブル格納手段１１３に格納するデータが実
施の形態１とは異なる。(Embodiment 2) Hereinafter, Embodiment 2 of the present invention will be described. The configuration diagram of the structured document management device in the second embodiment is the same as that in FIG. 1 of the first embodiment. However, the method of creating an ID in the ID creating means 108 is different from that in the first embodiment, and accordingly, the processing in the index information creating means 109, the processing in the index information searching means 110, and the data stored in the ID table storage means 113 are different from those in the first embodiment. Different from 1.

【００４０】ここで実施の形態２における登録処理の流
れについて説明する。まず構造化文書入力手段１０２お
よび構造解析手段１０７の処理は実施の形態１と同様で
あるため説明を省略する。Here, the flow of the registration process according to the second embodiment will be described. First, the processes of the structured document input unit 102 and the structure analysis unit 107 are the same as those in the first embodiment, and thus the description is omitted.

【００４１】次に図９を用いて、実施の形態２における
ＩＤ作成手段１０８でのＩＤの割当て方法について説明
する。実施の形態２におけるＩＤ作成手段１０８では、
まず登録対象文書の各要素に対し重複しないように素数
を割当てる。そして、実際に各要素の論理構造を特定す
るために割当てるＩＤは、該要素に割当てられた素数
と、該要素の親要素に割当てられたＩＤとの積とする
（ただし、ルートノードに割当てるＩＤは“１”もしく
は他の要素と重複しない素数とする）。以下、このＩＤ
を積ＩＤと呼ぶ。図９は実施の形態１の説明で用いた図
４の構造化文書の例に対して、実施の形態２の方法でＩ
Ｄを割り当てた例の図である。図９の各要素の２段目の
値が割当てられた素数であり、３段目の値が実際に割当
てられる積ＩＤである。Next, a method of assigning IDs by ID creating means 108 according to the second embodiment will be described with reference to FIG. In the ID creating means 108 according to the second embodiment,
First, a prime number is assigned to each element of the document to be registered so as not to overlap. The ID actually assigned to specify the logical structure of each element is the product of the prime number assigned to the element and the ID assigned to the parent element of the element (however, the ID assigned to the root node) Is "1" or a prime number which does not overlap with other elements.) Hereafter, this ID
Is called a product ID. FIG. 9 shows an example of the structured document of FIG. 4 used in the description of the first embodiment,
It is a figure of the example to which D was allocated. The values in the second row of each element in FIG. 9 are assigned prime numbers, and the values in the third row are product IDs actually assigned.

【００４２】このように積ＩＤを割当てた場合、構造化
文書の上位層または中間層を指定する検索（例えば、
「章」は「タイトル」と「段落」と「節」からなってい
る場合に「章」以下から検索するような場合）におい
て、構造を指定しない検索と同じような検索時間で検索
することが可能となる。以下で例を示す。When the product ID is assigned in this manner, a search for specifying the upper layer or the intermediate layer of the structured document (for example,
When "Chapter" is composed of "Title", "Paragraph", and "Section", search from "Chapter" and below) can be searched in the same search time as search without specifying structure It becomes possible. An example is shown below.

【００４３】図９の例において、「第１章の中から検索
する」という要求があった場合、検索対象となるのは９
０１の点線に囲まれた末端要素である。実施の形態２に
おけるＩＤ割当て方法の場合、９０１に囲まれた９つの
要素の積ＩＤはすべて「第１章」の要素の積ＩＤの値
（＝３３）で割り切れる、ということに着目して検索対
象を特定することが可能である。In the example of FIG. 9, if there is a request to “search in Chapter 1”, the search target is 9
It is a terminal element surrounded by a dotted line of 01. In the case of the ID assignment method according to the second embodiment, search is performed by focusing on the fact that the product IDs of the nine elements surrounded by 901 are all divisible by the product ID value (= 33) of the elements in “Chapter 1”. It is possible to specify the target.

【００４４】索引情報作成手段１０９では、このように
割当てられた積ＩＤを用いて検索用索引情報を作成す
る。実施の形態２における索引情報作成手段１０９の処
理の流れは、実施の形態１における索引情報作成手段の
処理と次に述べる１点だけ異なる。実施の形態２では、
索引情報に格納するＩＤが実施の形態１で格納したグル
ープＩＤ、メンバＩＤの２つではなく、積ＩＤである。The index information creating means 109 creates search index information using the product ID assigned in this manner. The processing flow of the index information creating means 109 in the second embodiment differs from the processing of the index information creating means in the first embodiment by one point described below. In the second embodiment,
The ID stored in the index information is not the two of the group ID and the member ID stored in the first embodiment, but the product ID.

【００４５】図１０は実施の形態２における索引情報作
成手段１０９によって図４の構造化文書のうち３行目の
「<タイトル> 構造化文書管理 < /タイトル>」という
要素について作成した索引情報の例の一部を示した図で
ある。図１０の１００１は、「文書番号が“１”の文書
の、要素番号が“１”の要素で、積ＩＤが“１０”で論
理構造が特定される要素の文字列中に、“構造”という
文字連鎖が先頭から“１”文字目の位置から存在する」
ということを表している。FIG. 10 shows the index information created by the index information creating means 109 of the second embodiment for the element “<title> structured document management </ title>” in the third line of the structured document of FIG. It is a figure showing a part of example. 1001 in FIG. 10 indicates that “the element having the element number“ 1 ”of the document having the document number“ 1 ”, and the character string of the element whose product ID is“ 10 ”and whose logical structure is specified includes“ structure ” Exists from the position of the "1" character from the beginning "
It represents that.

【００４６】次に本実施の形態における文書検索の処理
の流れについて具体例を示して説明する。まず検索条件
入力手段１０３を通して、端末１０１から「第１章以下
に“構造化”という文字列が含まれる文書」という条件
が与えられたとする。図１１は、この例での索引情報検
索手段１１０での処理を図に示したものである。ここで
は、説明を簡単にするため索引情報格納手段１１４には
図４に示す構造化文書の索引情報のみが格納され、ＩＤ
テーブル格納手段１１３には図９のように割当てられた
ＩＤのテーブルが格納されているものとする。この状態
において、まず「構造化」という文字列で索引情報格納
手段１１４を用いて全文検索を行なうと、文書番号、要
素番号、積ＩＤの組み合わせとして１１０１に示す２組
が得られる。一方、検索の論理構造条件に該当する要素
を求めるための積ＩＤの値をＩＤテーブル格納手段１１
３から求める。この例（論理構造条件が「第１章以
下」）の場合、１１０２に示す積ＩＤを求める。最後に
この例では論理構造条件が末端要素を指定したものでは
なく、中間層を指定しているので、１１０１の二組の積
ＩＤを、「第１章」を表す積ＩＤ“３３”で割った剰余
を求め、この剰余が０となる要素を最終的な検索結果と
する。Next, the flow of document retrieval processing in the present embodiment will be described with reference to a specific example. First, it is assumed that a condition “a document including a character string“ structured ”in the first chapter and below” is given from the terminal 101 through the search condition input unit 103. FIG. 11 shows the processing in the index information search means 110 in this example. Here, in order to simplify the description, only the index information of the structured document shown in FIG.
It is assumed that the table storage means 113 stores a table of assigned IDs as shown in FIG. In this state, when a full-text search is first performed using a character string “structured” using the index information storage unit 114, two sets 1101 are obtained as a combination of a document number, an element number, and a product ID. On the other hand, the value of the product ID for finding the element corresponding to the logical structure condition of the search is stored in the ID table storage unit 11.
Find from 3. In this example (the logical structure condition is “first and second chapter”), the product ID shown in 1102 is obtained. Finally, in this example, since the logical structure condition does not specify the end element but specifies the intermediate layer, the two product IDs of 1101 are divided by the product ID “33” representing “Chapter 1”. The element in which the remainder is 0 is determined as the final search result.

【００４７】なお、この例では論理構造条件が“第１章
以下”であり、これは論理構造の中間層を指定した検索
であったため、「第１章」を表す積ＩＤ“３３”で割っ
た剰余が０となる要素を検索結果とした。一方、論理構
造条件が“タイトル”といった末端要素を指定した検索
の場合は、論理構造条件に該当する全要素の積ＩＤをＩ
Ｄテーブル格納手段１１３から求め、文字列全文検索で
得られた要素うち、積ＩＤが論理構造条件から求めた積
ＩＤと一致するものだけを最終検索結果とする。実施の
形態２における検索結果作成・表示処理については、実
施の形態１と同様であるため、説明を省略する。In this example, the logical structure condition is “first and subsequent chapters”. Since this is a search specifying the intermediate layer of the logical structure, it is divided by the product ID “33” representing “first chapter”. The element whose remainder is 0 is defined as a search result. On the other hand, when the logical structure condition is a search specifying an end element such as “title”, the product ID of all elements corresponding to the logical structure condition is set to I
Of the elements obtained from the D-table storage unit 113 and obtained by the character string full-text search, only those whose product ID matches the product ID obtained from the logical structure condition are used as final search results. The search result creation / display processing according to the second embodiment is the same as that in the first embodiment, and a description thereof will not be repeated.

【００４８】以上のように本実施の形態では、構造化文
書の各要素を特定するＩＤを割当てる方法として、まず
登録対象文書の各要素に対し重複しないように素数を割
り当て、更に実際に各要素に割当てるＩＤを該要素に割
当てられた素数と、該要素の親要素に割当てられたＩＤ
との積とすることにより、構造化文書の上位層または中
間層を指定した検索の場合に、該当する上位層または中
間層のＩＤで割り切れるＩＤを持つ要素を求めれば良い
ため、構造を指定しない検索と同じような検索時間で検
索することが可能となる。As described above, in the present embodiment, as a method of assigning an ID for specifying each element of the structured document, first, a prime number is assigned to each element of the registration target document so as not to be duplicated, and each element is actually actually assigned. The prime number assigned to the element with the ID assigned to the element, and the ID assigned to the parent element of the element
In the case of a search in which the upper layer or the middle layer of the structured document is specified, an element having an ID divisible by the ID of the corresponding upper layer or the middle layer may be obtained, so that the structure is not specified. A search can be performed in the same search time as the search.

【００４９】なお、本実施の形態では１文書の構造化文
書の登録、検索について説明したが、複数文書の場合で
も同様の処理で実現が可能である。また本実施の形態で
は、一種類のＤＴＤにおけるＩＤの作成方法について説
明したが、本システムに複数の異なるＤＴＤの文書の登
録要求が起こった場合においても、実施の形態１と同様
に、ＤＴＤの種類毎に別の値のＩＤを割当てるか、もし
くは同じ値を利用してＩＤを割当て、更にＤＴＤの種類
を特定する識別子を登録対象構造化文書毎に持たせてお
くことにより、論理構造を指定した検索が実現可能であ
る。また、本実施の形態における説明では、各要素に対
し重複しないように素数を割当てたが、同じ親要素を持
つ末端要素間で重複しない限り、末端要素に割当てる素
数の値を再利用することにより、実際に割当てる積ＩＤ
の値の発散を小さくすることが可能である。In the present embodiment, registration and retrieval of a structured document of one document have been described. However, the same processing can be realized for a plurality of documents. In the present embodiment, the method of creating an ID in one type of DTD has been described. However, even when a request for registration of a plurality of different DTD documents occurs in the present system, the DTD of Assign a different value ID for each type, or assign an ID using the same value, and assign an identifier for specifying the type of DTD to each registered target structured document to specify the logical structure Search can be realized. Also, in the description of the present embodiment, prime numbers are assigned to each element so as not to overlap, but as long as there is no overlap between terminal elements having the same parent element, the value of the prime number assigned to the terminal element can be reused. , The product ID actually assigned
Can be made smaller.

【００５０】（実施の形態３）以下、本実施の形態３に
ついて説明する。図１２は実施の形態３における構造化
文書管理装置の構成図である。図１の構成と異なるの
は、検索エンジン１０５にアクセス権管理テーブル作成
手段１２０１を、またデータ格納部１０６に利用者情報
格納手段１２０２およびアクセス権管理テーブル格納手
段１２０３を新たに備えているところである。Embodiment 3 Hereinafter, Embodiment 3 will be described. FIG. 12 is a configuration diagram of the structured document management device according to the third embodiment. The difference from the configuration of FIG. 1 is that the search engine 105 is newly provided with an access right management table creation unit 1201 and the data storage unit 106 is newly provided with a user information storage unit 1202 and an access right management table storage unit 1203. .

【００５１】利用者情報格納手段１２０２は、本システ
ムを利用可能な個人（アカウント）またはグループに関
する情報が格納されている。The user information storage means 1202 stores information on individuals (accounts) or groups who can use this system.

【００５２】アクセス権管理テーブル作成手段１２０１
は、登録対象構造化文書中の各要素に対するアクセス権
を管理するテーブルを作成する。Access right management table creating means 1201
Creates a table for managing access rights to each element in the structured document to be registered.

【００５３】アクセス権管理テーブル格納手段１２０３
は、アクセス権管理テーブル作成手段１２０１によって
作成されたアクセス権管理テーブルを格納する。Access right management table storage means 1203
Stores the access right management table created by the access right management table creation unit 1201.

【００５４】ここで実施の形態３における処理の流れに
ついて説明する。まず実施の形態３における構造化文書
入力手段１０２では、登録対象構造化文書として各要素
にアクセス権に関する情報を付加されたものを扱うこと
が可能であるものとする。Here, the flow of processing in the third embodiment will be described. First, it is assumed that the structured document input unit 102 according to the third embodiment can handle, as a registration-target structured document, a document in which information regarding an access right is added to each element.

【００５５】図１３の１３０１は本実施の形態における
登録対象構造化文書の例である。なお図１３の１３０１
の例では、「論文」タグの中に「アクセス権＝“ＧＲＰ
１”」というようにタグの属性指定でアクセス権に関す
る情報が付加されているが、要素に対してアクセス権の
情報を付加することができればどのような方法でも構わ
ない。この“ＧＲＰ１”は、利用者情報格納手段１２０
２に格納されているグループに該当するものであり、
“ＵＳＲ１”、“ＵＳＲ２”、“ＵＳＲ３”は利用者情
報格納手段１２０２に格納されているアカウントに該当
するものとし、その階層構造は図１３の１３０２に示さ
れる関係であるものとする。Reference numeral 1301 in FIG. 13 is an example of a structured document to be registered in the present embodiment. Note that 1301 in FIG.
In the example of “access right =“ GRP
Although information about the access right is added by specifying the attribute of the tag such as "1"", any method may be used as long as the information of the access right can be added to the element. This “GRP1” is stored in the user information storage unit 120.
2 corresponds to the group stored in
“USR1”, “USR2”, and “USR3” correspond to the accounts stored in the user information storage unit 1202, and the hierarchical structure thereof has the relationship shown in 1302 in FIG.

【００５６】次に実施の形態３における構造解析手段１
０７の処理は実施の形態１および実施の形態２と同様で
あるため説明を省略する。Next, the structure analysis means 1 in the third embodiment
The processing of 07 is the same as that of the first and second embodiments, and therefore the description is omitted.

【００５７】次に実施の形態３におけるＩＤ作成手段１
０８および索引情報作成手段１０９における処理、なら
びに検索条件入力手段１０３および索引情報検索手段１
１０における処理は、実施の形態１での処理、または実
施の形態２での処理のどちらでも構わない。Next, the ID creating means 1 in the third embodiment
08 and the processing in the index information creation means 109, and the search condition input means 103 and the index information search means 1
The processing in 10 may be either the processing in the first embodiment or the processing in the second embodiment.

【００５８】次にアクセス権管理テーブル作成手段１２
０１での処理について説明する。Next, access right management table creating means 12
01 will be described.

【００５９】図１４はアクセス権管理テーブル作成手段
１２０１における処理フローの図である。まずステップ
１４０１において登録対象文書のルート要素にアクセス
権が設定されているかどうか調べ、設定されていればそ
のアクセス権を変数（ここでは仮に“buf”とする）に
代入する。次にステップ１４０２において現在の要素か
ら子要素をたどり、たどった要素にアクセス権指定があ
れば変数bufの値を置換える。次にステップ１４０３に
て、該要素が末端要素であるかどうか調べ、末端要素で
なければステップ１４０２に戻り、更に子要素をたど
る。該要素が末端要素であればステップ１４０４へ進
み、該末端要素のアクセス権を変数bufの値に設定す
る。次にステップ１４０５にて、アクセス権管理テーブ
ルに、該末端要素の文書番号・要素番号・アクセス権か
らなるレコードを追加する。次にステップ１４０６に
て、登録対象構造化文書の全ての末端要素についてアク
セス権管理テーブルにレコードを追加し終わったかどう
か確認し、全て終わっていれば処理終了、終わっていな
ければステップ１４０７にて、今参照している要素か
ら、まだ経由していない子要素のある親要素までさかの
ぼる。このさかのぼる処理の際に、経由する要素でアク
セス権の指定があれば変数bufの値を置換える。そして
全ての末端要素のレコードをアクセス権管理テーブルに
追加するまで１４０２から１４０６までの処理を繰返
す。FIG. 14 is a diagram showing a processing flow in the access right management table creating means 1201. First, in step 1401, it is checked whether an access right has been set for the root element of the document to be registered. If the access right has been set, the access right is assigned to a variable (here, temporarily set to “buf”). Next, in step 1402, the child element is traced from the current element, and if the traced element has an access right designation, the value of the variable buf is replaced. Next, in step 1403, it is checked whether or not the element is a terminal element. If not, the process returns to step 1402 to further trace child elements. If the element is a terminal element, the process proceeds to step 1404, and the access right of the terminal element is set to the value of the variable buf. Next, in step 1405, a record including the document number, element number, and access right of the terminal element is added to the access right management table. Next, in step 1406, it is checked whether or not records have been added to the access right management table for all end elements of the structured document to be registered. If all records have been added, the process ends. It goes from the element currently being referenced to the parent element that has child elements that have not yet passed through. At the time of this processing, if the access right is specified in the element passing through, the value of the variable buf is replaced. Then, the processing from 1402 to 1406 is repeated until records of all end elements are added to the access right management table.

【００６０】図１５は、図１３の１３０１の登録対象構
造化文書に対して図１４の処理を行なった結果、末端要
素に対して設定されるアクセス権を図示したものであ
る。FIG. 15 illustrates the access right set to the terminal element as a result of performing the processing in FIG. 14 on the registration-target structured document 1301 in FIG.

【００６１】各要素の２行目が、登録対象構造化文書
（１３０１）中で設定されているアクセス権であり、設
定されていない要素は空白で示してある。また各末端要
素の左下に記述されている値が、図１４の処理の結果末
端要素に設定されたアクセス権である。この結果から分
かる様に図１４の処理によって、登録対象構造化文書の
全要素にアクセス権が設定されていなくても、親要素の
アクセス権を子要素が継承し、また子要素にアクセス権
が設定されている場合は子要素の設定を優先するかたち
で、全末端要素のアクセス権を決定できる。The second line of each element is the access right set in the registration target structured document (1301), and the unset elements are shown in blank. The value described at the lower left of each terminal element is the access right set to the terminal element as a result of the processing in FIG. As can be seen from the result, by the processing in FIG. 14, even if the access right is not set to all elements of the structured document to be registered, the access right of the parent element is inherited by the child element, and the access right is assigned to the child element. If it is set, the access right of all terminal elements can be determined by giving priority to the setting of child elements.

【００６２】図１６は、１３０１の登録対象構造化文書
に対して図１４の処理を行なった結果、アクセス権管理
テーブル１２０３に格納されるテーブルの例である。こ
のアクセス権管理テーブルを参照することにより、検索
結果一覧作成時、および実体取得時に各末端要素のアク
セス権を確認することが可能となる。FIG. 16 is an example of a table stored in the access right management table 1203 as a result of performing the processing in FIG. 14 on the registered structured document 1301. By referring to this access right management table, it is possible to confirm the access right of each terminal element when creating a search result list and when obtaining the entity.

【００６３】次に本実施の形態における文書検索および
検索結果作成・表示処理について説明する。本実施の形
態では、要素毎の結果一覧作成および実体取得について
説明する。ここでは「段落に“入力”という文字列が含
まれる」という検索要求の場合で説明する。また、説明
を簡単にするため索引情報格納手段１１４には図１３の
１３０１に示す構造化文書の索引情報のみが格納され、
アクセス権管理テーブル格納手段１２０３には図１６の
ようなデータが格納されているものとする。Next, a description will be given of document retrieval and retrieval result creation / display processing in this embodiment. In the present embodiment, a description will be given of a result list creation and entity acquisition for each element. Here, a case of a search request that “a paragraph includes a character string“ input ”” will be described. Further, for the sake of simplicity, the index information storage unit 114 stores only the index information of the structured document shown by 1301 in FIG.
It is assumed that data as shown in FIG. 16 is stored in the access right management table storage means 1203.

【００６４】具体的な検索処理については、実施の形態
１または実施の形態２と同様であるため説明を省略する
が、｛文書番号，要素番号｝の組み合わせからなる検索
結果の集合として｛１，７｝、｛１，９｝、｛１，１
２｝の３組が得られる。次に結果作成部１１１での処理
を説明する。ここで検索要求を行なった利用者が“ＵＳ
Ｒ１”であったとする。図１６のアクセス権管理テーブ
ルより、｛１，７｝および｛１，９｝の要素のアクセス
権は“ＧＲＰ１”であり、図１３の１３０２より“ＵＳ
Ｒ１”は“ＧＲＰ１”に属するので、この利用者は
｛１，７｝および｛１，９｝で特定される要素のアクセ
ス権を有することになる。一方｛１，１２｝に設定され
たアクセス権は図１６より“ＵＳＲ３”であるため、こ
の利用者（ＵＳＲ１）は｛１，１２｝で特定される要素
へのアクセス権を持たないことになる。よって作成する
検索結果一覧では、｛１，７｝および｛１，９｝の要素
だけを含み、｛１，１２｝の要素は除くことが可能とな
る。なお、この例では図１４の処理を行なった結果全て
の末端要素に対してアクセス権が設定されたが、アクセ
ス権が設定されない末端要素が存在した場合（図１４の
ステップ１４０１の時点で変数buf=NULLとなる場合）、
全ての利用者がこの末端要素のアクセス権を持つものと
考える。The specific search processing is the same as that in the first or second embodiment, and therefore will not be described. However, as a set of search results including a combination of {document number, element number}, {1, 7｝, {1,9}, ｛1,1
3 sets of 2｝ are obtained. Next, processing in the result creating unit 111 will be described. The user who made the search request here is "US
It is assumed that the access right for the elements {1, 7} and {1, 9} is “GRP1” from the access right management table in FIG.
Since “R1” belongs to “GRP1”, this user has access rights to the elements specified by {1,7} and {1,9}, while the access set to {1,12} 16, the user (USR1) does not have the access right to the element specified by {1, 12}, so that in the search result list created, {1} , 7} and the elements of {1, 9}, and the elements of {1, 12} can be removed.In this example, as a result of performing the processing of FIG. When the access right is set, but there is a terminal element for which the access right is not set (when variable buf = NULL at step 1401 in FIG. 14)
It is assumed that all users have access to this end element.

【００６５】以上のように本実施の形態によれば、登録
対象構造化文書に各要素へのアクセス権に関する情報を
持たせ、アクセス権管理テーブル作成手段１２０１によ
って登録対象構造化文書の全末端要素へのアクセス権を
参照可能なテーブルを作成することにより、結果作成手
段１１１および結果表示手段１０４にて各文書、各要素
単位のアクセス権を考慮した結果一覧作成および実体取
得が可能となる。As described above, according to the present embodiment, the structured document to be registered is provided with information on the access right to each element, and the access right management table creating means 1201 creates all the terminal elements of the structured document to be registered. By creating a table capable of referring to the access right to the document, the result creation unit 111 and the result display unit 104 can create a result list and acquire the entity in consideration of the access right for each document and each element.

【００６６】なお、本実施の形態で用いている、アカウ
ントを階層型のグループ構造で管理する方法は、従来の
技術「特開平１０−１４３４３５」で行なっている方法
で実現することが考えられる。また、本実施の形態では
要素単位の結果一覧および実体取得として説明したが、
文書単位に結果一覧を作成した場合でも、実体取得時に
各要素のアクセス権を考慮し、アクセス権のない要素を
除いた文書として実体取得を行なうことも可能である。
また本実施の形態では、結果一覧作成の際に利用者がア
クセス権のない要素を除外する方法について述べたが、
結果一覧では検索結果全てを表示し、その中でアクセス
権のある要素のみ実体取得可能とする実施形態も考えら
れる。The method of managing accounts in a hierarchical group structure used in the present embodiment can be realized by the method used in the prior art "Japanese Patent Laid-Open No. 10-143435". Also, in the present embodiment, the description has been made as a result list and entity acquisition in element units.
Even when a result list is created for each document, it is also possible to take into account the access rights of each element at the time of obtaining the entity and obtain the entity as a document excluding the elements without the access right.
Also, in the present embodiment, the method of excluding elements to which the user does not have access when creating the result list has been described.
An embodiment in which all the search results are displayed in the result list, and only the elements to which the user has access right can be obtained.

【００６７】（実施の形態４）以下、本実施の形態４に
ついて説明する。図１７は実施の形態４における構造化
文書管理装置の構成図である。実施の形態３における構
成図である図１２と異なるのは、新たにアクセス権情報
格納手段１７０１を備えているところである。Embodiment 4 Hereinafter, Embodiment 4 will be described. FIG. 17 is a configuration diagram of a structured document management device according to the fourth embodiment. The difference from the configuration diagram of FIG. 12 according to the third embodiment is that an access right information storage unit 1701 is newly provided.

【００６８】アクセス権情報格納手段１７０１は、本シ
ステムで扱う構造化文書のＤＴＤ毎に、各論理構造への
アクセス権に関する情報をあらかじめ設定し、格納す
る。The access right information storage means 1701 presets and stores information on the access right to each logical structure for each DTD of the structured document handled by the present system.

【００６９】ここで実施の形態４における登録処理の流
れについて説明する。まず構造化文書入力手段１０２、
構造解析手段１０７、ＩＤ作成手段１０８、索引情報作
成手段１０９の処理は実施の形態３と同様であるため説
明を省略する。ただし、本実施の形態では、実施の形態
３と異なり、登録対象構造化文書中に各要素に対するア
クセス権の情報が特に付加されていないものとする。Here, the flow of the registration process according to the fourth embodiment will be described. First, the structured document input means 102,
The processing of the structure analysis unit 107, the ID creation unit 108, and the index information creation unit 109 is the same as that of the third embodiment, and thus the description is omitted. However, in the present embodiment, unlike the third embodiment, it is assumed that access right information for each element is not particularly added to the registered structured document.

【００７０】次にアクセス権管理テーブル作成手段１２
０１での処理について説明する。Next, access right management table creating means 12
01 will be described.

【００７１】実施の形態４におけるアクセス権管理テー
ブル作成手段１２０１での処理フローは実施の形態３の
図１４と次に述べる一点だけ異なる。ステップ１４０
１、ステップ１４０２、ステップ１４０７において変数
bufに値を代入する際、実施の形態３では、該登録対象
構造化文書中で設定されているアクセス権を参照した
が、実施の形態４では、アクセス権情報格納手段１７０
１に格納されているアクセス権情報を参照する。The processing flow in the access right management table creating means 1201 in the fourth embodiment differs from FIG. 14 in the third embodiment only in the following point. Step 140
1. Variables in steps 1402 and 1407
In substituting a value for buf in the third embodiment, the access right set in the registration target structured document is referred to. In the fourth embodiment, the access right information storage unit 170 is used.
1 is referred to.

【００７２】実施の形態４における検索処理、および検
索結果作成・表示処理については実施の形態３と同様で
あるため説明を省略する。The search processing and the search result creation / display processing according to the fourth embodiment are the same as those in the third embodiment, and a description thereof will be omitted.

【００７３】以上のように本実施の形態によれば、登録
部１０６に本システムで扱う構造化文書のＤＴＤ毎に、
各要素へのアクセス権に関する情報をあらかじめ設定
し、格納したアクセス権情報格納手段１７０１を備える
ことにより、登録対象構造化文書で特にアクセス権が設
定されていない場合でも、結果作成手段１１１および結
果表示手段１０４にてアクセス権を考慮した結果一覧作
成および実体取得が可能となる。As described above, according to the present embodiment, the registration unit 106 stores, for each DTD of a structured document handled by this system,
By providing the access right information storage unit 1701 in which information on the access right to each element is set in advance and stored, even if the access right is not particularly set in the structured document to be registered, the result creation unit 111 and the result display unit By means 104, it is possible to create a result list and obtain the entity in consideration of the access right.

【００７４】なお本実施の形態では、登録対象構造化文
書中に各要素に対するアクセス権の情報が特に付加され
ていないものとして説明したが、アクセス権情報格納手
段１７０１をあらかじめ備え、かつ登録対象構造化文書
中に各要素に対するアクセス権の情報が付加されている
場合、アクセス権管理テーブル作成手段１２０１におけ
る処理で要素のアクセス権指定を確認する際、登録対象
構造化文書中に該要素に関する指定があればそちらを優
先し、なければアクセス権情報格納手段１７０１での指
定を設定する、という実施形態を取ることも可能であ
る。Although the present embodiment has been described on the assumption that access right information for each element is not particularly added to the registered structured document, the access right information storage unit 1701 is provided in advance and the registration target structure When the access right information for each element is added to the structured document, when the access right designation of the element is confirmed in the process of the access right management table creating means 1201, the specification concerning the element is not included in the registration target structured document. It is also possible to adopt an embodiment in which if there is, priority is given to that, and if not, the designation in the access right information storage means 1701 is set.

【００７５】[0075]

【発明の効果】以上のように、本発明によれば構造化文
書の様々な論理構造を指定した検索ができるという効果
を有する。また、構造化文書の上位層または中間層を指
定した検索の場合においても、構造を指定しない検索と
同じような検索時間で検索することができるという効果
を有する。また、登録対象構造化文書に各要素へのアク
セス権に関する情報を持たせ、アクセス権管理テーブル
作成手段によって登録対象構造化文書の全末端要素への
アクセス権を参照可能なテーブルを作成することによ
り、各文書の各要素毎のアクセス権を考慮した結果一覧
作成および実体取得ができるという効果を有する。ま
た、アクセス権情報格納手段に、あらかじめ各要素への
アクセス権に関する情報を格納することで、登録対象構
造化文書で特にアクセス権が設定されていない場合でも
アクセス権を考慮した結果一覧作成および実体取得がで
きるという効果を有する。As described above, according to the present invention, it is possible to perform a search by designating various logical structures of a structured document. Also, in the case of a search specifying the upper layer or the middle layer of the structured document, there is an effect that the search can be performed in the same search time as the search without specifying the structure. Also, by giving information on the access right to each element in the registered structured document, and creating a table that can refer to the access rights to all terminal elements of the registered structured document by the access right management table creating means. This has the effect that a result list can be created and the entity can be obtained in consideration of the access right for each element of each document. In addition, by storing information on the access right to each element in advance in the access right information storage means, even if the access right is not particularly set in the structured document to be registered, a result list taking into account the access right and the entity list are created. It has the effect that it can be obtained.

[Brief description of the drawings]

【図1】本発明の実施の形態１における構造化文書管理
装置の構成図FIG. 1 is a configuration diagram of a structured document management device according to a first embodiment of the present invention.

【図２】本発明の実施の形態１における構造化文書の論
理構造の一例を示す図FIG. 2 is a diagram showing an example of a logical structure of a structured document according to the first embodiment of the present invention.

【図３】本発明の実施の形態１におけるメンバＩＤの一
例を示す図FIG. 3 is a diagram showing an example of a member ID according to the first embodiment of the present invention.

【図４】本発明の実施の形態１における構造化文書の一
例を示す図FIG. 4 is a diagram showing an example of a structured document according to the first embodiment of the present invention.

【図５】本発明の実施の形態１におけるグループＩＤお
よびメンバＩＤを割当てた例を示す図FIG. 5 is a diagram showing an example in which a group ID and a member ID are assigned according to the first embodiment of the present invention.

【図６】本発明の実施の形態１における索引情報作成手
段の処理手順を示すフロー図FIG. 6 is a flowchart showing a processing procedure of an index information creating unit according to the first embodiment of the present invention.

【図７】本発明の実施の形態１における索引情報の一例
を示す図FIG. 7 shows an example of index information according to the first embodiment of the present invention.

【図８】本発明の実施の形態１における索引情報検索手
段の処理の例を示す図FIG. 8 is a diagram showing an example of processing of an index information search unit according to the first embodiment of the present invention.

【図９】本発明の実施の形態２における積ＩＤを割当て
た例を示す図FIG. 9 is a diagram showing an example of allocating a product ID according to the second embodiment of the present invention.

【図1０】本発明の実施の形態２における索引情報の一
例を示す図FIG. 10 is a diagram showing an example of index information according to Embodiment 2 of the present invention.

【図1１】本発明の実施の形態２における索引情報検索
手段の処理の例を示す図FIG. 11 is a diagram illustrating an example of processing of an index information search unit according to the second embodiment of the present invention.

【図1２】本発明の実施の形態３における構造化文書管
理装置の構成図FIG. 12 is a configuration diagram of a structured document management device according to a third embodiment of the present invention.

【図1３】本発明の実施の形態３における構造化文書の
一例、およびアカウント、グループの階層構造の例を示
す図FIG. 13 is a diagram illustrating an example of a structured document and an example of a hierarchical structure of accounts and groups according to Embodiment 3 of the present invention.

【図1４】本発明の実施の形態３におけるアクセス権管
理テーブル作成手段の処理手順を示すフロー図FIG. 14 is a flowchart illustrating a processing procedure of an access right management table creating unit according to the third embodiment of the present invention.

【図1５】本発明の実施の形態３における末端要素に対
して設定されるアクセス権の例を示す図FIG. 15 is a diagram illustrating an example of an access right set for a terminal element according to the third embodiment of the present invention.

【図1６】本発明の実施の形態３におけるアクセス権管
理テーブルの例を示す図FIG. 16 is a diagram illustrating an example of an access right management table according to the third embodiment of the present invention.

【図1７】本発明の実施の形態４における構造化文書管
理装置の構成図FIG. 17 is a configuration diagram of a structured document management device according to a fourth embodiment of the present invention.

【図1８】従来の技術における文書登録システムの構成
を示す図FIG. 18 is a diagram showing a configuration of a document registration system according to a conventional technique.

【図１９】従来の技術における構造インデックスの生成
過程を示す図FIG. 19 is a diagram showing a process of generating a structure index in the related art.

[Explanation of symbols]

１０１端末１０２構造化文書入力手段１０３検索条件入力手段１０４結果表示手段１０５検索エンジン１０６データ格納部１０７構造解析手段１０８ＩＤ作成手段１０９索引情報作成手段１１０索引情報検索手段１１１結果作成手段１１２構造解析済みデータ格納手段１１３ＩＤテーブル格納手段１１４索引情報格納手段１１５実体データ格納手段１１６一覧データ格納手段１２０１アクセス権管理テーブル作成手段１２０２利用者情報格納手段１２０３アクセス権管理テーブル格納手段１７０１アクセス権情報格納手段 Reference Signs List 101 terminal 102 structured document input means 103 search condition input means 104 result display means 105 search engine 106 data storage unit 107 structure analysis means 108 ID creation means 109 index information creation means 110 index information search means 111 result creation means 112 structure analyzed Data storage means 113 ID table storage means 114 Index information storage means 115 Entity data storage means 116 List data storage means 1201 Access right management table creation means 1202 User information storage means 1203 Access right management table storage means 1701 Access right information storage means

───────────────────────────────────────────────────── フロントページの続き (72)発明者中井信一大阪府門真市大字門真1006番地松下電器産業株式会社内 (72)発明者菊池忠一大阪府門真市大字門真1006番地松下電器産業株式会社内Ｆターム(参考） 5B009 QA07 VA02 5B075 KK50 KK54 ND02 ND34 PP12 PP23 PQ02 PQ46 UU06 ──────────────────────────────────────────────────の Continued on the front page (72) Inventor Shinichi Nakai 1006 Kazuma Kadoma, Osaka Prefecture Matsushita Electric Industrial Co., Ltd. F term (reference) 5B009 QA07 VA02 5B075 KK50 KK54 ND02 ND34 PP12 PP23 PQ02 PQ46 UU06

Claims

[Claims]

In a document management system for handling a structured document having a logical structure, a structured document input means for inputting a structured document to be registered, and a logical structure of the registered document when registering the structured document Creating an ID for uniquely identifying a logical structure of each element analyzed by the structural analysis means; a structurally analyzed data storing means for storing data analyzed by the structural analysis means; ID creation means, ID table storage means for storing the ID created by the ID creation means, index information creation means for creating index information for performing a search, and index information storage means for storing the index information An entity storing means for storing an entity of a structured document to be registered, a search condition input means capable of designating a logical structure element as a search condition, and receiving from the search condition input means. A structured document management apparatus comprising: an index information search unit for performing a search using index information under a retrieved search condition.

2. A result creating means for receiving a search result obtained by the index information searching means and creating an entity for displaying a search result list and a text, and storing a search result list created by the result creating means. 2. The structured document management apparatus according to claim 1, further comprising: a result list storage unit that performs the search, and a result display unit that displays a search result stored in the result list storage unit on a terminal.

3. The index information search device according to claim 2, wherein the ID creating means creates a group ID for uniquely determining an element other than the terminal element and a member ID having the same value for the terminal element having the same tag name and the same appearance order. A structured document search method, wherein a search is possible by specifying a logical structure using the group ID, the member ID, or both IDs.

4. The ID creating means assigns a prime number to each element of a document to be registered so as not to be duplicated, further assigns an ID actually assigned to each element to the element, and a parent element of the element. A structured document search method characterized in that a product ID is created as a product of the ID assigned to the product, and the index information search means can perform a search specifying a logical structure using the product ID.

5. The method according to claim 4, wherein the ID creation means reuses the value of the prime number assigned to the terminal element unless the ID is duplicated between terminal elements having the same parent element. The structured document management device according to claim 1, wherein:

6. The structured document input means can handle a document to which information on access right is added to each element as a registered structured document, and further relates to an individual or a group who can use the present system. User information storage means for storing information, access right management table creation means for creating a table for managing access rights to each element in the registered structured document, and access created by the access right management table creation means 3. The structured document management device according to claim 2, further comprising an access right management table storage unit that stores a right management table.

7. The method according to claim 1, wherein the result creating unit determines, based on information in the access right management table storage unit, only elements to which the user has access right from a set of search results obtained by the processing of the index information searching unit. 7. The structured document management device according to claim 6, wherein a search result list is created.

8. An access right information storage unit newly storing information on an access right to each logical structure of a structured document handled by the present system, wherein the access right management table creating unit includes the access right information storage unit. 7. The structured document management device according to claim 6, wherein an access right management table is created by using the access right information stored in the device.

9. The access right management table creating means, when information on an access right for each element is added to the registration-target structured document, uses the access right added to the registration-target structured document, When information on the access right to each element is not added to the registered structured document, an access right management table is created using the access right information stored in the access right information storage unit. 9. The structured document management device according to claim 8, wherein