JPH0844766A

JPH0844766A - Document retrieval device

Info

Publication number: JPH0844766A
Application number: JP6196288A
Authority: JP
Inventors: Hirofumi Komatsubara; 弘文小松原; Miki Watanabe; 美樹渡辺; Nobuhiro Yamazaki; 伸宏山崎
Original assignee: Fuji Xerox Co Ltd
Current assignee: Fujifilm Business Innovation Corp
Priority date: 1994-07-29
Filing date: 1994-07-29
Publication date: 1996-02-16

Abstract

PURPOSE:To increase the speed of retrieval for specifying a desired document and reduce a retrieval noise on the retrieval device for a document requiring version management. CONSTITUTION:A document storage means stores a document as a group of document constituent elements and can store plural versions for one document. A version management means 1-132 manages the derivation relation of versions by documents stored in a document storage means 1-131 by using an order tree. Namely, data on versions are managed by using the data structure which regards each node of the order tree in tree structure as one version and a train of nodes obtained by tracing from a root or elder brother node to the eldest son as a branch of versions. The retrieval device retrieves a document or document constituent element as to the version specified by the version management of the version management means 1-132. The object of the retrieval is narrowed down to a specific version, e.g. the latest version through the version management, so the efficiency of the retrieval is improved.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は文書および文書構成要素
を検索する検索装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a retrieval device for retrieving documents and document components.

【０００２】[0002]

【従来の技術】電子文書の発展、普及に伴い、文書を作
成、編集する際に既存の文書の一部を再利用することで
文書の作成効率を向上させることが求められている。従
来この種の利用を行なうためには、あらかじめ再利用し
たい部分が含まれている文書を表示し、目的の部分をカ
ットあるいはコピーしてペーストすることにより取り込
んでいた（例えば、特開平４−７６６６４号公報）。こ
うしたことを行なう場合、再利用したい文書構成要素が
どの文書に含まれているかが明らかな場合はよいがそう
でない場合は、いちいち文書を表示して目的とする文書
構成要素があるかどうかを探さなければならない。この
ような問題を解決するために再利用したい文書構成要素
だけを選択的に格納して、効率的に再利用を行なおうと
する試みがなされている（例えば、特開平２−１４８２
５０号公報や特開平３−８０８７号公報等）。また、格
納された文書の構成要素を構成要素レベルで検索するこ
とを可能とすることにより、目的とする再利用したい文
書構成要素を迅速に探し出す試みもなされている（例え
ば、特開平４−３４８４６８号公報）。また、文書の変
更履歴を管理するために文書の版管理が行なわれるよう
になってきている。2. Description of the Related Art With the development and popularization of electronic documents, it is required to improve the document creation efficiency by reusing a part of the existing document when creating and editing the document. Conventionally, in order to use this kind of use, a document including a portion to be reused is displayed in advance, and the target portion is cut or copied and pasted to be incorporated (for example, Japanese Patent Laid-Open No. 4-76664). Issue). When doing this, it is good if it is clear which document contains the document component you want to reuse, but if not, view each document and search for the desired document component. There must be. In order to solve such a problem, it has been attempted to selectively store only the document constituent elements that are desired to be reused and to efficiently reuse them (for example, Japanese Patent Laid-Open No. 2-1482).
No. 50 and Japanese Patent Laid-Open No. 3-8087). Further, by making it possible to search the constituent elements of the stored document at the constituent element level, an attempt has been made to quickly find a desired document constituent element to be reused (for example, Japanese Patent Laid-Open No. 4-348468). Issue). Further, version management of documents has come to be performed in order to manage the change history of documents.

【０００３】また、内容が全く同じ文章、図表などを複
数の文書内に作成しようとする場合、ポインタなどを利
用して複数の文書で同じ内容を指すことにより、文書の
内容の共有化が実現され、正確さ、時間短縮に加え、記
憶容量の削減などの効果が得られている。文書について
複写、共有化などの再利用を行なう場合に対象となる文
書の一部をいかに効率良く取得するかということが問題
となる。これまでの手法では再利用の単位となる文書の
一部分は、それを含むと推測される文書をキーワードや
文書名で検索してそれの文書全体を表示した後にその文
書の一部を再利用するかどうかの判断をするといった方
式や、予め再利用する文書構成要素の条件や処理の内容
を文書プログラムに記述して文書構成要素を検索などの
方法（例えば、特開平５−２４７５号公報）によって選
択し、ユーザが介在することなしに自動的に新規文書を
再利用により作成するなどの方式が採用されてきた。Further, in the case where a sentence, a diagram, etc. having exactly the same contents are to be created in a plurality of documents, the contents of the documents can be shared by pointing the same contents in a plurality of documents using a pointer or the like. Therefore, in addition to accuracy and time reduction, the effect of reducing the storage capacity is obtained. When reusing a document such as copying or sharing, how to efficiently acquire a part of the target document becomes a problem. For the part of the document that is the unit of reuse in the conventional method, search the document that is supposed to contain it by keyword or document name, display the entire document, and then reuse part of the document By a method of determining whether or not, or a method of searching the document component by describing the condition of the document component to be reused and the content of the process in the document program in advance (for example, Japanese Patent Laid-Open No. 5-2475). A method of selecting and automatically creating a new document by reuse without user intervention has been adopted.

【０００４】[0004]

【発明が解決しようとする課題】しかし、従来例（特開
平２−１４８２５０号公報や特開平３−８０８７号公報
等）では、文書構成要素を再利用するためには、予め文
書構成要素ごとにデータベースに登録する必要がある。
これは一々文書構成要素を個別に登録するという手間と
再利用されるかどうかということを登録時に判断する必
要がある。文書構成要素として登録されてなければ再利
用することはできないという問題がある。また、従来例
では文書の版が複数存在する場合の管理について考慮さ
れていない。また、他の従来例（特開平４−３４８４６
８号公報）のように格納された文書の構成要素を構成要
素レベルで検索することを目的としたものがあるが、検
索対象の文書の版管理については考慮されていない。文
書の版管理が行なわれている文書に対しては、版をどう
扱うかが問題となる。文書の個々の版を独立した検索対
象としてしまえば、検索を行なうことができるが、この
ような場合、検索のノイズの増加がおこる。同一文書の
別の版の場合、同じ内容をもっているものが多いので、
検索結果としていくつもの版が検索され、所望の文書を
特定するのに時間がかかる。また、検索対象となるデー
タ量が増加してしまうという問題がある。However, in the conventional example (Japanese Patent Laid-Open No. 2-148250, Japanese Patent Laid-Open No. 3-8087, etc.), in order to reuse the document constituent elements, each of the document constituent elements has to be reused in advance. Must be registered in the database.
For this, it is necessary to judge at the time of registration whether the document components are individually registered and whether they are reused. There is a problem that it cannot be reused unless it is registered as a document component. Further, the conventional example does not consider management in the case where there are a plurality of document versions. In addition, another conventional example (Japanese Patent Laid-Open No. 4-34846)
However, there is no consideration for version management of the document to be searched. For a document whose version is managed, how to handle the version becomes a problem. If each edition of a document is set as an independent search target, the search can be performed, but in such a case, the search noise increases. Since different versions of the same document often have the same content,
Several versions are searched as the search result, and it takes time to specify a desired document. Further, there is a problem that the amount of data to be searched increases.

【０００５】文書検索と文書内容検索を利用する従来の
方法を用いた場合、再利用の候補となる文書の構成要素
の情報を得るためには、まずそれを含むと推測される文
書を検索し、文書表示後に文書内部で必要な文書の一部
を検索しなくてはならず、２度の検索が生じること、最
初に文書全体を表示するために文書全体のデータを読み
込まなくてはならないことなどから、操作が煩雑であ
る、処理が高速に行なえないなどの問題がある。また再
利用処理を自動的に行なう従来の方法（特開平４−３４
８４６８号公報）を利用した場合、再利用する文書構成
要素の内容を確認する必要が生じた場合や、再利用する
文書構成要素がどの文書に格納されているかわからずに
検索を思考錯誤的に何度も行なう必要がある場合にはな
ど対処することが困難である。When the conventional method using the document search and the document content search is used, in order to obtain the information of the constituent elements of the document which are candidates for reuse, first, the document which is supposed to include it is searched. , After the document is displayed, the required part of the document must be searched inside the document, two searches must be performed, and the data of the entire document must be read in order to display the entire document first. Therefore, there are problems that the operation is complicated and the processing cannot be performed at high speed. In addition, a conventional method for automatically performing a reuse process (Japanese Patent Laid-Open No. 4-34)
No. 8468 gazette), it is necessary to check the contents of the document components to be reused, or the search is thoughtless and error without knowing in which document the document components to be reused are stored. It is difficult to deal with when it is necessary to do it many times.

【０００６】本発明は、これらの従来技術の問題を解決
することを目的とするものである。すなわち、本発明は
版管理を伴う文書の検索装置において、所望の文書また
は文書構成要素を特定する検索の速度を高くすること、
および検索ノイズを少なくすることを目的とする。ま
た、本発明は文書の構成要素単位での検索、表示を可能
とし、対話的に、かつ効率よく文書データの再利用処理
を支援することを目的とする。The present invention is directed to overcoming these problems of the prior art. That is, the present invention is to increase the speed of search for identifying a desired document or document component in a document search device involving version management.
And to reduce search noise. Another object of the present invention is to enable retrieval and display of a document in units of constituent elements, and to interactively and efficiently support document data reuse processing.

【０００７】[0007]

【課題を解決するための手段】第１の発明は、文書構成
要素からなる文書を記憶する文書記憶手段と、各ノード
を一つの版とし、ルートまたは兄のあるノードから長男
をたどって得られるノードの列を版の枝とする順序木に
よる各文書ごとの版管理情報により、版の派生関係を管
理する版管理手段と、前記文書記憶手段に記憶された文
書または文書構成要素を版管理手段により特定された版
について検索する検索手段とを備えた文書検索装置であ
る。The first invention is obtained by tracing the eldest son from a node having a root or an elder brother with a document storing means for storing a document consisting of document constituent elements and each node as one version. Version management means for managing the derivation relationship of versions by version management information for each document based on an order tree having a row of nodes as version branches, and version management means for managing documents or document components stored in the document storage means. The document retrieval apparatus includes a retrieval unit that retrieves the version specified by.

【０００８】第２の発明は、文書構成要素からなる文書
を記憶する文書記憶手段と、各ノードを一つの版とし、
ルートまたは兄のあるノードから長男をたどって得られ
るノードの列を版の枝とする順序木による各文書ごとの
版管理情報により、版の派生関係を管理する版管理手段
と、検索対象の版を指定する情報を含む検索用データを
記憶する記憶手段と、前記版管理情報の更新に応じて前
記検索用データを更新する更新手段と、前記検索用デー
タにより文書構成要素を検索する検索手段とを備えた文
書検索装置である。According to a second aspect of the present invention, a document storage means for storing a document composed of document constituent elements and each node as one version are provided.
The version management means for managing the derivation relationship of the version and the version to be searched by the version management information for each document based on the order tree with the sequence of nodes obtained by tracing the eldest son from the node with the root or elder brother Storage means for storing search data including information designating information, update means for updating the search data in response to update of the version management information, and search means for searching the document constituent element by the search data. A document retrieval device equipped with.

【０００９】第３の発明は、文書構成要素からなる文書
を記憶する文書記憶手段と、検索条件により文書構成要
素の見出し情報を検索し、検索条件を満たす文書構成要
素の見出し情報とその文書構成要素へのポインタを得る
構成要素検索手段と、前記文書構成要素検索手段の検索
結果として得られた文書構成要素へのポインタによって
指された文書構成要素を−−−−文書記憶手段から読み
込みその内容を表示する表示手段とを備えた文書検索装
置である。According to a third aspect of the present invention, a document storage means for storing a document composed of document constituent elements, and heading information of a document constituent element is searched by a search condition, and the heading information of the document constituent element satisfying the search condition and the document structure thereof. A constituent element search means for obtaining a pointer to the element, and a document constituent element pointed by the pointer to the document constituent element obtained as a result of the search by the document constituent element search means are read from the document storage means, and the contents thereof are read. And a display means for displaying.

【００１０】[0010]

【作用】第１の発明において、文書記憶手段は、文書を
文書構成要素の集まりとして記憶する。また、文書記憶
手段は一つの文書に対して複数の版を記憶することがで
きる。版管理手段は、文書記憶手段に記憶されている文
書ごとに、版の派生関係を順序木により表した版管理情
報により管理する。すなわち、版管理手段は、木構造で
ある順序木の各ノードを一つの版とし、ルートまたは兄
のあるノードから長男をたどって得られるノードの列を
版の枝とするデータ構造を用いることにより版のデータ
を管理する。例えば、図４は文書Ａの版管理の木構造の
例を示すもので、ルートのノードｍａｉｎ１とそれから
作成された長男にあたる版のノードがｍａｉｎ２とがリ
ンクし、次にノードｍａｉｎ２の長男の版のノードｍａ
ｉｎ３がリンクし、版の作成された経緯が表されてお
り、さらに、ノードｍａｉｎ２から、その次男の版のノ
ードｅｄａ１が作成され、次にｅｄａ２と名付けられた
版が作成された別の経緯がノードとリンクで示されてい
る。ルートの版ｍａｉｎ１、その長男の版ｍａｉｎ２お
よびさらにその長男の版ｍａｉｎ３の経緯が木構造の版
の一つの枝を構成し、また、ｍａｉｎ２から分岐する兄
のある版のノードｅｄａ１およびその長男のノードｅｄ
ａ２のリンクによって表される経緯が別の枝を構成して
いる。検索手段は版管理手段の版管理により特定される
版について、文書または文書構成要素の検索を行う。例
えば、検索手段は、検索条件に適合する文書または文書
構成要素を探索する際、文書の有する版管理情報を調
べ、文書の検索対象の版を特定のもの例えば最も新しい
版に限定して検索を行う。本発明は、このように版管理
が行なわれている文書の検索対象の版を特定のものに限
定するようにしたことにより、検索対象となるデータ量
の減少と、同一文書の類似情報による検索ノイズを改善
することができ、検索効率を向上させることができる。In the first aspect of the invention, the document storage means stores the document as a set of document constituent elements. Further, the document storage means can store a plurality of versions for one document. The version management unit manages, for each document stored in the document storage unit, version management information in which the derivation relationship of versions is represented by a sequential tree. That is, the version management means uses a data structure in which each node of an ordered tree that is a tree structure is one version, and a sequence of nodes obtained by tracing the eldest son from a node with a root or elder brother is a version branch. Manage version data. For example, FIG. 4 shows an example of a tree structure of version management of document A. The root node main1 and the version node created from it, which is the eldest son, are linked to main2, and then the version of the oldest version of node main2 is linked. Node ma
In3 is linked to show how the version was created. Furthermore, from node main2, the second-edition version of node eda1 was created, and then another version of the version named eda2 was created. It is shown with nodes and links. The root version main1, the eldest son's version main2, and the eldest son's version main3 make up one branch of the tree-structured version, and the older version node eda1 and its eldest son node branch from main2. ed
The process represented by the link a2 constitutes another branch. The search means searches for a document or a document component with respect to the version specified by the version management of the version management means. For example, when searching for a document or a document component that meets the search condition, the search means checks the version management information of the document and limits the search target version of the document to a specific version, for example, the latest version. To do. According to the present invention, by limiting the search target version of a document whose version is managed in this way to a specific version, the amount of data to be searched is reduced and a search is performed using similar information of the same document. Noise can be improved and search efficiency can be improved.

【００１１】第２の発明の文書検索装置は、第１の発明
において、検索用データ記憶手段および検索用データを
用いて検索する検索手段を設けたものである。また、版
が更新されたときなどのために検索用データ更新手段を
設けている。検索用データは、通常の検索用の属性情報
の他に検索対象の版を指定する情報を含んでいる。図７
の例では検索用データは、文書検索用の集合と構成要素
検索用の集合を持つ。文書検索用の集合は、Ｄｏｃａｔ
ｔｒの要素からなる集合である。構成要素検索用の集合
は、ＥｌｅｍｅｎｔＡｔｔｒの要素からなる集合であ
る。ＤｏｃＡｔｔｒ７１は、ｄｏｃ、ｂｏｄ、ｎａｍ
ｅ、ｏｔｈｅｒ＿ａｔｔｒｓ等を有する。ｄｏｃは検索
対象となる文書、ｂｏｄは検索対象となる版の枝に対応
するもの、ｎａｍｅは文書名、ｏｔｈｅｒ＿ａｔｔｒｓ
は作成日やキーワード等検索を行なう属性値である。Ｅ
ｌｅｍｅｎｔＡｔｔｒ７２はｅｌｅｍｅｎｔ、ｂｏｄ、
ｔｉｔｌｅ、ｏｔｈｅｒ＿ａｔｔｒｓ等を有する。ｅｌ
ｅｍｅｎｔは、文書構成要素の部分木の頂点となる要
素、ｂｏｄは文書構成要素が属している文書の枝を管理
する版管理用データ、ｔｉｔｌｅは文書構成要素の見出
しやキャプション、ｏｔｈｅｒ＿ａｔｔｒｓは検索対象
となるその他の属性で、作成日やキーワード等である。
検索手段は、検索条件が指定され検索が指示されると、
検索用データに対して検索を行なう。検索対象の版に対
する検索条件に合った文書または文書構成要素の格納位
置の情報を得る。その得られた情報を基に、文書または
文書構成要素にアクセスすることができる。検索用デー
タは、版管理用データを指す情報を持ち、版管理用デー
タは前記各枝における最新版の情報を有しているので、
検索中に文書にアクセスする場合や、検索結果から文書
内容にアクセスする場合に、版管理用データを用いて版
を最新版に特定することができる。更新手段は、文書の
版が更新されて版管理情報が更新されたときに、前記検
索用データを更新する。例えば、文書の版が更新される
際に、文書の最新版を検索対象とするように検索用デー
タの更新を行なう。文書に枝版が作成された場合には、
その枝版の最新版を検索対象として追加する。また、枝
版が削除されたならその枝を検索対象から削除する。第
２の発明は、第１の発明と同様に版管理が行なわれてい
る文書の検索対象を絞り込むことにより、検索対象とな
るデータ量の減少と、同一文書の類似情報による検索ノ
イズを改善することができる。また、本発明は版管理情
報と関連づけた検索用データを用いるので、一層高速に
検索を行うことができ、検索効率をさらに向上させるこ
とができる。A document retrieval apparatus of a second invention is the document retrieval apparatus of the first invention, further comprising retrieval data storage means and retrieval means for performing retrieval using the retrieval data. Further, a search data updating means is provided in case the version is updated. The search data includes information that specifies the version to be searched, in addition to the normal search attribute information. Figure 7
In the example, the search data has a set for document search and a set for component search. The set for document search is Docat
It is a set of elements of tr. The set for component search is a set of Elements of ElementAttr. DocAttr71 is doc, bod, nam
e, other_attrs, etc. doc is the document to be searched, bod corresponds to the branch of the version to be searched, name is the document name, other_attrs
Is an attribute value for performing a search such as a creation date and a keyword. E
elementAttr72 is element, bod,
It has title, other_attrs, and the like. el
element is an element that is the top of the subtree of the document component, bod is version management data that manages the branch of the document to which the document component belongs, title is the heading or caption of the document component, and other_attrs is the search target. Other attributes such as creation date and keyword.
When the search condition is specified and the search is instructed, the search means
Search the search data. Obtain information about the storage location of a document or document component that meets the search conditions for the version to be searched. Based on the obtained information, the document or the document component can be accessed. Since the search data has information indicating the version management data, and the version management data has the latest version information in each branch,
When accessing a document during a search or accessing a document content from a search result, the version can be specified as the latest version by using the version management data. The updating means updates the search data when the version of the document is updated and the version management information is updated. For example, when the version of a document is updated, the search data is updated so that the latest version of the document is the search target. If a branch is created in the document,
Add the latest version of that branch as a search target. Also, if the branch version is deleted, the branch is deleted from the search target. The second invention, like the first invention, narrows down the search targets of the documents for which version management is performed, thereby reducing the amount of data to be searched and improving search noise due to similar information of the same document. be able to. Further, since the present invention uses the search data associated with the version management information, it is possible to perform the search at a higher speed and further improve the search efficiency.

【００１２】第３の発明において、構成要素検索手段
は、文書記憶手段に記憶された文書構成要素を検索条件
により検索し、検索条件を満たす文書構成要素の見出し
情報とその文書構成要素へのポインタ（条件を満たす文
書構成要素が複数ある場合にはそのリスト）を検索結果
として得る。ユーザは検索条件を満たす文書構成要素の
見出し情報を見て、その文書構成要素の内容情報をも見
たいときには、その内容の表示を指示する。それに応じ
て、表示手段は、前記文書構成要素検索手段の検索結果
として得られた文書構成要素へのポインタによって指さ
れた文書構成要素を−−−−文書記憶手段から読み込み
その内容を表示する。これにより、簡単な操作で効率よ
く再利用の候補である文書の一部の内容の確認をするこ
とができ、文書構成要素の再利用の支援を行うことがで
きる。In the third invention, the constituent element searching means searches the document constituent elements stored in the document storing means by the search condition, and the heading information of the document constituent element satisfying the search condition and the pointer to the document constituent element. (If there are a plurality of document constituent elements that satisfy the condition, the list thereof is obtained) as a search result. When the user wants to see the heading information of the document constituent element that satisfies the search condition and also wants to see the content information of the document constituent element, the user gives an instruction to display the content. In response to this, the display means reads the document constituent pointed by the pointer to the document constituent obtained as the search result of the document constituent search means from the document storage means and displays the contents thereof. As a result, a part of the document that is a candidate for reuse can be efficiently confirmed by a simple operation, and the reuse of the document constituent element can be supported.

【００１３】[0013]

【実施例】図１は本発明の第１の実施例の構成を示す図
である。この装置は、図１に示すように、入力部１−
１、文書指定部１−２、検索部１−３、表示部１−４、
文書編集部１−５、印刷部１−６、メモリ１−７、文書
格納制御部１−８、文書読込部１−９、一時文書記憶部
１−１０、文書登録部１−１１、文書操作部１−１２お
よび文書管理部１−１３および検索用データ更新部１−
１４等を備えている。文書操作部１−１２は、文書指定
部１−２により文書と版の名前が指定されると版管理部
を用いて対象データを特定し、入力部１−１により指定
された操作（処理）を行なう。指定される操作として
は、文書管理部１−１３により管理されている文書記憶
部１−１３１中の文書の文書一時記憶部１−１０への取
り出し、文書の削除、文書の版の削除等である。文書管
理部のデータは、複数のユーザからアクセスされる。文
書一時記憶部１−１０は、個人用の作業スペースで、文
書の編集のために使われる。文書管理部１−１３から、
文書一時記憶部１−１０へ取り込まれた文書は、文書管
理部１−１３では、書込み操作に対して排他的なロック
が掛けられる。ただし、読み込み操作は、行うことがで
きる。文書一時記憶部１−１０へ取り込まれた文書は、
編集作業がなされ、編集作業が終了した時点で、文書管
理部１−１３へ戻される。1 is a diagram showing the configuration of a first embodiment of the present invention. This device, as shown in FIG.
1, document designation unit 1-2, search unit 1-3, display unit 1-4,
Document editing unit 1-5, printing unit 1-6, memory 1-7, document storage control unit 1-8, document reading unit 1-9, temporary document storage unit 1-10, document registration unit 1-11, document operation Section 1-12, document management section 1-13, and search data updating section 1-
14 and so on. When the document designation unit 1-2 designates the name of the document and the version, the document operation unit 1-12 identifies the target data using the version management unit, and the operation (process) designated by the input unit 1-1. Do. The designated operation includes fetching a document in the document storage unit 1-131 managed by the document management unit 1-13 to the document temporary storage unit 1-10, deleting the document, and deleting the document version. is there. The data in the document management section is accessed by a plurality of users. The document temporary storage unit 1-10 is a personal work space and is used for editing a document. From the document management unit 1-13,
The document fetched in the document temporary storage unit 1-10 is exclusively locked to the writing operation in the document management unit 1-13. However, the read operation can be performed. The document captured in the document temporary storage unit 1-10 is
The editing work is performed, and when the editing work is completed, it is returned to the document management unit 1-13.

【００１４】文書読込部１−９は、文書一時記憶部１−
１０からメモリ１−７に文書データを読み込む。文書の
表示部１−４による表示や文書編集部１−５による編集
は、メモリ１−７に読み込まれた文書データを対象にそ
の操作が行われる。一時的な編集結果は、文書格納制御
部１−８により、文書一時記憶部１−１０に格納され
る。文書の編集作業が終了したら、文書登録部１−１１
は、一時文書記憶部１−１０から文書管理部１−１３
に、更新した文書の登録を行ない、版の更新を行なう。
文書管理部１−１３は、文書記憶部１−１３１、版管理
部１−１３２、版構造記憶部１−１３３および検索用デ
ータ記憶部１−１３４等を有しており、文書記憶部１−
１３１に記憶する構造化文書の管理を行うとともに、文
書の版の管理をも行う。版管理部１−１３２は、文書記
憶部１−１３１に記憶されている文書の版を木構造で管
理し、その管理データは版構造記憶部１−１３３に記憶
する。その版管理部１−１３２により文書と版の名前か
ら、ある文書の版を特定することができる。文書データ
記憶部１−１３１は、文書構成要素を木構造として記憶
し、版管理部により文書に複数の版を記憶することがで
きる。検索用データ記憶部１−１３４は、文書検索用の
集合と構成要素検索用の集合を記憶する。検索部１−３
は、検索用データを通して文書、文書構成要素の検索を
行ない、結果を表示部１−４に表示する。検索用データ
更新部１−１４では、文書登録部１−１１や文書操作部
１−１０の操作による文書記憶部１−１３１のデータの
更新に検索用データが対応できるように、検索用データ
記憶部１−１３４のデータの更新を行なう。The document reading unit 1-9 is a document temporary storage unit 1-.
Document data is read from the memory 10 into the memory 1-7. The display of the document by the display unit 1-4 and the editing by the document editing unit 1-5 are performed on the document data read in the memory 1-7. The temporary editing result is stored in the document temporary storage unit 1-10 by the document storage control unit 1-8. When the document editing work is completed, the document registration unit 1-11
From the temporary document storage unit 1-10 to the document management unit 1-13
Then, the updated document is registered and the version is updated.
The document management unit 1-13 includes a document storage unit 1-131, a version management unit 1-132, a version structure storage unit 1-133, a search data storage unit 1-134, and the like.
In addition to managing the structured document stored in 131, it also manages the version of the document. The version management unit 1-132 manages the versions of the document stored in the document storage unit 1-131 in a tree structure, and stores the management data in the version structure storage unit 1-133. The version management unit 1-132 can specify the version of a document from the name of the document and the version. The document data storage unit 1-131 stores the document constituent elements as a tree structure, and the version management unit can store a plurality of versions in the document. The search data storage unit 1-134 stores a set for document search and a set for component search. Search unit 1-3
Searches for documents and document constituent elements through the search data, and displays the results on the display unit 1-4. The search data update unit 1-14 stores the search data so that the search data can correspond to the update of the data in the document storage unit 1-131 by the operation of the document registration unit 1-11 or the document operation unit 1-10. The data of the part 1-134 is updated.

【００１５】以下、順を追って以上のように構成された
本発明の実施例の動作を説明する。図２に文書の例を示
す。本実施例で説明する文書は、章、節、段落、枠等の
論理構造に分割された要素から構成される構造化文書で
あり、木構造によって表現される。本発明でいう文書構
成要素とは、章、節、段落、枠といった論理構造で分割
されるものである。文書記憶部では、図２に示した文書
を図３に示すような論理構造の形式で記憶している。The operation of the embodiment of the present invention configured as described above will be described below step by step. FIG. 2 shows an example of a document. The document described in this embodiment is a structured document composed of elements divided into logical structures such as chapters, sections, paragraphs, and frames, and is represented by a tree structure. The document constituent element in the present invention is one that is divided by a logical structure such as a chapter, a section, a paragraph, and a frame. The document storage unit stores the document shown in FIG. 2 in a logical structure format as shown in FIG.

【００１６】版管理部１−１３２は、図４に示すよう
に、文書ごとに、文書の複数の版を、版を表すノードと
版の派生関係を表すリンクとからなる木構造によって管
理する。図４の例では、文書Ａのｍａｉｎ１と名付けら
れた初版からｍａｉｎ２と名付けられた版が作成され、
次にｍａｉｎ３と名付けられた版が作成された経緯が示
されており、さらに、ｍａｉｎ２から、ｅｄａ１と名付
けられた版が作成され、次にｅｄａ２と名付けられた版
が作成された経緯が示されている。ｍａｉｎ１、ｍａｉ
ｎ２およびｍａｉｎ３の経緯が木構造の一つの枝を構成
し、この例ではｍａｉｎの枝と呼び、また、ｍａｉｎ２
から分岐するｅｄａ１およびｅｄａ２の経緯の枝をｅｄ
ａの枝と呼ぶ。As shown in FIG. 4, the version management unit 1-132 manages, for each document, a plurality of versions of the document by a tree structure composed of nodes representing versions and links representing derivation relationships of the versions. In the example of FIG. 4, the first version of document A named main1 is created from the version named main2,
It shows how the version named main3 was created, and from main2 the version named eda1 was created, and then the version named eda2 was created. ing. main1, mai
The history of n2 and main3 constitutes one branch of the tree structure, which is called the main branch in this example.
Eda1 and eda2 branching from ed
Call it a branch.

【００１７】版管理の対象となるのは、図３のＤｏｃの
ｂｏｄｌｉｓｔ以外の値である。版管理の対象になって
いるデータは、文書版管理部により管理され、特定の版
を指定することにより、版管理部からその版の値を得る
ことができる。文書内の版は、名前によって管理されて
おり、文書と版の名前を指定することにより所望の版を
特定し、その版の文書データにアクセスする。The object of version management is a value other than the bodlist of Doc in FIG. The data that is the target of version management is managed by the document version management unit, and by specifying a specific version, the value of that version can be obtained from the version management unit. The version in the document is managed by the name, the desired version is specified by designating the name of the document and the version, and the document data of the version is accessed.

【００１８】検索や、文書の版指定のために文書版管理
用データとしてＢｏｄ（ＢｕｒａｎｃｈｏｆＤｏｃｕ
ｍｅｎｔ）を使用する。Ｂｏｄは、版の枝ごとに存在
し、枝の中での版の管理に用いられる。Ｂｏｄのデータ
構造は、図５の５１に示すように、ｄｏｃ、ｃｂ＿ｎａ
ｍｅ、ｂ＿ｎａｍｅ、ｖ＿ｌｉｓｔのデータを含んでい
る。ｄｏｃは、対象の文書の文書管理用データ、ｃｂ＿
ｎａｍｅは、枝の最新版の名前、ｂ＿ｎａｍｅは、枝の
分岐点となった版の名前、ｖ＿ｌｉｓｔは、枝に属して
いる版の名前のリストである。図５のｍａｉｎの枝にお
いて、最新版はｍａｉｎ３であるのでｃｂ＿ｎａｍｅは
ｍａｉｎ３を保持し、版の枝はｍａｉｎ１，ｍａｉｎ
２，ｍａｉｎ３からなるのでｖ＿ｌｉｓｔはそれらを保
持する。Ｂｏｄは、枝版の作成が行なわれた場合に、１
つ生成され、Ｄｏｃのｂｏｄｌｉｓｔに追加する。図５
の例では版の枝はｍａｉｎの枝のみであるので、文書版
管理用データはそれに対応するＢｏｄ＿１のみである
が、枝が追加された図６の例ではｍａｉｎの枝を管理す
るＢｏｄ＿１と、ｅｄａの枝を管理するＢｏｄ＿２とか
らなっており、文書管理用データＤｏｃのｂｏｄｌｉｓ
ｔにはＢｏｄ＿１とＢｏｄ＿２がリストされている。For the purpose of searching and specifying the version of a document, Bod (Burch of Docu) is used as document version management data.
ment) is used. The Bod exists for each branch of the plate and is used for managing the plate in the branch. The data structure of the Bod is doc, cb_na as shown in 51 of FIG.
It includes data of me, b_name, and v_list. doc is the document management data of the target document, cb_
name is the name of the latest version of the branch, b_name is the name of the version that became the branch point of the branch, and v_list is a list of the names of the versions that belong to the branch. In the main branch of FIG. 5, since the latest version is main3, cb_name holds main3, and the main branch has main1 and main3.
V_list holds them because they consist of 2, main3. The Bod is 1 when the stencil is created.
Is generated and added to the Doc's bodylist. Figure 5
In the example of FIG. 6, since the version branch is only the main branch, the document version management data is only the corresponding Bod_1, but in the example of FIG. 6 in which the branch is added, the Bod_1 that manages the main branch and the eda Of the document management data Doc.
Bod_1 and Bod_2 are listed in t.

【００１９】検索用データ記憶部１−１３４は、文書検
索用の集合と構成要素検索用の集合を持つ。文書検索用
の集合は、Ｄｏｃａｔｔｒの要素からなる集合である。
構成要素検索用の集合は、ＥｌｅｍｅｎｔＡｔｔｒの要
素からなる集合である。図７に示す様にＤｏｃＡｔｔｒ
７１は、ｄｏｃ、ｂｏｄ、ｎａｍｅ、ｏｔｈｅｒ＿ａｔ
ｔｒｓ等を有する。ｄｏｃは検索対象となる文書、ｂｏ
ｄは検索対象となる版の枝に対応するもの、ｎａｍｅは
文書名、ｏｔｈｅｒ＿ａｔｔｒｓは作成日やキーワード
等検索を行なう属性値である。ＥｌｅｍｅｎｔＡｔｔｒ
７２はｅｌｅｍｅｎｔ、ｂｏｄ、ｔｉｔｌｅ、ｏｔｈｅ
ｒ＿ａｔｔｒｓ等を有する。ｅｌｅｍｅｎｔは、文書構
成要素の部分木の頂点となる要素、ｂｏｄは文書構成要
素が属している文書の枝を管理するＢｏｄ、ｔｉｔｌｅ
は文書構成要素の見出しやキャプション、ｏｔｈｅｒ＿
ａｔｔｒｓは検索対象となるその他の属性で、作成日や
キーワード等である。The search data storage unit 1-134 has a set for document search and a set for component search. The set for document search is a set of elements of Docattr.
The set for component search is a set of Elements of ElementAttr. As shown in FIG. 7, DocAttr
71 is doc, bod, name, other_at
with trs etc. doc is a document to be searched, bo
d is the one corresponding to the branch of the version to be searched, name is the document name, and other_attrs is the attribute value for performing the search such as the creation date and the keyword. ElementAttr
72 is element, body, title, and the
r_attrs and the like. element is an element that is the top of the subtree of the document constituent element, and bod is a Bod or title that manages the branch of the document to which the document constituent element belongs.
Is a document component heading, caption, other_
attrs are other attributes to be searched, such as a creation date and a keyword.

【００２０】検索用データは、文書の版の枝ごとに作成
し、その最新版を検索対象とする。図８に検索用データ
と版管理を伴った文書、文書構成要素の関係の概要を示
す。図８に示された枝版に対する検索用集合の要素ｄａ
１は文書Ａのｍａｉｎ３に対応し、ｄａ２は、文書Ａの
ｅｄａ１に対応する。要素ｅ１，ｅ２は文書Ａのｍａｉ
ｎ３の構成要素であり、ｅ３，ｅ４は文書Ａのｅｄａ１
に属する構成要素である。図９に図８に示したｄａ１，
ｄａ２，ｅ１，ｅ２，ｅ３，ｅ４とＢｏｄの関係を示
す。この例において、文書検索用集合の要素ｄａ１のｂ
ｏｄは図８のｍａｉｎの枝に対応する版管理用データＢ
ｏｄ＿１を指しており、また、構成要素検索用の集合の
要素ｅ１，ｅ２のｂｏｄも図８のｍａｉｎの枝に対応す
る版管理用データＢｏｄ＿１を指している。同様に要素
ｄａ２，ｅ３，ｅ４は図８のｅｄａの枝に対応する版管
理用データＢｏｄ＿２を指している。このように、検索
用集合の各要素は、その要素が属しているＢｏｄを指し
ているので、検索中に文書にアクセスする場合や、検索
結果から文書内容にアクセスする場合に、Ｂｏｄを用い
て版を特定することができる。The search data is created for each branch of the document version, and the latest version is used as the search target. FIG. 8 shows an outline of the relationship between the search data, the document accompanied by version management, and the document constituent elements. Element da of the search set for the version shown in FIG.
1 corresponds to main3 of document A, and da2 corresponds to eda1 of document A. The elements e1 and e2 are the mai of the document A
n3 is a constituent element, and e3 and e4 are eda1 of document A
Is a component belonging to. In FIG. 9, da1 shown in FIG.
The relationship between da2, e1, e2, e3, e4 and Bod is shown. In this example, b of the element da1 of the document search set
od is the version management data B corresponding to the main branch of FIG.
It also refers to od_1, and the bods of the elements e1 and e2 of the set for component element search also point to the version management data Bod_1 corresponding to the main branch in FIG. Similarly, the elements da2, e3, and e4 indicate the version management data Bod_2 corresponding to the branch of eda in FIG. In this way, each element of the search set points to the Bod to which the element belongs, so when accessing a document during a search or when accessing the document contents from a search result, the Bod is used. The version can be specified.

【００２１】検索部１−３は、検索用データを通して検
索を行ない、検索条件に合った要素を保持する。検索結
果表示部は、検索結果の要素を表示する。文書名”文書
Ａ”の文書に版ｍａｉｎ１，ｍａｉｎ２，ｍａｉｎ３，
ｅｄａ１が存在した場合、２つの枝が存在しているの
で、文書名、”文書Ａ”の検索により、２つの文書検索
用データｄａ１，ｄａ２が検索される。この検索結果に
対してｂｏｄのｃｂ＿ｎａｍｅから版名を表示をさ
せ、どちらの文書を指定するかを決定できる。それらの
文書はいずれも版の枝の最新版である。従って、その
後、文書内容検索を行うときには、枝の最新版の内容が
検索対象となる。The search unit 1-3 performs a search through the search data and holds an element that meets the search condition. The search result display section displays the elements of the search result. Versions main1, main2, main3, and the like of the document with the document name "Document A"
When eda1 exists, two branches exist, and therefore, two document search data da1 and da2 are searched by searching the document name "document A". It is possible to display the version name from the cb_name of the bod for this search result and determine which document is designated. Each of these documents is the latest version of the version branch. Therefore, when the document content search is performed thereafter, the content of the latest version of the branch becomes the search target.

【００２２】なお、検索部１−３により全版を検索対象
として指定できるようにし、検索用要素の各Ｂｏｄのｖ
＿ｌｉｓｔを利用して最新版だけではなく、全版を検索
対象とするように構成することもできる。また、文書指
定部１−２により文書と版名が指定されれば、最新版で
なくても版管理部１−１３２を用いて中間の版の文書を
操作することができる。It should be noted that the search unit 1-3 allows all editions to be designated as a search target, and v of each Bod of the search element is designated.
By using _list, it is possible to configure not only the latest version but all the versions as search targets. Further, if the document and the version name are designated by the document designating section 1-2, an intermediate version of the document can be operated using the version managing section 1-132 even if it is not the latest version.

【００２３】文書構成要素の検索は、構成要素検索用の
集合に対して検索を行ない、検索条件に合った要素を得
る。その得られた検索用データの要素（ｅ１，ｅ２等）
における、部分木の頂点となる文書構成要素を指すｅｌ
ｅｍｅｎｔを基に、文書構成要素にアクセスすることが
できる。即ち、文書の構成要素へのアクセスは、文書構
成要素の部分木の頂点を辿ることによりその部分木をア
クセスすることができる。In the document component search, a set for component search is searched to obtain an element that meets the search condition. Elements of the obtained search data (e1, e2, etc.)
El that points to the document element that is the vertex of the subtree in
Based on the element, the document component can be accessed. That is, in order to access a document constituent element, the subtree can be accessed by tracing the vertices of the subtree of the document constituent element.

【００２４】文書データは版管理が行なわれているの
で、版が更新、削除された場合には、検索用データの更
新を行なわなければならない。検索用データ更新部１−
１４は、文書記憶部１−１３１の文書の版が更新される
際に、文書の最新版を検索対象とするように検索用デー
タの更新を行なう。文書に枝版が作成された場合には、
その枝版の最新版を検索対象として追加する。また、枝
版が削除されたならその枝を検索対象から削除する。Since the document data is managed by version, when the version is updated or deleted, the search data must be updated. Search data update unit 1-
When the version of the document in the document storage unit 1-131 is updated, 14 updates the search data so that the latest version of the document becomes the search target. If a branch is created in the document,
Add the latest version of that branch as a search target. Also, if the branch version is deleted, the branch is deleted from the search target.

【００２５】図１０は、文書登録時の検索用データと文
書の版の登録、更新の処理のフローを示す図である。ステップ（１０−１）文書の登録をするときに、それ
が初めての文書の登録か否かを調べる。ステップ（１０−２）初めての文書の登録である時に
は、版管理用のデータＢｏｄを作成して、値を設定す
る。Ｂｏｄのｃｂ＿ｎａｍｅに登録する版の名前を設定
する。また、ｖ＿ｌｉｓｔに版の名前を追加する。ステップ（１０−３）ステップ（１０−１）で初めて
の文書登録ではないことがわかったときには、同一の枝
の版の更新か否かを判定する。ステップ（１０−４）同じ枝の版の更新であるときに
は、版管理用のデータＢｏｄの値を更新する。即ち、最
新版を指すｃｂ＿ｎａｍｅを、登録する版の名前に設定
する。また、ｖ＿ｌｉｓｔにその登録する版の名前を追
加する。ステップ（１０−５）検索用データのデータを更新す
る。ステップ（１０−６）ステップ（１０−３）の判定
で、同一の枝の版の更新ではないとされたときには、版
管理用のデータＢｏｄを作成して、値を設定する。Ｂｏ
ｄのｃｂ＿ｎａｍｅに登録する版の名前を設定し、ｂ＿
ｎａｍｅに元の版の名前を設定し、ｖ＿ｌｉｓｔに版の
名前を追加する。ステップ（１０−７）登録する文書に対応する文書管
理用データＤｏｃのＢｏｄｌｉｓｔに、ステップ（１０
−２）またはステップ（１０−６）において作成したＢ
ｏｄを追加する。ステップ（１０−８）そして、検索用データにデータ
を登録する。FIG. 10 is a diagram showing a flow of processing for registering and updating the search data and the document version when registering the document. Step (10-1) When registering a document, it is checked whether it is the first document registration. Step (10-2) When the document is registered for the first time, data Bod for version management is created and a value is set. Set the name of the version to be registered in cb_name of Bod. Also, the name of the version is added to v_list. Step (10-3) If it is found in step (10-1) that this is not the first document registration, it is determined whether or not the version of the same branch is updated. Step (10-4) When the version of the same branch is updated, the value of the version management data Bod is updated. That is, cb_name indicating the latest version is set to the name of the version to be registered. Also, the name of the version to be registered is added to v_list. Step (10-5) Update the search data. Step (10-6) If it is determined in step (10-3) that the version of the same branch is not updated, version management data Bod is created and a value is set. Bo
Set the name of the version to be registered in cb_name of d, and b_
The name of the original version is set in name, and the version name is added to v_list. Step (10-7) Step (10-7) is set in the Bodlist of the document management data Doc corresponding to the document to be registered.
-2) or B created in step (10-6)
Add od. Step (10-8) Then, the data is registered in the search data.

【００２６】図１１および図１２は、文書削除、版削除
時の検索用データと文書の版の削除の処理のフローを示
す図である。ステップ（１１−１）削除の対象が全ての版を含む文
書全体であるか否かを判定する。ステップ（１１−２）文書全体の削除であるときに
は、検索用データから削除対象文書と関係のある要素を
全て削除する。ステップ（１１−３）また、文書全体と、その文書を
管理しているＤｏｃおよび文書の版を管理しているＢｏ
ｄを全て削除する。ステップ（１１−４）ステップ（１１−１）の判定
で、文書全体の削除ではないとされたときには、さらに
最新版の削除か否かを判定する。ステップ（１１−５）ステップ（１１−４）で最新版
の削除でないと判定されたときには、枝版のルートの削
除であるか否かを判定する。ステップ（１１−６）枝版のルートの削除ではないと
きには、Ｂｏｄの値を再設定する。即ち、ｖ＿ｌｉｓｔ
から削除される版の名前を削除する。ステップ（１１−７）ステップ（１１−５）で枝版の
ルートの削除であると判定されたときには、その版の削
除を行わない。ステップ（１１−８）ステップ（１１−４）で最新版
の削除であると判定されたときには、枝に他の版がある
かを判定する。ステップ（１１−９）削除対象の版の属する枝に他の
版がないと判定されたときには、他に枝があるかを判定
する。ステップ（１１−１０）ステップ（１１−９）でｎｏ
の判定がなされたとき、即ち、削除対象の版が最新版で
あり、その属する枝に版がなく、また他に枝もない場合
には、検索用データから削除対象の版の要素を全て削除
する。ステップ（１１−１１）また、この場合には、削除対
象の版の文書が、その文書の全体であるので、文書全体
と、その文書の管理用データＤｏｃおよび文書の版を管
理しているＢｏｄを全て削除する。ステップ（１１−１２）ステップ（１１−９）でｙｅ
ｓの判定がなされたとき、即ち、削除対象の版が最新版
であり、その属する枝に他に版がなく、かつ他の枝は存
在する場合には、検索用データから削除対象の版の要素
を全て削除する。ステップ（１１−１３）また、削除しようとする文書
の版の版管理用データＢｏｄを削除する。ステップ（１１−１４）ステップ（１１−８）でｙｅ
ｓの判定がなされたとき、即ち、削除対象の版が最新版
であり、その属する枝に他の版がある場合には、検索用
データから削除対象の版の要素を全て削除する。ステップ（１１−１５）次に、その版の属する枝に対
応する版管理用データＢｏｄの値を再設定する。即ち、
Ｂｏｄのｃｂ＿ｎａｍｅに一つ前の版の名前を設定す
る。また、ｖ＿ｌｉｓｔから削除される版の名前を削除
する。ステップ（１１−１６）そして、検索用データを一つ
前の版の値に更新する。FIG. 11 and FIG. 12 are diagrams showing a flow of processing for deleting a document, search data when a version is deleted, and a version of the document. Step (11-1) It is determined whether the deletion target is the entire document including all the editions. Step (11-2) When the entire document is to be deleted, all the elements related to the deletion target document are deleted from the search data. Step (11-3) In addition, the entire document, the Doc that manages the document, and the Bo that manages the document version
Delete all d. Step (11-4) If it is determined in step (11-1) that the entire document is not deleted, it is further determined whether the latest version is deleted. Step (11-5) When it is determined in step (11-4) that the latest version is not deleted, it is determined whether or not the root of the branched version is deleted. Step (11-6) When it is not the deletion of the branching route, the value of Bod is reset. That is, v_list
Remove the name of the edition that will be deleted from. Step (11-7) When it is determined in step (11-5) that the root of the edgy is deleted, the version is not deleted. Step (11-8) When it is determined in step (11-4) that the latest version is deleted, it is determined whether there is another version on the branch. Step (11-9) If it is determined that there is no other version in the branch to which the version to be deleted belongs, then it is determined whether there is another branch. Step (11-10) No in Step (11-9)
If it is judged that the version to be deleted is the latest version, there is no version in the branch to which it belongs, and there are no other branches, delete all the elements of the version to be deleted from the search data. To do. Step (11-11) Further, in this case, since the document of the version to be deleted is the entire document, the Bod managing the entire document, the management data Doc of the document, and the version of the document. Delete all. Step (11-12) Yes in Step (11-9)
When the determination of s is made, that is, when the version to be deleted is the latest version, there is no other version in the branch to which it belongs, and there is another branch, the version to be deleted from the search data is Delete all elements. Step (11-13) Also, the version management data Bod of the version of the document to be deleted is deleted. Step (11-14) Yes in Step (11-8)
When s is determined, that is, when the version to be deleted is the latest version and there is another version in the branch to which it belongs, all the elements of the version to be deleted are deleted from the search data. Step (11-15) Next, the value of the version management data Bod corresponding to the branch to which the version belongs is reset. That is,
Set the name of the previous version in cb_name of the Bod. Also, the name of the version to be deleted from v_list is deleted. Step (11-16) Then, the search data is updated to the value of the previous version.

【００２７】以上により、版管理されている文書に対し
てその最新版の文書と文書構成要素を検索対象とするこ
とができる。さらに、全ての版を検索対象と指定するこ
とにより、全ての版に対して文書検索を行なうことがで
きる。As described above, with respect to the document whose version is managed, the latest version of the document and the document component can be searched. Furthermore, by designating all the editions as search targets, it is possible to perform a document search for all the editions.

【００２８】［第２の実施例］第２の実施例は、第１の
実施例に、文書記憶部から見出し文字列などを検索キー
として文書構成要素を検索し、検索結果の要素あるいは
その要素に関連する要素に対しその内容を文書記憶部よ
り読みだして内容表示を行なうことにより、再利用の候
補である文書の一部の内容を確認でき、文書構成要素の
再利用をする場合の判断支援を行う構成を付加した実施
例である。この第２の実施例も第１の実施例と同様に図
１の概略構成を有するが、上記付加部分である図１３の
構成が図１の文書検索装置の機能の一部として追加され
ている。この追加部分を中心に第２の実施例について説
明する。なお、図１３の機能と第１の実施例の装置の構
成要素との関係を次に示す。検索条件入力部１３−１
は、入力部１−１に含まれる。文書構成要素検索部１３
−２は検索部１−３に含まれる。文書記憶部１３−４は
文書管理部１−１３に含まれる。検索結果記憶部１３−
４はメモリ１−７に含まれる。文書構成要素選択部１３
−５は文書指定部１−２に含まれる。文書構成要素表示
部１３−６は文書表示部１−４に含まれる。[Second Embodiment] The second embodiment is the same as the first embodiment except that the document constituent elements are searched from the document storage section using the index character string or the like as a search key, and the search result element or its element is searched. By reading the contents of the elements related to the document from the document storage unit and displaying the contents, it is possible to confirm a part of the document that is a candidate for reuse, and to judge when the document components are reused. This is an example in which a configuration for supporting is added. The second embodiment also has the schematic configuration of FIG. 1 similarly to the first embodiment, but the configuration of FIG. 13 which is the additional portion is added as a part of the function of the document retrieval apparatus of FIG. . The second embodiment will be described focusing on this additional portion. The relationship between the functions of FIG. 13 and the constituent elements of the apparatus according to the first embodiment is shown below. Search condition input section 13-1
Is included in the input unit 1-1. Document component search unit 13
-2 is included in the search unit 1-3. The document storage unit 13-4 is included in the document management unit 1-13. Search result storage unit 13-
4 is included in memories 1-7. Document component selection unit 13
-5 is included in the document designation section 1-2. The document component display unit 13-6 is included in the document display unit 1-4.

【００２９】図１３において、検索条件入力部１３−１
はキーボードなどの入力装置によって文書構成要素を検
索する際の条件を入力するものである。文書構成要素検
索部１３−２は検索条件入力部１３−１で得た条件によ
りデータから文書構成要素を検索し、結果として文書構
成要素へのポインタのリストを返す機能を有する。文書
記憶部１３−３は複数の文書データを文書構成要素の集
合として記憶する記憶部であり、図１の文書記憶部１−
１３１と同じものである。文書データは文書構成要素の
検索、文書構成要素単位での読みだしを可能とするデー
タ構造を有している。検索結果記憶部１３−４は文書構
成要素検索部１３−２の検索結果の出力である文書構成
要素へのポインタのリストを記憶しておく部分である。
文書構成要素選択部１３−５は検索結果記憶部１３−４
に記憶している検索結果から表示する文書構成要素を選
択する部分である。文書構成要素表示部１３−６は文書
構成要素選択部１３−５で選択した文書構成要素を必要
な部分だけ文書記憶部１３−３から読み込んでディスプ
レイ等の表示部１−４に表示する部分である。In FIG. 13, a search condition input unit 13-1
Is for inputting a condition for searching a document constituent element with an input device such as a keyboard. The document constituent element searching unit 13-2 has a function of searching the data for a document constituent element according to the condition obtained by the search condition input unit 13-1 and returning a list of pointers to the document constituent element as a result. The document storage unit 13-3 is a storage unit that stores a plurality of pieces of document data as a set of document constituent elements, and the document storage unit 1- of FIG.
It is the same as 131. The document data has a data structure that enables retrieval of document constituent elements and reading in document constituent element units. The search result storage unit 13-4 is a unit for storing a list of pointers to document components, which are output of the search results of the document component search unit 13-2.
The document component selection unit 13-5 is a search result storage unit 13-4.
This is a part for selecting a document component to be displayed from the search results stored in. The document component display unit 13-6 is a unit for reading only the required portion of the document component selected by the document component selection unit 13-5 from the document storage unit 13-3 and displaying it on the display unit 1-4 such as a display. is there.

【００３０】ここで、文書構成要素の例として章、節、
図表を取り扱うことにする。文書は文書記憶部１３−３
において図３に示すような木構造で記憶されている。文
書検索用の集合と文書構成要素検索用の集合は、図７お
よび図８に示したような関係で文書情報と関係付けられ
ている。この関係を図１４を用いてさらに説明する。図
１４（ａ）に示すように文書名とその文書の木構造にお
けるルートへのポインタ（格納位置）との対応関係を示
す索引情報を作成しておき、この索引情報を用いること
により文書のルートを高速に検索し、また、図１４
（ａ）（ｂ）に示すように文書構成要素の見出し文字列
とその文書構成要素へのポインタ（格納位置）との対応
関係を示す文書構成要素検索用の索引情報を作成してお
き、これを用いることにより文書構成要素の検索を高速
化する。Here, as an example of document constituent elements, chapters, sections,
I will handle the charts. The document is stored in the document storage unit 13-3.
In FIG. 3, it is stored in a tree structure as shown in FIG. The document search set and the document component search set are associated with the document information in the relationship shown in FIGS. 7 and 8. This relationship will be further described with reference to FIG. As shown in FIG. 14A, index information indicating the correspondence between the document name and the pointer (storage position) to the root in the tree structure of the document is created, and this index information is used to create the root of the document. Is searched at high speed, and as shown in FIG.
As shown in (a) and (b), index information for searching a document component is created in advance, which indicates the correspondence between the index character string of the document component and the pointer (storage position) to the document component. To speed up the retrieval of document components.

【００３１】このように構成された本実施例の文書処理
装置における処理の過程を図１５のフローチャートによ
り詳細に説明する。ステップ（１５−１）まずキーボードなどの入力手
段である検索条件入力部１３−１により文書構成要素検
索条件を入力する。ここで言う検索条件とは、例えば、
内容に「構造化文書」という文字列を含む段落や、見出
し文字列に「データ構造」を含む表、さらに第一章に含
まれる図等の条件である。ステップ（１５−２）文書構成要素検索部１３−３
は、ステップ（１５−１）で取得した検索条件により検
索を行なう。検索の方法は、図１４で説明した検索用の
集合を利用して見出し文字列などで検索を行なう方法
と、構造木を辿って各文書構成要素について判定する方
法があるが、本実施例では前者の方法を用いることによ
り高速に検索する。検索結果として検索条件を満たす文
書構成要素へのポインタと文書構成要素を代表するよう
な文字列、例えば章、節の見出し文字列や図形のキャプ
ションなどを対にしてリストの形で出力する。ステップ（１５−３）検索結果が１以上得られたか否
かを判定し、得られなかった場合には処理を終了し、検
索結果が１以上得られたときにはステップ１５−４に移
る。ステップ（１５−４）ステップ（１５−２）で取得し
た検索結果を検索結果記憶部１３−４に一時的に記憶す
る。ステップ（１５−５）ステップ（１５−４）で記憶
した結果のリストの文書構成要素に対応する文字列をデ
ィスプレイなどの表示部１−４に表示する。ステップ（１５−６）実際に内容を表示したい文書構
成要素の文字列を選択する。ここで候補となる文書構成
要素の全てを選択することや複数を選択することも可能
である。The process steps in the document processing apparatus of the present embodiment thus constructed will be described in detail with reference to the flowchart of FIG. Step (15-1) First, the document condition search conditions are input by the search condition input unit 13-1 which is an input means such as a keyboard. The search condition here is, for example,
The conditions include paragraphs that include the character string “structured document” in the content, tables that include the “data structure” in the heading character string, and the figures included in Chapter 1. Step (15-2) Document component search unit 13-3
Performs a search according to the search condition acquired in step (15-1). As a search method, there are a method of searching with a headline character string or the like using the set for search described in FIG. 14 and a method of determining each document constituent element by tracing a structure tree. The former method is used to search at high speed. As a search result, a pointer to a document constituent element that satisfies the search condition and a character string that represents the document constituent element, such as a chapter or section heading character string or a figure caption, are output as a pair. Step (15-3) It is determined whether or not one or more search results have been obtained. If no search result has been obtained, the process ends, and if one or more search results have been obtained, the process proceeds to step 15-4. Step (15-4) The search result acquired in step (15-2) is temporarily stored in the search result storage unit 13-4. Step (15-5) A character string corresponding to the document constituent element of the result list stored in step (15-4) is displayed on the display unit 1-4 such as a display. Step (15-6) Select the character string of the document component whose contents are to be actually displayed. Here, it is possible to select all or a plurality of candidate document constituent elements.

【００３２】ステップ（１５−７）ステップ（１５−
６）で選択した文書構成要素を表示する。表示の実現方
法は選択した文書構成要素に対応する部分木を一時記憶
に読みだし、一時記憶中でその部分木からなる文書を構
成して通常の表示方法と同じ方法で表示する。検索結果
から文書構成要素を複数選択した場合は先頭の要素をま
ず表示し、次の要素の表示を指示できるようにすること
で対処できる。ステップ（１５−８）ステップ（１５−７）で表示し
た文書構成要素に対してステップ（１５−７）編集
や複写などの処理を行なうか否かを判定し、操作を行わ
ない場合には処理を終了する。ステップ（１５−９）操作を行う場合には表示した文
書構成要素に対して操作を実行する。Step (15-7) Step (15-
The document component selected in 6) is displayed. As a display realizing method, a subtree corresponding to the selected document constituent element is read into the temporary storage, a document composed of the subtree is constructed in the temporary storage, and the document is displayed in the same manner as the normal display method. When a plurality of document constituent elements are selected from the search result, the first element is displayed first, and the next element can be instructed to be displayed. Step (15-8) Step (15-7) It is determined whether or not processing such as editing or copying is performed on the document component displayed in step (15-7). If no operation is performed, processing is performed. To finish. Step (15-9) When performing an operation, the operation is performed on the displayed document constituent element.

【００３３】この第２の実施例によれば、文書構成要素
の検索結果となる文書構成要素に対して、検索対象とな
っている文書構成要素の部分木の頂点を直接アクセスす
ることができ、そこから文書構成要素をなす部分木のデ
ータをアクセスすることにより、その文書構成要素の内
容を内容を文書全体を読み込むことなしに表示、確認で
きる。文書の一部について再利用したい場合にその判断
を対話的にかつ短時間で行なうことができる。According to the second embodiment, it is possible to directly access the vertex of the subtree of the document component to be searched for the document component which is the search result of the document component. By accessing the data of the subtree that constitutes the document constituent element from, the contents of the document constituent element can be displayed and confirmed without reading the entire document. When it is desired to reuse a part of a document, the judgment can be made interactively and in a short time.

【００３４】[0034]

【発明の効果】以上により、第１の発明によれば、検索
装置は版管理手段の版管理により特定される版につい
て、文書または文書構成要素の検索を行うことにより、
検索対象を絞り込むことができるので、検索対象となる
データ量の減少と、同一文書の類似情報による検索ノイ
ズを改善することができ、検索効率を向上させることが
できる。また、版管理を版の枝とそれに属する版のノー
ドを持つ木構造によって行うので、版の更新があっても
特定の版例えば最新版の文書と文書構成要素を検索対象
とすることができる。さらに、全ての版を検索対象と指
定することにより、全ての版に対して文書検索を行なう
ようにすることもできる。As described above, according to the first aspect of the invention, the search device searches for a document or a document component with respect to the version specified by the version management of the version management means.
Since the search target can be narrowed down, the amount of data to be searched can be reduced, and the search noise due to the similar information of the same document can be improved, and the search efficiency can be improved. Further, since the version management is performed by the tree structure having the branch of the version and the node of the version belonging to the version, even if the version is updated, the specific version, for example, the latest version of the document and the document component can be searched. Furthermore, by designating all editions as search targets, document retrieval can be performed for all editions.

【００３５】また、第２の発明によれば、第１の発明と
同様の効果を奏することができるほかに、版管理と関係
づけた検索用データを用いて検索をおこなうので、検索
の速度がさらに向上する。Further, according to the second invention, the same effect as that of the first invention can be obtained, and since the search is performed by using the search data associated with the version management, the search speed can be improved. Further improve.

【００３６】また、第３の発明によれば、文書構成要素
単位での検索と内容の読出しができ、従って文書構成要
素を検索してその内容を文書全体を読み込むことなしに
確認できるため、文書の一部について再利用したい場合
にその判断を対話的にかつ短時間で行なうことができ
る。Further, according to the third aspect of the invention, it is possible to search and read the contents of each document component, and therefore the contents of a document can be searched and confirmed without reading the entire document. If you want to reuse a part of the, you can make the decision interactively and in a short time.

[Brief description of drawings]

【図１】本発明の第１実施例の構成を示す図FIG. 1 is a diagram showing a configuration of a first embodiment of the present invention.

【図２】本発明の対象とする文書の例を示す図FIG. 2 is a diagram showing an example of a document targeted by the present invention.

【図３】文書の論理構造の例を示す図FIG. 3 is a diagram showing an example of a logical structure of a document.

【図４】版の管理を説明するための図FIG. 4 is a diagram for explaining plate management.

【図５】版の管理を説明するための図FIG. 5 is a diagram for explaining plate management.

【図６】版の管理を説明するための図FIG. 6 is a diagram for explaining plate management.

【図７】検索用データのデータ構造を示す図FIG. 7 is a diagram showing a data structure of search data.

【図８】検索用データと版管理を伴った文書、文書構
成要素の関係の概要を示す図。FIG. 8 is a diagram showing an outline of a relationship between search data, a document accompanied by version management, and document constituent elements.

【図９】検索用データと版管理用データとの関係の一
例を示す図FIG. 9 is a diagram showing an example of a relationship between search data and version management data.

【図１０】文書登録時の検索用データと文書の版の登
録、更新の処理のフローを示す図FIG. 10 is a diagram showing a flow of processing for registering and updating search data and a document version at the time of document registration.

【図１１】文書削除、版削除時の検索用データと文書
の版の削除の処理のフローを示す図FIG. 11 is a diagram showing a flow of processing for deleting a document, a search data at the time of version deletion, and a version of a document.

【図１２】文書削除、版削除時の検索用データと文書
の版の削除の処理のフローを示す図（図１１の続き）FIG. 12 is a diagram showing the flow of processing for deleting document, search data and version of a document when deleting a version (continuation of FIG. 11).

【図１３】本発明の第２の実施例の構成を示す図FIG. 13 is a diagram showing a configuration of a second exemplary embodiment of the present invention.

【図１４】（ａ）および（ｂ）は文書および文書構成
要素の索引情報を示す図14A and 14B are diagrams showing index information of a document and document constituent elements.

【図１５】第２の実施例の処理のフローを示す図FIG. 15 is a diagram showing a processing flow of a second embodiment.

[Explanation of symbols]

１−１…入力部、１−２…文書指定部、１−３…検索
部、１−４…表示部、１−５…文書編集部、１−６…印
刷部、１−７…メモリ、１−８…文書格納制御部、１−
９…文書読込部、１−１０…一時文書記憶部、１−１１
…検索用データ更新部、１−１２…文書操作部、１−１
３…文書管理部、１−１３１…文書記憶部、１−１３２
…版管理部、１−１３３…版構造記憶部、１−１３４…
検索用データ記憶部。1-1 ... Input unit, 1-2 ... Document designation unit, 1-3 ... Search unit, 1-4 ... Display unit, 1-5 ... Document editing unit, 1-6 ... Printing unit, 1-7 ... Memory, 1-8 ... Document storage control unit, 1-
9 ... Document reading unit, 1-10 ... Temporary document storage unit, 1-11
... search data update unit, 1-12 ... document operation unit, 1-1
3 ... Document management unit, 1-131 ... Document storage unit, 1-132
... Plate management unit, 1-133 ... Plate structure storage unit, 1-134 ...
Search data storage unit.

───────────────────────────────────────────────────── フロントページの続き (72)発明者山崎伸宏神奈川県川崎市高津区坂戸３丁目２番１号ＫＳＰＲ＆Ｄビジネスパークビル富士ゼロックス株式会社内 ─────────────────────────────────────────────────── ─── Continuation of the front page (72) Inventor Nobuhiro Yamazaki 3-2-1, Sakado, Takatsu-ku, Kawasaki City, Kanagawa Prefecture KSP R & D Business Park Building Fuji Xerox Co., Ltd.

Claims

[Claims]

1. A document storage unit for storing a document composed of document constituent elements, and a plurality of versions for each document are managed in a tree structure, each node of the tree structure being one version, and a node having a root or an elder brother. From the eldest son, the version management information expressed by a sequence tree with a node sequence obtained as a version branch, and version management means for managing the derivation relationship of versions, and documents or documents stored in the document storage means A document search device comprising: a search unit that searches a version whose component is specified by the version management unit.

2. A document storage unit for storing a document composed of document constituent elements, and a plurality of versions for each document are managed in a tree structure, each node of the tree structure is one version, and a node having a root or an elder brother. A version management information that manages the derivation relationship of the version by the version management information expressed by the order tree with the node sequence obtained by tracing the eldest son from the version, and the search including the information that specifies the version of the search target Storage means for storing data for use, update means for updating the search data in response to update of the version management information, and search means for searching for a document constituent element by the search data. Document retrieval device that does.

3. Document storage means for storing a document composed of document constituent elements, heading information of the document constituent elements is searched by a search condition, and heading information of the document constituent element satisfying the search condition and a pointer to the document constituent element. And a display unit for reading the document component pointed by the pointer to the document component obtained as the search result of the document component search unit from the document storage unit and displaying the content information. A document retrieval device characterized by being provided.