JPH04281563A

JPH04281563A - Document retrieving device

Info

Publication number: JPH04281563A
Application number: JP3044666A
Authority: JP
Inventors: Shigemi Nishikawa; 慈海西川; Kunio Kumamoto; 国夫熊本; Kenichi Morizaki; 謙一森崎; Hirokazu Yaegashi; 八重樫　浩和; Koki Koshida; 弘毅越田
Original assignee: Hitachi Ltd; Hitachi Communication Systems Inc
Current assignee: Hitachi Ltd; Hitachi Information Technology Co Ltd
Priority date: 1991-03-11
Filing date: 1991-03-11
Publication date: 1992-10-07

Abstract

PURPOSE:To output only an explanatory part with respect to a certain retrieval word comprehensively from plural relating documents. CONSTITUTION:A first storage means 5 which stores document information representing a document in which a descriptive part with respect to a certain retrieval word exists and descriptive part information representing the start point and end point of the descriptive part in the document at every retrieval word, and a second storage means 4 which stores the contents of the plural relating documents at every document, respectively are provided, and the document information with respect to an inputted retrieval word and the descriptive part information are retrieved from the first storage means 5, and all the corresponding data are outputted from the second storage means 4, thereby, a purpose described above can be attained.

Description

[Detailed description of the invention]

【産業上の利用分野】本発明は記憶装置に記憶されたド
キュメント中の指定された検索用語あるいはこの検索用
語に関する記載部分を検索し出力する検索装置に係り、
特に複数の関連するドキュメントから指定された検索用
語に関する記載部分を抽出して一括して出力できるドキ
ュメント検索装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a search device for searching and outputting a specified search term or a written portion related to the search term in a document stored in a storage device.
In particular, the present invention relates to a document search device that can extract descriptions related to a specified search term from a plurality of related documents and output them all at once.

【従来の技術】例えばコンピュータ等の使用時にはある
機能に関する設計方法、操作方法、及び操作時のメッセ
ージをまとめて見たいという場合がしばしばある。しか
しこの要求に従って機能毎に分類してマニュアルを作成
すると、複数の機能に重複する事項を各機能毎に記載す
ることになり、マニュアルの頁が大幅に増えるという欠
点があった。そのためコンピュータ等のマニュアルは一
般に「設計マニュアル」、「操作マニュアル」、「メッ
セージマニュアル」のように分けられており、このよう
に分けられたマニュアルを使って例えば「ファイルの作
成」という事項について調べるときは、設計方法、操作
方法、及び操作時のメッセージについて上記３種類のマ
ニュアルをそれぞれ調べる必要があった。このような必
要性から、ドキュメント中からある指定された用語を検
索する技術としては、例えば特開昭６２−４２２２９号
公報に記載のような技術がある。これは文書を記憶手段
に記憶しておき、さらに文書中の見出しをその表示頁、
表示行とともに記憶しておき、ある見出しが選択指定さ
れた場合はその見出しの表示頁、表示行以降の文書を読
み出して表示するようになっていた。また、ワードプロ
セッサの用語検索機能は、指定された用語を先頭頁から
順に検索し、その用語が最初に見つけられた頁を表示す
る。そして継続検索が指定されると、次にその用語が現
れる位置を検索するようになっている。2. Description of the Related Art For example, when using a computer, it is often necessary to view the design method, operation method, and operation messages related to a certain function all at once. However, if a manual is created by classifying the manuals by function in accordance with this requirement, items that overlap with multiple functions will be described for each function, which has the disadvantage of significantly increasing the number of pages in the manual. For this reason, manuals for computers, etc. are generally divided into ``design manuals,'' ``operation manuals,'' and ``message manuals.'' When using manuals divided in this way to investigate matters such as ``creating files,'' It was necessary to examine each of the above three types of manuals regarding the design method, operation method, and messages during operation. In view of this need, there is a technique described in, for example, Japanese Unexamined Patent Publication No. 62-42229 as a technique for searching a specified term in a document. This stores the document in a storage means, and also displays the headings in the document on its display page.
It is stored together with the display line, and when a certain heading is selected and designated, the display page of that heading and the document after the display line are read and displayed. Further, the term search function of a word processor searches for a specified term from the first page and displays the page where the term is first found. When a continuous search is specified, the next position where that term appears is searched.

【発明が解決しようとする課題】上記従来技術は、指定
された見出しあるいは用語を含む章、節、または段落だ
けを自動的に取り出すことができなかった。そのため操
作者自らが検索された用語の前後を読んで、指定された
見出しあるいは用語に関係する章、節、または段落を判
断して取り出す必要があるという問題があった。また上
記マニュアルのように複数に分けられたドキュメントに
対して検索を行う場合は、各ドキュメントに対して操作
者が上記作業を行う必要があるという問題があった。本
発明は、指定された検索用語に関する説明箇所の開始位
置から終了位置までを細かく出力することを目的とする
。また本発明は、検索用語を一度指定するだけで複数の
ドキュメントから検索用語に関する説明箇所をそれぞれ
取り出し一括して出力することを目的とする。SUMMARY OF THE INVENTION The above-mentioned prior art cannot automatically extract only chapters, sections, or paragraphs that include specified headings or terms. Therefore, there is a problem in that the operator must read before and after the searched term to determine and extract the chapter, section, or paragraph related to the specified heading or term. Furthermore, when searching for a document divided into a plurality of documents such as the above-mentioned manual, there is a problem in that the operator needs to perform the above operations for each document. An object of the present invention is to output detailed information from the start position to the end position of an explanation part regarding a specified search term. Another object of the present invention is to extract and output explanatory sections related to the search term from a plurality of documents at once by specifying the search term once.

【課題を解決するための手段】上記目的を達成するため
に本発明は、検索用語を入力する入力手段と、検索用語
に基づいて検索した結果を出力する出力手段とを有する
検索装置において、ある検索用語に関する記載部分が存
在するドキュメントを示すドキュメント情報とドキュメ
ントにおける記載部分の開始点と終了点を示す記載部分
情報を検索用語ごとに記憶する第１の記憶手段と、関連
した複数のドキュメントの内容をそれぞれのドキュメン
トごとに記憶した第２の記憶手段と、入力手段から入力
された検索用語を第１の記憶手段から検索し、検索用語
に対応するドキュメント情報と記載部分情報を読み出し
て、これらの情報に係る記載部分を第２の記憶手段から
抽出する抽出手段を設けたものである。[Means for Solving the Problems] In order to achieve the above object, the present invention provides a search device having an input means for inputting a search term and an output means for outputting a search result based on the search term. a first storage means for storing, for each search term, document information indicating a document in which a written portion related to the search term exists and written portion information indicating a start point and an end point of the written portion in the document; and contents of a plurality of related documents. is stored for each document, and the first storage means is searched for the search term input from the input means, and document information and written partial information corresponding to the search term are read out. An extraction means is provided for extracting a written part related to information from the second storage means.

【作用】検索用語毎にその検索用語に関する記載部分が
存在するドキュメントを示すドキュメント情報とこのド
キュメントにおける記載部分の開始位置と終了位置を示
す記載部分情報を記憶する第１の記憶手段と、複数のド
キュメントの内容をそれぞれのドキュメント毎に記憶す
る第２の記憶手段を設け、検索用語を指定することによ
ってその検索用語が第１の記憶手段に登録されているか
否かを検索し、登録されていればその検索用語に関する
記載部分が存在するドキュメントの情報とそのドキュメ
ントにおける開始位置情報と終了位置情報に基づき、第
２の記憶手段から該当する全てのデータを出力できる。[Operation] A first storage means for storing, for each search term, document information indicating a document in which a description part related to the search term exists and description part information indicating the start position and end position of the description part in this document; A second storage means is provided for storing the content of the document for each document, and by specifying a search term, it is searched whether or not the search term is registered in the first storage means, and whether it is registered or not. For example, all the corresponding data can be output from the second storage means based on the information on the document in which the description part related to the search term exists, and the start position information and end position information in the document.

【実施例】以下本発明の一実施例を図面を用いて詳細に
説明する図１は本発明の全体構成を示す図である。ＣＰ
Ｕ１には端末装置２とプリンタ装置３とドキュメントフ
ァイル４と検索用語ドキュメント関連付けファイル５が
接続されている。そしてＣＰＵ１は端末装置２との会話
形式による処理や、検索結果のプリンタ装置３への出力
、ドキュメントファイル４からの検索用語に対する説明
ページの取り出し処理等を行う。検索用語ドキュメント
関連付けファイル５は磁気ディスク装置等から成り、全
ての検索用語に対して、それぞれのドキュメント中でそ
の説明が記載されている箇所の開始ページと終了ページ
を記憶しているファイルである。図２は検索用語ドキュ
メント関連付けファイル５を詳細に説明するための図で
ある。図２においては、最初に「検索用語１」に関して
ドキュメントｌ中で説明されている部分の開始ページと
終了ページが対になって記憶されており、次に同じ「検
索用語１」に関してドキュメントｍ中で説明されている
部分の開始ページと終了ページが対になって記憶されて
いる。そしてその次に「検索用語２」に関してドキュメ
ントｎ中で説明されている部分の開始ページと終了ペー
ジが対になって記憶されている。ここで例えば、「検索
用語１」に関する説明がドキュメントｎ中にも存在すれ
ば、ドキュメントｍにおける開始ページと終了ページの
後に記憶させればよく、同じように「検索用語２」に関
する説明がドキュメントｌやドキュメントｍ中に存在す
れば、ドキュメントｎにおける開始ページと終了ページ
の前に記憶させればよい。尚、検索用語ドキュメント関
連付けファイル５における各ドキュメント中の開始ペー
ジと終了ページの記憶順序はｌ，ｍ，ｎ…のようにする
必要はない。　　ドキュメントファイル４は磁気ディス
ク装置等から成り、各ドキュメントの全ページが記憶さ
れている。図３は検索用語ドキュメント関連付けファイ
ル５とドキュメントファイル４の関係を説明するための
図である。図３において、「ファイル」という検索用語
に関しては、ドキュメントＡの３ページから５ページに
「ファイル容量の見積り方」について、１３ページから
１８ページに「ファイルの互換性」について、またドキ
ュメントＢの８ページから１０ページに「ファイルの操
作方法」について記載されている。同様に「回線接続」
という検索用語に関しては、ドキュメントＢの１８ペー
ジから２０ページに「回線の操作方法」について記載さ
れている。次に図４を用いて本実施例における処理の流
れを説明する。端末装置２から検索用語が入力されると
ＣＰＵ１は検索用語ドキュメント関連付けファイル５を
読み込み、その中に該検索用語が存在するかどうかサー
チし（ステップ３０）、該検索用語が存在したか否かを
判定する（ステップ３１）。検索用語が検索用語ドキュ
メント関連付けファイル５に存在しなかった場合は、検
索用語が検索用語ドキュメント関連付けファイル５に登
録されていない旨のエラーメッセージを端末装置２に表
示し（ステップ３９）、処理を終了する。一方、検索用
語が検索用語ドキュメント関連付けファイル５に存在し
たならば、次レコードのドキュメントナンバーを読み取
り（ステップ３２）、読み取られたレコードが”ＥＮＤ
”であるか否かを判定する（ステップ３３）。ステップ
３３で読み取られたレコードが”ＥＮＤ”でない場合は
、次レコードの開始ページを読み取り（ステップ３４）
、読み取られたレコードが”ＥＮＤ”であるか否かを判
定する（ステップ３５）。ステップ３５で読み取られた
レコードが”ＥＮＤ”でない場合は、次レコードの終了
ページを読み取る（ステップ３６）。次にステップ３２
で得たドキュメントナンバーに対応するドキュメントフ
ァイル４から、ステップ３４とステップ３６で得た開始
ページから終了ページまでのデータを読み取る（ステッ
プ３７）。そしてステップ３７で得られたデータをプリ
ンタ装置３に出力し（ステップ３８）、ステップ３４か
らの処理を繰り返す。ステップ３５で読み取られたレコ
ードが”ＥＮＤ”となった場合は、ステップ３２に戻り
次のドキュメントナンバーを読み取る。そしてステップ
３３で、読み取られたレコードが”ＥＮＤ”となるまで
ステップ３２からステップ３８の処理を繰り返し、ステ
ップ３３で、読み取られたレコードが”ＥＮＤ”となる
と処理は終了する。本実施例によれば、検索用語を入力
するだけで、この検索用語に関する説明を各ドキュメン
トから順に自動的に取り出すことができるので、操作者
が各ドキュメントを読んで検索用語に関する説明の開始
ページから終了ページを判断する必要が無くなるという
効果がある。尚、上記実施例においては、検索用語に関
する説明の指定はページ単位であったが、検索用語ドキ
ュメント関連付けファイル５に開始行、開始カラム、終
了行、終了カラム等を登録しておけば、取り出し単位を
より細かく指定することができる。また、上記実施例に
おいては、検索結果をプリンタ装置３に出力したが、デ
ィスプレイを有する端末装置２や、磁気ディスク装置な
どのファイル装置に出力するようにしてもよい。DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of the present invention will be described below in detail with reference to the drawings. FIG. 1 is a diagram showing the overall configuration of the present invention. C.P.
A terminal device 2, a printer device 3, a document file 4, and a search term document association file 5 are connected to U1. The CPU 1 performs conversational processing with the terminal device 2, outputs search results to the printer device 3, and extracts explanation pages for search terms from the document file 4. The search term document association file 5 is made up of a magnetic disk drive or the like, and is a file that stores, for all search terms, the start page and end page of the portion where the explanation is described in each document. FIG. 2 is a diagram for explaining the search term document association file 5 in detail. In FIG. 2, the start page and end page of the part explained in document l regarding "search term 1" are first stored as a pair, and then the same "search term 1" is stored in document m. The start page and end page of the portion described in 2 are stored as a pair. Next, the start page and end page of the portion explained in document n regarding "search term 2" are stored as a pair. For example, if an explanation regarding "search term 1" also exists in document n, it is sufficient to store it after the start page and end page of document m, and similarly, an explanation regarding "search term 2" is stored in document l. or document m, it may be stored before the start page and end page of document n. Note that the storage order of the start page and end page in each document in the search term document association file 5 does not need to be 1, m, n, . . . . The document file 4 is made up of a magnetic disk device, etc., and stores all pages of each document. FIG. 3 is a diagram for explaining the relationship between the search term document association file 5 and the document file 4. As shown in FIG. In Figure 3, regarding the search term "file", pages 3 to 5 of document A contain information about "how to estimate file capacity", pages 13 to 18 contain information about "file compatibility", and pages 13 to 18 of document A contain information about "file compatibility"; Pages 10 to 10 contain instructions on how to operate files. Similarly, "line connection"
Regarding the search term, "line operation method" is described on pages 18 to 20 of document B. Next, the flow of processing in this embodiment will be explained using FIG. 4. When a search term is input from the terminal device 2, the CPU 1 reads the search term document association file 5, searches whether the search term exists therein (step 30), and checks whether the search term exists or not. A determination is made (step 31). If the search term does not exist in the search term document association file 5, an error message indicating that the search term is not registered in the search term document association file 5 is displayed on the terminal device 2 (step 39), and the process ends. do. On the other hand, if the search term exists in the search term document association file 5, the document number of the next record is read (step 32), and the read record is "END".
” (step 33). If the record read in step 33 is not “END”, read the start page of the next record (step 34).
, it is determined whether the read record is "END" (step 35). If the record read in step 35 is not "END", the end page of the next record is read (step 36). Next step 32
Data from the start page to the end page obtained in steps 34 and 36 is read from the document file 4 corresponding to the document number obtained in step 37 (step 37). The data obtained in step 37 is then output to the printer device 3 (step 38), and the processing from step 34 is repeated. If the record read in step 35 is "END", the process returns to step 32 and reads the next document number. Then, in step 33, the processes from step 32 to step 38 are repeated until the read record becomes "END", and when the read record becomes "END" in step 33, the process ends. According to this embodiment, by simply inputting a search term, explanations related to the search term can be automatically retrieved from each document in order, so that the operator can read each document and start from the start page of the explanation related to the search term. This has the effect of eliminating the need to judge the end page. In the above embodiment, the explanation regarding the search term was specified in page units, but if the start line, start column, end line, end column, etc. are registered in the search term document association file 5, the search terms can be specified in the retrieval unit. can be specified in more detail. Further, in the above embodiment, the search results are output to the printer device 3, but they may be output to a terminal device 2 having a display or a file device such as a magnetic disk device.

【発明の効果】本発明によれば、検索用語を一度指定す
るだけでドキュメントが複数あっても、各ドキュメント
中のその検索用語に関する説明箇所を、一括して出力で
きるという効果がある。また本発明によれば、各ドキュ
メント中のその検索用語に関する説明箇所の開始位置か
ら終了位置までを自動的に出力できるので、操作者は検
索用語についての知識がなくても、操作者自ら各ドキュ
メントを読んでその検索用語に関する説明箇所を判断す
る必要がなく、容易にその検索用語に関する説明を得る
ことができるという効果がある。According to the present invention, even if there are a plurality of documents, by specifying a search term once, the explanation portions related to the search term in each document can be output at once. Furthermore, according to the present invention, since it is possible to automatically output from the start position to the end position of the explanatory part related to the search term in each document, the operator can automatically output each document There is no need to read the search term to determine where it explains the search term, and the effect is that the explanation regarding the search term can be easily obtained.

[Brief explanation of the drawing]

【図１】本発明の全体構成を示すブロック図である。FIG. 1 is a block diagram showing the overall configuration of the present invention.

【図２】検索用語ドキュメント関連付けファイル５を詳
細に説明するための図である。FIG. 2 is a diagram for explaining a search term document association file 5 in detail.

【図３】検索用語ドキュメント関連付けファイル５とド
キュメントファイル４の関係を説明するための図である
。FIG. 3 is a diagram for explaining the relationship between a search term document association file 5 and a document file 4. FIG.

【図４】本実施例における処理の流れを示すフローチャ
ートである。FIG. 4 is a flowchart showing the flow of processing in this embodiment.

[Explanation of symbols]

１…ＣＰＵ、２…端末装置、３…プリンタ装置、４…ド
キュメントファイル、５…検索用語ドキュメント関連付
けファイル1...CPU, 2...Terminal device, 3...Printer device, 4...Document file, 5...Search term document association file

Claims

[Claims]

Claims: 1. A search device comprising input means for inputting a search term and output means for outputting search results based on the search term; a first storage means that stores description part information indicating a start point and an end point of the description part in the document for each of the search terms; and a second storage means that stores the contents of a plurality of related documents for each document. a storage means; the first storage means is searched for the search term inputted from the input means; document information and description part information corresponding to the search term are read; 2. A document retrieval device comprising: an extracting means for extracting data from a storage means.