JPH11134341A

JPH11134341A - System for displaying selection of descriptive information in hyper media description language

Info

Publication number: JPH11134341A
Application number: JP9292806A
Authority: JP
Inventors: Yoshikazu Arai; 良和新井
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1997-10-24
Filing date: 1997-10-24
Publication date: 1999-05-21

Abstract

PROBLEM TO BE SOLVED: To provide an HTML description language selection display system for automatically selecting necessary parts from HTML description language, putting them together, displaying them, and quickly and efficiently retrieving a page on which truly necessary information is printed. SOLUTION: A character string including a keyword is retrieved by a character string retrieving part 13 from an HTML file obtained by an HTML obtaining part 12 from a URL inputted by an inputting part 11. Then a division tag included in this character string is found by a tag analyzing part 14, and the character string including the keyword is extracted by an information extracting part 15, and displayed on an outputting part 16.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明はハイパーメデイア記
述言語（以下ＨＴＭＬと略称する：ＨｙｐｅｒＴｅｘｔ
ＭａｒｋｕｐＬａｎｇｕａｇｅ）による記述情報の
抜粋表示システムに関し、特にＷＷＷ（ＷｏｒｌｄＷ
ｉｄｅＷｅｂ）におけるＨＴＭＬで記述されたホーム
ページ情報の中から与えられたキーワードに関連した部
分を抜粋表示するＨＴＭＬ記述情報抜粋表示システムに
関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a hypermedia description language (hereinafter abbreviated as HTML: HyperText).
The present invention relates to a system for displaying an excerpt of descriptive information according to Markup Language, and particularly to a WWW (World W
The present invention relates to an HTML description information excerpt display system that excerpts and displays a portion related to a given keyword from homepage information described in HTML on ide Web).

【０００２】[0002]

【従来の技術】最近急速に普及してきたインターネット
においては、ＷＷＷと呼ばれる情報検索システムが構築
されており、このＷＷＷによれば、種々の情報やサービ
スがＨＴＭＬと呼ばれる言語によって記述されたホーム
ページにより提供されるようになされている。2. Description of the Related Art An information retrieval system called WWW has been constructed in the Internet which has been rapidly spreading recently. According to the WWW, various information and services are provided by a home page described in a language called HTML. It has been made to be.

【０００３】現在、ホームページの数は、莫大な数とな
っており、その中からユーザが自身によって所望するも
のを見つけ出すのは困難である。そこで、ＷＷＷでは、
例えばキーワードなどを入力すると、そのキーワードを
含むホームページを検索するような種々の検索サービス
が提供されている。[0003] At present, the number of homepages is enormous, and it is difficult for a user to find out what he or she desires from among them. So, in WWW,
For example, various search services are provided such that when a keyword or the like is input, a home page including the keyword is searched.

【０００４】さらには特開平９−１７１５１３号公報記
載の技術では、ユーザが、自身が所望するホームページ
が開設されたことを、容易に認識することもできるよう
になっている。Further, according to the technology described in Japanese Patent Application Laid-Open No. Hei 9-171513, a user can easily recognize that a desired homepage has been opened.

【０００５】そしてこのキーワード検索を行なって対応
するホームページのＵＲＬ（ＵｎｉｆｏｒｍＲｅｓｏ
ｕｒｃｅＬｏｃａｔｅｒ、一種のアドレス）を得て、
これを用いて、ＷＷＷブラウザによりホームページを表
示してユーザは所望の情報を得ている。[0005] Then, by performing a keyword search, a URL (Uniform Reso) of a corresponding homepage is obtained.
source locator, a kind of address)
Using this, a homepage is displayed by a WWW browser, and the user has obtained desired information.

【０００６】[0006]

【発明が解決しようとする課題】上述した従来のホーム
ページの表示システム、すなわち、ＨＴＭＬ記述情報表
示システムは、ホームページ、すなわち、ＨＴＭＬ記述
情報の全文を表示しており、上記のキーワード検索で多
数のＵＲＬが得られた場合にはその全てのホームページ
をＷＷＷブラウザで表示して、本当に必要な情報の掲載
されたページを目視で探さなければならないという欠点
を有している。The above-described conventional homepage display system, that is, the HTML description information display system displays the homepage, that is, the full text of the HTML description information, and a large number of URLs are obtained by the above keyword search. Is obtained, it is necessary to display all the homepages with a WWW browser and visually search for a page on which really necessary information is posted.

【０００７】本発明の目的は、ホームページ、すなわ
ち、ＨＴＭＬ記述情報の中から必要と考えられる部分を
自動的に抜粋して一つに纏めて表示できるＨＴＭＬ記述
情報抜粋表示システムを提供することにある。An object of the present invention is to provide an HTML description information excerpt display system that can automatically extract necessary portions from a homepage, that is, HTML description information and collectively display them. .

【０００８】[0008]

【課題を解決するための手段】第１の発明のＨＴＭＬ記
述情報抜粋表示システムは、ハイパーメデイア記述言語
により記述された記述情報から指定されたキーワードを
含むキーワード文字列を検索する文字列検索手段と、前
記記述情報にあるタグを解析し予め指定された分割タグ
を検出するタグ解析手段と、前記検出された分割タグに
基づいて前記キーワード文字列を前記記述情報から抽出
しハイパーメデイア記述言語による抜粋情報に整形する
情報抽出手段と、前記抽出されたキーワード文字列を表
示する表示手段とを含んで構成されている。According to a first aspect of the present invention, there is provided an HTML description information excerpt display system, comprising: character string search means for searching a description string described in a hypermedia description language for a keyword character string including a specified keyword; Tag analysis means for analyzing a tag in the description information to detect a pre-designated division tag, and extracting the keyword character string from the description information based on the detected division tag and extracting the keyword string by a hypermedia description language It is configured to include information extracting means for shaping into information, and display means for displaying the extracted keyword character string.

【０００９】第２の発明のＨＴＭＬ記述情報抜粋表示シ
ステムは、第１の発明のＨＴＭＬ記述情報抜粋表示シス
テムにおいて、文字列検索手段は検索に際しては先ず記
述情報からタグを削除してキーワード文字列を検索し検
索後に前記削除したタグを復元することを特徴としてい
る。The HTML description information excerpt display system according to a second aspect of the present invention is the HTML description information excerpt display system according to the first aspect, wherein the character string search means first deletes a tag from the description information and searches for a keyword character string when searching. Searching and restoring the deleted tag after the search.

【００１０】第３の発明のＨＴＭＬ記述情報抜粋表示シ
ステムは、第１の発明のＨＴＭＬ記述情報抜粋表示シス
テムにおいて、情報抽出手段はキーワードに先行する最
も近い先行指定分割タグとキーワードに後続する最も近
い後続指定分割タグとを検索して前記先行指定分割タグ
と後続指定分割タグとを含むその間の文字列を抽出する
ことを特徴としている。The HTML description information excerpt display system according to a third aspect of the present invention is the HTML description information excerpt display system according to the first aspect, wherein the information extracting means includes a nearest pre-designated divided tag preceding the keyword and a nearest precedent keyword following the keyword. It is characterized in that a subsequent specified division tag is searched to extract a character string between the preceding specified division tag and the subsequent specified division tag.

【００１１】第４の発明のＨＴＭＬ記述情報抜粋表示シ
ステムは、第３の発明のＨＴＭＬ記述情報抜粋表示シス
テムにおいて、情報抽出手段は抽出したキーワードを含
む文字列ブロックが複数あるときには文字列区切りタグ
により１つの文字列ブロックにすることを特徴としてい
る。The HTML description information excerpt display system according to a fourth invention is the HTML description information excerpt display system according to the third invention, wherein the information extracting means uses a character string delimiter tag when there are a plurality of character string blocks including the extracted keywords. It is characterized by one character string block.

【００１２】第５の発明のＨＴＭＬ記述情報抜粋表示シ
ステムは、第３の発明のＨＴＭＬ記述情報抜粋表示シス
テムにおいて、情報抽出手段は抽出した文字列ブロック
の先頭に終了タグがある場合および前記文字列ブロック
の終端に開始タグがある場合にはこれを削除し、これ以
外で前記文字列ブロック中に先行する開始タグに対応す
る後続する終了タグが存在する場合以外は対応する不足
タグを追加することを特徴としている。According to a fifth aspect of the present invention, there is provided the HTML description information excerpt display system according to the third aspect of the present invention, wherein the information extracting means includes a step of: If there is a start tag at the end of the block, delete it, and otherwise add a corresponding missing tag unless there is a subsequent end tag corresponding to the preceding start tag in the string block. It is characterized by.

【００１３】第６の発明のＨＴＭＬ記述情報抜粋表示シ
ステムは、第３の発明のＨＴＭＬ記述情報抜粋表示シス
テムにおいて、情報抽出手段はハイパーメデイア記述言
語によることを示すタグと、情報のヘッダ部を示すタグ
と、情報の本文を示すタグとにより抜粋情報に整形する
ことを特徴としている。According to a sixth aspect of the present invention, there is provided the HTML description information excerpt display system according to the third aspect of the present invention, wherein the information extracting means indicates a tag indicating that the information is in the hypermedia description language and a header part of the information. It is characterized in that it is formed into excerpt information by a tag and a tag indicating the text of the information.

【００１４】第７の発明のＨＴＭＬ記述情報抜粋表示シ
ステムは、第１の発明のＨＴＭＬ記述情報抜粋表示シス
テムにおいて、ハイパーメデイア記述言語による記述情
報であるホームページ情報をネットワークを介して取得
し前記ホームページ情報の指定されたキーワードを含む
抜粋情報を作成して表示するすることを特徴としてい
る。According to a seventh aspect of the present invention, there is provided the HTML description information excerpt display system according to the first aspect, wherein homepage information, which is description information in a hypermedia description language, is obtained via a network. Is created and displayed as excerpt information including the specified keyword.

【００１５】[0015]

【発明の実施の形態】次に、本発明の実施の形態につい
て図面を参照して説明する。Next, embodiments of the present invention will be described with reference to the drawings.

【００１６】図１は本発明のＨＴＭＬ記述情報抜粋表示
システムの一実施の形態を示すブロック図である。FIG. 1 is a block diagram showing one embodiment of the HTML description information excerpt display system of the present invention.

【００１７】本実施の形態のＨＴＭＬ記述情報抜粋表示
システムは、図１に示すように、ＵＲＬとキーワードと
を入力する入力部１１と、入力部１１から受け取り対応
するＨＴＭＬフアイルをＷＷＷサーバから取得するＨＴ
ＭＬ取得部１２と、入力部１１からキーワードを受け取
りＨＴＭＬ取得部１２からＨＴＭＬフアイルを受けとり
ＨＴＭＬフアイルに対しキーワード文字列の検索を行な
う文字列検索部１３と、文字列検索部１３からキーワー
ド文字列の検索の終了したＨＴＭＬフアイルを受けとり
そのタグを検索しキーワード文字周辺で文書構造の区切
りに使用されることが多いタグを探し出すタグ解析部１
４とキーワードを含みタグ解析部１４で探し出されたタ
グで囲まれた部分を抜き出し文書ブロックとしさらにそ
の文書ブロックが複数ある場合はこれを一つにまとめ表
示可能なＨＴＭＬフアイル形式にする情報抽出部１５
と、情報抽出部１５で得られるＨＴＭＬフアイルを表示
する出力部１６とから構成されている。As shown in FIG. 1, the HTML description information excerpt display system according to the present embodiment acquires an input unit 11 for inputting a URL and a keyword, and a corresponding HTML file received from the input unit 11 from a WWW server. HT
An ML acquisition unit 12; a character string search unit 13 that receives a keyword from the input unit 11 and receives an HTML file from the HTML acquisition unit 12 and searches the HTML file for a keyword character string; A tag analysis unit 1 that receives an HTML file for which search has been completed, searches for the tag, and searches for a tag that is often used to delimit a document structure around a keyword character.
4 and a keyword, and a portion surrounded by tags searched by the tag analysis unit 14 is extracted as a document block. If there are a plurality of document blocks, the information is extracted into an HTML file format that can be displayed together. Part 15
And an output unit 16 for displaying an HTML file obtained by the information extraction unit 15.

【００１８】これらにより構成される本実施の形態のＨ
ＴＭＬ記述情報抜粋表示システムは、図２に示すキーボ
ード等の入力装置２１と、情報を表示するデイスプレイ
２２と、入力部１１、ＨＴＭＬ取得部１２、文字列検索
部１３、タグ解析部１４、情報抽出部１５、出力部１６
等の処理を行なうコンピュータ２３とにより実現され
る。The H of this embodiment constituted by these components
The TML description information excerpt display system includes an input device 21 such as a keyboard shown in FIG. 2, a display 22 for displaying information, an input unit 11, an HTML acquisition unit 12, a character string search unit 13, a tag analysis unit 14, and information extraction. Unit 15, output unit 16
And the like for realizing the processing.

【００１９】図３は本実施の形態のＨＴＭＬ記述情報抜
粋表示システムの動作を示す流れ図である。図１〜図３
を参照して本実施の形態のＨＴＭＬ記述情報抜粋表示シ
ステムの動作を説明する。FIG. 3 is a flowchart showing the operation of the HTML description information excerpt display system of the present embodiment. 1 to 3
The operation of the HTML description information excerpt display system of the present embodiment will be described with reference to FIG.

【００２０】まず、入力装置２１から与えられたＵＲＬ
とキーワードは、コンピュータ２３に実装された入力部
１１によってＵＲＬはＨＴＭＬ取得部１２へ、キーワー
ドは文字列検索部１３へ渡される（ステップ３０２）。First, the URL provided from the input device 21
The URL is passed to the HTML acquisition unit 12 and the keyword is passed to the character string search unit 13 by the input unit 11 mounted on the computer 23 (step 302).

【００２１】ＵＲＬを受け取ったＨＴＭＬ取得部１２は
対応するＨＴＭＬフアイルをＷＷＷサーバから取得する
（ステップ３０３）。The HTML acquisition unit 12 that has received the URL acquires a corresponding HTML file from the WWW server (step 303).

【００２２】次に、文字列検索部１３は入力部１１から
キーワード、ＨＴＭＬ取得部１２からＨＴＭＬフアイル
を受け取り、ＨＴＭＬフアイルに対しキーワード検索を
行なう（ステップ３０４）。この際ＨＴＭＬのタグとコ
メント部分は検索を行なう前に一旦取り除き検索を行な
い、検索終了後に元あった場所にタグとコメント部分を
戻すようにした方がよい。これは、タグとコメントはＨ
ＴＭＬフアイルを参照する際、眼には触れない部分であ
り、キーワードの検索の対象にならないし、また、それ
があることにより検索に誤作動を与えるためである。Next, the character string search unit 13 receives a keyword from the input unit 11 and an HTML file from the HTML acquisition unit 12, and performs a keyword search on the HTML file (step 304). At this time, it is preferable to remove the HTML tag and the comment part once before performing the search, perform the search, and return the tag and the comment part to the original place after the search. This is for tags and comments
When referring to a TML file, it is a part that is not touched by the eyes, and is not targeted for keyword search, and the presence of the keyword causes a malfunction in the search.

【００２３】キーワードに当てはまる部分がなかった場
合は（ステップ３０５のＮ枝）、その旨を出力部１６に
伝え表示する（ステップ３１２）。If there is no part corresponding to the keyword (N branch of step 305), the fact is notified to the output unit 16 and displayed (step 312).

【００２４】キーワードに当てはまる部分があった場合
は（ステップ３０５のＹ枝）、そのキーワードの位置を
記憶して置く（ステップ３０６）。If there is a portion that corresponds to the keyword (Y branch in step 305), the position of the keyword is stored and stored (step 306).

【００２５】次にタグ解析部１４はキーワード検索の終
了したＨＴＭＬフアイルを受け取りその文書の区切りに
使われるタグを検索する（ステップ３０７）。そして見
つけだされたタグを元にして情報抽出部１５はキーワー
ドを含む部分を抜き出し文書ブロックにする（ステップ
３０８）。文書ブロックの作成処理は図４を用いて後述
する（ステップ３０８）。Next, the tag analysis unit 14 receives the HTML file for which the keyword search has been completed, and searches for a tag used as a delimiter of the document (step 307). Then, based on the found tag, the information extracting unit 15 extracts a part including the keyword and sets it as a document block (step 308). The process of creating a document block will be described later with reference to FIG. 4 (step 308).

【００２６】作成した文書ブロックが複数あるかチェッ
クし（ステップ３０９）、複数あるときには（ステップ
３０９のＹ枝）、改行タグ＜ＢＲ＞、水平線タグ＜ＨＲ
＞等の区切り用タグを挟んでつなぎあわせ一つの文書ブ
ロックにする（ステップ３１０）。この例は図５を用い
て後述する。It is checked whether there are a plurality of created document blocks (step 309). If there are a plurality of document blocks (Y branch of step 309), a line feed tag <BR> and a horizontal line tag <HR
> Into a single document block by interposing a delimiter tag such as> (step 310). This example will be described later with reference to FIG.

【００２７】次に、ＨＴＭＬの必須タグ（図６（ｂ）に
示す）を文書ブロックの前後に追加しＨＴＭＬフアイル
を作成する（ステップ３１１）。Next, an HTML tag (shown in FIG. 6B) is added before and after the document block to create an HTML file (step 311).

【００２８】できたＨＴＭＬフアイルは出力部１６から
コンピュータ２３に表示される（ステップ３１２）。The resulting HTML file is displayed on the computer 23 from the output unit 16 (step 312).

【００２９】図４はタグを元にしてキーワードを含む文
書ブロックをＨＴＭＬフアイルから抜き出す処理を示す
流れ図である。FIG. 4 is a flowchart showing a process for extracting a document block including a keyword from an HTML file based on a tag.

【００３０】あらかじめ設定しておいた文書の構造的な
区切りに使用されることの多いタグがＨＴＭＬフアイル
中のキーワードの前にあるかを検索し、キーワードの前
にあって最も近い位置にあるタグを探す（ステップ４０
２）。次に同様に設定したタグがキーワードより後ろに
あるかを検索しキーワードの後ろにあって最も近い位置
にあるタグを探す（ステップ４０３）。A search is performed to determine whether a tag which is frequently used as a structural delimiter of the document set before the keyword in the HTML file is located at the nearest position before the keyword. (Step 40
2). Next, a search is made as to whether the tag set in the same manner is behind the keyword, and a tag located closest to the back of the keyword is searched (step 403).

【００３１】次に、キーワードの前後で見つかったタグ
を含む部分を抜き出す（ステップ４０４）。Next, a portion including the tag found before and after the keyword is extracted (step 404).

【００３２】通常分割に利用するＨＴＭＬのタグは図６
（ａ）に示すように、開始タグと終了タグが存在する
が、抜き出した結果、開始、終了のいずれかが不足の場
合と過剰の場合とが起るのでこれをチェックする（ステ
ップ４０５）。文書ブロックの先頭に終了タグがある場
合と、文書ブロックの終端に開始タグがある場合は、そ
のタグは過剰であるので取り去る。その他の場合で開始
タグと終了タグが一致していない場合は、不足と見なし
タグを追加する（ステップ４０６）。The HTML tag used for normal division is shown in FIG.
As shown in (a), there are a start tag and an end tag, but as a result of extraction, there are cases where either start or end is insufficient or excessive, and these are checked (step 405). If there is an end tag at the beginning of the document block and if there is a start tag at the end of the document block, the tags are removed because they are excessive. In other cases, if the start tag and the end tag do not match, the tag is regarded as insufficient and a tag is added (step 406).

【００３３】図５は文書ブロックをつなぎ併せる処理の
ステップ３１０の一例を示す流れ図である。文書ブロッ
クが複数ある場合には、これらを一つの文書ブロックに
まとめる。すなわち、ある文書ブロックＡにつづく次の
文書ブロックＢがあるかを調べ（ステップ５０２）、な
ければ（ステップ５０２のＮ枝）、この動作は終了する
が、あった場合には（ステップ５０２のＹ枝）、この文
書ブロックＡの最後に＜ＨＲ＞タグを追加して（ステッ
プ５０３）その後に次の文書ブロックＢを追加し（ステ
ップ５０４）、残余の文書ブロックがなくなるまで＜Ｈ
Ｒ＞タグで繋げてゆき、一つの文書ブロックに纏めてゆ
く。FIG. 5 is a flowchart showing an example of step 310 of the process of joining document blocks. When there are a plurality of document blocks, these are combined into one document block. That is, it is checked whether or not there is a next document block B following a certain document block A (step 502). If there is no document block B (N branch of step 502), this operation is finished. Branch), an <HR> tag is added to the end of the document block A (step 503), and then the next document block B is added (step 504).
R> tags and combine them into one document block.

【００３４】図７はＨＴＭＬフアイルからキーワードを
含む文書ブロックを図４に従って抜き出した結果の例を
示している。FIG. 7 shows an example of a result obtained by extracting a document block including a keyword from an HTML file according to FIG.

【００３５】以上説明したように、本実施の形態のＨＴ
ＭＬ記述情報抜粋表示システムは、ホームページの中か
ら必要と考えられる部分を自動的に抜粋して一つに纏め
て表示することができ、本当に必要な情報の掲載された
ページを従来よりも迅速にかつ、効率的に探索すること
ができる。As described above, the HT of the present embodiment
The ML description information excerpt display system can automatically extract the parts considered necessary from the homepage and display them together in one, and display the page with the really necessary information more quickly than before. In addition, the search can be efficiently performed.

【００３６】本実施の形態の説明ではＷＷＷのホームペ
ージを例にとって行なったが、本発明はこれに限定され
るものではなく、ＨＴＭＬ記述言語による記述情報につ
いての抜粋表示についても適用できることは自明であ
る。In the description of the present embodiment, a WWW homepage is taken as an example. However, the present invention is not limited to this, and it is obvious that the present invention can be applied to an excerpt display of description information in an HTML description language. .

【００３７】[0037]

【発明の効果】以上説明したように、本発明のＨＴＭＬ
記述情報抜粋表示システムは、ＨＴＭＬ記述言語による
記述情報の中から必要と考えられる部分を自動的に抜粋
して一つに纏めて表示することができ、本当に必要な情
報の掲載されたページを従来よりも迅速にかつ、効率的
に探索することができるという効果を有している。As described above, the HTML of the present invention
The descriptive information excerpt display system can automatically extract necessary portions from descriptive information in HTML description language and collectively display them, and display a page on which really necessary information is posted. This has the effect that the search can be performed more quickly and efficiently.

[Brief description of the drawings]

【図１】本発明のＨＴＭＬ記述情報抜粋表示システムの
一実施の形態を示すブロック図である。FIG. 1 is a block diagram showing one embodiment of an HTML description information excerpt display system of the present invention.

【図２】本実施の形態のＨＴＭＬ記述情報抜粋表示シス
テムにおける一実施例の構成を示すブロック図である。FIG. 2 is a block diagram illustrating a configuration of an example of the HTML description information excerpt display system according to the embodiment;

【図３】本実施の形態のＨＴＭＬ記述情報抜粋表示シス
テムの動作の一例を示す流れ図である。FIG. 3 is a flowchart showing an example of the operation of the HTML description information excerpt display system of the present embodiment.

【図４】本実施の形態のＨＴＭＬ記述情報抜粋表示シス
テムの情報抽出の動作を示す詳細流れ図である。FIG. 4 is a detailed flowchart showing an information extraction operation of the HTML description information excerpt display system of the present embodiment.

【図５】本実施の形態のＨＴＭＬ記述情報抜粋表示シス
テムの文書ブロックつなぎあわせ動作の詳細流れ図であ
る。FIG. 5 is a detailed flowchart of a document block joining operation of the HTML description information excerpt display system of the present embodiment.

【図６】（ａ）は分割タグの一例を示すタグ図、（ｂ）
は必須タグの一例を示すタグ図である。FIG. 6A is a tag diagram showing an example of a divided tag, and FIG.
FIG. 4 is a tag diagram showing an example of an essential tag.

【図７】ＨＴＭＬフアイルからキーワードを含む文書ブ
ロックを抜き出した結果の例を示す図である。FIG. 7 is a diagram showing an example of a result of extracting a document block including a keyword from an HTML file.

[Explanation of symbols]

１１入力部１２ＨＴＭＬ取得部１３文字列検索部１４タグ解析部１５情報抽出部１６出力部２１入力装置２２デイスプレイ２３コンピュータ DESCRIPTION OF SYMBOLS 11 Input part 12 HTML acquisition part 13 Character string search part 14 Tag analysis part 15 Information extraction part 16 Output part 21 Input device 22 Display 23 Computer

Claims

[Claims]

1. A character string search means for searching for a keyword character string including a specified keyword from description information described in a hypermedia description language, and analyzing a tag in the description information to determine a pre-designated divided tag. Tag analyzing means for detecting, information extracting means for extracting the keyword character string from the descriptive information based on the detected divided tags, and shaping the extracted keyword character string into excerpt information in a hypermedia description language; Display means for displaying description information in a hypermedia description language.

2. The hypermedia description according to claim 1, wherein the character string search means deletes a tag from the description information, searches for a keyword character string, and restores the deleted tag after the search. Excerpt display system of descriptive information in language.

3. The information extracting means searches for a nearest preceding designated division tag preceding a keyword and a nearest succeeding designated division tag following a keyword, and searches for a portion including the preceding designated division tag and the succeeding designated division tag. 2. The system according to claim 1, wherein a character string is extracted.

4. The description information according to the hypermedia description language according to claim 3, wherein the information extracting means forms one character string block by a character string delimiter tag when there are a plurality of character string blocks including the extracted keyword. Excerpt display system.

5. The information extracting means deletes an end tag at the beginning of the extracted character string block and a start tag at the end of the character string block, and deletes the start tag in other cases than the character string block. 4. The system according to claim 3, further comprising adding a missing tag corresponding to the start tag preceding the start tag except when there is a subsequent end tag corresponding to the preceding start tag.

6. The information extracting means according to claim 3, wherein the information is formatted into excerpt information by using a tag indicating a hypermedia description language, a tag indicating a header part of the information, and a tag indicating a text of the information. Excerpt display system of description information in the described hypermedia description language.

7. The method according to claim 1, wherein homepage information, which is description information in a hypermedia description language, is obtained via a network, and excerpt information including a specified keyword of the homepage information is created and displayed. Excerpt display system of description information in the described hypermedia description language.