JP5187064B2

JP5187064B2 - Web resource tracking management program, Web resource tracking management device, and Web resource tracking management method

Info

Publication number: JP5187064B2
Application number: JP2008210003A
Authority: JP
Inventors: 文人西野; 照宣粂
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2008-08-18
Filing date: 2008-08-18
Publication date: 2013-04-24
Anticipated expiration: 2028-08-18
Also published as: JP2010044708A

Description

本発明は、Ｗｅｂ資源追跡管理プログラム、Ｗｅｂ資源追跡管理装置及びＷｅｂ資源追跡管理方法に係り、特に１ページに１つ以上の資源を含む第一のＷｅｂ資源から他のＷｅｂ資源へ移動した資源を追跡する為のＷｅｂ資源追跡管理プログラム、Ｗｅｂ資源追跡管理装置及びＷｅｂ資源追跡管理方法に関する。 The present invention relates to a Web resource tracking management program, a Web resource tracking management apparatus, and a Web resource tracking management method, and more particularly, to a resource transferred from a first Web resource including one or more resources per page to another Web resource. The present invention relates to a Web resource tracking management program, a Web resource tracking management apparatus, and a Web resource tracking management method for tracking.

近年、インターネットやイントラネット等のネットワーク上のＷｅｂ資源へのアクセス技術が、よく利用されるようになっている。Ｗｅｂページ等のＷｅｂ資源へのアクセス技術は検索システム，Ｗｅｂアノテーションシステム，Ｗｅｂブラウザ，ブックマークシステムなどで利用されている。 In recent years, techniques for accessing Web resources on a network such as the Internet or an intranet are often used. Access technology to Web resources such as Web pages is used in search systems, Web annotation systems, Web browsers, bookmark systems, and the like.

検索システムは、ネットワーク上のＷｅｂページに対して、あらかじめインデックスを用意しておき、検索時に与えられたクエリー（処理要求）に対してインデックスを参照してＷｅｂページの所在情報（例えばＵＲＬ）を返すものである。 The search system prepares an index in advance for a Web page on the network, and returns the location information (for example, URL) of the Web page by referring to the index for a query (processing request) given at the time of search. Is.

一方、Ｗｅｂアノテーションシステムはネットワーク上のＷｅｂページに対して、そのＷｅｂページの作者と異なる別の第三者が、そのＷｅｂページに対し、二次情報として付箋，下線又はマーカなどのアノテーションを付与し、そのＷｅｂページの閲覧時に、付与したアノテーションをＷｅｂページと同時に表示したり、又は付与したアノテーションに基づいた処理をＷｅｂページに加えたりするものである。 On the other hand, in the Web annotation system, another third party different from the author of the Web page gives an annotation such as a tag, underline, or marker as secondary information to the Web page on the network. When the Web page is browsed, the added annotation is displayed at the same time as the Web page, or processing based on the added annotation is added to the Web page.

通常はＷｅｂページにアノテーションが付与されたとき、そのＷｅｂページのＵＲＬを保持しておき、そのＵＲＬで指定されるＷｅｂページの閲覧時にアノテーションを復元することで、時間をおいてのＷｅｂページの閲覧や、アノテーションを付与したアノテータと異なる第三者による閲覧であってもアノテーションを復元するものである。 Normally, when an annotation is added to a web page, the URL of the web page is stored, and the annotation is restored when the web page specified by the URL is browsed, so that the web page can be browsed over time. In addition, the annotation is restored even when browsing by a third party different from the annotator with the annotation.

これら検索システムやＷｅｂアノテーションシステム等は、Ｗｅｂページに含まれる記事などの資源の所在が時間を経ても変わらない場合、有効に作用する。しかし、ブログやニュース等のフロー型情報を扱うサイトでは、一つのＵＲＬで指定されるＷｅｂページに一つあるいは複数の記事を有する。また、ブログやニュース等のフロー型情報を扱うサイトでは、時間とともに別の記事が追加され、Ｗｅｂページ内の構成が変更される。 These search systems, Web annotation systems, and the like work effectively when the location of resources such as articles included in a Web page does not change over time. However, sites that handle flow-type information such as blogs and news have one or more articles on a Web page specified by a single URL. In addition, in sites that handle flow-type information such as blogs and news, other articles are added over time, and the configuration of the Web page is changed.

上述したように、検索システムやＷｅｂアノテーションシステム等は、Ｗｅｂページが必要となる度に、ＵＲＬで指定されるＷｅｂページを取得しているため、Ｗｅｂページ内の構成が変更されると、インデックスの内容やアノテーションの付与される位置が実際のＷｅｂページと対応しなくなることがあった。 As described above, since the search system, the Web annotation system, and the like acquire the Web page specified by the URL every time the Web page is required, if the configuration in the Web page is changed, the index In some cases, the position where the content or annotation is added does not correspond to the actual Web page.

このような事態に対応するため、特許文献１にはマーカ等を文書に付した後、その文書が変更された場合であっても、ある程度の追跡を可能とする内容が記載されている。特許文献１ではマーカ等の表示位置とその前後の文字列とを対応付けて記録しておき、マーカ等を表示する際に表示位置の前後の文字列を検査する。そして、特許文献１ではマーカ等の表示位置の前後の文字列が、記録されている文字列と異なる場合に、記録されている文字列の位置を文書中から探し出して、本来、表示する位置を取得してマーカ等を表示している。 In order to cope with such a situation, Patent Document 1 describes contents that enable tracking to some extent even if a document is changed after a marker or the like is attached to the document. In Patent Document 1, a display position of a marker or the like and a character string before and after the marker are recorded in association with each other, and the character string before and after the display position is inspected when the marker or the like is displayed. In Patent Document 1, when the character string before and after the display position of the marker or the like is different from the recorded character string, the position of the recorded character string is searched from the document, and the original display position is determined. The acquired marker is displayed.

特許文献１の技術を応用することで、検索システムやＷｅｂアノテーションシステム等はＷｅｂページ内の構成が変更されたとしても、インデックスの内容やアノテーションの付与される位置を実際のＷｅｂページと対応させることができた。
特開２０００−１６３４１４号公報 By applying the technology of Patent Document 1, the search system, the Web annotation system, and the like make the contents of the index and the position to which the annotation is attached correspond to the actual Web page even if the configuration in the Web page is changed. I was able to.
JP 2000-163414 A

従来、Ｗｅｂページ内の記事数が制限されている場合には「次のページ」というような形式で、最初のＵＲＬとは別のＵＲＬで指定されるＷｅｂページの中に記事が移動する場合もあった。 Conventionally, when the number of articles in a Web page is limited, an article may be moved into a Web page specified by a URL different from the first URL in a format such as “next page”. there were.

このような最初のＵＲＬとは別のＵＲＬで指定されるＷｅｂページの中に記事が移動する場合は、記事を閲覧しようとして元の所在（最初のＵＲＬ）にアクセスしたとき、自動的にその記事が移動した先（別のＵＲＬ）にジャンプ（移動した先の記事を表示）させたいという要求がある。 When an article moves into a Web page specified by a URL different from the first URL, when the original location (first URL) is accessed in order to browse the article, the article is automatically set. There is a request to jump (display the article of the moved destination) to the destination (another URL).

図１は最初のＵＲＬとは別のＵＲＬで指定されるＷｅｂページの中に記事が移動する例を表した一例の模式図である。ある時点で、１．ｈｔｍｌというファイル１０には、記事Ａ、記事Ｂ、記事Ｃがあったとする。一般的にニュースやブログなどは、ファイル１０に示すように１ページに複数の記事Ａ〜Ｃを含んでおり、新しい記事Ｃが上部に配置されることが多い。この時点で記事Ｂは、検索システムの場合、インデックスとして、Ｗｅｂアノテーションの場合、記事Bに付与されたアノテーションの位置として、１．ｈｔｍｌに存在すると記憶されている。 FIG. 1 is a schematic diagram illustrating an example in which an article moves into a Web page specified by a URL different from the initial URL. At some point: Assume that there are an article A, an article B, and an article C in the file 10 called html. In general, news, blogs, and the like include a plurality of articles A to C on one page as shown in the file 10, and a new article C is often arranged at the top. At this point, the article B is used as an index in the case of a search system, and in the case of a web annotation, as the position of the annotation given to the article B. It is stored as existing in html.

ここでは仮にｉｎｄｅｘ．ｈｔｍｌというファイル１２が各記事の所在を記憶しているものとする。ある時間が経過すると、１．ｈｔｍｌには記事Ｄ、記事Ｅが追加され、一定の条件を超えた記事が他のＷｅｂページに移動される。図１では、１ページ内に３記事までと制限されているとして、あふれた記事Ａと記事Ｂとが２．ｈｔｍｌというファイル１１に移動したことを意味している。このように記事が移動した場合は、記事の移動先を自動的に追跡し、移動した先の記事を表示させたいという要求があった。 Here, index. Assume that a file 12 called html stores the location of each article. When a certain time has passed, Article D and article E are added to html, and articles that exceed a certain condition are moved to other Web pages. In FIG. 1, it is assumed that the number of articles A and articles B is 2. This means that the file has been moved to the file 11 called html. When an article moves in this way, there has been a request to automatically track the destination of the article and display the article at the destination.

検索システムではロボットが関連する資源を再収集し、インデックスを再構築することにより、インデックスが正しい位置を示すようにしている。検索システムでは再収集する間隔を短くすることにより、正しくなるまでの期間を短くするようにしている。しかしながら、これは確率的な問題であり、正しくなるまでの期間を零にできない。 In the search system, the robot re-collects the related resources and rebuilds the index so that the index indicates the correct position. The search system shortens the period until it is correct by shortening the recollection interval. However, this is a probabilistic problem and the time to correct cannot be zero.

Ｗｅｂアノテーションシステムは、キーワード（あるいはｎｇｒａｍ）を抽出して注目アイテム識別情報とし、この注目アイテム識別情報を検索システムに与えることで、検索システムが指し示すところにアノテーションの付与先が移動したと認識できる。しかしながら、前述のとおり、これは検索システムが常に正しい位置を指し示していることが前提となっており、この前提が成り立たないこともある。 The Web annotation system extracts a keyword (or ngram) as attention item identification information, and gives this attention item identification information to the search system, thereby recognizing that the annotation assignment destination has moved to the location indicated by the search system. However, as described above, this is based on the assumption that the search system always points to the correct position, and this assumption may not be satisfied.

従来は、記事の移動とロボットによる関連するＷｅｂページの位置の再収集とが非同期であるため、インデックスを再構築するまでに時間が掛かり、タイミングによっては移動した記事を検索できないという問題があった。なお、検索のインデックスをオンデマンドで起動して収集することも可能であるが、インデックスの再構築は時間が掛かり、高速に注目している記事を追跡できないという問題があった。 Conventionally, the movement of articles and the recollection of the position of the related web page by the robot are asynchronous, so it takes time to rebuild the index, and there is a problem that the moved articles cannot be searched depending on the timing. . Although it is possible to start and collect search indexes on demand, it takes time to rebuild the index, and there is a problem that it is not possible to track articles focused on at high speed.

本発明の一実施形態は、上記の点に鑑みなされたもので、１ページに１つ以上の資源を含む第一のＷｅｂ資源から他のＷｅｂ資源へ移動した資源を容易且つ迅速に追跡できるＷｅｂ資源追跡管理プログラム、Ｗｅｂ資源追跡管理装置及びＷｅｂ資源追跡管理方法を提供することを目的とする。 One embodiment of the present invention has been made in view of the above points, and a Web that can easily and quickly track a resource that has moved from a first Web resource including one or more resources to one page to another Web resource. An object is to provide a resource tracking management program, a Web resource tracking management device, and a Web resource tracking management method.

上記課題を解決するため、本発明の一実施形態は、コンピュータに、１ページに１つ以上の資源を含む第一のＷｅｂ資源から他のＷｅｂ資源へ移動した前記資源を追跡させるＷｅｂ資源追跡管理プログラムであって、前記コンピュータを、前記第一のＷｅｂ資源の所在情報と、前記第一のＷｅｂ資源に含まれる注目資源を識別する為の注目資源識別情報との対応関係を受け取る受取手段と、利用者に選択された前記所在情報に対応する前記Ｗｅｂ資源を取り出す取出手段と、前記取出手段が取り出した前記Ｗｅｂ資源に前記注目資源が存在するかを、前記注目資源識別情報により確認する存在確認手段と、前記取出手段が取り出した前記Ｗｅｂ資源に前記注目資源が存在しなければ前記Ｗｅｂ資源に含まれる前記他のＷｅｂ資源へのリンクを抽出する抽出手段と、前記抽出手段が抽出した前記リンクで示される所在情報を次の所在情報として前記取出手段へ通知する追跡手段と、前記取出手段が取り出した前記Ｗｅｂ資源に前記注目資源が存在すれば前記Ｗｅｂ資源を前記利用者の端末に送信する送信手段として機能させるためのＷｅｂ資源追跡管理プログラムであることを特徴とする。 In order to solve the above-described problem, an embodiment of the present invention is a Web resource tracking management that causes a computer to track the resource that has been moved from a first Web resource including one or more resources per page to another Web resource. A receiving means for receiving correspondence between the location information of the first Web resource and the target resource identification information for identifying the target resource included in the first Web resource; Extraction means for extracting the Web resource corresponding to the location information selected by the user, and existence confirmation for confirming whether the attention resource exists in the Web resource extracted by the extraction means, based on the attention resource identification information And a link to the other Web resource included in the Web resource if the resource of interest does not exist in the Web resource extracted by the extraction unit The extraction means to be output, the tracking means for notifying the location information indicated by the link extracted by the extraction means to the extraction means as the next location information, and the resource of interest exists in the Web resource extracted by the extraction means In this case, it is a Web resource tracking management program for causing the Web resource to function as a transmission unit that transmits the Web resource to the user terminal.

なお、本発明の一実施形態の構成要素、表現または構成要素の任意の組合せを、方法、装置、システム、コンピュータプログラム、記録媒体、データ構造などに適用したものも本発明の態様として有効である。 In addition, what applied the component, the expression, or arbitrary combinations of the component of one Embodiment of this invention to a method, an apparatus, a system, a computer program, a recording medium, a data structure, etc. is also effective as an aspect of this invention. .

上述の如く、本発明の一実施形態によれば、１ページに１つ以上の資源を含む第一のＷｅｂ資源から他のＷｅｂ資源へ移動した資源を容易且つ迅速に追跡できる。 As described above, according to an embodiment of the present invention, resources moved from a first Web resource including one or more resources per page to another Web resource can be easily and quickly tracked.

次に、本発明を実施するための最良の形態を、以下の実施例に基づき図面を参照しつつ説明していく。 Next, the best mode for carrying out the present invention will be described based on the following embodiments with reference to the drawings.

（本実施例の概要）
本実施例は、Ｗｅｂページ間の移動がおこる記事などの注目アイテム（注目資源）へのジャンプにおいて、指定された所在（ＵＲＬ）に注目アイテムがない場合、Ｗｅｂページ内の「次のページ」というような形式で、別のＵＲＬで指定されるＷｅｂページにジャンプするリンク（以下、ナビゲーションリンクという）を探し出し、そのナビゲーションリンクを辿って目的の注目アイテムを見つけるものである。したがって、本実施例では他のＷｅｂページへ移動した注目アイテムを追跡できる。 (Overview of this example)
In the present embodiment, when there is no item of interest at a specified location (URL) in a jump to an item of interest (attention resource) such as an article that moves between Web pages, it is referred to as “next page” in the Web page. In such a format, a link that jumps to a Web page specified by another URL (hereinafter referred to as a navigation link) is searched, and the target attention item is found by following the navigation link. Therefore, in this embodiment, it is possible to track the item of interest that has moved to another Web page.

本実施例は、インターネットやイントラネット等のネットワーク上の記事等の資源に対する検索やアノテーションにおいて、無秩序に（人の自由意志によって）行われる移動ではないシステマティックな記事の移動があった場合でも、検索結果やアノテーションされた資源の閲覧において、資源の移動を反映した移動先へのアクセス（閲覧など）を可能とするものである。 In this embodiment, in the search and annotation for resources such as articles on networks such as the Internet and Intranet, even if there is a systematic article movement that is not a disorderly movement (by human free will), the search result In addition, when browsing an annotated resource, access (viewing etc.) to a destination reflecting the movement of the resource is made possible.

例えば、"hoge"という文字列を含む記事Aが「http://foo/bar1.html」のＷｅｂページにあったとする。検索エンジンのクローラは、このＷｅｂページを収集して"hoge"という文字列の所在として「http://foo/bar1.htmlを記憶したとする。その後、「http://foo/」というサイトでは記事が追加され、記事Aが「http://foo/bar2.html」に移動したとする。このとき検索エンジンの利用者が、クエリーとして"hoge"を与えて検索した時、従来の検索エンジンでは文字列"hoge"の所在として「http://foo/bar1.html」を返す。一方、本実施例では文字列"hoge"の所在として、記事Aの移動先である「http://foo/bar2.html」を返す（あるいは返された所在をクリックした時に移動先である「http://foo/bar2.html」にジャンプする）ことができる。 For example, it is assumed that an article A including a character string “hoge” is on a web page “http: //foo/bar1.html”. The search engine crawler collects this Web page and stores “http: //foo/bar1.html as the location of the string“ hoge ”. After that, the site“ http: // foo / ” Then, an article is added, and article A is moved to "http: //foo/bar2.html". At this time, when the user of the search engine performs a search by giving “hoge” as a query, the conventional search engine returns “http: //foo/bar1.html” as the location of the character string “hoge”. On the other hand, in this embodiment, as the location of the character string “hoge”, “http: //foo/bar2.html”, which is the destination of the article A, is returned (or the destination “ http: //foo/bar2.html ”).

あるいはＷｅｂアノテーションという応用では、「http://foo/bar1.html」にあった記事Aにアノテーションをしたとき、Ｗｅｂアノテーションシステムがそのアノテーションの所在として「http://foo/bar1.html」を記憶したとする。その後、「http://foo/」というサイトでは記事が追加され、記事Aは「http://foo/bar2.html」に移動したとする。このとき、アノテーションの所在を記したところから当アノテーションを付与した記事にジャンプしようとすると、従来のＷｅｂアノテーションシステムでは「http://foo/bar1.html」にジャンプしてしまい、アノテーションを付与した記事Aがみつからない。一方、本実施例では記事Aの移動先である「http://foo/bar2.html」にジャンプすることで、記事Aと、そこに付与されたアノテーションとを表示することができる。 Or in an application called Web Annotation, when annotating Article A that was in "http: //foo/bar1.html", the Web annotation system uses "http: //foo/bar1.html" as the location of the annotation. Suppose you remember it. After that, an article is added to the site “http: // foo /”, and article A is moved to “http: //foo/bar2.html”. At this time, if you try to jump to the article with the annotation from the location of the annotation, the conventional Web annotation system jumps to “http: //foo/bar1.html” and added the annotation. Article A is not found. On the other hand, in this embodiment, by jumping to “http: //foo/bar2.html” which is the destination of the article A, the article A and the annotation attached thereto can be displayed.

次に、サンプルのＨＴＭＬテキストで説明する。例えば、「http://foo/bar1.html」のＷｅｂページが図２に示すようなものであるとする。図２はＷｅｂページの一例のイメージ図である。図２のＷｅｂページは記事が水平線で区切られ、各記事に日付、タイトル、記事内容が含まれている。そして、図２のＷｅｂページは新しい記事が登録されると上に追加される。図２のＷｅｂページには一定の記事数だけが掲載される。図２のＷｅｂページは底に「前のページ」というリンクがあって、古くなった記事がリンク先に掲載されている。例えば記事３０１２を含むインデックスを作成し、図３のようなリンクページが作成されたとする。 Next, sample HTML text will be described. For example, assume that the web page “http: //foo/bar1.html” is as shown in FIG. FIG. 2 is an image diagram of an example of a Web page. In the Web page of FIG. 2, articles are separated by a horizontal line, and each article includes a date, title, and article content. The Web page in FIG. 2 is added above when a new article is registered. Only a certain number of articles are posted on the Web page in FIG. The web page in FIG. 2 has a link “Previous Page” at the bottom, and an old article is posted at the link destination. For example, assume that an index including an article 3012 is created and a link page as shown in FIG. 3 is created.

図３はリンクページの一例のイメージ図である。図３のリンクページは、通常のＷｅｂにおいて、例えば図４のようなＨＴＭＬファイルで表現される。図４は、リンクページを表現するＨＴＭＬファイルの一例のイメージ図である。 FIG. 3 is an image diagram of an example of a link page. The link page of FIG. 3 is expressed by an HTML file as shown in FIG. FIG. 4 is an image diagram of an example of an HTML file representing a link page.

図３のリンクページの「記事３０１２」の部分をクリックすると、記事３０１２をもともと含んでいたＷｅｂページにジャンプすることになる。例えば図４のＨＴＭＬファイルの場合は、「http://foo/bar1.html」のＷｅｂページにジャンプする。図３のリンクページは文字列"hoge"の検索結果であっても、アノテーションした所在の一覧リストであってもよい。 When the “article 3012” portion of the link page in FIG. 3 is clicked, the page jumps to the Web page that originally contained the article 3012. For example, in the case of the HTML file shown in FIG. 4, a jump is made to the web page “http: //foo/bar1.html”. The link page in FIG. 3 may be a search result of the character string “hoge” or a list of annotated locations.

しかし、ニュースやブログなどのＷｅｂページは時間が経つと記事が追加される。例えば図２のＷｅｂページは時間が経つと記事が追加され、例えば図５のＷｅｂページのようになっているかもしれない。図５は、記事が追加されたＷｅｂページの一例のイメージ図である。 However, articles are added to web pages such as news and blogs over time. For example, an article may be added to the Web page in FIG. 2 over time, and may be like the Web page in FIG. 5, for example. FIG. 5 is an image diagram of an example of a Web page to which an article has been added.

図５のＷｅｂページは、図２のＷｅｂページとＵＲＬが同じであるが、記事３０１４及び３０１５が追加され、図２のＷｅｂページに掲載されていた記事３０１１及び３０１２が「前のページ」というナビゲーションリンク先の図６に示すようなＷｅｂページに移動している。図６はナビゲーションリンク先のＷｅｂページの一例のイメージ図である。例えば図６のＷｅｂページのＵＲＬは、「http://foo/bar2.html」であるとする。 The Web page in FIG. 5 has the same URL as the Web page in FIG. 2, but articles 3014 and 3015 are added, and the articles 3011 and 3012 posted on the Web page in FIG. It has moved to the Web page as shown in FIG. FIG. 6 is an image diagram of an example of a Web page as a navigation link destination. For example, assume that the URL of the Web page in FIG. 6 is “http: //foo/bar2.html”.

従来、図３のリンクページは注目アイテム（記事３０１２）の所在を、インデックスが作成された時点での物理的位置「http://foo/bar1.html」として記憶している。インデックスが作成された後で記事３０１２が「http://foo/bar2.html」のＷｅｂページに移動したとすると、図３のリンクページから記事３０１２へのジャンプでは、記事３０１２の無い物理的位置「http://foo/bar1.html」のＷｅｂページが表示されてしまう。 Conventionally, the link page of FIG. 3 stores the location of the item of interest (article 3012) as the physical location “http: //foo/bar1.html” at the time the index is created. If the article 3012 is moved to the web page “http: //foo/bar2.html” after the index is created, the physical position where the article 3012 is not found in the jump from the link page of FIG. The web page “http: //foo/bar1.html” is displayed.

そこで、本実施例では、リンクページから注目アイテムである記事３０１２にアクセスしようとした時、インデックスが作成された時点での物理的位置「http://foo/bar1.html」のＷｅｂページではなく、「前のページ」というアンカーで示されたナビゲーションリンクを利用して、現時点で注目アイテムである記事３０１２が存在する「http://foo/bar2.html」のＷｅｂページにジャンプしようとするものである。 Therefore, in this embodiment, when trying to access the article 3012 that is the item of interest from the link page, it is not the Web page of the physical position “http: //foo/bar1.html” at the time when the index is created. , Using the navigation link indicated by the anchor “Previous Page”, to jump to the web page of “http: //foo/bar2.html” where the article 3012 which is the current item of interest exists It is.

すなわち、検索システムでは図３のリンクページの記事３０１２をクリックすると図５のＷｅｂページではなく図６のＷｅｂページが表示される。また、Ｗｅｂアノテーションシステムでは図６のＷｅｂページにアノテーションが付与された図７に示すＷｅｂページが表示される。図７はナビゲーションリンク先のＷｅｂページに復元したアノテーションの一例のイメージ図である。 That is, in the search system, when the article 3012 on the link page in FIG. 3 is clicked, the Web page in FIG. 6 is displayed instead of the Web page in FIG. In the Web annotation system, the Web page shown in FIG. 7 in which the annotation is added to the Web page in FIG. 6 is displayed. FIG. 7 is an image diagram of an example of the annotation restored to the navigation link destination Web page.

なお、上記では説明を容易化する為、「記事３０１０」のように記事番号で記事が一意に特定できるようにしたが、実際、このような記事番号が付与されているのではなく、通常のテキストがタイトルになっている。 In the above, for ease of explanation, the article number can be uniquely identified by the article number as in “Article 3010”. The text is the title.

一般に記事の移動は無秩序に行われるのではなく、記事の移動元のＷｅｂページから移動先のＷｅｂページへナビゲーションリンクが張られている。なお、ナビゲーションリンクとは、「次に」とか「next」などの目印が付けられた、その記事（資源）に関連する連続的なＷｅｂページへのリンクである。従来は、単に所在情報（ＵＲＬ）あるいは特徴情報で記事にアクセスしていた。 In general, the movement of articles is not performed randomly, but a navigation link is provided from the Web page of the article movement source to the Web page of the movement destination. The navigation link is a link to a continuous Web page related to the article (resource) with a mark such as “next” or “next”. Conventionally, articles are accessed simply by location information (URL) or feature information.

本実施例は、記事の移動元のＷｅｂページから移動先のＷｅｂページへナビゲーションリンクが張られていることに注目し、注目アイテム（注目記事）が見つからなかったときにナビゲーションリンクを自動的に辿ることで注目アイテムにアクセスできるようにしている。 In this embodiment, attention is paid to the fact that a navigation link is extended from the Web page that is the source of the article to the Web page that is the destination, and the navigation link is automatically traced when the item of interest (the article of interest) is not found. This makes it possible to access attention items.

（本実施例の原理）
図８は、本実施例の原理を表した説明図である。図８の資源追跡部２０は、注目アイテム情報取出部２１，Ｗｅｂ情報取出部２２，注目アイテム存在確認部２３，ナビゲーションリンク抽出部２４，ナビゲーション実行部２５を有する構成である。 (Principle of this embodiment)
FIG. 8 is an explanatory diagram showing the principle of this embodiment. The resource tracking unit 20 of FIG. 8 has a configuration including an attention item information extraction unit 21, a Web information extraction unit 22, an attention item presence confirmation unit 23, a navigation link extraction unit 24, and a navigation execution unit 25.

注目アイテム情報取出部２１は、アドレス（ＵＲＬ）と共に、注目アイテム（記事、ブロックなど）の識別情報（キーワードや記事ＩＤ、記事のハッシュコード値など）を注目アイテム情報として受け取る。 The attention item information extraction unit 21 receives the identification information (keyword, article ID, article hash code value, etc.) of the attention item (article, block, etc.) as attention item information together with the address (URL).

Ｗｅｂ情報取出部２２は、注目アイテム情報取出部２１が受け取ったアドレスで指定されるＷｅｂ資源（Ｗｅｂページ）を取得する。注目アイテム存在確認部２３は、Ｗｅｂ情報取出部２２が取得したＷｅｂ資源に、注目アイテム情報取出部２１が受け取った注目アイテムの識別情報と一致する資源（記事）が存在するか否かを検査する。 The Web information extraction unit 22 acquires a Web resource (Web page) specified by the address received by the attention item information extraction unit 21. The attention item existence confirmation unit 23 checks whether the Web resource acquired by the Web information extraction unit 22 includes a resource (article) that matches the identification information of the attention item received by the attention item information extraction unit 21. .

注目アイテムの識別情報と一致する資源が存在しない場合、ナビゲーションリンク抽出部２４は、そのＷｅｂ資源のナビゲーションリンクをアンカー文字列あるいは文書構造知識に基づいて探し出す。ナビゲーション実行部２５は、ナビゲーションリンクを組織的に辿りながら、注目アイテムの識別情報と一致する資源を有するＷｅｂ資源を探す。 If there is no resource that matches the identification information of the item of interest, the navigation link extraction unit 24 searches for the navigation link of the Web resource based on the anchor character string or the document structure knowledge. The navigation execution unit 25 searches for a Web resource having a resource that matches the identification information of the item of interest while systematically following the navigation link.

資源追跡部２０は、ステップＳ２１で注目アイテム情報取出部２１が受け取ったアドレスについて、Ｗｅｂ情報取出部２２，注目アイテム存在確認部２３，ナビゲーションリンク抽出部２４，ナビゲーション実行部２５の処理を繰り返し行う。 The resource tracking unit 20 repeatedly performs the processes of the Web information extraction unit 22, the item of interest presence check unit 23, the navigation link extraction unit 24, and the navigation execution unit 25 for the address received by the item-of-interest information extraction unit 21 in step S21.

以上のように、資源追跡部２０は取得したＷｅｂ資源の注目アイテムの存在を確認しながら、注目アイテムがない時にナビゲーションリンクを辿って注目アイテムを探す。注目アイテムがオリジナルな所在位置からナビゲーションリンクで辿れる所在位置へ移動したのであれば、資源追跡部２０は、どのタイミングでアクセスしても目当ての注目アイテムにアクセスできる。 As described above, the resource tracking unit 20 searches for an attention item by following the navigation link when there is no attention item while confirming the presence of the attention item of the acquired Web resource. If the item of interest has moved from the original location to a location that can be followed by a navigation link, the resource tracking unit 20 can access the target item of interest at any timing.

これは、通常のニュースやブログなどのＷｅｂページは時間経過とともに記事が増えて行くタイプのＷｅｂページに対する検索システム、Ｗｅｂアノテーションシステム、Ｗｅｂブラウザ，ブックマークシステム等のインターネット関連製品において効果がある。 This is effective in Internet-related products such as a search system, a Web annotation system, a Web browser, and a bookmark system for a Web page of a type in which articles increase over time for normal Web pages such as news and blogs.

検索やアノテーションのインデックスから注目アイテムが真に存在するＷｅｂページへジャンプするようなシステムとしては、どのタイミングで上記の資源を探すかで、典型的には以下の３つがある。 There are typically the following three systems for jumping from a search or annotation index to a Web page where an item of interest really exists, depending on when the above resources are searched.

第１のシステムは、リンクページに一般のリンクを利用する。第１のシステムは、リンクがクリックされてジャンプするときに資源を追跡する。第１のシステムはブラウザ等の閲覧器の拡張あるいは専用閲覧器のような形態で実施される。 The first system uses a general link for the link page. The first system tracks resources when a link is clicked to jump. The first system is implemented in the form of an extension of a browsing device such as a browser or a dedicated browsing device.

第２のシステムは、リンクページにリンクを表示する際に資源を追跡し、追跡した結果をリンクとして表示する。第２のシステムは、例えば検索装置が検索結果を表示するときに実施される。 The second system tracks resources when displaying the link on the link page, and displays the tracked result as a link. The second system is implemented, for example, when the search device displays search results.

また、第３のシステムはリンクページにパラメータ付きの特定の所在へのリンクを利用する。第３のシステムは、そのリンクがクリックされると、特定の所在で資源の追跡が実施され、追跡した結果に基づき目的の所在へジャンプする。 The third system uses a link to a specific location with parameters on the link page. When the link is clicked, the third system performs resource tracking at a specific location, and jumps to the target location based on the tracked result.

図９は資源追跡部の一例の構成図である。本実施例では注目アイテムの存在を確認するための注目アイテム識別情報を同時に保有することを前提としている。本実施例では指定された注目アイテムに対してその特徴、具体的に検索では検索用インデックスが注目アイテム識別情報になり、Ｗｅｂアノテーションではアノテーションを付与した際に付与された場所の周辺情報（周辺のテキスト、記事タイトル、記事本文、それらのハッシュコード値など）が注目アイテム識別情報になる。 FIG. 9 is a configuration diagram of an example of the resource tracking unit. In this embodiment, it is assumed that attention item identification information for confirming the presence of the attention item is simultaneously held. In this embodiment, the feature of the specified attention item, specifically, the search index becomes the attention item identification information in the search, and in the Web annotation, the peripheral information (peripheral The text, article title, article body, hash code values thereof, etc.) are the item identification information.

図９の資源追跡部２０は、注目アイテム情報取出部２１，Ｗｅｂ情報取出部２２，注目アイテム存在確認部２３，ナビゲーションリンク抽出部２４，ナビゲーション実行部２５，Ｗｅｂ資源情報出力部２６，ナビゲーション管理部２７，ナビゲーションリンクＤＢ２８及びナビゲーションリンクテーブル２９を有する構成である。 The resource tracking unit 20 in FIG. 9 includes an attention item information extraction unit 21, a Web information extraction unit 22, an attention item presence confirmation unit 23, a navigation link extraction unit 24, a navigation execution unit 25, a Web resource information output unit 26, and a navigation management unit. 27, a navigation link DB 28 and a navigation link table 29.

注目アイテム情報受取部２１は、所在情報（ＵＲＬ）と共に、注目アイテム識別情報を受け取る。Ｗｅｂ情報取出部２２は、所在情報に対応するＷｅｂ資源をＷｅｂサーバ３０から取り出す。注目アイテム存在確認部２３は、取り出したＷｅｂ資源の中に注目アイテムが存在するかどうか、注目アイテム識別情報を使って確認する。 The attention item information receiving unit 21 receives attention item identification information together with location information (URL). The Web information extraction unit 22 extracts Web resources corresponding to the location information from the Web server 30. The attention item existence confirmation unit 23 confirms whether or not the attention item exists in the extracted Web resource by using attention item identification information.

取り出したＷｅｂ資源の中に注目アイテムが存在すれば、注目アイテム存在確認部２３はＷｅｂ資源情報出力部２６から、そのＷｅｂ資源の所在情報ないしはＷｅｂ資源自身を出力する。 If there is an item of interest in the extracted Web resource, the item-of-interest existence confirmation unit 23 outputs the location information of the Web resource or the Web resource itself from the Web resource information output unit 26.

取り出したＷｅｂ資源の中に注目アイテムが存在しなければ、ナビゲーションリンク抽出部２４は、そのＷｅｂ資源から移動してしまった注目アイテムを追跡するため、現在のＷｅｂ資源からナビゲーションリンクを抽出する。ナビゲーションリンク抽出部２４は図１０に示すようなナビゲーションリンクテーブル２９を参照し、現在のＷｅｂ資源からナビゲーションリンクを特定する。 If there is no item of interest in the extracted Web resource, the navigation link extraction unit 24 extracts the navigation link from the current Web resource in order to track the item of interest that has moved from the Web resource. The navigation link extraction unit 24 refers to a navigation link table 29 as shown in FIG. 10 and identifies a navigation link from the current Web resource.

図１０はナビゲーションリンクテーブルの一例の構成図である。ナビゲーションリンクテーブル２９は、ナビゲーションリンクに付けられる「前のページ」，「→」などの目印が記載されている。 FIG. 10 is a configuration diagram of an example of the navigation link table. The navigation link table 29 describes marks such as “previous page” and “→” attached to the navigation link.

ナビゲーションリンク抽出部２４は抽出したナビゲーションリンクをナビゲーション管理部２７に通知する。ナビゲーション管理部２７は一つのＷｅｂ資源中にナビゲーションリンクが一つとは限らないので、通知されたナビゲーションリンクをナビゲーションリンクＤＢ２８に保持させる。ナビゲーション管理部２７は、どういう順番でナビゲーションリンクを辿るか、あるいは一度辿ったところを重複して辿らないように管理しながらナビゲーションリンクＤＢ２８からナビゲーションリンクを一つずつ取り出す。 The navigation link extraction unit 24 notifies the navigation management unit 27 of the extracted navigation link. Since the navigation management unit 27 does not necessarily have one navigation link in one Web resource, the navigation management unit 27 stores the notified navigation link in the navigation link DB 28. The navigation management unit 27 takes out the navigation links from the navigation link DB 28 one by one while managing the navigation links so that the navigation links are not traced in order or not to be repeated.

そして、ナビゲーション実行部２５はナビゲーションリンクＤＢ２８から取り出されたナビゲーションリンクの示す所在情報を取り出し、取り出した所在情報をＷｅｂ情報取出部２２に通知する。 Then, the navigation execution unit 25 extracts the location information indicated by the navigation link extracted from the navigation link DB 28 and notifies the Web information extraction unit 22 of the extracted location information.

例えば資源追跡部２０のない通常の検索システムは図１１のようになる。図１１は資源追跡部のない通常の検索システムを表した構成図である。利用者はブラウザ５０を操作してキーワード等のクエリーを検索装置４０に入力する。検索装置４０は利用者に入力されたクエリーで検索を行う。 For example, a normal search system without the resource tracking unit 20 is as shown in FIG. FIG. 11 is a block diagram showing a normal search system without a resource tracking unit. The user operates the browser 50 to input a query such as a keyword into the search device 40. The search device 40 performs a search using a query input by the user.

具体的に、検索装置４０はクエリーからキーの論理式を構築し、インデックス表に基づいて所在情報を求める。図１２はインデックス表の一例の構成図である。インデックス表はキーと所在情報とを対応付けているものである。なお、クエリーからキーの論理式を構築する部分は本実施例の本質ではないので、ここでは単一のキーを例に説明する。 Specifically, the search device 40 constructs a logical expression of a key from the query and obtains location information based on the index table. FIG. 12 is a configuration diagram of an example of an index table. The index table associates keys with location information. Note that a part for constructing a logical expression of a key from a query is not the essence of the present embodiment, and therefore, a single key will be described as an example here.

例えば図１２のインデックス表の場合、検索装置４０はクエリーとして"hoge"が与えられたとすると、インデクス表から所在情報として「foo/bar1.html」、「foo/bar9.html」を得る。そして、検索装置４０は、それぞれの所在の要約を合わせて検索結果のＨＴＭＬ文書４１を作成し、ブラウザ５０に図１３の検索結果をリンクページとして表示する。 For example, in the case of the index table of FIG. 12, if “hoge” is given as a query, the search device 40 obtains “foo / bar1.html” and “foo / bar9.html” as location information from the index table. Then, the search device 40 creates a search result HTML document 41 by combining the location summaries, and displays the search result of FIG. 13 on the browser 50 as a link page.

図１３は検索結果画面の一例のイメージ図である。利用者は検索結果４１の中から、その実体のＷｅｂ資源にアクセスするためにＷｅｂ資源へのリンク（例えば「foo/bar1のタイトル」と表示されている部分）をクリックする。 FIG. 13 is an image diagram of an example of a search result screen. The user clicks a link to the Web resource (for example, a part displayed as “foo / bar1 title”) in the search result 41 in order to access the Web resource of the entity.

ブラウザ５０は、クリックされたリンク先「foo/bar1.html」のアドレスに対応するＷｅｂ資源３１をＷｅｂサーバ３０から取り出し、そのＷｅｂ資源３１を表示する。本実施例は検索であるかどうかは本質ではなく、注目アイテムへのリンクであればよい。以下の説明では、図３のリンクページによって説明する。 The browser 50 retrieves the Web resource 31 corresponding to the address of the clicked link destination “foo / bar1.html” from the Web server 30, and displays the Web resource 31. In the present embodiment, whether or not the search is a search is not essential, and may be a link to an item of interest. In the following description, the description will be given with reference to the link page of FIG.

図３のリンクページでは、記事３０１２のHTMLは図４に示すようであり、記事３０１２のリンクをクリックしたら「http://foo/bar1.html」のアドレスに対応する図５のようなＷｅｂページが表示されることになる。 In the link page of FIG. 3, the HTML of the article 3012 is as shown in FIG. 4. When the link of the article 3012 is clicked, the web page as shown in FIG. 5 corresponding to the address “http: //foo/bar1.html” Will be displayed.

例えば資源追跡部２０のある検索システムは例えば図１４のようになる。図１４は資源追跡部のある検索システムの第一実施例の構成図である。図１４の検索システムは、リンクをクリックしてＷｅｂ資源３１にアクセスしようとした時、資源の追跡を行い、真に注目アイテムが存在するＷｅｂ資源３１を表示するものである。 For example, a search system with the resource tracking unit 20 is as shown in FIG. FIG. 14 is a block diagram of a first embodiment of a search system having a resource tracking unit. The search system shown in FIG. 14 tracks resources when clicking on a link to access the Web resource 31, and displays the Web resource 31 in which the item of interest truly exists.

利用者は、ブラウザ５０を操作してキーワードを検索装置４０に入力する。検索装置４０は利用者に入力されたキーワードで検索を行う。検索装置４０は、ブラウザ５０に検索結果４１をリンクページとして表示する。ここまでの処理は、通常の検索システムと同じである。 The user operates the browser 50 and inputs a keyword to the search device 40. The search device 40 performs a search using a keyword input by the user. The search device 40 displays the search result 41 as a link page on the browser 50. The processing so far is the same as that of a normal search system.

図１４の検索システムは、利用者に入力されたキーワードを注目資源識別情報６０として保持しておく点で図１１の検索システムと異なる。利用者は検索結果４１の中から、その実体のＷｅｂ資源にアクセスするためにＷｅｂ資源へのリンクをクリックする。 The search system of FIG. 14 is different from the search system of FIG. 11 in that the keyword input by the user is held as the resource-of-interest identification information 60. The user clicks a link to the Web resource from the search result 41 in order to access the Web resource of the entity.

ブラウザ５０は、クリックされたリンク先のアドレスに対応するＷｅｂ資源３１をＷｅｂサーバ３０から単純に取り出すのではなく、リンク先のアドレス（所在情報）と注目資源識別情報６０から資源の追跡を行い、真に注目アイテムのあるＷｅｂ資源３１を表示するものである。なお、資源追跡部６０はブラウザ５０の拡張として実現してもよいし、資源追跡部６０を有する専用のブラウザ５０としてもよい。 The browser 50 does not simply retrieve the Web resource 31 corresponding to the clicked link destination address from the Web server 30, but performs resource tracking from the link destination address (location information) and the target resource identification information 60. The Web resource 31 that has a truly noticeable item is displayed. The resource tracking unit 60 may be realized as an extension of the browser 50, or may be a dedicated browser 50 having the resource tracking unit 60.

なお、図１４の検索システムは図１１の検索システムと検索結果４１のＨＴＭＬが全くかわらないが、記事３０１２のリンクをクリックしたら「http://foo/bar1.html」とあるにもかかわらず「http://foo/bar2.html」のＷｅｂページ、すなわち図６のＷｅｂページが表示されることになる。 The search system shown in FIG. 14 is completely different from the search system shown in FIG. 11 and the HTML of the search result 41, but when the link of the article 3012 is clicked, “http: //foo/bar1.html” is displayed. The web page “http: //foo/bar2.html”, that is, the web page of FIG. 6 is displayed.

図１５は、資源追跡部のある検索システムの第二実施例の構成図である。図１５の検索システムは検索結果を表示する際、単純に、そのリンクを表示するのではなく、資源の追跡を行い、注目アイテムが真に存在するアドレスにリンクを書き換えて表示するものである。 FIG. 15 is a configuration diagram of a second embodiment of a search system having a resource tracking unit. When displaying the search result, the search system of FIG. 15 does not simply display the link, but tracks resources and rewrites and displays the link at the address where the item of interest really exists.

利用者は、ブラウザ５０を操作してキーワードを検索装置４０に入力する。検索装置４０は利用者に入力されたキーワードで検索を行う。検索装置４０は、ブラウザ５０に検索結果４１をリンクページとして表示する。このとき、検索装置４０は単純に検索結果４１のリンクを表示するのではなく、資源追跡部２０に資源の追跡を行わせる。 The user operates the browser 50 and inputs a keyword to the search device 40. The search device 40 performs a search using a keyword input by the user. The search device 40 displays the search result 41 as a link page on the browser 50. At this time, the search device 40 does not simply display the link of the search result 41 but causes the resource tracking unit 20 to track the resource.

すなわち、資源追跡部２０は検索装置４０に与えられたキーワードを注目資源識別情報として、各リンクで示す先に注目アイテムが存在するかどうか確認し、存在しないならばナビゲーションリンクを辿って、注目アイテムが見つかったＷｅｂ資源３１の所在情報（ＵＲＬ）にリンクを書き換えて利用者に提示する。 That is, the resource tracking unit 20 uses the keyword given to the search device 40 as attention resource identification information to check whether or not the attention item exists at the destination indicated by each link. The link is rewritten to the location information (URL) of the Web resource 31 in which is found, and is presented to the user.

既にリンクが書き換えられているので、ブラウザ５０は、その後、利用者がリンクをクリックしたならば、そのクリックされたリンク先のアドレスに対応するＷｅｂ資源３１をＷｅｂサーバ３０から取り出し、真に注目アイテムのあるＷｅｂ資源３１を表示するものである。図１５の検索システムでは、各リンクで示す先に注目アイテムが存在するかどうか確認し、存在しないならばナビゲーションリンクを辿って、注目アイテムが見つかったＷｅｂ資源３１の所在情報（ＵＲＬ）にリンクを書き換えて利用者に提示する。 Since the link has already been rewritten, the browser 50 then retrieves the Web resource 31 corresponding to the clicked link destination address from the Web server 30 if the user clicks on the link, and the item of interest is truly the item of interest. A Web resource 31 with a certain number is displayed. In the search system of FIG. 15, it is confirmed whether or not the item of interest exists at the destination indicated by each link, and if not, the navigation link is followed to link the location information (URL) of the Web resource 31 where the item of interest is found. Rewrite and present it to the user.

例えば図４のＨＴＭＬファイルの場合、図１５の検索システムではナビゲーションリンクを辿って、注目アイテムが見つかったＷｅｂ資源３１の所在情報（ＵＲＬ）にリンクを書き換える。図１６は、注目アイテムが見つかったＷｅｂ資源の所在情報にリンクを書き換えたＨＴＭＬファイルの一例のイメージ図である。 For example, in the case of the HTML file of FIG. 4, the search system of FIG. 15 follows the navigation link and rewrites the link to the location information (URL) of the Web resource 31 where the item of interest is found. FIG. 16 is an image diagram of an example of an HTML file in which the link is rewritten to the location information of the Web resource in which the item of interest is found.

図１６のＨＴＭＬファイルの場合、図３のリンクページの「記事３０１２」の部分をクリックすると、「http://foo/bar2.html」のＷｅｂページ、すなわち図６のＷｅｂページが表示されることになる。 In the case of the HTML file in FIG. 16, when the “article 3012” portion of the link page in FIG. 3 is clicked, the web page “http: //foo/bar2.html”, that is, the web page in FIG. 6 is displayed. become.

図１７は、図１５の検索システムの処理手順を表したフローチャートである。ステップＳ１に進み、検索装置４０は利用者からキーワードを受け取る。ステップＳ２に進み、検索装置４０はインデックス表から所在集合を得る。ステップＳ３に進み、資源追跡部２０は所在集合の各要素に対して現在の所在を求める。資源追跡部２０は、インデックス表から得た所在情報と、注目アイテム識別情報としてのキーワードとを得ることで、所在集合の各要素に対して現在の所在を求めることができる。 FIG. 17 is a flowchart showing the processing procedure of the search system of FIG. In step S1, the search device 40 receives a keyword from the user. In step S2, the search device 40 obtains a location set from the index table. In step S3, the resource tracking unit 20 obtains the current location for each element of the location set. The resource tracking unit 20 can obtain the current location for each element of the location set by obtaining the location information obtained from the index table and the keyword as the noticed item identification information.

ステップＳ４に進み、資源追跡部２０は検索結果４１として、所在集合の各要素に対して現在の所在を付与して出力する。なお、ステップＳ４では例えば図１８に示すようにリンク先のタイトル名なども修正された図１８のような検索結果画面を出力するようにしてもよい。図１８は検索結果画面の他の例のイメージ図である。 In step S4, the resource tracking unit 20 assigns and outputs the current location to each element of the location set as the search result 41. In step S4, for example, as shown in FIG. 18, a search result screen as shown in FIG. 18 in which the title name of the link destination is also corrected may be output. FIG. 18 is an image diagram of another example of the search result screen.

図１９は、資源追跡部のある検索システムの第三実施例の構成図である。図１９の検索システムは検索結果を表示する際、単純に、そのリンクを表示するのではなく、特定のＷｅｂサーバ６２のアドレス＋パラメータをリンクに書き換えて表示するものである。 FIG. 19 is a configuration diagram of a third embodiment of a search system having a resource tracking unit. The search system of FIG. 19 does not simply display the link when displaying the search result, but rewrites the address + parameter of the specific Web server 62 into the link and displays it.

利用者は、ブラウザ５０を操作してキーワードを検索装置４０に入力する。検索装置４０は利用者に入力されたキーワードで検索を行う。検索装置４０は、ブラウザ５０に検索結果４１をリンクページとして表示する。このとき、検索装置４０が単純に検索結果４１のリンクを表示するのではないのは、図１５の検索システムと同じである。しかし、図１０の検索システムは、この時点で資源の追跡を行わず、リンク加工部６１が、単にリンクを特定のＷｅｂサーバ６２のアドレス＋パラメータの形に書き換えておく。 The user operates the browser 50 and inputs a keyword to the search device 40. The search device 40 performs a search using a keyword input by the user. The search device 40 displays the search result 41 as a link page on the browser 50. At this time, the search device 40 does not simply display the link of the search result 41 as in the search system of FIG. However, the search system of FIG. 10 does not track resources at this time, and the link processing unit 61 simply rewrites the link in the form of the address of the specific Web server 62 + the parameter.

例えばキーワード"hoge"の検索結果のＵＲＬが「foo/bar1.html」であり、特定のＷｅｂサーバ６２のＵＲＬが「http://tracking」であったとすると、リンク加工部６１はリンクを「http://tracking?kw=hoge&url=foo%2Fbar1.html」に書き換える。ブラウザ５０には検索結果がリンクページとして表示される。 For example, if the URL of the search result of the keyword “hoge” is “foo / bar1.html” and the URL of the specific Web server 62 is “http: // tracking”, the link processing unit 61 converts the link to “http : // tracking? kw = hoge & url = foo% 2Fbar1.html ”. The browser 50 displays the search result as a link page.

利用者が「foo.html」にアクセスしようとしてリンクをクリックすると、実際には「http://tracking?kw=hoge&url=foo%2Fbar1.html」がアクセスされる。即ち、特定のＷｅｂサーバ６２として「http://tracking」がアクセスされる。特定のＷｅｂサーバ６２には、特定アイテム識別情報としてキーワード"hoge"、所在情報「foo%2Fbar1.html」が資源追跡部２０へ渡され、資源の追跡が実行される。したがって、ブラウザ５０は真に注目アイテムのあるＷｅｂ資源３１を表示できる。 When a user clicks a link to access "foo.html", "http: // tracking? Kw = hoge & url = foo% 2Fbar1.html" is actually accessed. That is, “http: // tracking” is accessed as the specific Web server 62. The keyword “hoge” and the location information “foo% 2Fbar1.html” are passed to the resource tracking unit 20 as specific item identification information to the specific Web server 62, and resource tracking is executed. Therefore, the browser 50 can display the Web resource 31 with the item of interest truly.

例えば図４のＨＴＭＬファイルの場合、図１９の検索システムでは特定のＷｅｂサーバ６２のアドレス＋パラメータ「http://tracking?kw=hoge&url=foo%2Fbar1.html」にリンクが書き換えられる。 For example, in the case of the HTML file of FIG. 4, the link is rewritten to the address of the specific Web server 62 + parameter “http: // tracking? Kw = hoge & url = foo% 2Fbar1.html” in the search system of FIG.

図２０は、特定のＷｅｂサーバのアドレス＋パラメータにリンクを書き換えたＨＴＭＬファイルの一例のイメージ図である。 FIG. 20 is an image diagram of an example of an HTML file in which a link is rewritten to a specific Web server address + parameter.

図２０のＨＴＭＬファイルの場合、図３のリンクページの「記事３０１２」の部分をクリックすると、特定のＷｅｂサーバ６２として「http://tracking」がアクセスされ、特定アイテム識別情報としてキーワード"hoge"、所在情報「foo%2Fbar1.html」が特定のＷｅｂサーバ６２から資源追跡部２０へ渡され、資源の追跡が実行された結果、真に注目アイテムのある「http://foo/bar2.html」のＷｅｂ資源３１、すなわち図６のＷｅｂ資源３１が表示されることになる。 In the case of the HTML file of FIG. 20, when the “article 3012” portion of the link page of FIG. 3 is clicked, “http: // tracking” is accessed as the specific web server 62 and the keyword “hoge” is used as the specific item identification information. As a result of the location information “foo% 2Fbar1.html” being passed from the specific Web server 62 to the resource tracking unit 20 and the tracking of the resources being executed, “http: //foo/bar2.html” that is truly the item of interest ", That is, the Web resource 31 shown in FIG. 6 is displayed.

上記では検索システムを例に説明したが、Ｗｅｂアノテーションシステムの場合、検索システムの検索装置４０をＷｅｂアノテーションＤＢからの情報取得を行う装置に置き換えればよい。 Although the search system has been described above as an example, in the case of a Web annotation system, the search device 40 of the search system may be replaced with a device that acquires information from the Web annotation DB.

図２１は資源追跡部の処理手順を表した一例のフローチャートである。ステップＳ１１に進み、資源追跡部２０の注目アイテム情報受取部２１は、所在情報（ＵＲＬ）と、注目アイテム識別情報を受け取る。ステップＳ１２に進み、Ｗｅｂ情報取出部２２は所在情報に対応するＷｅｂ資源をＷｅｂサーバ３０から取り出す。 FIG. 21 is a flowchart illustrating an example of the processing procedure of the resource tracking unit. In step S11, the item-of-interest information receiving unit 21 of the resource tracking unit 20 receives location information (URL) and item-of-interest identification information. In step S 12, the Web information extraction unit 22 extracts Web resources corresponding to the location information from the Web server 30.

ステップＳ１３に進み、注目アイテム存在確認部２３は取り出したＷｅｂ資源の中に注目アイテム識別情報が存在するか（注目アイテムが存在するか）どうか確認する。取り出したＷｅｂ資源の中に注目アイテムが存在すれば（ステップＳ１４においてＹＥＳ）、注目アイテム存在確認部２３はステップＳ１６に進み、Ｗｅｂ資源情報出力部２６から、そのＷｅｂ資源の所在情報ないしはＷｅｂ資源自身を出力することで、ステップＳ１２で取り出したＷｅｂ資源に対する処理（一般的には表示）を行う。なお、取り出したＷｅｂ資源に対する処理としては単純な表示の他、例えばＷｅｂアノテーションシステムであればアノテーションの処理（アノテーションを付与しての表示など）や、指定されたＷｅｂ資源のダウンロードなどの処理がある。 In step S13, the attention item existence confirmation unit 23 confirms whether attention item identification information exists in the extracted Web resource (whether the attention item exists). If there is an item of interest in the extracted web resource (YES in step S14), the item of interest presence confirmation unit 23 proceeds to step S16, and the location information of the web resource or the web resource itself is sent from the web resource information output unit 26. Is output (in general, display) for the Web resource extracted in step S12. In addition to simple display as processing for the extracted Web resource, for example, in the case of a Web annotation system, there are processing of annotation (display with annotation added) and processing of downloading a specified Web resource. .

取り出したＷｅｂ資源の中に注目アイテムが存在しなければ（ステップＳ１４においてＮＯ）、ナビゲーションリンク抽出部２４はステップＳ１５に進み、そのＷｅｂ資源から移動してしまった注目アイテムを追跡するため、現在のＷｅｂ資源からナビゲーションリンクを探索する。現在のＷｅｂ資源にナビゲーションリンクがあれば（ステップＳ１７においてＹＥＳ）、ナビゲーションリンク抽出部２４はステップＳ１８に進み、ナビゲーションリンクを抽出する。ナビゲーション実行部２５は、ナビゲーションリンク抽出部２４が抽出したナビゲーションリンク先の所在情報を新しい所在情報としてナビゲーション管理部２７に通知したあと、ステップＳ１２の処理に戻る。 If the noticed item does not exist in the extracted Web resource (NO in step S14), the navigation link extracting unit 24 proceeds to step S15, and tracks the noticed item that has moved from the web resource. Search for navigation links from Web resources. If there is a navigation link in the current Web resource (YES in step S17), the navigation link extraction unit 24 proceeds to step S18 and extracts the navigation link. The navigation execution unit 25 notifies the navigation management unit 27 of the location information of the navigation link destination extracted by the navigation link extraction unit 24 as new location information, and then returns to the process of step S12.

なお、現在のＷｅｂ資源にナビゲーションリンクがなければ（ステップＳ１７においてＮＯ）、ナビゲーションリンク抽出部２４はステップＳ１９に進み、注目アイテムが見つからなかったと判断する。ナビゲーション実行部２５は見つからなかった時の処理（例外処理など）を行う。具体的には、例えばメッセージを出して、元々の所在情報（ＵＲＬ）で示されるＷｅｂ資源の表示などを行う。 If there is no navigation link in the current Web resource (NO in step S17), the navigation link extraction unit 24 proceeds to step S19 and determines that the item of interest has not been found. The navigation execution unit 25 performs processing (exception processing or the like) when it is not found. Specifically, for example, a message is output to display the Web resource indicated by the original location information (URL).

ナビゲーションリンクが常に一つ正確に見つかる場合には、図２１に示すような処理で良い。しかし、実際にはナビゲーションリンクの候補が複数ある場合もある。図２２はナビゲーションリンクの候補が複数ある画面例である。図２２の画面ではナビゲーションリンクの候補として「前のページ」と「次のページ」とがあるが、時間的過去へのリンクがどちらかがあいまいである。 If one navigation link is always found correctly, the processing shown in FIG. 21 is sufficient. However, there may actually be a plurality of navigation link candidates. FIG. 22 shows an example of a screen having a plurality of navigation link candidates. In the screen of FIG. 22, there are “previous page” and “next page” as navigation link candidates, but the link to the temporal past is ambiguous.

このように、ナビゲーションリンクの候補にあいまい性のある場合にも対処するための実施例としては、ナビゲーションリンクの候補を待ち行列に登録し、その待ち行列に登録されたナビゲーションリンクの候補を出し入れして、次に探索すべきナビゲーションリンクの候補を求める。上記処理のためには、ナビゲーション管理部２７と待ち行列を保存するキューとが必要になる。なお、待ち行列の管理は、深さ優先探索、最良優先探索などのアルゴリズムが良く知られている（これらはコンピュータの基本的な探索アルゴリズムであるので詳細は省略する）。 As described above, as an embodiment for dealing with the case where the navigation link candidates are ambiguous, the navigation link candidates are registered in the queue, and the navigation link candidates registered in the queue are taken in and out. The navigation link candidate to be searched next is obtained. For the above processing, a navigation management unit 27 and a queue for storing a queue are required. For queue management, algorithms such as depth-first search and best-first search are well known (the details are omitted because these are basic computer search algorithms).

注目アイテム存在確認部２３は、Ｗｅｂ情報取出部２２が取り出したＷｅｂ資源の中に注目アイテム識別情報が存在するかどうか確認するものである。典型的な注目アイテム存在確認部２３は、文字列検索（例えばUNIX（登録商標）のfgrepコマンド）によりＷｅｂ資源の中に注目アイテム識別情報が存在するかどうか確認する。 The attention item existence confirmation unit 23 confirms whether attention item identification information exists in the Web resource extracted by the Web information extraction unit 22. The typical attention item existence confirmation unit 23 confirms whether attention item identification information exists in the Web resource by a character string search (for example, a UNIX (registered trademark) fgrep command).

例えば、注目アイテム識別情報が文字列"記事３０１２のタイトル"であれば、Ｗｅｂ資源中に文字列"記事３０１２のタイトル"が存在するか否かで注目アイテムが存在するか否かを判定するものである。 For example, if the item of interest identification information is a character string “title of article 3012”, it is determined whether or not the item of interest exists depending on whether or not the character string “title of article 3012” exists in the Web resource. It is.

やや高度な注目アイテム存在確認部２３は、情報構造を利用するものである。例えば図２３のＨＴＭＬファイルではＷｅｂ資源が複数の記事から構成されており、各記事のタイトルがclass属性の値"entry-header"で指定されている。また、別の注目アイテム存在確認部２３は、記事タイトルでなく、記事（図２３ではclass属性の値"entry"内の本文）のハッシュ値を利用することもできる。 The somewhat advanced attention item existence confirmation unit 23 uses an information structure. For example, in the HTML file of FIG. 23, the Web resource is composed of a plurality of articles, and the title of each article is specified by the value “entry-header” of the class attribute. Another attention item existence confirmation unit 23 can use the hash value of the article (the text in the class attribute value “entry” in FIG. 23) instead of the article title.

注目アイテム存在確認部２３は、ＨＴＭＬファイルの特定の構造と構造内の照合情報とを利用するものであり、図２４のような処理を行う。図２４は注目アイテム存在確認部の処理手順を表した一例のフローチャートである。 The attention item existence confirmation unit 23 uses a specific structure of the HTML file and collation information in the structure, and performs the processing shown in FIG. FIG. 24 is a flowchart illustrating an example of a processing procedure of the attention item presence confirmation unit.

ステップＳ２１に進み、注目アイテム存在確認部２３はＷｅｂ情報取出部２２が取得したＷｅｂ資源（ＨＴＭＬファイル）を取り込む。ステップＳ２２に進み、注目アイテム存在確認部２３は取り込んだＷｅｂ資源から、指定された属性及び値のペアをもつタグを探索する。 Proceeding to step S21, the notable item presence confirmation unit 23 captures the Web resource (HTML file) acquired by the Web information extraction unit 22. In step S22, the notable item existence confirmation unit 23 searches the fetched Web resource for a tag having a specified attribute / value pair.

ステップＳ２３に進み、注目アイテム存在確認部２３は指定された属性及び値のペアをもつタグが見つからないと、ステップＳ２４に進み、取り込んだＷｅｂ資源の中に注目アイテムが存在しない旨をナビゲーションリンク抽出部２４に返す。なお、注目アイテム存在確認部２３は指定された属性及び値のペアをもつタグが見つかると、ステップＳ２５に進み、そのタグの内容が探している内容と一致するか否かを判定する。見つかったタグの内容が探している内容と一致すると、注目アイテム存在確認部２３はステップＳ２６に進み、取り込んだＷｅｂ資源の中に注目アイテムが存在した旨をＷｅｂ資源情報出力部２６に返す。見つかったタグの内容が探している内容と一致しなければ、注目アイテム存在確認部２３はステップＳ２２の処理に戻る。 The process proceeds to step S23, and if the target item existence confirmation unit 23 does not find a tag having the specified attribute / value pair, the process proceeds to step S24 to extract that the target item does not exist in the fetched Web resource. Return to section 24. When the tag having the specified attribute / value pair is found, the attention item existence confirmation unit 23 proceeds to step S25, and determines whether or not the content of the tag matches the content being searched. If the content of the found tag matches the content being searched, the attention item existence confirmation unit 23 proceeds to step S26, and returns to the Web resource information output unit 26 that the attention item exists in the fetched Web resource. If the content of the found tag does not match the content being searched for, the attention item existence confirmation unit 23 returns to the process of step S22.

また、図２５はナビゲーションリンク抽出部の処理手順を表した一例のフローチャートである。ステップＳ３１に進み、ナビゲーションリンク抽出部２４はＷｅｂ資源（Ｗｅｂページ）中のすべてのリンク情報を取り出す。 FIG. 25 is a flowchart illustrating an example of the processing procedure of the navigation link extraction unit. In step S31, the navigation link extraction unit 24 extracts all link information in the Web resource (Web page).

例えばＨＴＭＬファイルでは、ナビゲーションリンク抽出部２４が、"a"タグで囲まれたアンカーテキスト及び"a"タグのhref属性の値を取り出す。また、ナビゲーションリンク抽出部２４は必要に応じてalt属性を取り出す（アンカーがテキストではなくイメージの場合などalt属性がアンカーテキストの代替となる）。 For example, in the HTML file, the navigation link extracting unit 24 extracts the anchor text surrounded by the “a” tag and the value of the href attribute of the “a” tag. In addition, the navigation link extraction unit 24 extracts the alt attribute as necessary (the alt attribute is an alternative to the anchor text, such as when the anchor is not a text but an image).

ステップＳ３２に進み、ナビゲーションリンク抽出部２４は、取り出したリンク情報のナビゲーションリンクらしさを、アンカーテキストやリンク先、文書中の位置などの情報を使って求める。ステップＳ３３に進み、ナビゲーションリンク抽出部２４は、それぞれのリンク情報のナビゲーションリンクらしさに基づきリンク情報の取捨選択を行い、ナビゲーションリンクを求める。リンク情報の取捨選択は一つないし無しを選ぶようなものであってもよいし、ナビゲーションリンクらしさの優先度を与えて、探索アルゴリズムによってナビゲーションリンクらしいものから順にナビゲーションリンクとして辿るという方策をとってもよい。 Proceeding to step S32, the navigation link extracting unit 24 obtains the likelihood of the extracted link information using the information such as anchor text, link destination, position in the document, and the like. Proceeding to step S33, the navigation link extracting unit 24 selects link information based on the likelihood of the navigation link of each link information to obtain a navigation link. The selection of link information may be such that one or none is selected, or a priority may be given to the likelihood of navigation links, and a search algorithm may be used to follow navigation links in order from what seems to be navigation links. .

ナビゲーションリンクらしさのチェックは、例えば以下のように行う。リンク情報がナビゲーションリンクであるかどうかは明確な基準があるわけではないので、ヒューリスティックスによってナビゲーションリンクらしさを求める。 For example, the navigation link likelihood is checked as follows. Whether or not the link information is a navigation link does not have a clear standard, so the likelihood of a navigation link is obtained by heuristics.

一般的にナビゲーションリンクは、そのＷｅｂページの閲覧者が一連の情報をサーチするために利用するものなので、アンカーテキストの手がかり（「前のページ」「次のページ」「next」「previous」「→」など）、文書中の位置（一般に文書の末尾付近）や、移動がシステマティックに行われることからリンク先の手がかり（xxx/1.htmlに対してはxxx/2.htmlであるはずというURLパターン）が利用できる。 In general, navigation links are used by viewers of the Web page to search a series of information. ”), The position in the document (generally near the end of the document) and the URL pattern that the link destination clue (xxx / 1.html should be xxx / 2.html because movement is done systematically) ) Is available.

図２６は、資源追跡部２０を実現する装置の一例の構成図である。図２６の装置は、それぞれバスＢで相互に接続された入力装置７１，出力装置７２，ドライブ装置７３，補助記憶装置７４，主記憶装置７５，演算処理装置７６およびインターフェース装置７７を含むように構成される。 FIG. 26 is a configuration diagram of an example of an apparatus that implements the resource tracking unit 20. 26 includes an input device 71, an output device 72, a drive device 73, an auxiliary storage device 74, a main storage device 75, an arithmetic processing device 76, and an interface device 77 connected to each other via a bus B. Is done.

入力装置７１はキーボードやマウスなどで構成され、各種信号を入力するために用いられる。出力装置７２はディスプレイ装置などで構成され、各種ウインドウやデータ等を表示するために用いられる。インターフェース装置７７は、モデム，ＬＡＮカードなどで構成されており、ネットワークに接続する為に用いられる。 The input device 71 includes a keyboard and a mouse, and is used for inputting various signals. The output device 72 includes a display device and is used to display various windows and data. The interface device 77 includes a modem, a LAN card, and the like, and is used for connecting to a network.

本実施例に係るプログラムは、資源追跡部２０を制御する各種プログラムの少なくとも一部である。プログラムは例えば記録媒体７８の配布やネットワークからのダウンロードなどによって提供される。プログラムを記録した記録媒体７８は、ＣＤ−ＲＯＭ、フレキシブルディスク、光磁気ディスク等の様に情報を光学的，電気的或いは磁気的に記録する記録媒体、ＲＯＭ、フラッシュメモリ等の様に情報を電気的に記録する半導体メモリ等、様々なタイプの記録媒体を用いることができる。 The program according to the present embodiment is at least a part of various programs that control the resource tracking unit 20. The program is provided, for example, by distributing the recording medium 78 or downloading from the network. The recording medium 78 on which the program is recorded is a recording medium that records information optically, electrically, or magnetically, such as a CD-ROM, flexible disk, magneto-optical disk, etc., and information is electrically stored such as a ROM, flash memory, etc. Various types of recording media, such as a semiconductor memory for recording data, can be used.

また、プログラムを記録した記録媒体７８がドライブ装置７３にセットされると、プログラムは記録媒体７８からドライブ装置７３を介して補助記憶装置７４にインストールされる。ネットワークからダウンロードされたプログラムは、インターフェース装置７７を介して補助記憶装置７４にインストールされる。 When the recording medium 78 on which the program is recorded is set in the drive device 73, the program is installed from the recording medium 78 to the auxiliary storage device 74 via the drive device 73. The program downloaded from the network is installed in the auxiliary storage device 74 via the interface device 77.

補助記憶装置７４は、インストールされたプログラムを格納すると共に、必要なファイル，データ等を格納する。主記憶装置７５は、起動時に補助記憶装置７４からプログラムを読み出して格納する。そして、演算処理装置７６は主記憶装置７５に格納されたプログラムに従って、上記したような各種処理を実現している。 The auxiliary storage device 74 stores the installed program and stores necessary files, data, and the like. The main storage device 75 reads and stores the program from the auxiliary storage device 74 at the time of activation. The arithmetic processing unit 76 implements various processes as described above in accordance with a program stored in the main storage device 75.

本発明は、以下に記載する付記のような構成が考えられる。
（付記１）
コンピュータに、１ページに１つ以上の資源を含む第一のＷｅｂ資源から他のＷｅｂ資源へ移動した前記資源を追跡させるＷｅｂ資源追跡管理プログラムであって、
前記コンピュータを、
前記第一のＷｅｂ資源の所在情報と、前記第一のＷｅｂ資源に含まれる注目資源を識別する為の注目資源識別情報との対応関係を受け取る受取手段と、
利用者に選択された前記所在情報に対応する前記Ｗｅｂ資源を取り出す取出手段と、
前記取出手段が取り出した前記Ｗｅｂ資源に前記注目資源が存在するかを、前記注目資源識別情報により確認する存在確認手段と、
前記取出手段が取り出した前記Ｗｅｂ資源に前記注目資源が存在しなければ前記Ｗｅｂ資源に含まれる前記他のＷｅｂ資源へのリンクを抽出する抽出手段と、
前記抽出手段が抽出した前記リンクで示される所在情報を次の所在情報として前記取出手段へ通知する追跡手段と、
前記取出手段が取り出した前記Ｗｅｂ資源に前記注目資源が存在すれば前記Ｗｅｂ資源を前記利用者の端末に送信する送信手段と
して機能させるためのＷｅｂ資源追跡管理プログラム。
（付記２）
前記抽出手段は、前記Ｗｅｂ資源に含まれる前記他のＷｅｂ資源への1種類以上のリンクを示したテーブル手段に基づき、前記Ｗｅｂ資源に含まれる前記他のＷｅｂ資源へのリンクを抽出することを特徴とする付記１記載のＷｅｂ資源追跡管理プログラム。
（付記３）
前記存在確認手段は、キーワード，資源番号，又は前記資源のハッシュコード値を前記注目資源識別情報として、前記取出手段が取り出した前記Ｗｅｂ資源に前記注目資源が存在するかを確認することを特徴とする付記１又は２記載のＷｅｂ資源追跡管理プログラム。
（付記４）
前記第一のＷｅｂ資源は、１ページに含む前記資源の数が制限されており、前記資源の数が制限を超過しないよう古い前記資源から順に、前記リンクで示される前記所在情報に対応する前記他のＷｅｂ資源に移動されることを特徴とする付記１乃至３何れか一項記載のＷｅｂ資源追跡管理プログラム。
（付記５）
１ページに１つ以上の資源を含む第一のＷｅｂ資源から他のＷｅｂ資源へ移動した前記資源を追跡させるＷｅｂ資源追跡管理装置であって、
前記第一のＷｅｂ資源の所在情報と、前記第一のＷｅｂ資源に含まれる注目資源を識別する為の注目資源識別情報との対応関係を受け取る受取手段と、
利用者に選択された前記所在情報に対応する前記Ｗｅｂ資源を取り出す取出手段と、
前記取出手段が取り出した前記Ｗｅｂ資源に前記注目資源が存在するかを、前記注目資源識別情報により確認する存在確認手段と、
前記取出手段が取り出した前記Ｗｅｂ資源に前記注目資源が存在しなければ前記Ｗｅｂ資源に含まれる前記他のＷｅｂ資源へのリンクを抽出する抽出手段と、
前記抽出手段が抽出した前記リンクで示される所在情報を次の所在情報として前記取出手段へ通知する追跡手段と、
前記取出手段が取り出した前記Ｗｅｂ資源に前記注目資源が存在すれば前記Ｗｅｂ資源を前記利用者の端末に送信する送信手段と
を有するＷｅｂ資源追跡管理装置。
（付記６）
コンピュータによって実行される、１ページに１つ以上の資源を含む第一のＷｅｂ資源から他のＷｅｂ資源へ移動した前記資源を追跡させるＷｅｂ資源追跡管理方法であって、
前記コンピュータが、
前記第一のＷｅｂ資源の所在情報と、前記第一のＷｅｂ資源に含まれる注目資源を識別する為の注目資源識別情報との対応関係を受け取る受取手順と、
利用者に選択された前記所在情報に対応する前記Ｗｅｂ資源を取り出す取出手順と、
前記取出手順で取り出した前記Ｗｅｂ資源に前記注目資源が存在するかを、前記注目資源識別情報により確認する存在確認手順と、
前記取出手順で取り出した前記Ｗｅｂ資源に前記注目資源が存在しなければ前記Ｗｅｂ資源に含まれる前記他のＷｅｂ資源へのリンクを抽出する抽出手順と、
前記抽出手順で抽出した前記リンクで示される所在情報を次の所在情報として前記取出手段へ通知する追跡手順と、
前記取出手順で取り出した前記Ｗｅｂ資源に前記注目資源が存在すれば前記Ｗｅｂ資源を前記利用者の端末に送信する送信手順と
を実行することを特徴とするＷｅｂ資源追跡管理方法。 The present invention may have the following configurations as described below.
(Appendix 1)
A web resource tracking management program for causing a computer to track the resource moved from a first web resource including one or more resources per page to another web resource,
The computer,
Receiving means for receiving a correspondence relationship between the location information of the first Web resource and target resource identification information for identifying the target resource included in the first Web resource;
Retrieval means for retrieving the Web resource corresponding to the location information selected by the user;
Presence checking means for confirming whether or not the attention resource exists in the Web resource extracted by the extraction means, based on the attention resource identification information;
Extraction means for extracting a link to the other Web resource included in the Web resource if the resource of interest does not exist in the Web resource extracted by the extraction means;
Tracking means for notifying the take-out means of the location information indicated by the link extracted by the extraction means as the next location information;
A Web resource tracking management program for causing a Web resource to function as a transmission unit for transmitting the Web resource to the user terminal if the resource of interest exists in the Web resource extracted by the extraction unit.
(Appendix 2)
The extraction means extracts a link to the other Web resource included in the Web resource based on a table means indicating one or more types of links to the other Web resource included in the Web resource. The Web resource tracking management program according to appendix 1, which is characterized.
(Appendix 3)
The presence confirmation unit is configured to confirm whether the target resource exists in the Web resource extracted by the extraction unit, using a keyword, a resource number, or a hash code value of the resource as the target resource identification information. The Web resource tracking management program according to Supplementary Note 1 or 2.
(Appendix 4)
In the first Web resource, the number of the resources included in one page is limited, and the first Web resource corresponds to the location information indicated by the link in order from the old resource so that the number of the resources does not exceed the limit. The Web resource tracking management program according to any one of appendices 1 to 3, wherein the Web resource tracking management program is moved to another Web resource.
(Appendix 5)
A web resource tracking management apparatus for tracking the resource moved from a first web resource including one or more resources in one page to another web resource,
Receiving means for receiving a correspondence relationship between the location information of the first Web resource and target resource identification information for identifying the target resource included in the first Web resource;
Retrieval means for retrieving the Web resource corresponding to the location information selected by the user;
Presence checking means for confirming whether or not the attention resource exists in the Web resource extracted by the extraction means, based on the attention resource identification information;
Extraction means for extracting a link to the other Web resource included in the Web resource if the resource of interest does not exist in the Web resource extracted by the extraction means;
Tracking means for notifying the take-out means of the location information indicated by the link extracted by the extraction means as the next location information;
A Web resource tracking management apparatus comprising: a transmission unit configured to transmit the Web resource to the user terminal if the resource of interest exists in the Web resource extracted by the extraction unit.
(Appendix 6)
A web resource tracking management method for tracking the resource moved from a first web resource including one or more resources per page to another web resource, which is executed by a computer,
The computer is
A receiving procedure for receiving correspondence between the location information of the first Web resource and the target resource identification information for identifying the target resource included in the first Web resource;
An extraction procedure for retrieving the Web resource corresponding to the location information selected by the user;
A presence confirmation procedure for confirming whether or not the target resource exists in the Web resource extracted in the extraction procedure by the target resource identification information;
An extraction procedure for extracting a link to the other Web resource included in the Web resource if the resource of interest does not exist in the Web resource extracted in the extraction procedure;
A tracking procedure for notifying the extraction means of the location information indicated by the link extracted in the extraction procedure as the next location information;
A Web resource tracking management method comprising: executing a transmission procedure for transmitting the Web resource to the user terminal if the resource of interest exists in the Web resource extracted in the extraction procedure.

本発明は、具体的に開示された実施例に限定されるものではなく、特許請求の範囲から逸脱することなく、種々の変形や変更が可能である。 The present invention is not limited to the specifically disclosed embodiments, and various modifications and changes can be made without departing from the scope of the claims.

最初のＵＲＬとは別のＵＲＬで指定されるＷｅｂページの中に記事が移動する例を表した一例の模式図である。It is a schematic diagram of an example showing an example in which an article moves into a Web page specified by a URL different from the first URL. Ｗｅｂページの一例のイメージ図である。It is an image figure of an example of a web page. リンクページの一例のイメージ図である。It is an image figure of an example of a link page. リンクページを表現するＨＴＭＬファイルの一例のイメージ図である。It is an image figure of an example of the HTML file expressing a link page. 記事が追加されたＷｅｂページの一例のイメージ図である。It is an image figure of an example of the web page to which the article was added. ナビゲーションリンク先のＷｅｂページの一例のイメージ図である。It is an image figure of an example of the web page of a navigation link destination. ナビゲーションリンク先のＷｅｂページに復元したアノテーションの一例のイメージ図である。It is an image figure of an example of the annotation decompress | restored to the web page of the navigation link destination. 本実施例の原理を表した説明図である。It is explanatory drawing showing the principle of a present Example. 資源追跡部の一例の構成図である。It is a block diagram of an example of a resource tracking part. ナビゲーションリンクテーブルの一例の構成図である。It is a block diagram of an example of a navigation link table. 資源追跡部のない通常の検索システムを表した構成図である。It is a block diagram showing the normal search system without a resource tracking part. インデックス表の一例の構成図である。It is a block diagram of an example of an index table. 検索結果画面の一例のイメージ図である。It is an image figure of an example of a search result screen. 資源追跡部のある検索システムの第一実施例の構成図である。It is a block diagram of the 1st Example of the search system with a resource tracking part. 資源追跡部のある検索システムの第二実施例の構成図である。It is a block diagram of the 2nd Example of the search system with a resource tracking part. 注目アイテムが見つかったＷｅｂ資源の所在情報にリンクを書き換えたＨＴＭＬファイルの一例のイメージ図である。It is an image figure of an example of the HTML file which rewritten the link to the location information of the Web resource where the attention item was found. 図１５の検索システムの処理手順を表したフローチャートである。It is a flowchart showing the process sequence of the search system of FIG. 検索結果画面の他の例のイメージ図である。It is an image figure of the other example of a search result screen. 資源追跡部のある検索システムの第三実施例の構成図である。It is a block diagram of the 3rd Example of the search system with a resource tracking part. 特定のＷｅｂサーバのアドレス＋パラメータにリンクを書き換えたＨＴＭＬファイルの一例のイメージ図である。It is an image figure of an example of the HTML file which rewritten the link to the address + parameter of a specific Web server. 資源追跡部の処理手順を表した一例のフローチャートである。It is a flowchart of an example showing the process sequence of the resource tracking part. ナビゲーションリンクの候補が複数ある画面例である。It is an example of a screen with a plurality of navigation link candidates. ＨＴＭＬファイルの一例のイメージ図である。It is an image figure of an example of an HTML file. 注目アイテム存在確認部の処理手順を表した一例のフローチャートである。It is a flowchart of an example showing the process sequence of the attention item presence confirmation part. ナビゲーションリンク抽出部の処理手順を表した一例のフローチャートである。It is a flowchart of an example showing the process sequence of the navigation link extraction part. 資源追跡部２０を実現する装置の一例の構成図である。2 is a configuration diagram of an example of an apparatus that implements a resource tracking unit 20; FIG.

Explanation of symbols

２０資源追跡部
２１注目アイテム情報取出部
２２Ｗｅｂ情報取出部
２３注目アイテム存在確認部
２４ナビゲーションリンク抽出部
２５ナビゲーション実行部
２６Ｗｅｂ資源情報出力部
２７ナビゲーション管理部
２８ナビゲーションリンクＤＢ
２９ナビゲーションリンクテーブル
３０Ｗｅｂサーバ
３１Ｗｅｂ資源
４０検索装置
４１検索結果
５０ブラウザ
６０注目資源識別情報
６１リンク加工部
６２特定のＷｅｂサーバ
７１入力装置
７２出力装置
７３ドライブ装置
７４補助記憶装置
７５主記憶装置
７６演算処理装置
７７インターフェース装置 DESCRIPTION OF SYMBOLS 20 Resource tracking part 21 Attention item information extraction part 22 Web information extraction part 23 Attention item presence confirmation part 24 Navigation link extraction part 25 Navigation execution part 26 Web resource information output part 27 Navigation management part 28 Navigation link DB
DESCRIPTION OF SYMBOLS 29 Navigation link table 30 Web server 31 Web resource 40 Search apparatus 41 Search result 50 Browser 60 Attention resource identification information 61 Link processing part 62 Specific Web server 71 Input apparatus 72 Output apparatus 73 Drive apparatus 74 Auxiliary storage apparatus 75 Main storage apparatus 76 Arithmetic processing device 77 Interface device

Claims

A web resource tracking management program for causing a computer to track the resource moved from a first web resource including one or more resources per page to another web resource,
The computer,
Receiving means for receiving a correspondence relationship between the location information of the first Web resource and target resource identification information for identifying the target resource included in the first Web resource;
Retrieval means for retrieving the Web resource corresponding to the location information selected by the user;
Presence checking means for confirming whether or not the attention resource exists in the Web resource extracted by the extraction means, based on the attention resource identification information;
Extraction means for extracting a link to the other Web resource included in the Web resource if the resource of interest does not exist in the Web resource extracted by the extraction means;
Tracking means for notifying the take-out means of the location information indicated by the link extracted by the extraction means as the next location information;
A Web resource tracking management program for causing a Web resource to function as a transmission unit for transmitting the Web resource to the user terminal if the resource of interest exists in the Web resource extracted by the extraction unit.

The extraction means extracts a link to the other Web resource included in the Web resource based on a table means indicating one or more types of links to the other Web resource included in the Web resource. The Web resource tracking management program according to claim 1, wherein:

The presence confirmation unit is configured to confirm whether the target resource exists in the Web resource extracted by the extraction unit, using a keyword, a resource number, or a hash code value of the resource as the target resource identification information. The Web resource tracking management program according to claim 1 or 2.

In the first Web resource, the number of the resources included in one page is limited, and the first Web resource corresponds to the location information indicated by the link in order from the old resource so that the number of the resources does not exceed the limit. 4. The Web resource tracking management program according to claim 1, wherein the Web resource tracking management program is moved to another Web resource.

A web resource tracking management apparatus for tracking the resource moved from a first web resource including one or more resources in one page to another web resource,
Receiving means for receiving a correspondence relationship between the location information of the first Web resource and target resource identification information for identifying the target resource included in the first Web resource;
Retrieval means for retrieving the Web resource corresponding to the location information selected by the user;
Presence checking means for confirming whether or not the attention resource exists in the Web resource extracted by the extraction means, based on the attention resource identification information;
Extraction means for extracting a link to the other Web resource included in the Web resource if the resource of interest does not exist in the Web resource extracted by the extraction means;
Tracking means for notifying the take-out means of the location information indicated by the link extracted by the extraction means as the next location information;
A Web resource tracking management apparatus comprising: a transmission unit configured to transmit the Web resource to the user terminal if the resource of interest exists in the Web resource extracted by the extraction unit.

A web resource tracking management method for tracking the resource moved from a first web resource including one or more resources per page to another web resource, which is executed by a computer,
The computer is
A receiving procedure for receiving correspondence between the location information of the first Web resource and the target resource identification information for identifying the target resource included in the first Web resource;
An extraction procedure for retrieving the Web resource corresponding to the location information selected by the user;
A presence confirmation procedure for confirming whether or not the target resource exists in the Web resource extracted in the extraction procedure by the target resource identification information;
An extraction procedure for extracting a link to the other Web resource included in the Web resource if the resource of interest does not exist in the Web resource extracted in the extraction procedure;
A tracking procedure for notifying the extraction means of the location information indicated by the link extracted in the extraction procedure as the next location information;
A Web resource tracking management method comprising: executing a transmission procedure for transmitting the Web resource to the user terminal if the resource of interest exists in the Web resource extracted in the extraction procedure.