JP2002024227A

JP2002024227A - System and method for generating radio web page

Info

Publication number: JP2002024227A
Application number: JP2001153006A
Authority: JP
Inventors: Brett Matthew Keating; マシューキーティングブレット; Michael Scott Hohman; スコットホーマンマイケル; Arajjofu Ivan; アラッジョフイヴァン; Jose Fa Keating; ファカークランドホセ; Jacob Sullivan; サリバンヤコブ
Original assignee: TOUUROOMU Inc
Current assignee: TOUUROOMU Inc
Priority date: 2000-05-22
Filing date: 2001-05-22
Publication date: 2002-01-25
Also published as: AU2001264810A1; WO2001090873A1

Abstract

PROBLEM TO BE SOLVED: To provide a system and a method which enable a user to analyze a document to create a new document generally by decomposing the document into a hierarchical structure. SOLUTION: The system and method for radio page generation are prepared which enable the user to specify the format of an HTML page sent from a radio device. This system can automatically handle dynamic Web pages as well as static HTML Web pages. The system may includes a graphical user interface(GUI) which enables the user to interact with the system. Further, the system may includes a robustifier which automatically processes a dynamic Web page to generate an XSL style sheet making it possible to extract contents from the dynamic Web page.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、一般に、文書を階
層構造に分解して新しい文書を生成するために、ユーザ
が文書を分析することを可能にするシステム及び方法に
関し、更に詳しくは、オリジナルの情報源に対応する１
つ又はそれ以上の無線ウェブページを生成するために、
ユーザがハイパーテキスト・マークアップ言語（ＨＴＭ
Ｌ）のウェブページ、ＸＭＬ文書、ＩＣＥ文書（内容シ
ンジケーション書式）、又は、ロイターなどの情報源を
分析することを可能にするシステム及び方法に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates generally to systems and methods that allow a user to analyze a document in order to decompose the document into a hierarchical structure and create a new document, and more particularly, to an original document. 1 corresponding to the information source of
To generate one or more wireless web pages,
Users can use Hypertext Markup Language (HTM)
L) Web pages, XML documents, ICE documents (content syndication forms), or systems and methods that allow analysis of information sources such as Reuters.

【０００２】[0002]

【従来の技術】文書又はウェブページをその構成部分に
分解できることがますます必要になってきている。特
に、ＨＴＭＬ又は何らかの他の書式で書かれているウェ
ブページについては、ウェブページを閲覧することがで
きて、アトミックとして知られる１つ又はそれ以上の階
層的に関連する要素にウェブページを分割できることが
必要である。アトミックとは、ウェブページの細部又は
小さな部分である。アトミックは、各グループにクラス
タ化されてもよく、各グループは、次に、より大きなグ
ループにクラスタ化され得る。例えば、ウェブページの
各トップ記事がアトミックであってもよいし、一方、記
事の全てをまとめて１つのグループとして扱ってもよ
い。BACKGROUND OF THE INVENTION There is an increasing need to be able to decompose a document or web page into its constituent parts. In particular, for web pages written in HTML or some other format, the ability to view the web page and divide the web page into one or more hierarchically related elements known as atomics. is necessary. Atomics are details or small parts of a web page. Atomics may be clustered into groups, and each group may then be clustered into larger groups. For example, each top article of a web page may be atomic, while all articles may be treated as one group.

【０００３】ウェブページ及びアトミックの階層構造に
おけるアトミックの特定は、重要なタスクである。例え
ば、システムは、ウェブページを階層的なアトミックに
分解し、次に、階層的なアトミックを使用して１人又は
それ以上の異なるユーザのためにウェブページを再構築
してもよく、この場合、各ユーザは、ウェブページのわ
ずかに異なる部分を要求してもよいし、又は、各ユーザ
は、メモリ又はスクリーンサイズの制限のためにウェブ
ページのある断片にしか対応できない装置を使用してい
るかもしれない。分解したウェブページはまた、様々な
他の目的に使用してもよい。[0003] The identification of atomics in a web page and in the atomic hierarchy is an important task. For example, the system may decompose the web page into a hierarchical atomic and then use the hierarchical atomic to reconstruct the web page for one or more different users, in which case , Each user may request a slightly different portion of the web page, or each user is using a device that can only accommodate certain fragments of the web page due to memory or screen size limitations Maybe. The disassembled web page may also be used for various other purposes.

【０００４】[0004]

【発明が解決しようとする課題】会社が携帯電話、パー
ム（Ｐａｌｍ）装置、ポケットベル（登録商標）などの
多くの無線装置にそのウェブページの内容を配信したい
と考える、そのような一般的なウェブページについて
は、人々の集団が、内容のデータベースに戻って異なる
種類の各々の無線装置向けに各々の新しいページを改め
て作成しなければならないが、これは遅く時間のかかる
処理である。更に、オリジナルのウェブページが変化し
た場合、その人々の集団が生成した新しいページの各々
を改めて生成しなければならない。従って、ウェブペー
ジプロデューサーなどの１人のユーザが、ＨＴＭＬウェ
ブページ、ＸＭＬ文書、ＩＣＥ文書（内容シンジケーシ
ョン書式）、及び、ロイター配給など、情報源をそのア
トミックにもっと簡単に分解して、アトミックを生成
し、互いに関連づけ、特性を割当てることにより、１つ
又はそれ以上の異なる無線装置向けに新しく書式化され
た無線ページを自動的に生成することが可能になるシス
テムを準備することが必要であり、本発明が意図するの
は、まさにこの目的である。SUMMARY OF THE INVENTION Such a common practice is that a company wants to distribute the contents of its web page to many wireless devices, such as mobile phones, Palm devices, pagers. For web pages, a group of people must return to the content database and re-create each new page for each different type of wireless device, which is a slow and time-consuming process. In addition, if the original web page changes, each new page created by that group of people must be re-created. Thus, a single user, such as a web page producer, can more easily atomically decompose sources of information, such as HTML web pages, XML documents, ICE documents (content syndication forms), and Reuters distribution, to create atomics It is necessary to provide a system that can automatically generate a newly formatted wireless page for one or more different wireless devices by associating and assigning properties to each other, It is precisely this purpose that the present invention contemplates.

【０００５】更に、ウェブページは、ウェブページの内
容が絶えず変更又は更新され得るという点で、しばしば
動的である。例えば、ニュース、取り引き情報、買い物
情報などに関する情報を含むウェブページは、連続的に
更新して変化を反映しなければならない。ウェブページ
が動的である時、オリジナルのウェブページから特定の
書式を持つ新しいページを生成しようとする処理は、更
に難しくなる。例えば、オリジナルのニュース記事のウ
ェブページに２つのトップ記事がある時、デザイナーが
新しいページを作成した場合、その新しいページは、記
事の内容が変わるか、又は、トップ記事の番号が変わる
とすぐに陳腐なものになるが、理由は、そのページがも
はや正確でも最新のものでもなくなるからである。すな
わち、動的ウェブページを基本として新しいページを手
作業で生成しようという作業は、新しいページがすぐに
陳腐になるために、とてつもなく難しく、非常に時間が
かかる。すなわち、動的ウェブページの内容が変わる時
にデザイナーが間断なく新しいページを作成し直さなく
てもよいように、動的ウェブページを基本として自動的
又は半自動的に新しいページを生成するシステムを準備
することが必要であり、本発明が意図するのはまた、ま
さにこの目的である。[0005] Further, web pages are often dynamic in that the content of the web page can be constantly changed or updated. For example, web pages containing information about news, transaction information, shopping information, etc., must be continuously updated to reflect changes. When a web page is dynamic, the process of trying to create a new page with a particular format from the original web page becomes more difficult. For example, if the original news article web page has two top articles, and the designer creates a new page, the new page will change as soon as the content of the article changes or the number of the top article changes It's stale, because the page is no longer accurate or up to date. That is, the task of manually creating a new page based on a dynamic web page is enormously difficult and very time-consuming because the new page quickly becomes stale. That is, a system for automatically or semi-automatically generating a new page based on the dynamic web page is prepared so that the designer does not need to recreate a new page without interruption when the content of the dynamic web page changes. It is necessary that the present invention also contemplates exactly this purpose.

【０００６】[0006]

【課題を解決するための手段】本発明による無線ウェブ
ページを生成するシステム及び方法が準備され、その場
合、ＨＴＭＬウェブページ、ＸＭＬ文書、ＩＣＥ文書
（内容シンジケーション書式）、又は、ロイター）など
の情報源は、階層的書式の中でその構成部分に自動的に
分解されてもよく、それにより、書式又は内容が異なる
新しいページは、オリジナルの情報源を基にして自動的
に生成され得る。本発明は、各無線装置向けに書式が異
なる一連のページ（カードという）を生成することが必
要となる、各無線装置が異なるメモリ及びスクリーンサ
イズを持つ場合、１つ又はそれ以上の無線装置向けにＨ
ＴＭＬウェブページを再目的化する状況で特に役に立
つ。本発明はまた、以下で更に詳細に説明するスクリー
ンサイズが異なる１つ又はそれ以上の異なる無線装置向
けに、動的ウェブページから１つ又はそれ以上の異なる
無線ページを生成するのに特に役に立つ。SUMMARY OF THE INVENTION A system and method for generating a wireless web page according to the present invention is provided, wherein information such as an HTML web page, an XML document, an ICE document (content syndication format), or a Reuters is provided. The source may be automatically broken down into its components in a hierarchical format, so that new pages that differ in format or content may be automatically generated based on the original source. The present invention is directed to one or more wireless devices, where each wireless device has a different memory and screen size, which requires generating a series of pages (referred to as cards) in a different format for each wireless device. To H
It is especially useful in situations where you want to repurpose TML web pages. The present invention is also particularly useful for generating one or more different wireless pages from a dynamic web page for one or more different wireless devices having different screen sizes as described in more detail below.

【０００７】本発明による無線ページ生成システムは、
ウェブページの内容を１つ又はそれ以上の異なる無線装
置に再目的化することを希望するウェブサイトのプロデ
ューサーにより利用されてもよく、その場合、各々の異
なる無線装置のスクリーンサイズが異なっていてもよ
く、それにより、各無線ウェブページの書式は、若干異
なるはずである。本発明によるシステムを使用すれば、
プロデューサーは、手助けがなくても複数の異なる無線
装置向けにウェブページを再目的化し、無線ページを無
線ページ配信システムによって自動的に生成し得る。[0007] The wireless page generation system according to the present invention comprises:
It may be used by producers of websites wishing to repurpose the content of a web page to one or more different wireless devices, in which case the screen size of each different wireless device may be different Well, thereby, the format of each wireless web page should be slightly different. With the system according to the invention,
The producer may repurpose the web page for a plurality of different wireless devices without assistance and generate the wireless page automatically by a wireless page distribution system.

【０００８】ノーマッド（登録商標）ワイヤレス・ツー
ルキットで知られる本発明による無線ウェブページを生
成するシステム及び方法は、グラフィック・ユーザ・イ
ンタフェース（ＧＵＩ）部分を備え得る。本システム
は、ワイヤレス・ツールキットを使用するウェブページ
のプロデューサーが、無線装置に対してウェブサイト内
容の表示の仕方を指定し、次に、賢い収穫及び運行シス
テムに対する目標とするウェブページ仕様と無線ページ
を生成する方法とを直接通信することを可能にする。本
システムはまた、プロデューサーが無線装置でウェブサ
イト内容がどのように表示されることになるかをエミュ
レートする１つ又はそれ以上の無線ページ上で、彼等の
ウェブサイト内容を下見することを可能にする。[0008] A system and method for generating wireless web pages according to the present invention, known in the NoMad® Wireless Toolkit, may include a graphic user interface (GUI) portion. The system allows web page producers using the wireless toolkit to specify how website content should be displayed to wireless devices, and then to target web page specifications and wireless for smart harvesting and navigation systems. Allows direct communication with how to generate the page. The system also allows producers to preview their website content on one or more wireless pages that emulate how the website content will be displayed on wireless devices. enable.

【０００９】好ましい実施形態において、本システム
は、プロデューサー（本明細書ではユーザともいう）が
素早くウェブページを処理してページに包含されるアト
ミックの階層的なリストを生成し、オリジナル・ウェブ
ページからのアトミックの一部又は全てを持つその結果
得られるページを生成することを可能にする。例えば、
メモリ又はスクリーンサイズに限りがある無線装置にそ
のページが送られることになる場合、ウェブページのプ
ロデューサーは、通常、その装置での表示用として更に
限られたメモリ又はスクリーンサイズでウェブページを
再書式化しなければならない。プロデューサーはまた、
各無線装置が異なるスクリーンサイズを持つような数多
くの無線装置向けにウェブページを再書式化する必要が
あるかもしれず、そのため、各無線装置向けに生成され
たるページは独自のものとなる。しかし、本発明による
システムにおいて、プロデューサーは、無線ページを自
動的に生成し、動的ウェブサイト向けの無線ページもま
た自動的に生成し得るように、各無線装置に対して各無
線ページを形成し得る。In a preferred embodiment, the system allows a producer (also referred to herein as a user) to quickly process a web page to generate a hierarchical list of the atomics contained in the page, and from the original web page To generate the resulting page with some or all of the atomics of the For example,
If the page is to be sent to a wireless device with limited memory or screen size, the web page producer will typically reformat the web page with a more limited memory or screen size for display on that device. Must be transformed. The producer also
The web page may need to be reformatted for a number of wireless devices, where each wireless device has a different screen size, so that the pages generated for each wireless device are unique. However, in the system according to the present invention, the producer automatically generates wireless pages and forms each wireless page for each wireless device so that wireless pages for dynamic websites can also be automatically generated. I can do it.

【００１０】すなわち、本発明によれば、情報源を処理
する装置が準備され、該装置は、情報源を検索し、各要
素が情報源内の内容の断片を含むような情報源から１つ
又はそれ以上の要素を抽出する。該装置はまた、情報源
内の要素の階層構造を表すデータ構造を生成し、情報源
から所定の要素を検索するために該データ構造を処理す
る。情報源から要素を抽出する際、該装置は、要素が抽
出されているページを閲覧するページ閲覧部分と、ペー
ジから抽出された要素の階層的リストを閲覧するページ
・ナビゲータ部分と、ページから要素を抽出するために
ページ閲覧部分からページ・ナビゲータ部分に要素を引
っ張り出すユーザと、ページ・ナビゲータ部分のリスト
の要素の特性を閲覧する要素特性部分とを備えてもよ
く、ページ閲覧部分、ページ・ナビゲータ部分、及び、
要素特性部分は、ユーザが、ページと要素の階層的リス
トとを同時に閲覧することにより、ページから要素を素
早く抽出することを可能にする。That is, in accordance with the present invention, there is provided an apparatus for processing an information source, the apparatus searching for the information source and one or more of the information sources, each element including a fragment of the content in the information source. Extract more elements. The apparatus also generates a data structure representing a hierarchical structure of the elements in the information source and processes the data structure to retrieve a predetermined element from the information source. When extracting an element from an information source, the apparatus includes a page browsing part for browsing a page from which the element is extracted, a page navigator part for browsing a hierarchical list of elements extracted from the page, and an element from the page. A user that pulls an element from the page browsing part to the page navigator part to extract the element, and an element characteristic part that browses the characteristics of the elements in the list of the page navigator part. Navigator part, and
The element properties portion allows a user to quickly extract elements from a page by simultaneously viewing the page and a hierarchical list of elements.

【００１１】データ構造を生成する際、該装置は、情報
源を内容と階層構造とを包含する第１の階層構造に変換
し、次に、たとえ情報源が変化した場合でも要素が捜し
当てられるように、情報源において要素に至る一般化さ
れた経路を決める。更に詳しくは、第１の階層構造は、
各々が要素を含む１つ又はそれ以上のノードを備え、特
定の要素は、階層構造の第１のノードで捜し当てられ、
一般化経路の決定子は、固有ノード識別子を識別するた
めに、データを包含する第１のノードを階層構造の他の
各ノードと比較する。一般化経路決定子はまた、固有識
別子が比較の間に捜し当てられない場合に、第１のノー
ドに付随する転換点ノードを識別し、該転換点ノード
は、一意的に第１のノードを識別し、第１のノードに適
合するノードの子孫がない場合に子孫軸線を転換点ノー
ドと指定する階層構造のノードである。[0011] In generating the data structure, the apparatus converts the information source into a first hierarchical structure containing the content and the hierarchical structure, and then the elements are located even if the information source changes. As such, determine a generalized path to the element at the source. More specifically, the first hierarchical structure is:
Comprising one or more nodes, each containing an element, a particular element being located at a first node of the hierarchical structure;
The generalized path determinator compares the first node containing the data with each of the other nodes in the hierarchical structure to identify the unique node identifier. The generalized path determinator also identifies a turning point node associated with the first node if the unique identifier is not found during the comparison, wherein the turning point node uniquely identifies the first node. This is a node having a hierarchical structure in which the descendant axis is designated as a turning point node when there is no descendant of the node that is identified and matches the first node.

【００１２】本発明の別の様態によれば、アトミック及
びアトミックのグループが抽出されているページを閲覧
するページ閲覧部分と、ページから抽出されたアトミッ
クの階層的リストを閲覧するページ・ナビゲータ部分
と、ページからアトミックを抽出するページ閲覧部分か
らページ・ナビゲータ部分にアトミックを引っ張り出す
ユーザと、ページ・ナビゲータ部分のリストのアトミッ
クの特性を閲覧するアトミック特性部分とを備える、Ｈ
ＴＭＬウェブページから１つ又はそれ以上のアトミック
を抽出するグラフィック・ユーザ・インタフェースが準
備される。ページ閲覧部分、ページ・ナビゲータ部分、
及び、要素特性部分は、ユーザが、ページとアトミック
の階層的リストとを同時に閲覧することにより、ページ
からアトミックを素早く抽出することを可能にする。According to another aspect of the invention, a page browsing portion for browsing a page from which an atomic or atomic group is extracted, and a page navigator portion for browsing a hierarchical list of the atomics extracted from the page. Extracting an atomic from a page, a user who pulls an atomic from a page browsing part to a page navigator part, and an atomic property part which browses atomic properties of a list of the page navigator part.
A graphical user interface is provided for extracting one or more atomics from a TML web page. Page browsing part, page navigator part,
And the element property portion allows a user to quickly extract an atomic from a page by simultaneously browsing the page and the hierarchical list of atomics.

【００１３】本発明の別の様態によれば、ＨＴＭＬウェ
ブページから１つ又はそれ以上のアトミックを抽出する
グラフィック・ユーザ・インタフェースが準備され、該
グラフィック・ユーザ・インタフェースは、アトミック
が抽出されているページを閲覧して、ページから抽出さ
れたアトミックの階層的リストを閲覧することによって
ページを運行し、ユーザは、アトミックをページ・ビュ
アーからページ・ナビゲータに引っ張り出してページか
らアトミックを抽出し、ユーザによって選択されたアト
ミックから特性を抽出するアトミック特性ジェネレータ
を引っ張り出し、それにより、ユーザがアトミックの階
層的なリストと選択されたアトミックの特性とを同時に
見れるページを閲覧する。According to another aspect of the present invention, there is provided a graphic user interface for extracting one or more atomics from an HTML web page, wherein the graphic user interface has the atomics extracted. Navigating the page by browsing the page and browsing the hierarchical list of atomics extracted from the page, the user pulls the atomic from the page viewer to the page navigator, extracts the atomic from the page, Pulls an atomic property generator that extracts properties from the selected atomics, thereby browsing the page where the user can simultaneously view the hierarchical list of atomics and the selected atomic properties.

【００１４】本発明の更に別の様態によれば、ウェブペ
ージの内容の各断片に至る経路を決めることにより、異
なるスクリーン書式を持つ１つ又はそれ以上の無線装置
向けのウェブページを再目的化するウェブページの処理
方法が準備される。該方法は、ウェブページに基づい
て、ウェブページの構造及びウェブページの内容を含む
第１の階層構造を生成する。該方法は、次に、内容に至
る経路が示されるウェブページの構造を備えるウェブペ
ージの第２の階層構造を第１の階層構造から生成する。
該方法は、次に、第２の階層構造内に挿入され、ウェブ
ページの内容に至る相対的な経路を生成し、たとえウェ
ブページが変化したとしても内容に至る経路を使用する
内容検索が内容が探し当てるように、第２の階層構造の
経路を強化する。In accordance with yet another aspect of the present invention, rerouting a web page for one or more wireless devices having different screen formats by routing to each piece of web page content. A web page processing method is prepared. The method generates a first hierarchical structure including a structure of the web page and a content of the web page based on the web page. The method then generates, from the first hierarchical structure, a second hierarchical structure of the web page comprising a structure of the web page indicating a path to the content.
The method then generates a relative path to the content of the web page, which is inserted into the second hierarchical structure, wherein the content search using the path to the content even if the web page changes is performed by the content search. The second hierarchically structured path is strengthened so that

【００１５】[0015]

【発明の実施の形態】本発明は、特に、１つ又はそれ以
上の異なる無線装置向けに１つ又はそれ以上の新しい無
線ページを生成するウェブページの分解に適用可能であ
り、この関連において本発明が以下に説明される。しか
し、本発明によるシステム及び方法は、非限定的である
が、ＸＭＬ文書、ＩＣＥ文書（内容シンジケーション書
式）、ロイター配給、又は、他の任意のタイプの情報配
給を含む、新しいページを生成するために情報源を１つ
又はそれ以上の要素に分解できる他のタイプの情報源な
どに対する方が利用度が大きいことが理解されるであろ
う。本発明をより理解するために、本発明による無線ウ
ェブページ・ジェネレータと共に使用し得る無線ページ
配信システムについて以下に簡単に説明する。DETAILED DESCRIPTION OF THE INVENTION The present invention is particularly applicable to the decomposition of web pages to generate one or more new wireless pages for one or more different wireless devices, and in this context the book The invention is described below. However, the systems and methods according to the present invention may be used to generate new pages, including, but not limited to, XML documents, ICE documents (content syndication forms), Reuters distributions, or any other type of information distribution. It will be appreciated that applications are more useful for other types of sources, such as those that can decompose the source into one or more components. To better understand the present invention, a wireless page distribution system that can be used with the wireless web page generator according to the present invention is briefly described below.

【００１６】図１は、本発明による無線ページ生成シス
テムと共に使用し得る無線ページ配信システム１０を示
すブロック図である。本システムについて本明細書では
簡単に説明する。より詳細な説明は、本明細書において
参照文献として援用されている、２０００年２月１４日
出願で本発明と同じ出願人が所有する現在出願中の米国
特許出願シリアル番号第０９／５０３、７９７号に見る
ことができる。システム１０は、ウェブサイトから１つ
又はそれ以上の異なる無線装置にウェブページを配信で
きることを希望する会社など、１つ又はそれ以上の内容
プロバイダ又は情報源１１を含んでもよく、その場合、
無線装置のスクリーンのサイズ、無線装置のメモリ、又
は、無線装置とウェブサイトとの間の通信リンクのため
に、各無線装置は、ウェブページに対して特定の方式で
書式化するように要求してもよい。FIG. 1 is a block diagram illustrating a wireless page distribution system 10 that may be used with the wireless page generation system according to the present invention. This system is briefly described herein. A more detailed description may be found in U.S. patent application Ser. No. 09 / 503,797, filed Feb. 14, 2000, filed on Feb. 14, 2000, which is hereby incorporated by reference. Can be seen in the issue. The system 10 may include one or more content providers or sources 11, such as companies that want to be able to deliver web pages from a website to one or more different wireless devices, in which case:
Due to the size of the wireless device screen, the wireless device memory, or the communication link between the wireless device and the website, each wireless device requires a web page to be formatted in a specific manner. You may.

【００１７】該システムはまた、ゲートウェイ１２、ウ
ェブサーバ１３、無線装置との無線通信システム１４、
及び、無線ウェブページ配信部分１５を含み得る。ゲー
トウェイは、無線装置からの着信ＨＴＴＰ要求を傍受し
てウェブサーバ１３、更には、無線ページ配信部分１５
に要求を発信することができる。無線ページ配信部分１
５は、実際の要求されたＨＴＭＬページを検索し、特定
の無線装置向けに１つ又はそれ以上のカード及びデック
にページを再書式化し、ウェブサーバ１３及びゲートウ
ェイ１２を使用して無線装置に再書式化されたカード及
びデックを送信し得る。The system also includes a gateway 12, a web server 13, a wireless communication system 14 with wireless devices,
And a wireless web page delivery portion 15. The gateway intercepts an incoming HTTP request from the wireless device and intercepts the web server 13 and the wireless page delivery portion 15
You can send a request to Wireless page distribution part 1
5 retrieves the actual requested HTML page, reformats the page into one or more cards and decks for a particular wireless device, and reformats the wireless device using web server 13 and gateway 12. Formatted cards and decks may be sent.

【００１８】ＨＴＭＬページの再書式化及び他の機能を
実行するために、無線ページ配信部分１５は、電気機器
接続ハンドラ１６、内容接続ハンドラ１７、ＸＭＬエン
ジン１８、配置エンジン１９、ルールデータベース２
０、及び、ＸＳＬルールセット・データベース２１を更
に備えてもよい。簡単にいえば、該システムは、着信Ｈ
ＴＭＬページ要求を受信し、ウェブページを検索し、Ｈ
ＴＭＬページをＸＨＴＭＬに再書式化し、ＸＨＴＭＬ文
書からＲＭＬ文書を生成し、ＲＭＬ文書からの要素を１
つ又はそれ以上のカード及びデックに書式化して無線装
置に配信されるプレゼンテーション・シューを形成し得
る。無線ページ配信システムの各部分の相互作用を図１
でより詳細に示すが、本明細書に援用されている現在出
願中の上記特許出願で更に説明されている。従って、無
線ページ配信システムの作動については、これ以上の詳
細な説明は省略する。ここで、本発明による無線ウェブ
ページ生成システムについて以下に説明する。To perform HTML page reformatting and other functions, the wireless page delivery portion 15 includes an electrical device connection handler 16, a content connection handler 17, an XML engine 18, a placement engine 19, and a rule database 2.
0 and an XSL ruleset database 21. Briefly, the system uses the incoming H
Receives a TML page request, retrieves a web page,
Reformats the TML page into XHTML, generates an RML document from the XHTML document, and replaces elements from the RML document with one.
One or more cards and decks may be formatted to form a presentation shoe that is delivered to the wireless device. Figure 1 shows the interaction of each part of the wireless page distribution system
, And is further described in the above-mentioned patent application, which is hereby incorporated by reference herein. Therefore, further detailed description of the operation of the wireless page distribution system will be omitted. Here, the wireless web page generation system according to the present invention will be described below.

【００１９】図２は、本発明によるウェブページ生成シ
ステム２２を示すブロック図である。一般に、ウェブペ
ージ生成システムは、以下で詳細に説明するように無線
装置にウェブページがダウンロードされる時、ウェブペ
ージを持つプロデューサー又は会社がその１つ又はそれ
以上のウェブページの体裁を調整することを可能にす
る。ウェブページ生成システム２２は、後端部分２３と
前端部分２４とを備え得る。前端部分はまた、グラフィ
ック・ユーザ・インタフェース（ＧＵＩ）ツールと呼ば
れてもよい。本発明の好ましい実施形態において、後端
部分は、以下で更に詳細に説明するように、後端の機能
を実行する１つ又はそれ以上のコンパイルされたＪＡＶ
Ａ（登録商標）プログラム／モジュールを備えることが
でき、前端は、以下で更に詳細に説明するように、前端
（ＧＵＩツール）の機能を実行する１つ又はそれ以上の
ビジュアル・ベーシック・モジュール／プログラムであ
ってもよい。ＧＵＩツールと後端とは、よく知られてい
るように、ＡＰＩを使用して互いに接続してもよい。FIG. 2 is a block diagram showing a web page generation system 22 according to the present invention. In general, a web page generation system will allow a producer or company with a web page to adjust the appearance of one or more web pages when the web page is downloaded to a wireless device, as described in more detail below. Enable. Web page generation system 22 may include a rear end portion 23 and a front end portion 24. The front end portion may also be called a graphic user interface (GUI) tool. In a preferred embodiment of the present invention, the trailing end portion includes one or more compiled JAVAs that perform the trailing end functions, as described in more detail below.
A (R) program / module, wherein the front end includes one or more visual basic modules / programs that perform the functions of the front end (GUI tool), as described in more detail below. It may be. The GUI tool and the back end may be connected to each other using an API, as is well known.

【００２０】更に詳しくは、後端２３は、図１に示すウ
ェブページ配信部分１５、ＲＭＬビルダー・モジュール
２５、ＸＳＬジェネレータ・モジュール２６、及び、ス
タイルシート・データベース２７を更に備えてもよい。
各モジュールの機能は、ここで説明され、各モジュール
について更なる詳細は、以下で与えられる。上記の通
り、ウェブページ配信部分１５は、ＸＨＭＴＬを生成し
得る。ＲＭＬビルダー・モジュール２５は、本明細書に
援用されている現在出願中の上記特許出願で更に詳細に
説明されているように、生成されたルールセットに基づ
いてＲＭＬ文書を生成し、ＲＭＬ文書に基づいてＸＳＬ
スタイルシートを生成するＸＳＬジェネレータ２６内に
ＲＭＬ文書を出力し得る。生成されたスタイルシート
は、データベース２７に記憶することができる。ウェブ
ページが無線装置にダウンロードされて表示されるよう
に、ＸＳＬスタイルシートを使用してウェブページから
１つ又はそれ以上のカードを自動的に生成してもよい。More specifically, the trailing end 23 may further include the web page distribution section 15, the RML builder module 25, the XSL generator module 26, and the style sheet database 27 shown in FIG.
The function of each module is now described, and further details for each module are given below. As described above, the web page delivery portion 15 may generate XHMTL. The RML builder module 25 generates an RML document based on the generated rule set and converts the RML document into an RML document, as described in further detail in the above-referenced patent application incorporated by reference herein. XSL based on
The RML document may be output into an XSL generator 26 that generates a style sheet. The generated style sheet can be stored in the database 27. One or more cards may be automatically generated from the web page using the XSL stylesheet so that the web page is downloaded and displayed on the wireless device.

【００２１】ＧＵＩツール２４は、ルールセット構築ツ
ールセット２８、ルールセット・データベース２９、プ
ロジェクト構築ツールセット３０、及び、無線ウェブサ
イト・プロジェクト・データベース３１を更に備えても
よい。グラフィック・ユーザ・インタフェース（ＧＵ
Ｉ）ツールによって、ユーザは、アプリケーションと対
話することができる。特にＧＵＩツールを使用すれば、
ユーザは、無線ウェブサイト・プロジェクトに対して、
ウェブサイトの内容を包含する１つ又はそれ以上のカー
ドの形成段階を含む、内容選択、構成、及び、配置を実
行することができる。好ましい実施形態において、ＧＵ
Ｉは、標準ＭＳウインドウズ型アプリケーションの体裁
と感覚とを持っており、ＭＳウインドウズ用アプリケー
ション規格に準拠している。The GUI tool 24 may further include a ruleset construction toolset 28, a ruleset database 29, a project construction toolset 30, and a wireless website project database 31. Graphic User Interface (GU
I) The tool allows the user to interact with the application. Especially if you use the GUI tool,
Users are required to submit a wireless website project
Content selection, organization, and placement can be performed, including the step of forming one or more cards containing the content of the website. In a preferred embodiment, the GU
I has the look and feel of a standard MS Windows application and is compliant with the MS Windows application standard.

【００２２】ルールセット構築ツールセット２８は、ユ
ーザがルールセットを作成及び定義することを可能にし
得る。ルールセット１５は、カードに対する新規書式化
など、無線ページ配信システム１５がデスクトップ中心
のウェブページの内容やサービスを無線装置に行く予定
の１つ又はそれ以上のカード内にいかに変換すべきか、
及び、どの内容をどのカードに載せるかを表している。
更に詳しくは、ルールセットはまた、どのＵＲＬが特定
のルールセットを使用するかを定めてもよい。ルールセ
ットはまた、１つ又はそれ以上の無線ページ内にウェブ
ページを変換する方法を指定するＸＳＬスタイルシート
を備えてもよい。ルールセット構築ツールセットを使用
すれば、ユーザは以下のことを行うことができる。１．ルールセットを作成し、開き、記憶する。２．ルールセットの基本となるデスクトップ中心のウェ
ブページを選択する。３．ルールセットを形成する（無線配信向けにウェブペ
ージに対する内容及びサービスを選択してグループにす
る）。４．専門無線機能をルールセットに一体化する。５．ルールセットの「無線運行構造」をグラフィックで
閲覧する。６．無線装置エミュレータ又はインターネット可能な無
線装置を使用して、試験的にルールセットを配置する。A ruleset construction toolset 28 may allow a user to create and define rulesets. The ruleset 15 describes how the wireless page delivery system 15 should translate the content and services of the desktop-centric web page into one or more cards that are going to the wireless device, such as new formatting for the card.
And what content is to be placed on which card.
More specifically, a ruleset may also define which URLs use a particular ruleset. The ruleset may also include an XSL stylesheet that specifies how to convert the web page into one or more wireless pages. With the ruleset construction toolset, the user can: 1. Create, open, and remember rulesets. 2. Select a desktop-centric web page that is the basis for the ruleset. 3. Create a rule set (select and group content and services for web pages for wireless distribution). 4. Integrate specialized wireless functions into the ruleset. 5. Browse the "wireless operation structure" of the ruleset in graphic form. 6. Using a wireless device emulator or an internet enabled wireless device, deploy the rule set on a trial basis.

【００２３】ルールセット構築ツールセット２８は、ウ
ェブ配信部分１５からウェブページを表すＸＨＴＭＬ文
書を受け取り、ＸＨＴＭＬに基づくデータベース２９に
記憶し得る１つ又はそれ以上のルールセットを生成して
もよい。１つ又はそれ以上のルールセットは、以下で更
に詳細に説明するように、無線ウェブページがウェブペ
ージに変換された時に無線装置上でＨＴＭＬウェブペー
ジがどのように見えるかを判断する。データベース２９
のルールセットは、ＲＭＬ文書を生成するＲＭＬビルダ
ー２５に送られてもよく、また、以下で説明するように
着信ウェブページ向けに無線ウェブサイトプロジェクト
を生成するプロジェクト構築ツールセット３０に送られ
てもよい。終了したプロジェクトは、データベース３１
に記憶される。The ruleset construction toolset 28 may receive XHTML documents representing web pages from the web distribution portion 15 and generate one or more rulesets that may be stored in an XHTML-based database 29. One or more rule sets determine what an HTML web page looks like on a wireless device when the wireless web page is converted to a web page, as described in further detail below. Database 29
May be sent to the RML builder 25, which generates an RML document, and also to the project construction toolset 30, which generates a wireless website project for incoming web pages, as described below. Good. Finished projects are stored in the database 31
Is stored.

【００２４】作動中に、プロデューサーは、１つ又はそ
れ以上の無線装置上でのＨＴＭＬウェブページの体裁に
関する情報を含む無線ウェブサイトプロジェクトを生成
するためにＧＵＩツールと対話してもよい。プロデュー
サー又はユーザがウェブページを選択すると、無線配信
部分１５は、そのウェブページを検索してウェブページ
に対応するＸＨＭＴＬ文書を生成することができる。ユ
ーザは、図３及び図４を参照しながら以下で説明するよ
うに、ルールセット構築ツールセットを使用してウェブ
ページから１つ又はそれ以上の要素を抽出するか、又
は、自動的に抽出することができる。以下アトミックと
呼ぶ抽出された要素から、ユーザは、無線ページの体裁
を生成し、無線ページを検討してもよい。ユーザが無線
ページに満足すると、無線ページ配信システム１５（図
１を参照されたい）が、それがウェブページの要求を受
け取る時、生成したルールセット及びスタイルシートに
基づいて無線装置向けに適切な１つ又はそれ以上のカー
ドを自動的に生成するように、無線ページの体裁に関す
る情報を捕らえる１つ又はそれ以上のルールセットが生
成される。すなわち、一旦ユーザがルールセットとスタ
イルシートとを形成すると、無線ページ配信システム
は、スタイルシートに従って無線ページを自動的に生成
する。In operation, a producer may interact with a GUI tool to generate a wireless website project that includes information about the appearance of an HTML web page on one or more wireless devices. When a producer or user selects a web page, the wireless distribution portion 15 can search the web page and generate an XHMTL document corresponding to the web page. The user extracts or automatically extracts one or more elements from the web page using the ruleset construction toolset, as described below with reference to FIGS. 3 and 4. be able to. From the extracted elements, hereinafter referred to as atomic, the user may generate a wireless page appearance and review the wireless page. When the user is satisfied with the wireless page, the wireless page delivery system 15 (see FIG. 1), when it receives the request for the web page, sends the appropriate one for the wireless device based on the generated ruleset and style sheet. One or more rule sets that capture information about the appearance of the wireless page are generated to automatically generate one or more cards. That is, once a user forms a rule set and a style sheet, the wireless page distribution system automatically generates a wireless page according to the style sheet.

【００２５】生成ルールセットを使用して、ＲＭＬビル
ダー・モジュール２５とＸＳＬジェネレータ・モジュー
ル２６とは、ＲＭＬ文書を生成してもよく、次に、ルー
ルセット及びＲＭＬ文書に具体化されたプロデューサー
の要件を反映するＸＳＬスタイルシートを生成し得る。
また、ルールセットを使用して、ＸＳＬスタイルシート
と結合できるプロジェクト情報を生成し、図１に示すよ
うに無線ウェブページ配信システムを使用してその後に
配置し得る無線ウェブサイト・プロジェクトを生成して
もよい。無線ページ生成システムを使用して、ユーザ
は、無線装置上でそのウェブページの書式を指定しても
よい。Using the generation ruleset, the RML builder module 25 and the XSL generator module 26 may generate the RML document, and then the ruleset and the requirements of the producer embodied in the RML document. XSL stylesheets that reflect
Also, using the ruleset to generate project information that can be combined with an XSL stylesheet, and using a wireless webpage distribution system to generate a wireless website project that can be subsequently deployed using a wireless webpage distribution system as shown in FIG. Is also good. Using the wireless page generation system, a user may specify the format of the web page on the wireless device.

【００２６】ルールセット構築ツールセット２８は、図
４ａから図４ｃを参照して説明するページビュアー・モ
ジュール、及び、図５を参照して以下で説明する無線運
行ビュアー・モジュールを更に備えることができる。プ
ロジェクト構築ツールセット３０は、図６及び図７を参
照して以下で説明するプロジェクトマネージャ・モジュ
ール、図８を参照して以下で説明するＵＲＬ形成マネー
ジャ・モジュール、図９Ａ及び図９Ｂを参照して以下で
説明する無線形態マネージャ・モジュール、図１０を参
照して以下で説明する配置マネージャ・モジュール、及
び、図３２ａ及び図３２ｂを参照して以下で説明するエ
ミュレータ・モジュールを更に備えてもよい。ここで、
これらの各モジュールについて更に詳細に説明する。The ruleset construction toolset 28 can further include a page viewer module, described with reference to FIGS. 4a-4c, and a wireless operation viewer module, described below with reference to FIG. . The project construction toolset 30 includes a project manager module described below with reference to FIGS. 6 and 7, a URL formation manager module described below with reference to FIG. 8, and FIGS. 9A and 9B. It may further comprise a wireless configuration manager module described below, a deployment manager module described below with reference to FIG. 10, and an emulator module described below with reference to FIGS. 32a and 32b. here,
Each of these modules will be described in more detail.

【００２７】図３は、本発明による１つ又はそれ以上の
アトミック及びアトミックのグループに分解されるウェ
ブページ４０の一部分の例、特に、オンライン株式仲買
会社のＨＴＭＬウェブページ４０の一部分の例を示す図
である。図では、一番内側の点線の囲み部分はアトミッ
クを示し、点線の囲み部分の外側の囲み部分は、アトミ
ックのグループを構成する。アトミックは、単語、株式
の相場、株式名、見出し、パラグラフ、リンク、画像、
及び、ウェブページの最も基本的要素を形成するページ
の他の基本的要素など、ウェブページの基本的構築ブロ
ックである。該システムは、自動的にＸＨＴＭＬ文書に
基づいてウェブページのアトミックを識別し得る。グル
ープは、階層的構成内にネストすることができるユーザ
が定めるアトミックのセットとして形成される。これら
の論理的な階層的セットによって、ユーザに関する運行
経験が形成され、無線ウェブサイト構築の鍵となる。FIG. 3 illustrates an example of a portion of a web page 40 that is broken down into one or more atomics and atomic groups according to the present invention, and in particular, an example of an HTML web page 40 of an online stockbroker. FIG. In the drawing, the innermost portion surrounded by a dotted line indicates atomic, and the outer portion surrounded by the dotted line forms an atomic group. Atomics include words, stock quotes, stock names, headlines, paragraphs, links, images,
And basic building blocks of the web page, such as other basic elements of the page that form the most basic element of the web page. The system may automatically identify web page atomics based on XHTML documents. Groups are formed as a user-defined atomic set that can be nested within a hierarchical organization. These logical hierarchical sets form the operating experience for users and are key to wireless website construction.

【００２８】例えば、ページ４０上部にあるのは、相場
ルックアップ形式４１である。相場ルックアップ形式４
１は、３つのアトミック、つまり、「相場」タイトル部
分４１ａ、入力ボックス４１ｂ、及び、「ゴー（Ｇ
ｏ）」提出ボタン４１ｃから作られている。更に、市場
グラフ４２ａ、表４２ｂ、Ｆｏｏｌ．ｃｏｍ広告４２ｃ
が各々関連アトミックであり、グループ４２を構成する
ためにグループにまとめられている。更に、市場グラフ
４２ａの各々の要素はまた、アトミックであってもよ
く、その結果、「ナスダック（ＮＡＳＤＡＱ）」はアト
ミックであり、「２７５６．２７」もアトミックであ
り、下向き矢印もアトミックであり、そして、「−５．
４８」もアトミックであるFor example, at the top of page 40 is a quote lookup format 41. Market lookup format 4
1 has three atomics: a "quote" title portion 41a, an input box 41b, and a "go (G
o) "Submit" button 41c. Further, the market graph 42a, the table 42b, and the Pool. com advertisement 42c
Are each related atomic and are grouped together to form a group 42. Further, each element of the market graph 42a may also be atomic, so that "NASDAQ" is atomic, "275.27" is atomic, the down arrow is atomic, Then, “−5.
48 is also atomic

【００２９】最後に、ＴｈｅＳｔｒｅｅｔ．ｃｏｍロゴ
４３ａ、及び、ニュース記事４３ｂ及び４３ｃは、各々
関連アトミックであり、グループ４３としてまとめられ
ている。グループ４１、４２、及び、４３の全てがルー
トグループ４０を構成している。これらのグループによ
って、Ｅ−ＴＲＡＤＥウェブサイトのこの部分に対する
関連階層が構成される。従って、いかなるウェブページ
も、アトミックとアトミックのグループとに分解され、
それにより、アトミックとアトミックのグループとは、
異なるアトミックのサブセット又は異なる書式などを持
つ新しいウェブページに再構成され得る。ここで、ルー
ルセット構築ツールセットのページビュアー・モジュー
ルについて更に詳細に説明する。Finally, The Street. The com logo 43a and the news articles 43b and 43c are each related atomic and are grouped as a group 43. All of the groups 41, 42, and 43 constitute the root group 40. These groups make up the relevant hierarchy for this part of the E-TRADE website. Therefore, any web page can be broken down into atomic and atomic groups,
Thereby, atomic and atomic group,
It can be reconstructed into a new web page with a different atomic subset or a different format. Here, the page viewer module of the rule set construction tool set will be described in more detail.

【００３０】ウェブページなどの情報源のそのアトミッ
クへの分解には、様々な異なる用途があり得る。好まし
い実施形態では、アトミックへのウェブページの分解に
よって、スクリーンサイズが異なる１つ又はそれ以上の
異なる無線装置上で表示するためにウェブページを再目
的化することができる。特に、ウェブページは、既に個
々のアトミックに分解されているので、無線ページが生
成され得るように、それらのアトミックを１つ又はそれ
以上の異なる無線装置向けに自動的又は手作業で１つ又
はそれ以上の無線ページに割当てることが可能である。There may be a variety of different uses for breaking down a source, such as a web page, into its atomic form. In a preferred embodiment, the decomposition of the web page into atomics allows the web page to be repurposed for display on one or more different wireless devices with different screen sizes. In particular, web pages have already been broken down into individual atomics, so that those atomics can be automatically or manually targeted to one or more different wireless devices, such that wireless pages can be generated. It is possible to allocate more wireless pages.

【００３１】図４ａから図４ｃまでは、図２に示すシス
テム内にあるＧＵＩツール向けの一体式デスクトップ・
ユーザ・インタフェース５０の例を示す図である。図示
した例において、ユーザ・インタフェースは、ユーザが
ルールセット構築ツールセットのツールの全てを、図示
のように、同時に閲覧できるように設計されている。特
に、ユーザインタフェース５０は、ページビュアー部分
５２、ページ・ナビゲータ部分５４、アトミック特性部
分５６、及び、ページビュアー部分５２で閲覧中の項目
を変更する１つ又はそれ以上のタブ５８を備えてもよ
い。図示した例において、タブは、ユーザがＨＴＭＬコ
ードを閲覧できるＨＴＭＬタブ６０、ユーザが図３に示
すグラフィックページを閲覧できる構成タブ６２、及
び、ユーザがページソースを閲覧できるソースタブ６４
を備えてもよい。FIGS. 4a to 4c show an integrated desktop tool for the GUI tools in the system shown in FIG.
FIG. 3 is a diagram illustrating an example of a user interface 50. In the illustrated example, the user interface is designed to allow the user to simultaneously view all of the tools in the ruleset construction toolset, as shown. In particular, the user interface 50 may include a page viewer portion 52, a page navigator portion 54, an atomic feature portion 56, and one or more tabs 58 for changing the item being viewed in the page viewer portion 52. . In the illustrated example, the tabs are an HTML tab 60 where the user can view the HTML code, a configuration tab 62 where the user can view the graphic page shown in FIG. 3, and a source tab 64 where the user can view the page source.
May be provided.

【００３２】ページビュアー５２は、ルールセット構築
のための主要な作業環境である。ユーザが無線配信のた
めに形成したいと思うデスクトップ中心のウェブページ
が表示される。ユーザは、マウスで目標のウェブページ
から要素（アトミック又はアトミックのグループ）を選
択し、次に、各々の要素に特性を設定する。ページビュ
アーによって、ユーザは、アトミックが抽出されている
ページか、又は、タブによって制御された異なる項目閲
覧モードにおいて抽出されたアトミックを使用して作成
されているページかを閲覧することができる。図４に
は、ｅＢａｙ用の競売ページが示されている。ここで、
アトミック抽出処理について簡単に説明する。The page viewer 52 is a main work environment for constructing a rule set. A desktop-centric web page that the user wants to create for wireless distribution is displayed. The user selects elements (atomic or atomic group) from the target web page with the mouse, and then sets properties for each element. The page viewer allows the user to browse the page where the atomics are being extracted or created using the extracted atomics in different item viewing modes controlled by the tabs. FIG. 4 shows an auction page for eBay. here,
Atomic extraction processing will be briefly described.

【００３３】ＨＴＭＬであり得るウェブページからアト
ミックを抽出するために、マイクロソフト（登録商標）
ＤＨＴＭＬなどの変形ＨＴＭＬエディタを使用してもよ
い。一般のＨＴＭＬエディタは、通常作成されるように
ＨＴＭＬを表示することができるが、リンクに対してク
リック・スルーができるものではないし、また、特定の
ＨＴＭＬサブツリーが選択できるものでもない。例え
ば、テキストの断片が選択された場合、一般のＨＴＭＬ
エディタは、ＨＴＭＬツリーのどのノードが全ての選択
されたテキストを含むのかをユーザに教える。しかし、
本発明による変形ＨＴＭＬエディタは、ユーザクリック
に基づいて、選択された内容を持つＨＴＭＬノードを戻
すように改変されている。次に、アトミック抽出器は、
下層のツリーノード構造の中に掘り進み、有効な親を反
復することによって選択された要素に至る経路を決める
（制御装置がエディタの外に存在しないいくつかの設計
時間タグを付け加え、我々も同様にこの混合に付け加え
るいくつかの確認事項があるため）。アトミックによっ
て表される選択された内容に至る経路は、そのアトミッ
クに至る絶対経路であり、特定内容の位置を説明するた
めに後で使用される。To extract atomics from web pages, which may be HTML, Microsoft®
A modified HTML editor such as DHTML may be used. A typical HTML editor can display HTML as it would normally be created, but it does not allow click-throughs to links, nor does it allow a particular HTML subtree to be selected. For example, if a text fragment is selected, the general HTML
The editor tells the user which node of the HTML tree contains all the selected text. But,
The modified HTML editor according to the present invention has been modified to return an HTML node with the selected content based on a user click. Next, the atomic extractor:
Dig into the underlying tree node structure and determine the path to the selected element by iterating through the valid parents. Because there are some checks to add to this mix). The path to the selected content represented by the atomic is the absolute path to the atomic and will be used later to describe the location of the specific content.

【００３４】ページ・ナビゲータ５４は、ページ・ビュ
アーの構築ビューと非常に似ており、構築ビュアーの機
能性の全てを開始することができる。しかし、ページ・
ナビゲータ５４は、グラフィック書式よりもむしろツリ
ー構造で選択されたページ要素を表示する。ツリー構造
は、どのように要素がグループやアトミックに構成され
るかを階層的に表すものである。従って、ページ・ナビ
ゲータ部分５４は、ページの階層構造を示す。特にペー
ジは、図示のように、階層的な関係で配列されるグルー
プやアトミックとして表わされる。アトミックとアトミ
ックのグループとの階層的関係は、ページ・ビュアー部
分５２に示すページから要素を引っ張り出してナビゲー
タ部分５４内に落とすことによりユーザが手作業で生成
してもよい。更に、該システムは、上記の通り、ウェブ
ページからアトミックやグループを自動的に抽出し得
る。ユーザは、次に、正確にページを反映するためにナ
ビゲータ部分５４にアトミックやグループを配置しても
よい。The page navigator 54 is very similar to the construction view of the page viewer and can initiate all of the functionality of the construction viewer. But the page
The navigator 54 displays the selected page elements in a tree structure rather than a graphic format. The tree structure hierarchically represents how the elements are organized into groups or atomically. Accordingly, the page navigator portion 54 shows the hierarchical structure of the page. In particular, pages are represented as groups or atomics arranged in a hierarchical relationship as shown. The hierarchical relationship between atomic and atomic groups may be manually created by the user by pulling elements from the page shown in page viewer portion 52 and dropping them into navigator portion 54. In addition, the system can automatically extract atomics and groups from web pages, as described above. The user may then place an atomic or group in the navigator portion 54 to accurately reflect the page.

【００３５】ナビゲータ部分５４のアトミックが選択さ
れると、そのアトミックの特性をアトミック特性部分５
６に表示してもよい。アトミック特性部分５６によっ
て、ユーザは、無線ウェブサイトの各要素について特性
を閲覧及び形成することができる。特性は、各要素の特
定の属性を表わすことができる（例えば、特性は、どの
クラスの無線装置が要素を表示し得るか定めてもよ
い）。図示の例では、アトミックの特性は、その名称、
クラス、ＲＭＬ経路、ＨＴＭＬ経路、そのタグ、アトミ
ックのサンプル、及び、他のいかなる情報も含み得る。
本発明によれば、ＧＵＩツールインタフェースによっ
て、ユーザは、図４に示すように上記の部分の全てを同
時に閲覧することができ、その結果、ユーザは、ページ
を分解して新しいページを作成するにあたり、必要な情
報の全てを見るために各スクリーン又は各アプリケーシ
ョン間で切り替えたりしなくて済む。When the atomic of the navigator part 54 is selected, the atomic property is changed to the atomic property part 5.
6 may be displayed. The atomic properties portion 56 allows a user to view and form properties for each element of the wireless website. The characteristics may represent a particular attribute of each element (eg, the characteristics may define which class of wireless device may display the element). In the example shown, the properties of an atomic are its name,
It may include classes, RML paths, HTML paths, their tags, atomic samples, and any other information.
In accordance with the present invention, the GUI tool interface allows a user to simultaneously view all of the above portions, as shown in FIG. It is not necessary to switch between screens or applications to see all of the necessary information.

【００３６】図４ｂは、ＨＴＭＬコードに基づいて生成
されたグラフィックページが示されているＨＴＭＬビュ
アー６０を示す。図４ｃは、グラフィックページを生成
する実際のＨＴＭＬコードが示されているソース・ビュ
アー６４を示す。ここで、ＧＵＩツールの一部である本
発明による無線運行ビュアー・モジュールについて更に
詳細に説明する。FIG. 4b shows an HTML viewer 60 in which a graphic page generated based on the HTML code is shown. FIG. 4c shows the source viewer 64 in which the actual HTML code that creates the graphic page is shown. Here, the wireless operation viewer module according to the present invention, which is a part of the GUI tool, will be described in further detail.

【００３７】図５は、図２に示すシステム内の本発明に
よる無線運行ビュアー・モジュール７０の例を示す図で
ある。無線運行ビュアー７０によって、ユーザは、上記
の通り、ルールセット構築ツールセットを使用して作成
されたルールセットに基づいて生成された１つ又はそれ
以上のカードの無線運行構造をグラフィックに閲覧する
ことができる。特に、ほとんどの無線装置のスクリーン
の限界のために、通常１つウェブページが一連のプレゼ
ンテーションデックで無線装置に配信される。これらの
デックには、各無線装置のスクリーンについて適切に書
式化されたウェブページ内容を各々含む１つ又はそれ以
上のカードが含まれている。従って、無線運行ビュアー
・モジュールは、適切な方法でウェブページがカードに
分割されたことをユーザが確認することができるよう
に、ユーザにグラフィック形式でこれらのデックを呈示
する。すなわち、ユーザは、ルールセット構築ツールセ
ットを使用して生成されたルールセットの結果を検討し
得る。FIG. 5 is a diagram showing an example of the wireless operation viewer module 70 according to the present invention in the system shown in FIG. The wireless operation viewer 70 allows the user to graphically view the wireless operation structure of one or more cards generated based on the ruleset created using the ruleset construction toolset, as described above. Can be. In particular, due to the screen limitations of most wireless devices, typically one web page is delivered to the wireless device in a series of presentation decks. These decks include one or more cards, each containing web page content appropriately formatted for each wireless device screen. Thus, the wireless navigation viewer module presents these decks to the user in a graphical format so that the user can confirm that the web page has been split into cards in an appropriate manner. That is, the user may review the results of the ruleset generated using the ruleset construction toolset.

【００３８】図５に示す例では、ウェブページは、ユー
ザのルールセットにより、付け値統計、売り手情報、及
び、品目の説明を各々含む他の３枚のカード７４、７
６、及び、７８とリンクする競売品目を包含する第１の
カード７２に分解されている。付け値統計カード７４
は、付け値カード７９及び付け値カードとリンクする付
け値方法カード８０を持ち得る。説明カード７８は、説
明カードとリンクする品目の画像を示す画像カード８２
を持ち得る。すなわち、無線装置を持つユーザについて
は、そのユーザは、図５に示すようにカードを介して運
行してもよい。すなわち、システムのユーザは、カード
が論理的な方法で生成されることを確実にすることがで
き、その結果、カードを介した運行は論理的である。ユ
ーザが運行に関する問題を察知した場合、ページ・ビュ
アー部分５０に戻り、問題を事後処理するためにルール
セットをやり直すことができる。ここで、本発明による
プロジェクト構築ツールキットについて更に詳細に説明
する。In the example shown in FIG. 5, the web page contains, according to the user's rule set, three other cards 74, 7 each containing bid statistics, seller information, and a description of the item.
6, and 78 are broken down into a first card 72 containing auction items linked to 78. Bid statistics card 74
May have a bid card 79 and a bid method card 80 linked to the bid card. The explanation card 78 is an image card 82 showing an image of an item linked to the explanation card.
Can have That is, for a user having a wireless device, the user may operate via a card as shown in FIG. That is, the user of the system can ensure that the cards are generated in a logical manner, so that operation through the cards is logical. If the user notices a problem with the operation, they can return to the page viewer portion 50 and redo the ruleset to post-process the problem. Here, the project construction toolkit according to the present invention will be described in more detail.

【００３９】プロジェクト構築ツールセット３０によっ
て、ユーザは、１つ又はそれ以上のルールセットを無線
ウェブサイト・プロジェクト（ＷＷＰ）の中に組み合わ
せることができる。プロジェクト構築ツールセットを使
用すれば、ユーザは以下のことを実行できる。１．ルールセットを追加及び除去することによりＷＷＰ
を構成する（図７を参照されたい）。２．ＷＷＰのＵＲＬ定義表（様々なウェブページへのル
ールセットの適用の仕方を表す）を作成及び維持する
（図８を参照されたい）。３．専門無線機能をＷＷＰと一体化する（図９Ａ及び図
９Ｂを参照されたい）。４．無線装置エミュレータ又はインターネット可能な無
線装置を使用して試験的にＷＷＰを配置する。５．試験環境から生産環境にＷＷＰを配置換えする（図
１０を参照されたい）。The project building toolset 30 allows a user to combine one or more rulesets into a wireless website project (WWP). With the project building toolset, users can: 1. WWP by adding and removing rule sets
(See FIG. 7). 2. Create and maintain a WWP URL definition table (showing how to apply the ruleset to various web pages) (see FIG. 8). 3. Integrate specialized wireless functionality with WWP (see FIGS. 9A and 9B). 4. Deploy WWP on a trial basis using a wireless device emulator or internet enabled wireless device. 5. Relocate the WWP from the test environment to the production environment (see FIG. 10).

【００４０】上記の通り、プロジェクト構築ツールセッ
トは、更に、プロジェクト・マネージャ・モジュール、
ＵＲＬ形成マネージャ・モジュール、無線形態マネージ
ャ・モジュール、配置マネージャ・モジュール、及び、
エミュレータ・モジュールを含む。ここで、これらのモ
ジュールの各々について説明する。図６は、本発明によ
るプロジェクト・マネージャ向けのユーザ・インタフェ
ース９０の例を示す図である。プロジェクト・マネージ
ャによって、ユーザは、ＷＷＰを構成するルールセット
のセットを維持することができる。プロジェクト・マネ
ージャを使用すれば、ユーザは、図７を参照して以下で
説明するように、先に作成されたルールセットを追加す
るか、又は、取り除くことによって、ＷＷＰを変更する
ことができる。図６に示すように、特定のプロジェクト
に対して１つ又はそれ以上の異なるルールセットのリス
ト９２が示されている。各々のルールセットについて
は、ルールセットを作成したユーザ、最終更新日付、状
態（配置又は変更）、及び、ＵＲＬ規則を掲載する。更
に、ユーザ・インタフェース９０は、ユーザが、例え
ば、ルールセットを追加し、ルールセットを配置し、Ｕ
ＲＬを形成し、プロジェクト管理を終了し、又は、以前
のコマンドを取り消すことができる１つ又はそれ以上の
ボタン９４を備えてもよい。ここで、ルールセット追加
ユーザ・インタフェースの例について説明する。ルール
セットと、ルールセット及びＸＳＬスタイルシートに基
づいて生成される無線ページとの例について、以下で図
２２から図２５ｂまでを参照して説明する。As described above, the project construction toolset further includes a project manager module,
A URL formation manager module, a wireless configuration manager module, a location manager module, and
Includes emulator module. Here, each of these modules will be described. FIG. 6 is a diagram illustrating an example of a user interface 90 for a project manager according to the present invention. The project manager allows the user to maintain a set of rule sets that make up the WWP. Using the project manager, the user can modify the WWP by adding or removing previously created rulesets, as described below with reference to FIG. As shown in FIG. 6, a list 92 of one or more different rule sets for a particular project is shown. For each rule set, the user who created the rule set, the date of the last update, the status (arranged or changed), and the URL rule are listed. Further, the user interface 90 allows the user to, for example, add a rule set, place a rule set,
It may include one or more buttons 94 that can form an RL, exit project management, or cancel previous commands. Here, an example of the ruleset addition user interface will be described. Examples of a rule set and a wireless page generated based on the rule set and the XSL style sheet will be described below with reference to FIGS. 22 to 25B.

【００４１】図７は、本発明によるルールセット追加ビ
ュアー・ユーザ・インタフェース１００の例を示す図で
ある。ルールセット追加ビュアー・ユーザ・インタフェ
ースによって、ユーザは、以前にルールセット構築ツー
ルセットを使用して作成したルールセットを特定のプロ
ジェクトに追加することができる。図７に示すように、
ユーザは、１つ又はそれ以上の現存するルールセット
（この例では、ＳｔａｒｔＰａｇｅ．ｒｓ、Ｃａｔｅｇ
ｏｒｉｅｓ．ｒｓ、ＩｔｅｍＬｉｓｔ．ｒｓ、及び、Ｉ
ｔｅｍＤｅｓｃｒｉｐｔｉｏｎ．ｒｓ）から選択し、そ
れらをプロジェクトに追加してもよい。図示の例におい
て、ユーザは、カード、デック、及び、品目の説明をユ
ーザに呈示するのに使用される書式を形成するＩｔｅｍ
Ｄｅｓｃｒｉｐｔｉｏｎ．ｒｓルールセットを選択して
いる。ここで、本発明によるＵＲＬ形成マネージャにつ
いて以下に説明する。FIG. 7 is a diagram showing an example of a ruleset addition viewer user interface 100 according to the present invention. The Add Ruleset Viewer user interface allows a user to add rulesets previously created using the ruleset construction toolset to a particular project. As shown in FIG.
The user may select one or more existing rule sets (StartPage.rs, Categ in this example).
ories. rs, ItemList. rs and I
temDescription. rs) and add them to the project. In the illustrated example, the user forms an item that is used to present a description of the card, deck, and item to the user.
Description. The rs rule set has been selected. Here, the URL formation manager according to the present invention will be described below.

【００４２】図８は、本発明によるＵＲＬ形成マネージ
ャ・ユーザ・インタフェース１１０の例を示す図であ
る。ＵＲＬ形成マネージャによって、ユーザは、プロジ
ェクト向けのＵＲＬ定義表を維持することができる。Ｕ
ＲＬ定義表によって、図２に示す無線ページ配信システ
ム１５は、各々のＵＲＬ要求について適切なルールセッ
トを選択することができる。特に、ルールセットは、複
数のＵＲＬに適用されることが多いことから、このモジ
ュールによって、ユーザは、ＵＲＬがルールセットにマ
ップされる方法を定めることができる。例えば、図８に
示すように、ＩｔｅｍＤｅｓｃｒｉｐｔｉｏｎ．ｒｓが
選択されており、ＵＲＬ形成マネージャは、特定のルー
ルセットが特定のＵＲＬに適用されるか否かを確認する
ために使用される１つ又はそれ以上のトーケン１１２を
列挙する。更に、特定のトーケンがルールセットを使用
するのに適合している必要があるか（「必要」）、ＵＲ
Ｌのトーケンは該当しないか（「関係ない」）、又は、
そのトーケンがそのルールセットに対してＵＲＬに存在
できないか（「不必要」）を示す各トーケンについて、
丸１１４を埋めてもよい。図示の例において、ＵＲＬ
は、ルールセットを呼び出すために以下のトーケンを持
つ必要がある。すなわち、ｅＢａｙ、ｃｏｍ、ａｗ-ｃ
ｇｉ、及び、ｅＢａｙＩＳＡＰＩ．ｄｌｌ？ＶｉｅｗＩ
ｔｅｍ＆Ｉｔｅｍである。従って、例えば、「ｈｔｔ
ｐ：／／ｗｗｗ．ｅ１Ｂａｙ．ｃｏｍ．．．」というＵ
ＲＬでは、ｅＢａｙを包含しないために、ルールセット
を呼び出さないことになる。ここで、本発明による無線
形態マネージャについて以下に説明する。FIG. 8 is a diagram showing an example of the URL formation manager user interface 110 according to the present invention. The URL formation manager allows a user to maintain a URL definition table for a project. U
The RL definition table allows the wireless page distribution system 15 shown in FIG. 2 to select an appropriate rule set for each URL request. In particular, rulesets often apply to multiple URLs, so this module allows the user to define how URLs are mapped to rulesets. For example, as shown in FIG. rs has been selected, and the URL formation manager enumerates one or more tokens 112 that are used to check whether a particular rule set applies to a particular URL. In addition, whether a particular token needs to be adapted to use the ruleset ("necessary"),
L token is not applicable ("not relevant"), or
For each token that indicates whether the token cannot exist in the URL for the ruleset ("unnecessary"),
The circle 114 may be filled. In the example shown, the URL
Needs to have the following tokens to invoke the ruleset. That is, eBay, com, aw-c
gi and eBayISAPI. dll? ViewI
tem & Item. Therefore, for example, "http
p: // www. e1Bay. com. . . U
In the RL, a rule set is not called because eBay is not included. Here, the wireless configuration manager according to the present invention will be described below.

【００４３】図９Ａ及び図９Ｂは、本発明による無線形
態マネージャ向けのユーザ・インタフェース１２０及び
１２２の例を示す図である。無線形態マネージャは、デ
スクトップ中心のウェブサイト内容からでは利用できな
い特定の専門無線機能を備えるのに使用される。特に、
無線形態マネージャによって、ユーザは、無線ウェブサ
イトにこれらの機能を一体化することができる。例え
ば、ユーザは、専門メッセージ機能をＷＷＰ、又は、追
加し得る無線形態の例として以下に簡単に説明される収
益形態の中に含めることができる。FIGS. 9A and 9B show examples of user interfaces 120 and 122 for a wireless configuration manager according to the present invention. The wireless configuration manager is used to provide certain specialized wireless features not available from desktop-centric website content. In particular,
The wireless configuration manager allows the user to integrate these functions into a wireless website. For example, a user may include a professional messaging function in a WWP or revenue form that is briefly described below as an example of a wireless form that may be added.

【００４４】図９Ａ及び図９Ｂは、無線ページに対して
電子収益形態を含める場合の例を示す。図９Ａでは、無
線形態マネージャによって、ユーザは、電子商取引形態
（安全な受け口及び総合支払いを含む）を追加するか、
又は、促進機能（無線広告、クーポン券配布、及び、資
金提供を含む）を追加することができる。図９Ｂでは、
無線広告を追加する場合のユーザ・インタフェース１２
２を更に詳細に示す。ユーザは、例えば、無線装置ユー
ザに対して広告を表示する頻度に関する情報と共に、広
告パートナーとそのパートナーのＵＲＬを指定すること
ができる。図示するように、広告について、１つ又はそ
れ以上の無線装置が選ぶことができ、各々の異なる無線
装置は、異なるレベルの広告を持つことができる。例え
ば、「インターネット電話」や「ハンドヘルド」などの
いくつかの無線装置が選択され、広告が１セッションに
１度表示されるＷＡＰ電話とは対照的に、ユーザに提示
されるあらゆるデックで広告を繰り返すことができる。
従って、ユーザは、無線形態マネージャを使用して、無
線ページに付随する形態をカスタム化することができ
る。ここで、配置マネージャについて更に詳細に説明す
る。FIGS. 9A and 9B show an example of a case where an electronic profit form is included in a wireless page. In FIG. 9A, the wireless configuration manager allows the user to add an e-commerce configuration (including secure points of receipt and total payment),
Or, promotion features (including wireless advertising, coupon distribution, and funding) can be added. In FIG. 9B,
User interface 12 for adding a wireless advertisement
2 is shown in more detail. The user can, for example, specify an advertising partner and the URL of that partner, along with information about how often to display advertisements to wireless device users. As shown, one or more wireless devices can be selected for the advertisement, and each different wireless device can have a different level of advertisement. For example, some wireless devices such as "Internet Phone" and "Handheld" may be selected and repeat the advertisement on every deck presented to the user, as opposed to a WAP phone where the advertisement is displayed once per session. be able to.
Thus, the user can use the wireless configuration manager to customize the configuration associated with the wireless page. Here, the arrangement manager will be described in more detail.

【００４５】図１０は、本発明による配置マネージャ・
ユーザ・インタフェース１３０の例を示す図である。配
置マネージャによって、ユーザは、ＷＷＰの配置を制御
することができる。例えば、ユーザは、配置マネージャ
を使用して、試験環境又は生産環境のいずれかにＷＷＰ
を配置することができる。配置マネージャはまた、必要
に応じてユーザが前のバージョンに戻ることができる配
置バージョン制御を備える。図１０は、「ｗｈａｔｅｖ
ｅｒ．ｎｍｄ」というプロジェクト向けのバージョンを
示す配置マネージャ・ユーザ・インタフェースの例を示
す。ここで、本発明による無線ページ生成システムの後
端について更に詳細に説明する。FIG. 10 shows a configuration manager according to the present invention.
FIG. 3 is a diagram illustrating an example of a user interface 130. The configuration manager allows the user to control the configuration of the WWP. For example, a user may use a deployment manager to create a WWP in either a test environment or a production environment.
Can be arranged. The deployment manager also includes a deployment version control that allows the user to revert to a previous version if needed. FIG. 10 shows “whatev
er. 5 shows an example of a deployment manager user interface showing a version for a project called "nmd". Here, the rear end of the wireless page generation system according to the present invention will be described in more detail.

【００４６】上記の通り、後端は、図２に示すようにＲ
ＭＬビルダー２５、ＸＳＬジェネレータ２６、及び、ス
タイルシート・データベース２７を備える。ここで、バ
ックエンドのこれらのモジュールの各々について更に詳
細に説明する。ＲＭＬビルダー２５は、ＧＵＩによって
生成したルールセットに基づいてアゴノスチックＲＭＬ
文書を記憶し更新する。これらの文書は、ＧＵＩを通し
て収集されたユーザ指定式のプロジェクトデータを含
む。該データは、ユーザ指定のグループ及びアトミック
だけでなく、配置エンジンへの入力として使用されるＲ
ＭＬ文書の構造を映す内容アグノスチックＲＭＬ文書グ
ループデータを階層的に取り扱うためのユーザ定義の属
性及び追加ルールを含む。ここで、ウェブページ（この
例では動的）をＲＭＬコードに変換し、次に、強化アグ
ノスチックＲＭＬコードに変換し、最後に、動的ウェブ
ページから目標とする内容を抽出するＸＳＬスタイルシ
ートに変換する処理の例について説明する。As described above, the rear end has an R as shown in FIG.
An ML builder 25, an XSL generator 26, and a style sheet database 27 are provided. Now, each of these back-end modules will be described in more detail. The RML builder 25 uses an agonostic RML based on a ruleset generated by the GUI.
Store and update documents. These documents include user-specified project data collected through a GUI. The data is used as input to the placement engine as well as user specified groups and atomics.
Content that reflects the structure of the ML document Contains user-defined attributes and additional rules for hierarchically handling agnostic RML document group data. Here, the web page (dynamic in this example) is converted to RML code, then converted to enhanced agnostic RML code, and finally to an XSL stylesheet that extracts the desired content from the dynamic web page An example of the processing performed will be described.

【００４７】図１１は、ウェブページの２つのサンプル
１３１及び１３２の内容が変わる動的ウェブページの例
を示す。ウェブページは、それが第１のサンプル１３１
から以下で説明するように正確に内容を抽出するのが更
に困難となる第２のサンプル１３２に変わるので、動的
であり得る。第１のサンプル１３１は、グループ１３１
及びそのグループ内の複数のアトミック１３４を備える
ことができる。特に、アトミックは、「陰」と「陽」と
になり得る。図示するように、第２のサンプル１３２は
また、グループ１３３とアトミック１３４とを持つこと
ができる。しかし、第２のサンプルでは、グループ１３
３内のアトミックの数は、「陰」、「陽」、及び、
「竜」のアトミックになれるように、1つだけ増えてい
る。ウェブページのもう片方の部分のアトミックの数も
また、１つだけ増えている。例えば、第１のサンプルが
第１の所定の時間においてウェブページであって、内容
が変化した時、第２のサンプルが第２の所定の時間にお
いてそれと同じウェブページであってもよい。上記の通
り、通常のシステムではウェブページが変化すると、図
１１に示すように、無線ウェブページを生成している人
々は、無線ウェブページの全てをやり直す必要がある。
しかし、本発明によれば、ウェブページが第１のサンプ
ル３１又は第２のサンプル１３２に似ているか否かを問
わず、ウェブページから適切な内容を抽出することがで
きる。ウェブページは、構造的には図１２に示すように
そのＨＴＭＬツリーによって表される。図１２は、図１
１のウェブページサンプルに対応するＨＴＭＬツリーの
例を示す。特に、ウェブページの各サンプルの階層構造
が示されているが、２つのサンプル間の違いは明白であ
る。更に詳しくは、ＨＴＭＬツリーの上部は、ウェブペ
ージの内容と構造が同一なので同一である。第１のサン
プルのＨＴＭＬツリーにおいて、表構造は、アトミック
「陰」及び「陽」を包含する２つのＴＤノード１３４の
親であるＴＲグループノード１３３の親である。対照的
に、第２のサンプルのＨＴＭＬツリーは、同じ表・親ノ
ード及び同じＴＲグループノード１３３を持つが、３つ
のアトミック「陰」、「陽」、及び、「竜」を包含する
３つのＴＤノードがある。ここで、ウェブページの２つ
のサンプル上で生成されるＲＭＬコードの例について説
明する。FIG. 11 shows an example of a dynamic web page in which the contents of two samples 131 and 132 of the web page change. The web page is the first sample 131
To a second sample 132, which makes it more difficult to accurately extract the content, as described below, and may be dynamic. The first sample 131 is a group 131
And a plurality of atomics 134 in the group. In particular, an atomic can be "yin" and "yang". As shown, the second sample 132 can also have a group 133 and an atomic 134. However, in the second sample, group 13
The number of atomics in 3 is "Yin", "Yang", and
Only one has been added to be able to become atomic of the dragon. The number of atomics in the other part of the web page has also increased by one. For example, the first sample may be a web page at a first predetermined time and when the content changes, the second sample may be the same web page at a second predetermined time. As described above, when the web page changes in the normal system, as shown in FIG. 11, the people who are generating the wireless web page need to start over all of the wireless web page.
However, according to the present invention, appropriate contents can be extracted from a web page regardless of whether the web page resembles the first sample 31 or the second sample 132. A web page is structurally represented by its HTML tree as shown in FIG. FIG.
5 shows an example of an HTML tree corresponding to one web page sample. In particular, although the hierarchical structure of each sample of the web page is shown, the differences between the two samples are clear. More specifically, the upper part of the HTML tree is the same because the content and structure of the web page are the same. In the first example HTML tree, the table structure is the parent of the TR group node 133 that is the parent of the two TD nodes 134 that contain the atomic “Yin” and “Yang”. In contrast, the HTML tree of the second sample has the same table / parent node and the same TR group node 133 but three TDs containing three atomic "Yin", "Yang", and "Dragon". There are nodes. Here, an example of the RML code generated on the two samples of the web page will be described.

【００４８】図１３は、図１１のウェブページサンプル
に基づいて生成される関係マークアップ言語（ＲＭＬ）
コード１３５の例を示す。特に、該ＲＭＬコードは、そ
のコードがアグノスチックＲＭＬ（ＡＲＭＬ）と対照的
な内容を包含するので正統ＲＭＬと呼ばれてもよく、該
ＡＲＭＬは、ウェブページの構造の質問だけを包含する
が正統ＲＭＬと同じ関係階層を持っている。現行パラダ
イムにおけるＲＭＬコードは、通常、目標とする内容の
特定断片の選択を通してＨＴＭＬをＲＭＬ書式に変換す
るために、ＸＳＬスタイルシートをＨＴＭＬページに適
用することによって構築される。図示するように、各サ
ンプルのＲＭＬコードは、第２のサンプルが上記の通り
追加のアトミックを含んでいるので異なっている。グル
ープタグは、各グループ１３３に対して生成され、アト
ミックタグは、各アトミック１３４に対して生成され
る。すなわち、ＲＭＬは、ウェブページの目標とする内
容を捕らえ、アトミックやアトミックのグループの階層
構造内にそれを包含する。FIG. 13 shows a relational markup language (RML) generated based on the web page sample of FIG.
An example of the code 135 is shown. In particular, the RML code may be referred to as legitimate RML because the code contains content in contrast to Agnostic RML (ARML), which contains only questions about the structure of the web page but does not contain legitimate RML. Has the same relationship hierarchy as. RML code in the current paradigm is typically constructed by applying an XSL stylesheet to an HTML page to convert HTML to RML format through the selection of specific pieces of content to be targeted. As shown, the RML code for each sample is different because the second sample contains additional atomics as described above. A group tag is generated for each group 133, and an atomic tag is generated for each atomic 134. That is, RML captures the targeted content of a web page and includes it in a hierarchical structure of atomic or atomic groups.

【００４９】図１４は、各サンプルの未処理のアグノス
チックＲＭＬコード（ＡＲＭＬ）１３６の例を示す。特
にこれらは、いかなる処理であってもその以前のＡＲＭ
Ｌを表し、各アトミックは、完全指定経路を包含する。
次に、ＡＲＭＬコードは、前処理された後に、図１５及
び図１６を参照して以下で説明するように、強化されて
一般化される。ウェブページの各サンプルの強化された
ＡＲＭＬコード１３７のサンプルを図１５に示す。従っ
て、たとえウェブページが動的であって変化しても（図
１１に示す第１のサンプル１３１と第２のサンプル１３
２との間の変化など）、図１７に示すＸＳＬスタイルシ
ートがウェブページから正しい内容を抽出するように、
目標とする内容に至る経路が強化される。この例では、
内容の２つ又は３つの断片があったとしても、表の内容
の全てが抽出されるはずである。図１６は、本発明によ
る一般化されたＡＲＭＬコードを示す。図１７は、図１
６のＡＲＭＬコードに基づいて生成された、本発明によ
るウェブページサンプルの両方を正しく処理するＸＳＬ
スタイルシート１３９の例を示す。ここで、ＸＳＬジェ
ネレータ２６の２つの実施形態について更に詳細に説明
する。FIG. 14 shows an example of the raw agnostic RML code (ARML) 136 for each sample. In particular, these are the ARM
L, and each atomic contains a fully specified path.
Next, the ARML code is pre-processed and then enhanced and generalized as described below with reference to FIGS. A sample of the enhanced ARML code 137 for each sample of the web page is shown in FIG. Therefore, even if the web page is dynamic and changes (the first sample 131 and the second sample 13 shown in FIG. 11).
2), so that the XSL stylesheet shown in FIG. 17 extracts the correct content from the web page,
The route to the target content is strengthened. In this example,
Even if there are two or three pieces of content, all of the contents of the table should be extracted. FIG. 16 shows a generalized ARML code according to the present invention. FIG.
XSL that correctly processes both web page samples according to the invention, generated based on ARML code 6
An example of the style sheet 139 is shown. Here, two embodiments of the XSL generator 26 will be described in more detail.

【００５０】図１８Ａ及び図１８Ｂは、本発明によるＸ
ＳＬジェネレータ２６の２つの実施形態の更なる詳細を
示すブロック図である。特に、ＸＳＬジェネレータは、
ＲＭＬビルダーからアグノスチックＲＭＬを受け取り、
ＸＳＬスタイルシートの構築にそれを使用する。ＸＳＬ
スタイルシートは、ＨＴＭＬウェブページに基づいて自
動的に１つ又はそれ以上のカード及びデックを生成する
ために無線ページ配信システムによって使用される。無
線ページ配信システムのＸＳＬエンジン１５（カタリス
ト（登録商標）として知られている）は、後で無線装置
向けの無線ページにウェブページを再書式化するための
正統ＲＭＬにＸＨＴＭＬを変換する時、このスタイルシ
ートを参照する。本発明の１つの実施形態によれば、Ｘ
ＳＬジェネレータは、上記の通り、ユーザ入力に基づい
て自動的にスタイルシートを作成することができる。好
ましい実施形態において、ＸＳＬジェネレータは、ＲＭ
Ｌ文書内のＸＰＡＴＨを通して自動的に構文解釈し、た
とえオリジナルのウェブページが変化した時でもＸＳＬ
スタイルシートがまだ適切なアトミック又はグループを
抽出し得るように経路を一般化しようとする、以下で説
明されるＸＰＡＴＨロバスチファイアを更に含んでもよ
い。例えば、ウェブサイトが一般に各記事用のセルを持
つ表にそのトップ記事を持つ場合、ウェブサイトが表内
に更にトップ記事を追加すると、ＸＰＡＴＨロバスチフ
ァイアは、オリジナルの書式スタイルを変更し、それに
より、たとえ余分のトップ記事があっても以下で更に詳
細に説明するように、書式スタイルは、依然として正し
い内容を検索する。すなわち、内容が変わる動的ウェブ
ページでさえも、１つ又はそれ以上の無線ページ又はカ
ードを正しく生成するために、本発明に従って自動的に
処理されてもよい。FIG. 18A and FIG. 18B show the X according to the present invention.
FIG. 3 is a block diagram illustrating further details of two embodiments of the SL generator 26. In particular, the XSL generator
Receive Agnostic RML from RML builder,
Use it to build XSL stylesheets. XSL
Style sheets are used by wireless page distribution systems to automatically generate one or more cards and decks based on HTML web pages. The XSL engine 15 (also known as Catalyst®) of the wireless page distribution system, when later converting the XHTML to legitimate RML for reformatting the web page into a wireless page for the wireless device, Refer to the style sheet. According to one embodiment of the present invention, X
The SL generator can automatically create a style sheet based on user input, as described above. In a preferred embodiment, the XSL generator is RM
Automatically parse through XPATH in L documents, even if the original web page changes, XSL
It may further include an XPATH robustifier, described below, which attempts to generalize the path so that the style sheet can still extract the appropriate atomics or groups. For example, if a website generally has its top article in a table with a cell for each article, and the website adds more top articles in the table, XPATH Robustifier will change the original formatting style, Thus, even if there is an extra top article, the formatting style still searches for the correct content, as described in more detail below. That is, even dynamic web pages that change content may be automatically processed in accordance with the present invention to correctly generate one or more wireless pages or cards.

【００５１】好ましい実施形態である、図１８Ａに示す
１つの実施形態において、ＸＳＬジェネレータ２６は、
サーバ上のＣＰＵによって実行されるソフトウェア・ア
プリケーションである可能性がある１つ又はそれ以上の
モジュールを備えてもよい。ＸＳＬジェネレータ２６
は、ＸＰＡＴＨプリプロセッサモジュール１４０、ＸＰ
ＡＴＨロバスチファイア・モジュール１４２、及び、Ｘ
ＳＬ書込みモジュール１４４を含んでもよい。図１８Ｂ
に示す実施形態では、ＸＩＰＡＴＨロバスチファイア１
４２は取り除かれている。ＸＰＡＴＨロバスチファイア
が無ければ、システムは、ウェブサイトから適切な情報
を抽出するであろうＸＳＬスタイルシートを生成する
が、ウェブサイトが動的である場合、正しい情報を抽出
できない可能性がある。本発明の好ましい実施形態によ
るＸＰＡＴＨロバスチファイアを使用すれば、動的ウェ
ブページも適切に処理することができる。これは、以下
で更に詳細に説明するように、ウェブページの変化でス
タイルシートが混乱しないように、ＸＰＡＴＨロバスチ
ファイアがスタイルシートのＸＰＡＴＨを一般化しよう
とするからである。ここで、ＸＳＬジェネレータ２６の
各モジュールについて更に詳細に説明する。In one embodiment, shown in FIG. 18A, which is a preferred embodiment, XSL generator 26 includes
It may comprise one or more modules, which may be software applications executed by the CPU on the server. XSL generator 26
Is the XPATH preprocessor module 140, XP
ATH robustifier module 142 and X
An SL writing module 144 may be included. FIG. 18B
In the embodiment shown in FIG.
42 has been removed. Without the XPATH robustifier, the system would generate an XSL stylesheet that would extract the appropriate information from the website, but if the website was dynamic, it might not be able to extract the correct information. Using the XPATH robustifier according to the preferred embodiment of the present invention, dynamic web pages can also be properly handled. This is because the XPATH robustifier attempts to generalize the stylesheet XPATH so that the stylesheet is not confused by changes in the web page, as described in more detail below. Here, each module of the XSL generator 26 will be described in more detail.

【００５２】ＸＰＡＴＨプリプロセッサ１４０の役割
は、ＲＭＬ内の選択された内容に至る相対的な経路を形
成することである。ＸＨＴＭＬツリーでは、ルートノー
ドと選択された内容との間のあらゆるノードを記録する
選択された内容に至る絶対経路が使用される。しかし、
相対的な経路（ルートノードから選択された内容までの
全てのノードより少ないノードを記録する必要がある）
を使用すると、スタイルシートはかなり一般化される。
更に、相対的な経路は、図２０及び図２１を参照して以
下で説明する一般化方法に対してＸＰＡＴＨロバスチフ
ァイアにより使用されてもよい。[0052] The role of the XPATH preprocessor 140 is to form a relative path to selected content in the RML. The XHTML tree uses an absolute path to the selected content that records any nodes between the root node and the selected content. But,
Relative path (need to record less than all nodes from root node to selected content)
The use of style sheets is quite generalized.
Further, the relative path may be used by the XPATH robustifier for the generalization method described below with reference to FIGS.

【００５３】更に詳しくは、相対的経路を形成するため
に、ＸＰＡＴＨプリプロセッサは、まず、選択された内
容についてグループノードを決める。グループノード
は、グループ分けされ選択された内容の全てにリンクす
る最下位の親ノードである。次に、プリプロセッサは、
グループノードと相対的経路であるその子孫との間の経
路を形成する。図１９は、本発明によるグループノード
を識別する３つの例を示す図である。図示するように、
グループノード１５６及び選択された内容１５２を含む
ＲＭＬツリー１５４と共に、選択された内容１５２（図
では陰影をつけて示す）に至る絶対経路を伴うＸＨＴＭ
Ｌツリー１５０が示されている。第１の例では、選択さ
れた内容１５２は、ＸＨＴＭＬツリーにおいて絶対経路
「ＡＢＤＧ」と「ＡＢＤＨ」とを持ち、相対的経路がグ
ループノード「ＡＢＤ」から、選択された内容の各々
（アトミックＧとアトミックＨ）までになるように、選
択された内容の両方の断片の親であるグループノード
「ＡＢＤ」が配置される。第２の例では、選択された内
容Ｅ及びＩは、絶対経路「ＡＢＤＩ」と「ＡＢＥ」とを
持ち、それにより、図示するように、選択された内容の
親を包含するグループノードは「ＡＢ」であり、ＲＭＬ
ツリー１５４は、グループノード及び２つの選択された
内容（アトミックＤＩとアトミックＥ）を包含する。同
様に、第３の例では、選択された内容ＥとＪは、絶対経
路「ＡＢＥ」と「ＡＣＦＪ」とを持ち、図示するよう
に、選択された内容の親を包含するグループノードは
「Ａ」であり、ＲＭＬツリー１５４は、グループノード
及び２つの選択された内容（アトミックＢＥとアトミッ
クＣＦＪ）を含む。すなわち、ＸＰＡＴＨプリプロセッ
サは、選択された内容の親であるツリーの最下位グルー
プノードを決めることにより、選択された内容に至る経
路を簡素化しようとする。More specifically, to form a relative path, the XPATH preprocessor first determines a group node for the selected content. The group node is the lowest parent node that links to all of the grouped and selected content. Next, the preprocessor:
Form a path between the group node and its descendants that are relative paths. FIG. 19 is a diagram showing three examples for identifying a group node according to the present invention. As shown
XHTM with absolute path to selected content 152 (shown shaded), with group node 156 and RML tree 154 containing selected content 152
An L-tree 150 is shown. In the first example, the selected content 152 has absolute paths "ABDG" and "ABDH" in the XHTML tree, and the relative path is from each of the selected contents (atomic G and A) from the group node "ABD". A group node "ABD", which is the parent of both fragments of the selected content, is arranged so that it is up to atomic H). In the second example, the selected content E and I have the absolute paths "ABDI" and "ABE" so that the group node containing the parent of the selected content is "AB And RML
Tree 154 contains a group node and two selected contents (atomic DI and atomic E). Similarly, in the third example, the selected contents E and J have absolute paths “ABE” and “ACFJ”, and as shown, the group node including the parent of the selected contents is “A”. And the RML tree 154 includes a group node and two selected contents (atomic BE and atomic CFJ). That is, the XPATH preprocessor seeks to simplify the path to the selected content by determining the lowest group node of the tree that is the parent of the selected content.

【００５４】ＸＳＬ書込みモジュール１４４は、アグノ
スチックＲＭＬからＸＳＬスタイルシートを作成する。
ＸＳＬ書込みモジュール１４４は、これをツリーの各ノ
ード向けにテンプレートを作成することによって行う
（一般化された場合、いくつかのノードは、ツリーのい
くつかの異なる内容断片を指す可能性がある）。要素ハ
ンドラは、各テンプレートに対してコードを書き込む。
ノードが内容を含む場合、要素ハンドラは、特定の方法
で内容を扱うユーザ定義の方法を実行する必要がある。
例えば、画像表示はいくつかの方法で取り扱われてもよ
い。すなわち、画像は、可能になった装置で表示され、
それ以外の場合は、ＡＬＴ−ｔａｇテキストとして表示
される、画像は、可能になった装置で表示され、他の装
置ではＡＬＴ−ｔａｇは何も使用されない、又は、画像
は、全ての装置でＡＬＴ−ｔａｇテキストとして表示さ
れる。The XSL writing module 144 creates an XSL stylesheet from the agnostic RML.
The XSL writing module 144 does this by creating a template for each node of the tree (when generalized, some nodes may point to several different content fragments of the tree). The element handler writes code for each template.
If the node contains content, the element handler needs to perform a user-defined way of handling the content in a particular way.
For example, image display may be handled in several ways. That is, the image is displayed on the enabled device,
Otherwise, the image is displayed as ALT-tag text, the image is displayed on the enabled device, no ALT-tag is used on other devices, or the image is displayed on all devices as ALT-tag. -Displayed as tag text.

【００５５】ユーザが全ての装置でＡＬＴ−ｔａｇテキ
ストとして画像を表示させたい場合、要素ハンドラは、
その決定を実行するコードをＸＳＬで作成する。これら
の選択は、補助的な（すなわち、非属性の）好みである
と考えられる。ここで、ＸＰＡＴＨロバスチファイアで
実行される一般化方法について説明する。図２０は、本
発明による一般化方法１６０を示す流れ図である。一般
化の方法は、選択された内容に至る経路を一般化する本
発明による技法の１つの例である。別の技法は、以下で
説明するように、ロバスチファイアで具体化される。両
方の技法は、無線ページ生成システムと共に使用される
場合もあれば、使用されない場合もある。両方の技法
は、無線ページ生成システムが内容の変化する可能性が
ある動的ウェブページを処理し易いようにするために使
用してもよい。一般化とは、１つの要素の内容選択と書
式化とを他の類似の要素に適用する処理である。一般化
では、一般化の対象となる要素がＸＨＴＭＬページ内で
任意の回数だけ起こり得ることが考慮されている。一般
化は、これを考慮するために、ＸＳＬスタイルシートを
生成するユニット（ＸＳＬジェネレータという）に、類
似要素を同じ方法で処理するため類似要素にテンプレー
トを適用することを強制する。更に詳しくは、ＡＲＭＬ
における経路が一般化され、次に、ＡＲＭＬを使用して
ＸＳＬスタイルシートが作成され、ＸＳＬスタイルシー
トは、更に、ＲＭＬを作成するためにＸＨＴＭＬページ
に適用される。If the user wants to display an image as ALT-tag text on all devices, the element handler
Create the code to make that decision in XSL. These choices are considered to be auxiliary (ie, non-attribute) preferences. Here, a generalization method executed by the XPATH robustifier will be described. FIG. 20 is a flowchart illustrating the generalization method 160 according to the present invention. The generalization method is one example of a technique according to the invention for generalizing the path to a selected content. Another technique is embodied in a robustifier, as described below. Both techniques may or may not be used with the wireless page generation system. Both techniques may be used to help the wireless page generation system handle dynamic web pages with potentially changing content. Generalization is the process of applying the content selection and formatting of one element to other similar elements. Generalization takes into account that the element to be generalized can occur any number of times in an XHTML page. To take this into account, generalization forces the unit that generates the XSL stylesheet (referred to as the XSL generator) to apply a template to similar elements in order to process them in the same way. For more information, ARML
The XSL stylesheet is then created using ARML, and the XSL stylesheet is further applied to the XHTML page to create the RML.

【００５６】この文書のためにＡＲＭＬは、内容に関し
てＸＨＴＭＬに質問する経路を持ち、ＲＭＬは、質問中
にアクセスされた実際の内容を包含する。本開示のため
に「前処理」又は「プリプロセッサ」という用語は、Ａ
ＲＭＬの葉に記憶された絶対経路を取り、それらをＡＲ
ＭＬ内のグループ及びアトミックの階層に基づいて相対
化するソフトウェア・モジュールを意味する。一般化処
理は、実際の経路が一度に１つより多いノードに適合で
きるように（ページ上の変化する類似品目数を取り扱う
のに役立つ）、ＸＰａｔｈを使用して実際の経路を変更
する遥かに複雑な処理である。For this document, ARML has a path to query XHTML for content, and RML contains the actual content accessed during the query. For the purposes of this disclosure, the term “pre-processing” or “preprocessor” refers to A
Take the absolute paths stored in the RML leaves and AR them
A software module that is relativized based on the group and atomic hierarchy in the ML. The generalization process uses XPath to modify the actual path so that the actual path can fit more than one node at a time (helping to handle the changing number of similar items on a page). It is a complicated process.

【００５７】一般化方法１６０は、ユーザ入力と自動計
算との組み合わせを伴う場合がある。この方法では、段
階１６２で、ユーザが数が動的に変化する可能性がある
種類のグループ又はアトミック（グループ及びアトミッ
クは説明されない）の例を選択した後に、他の類似のア
トミック又はグループを選択するか、又は、取り除く。
また、一般化すべきＸＨＴＭＬページの内容の量を調整
して、他のアトミック又はグループを自動的に選択及び
作成することができる。例えば、ユーザは、新しく選択
された内容からある特別の要素を取り除くか、又は、内
容選択を多く又は少なくするために段階１６４でＸＨＴ
ＭＬツリーを更に上又は下に移動するかを選んでもよ
い。ユーザは、選択内容を閲覧し、変更内容を承認する
か、又は、更に入力を与える。次に、選択された内容の
最終的な量から、１セットの一般化されたアトミック又
はグループが形成される。本方法は、どの要素もＲＭＬ
ノードに付随し、ＲＭＬノードもまたＸＨＴＭＬツリー
に相当するノードを持つようなツリーノードに基づいて
いる。どのＸＨＴＭＬノードも、サブツリー（親ノード
の全ての子孫ノードのセット）の親である。ユーザは、
運行ボタンを使用して段階１６４でＸＨＴＭＬツリーを
上下に移動することにより、より広いか又は狭いＸＨＴ
ＭＬサブツリーを選択してＲＭＬノードを表す。ツリー
の上方にあるＸＨＴＭＬノードは、より大きなサブツリ
ーの親であり、より多くの内容を備える。段階１６６に
おいて、ユーザが更に多くの内容を選択したか判断さ
れ、更に多くの内容が一般化されている場合、本方法
は、段階１６２にループして戻る。それ以上一般化する
内容がない場合、一般化のための一般ＸＰＡＴＨ表式が
段階１６８で計算される。そして本方法は完了する。The generalization method 160 may involve a combination of user input and automatic calculations. In this method, at step 162, the user selects an example of a type of group or atomic (groups and atomics are not described) whose number may change dynamically, and then selects another similar atomic or group. Do or remove.
Also, the amount of XHTML page content to be generalized can be adjusted to automatically select and create other atomics or groups. For example, the user may remove some special elements from the newly selected content, or XHT at step 164 to increase or decrease the content selection.
You may choose to move the ML tree further up or down. The user views the selection and approves the change or provides further input. Next, a set of generalized atomics or groups is formed from the final quantities of the selected content. The method uses RML
Associated with the nodes, the RML nodes are also based on tree nodes with nodes corresponding to the XHTML tree. Every XHTML node is the parent of a subtree (the set of all descendant nodes of a parent node). The user
Move the XHTML tree up and down in step 164 using the navigation button to create a wider or narrower XHT
Select the ML subtree to represent the RML node. XHTML nodes above the tree are the parents of the larger subtree and have more content. At step 166, it is determined whether the user has selected more content, and if more content has been generalized, the method loops back to step 162. If there is no more generalized content, a general XPATH expression for generalization is calculated in step 168. Then the method is complete.

【００５８】図２１は、本発明による一般化方法の例を
示す図である。特に、ユーザがＸＨＴＭＬツリーで
「Ｄ」ノード（図では陰影付き）を表すアトミックを選
択したと仮定する（図のセクションＩを参照された
い）。この「Ｄ」ノードは、ＸＨＴＭＬ構造ではその親
として「Ｃ」タグを持つ。「上向き矢印」ボタンを押す
ことによって、ユーザは、「Ｃ」タグに１つのレベルだ
けアトミック経路を上に移動することができる。これに
より、アトミックは、「Ｃ」タグより下の全てがアトミ
ックの子になるグループに変換される（セクションＩＩ
を参照されたい）。「Ｃ」タグがその下に数個の「Ｄ」
タグを持つ場合、全ての「Ｄ」タグがアトミックに変換
され「一般化」と標記されることになる。本発明によれ
ば、「Ｃ」より下のいかなる数のアトミック「Ｄ」も検
索されるように経路が「Ｃ」に向けられるので、本方法
は、子の数の変化を取り扱える（セクションＩＩＩを参
照されたい）。例えば、「Ｃ」がニュースウェブページ
のトップ記事を表し、「Ｄ」が各トップ記事を表す場
合、ウェブサイト内に更なるトップ記事が追加されても
やはり検索される。更に、一般化されたノード「Ｃ」
は、新たに挿入された目標としない子を無視してもよい
（セクションＩＶを参照されたい）。同様の方法を使用
して、グループの一般化も処理することができる。ここ
で、ロバスチファイアについて更に詳細に説明する。FIG. 21 is a diagram showing an example of the generalization method according to the present invention. In particular, assume that the user has selected an atomic in the XHTML tree that represents a "D" node (shown shaded) (see section I of the figure). This "D" node has a "C" tag as its parent in the XHTML structure. By pressing the "up arrow" button, the user can move up the atomic path one level to the "C" tag. This converts the atomic into a group where everything below the “C” tag is a child of the atomic (Section II).
Please refer to). The "C" tag has several "D" under it
If there are tags, all “D” tags will be converted atomically and marked as “generalized”. In accordance with the present invention, the method can handle changes in the number of children (see Section III), since the path is directed to "C" so that any number of atomic "D" s below "C" are searched. Please see). For example, if "C" represents the top stories of a news web page and "D" represents each top story, it will still be searched for even if more top stories are added to the website. Furthermore, the generalized node "C"
May ignore newly inserted non-target children (see section IV). A similar method can be used to handle group generalization. Here, the robustifier will be described in more detail.

【００５９】図２２は、本発明によるＸＰＡＴＨロバス
チファイア１４２の更なる詳細を示すブロック図であ
る。一般に、ロバスチファイアは、階層構造を通る経路
を一般化し、それにより、たとえ下の階層構造又は階層
構造内の内容が変化した場合でも、検索中に内容を階層
構造内でまだ捜し当てることができる。ＨＴＭＬ又はＸ
ＭＬ構造内への質問のためのＸＳＬコード生成という関
連において、ロバスチファイアは、ＸＰＡＴＨロバスチ
ファイアと呼ばれてもよい。すなわち、ロバスチファイ
アは、たとえ新しいＨＴＭＬ又はＸＭＬノードが質問の
対象となる構造に挿入された後でも依然として有効な、
選択された内容に至る経路を作成する。これは、ＸＰａ
ｔｈノード選択をできるだけ非特定的にすることによっ
て達成される。ロバスチファイアは、すなわち、ＸＨＴ
ＭＬサブツリー又はＸＨＴＭＬ構造全体の全ての類似ノ
ードと比較すると、選択されたＸＨＴＭＬノードに特有
の情報を検索する。この種の情報に従ってノードを適合
させることによって、同じ独自の内容セットに対してよ
り特定的でない経路を作成する。FIG. 22 is a block diagram showing further details of the XPATH robustifier 142 according to the present invention. In general, robustifiers generalize paths through a hierarchical structure, so that content can still be located in the hierarchical structure during a search, even if the underlying hierarchical structure or the contents within the hierarchical structure have changed. . HTML or X
In the context of generating XSL code for interrogation in an ML structure, a robustifier may be referred to as an XPATH robustifier. That is, the robustifier is still valid even after a new HTML or XML node has been inserted into the structure being queried.
Create a route to the selected content. This is XPa
This is achieved by making the th node selection as non-specific as possible. Robustifiers are: XHT
When compared to all similar nodes in the ML subtree or the entire XHTML structure, it retrieves information specific to the selected XHTML node. By adapting nodes according to this kind of information, a less specific path is created for the same unique set of content.

【００６０】図２３は、このアプローチの必要性を示
す、動的プログラム構造に記憶された内容に至る経路の
例を示す図である。この例では、ＸＨＴＭＬツリーで陰
影つきのノードとして示される２つのアトミック１８０
及び１８２がグループ内への配置用に選択されている。
両方のアトミックに至る経路は、ＸＨＴＭＬの「Ａ」ノ
ードから始まる。その点以降は２つの経路が分岐して、
経路「Ｂ」と経路「ＣＤＥＦ」とには個々のアトミック
（セクションＩに示す）が見つかる。セクションＩＩで
は、ウェブページが、つまりＸＨＴＭＬの構造が変化
し、その構造変化のために「Ｘ」ノード１８４がその構
造内に導入されている。伝統的な「Ｂ」及び「ＣＤＥ
Ｆ」経路はもはや有効ではないが、それでも、強化され
た経路の記述によって選択された内容を捜し当てること
ができ、それにより、動的ウェブページによる内容抽出
に対する妨害がない。この例では、強化された経路は、
「ｄｅｓｃｅｎｄａｎｔ：：Ｂ」、及び、「ｄｅｓｃｅ
ｎｄａｎｔ：：Ｅ／ｄｅｓｃｅｎｄａｎｔ：：Ｆ／」で
ある。FIG. 23 is a diagram showing an example of the path to the contents stored in the dynamic program structure, showing the necessity of this approach. In this example, two atomic 180s are shown as shaded nodes in the XHTML tree.
And 182 have been selected for placement in the group.
The path to both atomics starts at the XHTML "A" node. After that point, the two paths will branch off,
Individual atomics (shown in section I) are found in path "B" and path "CDEF". In Section II, the structure of the web page, ie, XHTML, has changed, and an "X" node 184 has been introduced into the structure due to the structure change. Traditional "B" and "CDE"
The "F" path is no longer valid, but the selected content can still be located by the enhanced path description, so that there is no hindrance to content extraction by dynamic web pages. In this example, the enhanced route is
“Descendant :: B” and “desce
ndant :: E / descendant :: F / ".

【００６１】図２２に戻って参照すると、ＸＰＡＴＨロ
バスチファイアは、１つ又はそれ以上のソフトウェア・
アプリケーション又はモジュールを使用して実行される
１つ又はそれ以上のモジュールを備えることができる。
該モジュールは、比較モジュール１７０、転換点ノード
識別子モジュール１７２、及び、軸線検証子１７４とも
呼ばれる子孫軸線の使用の可否を判断するモジュールを
備えてもよい。これらのモジュールは、たとえ動的ウェ
ブページの場合のように階層的データ記憶の構造が変化
しても、動的ウェブページの適切な内容が捜し当てられ
るように選択された内容に至る強化経路を自動的に判断
するように機能する。Referring back to FIG. 22, the XPATH robustifier includes one or more software
One or more modules can be provided that are executed using an application or module.
The module may include a comparison module 170, a turning point node identifier module 172, and a module for determining whether to use a descendant axis, also called an axis verifier 174. These modules provide an enhanced path to the selected content so that the appropriate content of the dynamic web page can be located, even if the structure of the hierarchical data storage changes, as in the case of dynamic web pages. It works to determine automatically.

【００６２】この強化処理は、上記のモジュールによっ
て実行されるいくつかの個別の方法に分割することがで
きる。一般に、本方法は、関連あるノードを識別するた
めに１セットのＸＨＴＭＬノードが互いにいかに類似し
ているか、特に、目標とする内容を含むノードがＸＨＴ
ＭＬツリー又はサブツリーの同じ種類の他のノードとい
かに異なるかを判断する比較方法、ページをより小さい
領域に効果的に仕切ることによって検索を向上させるの
に役立つ、目標とする内容を包含するノードの親「転換
点」ノードの発見方法、及び、「ｄｅｓｃｅｎｄａｎ
ｔ：：」軸線を転換点ノードの前に置くことができるか
否かの判断方法を含む。This enrichment process can be divided into several individual methods performed by the modules described above. In general, the method determines how similar a set of XHTML nodes are to each other to identify relevant nodes, in particular, if the node containing the target content is XHT
A comparison method that determines how it differs from other nodes of the same type in the ML tree or sub-tree, the number of nodes containing target content that helps to improve the search by effectively partitioning the page into smaller areas. How to find parent "turning point" node and "descendan"
The method includes determining whether the t :: "axis can be placed before the turning point node.

【００６３】これらの方法の各々について、以下で更に
詳細に説明する。強化処理の目的のために、ＸＨＴＭＬ
文書の構造が動的ウェブページによってどのように変化
することになるかについていくつかの仮定がなされてい
る。第１の仮定は、転換点ノード、内容包含ノード、関
係グループノードなどの特定ノードは、種類を変更しな
いか、又は、ページから取り除かれないということであ
る（関係グループノードは、ＲＭＬグループノードの全
ての子を包含する最小のサブツリーを持つＸＨＴＭＬツ
リー内のノードである）。第２の仮定は、これらのノー
ドは、何がツリー内の何の子孫であるか関してオリジナ
ルの関係を保持することになるということである。最後
の仮定は、１つのＸＨＴＭＬページだけが本処理の入力
として与えられるが、本方法の他の具体化においては、
強化処理及び一般化処理中に使用される情報の有効性を
判断し易いように、追加入力として多数のＸＨＴＭＬペ
ージを含むことができるということである。それらの仮
定の下で、ロバスチファイアは、様々な動的ウェブペー
ジが本発明に従って動的ウェブページから生成される無
線ページを持ち得るように、ＸＨＴＭＬツリー構造に起
こり得る他のいかなる変化に対しても機能する経路を作
成することになる。例として、ウェブサイトは、各記事
が表のセルの中に含まれるような１つ又はそれ以上のニ
ュース記事を持ってもよい。ウェブサイトが表に別のセ
ルを作成することによって別の記事を追加する場合、ア
トミックの種類、つまり表が変化しなかったために、一
般化経路によってその追加記事が突き止められて抽出さ
れることになる。Each of these methods is described in further detail below. For the purpose of enhanced processing, XHTML
Some assumptions are made about how the structure of a document will change with dynamic web pages. The first assumption is that certain nodes, such as turning point nodes, content containing nodes, relation group nodes, etc., do not change type or are not removed from the page (relation group nodes are RML group nodes). Node in the XHTML tree with the smallest subtree that contains all children). The second assumption is that these nodes will retain the original relationship as to what is what descendant in the tree. The last assumption is that only one XHTML page is given as input to the process, but in another embodiment of the method,
Additional input may include multiple XHTML pages to help determine the validity of the information used during the enrichment and generalization processes. Under these assumptions, the Robustifier will respond to any other possible changes to the XHTML tree structure such that various dynamic web pages may have wireless pages generated from the dynamic web pages according to the present invention. Will create a working path. By way of example, a website may have one or more news articles such that each article is contained within a table cell. If a website adds another article by creating another cell in the table, the generalized path will identify and extract the additional article because of the atomic type, that is, the table has not changed. Become.

【００６４】ロバスチファイアの他の実施形態におい
て、この処理を再帰的にしてもよい。好ましい実施形態
においては、１つの転換点ノードがある可能性があるだ
けである。しかし、一旦転換点ノードが見つかると、関
係グループノードから転換点ノードまでの経路は、次に
同様に強化できる別のサブ経路である。更に、転換点ノ
ードに対して述語を見つけることができ、オリジナルの
転換点ノードと関係グループノードとの間で別の転換点
ノードを見つけることができる。次に、新しい転換点ノ
ードに対して本処理を繰り返すことができ、以下同様で
ある。この種の再帰を可能にすることによって、稀な状
況を除き、内容に至る完全特定経路を全く使用する必要
がなくなる可能性が高くなる。In another embodiment of the robustifier, the process may be recursive. In the preferred embodiment, there may only be one turning point node. However, once a turning point node is found, the path from the relationship group node to the turning point node is another subpath that can then be similarly enhanced. Further, a predicate can be found for the turning point node, and another turning point node can be found between the original turning point node and the relation group node. The process can then be repeated for a new turning point node, and so on. Allowing this type of recursion increases the likelihood that there is no need to use a completely specific path to the content, except in rare circumstances.

【００６５】ある種の結果を優先するように強化処理全
体は組織されている。最も好ましい結果は、その局所的
特性だけでノードが識別できることである。これは、ま
ず、以下で説明する比較方法を用いて試験される。これ
が失敗した場合、そのノードが特定サブツリーのルート
ノードが転換点ノードである関係グループのサブツリー
の特定サブツリー内にあることを条件として、そのノー
ドが局所的特性によって識別できるか確かめるためにそ
のノードが試験される。それが失敗した場合には、アル
ゴリズムの将来的な再帰的バージョンでは、関係グルー
プのサブツリーの別の特定サブツリーの内側のツリーの
特定サブツリーを見つけるように試みられることにな
り、その場合、１つより多い転換点ノードを見つけるこ
とによって内容を一意的に識別することができ、以下同
様に繰り返すことができる。この全てが失敗した場合、
唯一の可能性は、内容に至る経路を完全に特定すること
である。ここで、本発明によるノードの比較方法につい
て説明する。The entire enhancement process is organized so that certain results are prioritized. The most favorable result is that a node can be identified only by its local characteristics. This is first tested using the comparison method described below. If this fails, the node is checked to see if it can be identified by local characteristics, provided that the node is in a particular subtree of the subtree of the relationship group where the root node of the particular subtree is a turning point node. To be tested. If that fails, a future recursive version of the algorithm will attempt to find a particular subtree of the tree inside another particular subtree of the relationship group subtree, in which case more than one By finding many turning points nodes, the content can be uniquely identified, and so on. If all this fails,
The only possibility is to completely specify the path to the content. Here, a node comparison method according to the present invention will be described.

【００６６】図２４Ａ及び図２４Ｂは、関連あるノード
を識別する本発明によるノード比較方法１９０を示す流
れ図である。特に、ロバスチファイア及び比較方法によ
って、類似の種類であるが目標とする以外の内容と目標
とする内容とを区別する方法が判断される必要がある。
例えば、図２３では、「Ｆ」ノードの一方は目標とする
ものであるが、他方はそうではなく、確実に強化処理で
適切なノードが抽出されるように、２つのノードを区別
する方法を判断することが重要である。従って、目標と
する内容と目標とする以外の内容を区別するために、ロ
バスチファイアは、前記で目標とするノードと称したも
のである「関連あるノード」を選択して形成する。関連
あるノードは、関連あるノードを他の全ての「不適合」
ノードと区別するために、特定経路の代わりに使用でき
る１セットの指定子によって形成され、該指定子は、ノ
ードがどの属性又は子を持つかなどの特性を含み得る。
これらの指定子によって、ロバスチファイアは、親ノー
ドのサブツリー内の潜在的不適合の各々を関連あるノー
ドと比較することができ、その指定子については、以下
で更に詳細に説明する。FIGS. 24A and 24B are flowcharts illustrating a node comparison method 190 according to the present invention for identifying relevant nodes. In particular, it is necessary to determine a method of distinguishing the target content from the similar type but other than the target by the robustifier and the comparison method.
For example, in FIG. 23, one of the “F” nodes is the target, but the other is not, and a method of distinguishing the two nodes so as to ensure that the appropriate node is extracted by the enhancement processing. It is important to judge. Therefore, in order to distinguish between the target content and the content other than the target, the robustifier selects and forms a “related node”, which is referred to as the target node above. Relevant node sets related node to all other "non-conforming"
In order to distinguish it from a node, it is formed by a set of specifiers that can be used instead of a specific route, and the specifiers may include properties such as which attributes or children the node has.
These specifiers allow the robustifier to compare each of the potential mismatches in the parent node's subtree with the relevant nodes, which are described in further detail below.

【００６７】比較方法では、その比較の基準として、ノ
ードに関して以下の情報が使用される。その情報とは、
ノードの兄弟、ノードの子孫、ノードの直系の子、ノー
ドの属性、及び、その兄弟間でのノードの位置である。
この情報セットは、その直系の親、及び、子ノードの属
性など、他の情報を含めるように容易に変更することが
できる。比較に使用される実際の情報セットは、必ずし
も本方法の基本的様態ではなく、変わってもよい。本方
法で必要なのは、比較の基準として使用される１セット
の情報があることだけであり、この情報は、必要性の変
化に応じて容易に変えることができる。In the comparison method, the following information regarding the node is used as a reference for the comparison. The information is
The siblings of the node, the descendants of the node, the immediate children of the node, the attributes of the node, and the position of the node between the siblings.
This information set can be easily modified to include other information, such as the attributes of its immediate parent and child nodes. The actual information set used for the comparison is not necessarily a fundamental aspect of the method and may vary. All that is required in the method is that there is a set of information that is used as a basis for comparison, and this information can be easily changed as needs change.

【００６８】一般に、比較方法は、何らかの内容を含む
ノードである場合が多いが関係グループノードであって
もよい関連あるノードについて何が独特であるか判断し
ようとする。ＸＨＴＭＬＩｎｆｏｒｍｅｒという特定ク
ラスは、ノードに関する情報を含むために使用され、こ
のクラスは、比較を行うためにも使用される。例えば、
ちょうど数学の集合のように、２つのＸＨＴＭＬＩｎｆ
ｏｒｍｅｒを交差させることができ、２つのノードの間
に共有される情報だけを包含するＸＨＴＭＬＩｎｆｏｒ
ｍｅｒを戻す。他の操作には、差分及び和集合があり、
各々、一方のノードにあるが他方にはないもの、又は、
どちらのノードにもあるものを戻す。各々の関連あるＸ
ＨＴＭＬノードは、本発明が記述される現行パラダイム
のＲＭＬノードによって表され、各ＲＭＬノードは、親
ノードを持つ（ルート＜ｒｍｌ＞ノードを除く）。前処
理方法によって、ＲＭＬノードの親によって表されるＸ
ＨＴＭＬノードが、実際にそのＲＭＬノードのＸＨＴＭ
ＬノードのＸＨＴＭＬツリーにおける祖先であることが
保証される。In general, the comparison method seeks to determine what is unique about a related node, which is often a node containing some content but may be a relationship group node. A particular class, XHTMLInformer, is used to contain information about nodes, and this class is also used to make comparisons. For example,
Just like a set of mathematics, two XHTMLInf
XHTMLInform, which can intersect the orderer and contains only the information shared between the two nodes
Return mer. Other operations include difference and union,
Each at one node but not at the other, or
Returns what is on both nodes. Each relevant X
HTML nodes are represented by RML nodes in the current paradigm in which the present invention is described, and each RML node has a parent node (except for the root <rml> node). The X represented by the parent of the RML node by the preprocessing method
The HTML node is actually the XHTML of the RML node
L nodes are guaranteed to be ancestors in the XHTML tree.

【００６９】本方法は、段階１９２において、関連ある
ＲＭＬノードのＲＭＬ親に対応するＸＨＴＭＬノードの
サブツリー全体を使用する検索で始まる。段階１９４で
このサブツリーを横断し、段階１９６で、関連あるＲＭ
Ｌノードに対応する同種のＸＨＴＭＬノードである全て
のＸＨＴＭＬノードを見つける。横断中に見つかった各
ノードは、段階１９８で差分操作を使用して関連あるＸ
ＨＴＭＬノードと比較される。差分操作によって、各々
の見つけられたノードに関して、なぜ関連あるＸＨＴＭ
Ｌノードと異なるのかが解明される。差分の結果は、段
階２００で、関連あるＸＨＴＭＬノードにあって他のノ
ードにはない物を包含するＸＨＴＭＬＩｎｆｏｒｍｅｒ
に記憶される。本方法は、次に段階２０２で、捜し当て
られたノードが更にあるか判断し、他の捜し当てられた
ノードを処理するために段階１９８に戻る。ＸＨＴＭＬ
Ｉｎｆｏｒｍｅｒを生成するために全てのノードが処理
された場合、本方法は続行する。The method begins at step 192 with a search using the entire subtree of XHTML nodes corresponding to the RML parent of the relevant RML node. In step 194, the subtree is traversed, and in step 196, the relevant RM
Find all XHTML nodes that are the same kind of XHTML node corresponding to the L node. Each node found during the traversal is associated with an associated X using a difference operation at step 198.
Compared to HTML node. By difference operation, for each found node, why the relevant XHTM
It is clarified whether it is different from the L node. The result of the difference is obtained in step 200 by an XHTMLInformer that includes those at the relevant XHTML node but not at other nodes.
Is stored. The method then determines, at step 202, if there are more located nodes and returns to step 198 to process other located nodes. XHTML
If all nodes have been processed to generate the Informer, the method continues.

【００７０】上記の処理の結果、ＸＨＴＭＬＩｎｆｏｒ
ｍｅｒのリストがもたらされ、各ＸＨＴＭＬＩｎｆｏｒ
ｍｅｒは、ツリーの他の特定ノードとの差異をもたらす
関連あるノードに関する情報を包含する。段階２０４で
は、これらのＸＨＴＭＬＩｎｆｏｒｍｅｒの全てを交差
させて、関連あるノードを他の全てのノードと異ならせ
るものを記述する１セットの情報（別のＸＨＴＭＬＩｎ
ｆｏｒｍｅｒに記憶）が得られる。段階２０６ではこれ
を交差試験と呼び、すなわち、この比較終了時のＸＨＴ
ＭＬＩｎｆｏｒｍｅｒが空でない場合、段階２０８にお
いて、関連あるノードを一意的なものにする何かがある
ことになる。次に、この情報は、ＸＰａｔｈ表式内で一
意的にこのノードを指定するために、段階２０８の述語
に置くことができる。従って、この試験が成功すれば、
段階２１０で経路全体を「ｄｅｓｃｅｎｄａｎｔ：：ノ
ード（情報）」と置き換えることができ、その場合、そ
の情報とは、段階２０８において比較終了時のＸＨＴＭ
ＬＩｎｆｏｒｍｅｒに包含されているものであり、強化
方法は、この関連ある特定ノードの処理を終了する。As a result of the above processing, XHTMLInfo
A list of mers is provided, and each XHTMLInfo
mer contains information about relevant nodes that make a difference from other specific nodes in the tree. In step 204, all of these XHTMLInformers are crossed to set a set of information (another XHTMLInformer) that describes what makes the relevant node different from all other nodes.
(stored in the former). In step 206, this is called a cross test, ie, the XHT at the end of this comparison.
If the MLInformer is not empty, there will be something in step 208 that makes the relevant node unique. This information can then be placed in the predicate of step 208 to uniquely identify this node in the XPath expression. Therefore, if this test is successful,
In step 210, the entire route can be replaced with "descendant :: node (information)", in which case the XHTM at the end of the comparison in step 208
This is included in the LInformer, and the enhancement method ends the processing of the relevant specific node.

【００７１】段階２０６で交差試験が失敗した場合、関
連あるノードに一意的な単一の情報断片がないことにな
る。しかし、関連あるノードを一部のノードと区別する
１片の情報と、関連あるノードを残りのノードと区別す
る別の１片の情報とがあり得る。例として、関連あるノ
ードで差分処理される４つのノードがあると仮定する。
差分は、以下の４セットである可能性がある。すなわ
ち、（ＡＢＣ）、（ＢＣＤ）、（ＣＤＡ）、及び、（Ｄ
ＡＢ）である。尚、これらの４セットの交差は零集合で
あるが、集合の和集合は（ＡＢＣＤ）である。現在のノ
ードには（ＡＢＣＤ）の全てがあるが、他のノードのい
ずれもこれらの情報の全てを持っていないことに注意し
て、ノードを一意的に指定するために和集合（ＡＢＣ
Ｄ）を使用することができる。これを和集合試験といい
（段階２１２）、差分のいずれも空ではない場合に限り
行われる（差分が空であるとは、関連あるノードで差分
処理されたノードが、関連あるノードと区別できないこ
とを意味する）。If the cross test fails in step 206, there is no single unique piece of information at the relevant node. However, there can be one piece of information that distinguishes a relevant node from some nodes and another piece of information that distinguishes a relevant node from the rest. As an example, suppose that there are four nodes that are differentially processed at relevant nodes.
The differences can be the following four sets: That is, (ABC), (BCD), (CDA), and (D
AB). The intersection of these four sets is a zero set, but the union of the sets is (ABCD). Note that the current node has all of (ABCD), but none of the other nodes have all of this information, and to uniquely specify the node, the union (ABC)
D) can be used. This is called a union test (step 212), and is performed only when none of the differences is empty (a difference is empty if a node subjected to difference processing by a related node cannot be distinguished from a related node) That means).

【００７２】和集合試験が成功した場合（段階２１
４）、本方法は、経路が取り替えられる段階２０８及び
２１０に戻る。要約すると、差分試験又は和集合試験が
成功した場合、相対的な経路が必要ないことのみなら
ず、そのノードに関して特有なものが何かを判断したこ
とになる。その情報は、ＸＰａｔｈ表式で述語として直
接使用することができる。両方の試験が失敗した場合、
ノードを一意的に識別する他の属に関する指定が何もな
いので、相対的経路を関連あるノードに使用する必要が
ある。ここで、本発明による転換点ノード識別の方法に
ついて説明する。If the union test is successful (step 21)
4) The method returns to steps 208 and 210 where the path is replaced. In summary, if the difference test or union test is successful, not only is there no need for a relative path, but it also has determined what is unique about that node. That information can be used directly as a predicate in an XPath expression. If both tests fail,
Since there is no designation for any other genus that uniquely identifies the node, the relative path must be used for the relevant node. Here, a method of turning point node identification according to the present invention will be described.

【００７３】図２５Ａ及び図２５Ｂは、本発明による転
換点ノード識別方法２２０を示す流れ図である。特に、
ノードの指定子リストが大き過ぎるか、又は小さ過ぎる
場合、ロバスチファイアは、選択された内容に至る正し
い経路を識別し易いように比較方法よりも別の手法を使
用する必要がある。転換点ノードの発見は、そのような
識別方法のうちの１つである。転換点ノードは、ツリー
における内容に至る経路の重要な構成要素、又は、「最
重要転換点」として識別されたノードとして形成され
る。例えば、図２３では、一方を他方から区別する指定
子がない「Ｆ」ノードが２つある。しかし、経路が
「Ｅ」ノードを通過する必要がある場合、「Ｆ」ノード
選択セットは、１つの目標とする「Ｆ」ノードに狭めら
れる。従って、「Ｅ」ノードは、不要なノードを避けな
がら目標とするノードを識別するのに使用することがで
きる点において、ツリーにおける最重要転換点である。FIGS. 25A and 25B are flowcharts illustrating a turning point node identification method 220 according to the present invention. In particular,
If the node's specifier list is too large or too small, the robustifier must use a different approach than the comparison method to help identify the correct path to the selected content. Turning point node discovery is one such method of identification. Turning points nodes are formed as important components of the path to the content in the tree, or as nodes identified as "most important turning points". For example, in FIG. 23, there are two “F” nodes without a specifier that distinguishes one from the other. However, if the path needs to go through an "E" node, the "F" node selection set is narrowed to one target "F" node. Thus, the "E" node is the most important turning point in the tree in that it can be used to identify target nodes while avoiding unnecessary nodes.

【００７４】転換点ノードを識別することによって、ペ
ージ上の潜在的不適合を含む領域全体を除外することが
できる。例えば、図２６に示すように、ページ２１７
は、表の行列として呈示されている。ＸＨＴＭＬにおい
て、表は、内容を置くことができる領域にページを仕切
るために、表の行タグと表の列タグとを使用する。表の
特定の行及び列によって内容を識別する能力は、ＸＨＴ
ＭＬにおいて特定ノードを転換点として識別するのと本
質的に同等である。転換点のサブツリーに焦点を当てる
ことによって、ロバスチファイアは、与えられたページ
の特定領域の検索だけを行いながら選択された内容を捜
し当てることができる。すなわち、図２６のように転換
点ノード２１８及び転換点グループ２１９が示される。By identifying turning point nodes, the entire region of the page containing potential mismatches can be excluded. For example, as shown in FIG.
Are presented as a matrix in a table. In XHTML, tables use table row tags and table column tags to divide pages into areas where content can be placed. The ability to identify content by specific rows and columns of a table is an XHT
This is essentially equivalent to identifying a specific node as a turning point in the ML. By focusing on the turning point subtree, the robustifier can locate the selected content while only searching a particular area of a given page. That is, a turning point node 218 and a turning point group 219 are shown in FIG.

【００７５】図２５Ａ及び図２５Ｂに戻って参照する
と、比較方法の和集合試験が失敗した場合、転換点識別
方法を使用して、相対的経路沿いのどこかの転換点ノー
ドを捜し当てることができる。転換点方法によって、検
索間隔が小さくなり、また、関連あるノードに関して一
意的なものを発見する可能性が大きくなることが期待さ
れる。本方法においては、サブツリーの最上部から、つ
まり、段階２２２で関連あるＲＭＬノードで記憶された
相対的経路の先端のＸＨＴＭＬノードから開始された。
サブツリーは、ツリーにおいて次の親に対してレベル１
つだけ下に移動すれば小さくすることができ、その後、
比較アルゴリズムの各段階に相当する段階２２４から段
階２４４に示すように、交差／和集合試験段階を実行す
る処理全体は繰り返すことができ、その処理全体の説明
はここでは省略される。Referring back to FIGS. 25A and 25B, if the union test of the comparison method fails, the turning point identification method can be used to find a turning point node somewhere along the relative path. . The turning point method is expected to reduce the search interval and increase the likelihood of finding a unique one for the relevant node. The method started from the top of the subtree, i.e., the XHTML node at the head of the relative path stored at step 222 with the associated RML node.
Subtree is level 1 relative to the next parent in the tree
Move it down by one to make it smaller,
As shown in steps 224 to 244 corresponding to each step of the comparison algorithm, the entire process of performing the intersection / union test step can be repeated, and the description of the entire process is omitted here.

【００７６】交差／和集合試験がこの小さくなったサブ
ツリーでいきなり成功する場合、段階２４６で、次の親
が転換点ノードと指定される。転換点ノードは、関連あ
るノードを大きなサブツリー内から一意的に指定できな
い時に一意的に指定できる小さいサブツリーを指定する
ので重要となる。転換点が見つからない、すなわち、交
差/和集合試験でいかなるサブツリー内からも関連ある
ノードに関する特定なものを見つけるのに一度も成功し
ない場合、本方法によって、段階２４７において経路に
別のノードがあるかどうか判断され、経路の次のノード
を処理するために段階２２４にループして戻る。目標と
するノードに至る経路で転換点ノードが見つからない場
合、強化処理は失敗するので、段階２４８で、関連ある
ノードに至る完全特定経路を使用する必要がある。ここ
で、転換点ノードに関してＸＰａｔｈ表式にｄｅｓｃｅ
ｎｄａｎｔ：：軸線を有効に適用できるかどうか識別す
る手法について説明する。If the intersection / union test succeeds abruptly on this reduced subtree, then at step 246, the next parent is designated as a turning point node. The turning point node is important because it specifies a small subtree that can be uniquely specified when a related node cannot be uniquely specified from within a large subtree. If no turning point is found, i.e. the intersection / union test has never succeeded in finding a particular for the relevant node from within any subtree, the method causes another node in the path at step 247. If so, the process loops back to step 224 to process the next node on the path. If no turning point node is found on the path to the target node, the enrichment process fails, and in step 248 it is necessary to use a completely specific path to the relevant node. Here, regarding the turning point node, the descend
A method for identifying whether or not ndant :: axis can be effectively applied will be described.

【００７７】ＸＰＡＴＨ表式によって、単なるノード名
以外の経路情報が可能となるので、目標とする内容を識
別し易いように軸線や述語を使用することができる。軸
線は、ノードとの関係に基づいて、ツリーのどこで現在
の内容ノードを探したらよいかを定める。一般的な軸線
は、「ｄｅｓｃｅｎｄａｎｔ：：」、「ｓｉｂｌｉｎ
ｇ：：」、「ａｎｃｅｓｔｏｒ：：」、及び、「ｐａｒ
ｅｎｔ：：」である。例えば、経路が「Ａ／Ｂ／ｓｉｂ
ｌｉｎｇ：：Ｃ」である場合、経路は「Ａ」から始ま
り、「Ｂ」に移動し、「Ｂ」の兄弟全てを見て「Ｃ」を
見つける。図２３では、現在のノードは「Ａ」である。
「Ｂ」ノードは、経路「ｄｅｓｃｅｎｄａｎｔ：：Ｂ」
によって見つけることができ（ケースＩ及びケースＩＩ
の両方において）、その場合、直系の子孫（子ノード）
かツリーの更に下にあるかを問わず、ノード「Ａ」より
下の任意の「Ｂ」ノードが選択されることになる。同様
に、目標とする「Ｆ」ノードに至る経路を「ｄｅｓｃｅ
ｎｄａｎｔ：：Ｅ／ｄｅｓｃｅｎｄａｎｔ：：Ｆ」と書
き込むことができるであろう。この経路は、「Ｅ」ノー
ドである「Ａ」ノードの子孫を見つけて、次に、「Ｆ」
ノードである「Ｅ」ノードの子孫を見つける。そのよう
にして、「転換点」（「Ｅ」ノード）がＸＰａｔｈで実
行される。Since the XPATH expression enables path information other than a simple node name, an axis or a predicate can be used to easily identify a target content. The axis defines where in the tree to find the current content node based on its relationship to the node. Common axes are “descendant ::”, “siblin
g :: "," ancestor :: ", and" par
ent :: ". For example, if the route is “A / B / sib”
If ling :: C ", the path starts at" A ", moves to" B ", and looks at all siblings of" B "to find" C ". In FIG. 23, the current node is “A”.
The “B” node has the route “descendant :: B”
(Case I and Case II
In both cases), then the direct descendants (child nodes)
Any "B" node below node "A" will be selected, regardless of whether it is further down the tree. Similarly, the route to the target “F” node is “desce”.
ndant :: E / descendant :: F "could be written. This path finds the descendants of the "A" node, which is the "E" node, and then "F"
Find descendants of node "E". As such, "turning points"("E" nodes) are executed in XPath.

【００７８】上記の軸線に加えて、述語もまたあり得
る。特に、目標とするノードの特性を記述するために１
つ又はそれ以上のＸＰａｔｈ述語を使用することができ
る。従って、ＸＰａｔｈに沿った各段階は、「ａｘｉ
ｓ：：ノード＿名称（述語）」と表示することができ
る。これは、例えば、軸線がノードセットとして全ての
現行のノード子孫を選択したが内容をもっと具体的に識
別する必要がある場合に便利である。一般的な述語は、
とりわけ属性を含むことになる。例えば、目標「Ｃ」ノ
ードに「タコス」属性がある場合、経路は、「Ａ／Ｂ／
Ｃ（タコス）」となることが可能であろう。この述語
は、上記の通り、指定子の例である。In addition to the axes described above, predicates are also possible. In particular, to describe the characteristics of the target node,
One or more XPath predicates can be used. Therefore, each stage along the XPath is described as “axi
s :: node_name (predicate) ". This is useful, for example, when the axis has selected all current node descendants as a node set, but needs to identify the content more specifically. A common predicate is
In particular, it will include attributes. For example, if the target “C” node has the “tacos” attribute, the route is “A / B /
C (tacos) ". This predicate is an example of a specifier, as described above.

【００７９】軸線及び述語の両方によって、完全特定経
路がなくてもノードを一意的に識別することができるの
で、ＸＰａｔｈ表式を強化することができる。実際に、
関連ある内容ノードが最大のサブツリー内でそれを識別
する適切な数の指定子を持つ場合、経路全体は、明細が
述語であるような「ｄｅｓｃｅｎｄａｎｔ：：ノード
（明細）」と置き換えることができる。例えば、目標
「Ｆ」ノードに「ピザ」という単一の属性があり、他の
「Ｆ」ノードに「バーガー」という単一の属性があった
場合、「Ｅ」ノードを使用しなくてもそれらを区別する
ことができるであろう。「Ａ」ノードからの強化経路
は、「ｄｅｓｃｅｎｄａｎｔ：：Ｆ（ピザ）」となるで
あろう。述語は、関連あるノードを指定するのに使用さ
れる。これらの述語はまた、転換点ノードを一意的に決
めるために使用してもよい。The XPath expression can be enhanced because both the axis and the predicate can uniquely identify a node without a complete specific path. actually,
If the relevant content node has the appropriate number of specifiers to identify it in the largest subtree, the entire path can be replaced with "descendant :: node (description)" where the specification is a predicate. For example, if the target "F" node has a single attribute of "pizza" and the other "F" nodes have a single attribute of "burger", those attributes can be used without using the "E" node. Could be distinguished. The enhancement path from the “A” node would be “descendant :: F (pizza)”. Predicates are used to specify relevant nodes. These predicates may also be used to uniquely determine a turning point node.

【００８０】図２７は、本発明による軸線の確認方法２
６０を示す流れ図である。特に、ある条件が満たされた
場合に転換点ノードを識別するために、ｄｅｓｃｅｎｄ
ａｎｔ：：軸線を使用することができる。これによっ
て、転換点ノードと関係グループノードとの間で構造的
な変化が起きることが可能になる。尚、転換点ノードか
ら一意的に内容を指定する述語は、転換点ノードを持つ
ことの必要条件であることから、ｄｅｓｃｅｎｄａｎ
ｔ：：軸線が転換点ノードから内容まで自動的に使用で
きる点に注意されたい。FIG. 27 shows a method 2 for confirming the axis according to the present invention.
FIG. In particular, descend to identify a turning point node when certain conditions are met.
ant :: axis can be used. This allows a structural change to occur between the turning point node and the relationship group node. Since a predicate that uniquely specifies the content from the turning point node is a necessary condition for having a turning point node, descendan
Note that the t :: axis can be used automatically from the turning point node to the content.

【００８１】本方法では、関係グループノードのサブツ
リーである、段階２６２で識別された最大サブツリーか
ら、転換点ノードと同種の他の全てのノードが段階２６
４で見つけられる。段階２６６では、内容ノードに関す
る特定情報を与えられた内容ノードに適合する子孫を持
つかどうか確かめるために、各ノードが調べられる。子
孫のいずれかが現在のノードに適合する場合、ｄｅｓｃ
ｅｎｄａｎｔ：：軸線の名称が一意的に現行のノードを
識別しないため、ｄｅｓｃｅｎｄａｎｔ：：軸線の名称
を使用することができず、本方法は終了となる。段階２
６８では、それ以上のノードがあるか判断され、本方法
は、各々の追加ノードを試験するために段階２６６にル
ープして戻る。ノードの全てを試験した後、それらのノ
ードのいずれにもこのような子孫がない場合、段階２７
０で関係グループノードからの転換点ノードに対してｄ
ｅｓｃｅｎｄａｎｔ：：軸線を使用することが安全であ
る。すなわち、ｄｅｓｃｅｎｄａｎｔ：：軸線の情報
は、上記の条件が満たされた場合に限り、転換点を指定
するのに使用し得る。In the method, from the largest subtree identified in step 262, which is a subtree of relational group nodes, all other nodes of the same type as the turning point node are identified in step 26.
You can find it at 4. In step 266, each node is examined to see if it has descendants that match the given content node with specific information about the content node. Desc if any of the descendants match the current node
Since the end :: axis name does not uniquely identify the current node, the descendant :: axis name cannot be used, and the method ends. Stage 2
At 68, it is determined if there are any more nodes, and the method loops back to step 266 to test each additional node. After testing all of the nodes, if none of those nodes has such a descendant, step 27
0 for d at the turning point node from the relation group node
It is safe to use escendant :: axis. That is, the information descendant :: axis can be used to designate a turning point only if the above conditions are met.

【００８２】ロバスチファイアの別の実施形態によれ
ば、各ページがユーザの希望に従って標記された多重の
ＸＨＴＭＬページをロバスチファイアへの入力として使
用してもよく、それらは、多重のＸＨＴＭＬページから
新しい１ページを生成する単一のスタイルシートを形成
するために合併される。特に、同種の数ページ（例え
ば、数ページのｅＢａｙ競売品目ページ）を使用しても
よい。例えば、ユーザが無線装置上にｅＢａｙの競売品
目の全てを置きたいと思っていると仮定する。無線ペー
ジ生成システムの目標は、それらの類似のページの全て
を正しく変換することになる単一のスタイルシートを作
成することである。元来ユーザは、１ページの競売品目
ページ例を生成する必要があったので、作成されたスタ
イルシートは、次に、全ての競売品目ページで機能する
はずである。ＸＰＡＴＨロバスチファイアは、競売品目
ページの構造の考えられる変化に品目ごとに対処しよう
とするので、上記が起こることを補助する。According to another embodiment of the robustifier, multiple XHTML pages, each page labeled as desired by the user, may be used as input to the robustifier, which may be multiple XHTML pages. Are merged to form a single style sheet that creates a new page from In particular, several pages of the same type (eg, several eBay auction item pages) may be used. For example, assume that a user wants to place all of the eBay auctioned items on a wireless device. The goal of the wireless page generation system is to create a single style sheet that will correctly translate all of those similar pages. Originally, the user had to generate one example auction item page, so the style sheet created would then work on all auction item pages. The XPATH robustifier helps this to happen because it attempts to address possible changes in the structure of the auctioned item page on an item-by-item basis.

【００８３】しかし、ここで、ユーザがスタイルシート
に同種の数ページ上でどのように挙動してほしいか定め
ることができると仮定する。これらのページの各々は、
定性的には類似しているが、ＸＨＴＭＬにおけるそれら
のツリー構造は、わずかに異なっていてもよい。いくつ
かのページ例の間の構造の違いは、事実上、ページが実
際に変化する仕方のサンプリングである。このサンプリ
ングは、ロバスチファイアに追加の手がかりをもたら
し、その結果、ロバスチファイアは、ページが変化する
かも知れないその仕方を盲目的に推測しなくてもよくな
るが、実際には、付加的な情報として用意された変化例
を持っている。本発明による多重ページの強化を更に理
解するために、いくつかの例を以下に示す。However, it is assumed here that the user can determine how he wants the style sheet to behave on several pages of the same kind. Each of these pages
Although qualitatively similar, their tree structure in XHTML may be slightly different. The structural difference between some example pages is, in effect, a sampling of how the page actually changes. This sampling provides additional clues to the robustifier, so that it does not have to blindly guess how the page may change, but in fact, It has a variation example prepared as information. To better understand the multi-page enhancement according to the present invention, some examples are provided below.

【００８４】まず第１に、ロバスチファイアが通常は転
換点ノードとしてツリーで特定ノード「Ｅ」を選択する
であろうと仮定する。しかし、強化処理は、他のツリー
を見ることによって「Ｅ」ノードがいつも存在するわけ
ではないことに気付く。これによって、ロバスチファイ
アは、「Ｅ」ノードの転換点ノードとしての資格を取り
消して、代わりに別の転換点ノードを選択せざるを得な
い。すなわち、多重ページによって、ＲＭＬコードを強
化する方法に関する追加情報が強化処理に与えられる。
第２に、ロバスチファイアは、特定の属性「ピザ」が関
連あるノードを一意的に識別し易いとわかっていると仮
定する。次に、他の例のうちの少なくとも１つにおい
て、「ピザ」が同じ関連あるノードの属性として存在し
ないと仮定する。それによって、その関連あるノードの
有効識別子としての「ピザ」の資格が取り消されるの
で、ロバスチファイアは、検索を絞り込むために、他の
属性を見るか、又は、転換点ノードを見つけるかのいず
れかを行わざるを得ない。ここでもまた、多重ページに
よって強化処理に追加情報が与えられる。First, assume that the robustifier will typically select a particular node "E" in the tree as a turning point node. However, the enhancement process notices by looking at other trees that the "E" node is not always present. This forces the robustifier to revoke the "E" node as a tipping point node and instead select another tipping point node. That is, multiple pages provide additional information to the enhancement process on how to enhance the RML code.
Second, assume that the robustifier knows that a particular attribute "pizza" is likely to uniquely identify the relevant node. Next, in at least one of the other examples, assume that "pizza" does not exist as an attribute of the same related node. This would revoke the "pizza" qualification as a valid identifier for the relevant node, so that the robustifier would either look at other attributes or find a tipping point node to narrow the search. I have to do it. Again, multiple pages provide additional information to the enhancement process.

【００８５】本発明の更なる別の実施形態によれば、Ｘ
ＳＬスタイルシートと数ページのＸＨＴＭＬページとが
入力となり得る逆の強化処理があってもよい。スタイル
シートといくつかのＸＨＴＭＬ目標とに基づいて、ＸＨ
ＴＭＬページの各ＲＭＬが生成されてもよい。逆強化処
理を使用して、ユーザは、無線ページ生成システムによ
って生成されたスタイルシートを手作業で微調整し、次
に、引き続きＧＵＩ環境内からＸＳＬに追加変更を行っ
てもよい。ここで、本発明による無線ページ生成処理の
例について説明する。According to yet another embodiment of the present invention, X
There may be a reverse enhancement process in which the SL style sheet and several XHTML pages can be input. XH based on stylesheets and some XHTML goals
Each RML of a TML page may be generated. Using the reverse enhancement process, the user may manually fine-tune the stylesheet generated by the wireless page generation system, and then subsequently make additional changes to XSL from within the GUI environment. Here, an example of the wireless page generation processing according to the present invention will be described.

【００８６】図２８は、本発明による一体式デスクトッ
プＧＵＩインタフェース５０を使用してアトミックを新
しいページに追加するプロデューサーを示す図である。
特に、ユーザは、上記の通り、次にプロジェクトウイン
ドウ内に移動させる特定のウェブページを選択すること
ができる。ユーザは、次に、構成タブを選択することが
でき、その結果、選択されたウェブページがＸＨＴＭＬ
に変換される。ユーザは、次に、図２８に示すように選
択されたアトミックを強調することにより、アトミック
２８０を選択してページ運行部分５４に追加してもよ
い。ユーザ・インタフェースは、ユーザがルートグルー
プにアトミックを追加するか、メニューにアトミックを
追加するか、又は、形態にアトミックを追加するかを選
択し得るようにメニューを表示してもよい。図示の例に
おいて、選択されたアトミックは、導入ノードとしてペ
ージ運行部分５４のルートノードに追加されている。FIG. 28 illustrates a producer using the integrated desktop GUI interface 50 according to the present invention to add an atomic to a new page.
In particular, the user can select a particular web page to move into the project window, as described above. The user can then select the configuration tab, so that the selected web page is in XHTML
Is converted to The user may then select and add atomic 280 to page navigation portion 54 by highlighting the selected atomic as shown in FIG. The user interface may display a menu so that the user can select to add an atomic to the root group, add an atomic to the menu, or add an atomic to the form. In the illustrated example, the selected atomic has been added to the root node of the page navigation portion 54 as an introductory node.

【００８７】図２９ａから図２９ｃまでは、本発明によ
るルールセットを形成するプロデューサーを示す図であ
る。ルールセットは、どのように無線ページ配信システ
ムが内容及びサービスをデスクトップ・ウェブページか
ら無線ページに変換すべきかを定める。ルールセットが
１つより多いＵＲＬに適用される場合が多いことから、
プロデューサーは、ＵＲＬマネージャにより、各ＵＲＬ
要求に対して適切なものを形成することができる。図２
９ａに示すように、ユーザは、スタイルシートに付随す
るようにＵＲＬ２９０を選択している。図２９ｂに示す
ように、プロデューサーは、スタイルシートにマッピン
グすべきＵＲＬの要素２９２を選択するか、又は、ドロ
ップダウンメニューから要素を選択してもよい。図２９
ｃに示すように、プロデューサーは、特定のＵＲＬ要素
２９６に付随する設定値２９４を定めてもよい。図示の
例において、ＵＲＬ要素「これ＝あれ」は、ＵＲＬ要素
の中になければならず、要素の値（例えば、これ＝あ
れ）は重要であり、その結果、「これ＝他の」である要
素を持つＵＲＬは、特定のルールセット及びスタイルシ
ートを使用して処理されることはない。FIGS. 29a to 29c show the producers forming the rule set according to the invention. The ruleset defines how the wireless page delivery system should convert content and services from desktop web pages to wireless pages. Because rule sets often apply to more than one URL,
The producer uses the URL manager to provide each URL
It can be tailored to your needs. FIG.
As shown in 9a, the user has selected a URL 290 to accompany the style sheet. As shown in FIG. 29b, the producer may select an element 292 of the URL to be mapped to the style sheet, or select an element from a drop-down menu. FIG.
As shown in c, the producer may define a set value 294 associated with a particular URL element 296. In the example shown, the URL element “this = that” must be inside a URL element, and the value of the element (eg, this = that) is significant, so “this = other”. URLs with elements are not processed using a specific rule set and style sheet.

【００８８】図３０は、本発明によるルールセットを配
置するプロデューサーを示す図である。特に、運行部分
５４の運行ツリーが構築されると、プロデューサーは、
図３２ａ及び図３２ｂに示すように、電話又はパーム・
エミュレータ上で無線ページを閲覧するためにプロジェ
クトを配置し得る。配置マネージャは、ＸＳＬスタイル
シートが自動的に適切なウェブページを処理するために
使用し得るように、無線ページ配信システムにＸＳＬス
タイルシートを送信してもよい。図３１は、本発明によ
るＸＳＬスタイルシート３００の例を示す図である。ス
タイルシートは、自動的にウェブページを処理して本発
明による無線ページを生成するために使用してもよい。FIG. 30 is a diagram showing a producer that arranges a rule set according to the present invention. In particular, when the operation tree of the operation part 54 is constructed, the producer:
32a and 32b, as shown in FIG.
You can deploy a project to view wireless pages on an emulator. The placement manager may send the XSL stylesheet to the wireless page delivery system so that the XSL stylesheet can be used to automatically process the appropriate web page. FIG. 31 is a diagram showing an example of the XSL style sheet 300 according to the present invention. Style sheets may be used to automatically process web pages to generate wireless pages according to the present invention.

【００８９】図３２ａ及び図３２ｂは、各々、携帯電話
エミュレータ３０２及びパーム装置エミュレータ３０４
上での新しいページの例を示す図である。特に、無線ペ
ージ配信システムにＸＳＬスタイルシートを配置する前
に、エミュレータによって、プロデューサーは、得られ
る無線ページを検討することができる。図３２ａ及び図
３２ｂは、図３０に電話及び次にまたパーム装置に関し
て示すのと同じウェブページを示す。パーム装置が電話
より多くの情報を表示することができるので、示された
２つの無線ページの間の違いに注意されたい。FIGS. 32a and 32b show a cellular phone emulator 302 and a palm device emulator 304, respectively.
It is a figure showing an example of a new page above. In particular, the emulator allows the producer to review the resulting wireless page before placing the XSL stylesheet in the wireless page distribution system. 32a and 32b show the same web page as shown in FIG. 30 for the telephone and then also for the palm device. Note the difference between the two wireless pages shown, because the palm device can display more information than the phone.

【００９０】以上は本発明の特定の実施形態に関するも
のであるが、添付の請求項にその範囲が定められている
本発明の原理及び精神から逸脱せずに本実施形態の変更
が可能であることは当業者には理解されるであろう。例
えば、本明細書で説明されたシステムは、ＸＭＬ文書、
ＩＣＥ文書（内容シンジケーション書式）、又は、ロイ
ター配給を含む様々な異なる情報源からの情報を処理す
るために使用してもよい。While the above has been directed to specific embodiments of this invention, modifications may be made thereto without departing from the principles and spirit of the invention, the scope of which is set forth in the appended claims. It will be understood by those skilled in the art. For example, the system described herein is an XML document,
It may be used to process information from a variety of different sources, including ICE documents (content syndication forms) or Reuters distribution.

[Brief description of the drawings]

【図１】無線ページ配信システムを示すブロック図であ
る。FIG. 1 is a block diagram showing a wireless page distribution system.

【図２】本発明による無線ウェブページ生成システムを
示すブロック図である。FIG. 2 is a block diagram illustrating a wireless web page generation system according to the present invention.

【図３】本発明に従って１つ又はそれ以上のアトミック
に分解されるウェブページの一部分の例を示す図であ
る。FIG. 3 illustrates an example of a portion of one or more atomically decomposed web pages in accordance with the present invention.

【図４ａ】ＧＵＩツ−ル向けのユーザ・インタフェース
の例を示し、特に、図２に示すシステム内にある本発明
による一体式デスクトップを示す図である。4a shows an example of a user interface for a GUI tool, and in particular, shows an integrated desktop according to the invention in the system shown in FIG. 2;

【図４ｂ】ＧＵＩツ−ル向けのユーザ・インタフェース
の例を示し、特に、図２に示すシステム内にある本発明
によるＨＴＭＬビュアーを示す図である。FIG. 4b shows an example of a user interface for a GUI tool, in particular an HTML viewer according to the invention in the system shown in FIG. 2;

【図４ｃ】ＧＵＩツ−ル向けのユーザ・インタフェース
の例を示し、特に、図２に示すシステム内にある本発明
によるソースビュアーを示す図である。4c shows an example of a user interface for a GUI tool, in particular a source viewer according to the invention in the system shown in FIG. 2;

【図５】図２に示すシステム内にある本発明による無線
運行ビュアーの例を示す図である。FIG. 5 shows an example of a wireless operating viewer according to the invention in the system shown in FIG. 2;

【図６】図２に示すシステム内にある本発明によるプロ
ジェクト・マネージャの例を示す図である。FIG. 6 shows an example of a project manager according to the invention in the system shown in FIG. 2;

【図７】図２に示すシステム内にある本発明によるルー
ルセット追加ビュアーの例を示す図である。7 shows an example of a ruleset addition viewer according to the invention in the system shown in FIG. 2;

【図８】図２に示すシステム内にある本発明によるＵＲ
Ｌ形成マネージャの例を示す図である。FIG. 8 shows a UR according to the invention in the system shown in FIG.
It is a figure showing an example of L formation manager.

【図９Ａ】図２に示すシステム内にある本発明による無
線形態マネージャの例を示す図である。9A shows an example of a wireless configuration manager according to the present invention in the system shown in FIG. 2;

【図９Ｂ】図２に示すシステム内にある本発明による無
線形態マネージャの例を示す図である。9B illustrates an example of a wireless configuration manager according to the present invention in the system illustrated in FIG. 2;

【図１０】図２に示すシステム内にある本発明による配
置マネージャの例を示す図である。FIG. 10 shows an example of a deployment manager according to the invention in the system shown in FIG. 2;

【図１１】ウェブページの内容が２つのサンプル間で変
化した動的ウェブページの例を示す図である。FIG. 11 is a diagram illustrating an example of a dynamic web page in which the content of the web page has changed between two samples.

【図１２】図１１のウェブページの２つのサンプルにお
けるＨＴＭＬツリーの例を示す図である。FIG. 12 is a diagram illustrating an example of an HTML tree in two samples of the web page of FIG. 11;

【図１３】図１１のウェブページの各サンプルに対する
関係マークアップ言語（ＲＭＬ）コードの例を示す図で
ある。13 is a diagram illustrating an example of a relational markup language (RML) code for each sample of the web page of FIG. 11;

【図１４】図１１のウェブページの各サンプルに対する
未処理のアグノスチックＲＭＬコード（ＡＲＭＬ）を示
す図である。FIG. 14 illustrates an unprocessed agnostic RML code (ARML) for each sample of the web page of FIG. 11;

【図１５】図１１のウェブページの各サンプルに対する
前処理及び強化されたアグノスチックＲＭＬコード（Ａ
ＲＭＬ）を示す図である。FIG. 15 shows a pre-processed and enhanced agnostic RML code for each sample of the web page of FIG. 11 (A
FIG.

【図１６】図１１に示すウェブページのいずれのサンプ
ルからも適切な内容を検索できる一般化アグノスチック
ＲＭＬコード（ＡＲＭＬ）を示す図である。FIG. 16 is a diagram showing a generalized agnostic RML code (ARML) that can retrieve appropriate content from any of the samples of the web page shown in FIG. 11;

【図１７】本発明によるいずれのウェブページサンプル
からも内容を正しく検索するＸＳＬスタイルシートの例
を示す図である。FIG. 17 is a diagram showing an example of an XSL style sheet for correctly retrieving contents from any web page sample according to the present invention.

【図１８Ａ】本発明によるＸＳＬジェネレータの実施形
態を示すブロック図である。FIG. 18A is a block diagram illustrating an embodiment of an XSL generator according to the present invention.

【図１８Ｂ】本発明によるＸＳＬジェネレータの実施形
態を示すブロック図である。FIG. 18B is a block diagram illustrating an embodiment of an XSL generator according to the present invention.

【図１９】本発明によるグループノードの例を示す図で
ある。FIG. 19 is a diagram showing an example of a group node according to the present invention.

【図２０】本発明による一般化方法を示す流れ図であ
る。FIG. 20 is a flowchart showing a generalization method according to the present invention.

【図２１】本発明による一般化方法の例を示す図であ
る。FIG. 21 is a diagram showing an example of a generalization method according to the present invention.

【図２２】本発明によるＸＰＡＴＨロバスチファイアの
更なる詳細を示すブロック図である。FIG. 22 is a block diagram showing further details of an XPATH robustifier according to the present invention.

【図２３】動的な内容に至る経路の例を示す図である。FIG. 23 is a diagram illustrating an example of a path leading to dynamic content.

【図２４Ａ】本発明によるノード比較方法を示す流れ図
である。FIG. 24A is a flowchart illustrating a node comparison method according to the present invention.

【図２４Ｂ】本発明によるノード比較方法を示す流れ図
である。FIG. 24B is a flowchart illustrating a node comparison method according to the present invention.

【図２５Ａ】本発明による転換点ノード識別方法を示す
流れ図である。FIG. 25A is a flowchart illustrating a turning point node identification method according to the present invention.

【図２５Ｂ】本発明による転換点ノード識別方法を示す
流れ図である。FIG. 25B is a flowchart illustrating a turning point node identification method according to the present invention.

【図２６】本発明による転換点法の例を示す図である。FIG. 26 is a diagram showing an example of a turning point method according to the present invention.

【図２７】本発明による子孫識別方法を示す流れ図であ
る。FIG. 27 is a flowchart illustrating a progeny identification method according to the present invention.

【図２８】本発明に従って新しいページにアトミックを
追加するプロデューサーを示す図である。FIG. 28 illustrates a producer adding an atomic to a new page in accordance with the present invention.

【図２９ａ】本発明に従ってルールセットを形成するプ
ロデューサーを示す図である。FIG. 29a illustrates a producer forming a ruleset according to the present invention.

【図２９ｂ】本発明に従ってルールセットを形成するプ
ロデューサーを示す図である。FIG. 29b illustrates a producer forming a ruleset according to the present invention.

【図２９ｃ】本発明に従ってルールセットを形成するプ
ロデューサーを示す図である。FIG. 29c illustrates a producer forming a ruleset according to the present invention.

【図３０】本発明に従ってルールセットを配置するプロ
デューサーを示す図である。FIG. 30 illustrates a producer placing a rule set according to the present invention.

【図３１】本発明によるＸＳＬスタイルシートの例を示
す図である。FIG. 31 is a diagram showing an example of an XSL style sheet according to the present invention.

【図３２ａ】携帯電話上の新しいページの例を示す図で
ある。FIG. 32a shows an example of a new page on a mobile phone.

【図３２ｂ】パーム装置上の新しいページの例を示す図
である。FIG. 32b shows an example of a new page on the palm device.

[Explanation of symbols]

１５無線ウェブページ配信部分２２ウェブページ生成システム２３後端部分２４前端部分２５ＲＭＬビルダー・モジュール２６ＸＳＬジェネレータ・モジュール２７スタイルシート・データベース２８ルールセット構築ツールセット２９ルールセット・データベース３０プロジェクト構築ツールセット３１無線ウェブサイト・プロジェクト・データベース 15 Wireless Web Page Delivery Part 22 Web Page Generation System 23 Rear End Part 24 Front End Part 25 RML Builder Module 26 XSL Generator Module 27 Stylesheet Database 28 Rule Set Construction Tool Set 29 Rule Set Database 30 Project Construction Tool Set 31 Wireless Website Project Database

───────────────────────────────────────────────────── フロントページの続き (72)発明者マイケルスコットホーマンアメリカ合衆国カリフォルニア州 94108 サンフランシスコクレイストリート＃12 1160 (72)発明者イヴァンアラッジョフアメリカ合衆国カリフォルニア州 94114 サンフランシスコトゥエンティーサードストリート 4418 (72)発明者ホセファカークランドアメリカ合衆国カリフォルニア州 94117 サンフランシスコウェブスターストリート 76 (72)発明者ヤコブサリバンアメリカ合衆国カリフォルニア州 94122 サンフランシスコラプラヤ＃３ 1348 Ｆターム(参考） 5B075 ND34 NK43 5B082 GA02 HA05 ────────────────────────────────────────────────── ─── Continued on the front page (72) Inventor Michael Scott Homan United States of America 94108 San Francisco Clay Treat # 12 1160 (72) Inventor Ivan Arajov United States of America 94114 San Francisco Twenty-third Street 4418 (72) Inventor Jose Fa Kirkland United States 94117 San Francisco Webster Street 76 (72) Inventor Jacob Sullivan United States 94122 San Francisco La Playa # 3 1348 F-term (reference) 5B075 ND34 NK43 5B082 GA02 HA05

Claims

[Claims]

Means for retrieving an information source; means for extracting from the information source one or more elements each having a piece of content in the information source; An apparatus for processing an information source, comprising: means for generating a data structure representing a hierarchical structure; and means for processing the data structure to retrieve a predetermined element from the information source.

2. The method according to claim 1, wherein the extracting unit includes: a page browsing part for browsing a page from which the element is extracted; a page navigator part for browsing a hierarchical list of the elements extracted from the page; Further comprising: a user that pulls an element from the page browsing portion to the page navigator portion to extract the page browsing portion; and an element characteristic portion that browses the characteristics of the elements in the list of the page navigator portion. , The page navigator and portions of the element properties allow the user to quickly extract elements from the page by simultaneously browsing the page and the hierarchical list of elements. The device according to claim 1, characterized in that:

3. The apparatus of claim 2, wherein the page comprises an HTML web page, and wherein the elements further comprise atomic and atomic groups.

4. The method according to claim 1, further comprising: an HTML browsing portion indicating an HTML code of the web page; a component indicating a graphic configuration of the web page; and a source portion indicating a source code of the web page. Item 3. The apparatus according to Item 3.

5. A data structure generating means for converting the information source into a first hierarchical structure including the contents and the hierarchical structure, wherein the element is searched for even if the information source changes. Means for determining a generalized path to the element of the information source, as described in claim 1.

6. The method of claim 1, wherein the first hierarchical structure includes one or more nodes, each containing an element, where a particular element is located at a first node of the hierarchical structure; Children are means for comparing a target node containing the data with each other node of the hierarchy to construct a unique node identifier, and wherein the unique identifier is located during the comparison. Means for identifying a turning point node, which is a node of the hierarchical structure that uniquely identifies the target node, which is associated with the target node, if the target node does not exist, and the node matching the target node Means for discovering whether it is valid to apply a descendant axis to a turning point node, which occurs when there are no descendants of the apparatus.

7. The comparing means identifies a node of the same type as the target node in the sub-tree of the hierarchical structure having a first node of a complete specific path to the target node at a root node. Means and the target for each of the identified nodes to determine which set of node information fragments may be used to uniquely identify the target node from the identified nodes. 7. The method of claim 6, further comprising: generating a comparison with a node; and determining an actual node information fragment used to uniquely identify the target node. apparatus.

8. The intersection and union of the sets of node information fragments to determine the actual node information fragments used to uniquely identify the target node. The apparatus of claim 7, further comprising means for determining

9. The method according to claim 9, wherein the means for identifying the turning point is of the same type as the target node in the subtree of the hierarchical structure having the first node of the complete specific path to the target node at a root node. Means for identifying each of the identified nodes, and each of the identified nodes for determining which set of node information fragments may be used to uniquely identify the target node from the identified nodes. Means for generating a comparison between the target node and the target node; and means for determining the actual node information fragment used to uniquely identify the target node. An apparatus according to claim 6.

10. The intersection and sum of the set of node information fragments to determine the actual node information fragment used to uniquely identify the target node. The apparatus of claim 9, further comprising means for determining a set.

11. The means for finding the axis comprises means for identifying the largest subtree of the hierarchical structure having nodes from the relative path as a root containing the first node, and all nodes of the same type. Means for identifying as a turning point node, means for determining whether the descendants of each identified node match the descendants of the turning point node, and any descendants of the identified node being descendants of the turning point node. 11. The apparatus of claim 10, further comprising means for effectively and securely assigning the descendant axis to the turning point node if not.

12. The method according to claim 5, wherein the hierarchical structure includes a tree structure associated with the web page, and the data in the target node includes a piece of content associated with the web page. An apparatus according to claim 1.

13. The means for determining a generalized path includes means for traversing the hierarchical structure to determine a generalized path identifier that leads to the first node through the hierarchical structure. An apparatus according to claim 5.

14. The information source may include one or more H
The apparatus of claim 1, comprising web pages in TML and XML formats, wherein the hierarchical structure comprises a relational markup language.

15. The apparatus of claim 14, wherein said means for processing comprises an XSL stylesheet.

16. Searching for an information source; extracting one or more elements from the information source, each element having a piece of content in the information source; A method of processing an information source, comprising: generating a data structure representing a hierarchical structure; and processing the data structure to retrieve a predetermined element from the information source.

17. The extracting step includes: browsing a page from which an element is extracted; browsing a hierarchical list of elements extracted from the page; and extracting the element from the page. Further comprising: a user pulling an element from the page browsing portion to the page navigator portion; and browsing a property of an element in a list of the page navigator portion, wherein the page browsing, page navigator, and Wherein each part of the element characteristics enables the user to quickly extract elements from the page by simultaneously browsing the page and the hierarchical list of elements. The method according to item 16,

18. The method of claim 17, wherein the page comprises an HTML web page, and wherein the elements further comprise an atomic and an atomic group.

19. The method of claim 18, further comprising: indicating the HTML code of the web page; indicating a graphic configuration of the web page; and indicating a source code of the web page. The described method.

20. The data structure generating step includes: converting the information source into a first hierarchical structure including the content and the hierarchical structure; and searching for the element even if the information source changes. Determining a generalized path to the element of the information source, as described.

21. The first hierarchical structure includes one or more nodes, each containing an element, where a particular element is located at a first node of the hierarchical structure; The determinator comprises comparing a target node containing the data with each other node of the hierarchical structure to construct a unique node identifier, and locating the unique identifier during the comparison. If not,
Identifying a turning point node that is associated with the target node and that is a node of the hierarchical structure that uniquely identifies the target node; and determining a descendant of the node that matches the target node. Finding if it is valid to apply the descendant axis to the turning point node, which occurs if there is no such point.
The method described in.

22. The comparing step includes, in a root node, a node of the same type as the target node in the hierarchical subtree having the first node of the complete specific path to the target node. Identifying each of the identified nodes and the target to determine which set of node information fragments may be used to uniquely identify the target node from the identified nodes. 22. The method of claim 21, further comprising: generating a comparison with a node to be determined; and determining the actual node information fragment used to uniquely identify the target node. The method described in.

23. The determining step includes determining the intersection and sum of the set of node information fragments to determine the actual node information fragment used to uniquely identify the target node. The method of claim 22, further comprising determining a set.

24. The step of identifying the turning point comprises the same type as the target node in the hierarchical subtree having the first node of the complete specific path to the target node at a root node. Identifying each of the identified nodes to determine which set of node information fragments may be used to uniquely identify the target node from the identified nodes. Generating a comparison between the target node and the target node; and determining the actual node information fragment used to uniquely identify the target node. A method according to claim 21.

25. The determining step further comprises determining the intersection and sum of the set of node information fragments to determine the actual node information fragment used to uniquely identify the target node. The method of claim 24, further comprising determining a set.

26. The step of locating the axis comprises identifying a largest subtree of the hierarchy having nodes from the relative path as the root containing the first node. Identifying a node as the turning point node; determining whether a descendant of each identified node matches a descendant of the turning point node;
26. The method of claim 25, further comprising: if any of the descendants of the identified node does not match a descendant of the turning point node, effectively and securely assigning the descendant axis to the turning point node. Method.

27. The method according to claim 20, wherein the hierarchical structure includes a tree structure associated with a web page, and the target node data includes a piece of content associated with the web page. The described method.

28. The method of claim 20, wherein determining the generalized path includes traversing the hierarchical structure to determine a path identifier to reach the first node through the hierarchical structure. The method described in.

29. The information source comprising one or more H
17. The method of claim 16, comprising web pages in TML and XML formats, wherein the hierarchical structure comprises a relational markup language.

30. The method according to claim 29, wherein said processing step comprises generating an XSL stylesheet.

31. A page browsing portion for browsing a page from which an atomic and atomic group is extracted, and wherein the page pulls an atomic therefrom to extract the atomic from the page. A page navigator part for browsing a hierarchical list of atomics extracted from, and an atomic property part for browsing atomic properties in the list of page navigator parts, wherein the page browsing, page navigator, and element An HTML web page, wherein each part of a property allows the user to quickly extract an atomic from the page by simultaneously viewing the page and the hierarchical list of atomics. A graph that extracts one or more atomics from Fick user interface.

32. A means for browsing the page from which the atomic has been extracted, and wherein the user pulls the atomic therefrom from the page browsing means to extract the atomic from the page.
Means for navigating the page comprising means for viewing a hierarchical list of atomics extracted from the page; and wherein the user simultaneously views the page, the hierarchical list of atomics, and the property of the selected atomic. A graphical user for extracting one or more elements from an HTML web page, comprising: an atomic property generating means for extracting the property from the atomic selected by the user. ·interface.

33. A method for generating a hierarchical representation of a web page having atomic and atomic groups, the method comprising: selecting an atomic graphical representation from a page being viewed by the user; Pulling the graphical representation of the atomic into a page navigator portion, as shown in a hierarchical relationship with other atomics in the page; and so that the user can view the atomic properties.
Automatically extracting the atomic properties from the atomic when selected by the user.

34. A method for processing a web page that repurposes the web page for one or more wireless devices having different screen formats by determining a path to each piece of web page content. Generating a first hierarchical structure including the structure of the web page and the content of the web page based on the web page; and the structure of the web page indicating a route to the content. Generating a second hierarchical structure of the web page from the first hierarchical structure, and generating a relative path that leads to the content of the web page and is inserted into the second hierarchical structure. In the second hierarchical structure so that a content search using the path to the content will find the content even if the web page has changed. Method characterized by comprising the steps of strengthening the path, the.

35. A means for retrieving an information source, a page for browsing a page from which the element is extracted, extracting one or more elements each comprising a piece of content in the information source from the information source. A browsing portion, a page navigator portion for browsing a hierarchical list of elements extracted from the page, and a user pulling elements from the page browsing portion to the page navigator portion to extract elements from the page. Further comprising an element characteristic part for browsing the characteristics of the elements of the list of the page navigator part, wherein each of the page browsing, page navigator, and element characteristics is performed by the user, Extracting means for simultaneously extracting elements from the page by simultaneously browsing the hierarchical list of Means for transforming the information source into a first hierarchical structure representing the hierarchical structure of the element of the source, including the content and the hierarchical structure, wherein the element is located even if the information source changes. Means for generating a data structure, comprising: means for determining a generalized path to the element of the information source, wherein the first hierarchical structure is such that the specific element is a first element of the hierarchical structure.
And one or more nodes, each containing an element, located at a node of the first node containing the data to identify a unique node identifier. Means for comparing the first node with the other nodes of the hierarchical structure, and uniquely identifying the first node associated with the first node if a unique identifier is not found during the comparison. Means for identifying a turning point node, which is a node of a hierarchical structure, and means for designating a descendant axis as a turning point node when there is no descendant of the node that matches the first node. To process information sources.

36. The apparatus of claim 35, wherein the page comprises an HTML web page, and wherein the elements further comprise an atomic and an atomic group.

37. The web page further includes an HTML browsing portion indicating the HTML code, a component indicating the graphic configuration of the web page, and a source portion indicating the source code of the web page. 37. The apparatus of claim 36, wherein:

38. The determinator comprises: means for comparing the target node containing the data with each other node of the hierarchical structure to construct a unique node identifier; Means for identifying a turning point node, which if not found during the comparison, is a node of the hierarchical structure that uniquely identifies the target node, associated with the target node; Happens if there is no descendant of the node that matches the node
Means for discovering whether applying a descendant axis to a turning point node is valid.
An apparatus according to claim 1.

39. The comparing means identifies a node of the same type as the target node in the subtree of the hierarchical structure having a first node of the complete specific path to the target node at a root node. Means to
Each of the identified nodes and the target node are identified to determine which set of node information fragments may be used to uniquely identify the target node from the identified nodes. The apparatus of claim 38, further comprising: means for generating a comparison; and means for determining an actual node information fragment used to uniquely identify the target node.

40. The means for determining comprises determining the actual node information fragment used to uniquely identify the target node by determining the intersection and sum of the set of node information fragments. The apparatus of claim 39, further comprising means for determining a set.

41. The means for identifying a turning point is of the same type as the target node in the hierarchical subtree having the first node of the complete specific path to the target node at a root node. Means for identifying each of the identified nodes, and each of the identified nodes for determining which set of node information fragments may be used to uniquely identify the target node from the identified nodes. Means for generating a comparison between the target node and the target node; and means for determining the actual node information fragment used to uniquely identify the target node. An apparatus according to claim 38.

42. The means for determining comprises determining the actual node information fragment used to uniquely identify the target node by determining the intersection and sum of the set of node information fragments. 42. The apparatus of claim 41, further comprising means for determining a set.

43. The means for finding the axis comprises: means for identifying a largest subtree of the hierarchical structure having nodes from the relative path as a root containing the first node; Means for identifying as a turning point node, means for determining whether the descendants of each identified node match the descendants of the turning point node, and any descendants of the identified node being descendants of the turning point node. 43. The apparatus of claim 42, further comprising means for effectively and securely assigning said descendant axis to said turning point node if not.