JP2002169836A

JP2002169836A - Support system for integration and rearrangement of information

Info

Publication number: JP2002169836A
Application number: JP2000365373A
Authority: JP
Inventors: Akiyuki Fujino; 亮之藤野
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 2000-11-30
Filing date: 2000-11-30
Publication date: 2002-06-14
Anticipated expiration: 2020-11-30
Also published as: JP3877957B2

Abstract

PROBLEM TO BE SOLVED: To provide a support system for integration and rearrangement of information which can precisely integrate and display information analog a layout form that a user generates. SOLUTION: This integration and rearrangement support system for information is equipped with a layout form generating means 21 which determines the layout form matching a style that the user requires and also saves a tag as a key for extracting necessary data in an item of the layout form at the same time, an information acquiring means 22 which acquires source data including the tag from the source information that the user specifies, an information integrating means 23 which extracts data corresponding to the tag from the saves source data and integrates the data, and an integrated information output means 61 which outputs the generated integrated data according to the layout form.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、データベースやＷ
ＷＷ上にある情報からユーザが望む特定の情報を抜粋し
て取得することで情報の整理を支援する情報の集約整理
支援システムに関する。[0001] The present invention relates to a database and W
The present invention relates to an information consolidation and support system that supports the organization of information by extracting and acquiring specific information desired by a user from information on the WW.

【０００２】[0002]

【従来の技術】近年、インターネットの発達により、イ
ンターネットのＷＷＷ（ＷｏｒｌｄＷｉｄｅＷｅｂ）
上から様々な情報やサービスを入手できるようになって
いる。しかし、ＷＷＷ上に存在する情報は、その量が膨
大であり、また情報の形式や編集スタイルも様々である
ため、ユーザが必要とする情報を抜粋して効率的に取得
し整理することは容易ではない。2. Description of the Related Art In recent years, with the development of the Internet, the WWW (World Wide Web) of the Internet has been developed.
Various information and services can be obtained from above. However, since the amount of information existing on the WWW is enormous and the information formats and editing styles are various, it is easy to extract and efficiently obtain and organize the information required by the user. is not.

【０００３】そこで、ＷＷＷ上の情報をユーザの望む条
件に沿って効率的に取得する技術が提案されている。特
開平１１−２０３１００号公報には、ＷＷＷ上のＨＴＭ
Ｌ（Hyper Text Markup Language）文書を取得し、その
中にある画像データなどの不要情報をＨＴＭＬタグから
判断して除去し、文書のレイアウトは保持したまま不要
情報の部分を空白にして出力する技術が開示されてい
る。また、特開平１１−１３４３４１号公報には、同じ
くＷＷＷ上のＨＴＭＬ文書を取得し、その中からあらか
じめ設定したキーワードを検索し、検索結果とＨＴＭＬ
タグを利用して、キーワードを含むひとまとまりの文字
列を抜粋して表示する技術が開示されている。[0003] Therefore, there has been proposed a technique for efficiently obtaining information on the WWW according to conditions desired by a user. Japanese Patent Application Laid-Open No. 11-203100 discloses an HTM on WWW.
A technology that acquires an L (Hyper Text Markup Language) document, removes unnecessary information such as image data from the HTML tag by judging it from an HTML tag, and outputs a blank portion of the unnecessary information while maintaining the document layout. Is disclosed. Japanese Patent Application Laid-Open No. 11-134341 also discloses that an HTML document on the WWW is acquired, a keyword set in advance is searched from the HTML document, and the search result and the HTML
There is disclosed a technique for extracting and displaying a group of character strings including a keyword using a tag.

【０００４】[0004]

【発明が解決しようとする課題】前記従来の情報取得技
術では、元情報の記述言語であるＨＴＭＬを手掛かりに
して必要な情報を抽出しているため、抽出対象となるＨ
ＴＭＬ文書にユーザの望む情報が記載されていない場合
には、そのＨＴＭＬ文書に埋め込まれたリンクをたどっ
て新たなＨＴＭＬ文書を探さなければならなかった。こ
のことから、ユーザが望む情報の一覧性が著しく低下し
てしまうという不便があった。In the conventional information acquisition technique, since necessary information is extracted by using HTML, which is a description language of the original information, as a clue, the H to be extracted is used.
If the information desired by the user is not described in the TML document, a new HTML document has to be searched for by following the link embedded in the HTML document. For this reason, there is an inconvenience that the list of information desired by the user is significantly reduced.

【０００５】また、特開平１１−２０３１００号公報に
開示された技術では、ユーザの望む情報がどのＨＴＭＬ
タグの範囲に記載されているか判断できないため、必要
な情報を取りこぼしてしまう可能性があった。また、例
えば画像など、必要のない情報を空白のまま表示するた
めに、表示段階でのレイアウト上の効率が悪かった。Further, in the technique disclosed in Japanese Patent Application Laid-Open No. 11-203100, information
Since it cannot be determined whether the tag is described in the range of the tag, there is a possibility that necessary information may be missed. In addition, since unnecessary information such as an image is displayed in a blank state, layout efficiency at the display stage is low.

【０００６】また、特開平１１−１３４３４１号公報に
開示された技術では、キーワードの検索のみによってユ
ーザの望む情報の有無を判断しているため、必要な情報
の取りこぼしや不要な情報の取り込みを避けられなかっ
た。また、表示段階でのレイアウトが統一されていない
ため、取得した情報を比較検討するのにはなはだ不便で
あった。In the technique disclosed in Japanese Patent Application Laid-Open No. H11-134341, the presence or absence of information desired by the user is determined only by searching for a keyword. I couldn't. Further, since the layout at the display stage is not unified, it is very inconvenient to compare and examine the acquired information.

【０００７】かかる事情から、ユーザの望む情報を精度
良く取得し、これをユーザの望むレイアウトで見やすく
表示することが求められている。具体的には、ユーザの
望む情報が複数箇所の元情報に分散している場合に、そ
れらの中から必要な情報を抽出し、同じレイアウトフォ
ームに編集して表示することにより、比較検討を容易に
する、といった要望である。[0007] Under such circumstances, it is required to obtain information desired by the user with high accuracy and to display the information in a layout desired by the user in a legible manner. More specifically, if the information desired by the user is distributed among multiple pieces of original information, the necessary information is extracted from the information and edited and displayed on the same layout form, making comparison and examination easier. It is a request to do.

【０００８】すなわち本発明は、ユーザが望む情報のレ
イアウトフォームをユーザ自身が作成し、そのレイアウ
トフォームに沿って情報を精度良く集約して表示するこ
とにより、集約された情報の一覧性を高めるとともに集
約情報の比較をしやすくできるような情報の集約整理支
援システムを提供することを解決課題とするものであ
る。That is, according to the present invention, a layout form of information desired by the user is created by the user himself, and the information is accurately aggregated and displayed according to the layout form, thereby improving the listability of the aggregated information. It is an object of the present invention to provide an information aggregation and support system that makes it easy to compare aggregated information.

【０００９】[0009]

【課題を解決するための手段】前記課題を解決するた
め、本発明の情報の集約整理支援システムは、ユーザが
求める情報を集約して表示するためのレイアウトフォー
ムを決定するとともに、そのレイアウトフォーム内の項
目に対応するデータを抽出するためのキーとなるタグを
保存するレイアウトフォーム作成手段と、ユーザが指定
した元情報の範囲内から前記タグを含む元データを取得
する情報取得手段と、前記元データから前記タグに対応
するデータを抽出して集約データを生成する情報集約手
段と、前記情報集約手段により生成された集約データを
前記レイアウトフォームに合わせて出力する集約情報出
力手段と、を備えることを特徴とする。この構成によれ
ば、大量の情報の中からユーザが求める情報を効率的
に、かつ精度良く抽出し、それをユーザの好みのレイア
ウトフォーム上に集約して表示することができるので、
情報の整理や閲覧、比較分析などが容易になる。SUMMARY OF THE INVENTION In order to solve the above-mentioned problems, an information summarizing and organizing support system according to the present invention determines a layout form for summarizing and displaying information required by a user. A layout form creating means for storing a tag serving as a key for extracting data corresponding to the item, an information acquiring means for acquiring original data including the tag from a range of the original information specified by the user, Information aggregation means for extracting data corresponding to the tag from data to generate aggregated data, and aggregated information output means for outputting the aggregated data generated by the information aggregation means in accordance with the layout form It is characterized by. According to this configuration, it is possible to efficiently and accurately extract information required by the user from a large amount of information and to collectively display the information on a layout form desired by the user.
Information can be easily organized, viewed, and compared.

【００１０】そして、前記レイアウトフォーム作成手段
において、キーとなるタグにＸＭＬタグを使用すること
により、ユーザが求める情報を、単なるデータの形式で
はなく、個々の意味を踏まえて的確に抽出することがで
きる。[0010] In the layout form creation means, by using an XML tag as a key tag, information required by a user can be accurately extracted based on individual meanings, not just a data format. it can.

【００１１】さらに、前記レイアウトフォーム作成手段
において、ＸＭＬタグとレイアウトフォーム内の項目と
を関連付けるにあたり、ＸＭＬ文書からＸＭＬタグの付
与された箇所をドラッグ・アンド・ドロップ操作により
入力できるように構成することもできる。この構成によ
れば、ユーザがＸＭＬタグについての詳しい知識を持た
ない場合でも、容易に本発明を利用することができる。[0011] Further, the layout form creation means may be configured such that, when associating an XML tag with an item in the layout form, a portion to which the XML tag is added can be input from the XML document by a drag-and-drop operation. Can also. According to this configuration, the present invention can be easily used even when the user does not have detailed knowledge of the XML tag.

【００１２】また、本発明における前記情報取得手段
は、レイアウトフォームに対応する元データを、ユーザ
がＵＲＬにより指定したＷＷＷ上のウェブページから取
得するように構成されたことを特徴とする。この構成に
よれば、多種多様で大量の情報を有しているＷＷＷ上か
ら、ユーザの求める情報を幅広く取得することができる
ので、取得される情報の質や量が充実する。Further, the information acquisition means of the present invention is characterized in that the information acquisition means is configured to acquire original data corresponding to a layout form from a web page on the WWW designated by a URL by a user. According to this configuration, information desired by the user can be acquired widely from the WWW having various and large amounts of information, so that the quality and quantity of acquired information are enhanced.

【００１３】前記情報取得手段は、レイアウトフォーム
に対応する元データをユーザが指定したＷＷＷ上のウェ
ブページから抽出できない場合に、前記ウェブページか
らリンクをたどって他のウェブページを探索することに
より、必要な元データを補充するように構成されてもよ
い。この構成よれば、あらかじめユーザが指定した情報
だけでなく、その情報と関連ある他の情報も自動的に探
索される。したがって、ユーザが全ての情報がある場所
を把握していない場合でも、断片的な情報を手掛かりに
して広範囲に情報を取得することができる。When the original data corresponding to the layout form cannot be extracted from the web page on the WWW specified by the user, the information obtaining means searches for another web page by following the link from the web page. It may be configured to supplement necessary original data. According to this configuration, not only information specified by the user in advance, but also other information related to the information is automatically searched. Therefore, even when the user does not know where all the information is located, it is possible to acquire information over a wide range using fragmentary information as a clue.

【００１４】また、本発明における前記情報集約手段
は、情報取得手段によって取得された元データから、共
通のレイアウトフォームで複数セットの集約データを生
成するように構成することができる。この構成によれ
ば、取得された大量の情報が共通のレイアウトフォーム
に集約整理されるので、情報の比較検討が容易になり、
情報の見落としも防止される。Further, the information aggregating means in the present invention can be configured to generate a plurality of sets of aggregated data in a common layout form from the original data acquired by the information acquiring means. According to this configuration, a large amount of acquired information is consolidated and arranged in a common layout form, so that comparison of information is facilitated,
Oversight of information is also prevented.

【００１５】また、前記情報集約手段は、情報取得手段
によって取得された複数セットの元データから、タグに
対応するデータを抽出して集約データを生成するように
構成することもできる。この構成によれば、ユーザの求
める情報の断片がさまざまな場所に分散して存在する場
合でも、それらをひとつのレイアウトフォームに集約す
ることで、情報の活用性を格段に向上させることができ
る。The information aggregating means may be configured to extract data corresponding to a tag from a plurality of sets of original data acquired by the information acquiring means to generate aggregated data. According to this configuration, even when pieces of information desired by the user are dispersedly present in various places, the pieces of information can be collected into one layout form, so that the usability of information can be significantly improved.

【００１６】さらに、本発明の情報の集約整理支援シス
テムは、前記情報集約手段においてレイアウトフォーム
内の全項目に対応するデータを元データから抽出できな
かった場合に、データの抽出ができなかった項目のタグ
を変更または追加して、元データの再取得ができるよう
に構成することができる。この構成によれば、ユーザが
求めるデータを十分に抽出できなかった場合でも、他の
タグをキーにして再度、必要な元データの取得をやり直
すことができるので、情報の取りこぼしを減らし、求め
る情報を確実に取得することができる。Further, in the information summarizing and organizing support system of the present invention, when the data corresponding to all the items in the layout form cannot be extracted from the original data by the information aggregating means, the data cannot be extracted. The tag can be changed or added so that the original data can be reacquired. According to this configuration, even when the data required by the user cannot be sufficiently extracted, the necessary original data can be obtained again using another tag as a key. Can be reliably obtained.

【００１７】また、本発明の情報の集約整理支援システ
ムは、前記情報集約手段においてレイアウトフォーム内
の全項目に対応するデータを元データから抽出できなか
った場合に、元データを取得する対象となる元情報の指
定範囲を変更または追加して、元データの再取得ができ
るように構成することができる。この構成によれば、元
データの情報量が不十分であった場合にこれを再取得し
て補充することができるので、情報の取りこぼしを減ら
し、求める情報を確実に取得することができる。Further, in the information summarizing and organizing support system of the present invention, when data corresponding to all items in the layout form cannot be extracted from the original data by the information summarizing means, the original data is acquired. The designated range of the original information can be changed or added so that the original data can be reacquired. According to this configuration, when the information amount of the original data is insufficient, it can be reacquired and supplemented, so that missing information can be reduced and the desired information can be acquired reliably.

【００１８】また、本発明における前記集約情報出力手
段は、情報集約手段において抽出したデータがどの元デ
ータから抽出されたものであるかを示す抽出元情報を表
示するように構成されたことを特徴とする。この構成に
よれば、集約整理された情報から、ユーザが更なる関連
情報を入手することが容易になる。In the present invention, the integrated information output means is configured to display extraction source information indicating from which original data the data extracted by the information collection means is extracted. And According to this configuration, it is easy for the user to obtain further related information from the collected and arranged information.

【００１９】また、前記集約情報出力手段は、共通のレ
イアウトフォームで生成された複数セットの集約データ
を一覧表形式で表示するように構成されたことを特徴と
する。この構成によれば、共通のレイアウトフォームで
集約された情報の視認性がさらに向上し、情報の比較検
討が容易になる。The integrated information output means is configured to display a plurality of sets of integrated data generated in a common layout form in a list form. According to this configuration, the visibility of the information aggregated by the common layout form is further improved, and comparison of the information is facilitated.

【００２０】[0020]

【発明の実施の形態】以下、本発明の実施の形態につい
て、図面を参照しつつ説明する。Embodiments of the present invention will be described below with reference to the drawings.

【００２１】＜システム構成＞図１は、本発明の情報の
集約整理支援システムのシステム構成を示している。パ
ーソナルコンピュータ１０（以下、「ＰＣ」と記す。）
内には、本発明の動作に必要なデバイスの制御や管理を
行うデータ処理部２０、データ処理部２０が必要とする
データを保存する記憶部３０、データ処理部２０をＷＷ
Ｗに接続するためのネットワークインターフェース（Ｉ
／Ｆ）４０が設けられている。データ処理部２０は、Ｐ
Ｃ１０のメモリ上に格納されているプログラムの集まり
で、レイアウトフォーム作成部２１と、元データ取得部
２２と、取捨選択部２３とを有する。記憶部３０は、Ｐ
Ｃ１０のメモリ上に領域を確保されている保存用の空間
で、ウェブデータ記憶部３１と、ＸＭＬデータ記憶部３
２と、フォームデータ記憶部３３と、集約データ記憶部
３４とを有する。また、データ処理部２０には、ユーザ
からの各種入力を受け付けるための入力制御部５１及び
入力装置５２、ならびに各種処理結果をユーザに通知す
るための出力制御部６１および出力装置６２が接続され
ている。<System Configuration> FIG. 1 shows a system configuration of an information aggregation / arrangement support system according to the present invention. Personal computer 10 (hereinafter, referred to as “PC”)
Inside, a data processing unit 20 for controlling and managing devices necessary for the operation of the present invention, a storage unit 30 for storing data required by the data processing unit 20, and a data processing unit 20
Network interface (I
/ F) 40 are provided. The data processing unit 20
A collection of programs stored on the memory of the C10, which includes a layout form creation unit 21, an original data acquisition unit 22, and a selection unit 23. The storage unit 30 stores P
A storage space in which an area is secured in the memory of the C10, the web data storage unit 31 and the XML data storage unit 3
2, a form data storage unit 33, and an aggregated data storage unit 34. The data processing unit 20 is connected to an input control unit 51 and an input device 52 for receiving various inputs from the user, and an output control unit 61 and an output device 62 for notifying the user of various processing results. I have.

【００２２】＜情報の集約整理処理全体の流れ＞図２
は、本発明による情報の集約整理処理全体の流れを示
す。本発明による情報の集約整理処理は、ステップＳ１
０（以下、各ステップを「Ｓ１０」のように略記す
る。）のレイアウトフォーム作成処理、Ｓ２０の元デー
タ取得処理、Ｓ３０の集約データ生成処理、Ｓ４０の出
力処理、の順で行われる。以下、これらの各処理につい
て順に詳述する。<Overall Flow of Information Aggregation and Arrangement Processing> FIG. 2
Shows the flow of the entire information aggregation / arrangement process according to the present invention. The information consolidating and organizing process according to the present invention is performed in step S1.
0 (hereinafter, each step is abbreviated as “S10”), a layout form creation process, an original data acquisition process in S20, an aggregated data generation process in S30, and an output process in S40. Hereinafter, each of these processes will be described in detail in order.

【００２３】＜レイアウトフォーム作成処理＞Ｓ１０の
レイアウトフォーム作成処理では、ユーザがどのような
内容の情報を取得し、それをどのような形態に集約して
表示するのかをあらかじめ指定するためのレイアウトフ
ォームを作成する。このレイアウトフォーム作成処理
は、図１に示したデータ処理部２０の中のレイアウトフ
ォーム作成部２１にて行われる。図３に、このレイアウ
トフォーム作成処理の流れを示し、図４に、具体的なレ
イアウトフォーム７０の作成例を示す。<Layout Form Creation Processing> In the layout form creation processing in S10, a layout form for the user to specify in advance what kind of information is to be acquired and in what form the information is to be aggregated and displayed. Create This layout form creation processing is performed by the layout form creation unit 21 in the data processing unit 20 shown in FIG. FIG. 3 shows the flow of the layout form creation process, and FIG. 4 shows a specific example of creating a layout form 70.

【００２４】レイアウトフォーム７０を作成するために
は、まずＳ１１でキャプション入力を行い、情報の集約
項目となるキャプション部７１を作成する。続いて、Ｓ
１２の入力スペース作成で、取得した情報を集約して表
示するためのデータ入力部７２を作成する。続いて、Ｓ
１３の入力スペースの関連付けで、前記データ入力部７
２に取得した情報を表示させるのに必要な設定情報を入
力する。Ｓ１４の追加入力では、Ｓ１１からＳ１３まで
の作業を、ユーザが必要とするキャプション部７１およ
びデータ入力部７２の数だけ繰り返して行う。In order to create the layout form 70, first, a caption is input in S11, and a caption section 71 serving as an information aggregation item is created. Then, S
In the creation of the 12 input spaces, a data input section 72 for collecting and displaying the acquired information is created. Then, S
13, the data input unit 7
2. Enter the setting information necessary to display the acquired information. In the additional input in S14, the operations from S11 to S13 are repeated by the number of caption sections 71 and data input sections 72 required by the user.

【００２５】図４の例に沿って説明すると、レイアウト
フォーム７０は、情報を集約すべき項目となる複数のキ
ャプション部７１と、各キャプション部７１に対応して
設けられるデータ入力部７２とから構成される。ユーザ
は、レイアウトフォーム７０のページ上に自由にキャプ
ション部７１およびデータ入力部７２を配置できるが、
これらは必ず対になるように配置する。この例では、キ
ャプション部７１として、「製品名」、「型番」、「性
能」、「価格」、「オプション」が入力され、それぞれ
に対応するデータ入力部７２が矩形のスペースとして各
キャプション部７１の右横に確保されている。Referring to the example of FIG. 4, the layout form 70 is composed of a plurality of caption sections 71 as items to be aggregated and a data input section 72 provided corresponding to each caption section 71. Is done. The user can freely arrange the caption section 71 and the data input section 72 on the page of the layout form 70,
These are always arranged in pairs. In this example, “product name”, “model number”, “performance”, “price”, and “option” are input as the caption section 71, and the corresponding data input section 72 is formed as a rectangular space. Is secured to the right.

【００２６】データ入力部７２には、取得された情報
（元データ）の中から各キャプション部７１に対応する
個々のデータが抽出されて表示される。この抽出は、個
々の情報が保有する意味内容に基づいて行われる必要が
あり、そのための設定情報を入力するのが入力スペース
の関連付けである。本発明では、個々の情報を意味的に
抽出する鍵としてＸＭＬタグを利用する。以下、ＸＭＬ
タグを利用した入力スペースの関連付けについて説明す
る。In the data input section 72, individual data corresponding to each caption section 71 is extracted from the acquired information (original data) and displayed. This extraction needs to be performed based on the semantic content of each piece of information, and inputting the setting information for that is input space association. In the present invention, an XML tag is used as a key for semantically extracting individual information. Hereafter, XML
The association of the input space using the tag will be described.

【００２７】図５は、取得される元データの一例とし
て、ある製品のスペック表（ａ）および価格表（ｂ）の
表示形態を示した図で、図６は、図５（ａ）に示したス
ペック表の論理構造の一部をＸＭＬ形式の文書で表現し
たものである。図６に示すように、ＸＭＬ文書は、開始
タグと終了タグとが対になったＸＭＬタグにより、明確
な階層構造をなすように記述されている。そこで、あら
かじめデータ入力部７２に必要とするデータのＸＭＬタ
グを指定しておき、指定されたＸＭＬタグを元データか
ら検索することで、必要なデータをデータ入力部７２に
抽出することができる。FIG. 5 is a diagram showing a display form of a specification table (a) and a price table (b) of a certain product as an example of the obtained original data. FIG. A part of the logical structure of the specification table is expressed in an XML format document. As shown in FIG. 6, the XML document is described so as to form a clear hierarchical structure by an XML tag in which a start tag and an end tag are paired. Therefore, the required data can be extracted to the data input unit 72 by previously specifying the XML tag of the required data in the data input unit 72 and searching the specified XML tag from the original data.

【００２８】ただし、あらかじめ必要な情報に係るＸＭ
Ｌタグを指定するには、元データの中で使用されている
ＸＭＬタグの種類や階層構造がすべて判明している必要
がある。しかし、ＸＭＬ文書ではその作成者がＸＭＬタ
グを自由に定義できるので、本発明のユーザがそのＸＭ
Ｌタグの定義をすべて確認し適切に指定するのは容易で
はない。そこで、本発明では、ＸＭＬタグの入力を支援
する方法として、ＸＭＬ文書からデータ入力部７２に関
連付けしたい文字列を選択し、ドラッグ・アンド・ドロ
ップ操作で入力する方法を採用する。この入力方法につ
いて、図７を参照しつつ説明する。However, XM related to necessary information in advance
In order to specify an L tag, it is necessary to know all types and hierarchical structures of the XML tags used in the original data. However, since the creator of the XML document can freely define the XML tag, the user of the present invention can use the XML tag.
It is not easy to check all the definitions of L tags and specify them appropriately. Therefore, in the present invention, as a method of supporting input of an XML tag, a method of selecting a character string to be associated with the data input unit 72 from an XML document and inputting the character string by a drag-and-drop operation is employed. This input method will be described with reference to FIG.

【００２９】ユーザは、まず所望の情報を保有している
元データのサンプルをＸＭＬ文書で用意する。そして、
まずＳ１３１で、データ入力部７２に抽出したいデータ
が記載されたＸＭＬ文書中の関連部分を選択する。図４
に示したレイアウトフォーム７０および図５（ａ）〜図
６に示したスペック表を例にとると、レイアウトフォー
ム７０の「型番」というキャプション部７１に対応する
データ入力部７２に、図５（ａ）のスペック表から「型
番」の意味を持つ文字列を取り込みたい場合は、例えば
型番のひとつを示す「８２４４」という文字列を選択す
る。そして、Ｓ１３２で、この選択された文字列の前後
を囲むＸＭＬタグの抽出を行う。この例では、図６に示
したＸＭＬ文書から、「８２４４」という文字列が＜型
番＞・＜／型番＞というＸＭＬタグによって指定されて
おり、このＸＭＬタグがデータ入力部７２に入力され
る。このＸＭＬタグを抽出する処理は、図５（ａ）のス
ペック表の上ではユーザには見えないが、＜型番＞・＜
／型番＞というＸＭＬタグがどういう意味を持っている
かをユーザが知る必要はない。ＸＭＬタグが抽出できた
場合は、Ｓ１３３にてＸＭＬタグの保存を行い、抽出で
きなかった場合は、Ｓ１３４で再入力指示を行う。この
手順で、すべてのデータ入力部７２にＸＭＬタグを入力
する。A user first prepares a sample of original data holding desired information in an XML document. And
First, in S131, the data input unit 72 selects a related part in the XML document in which data to be extracted is described. FIG.
5A and the specification tables shown in FIGS. 5A to 6 as examples, the data input unit 72 corresponding to the caption unit 71 of “model number” of the layout form 70 is provided in FIG. If it is desired to take in a character string having the meaning of "model number" from the specification table of (1), for example, a character string "8244" indicating one of the model numbers is selected. Then, in step S132, an XML tag surrounding the selected character string is extracted. In this example, from the XML document shown in FIG. 6, the character string “8244” is specified by the XML tags of <model number> and </ model number>, and the XML tag is input to the data input unit 72. The process of extracting the XML tag is not visible to the user on the specification table of FIG.
The user does not need to know what the XML tag “/ model number” has. If the XML tag can be extracted, the XML tag is stored in S133. If the XML tag cannot be extracted, a re-input instruction is performed in S134. In this procedure, the XML tags are input to all the data input units 72.

【００３０】こうして、各キャプション部７１とそれに
対応する各データ入力部７２、および各データ入力部７
２に対応して意味的に関連付けされた各ＸＭＬタグが、
それぞれ入力される。このデータは、図８に示すような
構造のフォームデータとして、図１に示した記憶部３０
のフォームデータ記憶部３３に保存される。また、抽出
された各ＸＭＬタグは、記憶部３０のＸＭＬデータ記憶
部３２にも保存される。Thus, each caption section 71, each corresponding data input section 72, and each data input section 7
2, each XML tag semantically associated with
Each is entered. This data is stored as form data having a structure as shown in FIG.
Is stored in the form data storage unit 33. The extracted XML tags are also stored in the XML data storage unit 32 of the storage unit 30.

【００３１】＜元データ取得処理＞Ｓ１０のレイアウト
フォーム作成処理によって所望のレイアウトフォーム７
０が準備できると、続いて、Ｓ２０の元データ取得処理
を行う。元データ取得処理とは、ユーザが作成したレイ
アウトフォーム７０を埋めるのに必要な情報を、ユーザ
が指定したデータセット（情報を取得する対象となる元
データの集合）あるいはＷＷＷ上のウェブページから取
得する処理であり、レイアウトフォーム７０上の全ての
データ入力部７２を埋めるのに必要な元データを取得す
ることを最終目的とする。この元データ取得処理は、図
１に示したデータ処理部２０の中の元データ取得部２２
にて行われる。この元データ取得処理の流れを図９に示
す。まず、Ｓ２１で、元データ取得部２２は前記レイア
ウトフォーム作成処理で作成されたフォームデータをフ
ォームデータ記憶部３３から取得する。次に、Ｓ２２
で、レイアウトフォーム７０を埋めるのに必要な元デー
タを取得する。さらに、このレイアウトフォーム７０に
対し複数のデータセットを与える場合は、Ｓ２３で元デ
ータの追加入力を行う。<Original Data Acquisition Processing> The desired layout form 7 is obtained by the layout form creation processing in S10.
When 0 is ready, the original data acquisition process of S20 is performed. The original data acquisition processing is to acquire information necessary for filling the layout form 70 created by the user from a data set designated by the user (a set of original data from which information is to be acquired) or a web page on the WWW. The final purpose is to acquire the original data necessary to fill all the data input sections 72 on the layout form 70. This original data acquisition processing is performed by the original data acquisition unit 22 in the data processing unit 20 shown in FIG.
It is performed in. FIG. 9 shows the flow of the original data acquisition processing. First, in S21, the original data acquisition unit 22 acquires form data created in the layout form creation processing from the form data storage unit 33. Next, S22
Then, original data necessary for filling the layout form 70 is acquired. Further, when a plurality of data sets are given to the layout form 70, additional input of original data is performed in S23.

【００３２】Ｓ２２の元データ取得処理の流れについ
て、図１０を参照しつつ詳述する。まず、ユーザはＳ２
２１でデータセットを指定し、そのファイル名やＵＲＬ
を入力する。すると、元データ取得部２２はＳ２２２
で、指定されたデータセット内にフォームデータで指定
されたＸＭＬタグがすべてあるかどうかを検索する。デ
ータセット内にすべてのＸＭＬタグが見つかった場合
は、Ｓ２２３で、そのデータセットを保存する。ＸＭＬ
タグが見つからなかった場合は、Ｓ２２４で、このデー
タセット内に他文書へのリンクがあるかどうかを調べ
る。リンクがない場合は、Ｓ２２３で、ここまでのデー
タセットを保存する。リンクがある場合は、Ｓ２２５
で、まずリンク探索の上限であるかを判断する。リンク
探索の上限の決定は、特開２０００−９０１１１に開示
された技術を利用する。探索上限にかかっていれば探索
を終了し、Ｓ２２３で、ここまでのデータセットを保存
する。探索可能であれば、Ｓ２２６で、リンクをたどっ
て他の元データを取得し、Ｓ２２２で、新たに取得した
元データ内に必要なＸＭＬタグがあるかどうかの判断を
繰り返す。このような手順で、できるかぎりレイアウト
フォーム７０内のすべてのデータ入力部７２を埋めるの
に必要な元データを取得する。The flow of the original data acquisition process in S22 will be described in detail with reference to FIG. First, the user enters S2
21. Specify the data set, its file name and URL
Enter Then, the original data acquisition unit 22 proceeds to S222
Search for all the XML tags specified by the form data in the specified data set. If all the XML tags are found in the data set, the data set is saved in S223. XML
If no tag is found, it is checked in step S224 whether there is a link to another document in this data set. If there is no link, the data set so far is saved in S223. If there is a link, S225
First, it is determined whether the upper limit of the link search is reached. The determination of the upper limit of the link search utilizes a technique disclosed in JP-A-2000-90111. If the upper limit of the search is reached, the search is terminated, and in S223, the data set up to this point is stored. If the search is possible, in S226, another original data is obtained by following the link, and in S222, the determination as to whether or not a necessary XML tag is included in the newly obtained original data is repeated. With such a procedure, original data necessary to fill all the data input sections 72 in the layout form 70 is acquired as much as possible.

【００３３】＜集約データ生成処理＞Ｓ２０の元データ
取得処理によって必要な元データが取得されると、続い
て、Ｓ３０の集約データ生成処理を行う。この集約デー
タ生成処理は、Ｓ１０で作成されたレイアウトフォーム
７０の各データ入力部７２に、Ｓ２０で取得した元デー
タの中からそれぞれ対応するデータを抽出して入力する
処理である。この集約データ生成処理は、図１に示した
データ処理部２０の中の取捨選択部２３にて行われる。
この集約データ生成処理の流れを図１１に示す。まず、
Ｓ３１で、レイアウトフォーム７０の各データ入力部７
２に関連付けられたＸＭＬタグを取得し、このＸＭＬタ
グによって元データを検索する。図４に示したレイアウ
トフォーム７０および図５（ａ）〜図６に示したスペッ
ク表を例にとると、レイアウトフォーム７０上の「製品
名」に対応するデータ入力部７２には、ＸＭＬ文書中の
＜シリーズ＞・＜／シリーズ＞というＸＭＬタグが対応
し、これらのＸＭＬタグで検索することにより、「ＭＮ
−３６０」という文字列が抽出される。これをデータ入
力部７２の数だけ繰り返し、各ＸＭＬタグに対応するデ
ータを元データから順次抽出して、集約データを生成す
る。<Aggregated Data Generation Process> When the required original data is acquired by the original data acquisition process of S20, subsequently, the aggregated data generation process of S30 is performed. This aggregated data generation process is a process of extracting and inputting corresponding data from the original data acquired in S20 to each data input unit 72 of the layout form 70 created in S10. This aggregated data generation process is performed by the selection unit 23 in the data processing unit 20 shown in FIG.
FIG. 11 shows the flow of the aggregated data generation process. First,
In S31, each data input unit 7 of the layout form 70
Then, an XML tag associated with No. 2 is acquired, and the original data is searched by using the XML tag. Taking the layout form 70 shown in FIG. 4 and the specification tables shown in FIGS. 5A to 6 as examples, the data input unit 72 corresponding to the “product name” on the layout form 70 includes XML tags such as <series> and </ series> correspond to each other, and by searching using these XML tags, "MN"
The character string "-360" is extracted. This is repeated by the number of data input units 72, and data corresponding to each XML tag is sequentially extracted from the original data to generate aggregated data.

【００３４】このとき、元データの内容によっては、複
数の集約データを生成する場合がある。前記の例では、
元データ中に複数個の型番とそれに対応する各スペック
が記載されているので、ユーザとしては図１２に示すよ
うに、複数枚のレイアウトフォーム７０に型番別にスペ
ックを集約したいとする。ここで、各型番と各スペック
とが互いに対応したデータとして抽出できるかどうかと
いうことが問題になる。これについて、図６に示したＸ
ＭＬ文書を例にとり説明する。At this time, depending on the contents of the original data, a plurality of aggregated data may be generated. In the above example,
Since a plurality of model numbers and respective specifications corresponding to the model numbers are described in the original data, it is assumed that the user wants to aggregate the specifications by model number into a plurality of layout forms 70 as shown in FIG. Here, it is a problem whether each model number and each specification can be extracted as data corresponding to each other. In this regard, X shown in FIG.
This will be described using an ML document as an example.

【００３５】図６のＸＭＬ文書を、＜型番＞というＸＭ
Ｌタグ（以下、終了タグは省略する。）および＜スペッ
ク＞というＸＭＬタグで検索すると、＜型番＞というＸ
ＭＬタグでは「８２４４」、「８１２８」、「４３２
２」…という文字列が得られ、＜スペック＞というＸＭ
Ｌタグでは「○○○○」、「□□□□」、「△△△△」
…という文字列が得られる。このままでは、どの型番と
どのスペックが対応しているのかわからない。しかし、
ＸＭＬ文書は明快な階層構造を有するという特長があ
り、慣例として同種のデータや意味的に関連するデータ
は同じＸＭＬタグで括られ、同一階層にまとめられる。
この原則によれば、＜機種＞というＸＭＬタグで括られ
た同じ階層にある＜型番＞と＜スペック＞とが互いに対
応するものであると判別できる。このように、ＸＭＬタ
グの階層構造を手掛かりにすれば、型番「８２４４」と
スペック「○○○○」、型番「８１２８」とスペック
「□□□□」…のように、互いに対応するデータを同一
のレイアウトフォーム７０に集約することができる。The XML document shown in FIG.
When an L tag (hereinafter, an end tag is omitted) and an XML tag of <specification> are searched, an X of <model number> is obtained.
For the ML tag, “8244”, “8128”, “432”
2 ”... is obtained, and the XM
For the L tag, "○○○○", "□□□□", "△△△△"
... is obtained. It is not clear which model numbers correspond to which specifications. But,
XML documents have the feature of having a clear hierarchical structure, and the same type of data and semantically related data are conventionally enclosed in the same XML tag and put together in the same hierarchy.
According to this principle, it can be determined that <model number> and <specification> in the same hierarchy enclosed by an XML tag of <model> correspond to each other. As described above, if the hierarchical structure of the XML tag is used as a clue, data corresponding to each other such as a model number “8244” and a specification “XXXXX”, and a model number “8128” and a specification “□□□□”. They can be combined into the same layout form 70.

【００３６】なお、ＸＭＬ文書における階層構造やタグ
定義については、業界で標準化を図る動きもある。その
標準化が進めば、レイアウトフォーム作成処理において
も、ＸＭＬ文書の基本的な階層構造を記憶し、あらかじ
め上位階層のＸＭＬタグをサンプルとなる元データから
取得しておくなどして、より汎用性の高い情報の集約整
理が可能になる。There is a movement in the industry to standardize the hierarchical structure and tag definitions in XML documents. As the standardization progresses, even in the layout form creation process, the basic hierarchical structure of the XML document is stored, and the XML tags of the upper hierarchy are acquired in advance from the original data as a sample, thereby increasing the versatility. High-level information can be aggregated and organized.

【００３７】また、データを抽出するための元データが
複数にわたっている場合も、前記と同様に、抽出したデ
ータ間の対応関係を判別する必要がある。この場合は、
異なる元データから抽出した集約データを対比し、集約
データの一部が一致するもの同士を合成することによ
り、ひとまとまりの集約データとして結合することがで
きる。図５（ａ）のスペック表と同（ｂ）の価格表の例
では、スペック表から［型番「８２４４」・スペック
「○○○○」］という集約データを抽出し、価格表から
［型番「８２４４」・価格「ＸＸＸ」］という集約デー
タを抽出した後、型番「８２４４」というデータをキー
にして二つの集約データを結合する。これにより、［型
番「８２４４」・スペック「○○○○」・価格「ＸＸ
Ｘ」］という集約データを生成することができる。この
ように複数組の集約データを結合するためには、レイア
ウトフォーム作成処理において、ユーザの注目している
項目をあらかじめキー項目として指定しておく必要があ
る。このキー項目を適切に設定することにより、ユーザ
が求める情報をより有用な形態で集約することができ
る。Also, when there are a plurality of original data for extracting data, it is necessary to determine the correspondence between the extracted data in the same manner as described above. in this case,
By comparing the aggregated data extracted from the different original data and synthesizing those in which a part of the aggregated data matches, it is possible to combine the aggregated data as a set of aggregated data. In the example of the price table shown in FIG. 5A and the price table shown in FIG. 5B, aggregated data of [model number “8244” and specification “xxxxxx”] is extracted from the specification table, and [model number “ After extracting the aggregated data of “8244” and price “XXX”], the two aggregated data are combined using the data of the model number “8244” as a key. As a result, [model number “8244”, specification “XXXXX”, price “XX
X "]. In order to combine a plurality of sets of aggregated data in this manner, it is necessary to previously specify an item of interest of the user as a key item in the layout form creation processing. By appropriately setting the key items, information required by the user can be collected in a more useful form.

【００３８】次に、図１１の中でＳ３２で示した空欄処
理について説明する。空欄処理は、元データから必要な
データの抽出ができず、レイアウトフォーム７０上のデ
ータ入力部７２のいずれかに空欄が残ったとき、その空
欄を埋めるかどうかをユーザの追加入力によって判断
し、必要がある場合は再度、データの抽出を行う処理で
ある。Next, the blank processing shown in S32 in FIG. 11 will be described. In the blank processing, when necessary data cannot be extracted from the original data and a blank remains in any of the data input units 72 on the layout form 70, it is determined whether or not to fill the blank by an additional input by the user. If necessary, the data is extracted again.

【００３９】レイアウトフォーム７０上にデータの抽出
ができなかったデータ入力部７２がある場合、そのデー
タ入力部７２は、一旦、空欄で表示されユーザに通知さ
れる。このときユーザは、データ処理部２０に対し、以
下の２種類の命令を行うことができる。If there is a data input section 72 on the layout form 70 from which data could not be extracted, the data input section 72 is displayed once in a blank space and notified to the user. At this time, the user can issue the following two types of commands to the data processing unit 20.

【００４０】第一は、抽出条件を変更しての再抽出であ
る。これは、空欄となったデータ入力部７２について、
データ抽出の際の検索キーとなるＸＭＬタグを変更また
は追加入力して抽出条件を変更し、同じ元データを再
度、検索することにより、最初に取りこぼしたデータを
抽出し直す処理である。The first is re-extraction after changing the extraction conditions. This is because, for the data input section 72 which is blank,
This is a process of changing or additionally inputting an XML tag serving as a search key at the time of data extraction to change an extraction condition, and retrieving the same original data again, thereby extracting data that was initially dropped.

【００４１】第二は、元情報の範囲を拡大しての元デー
タの再取得である。これは、元情報の指定範囲を変更ま
たは追加して、より広い範囲から元データを再取得した
後、再び集約データ生成処理を行うことにより、新たな
データを補充する処理である。The second is re-acquisition of the original data by expanding the range of the original information. This is a process of changing or adding the designated range of the original information, reacquiring the original data from a wider range, and performing the aggregated data generation process again to supplement the new data.

【００４２】これらの処理の流れを図１３に示す。まず
Ｓ３２１で、データの再抽出を行うべき空欄を選択す
る。次にＳ３２２で、この空欄のデータ入力部７２に指
定するＸＭＬタグの再入力を行うべきか否かを判断し、
必要に応じてＳ３２３で再入力およびそのＸＭＬタグの
保存を行う。再入力を行わない場合は、そのままＳ３２
４に移り、元データを再取得するためのデータセットの
追加入力を行う。ここでは、新たに元データを取得しな
おすための元情報の範囲をＵＲＬ等により追加する。か
かるＵＲＬは、複数件でも可とする。データセットの追
加入力を行った場合は、Ｓ３２５で再取得されたデータ
セットを元データ取得部２２に保存する。データセット
の再取得を行わない場合は、そのままＳ３２６に移る。
そして、新たに入力されたＸＭＬタグまたは新たに取得
された元データに基づき、既述の集約データ生成処理
（Ｓ３０）と同じ手順にて、この空欄に対する集約デー
タ生成処理を行う。FIG. 13 shows the flow of these processes. First, in S321, a blank to be re-extracted is selected. Next, in S322, it is determined whether or not the XML tag specified in the blank data input section 72 should be re-input.
If necessary, re-input and storage of the XML tag are performed in S323. If re-input is not performed, S32 is left as it is.
Then, the process proceeds to step 4 to additionally input a data set for reacquiring the original data. Here, the range of the original information for newly acquiring the original data is added by a URL or the like. Such URLs may be plural. When the additional input of the data set is performed, the data set reacquired in S325 is stored in the original data acquisition unit 22. If the data set is not to be reacquired, the process proceeds to S326.
Then, based on the newly input XML tag or the newly acquired original data, aggregate data generation processing for this blank is performed in the same procedure as the above-described aggregated data generation processing (S30).

【００４３】こうして生成した集約データを、図１１の
Ｓ３３で、データ記憶部３０内の集約データ記憶部３４
に保存する。保存される集約データのデータ構造を図１
４に例示する。集約データ８０は、フォームデータ名８
１、ＸＭＬタグ名８２、抽出データ８３、抽出元データ
８４、の４要素で構成される。フォームデータ名８１
は、この集約データ８０を生成する元になったレイアウ
トフォーム７０の名称、ＸＭＬタグ名８２は、データを
抽出するための検索に用いたＸＭＬタグの名称、抽出デ
ータ８３は、前記ＸＭＬタグでの検索によって抽出した
文字列などのデータ要素、抽出元データ８４は、前記抽
出データ８３を取得したデータセットのファイル名やＵ
ＲＬを、それぞれ示す。フォームデータ名８１以外のデ
ータ要素は、必要な数だけ、互いに対応づけられて同時
に保存される。The aggregated data generated in this way is stored in the aggregated data storage unit 34 in the data storage unit 30 in S33 of FIG.
To save. Figure 1 shows the data structure of the aggregated data to be stored
4 is illustrated. The aggregate data 80 is the form data name 8
1, XML tag name 82, extraction data 83, and extraction source data 84. Form data name 81
Is the name of the layout form 70 from which the aggregated data 80 was generated, the XML tag name 82 is the name of the XML tag used in the search for extracting data, and the extracted data 83 is the name of the XML tag. A data element such as a character string extracted by the search and the extraction source data 84 are the file name or U
RL are indicated respectively. A required number of data elements other than the form data name 81 are simultaneously stored in association with each other.

【００４４】＜出力処理＞Ｓ３０の集約データ生成処理
によって集約データが生成・保存されると、続いて、Ｓ
４０の出力処理を行う。この出力処理では、フォームデ
ータ記憶部３３に保存されたフォームデータと、集約デ
ータ記憶部３４に保存された集約データとの組み合わせ
により、レイアウトフォーム７０に合わせた集約データ
の表示が行われる。前記した図１２は、図４に例示した
レイアウトフォーム７０を元にして、図５に例示した元
情報から必要なデータを集約整理したときの最終的な表
示形態を例示したものである。<Output Process> When the aggregated data is generated and stored by the aggregated data generation process of S30, the process proceeds to S30.
40 output processing is performed. In this output process, the combined data displayed in the layout form 70 is displayed by a combination of the form data stored in the form data storage unit 33 and the combined data stored in the combined data storage unit 34. FIG. 12 described above illustrates a final display form when necessary data is consolidated and arranged from the original information illustrated in FIG. 5 based on the layout form 70 illustrated in FIG.

【００４５】なお、出力の形態としては、抽出されたデ
ータが含まれていた元情報へのアクセスを容易にするた
めに、抽出されたデータが記載されていた元データのフ
ァイル名やＵＲＬを同時に表示することも考えられる。
これによれば、ユーザが集約情報を見て、それに関連す
る情報を追加的に探索・収集することが容易になる。ま
た、ＷＷＷ上のＵＲＬであれば、自動的にリンクを生成
することでアクセスがより簡単になる。In order to facilitate access to the original information containing the extracted data, a file name and a URL of the original data in which the extracted data is described are simultaneously output. It may be displayed.
According to this, it becomes easy for the user to look at the aggregated information and additionally search and collect information related thereto. In the case of a URL on the WWW, access is made easier by automatically generating a link.

【００４６】また、図１２のようなデータシート形式で
蓄積された複数セットの集約データを、図１５に示すよ
うな一覧表形式で１ページ（１画面）に表示することに
より、集約情報の一覧性や比較性を高めることも可能で
ある。By displaying a plurality of sets of aggregated data stored in a data sheet format as shown in FIG. 12 on one page (one screen) in a list format as shown in FIG. 15, a list of aggregated information is displayed. It is also possible to enhance sexuality and comparability.

【００４７】[0047]

【発明の効果】本発明の情報の集約整理支援システム
は、ユーザが求める情報の表示形態を決定するとともに
抽出すべきデータのキーとなるタグを指定して保存する
レイアウトフォーム作成手段と、元情報から前記タグを
含む元データを取得する情報取得手段と、前記元データ
から前記タグに対応するデータを抽出して集約データを
生成する情報集約手段と、前記情報集約手段により生成
された集約データを前記レイアウトフォームに合わせて
出力する集約情報出力手段とを備えて構成されるので、
大量の情報の中からユーザが求める情報を効率的に、か
つ精度良く抽出し、それをユーザの好みの表示形態で表
示することが可能になる。したがって、情報の整理や閲
覧、比較分析などを迅速かつ効率的に行うことができ
る。According to the present invention, there is provided an information summarizing and organizing support system which determines a display mode of information required by a user, specifies a tag which is a key of data to be extracted, and saves the layout form. Information acquisition means for acquiring the original data including the tag from, information aggregation means for extracting data corresponding to the tag from the original data to generate aggregated data, and aggregated data generated by the information aggregation means Since it is configured to include an integrated information output unit that outputs in accordance with the layout form,
It is possible to efficiently and accurately extract information required by a user from a large amount of information, and display the information in a display form desired by the user. Therefore, it is possible to quickly and efficiently organize and browse information and perform comparative analysis.

[Brief description of the drawings]

【図１】本発明の実施の形態にかかる情報の集約整理支
援システムのシステム構成図である。FIG. 1 is a system configuration diagram of an information aggregation / arrangement support system according to an embodiment of the present invention.

【図２】本発明による情報の集約整理処理の流れを示す
フローチャートである。FIG. 2 is a flowchart showing a flow of an information aggregation / arrangement process according to the present invention.

【図３】図２中のレイアウトフォーム作成処理の流れを
示すフローチャートである。FIG. 3 is a flowchart showing a flow of a layout form creation process in FIG. 2;

【図４】具体的なレイアウトフォームの作成例を示す図
である。FIG. 4 is a diagram showing a specific example of creating a layout form.

【図５】元情報の一例として、ある製品のスペック表
（ａ）および価格表（ｂ）の表示形態を示した図であ
る。FIG. 5 is a diagram showing a display form of a specification table (a) and a price table (b) of a certain product as an example of original information.

【図６】図５（ａ）に示したスペック表の論理構造の一
部をＸＭＬ形式で表現した図である。6 is a diagram expressing a part of the logical structure of the specification table shown in FIG. 5A in an XML format.

【図７】図３中の入力スペースの関連付けにおいて、Ｘ
ＭＬタグの入力を支援する方法を示すフローチャートで
ある。FIG. 7 shows an example of X in association of the input space in FIG.
9 is a flowchart illustrating a method for supporting input of an ML tag.

【図８】レイアウトフォームのデータ構造を模式的に例
示した図である。FIG. 8 is a diagram schematically illustrating a data structure of a layout form.

【図９】図２中の元データ取得処理の流れを示すフロー
チャートである。FIG. 9 is a flowchart showing a flow of an original data acquisition process in FIG. 2;

【図１０】図９中の元データ取得処理における元データ
の探索・保存処理の流れを示すフローチャートである。FIG. 10 is a flowchart showing a flow of an original data search and storage process in the original data acquisition process in FIG. 9;

【図１１】図２中の集約データ生成処理の流れを示すフ
ローチャートである。FIG. 11 is a flowchart showing a flow of an aggregated data generation process in FIG. 2;

【図１２】本発明によって最終的に出力される集約情報
の編集例を示す図である。FIG. 12 is a diagram showing an example of editing aggregated information finally output by the present invention.

【図１３】図１１中の集約データ生成処理における空欄
処理の流れを示すフローチャートである。13 is a flowchart showing a flow of a blank process in the aggregated data generation process in FIG.

【図１４】集約データのデータ構造を模式的に例示した
図である。FIG. 14 is a diagram schematically illustrating a data structure of aggregated data.

【図１５】集約データを一覧表形式で表示したときの表
示例を示す図である。FIG. 15 is a diagram illustrating a display example when aggregated data is displayed in a list format.

[Explanation of symbols]

２１レイアウトフォーム作成部（レイアウトフォーム
作成手段）２２元データ取得部（情報取得手段）２３取捨選択部（情報集約手段）６１出力制御部（集約情報出力手段）７０レイアウトフォーム８０集約データ21 layout form creation unit (layout form creation unit) 22 original data acquisition unit (information acquisition unit) 23 sorting unit (information aggregation unit) 61 output control unit (aggregated information output unit) 70 layout form 80 aggregated data

Claims

[Claims]

1. A layout form creation means for deciding a layout form for collecting and displaying information required by a user and storing a tag serving as a key for extracting data corresponding to an item in the layout form. And information acquisition means for acquiring original data including the tag from within the range of the original information specified by the user; information aggregation means for extracting aggregated data by extracting data corresponding to the tag from the original data; An integrated information output unit that outputs the integrated data generated by the information aggregating unit in accordance with the layout form.

2. A layout form creation means, comprising:
2. The information aggregation / arrangement support system according to claim 1, wherein an XML tag is used as a key tag.

3. The layout form creation means,
3. The information according to claim 2, wherein, when associating the XML tag with an item in the layout form, a portion to which the XML tag is added can be input from the XML document by a drag-and-drop operation. Aggregation and sorting support system.

4. An information acquisition unit according to claim 1, wherein the original data corresponding to the layout form is designated by a W
2. The information aggregation and consolidation support system according to claim 1, wherein the system is configured to acquire the information from a web page on the WW.

5. The information acquiring means, when the original data corresponding to the layout form cannot be extracted from the web page specified by the user on the WWW, by following the link from the web page and searching for another web page. 5. The information aggregation and consolidation support system according to claim 4, wherein the system is configured to supplement necessary original data.

6. The information consolidating and organizing support according to claim 1, wherein the information consolidating means generates a plurality of sets of consolidated data in a common layout form from the original data acquired by the information acquiring means. system.

7. The information according to claim 1, wherein the information aggregating unit extracts aggregated data by extracting data corresponding to the tag from a plurality of sets of original data acquired by the information acquiring unit. Aggregation and sorting support system.

8. When data corresponding to all items in the layout form cannot be extracted from the original data by the information aggregating means, the tag of the item for which data could not be extracted is changed or added, and 2. The information aggregation / arrangement support system according to claim 1, wherein the system is configured to enable reacquisition.

9. When the information aggregating means cannot extract data corresponding to all items in the layout form from the original data, the designated range of the original information from which the original data is to be obtained is changed or added. 2. The system according to claim 1, wherein the original data can be re-acquired.

10. The integrated information output means is configured to display extraction source information indicating from which original data the data extracted by the information integration means is extracted. 1. The information aggregation and support system described in 1.

11. The aggregation of information according to claim 6, wherein the aggregation information output unit is configured to display a plurality of sets of aggregation data generated in a common layout form in a list format. Organizing support system.