JP2006235942A

JP2006235942A - Apparatus for processing structured document

Info

Publication number: JP2006235942A
Application number: JP2005048904A
Authority: JP
Inventors: Wataru Shimizu; 渉清水
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2005-02-24
Filing date: 2005-02-24
Publication date: 2006-09-07

Abstract

<P>PROBLEM TO BE SOLVED: To solve the problem with structured documents capable of describing links wherein there is the possibility that some of the links may not be necessary to the documents, in view of the fact that a document group consisting of a plurality of such structured documents, which are often bundled together and handled as one document, requires the links to be traced starting from the first document of the group during such processes as sending, receiving and printing the group. <P>SOLUTION: In the first document of a document group consisting of a plurality of documents, a list of documents included in the document group is stored so as to prevent processing of unnecessary documents during such a process as printing and to prevent the failure to process necessary documents, so as to achieve efficient processing. <P>COPYRIGHT: (C)2006,JPO&NCIPI

Description

本発明は、情報処理装置に関する。 The present invention relates to an information processing apparatus.

ウェブサイトの記述言語として使われているＨＴＭＬ（ＨｙｐｅｒＴｅｘｔＭａｒｋｕｐＬａｎｇｕａｇｅ：ｈｔｔｐ：／／ｗｗｗ．ｗ３．ｏｒｇ／Ｍａｒｋｕｐ／）や、汎用データ記述言語として近年広く用いられているＸＭＬ（ＥｘｔｅｎｓｉｂｌｅＭａｒｋｕｐＬａｎｇｕａｇｅ：ｈｔｔｐ：／／ｗｗｗ．ｗ３．ｏｒｇ／ＸＭＬ／）などの構造化文書においては、しばしば他のファイルやデータを参照するためのリンク記述の方法が用意されている。例えば、ＨＴＭＬにおいては、ｉｍｇ要素を使用することにより画像ファイルを表示の一部に使用したり、ａ要素を使用することにより他ファイルへのハイパーリンク機能を実現したりすることができる。さらに今後はＸＭＬやＳＶＧ等の普及により、多くのデータが構造化文書の形式で処理されることになる。このような構造化文書は、一つのファイルではなく、複数のファイル全体で一つの文書として成り立っているものが多い。そのため、このような構造化文書を保存や印刷などの処理をする際は、複数のファイルをまとめて処理をする必要がある。そのため、ＨＴＭＬには指定された文書がどのような意味を持つかを明示するＬｉｎｋタグがある。ＨＴＭＬのＬｉｎｋタグの使用例を図１１に示す。先頭文書である１１０１（ｉｎｄｅｘ．ｈｔｍｌ）には＜ｌｉｎｋｒｅｌ＝“ｎｅｘｔ”ｈｒｅｆ＝“ｃｈａｐ１．ｈｔｍｌ”＞と記述されてあり、これはこの先頭文書の次の文書が１１０２（ｃｈａｐ１．ｈｔｍｌ）であることを示す。１１０２（ｃｈａｐ１．ｈｔｍｌ）、１１０３（ｃｈａｐ２．ｈｔｍｌ）、１１０４（ｃｈａｐ３．ｈｔｍｌ）にも同様にＬｉｎｋタグがあり、ｎｅｘｔ、ｐｒｅｖ、ｉｎｄｅｘという属性で、それぞれ次の文書、前の文書、先頭文書を表している。このような構造の場合、次の文書をたどっていくことにより文書の全体をたどることができる（例えば〔特許文献１〕参照）。
特開２００３−３１６７６７号公報 HTML (Hyper Text Markup Language: http://www.w3.org/Markup/), which is used as a description language for websites, and XML (Extensible Markup Language: http: //), which has been widely used as a general-purpose data description language in recent years. In structured documents such as: //www.w3.org/XML/), link description methods for referring to other files and data are often prepared. For example, in HTML, an image file can be used as a part of display by using an img element, and a hyperlink function to another file can be realized by using an a element. In the future, with the spread of XML, SVG, etc., a lot of data will be processed in the form of structured documents. In many cases, such a structured document is not a single file, but is formed as a single document by using a plurality of files as a whole. Therefore, when processing such a structured document as storage or printing, it is necessary to process a plurality of files together. Therefore, HTML has a Link tag that clearly indicates what the designated document has. An example of using the HTML Link tag is shown in FIG. The first document 1101 (index.html) is described as <link rel = “next” href = “chap1.html”>, and the next document after this first document is 1102 (chap1.html). Indicates that there is. Similarly, 1102 (chap1.html), 1103 (chap2.html), and 1104 (chap3.html) also have Link tags, and the attributes of next, prev, and index are used for the next document, the previous document, and the first document, respectively. Represents. In the case of such a structure, the entire document can be traced by following the next document (see, for example, [Patent Document 1]).
JP 2003-316767 A

従来の構造化文書の形式では、印刷や保存、データの受け渡しなど機器間でデータを送受信する際、それぞれのファイルごとに操作を行う必要がある。リンク先をたどる方法もあるが、重要性の低いデータを処理してしまうことがある。ＨＴＭＬのＬｉｎｋタグの技術では、文書全体の構造を知るには文書を一つずつたどる必要があるため文書の数が多い場合や、文書がネットワーク上の他の機器にある場合などに非常に時間と手間がかかるという問題がある。 In the conventional structured document format, when data is transmitted and received between devices such as printing, storing, and data transfer, it is necessary to perform operations for each file. There is a way to follow the link destination, but it may process less important data. In HTML Link tag technology, it is necessary to trace one document at a time in order to know the structure of the entire document. Therefore, when the number of documents is large or when the documents are in other devices on the network, it takes a very long time. There is a problem that it takes time and effort.

本発明に係わる構造化文書処理装置は、上記目的を達成するためのもので、構造化文書のうち、一括して処理すべきリンク先を、先頭ページに埋め込む構成になっている。上記構成からなる構造化文書処理装置においては、構造化文書の先頭に処理の対象となる文書の一覧が格納されていることにより、構造化文書に関連付けられたデータのうち、処理すべきものとそうでないものが明確に区別できるため、不必要な文書を処理したり、必要な文書を処理し損ねたりすることがなくなる。また、先頭ページに含まれているため、文書をやり取りした後にもリンク先一覧情報が残り、文書を受け渡した後も正確な処理を行うことが可能になる。 The structured document processing apparatus according to the present invention is for achieving the above-described object, and is configured to embed, in the first page, link destinations to be collectively processed in the structured document. In the structured document processing apparatus configured as described above, a list of documents to be processed is stored at the beginning of the structured document, so that data to be processed among the data associated with the structured document is likely to be processed. Since those that are not can be clearly distinguished, unnecessary documents will not be processed or necessary documents will not be missed. In addition, since it is included in the first page, link destination list information remains even after the document is exchanged, and an accurate process can be performed after the document is delivered.

以上説明したように、本発明によれば、リンクにより複数のファイルからなる構造化文書の範囲を明確にすることで、保存、印刷などの処理や、他の機器との送受信を製作者の意図どおりに、かつ無駄なく行うことが可能になる。 As described above, according to the present invention, the scope of a structured document consisting of a plurality of files is clarified by a link, so that the process of storage, printing, etc., and transmission / reception with other devices are intended by the producer. It is possible to perform as usual and without waste.

（実施形態１）
図１は、本発明をパーソナルコンピュータ等からなるコンピュータ装置に適用した第１の実施形態を示す図である。本実施例は、ネットワーク上の構造化文書を印刷する際に、あらかじめ定められたデータのみを印刷することにより、必要な文書を漏らさず、かつ、不必要な文書を除いて印刷するものである。 (Embodiment 1)
FIG. 1 is a diagram showing a first embodiment in which the present invention is applied to a computer apparatus composed of a personal computer or the like. In this embodiment, when printing a structured document on a network, only predetermined data is printed, so that necessary documents are not leaked and unnecessary documents are excluded. .

図１では、構造化文書を作成するコンピュータ装置１０１と、コンピュータ装置１０１によって作成された構造化文書を保存するファイルサーバ１０３と、ファイルサーバ１０３に保存された構造化文書を閲覧するコンピュータ装置１０４と、コンピュータ装置１０４が構造化文書を印刷するためのプリンタ１０５がＬＡＮ１０２によって接続されている。 In FIG. 1, a computer device 101 that creates a structured document, a file server 103 that stores a structured document created by the computer device 101, and a computer device 104 that browses a structured document stored in the file server 103. A printer 105 for the computer device 104 to print a structured document is connected by a LAN 102.

図２は本発明に係るコンピュータ装置１０１の構成を示すブロック図である。同図において、ＣＰＵ２０１は、システム制御部であり、装置全体を制御する。ＲＯＭ２０２は、ＣＰＵの制御プログラムや各種固定データを格納するものである。ＲＡＭ２０３は、ＳＲＡＭ、ＤＲＡＭ等で構成され、プログラム制御変数等を格納するものである。また、各種設定パラメータ、各種ワーク用バッファもＲＡＭ２０３に格納されるものである。記憶部２０４はハードディスク等で構成され、ファイルデータを格納するためのものである。操作部２０５は、キーボード、マウス等で構成され、オペレータが各種入力操作を行うためのものである。表示部２０６は、ディスプレイ等でオペレータに表示通知するためのものである。ＬＡＮｉ／ｆ２０７はＬＡＮ回線２０８に接続するためのインターフェイスである。 FIG. 2 is a block diagram showing the configuration of the computer apparatus 101 according to the present invention. In the figure, a CPU 201 is a system control unit and controls the entire apparatus. The ROM 202 stores CPU control programs and various fixed data. The RAM 203 is composed of SRAM, DRAM, and the like, and stores program control variables and the like. Various setting parameters and various work buffers are also stored in the RAM 203. The storage unit 204 is composed of a hard disk or the like, and is for storing file data. The operation unit 205 includes a keyboard, a mouse, and the like, and is used by the operator to perform various input operations. The display unit 206 is used to notify the operator of the display using a display or the like. A LAN i / f 207 is an interface for connecting to the LAN line 208.

図３はコンピュータ装置１０１によって作成される、構造化文書の概念図である。３０１（ｉｎｄｅｘ．ｈｔｍｌ）中の＜ａｈｒｅｆ＝”“ａｂｏｕｔ．ｈｔｍｌ”＞、＜ａｈｒｅｆ＝“ｍａｎｕａｌ．ｍｉｄ”＞、＜ａｈｒｅｆ＝“ｒｅｆｅｒｅｎｃｅ”＞はそれぞれ３０２（ａｂｏｕｔ．ｈｔｍｌ）、３０４（ｍａｎｕａｌ．ｈｔｍｌ）、３０６（ｒｅｆｅｒｅｎｃｅ．ｈｔｍｌ）というファイルにリンクしていることを示している。また、３０２（ａｂｏｕｔ．ｈｔｍｌ）中の＜ＥＭＢＥＤＳＲＣ＝“ｂｇｍ．ｍｐ３＞はｂｇｍ．ｍｐ３に、３０４（ｍａｎｕａｌ．ｈｔｍｌ）中の＜ｉｍｇｓｒｃ＝“ｓａｍｐｌｅ．ｊｐｇ”＞はｓａｍｐｌｅ．ｊｐｇに、それぞれリンクしていることを示す。図３においては、ＨＴＭＬファイルや画像、音声ファイル等の複数のオブジェクトが、文書内の文字列により関連付けられている。ここでは、リンクの設定は文字列によってなされているが、画像などにリンクを設定することも可能である。また、ここでは、リンク先は一つのファイルになっているが、ファイルの一部分のみを指すことも可能である。 FIG. 3 is a conceptual diagram of a structured document created by the computer apparatus 101. 301 (index.html) <a href = ”“ about. html ”>, <a href =“ manual. “mid”> and <a href=“reference”> indicate that they are linked to files 302 (about.html), 304 (manual.html), and 306 (reference.html), respectively. (EMBED SRC = “bgm. mp3> is bgm. <img src = “sample.jpg”> in 304 (manual.html) is sample.mp3. jpg indicates that each is linked. In FIG. 3, a plurality of objects such as an HTML file, an image, and an audio file are associated by a character string in the document. Here, the link is set by a character string, but it is also possible to set a link to an image or the like. Here, the link destination is a single file, but it is also possible to indicate only a part of the file.

また図３において、この文書のトップページであるｉｎｄｅｘ．ｈｔｍｌには、製作者が作成した関連文書リストが２種類記されてある。最初の＜ｃｏｌｌｅｃｔｉｏｎ＞要素にはｉｎｄｅｘ．ｈｔｍｌ、ａｂｏｕｔ．ｈｔｍｌ、ｍａｎｕａｌ．ｈｔｍｌのみ記されてある。これは製作者がこの文書の概要としてテキストのみを扱うために記したものである。２番目の＜ｃｏｌｌｅｃｔｉｏｎ＞要素にはｒｅｆｅｒｅｎｃｅ．ｈｔｍｌを除くすべてのファイルが記されてある。これは製作者がこの文書に必要なものをすべてを扱うために記述したものである。また、ｒｅｆｅｒｅｎｃｅ．ｈｔｍｌはこの文書からリンクされているが、この文書にとって重要性は薄いと製作者が判断したためである。 Also, in FIG. 3, the index. Two types of related document lists created by the producer are written in html. The first <collection> element has an index. html, about. html, manual. Only html is marked. This is because the producers only treated the text as an overview of this document. The second <collection> element has a reference. All files except html are listed. This is what the producer has written to handle everything that this document needs. Also, reference. This is because html has been linked from this document, but the producer has determined that it is not important to this document.

関連文書リストを作成する手順を図９に示す流れ図に沿って説明する。製作者はあらかじめ関連文書群のうち関連文書リストに含める条件を指定する（Ｓ９０１）。なお本発明において「関連文書群」という言葉は構造化文書のリンクを無条件にたどることで得られる、ツリー構造を持つ文書の集合である。関連文書リストに含める条件として、階層数による指定する方法、ファイルの種類または拡張子で判断する方法、リンクの記述が相対パスなら関連文書リストに含め、絶対パスで記述されたものは除外する方法、ファイルサイズが大きいもののみ除外する方法、等が考えられる。 The procedure for creating the related document list will be described with reference to the flowchart shown in FIG. The producer designates the conditions to be included in the related document list in the related document group in advance (S901). In the present invention, the term “related document group” is a set of documents having a tree structure obtained by unconditionally following links of structured documents. As a condition to be included in the related document list, specify by the number of layers, determine by file type or extension, include in the related document list if the link description is a relative path, and exclude the one described in the absolute path A method of excluding only a file having a large file size can be considered.

次に先頭文書を読み込み（Ｓ９０２）、読み込んだ構造化文書を解析する（Ｓ９０３）。現在読んでいる文書に、まだたどったことのないリンクの記述があれば（Ｓ９０４）、そのリンク先の文書が関連文書リストに含める条件に合うを判断する（Ｓ９０５）。条件に合わなければ現在読んでいる文書に、他にたどったことのないリンクの記述があるか調べる。条件に合えば関連文書一覧に加え（Ｓ９０６）、そのリンク先を読み（Ｓ９０７）、その文書からさらにリンクされている文書を調べる。現在読んでいる文書にたどったことのないリンクの記述がなければ、現在読んでいる文書が先頭文書でなければ（Ｓ９０８）、現在の文書をリンクしているリンク元に戻る（Ｓ９０９）。現在読んでいる文書が先頭文書なら（Ｓ９０８）、さらに手動で関連文書リストに含める、あるいは除外する文書があれば（Ｓ９１０）、手動での作業を行い（Ｓ９１１）、なければ処理は終了となる。 Next, the first document is read (S902), and the read structured document is analyzed (S903). If there is a description of a link that has not been traced in the document that is currently being read (S904), it is determined whether the linked document meets the conditions to be included in the related document list (S905). If it doesn't meet the requirements, check the document you are reading for a link description you have never followed. If the condition is met, in addition to the related document list (S906), the link destination is read (S907), and the document further linked from the document is examined. If there is no description of a link that has not been traced to the document currently being read, if the currently read document is not the first document (S908), the process returns to the link source linking the current document (S909). If the currently read document is the first document (S908), and if there is a document to be manually included in or excluded from the related document list (S910), the manual operation is performed (S911). If not, the process ends. .

Ｓ９０１からＳ９０９までの過程によりツリー構造を持つ構造化文書群のうち、関連文書リストに自動的に含めるべき文書をすべて加えることができる。Ｓ９１０とＳ９１１の過程によって、関連文書リストに含める条件とは例外的に含める、または除外することができる。 Through the processes from S901 to S909, all the documents that should be automatically included in the related document list can be added from the structured document group having a tree structure. Through the processes of S910 and S911, the conditions included in the related document list can be included or excluded as exceptions.

本実施例では自動的な方法と手動の方法を組み合わせているが、完全に自動化する、あるいは手動で行うことも可能である。 In the present embodiment, an automatic method and a manual method are combined, but it is possible to completely automate or manually.

コンピュータ装置１０１は図３の文書をファイルサーバ１０３に保存しておくものとする。このファイルサーバの構成を図４に示す。 Assume that the computer apparatus 101 stores the document in FIG. 3 in the file server 103. The configuration of this file server is shown in FIG.

同図において、ＣＰＵ４０１は、システム制御部であり、装置全体を制御する。ＲＯＭ４０２は、ＣＰＵの制御プログラムや各種固定データを格納するものである。ＲＡＭ４０３は、ＳＲＡＭ、ＤＲＡＭ等で構成され、プログラム制御変数等を格納するものである。また、各種設定パラメータ、各種ワーク用バッファもＲＡＭ４０３に格納されるものである。記憶部４０４はハードディスク等で構成され、文書や画像などのファイルデータを格納するためのものである。操作パネル４０５は、キーボード、タッチパネル等で構成され、オペレータが各種入力操作を行うためのものである。表示部４０６は、ＬＣＤ、ＬＥＤ等でオペレータに表示通知するためのものである。ＬＡＮｉ／ｆ４０７はＬＡＮ回線４０８に接続するためのインターフェイスである。 In the figure, a CPU 401 is a system control unit and controls the entire apparatus. The ROM 402 stores a CPU control program and various fixed data. The RAM 403 is composed of SRAM, DRAM, and the like, and stores program control variables and the like. Various setting parameters and various work buffers are also stored in the RAM 403. The storage unit 404 is composed of a hard disk or the like, and stores file data such as documents and images. The operation panel 405 includes a keyboard, a touch panel, and the like, and is used by an operator to perform various input operations. The display unit 406 is used to notify the operator of display using an LCD, LED, or the like. A LAN i / f 407 is an interface for connecting to a LAN line 408.

次いで、コンピュータ装置１０４が図３の文書を印刷する手順を説明する。コンピュータ装置１０２はファイルサーバ１０３からこの文書のトップページであるｉｎｄｅｘ．ｈｔｍｌを読み出し、関連文書リストがあるかを判断する。本例では＜ｃｏｌｌｅｃｔｉｏｎ＞要素の中のそれぞれの＜Ｏｂｊｅｃｔ＞要素がリンク先となっている。そこで「印刷」のメニューを選ぶと図５のようなメニューが表示される。このとき対象となるファイルを既に持っている、あるいは対象となるファイルの情報を持っている場合は、この段階でページ数などの情報を表示することも可能である。「リンク先一覧にあるもの（ａｂｓｔｒａｃｔ）」を選択すると３０１（ｉｎｄｅｘ．ｈｔｍｌ）、３０２（ａｂｏｕｔ．ｈｔｍｌ）、３０４（ｍａｎｕａｌ．ｈｔｍｌ）のみを読み出し、それらをプリンタ１０５に送信する。「関連文書リストにあるもの（ｄｅｔａｉｌ）」を選択した場合は、３０４（ｒｅｆｅｒｅｎｃｅ．ｈｔｍｌ）以外のすべての文書を読み出し、プリンタ１０５に送信する。するとプリンタ１０５は受信したオブジェクトを印刷し、印刷は完了する。もしリンク先の一覧がなければ通常どおりの処理となる。 Next, a procedure for the computer device 104 to print the document in FIG. 3 will be described. The computer apparatus 102 sends an index.index, which is the top page of this document, from the file server 103. Read html and determine whether there is a related document list. In this example, each <Object> element in the <collection> element is a link destination. Therefore, when the “print” menu is selected, a menu as shown in FIG. 5 is displayed. At this time, if the target file already exists, or if the target file information is present, information such as the number of pages can be displayed at this stage. When “link list (abstract)” is selected, only 301 (index.html), 302 (about.html), and 304 (manual.html) are read and transmitted to the printer 105. If “what is in the related document list (detail)” is selected, all documents other than 304 (reference.html) are read and transmitted to the printer 105. Then, the printer 105 prints the received object, and printing is completed. If there is no list of link destinations, the process is as usual.

ここではコンピュータ装置１０４がリンク先の一覧に含まれるオブジェクトを読み出したが、トップページのみをプリンタ１０５に送信し、プリンタ１０５がリンク先の一覧にあるオブジェクトを読み出す、という方法も可能である。 Here, the computer device 104 reads out the objects included in the link destination list, but it is also possible to send only the top page to the printer 105 and the printer 105 reads out the objects in the link destination list.

図８は本発明の他の実施例が適用される構造化文書処理システムの概略構成図である。本実施例は、相対リンクと絶対リンクが含まれている文書を、別の機器に送信する際に、リンク情報を適切に変換するものである。 FIG. 8 is a schematic configuration diagram of a structured document processing system to which another embodiment of the present invention is applied. In the present embodiment, when a document including a relative link and an absolute link is transmitted to another device, the link information is appropriately converted.

本システムにおいては、コンピュータ装置装置８０４とファイルサーバ８０１と８０２がネットワーク８０３を介して接続されている。コンピュータ装置とファイルサーバの構成は前記構成例のものと同様である。 In this system, a computer apparatus 804 and file servers 801 and 802 are connected via a network 803. The configurations of the computer device and the file server are the same as those in the above configuration example.

このファイルサーバ８０１に図７に示す構造化文書が格納されている。この文書からリンクされたファイルのうち、７０３（ｐｉｃ．ｊｐｇ）はリンク先の一覧に含まれてなく、また７０４（ｄｏｃ．ｈｔｍｌ）は関連文書リストに含まれているが、別のファイサーバ８０２（ｂａｒ．ｏｒｇ）に格納されている。先頭文書を格納しているファイルサーバは８０１（ｆｏｏ．ｃｏｍ）である。 The file server 801 stores the structured document shown in FIG. Of the files linked from this document, 703 (pic.jpg) is not included in the linked list, and 704 (doc.html) is included in the related document list, but another file server 802 is included. (Bar.org). The file server storing the first document is 801 (foo.com).

ここでファイルサーバ８０１からコンピュータ装置８０４に図８の文書を送信する。その際、関連文書リストにあるファイルをそのまま転送すると、コンピュータ装置８０４からその文書を読む際に、リンクをたどれない、もしくは不必要なファイルの受信を行うことがため、受信時に適切な変換が必要になる。 Here, the document in FIG. 8 is transmitted from the file server 801 to the computer device 804. At that time, if a file in the related document list is transferred as it is, when reading the document from the computer device 804, a link is not followed or an unnecessary file is received. I need it.

リンクの記述の変換手順を図１０に示す流れ図に沿って説明する。まず先頭文書を受信するし（Ｓ１００１）、関連文書リストがなければ終了し（Ｓ１００２）、あれば図６のウィンドウを出し、処理範囲を入力する（Ｓ１００３）。先頭ページのみの場合は処理を終了する。関連文書リストにあるものを含める場合は、関連文書リストに含まれるファイルをすべて受信し（Ｓ１００４）、先頭文書を読み込み（Ｓ１００５）、文書の構造を解析する（Ｓ１００６）。現在読んでいる文書にたどったことのないリンクの記述があれば（Ｓ１００７）、リンク先が関連文書リストに含まれているか確認する（Ｓ１００８）。含まれている場合は、そのリンクの記述が絶対パスであるか確認し（Ｓ１００９）、絶対パスになっている場合はその記述を相対パスに変換し（Ｓ１０１０）、リンク先の文書を読み込む（Ｓ１０１１）。リンク先文書が関連文書リストに含まれていない場合は、その記述が相対パスであるか確認し（Ｓ１０１２）、相対パスであればリンクの記述を絶対パスに変換する（Ｓ１０１３）。現在読んでいる文書にたどったことのないリンクの記述が存在しなければ、現在読んでいる文書が先頭文書か確認する（Ｓ１０１４）。先頭文書でなければリンク元に戻り、まだたどったことのないリンクの記述を調べ、先頭文書であれば処理を終了する。 The procedure for converting the link description will be described with reference to the flowchart shown in FIG. First, the first document is received (S1001), and if there is no related document list, the process ends (S1002). If there is, the window shown in FIG. 6 is displayed and the processing range is input (S1003). If there is only the first page, the process ends. When including the files in the related document list, all the files included in the related document list are received (S1004), the first document is read (S1005), and the structure of the document is analyzed (S1006). If there is a description of a link that has not been traced to the currently read document (S1007), it is confirmed whether the link destination is included in the related document list (S1008). If it is included, it is confirmed whether the description of the link is an absolute path (S1009), and if it is an absolute path, the description is converted to a relative path (S1010), and the linked document is read (S1010). S1011). If the linked document is not included in the related document list, it is checked whether the description is a relative path (S1012). If the link destination document is a relative path, the link description is converted to an absolute path (S1013). If there is no description of the link that has never been traced in the currently read document, it is confirmed whether the currently read document is the first document (S1014). If it is not the first document, the process returns to the link source, and the description of the link that has not been followed is checked. If it is the first document, the process is terminated.

本実施例においては、コンピュータ装置８０４は７０２（ｆｉｇ．ｓｖｇ）の＜ａｘｌｉｎｋ：ｈｒｅｆ＝“ｐｉｃ．ｊｐｇ”＞を＜ａｘｌｉｎｋ：ｈｒｅｆ＝ｈｔｔｐ：／／ｆｏｏ．ｃｏｍ／ｐｉｃ．ｊｐｇ＞に、７０１（ｔｏｐ．ｘｍｌ）の＜ａｈｒｅｆ＝ｈｔｔｐ：／／ｂａｒ．ｏｒｇ／ｄｏｃ．ｈｔｍｌ＞を＜ａｈｒｅｆ＝“ｄｏｃ．ｈｔｍｌ”＞に置き換える。 In this embodiment, the computer apparatus 804 sets <a xlink:href=“pic.jpg”> of 702 (FIG. SVG) to <a xlink: href = http: // foo. com / pic. jpg>, 701 (top.xml) <a href = http: // bar. org / doc. Replace html> with <a href=“doc.html”>.

本実施例では受信者がリンクの記述の変換を行ったが、送信者が行うことも可能である。 In this embodiment, the receiver converts the link description, but the sender can also perform the conversion.

このようにリンクの記述を適切に変換することで、文書を別の機器に転送した後もリンク先を正確に参照し、また不必要な受信を行うことなく文書を閲覧することができる。 By appropriately converting the link description in this way, it is possible to accurately refer to the link destination even after the document is transferred to another device, and to browse the document without unnecessary reception.

本発明の構造化文書処理装置に係るシステム構成を示す図。The figure which shows the system configuration | structure which concerns on the structured document processing apparatus of this invention. 本発明の構造化文書処理を行うコンピュータ装置のハードウェア構成を示すブロック図。The block diagram which shows the hardware constitutions of the computer apparatus which performs the structured document process of this invention. 本発明の構造化文書処理装置によって生成された構造化文書の例を示す図。The figure which shows the example of the structured document produced | generated by the structured document processing apparatus of this invention. ファイルサーバの構成を示すブロック図。The block diagram which shows the structure of a file server. 印刷処理の範囲を指定を行うウィンドウの図。The figure of the window which designates the range of print processing. 構造化文書作成時にリンク関係を入力する画面の例を示す図。The figure which shows the example of the screen which inputs link relation at the time of structured document preparation. 本発明のリンク変換を説明するための構造化文書概念図。The structured document conceptual diagram for demonstrating the link conversion of this invention. 本発明のリンク変換機能を有する構造化文書処理装置に係るシステム構成を示す図。The figure which shows the system configuration | structure which concerns on the structured document processing apparatus which has a link conversion function of this invention. 本発明の関連文書一覧格納手段を説明するための流れ図。The flowchart for demonstrating the related document list storage means of this invention. 本発明のリンクの記述の変換を説明するための流れ図。The flowchart for demonstrating conversion of the description of the link of this invention. 従来技術であるＨＴＭＬのＬｉｎｋタグを使用例を示す図。The figure which shows the usage example of the Link tag of HTML which is a prior art.

Claims

Link information description means for describing document data related to document data as link information, and for one related document group, a list of document data associated with the first document and the first document is stored in the first document as a related document list. A related document list storage means, a structured document creation device, a document analysis means for analyzing a related document list stored in the first document of one related document group, and processing of the related document group A structured document processing system comprising a structured document processing apparatus having a document processing means for performing only the documents described in the related document list of the related document group.

2. The structured document processing system according to claim 1, wherein the related document list storage means is limited to only requested documents.

2. The structured document according to claim 1, wherein said related document list storage means selects necessary documents automatically or semi-automatically from a related document group and stores them in a related document list with a certain criterion. Document processing system.

2. The structure according to claim 1, wherein the related document list storage unit includes a document associated with the first document and a document at a lower hierarchy than the related document in the related document list of the first document. Document processing system.

2. The structured document processing system displays detailed information such as the size and the number of pages of an entire related document group when processing related documents included in a related document list. Structured document processing system.

The structured document processing system is characterized in that when a description including a link destination is a description by an absolute path when transmitting including data in a related document list, the description is rewritten to a description by a relative path. The structured document processing system according to claim 1.

When the structured document processing system transmits the data including the data in the link destination list, the description indicating the link destination is a relative path and not included in the link destination list. The structured document processing system according to claim 1, wherein the structured document processing system is rewritten to a description by

The structured document processing system according to claim 1, wherein the structured document processing system is capable of describing a plurality of related document lists in accordance with uses and selecting a processing range when processing.