JP7000709B2

JP7000709B2 - Information processing equipment and programs

Info

Publication number: JP7000709B2
Application number: JP2017114415A
Authority: JP
Inventors: 季史山田; 隆直佐々木
Original assignee: Fuji Xerox Co Ltd; Fujifilm Business Innovation Corp
Current assignee: Fujifilm Business Innovation Corp
Priority date: 2017-06-09
Filing date: 2017-06-09
Publication date: 2022-01-19
Anticipated expiration: 2037-06-09
Also published as: JP2018206303A

Description

本発明は、情報処理装置及びプログラムに関する。 The present invention relates to an information processing apparatus and a program.

今日、様々な情報がウェブページの形態でネットワーク上に公開されている。ウェブページは、ＨＴＭＬ（HyperText Markup Language）文書とも呼ばれ、データ記述言語の一例であるＨＴＭＬにより記述されている。
ＨＴＭＬ文書を直接生成することも可能であるが、多くの場合、ＸＭＬ（Extensible Markup Language）文書を一旦生成し、その後、スタイルシートを用いてＸＭＬ文書をＨＴＭＬ文書に変換する手法が用いられる。その理由は、文書とその表示の方法とを分離できるためである。 Today, various information is published on the network in the form of web pages. Web pages are also called HTML (HyperText Markup Language) documents and are described by HTML, which is an example of a data description language.
Although it is possible to directly generate an HTML document, in many cases, a method of generating an XML (Extensible Markup Language) document once and then converting the XML document into an HTML document using a style sheet is used. The reason is that the document and its display method can be separated.

ただし、ＸＭＬ文書からＨＴＭＬ文書への変換には多くのリソースを必要とする。このため、例えば特許文献１には、大容量のＸＭＬ文書を複数の小容量のＸＭＬ文書に機械的に分割することで、各文書の変換処理に要するリソースの軽減を図る技術が記載されている。 However, conversion from an XML document to an HTML document requires a lot of resources. Therefore, for example, Patent Document 1 describes a technique for reducing resources required for conversion processing of each document by mechanically dividing a large-capacity XML document into a plurality of small-capacity XML documents. ..

特開２００７－１７９４９２号公報Japanese Unexamined Patent Publication No. 2007-179492

ところが、予め定めた行数単位で機械的にＸＭＬ文書を小容量のＸＭＬ文書に分割する手法では、変換されるデータ量自体は不変であり、結局のところ変換に要する時間の短縮にはつながらない。 However, in the method of mechanically dividing an XML document into small-capacity XML documents in units of a predetermined number of lines, the amount of data to be converted does not change, and the time required for conversion is not shortened after all.

本発明は、変換後の構造化文書に与える影響の有無とは無関係に変換元の構造化文書を変換対象とする場合に比して、変換に要する時間の短縮を目的とする。 An object of the present invention is to reduce the time required for conversion as compared with the case where the converted structured document is the conversion target regardless of whether or not the converted structured document is affected.

請求項１に記載の発明は、コンピュータに、変換元である第１の構造化文書を構成する複数の部分データであって、当該部分データの表現形式の変換に使用する規則で参照されない情報を含まない中間文書を記憶部から読み出し、当該中間文書を前記規則に基づいて、それぞれ第２の構造化文書に変換する機能と、前記第２の構造化文書への変換に先立って、当該第２の構造化文書への今回の変換が２回目以降にあたる前記中間文書について第１のハッシュ値を計算するとともに、名称が共通する中間文書についての前回の変換時に計算された第２のハッシュ値を前記記憶部から読み出し、当該第１のハッシュ値と当該第２のハッシュ値を比較する機能と、前記第１のハッシュ値が前記第２のハッシュ値と相違する場合、当該第１のハッシュ値に対応する前記中間文書を新たな第２の構造化文書に変換し、当該新たな第２の構造化文書により、前記記憶部に記憶されている対応する既存の第２の構造化文書を差し替える機能と、を実行させるためのプログラムである。
請求項２に記載の発明は、前記第１のハッシュ値を計算する度に、算出された当該第１のハッシュ値と当該第１のハッシュ値に対応する前記第２の構造化文書を特定する情報とを紐付けたリストを生成して前記記憶部に記憶する機能を更に有し、前記リストに記憶された前記第１のハッシュ値は、前記中間文書の次回の変換時に、前記第２のハッシュ値として使用される、請求項１に記載のプログラムである。
請求項３に記載の発明は、新たな中間文書に対応する名称が、既存の前記リストに存在しない場合、当該新たな中間文書を前記第２の構造化文書に変換すると共に、当該新たな中間文書について計算された前記第１のハッシュ値と当該第２の構造化文書を特定する情報を当該リストに追加する機能を更に有する、請求項２に記載のプログラムである。
請求項４に記載の発明は、既存の前記リストに存在する名称が、新たな中間文書について生成される新たなリストに存在しない場合、当該名称を削除リストに登録すると共に、当該名称に対応する前記第２の構造化文書を削除する機能を更に有する、請求項２に記載のプログラムである。
請求項５に記載の発明は、前記規則で参照される情報を、前記第１の構造化文書から抽出して前記中間文書を生成する機能を更に有する、請求項１に記載のプログラムである。
請求項６に記載の発明は、前記中間文書は、前記規則を使用して変換される前記第２の構造化文書を単位に生成される、請求項５に記載のプログラムである。
請求項７に記載の発明は、変換元である第１の構造化文書を構成する複数の部分データであって、当該部分データの表現形式の変換に使用する規則で参照されない情報を含まない中間文書を記憶部から読み出し、当該中間文書を前記規則に基づいて、それぞれ第２の構造化文書に変換する変換部と、前記第２の構造化文書への変換に先立って、当該第２の構造化文書への今回の変換が２回目以降にあたる前記中間文書について第１のハッシュ値を計算するとともに、名称が共通する中間文書についての前回の変換時に計算された第２のハッシュ値を前記記憶部から読み出し、当該第１のハッシュ値と当該第２のハッシュ値を比較する比較部と、前記第１のハッシュ値が前記第２のハッシュ値と相違する場合、当該第１のハッシュ値に対応する前記中間文書を新たな第２の構造化文書に変換し、当該新たな第２の構造化文書により、前記記憶部に記憶されている対応する既存の第２の構造化文書を差し替える差替部と、を有する情報処理装置である。 The invention according to claim 1 presents to a computer information that is a plurality of partial data constituting the first structured document that is a conversion source and is not referred to in the rules used for conversion of the representation format of the partial data. A function of reading an intermediate document that is not included from the storage unit and converting the intermediate document into a second structured document based on the above rule, and a second function prior to the conversion into the second structured document. The first hash value is calculated for the intermediate document for which the current conversion to the structured document of is the second and subsequent times, and the second hash value calculated at the time of the previous conversion for the intermediate document having the same name is used as described above. The function of reading from the storage unit and comparing the first hash value with the second hash value, and when the first hash value is different from the second hash value, corresponds to the first hash value. With the function of converting the intermediate document to a new second structured document and replacing the corresponding existing second structured document stored in the storage unit with the new second structured document. , Is a program to execute.
The invention according to claim 2 specifies the calculated first hash value and the second structured document corresponding to the calculated first hash value each time the first hash value is calculated. It further has a function of generating a list associated with information and storing it in the storage unit, and the first hash value stored in the list is used for the second conversion of the intermediate document at the next conversion. The program according to claim 1 , which is used as a hash value .
The invention according to claim 3 converts the new intermediate document into the second structured document and the new intermediate document when the name corresponding to the new intermediate document does not exist in the existing list . The program according to claim 2, further comprising a function of adding the first hash value calculated for the document and the information identifying the second structured document to the list.
The invention according to claim 4 registers the name in the deletion list and corresponds to the name when the name existing in the existing list does not exist in the new list generated for the new intermediate document. The program according to claim 2 , further having a function of deleting the second structured document.
The invention according to claim 5 is the program according to claim 1, further comprising a function of extracting the information referred to by the rule from the first structured document to generate the intermediate document.
The invention according to claim 6 is the program according to claim 5, wherein the intermediate document is generated in units of the second structured document converted using the rule.
The invention according to claim 7 is an intermediate piece of data that constitutes a first structured document that is a conversion source and does not include information that is not referred to in the rules used for conversion of the representation format of the partial data. A conversion unit that reads a document from a storage unit and converts the intermediate document into a second structured document based on the above rules, and a second structure prior to conversion into the second structured document. The first hash value is calculated for the intermediate document that is converted to the converted document for the second time or later, and the second hash value calculated at the time of the previous conversion for the intermediate document having the same name is stored in the storage unit. A comparison unit that compares the first hash value with the second hash value, and when the first hash value is different from the second hash value, corresponds to the first hash value. A replacement unit that converts the intermediate document into a new second structured document and replaces the corresponding existing second structured document stored in the storage unit with the new second structured document. It is an information processing device having.

請求項１記載の発明によれば、第２の構造化文書に与える影響の有無とは無関係に第１の構造化文書を変換対象とする場合に比して、変換に要する時間を短縮できる。
請求項２記載の発明によれば、変換が必要な第１の構造化文書と変換が必要でない第１の構造化文書を区別できる。
請求項３記載の発明によれば、第２の構造化文書に変更が反映されるまでに要する時間を短縮できる。
請求項４記載の発明によれば、第２の構造化文書に変更が反映されるまでに要する時間を短縮できる。
請求項５記載の発明によれば、中間文書の生成を自動化でき、第２の構造化文書に変更が反映されるまでに要する総時間を短縮できる。
請求項６記載の発明によれば、中間文書の生成を自動化でき、第２の構造化文書に変更が反映されるまでに要する総時間を短縮できる。
請求項７記載の発明によれば、第２の構造化文書に与える影響の有無とは無関係に第１の構造化文書を変換対象とする場合に比して、変換に要する時間を短縮できる。 According to the first aspect of the invention, the time required for conversion can be shortened as compared with the case where the first structured document is the conversion target regardless of the presence or absence of the influence on the second structured document.
According to the invention of claim 2, it is possible to distinguish between a first structured document that requires conversion and a first structured document that does not require conversion.
According to the third aspect of the invention, the time required for the change to be reflected in the second structured document can be shortened.
According to the invention of claim 4, the time required for the change to be reflected in the second structured document can be shortened.
According to the invention of claim 5, the generation of the intermediate document can be automated, and the total time required for the change to be reflected in the second structured document can be shortened.
According to the invention of claim 6, the generation of the intermediate document can be automated, and the total time required for the change to be reflected in the second structured document can be shortened.
According to the invention of claim 7, the time required for conversion can be shortened as compared with the case where the first structured document is the conversion target regardless of the presence or absence of the influence on the second structured document.

本実施の形態に係るウェブシステムの構成例を説明する図である。It is a figure explaining the configuration example of the web system which concerns on this embodiment. 本実施の形態に係る情報処理装置のハードウェア構成の例を説明する図である。It is a figure explaining the example of the hardware composition of the information processing apparatus which concerns on this embodiment. 実施の形態１に係る情報処理装置のソフトウェア構成の例を説明する図である。It is a figure explaining the example of the software structure of the information processing apparatus which concerns on Embodiment 1. FIG. 変更有無判断部による処理のイメージを説明する図である。It is a figure explaining the image of the process by the change presence / absence determination unit. スタイルシートの構成例を説明する図である。It is a figure explaining the structural example of a style sheet. 端末装置のブラウザ画面に表示される本文ＨＴＭＬ文書、目次ＨＴＭＬ文書、図ＨＴＭＬ文書を例示する図である。It is a figure exemplifying the text HTML document, the table of contents HTML document, and the figure HTML document displayed on the browser screen of a terminal apparatus. 文書処理装置を構成する信号処理部が実行する処理動作の例を説明するフローチャートである。It is a flowchart explaining the example of the processing operation executed by the signal processing unit which constitutes a document processing apparatus. 全量変換の場合に実行される処理の流れを概略的に示す図である。It is a figure which shows the flow of the process executed in the case of the total amount conversion. ２回目以降のＨＴＭＬ変換の場合に実行される処理の流れを概略的に示す図である。It is a figure which shows the flow of the process executed in the case of the HTML conversion for the second and subsequent times. 実施の形態２に係る情報処理装置のソフトウェア構成の例を説明する図である。It is a figure explaining the example of the software structure of the information processing apparatus which concerns on Embodiment 2. FIG. 文書処理装置を構成する信号処理部が実行する処理動作の例を説明するフローチャートである。It is a flowchart explaining the example of the processing operation executed by the signal processing unit which constitutes a document processing apparatus. 変換データ判定部によるパス式の抽出処理を説明する図である。It is a figure explaining the extraction process of the path type by the conversion data determination unit. 中間ＸＭＬ文書生成部による中間ＸＭＬ文書の生成処理を説明する図である。It is a figure explaining the generation process of the intermediate XML document by the intermediate XML document generation part. スタイルシートの具体例と変換パス表に保存される値との関係を説明する図である。It is a figure explaining the relationship between the concrete example of a style sheet and the value stored in the conversion path table. ＸＭＬ文書と生成された中間ＸＭＬ文書の関係を説明する図である。It is a figure explaining the relationship between the XML document and the generated intermediate XML document. 図１５に例示したＸＭＬ文書の一部分にＨＴＭＬ文書では利用されない情報が追加された場合の中間ＸＭＬ文書の生成例を説明する図である。FIG. 15 is a diagram illustrating an example of generating an intermediate XML document when information not used in an HTML document is added to a part of the XML document illustrated in FIG. 15. ＨＴＭＬ文書に新たな情報を付加するような修正がＸＭＬ文書に加えられた場合における新旧２つの中間ＸＭＬ文書の具体例を示す図である。It is a figure which shows the specific example of the intermediate XML document between old and new when the modification which adds new information to the HTML document is made to the XML document. 新しいＨＴＭＬ文書のための記述がＸＭＬ文書に追加されている場合を説明する図である。It is a figure explaining the case where the description for a new HTML document is added to the XML document. 旧ＨＴＭＬ文書に対応する記述がＸＭＬ文書から削除された場合を説明する図である。It is a figure explaining the case where the description corresponding to the old HTML document is deleted from the XML document.

以下、添付図面を参照して、実施の形態について詳細に説明する。 Hereinafter, embodiments will be described in detail with reference to the accompanying drawings.

＜実施の形態１＞
＜システム全体の構成＞
図１は、本実施の形態に係るウェブシステム１の構成例を説明する図である。
ウェブシステム１は、ネットワーク上にＨＴＭＬ文書を閲覧可能に公開するサーバシステム１０と、ブラウザプログラム（以下「ブラウザ」という）が実行される端末装置２０とで構成される。本実施の形態におけるネットワークは、ローカルエリアネットワークでもインターネットでも構わない。
端末装置２０は、ユーザによって操作され、ネットワーク上に公開されている情報の閲覧に使用される。端末装置２０は、ネットワーク通信機能を有するコンピュータであり、据え置き型か携帯型かは問わない。 <Embodiment 1>
<Overall system configuration>
FIG. 1 is a diagram illustrating a configuration example of the web system 1 according to the present embodiment.
The web system 1 is composed of a server system 10 that publishes HTML documents on a network so that they can be viewed, and a terminal device 20 that executes a browser program (hereinafter referred to as "browser"). The network in this embodiment may be a local area network or the Internet.
The terminal device 20 is operated by the user and is used for browsing information published on the network. The terminal device 20 is a computer having a network communication function, and may be a stationary type or a portable type.

サーバシステム１０は、ＸＭＬ文書からＨＴＭＬ文書を生成する情報処理装置１００と、端末装置２０からアクセスがあったＵＲＬ（Uniform Resource Locator）に保存されているＨＴＭＬ文書を端末装置２０に送信するウェブサーバ２００とを有し、これらはローカルネットワーク１１を介して互いに接続されている。もっとも、情報処理装置１００とウェブサーバ２００がインターネットで接続されていてもよい。
ＸＭＬ文書は第１の構造化文書の一例であり、ＨＴＭＬ文書は第２の構造化文書の一例である。 The server system 10 is an information processing device 100 that generates an HTML document from an XML document, and a web server 200 that transmits an HTML document stored in a URL (Uniform Resource Locator) accessed from the terminal device 20 to the terminal device 20. And these are connected to each other via the local network 11. However, the information processing apparatus 100 and the web server 200 may be connected via the Internet.
The XML document is an example of a first structured document, and the HTML document is an example of a second structured document.

本実施の形態に係るウェブサーバ２００には、情報処理装置１００で生成されたＨＴＭＬ文書が特定のＵＲＬに紐付けられて保存されている。ＨＴＭＬ文書は、ウェブサーバ２００に内蔵の記憶装置に記憶されていても、外部の記憶装置に記憶されていてもよい。すなわち、ウェブサーバ２００と記憶装置は一体構成でも、別構成でもよい。 In the web server 200 according to the present embodiment, the HTML document generated by the information processing apparatus 100 is associated with a specific URL and stored. The HTML document may be stored in a storage device built in the web server 200, or may be stored in an external storage device. That is, the web server 200 and the storage device may have an integrated configuration or a separate configuration.

＜情報処理装置の構成＞
図２は、本実施の形態に係る情報処理装置１００のハードウェア構成の例を説明する図である。
情報処理装置１００は、ＸＭＬ文書をＨＴＭＬ文書に変換する各種の処理動作を実行する信号処理部１０１と、各種のプログラムやデータが記憶される記憶装置１０２と、作業者による操作の受け付けに使用される操作受付装置１０３と、外部装置との間でデータを受け渡しする通信装置１０４とを有している。 <Configuration of information processing device>
FIG. 2 is a diagram illustrating an example of a hardware configuration of the information processing apparatus 100 according to the present embodiment.
The information processing device 100 is used for a signal processing unit 101 that executes various processing operations for converting an XML document into an HTML document, a storage device 102 that stores various programs and data, and reception of operations by an operator. It has an operation receiving device 103 and a communication device 104 for transferring data to and from an external device.

信号処理部１０１は、プログラムを実行するＣＰＵ（Central Processing Unit）１０１Ａと、起動プログラム等が保存されるＲＯＭ（Read Only Memory）１０１Ｂと、作業領域として使用されるＲＡＭ（Random Access Memory）１０１Ｃとを有している。
なお、記憶装置１０２は、ハードディスク装置等で構成される。記憶装置１０２は、情報処理装置１００に内蔵の装置に限らず、通信装置１０４を通じて接続される外部装置でもよい。
操作受付装置１０３は、例えばキーボード、マウスである。
通信装置１０４は、例えばＵＳＢ（Universal Serial Bus）インターフェース、ネットワークインターフェースである。 The signal processing unit 101 has a CPU (Central Processing Unit) 101A for executing a program, a ROM (Read Only Memory) 101B for storing a startup program, and a RAM (Random Access Memory) 101C used as a work area. Have.
The storage device 102 is composed of a hard disk device or the like. The storage device 102 is not limited to the device built in the information processing device 100, but may be an external device connected through the communication device 104.
The operation receiving device 103 is, for example, a keyboard or a mouse.
The communication device 104 is, for example, a USB (Universal Serial Bus) interface or a network interface.

図３は、実施の形態１に係る情報処理装置１００のソフトウェア構成の例を説明する図である。
図３に示すソフトウェア構成は、信号処理部１０１によるプログラムの実行を通じて実現される。
本実施の形態における信号処理部１０１は、記憶装置１０２から中間ＸＭＬ文書１２１を読み出す文書読出し部１１０と、中間ＸＭＬ文書１２１毎に変更の有無を判断する変更有無判断部１１１と、スタイルシート１２３を用いて中間ＸＭＬ文書１２１をＨＴＭＬ文書に変換する文書変換部１１２と、変換後のＨＴＭＬ文書を既存のＨＴＭＬ文書にマージして新しいＨＴＭＬ文書を生成するデータマージ部１１３として機能する。 FIG. 3 is a diagram illustrating an example of a software configuration of the information processing apparatus 100 according to the first embodiment.
The software configuration shown in FIG. 3 is realized through the execution of a program by the signal processing unit 101.
The signal processing unit 101 in the present embodiment includes a document reading unit 110 that reads the intermediate XML document 121 from the storage device 102, a change presence / absence determination unit 111 that determines whether or not there is a change for each intermediate XML document 121, and a style sheet 123. It functions as a document conversion unit 112 that converts the intermediate XML document 121 into an HTML document and a data merging unit 113 that merges the converted HTML document into an existing HTML document to generate a new HTML document.

中間ＸＭＬ文書１２１は、ＨＴＭＬ文書の生成に必要な記述をＸＭＬ文書から抽出した部分データであり、表示単位であるＨＴＭＬ文書別に用意される。
従って、ＨＴＭＬ文書が複数であれば（換言すると、スタイルシート１２３が複数であれば）、中間ＸＭＬ文書１２１も複数である。ここでの中間ＸＭＬ文書１２１は、中間文書の一例である。
本実施の形態の場合、中間ＸＭＬ文書１２１は既に生成され、記憶装置１０２に記憶されている。もっとも、中間ＸＭＬ文書１２１は、情報処理装置１００を用いて生成しても良いし、他の装置を用いて生成してもよい。
中間ＸＭＬ文書１２１は、ＸＭＬ文書に変更が加えられるたびに生成される。 The intermediate XML document 121 is partial data obtained by extracting the description necessary for generating the HTML document from the XML document, and is prepared for each HTML document as a display unit.
Therefore, if there are a plurality of HTML documents (in other words, if there are a plurality of style sheets 123), there are also a plurality of intermediate XML documents 121. The intermediate XML document 121 here is an example of an intermediate document.
In the case of this embodiment, the intermediate XML document 121 has already been generated and stored in the storage device 102. However, the intermediate XML document 121 may be generated by using the information processing apparatus 100 or may be generated by using another apparatus.
The intermediate XML document 121 is generated each time a change is made to the XML document.

以下では、必要に応じ、変更が加えられる前のＸＭＬ文書を「旧ＸＭＬ文書」とよび、変更が加えられた後のＸＭＬ文書を「新ＸＭＬ文書」とよぶことがある。
ここで、新旧の違いは、特定の変更に対する前後関係を表しているのにすぎない。従って、ある変更との関係では新ＸＭＬ文書であったしても、追加の変更が加えられた場合には、追加の変更との関係では旧ＸＭＬ文書として扱われる。
また、旧ＸＭＬ文書から生成された中間ＸＭＬ文書１２１を「旧中間ＸＭＬ文書」とよび、新ＸＭＬ文書から生成された中間ＸＭＬ文書１２１を「新中間ＸＭＬ文書」とよぶことがある。
中間ＸＭＬ文書１２１は、作業者が手作業で生成してもよいし、後述する実施の形態２で説明するようにプログラム処理を通じて生成してもよい。 In the following, if necessary, the XML document before the change may be referred to as the "old XML document", and the XML document after the change may be referred to as the "new XML document".
Here, the difference between old and new only represents the context of a particular change. Therefore, even if it is a new XML document in relation to a certain change, when an additional change is made, it is treated as an old XML document in relation to the additional change.
Further, the intermediate XML document 121 generated from the old XML document may be referred to as an "old intermediate XML document", and the intermediate XML document 121 generated from the new XML document may be referred to as a "new intermediate XML document".
The intermediate XML document 121 may be generated manually by the operator, or may be generated through a program process as described in the second embodiment described later.

文書読出し部１１０は、変換対象として作業者から指定された新中間ＸＭＬ文書１２１Ｎを記憶装置１０２から読み出して変更有無判断部１１１に出力する機能を提供する。
旧中間ＸＭＬ文書１２１Ｏについて算出されたハッシュ値と、新中間ＸＭＬ文書１２１Ｎについて算出されたハッシュ値は、いずれも記憶装置１０２のハッシュ値リスト１２２に記憶されている。
以下では、旧中間ＸＭＬ文書１２１Ｏについて算出されたハッシュ値を「旧ハッシュ値」といい、新中間ＸＭＬ文書１２１Ｎについて算出されたハッシュ値を「新ハッシュ値」ということがある。
以下では、旧ハッシュ値を記憶するハッシュ値リストを「旧ハッシュ値リスト」といい、新ハッシュ値を記憶するハッシュ値リストを「新ハッシュ値リスト」ということがある。
記憶装置１０２に新ハッシュ値リスト１２２Ｎが存在しない場合、変更有無判断部１１１が新ハッシュ値を計算して記憶装置１０２に記憶する。
ここでの変更有無判断部１１１は、比較部の一例である。 The document reading unit 110 provides a function of reading the new intermediate XML document 121N designated by the operator as a conversion target from the storage device 102 and outputting it to the change presence / absence determining unit 111.
The hash value calculated for the old intermediate XML document 121O and the hash value calculated for the new intermediate XML document 121N are both stored in the hash value list 122 of the storage device 102.
In the following, the hash value calculated for the old intermediate XML document 121O may be referred to as an “old hash value”, and the hash value calculated for the new intermediate XML document 121N may be referred to as a “new hash value”.
In the following, the hash value list that stores the old hash value may be referred to as an "old hash value list", and the hash value list that stores the new hash value may be referred to as a "new hash value list".
If the new hash value list 122N does not exist in the storage device 102, the change presence / absence determination unit 111 calculates the new hash value and stores it in the storage device 102.
The change presence / absence determination unit 111 here is an example of a comparison unit.

変更有無判断部１１１は、変換対象である個々の新中間ＸＭＬ文書１２１Ｎについて算出された新ハッシュ値と、対応関係にある旧中間ＸＭＬ文書１２１Ｏについて記憶されている旧ハッシュ値とを比較する機能を提供する。
ここで、対応関係があるとは個々の中間ＸＭＬ文書１２１に紐付けられたＨＴＭＬ文書の名称が共通することをいう。
このため、変更有無判断部１１１は、まず新中間ＸＭＬ文書１２１Ｎに対応するＨＴＭＬ文書の名称が旧中間ＸＭＬ文書１２１Ｏについて保存されている旧ハッシュ値リスト１２２Ｏに存在するか否かを判断し、存在する場合にハッシュ値の比較処理を実行する。 The change presence / absence determination unit 111 has a function of comparing the new hash value calculated for each new intermediate XML document 121N to be converted with the old hash value stored for the old intermediate XML document 121O having a corresponding relationship. offer.
Here, having a correspondence relationship means that the names of the HTML documents associated with the individual intermediate XML documents 121 are common.
Therefore, the change presence / absence determination unit 111 first determines whether or not the name of the HTML document corresponding to the new intermediate XML document 121N exists in the old hash value list 122O stored for the old intermediate XML document 121O, and exists. When this is done, the hash value comparison process is executed.

例えば新中間ＸＭＬ文書１２１Ｎに対応するＨＴＭＬ文書の名称が旧中間ＸＭＬ文書１２１Ｏについて保存されている旧ハッシュ値リスト１２２Ｏに存在し、かつ、新旧のハッシュ値が一致する場合、変更有無判断部１１１は、新中間ＸＭＬ文書１２１Ｎの内容と旧中間ＸＭＬ文書１２１Ｏの内容とが同じであると判断する。
例えば新中間ＸＭＬ文書１２１Ｎに対応するＨＴＭＬ文書の名称が旧中間ＸＭＬ文書１２１Ｏについて保存されている旧ハッシュ値リスト１２２Ｏに存在しても、新旧のハッシュ値が異なる場合、変更有無判断部１１１は、新中間ＸＭＬ文書１２１Ｎには内容の変更又は修正が含まれると判断する。この場合、変更有無判断部１１１は、変更又は修正が検知された新中間ＸＭＬ文書１２１Ｎを差異リストに登録する。 For example, if the name of the HTML document corresponding to the new intermediate XML document 121N exists in the old hash value list 122O stored for the old intermediate XML document 121O, and the old and new hash values match, the change presence / absence determination unit 111 , It is determined that the content of the new intermediate XML document 121N and the content of the old intermediate XML document 121O are the same.
For example, even if the name of the HTML document corresponding to the new intermediate XML document 121N exists in the old hash value list 122O stored for the old intermediate XML document 121O, if the old and new hash values are different, the change presence / absence determination unit 111 may be used. It is determined that the new intermediate XML document 121N contains changes or amendments to the contents. In this case, the change presence / absence determination unit 111 registers the new intermediate XML document 121N in which the change or modification is detected in the difference list.

例えば新中間ＸＭＬ文書１２１Ｎに対応するＨＴＭＬ文書の名称が旧中間ＸＭＬ文書１２１Ｏについて保存されている旧ハッシュ値リスト１２２Ｏに存在しない場合、変更有無判断部１１１は、処理対象とする新中間ＸＭＬ文書１２１Ｎは新規に生成されたと判断する。この場合も、変更有無判断部１１１は、変更又は修正が検知された新中間ＸＭＬ文書１２１Ｎを差異リストに登録する。
例えば旧中間ＸＭＬ文書１２１Ｏについて保存されている旧ハッシュ値リスト１２２Ｏには存在するＨＴＭＬ文書の名称が、新中間ＸＭＬ文書１２１Ｎについて保存される新ハッシュ値リスト１２２Ｎ（今回の文書変換処理が完了した後は新たな文書変換処理との関係で旧ハッシュ値リスト１２２Ｏとして使用される）に存在しない場合、変更有無判断部１１１は、旧中間ＸＭＬ文書１２１Ｏが削除されたと判断する。この場合、変更有無判断部１１１は、削除されたと判断された旧中間ＸＭＬ文書１２１Ｏを削除リストに登録する。 For example, if the name of the HTML document corresponding to the new intermediate XML document 121N does not exist in the old hash value list 122O stored for the old intermediate XML document 121O, the change presence / absence determination unit 111 performs the new intermediate XML document 121N to be processed. Judges that it was newly generated. In this case as well, the change presence / absence determination unit 111 registers the new intermediate XML document 121N in which the change or modification is detected in the difference list.
For example, the name of the HTML document existing in the old hash value list 122O stored for the old intermediate XML document 121O is the new hash value list 122N stored for the new intermediate XML document 121N (after the current document conversion process is completed). Is not present in the old hash value list 122O in relation to the new document conversion process), the change presence / absence determination unit 111 determines that the old intermediate XML document 121O has been deleted. In this case, the change presence / absence determination unit 111 registers the old intermediate XML document 121O determined to be deleted in the deletion list.

図４は、変更有無判断部１１１による処理のイメージを説明する図である。
図４には、新ＸＭＬ文書から生成された複数の新中間ＸＭＬ文書＃１、＃２、＃３が、いずれもＨＴＭＬ文書＃１、＃２、＃３と対応関係を有することが示されている。
変更有無判断部１１１は、例えばＨＴＭＬ文書＃１に対応する新中間ＸＭＬ文書＃１について計算された新ハッシュ値＃１と、対応する旧中間ＸＭＬ文書＃１について計算された旧ハッシュ値＃１とを比較する。
勿論、変更有無判断部１１１は、新ハッシュ値＃２と旧ハッシュ値＃２とを比較し、新ハッシュ値＃３と旧ハッシュ値＃３とを比較する。 FIG. 4 is a diagram illustrating an image of processing by the change presence / absence determination unit 111.
FIG. 4 shows that a plurality of new XML documents # 1, # 2, and # 3 generated from the new XML document all have a correspondence relationship with HTML documents # 1, # 2, and # 3. There is.
The change presence / absence determination unit 111 has, for example, a new hash value # 1 calculated for the new intermediate XML document # 1 corresponding to the HTML document # 1 and an old hash value # 1 calculated for the corresponding old intermediate XML document # 1. To compare.
Of course, the change presence / absence determination unit 111 compares the new hash value # 2 with the old hash value # 2, and compares the new hash value # 3 with the old hash value # 3.

図３の説明に戻る。
文書変換部１１２は、差異リストに登録されている新中間ＸＭＬ文書１２１Ｎに限り（すなわち、内容の変更又は修正を含む新中間ＸＭＬ文書１２１Ｎと新規に生成された新中間ＸＭＬ文書１２１Ｎに限り）、対応するスタイルシート１２３を用いてＨＴＭＬ文書に変換する機能と、生成されたＨＴＭＬ文書を差分変換データ１２４として記憶装置１０２に保存する機能とを提供する。
ここで、スタイルシート１２３とは、ＸＳＬＴ（Extensible Stylesheet Language Transformations）文書のことであり、ＨＴＭＬ文書の表示形式の制御に使用される。スタイルシート１２３は、表現上の規則の一例である。 Returning to the description of FIG.
The document conversion unit 112 is limited to the new intermediate XML document 121N registered in the difference list (that is, limited to the new intermediate XML document 121N including the change or modification of the contents and the newly generated new intermediate XML document 121N). It provides a function of converting into an HTML document using the corresponding style sheet 123 and a function of storing the generated HTML document as difference conversion data 124 in the storage device 102.
Here, the style sheet 123 is an XSLT (Extensible Stylesheet Language Transformations) document, and is used to control the display format of the HTML document. The style sheet 123 is an example of an expression rule.

図５は、スタイルシート１２３の構成例を説明する図である。スタイルシート１２３は、生成対象であるＨＴＭＬ文書単位で用意されている。
本例の場合、スタイルシート１２３は、本文ＨＴＭＬ文書に対応するスタイルシート（すなわち本文ＨＴＭＬ文書用スタイルシート）１２３Ａと、目次ＨＴＭＬ文書に対応するスタイルシート（すなわち目次ＨＴＭＬ文書用スタイルシート）１２３Ｂと、図ＨＴＭＬ文書に対応するスタイルシート（すなわち図ＨＴＭＬ文書用スタイルシート）１２３Ｃとを含んでいる。 FIG. 5 is a diagram illustrating a configuration example of the style sheet 123. The style sheet 123 is prepared for each HTML document to be generated.
In the case of this example, the style sheet 123 includes a style sheet 123A corresponding to the text HTML document (that is, a style sheet for the text HTML document) 123A, a style sheet corresponding to the table HTML document (that is, a style sheet for the table HTML document) 123B, and the style sheet 123. A style sheet corresponding to the HTML document (that is, a style sheet for the HTML document) 123C is included.

図６は、端末装置２０のブラウザ画面２５に表示される本文ＨＴＭＬ文書、目次ＨＴＭＬ文書、図ＨＴＭＬ文書を例示する図である。
本文ＨＴＭＬ文書の表示ウインドウはブラウザ画面２５の右側に配置され、目次ＨＴＭＬ文書はブラウザ画面２５の左側に配置され、図ＨＴＭＬ文書は本文ＨＴＭＬ文書の表示ウインドウ内の左下隅位置に重ねて配置されている。 FIG. 6 is a diagram illustrating a text HTML document, a table of contents HTML document, and a FIG. HTML document displayed on the browser screen 25 of the terminal device 20.
The display window of the text HTML document is arranged on the right side of the browser screen 25, the table of contents HTML document is arranged on the left side of the browser screen 25, and the figure HTML document is arranged on the lower left corner position in the display window of the text HTML document. There is.

ここで、本文ＨＴＭＬ文書の表示ウインドウには、「１．２画像の挿入」とのタイトルが付された文書が表示されている。
また、本文ＨＴＭＬ文書の左隣に表示される目次ＨＴＭＬ文書のウインドウには、本文ＨＴＭＬ文書には「ご利用上の注意」とのタイトルが付されていること、本文ＨＴＭＬ文書は３つの章で構成されること等が表示されている。
また、本文ＨＴＭＬ文書に重ねて表示される図ＨＴＭＬ文書のウインドウには、本文ＨＴＭＬ文書に埋め込まれたリンク関係を通じて読み出されたサンプル画像が表示されている。 Here, in the display window of the text HTML document, a document with the title "1.2 Inserting an image" is displayed.
In addition, in the window of the table of contents HTML document displayed on the left side of the text HTML document, the title of the text HTML document is "Precautions for use", and the text HTML document has three chapters. It is displayed that it is configured.
Further, in the window of the figure HTML document displayed overlaid on the text HTML document, a sample image read through the link relationship embedded in the text HTML document is displayed.

図３の説明に戻る。
データマージ部１１３は、差分変換データ１２４を前回の変換データ１２５（すなわち、旧中間ＸＭＬ文書１２１Ｏを用いて生成された既存のＨＴＭＬ文書）にマージして今回の変換データ１２６を生成する機能を提供する。
例えばデータマージ部１１３は、新たに作成されたＨＴＭＬ文書を既存のＨＴＭＬ文書に追加し、又は、差し替える。
また例えばデータマージ部１１３は、削除リストに基づいて既存のＨＴＭＬ文書から対応する文書を削除する。
ここでのデータマージ部１１３は差替部の一例である。 Returning to the description of FIG.
The data merging unit 113 provides a function of merging the difference conversion data 124 with the previous conversion data 125 (that is, an existing HTML document generated using the old intermediate XML document 121O) to generate the current conversion data 126. do.
For example, the data merge unit 113 adds or replaces the newly created HTML document to the existing HTML document.
Further, for example, the data merge unit 113 deletes the corresponding document from the existing HTML document based on the deletion list.
The data merge unit 113 here is an example of a replacement unit.

＜処理動作＞
続いて、情報処理装置１００において実行される処理動作について説明する。
図７は、情報処理装置１００を構成する信号処理部１０１が実行する処理動作の例を説明するフローチャートである。
まず、信号処理部１０１は、新中間ＸＭＬ文書１２１Ｎを受け付ける（ステップ１０１）。具体的には、文書読出し部１１０が対象とする新中間ＸＭＬ文書１２１Ｎを記憶装置１０２から読み出す。 <Processing operation>
Subsequently, the processing operation executed by the information processing apparatus 100 will be described.
FIG. 7 is a flowchart illustrating an example of a processing operation executed by the signal processing unit 101 constituting the information processing apparatus 100.
First, the signal processing unit 101 receives the new intermediate XML document 121N (step 101). Specifically, the new intermediate XML document 121N targeted by the document reading unit 110 is read from the storage device 102.

次に、信号処理部１０１は、新中間ＸＭＬ文書１２１ＮのＨＴＭＬ文書への変換が初めてか否かを判定する（ステップ１０２）。この判定は、変更有無判断部１１１が実行する。
例えば変更有無判断部１１１は、受け付けた新中間ＸＭＬ文書１２１Ｎに対応するＨＴＭＬ文書の名称が旧ハッシュ値リスト１２２Ｏに含まれている否かにより初めての変換か否かを判定する。
対応するＨＴＭＬ文書の名称が存在する場合、変更有無判断部１１１は、初めての変換でないと判断する。
一方、対応するＨＴＭＬ文書の名称が存在しない場合、変更有無判断部１１１は、初めての変換であると判断する。
初めての変換であると判定された場合（ステップ１０２で肯定結果が得られた場合）、変更有無判断部１１１は、変換処理の対象になった複数の新中間ＸＭＬ文書１２１Ｎのそれぞれについて新ハッシュ値を計算し、記憶装置１０２の新ハッシュ値リスト１２２Ｎに登録する。 Next, the signal processing unit 101 determines whether or not the conversion of the new intermediate XML document 121N into an HTML document is the first time (step 102). This determination is executed by the change presence / absence determination unit 111.
For example, the change presence / absence determination unit 111 determines whether or not the conversion is the first time depending on whether or not the name of the HTML document corresponding to the received new intermediate XML document 121N is included in the old hash value list 122O.
If the name of the corresponding HTML document exists, the change presence / absence determination unit 111 determines that the conversion is not the first time.
On the other hand, when the name of the corresponding HTML document does not exist, the change presence / absence determination unit 111 determines that the conversion is the first time.
When it is determined that the conversion is the first time (when an affirmative result is obtained in step 102), the change presence / absence determination unit 111 has a new hash value for each of the plurality of new intermediate XML documents 121N targeted for the conversion process. Is calculated and registered in the new hash value list 122N of the storage device 102.

さらに、信号処理部１０１は、複数の新中間ＸＭＬ文書１２１Ｎの全てをそれぞれ対応するスタイルシート１２３を用いてＨＴＭＬ文書に変換する（ステップ１０３）。この処理は、文書変換部１１２が実行する。
本実施の形態では、全ての新中間ＸＭＬ文書１２１ＮをＨＴＭＬ文書に変換することを、全量変換という。
図８は、全量変換の場合に実行される処理の流れを概略的に示す図である。
図８に示すように、全量変換の場合、入力された全ての新中間ＸＭＬ文書１２１Ｎについて新ハッシュ値が計算され、算出された新ハッシュ値が新ハッシュ値リスト１２２Ｎに保存される。その後、全ての新中間ＸＭＬ文書１２１Ｎに対してＨＴＭＬ変換が実行され、それぞれに対応するＨＴＭＬ文書が生成される。 Further, the signal processing unit 101 converts all of the plurality of new intermediate XML documents 121N into HTML documents using the corresponding style sheets 123 (step 103). This process is executed by the document conversion unit 112.
In the present embodiment, the conversion of all the new intermediate XML documents 121N into HTML documents is referred to as total quantity conversion.
FIG. 8 is a diagram schematically showing a flow of processing executed in the case of total amount conversion.
As shown in FIG. 8, in the case of total amount conversion, a new hash value is calculated for all the input new intermediate XML documents 121N, and the calculated new hash value is stored in the new hash value list 122N. After that, HTML conversion is executed for all the new intermediate XML documents 121N, and HTML documents corresponding to each are generated.

初めての変換でないと判定された場合（ステップ１０２で否定結果が得られた場合）、変更有無判断部１１１は、新中間ＸＭＬ文書１２１Ｎのそれぞれについて新ハッシュ値を計算する（ステップ１０４）。
次に、変更有無判断部１１１は、新ハッシュ値と旧ハッシュ値を比較する（ステップ１０５）。
続いて、変更有無判断部１１１は、対応する旧ハッシュ値が無いか否かを判定する（ステップ１０６）。換言すると、変更有無判断部１１１は、新中間ＸＭＬ文書１２１Ｎが対応するＨＴＭＬ文書の名称が旧中間ＸＭＬ文書１２１Ｏに対応する旧ハッシュ値リスト１２２Ｏに存在するか否か判定する。 When it is determined that the conversion is not the first time (when a negative result is obtained in step 102), the change presence / absence determination unit 111 calculates a new hash value for each of the new intermediate XML documents 121N (step 104).
Next, the change presence / absence determination unit 111 compares the new hash value with the old hash value (step 105).
Subsequently, the change presence / absence determination unit 111 determines whether or not there is a corresponding old hash value (step 106). In other words, the change presence / absence determination unit 111 determines whether or not the name of the HTML document corresponding to the new intermediate XML document 121N exists in the old hash value list 122O corresponding to the old intermediate XML document 121O.

ステップ１０６で肯定結果が得られた場合（対応する旧ハッシュ値が存在せず、新規生成の場合）、変更有無判断部１１１は、差異リストに対応するＨＴＭＬ文書の名称を登録し、文書変換部１１２は、新中間ＸＭＬ文書１２１Ｎを対応するスタイルシート１２３でＨＴＭＬ変換する（ステップ１０８）。
ステップ１０６で否定結果が得られた場合（対応する旧ハッシュ値が存在する場合）、変更有無判断部１１１は、新ハッシュ値と旧ハッシュ値が異なっているか否かを判定する（ステップ１０７）。
ステップ１０７で肯定結果が得られた場合（新中間ＸＭＬ文書１２１Ｎが新規生成の場合）、変更有無判断部１１１はステップ１０８に進み、ＨＴＭＬ変換と名称の登録とを実行する。 When an affirmative result is obtained in step 106 (when the corresponding old hash value does not exist and is newly generated), the change presence / absence determination unit 111 registers the name of the HTML document corresponding to the difference list, and the document conversion unit 112 converts the new intermediate XML document 121N into HTML with the corresponding style sheet 123 (step 108).
When a negative result is obtained in step 106 (when the corresponding old hash value exists), the change presence / absence determination unit 111 determines whether or not the new hash value and the old hash value are different (step 107).
When an affirmative result is obtained in step 107 (when the new intermediate XML document 121N is newly generated), the change presence / absence determination unit 111 proceeds to step 108 to execute HTML conversion and name registration.

一方、ステップ１０７で否定結果が得られた場合（旧中間ＸＭＬ文書１２１Ｏが削除されている場合）、変更有無判断部１１１は、削除リストに旧中間ＸＭＬ文書１２１Ｏに対応するＨＴＭＬ文書の名称を登録する（ステップ１０９）。
ステップ１０８又はステップ１０９の実行後、データマージ部１１３は、差分変換データ１２４を前回の変換データ１２５にマージして今回の変換データ１２６を生成する（ステップ１１０）。
また、データマージ部１１３は、削除リストに登録されているＨＴＭＬ文書を今回の変換データ１２６から削除する（ステップ１１１）。 On the other hand, when a negative result is obtained in step 107 (when the old intermediate XML document 121O is deleted), the change presence / absence determination unit 111 registers the name of the HTML document corresponding to the old intermediate XML document 121O in the deletion list. (Step 109).
After executing step 108 or step 109, the data merging unit 113 merges the difference conversion data 124 with the previous conversion data 125 to generate the current conversion data 126 (step 110).
Further, the data merge unit 113 deletes the HTML document registered in the deletion list from the current conversion data 126 (step 111).

図９は、２回目以降のＨＴＭＬ変換の場合に実行される処理の流れを概略的に示す図である。
図９に示すように、２回目以降のＨＴＭＬ変換の場合も、入力された全ての新中間ＸＭＬ文書１２１Ｎについて新ハッシュ値が計算され、算出された新ハッシュ値が新ハッシュ値リスト１２２Ｎに保存される。
その後は、新中間ＸＭＬ文書１２１Ｎについての新ハッシュ値リスト１２２Ｎと旧中間ＸＭＬ文書１２１Ｏについての旧ハッシュ値リスト１２２Ｏとの比較処理が実行され、差異リストや削除リストが生成される。また、生成された差異リストや削除リストに基づいて、変更部分に限ってＨＴＭＬ文書が生成される。 FIG. 9 is a diagram schematically showing the flow of processing executed in the case of the second and subsequent HTML conversions.
As shown in FIG. 9, in the case of the second and subsequent HTML conversions, a new hash value is calculated for all the input new intermediate XML documents 121N, and the calculated new hash value is saved in the new hash value list 122N. To.
After that, the comparison process between the new hash value list 122N for the new intermediate XML document 121N and the old hash value list 122O for the old intermediate XML document 121O is executed, and the difference list and the deletion list are generated. Also, based on the generated difference list and deletion list, the HTML document is generated only for the changed part.

＜実施の形態１の効果＞
本実施の形態で説明した手法では、新中間ＸＭＬ文書１２１Ｎ毎に新ハッシュ値を計算し、新ハッシュ値リスト１２２Ｎと旧ハッシュ値リスト１２２Ｏとの比較により変化が検出された新中間ＸＭＬ文書１２１Ｎに限り、ＨＴＭＬ変換が実行される。
このため、本実施の形態によれば、変更の有無によらず全ての新中間ＸＭＬ文書１２１ＮをＨＴＭＬ変換の処理対象とする従前の手法に比して、ＨＴＭＬ文書が生成されるまでに要する時間を短縮することができる。
また、本実施の形態によれば、２回目以降のＨＴＭＬ変換には、リソースの少ない情報処理装置１００を用いることが可能になる。 <Effect of Embodiment 1>
In the method described in this embodiment, a new hash value is calculated for each new intermediate XML document 121N, and a change is detected in the new intermediate XML document 121N by comparing the new hash value list 122N with the old hash value list 122O. As long as the HTML conversion is performed.
Therefore, according to the present embodiment, the time required for the HTML document to be generated is compared with the conventional method in which all the new intermediate XML documents 121N are processed for HTML conversion regardless of whether or not they are changed. Can be shortened.
Further, according to the present embodiment, it is possible to use the information processing apparatus 100 having less resources for the second and subsequent HTML conversions.

＜実施の形態２＞
前述の実施の形態１の場合には、ＨＴＭＬ変換の処理対象となる新中間ＸＭＬ文書１２１Ｎが記憶装置１０２に記憶されている場合について説明したが、本実施の形態では、プログラム処理を通じ、ＸＭＬ文書から新中間ＸＭＬ文書１２１Ｎを生成する場合について説明する。
図１０は、実施の形態２に係る情報処理装置１００のソフトウェア構成の例を説明する図である。
図１０には、図３との対応部分に対応する符号が付されている。なお、図１０に示すソフトウェア構成は、信号処理部１０１によるプログラムの実行を通じて実現される。 <Embodiment 2>
In the case of the first embodiment described above, the case where the new intermediate XML document 121N to be processed for HTML conversion is stored in the storage device 102 has been described, but in the present embodiment, the XML document is stored through the program processing. A case where a new intermediate XML document 121N is generated from the above will be described.
FIG. 10 is a diagram illustrating an example of a software configuration of the information processing apparatus 100 according to the second embodiment.
In FIG. 10, a reference numeral corresponding to a portion corresponding to that in FIG. 3 is attached. The software configuration shown in FIG. 10 is realized through the execution of a program by the signal processing unit 101.

図１０と図３を比較して分かるように、本実施の形態における信号処理部１０１は、新中間ＸＭＬ文書１２１Ｎを生成するための機能を有する点で、実施の形態１における信号処理部１０１（図３）と異なっている。
すなわち、本実施の形態に係る信号処理部１０１は、スタイルシート１２３からパス式を抽出して変換パス表１３２を生成する変換データ判定部１３１と、ＸＭＬ文書１３０と変換パス表１３２から新中間ＸＭＬ文書１２１Ｎを生成する中間ＸＭＬ文書生成部１３３とを更に有している。
ここで、パス式とは、要素の位置を特定する情報の一例である。 As can be seen by comparing FIGS. 10 and 3, the signal processing unit 101 in the present embodiment has a function for generating the new intermediate XML document 121N, and the signal processing unit 101 in the first embodiment ( It is different from Fig. 3).
That is, the signal processing unit 101 according to the present embodiment has a conversion data determination unit 131 that extracts a path expression from the style sheet 123 to generate a conversion path table 132, and a new intermediate XML from the XML document 130 and the conversion path table 132. It further has an intermediate XML document generation unit 133 that generates the document 121N.
Here, the path expression is an example of information for specifying the position of an element.

＜処理動作＞
続いて、本実施の形態に係る情報処理装置１００において実行される処理動作について説明する。
図１１は、情報処理装置１００を構成する信号処理部１０１が実行する処理動作の例を説明するフローチャートである。
図１１には、図７との対応部分に対応する符号を付して示している。図１１に示すフローチャートと図７に示すフローチャートとの違いは、ステップ１０１（図７）に代えてステップ２０１とステップ２０２が実行される点である。 <Processing operation>
Subsequently, the processing operation executed by the information processing apparatus 100 according to the present embodiment will be described.
FIG. 11 is a flowchart illustrating an example of a processing operation executed by the signal processing unit 101 constituting the information processing apparatus 100.
FIG. 11 is shown with reference numerals corresponding to the portions corresponding to those in FIG. 7. The difference between the flowchart shown in FIG. 11 and the flowchart shown in FIG. 7 is that steps 201 and 202 are executed instead of step 101 (FIG. 7).

以下では、相違点であるステップ２０１とステップ２０２について説明する。
ステップ２０１では、変換データ判定部１３１（図１０参照）が、スタイルシート１２３からパス式（例えばＸＰａｔｈ式）を抽出し、パス式に含まれる値を変換パス表１３２に登録する。
スタイルシート１２３は、ＨＴＭＬ文書単位で用意されている。従って、変換データ判定部１３１は、スタイルシート１２３毎にパス式を抽出する。 Hereinafter, the differences between step 201 and step 202 will be described.
In step 201, the conversion data determination unit 131 (see FIG. 10) extracts a path expression (for example, an XPath expression) from the style sheet 123, and registers the value included in the path expression in the conversion path table 132.
The style sheet 123 is prepared for each HTML document. Therefore, the conversion data determination unit 131 extracts the path expression for each style sheet 123.

図１２は、変換データ判定部１３１によるパス式の抽出処理を説明する図である。
図１２の場合、スタイルシート１２３は、ＨＴＭＬ文書＃１～＃３に対応する３つである。図中では、スタイルシート＃１～＃３で示す。
図１２には、スタイルシート＃１の一部が示されており、パス式として「select="/INDEX/@NODETITLE"」と「match="/INDEX"」が抽出されている。
変換パス表１３２には、パス式に含まれる値、すなわち「/INDEX/@NODETITLE」と「/INDEX」が、対応するＨＴＭＬ文書に紐付けられた状態で保存される。 FIG. 12 is a diagram illustrating a pass-type extraction process by the conversion data determination unit 131.
In the case of FIG. 12, there are three style sheets 123 corresponding to HTML documents # 1 to # 3. In the figure, it is shown by style sheets # 1 to # 3.
FIG. 12 shows a part of the style sheet # 1, and "select =" / INDEX / @ NODETITLE "" and "match =" / INDEX "" are extracted as path expressions.
In the conversion path table 132, the values included in the path expression, that is, "/ INDEX / @ NODETITLE" and "/ INDEX" are stored in a state of being associated with the corresponding HTML document.

図１３は、中間ＸＭＬ文書生成部１３３（図１０参照）による中間ＸＭＬ文書１２１の生成処理を説明する図である。
図１３は、ＨＴＭＬ文書＃１に対応する中間ＸＭＬ文書＃１の生成する例を示している。
ここで、中間ＸＭＬ文書生成部１３３は、変換パス表１３２を参照し、ＸＭＬ文書１３０の中からＨＴＭＬ文書＃１に対応する値「/INDEX/@NODETITLE」と「/INDEX」に関連する記述を抽出し、中間ＸＭＬ文書＃１を生成する。
ここでの抽出される記述は、ＸＭＬ文書１３０のうちＨＴＭＬ文書＃１に関連するパス情報の一例である。
ＨＴＭＬ文書＃２に対応する中間ＸＭＬ文書＃２やＨＴＭＬ文書＃３に対応する中間ＸＭＬ文書＃３も同様の手順で生成される。 FIG. 13 is a diagram illustrating the generation process of the intermediate XML document 121 by the intermediate XML document generation unit 133 (see FIG. 10).
FIG. 13 shows an example of generating an intermediate XML document # 1 corresponding to the HTML document # 1.
Here, the intermediate XML document generation unit 133 refers to the conversion path table 132, and describes the values “/ INDEX / @ NODETITLE” and “/ INDEX” corresponding to the HTML document # 1 in the XML document 130. Extract and generate intermediate XML document # 1.
The description extracted here is an example of the path information related to HTML document # 1 in the XML document 130.
The intermediate XML document # 2 corresponding to the HTML document # 2 and the intermediate XML document # 3 corresponding to the HTML document # 3 are also generated by the same procedure.

以下では、より具体的な文書を用いて、変換データ判定部１３１（図１０参照）と中間ＸＭＬ文書生成部１３３（図１０参照）の処理動作を説明する。
図１４は、スタイルシート１２３の具体例と変換パス表１３２に保存される値との関係を説明する図である。
図１４におけるスタイルシート１２３は、図目次ＨＴＭＬ文書用のスタイルシート（fig_index．xsl）の例である。
図１４の場合、スタイルシート１２３から「Ｈ３」、「ＴＩＴＬＥ」、「ＦＩＧ」が抽出されている。図中、「Ｈ３」、「ＴＩＴＬＥ」、「ＦＩＧ」は太文字で示している。 Hereinafter, the processing operations of the conversion data determination unit 131 (see FIG. 10) and the intermediate XML document generation unit 133 (see FIG. 10) will be described using a more specific document.
FIG. 14 is a diagram illustrating a relationship between a specific example of the style sheet 123 and the values stored in the conversion path table 132.
The style sheet 123 in FIG. 14 is an example of a style sheet (fig_index.xsl) for a figure table of contents HTML document.
In the case of FIG. 14, "H3", "TITLE", and "FIG" are extracted from the style sheet 123. In the figure, "H3", "TITLE", and "FIG" are shown in bold letters.

図１１の説明に戻る。
ステップ２０２では、中間ＸＭＬ文書生成部１３３（図１０参照）が、変換パス表１３２を参照し、変換対象として指定されたＸＭＬ文書１３０から、ＨＴＭＬ文書別の中間ＸＭＬ文書１２１を生成する。具体的には、中間ＸＭＬ文書生成部１３３は、ＸＭＬ文書１３０の中からパス式に合致していないデータを削除し、中間ＸＭＬ文書１２１を生成する。 Returning to the description of FIG.
In step 202, the intermediate XML document generation unit 133 (see FIG. 10) refers to the conversion path table 132 and generates the intermediate XML document 121 for each HTML document from the XML document 130 designated as the conversion target. Specifically, the intermediate XML document generation unit 133 deletes data that does not match the path formula from the XML document 130, and generates the intermediate XML document 121.

図１５は、ＸＭＬ文書１３０と生成された中間ＸＭＬ文書１２１の関係を説明する図である。
図１５におけるＸＭＬ文書１３０は、図目次ＨＴＭＬ文書に関連する記述部分であり、ＸＭＬ文書１３０のうち「Ｈ３／ＴＩＴＬＥ」、「Ｈ３／／ＦＩＧ／ＴＩＴＬＥ」に関連する記述部分を抽出したものが中間ＸＭＬ文書１２１である。図１５では、抽出部分を太線で囲んで示している。 FIG. 15 is a diagram illustrating the relationship between the XML document 130 and the generated intermediate XML document 121.
The XML document 130 in FIG. 15 is a description part related to the HTML document of the figure table of contents, and the description part related to “H3 / TITLE” and “H3 // FIG / TITLE” is extracted from the XML document 130 in the middle. XML document 121. In FIG. 15, the extracted portion is surrounded by a thick line.

この場合において、ＸＭＬ文書１３０に変更が加えられた場合を検討する。
図１６は、図１５に例示したＸＭＬ文書１３０の一部分にＨＴＭＬ文書では利用されない情報が追加された場合の中間ＸＭＬ文書１２１の生成例を説明する図である。
図１６に示すＸＭＬ文書１３０の例では、太線で囲んだ部分で「画像を扱う。」が「画像を挿入することができます。」と変更されている。
ただし、この「画像を挿入することができます。」との文は、スタイルシート１２３によって生成されるＨＴＭＬ文書には影響しない情報である。
このため、スタイルシート１２３を用いて生成された変換パス表１３２に基づいて生成された中間ＸＭＬ文書１２１の内容は、修正前の中間ＸＭＬ文書１２１の内容（図１５）と同じになる。 In this case, consider the case where the XML document 130 is modified.
FIG. 16 is a diagram illustrating an example of generating an intermediate XML document 121 when information not used in an HTML document is added to a part of the XML document 130 illustrated in FIG. 15.
In the example of the XML document 130 shown in FIG. 16, "Handle an image" is changed to "An image can be inserted" in the portion surrounded by a thick line.
However, this sentence "an image can be inserted" is information that does not affect the HTML document generated by the style sheet 123.
Therefore, the content of the intermediate XML document 121 generated based on the conversion path table 132 generated using the style sheet 123 is the same as the content of the intermediate XML document 121 before modification (FIG. 15).

従って、「画像を挿入することができます。」との修正が加えられた後のＸＭＬ文書１３０について生成された新中間ＸＭＬ文書１２１Ｎについて計算される新ハッシュ値と、修正前のＸＭＬ文書１３０について生成された旧中間ＸＭＬ文書１２１Ｏについて計算される旧ハッシュ値とは一致する。
このため、変更有無判断部１１１は、この新中間ＸＭＬ文書１２１Ｎに対応するＨＴＭＬ文書の名称（すなわち、図目次ＨＴＭＬ文書）を差異リストに登録しない。結果的に、この新中間ＸＭＬ文書１２１Ｎについては、文書変換部１１２によるＨＴＭＬ変換の対象から除外される。 Therefore, about the new hash value calculated for the new intermediate XML document 121N generated for the XML document 130 after the modification "The image can be inserted" and the XML document 130 before the modification. It matches the old hash value calculated for the generated old intermediate XML document 121O.
Therefore, the change presence / absence determination unit 111 does not register the name of the HTML document corresponding to the new intermediate XML document 121N (that is, the HTML document in the table of contents) in the difference list. As a result, this new intermediate XML document 121N is excluded from the target of HTML conversion by the document conversion unit 112.

以下では、新中間ＸＭＬ文書１２１Ｎに対応する新ハッシュ値と旧中間ＸＭＬ文書１２１Ｏに対応する旧ハッシュ値とが異なる場合の具体例を説明する。
図１７は、ＨＴＭＬ文書に新たな情報を付加するような修正がＸＭＬ文書１３０に加えられた場合における新旧２つの中間ＸＭＬ文書１２１の具体例を示す図である。（Ａ）は新中間ＸＭＬ文書１２１Ｎの具体例であり、（Ｂ）は旧中間ＸＭＬ文書１２１Ｏの具体例である。
新中間ＸＭＬ文書１２１Ｎのうち太枠で囲んだ記述が追加部分である。なお、修正は記述の追加だけでなく削除も含まれる。
いずれにしても新中間ＸＭＬ文書１２１Ｎは、旧中間ＸＭＬ文書１２１Ｏとは内容が異なるため、文書変換部１１２によるＨＴＭＬ変換の対象となる。 Hereinafter, a specific example will be described in which the new hash value corresponding to the new intermediate XML document 121N and the old hash value corresponding to the old intermediate XML document 121O are different from each other.
FIG. 17 is a diagram showing a specific example of two intermediate XML documents 121, old and new, when a modification such as adding new information to an HTML document is added to the XML document 130. (A) is a specific example of the new intermediate XML document 121N, and (B) is a specific example of the old intermediate XML document 121O.
The description enclosed in a thick frame in the new intermediate XML document 121N is an additional part. The modification includes not only the addition of the description but also the deletion.
In any case, since the content of the new intermediate XML document 121N is different from that of the old intermediate XML document 121O, it is subject to HTML conversion by the document conversion unit 112.

図１８は、新しいＨＴＭＬ文書のための記述がＸＭＬ文書１３０に追加されている場合を説明する図である。（Ａ）は新中間ＸＭＬ文書１２１Ｎの具体例であり、（Ｂ）は旧中間ＸＭＬ文書１２１Ｏの具体例である。
この場合、図１８に示すように、旧中間ＸＭＬ文書１２１Ｏのデータは存在しない。従って、新中間ＸＭＬ文書１２１Ｎは、文書変換部１１２によるＨＴＭＬ変換の対象となる。 FIG. 18 is a diagram illustrating a case where a description for a new HTML document is added to the XML document 130. (A) is a specific example of the new intermediate XML document 121N, and (B) is a specific example of the old intermediate XML document 121O.
In this case, as shown in FIG. 18, the data of the old intermediate XML document 121O does not exist. Therefore, the new intermediate XML document 121N is subject to HTML conversion by the document conversion unit 112.

図１９は、既存のＨＴＭＬ文書に対応する記述がＸＭＬ文書１３０から削除された場合を説明する図である。（Ａ）は新中間ＸＭＬ文書１２１Ｎの具体例であり、（Ｂ）は旧中間ＸＭＬ文書１２１Ｏの具体例である。
この場合、図１９に示すように、旧中間ＸＭＬ文書１２１Ｏに対応する新中間ＸＭＬ文書１２１Ｎのデータは存在しない。このことは、削除リストによって確認されるので、データマージ部１１３は、マージ後のＨＴＭＬ文書の中から対応するＨＴＭＬ文書を削除する。 FIG. 19 is a diagram illustrating a case where the description corresponding to the existing HTML document is deleted from the XML document 130. (A) is a specific example of the new intermediate XML document 121N, and (B) is a specific example of the old intermediate XML document 121O.
In this case, as shown in FIG. 19, there is no data of the new intermediate XML document 121N corresponding to the old intermediate XML document 121O. Since this is confirmed by the deletion list, the data merging unit 113 deletes the corresponding HTML document from the merged HTML documents.

＜実施の形態２の効果＞
本実施の形態によれば、予め中間ＸＭＬ文書１２１を手作業で準備する必要がなくなり、作業者によって編集されたＸＭＬ文書１３０と各ＨＴＭＬ文書に対応するスタイルシート１２３を情報処理装置１００に与えれば、対応するＨＴＭＬ文書を情報処理装置１００の信号処理により生成することができる。
このため、本実施の形態の場合には、ＨＴＭＬ文書が生成されるまでに要する時間を、実施の形態１の場合よりも短縮することができる。 <Effect of Embodiment 2>
According to this embodiment, it is not necessary to manually prepare the intermediate XML document 121 in advance, and if the XML document 130 edited by the operator and the style sheet 123 corresponding to each HTML document are provided to the information processing apparatus 100. , The corresponding HTML document can be generated by the signal processing of the information processing apparatus 100.
Therefore, in the case of the present embodiment, the time required for the HTML document to be generated can be shortened as compared with the case of the first embodiment.

＜他の実施の形態＞
以上、本発明の実施の形態について説明したが、本発明の技術的範囲は上記実施の形態に記載の範囲には限定されない。上記実施の形態に、種々の変更又は改良を加えたものも、本発明の技術的範囲に含まれることは、特許請求の範囲の記載から明らかである。
例えば前述の実施の形態では、構造化文書の一例であるＸＭＬ文書をＨＴＭＬ文書に変換する場合について説明したが、他の構造化文書の変換処理にも用いることができる。例えばＳＧＭＬ（Standard Generalized Markup Language）で記述された文書、ＸＨＴＭＬ（Extensible HyperText Markup Language）で記述された文書、ＭａｔｈＭＬ（Mathematical Markup Language)で記述された文書等の構造変換にも使用できる。 <Other embodiments>
Although the embodiments of the present invention have been described above, the technical scope of the present invention is not limited to the scope described in the above embodiments. It is clear from the description of the claims that the above-described embodiment with various modifications or improvements is also included in the technical scope of the present invention.
For example, in the above-described embodiment, the case of converting an XML document, which is an example of a structured document, into an HTML document has been described, but it can also be used for conversion processing of other structured documents. For example, it can be used for structural conversion of a document described in SGML (Standard Generalized Markup Language), a document described in XHTML (Extensible HyperText Markup Language), a document described in MathML (Mathematical Markup Language), and the like.

１…ウェブシステム、１０…サーバシステム、２０…端末装置、２５…ブラウザ画面、１００…情報処理装置、１１０…文書読出し部、１１１…変更有無判断部、１１２…文書変換部、１１３…データマージ部、１２１、１２１Ｎ、１２１Ｏ…中間ＸＭＬ文書、１２２１２２Ｎ、１２２Ｏ…ハッシュ値リスト、１２３…スタイルシート、１２３Ａ…本文ＨＴＭＬ文書用スタイルシート、１２３Ｂ…目次ＨＴＭＬ文書用スタイルシート、１２３Ｃ…図ＨＴＭＬ文書用スタイルシート、１２４…差分変換データ、１２５…前回の変換データ、１２６…今回の変換データ、１３０…ＸＭＬ文書、１３１…変換データ判定部、１３２…変換パス表、１３３…中間ＸＭＬ文書生成部 1 ... Web system, 10 ... Server system, 20 ... Terminal device, 25 ... Browser screen, 100 ... Information processing device, 110 ... Document reading unit, 111 ... Change presence / absence determination unit, 112 ... Document conversion unit, 113 ... Data merging unit , 121, 121N, 121O ... Intermediate XML document, 122122N, 122O ... Hash value list, 123 ... Style sheet, 123A ... Text HTML document style sheet, 123B ... Table of contents HTML document style sheet, 123C ... Figure HTML document style sheet , 124 ... Difference conversion data, 125 ... Previous conversion data, 126 ... Current conversion data, 130 ... XML document, 131 ... Conversion data determination unit, 132 ... Conversion path table, 133 ... Intermediate XML document generation unit

Claims

On the computer
An intermediate document that is a plurality of partial data constituting the first structured document that is the conversion source and does not include information that is not referred to by the rules used for conversion of the representation format of the partial data is read from the storage unit, and the intermediate document is read. A function to convert a document into a second structured document based on the above rules, and
Prior to the conversion to the second structured document, the first hash value is calculated for the intermediate document for which the current conversion to the second structured document is the second and subsequent times, and the intermediate with the same name. A function of reading the second hash value calculated at the time of the previous conversion of the document from the storage unit and comparing the first hash value with the second hash value .
When the first hash value is different from the second hash value , the intermediate document corresponding to the first hash value is converted into a new second structured document, and the new second structure is obtained. A function to replace the corresponding existing second structured document stored in the storage unit by the converted document, and
A program to execute.

Each time the first hash value is calculated, a list is generated in which the calculated first hash value is associated with the information that identifies the second structured document corresponding to the first hash value. Further has a function of storing in the storage unit .
The program according to claim 1 , wherein the first hash value stored in the list is used as the second hash value at the next conversion of the intermediate document .

If the name corresponding to the new intermediate document does not exist in the existing list, the new intermediate document is converted to the second structured document and the first calculated for the new intermediate document. The program according to claim 2, further comprising a function of adding a hash value and information identifying the second structured document to the list.

If the existing name in the list does not exist in the new list generated for the new intermediate document, the name is added to the deletion list and the second structured document corresponding to the name is deleted. The program according to claim 2 , further comprising a function of performing.

The program according to claim 1, further comprising a function of extracting the information referred to in the rule from the first structured document to generate the intermediate document.

The program of claim 5, wherein the intermediate document is generated in units of the second structured document that is converted using the rule.

An intermediate document that is a plurality of partial data constituting the first structured document that is the conversion source and does not include information that is not referred to by the rules used for conversion of the representation format of the partial data is read from the storage unit, and the intermediate document is read. A conversion unit that converts a document into a second structured document based on the above rules, and
Prior to the conversion to the second structured document, the first hash value is calculated for the intermediate document for which the current conversion to the second structured document is the second and subsequent times, and the intermediate with the same name. A comparison unit that reads the second hash value calculated at the time of the previous conversion of the document from the storage unit and compares the first hash value with the second hash value .
When the first hash value is different from the second hash value , the intermediate document corresponding to the first hash value is converted into a new second structured document, and the new second structure is obtained. A replacement unit that replaces the corresponding existing second structured document stored in the storage unit by the converted document.
Information processing device with.