JP5176539B2

JP5176539B2 - Structured document file processing apparatus and method

Info

Publication number: JP5176539B2
Application number: JP2007338728A
Authority: JP
Inventors: ひとみ日向; 健一長島; 慎吾川崎; 祥正川又
Original assignee: Dai Nippon Printing Co Ltd
Current assignee: Dai Nippon Printing Co Ltd
Priority date: 2007-12-28
Filing date: 2007-12-28
Publication date: 2013-04-03
Anticipated expiration: 2027-12-28
Also published as: JP2009157875A

Description

本発明は、ＸＭＬ等の構造化文書ファイルを高速に処理するための技術に関する。 The present invention relates to a technique for processing a structured document file such as XML at high speed.

近年、コンピュータネットワークの発展により、膨大な量の文書データが、やりとりされている。文書データの一つの形態として、ＸＭＬ（eXtensible Markup Language）で記述された構造化文書が用いられている。ＸＭＬは、文書やデータの意味や構造を記述するためのマークアップ言語の一つであり、「タグ」と呼ばれる特定の文字列で構造を埋め込んでいく。 In recent years, an enormous amount of document data has been exchanged with the development of computer networks. As one form of document data, a structured document described in XML (eXtensible Markup Language) is used. XML is one of markup languages for describing the meaning and structure of documents and data, and the structure is embedded with a specific character string called “tag”.

ＸＭＬは利用者が自由にタグを定義でき、文書中の文字列に意味付けができる言語構造を持っており、その操作性の良さから文書処理から電子商取引に至るまでネットワーク上のデータ処理の多くの局面で活用されている。このようなＸＭＬの解析・処理手段としては、ＤＯＭ（Document Object Model）とＳＡＸ（Simple API for XML）が広く用いられている。 XML has a language structure that allows users to freely define tags and give meaning to character strings in documents. Many of the data processing on the network from document processing to electronic commerce due to its good operability. It is utilized in the aspect of. As such XML analysis and processing means, DOM (Document Object Model) and SAX (Simple API for XML) are widely used.

ＤＯＭは、ＸＭＬ文書をメモリ上で要素のツリー構造として扱い、ツリーをランダムに行き来したり削除したりする。このため、任意のツリー要素にランダムアクセスするようなＸＭＬ操作に向いている。一方、ＳＡＸは、ＸＭＬ文書を読み込み、文書中からタグ等のイベントを検出して、イベント毎に設けられたメソッドを実行する。このため、ＳＡＸは、順序立てた要素処理を行うようなＸＭＬ操作に向いている。 DOM treats an XML document as a tree structure of elements in memory, and traverses and deletes the tree at random. For this reason, it is suitable for an XML operation that randomly accesses an arbitrary tree element. On the other hand, the SAX reads an XML document, detects an event such as a tag from the document, and executes a method provided for each event. For this reason, SAX is suitable for XML operations that perform ordered element processing.

しかしながら、ＤＯＭでは、全データをメモリに保有するので大容量のＸＭＬ文書を処理する場合、処理が重くなる傾向にある。また、ＳＡＸでは、ＸＭＬ文書の編集や複雑な操作ができない。一方、ＸＭＬの構造的特徴を利用して、ＸＭＬを、複数のファイルを含むデジタル封筒としての活用も行われている。デジタル封筒として活用するためには、デジタル封筒内の任意のファイルにアクセスし、追加・編集・削除等の処理を行うこと、複数のファイルデータを内包することが必要となるが、ＤＯＭ、ＳＡＸでは、これらの要求に応えられないという問題がある。また、構造化文書ファイルを効率的に処理する技術も存在するが（特許文献１参照）、位置情報取得用に別途特別な前処理が必要となるという問題がある。
特開２００５−２３４８３７号公報 However, in DOM, since all data is held in a memory, when processing a large-capacity XML document, the processing tends to be heavy. In SAX, XML documents cannot be edited and complicated operations cannot be performed. On the other hand, using the structural features of XML, XML is also used as a digital envelope including a plurality of files. In order to use it as a digital envelope, it is necessary to access any file in the digital envelope, perform processing such as addition / editing / deletion, etc., and include multiple file data, but in DOM and SAX, There is a problem that these requests cannot be met. In addition, there is a technique for efficiently processing a structured document file (see Patent Document 1), but there is a problem that special pre-processing is separately required for acquiring position information.
Japanese Patent Laid-Open No. 2005-234837

そこで、本発明は、特別な前処理を行うことなく、複数のファイルを包含した大容量の構造化文書を高速に処理することが可能な構造化文書ファイル、およびその処理装置および方法を提供することを課題とする。 Therefore, the present invention provides a structured document file that can process a large-capacity structured document including a plurality of files at high speed without performing special preprocessing, and a processing apparatus and method therefor. This is the issue.

上記課題を解決するため、本発明では、記録されるデータごとに区分けされ、各データの絶対パス名が記録されることにより、複数のデータが記録されたデータ保持領域と、前記データ保持領域に含まれる前記データの名称であるデータ名および階層構造を成すディレクトリ名を有するデータ管理領域と、を有し、タグにより階層構造が表現された構造化文書ファイルを処理する装置であって、記録されているデータである記録データが指定された際に、当該記録データのデータ名で前記データ管理領域を探索し、当該データ名が含まれるタグを取得し、取得されたタグを最上位階層まで辿って階層構造を含むディレクトリ名を取得することにより前記指定された記録データの絶対パス名を生成する手段と、前記生成された絶対パス名で前記データ保持領域を参照し、前記絶対パス名に対応付けて記録されている記録データを抽出する手段を有する構造化文書ファイルの処理装置を提供する。 In order to solve the above-described problems, in the present invention , each data to be recorded is divided, and the absolute path name of each data is recorded, so that a plurality of data is recorded in the data holding area and the data holding area. A data management area having a data name that is the name of the data included and a directory name that forms a hierarchical structure, and a device that processes a structured document file in which the hierarchical structure is expressed by a tag, and is recorded When the recorded data, which is the recorded data, is specified, the data management area is searched with the data name of the recorded data, the tag including the data name is acquired, and the acquired tag is traced to the highest layer. Means for generating an absolute path name of the designated recording data by obtaining a directory name including a hierarchical structure, and the generated absolute path name Referring to over data holding area, to provide a processing apparatus of a structured document file having a means for extracting the data recorded in association with the absolute path name.

また、本発明では、ディレクトリの階層構造を示したツリー画面上でディレクトリが指定され、新規データのデータ名が入力された際に、最上位階層から入力されたデータ名までの絶対パス名を生成する手段と、前記生成された絶対パス名を含むタグを前記データ保持領域内の対応する位置に記録し、前記入力された新規データのデータ名を含むタグを前記データ管理領域内の対応する位置に記録する手段と、をさらに有する構造化文書ファイルの処理装置を提供する。 In the present invention, when a directory is specified on the tree screen showing the directory hierarchy and a data name of new data is input, an absolute path name from the highest level to the input data name is generated. And a tag that includes the generated absolute path name in a corresponding position in the data holding area, and a tag that includes the data name of the input new data corresponds to a position in the data management area. A structured document file processing apparatus further comprising :

また、本発明では、記録されるデータごとに区分けされ、各データの絶対パス名が記録されることにより、複数のデータが記録されたデータ保持領域と、前記データ保持領域に含まれる前記データの名称であるデータ名および階層構造を成すディレクトリ名を有するデータ管理領域と、を有し、タグにより階層構造が表現された構造化文書ファイルをコンピュータが処理する方法であって、格納データが指定された際に、コンピュータが、記録されているデータである記録データが指定された際に、当該記録データのデータ名で前記データ管理領域を探索し、当該データ名が含まれるタグを取得し、取得されたタグを最上位階層まで辿って階層構造を含むディレクトリ名を取得することにより前記指定された記録データの絶対パス名を生成する段階と、コンピュータが、前記生成された絶対パス名で前記データ保持領域を参照し、前記絶対パス名に対応付けて記録されている記録データを抽出する段階を有する構造化文書ファイルの処理方法を提供する。 Further, in the present invention , each data to be recorded is divided, and the absolute path name of each data is recorded, so that a data holding area in which a plurality of data is recorded and the data included in the data holding area are recorded. A data management area having a data name that is a name and a directory name that forms a hierarchical structure, and a computer that processes a structured document file in which the hierarchical structure is expressed by a tag , in which storage data is designated When recording data that is recorded data is designated, the computer searches the data management area by the data name of the recording data, acquires a tag including the data name, and acquires to generate the full path name of the record data the specified by acquiring the name of the directory that contains the hierarchy the tag by following to the highest hierarchy is A method, computer, by referring to the data retention area in absolute pathname said generated processing method of a structured document file including the step of extracting the data recorded in association with the full path name provide.

本発明によれば、複数のデータを保持する構造化文書ファイルを、保持されるデータごとに区分けされ、各データの絶対パスを含むデータ名が記録されたデータ保持領域と、データ保持領域に含まれる前記データ名および階層構造を成すディレクトリ名を有するデータ管理領域により構成されるようにしたので、大容量の構造化文書を高速に処理することが可能となる。 According to the present invention, a structured document file that holds a plurality of data is divided into data to be held, and a data holding area in which a data name including the absolute path of each data is recorded, and the data holding area includes Since the data management area has the data name and the directory name having a hierarchical structure, a large-capacity structured document can be processed at high speed.

本発明によれば、特別な前処理を行うことなく、複数のファイルを包含した大容量の構造化文書を高速に処理することが可能となるという効果を奏する。 According to the present invention, there is an effect that a large-capacity structured document including a plurality of files can be processed at high speed without performing special preprocessing.

以下、本発明の好適な実施形態について、図面を参照して詳細に説明する。
（１．ファイル構造）
図１は、本発明に係る構造化文書ファイルの構造を示す概略図である。図１に示すように、本発明の構造化文書ファイルは、データ管理領域とデータ保持領域を有している。データ保持領域においては、複数のデータを保持することが可能であり、構造化文書ファイルがデジタル封筒としての役割を果たしている。構造化文書ファイルの形式としては、特に限定はされないが、内部に格納されているデータ以外は、全てテキストデータであることが必要となる。本実施形態では、全てテキストデータで表現されたＸＭＬ形式で、構造化文書ファイルを実現している。 DESCRIPTION OF EMBODIMENTS Hereinafter, preferred embodiments of the present invention will be described in detail with reference to the drawings.
(1. File structure)
FIG. 1 is a schematic diagram showing the structure of a structured document file according to the present invention. As shown in FIG. 1, the structured document file of the present invention has a data management area and a data holding area. In the data holding area, it is possible to hold a plurality of data, and the structured document file serves as a digital envelope. The format of the structured document file is not particularly limited, but all data other than the data stored inside must be text data. In the present embodiment, the structured document file is realized in the XML format that is all expressed in text data.

図２は、データ管理領域の詳細を示す図である。本実施形態の構造化文書ファイルでは、データ管理領域およびデータ保持領域内のディレクトリをコンテナという仮想の単位で管理している。図２のデータ管理領域の１行目と最終の１９行目のタグでコンテナ（Container）の開始と終了を示している。１行目のタグの中身に示すように、このコンテナのコンテナＩＤは、“EVP3HeaderContainer”となっている。 FIG. 2 is a diagram showing details of the data management area. In the structured document file of this embodiment, directories in the data management area and the data holding area are managed in virtual units called containers. The first and last 19th line tags in the data management area in FIG. 2 indicate the start and end of the container (Container). As shown in the contents of the tag in the first line, the container ID of this container is “EVP3HeaderContainer”.

図２のデータ管理領域の２行目と１８行目のタグは、コンテナの中身の開始と終了を示している。３行目と１７行目のタグは、ルート（root）ディレクトリを示している。４行目と６行目のタグは、EVPHeaderディレクトリを示し、５行目のタグは、EVPHeaderディレクトリの下位にFlowCtrlInfoディレクトリがあり、FlowCtrlInfoディレクトリの中身が空であることを示している。７行目と１３行目のタグは、EVPBodyディレクトリを示し、８行目と１２行目のタグは、EVPBodyディレクトリの下位のJobsディレクトリを示し、９、１０、１１行目のタグは、Jobsディレクトリ内にファイル“Job1.txt”“Job2.txt”“Job3.txt”が存在することを示している。１４行目と１６行目のタグは、CDStageInfoディレクトリを示し、１５行目のタグは、CDStageInfoディレクトリ内にファイル“XMLFAT.xml”が存在することを示している。 Tags in the second and 18th lines of the data management area in FIG. 2 indicate the start and end of the contents of the container. The tags on the 3rd and 17th lines indicate the root directory. The tags on the 4th and 6th lines indicate the EVPHeader directory, and the tag on the 5th line indicates that the FlowCtrlInfo directory is below the EVPHeader directory and that the contents of the FlowCtrlInfo directory are empty. The tags on the 7th and 13th lines indicate the EVPBody directory, the tags on the 8th and 12th lines indicate the Jobs directory below the EVPBody directory, and the tags on the 9th, 10th, and 11th lines indicate the Jobs directory. Indicates that the files “Job1.txt”, “Job2.txt”, and “Job3.txt” exist. The tags on the 14th and 16th lines indicate the CDStageInfo directory, and the tag on the 15th line indicates that the file “XMLFAT.xml” exists in the CDStageInfo directory.

図３は、データ保持領域の詳細を示す図である。図３のデータ保持領域においては、５行目と下から１２行目のタグで、root、EVPBodyの下位のJobsディレクトリを示し、６行目から２４行目は、Jobsディレクトリの中身を示している。このうち、６行目から１４行目は、データである“Job1.txt”に関し、１５行目から２３行目は、データである“Job2.txt”に関するものである。“Job1.txt”に関する６行目から１４行目のうち、６行目から８行目は、“Job1.txt”のデータそのものを示し、９行目から１４行目は、“Job1.txt”に関する情報を示す。同様に、“Job2.txt”に関する１５行目から２３行目のうち、１５行目から１７行目は、“Job2.txt”のデータそのものを示し、１８行目から２３行目は、“Job2.txt”に関する情報を示す。２４行目からは、“Job2.txt”に関する記述がなされるが、図が繁雑になるため省略している。また、下から１１行目と下から１行目のタグで、rootの下位のCDStageInfoディレクトリを示し、下から１０行目から下から２行目は、CDStageInfoディレクトリの中身を示している。このうち、下から１０行目から下から８行目は、“XMLFAT.xml”のデータそのものを示し、下から７行目から下から２行目は、“XMLFAT.xml”に関する情報を示す。 FIG. 3 is a diagram showing details of the data holding area. In the data holding area of FIG. 3, the tags in the 5th line and the 12th line from the bottom indicate the Jobs directory below root and EVPBody, and the 6th to 24th lines indicate the contents of the Jobs directory. . Among these lines, the 6th to 14th lines relate to the data “Job1.txt”, and the 15th to 23rd lines relate to the data “Job2.txt”. Of the 6th to 14th lines related to “Job1.txt”, the 6th to 8th lines indicate the data of “Job1.txt” itself, and the 9th to 14th lines indicate “Job1.txt”. Indicates information about. Similarly, of the 15th to 23rd lines relating to “Job2.txt”, the 15th to 17th lines indicate the data of “Job2.txt” itself, and the 18th to 23rd lines indicate “Job2”. Indicates information about “.txt”. From the 24th line, description about “Job2.txt” is made, but it is omitted because the figure becomes complicated. Further, the tags on the 11th line from the bottom and the 1st line from the bottom indicate the CDStageInfo directory below the root, and the 10th line from the bottom to the 2nd line indicate the contents of the CDStageInfo directory. Among these, the 10th line from the bottom to the 8th line from the bottom indicate the data of “XMLFAT.xml” itself, and the 7th line from the bottom to the 2nd line from the bottom indicate information related to “XMLFAT.xml”.

（２．処理装置および方法）
続いて、本発明に係る構造化文書ファイルの処理装置および方法について説明する。本発明に係る構造化文書ファイルの処理装置は、専用のプログラムを汎用のコンピュータに組み込むことにより実現される。そして、本発明に係る構造化文書ファイルの処理方法は、本発明に係る構造化文書ファイルの処理装置、すなわち専用のプログラムを組み込んだコンピュータにより実行される。まず、汎用のコンピュータに、本発明の構造化文書ファイルの処理方法を実現するための専用のプログラムを組み込んで構造化文書ファイルの処理装置を構成する。このプログラムは、従来のＤＯＭ機能を一部の機能として有している。 (2. Processing apparatus and method)
Subsequently, a structured document file processing apparatus and method according to the present invention will be described. The structured document file processing apparatus according to the present invention is realized by incorporating a dedicated program into a general-purpose computer. The structured document file processing method according to the present invention is executed by the structured document file processing apparatus according to the present invention, that is, a computer incorporating a dedicated program. First, a structured document file processing apparatus is configured by incorporating a dedicated program for realizing the structured document file processing method of the present invention into a general-purpose computer. This program has a conventional DOM function as a partial function.

上述のようなプログラムを組み込んだ状態で、コンピュータが、構造化文書ファイルを読み込む。そして、それがＸＭＬファイルであることを認識すると、コンピュータは、まず、データ管理領域内のタグを解析して、ディレクトリ構造を認識し、画面に表示する。このようなＸＭＬのタグを解析してディレクトリ構造を画面に表示する機能については、上述のように、公知のＤＯＭ機能を採用している。この際、データ管理領域の情報がメモリに展開されるが、データ保持領域に比べてデータ量が小さいため、コンピュータの処理負荷は小さい。 The computer reads the structured document file in a state where the program as described above is incorporated. When recognizing that it is an XML file, the computer first analyzes the tag in the data management area, recognizes the directory structure, and displays it on the screen. As the function of analyzing the XML tag and displaying the directory structure on the screen, a known DOM function is employed as described above. At this time, the information in the data management area is expanded in the memory, but since the amount of data is smaller than that in the data holding area, the processing load on the computer is small.

ディレクトリ構造を表示した状態の画面を図４に示す。コンピュータは、データ管理領域のタグを解析し、ディレクトリについては、ディレクトリを示す図形にディレクトリ名を割り当て、ファイルについては、ファイルを示す図形にファイル名を割り当てて、上下関係を有するディレクトリ、ファイル間を線分で結ぶ処理を行う。その結果、コンピュータの画面には、図４に示すようなツリー構造が表示されることになる。
A screen showing the directory structure is shown in FIG. The computer analyzes the tags in the data management area, and for directories, assigns a directory name to the graphic that indicates the directory, and for files, assigns a file name to the graphic that indicates the file, Perform processing to connect with line segments. As a result, the computer screen would have tree structure as shown in FIG. 4 is displayed.

図４に示した画面が表示された状態で、利用者が画面上のファイルを示す図形を選択すると、コンピュータは、選択された図形の位置を検知して対応するファイルを認識する。そして、コンピュータは、認識したファイルのファイル名で、構造化文書ファイル内のデータ管理領域を参照し、そのファイルのパスを取得する。 When the user selects a graphic representing a file on the screen while the screen shown in FIG. 4 is displayed, the computer detects the position of the selected graphic and recognizes the corresponding file. Then, the computer refers to the data management area in the structured document file with the file name of the recognized file, and acquires the path of the file.

例えば、図４の画面で“Job2.txt”が選択された場合、コンピュータは、構造化文書ファイル内のデータ管理領域を、“Job2.txt”で探索する。すると、図２に示したように、データ管理領域の１０行目に“Job2.txt”が見つかる。そして、コンピュータは、その上位のタグを辿ることにより、データ管理領域の８行目にJobsディレクトリを発見する。さらに上位のタグを辿ることにより、データ管理領域の７行目にEVPBodyディレクトリ、データ管理領域の３行目にrootディレクトリを発見する。rootディレクトリまで辿りついたら、コンピュータは、rootディレクトリから“Job2.txt”までのパスを作成する。この結果、Ｉｄ＝“/ root/ EVPBody/ Jobs/ Job2.txt”というパスが得られる。 For example, when “Job2.txt” is selected on the screen of FIG. 4, the computer searches the data management area in the structured document file with “Job2.txt”. Then, as shown in FIG. 2, “Job2.txt” is found in the 10th line of the data management area. Then, the computer finds the Jobs directory on the 8th line of the data management area by tracing the upper tag. By tracing the higher-level tags, the EVPBody directory is found on the seventh line of the data management area, and the root directory is found on the third line of the data management area. After reaching the root directory, the computer creates a path from the root directory to “Job2.txt”. As a result, a path of Id = “/ root / EVPBody / Jobs / Job2.txt” is obtained.

この時点までは、コンピュータは、データ量の大きいデータ保持領域を全く参照していない。このため、処理負荷の削減と、処理の高速化が図れるのである。そして、選択されたデータへのパスが得られたら、このパスを利用して、コンピュータは、初めてデータ保持領域を参照する。Ｉｄ＝“/ root/ EVPBody/ Jobs/ Job2.txt”というパスを利用すると、図３に示したデータ保持領域の１５行目のタグと一致するので、コンピュータは、その次の１６行目のデータ内容を取得する。 Up to this point, the computer has not referred to the data holding area having a large amount of data. For this reason, the processing load can be reduced and the processing speed can be increased. When a path to the selected data is obtained, the computer refers to the data holding area for the first time using this path. If the path Id = “/ root / EVPBody / Jobs / Job2.txt” is used, it matches the 15th line tag of the data holding area shown in FIG. Get the contents.

コンピュータは取得したデータが暗号化されている場合には、それを認識し、復号化する。図３の例では、“Job2.txt”は暗号化されているので、コンピュータは、“Job2.txt”を復号化することになる。そして、復号化したデータのファイルを開き、画面に表示する。ここまでを、利用者の視点から見ると、図４のツリー画面で“Job2.txt”を選択すると、すぐにそのファイルの中身が画面に表示された状態となる。 If the acquired data is encrypted, the computer recognizes it and decrypts it. In the example of FIG. 3, since “Job2.txt” is encrypted, the computer decrypts “Job2.txt”. Then, the decrypted data file is opened and displayed on the screen. From the user's point of view, when “Job2.txt” is selected on the tree screen in FIG. 4, the contents of the file are immediately displayed on the screen.

表示された内容に対して、利用者が編集を行った場合、更新した内容を保存することができる。利用者が内容を更新した状態で、コンピュータに対して保存指示を行うと、コンピュータは、既に取得しているＩｄ＝“/ root/ EVPBody/ Jobs/ Job2.txt”というパスで、データ保持領域内を検索し、見つかったタグの直後に、修正後の内容を格納する。 When the user edits the displayed content, the updated content can be saved. When the user gives a save instruction to the computer with the updated contents, the computer uses the already acquired Id = “/ root / EVPBody / Jobs / Job2.txt” in the data holding area. The modified content is stored immediately after the found tag.

次に、新規に作成したデータを構造化文書ファイル内に格納する場合について説明する。新規に作成したデータを格納する場合も、まず、構造化文書ファイルをコンピュータで読み込み、ツリー情報を画面に表示する。この状態で、利用者は、新規データを格納する指示をコンピュータに対して行うとともに、格納するディレクトリを示す図形をツリー画面で指定し、ファイル名を入力する。例えば、Jobsディレクトリの図形を指定し、ファイル名“Job4.txt”を入力したとすると、コンピュータは、ツリー画面において、Jobsディレクトリの図形の下位に“Job4.txt”ファイルの図形を生成する。このときのツリー画面の状態を図５に示す。 Next, a case where newly created data is stored in a structured document file will be described. When storing newly created data, the structured document file is first read by a computer and the tree information is displayed on the screen. In this state, the user instructs the computer to store new data, specifies a figure indicating the directory to be stored on the tree screen, and inputs a file name. For example, if the figure of the Jobs directory is specified and the file name “Job4.txt” is input, the computer generates a figure of the “Job4.txt” file below the figure of the Jobs directory on the tree screen. The state of the tree screen at this time is shown in FIG.

さらに、コンピュータは、ツリー画面を解析して、新規ファイルまでのパスを作成する。この結果、Ｉｄ＝“/ root/ EVPBody/ Jobs/ Job4.txt”というパスが得られる。そして、利用者が格納するファイルを指定すると、コンピュータは、指定されたファイルを暗号化した後、構造化文書ファイルのデータ保持領域における、上記作成したパスで特定される位置に指定されたファイルを格納する。この格納処理が終わった後の、データ保持領域の状態を図６に示す。図６において、下線を付した部分が新規に追加されたタグである。“Job4.txt”ファイルを新規追加することにより、図６に示した９行分のタグが追加されることになる。 Further, the computer analyzes the tree screen and creates a path to the new file. As a result, a path of Id = “/ root / EVPBody / Jobs / Job4.txt” is obtained. When the user specifies a file to be stored, the computer encrypts the specified file, and then the specified file at the position specified by the created path in the data holding area of the structured document file. Store. FIG. 6 shows the state of the data holding area after this storage processing is completed. In FIG. 6, the underlined portion is a newly added tag. By adding a new “Job4.txt” file, the tags for nine lines shown in FIG. 6 are added.

続いて、コンピュータは、構造化文書ファイルのデータ管理領域内に、新規データのタグを追加する。上記のように、データ保持領域において、Ｉｄ＝“/ root/ EVPBody/ Jobs/ Job4.txt”というパスで特定される位置に新規データを格納したので、データ管理領域においても対応する位置にタグを追加する。この追加処理が終わった後の、データ管理領域の状態を図７に示す。図７において、下線を付した部分が新規に追加されたタグである。“Job4.txt”ファイルを新規追加することにより、図７に示した１行分のタグが追加されることになる。なお、図６、図７においては、新規追加されたタグに下線を付したが、これは、説明の便宜のためであり、実際には下線は付されない。 Subsequently, the computer adds a tag for new data in the data management area of the structured document file. As described above, since new data is stored at the location specified by the path Id = “/ root / EVPBody / Jobs / Job4.txt” in the data holding area, a tag is added to the corresponding position in the data management area. to add. FIG. 7 shows the state of the data management area after this addition processing is completed. In FIG. 7, the underlined portion is a newly added tag. By adding a new “Job4.txt” file, the tag for one line shown in FIG. 7 is added. 6 and 7, the newly added tag is underlined. However, this is for convenience of explanation, and the underline is not actually added.

本発明に係る構造化文書ファイルは、複数のデータファイルを包含したデジタル封筒として利用することができる。例えば、複数の者が文書ファイルのやりとりするために利用することが可能である。一例として、起票者とユーザＡ、ユーザＢの計３人で報告書の収集を行う場合について説明する。 The structured document file according to the present invention can be used as a digital envelope including a plurality of data files. For example, a plurality of persons can use it for exchanging document files. As an example, a case will be described in which reports are collected by a total of three people, a drafter, user A, and user B.

この場合、起票者、ユーザＡ、ユーザＢが有するコンピュータには、本発明の構造化文書ファイルの処理方法を実現するためのプログラムが組み込まれており、各コンピュータは、互いにデータ送受信が可能なように接続されているものとする。まず、起票者は、自身のコンピュータで報告書のテンプレートファイルを作成し、このテンプレートファイルを、上記新規データの格納で説明したような処理により、構造化文書ファイルに格納する。格納後、起票者は、構造化文書ファイルを、ユーザＡに送信する。 In this case, the computer of the drafter, user A, and user B incorporates the program for realizing the structured document file processing method of the present invention, and each computer can transmit and receive data to and from each other. It shall be connected as follows. First, the drafter creates a report template file on his computer, and stores this template file in the structured document file by the processing described in the above-mentioned storage of new data. After the storage, the drafter transmits the structured document file to the user A.

ユーザＡは、受信した構造化文書ファイルを起動する指示を行うと、ユーザＡのコンピュータは、構造化文書ファイルのデータ管理領域を解析してツリー構造を画面に表示させる。ユーザＡが、ツリー構造の中から報告書のテンプレートファイルを選択すると、コンピュータは、指定されたテンプレートファイルを構造化文書ファイルから抽出する。テンプレートファイルに所定の事項を書き込んで、報告書ファイルを作成した後、絶対パスを指定して、構造化文書ファイルに格納する。格納後、ユーザＡは、構造化文書ファイルを、ユーザＢに送信する。ユーザＢは、ユーザＡと同様に処理を行って、報告書を作成し、構造化文書ファイルに格納する。格納後、ユーザＢは、構造化文書ファイルを、起票者に送信する。起票者は、構造化文書ファイルからツリー構造を表示し、ユーザＡ、ユーザＢの報告書ファイルを選択する。すると、ユーザＡ、ユーザＢの報告書ファイルが抽出され、画面に表示される。以上のようにして、起票者は、自分が指定したテンプレートに従った各ユーザの報告書を受け取ることができる。 When the user A gives an instruction to activate the received structured document file, the computer of the user A analyzes the data management area of the structured document file and displays the tree structure on the screen. When the user A selects a report template file from the tree structure, the computer extracts the designated template file from the structured document file. After a predetermined item is written in the template file and a report file is created, an absolute path is designated and stored in the structured document file. After storage, user A transmits a structured document file to user B. User B performs the same process as user A, creates a report, and stores it in a structured document file. After the storage, the user B transmits the structured document file to the drafter. The drafter displays a tree structure from the structured document file, and selects the report files of user A and user B. Then, report files of user A and user B are extracted and displayed on the screen. As described above, the drafter can receive the report of each user according to the template designated by the drafter.

本発明に係る構造化文書ファイルの構造を示す概略図である。It is the schematic which shows the structure of the structured document file based on this invention. データ管理領域の詳細を示す図である。It is a figure which shows the detail of a data management area | region. データ保持領域の詳細を示す図である。It is a figure which shows the detail of a data holding area. データ管理領域を解析して、ディレクトリ構造を表示した状態の画面を示す図である。It is a figure which shows the screen of the state which analyzed the data management area | region and displayed the directory structure. 新規データ格納時のツリー画面の状態を示す図である。It is a figure which shows the state of the tree screen at the time of new data storage. 新規データ格納時の、データ保持領域を示す図である。It is a figure which shows the data holding area at the time of new data storage. 新規データ格納時の、データ管理領域を示す図である。It is a figure which shows the data management area | region at the time of new data storage.

Claims

The data is divided for each data to be recorded, and by recording the absolute path name of each data, a data holding area in which a plurality of data is recorded, a data name that is the name of the data included in the data holding area, and A data management area having a directory name having a hierarchical structure, and processing a structured document file in which the hierarchical structure is expressed by a tag ,
When recording data that is recorded data is specified , the data management area is searched with the data name of the recording data, a tag including the data name is acquired, and the acquired tag is stored in the highest hierarchy. Means for generating an absolute path name of the specified recording data by tracing up to obtain a directory name including a hierarchical structure;
Means for referring to the data holding area by the generated absolute path name and extracting recorded data recorded in association with the absolute path name ;
An apparatus for processing a structured document file, comprising:

When a directory is specified on the tree screen showing the directory hierarchy and a data name of new data is input, a means for generating an absolute path name from the top level to the input data name;
Means for recording a tag including the generated absolute path name in a corresponding position in the data holding area and recording a tag including a data name of the input new data in a corresponding position in the data management area; When,
The structured document file processing apparatus according to claim 1, further comprising:

The data is divided for each data to be recorded, and by recording the absolute path name of each data, a data holding area in which a plurality of data is recorded, a data name that is the name of the data included in the data holding area, and A data management area having a directory name having a hierarchical structure, and a computer processing a structured document file in which the hierarchical structure is expressed by a tag ,
When storage data is specified,
When recording data that is recorded data is designated, the computer searches the data management area by the data name of the recording data, acquires a tag including the data name, and acquires the acquired tag Generating an absolute path name of the specified recording data by tracing up to the highest hierarchy and obtaining a directory name including a hierarchical structure;
The computer refers to the data holding area with the generated absolute path name, and extracts recorded data recorded in association with the absolute path name ;
A method for processing a structured document file, comprising: