JP2017102496A

JP2017102496A - Document review support program, document review support device, and document review support method

Info

Publication number: JP2017102496A
Application number: JP2015232603A
Authority: JP
Inventors: 正夫井跡; Masao Iseki; 慧杉山; Satoshi Sugiyama; 光樹蓬田; Mitsuki Yomogida
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2015-11-30
Filing date: 2015-11-30
Publication date: 2017-06-08

Abstract

PROBLEM TO BE SOLVED: To provide a document review support program, a document review support device and a document review support method capable of improving the convenience when checking differences between a plurality of document files each having different file format.SOLUTION: With respect to a plurality of document files each having different file format which are generated on the basis of an identical generation source data used for generating document files, a piece of attribute information is generated for each line unit of a document which is included in a document file. With respect to a plurality of document files, a comparison result of the attribute information is output for each line in the document.SELECTED DRAWING: Figure 5

Description

本発明は、文書校閲支援プログラム、文書校閲支援装置及び文書校閲支援方法に関する。 The present invention relates to a document review support program, a document review support apparatus, and a document review support method.

利用者に対してサービスを提供する事業者（以下、単に事業者とも呼ぶ）は、例えば、利用者に対して各種サービスを提供するために、業務システムの構築を行う。具体的に、事業者は、例えば、文書ファイルの生成元となるデータ（以下、生成元データとも呼ぶ）から、利用者が指定したファイル形式の文書ファイルを生成する機能を有する業務システムを構築する。 A provider that provides services to users (hereinafter, also simply referred to as a provider) constructs a business system in order to provide various services to users, for example. Specifically, for example, a business operator constructs a business system having a function of generating a document file in a file format designated by a user from data that is a generation source of a document file (hereinafter also referred to as generation source data). .

そして、利用者は、この場合、例えば、生成する文書ファイルのファイル形式（例えば、複数のファイル形式）を指定し、業務システムに対して生成元データを入力する。その後、業務システムは、例えば、生成元データの入力に応じて、その生成元データの変換し、利用者が指定したファイル形式の文書ファイルを生成する。これにより、利用者は、用途に応じた文書ファイルを取得することが可能になる（例えば、特許文献１乃至３参照）。 In this case, for example, the user designates the file format (for example, a plurality of file formats) of the document file to be generated, and inputs the generation source data to the business system. Thereafter, the business system, for example, converts the generation source data in accordance with the input of the generation source data, and generates a document file in a file format designated by the user. As a result, the user can acquire a document file corresponding to the application (see, for example, Patent Documents 1 to 3).

特開平２−２５５９６４号公報JP-A-2-255964 特開平１−２１６４６８号公報JP-A-1-216468 特開２００６−９２３４４号公報JP 2006-92344 A

利用者が同一の生成元データから異なるファイル形式の文書ファイルを複数生成する場合、各文書ファイルは、全て同一の内容である必要がある。しかしながら、生成元データから文書ファイルを生成するための変換プロセスは、ファイル形式によって異なる。そのため、業務システムが生成した複数の文書ファイルの内容には、差分が存在する場合がある。したがって、事業者は、複数の文書ファイルを生成した場合、生成した複数の文書ファイル間に存在する差分を抽出するために、例えば、生成した複数の文書ファイルの比較を行う。 When a user generates a plurality of document files having different file formats from the same generation source data, all the document files need to have the same contents. However, the conversion process for generating the document file from the generation source data differs depending on the file format. Therefore, there may be a difference in the contents of a plurality of document files generated by the business system. Therefore, when a plurality of document files are generated, the business operator compares, for example, the generated plurality of document files in order to extract differences existing between the generated plurality of document files.

具体的に、事業者は、この場合、生成した複数の文書ファイルをそれぞれ出力する。そして、事業者は、例えば、画像データを比較する機能を有する装置（以下、画像比較装置とも呼ぶ）を用いて、複数の文書ファイルの出力結果を画像データとして比較する。これにより、事業者は、生成した複数の文書ファイルに存在する差分を抽出することが可能になる。 Specifically, in this case, the business operator outputs the generated plurality of document files. Then, for example, the business operator compares the output results of a plurality of document files as image data using an apparatus having a function of comparing image data (hereinafter also referred to as an image comparison apparatus). As a result, the business operator can extract differences existing in the plurality of generated document files.

しかしながら、画像比較装置は、内容が同一であっても画像データ上の位置のみが異なる部分を差分として認識する場合がある。そのため、事業者は、この場合、生成した複数の文書ファイルに存在する差分の抽出を効率的に行うことができない。 However, the image comparison apparatus may recognize a difference in only the position on the image data even if the contents are the same. Therefore, in this case, the business operator cannot efficiently extract differences existing in the generated plurality of document files.

一方、事業者は、例えば、書面等から文字情報の読み込みを行い、読み込んだ文字情報を比較する機能を有する装置（以下、情報比較装置とも呼ぶ）を用いて、複数の文書ファイルの出力結果から文字情報の読み込みを行い、読み込んだ文字情報の比較を行う場合がある。 On the other hand, for example, the business operator reads character information from a document or the like, and uses a device having a function of comparing the read character information (hereinafter also referred to as an information comparison device), from the output results of a plurality of document files. There are cases where character information is read and the read character information is compared.

しかしながら、情報比較装置は、文書ファイルの出力結果に含まれる文字情報に対して行われている文字装飾（例えば、文字色の変更）等の有無を認識することができない場合がある。そのため、事業者は、この場合についても、生成した複数の文書ファイルに存在する差分の抽出を効率的に行うことができない。 However, the information comparison apparatus may not be able to recognize the presence / absence of character decoration (for example, change of character color) performed on the character information included in the output result of the document file. Therefore, even in this case, the business operator cannot efficiently extract differences existing in the generated plurality of document files.

そこで、一つの側面では、本発明は、ファイル形式の異なる複数の文書ファイル間における差分確認の利便性向上を図ることを可能とする文書校閲支援プログラム、文書校閲支援装置及び文書校閲支援方法を提供することを目的とする。 Accordingly, in one aspect, the present invention provides a document review support program, a document review support device, and a document review support method that can improve the convenience of checking differences between a plurality of document files having different file formats. The purpose is to do.

実施の形態の一態様では、コンピュータに、文書ファイルの生成に用いられる同一の生成元データに基づいて生成された、ファイル形式の異なる複数の文書ファイルについて、文書ファイルに含まれる文書の行単位の属性情報を生成し、前記複数の文書ファイルについて、文書内の行それぞれについての前記属性情報を比較した結果を出力する、処理を実行させる。 In one embodiment of the present invention, a plurality of document files having different file formats generated based on the same generation source data used for generating a document file are stored in a computer in line units of documents included in the document file. A process of generating attribute information and outputting a result of comparing the attribute information for each line in the document for the plurality of document files is executed.

一つの側面によれば、ファイル形式の異なる複数の文書ファイル間における差分確認の利便性向上を図ることを可能とする。 According to one aspect, it is possible to improve the convenience of checking differences between a plurality of document files having different file formats.

情報処理システム１０の構成を示す図である。1 is a diagram illustrating a configuration of an information processing system 10. 文書ファイル生成処理の具体例を説明する図である。It is a figure explaining the specific example of a document file production | generation process. 情報処理装置１のハードウエア構成を示す図である。2 is a diagram illustrating a hardware configuration of the information processing apparatus 1. FIG. 情報処理装置１の機能ブロック図である。2 is a functional block diagram of the information processing apparatus 1. FIG. 第１の実施の形態における文書校閲支援処理の概略を説明するフローチャート図である。It is a flowchart figure explaining the outline of the document review assistance process in 1st Embodiment. 第１の実施の形態における文書校閲支援処理の概略を説明する図である。It is a figure explaining the outline of the document review assistance process in 1st Embodiment. 第１の実施の形態における文書校閲支援処理の詳細を説明するフローチャート図である。It is a flowchart figure explaining the detail of the document review assistance process in 1st Embodiment. 第１文書ファイル情報１３２の具体例を説明する図である。It is a figure explaining the specific example of the 1st document file information 132. FIG. 図８に示す第１文書ファイル情報１３２の出力結果を説明する図である。It is a figure explaining the output result of the 1st document file information 132 shown in FIG. 第２文書ファイル情報１３３の具体例を説明する図である。It is a figure explaining the specific example of the 2nd document file information 133. FIG. 図１０に示す第２文書ファイル情報１３３の出力結果を説明する図である。It is a figure explaining the output result of the 2nd document file information 133 shown in FIG. フォーマット情報１３４の具体例を説明する図である。It is a figure explaining the specific example of the format information. 図８で説明した第１文書ファイル情報１３２から生成された第１属性情報１３５の具体例を説明する図である。It is a figure explaining the specific example of the 1st attribute information 135 produced | generated from the 1st document file information 132 demonstrated in FIG. 図１０で説明した第２文書ファイル情報１３３から生成された第２属性情報１３６の具体例を説明する図である。It is a figure explaining the specific example of the 2nd attribute information 136 produced | generated from the 2nd document file information 133 demonstrated in FIG. 第１属性情報１３５の具体例を説明する図である。It is a figure explaining the specific example of the 1st attribute information 135. FIG. 第２属性情報１３６の具体例を説明する図である。It is a figure explaining the specific example of the 2nd attribute information 136. FIG. 比較結果情報１３７の具体例を説明する図である。It is a figure explaining the specific example of the comparison result information 137. FIG. 第１属性情報１３５の具体例を説明する図である。It is a figure explaining the specific example of the 1st attribute information 135. FIG. 第２属性情報１３６の具体例を説明する図である。It is a figure explaining the specific example of the 2nd attribute information 136. FIG. 第１属性情報１３５の具体例を説明する図である。It is a figure explaining the specific example of the 1st attribute information 135. FIG. 第２属性情報１３６の具体例を説明する図である。It is a figure explaining the specific example of the 2nd attribute information 136. FIG. 比較結果情報１３７の具体例を説明する図である。It is a figure explaining the specific example of the comparison result information 137. FIG.

［情報処理システムの構成］
図１は、情報処理システム１０の構成を示す図である。図１に示す情報処理システム１０は、情報処理装置１（以下、文書校閲支援装置１またはコンピュータ１とも呼ぶ）と、記憶部２とを有する。そして、利用者端末１１は、インターネットやイントラネット等からなるネットワークＮＷを介して情報処理装置１にアクセスすることが可能である。 [Configuration of information processing system]
FIG. 1 is a diagram illustrating a configuration of the information processing system 10. An information processing system 10 illustrated in FIG. 1 includes an information processing apparatus 1 (hereinafter also referred to as a document review support apparatus 1 or a computer 1) and a storage unit 2. The user terminal 11 can access the information processing apparatus 1 via the network NW including the Internet and an intranet.

情報処理装置１は、例えば、生成元データから各ファイル形式の文書ファイルを生成する機能を有する業務システムが構築されている。具体的に、利用者は、例えば、利用者端末１１を介して、文書ファイルの生成を行うファイル形式を指定し、業務システムに対して生成元データの入力を行う。そして、業務システムは、例えば、生成元データの入力があった場合に、その生成元データから、利用者が指定したファイル形式の文書ファイルを生成する処理（以下、文書ファイル生成処理とも呼ぶ）を実行する。 In the information processing apparatus 1, for example, a business system having a function of generating a document file of each file format from generation source data is constructed. Specifically, for example, the user designates a file format for generating a document file via the user terminal 11 and inputs the generation source data to the business system. For example, when the generation source data is input, the business system performs a process of generating a document file in a file format designated by the user from the generation source data (hereinafter also referred to as a document file generation process). Run.

また、情報処理装置１は、例えば、同一の生成元データから複数の文書ファイルを生成した場合に、生成された複数の文書ファイルの内容を比較し、差分が存在するか否かを判定する処理（以下、文書校閲支援処理とも呼ぶ）を実行する。文書校閲支援処理については後述する。 In addition, for example, when a plurality of document files are generated from the same generation source data, the information processing apparatus 1 compares the contents of the plurality of generated document files and determines whether there is a difference. (Hereinafter also referred to as document review support processing). The document review support process will be described later.

記憶部２は、利用者端末１１を介して入力された生成元データや、情報処理装置１が生成した文書ファイル等を記憶する。 The storage unit 2 stores generation source data input via the user terminal 11 and a document file generated by the information processing apparatus 1.

なお、情報処理装置１は、生成済の文書ファイルが記憶部２に記憶されている場合、文書ファイル生成処理を実行せずに、文書校閲支援処理のみを実行するものであってよい。 Note that when the generated document file is stored in the storage unit 2, the information processing apparatus 1 may execute only the document review support process without executing the document file generation process.

［文書ファイル生成処理の具体例］
次に、文書ファイル生成処理の具体例について説明を行う。図２は、文書ファイル生成処理の具体例を説明する図である。 [Specific example of document file generation processing]
Next, a specific example of the document file generation process will be described. FIG. 2 is a diagram illustrating a specific example of the document file generation process.

情報処理装置１は、例えば、利用者が利用者端末１１を介して生成元データを入力した場合、その生成元データから、予め利用者が指定したファイル形式の文書ファイルを生成する。具体的に、情報処理装置１は、図２に示すように、例えば、ＨＴＭＬ（ＨｙｐｅｒＴｅｘｔＭａｒｋｕｐＬａｎｇｕａｇｅ）形式の文書ファイル（以下、単にＨＴＭＬファイルとも呼ぶ）を生成する。また、情報処理装置１は、図２に示すように、例えば、ＰＤＦ（ＰｏｒｔａｂｌｅＤｏｃｕｍｅｎｔＦｏｒｍａｔ）形式の文書ファイル（以下、単にＰＤＦファイルとも呼ぶ）を生成する。 For example, when the user inputs generation source data via the user terminal 11, the information processing apparatus 1 generates a document file in a file format designated in advance by the user from the generation source data. Specifically, as illustrated in FIG. 2, the information processing apparatus 1 generates a document file (hereinafter, also simply referred to as an HTML file) in, for example, an HTML (Hyper Text Markup Language) format. Further, as illustrated in FIG. 2, the information processing apparatus 1 generates a document file (hereinafter, also simply referred to as a PDF file) in a PDF (Portable Document Format) format, for example.

上記のように、同一の生成元データから異なるファイル形式の文書ファイルを複数生成する場合、各文書ファイルの内容は、全て同一である必要がある。しかしながら、生成元データから文書ファイルを生成するための変換プロセスは、ファイル形式によって異なる。そのため、業務システムが生成した複数の文書ファイルの内容には、差分が含まれる可能性がある。したがって、事業者は、複数の文書ファイルを生成した場合、生成した複数の文書ファイル間に存在する差分を抽出するために、例えば、生成した複数の文書ファイルの比較を行う。 As described above, when a plurality of document files having different file formats are generated from the same generation source data, the contents of each document file must all be the same. However, the conversion process for generating the document file from the generation source data differs depending on the file format. Therefore, there is a possibility that differences are included in the contents of a plurality of document files generated by the business system. Therefore, when a plurality of document files are generated, the business operator compares, for example, the generated plurality of document files in order to extract differences existing between the generated plurality of document files.

具体的に、事業者は、この場合、生成した複数の文書ファイルをそれぞれ出力する。そして、事業者は、例えば、画像比較装置を用いて、複数の文書ファイルの出力結果を画像データとして比較する。これにより、事業者は、生成した複数の文書ファイルに存在する差分を抽出することが可能になる。 Specifically, in this case, the business operator outputs the generated plurality of document files. Then, the business operator compares the output results of a plurality of document files as image data using, for example, an image comparison device. As a result, the business operator can extract differences existing in the plurality of generated document files.

一方、事業者は、例えば、情報比較装置を用いて、複数の文書ファイルの出力結果から文字情報の読み込みを行い、読み込んだ文字情報の比較を行う場合がある。 On the other hand, for example, an operator may use an information comparison device to read character information from the output results of a plurality of document files and compare the read character information.

そこで、本実施の形態における情報処理装置１は、ファイル形式の異なる複数の文書ファイル（同一の生成元データに基づいて生成された複数の文書ファイル）について、文書ファイルに含まれる文書の属性情報を生成する。そして、情報処理装置１は、複数の文書ファイルについて、属性情報を比較した結果を出力する。 Therefore, the information processing apparatus 1 according to the present embodiment uses the attribute information of documents included in a document file for a plurality of document files having different file formats (a plurality of document files generated based on the same generation source data). Generate. Then, the information processing apparatus 1 outputs the result of comparing the attribute information for a plurality of document files.

すなわち、本実施の形態における情報処理装置１は、比較を行う複数の文書ファイルからそれぞれ抽出した情報から、同一のフォーマットからなる属性情報をそれぞれ生成する。具体的に、情報処理装置１は、例えば、文書ファイルに含まれる文字に対して文字修飾が行われているか否かを示す情報や、文書ファイルに空白文字（空白）が存在するか否かを示す情報等、事業者が事前に指定した情報（差分確認を行う必要がある情報）が含まれるように、属性情報を生成する。 That is, the information processing apparatus 1 according to the present embodiment generates attribute information having the same format from information extracted from a plurality of document files to be compared. Specifically, the information processing apparatus 1 determines, for example, information indicating whether or not character modification is performed on characters included in the document file, and whether or not a blank character (blank) exists in the document file. The attribute information is generated so as to include information designated in advance by the operator (information that needs to be checked for difference) such as information to be shown.

これにより、情報処理装置１は、各属性情報の比較を行うことで、事業者が指定した情報に関する差分が複数の文書ファイル間において存在するか否かの判定を行うことが可能になる。そのため、情報処理装置１は、複数の文書ファイル間において差分が存在するか否かの判定を容易に行うことが可能になる。したがって、情報処理装置１は、ファイル形式の異なる複数の文書ファイル間における差分確認の利便性向上を図ることを可能にする。 Thereby, the information processing apparatus 1 can determine whether or not a difference regarding information specified by the operator exists between a plurality of document files by comparing the attribute information. Therefore, the information processing apparatus 1 can easily determine whether there is a difference between a plurality of document files. Therefore, the information processing apparatus 1 can improve the convenience of checking differences between a plurality of document files having different file formats.

［情報処理装置のハードウエア構成］
次に、情報処理装置１のハードウエア構成について説明する。図３は、情報処理装置１のハードウエア構成を示す図である。 [Hardware configuration of information processing device]
Next, the hardware configuration of the information processing apparatus 1 will be described. FIG. 3 is a diagram illustrating a hardware configuration of the information processing apparatus 1.

情報処理装置１は、プロセッサであるＣＰＵ１０１と、メモリ１０２と、外部インターフェース（Ｉ／Ｏユニット）１０３と、記憶媒体１０４とを有する。各部は、バス１０５を介して互いに接続される。 The information processing apparatus 1 includes a CPU 101 that is a processor, a memory 102, an external interface (I / O unit) 103, and a storage medium 104. Each unit is connected to each other via a bus 105.

記憶媒体１０４は、例えば、記憶媒体１０４内のプログラム格納領域（図示しない）に、文書校閲支援処理等を行うためのプログラム１１０を記憶する。また、記憶媒体１０４は、例えば、文書校閲支援処理等を行う際に用いられる情報を記憶する情報格納領域１３０（以下、記憶部１３０とも呼ぶ）を有する。 The storage medium 104 stores, for example, a program 110 for performing document review support processing or the like in a program storage area (not shown) in the storage medium 104. In addition, the storage medium 104 includes an information storage area 130 (hereinafter also referred to as a storage unit 130) that stores information used when performing document review support processing or the like.

ＣＰＵ１０１は、図３に示すように、プログラム１１０の実行時に、プログラム１１０を記憶媒体１０４からメモリ１０２にロードし、プログラム１１０と協働して文書校閲支援処理等を行う。また、外部インターフェース１０３は、利用者端末１１と通信を行う。 As shown in FIG. 3, when executing the program 110, the CPU 101 loads the program 110 from the storage medium 104 to the memory 102, and performs document review support processing in cooperation with the program 110. The external interface 103 communicates with the user terminal 11.

［情報処理装置の機能］
次に、情報処理装置１の機能について説明する。図４は、情報処理装置１の機能ブロック図である。 [Functions of information processing device]
Next, functions of the information processing apparatus 1 will be described. FIG. 4 is a functional block diagram of the information processing apparatus 1.

情報処理装置１のＣＰＵ１０１は、プログラム１１０と協働することにより、例えば、文書ファイル生成部１１１と、属性情報生成部１１２（以下、単に情報生成部１１２とも呼ぶ）と、情報管理部１１３と、情報比較部１１４と、結果出力部１１５として動作する。また、情報格納領域１３０には、生成元データ１３１（以下、生成元情報１３１とも呼ぶ）と、文書ファイル１３２（以下、第１文書ファイル情報１３２とも呼ぶ）と、文書ファイル１３３（以下、第２文書ファイル情報１３３とも呼ぶ）と、フォーマット情報１３４とが記憶されている。さらに、情報格納領域１３０には、第１属性情報１３５と、第２属性情報１３６と、比較結果情報１３７とが記憶されている。情報格納領域１３０は、例えば、図１で説明した記憶部２に対応するものであってよい。 The CPU 101 of the information processing apparatus 1 cooperates with the program 110, for example, a document file generation unit 111, an attribute information generation unit 112 (hereinafter also simply referred to as an information generation unit 112), an information management unit 113, It operates as the information comparison unit 114 and the result output unit 115. Further, the information storage area 130 includes generation source data 131 (hereinafter also referred to as generation source information 131), document file 132 (hereinafter also referred to as first document file information 132), and document file 133 (hereinafter referred to as second information). Document file information 133) and format information 134 are stored. Furthermore, in the information storage area 130, first attribute information 135, second attribute information 136, and comparison result information 137 are stored. The information storage area 130 may correspond to, for example, the storage unit 2 described with reference to FIG.

第１文書ファイル情報１３２及び第２文書ファイル情報１３３は、生成元情報１３１を変換することにより生成された文書ファイルである。具体的に、第１文書ファイル情報１３２は、例えば、ＰＤＦファイルであり、第２文書ファイル情報１３３は、例えば、ＨＴＭＬファイルである。第１文書ファイル情報１３２及び第２文書ファイル情報１３３の具体例については後述する。 The first document file information 132 and the second document file information 133 are document files generated by converting the generation source information 131. Specifically, the first document file information 132 is a PDF file, for example, and the second document file information 133 is an HTML file, for example. Specific examples of the first document file information 132 and the second document file information 133 will be described later.

なお、以下、第１文書ファイル情報１３２及び第２文書ファイル情報１３３の比較を行う場合について説明を行うが、情報処理装置１は、３つ以上の異なるファイル形式の文書ファイルを比較するものであってもよい。 Hereinafter, a case where the first document file information 132 and the second document file information 133 are compared will be described. However, the information processing apparatus 1 compares document files of three or more different file formats. May be.

文書ファイル生成部１１１は、文書ファイル生成処理を実行する。すなわち、文書ファイル生成部１１１は、例えば、利用者端末１１から生成元情報１３１の入力があった場合に、入力された生成元情報１３１から、第１文書ファイル情報１３２及び第２文書ファイル情報１３３を生成する。 The document file generation unit 111 executes document file generation processing. That is, for example, when the generation source information 131 is input from the user terminal 11, the document file generation unit 111 receives the first document file information 132 and the second document file information 133 from the input generation source information 131. Is generated.

なお、文書ファイル生成部１１１は、例えば、所定のタイミングになったときに、予め情報格納領域１３０に記憶された生成元情報１３１を取得し、第１文書ファイル情報１３２及び第２文書ファイル情報１３３を生成するものであってもよい。 For example, the document file generation unit 111 acquires the generation source information 131 stored in advance in the information storage area 130 at a predetermined timing, and the first document file information 132 and the second document file information 133 are acquired. May be generated.

属性情報生成部１１２は、フォーマット情報１３４に従って、第１文書ファイル情報１３２から第１属性情報１３５を生成する。また、属性情報生成部１１２は、フォーマット情報１３４に従って、第２文書ファイル情報１３３から第２属性情報１３６を生成する。フォーマット情報１３４、第１属性情報１３５及び第２属性情報１３６の具体例については後述する。 The attribute information generation unit 112 generates the first attribute information 135 from the first document file information 132 according to the format information 134. Further, the attribute information generation unit 112 generates the second attribute information 136 from the second document file information 133 according to the format information 134. Specific examples of the format information 134, the first attribute information 135, and the second attribute information 136 will be described later.

情報管理部１１３は、利用者端末１１から生成元情報１３１が入力された場合に、入力された生成元情報１３１を情報格納領域１３０に記憶する。また、情報管理部１１３は、文書ファイル生成部１１１が第１文書ファイル情報１３２または第２文書ファイル情報１３３を生成した場合に、生成された第１文書ファイル情報１３２または第２文書ファイル情報１３３を情報格納領域１３０に記憶する。さらに、情報管理部１１３は、属性情報生成部１１２が第１属性情報１３５または第２属性情報１３６を生成した場合に、生成された第１属性情報１３５または第２属性情報１３６を情報格納領域１３０に記憶する。 When the generation source information 131 is input from the user terminal 11, the information management unit 113 stores the input generation source information 131 in the information storage area 130. Further, when the document file generation unit 111 generates the first document file information 132 or the second document file information 133, the information management unit 113 stores the generated first document file information 132 or the second document file information 133. The information is stored in the information storage area 130. Further, when the attribute information generation unit 112 generates the first attribute information 135 or the second attribute information 136, the information management unit 113 stores the generated first attribute information 135 or the second attribute information 136 in the information storage area 130. To remember.

情報比較部１１４は、属性情報生成部１１２が生成した第１属性情報１３５及び第２属性情報１３６を比較し、比較結果情報１３７を生成する。そして、情報管理部１１３は、この場合、生成された比較結果情報１３７を情報格納領域１３０に記憶する。 The information comparison unit 114 compares the first attribute information 135 and the second attribute information 136 generated by the attribute information generation unit 112, and generates comparison result information 137. In this case, the information management unit 113 stores the generated comparison result information 137 in the information storage area 130.

結果出力部１１５は、情報比較部１１４が生成した比較結果情報１３７を出力する。結果出力部１１５は、例えば、比較結果情報１３７を利用者端末１１に出力する。 The result output unit 115 outputs the comparison result information 137 generated by the information comparison unit 114. For example, the result output unit 115 outputs the comparison result information 137 to the user terminal 11.

［第１の実施の形態］
次に、第１の実施の形態について説明する。図５は、第１の実施の形態における文書校閲支援処理の概略を説明するフローチャート図である。また、図６は、第１の実施の形態における文書校閲支援処理の概略を説明する図である。図６を参照しながら図５の文書校閲支援処理の概略について説明する。 [First Embodiment]
Next, a first embodiment will be described. FIG. 5 is a flowchart for explaining the outline of the document review support process in the first embodiment. FIG. 6 is a diagram for explaining the outline of the document review support process in the first embodiment. The outline of the document review support process of FIG. 5 will be described with reference to FIG.

情報処理装置１は、図５に示すように、第１文書ファイル情報１３２及び第２文書ファイル情報１３３の比較を行うタイミング（以下、文書校閲タイミングとも呼ぶ）になるまで待機する（Ｓ１のＮＯ）。文書校閲タイミングは、例えば、情報処理装置１が文書ファイル生成処理を実行し、第１文書ファイル情報１３２及び第２文書ファイル情報１３３が生成した直後のタイミングであってよい。また、文書校閲タイミングは、例えば、第１文書ファイル情報１３２及び第２文書ファイル情報１３３の比較を行う旨の入力が利用者端末１１からあったタイミングであってよい。 As shown in FIG. 5, the information processing apparatus 1 waits until the timing for comparing the first document file information 132 and the second document file information 133 (hereinafter also referred to as document review timing) (NO in S1). . The document review timing may be, for example, a timing immediately after the information processing apparatus 1 executes the document file generation process and the first document file information 132 and the second document file information 133 are generated. The document review timing may be, for example, a timing at which an input for comparing the first document file information 132 and the second document file information 133 is received from the user terminal 11.

そして、文書校閲タイミングになった場合（Ｓ１のＹＥＳ）、情報処理装置１は、図６に示すように、同一の生成元情報１３１に基づいて生成された第１文書ファイル情報１３２及び第２文書ファイル情報１３３について、文書ファイルに含まれる文書の行単位の第１属性情報１３５及び第２属性情報１３６をそれぞれ生成する（Ｓ２）。 When the document review timing is reached (YES in S1), the information processing apparatus 1, as shown in FIG. 6, the first document file information 132 and the second document generated based on the same generation source information 131 For the file information 133, first attribute information 135 and second attribute information 136 for each line of the document included in the document file are generated (S2).

具体的に、情報処理装置１は、情報格納領域１３０に記憶されたフォーマット情報１３４に従って、情報格納領域１３０に記憶された第１文書ファイル情報１３２を第１属性情報１３５に変換する。また、情報処理装置１は、情報格納領域１３０に記憶されたフォーマット情報１３４に従って、情報格納領域１３０に記憶された第２文書ファイル情報１３３を第２属性情報１３６に変換する。 Specifically, the information processing apparatus 1 converts the first document file information 132 stored in the information storage area 130 into first attribute information 135 according to the format information 134 stored in the information storage area 130. Further, the information processing apparatus 1 converts the second document file information 133 stored in the information storage area 130 into the second attribute information 136 according to the format information 134 stored in the information storage area 130.

すなわち、情報処理装置１は、ファイル形式の異なる文書ファイルである第１文書ファイル情報１３２及び第２文書ファイル情報１３３から、事業者が差分確認を行うために指定した情報を含む第１属性情報１３５及び第２属性情報１３６を生成する。 That is, the information processing apparatus 1 uses the first attribute information 135 including information designated by the operator to confirm the difference from the first document file information 132 and the second document file information 133 which are document files having different file formats. And second attribute information 136 is generated.

これにより、情報処理装置１は、第１属性情報１３５及び第２属性情報１３６の比較を行うことで、事業者が指定した情報に関する差分が第１文書ファイル情報１３２及び第２文書ファイル情報１３３の間に存在するか否かの判定を行うことが可能になる。そのため、事業者は、第１文書ファイル情報１３２及び第２文書ファイル情報１３３の間に差分が存在するか否かを容易に判定することが可能になる。 As a result, the information processing apparatus 1 compares the first attribute information 135 and the second attribute information 136 so that the difference regarding the information specified by the operator is the first document file information 132 and the second document file information 133. It is possible to determine whether or not there exists between them. Therefore, the business operator can easily determine whether or not there is a difference between the first document file information 132 and the second document file information 133.

また、情報処理装置１は、第１属性情報１３５及び第２属性情報１３６を、文書ファイルに含まれる文書の行単位に生成する。これにより、情報処理装置１は、各文書ファイルの出力結果に含まれる改行の数が異なる等、出力結果の形式が異なる場合であっても、文書の内容に関する差分のみを抽出することが可能になる。 In addition, the information processing apparatus 1 generates the first attribute information 135 and the second attribute information 136 for each line of the document included in the document file. As a result, the information processing apparatus 1 can extract only the difference regarding the content of the document even when the output result format is different, for example, the number of line breaks included in the output result of each document file is different. Become.

その後、情報処理装置１は、図６に示すように、第１文書ファイル情報１３２及び第２文書ファイル情報１３３について、第１属性情報１３５及び第２属性情報１３６を比較する（Ｓ３）。具体的に、情報処理装置１は、第１文書ファイル情報１３２及び第２文書ファイル情報１３３について、文書内の行それぞれについての属性情報（第１属性情報１３５及び第２属性情報１３６）を比較する。そして、情報処理装置１は、Ｓ３の処理で比較した結果を利用者端末１１に出力する（Ｓ４）。 Thereafter, the information processing apparatus 1 compares the first attribute information 135 and the second attribute information 136 for the first document file information 132 and the second document file information 133, as shown in FIG. 6 (S3). Specifically, the information processing apparatus 1 compares the attribute information (first attribute information 135 and second attribute information 136) for each line in the document for the first document file information 132 and the second document file information 133. . And the information processing apparatus 1 outputs the result compared by the process of S3 to the user terminal 11 (S4).

このように、情報処理装置１は、同一の生成元情報１３１に基づいて生成されたファイル形式の異なる複数の文書ファイル（第１文書ファイル情報１３２及び第２文書ファイル情報１３３）について、例えば、各文書ファイルに含まれる文書の行単位の属性情報（第１属性情報１３５及び第２属性情報１３６）を生成する。そして、情報処理装置１は、複数の文書ファイルについて、文書内の行毎に各属性情報を比較した結果を出力する。 In this way, the information processing apparatus 1 uses, for example, each of a plurality of document files (first document file information 132 and second document file information 133) having different file formats generated based on the same generation source information 131. Attribute information (first attribute information 135 and second attribute information 136) in units of lines of the document included in the document file is generated. Then, the information processing apparatus 1 outputs a result of comparing the attribute information for each line in the document for a plurality of document files.

［第１の実施の形態の詳細］
次に、第１の実施の形態の詳細について説明する。図７は、第１の実施の形態における文書校閲支援処理の詳細を説明するフローチャート図である。また、図８から図２２は、第１の実施の形態における文書校閲支援処理の詳細を説明する図である。図８から図２２を参照しながら、図７の文書校閲支援処理の詳細を説明する。なお、以下、第１文書ファイル情報１３２がＰＤＦファイルであり、第２文書ファイル情報１３３がＨＴＭＬファイルであるものとして説明を行う。 [Details of First Embodiment]
Next, details of the first embodiment will be described. FIG. 7 is a flowchart for explaining the details of the document review support processing in the first embodiment. FIGS. 8 to 22 are diagrams for explaining the details of the document review support process in the first embodiment. Details of the document review support process of FIG. 7 will be described with reference to FIGS. In the following description, it is assumed that the first document file information 132 is a PDF file and the second document file information 133 is an HTML file.

情報処理装置１の属性情報生成部１１２は、図７に示すように、文書校閲タイミングになるまで待機する（Ｓ１１のＮＯ）。そして、文書校閲タイミングになった場合（Ｓ１１のＹＥＳ）、属性情報生成部１１２は、情報格納領域１３０に記憶されている第１文書ファイル情報１３２、第２文書ファイル情報１３３及びフォーマット情報１３４を取得する（Ｓ１２）。以下、第１文書ファイル情報１３２、第２文書ファイル情報１３３及びフォーマット情報１３４の具体例について説明を行う。 As shown in FIG. 7, the attribute information generation unit 112 of the information processing apparatus 1 waits until the document review timing comes (NO in S11). When the document review timing comes (YES in S11), the attribute information generation unit 112 acquires the first document file information 132, the second document file information 133, and the format information 134 stored in the information storage area 130. (S12). Hereinafter, specific examples of the first document file information 132, the second document file information 133, and the format information 134 will be described.

［第１文書ファイルの具体例］
初めに、第１文書ファイル情報１３２の具体例について説明を行う。図８は、第１文書ファイル情報１３２の具体例を説明する図である。また、図９は、図８に示す第１文書ファイル情報１３２の出力結果を説明する図である。なお、図８に示す第１文書ファイル情報１３２は、図９に示す文書を利用者端末１１等に表示するためのＰＤＦファイルである。 [Specific example of the first document file]
First, a specific example of the first document file information 132 will be described. FIG. 8 is a diagram for explaining a specific example of the first document file information 132. FIG. 9 is a diagram for explaining the output result of the first document file information 132 shown in FIG. Note that the first document file information 132 shown in FIG. 8 is a PDF file for displaying the document shown in FIG. 9 on the user terminal 11 or the like.

図８に示す第１文書ファイル情報１３２には、利用者端末１１等に出力する際に、１行目として出力される文書を定義する部分情報１３２ａと、２行目として出力される文書を定義する部分情報１３２ｂと、３行目として出力される文書を定義する部分情報１３２ｃとが含まれる。なお、図８では、第１文書ファイル情報１３２に含まれるヘッダやトレーラ等の情報の記載を省略している。 The first document file information 132 shown in FIG. 8 defines partial information 132a that defines a document that is output as the first line and a document that is output as the second line when output to the user terminal 11 or the like. Partial information 132b to be output, and partial information 132c defining a document to be output as the third line are included. In FIG. 8, description of information such as a header and a trailer included in the first document file information 132 is omitted.

具体的に、図８に示す例において、「BT」は、出力する文書を定義する部分であるテキストオブジェクトの開始位置を示している。そして、「/F1 24 Tf」は、Ｆ１によって識別されるフォントを使用し、２４ポイントの大きさで文書を出力する旨を示している。また、「ET」は、テキストオブジェクトの終了位置を示している。 Specifically, in the example shown in FIG. 8, “BT” indicates the start position of a text object that is a part that defines a document to be output. “/ F1 24 Tf” indicates that the font identified by F1 is used and the document is output in a size of 24 points. “ET” indicates the end position of the text object.

次に、図８に示す例において、部分情報１３２ａに含まれる「1 0 0 1 0 648 Tm」は、座標（０，６４８）から文書の出力を行う旨を示している。そして、部分情報１３２ａに含まれる「( Hello World) Tj」は、図９に示すように、「 Hello World」を出力する旨を示している。なお、「 Hello World」の「H」の前には、半角２文字分の空白が存在する。 Next, in the example shown in FIG. 8, “1 0 0 1 0 648 Tm” included in the partial information 132a indicates that the document is output from the coordinates (0,648). Then, “(Hello World) Tj” included in the partial information 132a indicates that “Hello World” is output as shown in FIG. Note that there is a space of two characters before “H” in “Hello World”.

続いて、図８に示す例において、部分情報１３２ｂに含まれる「1 0 0 1 72 612 Tm」は、座標（７２，６１２）から文書の出力を行う旨を示している。そして、部分情報１３２ｂに含まれる「<20204D53835383568362834E3234> Tj」は、図９に示すように、「 MSゴシック24」を出力する旨を示している。なお、「 MSゴシック24」の「M」の前には、半角２文字分の空白が存在する。 Subsequently, in the example shown in FIG. 8, “1 0 0 1 72 612 Tm” included in the partial information 132b indicates that the document is output from the coordinates (72, 612). Then, “<20204D53835383568362834E3234> Tj” included in the partial information 132b indicates that “MS Gothic 24” is output, as shown in FIG. Note that there is a space of two characters before “M” in “MS Gothic 24”.

また、図８に示す例において、部分情報１３２ｂに含まれる「0.5 0 0 rg」は、赤色の文字の出力を開始する旨を示している。そして、部分情報１３２ｂに含まれる「<837C834383938367> Tj」は、図９に示すように、「ポイント」を出力する旨を示している。 In the example shown in FIG. 8, “0.5 0 0 rg” included in the partial information 132b indicates that output of red characters is started. Then, “<837C834383938367> Tj” included in the partial information 132b indicates that “point” is output as shown in FIG.

さらに、図８に示す例において、部分情報１３２ｃに含まれる「1 0 0 1 72 576 Tm」は、座標（７２，５７６）から文書の出力を行う旨を示している。また、図８に示す例において、部分情報１３２ｃに含まれる「0 g」は、黒色の文字の出力を開始する旨を示している。そして、部分情報１３２ｃに含まれる「<82BB82EA82F08A44904682C982B582BD82E082CC> Tj」は、図９に示すように、「それを灰色にしたもの」を出力する旨を示している。 Further, in the example shown in FIG. 8, “1 0 0 1 72 576 Tm” included in the partial information 132c indicates that the document is output from the coordinates (72,576). In the example shown in FIG. 8, “0 g” included in the partial information 132c indicates that the output of black characters is started. Then, “<82BB82EA82F08A44904682C982B582BD82E082CC> Tj” included in the partial information 132c indicates that “what is grayed out” is output as shown in FIG.

［第２文書ファイルの具体例］
次に、第２文書ファイル情報１３３の具体例について説明を行う。図１０は、第２文書ファイル情報１３３の具体例を説明する図である。また、図１１は、図１０に示す第２文書ファイル情報１３３の出力結果を説明する図である。なお、図１０に示す第２文書ファイル情報１３３は、図１１に示す文書を利用者端末１１等に出力するためのＨＴＭＬファイルである。 [Specific example of the second document file]
Next, a specific example of the second document file information 133 will be described. FIG. 10 is a diagram for explaining a specific example of the second document file information 133. FIG. 11 is a diagram for explaining the output result of the second document file information 133 shown in FIG. Note that the second document file information 133 illustrated in FIG. 10 is an HTML file for outputting the document illustrated in FIG. 11 to the user terminal 11 or the like.

図１０に示す第２文書ファイル情報１３３には、利用者端末１１等に出力する際に、１行目として出力される文書を定義する部分情報１３３ａと、２行目として出力される文書を定義する部分情報１３３ｂと、３行目として出力される文書を定義する部分情報１３３ｃとが含まれる。なお、図１０では、第２文書ファイル情報１３３に含まれるｂｏｄｙ部の情報についてのみ記載している。 The second document file information 133 shown in FIG. 10 defines partial information 133a that defines a document that is output as the first line and a document that is output as the second line when output to the user terminal 11 or the like. Partial information 133b to be output and partial information 133c defining a document to be output as the third line are included. In FIG. 10, only the information of the body part included in the second document file information 133 is described.

具体的に、図１０に示す例において、部分情報１３３ａは、２４ポイントの空白（半角２文字分）と、フォントがＭＳゴシックである２４ポイントの「Hello World」とを出力する旨を示している。 Specifically, in the example illustrated in FIG. 10, the partial information 133 a indicates that 24 points of white space (for two characters) and 24 points of “Hello World” whose font is MS Gothic are output. .

続いて、図１０に示す例において、部分情報１３３ｂは、２４ポイントの空白（半角２文字分）と、半角６文字分の空白（「&nbsp」によって指定された半角５文字分の空白と、「&nbsp」の後のスペースによって指定された半角１文字分の空白）とを出力する旨を示している。 Subsequently, in the example illustrated in FIG. 10, the partial information 133 b includes 24 points of space (2 characters), 6 spaces (5 spaces specified by “& nbsp”), “ "& nbsp" followed by one space specified by the space).

また、部分情報１３３ｂは、フォントがＭＳゴシックである２４ポイントの「MS」、「ゴシック」及び「24」を出力する旨を示している。そして、部分情報１３３ｂは、フォントがＭＳゴシックであって２４ポイントの赤色の文字である「ポイント」を出力する旨を示している。 Further, the partial information 133b indicates that “MS”, “Gothic”, and “24” of 24 points whose font is MS Gothic are output. The partial information 133b indicates that the font is MS Gothic and that “point”, which is a 24-point red character, is output.

さらに、図１０に示す例において、部分情報１３３ｃは、２４ポイントの空白（半角２文字分）と、半角４文字分の空白（「&nbsp」によって指定された半角３文字分の空白と、「&nbsp」の後のスペースによって指定された半角１文字分の空白）とを出力する旨を示している。そして、部分情報１３３ｃは、フォントがＭＳゴシックである２４ポイントの「それを灰色にしたもの」を出力する旨を示している。図１０に示す他の情報については説明を省略する。 Further, in the example shown in FIG. 10, the partial information 133 c includes 24 points of space (2 characters), 4 characters of space (spaces of 3 characters specified by “& nbsp”, “& nbsp” ", A space of one half-width character designated by the space after") is output. Then, the partial information 133c indicates that 24 point “what is made gray” is output in which the font is MS Gothic. Description of other information shown in FIG. 10 is omitted.

［フォーマット情報の具体例］
次に、フォーマット情報１３４の具体例について説明を行う。図１２は、フォーマット情報１３４の具体例を説明する図である。 [Specific examples of format information]
Next, a specific example of the format information 134 will be described. FIG. 12 is a diagram for explaining a specific example of the format information 134.

図１２に示すフォーマット情報１３４は、第１文書ファイル情報１３２または第２文書ファイル情報１３３の出力結果におけるページ数を示す「ページ数」を含む。また、図１２に示すフォーマット情報１３４は、第１文書ファイル情報１３２または第２文書ファイル情報１３３の出力結果における行数を示す「行数」を含む。 The format information 134 shown in FIG. 12 includes “page number” indicating the number of pages in the output result of the first document file information 132 or the second document file information 133. 12 includes “number of lines” indicating the number of lines in the output result of the first document file information 132 or the second document file information 133.

そして、図１２に示すフォーマット情報１３４は、第１文書ファイル情報１３２または第２文書ファイル情報１３３の出力結果に含まれる各行において文字の出力が開始される位置を示す「文字開始列」と、各行において文字が出力される文字数を示す「全体文字数」とを含む。また、図１２に示すフォーマット情報１３４は、各行の文字について文字修飾が行われている回数を示す「修飾数」を含む。文字修飾は、例えば、文字色の変更、文字の太文字化、文字の斜体化及び文字の下線追加等を含むものであってよい。 The format information 134 shown in FIG. 12 includes a “character start string” indicating a position where character output is started in each line included in the output result of the first document file information 132 or the second document file information 133, and each line. And the “total number of characters” indicating the number of characters to be output. Further, the format information 134 shown in FIG. 12 includes “modification number” indicating the number of times character modification is performed on the characters in each line. The character modification may include, for example, changing the character color, making the character bold, making the character italic, and adding a character underline.

さらに、図１２に示すフォーマット情報１３４は、各行において文字修飾された文字の表示が開始される位置を示す「修飾開始列」を、「修飾数」が示す数（図１２に示す例では「ｎ」）だけ含む。また、図１２に示すフォーマット情報１３４は、各行において文字修飾された文字が表示される文字数を示す「修飾文字数」を、「修飾数」が示す数（図１２に示す例では「ｎ」）だけ含む。そして、図１２に示すフォーマット情報１３４は、フォーマット情報１３４に含まれる各情報の終端を示す情報が設定される「ＥＮＤコード」を含む。 Further, the format information 134 shown in FIG. 12 includes a “modification start column” indicating a position where display of a character-modified character in each line is started, a number indicated by “qualification number” (in the example shown in FIG. 12, “n” ]) Only. Further, the format information 134 shown in FIG. 12 includes the “number of modified characters” indicating the number of characters displayed with the characters modified in each line, by the number indicated by the “qualified number” (“n” in the example shown in FIG. 12). Including. The format information 134 shown in FIG. 12 includes an “END code” in which information indicating the end of each information included in the format information 134 is set.

図７に戻り、属性情報生成部１１２は、Ｓ１２の処理で取得したフォーマット情報１３４に従って、Ｓ１２の処理で取得した第１文書ファイル情報１３２から第１属性情報１３５を生成する（Ｓ１３）。また、属性情報生成部１１２は、Ｓ１２の処理で取得したフォーマット情報１３４に従って、Ｓ１２の処理で取得した第２文書ファイル情報１３３から第２属性情報１３６を生成する（Ｓ１４）。以下、第１属性情報１３５及び第２属性情報１３６の具体例について説明を行う。 Returning to FIG. 7, the attribute information generation unit 112 generates the first attribute information 135 from the first document file information 132 acquired in the process of S12 according to the format information 134 acquired in the process of S12 (S13). Further, the attribute information generation unit 112 generates the second attribute information 136 from the second document file information 133 acquired in the process of S12 according to the format information 134 acquired in the process of S12 (S14). Hereinafter, specific examples of the first attribute information 135 and the second attribute information 136 will be described.

［第１属性情報の具体例］
図１３は、図８で説明した第１文書ファイル情報１３２から生成された第１属性情報１３５の具体例を説明する図である。図１３に示す第１属性情報１３５は、図１２で説明したフォーマット情報１３４に含まれる各情報を項目として有している。なお、図１３に示す第１属性情報１３５は、説明の便宜上、第１属性情報１３５に含まれる各情報を識別する「項番」を項目として有している。 [Specific example of first attribute information]
FIG. 13 is a diagram illustrating a specific example of the first attribute information 135 generated from the first document file information 132 described with reference to FIG. The first attribute information 135 illustrated in FIG. 13 includes items of information included in the format information 134 described with reference to FIG. Note that the first attribute information 135 illustrated in FIG. 13 has “item number” as an item for identifying each piece of information included in the first attribute information 135 for convenience of explanation.

具体的に、図８に示す第１文書ファイル情報１３２の部分情報１３２ａは、「Hello world」の先頭に、半角２文字分の空白が存在する旨の情報を含んでいる。そのため、この場合、「Hello world」の出力の開始位置は、行の先頭から半角３文字目である。 Specifically, the partial information 132a of the first document file information 132 shown in FIG. 8 includes information indicating that a space of two half-width characters exists at the head of “Hello world”. Therefore, in this case, the output start position of “Hello world” is the third character from the beginning of the line.

また、「Hello world」は、半角１１文字からなる。そのため、「Hello world」は、行の先頭から半角３文字目から半角１３文字目までに出力される。したがって、属性情報生成部１１２は、図１３に示すように、「ページ数」が「１」であって「行数」が「１」である情報の「文字開始列」に「３」を設定し、「全体文字数」に「１３」を設定する。 “Hello world” consists of 11 single-byte characters. Therefore, “Hello world” is output from the third half-width character to the 13th half-width character from the beginning of the line. Therefore, as shown in FIG. 13, the attribute information generation unit 112 sets “3” to “character start string” of the information where “number of pages” is “1” and “number of lines” is “1”. Then, “13” is set in the “total number of characters”.

そして、図８に示す第１文書ファイル情報１３２の部分情報１３２ａには、文字修飾についての情報が含まれていない。そのため、属性情報生成部１１２は、図１３に示すように、「ページ数」が「１」であって「行数」が「１」である情報の「修飾数」、「修飾開始列」及び「修飾文字数」に、情報が存在しないことを示す「−」を設定する。さらに、属性情報生成部１１２は、例えば、図１３に示すように、「ＥＮＤコード」に、各行に設定される情報の終端を示す「ＥＮＤ」を設定する。 The partial information 132a of the first document file information 132 shown in FIG. 8 does not include information on character modification. Therefore, the attribute information generation unit 112, as shown in FIG. 13, the “modification number”, “modification start column” and “modification start column” of the information whose “number of pages” is “1” and “number of rows” is “1”. “-” Indicating that no information exists is set in “number of qualified characters”. Further, for example, as illustrated in FIG. 13, the attribute information generation unit 112 sets “END” indicating the end of information set in each row in the “END code”.

次に、図８に示す第１文書ファイル情報１３２の部分情報１３２ｂは、文書の出力の開始が座標（７２，６１２）から行われる旨の情報を含んでいる。また、図８に示す第１文書ファイル情報１３２において、文字のフォントは２４ポイントである。そのため、属性情報生成部１１２は、Ｘ座標である７２をフォントの２４で除算することにより、文書の出力が開始される位置の前には、全角３文字分（半角６文字分）の空白が存在するものと判定する。そして、「MSゴシック24ポイント」の文字列の先頭には、半角２文字分の空白が存在する。すなわち、「MSゴシック24ポイント」の前には、半角８文字分の空白が存在する。 Next, the partial information 132b of the first document file information 132 shown in FIG. 8 includes information indicating that the output of the document is started from the coordinates (72, 612). In the first document file information 132 shown in FIG. 8, the character font is 24 points. Therefore, the attribute information generation unit 112 divides 72, which is the X coordinate, by 24 of the font, so that a space of 3 full-width characters (6 half-width characters) is left before the position where the document output is started. It is determined that it exists. And, there is a space for two characters at the beginning of the character string of “MS Gothic 24 points”. In other words, there is a space of 8 half-width characters before “MS Gothic 24 points”.

また、「MSゴシック24ポイント」は、半角２０文字からなる。そのため、「MSゴシック24ポイント」は、行の先頭から半角９文字目から半角２８文字目までに出力される。したがって、属性情報生成部１１２は、図１３に示すように、「ページ数」が「１」であって「行数」が「２」である情報の「文字開始列」に「９」を設定し、「全体文字数」に「２８」を設定する。 “MS Gothic 24-point” consists of 20 single-byte characters. Therefore, “MS Gothic 24-point” is output from the 9th character to the 28th character from the beginning of the line. Therefore, as shown in FIG. 13, the attribute information generation unit 112 sets “9” in the “character start string” of the information where “number of pages” is “1” and “number of lines” is “2”. Then, “28” is set in the “total number of characters”.

そして、図８に示す第１文書ファイル情報１３２の部分情報１３２ｂには、「ポイント」が赤色の文字である旨の情報が含まれている。そのため、属性情報生成部１１２は、図１３に示すように、「修飾数」に「１」を設定し、「修飾開始列」に「ポイント」の文字開始列である「２１」を設定し、「修飾文字数」に「ポイント」の文字数である「８」を設定する。さらに、属性情報生成部１１２は、例えば、「ＥＮＤコード」に「ＥＮＤ」を設定する。 The partial information 132b of the first document file information 132 shown in FIG. 8 includes information indicating that “point” is a red character. Therefore, as shown in FIG. 13, the attribute information generation unit 112 sets “1” to “modification number”, sets “21” that is the character start string of “point” to “modification start string”, and “8”, which is the number of characters of “point”, is set in “number of qualified characters”. Furthermore, the attribute information generation unit 112 sets “END” to “END code”, for example.

続いて、図８に示す第１文書ファイル情報１３２の部分情報１３２ｃにおいて、文書の出力の開始が座標（７２，５７６）から行われる旨の情報を含んでいる。また、図８に示す第１文書ファイル情報１３２において、文字のフォントは２４ポイントである。そのため、属性情報生成部１１２は、Ｘ座標である７２をフォントの２４で除算することにより、「それを灰色にしたもの」の前には、全角３文字分（半角６文字分）の空白が存在するものと判定する。 Subsequently, the partial information 132c of the first document file information 132 shown in FIG. 8 includes information indicating that the output of the document is started from the coordinates (72, 576). In the first document file information 132 shown in FIG. 8, the character font is 24 points. Therefore, the attribute information generation unit 112 divides 72 which is the X coordinate by 24 of the font so that a space of 3 full-width characters (6 half-width characters) is preceded by “the grayed-out”. It is determined that it exists.

そして、「それを灰色にしたもの」は、半角２０文字からなる。そのため、「それを灰色にしたもの」は、行の先頭から半角７文字目から半角２６文字目までに出力される。そのため、属性情報生成部１１２は、図１３に示すように、「ページ数」が「１」であって「行数」が「３」である情報の「文字開始列」に「７」を設定し、「全体文字数」に「２６」を設定する。 And “what made it gray” consists of 20 half-width characters. For this reason, “the color in gray” is output from the 7th half-width character to the 26th half-width character from the beginning of the line. Therefore, as shown in FIG. 13, the attribute information generation unit 112 sets “7” in the “character start string” of the information where the “number of pages” is “1” and the “number of lines” is “3”. Then, “26” is set in the “total number of characters”.

さらに、図８に示す第１文書ファイル情報１３２の部分情報１３２ｃには、文字修飾についての情報が含まれていない。そのため、属性情報生成部１１２は、図１３に示すように、「ページ数」が「１」であって「行数」が「３」である情報の「修飾数」、「修飾開始列」及び「修飾文字数」に「−」を設定する。さらに、属性情報生成部１１２は、例えば、図１３に示すように、「ＥＮＤコード」に「ＥＮＤ」を設定する。 Further, the partial information 132c of the first document file information 132 shown in FIG. 8 does not include information on character modification. Therefore, the attribute information generation unit 112, as shown in FIG. 13, the “modification number”, “modification start column”, and “modification start column” of the information where “number of pages” is “1” and “number of rows” is “3” Set “-” to “Number of modifiers”. Further, the attribute information generation unit 112 sets “END” to “END code” as illustrated in FIG. 13, for example.

［第２属性の具体例］
次に、第２属性情報１３６の具体例について説明を行う。図１４は、図１０で説明した第２文書ファイル情報１３３から生成された第２属性情報１３６の具体例を説明する図である。図１４に示す第２属性情報１３６は、図１２で説明したフォーマット情報１３４に含まれる各情報を項目として有している。なお、図１４に示す第２属性情報１３６は、説明の便宜上、図１３で説明した場合と同様に、第２属性情報１３６に含まれる各情報を識別する「項番」を項目として有している。 [Specific example of the second attribute]
Next, a specific example of the second attribute information 136 will be described. FIG. 14 is a diagram illustrating a specific example of the second attribute information 136 generated from the second document file information 133 described with reference to FIG. The second attribute information 136 illustrated in FIG. 14 includes items of information included in the format information 134 described with reference to FIG. The second attribute information 136 shown in FIG. 14 has “item number” as an item for identifying each piece of information included in the second attribute information 136, as in the case described with reference to FIG. Yes.

具体的に、図１０に示す第２文書ファイル情報１３３の部分情報１３３ａは、「Hello world」の先頭に、半角２文字分の空白が存在する旨の情報を含んでいる。そのため、この場合、「Hello world」の出力の開始位置は、行の先頭から半角３文字目である。 Specifically, the partial information 133a of the second document file information 133 illustrated in FIG. 10 includes information indicating that a space of two half-width characters exists at the head of “Hello world”. Therefore, in this case, the output start position of “Hello world” is the third character from the beginning of the line.

また、「Hello world」は、半角１１文字からなる。そのため、「Hello world」は、行の先頭から半角３文字目から半角１３文字目までに出力される。したがって、属性情報生成部１１２は、図１４に示すように、「ページ数」が「１」であって「行数」が「１」である情報の「文字開始列」に「３」を設定し、「全体文字数」に「１３」を設定する。 “Hello world” consists of 11 single-byte characters. Therefore, “Hello world” is output from the third half-width character to the 13th half-width character from the beginning of the line. Therefore, as shown in FIG. 14, the attribute information generation unit 112 sets “3” to “character start string” of the information where “number of pages” is “1” and “number of lines” is “1”. Then, “13” is set in the “total number of characters”.

そして、図１０に示す第２文書ファイル情報１３３の部分情報１３３ａには、文字修飾についての情報が含まれていない。そのため、属性情報生成部１１２は、図１４に示すように、「ページ数」が「１」であって「行数」が「１」である情報の「修飾数」、「修飾開始列」及び「修飾文字数」に「−」を設定する。さらに、属性情報生成部１１２は、例えば、図１４に示すように、「ＥＮＤコード」に「ＥＮＤ」を設定する。 The partial information 133a of the second document file information 133 shown in FIG. 10 does not include information on character modification. Therefore, as shown in FIG. 14, the attribute information generation unit 112 includes “modification number”, “modification start column”, and “modification start column” of information in which “number of pages” is “1” and “number of rows” is “1”. Set “-” to “Number of modifiers”. Further, the attribute information generation unit 112 sets “END” to “END code” as illustrated in FIG. 14, for example.

次に、図１０に示す第２文書ファイル情報１３３の部分情報１３３ｂは、「MSゴシック24ポイント」の前に、半角８文字分の空白が存在する旨の情報を含んでいる。また、「MSゴシック24ポイント」は、半角２０文字からなる。そのため、「MSゴシック24ポイント」は、行の先頭から半角９文字目から半角２８文字目までに出力される。したがって、属性情報生成部１１２は、図１４に示すように、「ページ数」が「１」であって「行数」が「２」である情報の「文字開始列」に「９」を設定し、「全体文字数」に「２８」を設定する。 Next, the partial information 133b of the second document file information 133 shown in FIG. 10 includes information indicating that there is a space of 8 half-width characters before “MS Gothic 24 points”. “MS Gothic 24-point” consists of 20 single-byte characters. Therefore, “MS Gothic 24-point” is output from the 9th character to the 28th character from the beginning of the line. Accordingly, as shown in FIG. 14, the attribute information generation unit 112 sets “9” in the “character start string” of the information where “number of pages” is “1” and “number of lines” is “2”. Then, “28” is set in the “total number of characters”.

そして、図１０に示す第２文書ファイル情報１３３の部分情報１３３ｂには、「ポイント」が赤色の文字である旨の情報が含まれている。そのため、属性情報生成部１１２は、図１４に示すように、「修飾数」に「１」を設定し、「修飾開始列」に「ポイント」の文字開始列である「２１」を設定し、「修飾文字数」に「ポイント」の文字数である「８」を設定する。さらに、属性情報生成部１１２は、例えば、「ＥＮＤコード」に「ＥＮＤ」を設定する。 The partial information 133b of the second document file information 133 shown in FIG. 10 includes information indicating that “point” is a red character. Therefore, as shown in FIG. 14, the attribute information generation unit 112 sets “1” to the “qualification number”, sets “21” that is the character start sequence of “point” to the “modification start sequence”, “8”, which is the number of characters of “point”, is set in “number of qualified characters”. Furthermore, the attribute information generation unit 112 sets “END” to “END code”, for example.

続いて、図１０に示す第２文書ファイル情報１３３の部分情報１３３ｃは、「それを灰色にしたもの」の前に、半角６文字分の空白が存在する旨の情報を含んでいる。また、「それを灰色にしたもの」は、半角２０文字からなる。そのため、「それを灰色にしたもの」は、行の先頭から半角７文字目から半角２６文字目までに出力される。したがって、属性情報生成部１１２は、図１４に示すように、「ページ数」が「１」であって「行数」が「３」である情報の「文字開始列」に「７」を設定し、「全体文字数」に「２６」を設定する。 Subsequently, the partial information 133c of the second document file information 133 illustrated in FIG. 10 includes information indicating that there is a space of 6 half-width characters before “the grayed-out information”. “Grayed it” consists of 20 single-byte characters. For this reason, “the color in gray” is output from the 7th half-width character to the 26th half-width character from the beginning of the line. Therefore, as shown in FIG. 14, the attribute information generation unit 112 sets “7” in the “character start string” of the information where the “number of pages” is “1” and the “number of lines” is “3”. Then, “26” is set in the “total number of characters”.

そして、図１０に示す第２文書ファイル情報１３３の部分情報１３３ｃには、文字修飾についての情報が含まれていない。そのため、属性情報生成部１１２は、図１４に示すように、「ページ数」が「１」であって「行数」が「３」である情報の「修飾数」、「修飾開始列」及び「修飾文字数」に「−」を設定する。さらに、属性情報生成部１１２は、例えば、図１４に示すように、「ＥＮＤコード」に「ＥＮＤ」を設定する。 The partial information 133c of the second document file information 133 shown in FIG. 10 does not include information on character modification. Therefore, as shown in FIG. 14, the attribute information generation unit 112 includes “modification number”, “modification start column”, and “modification start column” of information in which “number of pages” is “1” and “number of rows” is “3”. Set “-” to “Number of modifiers”. Further, the attribute information generation unit 112 sets “END” to “END code” as illustrated in FIG. 14, for example.

すなわち、属性情報生成部１１２は、Ｓ１３及びＳ１４の処理において、ファイル形式の異なる第１文書ファイル情報１３２及び第２文書ファイル情報１３３から、同一のフォーマットである第１属性情報１３５及び第２属性情報１３６を生成する。これにより、属性情報生成部１１２は、第１属性情報１３５と第２属性情報１３６とを比較することで、第１文書ファイル情報１３２及び第２文書ファイル情報１３３に差分が存在するか否かの判定を容易に行うことが可能になる。 That is, the attribute information generation unit 112 uses the first attribute information 135 and the second attribute information in the same format from the first document file information 132 and the second document file information 133 having different file formats in the processes of S13 and S14. 136 is generated. Thereby, the attribute information generation unit 112 compares the first attribute information 135 and the second attribute information 136 to determine whether there is a difference between the first document file information 132 and the second document file information 133. Judgment can be made easily.

具体的に、図１３に示す第１属性情報１３５及び図１４に示す第２属性情報１３６は、同一の内容である。そのため、属性情報生成部１１２は、後述するように、図８に示す第１文書ファイル情報１３２と、図１０に示す第２文書ファイル情報１３３との間（図９に示す出力結果と、図１１に示す出力結果との間）には、差分が存在しない旨の比較結果情報１３７を出力する。これにより、事業者は、第１文書ファイル情報１３２及び第２文書ファイル情報１３３が、差分を発生させることなく生成されたことを確認することが可能になる。第１属性情報１３５と第２属性情報１３６との間に差分が存在する場合の具体例については後述する。 Specifically, the first attribute information 135 shown in FIG. 13 and the second attribute information 136 shown in FIG. 14 have the same contents. Therefore, the attribute information generation unit 112, as will be described later, between the first document file information 132 shown in FIG. 8 and the second document file information 133 shown in FIG. 10 (the output result shown in FIG. The comparison result information 137 indicating that there is no difference is output between the output results shown in FIG. Accordingly, the business operator can confirm that the first document file information 132 and the second document file information 133 are generated without causing a difference. A specific example in the case where there is a difference between the first attribute information 135 and the second attribute information 136 will be described later.

図７に戻り、情報処理装置１の情報比較部１１４は、第１属性情報１３５及び第２属性情報１３６に含まれる情報のうち、比較を行わない情報の削除を行うか否かを判定する（Ｓ１５）。具体的に、情報比較部１１４は、例えば、事業者が情報処理装置１に対して予め行った設定の内容を参照し、比較を行わない情報の削除を行うか否かを判定するものであってよい。 Returning to FIG. 7, the information comparison unit 114 of the information processing apparatus 1 determines whether or not to delete information that is not compared among the information included in the first attribute information 135 and the second attribute information 136 ( S15). Specifically, the information comparison unit 114 determines whether or not to delete information that is not compared, for example, by referring to the contents of the settings previously made by the business operator for the information processing apparatus 1. It's okay.

そして、比較を行わない情報の削除を行わない場合（Ｓ１５のＮＯ）、情報比較部１１４は、Ｓ１３の処理で生成した第１属性情報１３５と、Ｓ１４の処理で生成した第２属性情報１３６とを比較する（Ｓ１８）。その後、結果出力部１１５は、例えば、Ｓ１８の処理で情報比較部１１４が比較した結果である比較結果情報１３７を利用者端末１１等に出力する（Ｓ１９）。以下、Ｓ１８及びＳ１９の処理の具体例について説明する。 If the information not to be compared is not deleted (NO in S15), the information comparison unit 114 includes the first attribute information 135 generated in the process of S13 and the second attribute information 136 generated in the process of S14. Are compared (S18). Thereafter, the result output unit 115 outputs, for example, the comparison result information 137, which is the result of comparison by the information comparison unit 114 in the process of S18, to the user terminal 11 or the like (S19). Hereinafter, a specific example of the processing of S18 and S19 will be described.

［Ｓ１８及びＳ１９の処理の具体例（１）］
図１５は、第１属性情報１３５の具体例を説明する図である。また、図１６は、第２属性情報１３６の具体例を説明する図である。さらに、図１７は、比較結果情報１３７の具体例を説明する図である。なお、図１５から図１７は、図８から図１４で説明した具体例とは異なる具体例を説明する図である。 [Specific Example of Processing in S18 and S19 (1)]
FIG. 15 is a diagram illustrating a specific example of the first attribute information 135. FIG. 16 is a diagram illustrating a specific example of the second attribute information 136. Further, FIG. 17 is a diagram for explaining a specific example of the comparison result information 137. 15 to 17 are diagrams illustrating specific examples different from the specific examples described in FIGS. 8 to 14.

具体的に、図１５に示す第１属性情報１３５において、「ページ数」が「１」であって「行数」が「２」である情報には、「修飾数」として「２」が設定され、「修飾１開始列」として「８」が設定され、「修飾１文字数」として「３」が設定されている。さらに、図１５に示す第１属性情報１３５において、「ページ数」が「１」であって「行数」が「２」である情報には、「修飾２開始列」として「１２」が設定され、「修飾２文字数」として「４」が設定されている。すなわち、図１５に示す第１属性情報１３５における「ページ数」が「１」であって「行数」が「２」である情報には、文字修飾に関する情報が２つ含まれている。 Specifically, in the first attribute information 135 shown in FIG. 15, “2” is set as the “qualification number” in the information where “number of pages” is “1” and “number of rows” is “2”. As a result, “8” is set as the “qualification 1 start column”, and “3” is set as the “number of characters per modification”. Further, in the first attribute information 135 shown in FIG. 15, “12” is set as “qualification 2 start column” in the information where “number of pages” is “1” and “number of rows” is “2”. Then, “4” is set as “the number of two qualified characters”. That is, in the first attribute information 135 shown in FIG. 15, the information “number of pages” is “1” and the number of lines is “2” includes two pieces of information related to character modification.

一方、図１６に示す第２属性情報１３６において、「ページ数」が「１」であって「行数」が「２」である情報には、「修飾数」として「１」が設定され、「修飾１開始列」として「８」が設定され、「修飾１文字数」として「３」が設定されている。すなわち、図１６に示す第１属性情報１３５における「ページ数」が「１」であって「行数」が「２」である情報には、文字修飾に関する情報が１つのみ含まれている。 On the other hand, in the second attribute information 136 shown in FIG. 16, “1” is set as the “qualification number” in the information where “number of pages” is “1” and “number of rows” is “2”. “8” is set as the “modification 1 start column”, and “3” is set as the “number of 1 modification character”. That is, in the first attribute information 135 shown in FIG. 16, the information “number of pages” is “1” and the number of lines is “2” includes only one piece of information related to character modification.

そのため、情報比較部１１４は、この場合、Ｓ１８の処理において、第１属性情報１３５及び第２属性情報１３６のうち、第１属性情報１３５にのみ存在する文字修飾の情報が存在すると判定する。 Therefore, in this case, the information comparison unit 114 determines that character modification information that exists only in the first attribute information 135 exists in the first attribute information 135 and the second attribute information 136 in the process of S18.

さらに、図１５に示す第１属性情報１３５において、「ページ数」が「１」であって「行数」が「４」である情報には、「文字開始列」として「３」が設定され、「全体文字数」として「１０」が設定されている。一方、図１６に示す第２属性情報１３６において、「ページ数」が「１」であって「行数」が「４」である情報には、「文字開始列」として「１」が設定され、「全体文字数」として「８」が設定されている。 Further, in the first attribute information 135 shown in FIG. 15, “3” is set as the “character start column” in the information where “number of pages” is “1” and “number of lines” is “4”. , “10” is set as the “total number of characters”. On the other hand, in the second attribute information 136 shown in FIG. 16, “1” is set as the “character start string” in the information where “number of pages” is “1” and “number of lines” is “4”. , “8” is set as the “total number of characters”.

そのため、情報比較部１１４は、この場合、Ｓ１８の処理において、図１５に示す第１属性情報１３５に情報が含まれる空白の数と、図１６に示す第２属性情報１３６に情報が含まれる空白の数とが異なっていると判定する。 Therefore, in this case, in the processing of S18, the information comparison unit 114 determines the number of blanks that contain information in the first attribute information 135 shown in FIG. 15 and the blank that contains information in the second attribute information 136 shown in FIG. It is determined that the number is different.

そして、結果出力部１１５は、Ｓ１９の処理において、Ｓ１８の処理における比較結果から比較結果情報１３７を生成し、利用者端末１１に出力する。具体的に、結果出力部１１５は、例えば、図１７に示すように、「ＰＤＦ文書の１ページ目の２行目」に「文字修飾の差異が存在」する旨の情報を比較結果情報１３７に含める。また、結果出力部１１５は、例えば、図１７に示すように、「ＰＤＦ文書の１ページ目の５行目」に「表示位置の差異が存在」する旨の情報を比較結果情報１３７に含める。 And the result output part 115 produces | generates the comparison result information 137 from the comparison result in the process of S18 in the process of S19, and outputs it to the user terminal 11. Specifically, for example, as shown in FIG. 17, the result output unit 115 adds information indicating that “a character modification difference exists in the second line of the first page of the PDF document” to the comparison result information 137. include. Further, for example, as illustrated in FIG. 17, the result output unit 115 includes information indicating that “there is a difference in display position” in “the fifth line of the first page of the PDF document” in the comparison result information 137.

これにより、結果出力部１１５は、事業者に対し、比較結果情報１３７に情報が含まれる箇所についての見直しを行う契機を与えることが可能になる。 As a result, the result output unit 115 can give the operator an opportunity to review the location where the information is included in the comparison result information 137.

なお、図１６に示す第２属性情報１３６において、「修飾２開始列」及び「修飾２文字数」には、「−」が設定されている。そのため、属性情報生成部１１２は、図１６に示す第２属性情報１３６を生成する際に、「修飾２開始列」及び「修飾２文字数」の項目を含めないものであってもよい。 In the second attribute information 136 shown in FIG. 16, “-” is set in “Modification 2 start column” and “Modification 2 character count”. Therefore, the attribute information generation unit 112 may not include the items “modification 2 start column” and “number of modification 2 characters” when generating the second attribute information 136 illustrated in FIG. 16.

図７に戻り、Ｓ１５の処理において、比較を行わない情報の削除を行わない場合（Ｓ１５のＮＯ）、情報比較部１１４は、例えば、第１属性情報１３５及び第２属性情報１３６から、ページ数を示す情報及び行を示す情報を削除する（Ｓ１６）。 Returning to FIG. 7, in the process of S <b> 15, when the information not to be compared is not deleted (NO in S <b> 15), the information comparison unit 114 determines the number of pages from the first attribute information 135 and the second attribute information 136, for example. And the information indicating the row are deleted (S16).

すなわち、例えば、ＰＤＦファイルには、ページ数の概念があるのに対し、ＨＴＭＬファイルには、ページ数の概念がない。そのため、ＨＴＭＬファイル（第２文書ファイル情報１３３）から生成した属性情報（第２属性情報１３６）は、全ての情報が１ページに含まれるものとして生成される。そのため、この場合、第１属性情報１３５及び第２属性情報１３６において、同じ内容の情報を含む行に対応する「ページ数」及び「行数」に、それぞれ異なる情報が設定される可能性がある。したがって、情報比較部１１４は、この場合、第１属性情報１３５及び第２属性情報１３６の比較を行うことができない。 That is, for example, a PDF file has a concept of the number of pages, whereas an HTML file does not have a concept of the number of pages. Therefore, the attribute information (second attribute information 136) generated from the HTML file (second document file information 133) is generated assuming that all information is included in one page. Therefore, in this case, in the first attribute information 135 and the second attribute information 136, different information may be set in the “number of pages” and “number of lines” corresponding to the lines including the same information. . Therefore, in this case, the information comparison unit 114 cannot compare the first attribute information 135 and the second attribute information 136.

そこで、情報比較部１１４は、第１属性情報１３５及び第２属性情報１３６の比較を行う前に、例えば、「ページ数」及び「行数」（ファイル形式間において単純に比較することができない項目）に対応する情報を、比較を行う情報から除外するものであってよい。これにより、情報比較部１１４は、第１属性情報１３５及び第２属性情報１３６に、単純に比較することができない項目に対応する情報が含まれている場合であっても、第１属性情報１３５及び第２属性情報１３６の比較を行うことが可能になる。 Therefore, before comparing the first attribute information 135 and the second attribute information 136, the information comparison unit 114, for example, “number of pages” and “number of lines” (items that cannot be simply compared between file formats). ) May be excluded from the information to be compared. As a result, the information comparison unit 114 is configured such that even if the first attribute information 135 and the second attribute information 136 include information corresponding to items that cannot be simply compared, the first attribute information 135 And the second attribute information 136 can be compared.

さらに、情報比較部１１４は、Ｓ１６の処理の後、第１属性情報１３５及び第２属性情報１３６から、改行が含まれる旨を示す情報を削除する（Ｓ１７）。 Further, after the process of S16, the information comparison unit 114 deletes information indicating that a line break is included from the first attribute information 135 and the second attribute information 136 (S17).

すなわち、第１文書ファイル情報１３２の出力結果に含まれる改行の数と、第２文書ファイル情報１３３の出力結果に含まれる改行の数とが異なる場合がある。この場合、第１属性情報１３５に含まれる情報の行数と第２属性情報１３６に含まれる情報の行数とが異なることになるため、情報比較部１１４は、第１属性情報１３５及び第２属性情報１３６の比較を行うことができない。 That is, the number of line breaks included in the output result of the first document file information 132 may differ from the number of line breaks included in the output result of the second document file information 133. In this case, since the number of lines of information included in the first attribute information 135 is different from the number of lines of information included in the second attribute information 136, the information comparison unit 114 performs the first attribute information 135 and the second attribute information 135. The attribute information 136 cannot be compared.

そこで、情報比較部１１４は、第１属性情報１３５及び第２属性情報１３６の比較を行う前に、例えば、第１属性情報１３５及び第２属性情報１３６のうち、改行が含まれる旨を示す情報を削除するものであってよい。これにより、情報比較部１１４は、第１属性情報１３５に含まれる情報の行数と第２属性情報１３６に含まれる情報の行数とを合わせることが可能になる。そのため、情報比較部１１４は、第１属性情報１３５に含まれる情報の行数と第２属性情報１３６に含まれる情報の行数とが異なる場合であっても、第１属性情報１３５及び第２属性情報１３６の比較を行うことが可能になる。 Therefore, before comparing the first attribute information 135 and the second attribute information 136, the information comparison unit 114, for example, information indicating that a line break is included in the first attribute information 135 and the second attribute information 136. May be deleted. Thereby, the information comparison unit 114 can match the number of lines of information included in the first attribute information 135 with the number of lines of information included in the second attribute information 136. Therefore, even if the number of lines of information included in the first attribute information 135 is different from the number of lines of information included in the second attribute information 136, the information comparison unit 114 does not include the first attribute information 135 and the second attribute information 135. It becomes possible to compare the attribute information 136.

そして、Ｓ１７の処理の後、情報比較部１１４は、Ｓ１８以降の処理を実行する。なお、情報比較部１１４は、Ｓ１６及びＳ１７の処理のうち、一方の処理のみを行うものであってもよい。以下、Ｓ１８及びＳ１９の処理の他の具体例について説明する。 And after the process of S17, the information comparison part 114 performs the process after S18. The information comparison unit 114 may perform only one of the processes in S16 and S17. Hereinafter, another specific example of the processing of S18 and S19 will be described.

［Ｓ１８及びＳ１９の処理の具体例（２）］
図１８及び図２０は、第１属性情報１３５の具体例を説明する図である。また、図１９及び図２１は、第２属性情報１３６の具体例を説明する図である。さらに、図２２は、比較結果情報１３７の具体例を説明する図である。なお、図１８及び図２２は、図８から図１４で説明した具体例、及び図１５から図１７で説明した具体例とは異なる具体例を説明する図である。 [Specific Example of Processing in S18 and S19 (2)]
18 and 20 are diagrams for explaining a specific example of the first attribute information 135. FIG. 19 and 21 are diagrams for explaining a specific example of the second attribute information 136. Further, FIG. 22 is a diagram for explaining a specific example of the comparison result information 137. 18 and 22 are diagrams illustrating specific examples different from the specific examples described in FIGS. 8 to 14 and the specific examples described in FIGS. 15 to 17.

具体的に、図１８に示す第１属性情報１３５において、「ページ数」には、「１」または「２」が設定されている。一方、図１９に示す第２属性情報１３６において、「ページ数」には、「１」のみが設定されている。 Specifically, in the first attribute information 135 illustrated in FIG. 18, “1” or “2” is set in the “number of pages”. On the other hand, in the second attribute information 136 shown in FIG. 19, only “1” is set in the “number of pages”.

また、図１９に示す第２属性情報１３６において、「ページ数」が「１」であって「行数」が「３」及び「５」である情報には、「項番」、「ページ数」、「行数」及び「ＥＮＤコード」にのみ情報が設定されている。すなわち、図１９に示す第２属性情報１３６において、「ページ数」が「１」であって「行数」が「３」及び「５」である情報は、改行が存在することを示す情報である。 Further, in the second attribute information 136 shown in FIG. 19, information with “number of pages” being “1” and “number of rows” being “3” and “5” includes “item number”, “number of pages”. "," Number of lines "and" END code "are set only. That is, in the second attribute information 136 shown in FIG. 19, the information where “number of pages” is “1” and “number of lines” is “3” and “5” is information indicating that a line break exists. is there.

そのため、情報比較部１１４は、図１８に示す第１属性情報１３５に含まれる情報に、図２０に示す第２属性情報１３６に含まれる情報と単純に比較することができない情報が存在するものと判定する。 Therefore, the information comparison unit 114 includes information that cannot be simply compared with the information included in the second attribute information 136 illustrated in FIG. 20 in the information included in the first attribute information 135 illustrated in FIG. judge.

そこで、情報比較部１１４は、図２０に示すように、図１８に示す第１属性情報１３５に含まれる項目のうち、「ページ数」及び「行数」に対応する情報を削除する。 Therefore, as illustrated in FIG. 20, the information comparison unit 114 deletes information corresponding to “number of pages” and “number of rows” from the items included in the first attribute information 135 illustrated in FIG.

また、情報比較部１１４は、図２１に示すように、図１９に示す第２属性情報１３６に含まれる項目のうち、「ページ数」及び「行数」に対応する情報を削除する。さらに、情報比較部１１４は、改行コードのみを含む行に対応する情報である「ページ数」が「１」であって「行数」が「３」及び「５」である情報を削除する。 Further, as illustrated in FIG. 21, the information comparison unit 114 deletes information corresponding to “number of pages” and “number of rows” from the items included in the second attribute information 136 illustrated in FIG. 19. Further, the information comparison unit 114 deletes information corresponding to the line including only the line feed code, the number of “pages” being “1” and the “number of lines” being “3” and “5”.

これにより、図１９及び図２１に示すように、第１属性情報１３５に含まれる情報に対応する行の数は、第２属性情報１３６に含まれる情報に対応する行の数と一致する。そのため、情報比較部１１４は、Ｓ１８の処理において、第１属性情報１３５及び第２属性情報１３６の比較を行うことが可能になる。 Accordingly, as shown in FIGS. 19 and 21, the number of lines corresponding to the information included in the first attribute information 135 matches the number of lines corresponding to the information included in the second attribute information 136. Therefore, the information comparison unit 114 can compare the first attribute information 135 and the second attribute information 136 in the process of S18.

なお、図１９及び図２１に示す例において、図１９に示す第１属性情報１３５における「項番」が「５」である情報には、「文字開始列」として「５」が設定され、「全体文字数」として「１０」が設定されている。一方、図２１に示す第２属性情報１３６における「項番」が「５」である情報には、「文字開始列」として「１」が設定され、「全体文字数」として「６」が設定されている。 In the example shown in FIG. 19 and FIG. 21, “5” is set as the “character start string” in the information whose “item number” is “5” in the first attribute information 135 shown in FIG. “10” is set as the “total number of characters”. On the other hand, in the information whose “item number” is “5” in the second attribute information 136 shown in FIG. 21, “1” is set as the “character start string” and “6” is set as the “total number of characters”. ing.

そのため、情報比較部１１４は、この場合、Ｓ１８の処理において、図１９に示す第１属性情報１３５に情報が存在する空白の数と、図２１に示す第２属性情報１３６に情報が存在する空白の数とが異なっていると判定する。 Therefore, in this case, in the processing of S18, the information comparison unit 114 determines the number of blanks in which information exists in the first attribute information 135 illustrated in FIG. 19 and the blanks in which information exists in the second attribute information 136 illustrated in FIG. It is determined that the number is different.

そして、結果出力部１１５は、Ｓ１９の処理において、例えば、図２２に示すように、「ＰＤＦ文書の２ページ目の２行目」に「表示位置の差異が存在」する旨の情報を比較結果情報１３７に含める。 Then, the result output unit 115 compares the information indicating that “there is a difference in display position” in “the second line of the second page of the PDF document”, as shown in FIG. It is included in the information 137.

これにより、結果出力部１１５は、第１属性情報１３５及び第２属性情報１３６に、単純に比較することができない項目に対応する情報が含まれている場合であっても、事業者に対し、比較結果情報１３７が示す情報の見直しを行う契機を与えることが可能になる。 As a result, even if the result output unit 115 includes information corresponding to items that cannot be simply compared in the first attribute information 135 and the second attribute information 136, It becomes possible to give an opportunity to review the information indicated by the comparison result information 137.

以上の実施の形態をまとめると、以下の付記のとおりである。 The above embodiment is summarized as follows.

（付記１）
コンピュータに、
文書ファイルの生成に用いられる同一の生成元データに基づいて生成された、ファイル形式の異なる複数の文書ファイルについて、文書ファイルに含まれる文書の行単位の属性情報を生成し、
前記複数の文書ファイルについて、文書内の行それぞれについての前記属性情報を比較した結果を出力する、
処理を実行させることを特徴とする文書校閲支援プログラム。 (Appendix 1)
On the computer,
For each of a plurality of document files having different file formats generated based on the same generation source data used for generating the document file, the attribute information for each line of the document included in the document file is generated,
Outputting a result of comparing the attribute information for each line in the document for the plurality of document files;
A document review support program characterized by causing processing to be executed.

（付記２）
付記１において、
前記属性情報は、前記文書ファイルに含まれる文書の修飾方法を示す情報を含む、
ことを特徴とする文書校閲支援プログラム。 (Appendix 2)
In Appendix 1,
The attribute information includes information indicating a modification method of a document included in the document file.
Document review support program characterized by this.

（付記３）
付記１において、
前記属性情報は、前記文書ファイルに含まれる文書に空白文字が含まれる旨を示す情報を含む、
ことを特徴とする文書校閲支援プログラム。 (Appendix 3)
In Appendix 1,
The attribute information includes information indicating that the document included in the document file includes a blank character,
Document review support program characterized by this.

（付記４）
付記１において、
前記属性情報は、前記文書ファイルに含まれる文書の出力時における前記文書のページを示す情報及び行を示す情報を含み、
前記出力では、前記属性情報のうち、前記ページを示す情報及び前記行を示す情報以外の情報について比較を行う、
ことを特徴とする文書校閲支援プログラム。 (Appendix 4)
In Appendix 1,
The attribute information includes information indicating a page of the document and information indicating a line when the document included in the document file is output,
In the output, the attribute information is compared for information other than information indicating the page and information indicating the row.
Document review support program characterized by this.

（付記５）
付記１において、
前記属性情報は、前記文書ファイルに含まれる文書に改行が含まれる旨を示す情報を含み、
前記出力では、前記属性情報のうち、前記改行が含まれる旨を示す情報以外の情報について比較を行う、
ことを特徴とする文書校閲支援プログラム。 (Appendix 5)
In Appendix 1,
The attribute information includes information indicating that the document included in the document file includes a line feed,
In the output, the attribute information is compared with information other than information indicating that the line feed is included.
Document review support program characterized by this.

（付記６）
付記１において、
前記生成では、同一の特定フォーマットに従って、前記複数の文書ファイルについての前記属性情報を生成する、
ことを特徴とする文書校閲支援プログラム。 (Appendix 6)
In Appendix 1,
In the generation, the attribute information for the plurality of document files is generated according to the same specific format.
Document review support program characterized by this.

（付記７）
付記１において、
前記属性情報は、前記文書ファイルに含まれる文書の文字数を示す情報を含む、
ことを特徴とする文書校閲支援プログラム。 (Appendix 7)
In Appendix 1,
The attribute information includes information indicating the number of characters of a document included in the document file.
Document review support program characterized by this.

（付記８）
文書ファイルの生成に用いられる同一の生成元データに基づいて生成された、ファイル形式の異なる複数の文書ファイルについて、文書ファイルに含まれる文書の行単位の属性情報を生成する情報生成部と、
前記複数の文書ファイルについて、文書内の行それぞれについての前記属性情報を比較した結果を出力する結果出力部と、を有する、
ことを特徴とする文書校閲支援装置。 (Appendix 8)
An information generation unit for generating line-by-line attribute information of a document included in the document file for a plurality of document files having different file formats generated based on the same generation source data used for generating the document file;
A result output unit that outputs a result of comparing the attribute information for each line in the document for the plurality of document files;
Document review support device characterized by the above.

（付記９）
文書ファイルの生成に用いられる同一の生成元データに基づいて生成された、ファイル形式の異なる複数の文書ファイルについて、文書ファイルに含まれる文書の行単位の属性情報を生成し、
前記複数の文書ファイルについて、文書内の行それぞれについての前記属性情報を比較した結果を出力する、
ことを特徴とする文書校閲支援方法。 (Appendix 9)
For each of a plurality of document files having different file formats generated based on the same generation source data used for generating the document file, the attribute information for each line of the document included in the document file is generated,
Outputting a result of comparing the attribute information for each line in the document for the plurality of document files;
Document review support method characterized by this.

１：情報処理装置２：記憶部
１１：利用者端末ＮＷ：ネットワーク 1: Information processing device 2: Storage unit 11: User terminal NW: Network

Claims

On the computer,
For each of a plurality of document files having different file formats generated based on the same generation source data used for generating the document file, the attribute information for each line of the document included in the document file is generated,
Outputting a result of comparing the attribute information for each line in the document for the plurality of document files;
A document review support program characterized by causing processing to be executed.

In claim 1,
The attribute information includes information indicating a modification method of a document included in the document file.
Document review support program characterized by this.

In claim 1,
The attribute information includes information indicating that the document included in the document file includes a blank character,
Document review support program characterized by this.

In claim 1,
The attribute information includes information indicating a page of the document and information indicating a line when the document included in the document file is output,
In the output, the attribute information is compared for information other than information indicating the page and information indicating the row.
Document review support program characterized by this.

In claim 1,
The attribute information includes information indicating that the document included in the document file includes a line feed,
In the output, the attribute information is compared with information other than information indicating that the line feed is included.
Document review support program characterized by this.

In claim 1,
In the generation, the attribute information for the plurality of document files is generated according to the same specific format.
Document review support program characterized by this.

An information generation unit for generating line-by-line attribute information of a document included in the document file for a plurality of document files having different file formats generated based on the same generation source data used for generating the document file;
A result output unit that outputs a result of comparing the attribute information for each line in the document for the plurality of document files;
Document review support device characterized by the above.

For each of a plurality of document files having different file formats generated based on the same generation source data used for generating the document file, the attribute information for each line of the document included in the document file is generated,
Outputting a result of comparing the attribute information for each line in the document for the plurality of document files;
Document review support method characterized by this.