JP2005122324A

JP2005122324A - Apparatus, method, and program for document electronization, and document electronization system equipped with same apparatus

Info

Publication number: JP2005122324A
Application number: JP2003354407A
Authority: JP
Inventors: Tetsuya Hamada; 徹也濱田; Taiji Yoneyama; 泰司米山
Original assignee: Canon Marketing Japan Inc
Current assignee: Canon Marketing Japan Inc
Priority date: 2003-10-14
Filing date: 2003-10-14
Publication date: 2005-05-12
Anticipated expiration: 2023-10-14
Also published as: JP4544508B2

Abstract

<P>PROBLEM TO BE SOLVED: To provide an apparatus, a method, and a program for document electronization that can securely classify images of documents, and a document electronization system equipped with the same apparatus. <P>SOLUTION: A document electronization server 4 acquires a PDF file from a folder after OCR processing and extracts a document management number as a file name from the acquired PDF file; when document property data including the same document management number as the extracted document management number are registered in a registered slip DB, the document electronization server 4 acquires specified data of a document kind, a comment, a manager code, etc., among the document property data registered in the registered slip DB, imparts the acquired specified data to properties including the file size and the number of pages of the PDF file, and retrieves a folder to be stored with the PDF file from storage-destination folders in another root directory on the document electronization server 4 according to the acquired manager code to move the PDF file to the corresponding folder. <P>COPYRIGHT: (C)2005,JPO&NCIPI

Description

本発明は、書類電子化装置、方法、プログラム、及び該装置を備える書類電子化システムに関し、特に、画像読取装置により複数枚の書類の画像を電子化して作成された電子化ファイルを分類する書類電子化装置、方法、プログラム、及び該装置を備える書類電子化システムに関する。 BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a document digitizing apparatus, method, program, and document digitizing system including the apparatus, and in particular, a document for classifying digitized files created by digitizing images of a plurality of documents by an image reading apparatus. The present invention relates to an electronic device, a method, a program, and a document electronic system including the device.

従来の書類電子化システムでは、スキャナ、複合機等の画像読取装置が、紙書類の画像を読取って、ＰＤＦ（Portable Document Format）、ＪＰＥＧ（Joint Photographic Exports group）、ＴＩＦＦ（Tagged Image File Format）、Ｍｕｌｔｉ−ＴＩＦＦ等のファイル形式に画像を電子化した電子化ファイルを作成する。また、ＡＤＦ機能（自動原稿送り機能）を有する画像読取装置を利用すれば、複数枚の紙書類の画像を一括して電子化することができる。 In a conventional document digitization system, an image reading device such as a scanner or a multi-function device reads an image of a paper document, PDF (Portable Document Format), JPEG (Joint Photographic Exports group), TIFF (Tagged Image File Format), An electronic file is created by digitizing an image in a file format such as Multi-TIFF. Further, if an image reading apparatus having an ADF function (automatic document feeding function) is used, images of a plurality of paper documents can be digitized at once.

しかしながら、紙書類を画像データとして電子化しただけでは、画像データに対しては自動文字検索等を行うことができないので、作成した電子化ファイルから所望の文字情報を視認により検索しなければならない。 However, if a paper document is only digitized as image data, automatic character search or the like cannot be performed on the image data. Therefore, it is necessary to visually search desired character information from the created digitized file.

そこで、紙書類を電子化する際にＯＣＲ（Optical Character Reader）処理を施すことにより、電子化ファイル内の自動文字検索を可能にする方法や、電子化ファイルに文書属性情報を付与することにより、該文書属性情報を参照してファイルの分類作業を容易にする方法が考えられる。 Therefore, by applying OCR (Optical Character Reader) processing when digitizing paper documents, enabling automatic character search within the digitized file, or adding document attribute information to the digitized file, A method for facilitating file classification work with reference to the document attribute information is conceivable.

上記方法の一例として、電子化ファイルに付与すべき文書属性情報の項目を指定するための文書属性抽出シートを紙書類の表紙にして画像読取装置に読取らせることにより、紙書類自体に記載された文書属性情報を読取って、該文書属性情報を電子化ファイルに付与する技術が提案されている（例えば、特許文献１）。 As an example of the above method, the document attribute extraction sheet for designating the item of the document attribute information to be added to the digitized file is used as a cover of the paper document, and is read by the image reading device, so that it is described in the paper document itself. A technique has been proposed in which document attribute information is read and the document attribute information is added to an electronic file (for example, Patent Document 1).

この技術によれば、ユーザが、紙書類の文書属性情報の記載された領域、及び電子化ファイルに付与すべき文書属性情報の項目を指定することにより、文書属性情報を電子化ファイルに付与することができるので、ユーザの文書属性情報付与の作業負担を軽減することができる。
特開２００２−５５９８５号公報 According to this technique, the user assigns document attribute information to the digitized file by designating an area in which the document attribute information of the paper document is described and an item of the document attribute information to be given to the digitized file. As a result, it is possible to reduce the work burden of the user on adding document attribute information.
JP 2002-55985 A

しかしながら、上記技術によって紙書類から文書属性情報を読取る場合、読取るべき文書属性情報の項目が、例えば５つある場合に、そのうちの１つの項目で文書属性情報の読取りミスが発生すれば、電子化ファイルに間違った文書属性情報が付与される。したがって、文書属性情報に基づいて紙書類を正確に分類することができない。 However, when document attribute information is read from a paper document by the above technique, if there are, for example, five items of document attribute information to be read, if an error in reading the document attribute information occurs in one of the items, digitization is performed. Incorrect document attribute information is given to the file. Therefore, the paper document cannot be classified accurately based on the document attribute information.

また、上記技術では、ユーザが電子化ファイルに付与すべき文書属性情報の項目を指定するための文書属性抽出シートの記入を誤ると、文書属性情報を適切に読取ることができない。 Further, in the above technique, if the user makes a mistake in filling in the document attribute extraction sheet for designating the item of the document attribute information to be added to the digitized file, the document attribute information cannot be read appropriately.

さらに、上記技術では、ユーザは、紙書類を読取って電子化ファイルを作成した時点では、電子化ファイルに付与された文書属性情報が正確であるか否かを確認することができない。 Furthermore, in the above technique, the user cannot confirm whether or not the document attribute information given to the digitized file is accurate when the digitized file is created by reading the paper document.

本発明の目的は、書類の画像を確実に分類することができる書類電子化装置、方法、プログラム、及び該装置を備える書類電子化システムを提供することにある。 An object of the present invention is to provide a document digitizing apparatus, a method, a program, and a document digitizing system including the apparatus capable of reliably classifying document images.

上述の目的を達成するために、請求項１記載の書類電子化装置は、所定の識別子が所定の領域に記載された識別用書類を含む書類の画像から前記記載された識別子を抽出する書類電子化装置において、前記書類を識別するための識別子を格納する格納手段と、前記識別用書類の画像における前記所定の領域に対応する領域から前記記載された識別子を抽出する抽出手段と、前記抽出された識別子及び前記格納されている識別子が互いに一致するか否かを判別する判別手段と、前記判別手段による判別の結果に基づいて前記書類の画像を保存する保存手段とを備えることを特徴とする。 In order to achieve the above object, the document digitizing apparatus according to claim 1 is a document electronic device for extracting the described identifier from an image of a document including an identification document in which a predetermined identifier is described in a predetermined area. In the conversion apparatus, storage means for storing an identifier for identifying the document, extraction means for extracting the described identifier from an area corresponding to the predetermined area in the image of the identification document, and the extracted A discriminating unit that discriminates whether or not the stored identifier and the stored identifier coincide with each other; and a storage unit that saves the image of the document based on a result of discrimination by the discriminating unit. .

請求項２記載の書類電子化装置は、請求項１記載の書類電子化装置において、前記抽出された識別子及び前記格納されている識別子が互いに一致しないときは、前記保存手段は、前記書類の画像を不一致フォルダに保存することを特徴とする。 The document digitizing apparatus according to claim 2, wherein, in the document digitizing apparatus according to claim 1, when the extracted identifier and the stored identifier do not coincide with each other, the storage unit stores the image of the document. Is stored in the mismatch folder.

請求項３記載の書類電子化装置は、請求項２記載の書類電子化装置において、前記不一致フォルダに格納された書類に含まれる識別用書類の画像を表示する表示手段と、当該識別用書類の画像が表示されている識別用書類の識別子を入力する入力手段とを備える端末装置に接続され、前記書類の画像に前記保存手段は、前記入力された識別子に基づいて保存フォルダに保存することを特徴とする。 The document digitizing apparatus according to claim 3 is the document digitizing apparatus according to claim 2, wherein display means for displaying an image of the identification document included in the document stored in the mismatched folder, and the identification document And an input unit that inputs an identifier of an identification document on which an image is displayed. The storage unit stores an image of the document in a storage folder based on the input identifier. Features.

請求項４記載の書類電子化装置は、請求項１記載の書類電子化装置において、前記抽出された識別子及び前記格納されている識別子が互いに一致するときは、前記保存手段は、前記書類の画像に前記一致した識別子に基づいて識別名を付与して保存フォルダに保存することを特徴とする。 The document digitizing apparatus according to claim 4, wherein, in the document digitizing apparatus according to claim 1, when the extracted identifier and the stored identifier coincide with each other, the storage unit stores the image of the document. And assigning an identification name to the storage folder based on the matched identifier.

請求項５記載の書類電子化装置は、請求項３又は４記載の書類電子化装置において、前記格納手段は、前記識別子を含む前記書類の属性を示す書類属性情報を格納し、前記保存手段は、前記識別子が付与された書類の画像に前記格納されている書類属性情報を付与して前記保存フォルダに保存することを特徴とする。 The document digitizing apparatus according to claim 5 is the document digitizing apparatus according to claim 3 or 4, wherein the storage unit stores document attribute information indicating an attribute of the document including the identifier, and the storage unit includes The stored document attribute information is added to the image of the document to which the identifier is assigned, and the image is saved in the save folder.

請求項６記載の書類電子化装置は、請求項５記載の書類電子化装置において、前記保存手段は、前記書類属性情報に基づいて前記書類属性情報が付与された書類の画像を前記保存フォルダのサブフォルダに保存することを特徴とする。 The document digitizing apparatus according to claim 6 is the document digitizing apparatus according to claim 5, wherein the storage unit stores an image of the document to which the document attribute information is assigned based on the document attribute information in the storage folder. It is stored in a subfolder.

請求項７記載の書類電子化装置は、請求項１乃至６のいずれか１項に記載の書類電子化装置において、前記抽出された識別子及び前記格納されている識別子が互いに一致した一致回数をカウントするカウント手段とを備えることを特徴とする。 The document digitizing apparatus according to claim 7, wherein the document digitizing apparatus according to any one of claims 1 to 6 counts the number of times the extracted identifier and the stored identifier match each other. And a counting means.

請求項８記載の書類電子化装置は、請求項１乃至７のいずれか１項に記載の書類電子化装置において、前記識別用書類を含む書類を読み取ってその画像を出力する画像出力装置に接続され、前記所定の識別子が前記所定の領域に記載されるように、該所定の識別子が記載されていない識別用書類のフォーマットを前記画像出力装置に送信する送信手段と、前記画像出力装置により出力された書類の画像を受信する受信手段とを備えることを特徴とする。 The document digitizing apparatus according to claim 8 is connected to an image output apparatus that reads the document including the identification document and outputs the image in the document digitizing apparatus according to any one of claims 1 to 7. So that the predetermined identifier is described in the predetermined area, and a format of an identification document not including the predetermined identifier is transmitted to the image output device, and output by the image output device Receiving means for receiving an image of the document that has been read.

請求項９記載の書類電子化装置は、請求項８記載の書類電子化装置において、前記画像出力装置は、前記識別用書類のフォーマットを受信して当該フォーマットに従って前記格納手段に格納すべき識別子を前記所定の領域に入力する端末装置と、前記識別子が入力された識別用書類を印刷する印刷する印刷装置と、前記印刷された識別用書類を含む書類の画像を読取る画像読取装置とを備えることを特徴とする。 The document digitizing apparatus according to claim 9 is the document digitizing apparatus according to claim 8, wherein the image output apparatus receives an identifier to be stored in the storage means according to the format of the identification document. A terminal device for inputting to the predetermined area; a printing device for printing the identification document to which the identifier is input; and an image reading device for reading an image of the document including the printed identification document. It is characterized by.

上述の目的を達成するために、請求項１０記載の書類電子化方法は、所定の識別子が所定の領域に記載された識別用書類を含む書類の画像から前記記載された識別子を抽出する書類電子化方法において、前記書類を識別するための識別子を格納する格納ステップと、前記識別用書類の画像における前記所定の領域に対応する領域から前記記載された識別子を抽出する抽出ステップと、前記抽出された識別子及び前記格納されている識別子が互いに一致するか否かを判別する判別ステップと、前記判別ステップでの前記判別の結果に基づいて前記書類の画像を保存する保存ステップとを有することを特徴とする。 In order to achieve the above-described object, a document digitizing method according to claim 10 is a document electronic method for extracting the described identifier from an image of a document including an identification document in which a predetermined identifier is described in a predetermined area. In the conversion method, a storing step of storing an identifier for identifying the document, an extracting step of extracting the described identifier from an area corresponding to the predetermined area in the image of the identification document, and the extraction A determination step for determining whether or not the stored identifier and the stored identifier match each other, and a storage step for storing the image of the document based on the determination result in the determination step. And

上述の目的を達成するために、請求項１１記載のプログラムは、所定の識別子が所定の領域に記載された識別用書類を含む書類の画像から前記記載された識別子を抽出する書類電子化方法をコンピュータに実行させるプログラムであって、前記書類を識別するための識別子を格納する格納モジュールと、前記識別用書類の画像における前記所定の領域に対応する領域から前記記載された識別子を抽出する抽出モジュールと、前記抽出された識別子及び前記格納されている識別子が互いに一致するか否かを判別する判別モジュールと、前記判別モジュールによる判別の結果に基づいて前記書類の画像を保存する保存モジュールとを備えることを特徴とする。 In order to achieve the above object, a program according to claim 11 is a document digitizing method for extracting an identifier described from a document image including an identification document in which a predetermined identifier is described in a predetermined area. A storage module for storing an identifier for identifying the document, and an extraction module for extracting the described identifier from an area corresponding to the predetermined area in the image of the identification document. A determination module that determines whether the extracted identifier and the stored identifier match each other, and a storage module that stores an image of the document based on a determination result by the determination module It is characterized by that.

上述の目的を達成するために、請求項１２記載の書類電子化システムは、所定の識別子が所定の領域に記載された識別用書類を含む書類を読み取ってその画像を出力する画像出力装置と、前記出力された画像から前記記載された識別子を抽出する書類電子化装置とから成る書類電子化システムにおいて、前記書類電子化装置は、前記書類を識別するための識別子を格納する格納手段と、前記識別用書類の画像における前記所定の領域に対応する領域から前記記載された識別子を抽出する抽出手段と、前記抽出された識別子及び前記格納されている識別子が互いに一致するか否かを判別する判別手段と、前記判別手段による判別の結果に基づいて前記書類の画像を保存する保存手段とを備えることを特徴とする。 In order to achieve the above-described object, an electronic document output system according to claim 12, an image output device that reads a document including an identification document whose predetermined identifier is described in a predetermined area, and outputs the image, and In a document digitizing system comprising a document digitizing device that extracts the described identifier from the output image, the document digitizing device includes a storage unit that stores an identifier for identifying the document, and Discriminating means for extracting the described identifier from an area corresponding to the predetermined area in the image of the identification document and determining whether or not the extracted identifier and the stored identifier match each other Means and a storage means for storing the image of the document based on the result of determination by the determination means.

請求項１３記載の書類電子化システムは、請求項１２記載の書類電子化システムにおいて、前記書類電子化装置は、前記識別用書類に記載される識別子が前記所定の領域となるように識別子が記載されていない識別用書類のフォーマットを前記画像出力装置に送信する送信手段を備え、前記画像出力装置は、前記識別用書類のフォーマットを受信して当該フォーマットに従って前記格納手段に格納すべき識別子を前記所定の領域に入力する端末装置と、前記識別子が入力された識別用書類を印刷する印刷する印刷装置と、前記印刷された識別用書類を含む書類の画像を読取る画像読取装置とを備えることを特徴とする。 The document digitizing system according to claim 13 is the document digitizing system according to claim 12, wherein the document digitizing apparatus describes the identifier so that the identifier described in the identification document is the predetermined area. Transmitting means for transmitting a format of an unidentified identification document to the image output device, the image output device receiving an identifier to be stored in the storage means according to the format after receiving the format of the identification document A terminal device for inputting to a predetermined area; a printing device for printing the identification document to which the identifier is input; and an image reading device for reading an image of the document including the printed identification document. Features.

請求項１記載の書類電子化装置、請求項１０記載の方法、請求項１１記載のプログラム、又は請求項１２記載のシステムによれば、抽出された識別子及び格納されている識別子が互いに一致するか否かの判別の結果に基づいて前記書類の画像を保存するので、書類の画像を確実に分類することができる。また、識別用書類の画像における所定の領域に対応する領域から記載された識別子を抽出するので、ユーザが所定の領域を指定することをなくすことができ、ユーザの操作性を向上させることができる。 According to the document digitizing apparatus according to claim 1, the method according to claim 10, the program according to claim 11, or the system according to claim 12, whether the extracted identifier and the stored identifier match each other. Since the image of the document is stored based on the result of determination of whether or not, the image of the document can be reliably classified. Further, since the identifier described from the area corresponding to the predetermined area in the image of the identification document is extracted, the user can be prevented from designating the predetermined area, and the operability for the user can be improved. .

請求項２記載の書類電子化装置によれば、抽出された識別子及び格納されている識別子が互いに一致しないときは、書類の画像を不一致フォルダに保存するので、例えば抽出エラーがあったときに書類の画像に識別子を付与することを停止することができる。 According to the document digitizing apparatus of claim 2, when the extracted identifier and the stored identifier do not match each other, the image of the document is stored in the mismatch folder. It is possible to stop assigning an identifier to the image.

請求項３記載の書類電子化装置によれば、不一致フォルダに格納された書類に含まれる識別用書類の画像を表示して、表示されている識別用書類に記載された識別子を入力させることにより、入力された識別子に基づいて保存フォルダに保存するので、例えば抽出エラーがあったときでも入力された識別子に基づいて書類の画像に入力された識別子を付与して確実に保存フォルダにおいて分類することができる。 According to the document digitizing apparatus of the third aspect, by displaying an image of the identification document included in the document stored in the mismatched folder and inputting the identifier described in the displayed identification document Since it is stored in the storage folder based on the input identifier, for example, even if there is an extraction error, the input identifier is given to the image of the document based on the input identifier, and it is surely classified in the storage folder Can do.

請求項４記載の書類電子化装置によれば、抽出された識別子及び格納されている識別子が互いに一致するときは、書類の画像に一致した識別子に基づいて識別名を付与して保存フォルダに保存するので、識別子に基づいて書類の画像を確実に保存フォルダにおいて分類することができる。 According to the document digitizing apparatus of claim 4, when the extracted identifier and the stored identifier match each other, an identification name is given based on the identifier that matches the image of the document and saved in the save folder. Therefore, the image of the document can be surely classified in the storage folder based on the identifier.

請求項５記載の書類電子化装置によれば、識別子が付与された書類の画像に、格納されている書類属性情報を付与して保存フォルダに保存するので、書類属性情報に基づいて書類の画像を確実に保存フォルダにおいて分類することができる。 According to the document digitizing apparatus of claim 5, since the stored document attribute information is added to the image of the document to which the identifier is assigned and saved in the save folder, the image of the document is based on the document attribute information. Can be reliably classified in the storage folder.

請求項６記載の書類電子化装置によれば、書類属性情報に基づいて書類属性情報が付与された書類の画像を保存フォルダのサブフォルダに保存するので、書類属性情報に基づいて書類の画像を確実にサブフォルダにおいて分類することができる。 According to the document digitizing apparatus of the sixth aspect, since the image of the document to which the document attribute information is assigned based on the document attribute information is stored in the subfolder of the storage folder, the document image is surely determined based on the document attribute information. Can be categorized in subfolders.

請求項７記載の書類電子化装置によれば、抽出された識別子及び格納されている識別子が互いに一致した一致回数をカウントするので、例えば既に保存されている書類の画像に対して上書き保存した回数をカウントすることができる。また、書類が複数部であるときは、どの部の書類において抽出エラーが発生したのかを容易に特定することができ、ユーザの操作性を向上させることができる。 According to the document digitizing apparatus according to claim 7, the number of times that the extracted identifier and the stored identifier match each other is counted. Can be counted. Further, when there are a plurality of documents, it is possible to easily identify in which document the extraction error has occurred, and to improve the operability for the user.

請求項８記載の書類電子化装置によれば、所定の識別子が所定の領域に記載されるように、該所定の識別子が記載されていない識別用書類のフォーマットを画像出力装置に送信するので、書類電子化装置は所定の領域を容易に特定することができ、確実に抽出エラーを低減させることができる。 According to the document digitizing apparatus according to claim 8, since the format of the identification document in which the predetermined identifier is not described is transmitted to the image output device so that the predetermined identifier is described in the predetermined area, The document digitizing apparatus can easily specify a predetermined area, and can reliably reduce extraction errors.

請求項９記載の書類電子化装置、又は請求項１３記載のシステムによれば、端末装置が、書類電子化装置から識別用書類のフォーマットを受信して当該フォーマットに従って識別子を所定の領域に入力し、印刷装置が、識別子が入力された識別用書類を印刷し、画像読取装置が、印刷された識別用書類を含む書類の画像を読取り、その画像が書類電子化装置に出力されるので、書類電子化装置はより確実に抽出エラーを低減させることができる。 According to the document digitizing device according to claim 9 or the system according to claim 13, the terminal device receives the format of the identification document from the document digitizing device and inputs the identifier into a predetermined area according to the format. The printing device prints the identification document to which the identifier is inputted, the image reading device reads the image of the document including the printed identification document, and the image is output to the document digitizing device. The electronic apparatus can more reliably reduce extraction errors.

以下、本発明の実施の形態を図面を参照しながら説明する。 Embodiments of the present invention will be described below with reference to the drawings.

図１は、本発明の実施の形態に係る書類電子化システムの構成を示す図である。 FIG. 1 is a diagram showing a configuration of a document digitizing system according to an embodiment of the present invention.

図１において、書類電子化システム１は、書類電子化システム１のユーザが操作する利用者端末２と、紙書類から画像を読取って、例えばＰＤＦ形式の画像データとしてＰＤＦファイル（電子化ファイル）を作成する画像読取装置３と、作成されたＰＤＦファイルの画像データにＯＣＲ処理を行い、紙書類の文書属性を示す文書属性データを付与する書類電子化サーバ４（書類電子化装置）と、書類電子化サーバ４においてエラーが検出されたＰＤＦファイルにエラー処理を行うために管理者が操作する管理者端末５とを備え、これらの各装置はＬＡＮ６を介して互いに接続されている。 In FIG. 1, a document digitizing system 1 reads an image from a user terminal 2 operated by a user of the document digitizing system 1 and a paper document, and converts a PDF file (digitized file) as, for example, PDF format image data. An image reading device 3 to be created, a document digitizing server 4 (document digitizing device) that performs OCR processing on the image data of the created PDF file and assigns document attribute data indicating the document attribute of the paper document, and document electronics An administrator terminal 5 that is operated by an administrator to perform error processing on the PDF file in which an error is detected in the conversion server 4 is connected to each other via a LAN 6.

なお、各装置が互いに接続される通信回線は、当然のことながらＬＡＮのみには限られない。 Of course, the communication line to which the devices are connected to each other is not limited to the LAN.

利用者端末２は、ユーザが入力した所定の文書属性データから後述する登録票（識別用書類）を作成する（印刷する）と共に、文書管理番号を含む文書属性データを書類電子化サーバ４のＨＤＤ等の記録媒体に記録されている登録票ＤＢに登録する。なお、登録票ＤＢを管理するサーバ（不図示）は、別のコンピュータであってもよい。画像読取装置３は、登録票を表紙にした紙書類を画像データとして読取って、ＰＤＦファイルを作成し、書類電子化サーバ４は、ＰＤＦファイルの画像データにＯＣＲ処理を行い、登録票から読取った文書管理番号に対応する文書属性データをＰＤＦファイルに付与する。登録票から読取った文書管理番号に対応する文書属性データが書類電子化サーバ４に登録されていないときは、管理者端末５はエラー処理を行う。 The user terminal 2 creates (prints) a registration slip (identification document), which will be described later, from predetermined document attribute data input by the user, and sends document attribute data including a document management number to the HDD of the document digitizing server 4. Registered in the registration form DB recorded on the recording medium. Note that the server (not shown) for managing the registration form DB may be another computer. The image reading device 3 reads a paper document with a registration form as a cover as image data to create a PDF file, and the document digitizing server 4 performs OCR processing on the image data of the PDF file and reads it from the registration form. Document attribute data corresponding to the document management number is assigned to the PDF file. When the document attribute data corresponding to the document management number read from the registration form is not registered in the document digitizing server 4, the administrator terminal 5 performs error processing.

図２は、図１における利用者端末２の構成を示すブロック図である。なお、管理者端末５も利用者端末２と同様の構成を有し、また、書類電子化サーバ４も利用者端末２の部分的な構成と同様の構成を有するので、それらの説明を省略する。 FIG. 2 is a block diagram showing the configuration of the user terminal 2 in FIG. The administrator terminal 5 has the same configuration as the user terminal 2, and the document digitizing server 4 also has the same configuration as the partial configuration of the user terminal 2, so that the description thereof is omitted. .

図２において、利用者端末２は、プログラムを実行するＣＰＵ１１と、ＣＰＵ１１の主メモリやワークエリア等として機能するＲＡＭ１２と、ＣＰＵ１１が動作するためのプログラムが格納されているＲＯＭ１３と、記憶装置としてのＨＤＤ１４と、脱着自在な記録媒体にデータを記録可能な記録装置１５と、表示機能を提供するビデオアダプタ１６及び表示装置（不図示）と、入力装置としてのキーボード（Ｋ／Ｂ）１７及びマウス１８と、ネットワークに接続すると共に通信するデータを電気信号に変換するためのネットワークインターフェースカード（ＮＩＣ：Network Interface Card）１９と、登録票を印刷する印刷装置（不図示）とを備える。なお、１台の印刷装置が複数の利用者端末２でＬＡＮ６を介して共有されてもよい。 2, a user terminal 2 includes a CPU 11 that executes a program, a RAM 12 that functions as a main memory and a work area of the CPU 11, a ROM 13 that stores a program for operating the CPU 11, and a storage device. HDD 14, recording device 15 capable of recording data on a removable recording medium, video adapter 16 and display device (not shown) for providing a display function, keyboard (K / B) 17 and mouse 18 as input devices And a network interface card (NIC: Network Interface Card) 19 for converting data to be communicated to a network and an electrical signal, and a printing device (not shown) for printing a registration form. One printing apparatus may be shared by a plurality of user terminals 2 via the LAN 6.

図３は、図２における利用者端末２によって実行される登録票作成処理のフローチャートである。 FIG. 3 is a flowchart of a registration form creation process executed by the user terminal 2 in FIG.

図３において、まず、ユーザは、利用者端末２の表示装置に表示される図４に示すログイン画面に、該ユーザを特定するＩＤと、ユーザがログインするためのパスワードとから成る認証情報を入力し、ログインボタンを押下すると、利用者端末２は、入力された認証情報を書類電子化サーバ４に送信し（ステップＳ１０１）、書類電子化サーバ４から正当な利用者ではない旨を示すエラーメッセージを取得したか否かを判別する（ステップＳ１０２）。 In FIG. 3, first, the user inputs authentication information including an ID for identifying the user and a password for logging in to the login screen shown in FIG. 4 displayed on the display device of the user terminal 2. When the login button is pressed, the user terminal 2 transmits the input authentication information to the document digitizing server 4 (step S101), and an error message indicating that the user is not a valid user from the document digitizing server 4. Is determined (step S102).

なお、認証手段は、ユーザを識別・認証できるのであれば、ＩＤ、パスワードを用いた方法以外であってもよい。 Note that the authentication unit may be other than a method using an ID and a password as long as the user can be identified and authenticated.

ステップＳ１０２の判別の結果、書類電子化サーバ４に正当な利用者であると認証されず、書類電子化サーバ４からエラーメッセージを取得したときは、ステップＳ１０１に戻り、エラーメッセージを取得しなかったときは、書類電子化サーバ４から文書属性データ入力画面情報を取得し（ステップＳ１０３）、図５に示す文書属性データ入力画面を表示する（ステップＳ１０４）。 As a result of the determination in step S102, if the document digitizing server 4 is not authenticated as a valid user and an error message is obtained from the document digitizing server 4, the process returns to step S101 and no error message is obtained. When the document attribute data input screen information is obtained from the document digitizing server 4 (step S103), the document attribute data input screen shown in FIG. 5 is displayed (step S104).

なお、書類電子化サーバ４から取得する文書属性データ入力画面情報は、例えば、ＨＴＭＬ形式のデータであり、利用者端末においては、そのＨＴＭＬ形式のデータをＩｎｔｅｒｎｅｔＥｘｐｌｏｒｅｒ（商標）等のブラウザにより読み込み、図５に示す文書属性データ入力画面が表示される。 The document attribute data input screen information acquired from the document digitizing server 4 is, for example, HTML format data, and the user terminal reads the HTML format data with a browser such as the Internet Explorer (trademark), The document attribute data input screen shown in FIG. 5 is displayed.

ユーザは、自己の所属部課情報、管理者コード（管理者番号）（又は作成者コード（作成者番号））、文書種類、保存期間、作成日、コメントなどの文書属性データを文書属性データ入力画面上の各項目に入力し、登録ボタンの押下により文書属性データの送信を指示すると（ステップＳ１０５でＹＥＳ）、入力された文書属性データを書類電子化サーバ４に送信し（ステップＳ１０６）、送信した文書属性データの項目のうち少なくとも１つの項目に入力されていないことを示す入力漏れが、書類電子化サーバ４から通知されたか否かを判別し（ステップＳ１０７）、入力漏れが通知されたときは、図６に示すような入力漏れエラーメッセージを表示して（ステップＳ１０８）、ステップＳ１０４に戻り、ユーザに入力漏れの項目に対する文書属性データの入力を促す。 The user can enter the document attribute data such as his / her department information, administrator code (manager number) (or creator code (creator number)), document type, storage period, creation date, and comments. When each of the above items is entered and transmission of document attribute data is instructed by pressing the registration button (YES in step S105), the input document attribute data is transmitted to the document digitizing server 4 (step S106). It is determined whether or not an input omission indicating that at least one of the items of the document attribute data has not been inputted is notified from the document digitizing server 4 (step S107). 6, an input omission error message as shown in FIG. 6 is displayed (step S108), and the process returns to step S104. It prompts you to enter the book attribute data.

なお、文書属性データ入力画面上における各項目は、ユーザにより選択又は入力されるが、文書属性データ入力画面を表示する画面において（ステップＳ１０４）、本システムとは別で管理しているデータベースからログインユーザに関する該当項目を取得できるのであれば、既に選択又は入力されている状態にすることも可能である。例えば、グループウェアで管理しているユーザに関する所属部課情報が取得できるのであれば、書類電子化サーバは、ログインユーザの所属部課情報を取得し、文書属性データ入力画面情報に取得したログインユーザの所属部課情報を付加してユーザ端末に送信する。 Each item on the document attribute data input screen is selected or input by the user. On the screen that displays the document attribute data input screen (step S104), the user logs in from a database managed separately from the system. As long as the corresponding item regarding the user can be acquired, it is also possible to make the state already selected or input. For example, if the department information related to the user managed by the groupware can be acquired, the document digitization server acquires the department information of the login user and belongs to the login user acquired in the document attribute data input screen information. Add the department information and send it to the user terminal.

また、後述する書類電子化サーバ４の登録票データベース（ＤＢ）に既に登録されている文書属性データから選択又は入力されてもよい。 Further, it may be selected or inputted from document attribute data already registered in a registration form database (DB) of the document digitizing server 4 described later.

なお、書類を電子化する場合、書類の所有者自身が登録票を作成・印刷し、当該登録票を用いて、画像読取装置３から書類の電子化を行うケースが、一般に考えられるが、登録票を用いて画像読取装置３から書類の電子化を行うのは煩雑であるため、他の者に電子化作業の分担を依頼できれば便宜である。このように電子化作業を分担する場合は、整理された登録票と書類を渡して電子化を依頼すれば済むが、更に登録票の作成も他の者に依頼できればより作業分担を図ることができるようになる。そのため、作業分担を依頼した者が登録票を作成する時に、電子化したファイルを利用する者（通常は、作成依頼をした者）の管理番号を入力するようにし，最終的に電子化したファイルを所定の利用者のフォルダに格納することにしている。図５及び図９において、管理番号（管理者コード）は、電子化ファイルを利用する者（＝ファイルの中身を理解して入用する者）を意味し、作成者（作成者コード）は、登録票を作成する者を意味する。 In the case of digitizing a document, it is generally considered that the owner of the document creates and prints a registration slip and digitizes the document from the image reading device 3 using the registration slip. Since it is cumbersome to digitize a document from the image reading device 3 using a vote, it is convenient if another person can be requested to share the digitization work. In this way, when sharing electronic work, it is sufficient to request the digitization by handing out the organized registration forms and documents, but if you can also ask other people to create registration forms, it is possible to share the work more become able to. Therefore, when the person who requested the work sharing creates a registration slip, the management number of the person who uses the digitized file (usually the person who requested the creation) is entered, and finally the digitized file Is stored in a folder of a predetermined user. 5 and 9, the management number (administrator code) means a person who uses the digitized file (= a person who understands and uses the contents of the file), and the creator (creator code) is It means a person who creates a registration form.

なお、登録票を作成する者は、登録票を作成する際に、少なくとも電子化するファイルの管理番号を知る必要があるが、管理者から直接聞くか、管理番号が社員コードである場合は、グループウェアの社員検索機能等を利用して調べる必要がある。 The person who creates the registration slip needs to know at least the management number of the file to be digitized when creating the registration slip, but if the management number is an employee code, It is necessary to investigate using the employee search function of groupware.

なお、図５において「ファイル名」の項目に「自動設定（文書種類−コメント−文書管理番号）」と記載されているが、これは、最終的な電子化ファイルに付与される名前（識別名）が、書類電子化サーバにより自動的に設定され、その名前の付け方として（文書種類−コメント−文書管理番号）が付与されることを意味しているに過ぎず、その他の名前の付与方法及び付与法則を採ることができるのはいうまでもない。 In FIG. 5, “automatic setting (document type−comment−document management number)” is described in the “file name” item. This is the name (identification name) given to the final digitized file. ) Is automatically set by the document computerization server, and only means that (document type-comment-document management number) is given as the name assignment method. It goes without saying that the law of grant can be adopted.

なお、書類電子化サーバにおいて項目に対する入力漏れのチェックが行われるが、１つでも入力漏れがあったら通知するような仕組にする必要はなく、例えば、「コメント」についての入力がなかった場合には、入力漏れの通知をしないというように、入力漏れの通知をしない項目を設けてもよい。 Note that the document digitization server checks for input omissions for items, but there is no need to provide a mechanism to notify if there is any omission, for example, when there is no input for “comment” May provide an item that does not notify input leakage such as not reporting input leakage.

ステップＳ１０７の判別の結果、入力漏れが通知されなかったときは、ユーザが不図示の印刷ボタンを介して印刷の指示を行うと（ステップＳ１０９でＹＥＳ）、利用者端末２は、図７に示すような登録票を作成し（ステップＳ１１０）、作成された登録票の印刷指示をその印刷装置に送信し（ステップＳ１１１）、該印刷指示に従って印刷装置は、図７に示す所定の領域に文書管理番号が記載されるように登録票を印刷し、そして本処理を終了する。 If the input omission is not notified as a result of the determination in step S107, when the user gives a print instruction via a print button (not shown) (YES in step S109), the user terminal 2 is shown in FIG. Such a registration slip is created (step S110), and a printing instruction for the created registration slip is transmitted to the printing apparatus (step S111). According to the printing instruction, the printing apparatus manages a document in a predetermined area shown in FIG. The registration slip is printed so that the number is written, and the process is terminated.

なお、図７における「文書管理番号」は、利用者端末２から文書属性データが送信されたときに書類電子化サーバが付与するユニークな番号である。ユニーク（一意）な番号にするため、例えば、「個人コード＋個人コード毎の累積番号」といった番号を付与することが考えられる。図７における「文書管理番号」は「１２３４５６０００００１」であるが、これは「１２３４５６」は個人コードで、「０００００１」はユーザ「１２３４５６」が登録した文書属性データの累積数を示している。 The “document management number” in FIG. 7 is a unique number assigned by the document digitizing server when document attribute data is transmitted from the user terminal 2. In order to obtain a unique number, for example, it is possible to give a number such as “personal code + accumulated number for each personal code”. The “document management number” in FIG. 7 is “123456000001”, where “123456” is a personal code and “000001” indicates the cumulative number of document attribute data registered by the user “123456”.

なお、「文書管理番号」は、ユニーク（一意）な番号でさえあればよく、一意性を確保できるのであれば各利用者端末２側において付与する仕組みであってもよい。 The “document management number” only needs to be a unique number, and may be assigned on each user terminal 2 side as long as uniqueness can be ensured.

なお、利用者端末２における登録票の作成・印刷は、１つの例として、ＲＡＭ１２若しくはＨＤＤ１４等に記憶されている書類電子化サーバに送信した文書属性データ又は書類電子化サーバから受信した文書属性データから、登録票に印刷する項目を取り出し、取り出した項目を雛形（フォーム）の所定のエリアに印刷されるように表計算ソフト等を介して作成・印刷する。ここで、「文書管理番号」は、書類電子化サーバが付与するため、書類電子化サーバから受信したデータを利用することになる。ただし、「文書管理番号」を各利用者端末において管理し付与する場合においては書類電子化サーバからデータを受信しなくてもよい。 Note that creation / printing of a registration form in the user terminal 2 is, for example, document attribute data transmitted to the document digitizing server stored in the RAM 12 or the HDD 14 or document attribute data received from the document digitizing server. Then, an item to be printed on the registration form is taken out, and the taken out item is created and printed via spreadsheet software or the like so as to be printed in a predetermined area of the template. Here, since the “document management number” is assigned by the document digitizing server, the data received from the document digitizing server is used. However, when the “document management number” is managed and assigned at each user terminal, data need not be received from the document digitizing server.

図３の処理によれば、利用者端末２は、登録票を作成し（ステップＳ１１０）、作成された登録票の印刷指示を印刷装置に送信する（ステップＳ１１１）ので、印刷された登録票を用いて、画像読取装置３を用いて読取るべき紙書類を分類することができる。 According to the processing of FIG. 3, the user terminal 2 creates a registration slip (step S110), and transmits a print instruction for the created registration slip to the printing apparatus (step S111). It is possible to classify paper documents to be read using the image reading device 3.

また、文書属性データを書類電子化サーバ４に送信する（ステップＳ１０６）ので、書類電子化サーバ４に送信された文書属性データと、画像読取装置３によって読取られた登録票の画像とを容易に比較することができる。 Since the document attribute data is transmitted to the document digitizing server 4 (step S106), the document attribute data transmitted to the document digitizing server 4 and the image of the registration slip read by the image reading device 3 can be easily obtained. Can be compared.

図８は、図１における書類電子化サーバ４によって実行される文書属性データ登録処理のフローチャートである。 FIG. 8 is a flowchart of the document attribute data registration process executed by the document digitizing server 4 in FIG.

図８の処理は、図４の登録票作成処理と並行して行われる。 The process of FIG. 8 is performed in parallel with the registration form creation process of FIG.

図８において、書類電子化サーバ４は、利用者端末２がステップＳ１０１で送信した認証情報を取得したときは（ステップＳ２０１でＹＥＳ）、取得した認証情報から正当な利用者であるか否かを判別する（ステップＳ２０２）。 In FIG. 8, when the document electronic server 4 acquires the authentication information transmitted by the user terminal 2 in step S101 (YES in step S201), the document electronic server 4 determines whether or not the user is a valid user from the acquired authentication information. It discriminate | determines (step S202).

ステップＳ２０２の判別の結果、正当な利用者でないときは、その旨をエラーメッセージを利用者端末２に送信して（ステップＳ２０３）、ステップＳ２０１に戻り、正当な利用者であるときは、利用者端末２に文書属性データ入力画面情報（ＨＴＭＬ等のフォーマット）を送信する（ステップＳ２０４）。 If it is determined in step S202 that the user is not a valid user, an error message is transmitted to the user terminal 2 (step S203), and the process returns to step S201. Document attribute data input screen information (format such as HTML) is transmitted to the terminal 2 (step S204).

次いで、ステップＳ１０６で利用者端末２から送信された文書属性データを取得したときは（ステップＳ２０５でＹＥＳ）、取得した文書属性データの各項目のうち少なくとも１つの項目に入力漏れがあるか否かを判別し（ステップＳ２０６）、入力漏れがあるときは、利用者端末２に入力漏れ通知を送信して（ステップＳ２０７）、ステップＳ２０５に戻る。 Next, when the document attribute data transmitted from the user terminal 2 is acquired in step S106 (YES in step S205), whether or not there is an input omission in at least one item among the items of the acquired document attribute data. (Step S206), and when there is an input omission, an input omission notice is transmitted to the user terminal 2 (step S207), and the process returns to step S205.

なお、図３の説明でしたように、項目漏れの通知をする項目と通知をしない項目を分ける仕組みであってもよいのはいうまでもない。 As described with reference to FIG. 3, it goes without saying that a mechanism may be used in which items that are notified of item omission are separated from items that are not notified.

ステップＳ２０６の判別の結果、入力漏れがないときは、取得した文書属性データを図９に示すような登録票ＤＢに登録して（ステップＳ２０８）、本処理を終了する。 If there is no omission as a result of the determination in step S206, the acquired document attribute data is registered in the registration form DB as shown in FIG. 9 (step S208), and this process is terminated.

なお、「文書管理番号」を書類電子化サーバ４側で付与する場合、取得した文書属性データを登録票ＤＢに登録する際、ユニーク（一意）な「文書管理番号」を付与し登録する。 In addition, when the “document management number” is assigned on the document digitizing server 4 side, when the acquired document attribute data is registered in the registration form DB, a unique (unique) “document management number” is assigned and registered.

なお、「文書管理番号」を書類電子化サーバ４側で付与した場合、登録票は利用者端末２側で印刷がされるため、登録票の印刷に必要な「文書管理番号」を利用者端末２に通知する必要がある。 When the “document management number” is assigned on the document digitizing server 4 side, the registration slip is printed on the user terminal 2 side. Therefore, the “document management number” necessary for printing the registration slip is used on the user terminal. 2 need to be notified.

登録票ＤＢには、図９に示すように、文書管理番号、作成者コード、管理者コード、処理カウンタなどの文書属性情報（文書属性データ）が登録票データとして登録されている（格納手段）。処理カウンタの値は、登録票ＤＢに登録された登録票データがＰＤＦファイルに付与されると、カウントアップされる。 In the registration form DB, as shown in FIG. 9, document attribute information (document attribute data) such as a document management number, creator code, administrator code, and processing counter is registered as registration form data (storage means). . The value of the processing counter is counted up when the registration form data registered in the registration form DB is added to the PDF file.

ここで、上述したように、「作成者コード」には、登録票を作成した者を識別するための番号が登録され、「管理者コード」には、当該文書管理番号を有する登録票により電子化されたファイルの利用をする者を識別するための番号が登録される。１つの例として、図９において、文書管理番号「１２３４５７０００００１」を有する登録票を作成した者は、「２３４５６７８」を識別コードに持つ者であり、当該文書管理番号を有する登録票により電子化されたファイルを利用する者は、「１２３４５７」を識別コードに持つ者であることを示している。 Here, as described above, a number for identifying the person who created the registration slip is registered in the “creator code”, and the “administrator code” is electronically registered with the registration slip having the document management number. A number for registering the person who uses the converted file is registered. As an example, in FIG. 9, the person who created the registration slip having the document management number “123457000001” is the person who has “2345678” as the identification code, and was digitized by the registration slip having the document management number. The person who uses the file indicates that the person has “123457” in the identification code.

ステップＳ１０２，Ｓ２０２の処理で認証されたユーザは、登録票ＤＢに登録されている登録票データのうち該ユーザに対応する登録票データを、図１０に示すような登録票一覧画面を利用者端末２の表示装置に一覧表示させることが可能である。 The user who is authenticated in the processes of steps S102 and S202 displays the registration form data corresponding to the user among the registration form data registered in the registration form DB, and displays the registration form list screen as shown in FIG. It is possible to display a list on two display devices.

なお、一覧表示させる登録票データとして、図９に示す登録票ＤＢ中の「管理者コード」と該ユーザを識別するコード（当該コードは、例えばログイン情報等から取得する。）とが合致するものを一覧表示させてもよいし、「管理者コード」又は「作成者コード」のいずれかのコードと該ユーザを識別するコードが合致するものを一覧表示させてもよい。 As the registration form data to be displayed as a list, the “administrator code” in the registration form DB shown in FIG. 9 matches the code for identifying the user (the code is acquired from, for example, login information). May be displayed as a list, or a list of codes in which either the “manager code” or the “creator code” matches the code for identifying the user may be displayed.

また、一覧表示させる登録票のデータを該ユーザに限らず、登録票ＤＢの全ての登録票データ又は特定の所属部課に属する登録票のデータを一覧表示させるようにしてもよい。 The registration form data to be displayed in a list is not limited to the user, and all registration form data in the registration form DB or registration form data belonging to a specific department may be displayed as a list.

さらに、登録票ＤＢに保存されている各項目に関する検索条件を入力するための条件入力画面（不図示）を設けることも可能である。このような条件入力画面を設けることにより、例えば、「文書種類」が契約書であり「作成日」が２００３年１月１日以降である登録票データを一覧表示させるといったことも可能になる。 Furthermore, it is possible to provide a condition input screen (not shown) for inputting a search condition regarding each item stored in the registration form DB. By providing such a condition input screen, for example, it is possible to display a list of registration form data whose “document type” is a contract and whose “creation date” is January 1, 2003 or later.

なお、図１０において、ファイル名には、少なくとも文書管理番号が付与されており、これにより、ユーザは該ユーザに対応する登録票の文書管理番号を容易に視認することが可能である。 In FIG. 10, at least a document management number is assigned to the file name, so that the user can easily recognize the document management number of the registration form corresponding to the user.

また、印刷された登録票を用いて書類の電子化が行われたか否かの判断をするため「処理有無」の項目も設けられている。これにより「未」であれば、まだ書類の電子化が行われていないと判断でき、「済」であれば、既に書類の電子化が行われていると判断することができる。なお、この項目は、登録票ＤＢの処理カウンタの値が、「Ｎｕｌｌ」の場合は「未」と、「１」以上の場合は「済」と表示されるようになっているが、「済」の場合は、カウント値をそのまま表示するようにしてもよい。そうすることにより何度電子化をしたかを判断することができるようになる。 In addition, an item “presence / absence of processing” is also provided to determine whether or not the document has been digitized using the printed registration form. Accordingly, if “not yet”, it can be determined that the document has not been digitized yet, and if “completed”, it can be determined that the document has already been digitized. This item is displayed as “not yet” when the value of the processing counter in the registration slip DB is “Null”, and “done” when it is “1” or more. ", The count value may be displayed as it is. By doing so, it becomes possible to determine how many times the computer has been digitized.

また、「未」又は「済」の文字にリンク情報を埋め込み、ユーザが当該文字をマウス１８等でクリック等した場合は、当該登録票に関するより詳細な情報を登録票ＤＢから取得して利用者端末２において表示されるようにしてもよいし、当該登録票に関する登録票ＤＢの登録内容を修正する画面が表示されるようにしてもよい。ここで、修正を認める項目としては、通常、ユーザが最初に登録する「文書種類」「コメント」等となる。書類電子化サーバ側で付与される「文書管理番号」や「ファイル名」を修正できるようにしたのでは、同一「文書管理番号」や「ファイル名」が既に存在した場合に、上書きされてしまうからである。ただし、上書きは後述するエラー処理で防止することができるため、このような場合は、当然のことながら「文書管理番号」や「ファイル名」についてもユーザが修正することができるようにしてもよい。 In addition, when link information is embedded in the characters “uncompleted” or “completed” and the user clicks the character with the mouse 18 or the like, more detailed information on the registration slip is obtained from the registration slip DB, and the user You may make it display on the terminal 2, and you may make it display the screen which corrects the registration content of the registration form DB regarding the said registration form. Here, the items that are permitted to be corrected are usually “document type”, “comment”, etc., which the user first registers. If the "document management number" and "file name" assigned on the document digitizing server can be modified, they will be overwritten if the same "document management number" or "file name" already exists. Because. However, since overwriting can be prevented by error processing to be described later, in such a case, it is natural that the user can also correct the “document management number” and “file name”. .

なお、リンク情報を埋め込むのは「処理有無」の項目に限られないのはいうまでもない。例えば、「ファイル名」の項目にリンク情報を埋め込んでもよい、また、リンク情報を表示するための専用の項目を別に設けて、書類電子化サーバ４又は利用者端末２が所定の番号を付与して、該番号にリンク情報を埋め込んでもよい。 Needless to say, embedding the link information is not limited to the “process presence / absence” item. For example, link information may be embedded in the “file name” item, or a dedicated item for displaying the link information is provided separately, and the document digitizing server 4 or the user terminal 2 assigns a predetermined number. Thus, link information may be embedded in the number.

図８の処理によれば、ステップＳ１０６で利用者端末２から送信された文書属性データを取得したときは（ステップＳ２０５でＹＥＳ）、取得した文書属性データを登録票ＤＢに登録する（ステップＳ２０８）ので、登録票ＤＢに登録されている文書属性データ（登録票データ）に基づいて、画像読取装置３を用いてそれらの画像を読取るべき紙書類を分類することができると共に、登録票ＤＢに登録されている登録票データと、画像読取装置３によって読取られた登録票の画像とを容易に比較することができる。 According to the process of FIG. 8, when the document attribute data transmitted from the user terminal 2 is acquired in step S106 (YES in step S205), the acquired document attribute data is registered in the registration form DB (step S208). Therefore, based on the document attribute data (registration form data) registered in the registration form DB, it is possible to classify paper documents whose images should be read using the image reading device 3, and register them in the registration form DB. The registered registration slip data can be easily compared with the registration slip image read by the image reading device 3.

次に、画像読取装置３で紙書類の画像を読取って作成されたＰＤＦファイルの電子化処理を説明する。 Next, digitization processing of a PDF file created by reading an image of a paper document with the image reading device 3 will be described.

図１１は、図１における書類電子化サーバ４のディレクトリ構成の一例を示す図である。 FIG. 11 is a diagram showing an example of the directory structure of the document digitizing server 4 in FIG.

書類電子化サーバ４は、図１１に示すように、監視先フォルダ、ＯＣＲ処理後フォルダ、及びエラーフォルダを有する。画像読取装置３は、登録票を先頭ページとする複数枚の紙書類の画像を読取って作成したＰＤＦファイルを、書類電子化サーバ４に送信して書類電子化サーバ４の監視先フォルダに格納する。なお、紙書類が複数部であるときは、各部の先頭ページに登録票が挿入されるが、画像読取装置３は、複数部の紙書類を一括して読取って１つのＰＤＦファイルを作成することになる。このように作成されたＰＤＦファイルには、例えば紙書類の画像を読取ったときの西暦年・月日・時刻に基づいて図１１に示す「２００３０７０１０９１７２７．ｐｄｆ」といったファイル名が付与される。 As shown in FIG. 11, the document digitizing server 4 has a monitoring destination folder, a folder after OCR processing, and an error folder. The image reading device 3 transmits a PDF file created by reading images of a plurality of paper documents having a registration slip as the first page to the document digitizing server 4 and stores it in a monitoring destination folder of the document digitizing server 4. . When there are a plurality of paper documents, a registration slip is inserted on the first page of each copy. However, the image reading device 3 reads a plurality of paper documents at a time and creates one PDF file. become. The PDF file created in this way is given a file name such as “20030701091727.pdf” shown in FIG. 11 based on the year, month, day, and time when the image of the paper document is read.

図１２は、図１における書類電子化サーバ４によって実行されるファイル分類処理のフローチャートである。 FIG. 12 is a flowchart of the file classification process executed by the document digitizing server 4 in FIG.

図１２において、画像読取装置３が、紙書類の画像を読取ってＰＤＦファイルを作成し書類電子化サーバの監視先フォルダに送信した後、書類電子化サーバ４は、監視先フォルダにＰＤＦファイルがあるか否かの判断を行う。ＰＤＦファイルがあるときは（ステップＳ３０１でＹＥＳ）、書類電子化サーバ４は監視先フォルダからＰＤＦファイルを取得する（ステップＳ３０２）。 In FIG. 12, after the image reading device 3 reads an image of a paper document to create a PDF file and transmits it to the monitoring destination folder of the document digitizing server, the document digitizing server 4 has the PDF file in the monitoring destination folder. Judge whether or not. If there is a PDF file (YES in step S301), the document digitizing server 4 acquires the PDF file from the monitoring destination folder (step S302).

なお、書類電子化サーバ４が、取得したファイルは、ＲＡＭ１２等のワークエリアに格納され、プログラムにより利用されることになる。以下、同様とする。 The file acquired by the document digitizing server 4 is stored in a work area such as the RAM 12 and used by a program. The same shall apply hereinafter.

なお、ＰＤＦファイルがあるか否かの監視の方法の一例として、定期的に（例えば１０秒）に一度と監視先フォルダにＰＤＦファイルがあるか否かを検索する方法が考えられる。なお、定期的に行う検索の間隔が短い場合、画像読取装置３から送信されるＰＤＦファイルが監視先フォルダに完全に保存される前に、書類電子化サーバ４がＰＤＦファイルを取得してしまう虞があるので、このような場合は、ＰＤＦファイルサイズの変動を監視して完全なＰＤＦファイルを取得する手法を採ることが考えられる。 As an example of a method for monitoring whether or not there is a PDF file, a method of searching whether or not there is a PDF file in the monitoring destination folder at regular intervals (for example, 10 seconds) can be considered. Note that if the interval between periodic searches is short, the document digitizing server 4 may acquire the PDF file before the PDF file transmitted from the image reading device 3 is completely stored in the monitoring destination folder. Therefore, in such a case, it is conceivable to take a method of obtaining a complete PDF file by monitoring a change in the PDF file size.

なお、ＰＤＦファイルであるか否かの検索方法として、例えば、ファイルの拡張子（．ｐｄｆ）を検索する方法がある。 As a search method for determining whether the file is a PDF file, for example, there is a method of searching for the extension (.pdf) of the file.

ステップＳ３０３で、取得したＰＤＦファイルに天地補正等のような所定の補正処理を行い、補正処理したＰＤＦファイル内から「登録票」という文字を検索する（ステップ３０４）。登録票には、「登録票」という文字が図７に示すような座標に常に印刷されるので、書類電子化サーバ４に、図１３に示すように、予め「登録票」という文字が印刷される座標を含む所定の領域６００を指定しておき、ステップＳ３０４の検索処理では、例えば、図１３に示すような領域６００の画像データに対してＯＣＲ処理を行い、「登録票」という文字があるか否かを検索する。 In step S303, the acquired PDF file is subjected to predetermined correction processing such as top-and-bottom correction, and the character “registration card” is searched from the corrected PDF file (step 304). Since the characters “registration slip” are always printed on the coordinates as shown in FIG. 7 on the registration slip, the characters “registration slip” are printed in advance on the document electronic server 4 as shown in FIG. For example, in the search processing in step S304, OCR processing is performed on the image data in the region 600 as shown in FIG. Search whether or not.

なお、所定の領域６００の画像データに対してＯＣＲ処理を行い、最終的に当該領域６００のＯＣＲ後の文字コードを取得することができればよいので、例えば、所定の領域の画像データを切り出して、その切り出した画像データに対してＯＣＲ処理を行って、文字コードを取得してもよいし、全ページにＯＣＲ処理を行った後に、所定の領域の文字コードを取得するようにしてもよい。なお、各部の先頭ページに登録票が挿入された複数部の紙書類に対応するＰＤＦファイルの場合には、所定の領域に「登録票」という文字があるか否かを検索することにより、画像読取装置３によって読取られた登録票以外の紙文書の画像データと登録票の画像データ（以下、「登録票画像」という）とをほぼ確実に区別することができる。 Note that it is only necessary to perform OCR processing on the image data of the predetermined area 600 and finally obtain the character code after the OCR of the area 600. For example, the image data of the predetermined area 600 is cut out, The extracted image data may be subjected to OCR processing to acquire a character code, or may be acquired after performing OCR processing on all pages. In the case of a PDF file corresponding to a plurality of paper documents in which a registration slip is inserted in the first page of each copy, an image is obtained by searching whether or not there is a character “registration slip” in a predetermined area. The image data of the paper document other than the registration slip read by the reading device 3 and the image data of the registration slip (hereinafter referred to as “registration slip image”) can be almost certainly distinguished.

次いで、登録票画像が複数あるか否かを判別する（ステップＳ３０６）。 Next, it is determined whether or not there are a plurality of registration form images (step S306).

ステップＳ３０６の判別の結果、登録票画像が複数あるときは、登録票画像ごとに、即ち紙文書の各部に対応するようにＰＤＦファイルを分割し（ステップＳ３０７）、分割ファイルをＲＡＭ１２等に格納する。そして、分割後のＰＤＦファイルから登録票画像を１つ取得して（ステップＳ３０８）、取得した登録票画像に対してＯＣＲ処理を行う（ステップＳ３０９）。一方、登録票画像が１つであるときは、ステップＳ３０７，ステップＳ３０８の処理をスキップして、登録票画像に対してＯＣＲ処理を行う（ステップＳ３０９）。 If there are a plurality of registration form images as a result of the determination in step S306, the PDF file is divided for each registration form image, that is, corresponding to each part of the paper document (step S307), and the divided file is stored in the RAM 12 or the like. . Then, one registration form image is acquired from the divided PDF file (step S308), and OCR processing is performed on the acquired registration form image (step S309). On the other hand, when there is only one registration form image, the processing of steps S307 and S308 is skipped, and the OCR process is performed on the registration form image (step S309).

次いで、図１４に示すような登録票の画像の所定の領域７００の所定の位置（座標）７１０ａに対応する、ＯＣＲ処理後の登録票画像から文書管理番号を取得し（ステップＳ３１０）（抽出手段）、ＯＣＲ処理後のＰＤＦファイルのファイル名を、取得した文書管理番号に置換して（ステップＳ３１１）、そのファイル名に置換されたＰＤＦファイルを図１５に示すようにＯＣＲ処理後フォルダに移動する（ステップＳ３１２）。 Next, a document management number is acquired from the registration form image after OCR processing corresponding to a predetermined position (coordinates) 710a of a predetermined area 700 of the registration form image as shown in FIG. 14 (step S310) (extraction means) Then, the file name of the PDF file after the OCR process is replaced with the acquired document management number (step S311), and the PDF file replaced with the file name is moved to the folder after the OCR process as shown in FIG. (Step S312).

なお、文書管理番号が印刷される所定の領域７００（座標７１０ａ）は、書類電子化サーバ４に指定されている。 A predetermined area 700 (coordinates 710a) where the document management number is printed is designated by the document digitizing server 4.

なお、重複処理を防止するため、ＯＣＲ処理後フォルダに移動する際、監視先フォルダ内にある元のＰＤＦファイルは削除する。 In order to prevent duplication processing, when moving to the folder after OCR processing, the original PDF file in the monitoring destination folder is deleted.

ステップＳ３０７の処理でＰＤＦファイルを分割した場合は、全ての分割されたＰＤＦファイルに対してステップＳ３０８〜Ｓ３１２の各処理が済んだか否かを判別し（ステップＳ３１３）、全ての分割されたＰＤＦファイルに対して各処理が済んでないないときは、ステップＳ３０８以降の処理を繰り返し、全ての分割されたＰＤＦファイルに対して各処理が済んだときは、本処理を終了する。したがって、紙書類に登録票が複数枚ある場合、即ちＰＤＦファイルが複数個に分割された場合には、図１６に示すように、ＯＣＲ処理後フォルダに複数個のＰＤＦファイルが格納されることになる。 When the PDF file is divided in the process of step S307, it is determined whether or not each process of steps S308 to S312 has been completed for all the divided PDF files (step S313), and all the divided PDF files are determined. However, when the processes are not completed, the processes after step S308 are repeated, and when the processes are completed for all the divided PDF files, the process is terminated. Therefore, when there are a plurality of registration slips in the paper document, that is, when the PDF file is divided into a plurality of pieces, a plurality of PDF files are stored in the folder after the OCR processing as shown in FIG. Become.

図１２の処理によれば、登録票の画像の所定の領域７００の所定の位置７１０ａ（図１４）に対応する、ＯＣＲ処理後の登録票画像から文書管理番号を抽出し（ステップＳ３１０）、ＯＣＲ処理後のＰＤＦファイルのファイル名を、取得した文書管理番号に置換して（ステップＳ３１１）、そのＰＤＦファイルをＯＣＲ処理後フォルダに移動する（ステップＳ３１２）ので、書類電子化サーバ４の画像読取後のＰＤＦファイルを、場合によっては分割して、文書管理番号に基づいてＯＣＲ処理後フォルダに保存することができ、ＯＣＲ処理後フォルダに保存されたＰＤＦファイルの文書管理番号と、登録票ＤＢに格納されている文書管理番号とを容易に比較することができる。 12, the document management number is extracted from the registration form image after the OCR process corresponding to the predetermined position 710a (FIG. 14) in the predetermined area 700 of the registration form image (step S310), and the OCR is performed. The file name of the PDF file after processing is replaced with the acquired document management number (step S311), and the PDF file is moved to the folder after OCR processing (step S312). PDF files can be divided in some cases and stored in a folder after OCR processing based on the document management number. The document management number of the PDF file stored in the folder after OCR processing is stored in the registration form DB. It can be easily compared with the document management number.

上述したように、書類電子化サーバ４は、書類電子化サーバ４に固有の登録票（フォーマット）をＬＡＮ６上の各利用者端末２に作成させ（ステップＳ２０４）、且つ、その印刷物を含む紙書類を画像読取装置３が読取って作成したＰＤＦファイルを書類電子化サーバ４に送信させる（ステップＳ３０１）。したがって、書類電子化サーバ４は、文書管理番号を抽出する対象領域７００を書類電子化サーバ４に固有の登録票の印刷物の画像データとすることができ、もって、ユーザが文書管理番号の読取り領域７００を指定することをなくすことができると共に、書類電子化サーバ４は文書管理番号の読取りエラーの発生を低減することができる。 As described above, the document digitizing server 4 causes each user terminal 2 on the LAN 6 to create a registration form (format) unique to the document digitizing server 4 (step S204), and the paper document including the printed matter. Is transmitted to the document digitizing server 4 (step S301). Therefore, the document digitizing server 4 can set the target area 700 from which the document management number is extracted as the image data of the printed matter of the registration slip unique to the document digitizing server 4, and the user can read the document management number reading area. It is possible to eliminate the designation of 700, and the document digitizing server 4 can reduce the occurrence of a document management number reading error.

図１７は、図１における書類電子化サーバ４によって実行される文書属性データ付与処理のフローチャートである。 FIG. 17 is a flowchart of the document attribute data adding process executed by the document digitizing server 4 in FIG.

図１７において、書類電子化サーバ４は、ＯＣＲ処理後フォルダにＰＤＦファイルがあるときは（ステップＳ４０１でＹＥＳ）、ＯＣＲ処理後フォルダからＰＤＦファイルを取得し（ステップＳ４０２）、取得したＰＤＦファイルからファイル名である文書管理番号を抽出する（ステップＳ４０３）（抽出手段）。 In FIG. 17, when there is a PDF file in the folder after the OCR process (YES in step S401), the document digitizing server 4 acquires the PDF file from the folder after the OCR process (step S402), and the file from the acquired PDF file A document management number as a name is extracted (step S403) (extraction means).

なお、ＯＣＲ処理後フォルダにＰＤＦファイルがあるか否かの検索手法は、図１２におけるステップＳ３０１と同様である。 Note that the search method for determining whether or not there is a PDF file in the folder after OCR processing is the same as in step S301 in FIG.

次いで、抽出した文書管理番号をキーにして、登録票ＤＢに問い合わせを行い（ステップＳ４０４）、抽出した文書管理番号と同一の文書管理番号を含む文書属性データが登録票ＤＢに登録されているか否かを判別し（ステップＳ４０５）（判別手段）、抽出した文書管理番号と同一の文書管理番号を含む文書属性データが登録票ＤＢに登録されていないとき、例えば、図１８に示すように、登録票の文書管理番号「１２３４５６０００００１」の領域７１０ｂにおける「１２３４５６０」と「００００１」との間に汚れ「．」（ノイズ）がある（図１８）ために図１２のステップＳ３１０で文書管理番号の読取りエラーが発生したときは、読取りエラーがあったままのファイル名に置換されているため、このファイル名から取得した文書管理番号をキーにして登録票ＤＢを検索しても、合致する文書管理番号を検出することはできない。このような場合は、図１９に示すように、そのＰＤＦファイルをエラーフォルダ（不一致フォルダ）内に移動して（保存手段）、後述するエラー処理を管理用端末５が実行し（ステップＳ４０６）、本処理を終了する。 Next, an inquiry is made to the registration form DB using the extracted document management number as a key (step S404), and whether or not document attribute data including the same document management number as the extracted document management number is registered in the registration form DB. (Step S405) (discriminating means), and when document attribute data including the same document management number as the extracted document management number is not registered in the registration form DB, for example, as shown in FIG. There is a dirt “.” (Noise) between “1234560” and “00001” in the area 710b of the document management number “123456000001” of the vote (FIG. 18). Occurs, it is replaced with the file name with the reading error. A search of the registration form DB and the number to the key, it is not possible to detect the document management number matching. In such a case, as shown in FIG. 19, the PDF file is moved into the error folder (mismatch folder) (save means), and the management terminal 5 executes error processing to be described later (step S406). This process ends.

なお、重複処理を防止するため、エラーフォルダに移動する際、ＯＣＲ処理後フォルダ内にある元のＰＤＦファイルは削除する。 In order to prevent duplication processing, when moving to the error folder, the original PDF file in the folder after OCR processing is deleted.

ステップＳ４０５の判別の結果、抽出した文書管理番号と同一の文書管理番号を含む文書属性データが登録票ＤＢに登録されているときは、登録票ＤＢに登録されている文書属性データのうち、文書種類、コメント、管理者コードなどの所定のデータを取得し（ステップＳ４０７）、図２０に示すように、ＰＤＦファイルのファイルサイズやページ数を含むプロパティに取得した所定のデータを付与する（ステップＳ４０８）。そして、登録票ＤＢ中の「文書種類」「コメント」「文書管理番号」をハイフンで繋げて、例えば、「契約書−○○○株式会社基本契約書−１２３４５６０００００１．ｐｄｆ」といったＰＤＦファイルのファイル名にリネームする（ステップＳ４０９）。なお、リネームするＰＤＦファイル名は任意であり、書類電子化サーバにおいてその組み合わせを設定しておくことができる。 As a result of the determination in step S405, when document attribute data including the same document management number as the extracted document management number is registered in the registration slip DB, the document attribute data registered in the registration slip DB Predetermined data such as type, comment, administrator code, etc. is acquired (step S407), and as shown in FIG. 20, the acquired predetermined data is assigned to properties including the file size and the number of pages of the PDF file (step S408). ). Then, the “document type”, “comment”, and “document management number” in the registration form DB are connected with a hyphen, and the file name of the PDF file such as “contract document-XXX Inc. basic contract document-123456000001.pdf”, for example. (Step S409). The name of the PDF file to be renamed is arbitrary, and the combination can be set in the electronic document server.

次いで、書類電子化サーバ４の他のルートディレクトリにある保存先フォルダ又は他のサーバにある保存先フォルダからＰＤＦファイルを保存すべきフォルダを検索し（ステップＳ４１０）、該当するフォルダ内にリネームされたファイル名と同一のファイル名のＰＤＦファイルが存在するか否かを判別する（ステップＳ４１１）。 Next, a folder in which the PDF file is to be saved is searched from a saving destination folder in another root directory of the document digitizing server 4 or a saving destination folder in another server (step S410), and the folder is renamed in the corresponding folder. It is determined whether a PDF file having the same file name as the file name exists (step S411).

なお、保存先フォルダを検索するキーとして、登録票ＤＢにおける「管理者コード」や「所属本部」「所属部」「所属課」等をキーとすることができる。「管理者コード」をキーとすれば、当該「管理者コード」を持つ者の個人フォルダである保存先フォルダに保存されるため、電子化ファイルを個人で管理することができ、「所属本部」「所属部」「所属課」等をキーとすれば、当該「所属本部」「所属部」「所属課」等で管理する共有フォルダである保存先フォルダに保存されるため、電子化ファイルを複数人で管理することができる。 As keys for searching for a storage destination folder, “manager code”, “affiliation headquarters”, “affiliation department”, “affiliation section”, etc. in the registration form DB can be used as keys. If the “administrator code” is used as a key, it is stored in the save destination folder, which is the personal folder of the person who has the “administrator code”. If "Affiliation Department", "Affiliation Department", etc. are used as keys, it is saved in the save destination folder, which is a shared folder managed by the "Affiliation Headquarters", "Affiliation Department", "Affiliation Department", etc. It can be managed by people.

なお、保存すべきフォルダが存在しない場合においては、電子化ファイルの保存を中止することもできるし、例えば、検索に使ったキーを含む名称を新たなフォルダ名として作成し、当該フォルダに電子化ファイルを保存させることもできる。 If there is no folder to save, you can stop saving the digitized file. For example, create a name that includes the key used for the search as a new folder name and digitize the folder. You can also save the file.

なお、このように電子化ファイルの保存が中止されたり新たなフォルダに保存されたりした場合、当該文書を管理している者にとっては、画像読取装置３で読込んだ文書データの行方がわからなくなるため、書類電子化サーバが電子メール等により処理の結果を通知するようにしてもよい。 If the saving of the digitized file is canceled or saved in a new folder in this way, the person managing the document cannot know where the document data read by the image reading device 3 is. Therefore, the document digitizing server may notify the processing result by e-mail or the like.

該当するフォルダ内に同一ファイル名のＰＤＦファイルが存在しないときは、ＰＤＦファイルを該当するフォルダ内に移動して（ステップＳ４１２）（保存手段）、一方、該当するフォルダ内に同一ファイル名のＰＤＦファイルが存在するときは、ＰＤＦファイルを該当するフォルダ内に移動すると共に該当するフォルダ内に存在する同一ファイル名のＰＤＦファイルを消去（上書き）して（ステップＳ４１３）（保存手段）、登録票ＤＢの処理カウンタ用の処理フラグをカウントアップし（ステップＳ４１４）、本処理を終了する。 If there is no PDF file with the same file name in the corresponding folder, the PDF file is moved into the corresponding folder (step S412) (save means), while the PDF file with the same file name is in the corresponding folder. If the file exists, the PDF file is moved into the corresponding folder and the PDF file with the same file name existing in the corresponding folder is deleted (overwritten) (step S413) (save means), and the registration form DB The process flag for the process counter is counted up (step S414), and this process ends.

なお、重複処理を防止するため、保存先フォルダに移動する際、ＯＣＲ処理後フォルダ内にある元のＰＤＦファイルは削除する。 In order to prevent duplication processing, when moving to the save destination folder, the original PDF file in the folder after the OCR processing is deleted.

上記ステップ４１０〜４１２の処理において、例えば、取得した管理者コードが「１２３４５６」であるときは、図２１に示すように、ＰＤＦファイルは、上記ＰＤＦファイルを保存すべきフォルダに該当する「１２３４５６フォルダ」内に保存される。 In the processing of steps 410 to 412, for example, when the acquired administrator code is “123456”, as shown in FIG. 21, the PDF file is “123456 folder corresponding to the folder in which the PDF file is stored. "Is stored in.

図１７の処理によれば、登録票ＤＢに登録されている文書属性データのうち所定のデータを取得し（ステップＳ４０７）、ＰＤＦファイルのプロパティに取得した所定のデータを付与する（ステップＳ４０８）ので、紙書類を画像読取装置によって読取って作成したＰＤＦファイルに確実に文書属性データを付与することができ、もって所定のデータに基づいてＰＤＦファイルを確実に分類することができる。 According to the process of FIG. 17, predetermined data is acquired from the document attribute data registered in the registration form DB (step S407), and the acquired predetermined data is assigned to the property of the PDF file (step S408). Document attribute data can be surely assigned to a PDF file created by reading a paper document with an image reading apparatus, so that the PDF file can be reliably classified based on predetermined data.

また、例えば、取得した管理者コードに基づいて、ＰＤＦファイルを保存すべきフォルダを検索する場合（ステップＳ４１０）、最終的にＰＤＦファイルは管理者の所有するフォルダに移動する（ステップＳ４１２）ので、ＰＤＦファイルの管理者と登録票の作成者とは別であってもよく、もって作業分担を図ることができる。 For example, when searching for a folder in which a PDF file is to be stored based on the acquired administrator code (step S410), the PDF file is finally moved to a folder owned by the administrator (step S412). The administrator of the PDF file and the creator of the registration form may be different, and work sharing can be achieved.

さらに、登録票ＤＢにＰＤＦファイルから抽出した文書管理番号と同一の文書管理番号が存在しないときは（ステップＳ４０５でＮＯ）、ＰＤＦファイルをエラーフォルダに移動する（ステップＳ４０６）ので、後からエラーフォルダを参照することにより、文書管理番号について読取りエラーがあったことを容易に知ることができる。 Furthermore, when the same document management number as the document management number extracted from the PDF file does not exist in the registration form DB (NO in step S405), the PDF file is moved to the error folder (step S406). By referring to, it is possible to easily know that there was a reading error for the document management number.

さらに、登録票ＤＢの処理カウンタ用の処理フラグをカウントアップする（ステップＳ４１４）ので、ＰＤＦファイルが上書きされた回数、即ち、当該「文書管理番号」が付与された登録票を含む書類の電子化処理が行われた合計回数を記録されることになる。 Further, since the processing flag for the processing counter in the registration form DB is counted up (step S414), the number of times the PDF file is overwritten, that is, the digitization of the document including the registration form to which the “document management number” is assigned. The total number of times processing has been performed will be recorded.

なお、画像読取装置３において読取りエラーが発生した紙書類については、同じ登録票を用いて画像読取装置３によるＰＤＦファイルの作成を再度行わなくてもよい。後述するエラー処理を行うことにより、書類電子化処理を継続させることができるからである。 For a paper document in which a reading error has occurred in the image reading device 3, it is not necessary to create a PDF file again by the image reading device 3 using the same registration slip. This is because document digitization processing can be continued by performing error processing described later.

ところで、１０組からなる書類を１０枚の登録票を用いて電子化を行った場合において、「文書管理番号」の読取りエラーがあったものについては、エラーフォルダに移動するため、どの組の登録票について読取りエラーがあったかを特定することができる。しかし、文書管理番号に含まれる、例えば「５」が「６」というような数字間の読取りミスがあった場合において、読取りミスの文書管理番号と同一のＰＤＦファイルがあったときは、このＰＤＦファイルは読取りエラーのＰＤＦファイルで上書きされることとなるため、このような読取りエラーがあったＰＤＦファイルは、エラーフォルダに移動されず、エラーフォルダを参照することによっては、どの登録票について読取りエラーがあったかを特定することはできない。このように読取りエラーのＰＤＦファイルで上書きされた場合、ファイル名とファイルの中身の整合性が合致しないという問題が生じるが、ユーザは、例えば、図９から処理カウンタの値を参照することにより、上書きされたＰＤＦファイルを推定することができる。例えば、１度しか電子化していないはずの登録票についての処理回数が２回となっていた場合、当該ファイルが上書きされたと推定することができる。 By the way, when 10 documents are digitized using 10 registration slips, if there is an error in reading the “document management number”, it is moved to the error folder, so which set of registrations It is possible to specify whether or not there is a reading error for the vote. However, when there is a reading error between numbers included in the document management number, for example, “5” is “6”, and there is a PDF file identical to the document management number of the reading error, this PDF Since the file is overwritten with the PDF file with the reading error, the PDF file having such a reading error is not moved to the error folder. It is not possible to specify whether or not there was. When overwritten with a PDF file having a reading error in this way, there arises a problem that the consistency between the file name and the contents of the file does not match, but the user can refer to the value of the processing counter from FIG. An overwritten PDF file can be estimated. For example, when the number of times of processing for a registration form that should be digitized only once is two, it can be estimated that the file has been overwritten.

図２２は、図１７のステップＳ４０６において図１における管理者端末５によって実行されるエラー処理のフローチャートである。なお、本処理は、管理者であるユーザが管理者端末５を用いて任意の時間に実行する。なお、本エラー処理は、後述するエラー処理を行うことができるプログラムがあれば、利用者端末２又は書類電子化サーバ４において実行されてもよい。 FIG. 22 is a flowchart of error processing executed by the administrator terminal 5 in FIG. 1 in step S406 in FIG. This process is executed at an arbitrary time by the user who is the administrator using the administrator terminal 5. Note that this error processing may be executed in the user terminal 2 or the document digitizing server 4 if there is a program capable of performing error processing described later.

図２２において、まず、管理者端末５は、書類電子化サーバ４の登録票ＤＢにアクセス可能に接続されているか否かを判別し（ステップＳ５０１）、登録票ＤＢとアクセス可能に接続されていないときは、エラーメッセージを表示し（ステップＳ５０２）、ステップＳ５０１に戻る一方、登録票ＤＢとアクセス可能に接続されているときは、ステップＳ５０４に進み、エラーフォルダにＰＤＦファイルがあるか否かを判別する。 In FIG. 22, first, the administrator terminal 5 determines whether or not the registration form DB of the document electronic server 4 is accessible (step S501), and is not connected to the registration form DB so as to be accessible. When an error message is displayed (step S502), the process returns to step S501. On the other hand, when the registration form DB is accessible, the process proceeds to step S504 to determine whether there is a PDF file in the error folder. To do.

ステップＳ５０４の判別の結果、エラーフォルダにＰＤＦファイルがないときは、本処理を終了し、エラーフォルダにＰＤＦファイルがあるときは、図２３に示す文書管理番号修正用画面を利用者端末２に表示する（ステップＳ５０５）。 If the result of the determination in step S504 is that there is no PDF file in the error folder, this process is terminated. If there is a PDF file in the error folder, the document management number correction screen shown in FIG. (Step S505).

文書管理番号修正用画面は、図２３に示すように、画像読取装置３で読取った登録票の画像データに対応する画像と、文書管理番号入力用の画像とを表示しているので、ユーザは登録票の画像を視認しながら文書管理番号を修正することができる。 As shown in FIG. 23, the document management number correction screen displays an image corresponding to the image data of the registration slip read by the image reading device 3 and an image for inputting the document management number. The document management number can be corrected while visually recognizing the image of the registration form.

なお、図２３に示す文書管理番号修正用画面は、例えば、エラー処理を実行するプログラムを起動した際、図２３に示す文書管理番号を修正するための「文書管理番号修正」入力画面を表示するためのオブジェクトを読込み、その背景にエラーフォルダにあるＰＤＦファイルを読込むことにより、管理者端末５の表示装置に表示される。 The document management number correction screen shown in FIG. 23 displays, for example, a “document management number correction” input screen for correcting the document management number shown in FIG. 23 when a program for executing error processing is started. The object for reading is read and the PDF file in the error folder is read in the background, so that it is displayed on the display device of the administrator terminal 5.

なお、図２３に示す文書管理番号修正用画面には、読取りエラーがあった「文書管理番号」は非表示にして、画像読取装置３において読取られた画像データを表示する。画像読取装置３において読取られた画像データであれば、ユーザは、汚れはあるものの「文書管理番号」を視認することができるからである。 In the document management number correction screen shown in FIG. 23, the “document management number” in which the reading error has occurred is not displayed, and the image data read by the image reading device 3 is displayed. This is because the image data read by the image reading device 3 allows the user to visually recognize the “document management number” although there is dirt.

なお、エラーフォルダにあるＰＤＦファイルは既にＯＣＲ処理がされているので、エラー処理を実行するプログラムを起動した際、図２３に示す文書管理番号を修正するための「文書管理番号修正」入力画面を表示するためのオブジェクト内の「文書管理番号」の欄に、誤認識をした文字列を格納して表示することもできる。こうすることにより、誤認識した文字の修正を行うだけで、修正作業が済むことになる。 Since the PDF file in the error folder has already been subjected to OCR processing, when the program for executing error processing is started, the “document management number correction” input screen for correcting the document management number shown in FIG. 23 is displayed. A misrecognized character string can be stored and displayed in the “document management number” field in the object to be displayed. By doing so, the correction work is completed only by correcting the erroneously recognized character.

図２２に戻り、文書管理番号修正用画面上で、背景に表示されているエラーフォルダにあるＰＤＦファイルの文書管理番号（入力値）が入力され、検索ボタンが押下されると（ステップＳ５０６でＹＥＳ）、入力値をキーにして、登録票ＤＢに問い合わせ（ステップＳ５０８）、登録票ＤＢに入力値と一致する文書管理番号が存在するか否かを判別する（ステップＳ５１１）。 Returning to FIG. 22, when the document management number (input value) of the PDF file in the error folder displayed in the background is input on the document management number correction screen and the search button is pressed (YES in step S506). ) Using the input value as a key, an inquiry is made to the registration form DB (step S508), and it is determined whether or not a document management number that matches the input value exists in the registration form DB (step S511).

ステップＳ５１１の判別の結果、登録票ＤＢに入力値と一致する文書管理番号が存在しないときは、エラーメッセージを表示して（ステップＳ５１２）、ステップＳ５０５に戻り、修正後の文書管理番号の再入力を促す一方、登録票ＤＢに入力値と一致する文書管理番号が存在するときは、登録票ＤＢの文書属性情報データから所定のデータを取得し（ステップＳ５１３）、図２４に示すように取得した各所定のデータを文書管理番号修正用画面上の対応する項目に表示し（ステップＳ５１４）、ユーザが、登録ボタンを押下したときは（ステップＳ５１５でＹＥＳ）、ＰＤＦファイルのファイル名を所定のファイル名に置換する（ステップＳ５１６）。所定の項目の組み合わせは任意であり、管理者端末５又は書類電子化サーバに指示に基づいて所定のファイル名作成することになる。 If it is determined in step S511 that there is no document management number that matches the input value in the registration form DB, an error message is displayed (step S512), and the process returns to step S505 to re-enter the document management number after correction. On the other hand, if there is a document management number that matches the input value in the registration form DB, predetermined data is acquired from the document attribute information data in the registration form DB (step S513), and is acquired as shown in FIG. Each predetermined data is displayed in a corresponding item on the document management number correction screen (step S514), and when the user presses the registration button (YES in step S515), the file name of the PDF file is set to the predetermined file. The name is replaced (step S516). The combination of the predetermined items is arbitrary, and a predetermined file name is created based on an instruction to the administrator terminal 5 or the document digitizing server.

なお、所定のファイル名が、登録票ＤＢ内に保存される項目を組み合せたものである場合、管理者端末５がステップ５１３で一旦取得した情報に基づいて所定のファイル名を作成してもよいし、再度書類電子化サーバに所定の項目を要求して取得した項目に基づいて
所定のファイル名を作成してもよい。 If the predetermined file name is a combination of items stored in the registration form DB, the predetermined file name may be created based on the information once acquired by the administrator terminal 5 in step 513. Then, a predetermined file name may be created based on an item acquired by requesting the document electronic server again for a predetermined item.

なお、ステップＳ５１５で登録ボタンが押下されると、管理者端末５は、修正後の文書管理番号と同一の文書管理番号のＰＤＦファイルを保存先フォルダの該当するフォルダ内を検索すると共に、同一文書管理番号のＰＤＦファイルがある場合には登録票ＤＢに保存されているその処理カウンタの値を参照する。 When the registration button is pressed in step S515, the administrator terminal 5 searches for the PDF file having the same document management number as the corrected document management number in the corresponding folder in the save destination folder, and the same document. If there is a PDF file with a management number, the value of the processing counter stored in the registration form DB is referred to.

次いで、管理者端末５は、登録票ＤＢの処理カウンタを参照することにより、既に同一文書管理番号のＰＤＦファイルを少なくとも１回作成したことがあるか否かを判別し（ステップＳ５１７）、作成済みと判断したときは、図２５に示すようなメッセージ画面を表示し（ステップＳ５１８）、作成済みと判断しなかったときは、ステップＳ５１８の処理をスキップして、本処理を終了し、そして図１７のステップＳ４０９にリターンする。 Next, the administrator terminal 5 refers to the processing counter in the registration form DB to determine whether or not a PDF file having the same document management number has been created at least once (step S517). 25 is displayed (step S518). If it is not determined that the message has been created, the process of step S518 is skipped, the process is terminated, and FIG. The process returns to step S409.

ステップＳ５１８の処理では、管理者端末５が、登録票ＤＢを参照することにより同一文書管理番号のＰＤＦファイルを既に所定回数、例えば３回作成したことを示す情報を取得したときは、図２６に示すようなメッセージ画面を表示する。 In the process of step S518, when the administrator terminal 5 acquires information indicating that a PDF file having the same document management number has already been created a predetermined number of times, for example, three times by referring to the registration form DB, FIG. Display the message screen as shown.

その後、書類電子化サーバ４で実行される図１７におけるステップＳ４０８，Ｓ４１０からステップＳ４１４までの処理と同じ処理が管理者端末５で行われ、書類電子化処理が終了することになる。 Thereafter, the same processing as the processing from step S408, S410 to step S414 in FIG. 17 executed in the document digitizing server 4 is performed in the administrator terminal 5, and the document digitizing processing is ended.

なお、本エラー処理を実行するプログラムは、読取りエラーがない場合において、書類電子化サーバで実行されるステップＳ４０８、Ｓ４１０からステップＳ４１４までの処理と同じ処理を管理者端末５で行っているが、当然のことながら、書類電子化サーバ４側で実行されてもよい。この場合、管理者端末５は、書類電子化サーバ４に処理の指示を与えることになる。 Note that the program that executes this error processing performs the same processing as the processing from steps S408, S410 to S414 executed by the document digitizing server in the administrator terminal 5 when there is no reading error. As a matter of course, it may be executed on the document digitizing server 4 side. In this case, the manager terminal 5 gives a processing instruction to the document digitizing server 4.

図２２の処理によれば、文書管理番号修正用画面上でユーザが背景に表示されているエラーフォルダにあるＰＤＦファイルの文書管理番号（入力値）を目視して確認しながら入力する（ステップＳ５０６）ことができるので、文書管理番号の読み取りエラーがあったＰＤＦファイルに確実に正当な文書管理番号を付与することができる。登録票ＤＢの文書属性データから所定のデータを取得することにより（ステップＳ５１３）、ユーザは、入力した文書管理番号に対する所定のデータが登録票ＤＢに登録されていることを確認した後、登録ボタンを押下することができる（ステップＳ５１５でＹＥＳ）。管理者端末５は、登録ボタンが押下されたことを検知した場合、所定のファイル名に変更する（ステップＳ５１６）。 According to the processing of FIG. 22, the user inputs the document management number (input value) of the PDF file in the error folder displayed in the background on the document management number correction screen while visually confirming it (step S506). Therefore, a valid document management number can be surely assigned to a PDF file having a document management number reading error. By acquiring predetermined data from the document attribute data in the registration slip DB (step S513), the user confirms that the predetermined data for the input document management number is registered in the registration slip DB, and then presses the registration button. Can be pressed (YES in step S515). When the administrator terminal 5 detects that the registration button has been pressed, the administrator terminal 5 changes the file name to a predetermined file name (step S516).

このように、エラーフォルダにあるＰＤＦファイルについての正当な文書管理番号を入力することができるようにしているため、読取りエラーがあったＰＤＦファイルを再度画像読取装置３で読取る必要がなくなる。 In this way, since a valid document management number for a PDF file in the error folder can be input, it is not necessary to read again the PDF file in which the reading error has occurred by the image reading device 3.

また、管理者端末５が作成済みであると判断したときは（ステップＳ５１７でＹＥＳ）、メッセージ画面を表示する（ステップＳ５１８）ので、ユーザは、同一ファイル名のＰＤＦファイルがメッセージの対象であるＰＤＦで上書き（更新）されることを理解することができる。これにより、一度電子化処理が完了したものであることを確認することができる。また、電子化処理が完了しているはずがないファイルについてメッセージ画面が表示された場合、エラー処理におけるユーザ自身による「文書管理番号」の入力に誤りがあったと判断することもできる。こうすることにより、ファイル名とファイルの中身が不一致となる電子化ファイルの作成を防止することができる。 When it is determined that the administrator terminal 5 has been created (YES in step S517), a message screen is displayed (step S518), so that the user can use a PDF file having the same file name as the message target. Can be overwritten (updated). Thereby, it can be confirmed that the digitization process has been completed once. When a message screen is displayed for a file that cannot be digitized, it can be determined that there is an error in the input of the “document management number” by the user himself / herself in error processing. By doing so, it is possible to prevent creation of an electronic file in which the file name and the file content do not match.

上記実施の形態では、書類電子化システム１を構成する各装置はＬＡＮ６を介して接続されているとしたが、ＬＡＮ６はデータ通信が可能なネットワークであればよい。 In the above embodiment, each device constituting the document digitizing system 1 is connected via the LAN 6, but the LAN 6 may be any network capable of data communication.

上記実施の形態において、画像読取装置３は、読取った紙書類の画像からＰＤＦファイルを作成するとしたが、作成されるＰＤＦファイルに限るものではなく、ＴＩＦＦファイル、Ｍｕｌｔｉ−ＴＩＦＦファイル、ＪＰＥＧファイルなどであってもよい。 In the above embodiment, the image reading device 3 creates a PDF file from the image of the read paper document. However, the image reading device 3 is not limited to the created PDF file, but is a TIFF file, a Multi-TIFF file, a JPEG file, or the like. There may be.

また、書類電子化サーバ４は、文書管理番号を識別子として画像読取装置３からの画像を分類したが、識別子は、書類電子化サーバ４がＯＣＲ処理により識別可能な文字であれば、番号等の数字、アルファベット等の文字、所定のマーク、又はそれらの組み合わせであってもよい。 Further, the document digitizing server 4 classifies the images from the image reading device 3 using the document management number as an identifier. If the identifier is a character that the document digitizing server 4 can identify by OCR processing, the document digitizing server 4 is a number or the like. Numbers, letters such as alphabets, predetermined marks, or combinations thereof may be used.

本発明は、上述した実施の形態の機能を実現するソフトウェアのプログラム（図９の登録票ＤＢ、並びに図３、図８、図１２、図１７、及び図２２のフローチャート）をコンピュータ又はＣＰＵに供給し、そのコンピュータ又はＣＰＵが該供給されたプログラムを読出して実行することによって、達成することができる。 In the present invention, a software program (the registration form DB of FIG. 9 and the flowcharts of FIGS. 3, 8, 12, 17, and 22) for realizing the functions of the above-described embodiments is supplied to a computer or CPU. This can be achieved by the computer or CPU reading and executing the supplied program.

この場合、上記プログラムは、該プログラムを記録した記憶媒体から直接供給されるか、又はインターネット、商用ネットワーク、若しくはローカルエリアネットワーク等に接続される不図示の他のコンピュータやデータベース等からダウンロードすることにより供給される。 In this case, the program is directly supplied from a storage medium storing the program, or downloaded from another computer or database (not shown) connected to the Internet, a commercial network, a local area network, or the like. Supplied.

上記プログラムの形態は、オブジェクトコード、インタプリタにより実行されるプログラムコード、ＯＳ（オペレーティングシステム）に供給されるスクリプトデータ等の形態から成ってもよい。 The form of the program may be in the form of object code, program code executed by an interpreter, script data supplied to an OS (operating system), and the like.

また、本発明は、上述した実施の形態の機能を実現するソフトウェアのプログラムを記憶した記憶媒体をコンピュータ又はＣＰＵに供給し、そのコンピュータ又はＣＰＵが記憶媒体に記憶されたプログラムを読出して実行することによっても、達成することができる。 The present invention also supplies a computer or CPU with a storage medium storing a software program that implements the functions of the above-described embodiments, and the computer or CPU reads and executes the program stored in the storage medium. Can also be achieved.

この場合、格納媒体から読出されたプログラムコード自体が上述した各実施の形態の機能を実現すると共に、そのプログラムコードを記憶した記憶媒体は本発明を構成する。 In this case, the program code itself read from the storage medium realizes the functions of the above-described embodiments, and the storage medium storing the program code constitutes the present invention.

プログラムコードを記憶する記憶媒体としては、例えば、ＲＯＭ、ＲＡＭ、ＮＶ−ＲＡＭ、フロッピー（登録商標）ディスク、ハードディスク、光ディスク、光磁気ディスク、ＣＤ−ＲＯＭ、ＭＯ、ＣＤ−Ｒ、ＣＤ−ＲＷ、ＤＶＤ−ＲＯＭ、ＤＶＤ−ＲＡＭ、ＤＶＤ−ＲＷ、ＤＶＤ＋ＲＷ、磁気テープ、不揮発性のメモリカード等がある。 As a storage medium for storing the program code, for example, ROM, RAM, NV-RAM, floppy (registered trademark) disk, hard disk, optical disk, magneto-optical disk, CD-ROM, MO, CD-R, CD-RW, DVD -ROM, DVD-RAM, DVD-RW, DVD + RW, magnetic tape, nonvolatile memory card, and the like.

上述した実施の形態の機能は、コンピュータから読出されたプログラムコードを実行することによるばかりでなく、コンピュータ上で稼動するＯＳ等がプログラムコードの指示に基づいて実際の処理の一部又は全部を行うことによっても実現することができる。 The function of the above-described embodiment is not only by executing the program code read from the computer, but the OS or the like running on the computer performs part or all of the actual processing based on the instruction of the program code. Can also be realized.

本発明の実施の形態に係る書類電子化システムの構成を示す図である。It is a figure which shows the structure of the document digitization system which concerns on embodiment of this invention. 図１における利用者端末２の構成を示すブロック図である。It is a block diagram which shows the structure of the user terminal 2 in FIG. 図２における利用者端末２によって実行される登録票作成処理のフローチャートである。3 is a flowchart of a registration form creation process executed by a user terminal 2 in FIG. 図３のステップＳ１０１で表示されるログイン画面の一例を示す図である。It is a figure which shows an example of the login screen displayed by step S101 of FIG. 図３のステップＳ１０４で表示される文書属性データ入力画面の一例を示す図である。It is a figure which shows an example of the document attribute data input screen displayed by FIG.3 S104. 図３のステップＳ１０８で表示される入力漏れエラーメッセージの一例を示す図である。It is a figure which shows an example of the input omission error message displayed by step S108 of FIG. 図３のステップＳ１１０で作成される登録票の一例を示す図である。It is a figure which shows an example of the registration form produced by step S110 of FIG. 図１における書類電子化サーバ４によって実行される文書属性データ登録処理のフローチャートである。It is a flowchart of the document attribute data registration process performed by the document digitization server 4 in FIG. 図８のステップＳ２０８で文書属性データを登録される登録票ＤＢの一例を示す図である。FIG. 9 is a diagram illustrating an example of a registration form DB in which document attribute data is registered in step S208 of FIG. 図１における利用者端末２の表示装置で表示可能な登録票一覧画面の一例を示す図である。It is a figure which shows an example of the registration form list screen which can be displayed with the display apparatus of the user terminal 2 in FIG. 図１における書類電子化サーバ４のディレクトリ構成の一例を示す図である。It is a figure which shows an example of the directory structure of the document digitization server 4 in FIG. 図１における書類電子化サーバ４によって実行されるファイル分類処理のフローチャートである。It is a flowchart of the file classification | category process performed by the document digitization server 4 in FIG. 図１２のステップＳ３０４において登録票検索を説明するために用いられる登録票画像の部分図である。FIG. 13 is a partial view of a registration form image used for explaining registration form search in step S304 of FIG. 12. 図１２のステップＳ３１０において文書管理番号取得を説明するために用いられる登録票画像の部分図である。FIG. 13 is a partial view of a registration form image used for explaining acquisition of a document management number in step S310 of FIG. 図１１に示すような書類電子化サーバ４のディレクトリ構成において、図１２のステップＳ３１２でＰＤＦファイルがＯＣＲ処理後フォルダ内に格納された状態の一例を示す図である。FIG. 13 is a diagram showing an example of a state in which a PDF file is stored in a folder after OCR processing in step S312 of FIG. 12 in the directory structure of the document digitizing server 4 as shown in FIG. 図１１に示すような書類電子化サーバ４のディレクトリ構成において、図１２のステップＳ３１２でＰＤＦファイルがＯＣＲ処理後フォルダ内に格納された状態の他の例を示す図である。FIG. 13 is a diagram showing another example of the state in which the PDF file is stored in the folder after the OCR process in step S312 of FIG. 12 in the directory structure of the document digitizing server 4 as shown in FIG. 図１における書類電子化サーバ４によって実行される文書属性データ付与処理のフローチャートである。It is a flowchart of the document attribute data provision process performed by the document digitization server 4 in FIG. 図７に示すような登録票において文書管理番号の領域に汚れがある場合の一例を示す図である。FIG. 8 is a diagram illustrating an example of a case where a document management number area is contaminated in the registration slip as illustrated in FIG. 7. 図１１に示すような書類電子化サーバ４のディレクトリ構成において、図１８のステップＳ４０６でＰＤＦファイルがエラーフォルダ内に格納された状態の一例を示す図である。FIG. 19 is a diagram showing an example of a state in which a PDF file is stored in an error folder in step S406 of FIG. 18 in the directory structure of the document digitizing server 4 as shown in FIG. 図１８のステップＳ４０８で文書属性データが付与されたＰＤＦファイルのプロパティの一例を示す図である。It is a figure which shows an example of the property of the PDF file to which the document attribute data was provided in step S408 of FIG. 図１における書類電子化サーバ４の他のディレクトリ構成の一例を示す図であって、図１８のステップＳ４１２，Ｓ４１３でＰＤＦファイルが保存先フォルダ内に格納された状態の一例を示す。It is a figure which shows an example of the other directory structure of the document digitization server 4 in FIG. 1, Comprising: An example of the state by which the PDF file was stored in the preservation | save destination folder by FIG.18 S412 and S413 is shown. 図１７のステップＳ４０６において図１における管理者端末５によって実行されるエラー処理のフローチャートである。18 is a flowchart of error processing executed by the administrator terminal 5 in FIG. 1 in step S406 in FIG. 図２２のステップＳ５０５で表示される文書管理番号修正用画面の一例を示す図である。It is a figure which shows an example of the screen for document management number correction displayed by step S505 of FIG. 図２２のステップＳ５１４で表示される文書管理番号修正用画面の一例を示す図である。It is a figure which shows an example of the screen for document management number correction displayed by step S514 of FIG. 図２２のステップＳ５１８で表示されるメッセージ画面の一例を示す図である。It is a figure which shows an example of the message screen displayed by step S518 of FIG. 図２２のステップＳ５１８で表示されるメッセージ画面の他の例を示す図である。It is a figure which shows the other example of the message screen displayed by FIG.22 S518.

Explanation of symbols

１書類電子化システム
２利用者端末
３画像読取装置
４書類電子化サーバ
５管理者端末 DESCRIPTION OF SYMBOLS 1 Document digitization system 2 User terminal 3 Image reading device 4 Document digitization server 5 Administrator terminal

Claims

In a document digitizing apparatus for extracting the described identifier from an image of a document including an identification document in which a predetermined identifier is described in a predetermined area, storage means for storing an identifier for identifying the document, Discriminating means for extracting the described identifier from an area corresponding to the predetermined area in the image of the identification document and determining whether or not the extracted identifier and the stored identifier match each other And a storage means for storing an image of the document based on a result of determination by the determination means.

2. The document digitizing apparatus according to claim 1, wherein when the extracted identifier and the stored identifier do not match each other, the storage unit stores the image of the document in a mismatch folder.

A terminal device comprising: display means for displaying an image of an identification document included in a document stored in the mismatched folder; and input means for inputting an identifier of an identification document on which the image of the identification document is displayed 3. The document digitizing apparatus according to claim 2, wherein the storage unit stores the image of the document in a storage folder based on the input identifier.

When the extracted identifier and the stored identifier match each other, the storage unit assigns an identification name to the image of the document based on the matched identifier and stores it in a storage folder. The document digitizing apparatus according to claim 1.

The storage means stores document attribute information indicating an attribute of the document including the identifier, and the storage means assigns the stored document attribute information to an image of the document to which the identifier is assigned. 5. The document digitizing apparatus according to claim 3, wherein the document digitizing apparatus stores the data in a save folder.

6. The document digitizing apparatus according to claim 5, wherein the storage unit stores an image of the document to which the document attribute information is assigned based on the document attribute information in a subfolder of the storage folder.

7. The document digitizing apparatus according to claim 1, further comprising a counting unit that counts the number of times that the extracted identifier and the stored identifier match each other.

An identification document that is connected to an image output device that reads a document including the identification document and outputs the image, and does not include the predetermined identifier so that the predetermined identifier is described in the predetermined area. 8. The apparatus according to claim 1, further comprising: a transmission unit that transmits the format of the document to the image output device; and a reception unit that receives an image of the document output by the image output device. Document electronic equipment.

The image output device receives a format of the identification document, and prints an identification document to which the identifier is input, a terminal device that inputs an identifier to be stored in the storage unit in the predetermined area according to the format 9. The document digitizing apparatus according to claim 8, further comprising: a printing device that performs printing; and an image reading device that reads an image of the document including the printed identification document.

In a document digitizing method for extracting the described identifier from an image of a document including an identification document in which a predetermined identifier is described in a predetermined area, a storing step of storing an identifier for identifying the document; An extraction step for extracting the described identifier from an area corresponding to the predetermined area in the image of the identification document, and a determination for determining whether the extracted identifier and the stored identifier match each other And a storing step of storing an image of the document based on a result of the determination in the determination step.

A program for causing a computer to execute a document digitizing method for extracting the described identifier from an image of a document including an identification document in which a predetermined identifier is described in a predetermined area, the identifier for identifying the document A storage module that stores the identifier, the extraction module that extracts the described identifier from an area corresponding to the predetermined area in the image of the identification document, and the extracted identifier and the stored identifier match each other A program comprising: a discrimination module for discriminating whether or not to perform; and a storage module for saving an image of the document based on a discrimination result by the discrimination module.

An image output device that reads a document including an identification document in which a predetermined identifier is described in a predetermined region and outputs the image, and a document electronic device that extracts the described identifier from the output image In the document digitizing system, the document digitizing apparatus includes a storage unit that stores an identifier for identifying the document, and the identifier described from an area corresponding to the predetermined area in the image of the identification document An extracting means for extracting the image, a determining means for determining whether or not the extracted identifier and the stored identifier match each other, and storing the image of the document based on the determination result by the determining means A document digitization system comprising storage means.

The document digitizing apparatus includes a transmission unit that transmits a format of an identification document that does not include an identifier to the image output device so that an identifier described in the identification document is the predetermined area, The image output apparatus receives the format of the identification document, and prints the identification document to which the identifier is input, and a terminal device that inputs the identifier to be stored in the storage unit in the predetermined area according to the format 13. The document digitizing system according to claim 12, further comprising: a printing device that prints; and an image reading device that reads an image of the document including the printed identification document.