JP5542414B2

JP5542414B2 - Information processing apparatus, document management method, and document management program

Info

Publication number: JP5542414B2
Application number: JP2009250543A
Authority: JP
Inventors: 正仁西
Original assignee: Toshiba Corp; Toshiba Solutions Corp
Current assignee: Toshiba Corp; Toshiba Digital Solutions Corp
Priority date: 2009-10-30
Filing date: 2009-10-30
Publication date: 2014-07-09
Anticipated expiration: 2029-10-30
Also published as: JP2011096070A

Description

本発明は、文書が記された画像データからテキストデータを生成し、また作成されたテキストデータの編集作業および確認作業を支援する技術に関する。 The present invention relates to a technique for generating text data from image data in which a document is written, and for supporting editing and confirmation work of the created text data.

紙媒体で管理された文書を修正する場合、スキャナで紙媒体を読み取り画像データを作成した後、ＯＣＲ（Optical Character Reader）を使用してテキストデータ化したものを修正することになる。修正内容の確認作業は、図１５に示すように、修正前原本と修正後の出力物（プリントアウトされた紙媒体やコンピュータの画面上に表示されたデータ）とを確認者が交互に目視し、対比することで行われる。 When correcting a document managed by a paper medium, the paper medium is read by a scanner to create image data, and then the text data is converted using an OCR (Optical Character Reader). As shown in FIG. 15, the checker confirms the original contents before correction and the output data after correction (printed paper media or data displayed on the computer screen) alternately as shown in FIG. , By contrast.

関連する技術として、以下の技術が開示されている。 The following technologies are disclosed as related technologies.

特開２００５−５００９４号公報Japanese Patent Laid-Open No. 2005-50094

原本が紙媒体の場合、修正後との比較確認は目視確認にならざるを得ない。また、修正前原本と修正後の出力物を目視確認する場合、修正箇所が明確に判別できる訳ではないため確認漏れが発生する可能性がある。また、確認作業は修正前原本と修正後の出力物を見比べるという作業となり、作業の手間も大きい。 In the case where the original is a paper medium, the comparison confirmation after the correction must be a visual confirmation. In addition, when the original before correction and the output after correction are visually checked, there is a possibility that a check omission may occur because the correction portion cannot be clearly identified. Also, the confirmation work is a work of comparing the original document before correction with the output product after correction, which requires a lot of work.

さらに、修正前原本と修正後の出力物との対比は、新旧対比表を作成しこの表を用いて確認することが有効であるが、従来技術においては、新旧対比表を別途生成する必要がある。 In addition, it is effective to create a new / old comparison table and check the comparison between the original data before correction and the output after correction, but in the conventional technology, it is necessary to generate a new / old comparison table separately. is there.

本発明は、上述した問題点を解決するためになされたものであり、修正前原本と修正後の出力物との確認作業を一つのイメージにマージすることで、修正前のデータに対して修正箇所を明示することが可能になり、確認漏れを低減させるとともに修正前のデータと修正後のデータとの比較確認も容易となる技術を提供することを目的とする。 The present invention has been made in order to solve the above-described problems, and the data before correction is corrected by merging the confirmation work between the original data before correction and the output data after correction into one image. It is an object of the present invention to provide a technique that makes it possible to clearly indicate a location, reduce confirmation omissions, and facilitate comparison and confirmation of data before correction and data after correction.

上述した課題を解決するため、本発明の一態様に係る情報処理装置は、一つの文字または複数の文字で構成された文字列が記された画像データから、前記文字列を読み取りテキストデータを生成する認識部と、前記認識部によって生成されたテキストデータを取得し、該テキストデータに対しての編集作業によって削除された文字列には削除情報を付与し、挿入された文字列には挿入情報を付与し、これら削除情報、挿入情報が付与されたテキストデータを生成する編集支援部と、前記画像データと前記編集支援部によって生成されたテキストデータとを取得し、前記画像データと挿入情報が付与された文字列とを表示するとともに、削除情報が付与された文字列に対応する前記画像データの文字列を、他の文字列とは異なる形式で表示し、且つ前記挿入情報が付与された文字列が前記画像データのいずれに挿入されるかを、視認可能な形式で表示する表示部とを有する。 In order to solve the above-described problem, an information processing apparatus according to an aspect of the present invention generates text data by reading the character string from image data in which a character string including one character or a plurality of characters is written. A recognition unit that acquires the text data generated by the recognition unit, adds deletion information to the character string deleted by editing the text data, and inserts information into the inserted character string. An edit support unit that generates text data to which the deletion information and the insert information are added, the image data and the text data generated by the edit support unit, and the image data and the insert information are Displaying the assigned character string and displaying the character string of the image data corresponding to the character string to which the deletion information is given in a format different from other character strings, One or the insertion information string is granted is inserted into any of the image data, and a display unit for displaying in a visible form.

また、上述した課題を解決するため、本発明の一態様に係る文書管理方法は、コンピュータが、一つの文字または複数の文字で構成された文字列が記された画像データから、前記文字列を読み取りテキストデータを生成し、生成されたテキストデータを取得し、該テキストデータに対しての編集作業によって削除された文字列には削除情報を付与し、挿入された文字列には挿入情報を付与し、これら削除情報、挿入情報が付与されたテキストデータを生成し、前記画像データと、削除情報、挿入情報が付与されることで生成されたテキストデータとを取得し、前記画像データと挿入情報が付与された文字列とを表示するとともに、削除情報が付与された文字列に対応する前記画像データの文字列を、他の文字列とは異なる形式で表示し、且つ前記挿入情報が付与された文字列が前記画像データのいずれに挿入されるかを、視認可能な形式で表示する。 In order to solve the above-described problem, in a document management method according to one aspect of the present invention, a computer uses the character string from image data in which a character string including one character or a plurality of characters is written. Generates read text data, acquires the generated text data, gives deletion information to the character string deleted by editing the text data, and gives insertion information to the inserted character string And generating the text data to which the deletion information and the insertion information are added, obtaining the image data and the text data generated by adding the deletion information and the insertion information, and acquiring the image data and the insertion information. And a character string of the image data corresponding to the character string to which the deletion information is added, in a format different from other character strings, and Or serial insertion information string granted is inserted into any of the image data is displayed in a visible form.

上述した課題を解決するため、本発明の一態様に係る文書管理プログラムは、一つの文字または複数の文字で構成された文字列が記された画像データから、前記文字列を読み取りテキストデータを生成し、生成されたテキストデータを取得し、該テキストデータに対しての編集作業によって削除された文字列には削除情報を付与し、挿入された文字列には挿入情報を付与し、これら削除情報、挿入情報が付与されたテキストデータを生成し、前記画像データと、削除情報、挿入情報が付与されることで生成されたテキストデータとを取得し、前記画像データと挿入情報が付与された文字列とを表示するとともに、削除情報が付与された文字列に対応する前記画像データの文字列を、他の文字列とは異なる形式で表示し、且つ前記挿入情報が付与された文字列が前記画像データのいずれに挿入されるかを、視認可能な形式で表示する処理をコンピュータに実行させる。 In order to solve the above-described problem, a document management program according to an aspect of the present invention generates text data by reading the character string from image data in which a character string composed of one character or a plurality of characters is written. The generated text data is acquired, deletion information is given to the character string deleted by editing the text data, insertion information is given to the inserted character string, and the deletion information The text data to which the insertion information is added is generated, the image data and the text data generated by the deletion information and the insertion information are acquired, and the character to which the image data and the insertion information are assigned are obtained. A character string of the image data corresponding to the character string to which the deletion information is given, in a format different from other character strings, and the insertion information is attached. Or string is inserted into any of the image data, to execute the processing for displaying in a visible form to the computer.

編集前原本の画像データと修正後のデータとの比較確認作業を一つの修正確認用イメージ上で表示することができ、確認漏れが低減するとともに、目視比較作業の手間を大幅に削減することができる。 Comparing and confirming the original image data before editing and the corrected data can be displayed on a single image for confirmation of confirmation, reducing check errors and greatly reducing the labor of visual comparison. it can.

本実施の形態に係る文書管理システムの構成の一例を示す図である。It is a figure which shows an example of a structure of the document management system which concerns on this Embodiment. 本実施の形態に係るイメージスキャニング部の一例を説明する図である。It is a figure explaining an example of the image scanning part which concerns on this Embodiment. 本実施の形態に係る文字認識部の一例を説明する図である。It is a figure explaining an example of the character recognition part which concerns on this Embodiment. 本実施の形態に係る文字認識部が出力するテキストデータの一例を説明する図である。It is a figure explaining an example of the text data which the character recognition part which concerns on this Embodiment outputs. 本実施の形態に係る文字編集部の一例を説明する図である。It is a figure explaining an example of the character edit part which concerns on this Embodiment. 本実施の形態に係る文字編集部による編集前後のデータ例を説明する図である。It is a figure explaining the example of data before and behind the edit by the character edit part which concerns on this Embodiment. 本実施の形態に係る出力部の一例を説明する図である。It is a figure explaining an example of the output part which concerns on this Embodiment. 本実施の形態に係る修正確認用データのレイアウトの一例を示す図である。It is a figure which shows an example of the layout of the data for a correction confirmation which concerns on this Embodiment. 本実施の形態に係る出力部のマージ処理の一例を説明する図である。It is a figure explaining an example of the merge process of the output part which concerns on this Embodiment. 本実施の形態に係る修正確認用データの具体例を示す図である。It is a figure which shows the specific example of the data for correction confirmation which concerns on this Embodiment. 本実施の形態に係る出力部の動作の一例を示すフローチャートである。It is a flowchart which shows an example of operation | movement of the output part which concerns on this Embodiment. 本実施の形態に係る新旧対比表生成部の一例を説明する図である。It is a figure explaining an example of the old and new comparison table production | generation part which concerns on this Embodiment. 本実施の形態に係る新旧対比表生成部によるテキストデータから新旧対比表を作成する処理の一例を説明する図である。It is a figure explaining an example of the process which creates an old and new comparison table from the text data by the old and new comparison table production | generation part which concerns on this Embodiment. 本実施の形態に係る文書管理システムの動作の一例を示すフローチャートである。It is a flowchart which shows an example of operation | movement of the document management system which concerns on this Embodiment. 従来の修正前原本と修正後の出力物との比較確認を説明する模式図である。It is a schematic diagram explaining the comparison confirmation of the conventional uncorrected original and the corrected output.

以下、本発明の実施の形態について図面を参照しつつ説明する。 Embodiments of the present invention will be described below with reference to the drawings.

図１に、本実施の形態に係る文書管理システムの構成を示す。文書管理システム３００は、スキャナ２００、文書管理端末１００（情報処理装置）を有する。 FIG. 1 shows a configuration of a document management system according to the present embodiment. The document management system 300 includes a scanner 200 and a document management terminal 100 (information processing apparatus).

スキャナ２００は、イメージスキャニング部２０を有する。文書管理端末１００は、文字認識部１（認識部）、文字編集部２（編集支援部）、出力部３（表示部）、新旧対比表生成部４（対比表生成部）を有する。 The scanner 200 has an image scanning unit 20. The document management terminal 100 includes a character recognition unit 1 (recognition unit), a character editing unit 2 (editing support unit), an output unit 3 (display unit), and an old and new comparison table generation unit 4 (contrast table generation unit).

文書管理端末１００は、演算処理装置であるＣＰＵ（Central Processing Unit）、主記憶装置であるメモリ、不揮発性記憶装置（フラッシュメモリ、ハードディスクドライブ等）を有し、またキーボード、マウス、ディスプレイ、プリンタ等の入出力装置を有するコンピュータである。文書管理端末１００内の各機能ブロックは、不揮発性記憶装置に予め記憶されている文書管理プログラムが、メモリ上にロードされ、ＣＰＵによって文書管理プログラムが演算実行されることで実現される。 The document management terminal 100 includes a CPU (Central Processing Unit) that is an arithmetic processing device, a memory that is a main storage device, a nonvolatile storage device (flash memory, a hard disk drive, etc.), a keyboard, a mouse, a display, a printer, and the like. The computer having the input / output device. Each functional block in the document management terminal 100 is realized by loading a document management program stored in advance in a non-volatile storage device onto the memory and calculating and executing the document management program by the CPU.

次に、これら各機能ブロックの詳細説明をする。
イメージスキャニング部２０の詳細を図２を参照しつつ説明する。イメージスキャニング部２０は、紙媒体で管理された修正前原本５１（修正前原本５１には少なくとも一つの文字または複数の文字が記載されている）に光を照射し、その反射光を用いてデジタルデータに変換することで、イメージデータ５２を生成する。生成されたイメージデータ５２は、文書管理端末１００の文字認識部１に送信される。 Next, a detailed description of each of these functional blocks will be given.
Details of the image scanning unit 20 will be described with reference to FIG. The image scanning unit 20 irradiates light to an uncorrected original 51 (at least one character or a plurality of characters written on the uncorrected original 51) managed by a paper medium, and digitally uses the reflected light. Image data 52 is generated by converting the data. The generated image data 52 is transmitted to the character recognition unit 1 of the document management terminal 100.

文字認識部１の詳細を、図３を参照しつつ説明する。文字認識部１は、イメージスキャニング部２０から送信されたイメージデータ５２を解析し、文字列と思われる箇所をテキストデータ（所定の文字コードによって構成されるデータ）に変換するとともに、その文字列がイメージデータ５２のどの場所に存在するかの座標情報を行単位で変換後のテキストデータに付与する。 Details of the character recognition unit 1 will be described with reference to FIG. The character recognizing unit 1 analyzes the image data 52 transmitted from the image scanning unit 20 and converts a portion that seems to be a character string into text data (data constituted by a predetermined character code). Coordinate information indicating where the image data 52 exists is added to the converted text data in units of lines.

また文字認識部１は、変換後のテキストデータにイメージデータ５２のファイル名およびイメージデータ５２の存在場所（いずれのフォルダに格納されるか）を付与することで、テキストデータとイメージデータ５２の関連付けを行う。このテキストデータは、ＸＭＬ形式でテキストデータ５３として生成される。文字認識部１は、処理が完了した際に、生成したテキストデータ５３と認識に使用されたイメージデータ５２を所定の格納場所にファイルとして格納する。 Further, the character recognition unit 1 associates the text data with the image data 52 by assigning the file name of the image data 52 and the location where the image data 52 exists (in which folder) to the converted text data. I do. This text data is generated as text data 53 in the XML format. When the process is completed, the character recognition unit 1 stores the generated text data 53 and the image data 52 used for recognition as a file in a predetermined storage location.

ここで、文字認識部１が生成するテキストデータ５３の例を図４に示す。図４のように、テキストデータ５３上にイメージデータ５２の情報（イメージデータ情報）を設定することで、イメージデータ５２とテキストデータ５３とが出力部３によるマージ処理の際にマッチングされる。尚、図４で示したテキストデータ５３で、「<image path>/usr/DAT/A00001/genpon.jpg</>」から「<image height>296.7</>」までがイメージデータ情報であり、「xs」、「ys」、「xe」、「ye」のそれぞれの値がイメージデータ５２の左上端を基準とした行ごとの座標情報である。 Here, an example of the text data 53 generated by the character recognition unit 1 is shown in FIG. As shown in FIG. 4, by setting the information (image data information) of the image data 52 on the text data 53, the image data 52 and the text data 53 are matched during the merge process by the output unit 3. In the text data 53 shown in FIG. 4, the image data information from “<image path> /usr/DAT/A00001/genpon.jpg </>” to “<image height> 296.7 </>” Each value of “xs”, “ys”, “xe”, and “ye” is coordinate information for each row with the upper left corner of the image data 52 as a reference.

文字編集部２の詳細を図５を参照しつつ説明する。文字編集部２は、文字認識部１が生成したテキストデータ５３を呼び出し、自己のエディタ上に文字列を表示することで編集者による編集作業を支援する。編集結果は再度テキストデータ５３に書き込まれる。尚、文字編集部２のエディタによって表示されるデータは、ＸＭＬ形式のタグやイメージデータ情報、座標情報が付与されていない状態（すなわち、文字認識部１によって認識された直後のテキストデータ）であるものとする。 Details of the character editing unit 2 will be described with reference to FIG. The character editing unit 2 supports the editing work by the editor by calling the text data 53 generated by the character recognition unit 1 and displaying the character string on its own editor. The editing result is written to the text data 53 again. The data displayed by the editor of the character editing unit 2 is in a state in which no XML tag, image data information, or coordinate information is attached (that is, text data immediately after being recognized by the character recognition unit 1). Shall.

文字編集部２は、格納されたテキストデータ５３を呼び出し、エディタに渡す（ステップＳ１）。エディタは、渡されたテキストデータ５３を入出力装置（例えばディスプレイ）の画面上に表示し、編集者はその画面上でキーボード、マウスを使用しながら編集作業を行う（Ｓ２）。エディタでの編集作業中では、メモリ内にロードされているテキストデータに対して修正が行われている。 The character editing unit 2 calls the stored text data 53 and passes it to the editor (step S1). The editor displays the passed text data 53 on the screen of the input / output device (for example, display), and the editor performs editing work using the keyboard and mouse on the screen (S2). During the editing work in the editor, the text data loaded in the memory is corrected.

編集者が編集結果を保存指示したタイミングで、エディタはメモリ内にロードされている編集後のテキストデータと編集前のテキストデータ５３とを比較し、編集前後のテキスト情報とともに、修正情報タグ（後述）を付与したデータをテキストデータ５３として新たに作成する（Ｓ３）。作成されたテキストデータ５３は不揮発性記憶装置に書き込まれる。本実施の形態では、編集前のテキストデータを編集後のテキストデータで上書き保存するものとするが、それぞれ別ファイルとなるように保存してもよい。 At the timing when the editor instructs to save the editing result, the editor compares the text data after editing loaded in the memory with the text data 53 before editing, and the correction information tag (described later) together with the text information before and after editing. ) Is newly created as text data 53 (S3). The created text data 53 is written into the nonvolatile storage device. In this embodiment, the text data before editing is overwritten and saved with the text data after editing. However, the text data may be saved in separate files.

文字編集部２による修正情報タグの設定方法例を図６に示す。本実施の形態では、修正情報タグとは、挿入された文字列であることを示す<ins/>タグ（以下、挿入タグ（挿入情報）と称す）、および削除された文字列であることを示す<del/>タグ（以下、削除タグ（削除情報）と称す）の総称である。テキストデータの編集結果は全て「挿入」と「削除」で表現することができる。文字列の置き換えは「削除」と「挿入」の組み合わせで表現可能である。文字編集部２は、エディタ上で編集された結果をファイルに書き込む際、修正情報タグをテキストデータ５３内にXML形式で設定する。 An example of a method for setting the correction information tag by the character editing unit 2 is shown in FIG. In the present embodiment, the correction information tag is an <ins /> tag (hereinafter referred to as an insertion tag (insertion information)) indicating an inserted character string, and a deleted character string. <Del /> tag (hereinafter referred to as deletion tag (deletion information)). All edit results of text data can be expressed by “insertion” and “deletion”. The replacement of a character string can be expressed by a combination of “delete” and “insert”. The character editing unit 2 sets a correction information tag in the text data 53 in the XML format when writing the result edited on the editor to a file.

図６の例のように、ユーザが「今日の天気は雨でしたが午後から晴れました。」の文字列を、「今日の天気は晴れでした。」と修正した場合（図６（Ａ）参照）、文字編集部２は、挿入された文字列には挿入タグを付与し、削除された文字列には、実際にデータを削除するのではなく削除タグを付与したテキストデータ５３を作成する（図６（Ｂ）参照）。 As in the example of FIG. 6, when the user modifies the character string “Today's weather was rainy but sunny from the afternoon” to “Today's weather was sunny” (FIG. 6 (A )), The character editing unit 2 assigns an insertion tag to the inserted character string, and creates text data 53 to which the deleted character string is assigned a deletion tag instead of actually deleting data. (See FIG. 6B).

次に、出力部３の詳細を、図７に基づき説明する。出力部３は、文字編集部２からの修正確認用データの出力指示を受けて（ステップＳ１０）、不揮発性記憶装置にファイルとして記憶されている編集後のテキストデータ５３、およびイメージデータ５２を取り込み（Ｓ１１、Ｓ１２）、これらをマージした修正確認用データ５４を生成し（Ｓ１３）、不揮発性記憶装置にファイルとして出力または入出力装置（例えばディスプレイ）に表示する（Ｓ１４）。 Next, the detail of the output part 3 is demonstrated based on FIG. Upon receiving an instruction to output correction confirmation data from the character editing unit 2 (step S10), the output unit 3 captures the edited text data 53 and image data 52 stored as files in the nonvolatile storage device. (S11, S12), the correction confirmation data 54 obtained by merging them is generated (S13), and output as a file to the nonvolatile storage device or displayed on the input / output device (for example, display) (S14).

修正確認用データ５４のレイアウトの一例を図８に示す。修正確認用データ５４は、いずれの位置にイメージデータ、文字列を配置するかを定義したレイアウト情報を少なくとも有するデータ構造である。本実施の形態において、修正確認用データ５４は、イメージデータ５２を配置する領域（イメージデータ配置領域）、および挿入された文字列を配置する領域（修正情報配置領域）とで領域が分けられて入出力装置（例えばディスプレイ）に表示される形式のデータである。イメージデータ５２は、必要に応じてファイルに格納されているデータが縮小されて配置される。 An example of the layout of the correction confirmation data 54 is shown in FIG. The correction confirmation data 54 has a data structure having at least layout information that defines where image data and character strings are to be arranged. In the present embodiment, the correction confirmation data 54 is divided into an area where the image data 52 is arranged (image data arrangement area) and an area where the inserted character string is arranged (correction information arrangement area). Data in a format displayed on an input / output device (for example, a display). The image data 52 is arranged by reducing the data stored in the file as necessary.

図９を参照しつつ、出力部３によるマージ処理（修正確認用データ５４の作成方法）について説明する。出力部３は、イメージデータ５２を修正確認用データ５４のイメージデータ配置領域に配置し、テキストデータ５３内の修正情報タグと座標情報から、イメージデータ５２のいずれの文字列が削除されたかを算出し、確認者が修正内容を視認することができる注意情報（削除箇所の網掛け）を該当文字列上に乗せる。また、出力部３は、イメージデータ５２内の挿入箇所をポイントした吹き出しを修正情報配置領域に配置し、その吹き出しの中に挿入文字列を設定する。図９の例では、出力部３は、イメージデータ５２に削除タグが付与されている文字列に網掛けの注意情報を乗せ、イメージデータ５２の挿入箇所をポイントする吹き出しを修正情報配置領域に配置し、吹き出しの中に挿入タグが付与されている文字列を配置する。修正確認用データ５４の、より具体的な例を図１０に示す。 With reference to FIG. 9, a merge process (a method for creating the correction confirmation data 54) by the output unit 3 will be described. The output unit 3 arranges the image data 52 in the image data arrangement area of the correction confirmation data 54, and calculates which character string of the image data 52 has been deleted from the correction information tag and the coordinate information in the text data 53. Then, attention information (shading of the deleted part) that allows the confirmer to visually recognize the correction contents is put on the corresponding character string. Further, the output unit 3 arranges a balloon pointing to the insertion position in the image data 52 in the correction information arrangement area, and sets an insertion character string in the balloon. In the example of FIG. 9, the output unit 3 places shaded attention information on a character string with a deletion tag attached to the image data 52, and places a balloon that points to the insertion position of the image data 52 in the correction information arrangement area. Then, the character string to which the insertion tag is attached is placed in the balloon. A more specific example of the correction confirmation data 54 is shown in FIG.

図１１のフローチャートを参照しつつ、出力部３によるマージ処理の動作を説明する。 The operation of the merge process by the output unit 3 will be described with reference to the flowchart of FIG.

出力部３は、テキストデータ５３を不揮発性記憶装置から受け取り（Ｓ２０）、イメージデータ５２を不揮発性記憶装置から受け取る（Ｓ２１）。出力部３は、イメージデータ５２を図８で示したイメージデータ配置領域に配置し（Ｓ２２）、テキストデータ５３を読み込む（Ｓ２３）。 The output unit 3 receives the text data 53 from the nonvolatile storage device (S20), and receives the image data 52 from the nonvolatile storage device (S21). The output unit 3 arranges the image data 52 in the image data arrangement area shown in FIG. 8 (S22), and reads the text data 53 (S23).

出力部３は、ここで、最後までテキストデータ５３を読み込んだかを判定し（Ｓ２４）、最後まで読み込んでいない場合（Ｓ２４、ＮＯ）、次に現在の読み込み箇所が修正情報タグであるかを判定する（Ｓ２５）。ここで、修正情報タグでない場合（Ｓ２５、ＮＯ）、処理はステップＳ２３に戻りテキストデータ５３内の次の文字を読み込む。一方、修正情報タグである場合（Ｓ２５、ＹＥＳ）、出力部３は、修正情報タグの種別を判定する（Ｓ２６）。 Here, the output unit 3 determines whether or not the text data 53 has been read to the end (S24), and if it has not been read to the end (S24, NO), determines whether or not the current read location is a correction information tag. (S25). If it is not a correction information tag (S25, NO), the process returns to step S23 to read the next character in the text data 53. On the other hand, when it is a correction information tag (S25, YES), the output unit 3 determines the type of the correction information tag (S26).

修正情報タグが削除タグである場合（Ｓ２６、削除タグ）、出力部３は削除範囲を算出する（Ｓ２７）。本実施の形態では、出力部３は、テキストデータ５３の削除タグが付与されている文字列の最初の文字が、行の端から何文字目（値Ａとする）にあるのか、および削除タグが付与されている文字列の文字数（値Ｂとする）をカウントし、削除範囲を決定する。 When the correction information tag is a deletion tag (S26, deletion tag), the output unit 3 calculates a deletion range (S27). In the present embodiment, the output unit 3 determines the number of characters (value A) from the end of the line where the first character of the character string to which the deletion tag of the text data 53 is assigned, and the deletion tag. Is counted to determine the deletion range.

出力部３は、イメージデータ５２上での削除箇所を設定し、該当箇所に注意情報を乗せる（Ｓ２８）。出力部３は、ステップＳ２８で以下の処理を実行する。
（Ｓ２８−１）削除タグが付与されている文字列が存在する行の座標情報（行の起点（xs, ys）、終点(xe, ye)の座標）をテキストデータ５３から取得する。
（Ｓ２８−２）取得した座標情報を用いて、イメージデータ５２の該当行を特定する。
（Ｓ２８−３）イメージデータ５２上で、特定された該当行の修正前総文字数（値Ｃとする）で座標情報の文字方向のサイズを割り（横書きの場合：（xe − xs）／Ｃ、縦書きの場合：（ye − ys）／Ｃ）、１文字ごとの座標位置を算出し、値Ａが該当する座標位置から値Ｂが該当する座標位置の範囲に網掛けを乗せる。
尚、Ｓ２７では、テキストデータ５３のカウント値とＳ２８−３の網掛けをする座標位置とにずれが生じないようにするため、ＸＭＬ形式のタグ文字および挿入タグが付与されている文字列はカウントしないものとする。 The output unit 3 sets a deletion location on the image data 52 and puts attention information on the corresponding location (S28). The output unit 3 executes the following process in step S28.
(S28-1) The coordinate information (coordinates of the starting point (xs, ys) and ending point (xe, ye) of the line) of the line in which the character string to which the deletion tag is assigned is acquired from the text data 53.
(S28-2) The corresponding line of the image data 52 is specified using the acquired coordinate information.
(S28-3) On the image data 52, the size in the character direction of the coordinate information is divided by the total number of characters before correction (value C) of the identified line (in the case of horizontal writing: (xe−xs) / C, In the case of vertical writing: (ye−ys) / C) The coordinate position for each character is calculated, and the range of the coordinate position corresponding to the value B is shaded from the coordinate position corresponding to the value A.
In S27, in order to prevent a shift between the count value of the text data 53 and the coordinate position to be shaded in S28-3, the character string to which the XML format tag character and the insertion tag are attached is counted. Shall not.

ステップＳ２８では、上記方法以外にも、イメージデータ５２で行が特定された後に、削除タグが付与されている文字列を取得し、ＯＣＲ機能を用いてイメージデータ５２の該当行に対して、削除タグが付与されている文字列であるか否かサーチする方法も考えられる。しかし、イメージデータ５２の同一行の中に同じ文字列が複数あり、一方は削除タグが付与されており、他方は付与されてない場合、イメージデータ５２内をサーチし検索対象文字列が見つかったときに、削除対象の文字列なのか否かの判断が困難となる。本実施の形態では、かかる点を考慮して上述のような実装としている。 In step S28, in addition to the above method, after a line is specified in the image data 52, a character string to which a deletion tag is attached is acquired, and the corresponding line in the image data 52 is deleted using the OCR function. A method of searching for whether or not a character string has a tag is also conceivable. However, when there are a plurality of the same character strings in the same line of the image data 52, one of them is assigned a deletion tag, and the other is not attached, a search target character string is found by searching the image data 52. Sometimes, it is difficult to determine whether the character string is to be deleted. In the present embodiment, the above-described mounting is performed in consideration of such points.

ステップＳ２６の処理に説明を戻す。修正情報タグが挿入タグである場合（Ｓ２６、挿入タグ）、出力部３は挿入箇所を算出する（Ｓ２９）。出力部３は、ここでテキストデータ５３の挿入タグが付与されている文字列の一つ前の文字が、行の端から何文字目にあるのかをカウントする。 The description returns to the process of step S26. When the correction information tag is an insertion tag (S26, insertion tag), the output unit 3 calculates the insertion location (S29). The output unit 3 counts the number of characters from the end of the line where the character immediately before the character string to which the insertion tag of the text data 53 is assigned.

次に出力部３は、挿入タグが付与されている文字列をテキストデータ５３から抽出し（Ｓ３０）、イメージデータ５２への挿入箇所を設定する（Ｓ３１）。 Next, the output unit 3 extracts the character string to which the insertion tag is assigned from the text data 53 (S30), and sets the insertion location in the image data 52 (S31).

出力部３は、Ｓ３１で以下の処理を実行する。
（Ｓ３１−１）挿入タグが付与されている文字列が存在する行の座標情報（行の起点、終点の座標）をテキストデータ５３から取得する。
（Ｓ３１−２）取得した座標情報を用いて、イメージデータ５２の該当行を特定する。
（Ｓ３１−３）イメージデータ５２上で、特定された該当行の修正前総文字数（値Ｄとする）で座標情報の文字方向のサイズを割り（横書きの場合：（xe − xs）／Ｄ、縦書きの場合：（ye − ys）／Ｄ）、１文字ごとの座標位置を行の基点から順に算出し、算出した座標位置の数がステップＳ２９で得られたカウント値になった場合、その座標位置をイメージデータ５２上の挿入箇所として特定する。
尚、Ｓ２９では、テキストデータ５３のカウント値とＳ３１−３の座標位置とにずれが生じないようにするため、ＸＭＬ形式のタグ文字および挿入タグが付与されている文字列はカウントしないものとする。 The output unit 3 executes the following process in S31.
(S31-1) The coordinate information (coordinates of the start point and end point of the line) of the line in which the character string to which the insertion tag is attached is obtained from the text data 53.
(S31-2) The corresponding line of the image data 52 is specified using the acquired coordinate information.
(S31-3) On the image data 52, the size in the character direction of the coordinate information is divided by the total number of characters before correction (value D) in the specified line (in the case of horizontal writing: (xe−xs) / D, For vertical writing: (ye−ys) / D) When the coordinate position for each character is calculated in order from the base point of the line, and the number of calculated coordinate positions becomes the count value obtained in step S29, A coordinate position is specified as an insertion location on the image data 52.
In S29, in order to prevent a shift between the count value of the text data 53 and the coordinate position of S31-3, the character string to which the XML format tag character and the insertion tag are attached is not counted. .

次に出力部３は、修正情報配置領域（図８参照）に、ステップＳ３１で得られたイメージデータ５２上の挿入箇所をポイントする吹き出しを配置し、この吹き出し内の領域に挿入タグが付与されている文字列を配置する。（Ｓ３２）。 Next, the output unit 3 arranges a balloon that points to the insertion location on the image data 52 obtained in step S31 in the correction information arrangement area (see FIG. 8), and an insertion tag is assigned to the area in the balloon. Place the character string. (S32).

ステップＳ２８、ステップＳ３２の後に、処理はＳ２３へ戻り、次の文字に対しての処理が実行される。また、出力部３は、ステップＳ２４で最後までテキストデータ５３を読み込んだと判定した場合（Ｓ２４、ＹＥＳ）、終了処理（修正確認データ５４のファイル作成やディスプレイ上への表示、使用したメモリの開放等）を実行し（Ｓ３３）、処理は終了する。 After step S28 and step S32, the process returns to S23, and the process for the next character is executed. If the output unit 3 determines in step S24 that the text data 53 has been read to the end (YES in S24), the output unit 3 creates a file of the correction confirmation data 54, displays it on the display, and releases the used memory. Etc.) is executed (S33), and the process ends.

次に、新旧対比表生成部４の詳細を図１２を参照しつつ説明する。新旧対比表生成部４は、文字編集部２からの新旧対比表の出力指示を受けて（Ｓ４０）、文字編集部２が生成したテキストデータ５３を取り込み（Ｓ４１）、テキストデータ５３内の修正情報タグを元に新旧対比表５５を生成しファイル出力する（Ｓ４２、Ｓ４３）。新旧対比表５５として出力されるデータはＣＳＶ（Comma Separated Values）形式であるものとする。 Next, details of the new and old comparison table generator 4 will be described with reference to FIG. The old and new comparison table generation unit 4 receives an instruction to output the old and new comparison table from the character editing unit 2 (S40), takes in the text data 53 generated by the character editing unit 2 (S41), and corrects information in the text data 53 The new and old comparison table 55 is generated based on the tag and output as a file (S42, S43). The data output as the old and new comparison table 55 is assumed to be in CSV (Comma Separated Values) format.

新旧対比表５５の生成例を図１３に示す。新旧対比表生成部４は、テキストデータ５３から行単位で文字列情報を抽出し、「挿入」、「削除」に応じて修正前と修正後の新旧対比表５５を生成する。本実施の形態では、図１３の例のように、ページ番号を示す「頁」、修正前の行数を示す「行数」、（挿入／削除）または削除の別を示す「修正タイプ」、修正前の文字列を示す「修正前原本」、修正後の文字列を示す「修正後」を１つのレコードとしたＣＳＶデータが生成される。「修正前原本」には、挿入タグが付与されている文字列が取り除かれたテキストデータが抽出され、「修正後」には、削除タグが付与されている文字列が取り除かれたテキストデータが抽出される。 A generation example of the old and new comparison table 55 is shown in FIG. The old and new comparison table generation unit 4 extracts character string information in line units from the text data 53, and generates the old and new comparison table 55 before and after correction according to “insertion” and “deletion”. In the present embodiment, as in the example of FIG. 13, “page” indicating the page number, “number of lines” indicating the number of lines before correction, “modification type” indicating (insertion / deletion) or deletion, CSV data is generated with “record before modification” indicating a character string before correction and “after correction” indicating a character string after correction as one record. The text data from which the character string with the insertion tag is removed is extracted in the “original document”, and the text data from which the character string to which the deletion tag is added is removed in “After modification”. Extracted.

ここで、新旧対比表を作成する理由について説明する。例えば、文書を修正した後に、その修正内容を通知するための通達文書を作成することがある。この通達文書には、文書のどの箇所がどのように修正したかを記載する必要がある。従来、修正箇所を目視確認し通達文書を新規で作成していたが、本実施の形態のように新旧対比表を装置が自動で作成すれば、作成された表をそのまま貼り付けることで通達文書を作成することができる。よって、本実施の形態によって作業効率を向上させることができる。 Here, the reason for creating the old and new comparison table will be described. For example, after a document is corrected, a notification document for notifying the correction content may be created. This notification document must describe which part of the document has been modified and how. Conventionally, a notification document has been created by visually checking the correction location, but if the device automatically creates a new and old comparison table as in this embodiment, the notification document can be created by pasting the created table as it is. Can be created. Therefore, working efficiency can be improved by this embodiment.

また、例えば監督官庁等の機関からの行政指導により、文書修正が指示された場合、修正結果を当該機関に報告する必要がある。この報告文書に記載する修正内容に新旧対比表を使用することが可能となる。 For example, when a document correction is instructed by administrative guidance from an organization such as a supervisory government office, it is necessary to report the correction result to the organization. It is possible to use the old and new comparison tables for the correction contents described in this report document.

最後に、文書管理システム３００の全体動作を図１４のフローチャートを参照しつつ説明する。 Finally, the overall operation of the document management system 300 will be described with reference to the flowchart of FIG.

イメージスキャニング部２０は、修正前原本５１を読み取り、イメージデータ５２（例えばＪＰＥＧ形式の画像データ）を生成する（Ｓ５１）。このイメージデータ５２には、少なくとも一つの文字または複数の文字で構成された文字列が記されているものとする。 The image scanning unit 20 reads the uncorrected original 51 and generates image data 52 (for example, image data in JPEG format) (S51). It is assumed that the image data 52 includes a character string composed of at least one character or a plurality of characters.

文字認識部１は、イメージデータ５２から、文字列を読み取りテキストデータ５３を生成する（Ｓ５２）。このテキストデータ５３には、イメージデータ５２に記された文字列の行ごとに、その行がイメージデータ５２のいずれの位置にあるかを示す座標情報が付与される。 The character recognition unit 1 reads a character string from the image data 52 and generates text data 53 (S52). The text data 53 is given coordinate information indicating the position of the line in the image data 52 for each line of the character string written in the image data 52.

文字編集部２は、文字認識部１によって生成されたテキストデータ５３を取得し、テキストデータ５３に対してのユーザによる編集作業に応じて、削除された文字列には削除タグを付与し、挿入された文字列には挿入タグを付与する。文字編集部２は、これら削除タグ、挿入タグが付与されたテキストデータ５３を生成する（Ｓ５３）。 The character editing unit 2 acquires the text data 53 generated by the character recognition unit 1, adds a deletion tag to the deleted character string, and inserts it according to the editing operation by the user on the text data 53. An insertion tag is assigned to the character string that has been set. The character editing unit 2 generates text data 53 to which these deletion tags and insertion tags are assigned (S53).

次に、文字編集部２から修正確認用データの出力指示があった場合（Ｓ５４、修正確認用データ）の処理について説明する。 Next, a description will be given of processing when there is an instruction to output correction confirmation data from the character editing unit 2 (S54, correction confirmation data).

出力部３は、修正確認用データを作成し、ファイル出力やディスプレイに表示する（Ｓ５５）。Ｓ５５について説明する。出力部３は、イメージデータ５２と文字編集部２によって生成されたテキストデータ５３とを取得する。出力部３は、イメージデータ５２と挿入タグが付与された文字列とを表示する。さらに、出力部３は、削除タグが付与された文字列に対応するイメージデータ５２の文字列を、他の文字列とは異なる形式（例えば網掛け形式）で表示する。加えて出力部３は、挿入タグが付与された文字列がイメージデータ５２のいずれに挿入されるかをユーザが視認可能な形式（例えば挿入箇所をポイントしている吹き出し）で表示する。 The output unit 3 creates correction confirmation data and displays it on a file output or display (S55). S55 will be described. The output unit 3 acquires the image data 52 and the text data 53 generated by the character editing unit 2. The output unit 3 displays the image data 52 and the character string to which the insertion tag is attached. Further, the output unit 3 displays the character string of the image data 52 corresponding to the character string to which the deletion tag is assigned in a format different from other character strings (for example, a shaded format). In addition, the output unit 3 displays in which format the character string to which the insertion tag is added is inserted into the image data 52 in a format that the user can visually recognize (for example, a balloon pointing to the insertion location).

イメージデータ５２上での削除箇所、挿入箇所の特定について説明する。出力部３は、削除タグが付与されている文字列の最初の文字が、行の端から何文字目にあるのか、および前記文字列の文字数を、文字編集部２によって生成されたテキストデータ５３を用いてカウントし、これらカウント値と座標情報とに基づき、削除タグが付与された文字列に対応するイメージデータ５２の文字列を特定する。また、挿入タグが付与されている文字列の最初の文字が、行の端から何文字目にあるのかを、文字編集部２によって生成されたテキストデータ５３を用いてカウントし、このカウント値と座標情報とに基づき、挿入タグが付与された文字列がイメージデータ５２のいずれに挿入されるかを特定する。 A description will be given of how to specify a deletion location and an insertion location on the image data 52. The output unit 3 uses the text data 53 generated by the character editing unit 2 to determine the number of characters from the end of the line, and the number of characters in the character string, to which the first character of the character string to which the deletion tag is assigned. The character string of the image data 52 corresponding to the character string to which the deletion tag is assigned is specified based on the count value and the coordinate information. In addition, the number of characters from the end of the line at which the first character of the character string to which the insertion tag is attached is counted using the text data 53 generated by the character editing unit 2, and this count value and Based on the coordinate information, it is specified in which of the image data 52 the character string to which the insertion tag is attached is inserted.

Ｓ５４の判定処理に説明を戻す。文字編集部２から新旧対応表の出力指示があった場合（Ｓ５４、新旧対応表）の処理について説明する。 The description returns to the determination process of S54. The processing when the character editing unit 2 instructs to output the old and new correspondence table (S54, old and new correspondence table) will be described.

新旧対比表生成部４は、文字編集部２によって生成されたテキストデータ５３の削除タグ、挿入タグに基づき、ユーザの編集作業によって削除された文字列または挿入された文字列を特定し、編集作業の前と後との新旧対比表５５を生成する（Ｓ５６）。 The old and new comparison table generation unit 4 identifies a character string deleted or inserted by a user editing operation based on the deletion tag and insertion tag of the text data 53 generated by the character editing unit 2, and the editing operation The old and new comparison table 55 before and after is generated (S56).

本実施の形態では、文書管理端末１００は画像データを取得し、画像データに対して処理するものとしたが、画像データに限らず文字列を有するデータであれば本実施の形態の態様を適用することができる。 In the present embodiment, the document management terminal 100 acquires image data and processes the image data. However, the present embodiment is not limited to image data, and any data having a character string can be applied. can do.

また、本実施の形態では、スキャナ２００、文書管理端末１００の２つのユニットに分かれた文書管理システム３００について説明したが、これら２つのユニットを１つのＯＣＲ装置（光学式文字読取装置）とすることも可能である。 In the present embodiment, the document management system 300 divided into two units of the scanner 200 and the document management terminal 100 has been described. However, these two units are assumed to be one OCR device (optical character reader). Is also possible.

本実施の形態のように、修正前原本と修正後の出力物との確認作業を一つのイメージにマージすることで、修正前原本に対して修正箇所を明示することが可能になり、確認漏れを無くすとともに比較確認も容易となる。また、修正前原本のテキスト情報と修正後のテキスト情報とが対比された表を作成することで、新旧対比を容易に作成可能となる。 As in this embodiment, by merging the confirmation work of the uncorrected original and the corrected output into a single image, it becomes possible to clearly indicate the correction location for the uncorrected original, and omission of confirmation This makes it easier to confirm the comparison. In addition, by creating a table in which the text information of the original before correction and the text information after correction are compared, it is possible to easily create a comparison between old and new.

本実施の形態において、文書管理プログラムは上述した文書管理端末の内部に予めインストールされているものとして記載したが、本発明における文書管理プログラムは記憶媒体に記憶されたものも含まれる。ここで記憶媒体とは、磁気テープ、磁気ディスク（ハードディスクドライブ等）、光ディスク（ＣＤ−ＲＯＭ、ＤＶＤディスク等）、光磁気ディスク（ＭＯ等）、フラッシュメモリ等、文書管理端末に対し脱着可能な媒体や、さらにネットワークを介することで伝送可能な媒体等、上述した文書管理端末におけるコンピュータで読み取りや実行が可能な全ての媒体をいう。 In the present embodiment, the document management program is described as being installed in advance in the above-described document management terminal. However, the document management program in the present invention includes one stored in a storage medium. Here, the storage medium is a medium that is detachable from the document management terminal, such as a magnetic tape, a magnetic disk (hard disk drive, etc.), an optical disk (CD-ROM, DVD disk, etc.), a magneto-optical disk (MO, etc.), and a flash memory. Further, it refers to all media that can be read and executed by a computer in the document management terminal described above, such as media that can be transmitted via a network.

本発明は、その精神または主要な特徴から逸脱することなく、他の様々な形で実施することができる。そのため、前述の実施の形態は、あらゆる点で単なる例示に過ぎず、限定的に解釈してはならない。本発明の範囲は、特許請求の範囲によって示すものであって、明細書本文には、何ら拘束されない。更に、特許請求の範囲の均等範囲に属する全ての変形、様々な改良、代替および改質は、全て本発明の範囲内のものである。 The present invention can be implemented in various other forms without departing from the spirit or main features thereof. Therefore, the above-described embodiment is merely an example in all respects and should not be interpreted in a limited manner. The scope of the present invention is shown by the scope of claims, and is not restricted by the text of the specification. Moreover, all modifications, various improvements, substitutions and modifications belonging to the equivalent scope of the claims are all within the scope of the present invention.

１文字認識部、２文字編集部、３出力部、４、新旧対比表生成部、２０イメージスキャニング部、５１修正前原本、５２イメージデータ、５３テキストデータ、５４修正確認用データ、５５新旧対比表、１００文書管理端末、２００スキャナ、３００文書管理システム。 1 character recognition unit, 2 character editing unit, 3 output unit, 4 old and new comparison table generation unit, 20 image scanning unit, 51 original document before correction, 52 image data, 53 text data, 54 correction confirmation data, 55 old and new comparison table , 100 document management terminal, 200 scanner, 300 document management system.

Claims

From the image data in which a character string composed of one character or a plurality of characters is written, the character string is read to generate text data, and the information about the storage location and size of the image data is used as image data information. A recognition unit to be assigned to
The text data is acquired, deletion information is given to the character string deleted by the editing operation on the text data, insertion information is given to the inserted character string, and the deletion information and insertion information are An editing support unit for generating the given text data;
The image data and the text data generated by the editing support unit are acquired, the image data and the character string to which insertion information is added are displayed, and the image corresponding to the character string to which deletion information is assigned A display unit that displays a character string of data in a format different from other character strings, and displays in which form the character string to which the insertion information is added is inserted into the image data in a visually recognizable format When,
I have a,
For each line of the character string written in the image data, the recognition unit gives coordinate information indicating the position of the line in the image data to the text data,
The display unit displays the number of characters from the end of the line where the first character of the character string to which deletion information is assigned, and the number of column characters of the character string to which the deletion information is assigned by the editing support unit. Count using the generated text data, and based on these count values and the coordinate information, specify the character string of the image data corresponding to the character string to which the deletion information is assigned, and is provided with insertion information The character immediately before the character string is counted from the end of the line using the text data generated by the editing support unit, and based on the count value and the coordinate information, An information processing apparatus that identifies in which of the image data the character string to which the insertion information is added is inserted .

The information processing apparatus according to claim 1 ,
The display unit
The coordinate information of the starting point and the ending point of the line in which the character string to which the deletion information is attached or the character string to which the insertion information is attached is obtained from the text data, and the image is obtained using the starting point and ending point coordinate information. The line in the data is specified, and the coordinate position of each character on the image data is calculated by dividing the difference of the coordinate information of the starting point from the coordinate information of the end point by the total number of characters before correction of the line. ,
When the character string of the image data corresponding to the character string to which the deletion information is added is specified, the first character of the character string to which the deletion information is added at the calculated coordinate position of each character is a line end. From the coordinate position corresponding to the count value indicating the number of characters from, to the range on the image data from the coordinate position corresponding to the number of column characters,
When the character string of the image data corresponding to the character string to which the insertion information is assigned is specified, the character immediately before the character string to which the insertion information is assigned out of the calculated coordinate position of each character An information processing apparatus, wherein information relating to the character string to which the insertion information is added is arranged at a coordinate position corresponding to the count value indicating the character number from the line end.

A recognition unit of the computer reads the character string from image data in which a character string composed of one character or a plurality of characters is written, generates text data, and stores information on the storage location and size of the image data. Attached to the text data as image data information,
The editing support unit of the computer obtains the text data, gives deletion information to a character string deleted by editing the text data, and gives insertion information to the inserted character string Then, generate the text data with the deletion information and insertion information.
The display unit of the computer acquires the image data and text data generated by the editing support unit , displays the image data and a character string to which insertion information is added, and is given deletion information. The character string of the image data corresponding to the character string displayed is displayed in a format different from that of the other character strings, and it is visually recognized in which of the image data the character string provided with the insertion information is inserted. Display in a possible format ,
For each line of the character string written in the image data, the recognition unit gives coordinate information indicating the position of the line in the image data to the text data,
The display unit determines the number of characters from the end of the line where the first character of the character string to which deletion information has been assigned, and the number of column characters of the character string to which the deletion information has been assigned, by the editing support unit. Count using the generated text data, and based on these count values and the coordinate information, specify the character string of the image data corresponding to the character string to which the deletion information is assigned, and is provided with insertion information The character immediately before the character string is counted from the end of the line using the text data generated by the editing support unit, and based on the count value and the coordinate information, A document management method for specifying in which of the image data the character string to which the insertion information is added is inserted .

From the image data in which a character string composed of one character or a plurality of characters is written, the character string is read to generate text data, and the information about the storage location and size of the image data is used as image data information. A recognition unit to be assigned to
The text data is acquired, deletion information is given to the character string deleted by the editing operation on the text data, insertion information is given to the inserted character string, and the deletion information and insertion information are An editing support unit for generating the given text data;
The image data and the text data generated by the editing support unit are acquired, the image data and the character string to which insertion information is added are displayed, and the image corresponding to the character string to which deletion information is assigned display the character string data, it is displayed in a different format from the other strings, and whether the insertion information is imparted string is inserted into any of the image data is displayed in a visible form
Function as a computer
For each line of the character string written in the image data, the recognition unit gives coordinate information indicating the position of the line in the image data to the text data,
The display unit displays the number of characters from the end of the line where the first character of the character string to which deletion information is assigned, and the number of column characters of the character string to which the deletion information is assigned by the editing support unit. Count using the generated text data, and based on these count values and the coordinate information, specify the character string of the image data corresponding to the character string to which the deletion information is assigned, and is provided with insertion information The character immediately before the character string is counted from the end of the line using the text data generated by the editing support unit, and based on the count value and the coordinate information, A document management program for specifying in which of the image data the character string to which the insertion information is added is inserted .