JP2017156982A

JP2017156982A - Image conversion program, image conversion device, and image conversion method

Info

Publication number: JP2017156982A
Application number: JP2016039559A
Authority: JP
Inventors: 純黒木; Jun Kuroki
Original assignee: Konica Minolta Inc
Current assignee: Konica Minolta Inc
Priority date: 2016-03-02
Filing date: 2016-03-02
Publication date: 2017-09-07
Anticipated expiration: 2036-03-02
Also published as: JP6662108B2

Abstract

PROBLEM TO BE SOLVED: To generate structured data in which producibility of an original document is assured, retrieval property and re-editing property of the document are maintained, and degradation in visibility of the original is suppressed.SOLUTION: An image conversion device (image conversion program) acquires first image data of which an image reading part is made to read a document or an imaging part is made to image the document, applies a vectorizing process to the first image data for conversion to structured data, analyzes the structured data for obtaining object information and performing a rasterizing process on the structured data for re-conversion to second image data, compares the first image data with the second image data for extracting a differential part, based on an object information, specifies an object region in a specified range to which an object arranged to the differential part belongs, acquires image information corresponding to the object region from the first image data, updates the structured data by using the acquired image information, and outputs the updated structured data.SELECTED DRAWING: Figure 9

Description

本発明は、画像変換プログラム及び画像変換装置並びに画像変換方法に関し、特に、イメージデータから構造化データを生成する画像変換プログラム及び画像変換装置並びに画像変換方法に関する。 The present invention relates to an image conversion program, an image conversion apparatus, and an image conversion method, and more particularly to an image conversion program, an image conversion apparatus, and an image conversion method for generating structured data from image data.

近年、省資源化のために、用紙に印刷した原稿をスキャナなどで読み取ってイメージデータに変換して保存することが行われている。また、イメージデータでは、原稿の中の特定のオブジェクトの検索や原稿の再編集ができないことから、イメージデータをベクターデータ（ベクトルデータとも言う。）に変換し、ベクターデータを保存することも行われている。しかしながら、イメージデータをベクターデータに変換（ベクター変換若しくはベクタライズと言う。）する過程で誤認識が生じる恐れがある。 In recent years, in order to save resources, a document printed on paper is read by a scanner or the like, converted into image data, and stored. In addition, since image data cannot be used to search for a specific object in the original or to be edited again, it is also possible to convert the image data into vector data (also referred to as vector data) and save the vector data. ing. However, misrecognition may occur in the process of converting image data into vector data (referred to as vector conversion or vectorization).

このような誤認識が生じた場合の対処方法として、例えば、下記特許文献１には、原稿を読み取って得られる原稿画像の画像処理を行う画像処理装置であって、原稿を読み取る読取手段と、前記読取手段で読み取られた原稿画像をベクトルデータに変換する第１変換手段と、前記ベクトルデータをイメージデータに変換する第２変換手段と、前記原稿画像の第１イメージデータと、前記第２変換手段で生成された第２イメージデータとを比較する比較手段と、前記比較手段の比較結果に基づいて、前記原稿画像に対応する電子ファイルとして、前記第１イメージデータあるいは前記ベクトルデータのどちらかを選択する選択手段とを備える画像処理装置が開示されている。 As a coping method when such erroneous recognition occurs, for example, Japanese Patent Application Laid-Open No. H10-260260 discloses an image processing apparatus that performs image processing of a document image obtained by reading a document, and a reading unit that reads the document, First conversion means for converting the original image read by the reading means into vector data, second conversion means for converting the vector data into image data, first image data of the original image, and the second conversion Comparing the first image data or the vector data as an electronic file corresponding to the document image based on the comparison result of the comparison means and the comparison result of the comparison means. An image processing apparatus including a selection unit for selecting is disclosed.

特開２００５−１５７４５０号公報JP 2005-157450 A

上記特許文献１では、原稿の元のイメージデータと変換後のイメージデータとの比較結果に基づいて、原稿全体をイメージデータで保存するかベクターデータで保存するかを選択しているが、イメージデータで保存された場合にはテキスト情報やグラフィックス情報が欠損してしまうため、原稿の検索性や再編集性が失われてしまう。 In the above-mentioned Patent Document 1, whether to save the entire document as image data or vector data is selected based on the comparison result between the original image data of the document and the converted image data. When saved in, text information and graphics information are lost, so the searchability and reeditability of the manuscript are lost.

また、別の方法として、ベクタライズ過程で誤認識している可能性のある部分のみをイメージデータで置き換える方法も考えられるが、この方法では、所定の領域内でベクターデータとラスターデータとが混在することになるため、原稿の視認性が悪化する恐れがある。 As another method, it is possible to replace only a portion that may be erroneously recognized in the vectorization process with image data. In this method, vector data and raster data are mixed in a predetermined area. Therefore, the visibility of the document may be deteriorated.

本発明は、上記問題点に鑑みてなされたものであって、その主たる目的は、元の原稿の再現性を確保しつつ、原稿の検索性や再編集性を維持し、かつ、原稿の視認性の悪化を抑制した構造化データを生成することができる画像変換プログラム及び画像変換装置並びに画像変換方法を提供することにある。 The present invention has been made in view of the above problems, and its main purpose is to maintain the searchability and re-editability of the original document while ensuring the reproducibility of the original original document, and to visually check the original document. An object of the present invention is to provide an image conversion program, an image conversion apparatus, and an image conversion method capable of generating structured data in which deterioration of property is suppressed.

本発明の一側面は、ベクタライズ処理とラスタライズ処理とが実行可能な装置で動作する画像変換プログラムであって、前記装置に、画像読取部に原稿を読み取らせた第１の画像データ又は撮像部に原稿を撮像させた第１の画像データを取得する第１処理、前記第１の画像データに対して前記ベクタライズ処理を行って、構造化データに変換する第２処理、前記構造化データを解析して、オブジェクト情報を取得する第３処理、前記構造化データに対して前記ラスタライズ処理を行って、第２の画像データに再変換する第４処理、前記第１の画像データと前記第２の画像データとを比較して、差異部分を抽出する第５処理、前記オブジェクト情報に基づいて、前記差異部分に配置されるオブジェクトが属する所定範囲のオブジェクト領域を特定する第６処理、前記第１の画像データから、前記オブジェクト領域に対応する画像情報を取得する第７処理、前記取得した画像情報を用いて前記構造化データを更新し、更新後の前記構造化データを出力する第８処理、を実行させることを特徴とする。 One aspect of the present invention is an image conversion program that operates on an apparatus capable of performing vectorization processing and rasterization processing. The image conversion program causes the image reading unit to read an original on the image reading unit. A first process for acquiring first image data obtained by capturing an image of a document; a second process for converting the first image data into structured data by performing the vectorization process; and analyzing the structured data. A third process for obtaining object information, a fourth process for performing the rasterization process on the structured data and reconverting the second image data, and the first image data and the second image. A fifth process for comparing the data with each other and extracting the difference portion, and on the basis of the object information, the object area within a predetermined range to which the object arranged in the difference portion belongs is specified. A sixth process for performing, a seventh process for acquiring image information corresponding to the object region from the first image data, updating the structured data using the acquired image information, and the structured data after the update An eighth process of outputting data is executed.

本発明の一側面の画像変換装置は、画像読取部又は撮像部と、前記画像読取部が原稿を読み取った第１の画像データ又は前記撮像部が原稿を撮像した第１の画像データを取得するデータ取得部と、前記第１の画像データに対してベクタライズ処理を行って、構造化データに変換するベクタライズ処理部と、前記構造化データを解析して、オブジェクト情報を取得する解析部と、前記構造化データに対してラスタライズ処理を行って、第２の画像データに再変換するラスタライズ処理部と、前記第１の画像データと前記第２の画像データとを比較して、差異部分を抽出する比較部と、前記オブジェクト情報に基づいて、前記差異部分に配置されるオブジェクトが属する所定範囲のオブジェクト領域を特定し、前記第１の画像データから、前記オブジェクト領域に対応する画像情報を取得し、前記取得した画像情報を用いて前記構造化データを更新し、更新後の前記構造化データを出力するデータ更新部と、を備えることを特徴とする。 An image conversion apparatus according to an aspect of the present invention acquires an image reading unit or an imaging unit, and first image data obtained by reading an original by the image reading unit or first image data obtained by imaging the original by the imaging unit. A data acquisition unit, a vectorization processing unit that performs vectorization processing on the first image data and converts it into structured data, an analysis unit that analyzes the structured data and acquires object information, and Rasterization processing is performed on the structured data to compare the first image data with the second image data, and a difference portion is extracted. Based on the comparison unit and the object information, an object region in a predetermined range to which the object arranged in the difference portion belongs is specified, and the object area is determined from the first image data. Acquires image information corresponding to the object region, using the obtained image information and updating it said structured data, a data updating unit which outputs the structured data after updating, characterized in that it comprises a.

本発明の一側面は、ベクタライズ処理とラスタライズ処理とが実行可能な制御装置と画像読取部又は撮像部を備える装置とを含むシステムにおける画像変換方法であって、前記制御装置は、前記画像読取部に原稿を読み取らせた第１の画像データ又は前記撮像部に原稿を撮像させた第１の画像データを取得する第１処理と、前記第１の画像データに対して前記ベクタライズ処理を行って、構造化データに変換する第２処理と、前記構造化データを解析して、オブジェクト情報を取得する第３処理と、前記構造化データに対して前記ラスタライズ処理を行って、第２の画像データに再変換する第４処理と、前記第１の画像データと前記第２の画像データとを比較して、差異部分を抽出する第５処理と、前記オブジェクト情報に基づいて、前記差異部分に配置されるオブジェクトが属する所定範囲のオブジェクト領域を特定する第６処理と、前記第１の画像データから、前記オブジェクト領域に対応する画像情報を取得する第７処理と、前記取得した画像情報を用いて前記構造化データを更新し、更新後の前記構造化データを出力する第８処理と、を実行することを特徴とする。 One aspect of the present invention is an image conversion method in a system including a control device capable of executing vectorization processing and rasterization processing and an image reading unit or an apparatus including an imaging unit, and the control device includes the image reading unit. A first process for acquiring first image data obtained by reading a document or first image data obtained by imaging the document by the imaging unit, and performing the vectorization process on the first image data, A second process for converting to structured data; a third process for analyzing the structured data to obtain object information; and performing the rasterization process on the structured data to obtain second image data. Based on the object information, a fourth process for re-transforming, a fifth process for comparing the first image data and the second image data to extract a difference portion, and the object information. A sixth process for specifying an object area within a predetermined range to which an object placed in a part belongs, a seventh process for acquiring image information corresponding to the object area from the first image data, and the acquired image information And updating the structured data by using and executing the eighth process of outputting the updated structured data.

本発明の画像変換プログラム及び画像変換装置並びに画像変換方法によれば、元の原稿の再現性を確保しつつ、原稿の検索性や再編集性を維持し、かつ、原稿の視認性の悪化を抑制した構造化データを生成することができる。 According to the image conversion program, the image conversion apparatus, and the image conversion method of the present invention, while maintaining the reproducibility of the original document, the searchability and reeditability of the document are maintained, and the visibility of the document is deteriorated. Suppressed structured data can be generated.

その理由は、画像変換装置（画像変換プログラム）は、画像読取部に原稿を読み取らせた第１の画像データ又は撮像部に原稿を撮像させた第１の画像データを取得し、第１の画像データに対してベクタライズ処理を行って構造化データに変換し、構造化データを解析してオブジェクト情報を取得し、構造化データに対してラスタライズ処理を行って第２の画像データに再変換し、第１の画像データと第２の画像データとを比較して差異部分を抽出し、オブジェクト情報に基づいて、差異部分に配置されるオブジェクトが属する所定範囲のオブジェクト領域を特定し、第１の画像データからオブジェクト領域に対応する画像情報を取得し、取得した画像情報を用いて構造化データを更新し、更新後の構造化データを出力するからである。 The reason is that the image conversion apparatus (image conversion program) acquires the first image data obtained by causing the image reading unit to read the original or the first image data obtained by causing the image pickup unit to capture the original, and obtains the first image. The data is vectorized and converted to structured data, the structured data is analyzed to obtain object information, the structured data is rasterized and converted to second image data, The first image data and the second image data are compared to extract a difference portion, and based on the object information, an object region in a predetermined range to which an object arranged in the difference portion belongs is specified, and the first image This is because the image information corresponding to the object region is acquired from the data, the structured data is updated using the acquired image information, and the updated structured data is output.

本発明の一実施例に係る画像変換システムの一例を示す模式図である。It is a schematic diagram which shows an example of the image conversion system which concerns on one Example of this invention. 本発明の一実施例に係る画像変換システムの他の例を示す模式図である。It is a schematic diagram which shows the other example of the image conversion system which concerns on one Example of this invention. 本発明の一実施例に係る画像変換システムの他の例を示す模式図である。It is a schematic diagram which shows the other example of the image conversion system which concerns on one Example of this invention. 本発明の一実施例に係る画像変換システムの他の例を示す模式図である。It is a schematic diagram which shows the other example of the image conversion system which concerns on one Example of this invention. 本発明の一実施例に係る画像変換システムの他の例を示す模式図である。It is a schematic diagram which shows the other example of the image conversion system which concerns on one Example of this invention. 本発明の一実施例に係る画像変換システムの他の例を示す模式図である。It is a schematic diagram which shows the other example of the image conversion system which concerns on one Example of this invention. 本発明の一実施例に係る画像変換装置の構成を示すブロック図である。It is a block diagram which shows the structure of the image converter which concerns on one Example of this invention. 本発明の一実施例に係る画像形成装置の構成を示すブロック図である。1 is a block diagram illustrating a configuration of an image forming apparatus according to an embodiment of the present invention. 本発明の一実施例に係る画像変換装置の動作を示すフローチャート図である。It is a flowchart figure which shows operation | movement of the image conversion apparatus which concerns on one Example of this invention. 本発明の一実施例に係る入力画像の一例である。It is an example of the input image which concerns on one Example of this invention. 本発明の一実施例に係る構造化データの文書作成アプリケーションによる表示画像の一例である。It is an example of the display image by the document creation application of the structured data which concerns on one Example of this invention. 本発明の一実施例に係るテキストオブジェクトから文字列を認識する様子を説明する図である。It is a figure explaining a mode that a character string is recognized from the text object which concerns on one Example of this invention. 本発明の一実施例に係るグラフィックスオブジェクトから図形を認識する様子を説明する図である。It is a figure explaining a mode that a figure is recognized from the graphics object which concerns on one Example of this invention. 図１１の構造化データのオブジェクト領域の分類例を示す図である。It is a figure which shows the example of a classification | category of the object area | region of the structured data of FIG. 本発明の一実施例に係る構造化データのラスタライズ結果を示す図である。It is a figure which shows the rasterization result of the structured data which concerns on one Example of this invention. 本発明の一実施例に係る図形１の比較結果の一例である。It is an example of the comparison result of the figure 1 which concerns on one Example of this invention. 本発明の一実施例に係る図形１の比較結果を画素レベルで示す図である。It is a figure which shows the comparison result of the figure 1 which concerns on one Example of this invention at a pixel level. 本発明の一実施例に係る更新対象のオブジェクト領域の特定処理を説明する図であり、（ａ）は差異部分を示す図、（ｂ）は差異部分に配置されるオブジェクトの位置を示す図、（ｃ）は更新対象のオブジェクト領域を示す図である。It is a figure explaining the identification process of the object area | region of the update target which concerns on one Example of this invention, (a) is a figure which shows a different part, (b) is a figure which shows the position of the object arrange | positioned in a different part, (C) is a diagram showing an object region to be updated. 本発明の一実施例に係る更新後の構造化データの文書作成アプリケーションによる表示画像の一例である。It is an example of the display image by the document creation application of the structured data after the update which concerns on one Example of this invention. 本発明の一実施例に係る入力画像の他の例である。It is another example of the input image which concerns on one Example of this invention. 図２０の構造化データのオブジェクト領域の分類例を示す図である。It is a figure which shows the example of a classification | category of the object area | region of the structured data of FIG.

背景技術で示したように、用紙に印刷した原稿をスキャナなどで読み取ってイメージデータに変換し、イメージデータをベクターデータに変換して保存することが行われているが、イメージデータをベクターデータに変換するベクタライズ過程で誤認識が生じる場合がある。この問題に対して、特許文献１では、元のイメージデータと変換後のイメージデータとの比較結果に基づいて、原稿全体をイメージデータで保存するかベクターデータで保存するかを選択しているが、イメージデータで保存された場合には、原稿の検索性や再編集性が失われてしまう。また、ベクタライズ過程で誤認識している部分のみをイメージデータで置き換える方法もあるが、この方法では、所定の領域内でベクターデータとラスターデータとが混在するため、原稿の視認性が悪化する恐れがある。 As shown in the background art, a document printed on paper is read by a scanner or the like and converted into image data, and the image data is converted into vector data and stored. However, image data is converted into vector data. Misrecognition may occur in the vectorization process for conversion. To deal with this problem, Patent Document 1 selects whether to save the entire document as image data or vector data based on the comparison result between the original image data and the converted image data. When saved as image data, the searchability and reeditability of the document are lost. In addition, there is a method of replacing only erroneously recognized portions in the vectorization process with image data. However, in this method, vector data and raster data are mixed in a predetermined area, so that the visibility of the document may be deteriorated. There is.

そこで、本発明の一実施の形態では、入力されたイメージデータをベクタライズして構造化データに変換し、構造化データをラスタライズしてイメージデータに再度変換し、入力されたイメージデータと変換されたイメージデータとを比較して差異部分を抽出し、当該差異部分に配置されるオブジェクトが属する所定範囲のオブジェクト領域を特定し、特定したオブジェクト領域全体を入力されたイメージデータを用いて更新する（構造化データの特定したオブジェクト領域をイメージデータで置き換えるか、構造化データの特定したオブジェクト領域をイメージデータで上書きする指示データを作成する）ようにする。 Therefore, in one embodiment of the present invention, the input image data is vectorized and converted to structured data, the structured data is rasterized and converted again to image data, and converted to input image data. A difference part is extracted by comparing with image data, an object area in a predetermined range to which an object arranged in the difference part belongs is specified, and the entire specified object area is updated using the input image data (structure) The object area specified in the structured data is replaced with image data, or instruction data for overwriting the object area specified in the structured data with the image data is created).

このように、入力されたイメージデータのベクタライズ過程で誤変換された部分だけでなく、当該部分に配置されるオブジェクトが属する所定範囲のオブジェクト領域全体を入力されたイメージデータで更新することにより、元の原稿の再現性を確保しつつ、原稿の検索性や再編集性を維持し、かつ、原稿の視認性の悪化を抑制した構造化データを生成することができる。 In this way, by updating not only the part erroneously converted in the vectorization process of the input image data but also the entire object area of the predetermined range to which the object arranged in the part belongs is updated with the input image data. It is possible to generate structured data that maintains the retrievability and reeditability of the original while suppressing the deterioration of the original visibility while ensuring the reproducibility of the original.

上記した本発明の一実施の形態についてさらに詳細に説明すべく、本発明の一実施例に係る画像変換プログラム及び画像変換装置並びに画像変換方法について、図１乃至図２１を参照して説明する。図１乃至図６は、本実施例の画像変換システムの一例を示す模式図であり、図７は、画像変換装置の構成を示すブロック図、図８は、画像形成装置の構成を示すブロック図である。また、図９は、本実施例の画像変換装置の動作を示すフローチャート図であり、図１０乃至図２１は、本実施例の画像変換方法を説明する図である。 In order to describe the above-described embodiment of the present invention in more detail, an image conversion program, an image conversion apparatus, and an image conversion method according to an embodiment of the present invention will be described with reference to FIGS. 1 to 6 are schematic diagrams illustrating an example of an image conversion system according to the present exemplary embodiment, FIG. 7 is a block diagram illustrating a configuration of an image conversion apparatus, and FIG. 8 is a block diagram illustrating a configuration of an image forming apparatus. It is. FIG. 9 is a flowchart showing the operation of the image conversion apparatus of the present embodiment, and FIGS. 10 to 21 are diagrams for explaining the image conversion method of the present embodiment.

図１に示すように、本実施例の画像変換システム１０は、ベクタライズ処理とラスタライズ処理とが実行可能な画像変換装置２０と、用紙に印刷された原稿からイメージデータを読み取る画像読取部を備えるＭＦＰ（Multi-Functional Peripherals）などの画像形成装置３０と、で構成される。これらは、IEEE1394、Parallelなどを用いて接続、若しくは、イーサネット（登録商標）、トークンリング、ＦＤＤＩ（Fiber-Distributed Data Interface）等の規格により定められるＬＡＮ（Local Area Network）やＷＡＮ（Wide Area Network）等のネットワークを介して接続されている。 As shown in FIG. 1, an image conversion system 10 according to this embodiment includes an image conversion device 20 that can execute vectorization processing and rasterization processing, and an MFP that includes an image reading unit that reads image data from a document printed on paper. And an image forming apparatus 30 such as (Multi-Functional Peripherals). These are connected using IEEE1394, Parallel, etc., or LAN (Local Area Network) or WAN (Wide Area Network) defined by standards such as Ethernet (registered trademark), Token Ring, FDDI (Fiber-Distributed Data Interface), etc. Etc. are connected via a network.

なお、図１では、画像変換システム１０を画像変換装置２０と画像形成装置３０とで構成したが、図２に示すように、画像変換システム１０を画像変換装置２０とスキャナなどの画像読取装置４０とで構成してもよい。また、用紙に印刷された原稿をカメラなどで撮像してイメージデータを取得する場合は、図３に示すように、画像変換システム１０を画像変換装置２０と撮像装置５０とで構成してもよい。また、画像形成装置３０や画像読取装置４０、撮像装置５０が、ベクタライズ処理及びラスタライズ処理が実行可能な場合は、図４乃至図６に示すように、画像変換システム１０を画像形成装置３０や画像読取装置４０、撮像装置５０単独で構成（すなわち、画像形成装置３０や画像読取装置４０、撮像装置５０を画像変換装置として機能させる構成）としてもよい。以下、図１の構成を前提にして各装置について詳細に説明する。 In FIG. 1, the image conversion system 10 includes the image conversion apparatus 20 and the image forming apparatus 30. However, as shown in FIG. 2, the image conversion system 10 includes the image conversion apparatus 20 and an image reading apparatus 40 such as a scanner. You may comprise. Further, when image data is acquired by capturing an image of a document printed on paper with a camera or the like, the image conversion system 10 may be composed of an image conversion device 20 and an image pickup device 50 as shown in FIG. . When the image forming apparatus 30, the image reading apparatus 40, and the imaging apparatus 50 can execute the vectorizing process and the rasterizing process, the image conversion system 10 is connected to the image forming apparatus 30 or the image as illustrated in FIGS. 4 to 6. The reading device 40 and the imaging device 50 may be configured alone (that is, a configuration in which the image forming device 30, the image reading device 40, and the imaging device 50 function as an image conversion device). Hereinafter, each device will be described in detail on the assumption of the configuration of FIG.

［画像変換装置］
画像変換装置２０は、パーソナルコンピュータなどのコンピュータ装置、画像形成装置３０や画像読取装置４０、撮像装置５０を制御する制御装置、スマートフォンやタブレットなどの携帯端末などであり、図７（ａ）に示すように、制御部２１、記憶部２５、ネットワークＩ／Ｆ部２６、表示部２７、操作部２８などで構成される。 [Image converter]
The image conversion device 20 is a computer device such as a personal computer, an image forming device 30, an image reading device 40, a control device that controls the imaging device 50, a portable terminal such as a smartphone or a tablet, and the like, as shown in FIG. As described above, the control unit 21, the storage unit 25, the network I / F unit 26, the display unit 27, and the operation unit 28 are configured.

制御部２１は、ＣＰＵ（Central Processing Unit）２２とＲＯＭ（Read Only Memory）２３やＲＡＭ（Random Access Memory）２４などのメモリとで構成され、ＣＰＵ２２は、ＲＯＭ２３や記憶部２５に記憶した制御プログラム（構造化データを表示可能な文書表示アプリケーションを含む。）をＲＡＭ２４に展開して実行することにより、画像変換装置２０全体の動作を制御する。また、上記制御部２１は、図７（ｂ）に示すように、データ取得部２１ａ、ベクタライズ処理部２１ｂ、解析部２１ｃ、ラスタライズ処理部２１ｄ、比較部２１ｅ、データ更新部２１ｆなどとして機能する。 The control unit 21 includes a CPU (Central Processing Unit) 22 and a memory such as a ROM (Read Only Memory) 23 and a RAM (Random Access Memory) 24. The CPU 22 controls the control program (stored in the ROM 23 and the storage unit 25 ( A document display application capable of displaying structured data is developed in the RAM 24 and executed, thereby controlling the entire operation of the image conversion apparatus 20. Further, as shown in FIG. 7B, the control unit 21 functions as a data acquisition unit 21a, a vectorization processing unit 21b, an analysis unit 21c, a rasterization processing unit 21d, a comparison unit 21e, a data update unit 21f, and the like.

データ取得部２１ａは、画像形成装置３０から画像読取部３８が原稿をスキャンして得たイメージデータ（第１の画像データ）を入力画像として取得する。なお、図２のシステム構成の場合は、画像読取装置４０が原稿をスキャンして得たイメージデータを取得し、図３のシステム構成の場合は、撮像装置５０が原稿を撮像して得たイメージデータを取得する。 The data acquisition unit 21a acquires, as an input image, image data (first image data) obtained by the image reading unit 38 scanning the document from the image forming apparatus 30. In the case of the system configuration of FIG. 2, the image reading device 40 acquires image data obtained by scanning a document. In the case of the system configuration of FIG. 3, an image obtained by the imaging device 50 capturing an image of the document. Get the data.

ベクタライズ処理部２１ｂは、取得した入力画像に対して、公知の領域判別処理を実施し、入力画像をテキスト領域とグラフィックス領域とイメージ領域とに分類した後、公知のベクタライズ処理を実施して、ベクターデータに変換する。具体的には、テキスト領域は、公知のＯＣＲ（Optical Character Recognition）処理を実施してテキストコードに変換し、グラフィックス領域は、図形の描画コマンドに変換する。そして、ベクタライズ処理部２１ｂは、ベクタライズ処理の結果に基づいて、ベクターデータを文書表示アプリケーションで表示可能な構造化データに変換する。なお、文書表示アプリケーションは、例えば、Microsoft（登録商標）のWord（登録商標）、Excel（登録商標）、PowerPoint（登録商標）、Adobe（登録商標）のAcrobat（登録商標）などである。また、構造化データとは、ＰＤＦ（Portable Document Format）やＯＤＦ（OpenDocument Format）、ＯＯＸＭＬ（Office Open XML）のフォーマットなどで記述されたデータであり、オブジェクト毎に属性を記述したデータである。 The vectorization processing unit 21b performs a known area determination process on the acquired input image, classifies the input image into a text area, a graphics area, and an image area, and then performs a known vectorization process, Convert to vector data. Specifically, the text area is converted into a text code by performing a known OCR (Optical Character Recognition) process, and the graphics area is converted into a graphic drawing command. Then, the vectorize processing unit 21b converts the vector data into structured data that can be displayed by the document display application based on the result of the vectorize process. The document display application is, for example, Microsoft (registered trademark) Word (registered trademark), Excel (registered trademark), PowerPoint (registered trademark), Adobe (registered trademark) Acrobat (registered trademark), or the like. The structured data is data described in PDF (Portable Document Format), ODF (Open Document Format), OOXML (Office Open XML) format, and the like, and is data in which attributes are described for each object.

解析部２１ｃは、構造化データを解析し、オブジェクト情報を取得する。具体的には、構造化データに含まれるオブジェクトの属性及び描画位置の情報を取得すると共に、オブジェクトの相互関係に基づいて、各々の属性のオブジェクトの描画領域を分割したオブジェクト領域を設定する。具体的には、テキスト領域に対しては、テキストコードからスペースやカンマ、ピリオドなどの位置を特定し、特定した位置から文字列を認識し、各々の文字列の位置情報を取得する。そして、認識した文字列の位置関係（上下の位置情報、左右のオブジェクトの種別）からテキスト領域を設定する。また、グラフィックス領域に対しては、描画コマンドから図形を認識し、各々の図形の位置情報を取得する。そして、認識した各々の図形の連結性や接近性などからグラフィックス領域を設定する。 The analysis unit 21c analyzes the structured data and acquires object information. Specifically, the object attributes and the drawing position information included in the structured data are acquired, and an object area obtained by dividing the drawing area of each attribute object is set based on the mutual relationship between the objects. Specifically, for a text region, a position such as a space, a comma, or a period is specified from the text code, a character string is recognized from the specified position, and position information of each character string is acquired. Then, a text area is set from the positional relationship of the recognized character string (upper and lower position information, and left and right object types). For the graphics area, the figure is recognized from the drawing command, and the position information of each figure is acquired. Then, the graphics area is set based on the connectivity and accessibility of each recognized figure.

ラスタライズ処理部２１ｄは、ＲＩＰ（Raster Image Processor）により、ベクタライズ処理部２１ｂによって生成された構造化データに対して、公知のラスタライズ処理を行って、イメージデータ（第２の画像データ）に再変換する。 The rasterization processing unit 21d performs a known rasterization process on the structured data generated by the vectorization processing unit 21b by RIP (Raster Image Processor), and reconverts the image data (second image data). .

比較部２１ｅは、入力されたイメージデータ（第１の画像データ）と再変換されたイメージデータ（第２の画像データ）とを比較し、差異部分を抽出する。このイメージデータの比較は、解析部２１ｃが設定したオブジェクト領域毎に画素単位で比較し、差異がある画素の数（若しくは、当該オブジェクト領域の全画素数に対する差異がある画素の数の割合）が所定の閾値を超える部分がある場合、その部分を差異部分として抽出する。なお、上記閾値は、オブジェクトの属性に応じて異なる値に設定してもよい。また、比較部２１ｅは、必要に応じて、入力されたイメージデータと再変換されたイメージデータとの比較結果（差異部分を明示する画像）を表示部２７に表示させ、差異部分に対して後述する構造化データの更新を行うか否かの選択操作を受け付ける。 The comparison unit 21e compares the input image data (first image data) with the reconverted image data (second image data), and extracts a difference portion. The comparison of the image data is performed for each object region set by the analysis unit 21c in pixel units, and the number of pixels having a difference (or the ratio of the number of pixels having a difference to the total number of pixels in the object region) is calculated. If there is a part exceeding a predetermined threshold, the part is extracted as a difference part. The threshold value may be set to a different value depending on the attribute of the object. In addition, the comparison unit 21e displays a comparison result (an image clearly showing the difference portion) between the input image data and the reconverted image data on the display unit 27 as necessary, and the difference portion will be described later. To select whether to update the structured data to be updated.

データ更新部２１ｆは、差異部分が抽出されたか否かを判断し、差異部分が抽出された場合は、オブジェクト情報に基づいて、差異部分に配置されるオブジェクトが属するオブジェクト領域を特定し、入力画像から、特定したオブジェクト領域に対応する画像情報を取得し、取得した画像情報を用いて構造化データを更新し、更新後の構造化データを出力（例えば、記憶部２５などに保存）する。具体的には、構造化データの特定したオブジェクト領域を、当該オブジェクト領域に対応する入力画像に置き換えたり、構造化データの特定したオブジェクト領域に、当該オブジェクト領域に対応する入力画像を上書きする指示データを作成したりする。また、データ更新部２１ｆは、必要に応じて、特定したオブジェクト領域を入力画像で更新した結果（更新後の構造化データを文書表示アプリケーションで表示した表示画像）を表示部２７に表示させ、構造化データの更新を採用するか否かの選択操作を受け付ける。 The data update unit 21f determines whether or not a difference portion has been extracted. If the difference portion has been extracted, the data update unit 21f specifies an object region to which an object placed in the difference portion belongs based on the object information, and the input image Then, the image information corresponding to the specified object region is acquired, the structured data is updated using the acquired image information, and the updated structured data is output (for example, stored in the storage unit 25 or the like). Specifically, the instruction data for replacing the object area specified in the structured data with the input image corresponding to the object area or overwriting the input area corresponding to the object area in the object area specified in the structured data. Or create. Further, the data updating unit 21f displays the result of updating the specified object region with the input image (a display image in which the structured data after the update is displayed by the document display application) on the display unit 27 as necessary. A selection operation for accepting whether or not to adopt the data update is accepted.

なお、データ取得部２１ａ、ベクタライズ処理部２１ｂ、解析部２１ｃ、ラスタライズ処理部２１ｄ、比較部２１ｅ、データ更新部２１ｆは、ハードウェアとして構成してもよいし、制御部２１を、データ取得部２１ａ、ベクタライズ処理部２１ｂ、解析部２１ｃ、ラスタライズ処理部２１ｄ、比較部２１ｅ、データ更新部２１ｆとして機能させる画像変換プログラムとして構成し、当該画像変換プログラムをＣＰＵ２２に実行させるようにしてもよい。 The data acquisition unit 21a, the vectorization processing unit 21b, the analysis unit 21c, the rasterization processing unit 21d, the comparison unit 21e, and the data update unit 21f may be configured as hardware, or the control unit 21 may be configured as the data acquisition unit 21a. The image conversion program may be configured to function as the vectorization processing unit 21b, the analysis unit 21c, the rasterization processing unit 21d, the comparison unit 21e, and the data update unit 21f, and the CPU 22 may execute the image conversion program.

記憶部２５は、ＨＤＤ（Hard Disk Drive）やＳＳＤ（Solid State Drive）などで構成され、ＣＰＵ２２が各部を制御するためのプログラム、自装置の処理機能に関する情報、入力されたイメージデータ、ベクタライズ処理部２１ｂが変換した構造化データ、ラスタライズ処理部２１ｄが構造化データから再変換したイメージデータ、比較部２１ｅの比較結果、データ更新部２１ｆが更新した構造化データなどを記憶する。 The storage unit 25 is configured by an HDD (Hard Disk Drive), an SSD (Solid State Drive), or the like, a program for the CPU 22 to control each unit, information on the processing function of the own device, input image data, a vectorization processing unit The structured data converted by 21b, the image data reconverted from the structured data by the rasterizing processing unit 21d, the comparison result of the comparing unit 21e, the structured data updated by the data updating unit 21f, and the like are stored.

ネットワークＩ／Ｆ部２６は、ＮＩＣ（Network Interface Card）やモデムなどで構成され、画像変換装置２０を画像形成装置３０に接続し、画像形成装置３０からイメージデータを取得する。 The network I / F unit 26 includes a NIC (Network Interface Card), a modem, and the like, connects the image conversion apparatus 20 to the image forming apparatus 30, and acquires image data from the image forming apparatus 30.

表示部２７は、液晶表示装置（ＬＣＤ：Liquid Crystal Display）や有機ＥＬ（electroluminescence）表示装置などで構成され、比較部２１ｅの比較結果（差異部分を明示する画像）、データ更新部２１ｆの更新結果（更新後の構造化データを文書表示アプリケーションで表示した表示画像）などを表示する。 The display unit 27 includes a liquid crystal display (LCD), an organic EL (electroluminescence) display device, and the like. The comparison result of the comparison unit 21e (an image clearly showing the difference) and the update result of the data update unit 21f (Displayed image of the updated structured data displayed by the document display application) or the like is displayed.

操作部２８は、マウスやキーボードなどで構成され、比較部２１ｅの比較結果に対する選択操作、データ更新部２１ｆの更新結果に対する選択操作などを可能にする。 The operation unit 28 includes a mouse, a keyboard, and the like, and enables a selection operation for the comparison result of the comparison unit 21e, a selection operation for the update result of the data update unit 21f, and the like.

［画像形成装置］
画像形成装置３０は、ＭＦＰなどであり、図８に示すように、制御部３１、記憶部３５、ネットワークＩ／Ｆ部３６、表示操作部３７、画像読取部３８、印刷処理部３９などで構成される。 [Image forming apparatus]
The image forming apparatus 30 is an MFP or the like, and includes a control unit 31, a storage unit 35, a network I / F unit 36, a display operation unit 37, an image reading unit 38, a print processing unit 39, and the like as shown in FIG. Is done.

制御部３１は、ＣＰＵ３２とＲＯＭ３３やＲＡＭ３４などのメモリとで構成され、ＣＰＵ３２は、ＲＯＭ３３や記憶部３５に記憶した制御プログラムをＲＡＭ３４に展開して実行することにより、画像形成装置３０全体の動作を制御する。 The control unit 31 includes a CPU 32 and a memory such as a ROM 33 and a RAM 34. The CPU 32 develops and executes the control program stored in the ROM 33 and the storage unit 35 in the RAM 34, thereby performing the operation of the entire image forming apparatus 30. Control.

記憶部３５は、ＨＤＤやＳＳＤなどで構成され、ＣＰＵ３２が各部を制御するためのプログラム、自装置の処理機能に関する情報、画像読取部３８が読み取ったイメージデータなどを記憶する。 The storage unit 35 is configured by an HDD, an SSD, or the like, and stores a program for the CPU 32 to control each unit, information on processing functions of the own apparatus, image data read by the image reading unit 38, and the like.

ネットワークＩ／Ｆ部３６は、ＮＩＣやモデムなどで構成され、画像形成装置３０を画像変換装置２０に接続し、画像変換装置２０にイメージデータなどを送信する。 The network I / F unit 36 includes a NIC, a modem, and the like, connects the image forming apparatus 30 to the image conversion apparatus 20, and transmits image data and the like to the image conversion apparatus 20.

表示操作部３７は、表示部上に透明電極が格子状に配置された感圧式の操作部（タッチセンサ）を設けたタッチパネルなどであり、印刷処理に関する各種画面を表示し、印刷に関する各種操作を可能にする。 The display operation unit 37 is a touch panel provided with a pressure-sensitive operation unit (touch sensor) in which transparent electrodes are arranged in a grid pattern on the display unit, displays various screens related to printing processing, and performs various operations related to printing. to enable.

画像読取部３８は、原稿台上に載置された原稿からイメージデータを光学的に読み取る部分であり、原稿を走査する光源と、原稿で反射された光を電気信号に変換するＣＣＤ（Charge Coupled Devices）やＣＭＯＳ（Complementary Metal Oxide Semiconductor）等のイメージセンサと、イメージセンサから出力される電気信号をＡ／Ｄ変換するＡ／Ｄ変換器等により構成される。 The image reading unit 38 is a part that optically reads image data from a document placed on a document table, and a light source that scans the document and a CCD (Charge Coupled) that converts light reflected by the document into an electrical signal. Devices) and image sensors such as CMOS (Complementary Metal Oxide Semiconductor), and A / D converters for A / D converting electrical signals output from the image sensors.

印刷処理部３９は、印刷処理を実行する印刷エンジンである。具体的には、帯電装置により帯電された感光体ドラムに露光装置から画像に応じた光を照射して静電潜像を形成し、現像装置で帯電したトナーを付着させて現像し、そのトナー像を転写ベルトに１次転写し、転写ベルトから用紙に２次転写し、更に定着装置で用紙上のトナー像を定着させる処理を行う。
［画像読取装置］
画像読取装置４０は、画像読取部を備えるスキャナなどであり、画像読取部は、画像形成装置３０の画像読取部３８と同様に、原稿台上に載置された原稿からイメージデータを光学的に読み取る。具体的には、原稿を走査する光源と、原稿で反射された光を電気信号に変換するＣＣＤやＣＭＯＳ等のイメージセンサと、電気信号をＡ／Ｄ変換するＡ／Ｄ変換器等により構成される。
［撮像装置］
撮像装置５０は、撮像部を備えるデジタルカメラなどであり、撮像部は、原稿を撮像してイメージデータを光学的に読み取る。具体的には、レンズやファインダなどの光学系と、ＣＣＤやＣＭＯＳ等のイメージセンサと、イメージセンサから出力される電気信号をＡ／Ｄ変換するＡ／Ｄ変換器等により構成される。 The print processing unit 39 is a print engine that executes print processing. Specifically, the photosensitive drum charged by the charging device is irradiated with light corresponding to the image from the exposure device to form an electrostatic latent image, and the toner charged by the developing device is attached and developed. The image is primarily transferred to the transfer belt, secondarily transferred from the transfer belt to the paper, and a fixing device fixes the toner image on the paper.
[Image reading device]
The image reading device 40 is a scanner or the like provided with an image reading unit, and the image reading unit optically receives image data from a document placed on a document table, similarly to the image reading unit 38 of the image forming device 30. read. Specifically, it is composed of a light source that scans a document, an image sensor such as a CCD or CMOS that converts light reflected from the document into an electrical signal, an A / D converter that A / D converts the electrical signal, and the like. The
[Imaging device]
The imaging device 50 is a digital camera or the like including an imaging unit, and the imaging unit images a document and optically reads image data. Specifically, it includes an optical system such as a lens and a finder, an image sensor such as a CCD or CMOS, and an A / D converter that performs A / D conversion on an electrical signal output from the image sensor.

なお、図１乃至図８は、本実施例の画像変換システム１０の一例であり、その構成は適宜変更可能である。例えば、図７では、画像変換装置２０の制御部２１をデータ取得部２１ａ、ベクタライズ処理部２１ｂ、解析部２１ｃ、ラスタライズ処理部２１ｄ、比較部２１ｅ、データ更新部２１ｆとして機能させる場合を示したが、画像形成装置３０の制御部３１（若しくは画像読取装置４０や撮像装置５０の制御部）を、データ取得部、ベクタライズ処理部、解析部、ラスタライズ処理部、比較部、データ更新部として機能させる（画像形成装置３０や画像読取装置４０、撮像装置５０にデータ取得部、ベクタライズ処理部、解析部、ラスタライズ処理部、比較部、データ更新部を設ける）構成としてもよい。 1 to 8 show an example of the image conversion system 10 of the present embodiment, and the configuration thereof can be changed as appropriate. For example, FIG. 7 shows a case where the control unit 21 of the image conversion apparatus 20 is caused to function as the data acquisition unit 21a, vectorization processing unit 21b, analysis unit 21c, rasterization processing unit 21d, comparison unit 21e, and data update unit 21f. The control unit 31 of the image forming apparatus 30 (or the control unit of the image reading device 40 or the imaging device 50) functions as a data acquisition unit, vectorization processing unit, analysis unit, rasterization processing unit, comparison unit, and data update unit ( The image forming apparatus 30, the image reading apparatus 40, and the imaging apparatus 50 may have a data acquisition unit, a vectorization processing unit, an analysis unit, a rasterization processing unit, a comparison unit, and a data update unit.

以下、図１の構成の画像変換システム１０の画像変換装置２０を用いた画像変換方法について説明する。ＣＰＵ２２は、ＲＯＭ２３又は記憶部２５に記憶した画像変換プログラムをＲＡＭ２４に展開して実行することにより、図９のフローチャート図に示す各ステップの処理を実行する。 Hereinafter, an image conversion method using the image conversion apparatus 20 of the image conversion system 10 configured as shown in FIG. 1 will be described. The CPU 22 executes the processing of each step shown in the flowchart of FIG. 9 by developing the image conversion program stored in the ROM 23 or the storage unit 25 in the RAM 24 and executing it.

まず、制御部２１（データ取得部２１ａ）は、ネットワークＩ／Ｆ部２６を介して、画像形成装置３０の画像読取部３８や画像読取装置４０が原稿をスキャンして得たイメージデータや、撮像装置５０が原稿を撮像して得たイメージデータを入力画像として取得する（Ｓ１０１）。図１０は、入力画像６０の一例であり、この入力画像６０には、テキストオブジェクトとグラフィックスオブジェクトとイメージオブジェクトとが含まれている。 First, the control unit 21 (data acquisition unit 21a) captures image data obtained by scanning the document by the image reading unit 38 or the image reading device 40 of the image forming apparatus 30 via the network I / F unit 26, and imaging. The apparatus 50 acquires image data obtained by imaging a document as an input image (S101). FIG. 10 shows an example of the input image 60. The input image 60 includes a text object, a graphics object, and an image object.

次に、制御部２１（ベクタライズ処理部２１ｂ）は、入力画像に対して領域判別処理を実施して、図１０に示すように、入力画像をテキスト領域６０ａとグラフィックス領域６０ｂとイメージ領域６０ｃとに分類した後、ベクタライズ処理を実施してベクターデータに変換する（Ｓ１０２）。具体的には、テキスト領域６０ａと判断された領域は、公知のＯＣＲ処理を実施してテキストコードに変換する。また、グラフィックス領域６０ｂと判断された領域は、図形の描画コマンドに変換する。 Next, the control unit 21 (vectorize processing unit 21b) performs region discrimination processing on the input image, and the input image is converted into a text region 60a, a graphics region 60b, and an image region 60c as shown in FIG. Then, vectorization processing is performed and converted into vector data (S102). Specifically, the area determined to be the text area 60a is converted into a text code by performing a known OCR process. The area determined to be the graphics area 60b is converted into a graphic drawing command.

次に、制御部２１（ベクタライズ処理部２１ｂ）は、ベクタライズ処理の結果に基づいて、ベクターデータを文書表示アプリケーションで表示可能な構造化データに変換する（Ｓ１０３）。構造化データは、上述したように、ＰＤＦやＯＤＦ、ＯＯＸＭＬのフォーマットで記述されたデータである。図１１に、構造化データの表示画像６１（文書表示アプリケーションで表示した時の表示結果）を示す。この例では、太い破線で囲んだ２箇所がベクタライズ過程で誤変換されており、「Ｂ＆Ｗ」の文字が「ＢＢＷ」に、楕円の図形が正円に誤変換されている。 Next, the control unit 21 (vectorization processing unit 21b) converts the vector data into structured data that can be displayed by the document display application based on the result of the vectorization processing (S103). As described above, the structured data is data described in the format of PDF, ODF, or OOXML. FIG. 11 shows a display image 61 of structured data (display result when displayed by a document display application). In this example, two places surrounded by a thick broken line are erroneously converted in the vectorization process, the characters “B & W” are erroneously converted to “BBW”, and the ellipse figure is erroneously converted to a perfect circle.

次に、制御部２１（解析部２１ｃ）は、構造化データを解析し、オブジェクト情報を取得する（Ｓ１０４）。具体的には、構造化データに含まれるオブジェクトの属性及び描画位置の情報を取得すると共に、オブジェクトの相互関係に基づいて、各々の属性のオブジェクトの描画領域を分割したオブジェクト領域（テキスト領域やグラフィックス領域）を設定する。このオブジェクト領域は、後述するデータ更新にて入力画像に置き換える範囲を規定する（更新対象となる）領域である。 Next, the control part 21 (analysis part 21c) analyzes structured data, and acquires object information (S104). Specifically, the object attributes and drawing position information included in the structured data are acquired, and the object areas (text area and graphic area) obtained by dividing the drawing area of each attribute object based on the interrelationship between the objects. Set the area. This object area is an area that defines a range to be replaced with an input image by data update described later (to be updated).

図１２は、テキスト領域の設定例を示している。例えば、図１２（ａ）の文章を例にして説明すると、ＯＣＲ処理により各文字はテキストコードに変換される。次に、スペースやカンマ、ピリオドの位置などから文字列を認識する。図１２（ｂ）は、文字列を認識した結果である。この例では、９つの文字列が認識されている。そして、認識した各文字列の位置関係（上下の位置情報、左右のオブジェクトの種別）から更新対象となるテキスト領域（例えば、行毎のテキスト領域）を設定する。 FIG. 12 shows an example of setting a text area. For example, taking the sentence in FIG. 12A as an example, each character is converted into a text code by OCR processing. Next, a character string is recognized from the position of a space, a comma, or a period. FIG. 12B shows the result of recognizing the character string. In this example, nine character strings are recognized. Then, a text area (for example, a text area for each line) to be updated is set from the positional relationship (upper and lower position information, left and right object types) of the recognized character strings.

図１３は、グラフィックス領域の設定例を示している。例えば、図１３（ａ）の図形を例にして説明すると、この図形は、図１１の構造化データの表示画像６１の左下のオブジェクトであり、図１３（ｂ）に示すように図形１〜図形５の５つの図形で表現されている。そして、各々の図形の連結性／接近性から更新対象となるグラフィックス領域（ここでは、５つの図形を含むグラフィックス領域）を設定する。 FIG. 13 shows a setting example of the graphics area. For example, the figure in FIG. 13A will be described as an example. This figure is an object at the lower left of the display image 61 of the structured data in FIG. 11, and as shown in FIG. It is expressed by 5 figures. Then, a graphics area (here, a graphics area including five figures) to be updated is set based on connectivity / accessibility of each figure.

図１４は、図１１の構造化データの表示画像６１をオブジェクト領域で分類した図である。この例では、１２個のテキスト領域（Text１〜Text１２）と１個のイメージ領域（Image１）と２個のグラフィックス領域（Graphics１〜Graphics２）とに分類されている。なお、図１４の分類は一例であり、適宜変更可能である。例えば、図１４では、テキストオブジェクトを行毎に分類しているが、テキストオブジェクトを段落毎に分類し、Text１〜Text７、Text８〜Text１２を各々１つのテキスト領域としたり、ピリオドまでを１つのテキスト領域としたりしてもよい。 FIG. 14 is a diagram in which the structured data display image 61 of FIG. 11 is classified by object region. In this example, it is classified into 12 text areas (Text1 to Text12), 1 image area (Image1), and 2 graphics areas (Graphics1 to Graphics2). The classification in FIG. 14 is an example and can be changed as appropriate. For example, in FIG. 14, the text objects are classified for each line, but the text objects are classified for each paragraph, and Text1 to Text7 and Text8 to Text12 are each set as one text area, or up to a period is one text area. Or you may.

次に、制御部２１（ラスタライズ処理部２１ｄ）は、構造化データに対してラスタライズ処理を行って画像化し、イメージデータ（第２の画像データ）に再変換する（Ｓ１０５）。図１５は、構造化データのラスタライズ結果６２を示す図である。基本的に見た目は図１１の構造化データの表示画像６１（文書表示アプリケーションで表示した時の表示結果）と同様である。 Next, the control unit 21 (rasterization processing unit 21d) performs rasterization processing on the structured data to form an image, and reconverts it into image data (second image data) (S105). FIG. 15 is a diagram showing a rasterized result 62 of structured data. The appearance is basically the same as the display image 61 of the structured data in FIG. 11 (display result when displayed by the document display application).

次に、制御部２１（比較部２１ｅ）は、図１０の入力画像６０（第１の画像データ）と、図１５の構造化データのラスタライズ結果６２（第２の画像データ）とを比較し、差異部分を抽出する（Ｓ１０６）。このイメージデータの比較は、図１４に示すオブジェクト領域毎に行う。 Next, the control unit 21 (comparison unit 21e) compares the input image 60 (first image data) in FIG. 10 with the rasterized result 62 (second image data) of the structured data in FIG. A difference portion is extracted (S106). This comparison of image data is performed for each object area shown in FIG.

図１６は、図１３の図形１に着目した比較結果を示しており、図１０の入力画像６０と図１５の構造化データのラスタライズ結果６２とを比較すると図１６のような重ね合わせになる。図１７は、図１６を画素単位で比較した図である。この例では、太い実線の四角が図１５における図形１（正円）を構成する画素で、ハッチング部分が図１０における図形１（楕円）の横曲線を構成する画素である。図１７の例では、１００個の画素で正円が構成されており、横曲線と交差する画素を一致画素とすると、１３画素が一致し、横曲線と交差していない画素を不一致画素とすると、８７画素が一致しない。その結果、正円を構成する画素に対しては、不一致率は８７％となる。ここで、全画素に対する不一致画素の割合の閾値を２０％とすると、この領域の割合は閾値を超えるため差異部分と判定される。なお、ここでは全画素に対する不一致画素の割合と閾値とを比較したが、不一致画素の数と閾値とを比較してもよい。また、これらの閾値はオブジェクトの属性に応じて個別に設定可能である。 FIG. 16 shows a comparison result focusing on the graphic 1 in FIG. 13. When the input image 60 in FIG. 10 is compared with the rasterized result 62 of the structured data in FIG. FIG. 17 is a diagram comparing FIG. 16 in units of pixels. In this example, a thick solid square is a pixel constituting the graphic 1 (circular circle) in FIG. 15, and a hatched portion is a pixel constituting the horizontal curve of the graphic 1 (ellipse) in FIG. In the example of FIG. 17, a perfect circle is formed by 100 pixels. If a pixel that intersects the horizontal curve is a matching pixel, 13 pixels are matched and a pixel that does not intersect the horizontal curve is a mismatched pixel. , 87 pixels do not match. As a result, the mismatch rate is 87% for the pixels forming the perfect circle. Here, if the threshold of the ratio of non-matching pixels to all the pixels is 20%, the ratio of this region exceeds the threshold, so that it is determined as a difference portion. Here, the ratio of the mismatched pixels to all the pixels is compared with the threshold value, but the number of mismatched pixels may be compared with the threshold value. These threshold values can be individually set according to the attributes of the object.

次に、制御部２１（データ更新部２１ｆ）は、差異部分が抽出されたかを判断する（Ｓ１０７）。差異部分が抽出されなかった場合は、制御部２１（データ更新部２１ｆ）は、構造化データをそのまま出力（例えば、記憶部２５などに保存）する（Ｓ１１１）。一方、差異部分が抽出された場合は、制御部２１（データ更新部２１ｆ）は、差異部分に配置されるオブジェクトが属するオブジェクト領域を特定する（Ｓ１０８）。図１８は、オブジェクト領域の特定処理を説明する図であり、図１８（ａ）は、抽出された差異部分を示し、図１８（ｂ）は、抽出された差異部分に配置されるオブジェクトの位置を示している。図１８（ｂ）の位置から、上記オブジェクトが属するオブジェクト領域を特定する。図１８（ｃ）は、特定したオブジェクト領域（Text９、Graphics１）をハッチングで示している。 Next, the control unit 21 (data update unit 21f) determines whether a difference portion has been extracted (S107). When the difference portion is not extracted, the control unit 21 (data update unit 21f) outputs the structured data as it is (for example, stores it in the storage unit 25) (S111). On the other hand, when the different part is extracted, the control unit 21 (data update unit 21f) specifies the object area to which the object arranged in the different part belongs (S108). 18A and 18B are diagrams for explaining the object area specifying process. FIG. 18A shows the extracted difference portion, and FIG. 18B shows the position of the object placed in the extracted difference portion. Is shown. From the position in FIG. 18B, the object area to which the object belongs is specified. FIG. 18C shows the identified object area (Text9, Graphics1) by hatching.

次に、制御部２１（データ更新部２１ｆ）は、入力画像から、特定したオブジェクト領域に対応する画像情報を取得し（Ｓ１０９）、取得した画像情報を用いて構造化データを更新する（Ｓ１１０）。図１９は、特定したオブジェクト領域（Text９、Graphics１）を入力画像の画像情報に置き替えた更新後の構造化データの表示画像６３（文書表示アプリケーションで表示した時の表示結果）の一例である。ここでは、構造化データの更新部分を分かりやすくするために、入力画像の画像情報に置き替えた部分を太線で強調表示している。その後、制御部２１（データ更新部２１ｆ）は、更新した構造化データを出力（例えば、記憶部２５などに保存）する（Ｓ１１１）。 Next, the control unit 21 (data update unit 21f) acquires image information corresponding to the specified object region from the input image (S109), and updates the structured data using the acquired image information (S110). . FIG. 19 shows an example of a display image 63 (display result when displayed by the document display application) of the structured data after update in which the specified object region (Text9, Graphics1) is replaced with image information of the input image. Here, in order to make the updated portion of the structured data easy to understand, the portion replaced with the image information of the input image is highlighted with a bold line. Thereafter, the control unit 21 (data update unit 21f) outputs (for example, stores in the storage unit 25) the updated structured data (S111).

このように、入力されたイメージデータのベクタライズ処理で誤変換された部分だけでなく、当該部分に配置されるオブジェクトが属するオブジェクト領域全体を入力されたイメージデータで更新して、関連領域を同一の属性のオブジェクトで統一することにより、元の原稿の再現性を確保しつつ、原稿の検索性や再編集性を維持し、かつ、原稿の視認性の悪化を抑制した構造化データを生成することができる。 In this way, not only the part erroneously converted by the vectorization processing of the input image data but also the entire object area to which the object placed in the part belongs is updated with the input image data, and the related area is made the same. Unify the attribute objects to maintain the reproducibility of the original document, maintain the searchability and reeditability of the document, and generate structured data that suppresses deterioration of the document visibility. Can do.

なお、上記フローでは、画像変換プログラムが、構造化データのオブジェクト領域を自動的に入力画像に置き換えたが、制御部２１（比較部２１ｅ）は、比較結果（差異部分を明示する画像）を表示部２７に表示させ、差異部分に対して構造化データの更新を行うか否かをユーザに選択させたり（選択操作を受け付けたり）、制御部２１（データ更新部２１ｆ）は、更新結果（更新後の構造化データを文書表示アプリケーションで表示した表示画像）を表示部２７に表示して、構造化データの更新を採用するか否かをユーザに選択させたり（選択操作を受け付けたり）してもよい。 In the above flow, the image conversion program automatically replaces the object area of the structured data with the input image, but the control unit 21 (comparison unit 21e) displays the comparison result (image that clearly shows the difference). The control unit 21 (the data update unit 21f) displays the update result (update) by displaying on the unit 27 and allowing the user to select whether or not to update the structured data for the difference portion (accepting the selection operation). A display image of the subsequent structured data displayed by the document display application) is displayed on the display unit 27, and the user is allowed to select whether or not to update the structured data (selection operation is accepted). Also good.

また、更新対象のオブジェクト領域の特定において、イメージ領域に含まれる文字列がテキスト領域と判断された場合には、テキスト領域を含むイメージ領域全体を入力画像で置き換えることも可能である。図２０は、イメージ領域にテキスト領域（「bizhub PRESS C8000」の文字列）が含まれる入力画像の一例であり、図２１は、この入力画像をベクタライズして変換した構造化データをオブジェクト領域で分類した図である。この例では、Text８とText９のエリア（テキスト領域）がImage１のエリア（イメージ領域）に包含されているため、Text８又はText９から差異部分が抽出された場合、構造化データ中のImage１を置き換えて構造化データを更新することができる。 Further, in specifying the object area to be updated, when the character string included in the image area is determined to be a text area, the entire image area including the text area can be replaced with the input image. FIG. 20 is an example of an input image in which a text region (a character string “bizhub PRESS C8000”) is included in the image region. FIG. 21 illustrates classification of structured data obtained by vectorizing and converting the input image into object regions. FIG. In this example, since the area (text area) of Text8 and Text9 is included in the area (image area) of Image1, if a difference portion is extracted from Text8 or Text9, the structure is obtained by replacing Image1 in the structured data. Data can be updated.

なお、本発明は上記実施例に限定されるものではなく、本発明の趣旨を逸脱しない限りにおいて、その構成や制御は適宜変更可能である。 In addition, this invention is not limited to the said Example, The structure and control can be changed suitably, unless it deviates from the meaning of this invention.

例えば、上記実施例では、テキストオブジェクトとグラフィックスオブジェクトとイメージオブジェクトを含む原稿やテキストオブジェクトとイメージオブジェクトを含む原稿を例にして説明したが、少なくともテキストオブジェクト又はグラフィックスオブジェクトを含む原稿に対して本発明の画像変換方法を同様に適用することができる。 For example, in the above-described embodiment, a document including a text object, a graphics object, and an image object and a document including a text object and an image object have been described as examples. The image conversion method of the invention can be similarly applied.

また、上記実施例では、更新した構造化データを画像変換装置２０の記憶部２５などに保存する場合を示したが、更新した構造化データは画像変換装置２０に接続される外部の記憶装置（クラウドのサーバなど）に保存してもよい。 In the above embodiment, the case where the updated structured data is stored in the storage unit 25 of the image conversion apparatus 20 is shown. However, the updated structured data is stored in an external storage device (connected to the image conversion apparatus 20 ( It may be stored in a cloud server).

本発明は、イメージデータから構造化データを生成する画像変換プログラム及び当該画像変換プログラムを記録した記録媒体並びに当該画像変換プログラムが動作する画像変換装置並びに当該画像変換装置を含む画像変換システムにおける画像変換方法に利用可能である。 The present invention relates to an image conversion program for generating structured data from image data, a recording medium on which the image conversion program is recorded, an image conversion apparatus on which the image conversion program operates, and an image conversion in an image conversion system including the image conversion apparatus. Available to the method.

１０画像変換システム
２０画像変換装置
２１制御部
２１ａデータ取得部
２１ｂベクタライズ処理部
２１ｃ解析部
２１ｄラスタライズ処理部
２１ｅ比較部
２１ｆデータ更新部
２２ＣＰＵ
２３ＲＯＭ
２４ＲＡＭ
２５記憶部
２６ネットワークＩ／Ｆ部
２７表示部
２８操作部
３０画像形成装置
３１制御部
３２ＣＰＵ
３３ＲＯＭ
３４ＲＡＭ
３５記憶部
３６ネットワークＩ／Ｆ部
３７表示操作部
３８画像読取部
３９印刷処理部
４０画像読取装置
５０撮像装置
６０、７０入力画像
６０ａテキスト領域
６０ｂグラフィックス領域
６０ｃイメージ領域
６１構造化データの表示画像
６２構造化データのラスタライズ結果
６３更新後の構造化データの表示画像 DESCRIPTION OF SYMBOLS 10 Image conversion system 20 Image conversion apparatus 21 Control part 21a Data acquisition part 21b Vectorization process part 21c Analysis part 21d Rasterization process part 21e Comparison part 21f Data update part 22 CPU
23 ROM
24 RAM
25 Storage Unit 26 Network I / F Unit 27 Display Unit 28 Operation Unit 30 Image Forming Apparatus 31 Control Unit 32 CPU
33 ROM
34 RAM
35 Storage Unit 36 Network I / F Unit 37 Display Operation Unit 38 Image Reading Unit 39 Print Processing Unit 40 Image Reading Device 50 Imaging Device 60, 70 Input Image 60a Text Area 60b Graphics Area 60c Image Area 61 Display Image of Structured Data 62 Rasterization result of structured data 63 Display image of structured data after update

Claims

An image conversion program that operates on an apparatus capable of performing vectorization processing and rasterization processing,
In the device,
A first process for acquiring first image data obtained by causing an image reading unit to read a document or first image data obtained by causing an image pickup unit to capture an image of a document;
A second process for converting the first image data into structured data by performing the vectorization process;
Analyzing the structured data to obtain object information;
A fourth process for performing the rasterization process on the structured data and re-converting the structured data into second image data;
A fifth process for comparing the first image data with the second image data and extracting a difference portion;
A sixth process for identifying a predetermined area of the object region to which the object arranged in the difference portion belongs based on the object information;
A seventh process for obtaining image information corresponding to the object area from the first image data;
Updating the structured data using the acquired image information, and executing the eighth process of outputting the updated structured data;
An image conversion program characterized by that.

In the third process, the attribute and drawing position of the object included in the structured data, and information on the object area obtained by dividing the drawing area of the object of each attribute are acquired.
The image conversion program according to claim 1, wherein:

In the third process, the object area is set based on the mutual relationship of the objects included in the structured data.
The image conversion program according to claim 2, wherein:

In the fifth process, for each object area, a part where the number or ratio of pixels having a difference exceeds a predetermined threshold is extracted.
The image conversion program according to claim 2 or 3, wherein

In the fifth process, the threshold value is set according to the attribute of the object.
The image conversion program according to claim 4, wherein:

In the sixth process, when the difference portion is a text area and the text area is included in the image area, the image area is specified as the object area.
The image conversion program according to any one of claims 2 to 5, wherein

In the fifth process, an image clearly showing the difference portion is displayed on a display unit, and a selection operation as to whether or not to update the structured data is accepted.
An image conversion program according to any one of claims 1 to 6, wherein

In the eighth process, the display image of the structured data after update is displayed on the display unit, and a selection operation as to whether or not to adopt the update of the structured data is accepted.
8. The image conversion program according to claim 1, wherein the image conversion program is any one of claims 1 to 7.

The structured data is data described in a format of PDF (Portable Document Format), ODF (Open Document Format), or OOXML (Office Open XML).
The image conversion program according to any one of claims 1 to 8, wherein

An image reading unit or an imaging unit;
A data acquisition unit for acquiring first image data obtained by reading the original document by the image reading unit or first image data obtained by imaging the original document by the imaging unit;
A vectorization processing unit that performs vectorization processing on the first image data and converts the first image data into structured data;
Analyzing the structured data to obtain object information;
A rasterization processing unit that performs rasterization processing on the structured data and reconverts the second image data;
A comparison unit that compares the first image data and the second image data and extracts a difference portion;
Based on the object information, a predetermined range of an object region to which an object arranged in the difference portion belongs is specified, image information corresponding to the object region is acquired from the first image data, and the acquired image A data update unit that updates the structured data using information and outputs the updated structured data.
An image conversion apparatus characterized by that.

The analysis unit acquires information on an object area and a drawing position of the object included in the structured data, an object area obtained by dividing a drawing area of the object of each attribute,
The image conversion apparatus according to claim 10.

The analysis unit sets the object area based on the interrelationship of objects included in the structured data.
The image conversion apparatus according to claim 11.

The comparison unit extracts, for each object region, a portion where the number or ratio of pixels having a difference exceeds a predetermined threshold value,
The image conversion apparatus according to claim 11 or 12, characterized in that:

The comparison unit sets the threshold according to the attribute of the object.
The image conversion apparatus according to claim 13.

The data update unit is configured to specify the image area as the object area when the difference portion is a text area and the text area is included in the image area;
The image conversion apparatus according to claim 11, wherein the image conversion apparatus is an image conversion apparatus.

The structured data is data described in a format of PDF (Portable Document Format), ODF (Open Document Format), or OOXML (Office Open XML).
The image conversion apparatus according to claim 10, wherein the image conversion apparatus is an image conversion apparatus.

An image conversion method in a system including a control device capable of executing vectorization processing and rasterization processing and a device including an image reading unit or an imaging unit,
The controller is
A first process for acquiring first image data obtained by causing the image reading unit to read a document or first image data obtained by causing the imaging unit to capture an image of the document;
A second process for performing the vectorization process on the first image data and converting it to structured data;
A third process for analyzing the structured data and obtaining object information;
A fourth process for performing the rasterization process on the structured data and re-converting it into second image data;
A fifth process for comparing the first image data with the second image data and extracting a difference portion;
Based on the object information, a sixth process for specifying a predetermined range of the object region to which the object arranged in the difference portion belongs;
A seventh process of acquiring image information corresponding to the object region from the first image data;
Updating the structured data using the acquired image information, and outputting the updated structured data.
An image conversion method characterized by that.

In the third process, the attribute and drawing position of the object included in the structured data, and information on the object area obtained by dividing the drawing area of the object of each attribute are acquired.
The image conversion method according to claim 17.

In the third process, the object area is set based on the mutual relationship of the objects included in the structured data.
The image conversion method according to claim 18, wherein:

In the fifth process, for each object area, a part where the number or ratio of pixels having a difference exceeds a predetermined threshold is extracted.
20. The image conversion method according to claim 18 or 19, wherein:

In the fifth process, the threshold value is set according to the attribute of the object.
The image conversion method according to claim 20, wherein:

In the sixth process, when the difference portion is a text area and the text area is included in the image area, the image area is specified as the object area.
The image conversion method according to any one of claims 18 to 21, wherein the image conversion method is performed.

The structured data is data described in a format of PDF (Portable Document Format), ODF (Open Document Format), or OOXML (Office Open XML).
The image conversion method according to any one of claims 17 to 22, wherein the image conversion method is performed.