JP2010009146A

JP2010009146A - Document processing method and document processor

Info

Publication number: JP2010009146A
Application number: JP2008165069A
Authority: JP
Inventors: Taisuke Ishiguro; 泰輔石黒
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2008-06-24
Filing date: 2008-06-24
Publication date: 2010-01-14

Abstract

<P>PROBLEM TO BE SOLVED: To restore a state before the change of layout. <P>SOLUTION: This document processing method includes: a document input process for inputting a document; a document analyzing process for analyzing a document input by the document input process, and for acquiring document configuring blocks configuring a document and the layout information of the document configuring blocks; a layout changing process for changing the layout of the document configuring blocks acquired by the document analyzing process; and an embedding process for embedding the layout information acquired by the document analyzing process in the layout-changed document configuring blocks. <P>COPYRIGHT: (C)2010,JPO&INPIT

Description

本発明はコンテンツのレイアウトを決定するためのレイアウト技術に関する。 The present invention relates to a layout technique for determining a layout of content.

スキャンした文書から画像や文字列などの領域（以下、文書構成データ）を抽出する技術が知られている。抽出された文書構成データは、文書のレイアウト変更などに応用されている（例えば、特許文献１）。
特開２００５−３５２６９６ A technique for extracting an area (hereinafter, document configuration data) such as an image or a character string from a scanned document is known. The extracted document configuration data is applied to document layout change (for example, Patent Document 1).
JP 2005-352696 A

しかしながら、レイアウト変更を行うとレイアウト変更前の情報は破棄されてしまう。したがって、レイアウト変更後の文書からレイアウト変更前の状態を復元することが困難であるという問題があった。 However, if the layout is changed, the information before the layout change is discarded. Therefore, there is a problem that it is difficult to restore the state before the layout change from the document after the layout change.

そこで、レイアウト変更後の文書から変更前の状態を復元可能とすることを目的とする。 Therefore, it is an object to make it possible to restore the state before the change from the document after the layout change.

上記の課題を解決するために、本発明に係る文書処理方法は、文書を入力する文書入力工程と、前記文書入力工程により入力された文書を解析して、文書を構成する文書構成ブロックと前記文書構成ブロックのレイアウト情報とを取得する文書解析工程と、前記文書解析工程により取得された文書構成ブロックのレイアウトを変更するレイアウト変更工程と、前記レイアウト変更された文書構成ブロックに対して、前記文書解析工程で取得したレイアウト情報を埋め込む埋め込み工程とを備える。 In order to solve the above problems, a document processing method according to the present invention includes a document input step for inputting a document, a document composition block for analyzing the document input by the document input step, A document analysis step for acquiring document configuration block layout information; a layout change step for changing the layout of the document configuration block acquired by the document analysis step; and the document for the layout-changed document configuration block And an embedding process for embedding the layout information acquired in the analysis process.

本発明によれば、レイアウトを変更した後の文書から変更前の状態を復元することができる。 According to the present invention, the state before the change can be restored from the document after the layout is changed.

＜実施形態１＞
以下、図面を参照して本発明の好適な実施形態を詳細に説明する。 <Embodiment 1>
Hereinafter, preferred embodiments of the present invention will be described in detail with reference to the drawings.

図１は、本発明に係る情報処理装置より構成されるネットワークシステムを示す図である。ネットワークシステムは各種データの伝送媒体となるネットワーク１０２上に複数の情報処理装置が接続されている。ネットワーク１０２は例えばＥｔｈｅｒｎｅｔ（登録商標）のようなＬＡＮ（ＬｏｃａｌＡｒｅａＮｅｔｗｏｒｋ）あるいはインターネットのような広域情報通信網であってもよい。各情報処理装置１乃至情報処理装置５は図２において後述する通信部２０８を介して接続されている。 FIG. 1 is a diagram showing a network system including information processing apparatuses according to the present invention. In the network system, a plurality of information processing apparatuses are connected to a network 102 serving as a transmission medium for various data. The network 102 may be, for example, a LAN (Local Area Network) such as Ethernet (registered trademark) or a wide area information communication network such as the Internet. Each information processing apparatus 1 to information processing apparatus 5 is connected via a communication unit 208 described later in FIG.

図２は、本発明に係る情報処理装置（文書処理装置）のハードウェア構成を示す図である。図２において２０１は、マイクロプロセッサＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）であり、各処理のための演算、論理判断等を行い、バス２０９を介してそれらのバスに接続された各構成要素を制御する。 FIG. 2 is a diagram showing a hardware configuration of the information processing apparatus (document processing apparatus) according to the present invention. In FIG. 2, reference numeral 201 denotes a microprocessor CPU (Central Processing Unit), which performs operations and logical determinations for each process, and controls each component connected to those buses via a bus 209.

２０２は読み出し専用の固定メモリＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）であり、実行される処理プログラム等の制御プログラムコードを記憶する。 Reference numeral 202 denotes a read-only fixed memory ROM (Read Only Memory), which stores control program codes such as processing programs to be executed.

２０３は書き込み可能なＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）であり、各構成要素からの各種データの一時記憶に用いられる。 Reference numeral 203 denotes a writable RAM (Random Access Memory), which is used for temporary storage of various data from each component.

２０４は入力部であり、情報（データ）の入力に用いられる。 An input unit 204 is used to input information (data).

２０５は陰極線管ＣＲＴ（Ｃａｔｈｏｄ−ＲａｙＴｕｂｅ）や液晶パネル等の表示部であり、その表示部におけるドット構成の表示パターンおよびカーソルの表示を表示コントローラ２０６で制御する。 Reference numeral 205 denotes a display unit such as a cathode ray tube CRT (Cathod-Ray Tube) or a liquid crystal panel. The display controller 206 controls the display pattern of the dot configuration and the display of the cursor in the display unit.

２０７は記憶部であり、種々の情報が格納される。また、これらのデータおよびプログラムを格納する記憶媒体としては、ＲＯＭ、フロッピー（登録商標）ディスク、ＣＤ−ＲＯＭ、ＤＶＤ−ＲＯＭ、メモリカード、光磁気ディスクなどを用いることができる。 A storage unit 207 stores various information. As a storage medium for storing these data and programs, ROM, floppy (registered trademark) disk, CD-ROM, DVD-ROM, memory card, magneto-optical disk, and the like can be used.

２０８は通信部であり、Ｅｔｈｅｒｎｅｔ（登録商標）などのネットワークに接続し、ネットワークシステムにおいて複数の情報処理装置同士を接続する役割を担う。 A communication unit 208 is connected to a network such as Ethernet (registered trademark) and plays a role of connecting a plurality of information processing apparatuses in a network system.

２１０はスキャナなどの画像読取部であり、原稿やフィルムなどを読み取って、画像信号を取得する。 An image reading unit 210 such as a scanner reads an original or a film and acquires an image signal.

係る各構成要素からなる情報処理装置においては、入力部２０４からの各種の入力および通信部２０８から提供されるネットワーク経由の各種入力に応じて作動するものである。入力部２０４からの入力および通信部２０８からの入力が供給されると、まず、インタラプト信号がＣＰＵ２０１に送られる。そして、そのＣＰＵ２０１が記憶部２０７内に記憶してある各種の制御信号を読み出し、それらの制御信号に従って、各種の制御が行われる。 The information processing apparatus including such constituent elements operates in response to various inputs from the input unit 204 and various inputs via the network provided from the communication unit 208. When an input from the input unit 204 and an input from the communication unit 208 are supplied, an interrupt signal is first sent to the CPU 201. Then, the CPU 201 reads out various control signals stored in the storage unit 207, and various controls are performed according to the control signals.

次に、本実施形態における処理全体の概要を図３により説明する。 Next, the outline of the entire processing in this embodiment will be described with reference to FIG.

図３は本発明の実施形態における処理全体の概要を示すフローチャートである。 FIG. 3 is a flowchart showing an outline of the entire processing according to the embodiment of the present invention.

ステップＳ３００では、文書入力工程であり、入力部２０４により紙原稿をラスタ状に走査して読み取り、画像信号を得る。たとえば、スキャナなどの画像読取部２１０で原稿を読み取り、６００ＤＰＩ−８ビットの画像信号を得る。この画像信号は、ＲＡＭ２０３もしくは記憶部２０７に画像データとして記憶される。 In step S300, a document input process is performed. The input unit 204 scans and reads a paper document in a raster shape to obtain an image signal. For example, a document is read by an image reading unit 210 such as a scanner to obtain a 600 DPI-8 bit image signal. This image signal is stored as image data in the RAM 203 or the storage unit 207.

ステップＳ３０１では、入力原稿に対するコピー処理を行うか判定を行う。コピー処理の指示は、ポインティングデバイスなどの入力部２０４を用いてユーザにより明示的に行われる。ユーザによる指示がコピーである場合、ステップＳ３１１へ移行する。ユーザによりコピーを行わないと指示があった場合は、ステップＳ３０３へ移行する。 In step S301, it is determined whether or not to perform copy processing on the input document. The copy processing instruction is explicitly given by the user using the input unit 204 such as a pointing device. When the instruction by the user is copying, the process proceeds to step S311. If the user instructs not to copy, the process proceeds to step S303.

ステップＳ３０２では、ステップＳ３００で取得した画像データを解析し、文書を構成するデータとレイアウト情報を取得する。具体的には、文字／線画部分とハーフトーン画像部分に領域分割する。文字／線画部分は段落でまとまっているブロック、あるいは線で構成された表、図表に分割処理を行う。一方、ハーフトーン画像部分は、矩形に分離されたブロックの画像部分、背景部分等、ブロック毎に独立したオブジェクトに分割する。 In step S302, the image data acquired in step S300 is analyzed, and data constituting the document and layout information are acquired. Specifically, the area is divided into a character / line drawing part and a halftone image part. The character / line drawing part is divided into blocks organized in paragraphs, or tables and charts composed of lines. On the other hand, the halftone image portion is divided into independent objects for each block, such as an image portion of a block separated into a rectangle and a background portion.

ステップＳ３０３では、入力原稿のレイアウトを変更するか判定する。レイアウト変更処理の指示は、ステップＳ３０１のコピー処理と同様にポインティングデバイスなどの入力部２０４を用いてユーザにより明示的に行われる。ユーザによる指示がレイアウト変更実施である場合には、ステップＳ３０４へ移行する。レイアウト変更を実行しないと指示があった場合は、ステップＳ３０７へ移行する。 In step S303, it is determined whether to change the layout of the input document. The layout change processing instruction is explicitly given by the user using the input unit 204 such as a pointing device, similarly to the copy processing in step S301. If the user instruction is layout change execution, the process proceeds to step S304. If there is an instruction not to execute layout change, the process proceeds to step S307.

ステップＳ３０４では、ステップＳ３０２において抽出した構成データをもとに、どのようなレイアウト変更が可能かユーザに提示し、ユーザに選択させる。具体的には、構成データの種別（文字（列）、画像）とその個数から、当てはめることができるテンプレートを判断し、検索結果のテンプレートをユーザに提示する。ユーザは、提示されたテンプレートから希望するテンプレートを選択する。テンプレートはＲＡＭ２０３もしくは記憶部２０７に記憶されている。 In step S304, based on the configuration data extracted in step S302, the user is presented with what layout changes are possible, and the user is allowed to select. Specifically, a template that can be applied is determined from the type (character (string), image) of the configuration data and the number thereof, and the search result template is presented to the user. The user selects a desired template from the presented templates. The template is stored in the RAM 203 or the storage unit 207.

ステップＳ３０５では、ステップＳ３０４でユーザにより指示されたテンプレートにしたがって、ステップＳ３０２において取得した文書構成データのレイアウトを変更する。 In step S305, the layout of the document configuration data acquired in step S302 is changed according to the template instructed by the user in step S304.

ステップＳ３０６では、各文書構成データに対してステップＳ３０５によりレイアウトが変更される前のレイアウト情報を埋め込む。言い換えると、ステップＳ３０２において取得し、記憶したレイアウト情報を各文書構成データに埋め込む。 In step S306, the layout information before the layout is changed in step S305 is embedded in each document configuration data. In other words, the layout information acquired and stored in step S302 is embedded in each document configuration data.

ステップＳ３０４からステップＳ３０６までのレイアウト変更処理に関する詳細は後述する。 Details regarding the layout change processing from step S304 to step S306 will be described later.

ステップＳ３０７では、入力原稿のレイアウトを復元するか判定する。レイアウト復元処理の指示は、ステップＳ３０１、ステップＳ３０３と同様にポインティングデバイスなどの入力部２０４を用いてユーザにより明示的に行われる。ユーザによる指示がレイアウト復元である場合、ステップＳ３０８へ移行する。レイアウト復元を実行しないと指示があった場合、処理を終了する。 In step S307, it is determined whether to restore the layout of the input document. The layout restoration processing instruction is explicitly given by the user using the input unit 204 such as a pointing device as in steps S301 and S303. When the user instruction is layout restoration, the process proceeds to step S308. If there is an instruction not to execute layout restoration, the process ends.

ステップＳ３０８では、ステップＳ３０２において取得した各文書構成データからレイアウト変更前のレイアウト情報を抽出する。 In step S308, layout information before the layout change is extracted from each document configuration data acquired in step S302.

ステップＳ３０９では、ステップＳ３０８においてレイアウト情報が抽出されたかを判定する。レイアウト情報が抽出できてたと判定された場合、ステップＳ３１０へ移行する。レイアウト情報が抽出されなかった場合は、処理を終了する。 In step S309, it is determined whether layout information has been extracted in step S308. If it is determined that the layout information has been extracted, the process proceeds to step S310. If layout information has not been extracted, the process ends.

ステップＳ３１０では、ステップＳ３０８において抽出したレイアウト情報に基づき各文書構成データの配置を決定し、復元する。 In step S310, the arrangement of each document configuration data is determined based on the layout information extracted in step S308, and is restored.

ステップＳ３０８からステップＳ３１０までのレイアウト復元処理に関する詳細は、後述する。 Details regarding the layout restoration processing from step S308 to step S310 will be described later.

ステップＳ３１１では、各文書構成データの位置情報に基づき印刷処理を行う。単純なコピー処理の場合は、ステップＳ３００において取得した画像データをそのまま印刷する。レイアウト変更、復元処理が行われていた場合は、各構成文書データの位置情報から文書内の配置位置を決定し、当該配置位置に基づき文書画像を生成し、印刷処理を行う。 In step S311, print processing is performed based on the position information of each document configuration data. In the case of simple copy processing, the image data acquired in step S300 is printed as it is. If layout change or restoration processing has been performed, an arrangement position in the document is determined from position information of each component document data, a document image is generated based on the arrangement position, and printing processing is performed.

このように、本システムでは入力原稿を解析し、各文書構成データに対して、オリジナルのレイアウト情報を埋め込み印刷する。したがって、印刷物から得られる情報のみで元の状態へ復元することができる。このため、ネットワーク障害などデータへアクセスできない場合も処理への影響がなくなるという効果がある。 In this way, the system analyzes the input document and embeds and prints the original layout information in each document configuration data. Therefore, it is possible to restore the original state only with information obtained from the printed matter. For this reason, there is an effect that there is no influence on processing even when data cannot be accessed such as a network failure.

以下、各処理の詳細について説明する。 Details of each process will be described below.

まず、図３のステップＳ３０２の文書画像解析処理について説明する。文書画像解析処理とは、画像から意味のあるブロックをかたまりとして認識して、該ブロック各々の属性を判定し、異なる属性を持つブロックに分割する処理である。たとえば、図４（ａ）のラスタ画像に対して文書画像解析処理を行うと、図４（ｂ）のように文字、画像などのブロックとして分割される。 First, the document image analysis process in step S302 of FIG. 3 will be described. The document image analysis process is a process of recognizing a meaningful block from an image as a block, determining the attribute of each block, and dividing the block into blocks having different attributes. For example, when the document image analysis process is performed on the raster image of FIG. 4A, it is divided into blocks such as characters and images as shown in FIG. 4B.

文書画像解析処理は、既存の文書画像解析処理技術を用いて行う。以下、簡単に文書画像解析技術の一例を示す。 The document image analysis processing is performed using an existing document image analysis processing technique. An example of a document image analysis technique will be briefly described below.

入力画像を受け取ると、白黒画像に二値化する。そして、輪郭線追跡を行い黒画素輪郭で囲まれる画素のかたまりを抽出する。一定面積以上の黒画素の場合は、内部にある白画素に対しても輪郭線追跡を行い白画素のかたまりを抽出する。抽出した白画素のかたまりが一定面積以上であれば、さらに黒画素のかたまりを抽出する。当該抽出処理は、抽出されたかたまりが一定面積以上であれば、再帰的に実行する。 When an input image is received, it is binarized into a black and white image. Then, outline tracing is performed to extract a block of pixels surrounded by a black pixel outline. In the case of a black pixel having a certain area or more, contour tracing is performed for white pixels inside to extract a block of white pixels. If the extracted white pixel block is larger than a certain area, a black pixel block is further extracted. The extraction process is recursively executed if the extracted cluster is larger than a certain area.

上記の処理で得られた黒画素のかたまりを大きさおよび形状により様々な属性を持つブロックとして分類する。たとえば、縦横比が１に近いブロックは文字相当の画素のかたまりとし、隣接する文字相当の画素のかたまりが整列されていてグループ化可能な場合は文字列ブロックとする。また、不定形の画素のかたまりが散在する場合は、写真ブロック、それ以外は図面ブロックなどに分類する。 The block of black pixels obtained by the above processing is classified as a block having various attributes according to size and shape. For example, a block whose aspect ratio is close to 1 is a block of pixels corresponding to a character, and a block of pixels corresponding to a character is aligned and can be grouped to be a character string block. In addition, when a block of irregular pixels is scattered, the block is classified into a photograph block, and the others are classified into drawing blocks.

文書画像解析処理では、分類したブロックの位置情報や属性などを文書構成データ情報（レイアウト情報）としてＲＡＭ２０３もしくは記憶部２０７に記憶する。 In the document image analysis process, the position information and attributes of the classified blocks are stored in the RAM 203 or the storage unit 207 as document configuration data information (layout information).

図５を用いて、文書構成データ情報の一例を示す。 An example of document configuration data information is shown using FIG.

図５に示すように、文書構成データ情報は、各構成データを一意に決定するためのＩＤ、構成データの属性を示すデータ属性（１：文字列、２：図画、３：写真）、構成データの位置座標（Ｘ，Ｙ）、構成データの外接矩形の幅Ｗおよび高さＨで構成される。 As shown in FIG. 5, the document configuration data information includes an ID for uniquely determining each configuration data, a data attribute indicating an attribute of the configuration data (1: character string, 2: drawing, 3: photograph), and configuration data. Position coordinates (X, Y), the width W and the height H of the circumscribed rectangle of the configuration data.

構成データの位置座標は、原稿画像の左上を原点（０，０）とした場合の位置座標である。構成データの幅Ｗおよび高さＨは画素数で表現される。 The position coordinates of the configuration data are the position coordinates when the upper left corner of the document image is the origin (0, 0). The width W and the height H of the configuration data are expressed by the number of pixels.

次に、本実施形態の情報処理装置の機能ブロック図を図６に示す。 Next, a functional block diagram of the information processing apparatus of this embodiment is shown in FIG.

図６に示すように、本実施形態の情報処理装置は、入力部６０１、文書解析部６０２、レイアウト変更部６０３、埋め込み情報処理部６０４、出力部６０５から構成される。 As illustrated in FIG. 6, the information processing apparatus according to the present exemplary embodiment includes an input unit 601, a document analysis unit 602, a layout change unit 603, an embedded information processing unit 604, and an output unit 605.

入力部６０１は、原稿の入力を受け付けるスキャナやユーザの指示を受け付けるキーボード、ポインティングデバイスなどである。 The input unit 601 is a scanner that accepts input of a document, a keyboard that accepts user instructions, a pointing device, or the like.

文書解析部６０２は、入力部６０１に入力された入力原稿画像を解析し、文書構成データを抽出する。 The document analysis unit 602 analyzes the input document image input to the input unit 601 and extracts document configuration data.

レイアウト変更部６０３は、文書解析部６０２により抽出された文書構成データを任意のレイアウトへ変更する。 The layout change unit 603 changes the document configuration data extracted by the document analysis unit 602 to an arbitrary layout.

埋め込み情報処理部６０４は、文書解析部６０２により抽出された文書構成データに対して、情報の埋め込み、抽出処理を行う。 The embedded information processing unit 604 performs information embedding and extraction processing on the document configuration data extracted by the document analysis unit 602.

出力部６０５は、レイアウト変更部６０３によりレイアウト変更された文書の出力やユーザインタフェースを表示するプリンタやディスプレイである。 An output unit 605 is a printer or a display that displays an output of a document whose layout has been changed by the layout changing unit 603 and a user interface.

次に、図３のステップＳ３０４からステップＳ３０６までのレイアウト変更処理の詳細について、図８を用いて説明する。 Next, details of the layout change processing from step S304 to step S306 in FIG. 3 will be described with reference to FIG.

図８はレイアウト変更処理の詳細を示すフローチャートである。 FIG. 8 is a flowchart showing details of the layout change process.

ステップＳ８００では、テンプレート表示処理を行う。テンプレート表示処理では、文書画像解析処理で取得された構成データ数に基づきレイアウト変更可能なテンプレートを表示部２０５に表示する。テンプレートは、記憶部２０７に格納されており、各テンプレートには対応可能なデータ数が関連付けられている。 In step S800, template display processing is performed. In the template display process, a template whose layout can be changed based on the number of configuration data acquired in the document image analysis process is displayed on the display unit 205. The templates are stored in the storage unit 207, and the number of data that can be handled is associated with each template.

ここで、入力原稿が、図７（ａ）に示した文書であった場合を説明する。図７（ｂ）は、入力原稿から抽出された文書構成データ（文書構成ブロック）のイメージ図である。矩形で囲まれた部分が文書画像解析処理により抽出された各文書構成データであり、入力原稿は、４つの構成データから構成されている。したがって、４つの構成データに対応するテンプレートが表示部２０５に表示されることとなる（図７（ｃ））。 Here, a case where the input document is the document shown in FIG. 7A will be described. FIG. 7B is an image diagram of the document configuration data (document configuration block) extracted from the input document. A portion surrounded by a rectangle is each piece of document configuration data extracted by the document image analysis process, and the input original is composed of four pieces of configuration data. Therefore, templates corresponding to the four pieces of configuration data are displayed on the display unit 205 (FIG. 7C).

ステップＳ８０１では、テンプレートを選択する。テンプレート選択処理では、ステップＳ８００において提示されたテンプレートをユーザにより選択させる。ユーザによる選択時には、キーボードやポインティングデバイスなど入力部２０４を用いて行う。 In step S801, a template is selected. In the template selection process, the template presented in step S800 is selected by the user. The selection by the user is performed using the input unit 204 such as a keyboard or a pointing device.

ステップＳ８０２では、レイアウト変更処理を行う。レイアウト変更処理では、ステップＳ８０１で選択されたテンプレートに対して、どのように文書構成データを配置するか決定する。ステップＳ８０１で、図７（ｃ）のテンプレート３が選択された場合は、ステップＳ８０２で、選択されたテンプレート３にあわせて文書構成データを配置される（図７（ｄ））。 In step S802, layout change processing is performed. In the layout change process, it is determined how to arrange the document configuration data for the template selected in step S801. If the template 3 in FIG. 7C is selected in step S801, the document configuration data is arranged in accordance with the selected template 3 in step S802 (FIG. 7D).

具体的には、入力原稿から抽出した文書構成データとステップＳ８０１において決定したテンプレートをユーザに提示する。そして、マウスによるドラッグ＆ドロップなどの操作により各文書構成データと当該データを配置したいテンプレート中のブロックを関連付けることで配置を決定する。 Specifically, the document configuration data extracted from the input document and the template determined in step S801 are presented to the user. Then, the arrangement is determined by associating each document configuration data with a block in the template where the data is to be arranged by an operation such as drag and drop with the mouse.

テンプレートに定義される領域が文書構成データを配置するのに不十分な場合、文書構成データの拡大・縮小などの形状変更を行うことでテンプレート中の領域内に文書構成データを配置する。すなわち、文書構成データは、テンプレートに定義される領域に収まるように形状が変更される。 When the area defined in the template is insufficient for arranging the document configuration data, the document configuration data is arranged in the area in the template by changing the shape such as enlargement / reduction of the document configuration data. That is, the shape of the document configuration data is changed so as to fit in the area defined in the template.

ステップＳ８０３では、変更内容判定処理を行う。変更内容判定処理では、ステップＳ８０２におけるレイアウト変更処理の結果、各文書構成データにどのような変更が加えられたのか判定する。 In step S803, a change content determination process is performed. In the change content determination process, it is determined what change has been made to each document configuration data as a result of the layout change process in step S802.

具体的には、文書構成データ毎に、レイアウト前の文書構成データと、レイアウト変更後の文書構成データとを比較する。そして、文書構成データ毎に何も変化がないのか、位置のみに変更があったのか、位置および形状に変更があったのかという３タイプに分類する。分類結果は、ＲＡＭ２０３に一時的に記憶する。記憶した分類結果は、後述するステップＳ８０４０の埋め込み情報生成処理により用いられる。 Specifically, the document configuration data before layout is compared with the document configuration data after layout change for each document configuration data. Then, it is classified into three types: whether there is no change for each document configuration data, whether only the position has changed, or whether the position and shape have changed. The classification result is temporarily stored in the RAM 203. The stored classification result is used by the embedded information generation process in step S8040 described later.

図９を用いて、変更内容種別の一例を説明する。 An example of the change content type will be described with reference to FIG.

図９（ａ）は、レイアウト変更前の状態、すなわち入力原稿の画像を示す。矩形で囲まれた部分が文書画像解析処理により抽出された各文書構成データである。４つの文書構成データ（９０１Ａ、９０１Ｂ、９０１Ｃ、９０１Ｄ）から構成されることがわかる。 FIG. 9A shows the state before the layout change, that is, the image of the input document. A portion surrounded by a rectangle is each piece of document configuration data extracted by the document image analysis process. It can be seen that it is composed of four pieces of document configuration data (901A, 901B, 901C, 901D).

図９（ｂ）は、レイアウト変更内容を示すテンプレートである。テンプレートには、文書構成データを関連付ける矩形領域９０２〜９０５が４つ定義されている。 FIG. 9B is a template showing layout change contents. The template defines four rectangular areas 902 to 905 for associating document configuration data.

図９（ｃ）は、図９（ａ）の入力原稿を図９（ｂ）のテンプレートに基づきレイアウト変更を行った例である。文書構成データＡに注目すると、図９（ａ）の９０１Ａと図９（ｃ）の９０６Ａとでは、位置および形状に変化がないので、何も変更が加えられていないと判定される。次に、文書構成データＢに注目すると、形状に変化はないが、位置が変わっているので、位置のみに変更があったと判定される。文書構成データＣ、Ｄは、位置および形状に変更が加えられているので、位置および形状に変更があったと判定される。 FIG. 9C shows an example in which the layout of the input document in FIG. 9A is changed based on the template in FIG. 9B. Focusing on the document configuration data A, since there is no change in position and shape between 901A in FIG. 9A and 906A in FIG. 9C, it is determined that no change has been made. Next, paying attention to the document configuration data B, the shape is not changed, but the position is changed, so that it is determined that only the position is changed. Since the document configuration data C and D have been changed in position and shape, it is determined that the position and shape have been changed.

ステップＳ８０４では、各文書構成データに埋め込む情報を生成する。埋め込み情報生成処理では、ステップＳ８０３において判定された分類結果に応じて生成する情報を変更する。具体的には、位置および形状に変化がない、すなわち何も変更されてない場合、埋め込み情報を生成しない。位置のみに変更がある場合は、変更前の位置情報のみを埋め込み情報として生成する。位置情報は、文書構成データの外接矩形の左上の座標とする。位置および形状に変更がある場合は、変更前の位置情報と、形状の変更割合を埋め込み情報として生成する。位置情報は、文書構成データの外接矩形の左上の座標であり、変更割合は、変更前の矩形から幅および高さが変更した割合（％）とする。 In step S804, information to be embedded in each document configuration data is generated. In the embedded information generation process, information to be generated is changed according to the classification result determined in step S803. Specifically, when there is no change in position and shape, that is, when nothing has been changed, no embedding information is generated. When only the position is changed, only the position information before the change is generated as the embedded information. The position information is the upper left coordinates of the circumscribed rectangle of the document configuration data. When there is a change in the position and shape, the position information before the change and the change rate of the shape are generated as embedded information. The position information is the upper left coordinates of the circumscribed rectangle of the document configuration data, and the change ratio is a ratio (%) in which the width and height are changed from the rectangle before the change.

このように、レイアウト変更処理による文書構成データの変更種別に応じて生成する埋め込み情報を変えることで、埋め込み情報量を削減できる。 As described above, the amount of embedded information can be reduced by changing the embedded information to be generated according to the change type of the document configuration data by the layout change processing.

生成した埋め込み情報が文書構成データに埋め込むことができないと判断した場合、文書構成データを任意の記憶装置に格納しておき、当該格納場所を埋め込み情報として生成しても良い。文書構成データに十分なデータ量を埋め込めない場合に有効である。 When it is determined that the generated embedded information cannot be embedded in the document configuration data, the document configuration data may be stored in an arbitrary storage device, and the storage location may be generated as embedded information. This is effective when a sufficient amount of data cannot be embedded in the document configuration data.

文書構成データを任意の記憶装置に格納しておくと同時に、文書構成データの格納場所を管理する管理テーブルを別途作成しておき、当該テーブル上を一意に決定可能な情報のみを埋め込み情報としても良い。この場合も、文書構成データに十分なデータ量を埋め込めない場合に有効である。 The document configuration data is stored in an arbitrary storage device, and at the same time, a management table for managing the storage location of the document configuration data is separately created, and only information that can be uniquely determined on the table may be used as embedded information. good. This case is also effective when a sufficient amount of data cannot be embedded in the document configuration data.

文書構成データを任意の記憶装置に格納し、アクセスする手段を埋め込む手法は、文書構成データの形状が変更された場合にも応用できる。具体的には、変更割合に許容範囲を定めておき、許容範囲外の場合は文書構成データを任意の記憶装置に格納する。こうすることで、写真など拡大・縮小により大きく画質が変更され、元の状態に戻すことが難しい場合にも対応することができる。 The technique of storing the document configuration data in an arbitrary storage device and embedding the means for accessing can be applied even when the shape of the document configuration data is changed. Specifically, an allowable range is set for the change ratio, and if it is out of the allowable range, the document configuration data is stored in an arbitrary storage device. In this way, it is possible to cope with a case where the image quality is greatly changed due to enlargement / reduction, such as a photograph, and it is difficult to return to the original state.

ステップＳ８０５では、各文書構成データにステップＳ８０４で生成した情報を埋め込む。情報埋め込み処理では、各文書構成データの種別に応じて電子透かし埋め込み手法を変更することが望ましい。文書構成データの種別と埋め込み手法との対応は、別途埋め込み技術対応テーブルに定めておく（不図示）。たとえば、文書構成データが文字列の場合、文字列に対する埋め込み技術を使用し、図画・写真であれば、図画・写真に対する埋め込み技術を使用するという対応付けを行っておく。 In step S805, the information generated in step S804 is embedded in each document configuration data. In the information embedding process, it is desirable to change the digital watermark embedding method according to the type of each document configuration data. The correspondence between the type of document configuration data and the embedding method is determined separately in an embedding technology correspondence table (not shown). For example, when the document configuration data is a character string, an embedding technique for the character string is used, and for a drawing / photo, the embedding technique for the drawing / photo is used.

以上、説明したように本実施形態によるレイアウト変更処理では、文書構成データの変更内容種別に応じて、埋め込む情報を変更することで、埋め込み情報量を削減できる。 As described above, in the layout change process according to the present embodiment, the amount of embedded information can be reduced by changing the embedded information in accordance with the change content type of the document configuration data.

続いて、図３のステップＳ３０８からステップＳ３１０までのレイアウト復元処理の詳細について、図１０を用いて説明する。 Next, details of the layout restoration processing from step S308 to step S310 in FIG. 3 will be described with reference to FIG.

レイアウト復元処理は、レイアウト変更処理により変更された文書から変更前の文書を生成する処理である。 The layout restoration process is a process for generating a document before change from a document changed by the layout change process.

図１０はレイアウト復元処理の詳細を示すフローチャートである。 FIG. 10 is a flowchart showing details of the layout restoration process.

ステップＳ１０００では、各文書構成データから埋め込まれているレイアウト情報を抽出する。埋め込み情報抽出処理では、文書構成データの種別に応じて抽出手法を変更する。文書構成データの種別と埋め込み情報抽出手法との対応は、別途埋め込み情報抽出技術対応テーブルに定めておく（不図示）。たとえば、文書構成データが文字列の場合、文字列に対する抽出技術を使用し、図画・写真であれば、図画・写真に対する抽出技術を使用するという対応付けを行っておく。 In step S1000, the embedded layout information is extracted from each document configuration data. In the embedded information extraction process, the extraction method is changed according to the type of document configuration data. The correspondence between the type of document configuration data and the embedded information extraction method is determined separately in an embedded information extraction technology correspondence table (not shown). For example, when the document configuration data is a character string, an extraction technique for the character string is used, and when the document structure data is a drawing / photo, the extraction technique for the drawing / photo is used.

ステップＳ１００１では、ステップＳ１０００においてレイアウト情報が抽出されたかを判定する。レイアウト情報が抽出できたと判定された場合、ステップＳ３１０へ移行する。レイアウト情報が抽出されなかった場合は、処理を終了する。 In step S1001, it is determined whether layout information has been extracted in step S1000. If it is determined that the layout information has been extracted, the process proceeds to step S310. If layout information has not been extracted, the process ends.

ステップＳ１００２では、ステップＳ１０００で抽出したレイアウト情報に基づきレイアウト変更前の各文書構成データを生成する。抽出したレイアウト情報が、位置情報のみの場合は、文書構成データ生成処理は行わない。この場合の文書構成データは、レイアウト復元時に文書画像解析処理により抽出されたものと同じである。ステップＳ１０００にて抽出されたレイアウト情報が、変更割合もしくは文書構成データへの参照情報を含む場合、当該埋め込み情報に基づき文書構成データを生成する。たとえば、変更割合が幅９０％、高さ１００％という情報が埋め込まれていた場合、レイアウト復元時に文書画像解析処理により抽出された文書構成データの幅×０．９の幅を持つ文書構成データを生成する。また、抽出した情報がＵＲＬである場合は、当該ＵＲＬに示される場所に格納されるデータを取得し、レイアウト復元用の文書構成データとする。 In step S1002, each document configuration data before the layout change is generated based on the layout information extracted in step S1000. When the extracted layout information is only the position information, the document configuration data generation process is not performed. The document configuration data in this case is the same as that extracted by the document image analysis process at the time of layout restoration. When the layout information extracted in step S1000 includes change ratio or reference information to the document configuration data, the document configuration data is generated based on the embedded information. For example, when the information that the change ratio is 90% width and 100% height is embedded, the document structure data having a width of 0.9 times the width of the document structure data extracted by the document image analysis process at the time of layout restoration is obtained. Generate. If the extracted information is a URL, the data stored at the location indicated by the URL is acquired and used as document configuration data for layout restoration.

ステップＳ１００３では、レイアウト処理を行う。レイアウト処理では、ステップＳ１０００にて抽出された位置情報をもとにステップＳ１００２で生成された文書構成データを配置する。配置結果が、レイアウト変更前の文書となる。 In step S1003, layout processing is performed. In the layout process, the document configuration data generated in step S1002 is arranged based on the position information extracted in step S1000. The arrangement result is the document before the layout change.

以上、説明したように本実施形態によるレイアウト復元処理では、入力原稿から文書画像解析により文書構成データを抽出する。抽出された文書構成データから元のレイアウト情報を取得することで、入力原稿のみで元のレイアウトを復元することができる。 As described above, in the layout restoration process according to the present embodiment, document configuration data is extracted from an input original by document image analysis. By acquiring the original layout information from the extracted document configuration data, the original layout can be restored using only the input document.

前述した実施形態では、レイアウト変更時にテンプレートにあわせて文書構成データの形状変更を行った。文書構成データが文字列の場合も文字列領域の画像の形状変更を行っていたが、文字列の場合は個々の文字画像を切り出すことで文字の形状を変更することなくレイアウト変更が可能である。 In the above-described embodiment, the shape of the document configuration data is changed according to the template when the layout is changed. Even when the document structure data is a character string, the shape of the image in the character string area was changed. However, in the case of a character string, the layout can be changed without changing the character shape by cutting out individual character images. .

たとえば、縦長の領域の文書構成データ（文字列）をテンプレート内の横長の領域に関連付けた場合、文書構成データの形状変更を行うだけでは、文字列画像は縦につぶされ、横に引き伸ばされることになる（図１１（ａ））。一方、文書構成データ（文字列）から個々の文字画像を切り出し、テンプレート内の領域にあわせて文字画像を並び替えることで、個々の文字画像の形状を崩すことなくレイアウト変更が可能である（図１１（ｂ））。 For example, when document composition data (character strings) in a vertically long area is associated with a horizontally long area in a template, the character string image is crushed vertically and stretched horizontally only by changing the shape of the document structure data. (FIG. 11A). On the other hand, by cutting out individual character images from the document configuration data (character string) and rearranging the character images according to the area in the template, the layout can be changed without breaking the shape of the individual character images (see FIG. 11 (b)).

このように文字列画像を個々の文字画像に分割した場合も、前述した実施形態によりレイアウト変更および復元が可能である。 Thus, even when the character string image is divided into individual character images, the layout can be changed and restored by the above-described embodiment.

具体的には、並び替えられた個々の文字画像に対して、図８におけるステップＳ８０３およびＳ８０４の処理を行うことで、個々の文字画像に情報を埋め込むことができる。 Specifically, information can be embedded in each character image by performing the processing of steps S803 and S804 in FIG. 8 on the rearranged individual character images.

レイアウト復元時は、文書構成データが文字列であれば、個々の文字画像を切り出し、個々の文字画像から情報を抽出することによりレイアウト復元が可能である。 At the time of layout restoration, if the document configuration data is a character string, layout restoration is possible by cutting out individual character images and extracting information from the individual character images.

ここでは、個々の文字画像単位で扱ったが、文字列から単語を切り出し単語画像単位で情報埋め込み、抽出を行ってもよい。 Here, each character image is handled, but a word may be cut out from the character string and information may be embedded and extracted in word image units.

また、行単位や任意の文字数単位で画像を切り出し、情報埋め込み、抽出を行ってもよい。 In addition, an image may be cut out in units of lines or in an arbitrary number of characters, and information may be embedded and extracted.

＜その他の実施形態＞
本発明の目的は前述した実施例の機能を実現するソフトウエアのプログラムコードを記録した記録媒体を装置に供給し、その装置のコンピュータが記録媒体に格納されたプログラムコード（手順）を読み出し実行することによっても実現可能である。この場合、記憶媒体から読み出されたプログラムコード自体が前述した実施形態の機能を実現することとなり、そのプログラムコードを記憶した記憶媒体は本発明を構成することになる。 <Other embodiments>
An object of the present invention is to supply a recording medium on which a program code of software for realizing the functions of the above-described embodiments is recorded to a device, and the computer of the device reads and executes the program code (procedure) stored on the recording medium. Can also be realized. In this case, the program code itself read from the storage medium realizes the functions of the above-described embodiment, and the storage medium storing the program code constitutes the present invention.

また、コンピュータが読み出したプログラムコードの指示に基づいて、コンピュータ上で稼働しているオペレーティングシステム（ＯＳ）等が実際の処理の一部又は全部を実行することによって、前述した実施形態の機能が実現される場合も含まれる。 Further, the functions of the above-described embodiments are realized by an operating system (OS) running on the computer executing part or all of the actual processing based on the instruction of the program code read by the computer. This is also included.

さらに、記録媒体から読み出されたプログラムコードが、コンピュータに挿入された機能拡張カードやコンピュータに接続された機能拡張ユニットに備わるメモリに書き込まれる。その後、プログラムコードの指示に基づいて、その機能拡張カードや機能拡張ユニットに備わるＣＰＵ等が実際の処理の一部又は全部を実行することによって、前述した実施形態の機能が実現される場合も含まれる。 Further, the program code read from the recording medium is written in a memory provided in a function expansion card inserted into the computer or a function expansion unit connected to the computer. Thereafter, the function of the above-described embodiment is realized by executing part or all of the actual processing by the CPU or the like provided in the function expansion card or function expansion unit based on the instruction of the program code. It is.

本発明を上記記録媒体に適用する場合には、その記録媒体には、先に説明したフローチャートに対応するプログラムコードが格納されることになる。 When the present invention is applied to the recording medium, program code corresponding to the flowchart described above is stored in the recording medium.

本発明の実施形態におけるネットワークシステム構成図を示す図である。It is a figure which shows the network system block diagram in embodiment of this invention. 本発明の実施形態における装置構成図を示す図である。It is a figure which shows the apparatus block diagram in embodiment of this invention. 本発明の実施形態において実行する処理全体の概要を示すフローチャートである。It is a flowchart which shows the outline | summary of the whole process performed in embodiment of this invention. 本発明の実施形態における文書画像解析処理の概念を説明するための図である。It is a figure for demonstrating the concept of the document image analysis process in embodiment of this invention. 本発明の実施形態における文書構成データ情報の一例を示す図である。It is a figure which shows an example of the document structure data information in embodiment of this invention. 本発明の実施形態における機能ブロック図を示す図である。It is a figure which shows the functional block diagram in embodiment of this invention. 本発明の実施形態におけるレイアウト変更処理の概要を示す図である。It is a figure which shows the outline | summary of the layout change process in embodiment of this invention. 本発明の実施形態におけるレイアウト変更処理を示すフローチャートである。It is a flowchart which shows the layout change process in embodiment of this invention. 本発明の実施形態におけるレイアウト変更種別の一例を示す図である。It is a figure which shows an example of the layout change classification in embodiment of this invention. 本発明の実施形態におけるレイアウト復元処理を示すフローチャートである。It is a flowchart which shows the layout restoration process in embodiment of this invention. 本発明の実施形態における文字画像分割処理を適用したレイアウト変更の一例である。It is an example of the layout change which applied the character image division | segmentation process in embodiment of this invention.

Claims

A document input process for inputting a document;
A document analysis step of analyzing the document input in the document input step to obtain a document configuration block constituting the document and layout information of the document configuration block;
A layout change step for changing the layout of the document constituent block acquired by the document analysis step;
A document processing method comprising: an embedding step of embedding layout information acquired in the document analysis step in the document composition block whose layout has been changed.

The document processing method according to claim 1, wherein the layout information is position information of the document configuration block.

A document input means for inputting a document;
A document analysis unit that analyzes a document input to the document input unit and obtains a document configuration block constituting the document and layout information of the document configuration block;
A layout change step for changing the layout of the document constituent block acquired by the document analysis means;
A document processing apparatus comprising: an embedding step of embedding layout information acquired by the document analysis means in the document configuration block whose layout has been changed.

A program for causing a computer to execute the procedure of the document processing method according to claim 1 or 2.

A computer-readable storage medium storing the program according to claim 4.