JP2008191948A

JP2008191948A - Layout processor, layout processing method, program and recording medium

Info

Publication number: JP2008191948A
Application number: JP2007026094A
Authority: JP
Inventors: Toshio Miyazawa; 利夫宮澤; Koichi Ejiri; 公一江尻
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 2007-02-05
Filing date: 2007-02-05
Publication date: 2008-08-21
Anticipated expiration: 2027-02-05
Also published as: JP5085147B2

Abstract

<P>PROBLEM TO BE SOLVED: To facilitate screening by a reader by disassembling many pieces of information composing an input image to areas and arranging the areas by category. <P>SOLUTION: The processor comprises a document attribute determination/disassembling part 2 for extracting an area from the input image; a category division means for dividing each area by category according to the beginning of the area extracted by the document attribute determination/disassembling part 2; and an area arrangement means for arranging areas belonging to different categories and areas belonging to the same category in different directions respectively. <P>COPYRIGHT: (C)2008,JPO&INPIT

Description

本発明は、レイアウト処理装置、レイアウト処理方法、プログラムおよび記録媒体に関するものである。 The present invention relates to a layout processing apparatus, a layout processing method, a program, and a recording medium.

従来、新聞やビジネス文書のように、文書全体の背景の理解や、要点の把握が重要である文書では、読者による「スクリーニング（＝ふるいにかける。選抜、選別すること）」が必要となり、たとえば「目次、索引（キーワード）、オンライン検索、対極的な文書構造の並列表示」などを行なっていた。また、一方で、大型の表示装置や、新聞などでは、いわゆる「段組」といわれる、文章全体を小さな領域に分割して、２次元化の表示を行うことで、文章の２次元的把握を容易にする方法も知られている。 Traditionally, for documents such as newspapers and business documents where it is important to understand the background of the entire document and to understand the main points, readers need to be screened (= sieved, selected, selected), for example, "Table of contents, index (keyword), online search, parallel display of opposite document structure" and so on. On the other hand, in large display devices and newspapers, so-called “columns” are divided into small areas and the two-dimensional display is performed by dividing the entire sentence into two-dimensional displays. Methods for facilitating it are also known.

このような従来の技術として、携帯電話など表示領域の限られた表示デバイスへの表示方法として、タイトルのみを表示したり、文章を選択して、レイアウト見出しの上に選択した文章を表示する方法が開示されている（たとえば、特許文献１参照）。また、文書データ内のワードから関連する画像を検索して、その画像からその文書のイラストを作成出力することにより、文書の内容を簡単かつ迅速に把握することのできるシステムが開示されている（たとえば、特許文献２参照）。 As such a conventional technique, as a display method on a display device having a limited display area such as a mobile phone, a method of displaying only the title or selecting a sentence and displaying the selected sentence on the layout heading Is disclosed (for example, see Patent Document 1). In addition, a system is disclosed that can easily and quickly grasp the contents of a document by retrieving related images from words in the document data and creating and outputting illustrations of the documents from the images ( For example, see Patent Document 2).

特開２００１−１９５４１２号公報JP 2001-195212 A 特開２００６−２１５６８１号公報JP 2006-215681 A

しかしながら、上記に示されるような従来の技術にあっては、特許文献１では、タイトルのみの表示では、読者がスクリーニングを行う際の判断材料がタイトルのみとなってしまい、適切なスクリーニングができないという問題点があった。また、特許文献２では、イラストのみでは、読者がスクリーニングを行う際の判断情報が不足してしまい、適切なスクリーニングができないという問題点があった。 However, in the conventional technique as described above, in Patent Document 1, if only the title is displayed, the judgment material when the reader performs screening is only the title, and appropriate screening cannot be performed. There was a problem. Further, in Patent Document 2, there is a problem that, with only an illustration, there is a lack of judgment information when a reader performs screening, and appropriate screening cannot be performed.

本発明は、上記に鑑みてなされたものであって、入力画像を構成する多くの情報を領域に分解しこれをカテゴリー別に配置することにより、読者によるスクリーニングを容易にすることを目的の一つとする。 The present invention has been made in view of the above, and an object of the present invention is to facilitate screening by a reader by disassembling a large amount of information constituting an input image into regions and arranging them into categories. To do.

上述した課題を解決し、目的を達成するために、請求項１にかかる発明は、入力画像から領域を抽出する画像抽出手段と、前記画像抽出手段で抽出された領域の文頭にしたがって各領域をカテゴリー別に分割するカテゴリー分割手段と、異なるカテゴリーに属する領域と同じカテゴリーに属する領域についてそれぞれ異なる方向に配列する領域配列手段と、を備えることを特徴とする。 In order to solve the above-described problems and achieve the object, the invention according to claim 1 is directed to an image extracting unit that extracts a region from an input image, and each region according to a sentence head of the region extracted by the image extracting unit. Category dividing means for dividing each category, and area arranging means for arranging areas belonging to the same category as areas belonging to different categories, respectively.

また、請求項２にかかる発明は、さらに、前記カテゴリー分割手段で分割された領域の接続関係を解析する領域解析手段を備えることを特徴とする。 The invention according to claim 2 is characterized by further comprising area analysis means for analyzing the connection relation of the areas divided by the category dividing means.

また、請求項３にかかる発明は、さらに、前記領域解析手段で領域を解析した後、当該入力画像に含まれる図、表、写真を関連する領域の横に配置する配置手段を備えることを
特徴とする。 The invention according to claim 3 further includes an arrangement unit that arranges a diagram, a table, and a photograph included in the input image next to a related area after the area is analyzed by the area analysis unit. And

また、請求項４にかかる発明は、さらに、前記配置手段は、前記領域を囲み記事の形式に配置することを特徴とする。 Further, the invention according to claim 4 is characterized in that the arrangement means arranges the area in the form of an enclosed article.

また、請求項５にかかる発明は、前記配置手段は、表示装置の表示サイズにあわせて前記領域の配置する大きさを調整することを特徴とする。 The invention according to claim 5 is characterized in that the arranging means adjusts the size of the area arranged in accordance with the display size of the display device.

また、請求項６にかかる発明は、さらに、画像読取装置またはネットワーク上のサーバから前記入力画像を受信する受信手段を備えることを特徴とする。 The invention according to claim 6 further includes receiving means for receiving the input image from an image reading apparatus or a server on a network.

また、請求項７にかかる発明は、前記画像抽出手段は、句読点、段落下げ、空行、空白文字、の特徴情報を用いて領域の分解を行なうことを特徴とする。 Further, the invention according to claim 7 is characterized in that the image extracting means decomposes a region using feature information of punctuation marks, paragraph lowering, blank lines, and blank characters.

また、請求項８にかかる発明は、前記領域解析手段は、各領域間の接続関係を、接続詞によって判断することを特徴とする。 The invention according to claim 8 is characterized in that the region analysis means determines a connection relation between the regions by a conjunction.

また、請求項９にかかる発明は、入力画像から領域を抽出する画像抽出工程と、前記画像抽出工程で抽出された領域の文頭にしたがって各領域をカテゴリー別に分割するカテゴリー分割工程と、異なるカテゴリーに属する領域と同じカテゴリーに属する領域についてそれぞれ異なる方向に配列する領域配列工程と、を含むことを特徴とする。 The invention according to claim 9 includes an image extraction step of extracting a region from the input image, a category division step of dividing each region into categories according to the beginning of the region extracted in the image extraction step, and a different category. A region arrangement step of arranging regions belonging to the same category as the region to which the regions belong in different directions.

また、請求項１０にかかる発明は、さらに、前記カテゴリー分割工程で分割された領域の接続関係を解析する領域解析工程を含むことを特徴とする。 The invention according to claim 10 further includes a region analyzing step of analyzing a connection relation of the regions divided in the category dividing step.

また、請求項１１にかかる発明は、さらに、前記領域解析工程で領域を解析した後、当該入力画像に含まれる図、表、写真を関連する領域の横に配置する配置工程を含むことを特徴とする。 The invention according to claim 11 further includes an arrangement step of arranging a diagram, a table, and a photograph included in the input image next to the related region after the region is analyzed in the region analysis step. And

また、請求項１２にかかる発明は、前記配置工程は、前記領域を囲み記事の形式に配置するを特徴とする。 The invention according to claim 12 is characterized in that the arranging step arranges the area in a form of an enclosed article.

また、請求項１３にかかる発明は、前記配置工程は、表示装置の表示サイズにあわせて前記領域の配置する大きさを調整することを特徴とする。 The invention according to claim 13 is characterized in that in the arranging step, the size of the area arranged is adjusted in accordance with the display size of the display device.

また、請求項１４にかかる発明は、画像読取装置またはネットワーク上のサーバから前記入力画像を受信する受信工程を含むことを特徴とする。 The invention according to claim 14 includes a receiving step of receiving the input image from an image reading apparatus or a server on a network.

また、請求項１５にかかる発明は、前記画像抽出工程は、句読点、段落下げ、空行、空白文字、の特徴情報を用いて領域の分解を行なうことを特徴とする。 The invention according to claim 15 is characterized in that in the image extraction step, the region is decomposed using feature information of punctuation marks, paragraph lowering, blank lines, and blank characters.

また、請求項１６にかかる発明は、前記領域解析工程は、各領域間の接続関係を、接続詞によって判断することを特徴とする。 The invention according to claim 16 is characterized in that, in the region analysis step, a connection relation between the regions is determined by a conjunction.

また、請求項１７にかかる発明は、請求項９〜１６のいずれか一つに記載のレイアウト処理方法をコンピュータに実現させることを特徴とする。 The invention according to claim 17 is characterized by causing a computer to implement the layout processing method according to any one of claims 9 to 16.

また、請求項１８にかかる発明は、コンピュータが読み取り可能な記録媒体であって、請求項１７に記載のプログラムを記録したことを特徴とする。 The invention according to claim 18 is a computer-readable recording medium in which the program according to claim 17 is recorded.

本発明のレイアウト処理装置(請求項１)によれば、入力画像から画像抽出手段により領域を抽出し、この抽出された領域からカテゴリー分割手段によりカテゴリー別に分割した後、領域配列手段により、異なるカテゴリーに属する領域と同じカテゴリーに属する領域についてそれぞれ異なる方向に配列することにより、入力画像を構成する多くの情報を領域に分解しこれをカテゴリー別に再配置することが実現するため、読者によるスクリーニングを容易にすることができるという効果を奏する。 According to the layout processing apparatus of the present invention (Claim 1), an area is extracted from an input image by an image extracting means, and the extracted area is divided into categories by a category dividing means. By arranging regions belonging to the same category as regions belonging to different directions, it is possible to decompose a lot of information constituting the input image into regions and rearrange them into categories, making it easy for readers to screen There is an effect that can be made.

また、本発明のレイアウト処理装置(請求項２)によれば、請求項１において、さらに、分割された領域の接続関係を解析する領域解析手段を備えることにより、領域の関係を判断することができるという効果を奏する。 Further, according to the layout processing apparatus of the present invention (claim 2), in claim 1, the relationship between the regions can be determined by further comprising region analysis means for analyzing the connection relationship of the divided regions. There is an effect that can be done.

また、本発明のレイアウト処理装置(請求項３)によれば、請求項２において、領域解析手段で領域を解析した後、当該入力画像に含まれる図、表、写真を関連する領域の近くに配置することにより、領域と関連する図、表、写真とを容易に閲覧することができるという効果を奏する。 Further, according to the layout processing apparatus of the present invention (Claim 3), after analyzing the area by the area analysis means in Claim 2, the diagram, table, and photograph included in the input image are located near the related area. By arranging, it is possible to easily view diagrams, tables, and photographs related to the area.

また、本発明のレイアウト処理装置(請求項４)によれば、請求項２または３において、さらに、前記領域を囲み記事の形式に配置する配置手段を備えることにより、領域毎に識別しやすくなるという効果を奏する。 In addition, according to the layout processing apparatus of the present invention (Claim 4), in addition to Claim 2 or 3, the layout processing apparatus further includes an arrangement unit that arranges the area in the form of an enclosed article, thereby facilitating identification for each area. There is an effect.

また、本発明のレイアウト処理装置(請求項５)によれば、請求項２、３または４において、表示装置の表示サイズにあわせて前記領域を配置する大きさを調整することにより、表示装置が有する表示画面を最大限に利用して大きく表示されるので、見やすい画像が出力されるという効果を奏する。 According to the layout processing apparatus of the present invention (Claim 5), the display apparatus according to Claim 2, 3 or 4 is adjusted by adjusting the size of the area according to the display size of the display apparatus. Since the display screen is displayed in a large size by making the maximum use of the display screen it has, there is an effect that an easy-to-see image is output.

また、本発明のレイアウト処理装置(請求項６)によれば、請求項１において、入力画像を画像読取装置またはネットワーク上のサーバから取り込むことにより、多くの画像ソースを容易に入力することができるという効果を奏する。 Further, according to the layout processing apparatus of the present invention (Claim 6), it is possible to easily input a large number of image sources by capturing an input image from the image reading apparatus or a server on the network. There is an effect.

また、本発明のレイアウト処理装置(請求項７)によれば、請求項１において、句読点、段落下げ、空行、空白文字、の特徴情報を用いて領域の分解を行なうことにより、細かな領域に分解することができるという効果を奏する。 According to the layout processing apparatus of the present invention (Claim 7), the area is decomposed by using the feature information of punctuation marks, paragraph lowering, blank lines, and blank characters according to Claim 1, so that a detailed area is obtained. The effect that it can be decomposed | disassembled is produced.

また、本発明のレイアウト処理装置(請求項８)によれば、請求項２または３において、各領域間の接続関係を接続詞によって解析することができるという効果を奏する。 According to the layout processing apparatus of the present invention (Claim 8), in Claim 2 or 3, there is an effect that the connection relation between the regions can be analyzed by a conjunction.

また、本発明のレイアウト処理方法(請求項９)によれば、入力画像から画像抽出工程で領域を抽出し、この抽出された領域からカテゴリー分割工程でカテゴリー別に統合し分割した後、領域配列工程において、異なるカテゴリーに属する領域と同じカテゴリーに属する領域についてそれぞれ異なる方向に配列することにより、入力画像を構成する多くの情報を領域に分解しこれをカテゴリー別に再配置することが実現するため、読者によるスクリーニングを容易にすることができるという効果を奏する。 In addition, according to the layout processing method of the present invention (claim 9), an area is extracted from an input image by an image extraction process, and the extracted areas are integrated and divided by category by a category division process. In order to realize that it is possible to disassemble a lot of information composing the input image into regions and rearrange them by category by arranging the regions belonging to the same category as the regions belonging to different categories in different directions. There is an effect that screening by can be facilitated.

また、本発明のレイアウト処理方法(請求項１０)によれば、請求項９において、さらに、分割された領域の接続関係を解析する領域解析工程を含むことにより、領域の関係を判断することができるという効果を奏する。 Further, according to the layout processing method of the present invention (claim 10), in claim 9, the relationship between the regions can be determined by further including a region analysis step of analyzing the connection relationship of the divided regions. There is an effect that can be done.

また、本発明のレイアウト処理方法(請求項１１)によれば、請求項１０において、領域解析工程で領域を解析した後、当該入力画像に含まれる図、表、写真を関連する領域の近
くに配置することにより、領域と関連する図、表、写真とを容易に閲覧することができるという効果を奏する。 According to the layout processing method (claim 11) of the present invention, in claim 10, after analyzing the region in the region analysis step, the figure, table, and photograph included in the input image are located near the related region. By arranging, it is possible to easily view diagrams, tables, and photographs related to the area.

また、本発明のレイアウト処理方法(請求項１２)によれば、請求項９または１０において、さらに、前記領域を囲み記事の形式に配置する配置工程を含むことにより、領域毎に識別しやすくなるという効果を奏する。 In addition, according to the layout processing method of the present invention (claim 12), in claim 9 or 10, the layout processing method further includes an arrangement step of arranging the area in the form of an enclosed article, thereby facilitating identification for each area. There is an effect.

また、本発明のレイアウト処理方法(請求項１３)によれば、請求項１０、１１または１２において、表示装置の表示サイズにあわせて前記領域を配置する大きさを調整することにより、表示装置が有する表示画面を最大限に利用して大きく表示されるので、見やすい画像が出力されるという効果を奏する。 According to the layout processing method of the present invention (claim 13), the display device according to claim 10, 11 or 12 is adjusted by adjusting the size of the area according to the display size of the display device. Since the display screen is displayed in a large size by making the maximum use of the display screen it has, there is an effect that an easy-to-see image is output.

また、本発明のレイアウト処理方法(請求項１４)によれば、請求項９おいて、入力画像を画像読取装置またはネットワーク上のサーバから取り込むことにより、多くの画像ソースを容易に入力することができるという効果を奏する。 According to the layout processing method (claim 14) of the present invention, in claim 9, a large number of image sources can be easily input by fetching an input image from an image reading apparatus or a server on a network. There is an effect that can be done.

また、本発明のレイアウト処理方法(請求項１５)によれば、請求項９において、句読点、段落下げ、空行、空白文字、の特徴情報を用いて領域の分解を行なうことにより、細かな領域に分解することができるという効果を奏する。 Further, according to the layout processing method (claim 15) of the present invention, in the claim 9, the region is decomposed by using the feature information of the punctuation marks, the paragraph lowering, the blank line, and the blank character, thereby subdividing the region. The effect that it can be decomposed | disassembled is produced.

また、本発明のレイアウト処理方法(請求項１６)によれば、請求項９、１０または１１において、各領域間の接続関係を接続詞によって解析することができるという効果を奏する。 In addition, according to the layout processing method of the present invention (claim 16), the connection relationship between the areas can be analyzed by a conjunction in claim 9, 10 or 11.

また、本発明のプログラム(請求項１７)によれば、請求項９〜１６のいずれか一つに記載のレイアウト処理方法をコンピュータに実現させることができるという効果を奏する。 Moreover, according to the program of this invention (Claim 17), there exists an effect that the layout processing method as described in any one of Claims 9-16 can be implement | achieved by a computer.

また、本発明の記録媒体(請求項１８)によれば、請求項１７に記載のプログラムを記録したことにより、コンピュータ上で請求項９〜１６のいずれか一つに記載のレイアウト処理方法を実行することができるという効果を奏する。 According to the recording medium of the present invention (Claim 18), the layout processing method according to any one of Claims 9 to 16 is executed on a computer by recording the program according to Claim 17. There is an effect that can be done.

以下に添付図面を参照して、この発明にかかるレイアウト処理装置、レイアウト処理方法、プログラムおよび記録媒体の最良な実施の形態を詳細に説明する。 Exemplary embodiments of a layout processing apparatus, a layout processing method, a program, and a recording medium according to the present invention are explained in detail below with reference to the accompanying drawings.

（実施の形態）
本発明は、新聞やビジネス文書のように、文書全体の背景の理解や、要点の把握が重要である文書では、読者による「スクリーニング（＝ふるいにかける。選抜、選別すること。）」が必要で、「スクリーニング」をする上で読みやすいレイアウト方法を提供するものである。また、大型の表示装置や、新聞などでは、いわゆる「段組」といわれる、文章全体を小さな領域に分割して、２次元化の表示を行うことで、文章の２次元的把握を容易にする方法において、「段組」のレイアウトを自動化する方法を提供するものである。以下、具体的に説明する。 (Embodiment)
The present invention requires “screening (= sieving. Selection, selection)” by the reader for documents such as newspapers and business documents where it is important to understand the background of the entire document and to understand the main points. Therefore, it provides a layout method that is easy to read for “screening”. Also, in large display devices and newspapers, so-called “columns” are divided into small areas and the two-dimensional display is performed to facilitate the two-dimensional grasp of the sentence. In the method, a method for automating the layout of “columns” is provided. This will be specifically described below.

図１は、本発明の実施の形態にかかるレイアウト自動化装置の構成を示すブロック図である。この図１に示すように、本発明のレイアウト処理装置としてのレイアウト自動化装置１００は、本装置をマイクロコンピュータシステムにより後述するような動作をプログラムにしたがって本装置を制御する制御部１、処理対象となる図や表・テキストなどの文書属性を判定してそれぞれの文書属性に分解し、属性がテキストの領域をパラグラフに分解する画像抽出手段としての文書属性判定・分解部２、分解したパラグラフを１ページにたとえば１０個前後となるように統合するカテゴリー分割手段としてのパラグラフ統合・
分割部３、パラグラフの階層を解析する領域解析手段としてのパラグラフ階層解析部４、パラグラフの解析結果から１ページ内に配置する配置手段としてのブロック配置部５、各ブロックに、統合・分割後のパラグラフのテキストをコラム（囲み記事）構成で配置する領域配列手段の機能を備えるコラム作成部６、を備えている。また、制御部１は、ネットワーク（図２参照）上のサーバやスキャナ(図２参照)から処理対象の画像を入力する機能を有している。 FIG. 1 is a block diagram showing a configuration of a layout automation apparatus according to an embodiment of the present invention. As shown in FIG. 1, a layout automation apparatus 100 as a layout processing apparatus according to the present invention includes a control unit 1 that controls the apparatus according to a program in accordance with a program for the operation of the apparatus as will be described later. A document attribute determination / decomposition unit 2 serving as an image extracting unit that determines a document attribute such as a figure, a table, and a text, and decomposes the document attribute into each document attribute. Paragraph integration as a category division means to integrate about 10 pages on a page
The division unit 3, the paragraph hierarchy analysis unit 4 as an area analysis unit for analyzing the paragraph hierarchy, the block arrangement unit 5 as an arrangement unit to arrange within one page from the analysis result of the paragraph, A column creation unit 6 having a function of region arrangement means for arranging paragraph text in a column (boxed article) configuration is provided. Further, the control unit 1 has a function of inputting an image to be processed from a server or a scanner (see FIG. 2) on a network (see FIG. 2).

図２は、図１に示したレイアウト自動化装置を含むシステム構成を示すブロック図である。この図２において、符号１００は図１に示すように構成されたレイアウト自動化装置、符号１０１は本発明によるレイアウトの処理状態をたとえば液晶パネルに表示する表示部、符号１０２は本発明のレイアウト時における必要な操作を行なう操作部、符号１０３は処理対象の文書を入力するためのスキャナ、符号１０４は本発明による文書や処理後のデータ（たとえばパラグラフの分解に用いた、句読点、段落下げ、空行、空白文字などの特徴情報や、分解後のパラグラフ）などを必要に応じて記憶しておくための大容量記憶装置、符号１０５はネットワーク１０７とのインターフェイス処理を行なうネットワークＩ／Ｆ（インターフェイス）、符号１０６は本システムを統括的に制御するシステム制御部、符号１０７はパーソナルコンピュータやサーバなどが接続され、処理対象の文書情報を入力するためのインターネットなどのネットワークである。 FIG. 2 is a block diagram showing a system configuration including the layout automation apparatus shown in FIG. In FIG. 2, reference numeral 100 denotes a layout automation apparatus configured as shown in FIG. 1, reference numeral 101 denotes a display unit for displaying a layout processing state according to the present invention on, for example, a liquid crystal panel, and reference numeral 102 denotes a layout in the present invention. An operation unit for performing necessary operations, a reference numeral 103 is a scanner for inputting a document to be processed, a reference numeral 104 is a document according to the present invention and processed data (for example, punctuation marks, paragraph lowering, blank lines used for paragraph decomposition) , A large capacity storage device for storing feature information such as blank characters and a disassembled paragraph) as necessary, a reference numeral 105 denotes a network I / F (interface) for performing an interface process with the network 107, Reference numeral 106 denotes a system control unit for overall control of the system, and reference numeral 107 denotes a personal computer. Etc. over data and servers are connected, a network such as the Internet for inputting document information to be processed.

以上のように構成されたレイアウト自動化装置１００は基本的につぎのような処理を実行する。まず、処理の流れとして、まず、入力画像を各パラグラフ（≒文字列などの領域）に分解し、さらに各パラグラフ（領域）の文頭に基づいて、各パラグラフ（領域）をカテゴリー別に分割する。 The layout automation device 100 configured as described above basically executes the following processing. First, as a processing flow, first, an input image is decomposed into each paragraph (≈an area such as a character string), and each paragraph (area) is divided into categories based on the head of each paragraph (area).

この処理例は以下の通りである。
１．『導入部』 … 『まず』等
２．『展開部』 … 『次に』、『また』、『さらに』等
３．『しかし文』… 『しかし』
４．『結論』 … 『まとめると』、『したがって』等 An example of this processing is as follows.
1. “Introduction”… “First” etc. “Development”… “Next”, “Mata”, “More”, etc. “But Sentence”… “But”
4). “Conclusion”… “To summarize”, “So”, etc.

上記のように入力画像をパラグラフに分解し、カテゴリー別に分類する処理を行なった後、各パラグラフのレイアウトを変更する場合、『異なるカテゴリー』に属するパラグラフは垂直方向に配置し、『同じカテゴリー』に属するパラグラフは水平方向に配置する(図４−４参照）。 When the input image is decomposed into paragraphs and classified by category as described above, and the layout of each paragraph is changed, the paragraphs belonging to `` different categories '' are arranged vertically and placed in the `` same category '' The paragraphs to which they belong are arranged in the horizontal direction (see Fig. 4-4).

また、パラグラフ中に[図１]や[写真１]といった文字が存在する場合には、該パラグラフの横に、該当する図や写真を並べて配置する(図４−２参照）。また、表示部１０１の画面サイズに併せて１ページの表示の大きさを変更する(図４−３参照）。さらに囲み記事（コラム）を作成する(図４−４参照）。 In addition, when characters such as [FIG. 1] and [Photo 1] are present in a paragraph, the corresponding figure or photo is arranged next to the paragraph (see FIG. 4-2). Further, the display size of one page is changed in accordance with the screen size of the display unit 101 (see FIG. 4-3). Further, a boxed article is created (see Fig. 4-4).

つぎに図３に示すフローチャートおよび図４−１〜図４−５を参照し、上述したレイアウト自動化装置の一連の処理について説明する。図３に示すフローチャートにおいて、まず、文書属性判定・分解部２は、対象文書を、図や表、テキストに分解し、テキストはさらにパラグラフ（＝文章の節または段落、新聞・雑誌などの短い記事）に分解する（ステップＳ１）。続いて、パラグラフ統合・分割部３は、１ページにたとえば１０個前後になるように、パラグラフの統合・分割を行い（ステップＳ２）、さらにパラグラフ階層解析部４は、パラグラフの階層を解析する（ステップＳ３）。続いて、さらにブロック配置部５は、文書に含まれる図や表は引用部の近くに配置し（ステップＳ４）、パラグラフの階層を表現するブロックを１ページに配置する（ステップＳ５）。続いて、コラム作成部６は、各ブロックに、統合・分割後のパラグラフ内のテキストを、コラム（＝囲み記事）構
成で配置する（ステップＳ６）。その後、各ブロック内の最初の行の強調処理を実行する（ステップＳ７）。 Next, a series of processes of the layout automation device described above will be described with reference to the flowchart shown in FIG. 3 and FIGS. In the flowchart shown in FIG. 3, the document attribute determination / decomposition unit 2 first decomposes the target document into figures, tables, and texts, and the text is further converted into paragraphs (= sentences or paragraphs of sentences, short articles such as newspapers / magazines). (Step S1). Subsequently, the paragraph integration / division unit 3 integrates and divides the paragraphs so that there are, for example, about 10 on one page (step S2), and the paragraph hierarchy analysis unit 4 analyzes the hierarchy of the paragraphs (step S2). Step S3). Subsequently, the block arrangement unit 5 arranges the figures and tables included in the document near the citation unit (step S4), and arranges the blocks representing the paragraph hierarchy on one page (step S5). Subsequently, the column creation unit 6 arranges the text in the paragraph after integration and division in each block in a column (= enclosed article) configuration (step S6). Thereafter, the emphasis process for the first line in each block is executed (step S7).

さらに、上述したそれぞれの処理内容について詳述する。ステップＳ１では、対象文書を、図や表、テキストに分解し、テキストはさらにパラグラフ（＝文章の節または段落、新聞・雑誌などの短い記事）に分解する。対象文書が、いわゆる電子ファイル（ワープロソフトで作成されたファイルや、ＰＤＦ（ＰｏｒｔａｂｌｅＤｏｃｕｍｅｎｔＦｏｒｍａｔ：米Ａｄｏｂｅ社）、ＨＴＭＬ（ＨｙｐｅｒＴｅｘｔＭａｒｋｕｐＬａｎｇｕａｇｅ）、ＸＭＬ（ＥｘｔｅｎｓｉｂｌｅＭａｒｋｕｐＬａｎｇｕａｇｅ）など）テキストをＯＣＲ（ｏｐｔｉｃａｌｃｈａｒａｃｔｅｒｒｅａｄｅｒ）処理を必要とせずに取り出すことが可能な文書の場合は、その文書を解析することにより、図や表、テキストに分解することができる。このような構造的文書は、図２に示すようにインターネット上に接続されたサーバから取り込むことにより、多くの画像ソースを容易に入力することができる。 Furthermore, each processing content mentioned above is explained in full detail. In step S1, the target document is decomposed into figures, tables, and texts, and the text is further decomposed into paragraphs (= short articles such as sentence sections or paragraphs, newspapers and magazines). The target document is a so-called electronic file (a file created by word processing software, PDF (Portable Document Format: Adobe Corporation), HTML (Hyper Text Markup Language), XML (Extensible Markup Language Text), etc. In the case of a document that can be extracted without requiring a reader process, the document can be decomposed into a figure, a table, or a text by analyzing the document. By importing such a structural document from a server connected on the Internet as shown in FIG. 2, it is possible to easily input many image sources.

具体的には、“タグ”と呼ばれるものがついていることが多く、たとえば、テキスト部分には＜ＴＥＸＴ＞、図には＜ＦＩＧ＞などのタグ付の文書形式である。一方、対象文書が、いわゆる画像ファイルや紙に印刷された文書の場合は、ＯＣＲ処理により図や表、テキストに分解する。ＯＣＲ処理の方法は、これまでにも非常に多くの方式が提案されており、特段方法を限定するものではない。 Specifically, what is called a “tag” is often attached, for example, a document format with a tag such as <TEXT> in the text portion and <FIG> in the figure. On the other hand, if the target document is a so-called image file or a document printed on paper, it is decomposed into a figure, a table, and text by OCR processing. A great number of methods for the OCR processing have been proposed so far, and the method is not particularly limited.

また、システム構成の簡単化のために、いわゆる電子ファイルの場合であっても、一旦画像ファイル形式に変換を行い、画像ファイルや紙に印刷された文書と同様に、ＯＣＲ処理により図や表、テキストに分解してもよい。これは電子ファイルの場合は、様々なファイル形式があり、１つのファイル形式でもバージョンによって、互換性が保たれていない場合など、解析を行うためには、数多くのケースに対応する必要があり、システムが大きくなってしまう場合があるためである。画像ファイル形式への変換は、プリンタドライバを用いて行うことが可能である。 In order to simplify the system configuration, even in the case of a so-called electronic file, it is once converted into an image file format, and as with an image file or a document printed on paper, a chart, table, It may be broken down into text. In the case of electronic files, there are various file formats, and even if one file format is not compatible depending on the version, it is necessary to deal with many cases in order to perform analysis. This is because the system may become large. Conversion to the image file format can be performed using a printer driver.

つぎに、テキストをパラグラフに分解する。これは、日本語の場合、句読点や、段落下げ、空行、空白文字などを情報としてパラグラフに分解する。このステップでは、まず細かく分解を行い、後段のステップで適切な数に統合する。そこで、句読点、段落下げ、空行、空白文字などパラグラフに分解可能なすべての特徴情報を利用してパラグラフを分解する。このように、句読点、段落下げ、空行、空白文字、の特徴情報を用いて領域の分解を行なうことにより、細かな領域に分解することができる。なお、後段でパラグラフの統合を行なうために、どの特徴情報でパラグラフ分解を行なったのかをあわせて大容量記憶装置１０４などの記憶装置に保存する。処理例を以下に示す。 Next, break the text into paragraphs. In the case of Japanese, punctuation marks, paragraph lowering, blank lines, blank characters, etc. are decomposed into paragraphs as information. In this step, first, a fine decomposition is performed, and an appropriate number is integrated in a later step. Therefore, the paragraph is decomposed using all feature information that can be decomposed into paragraphs such as punctuation marks, paragraph lowering, blank lines, and blank characters. In this way, by dividing the region using the feature information of punctuation marks, paragraph lowering, blank lines, and blank characters, it is possible to break down into fine regions. It should be noted that in order to integrate paragraphs at a later stage, the feature information used for the paragraph decomposition is stored in a storage device such as the mass storage device 104. An example of processing is shown below.

パラグラフ１私の名前は、読点
パラグラフ２理光太郎です。句点
パラグラフ３このたび異動することになりました。句点、空白行
パラグラフ４移動先は海老名事業所で、読点
・・・・・・・・・ Paragraph 1 My name is Taro Rikko. Punctuation paragraph 3 This time, it will be changed. Punctuation, blank line paragraph 4 Move to the Ebina office, and read ...

ステップＳ２では、１ページに１０個前後になるように、パラグラフの統合・分割を行う。パラグラフの統合・分割は、ステップＳ１で行った、パラグラフを分解した結果を用いて行う。まず、１ページに１０個前後になるようにするために、パラグラフを分解した結果から、読点以外で分解されたパラグラフの個数を数える。上記ステップＳ１の例では、パラグラフ１とパラグラフ２とを１つのパラグラフとして計算する。この数が１０個前後のあらかじめユーザーが指定したパラグラフ数の範囲内であれば、統合・分割は行わな
い。ここで読点を除くのは、一般にパラグラフは、文章の節や段落を１つの単位とするためで、読みやすさの観点からも読点ごとにコラム形式でレイアウトするのは適さないためである。 In step S2, the paragraphs are integrated and divided so that there are about 10 per page. The integration / division of paragraphs is performed using the result of disassembling the paragraphs performed in step S1. First, the number of paragraphs decomposed other than the punctuation is counted from the result of disassembling the paragraphs so that there are about 10 per page. In the example of step S1, paragraph 1 and paragraph 2 are calculated as one paragraph. If this number is within the range of about 10 paragraphs specified by the user in advance, the integration / division is not performed. The reason why reading marks are excluded here is that paragraphs are generally composed of sections and paragraphs of sentences as one unit, and from the viewpoint of readability, it is not suitable to lay out in column format for each reading point.

もし、個数があらかじめユーザーが指定した範囲より少ない場合は、読点での分割を考える。分割を行ったあとのパラグラフの文字数が、ユーザーがあらかじめ指定した文字数よりも多い場合はパラグラフを分割する。分割した結果、１０個前後のパラグラフ数に達した場合は、さらに分割を行わない。 If the number is less than the range specified by the user in advance, division by reading marks is considered. If the number of characters in the paragraph after division is greater than the number of characters specified by the user in advance, the paragraph is divided. As a result of the division, if the number of paragraphs reaches around 10, no further division is performed.

一方、個数があらかじめユーザーが指定した範囲より多い場合は、統合を考える。統合は、ステップＳ１でのパラグラフ分解をどの情報で行ったのかの情報をたとえば大容量記憶装置１０４に記憶されている情報から活用する。具体的には、句点→段落→空白行の順に統合を行い、句点で統合した結果、１０個前後のパラグラフ数に達した場合は、さらに統合は行わない。同様に段落で統合した結果、１０個前後のパラグラフ数に達した場合は、さらに統合は行わない。 On the other hand, if the number is greater than the range specified by the user in advance, consider integration. For the integration, information indicating which information was used for the paragraph decomposition in step S1 is utilized from information stored in the mass storage device 104, for example. More specifically, if the number of paragraphs is about 10, as a result of integration in the order of the paragraphs → paragraphs → blank lines, and the integration of the paragraphs, no further integration is performed. Similarly, when the number of paragraphs reaches about 10 as a result of integration in paragraphs, no further integration is performed.

ステップＳ３は、パラグラフの階層を解析する。この処理例を図４−１に示す。パラグラフの階層の解析は、ステップＳ２で実行したパラグラフの統合・分割を行った結果を用いて行なう。パラグラフ統合・分割された各パラグラフをさらに形態素解析を行い、各パラグラフの接続関係を抽出する。具体的には、各パラグラフの形態素解析結果のはじめの形態素が所定の接続詞かどうかで判断する。このように、分割された領域の接続関係を解析することにより、領域の関係を判断することができる。 Step S3 analyzes the paragraph hierarchy. An example of this processing is shown in FIG. The analysis of the paragraph hierarchy is performed using the result of the integration / division of the paragraph executed in step S2. The morphological analysis is further performed on each paragraph integrated and divided, and the connection relation between the paragraphs is extracted. Specifically, it is determined whether or not the first morpheme of the morpheme analysis result of each paragraph is a predetermined conjunction. In this way, by analyzing the connection relation of the divided areas, the relation of the areas can be determined.

具体例としては、「また」「しかし」「まとめると」「あるいは」「次に」「さらに」などの所定の接続詞をあらかじめ登録しておく。また、接続詞によって、縦に配置する関係と、横に配置する関係にあるものを区別しておく。 As a specific example, predetermined conjunctions such as “also” “but” “collectively” “or” “next” “further” and the like are registered in advance. In addition, the relationship between the vertical arrangement and the horizontal arrangement is distinguished by the conjunction.

ステップＳ４では、文書に含まれる図や表は引用部の近くに配置する。すなわち、ここでは、図４−２に示すように、「まとめると・・・図１に示すように」という引用部のとなりに「図１」を配置する。このように、領域を解析した後、当該入力画像に含まれる図、表、写真を関連する領域の近くに配置することにより、領域と関連する図、表、写真とを容易に閲覧することができる。 In step S4, the figures and tables included in the document are arranged near the citation section. That is, here, as shown in FIG. 4B, “FIG. 1” is arranged next to the quoted part “when combined, as shown in FIG. 1”. In this way, after analyzing the region, it is possible to easily view the diagram, table, and photograph related to the region by arranging the figure, table, and photo included in the input image near the related region. it can.

ステップＳ５では、パラグラフの階層を表現するブロックを１ページに配置する。この処理例を図４−３に示す。ここでは図４−３に示すようにステップＳ４で作成したパラグラフの解析結果を受けて、１ページ内に収まるように、パラグラフの階層を表現するブロックの大きさを調整する。すなわち、表示装置の表示サイズにあわせて領域を配置する大きさを調整するため、表示装置が有する表示画面を最大限に利用して配置後の領域が大きく表示されて、見やすい画像が出力されることになる。 In step S5, a block expressing a paragraph hierarchy is arranged on one page. An example of this processing is shown in FIG. Here, as shown in FIG. 4C, the size of the block representing the paragraph hierarchy is adjusted so that the result of the analysis of the paragraph created in step S4 is received within one page. In other words, in order to adjust the size of the area to be arranged in accordance with the display size of the display device, the display area of the display device is used to the maximum, the area after the arrangement is displayed large, and an easy-to-view image is output. It will be.

ステップＳ６では、図４−４の処理例にように、パラグラフの階層を表現する各ブロックを、統合・分割後のパラグラフ内のテキストを、コラム（＝囲み記事）構成で配置する。このように、領域を囲み記事の形式に再配置する再配置手段を備えることにより、領域毎に識別しやすくなる。ままた、各領域間の接続関係を接続詞によって解析することができる。 In step S6, as shown in the processing example of FIG. 4-4, each block expressing the paragraph hierarchy is arranged in a column (= enclosed article) text in the paragraph after integration / division. In this way, by providing the rearrangement means for rearranging the area into the form of the article, it becomes easy to identify each area. In addition, the connection relationship between each region can be analyzed by a conjunction.

ステップＳ７では、パラグラフの階層を表現する各ブロック内の最初の行を強調する。強調の方法には、たとえば、太字にする、フォントを大きくする、色を変える、あるいは図４−５に示すように下線を引くなど様々な方法がある。 In step S7, the first line in each block representing the paragraph hierarchy is highlighted. There are various emphasis methods such as bolding, enlarging the font, changing the color, or underlining as shown in FIG. 4-5.

したがって、この実施の形態によれば、入力画像から領域を抽出し、この抽出された各領域をカテゴリー別に分割した後、異なるカテゴリーに属する領域と同じカテゴリーに属する領域についてそれぞれ異なる方向に配列することにより、入力画像を構成する多くの情報を領域に分解しこれをカテゴリー別に配置することが実現するため、読者によるスクリーニングを容易にすることができる。 Therefore, according to this embodiment, after extracting an area from the input image and dividing each extracted area into categories, the areas belonging to the same category as the areas belonging to different categories are arranged in different directions. Therefore, it is possible to decompose a large amount of information constituting the input image into regions and arrange them into categories, thereby facilitating screening by the reader.

なお、これまで説明してきた実施の形態では、入力画像を横書きあるいは縦書きについて説明してきたが、これに限らず横書きと縦書きが混在している場合にも適用することが可能である。 In the embodiments described so far, the input image has been described for horizontal writing or vertical writing. However, the present invention is not limited to this, and the present invention can be applied to a case where horizontal writing and vertical writing are mixed.

ところで、これまで説明してきた実施の形態におけるレイアウト処理方法（動作）を、プログラム化し、コンピュータ読み取り可能な記録媒体に記録し、コンピュータ上で実行することもできる。また、レイアウト処理方法の一部をネットワーク上に有し、通信回線を通して実現することもできる。 By the way, the layout processing method (operation) in the embodiment described so far can be programmed, recorded on a computer-readable recording medium, and executed on the computer. Also, a part of the layout processing method can be provided on a network and realized through a communication line.

すなわち、この実施の形態で説明したレイアウト処理方法は、図５に示すように、あらかじめ用意されたプログラムをパーソナルコンピュータやワークステーションなどのコンピュータ（ＣＰＵ２００）で実行することにより実現される。このプログラムは、キーボードの操作などにより、メモリ２０１、ハードディスク２０４、フレキシブルディスク２０７、ＣＤ−ＲＯＭ（Ｃｏｍｐａｃｔ−ＤｉｓｃＲｅａｄＯｎｌｙＭｅｍｏｒｙ）２０６、ＭＯ（ＭａｇｎｅｔｏＯｐｔｉｃａｌ）、ＤＶＤ（ＤｉｇｉｔａｌＶｅｒｓａｔｉｌｅＤｉｓｃ）などのコンピュータで読み取り可能な記録媒体に記録され、コンピュータ（ＣＰＵ２００）によって記録媒体から読み出し、必要に応じて表示装置に表示することによって実行される。また、必要に応じてこのレイアウト処理方法のデータを通信装置から外部装置に送受信することも可能である。 That is, the layout processing method described in this embodiment is realized by executing a program prepared in advance on a computer (CPU 200) such as a personal computer or a workstation as shown in FIG. This program is read by a computer such as a memory 201, a hard disk 204, a flexible disk 207, a CD-ROM (Compact-Disc Read Only Memory) 206, an MO (Magneto Optical), a DVD (Digital Versatile Disc) by operating the keyboard. It is recorded on a possible recording medium, read from the recording medium by a computer (CPU 200), and displayed on a display device as necessary. Further, the data of this layout processing method can be transmitted and received from the communication device to the external device as necessary.

また、このプログラムは、上記記録媒体を介して、インターネットなどのネットワークによってパーソナルコンピュータなどの装置に配布することができる。 The program can be distributed to a device such as a personal computer via the recording medium via a network such as the Internet.

すなわち、このプログラムは、図６に示すように、たとえばコンピュータに内蔵されている記録媒体としてのハードディスクに、あらかじめインストールした状態で提供することができる。プログラムは記録媒体に一時的あるいは永続的に格納し、コンピュータにユニットとして組み込んだり、あるいは着脱式の記録媒体として利用することで、パッケージソフトウェアとして提供することができる。 That is, as shown in FIG. 6, this program can be provided in a state of being installed in advance on a hard disk as a recording medium built in the computer, for example. The program can be temporarily or permanently stored in a recording medium, and can be provided as packaged software by being incorporated in a computer as a unit or being used as a removable recording medium.

記録媒体としては、たとえば、フレキシブルディスク、ＣＤ−ＲＯＭ、ＭＯディスク、ＤＶＤ、磁気ディスク、半導体メモリなどが利用できる。 As the recording medium, for example, a flexible disk, a CD-ROM, an MO disk, a DVD, a magnetic disk, a semiconductor memory, and the like can be used.

プログラムは、ダウンロードサイトから、ＬＡＮ（ＬｏｃａｌＡｒｅａＮｅｔｗｏｒｋ）やインターネットといったネットワーク２１０を介して、有線または無線でコンピュータに転送し、そのコンピュータ２１１〜２１３において、内蔵するハードディスクなどの記憶装置にダウンロードさせるようにすることができる。 The program is transferred from a download site to a computer in a wired or wireless manner via a network 210 such as a LAN (Local Area Network) or the Internet, and is downloaded to a storage device such as a built-in hard disk in the computers 211 to 213. can do.

以上のように、本発明にかかるレイアウト処理装置、レイアウト処理方法、プログラムおよび記録媒体は、新聞やビジネス文書などにおいて文書全体の背景の理解や要点を理解しやすいようにするレイアウトに有用であり、特に、新聞などにおける段組のレイアウトを自動化するシステムや方法に適している。 As described above, the layout processing device, the layout processing method, the program, and the recording medium according to the present invention are useful for a layout that makes it easy to understand the background and main points of the entire document in newspapers and business documents. In particular, it is suitable for a system or method for automating column layout in newspapers.

本発明の実施の形態にかかるレイアウト自動化装置の構成を示すブロック図である。It is a block diagram which shows the structure of the layout automation apparatus concerning embodiment of this invention. 図１に示したレイアウト自動化装置を含むシステム構成を示すブロック図である。It is a block diagram which shows the system configuration | structure containing the layout automation apparatus shown in FIG. 本発明の実施の形態にかかるレイアウト自動化装置の処理動作を示すフローチャートである。It is a flowchart which shows the processing operation of the layout automation apparatus concerning embodiment of this invention. 図３におけるステップＳ３の処理例を示す説明図である。It is explanatory drawing which shows the process example of step S3 in FIG. 図３におけるステップＳ４の処理例を示す説明図である。It is explanatory drawing which shows the process example of step S4 in FIG. 図３におけるステップＳ５の処理例を示す説明図である。It is explanatory drawing which shows the process example of step S5 in FIG. 図３におけるステップＳ６の処理例を示す説明図である。It is explanatory drawing which shows the process example of step S6 in FIG. 図３におけるステップＳ７の処理例を示す説明図である。It is explanatory drawing which shows the process example of step S7 in FIG. 本発明の実施の形態にかかるレイアウト自動化方法をコンピュータに実行させる例を示すブロック図である。It is a block diagram which shows the example which makes a computer perform the layout automation method concerning embodiment of this invention. 本発明の実施の形態にかかるレイアウト自動化方法をネットワーク上からダウンロードして実行させる例を示すブロック図である。It is a block diagram which shows the example which downloads and performs the layout automation method concerning embodiment of this invention from a network.

Explanation of symbols

１制御部
２文書属性判定・分解部
３パラグラフ統合・分割部
４パラグラフ階層解析部
５ブロック配置部
６コラム作成部
１００レイアウト自動化装置
１０１表示部
１０２操作部
１０３スキャナ
１０４大容量記憶装置
１０５ネットワークＩ／Ｆ
１０６システム制御部
１０７ネットワーク DESCRIPTION OF SYMBOLS 1 Control part 2 Document attribute determination / decomposition | disassembly part 3 Paragraph integration / division | segmentation part 4 Paragraph hierarchy analysis part 5 Block arrangement part 6 Column creation part 100 Layout automation apparatus 101 Display part 102 Operation part 103 Scanner 104 Mass storage device 105 Network I / F
106 System control unit 107 Network

Claims

Image extracting means for extracting a region from the input image;
Category dividing means for dividing each area into categories according to the beginning of the area extracted by the image extracting means,
Area arrangement means for arranging areas belonging to the same category as areas belonging to different categories in different directions;
A layout processing apparatus comprising:

The layout processing apparatus according to claim 1, further comprising a region analysis unit that analyzes a connection relationship between the regions divided by the category division unit.

The layout processing according to claim 2, further comprising an arrangement unit that arranges a diagram, a table, and a photograph included in the input image next to a related region after the region is analyzed by the region analysis unit. apparatus.

The layout processing apparatus according to claim 2, wherein the arrangement unit arranges the area in a form of an enclosed article.

The layout processing apparatus according to claim 2, wherein the arrangement unit adjusts a size of the area in accordance with a display size of the display apparatus.

The layout processing apparatus according to claim 1, further comprising receiving means for receiving the input image from an image reading apparatus or a server on a network.

The layout processing apparatus according to claim 1, wherein the image extraction unit decomposes a region using feature information of punctuation marks, paragraph lowering, blank lines, and blank characters.

The layout processing apparatus according to claim 2, wherein the area analysis unit determines a connection relation between the areas based on a conjunction.

An image extraction step of extracting a region from the input image;
A category dividing step of dividing each region into categories according to the beginning of the region extracted in the image extracting step;
A region arrangement step of arranging regions belonging to the same category as regions belonging to different categories in different directions;
A layout processing method comprising:

The layout processing method according to claim 9, further comprising a region analysis step of analyzing a connection relationship between the regions divided in the category division step.

The layout processing according to claim 10, further comprising an arrangement step of arranging a figure, a table, and a photograph included in the input image next to a related region after analyzing the region in the region analysis step. Method.

The layout processing method according to claim 9, wherein the arranging step arranges the area in a form of an enclosed article.

13. The layout processing method according to claim 10, 11 or 12, wherein a size of the area is adjusted in accordance with a display size of the display device.

The layout processing method according to claim 9, further comprising a receiving step of receiving the input image from an image reading device or a server on a network.

The layout processing method according to claim 9, wherein the image extracting step performs region decomposition using feature information of punctuation marks, paragraph lowering, blank lines, and blank characters.

The layout processing method according to claim 10, wherein the region analysis step determines a connection relation between the regions based on a conjunction.

A program for causing a computer to implement the layout processing method according to any one of claims 9 to 16.

A computer-readable recording medium on which the program according to claim 17 is recorded.