JP2008305088A

JP2008305088A - Document processor, document processing method, and document processing program

Info

Publication number: JP2008305088A
Application number: JP2007150621A
Authority: JP
Inventors: Yoshio Komaki; 由夫小巻
Original assignee: Konica Minolta Business Technologies Inc
Current assignee: Konica Minolta Business Technologies Inc
Priority date: 2007-06-06
Filing date: 2007-06-06
Publication date: 2008-12-18
Anticipated expiration: 2027-06-06
Also published as: JP5125238B2

Abstract

PROBLEM TO BE SOLVED: To efficiently generate browsing navigation information according to a content region included in a document image, and to efficiently generate browsing navigation information according to the characteristics of a display means for displaying the document image in generating an electronic document including the document image. SOLUTION: A bookmark data generation part classifies a whole content region into at least one group (step S114), and evaluates the degree of adaptation as the bookmark of each group on the basis of the position of the content region belonging to each of those classified groups in a document image (step S116). A bookmark generation part 17 selects at least one group as the generation object of the bookmark data from among the most significant evaluation results on the basis of the evaluation result (step S118), and generates the bookmark data showing the position of the content region belonging to the group selected as mentioned above in the document image on the basis of the attribute information about the content region belonging to the group (step S122). COPYRIGHT: (C)2009,JPO&INPIT

Description

この発明は文書画像を含む電子化文書を扱う文書処理装置、文書処理方法および文書処理プログラムに関し、特に文書画像に含まれる内容領域に対して閲覧ナビゲート情報を生成する技術に関する。 The present invention relates to a document processing apparatus, a document processing method, and a document processing program that handle an electronic document including a document image, and more particularly to a technique for generating browsing navigation information for a content area included in a document image.

省資源や省スペースの観点から、紙原稿などに記載された文書を電子化文書に変換して管理する文書管理システムが実用化されている。このような文書管理システムでは、スキャナ装置などを用いて原稿を読取ることで文書画像を生成し、これらの文書画像から電子化文書を生成する。 From the viewpoint of resource saving and space saving, a document management system that converts and manages a document described on a paper manuscript or the like into an electronic document has been put into practical use. In such a document management system, a document image is generated by reading a document using a scanner device or the like, and an electronic document is generated from these document images.

このような文書画像は、文書を画素の集合である画像（イメージ）として格納するので、原稿文書に含まれる文字列や図表などの内容を特定するためのデータ（代表的に、テキストデータなど）を元来含んでいない。また、紙原稿などに記載された文書が電子化して利用できるようにデザインされているとは限らない。そのため、全ての文書画像を一度には表示できないコンピュータ上の閲覧ソフト（ビューア）を用いて電子化文書を閲覧しようとする場合には、ユーザは探索的にスクロール（表示画面切替え）を行なう必要があり、非常に手間のかかる作業であった。 Since such a document image stores the document as an image (image) that is a set of pixels, data (typically, text data, etc.) for specifying contents such as character strings and charts included in the original document Does not contain. In addition, a document described on a paper manuscript or the like is not necessarily designed so that it can be used electronically. Therefore, when browsing an electronic document by using browsing software (viewer) on a computer that cannot display all document images at once, the user needs to perform scrolling (display screen switching) in an exploratory manner. There was a lot of work.

このような電子化文書に対して、文書画像上の注目すべき箇所へ閲覧ナビゲート情報（代表的に、「しおり」もしくは「電子しおり」）をユーザが対話的に設定可能なアプリケーションソフトが実用化されている。このようなしおりを注目すべき箇所に予め設定しておくことで、ユーザは探索的なスクロールを行なうことなく、注目すべき箇所を素早く閲覧することができる。 Application software that allows users to interactively set browsing navigation information (typically "bookmarks" or "electronic bookmarks") to places of note on document images for such electronic documents is practical. It has become. By setting such a bookmark in a place to be noticed in advance, the user can quickly browse the place to be noticed without performing exploratory scrolling.

ここで、ユーザが文書画像を確認しながら対話的にしおりを設定することは、非常に手間のかかる作業であり、多数の紙原稿などから電子化文書を生成する場合などに適用するのは現実的ではない。そこで、たとえば特開平０９−１３４４０６号公報（特許文献１）などに記載されるような、文書画像から必要とする部分領域を取り出して認識する技術を用いることで、しおりを設定すべき箇所を探す作業を支援することも考えられる。
特開平０９−１３４４０６号公報 Here, setting a bookmark interactively while checking a document image is a very time-consuming work, and it is actually applied when generating an electronic document from a large number of paper originals. Not right. In view of this, for example, a technique for extracting and recognizing a necessary partial area from a document image as described in Japanese Patent Application Laid-Open No. 09-134406 (Patent Document 1) is used to search for a place where a bookmark should be set. Supporting the work can also be considered.
JP 09-134406 A

しかしながら、特開平０９−１３４４０６号公報（特許文献１）にはしおりを設定する構成については開示がなく、このような認識技術を用いてもユーザが対話的にしおりを設定する作業は依然として存在する。そのため、多数の紙原稿などから電子化文書を生成する際に、注目すべき箇所に効率的にしおりを設定することは困難であった。 However, Japanese Patent Laid-Open No. 09-134406 (Patent Document 1) does not disclose a configuration for setting a bookmark, and there is still work for a user to set a bookmark interactively even if such a recognition technique is used. . For this reason, when generating an electronic document from a large number of paper originals, it has been difficult to efficiently set bookmarks at points to be noted.

また、予めしおりが設定された電子化文書を閲覧しようとする場合において、元原稿のページ領域と閲覧ソフトとして表示される領域（サイズ）とが大きく異なる場合には、表示領域内に全くしおりが含まれない状態や、逆に表示領域内に多数のしおりが含まれる状態が生じ、ユーザが電子化文書を閲覧する効率が低下するという課題がある。 Further, when an attempt is made to browse a digitized document in which bookmarks are set in advance, if the page area of the original document and the area (size) displayed as the browsing software are greatly different, there is no bookmark in the display area. There is a problem that a state in which the display area is not included or a state in which a large number of bookmarks are included in the display area occurs, and the efficiency of the user viewing the digitized document is reduced.

そこで、この発明は、かかる問題を解決するためになされたものであり、その第１の目的は、文書画像を含む電子化文書を生成する際に、文書画像に含まれる内容領域に応じて効率的に閲覧ナビゲート情報を生成できる文書処理装置、文書処理方法および文書処理プログラムを提供することである。また、第２の目的は、文書画像を含む電子化文書を表示する際に、当該文書画像を表示する表示手段の特性に応じて効率的な閲覧ナビゲート情報を生成する文書処理装置、文書処理方法および文書処理プログラムを提供することである。 Accordingly, the present invention has been made to solve such a problem, and a first object of the present invention is to generate an electronic document including a document image in accordance with the content area included in the document image. It is to provide a document processing apparatus, a document processing method, and a document processing program that can generate browsing navigation information. A second object of the present invention is to provide a document processing apparatus and document processing for generating efficient browsing navigation information according to the characteristics of display means for displaying a document image when displaying an electronic document including the document image. A method and document processing program are provided.

この発明のある局面に従えば、文書画像を含む電子化文書を生成する文書処理装置であって、文書画像から少なくとも１つの内容領域を抽出し、内容領域について属性情報を取得する取得手段を備え、属性情報は、内容領域の文書画像内での位置を示す位置情報を含み、さらに内容領域の文書画像内での位置を特定するための閲覧ナビゲート情報を生成する情報生成手段を備える。情報生成手段は、属性情報に基づいて、少なくとも１つの内容領域を少なくとも１つのグループに分類する分類手段と、各グループに所属する内容領域の文書画像内での位置に基づいて、グループの各々を評価する評価手段と、評価手段による評価結果に基づいて、少なくとも１つのグループの中から閲覧ナビゲート情報の生成対象とするグループを選択する選択手段とを含む。 According to an aspect of the present invention, there is provided a document processing apparatus that generates an electronic document including a document image, and includes an acquisition unit that extracts at least one content area from the document image and acquires attribute information about the content area. The attribute information includes position information indicating the position of the content area in the document image, and further includes information generation means for generating browsing navigation information for specifying the position of the content area in the document image. The information generation means classifies at least one content area into at least one group based on the attribute information, and classifies each group based on the position of the content area belonging to each group in the document image. Evaluation means for evaluation, and selection means for selecting a group for generating browsing navigation information from at least one group based on an evaluation result by the evaluation means.

好ましくは、原稿を読取ることで文書画像を生成する画像読取手段と、文書画像に閲覧ナビゲート情報を付加することで電子化文書を生成する文書生成手段とをさらに備える。 Preferably, an image reading unit that generates a document image by reading a document and a document generation unit that generates a digitized document by adding browsing navigation information to the document image are further provided.

好ましくは、文書画像は、ページ単位で区分されており、評価手段は、各グループに所属する内容領域のページ毎の出現数に基づいてグループの各々を評価する。 Preferably, the document image is divided in units of pages, and the evaluation unit evaluates each group based on the number of appearances of the content area belonging to each group for each page.

さらに好ましくは、評価手段は、所属する内容領域がより多くのページに出現するグループに対して相対的に高い評価を与え、選択手段は、相対的に高い評価を与えられたグループを選択する。 More preferably, the evaluation unit gives a relatively high evaluation to a group in which the content region to which the user belongs appears on more pages, and the selection unit selects a group given a relatively high evaluation.

さらに好ましくは、評価手段は、さらに、所属する内容領域のページ毎の出現数の最大値が所定範囲内であるグループに対して、出現数の最大値が所定範囲外であるグループに比較して相対的に高い評価を与える。 More preferably, the evaluation means further compares the group whose maximum number of appearances per page of the content area to which it belongs is within a predetermined range with a group whose maximum number of appearances is outside the predetermined range. Give a relatively high rating.

好ましくは、情報生成手段は、選択手段が複数のグループを選択する場合に、複数のグループに含まれる内容領域の文書画像内での位置に基づいて、グループ間の従属関係を決定する従属関係決定手段をさらに含む。 Preferably, when the selection unit selects a plurality of groups, the information generation unit determines a dependency relationship between the groups based on the position of the content area included in the plurality of groups in the document image. Means are further included.

好ましくは、内容領域は、文字列、段落、図、表、写真、の少なくともいずれかを含む。 Preferably, the content area includes at least one of a character string, a paragraph, a figure, a table, and a photograph.

この発明の別の局面に従えば、文書画像を含む電子化文書を生成する文書処理方法であって、文書画像から少なくとも１つの内容領域を抽出し、内容領域について属性情報を取得するステップを備え、属性情報は、内容領域の文書画像内での位置を示す位置情報を含み、さらに内容領域の文書画像内での位置を特定するための閲覧ナビゲート情報を生成するステップを備える。属性情報を取得するステップは、属性情報に基づいて、少なくとも１つの内容領域を少なくとも１つのグループに分類するステップと、各グループに所属する内容領域の文書画像内での位置に基づいて、グループの各々を評価するステップと、評価するステップによる評価結果に基づいて、少なくとも１つのグループの中から閲覧ナビゲート情報の生成対象とするグループを選択するステップとを含む。 According to another aspect of the present invention, there is provided a document processing method for generating an electronic document including a document image, comprising: extracting at least one content area from the document image and acquiring attribute information about the content area. The attribute information includes position information indicating the position of the content area in the document image, and further includes a step of generating browsing navigation information for specifying the position of the content area in the document image. The step of acquiring the attribute information includes a step of classifying at least one content area into at least one group based on the attribute information, and a position of the group based on the position of the content area belonging to each group in the document image. And a step of evaluating each, and a step of selecting a group for which browsing navigation information is to be generated from at least one group based on an evaluation result of the step of evaluating.

この発明のさらに別の局面に従えば、上記の文書処理方法をコンピュータに実行させる文書処理プログラムである。 According to still another aspect of the present invention, there is provided a document processing program for causing a computer to execute the above document processing method.

この発明のさらに別の局面に従えば、文書画像を含む電子化文書を処理する文書処理装置であって、電子化文書は、文書画像に含まれる内容領域に対応付けて、当該内容領域の所属するグループの種類および当該内容領域の文書画像内での位置が規定されている属性情報を含み、内容領域の文書画像内での位置を特定するための閲覧ナビゲート情報を生成する情報生成手段と、閲覧ナビゲート情報とともに文書画像を表示する表示手段とを備える。情報生成手段は、表示手段の表示特性を取得する表示特性取得手段と、表示手段の表示特性に応じて少なくとも１つの閲覧ページ領域を設定する領域設定手段と、各グループに所属する内容領域の閲覧ページ毎の出現数に基づいてグループの各々を評価する評価手段と、評価手段による評価結果に基づいて、少なくとも１つのグループの中から閲覧ナビゲート情報の生成対象とするグループを選択する選択手段とを含む。 According to still another aspect of the present invention, there is provided a document processing apparatus for processing an electronic document including a document image, the electronic document being associated with a content area included in the document image and belonging to the content area Information generating means for generating browsing navigation information for specifying the position of the content area in the document image, including attribute information in which the type of the group to be performed and the position of the content area in the document image are defined Display means for displaying a document image together with browsing navigation information. The information generation means includes display characteristic acquisition means for acquiring display characteristics of the display means, area setting means for setting at least one browsing page area according to the display characteristics of the display means, and browsing of content areas belonging to each group An evaluation unit that evaluates each of the groups based on the number of appearances per page, and a selection unit that selects a group for which browsing navigation information is to be generated from at least one group based on an evaluation result by the evaluation unit; including.

好ましくは、評価手段は、所属する内容領域がより多くの閲覧ページに出現するグループに対して相対的に高い評価を与え、選択手段は、相対的に高い評価を与えられたグループを選択する。 Preferably, the evaluation unit gives a relatively high evaluation to a group in which the content region to which the user belongs appears in more browsing pages, and the selection unit selects a group given a relatively high evaluation.

さらに好ましくは、評価手段は、所属する内容領域の閲覧ページ毎の出現数の最大値が所定範囲内であるグループに対して、出現数の最大値が所定範囲外であるグループに比較して相対的に高い評価を与える。 More preferably, the evaluation means is relative to a group in which the maximum number of appearances per viewing page of the content area to which the user belongs is within a predetermined range compared to a group in which the maximum number of appearances is outside the predetermined range. Highly evaluated.

好ましくは、情報生成手段は、閲覧ナビゲート情報として、閲覧ページのうち内容領域が出現しない閲覧ページに対して、当該ページを特定するための情報を付加する付加手段をさらに含む。 Preferably, the information generation means further includes an addition means for adding information for specifying the page to the browsing page in which the content area does not appear as the browsing navigation information.

好ましくは、評価手段は、閲覧環境に応じて評価をするための基準を変更する。
この発明のさらに別の局面に従えば、文書画像を含む電子化文書を処理する文書処理方法であって、電子化文書は、文書画像に含まれる内容領域に対応付けて、当該内容領域の所属するグループの種類および当該内容領域の文書画像内での位置が規定されている属性情報を含み、内容領域の文書画像内での位置を特定するための閲覧ナビゲート情報を生成するステップと、閲覧ナビゲート情報とともに文書画像を表示部に表示するステップとを備える。閲覧ナビゲート情報を生成するステップは、表示部の表示特性を取得するステップと、表示部の表示特性に応じて少なくとも１つの閲覧ページ領域を設定するステップと、各グループに所属する内容領域の閲覧ページ毎の出現数に基づいてグループの各々を評価するステップと、グループの各々を評価するステップによる評価結果に基づいて、少なくとも１つのグループの中から閲覧ナビゲート情報の生成対象とするグループを選択する選択ステップとを含む。 Preferably, the evaluation unit changes a reference for evaluation according to a browsing environment.
According to still another aspect of the present invention, there is provided a document processing method for processing an electronic document including a document image, the electronic document being associated with the content area included in the document image and belonging to the content area Generating browsing navigation information for specifying the position of the content area in the document image, including attribute information that defines the type of group to be performed and the position of the content area in the document image; And displaying the document image on the display unit together with the navigation information. The steps of generating browsing navigation information include: acquiring display characteristics of the display unit; setting at least one browsing page area according to the display characteristics of the display unit; and browsing content areas belonging to each group Select a group for generating browsing navigation information from at least one group based on the evaluation result of the step of evaluating each group based on the number of appearances per page and the step of evaluating each group Selecting step.

この発明のさらに別の局面に従えば、上記の文書処理方法をコンピュータに実行させる、文書処理プログラムである。 According to still another aspect of the present invention, there is provided a document processing program for causing a computer to execute the above document processing method.

この発明によれば、文書画像を含む電子化文書を生成する際に、文書画像に含まれる内容領域に応じて効率的に閲覧ナビゲート情報を生成できる。また、この発明によれば、文書画像を含む電子化文書を表示する際に、当該文書画像を表示する表示手段の特性に応じて効率的な閲覧ナビゲート情報を生成できる。 According to the present invention, when an electronic document including a document image is generated, browsing navigation information can be efficiently generated according to the content area included in the document image. Further, according to the present invention, when an electronic document including a document image is displayed, efficient browsing navigation information can be generated according to the characteristics of the display means for displaying the document image.

この発明の実施の形態について、図面を参照しながら詳細に説明する。なお、図中の同一または相当部分については、同一符号を付してその説明は繰返さない。 Embodiments of the present invention will be described in detail with reference to the drawings. Note that the same or corresponding parts in the drawings are denoted by the same reference numerals and description thereof will not be repeated.

［実施の形態１］
（全体システム構成）
図１は、この発明の実施の形態１に従う文書処理装置を含むシステムの概略構成図である。本実施の形態においては、代表的に、本発明に係る文書処理装置を搭載するＭＦＰ（Multi Function Peripheral）について説明する。なお、本発明に係る文書処理装置は、ＭＦＰに限らず、複写機、ファクシミリ装置、スキャナ装置などにも適用可能である。 [Embodiment 1]
(Overall system configuration)
FIG. 1 is a schematic configuration diagram of a system including a document processing apparatus according to the first embodiment of the present invention. In the present embodiment, an MFP (Multi Function Peripheral) equipped with the document processing apparatus according to the present invention will be typically described. The document processing apparatus according to the present invention is not limited to an MFP, and can be applied to a copying machine, a facsimile apparatus, a scanner apparatus, and the like.

図１を参照して、本実施の形態に従うＭＦＰ１は、原稿３００を読取るための画像読取部１０４と、紙媒体などへの印刷処理を行なうためのプリント部１０６とを含んで構成される。 Referring to FIG. 1, MFP 1 according to the present embodiment is configured to include an image reading unit 104 for reading a document 300 and a printing unit 106 for performing a printing process on a paper medium or the like.

特に、本実施の形態に従うＭＦＰ１は、画像読取部１０４で原稿３００を読取ることで文書画像を取得し、この文書画像を含む電子化文書４００を生成する。代表的に、電子化文書４００にはＰＤＦ（Portable Document Format）などのフォーマットを採用できる。この際、ＭＦＰ１は、文書画像から少なくとも１つの内容領域を抽出し、各内容領域について属性情報を取得するとともに、抽出した内容領域のうち特定の内容領域に対して文書画像内での位置を特定するための閲覧ナビゲート情報を生成する。 In particular, MFP 1 according to the present embodiment obtains a document image by reading document 300 by image reading unit 104 and generates an electronic document 400 including the document image. Typically, the electronic document 400 can employ a format such as PDF (Portable Document Format). At this time, the MFP 1 extracts at least one content area from the document image, acquires attribute information for each content area, and specifies a position in the document image with respect to a specific content area among the extracted content areas. Browsing navigation information to generate.

本明細書において「内容領域」とは、文書に含まれる情報資源であり、たとえば文字列や段落・図・表・写真などの内容要素（コンテンツ）である。また、本明細書において「閲覧ナビゲート情報」とは、ユーザによる電子化文書に含まれる文書画像の閲覧を支援するための情報であり、より具体的には、当該文書画像に含まれる内容領域のうち所定のものが存在する位置を特定するための情報である。このような閲覧ナビゲート情報は、一例として「しおり（bookmark）」、「注釈」、「スレッド」、「リンク」などを含み、内容領域の位置を特定するための情報に加えて、対応する内容領域のサムネイル（縮小画像）などを含めてもよい。本実施の形態においては、特に「閲覧ナビゲート情報」の代表例として「しおり」を用いる構成について説明する。 In this specification, a “content area” is an information resource included in a document, and is, for example, a content element (content) such as a character string, paragraph, figure, table, or photograph. Further, in this specification, “browsing navigation information” is information for supporting browsing of a document image included in an electronic document by a user, and more specifically, a content area included in the document image. This is information for specifying a position where a predetermined one exists. Such browsing navigation information includes "bookmark", "annotation", "thread", "link", etc. as an example, and in addition to information for specifying the position of the content area, the corresponding content An area thumbnail (reduced image) or the like may be included. In the present embodiment, a configuration using “bookmark” as a representative example of “browsing navigation information” will be described.

ＭＦＰ１は、生成した電子化文書４００を自身の記憶部（図示しない）に格納したり、ネットワークを介してパーソナルコンピュータＰＣ１，ＰＣ２，ＰＣ３（以下、「パーソナルコンピュータＰＣ」とも総称する）や携帯端末ＭＴに送信したりする。代表的な使用形態として、ＭＦＰ１が設置されている同一のオフィス内に敷設されたネットワークであるＬＡＮ（Local Area Network）に接続されているパーソナルコンピュータＰＣ１，ＰＣ２に対しては、ＭＦＰ１から電子化文書４００が直接的に送信される。一方、ＬＡＮとＷＡＮ（Wide Area Network）との接続点には、サーバ装置ＳＲＶが設けてあり、ＭＦＰ１とは離れたオフィスにあるパーソナルコンピュータＰＣ３などに対しては、ＭＦＰ１からサーバ装置ＳＲＶを介して電子化文書４００が送信される。さらに、携帯端末ＭＴには、ＷＡＮおよび公衆携帯電話網や無線ＬＡＮなどの無線ネットワーク回線（図示しない）を介して、ＭＦＰ１から電子化文書４００が送信される。ここで、サーバ装置ＳＲＶは代表的に、メールサーバ、ＦＴＰ（File Transfer Protocol）サーバ、Ｗｅｂサーバ、ＳＭＢサーバなどからなる。 The MFP 1 stores the generated electronic document 400 in its own storage unit (not shown), or personal computers PC1, PC2, and PC3 (hereinafter also collectively referred to as “personal computer PC”) and a portable terminal MT via a network. Or send to. As a typical usage mode, the personal computer PC1 and PC2 connected to a LAN (Local Area Network) that is a network laid in the same office where the MFP 1 is installed are transferred from the MFP 1 to an electronic document. 400 is sent directly. On the other hand, a server SRV is provided at a connection point between a LAN and a WAN (Wide Area Network). A personal computer PC3 or the like located in an office remote from the MFP 1 is connected from the MFP 1 via the server SRV. An electronic document 400 is transmitted. Further, the electronic document 400 is transmitted from the MFP 1 to the mobile terminal MT via a WAN, a public mobile phone network, a wireless network line (not shown) such as a wireless LAN. Here, the server SRV typically includes a mail server, an FTP (File Transfer Protocol) server, a Web server, an SMB server, and the like.

画像読取部１０４は、原稿をセットするための戴荷台と、原稿台ガラスと、戴荷台にセットされた原稿を原稿台ガラスに自動的に一枚ずつ搬送する搬送部と、読取られた原稿を排出するための排出台とを含む（いずれも図示しない）。これにより、複数枚の原稿を連続的に読取って、一つの電子化文書４００として生成することができる。 The image reading unit 104 includes a loading table for setting a document, a document table glass, a conveyance unit that automatically conveys the documents set on the loading table one by one to the document table glass, and a scanned document. And a discharge stand for discharging (both not shown). As a result, a plurality of documents can be continuously read and generated as one electronic document 400.

（ＭＦＰの概略構成）
図２は、この発明の実施の形態１に従うＭＦＰ１における概略構成を示すブロック図である。 (Schematic configuration of MFP)
FIG. 2 is a block diagram showing a schematic configuration in MFP 1 according to the first embodiment of the present invention.

図２を参照して、ＭＦＰ１は、制御部１００と、メモリ部１０２と、画像読取部１０４と、プリント部１０６と、通信インターフェイス部１０８と、データ格納部１１０とを含む。 Referring to FIG. 2, MFP 1 includes a control unit 100, a memory unit 102, an image reading unit 104, a printing unit 106, a communication interface unit 108, and a data storage unit 110.

制御部１００は、代表的にＣＰＵ（Central Processing Unit）などの演算装置から構成され、プログラムを実行することで本実施の形態に従う文書処理を実現する。メモリ部１０２は、代表的にＤＲＡＭ（Dynamic Random Access Memory）などの揮発性の記憶装置であり、制御部１００で実行されるプログラムやプログラムの実行に必要なデータなどを保持する。通信インターフェイス部１０８は、代表的に、ネットワーク（たとえば、図１に示すＬＡＮ）を介してパーソナルコンピュータＰＣ（図１）や携帯端末ＭＴとの間でデータを送受信するための部位であり、たとえば、ＬＡＮアダプタおよびそれを制御するドライバソフトなどを含む。プリント部１０６は、プリント処理を行なうための部位であり、プリント処理に係るハードウェア構成に加えて、各部の作動を制御するための制御装置をも含む。データ格納部１１０は、代表的にハードディスク装置やフラッシュメモリなどの不揮発性の記憶装置であり、制御部１００で生成された電子化文書４００などを格納する。 The control unit 100 is typically configured by an arithmetic device such as a CPU (Central Processing Unit), and implements document processing according to the present embodiment by executing a program. The memory unit 102 is typically a volatile storage device such as a DRAM (Dynamic Random Access Memory), and holds a program executed by the control unit 100 and data necessary for executing the program. The communication interface unit 108 is typically a part for transmitting and receiving data to and from the personal computer PC (FIG. 1) and the portable terminal MT via a network (for example, the LAN shown in FIG. 1). Includes a LAN adapter and driver software for controlling it. The print unit 106 is a part for performing print processing, and includes a control device for controlling the operation of each unit in addition to the hardware configuration related to print processing. The data storage unit 110 is typically a non-volatile storage device such as a hard disk device or a flash memory, and stores the electronic document 400 generated by the control unit 100.

（パーソナルコンピュータの構成）
図３は、この発明の実施の形態１に従うパーソナルコンピュータＰＣの概略構成を示すブロック図である。 (Configuration of personal computer)
FIG. 3 is a block diagram showing a schematic configuration of the personal computer PC according to the first embodiment of the present invention.

図３を参照して、パーソナルコンピュータＰＣは、オペレーティングシステム（ＯＳ：Operating System）を含む各種プログラムを実行するＣＰＵ（Central Processing Unit）２０１と、ＣＰＵ２０１でのプログラムの実行に必要なデータを一時的に記憶するメモリ部２１３と、ＣＰＵ２０１で実行されるプログラムを不揮発的に記憶するハードディスク部（ＨＤＤ：Hard Disk Drive）２１１とを含む。また、ハードディスク部２１１には、ＭＦＰ１で生成された電子化文書を表示するための閲覧アプリケーションが記憶されており、このようなプログラムは、ＦＤＤドライブ２１７またはＣＤ−ＲＯＭドライブ２１５によって、それぞれフレキシブルディスク２１７ａまたはＣＤ−ＲＯＭ（Compact Disk-Read Only Memory）２１５ａなどから読取られる。 Referring to FIG. 3, personal computer PC temporarily stores a CPU (Central Processing Unit) 201 that executes various programs including an operating system (OS) and data necessary for the CPU 201 to execute the program. A memory unit 213 that stores data and a hard disk unit (HDD: Hard Disk Drive) 211 that stores programs executed by the CPU 201 in a nonvolatile manner are included. The hard disk unit 211 stores a browsing application for displaying an electronic document generated by the MFP 1, and such a program is stored in the flexible disk 217 a by the FDD drive 217 or the CD-ROM drive 215, respectively. Alternatively, it is read from a CD-ROM (Compact Disk-Read Only Memory) 215a or the like.

ＣＰＵ２０１は、キーボードやマウスなどからなる入力部２０９を介してユーザからの指示を受取るとともに、プログラムの実行によって生成される画面出力をディスプレイ部２０５へ出力する。また、ＣＰＵ２０１は、ＬＡＮカードなどからなる通信インターフェイス部２０７を介して、ＬＡＮやＷＡＮに接続されたＭＦＰ１やサーバ装置ＳＲＶ（図１）から電子化文書を取得し、ハードディスク部２１１などに格納する。また、上述の各部は、内部バス２０３を介して相互にデータを授受する。 The CPU 201 receives an instruction from the user via the input unit 209 including a keyboard and a mouse, and outputs a screen output generated by executing the program to the display unit 205. Further, the CPU 201 acquires a digitized document from the MFP 1 or server SRV (FIG. 1) connected to the LAN or WAN via the communication interface unit 207 including a LAN card and stores the digitized document in the hard disk unit 211 or the like. Further, the above-described units exchange data with each other via the internal bus 203.

なお、携帯端末ＭＴについては、図３においてＦＤＤドライブ２１７やＣＤ−ＲＯＭドライブ２１５などを取り除いたものとほぼ等価であるので、詳細な説明は繰返さない。 Since mobile terminal MT is substantially equivalent to the mobile terminal MT with FDD drive 217 and CD-ROM drive 215 removed in FIG. 3, detailed description thereof will not be repeated.

（パーソナルコンピュータＰＣにおける電子化文書の表示画面）
ＣＰＵ２０１がハードディスク部２１１に記憶された閲覧アプリケーションを実行することで、ディスプレイ部２０５上には図４に示すような形態で電子化文書が表示される。 (Display screen of digitized document on personal computer PC)
When the CPU 201 executes the browsing application stored in the hard disk unit 211, the digitized document is displayed on the display unit 205 in the form as shown in FIG.

図４は、この発明の実施の形態１に従うパーソナルコンピュータＰＣにおける電子化文書の表示画面の一例を模式的に示した図である。 FIG. 4 schematically shows an example of a display screen of an electronic document in personal computer PC according to the first embodiment of the present invention.

図４を参照して、ディスプレイ部２０５（図３）上には一例として、文書表示領域５００と、閲覧ナビゲート情報表示領域５１０とが形成される。文書表示領域５００には、電子化文書に含まれる文書画像のうち所定範囲が表示され、閲覧ナビゲート情報表示領域５１０には、電子化文書に含まれる閲覧ナビゲート情報に基づいてアイコン５１２，５１４，５１６が表示される。 Referring to FIG. 4, as an example, a document display area 500 and a browsing navigation information display area 510 are formed on display 205 (FIG. 3). A predetermined range of the document image included in the digitized document is displayed in the document display area 500, and icons 512 and 514 are displayed in the browse navigation information display area 510 based on the browse navigation information included in the digitized document. , 516 are displayed.

これらの閲覧ナビゲート情報は、内容領域の文書画像内での位置を示す位置情報を含んでおり、ユーザがアイコン５１２を選択（代表的には、図示しないマウスなどによるクリック動作）すれば、文書表示領域５００では、しおり位置５０２が文書表示領域５００内の所定位置（代表的に、文書表示領域５００の最上部）と一致するように、文書画像の表示領域が変化（スクロール）する。同様に、ユーザがアイコン５１４および５１６を選択すれば、それぞれしおり位置５０４および５０６が文書表示領域５００内の所定位置となるように、文書画像の表示領域が変化する。 The browsing navigation information includes position information indicating the position of the content area in the document image. If the user selects the icon 512 (typically, a click operation using a mouse or the like not shown), the document is displayed. In the display area 500, the display area of the document image changes (scrolls) so that the bookmark position 502 coincides with a predetermined position in the document display area 500 (typically, the uppermost part of the document display area 500). Similarly, when the user selects icons 514 and 516, the display area of the document image changes so that bookmark positions 504 and 506 are predetermined positions in document display area 500, respectively.

また、アイコン５１２，５１４，５１６の間には、それぞれ対応する内容領域の階層構造に応じた従属関係が規定されている。すなわち、図４ではアイコン５１２にアイコン５１４および５１６が従属する例を示す。なお、図４に示すアイコン５１２，５１４，５１６には文字などの表示はなされていないが、対応する内容領域の種別（一例として、「大見出し」および「中見出し」など）を付加的に表示してもよく、さらにアイコンとして対応する内容領域の縮小画像（サムネイル画像）などを用いてもよい。 Further, between the icons 512, 514, and 516, a dependency relationship is defined according to the hierarchical structure of the corresponding content area. That is, FIG. 4 shows an example in which icons 514 and 516 are subordinate to the icon 512. Note that the icons 512, 514, and 516 shown in FIG. 4 do not display characters or the like, but additionally display the type of the corresponding content area (for example, “large headline” and “medium headline”). Alternatively, a reduced image (thumbnail image) of the corresponding content area may be used as an icon.

このように、ユーザは、文書画像内の特定の内容領域と対応付けたしおりアイコンを参照して、必要な内容領域を効率的に検索することができる。 Thus, the user can efficiently search for a necessary content area by referring to the bookmark icon associated with a specific content area in the document image.

（ＭＦＰの機能的構成）
図５は、この発明の実施の形態１に従うＭＦＰ１における機能構成を示すブロック図である。これらの機能は、主としてＭＦＰ１の制御部１００やメモリ部１０２（図２）などによって実現される。 (Functional configuration of MFP)
FIG. 5 is a block diagram showing a functional configuration in MFP 1 according to the first embodiment of the present invention. These functions are mainly realized by the control unit 100 and the memory unit 102 (FIG. 2) of the MFP 1.

図５を参照して、ＭＦＰ１の機能構成としては、画像読取部１０４と、画像前処理部１２と、画像バッファ部１３と、圧縮処理部１４と、電子化文書生成部１５と、画像解析部１６と、しおりデータ生成部１７と、送信部１８と、画像処理部１９と、プリント部１０６とを含む。 Referring to FIG. 5, the functional configuration of MFP 1 includes an image reading unit 104, an image preprocessing unit 12, an image buffer unit 13, a compression processing unit 14, a digitized document generation unit 15, and an image analysis unit. 16, a bookmark data generation unit 17, a transmission unit 18, an image processing unit 19, and a printing unit 106.

画像読取部１０４は、原稿３００を読取って文書画像を取得し、その文書画像を画像前処理部１２へ出力する。画像前処理部１２は、主としてパーソナルコンピュータＰＣなどでの表示に適するように、文書画像の表示特性などを調整する。さらに、画像前処理部１２が文書画像に含まれるノイズを除去してもよい。そして、画像前処理部１２で画像処理が施された文書画像は、画像バッファ部１３へ送られる。画像バッファ部１３は、取得された文書画像のデータを一時的に格納する部位であり、一旦格納した文書画像を圧縮処理部１４、画像解析部１６および画像処理部１９へ出力する。圧縮処理部１４は、画像バッファ部１３から出力される文書画像を圧縮処理して、電子化文書生成部１５へ出力する。この圧縮処理による圧縮度合いは、生成される電子化文書の大きさや、要求される文書画像の解像度などに応じて変化させてもよく、また圧縮処理はＪＰＥＧ（Joint Photographic Experts Group）などの非可逆変換であってもよい。なお、高解像度が要求される場合などには、圧縮処理を省略してもよい。 The image reading unit 104 reads the document 300 to acquire a document image, and outputs the document image to the image preprocessing unit 12. The image preprocessing unit 12 adjusts the display characteristics of the document image so as to be suitable mainly for display on a personal computer PC or the like. Further, the image preprocessing unit 12 may remove noise included in the document image. Then, the document image subjected to the image processing by the image preprocessing unit 12 is sent to the image buffer unit 13. The image buffer unit 13 is a part for temporarily storing the acquired document image data, and outputs the stored document image to the compression processing unit 14, the image analysis unit 16, and the image processing unit 19. The compression processing unit 14 compresses the document image output from the image buffer unit 13 and outputs the compressed document image to the digitized document generation unit 15. The degree of compression by this compression process may be changed according to the size of the generated electronic document and the required resolution of the document image, and the compression process is irreversible such as JPEG (Joint Photographic Experts Group). It may be a conversion. Note that the compression process may be omitted when high resolution is required.

画像解析部１６は、画像バッファ部１３から出力される文書画像から内容領域を抽出し、さらに抽出した内容領域についての属性情報を取得する。ここで属性情報には、内容領域毎に、文書画像内での位置、当該領域に含まれる文字の大きさ、当該領域に含まれる文字の色、当該領域の背景の色などが含まれる。これらの属性情報は、しおりデータ生成部１７へ送られる。しおりデータ生成部１７は、画像解析部１６から出力される内容領域の属性情報に基づいて、抽出された内容領域のうち特定のものに対してしおりデータを生成する。そして、しおりデータ生成部１７は、生成したしおりデータを電子化文書生成部１５へ出力する。 The image analysis unit 16 extracts a content area from the document image output from the image buffer unit 13 and acquires attribute information about the extracted content area. Here, the attribute information includes, for each content area, the position in the document image, the size of the character included in the area, the color of the character included in the area, the background color of the area, and the like. These pieces of attribute information are sent to the bookmark data generation unit 17. The bookmark data generation unit 17 generates bookmark data for a specific one of the extracted content areas based on the attribute information of the content area output from the image analysis unit 16. Then, the bookmark data generation unit 17 outputs the generated bookmark data to the digitized document generation unit 15.

電子化文書生成部１５は、圧縮処理部１４で圧縮された文書画像に、しおりデータ生成部１７からのしおりデータを付加することで、電子化文書を生成する。そして、この生成された電子化文書は、ユーザによる設定などに応じて、データ格納部１１０へ格納され、もしくは送信部１８へ出力される。送信部１８は、通信インターフェイス部１０８によって実現され、ＬＡＮなどのネットワークを介してパーソナルコンピュータＰＣ（図１）などへ電子化文書生成部１５で生成された電子化文書を送信する。 The digitized document generation unit 15 generates a digitized document by adding the bookmark data from the bookmark data generation unit 17 to the document image compressed by the compression processing unit 14. Then, the generated electronic document is stored in the data storage unit 110 or output to the transmission unit 18 according to the setting by the user. The transmission unit 18 is realized by the communication interface unit 108, and transmits the digitized document generated by the digitized document generation unit 15 to a personal computer PC (FIG. 1) or the like via a network such as a LAN.

一方、画像処理部１９は、ユーザ操作に応じて、画像バッファ部１３から出力される文書画像をプリント部１０６でのプリント動作に適した画像に変換する。代表的に、ＲＧＢ表示系で規定された文書画像をカラープリントに適したＣＭＹＫ表示系の画像データなどに変換する。このとき、プリント部１０６の特性に応じた色調整を行なってもよい。プリント部１０６は、画像処理部１９から出力される画像データに基づいて紙媒体などへの印刷処理を行なう。 On the other hand, the image processing unit 19 converts the document image output from the image buffer unit 13 into an image suitable for the printing operation in the printing unit 106 according to a user operation. Typically, a document image defined by an RGB display system is converted into image data of a CMYK display system suitable for color printing. At this time, color adjustment according to the characteristics of the print unit 106 may be performed. The printing unit 106 performs a printing process on a paper medium or the like based on the image data output from the image processing unit 19.

なお、図５に示す各機能ブロックと本願発明との対応関係については、画像解析部１６が「取得手段」に相当し、しおりデータ生成部１７が「情報生成手段」、「分類手段」、「評価手段」、「選択手段」に相当し、画像読取部１０４が「画像読取手段」に相当し、電子化文書生成部１５が「文書生成手段」に相当する。 For the correspondence between each functional block shown in FIG. 5 and the present invention, the image analysis unit 16 corresponds to “acquisition unit”, and the bookmark data generation unit 17 performs “information generation unit”, “classification unit”, “ The image reading unit 104 corresponds to an “image reading unit”, and the digitized document generation unit 15 corresponds to a “document generation unit”.

（電子化文書の生成処理手順）
図６は、この発明の実施の形態１に従う電子化文書の生成処理の具体例を示すフローチャートである。図６のフローチャートに示される処理は、制御部１００（図２）がメモリ部１０２（図２）にプログラムを読出して実行し、図５に示される各機能を制御することで実現される。 (Digitized document generation procedure)
FIG. 6 is a flowchart showing a specific example of the digitized document generation process according to the first embodiment of the present invention. The processing shown in the flowchart of FIG. 6 is realized by the control unit 100 (FIG. 2) reading and executing a program in the memory unit 102 (FIG. 2) and controlling each function shown in FIG. 5.

図５および図６を参照して、まず、画像読取部１０４がユーザ設定などに応じて原稿３００を読取って文書画像を生成する（ステップＳ１００）。次に、画像前処理部１２がこの生成された文書画像を調整する（ステップＳ１０２）。そして、調整後の文書画像は、画像バッファ部１３に格納される。 Referring to FIGS. 5 and 6, first, image reading unit 104 reads document 300 according to a user setting or the like to generate a document image (step S 100). Next, the image preprocessing unit 12 adjusts the generated document image (step S102). The adjusted document image is stored in the image buffer unit 13.

続いて、圧縮処理部１４が、画像バッファ部１３に格納された文書画像を圧縮処理して、電子化文書生成部１５へ出力する（ステップＳ１０４）。 Subsequently, the compression processing unit 14 compresses the document image stored in the image buffer unit 13 and outputs the compressed document image to the digitized document generation unit 15 (step S104).

一方、画像解析部１６が、画像バッファ部１３に格納された文書画像から内容領域を行単位で抽出する（ステップＳ１０６）。そして、画像解析部１６が、１ページ目の文書画像に含まれる内容領域に応じて、各内容領域の位置を特定するための基準となる閲覧パスを文書画像内に設定する（ステップＳ１０８）。さらに、画像解析部１６が、抽出された各内容領域の閲覧パスを基準とする位置（「閲覧パスからの距離」および「閲覧パス上位置」）を取得する（ステップＳ１１０）。同時に、画像解析部１６が、抽出された各内容領域の「文字の大きさ」、「文字の色」、「背景の色」の代表値を取得する（ステップＳ１１２）。そして、各内容領域の「閲覧パスからの距離」、「閲覧パス上位置」、「文字の大きさ」、「文字の色」および「背景の色」は、属性情報としてしおりデータ生成部１７へ出力される。 On the other hand, the image analysis unit 16 extracts the content area from the document image stored in the image buffer unit 13 in units of lines (step S106). Then, the image analysis unit 16 sets a browsing path serving as a reference for specifying the position of each content area in the document image in accordance with the content area included in the document image of the first page (step S108). Furthermore, the image analysis unit 16 acquires a position (“distance from the browsing path” and “position on the browsing path”) based on the browsing path of each extracted content area (step S110). At the same time, the image analysis unit 16 acquires representative values of “character size”, “character color”, and “background color” of each extracted content area (step S112). Then, the “distance from the browsing path”, “position on the browsing path”, “size of character”, “character color”, and “background color” of each content area are sent to the bookmark data generation unit 17 as attribute information. Is output.

この属性情報を受けて、しおりデータ生成部１７が、内容領域の全体を少なくとも１つのグループに分類する（ステップＳ１１４）。その後、しおりデータ生成部１７が、分類された各グループに所属する内容領域の文書画像内での位置に基づいて、各グループのしおりとしての適合度を評価する（ステップＳ１１６）。この評価結果に基づいて、しおりデータ生成部１７が評価結果の最上位のものから少なくとも１つのグループをしおりデータの生成対象として選択する（ステップＳ１１８）。さらに、複数のグループを選択した場合には、しおりデータ生成部１７が、選択された各グループに所属する内容領域の文書画像内での位置に基づいて、グループ間の従属関係を決定する（ステップＳ１２０）。 Upon receiving this attribute information, the bookmark data generation unit 17 classifies the entire content area into at least one group (step S114). Thereafter, the bookmark data generation unit 17 evaluates the suitability of each group as a bookmark based on the position in the document image of the content area belonging to each classified group (step S116). Based on the evaluation result, the bookmark data generation unit 17 selects at least one group from among the highest evaluation results as a bookmark data generation target (step S118). Further, when a plurality of groups are selected, the bookmark data generation unit 17 determines a dependency relationship between the groups based on the position in the document image of the content area belonging to each selected group (step S120).

しおりデータ生成部１７は、このように選択されたグループに所属する内容領域についての属性情報に基づいて、当該グループに所属する内容領域の文書画像内での位置を示すしおりデータを生成する（ステップＳ１２２）。なお、このとき複数のグループ間の従属関係が規定されている場合には、当該従属関係を含んでしおりデータが生成される。 The bookmark data generation unit 17 generates bookmark data indicating the position in the document image of the content area belonging to the group based on the attribute information about the content area belonging to the group selected in this way (step S122). At this time, if a dependency relationship between a plurality of groups is defined, bookmark data including the dependency relationship is generated.

続いて、電子化文書生成部１５が、圧縮処理部１４からの（圧縮された）文書画像に、しおりデータ生成部１７からのしおりデータを付加することで、電子化文書を生成する（ステップＳ１２４）。そして、電子化文書の生成処理は終了する。 Subsequently, the digitized document generation unit 15 adds the bookmark data from the bookmark data generation unit 17 to the (compressed) document image from the compression processing unit 14 to generate a digitized document (step S124). ). Then, the digitized document generation process ends.

以下、上記の各ステップの詳細な処理について説明する。
（内容領域の抽出処理）
図７は、図６のステップＳ１０６における内容領域の抽出処理を説明するための図である。 Hereinafter, detailed processing of each step will be described.
(Content area extraction processing)
FIG. 7 is a diagram for explaining the content region extraction processing in step S106 of FIG.

図７（ａ）は、原稿３００から生成される文書画像４２０の一例を示す図であり、図７（ｂ）は、図７（ａ）に示す文書画像４２０に対して内容領域の抽出処理が実行された結果の一例を示す図である。たとえば、３ページ分の原稿３００が画像読取部１０４（図２，図５）で読取られると、画像バッファ部１３には、図７（ａ）に示すような文書画像４２０が格納される。この文書画像４２０は、ページ領域４２１，４２２，４２３を含み、各ページ領域では「タイトルＡ」、「見出しＡ１」、「見出しＡ１．１」、「内容Ａ１．１は・・・」などにように、その種別に応じて「インデント」および「段落分け」されて記述されている。 FIG. 7A is a diagram illustrating an example of a document image 420 generated from the document 300. FIG. 7B illustrates a content area extraction process performed on the document image 420 illustrated in FIG. It is a figure which shows an example of the result performed. For example, when a three-page document 300 is read by the image reading unit 104 (FIGS. 2 and 5), a document image 420 as shown in FIG. 7A is stored in the image buffer unit 13. This document image 420 includes page areas 421, 422, and 423. In each page area, “title A”, “heading A1”, “heading A1.1”, “content A1.1 is... According to the type, “indent” and “paragraph division” are described.

画像解析部１６は、このような文書画像に対して、文字列を含む矩形領域を行単位で順次抽出する。すると、図７（ｂ）に示すように、文書画像４２０に含まれるページ領域４２１，４２２，４２３の各々において複数の内容領域４３０が抽出される。このような内容領域４３０の抽出処理については、たとえば特開平０９−１３４４０６号公報（特許文献１）に開示されているような公知の方法を用いることができる。ここで、抽出対象とする内容領域の種別は予め任意に設定することが可能であり、一例として「文字列」、「段落」、「図」、「表」、「写真」などを抽出対象にできる。なお、このような抽出対象の種別の選択についても上述したような公知の技術を用いることで実現できる。また、図７（ａ）および図７（ｂ）には、横書き原稿に対して内容領域４３０を抽出する構成について例示するが、縦書き原稿に対しても同様に内容領域４３０を抽出することが可能である。この場合、画像解析部１６は、紙面上下方向を「行方向」みなして内容領域４３０を抽出する。なお、「横書き原稿」と「縦書き原稿」との区別は、文書中の内容要素の密度に基づいて判断することができる。具体的には、一般的に「横書き原稿」では紙面左側に内容要素が集中する一方、紙面右側の内容要素が位置する密度は低い。これに対して、「縦書き原稿」では紙面上側に内容要素が集中する一方、紙面下側の内容要素が位置する密度は低い。このような、内容要素の偏在性に基づいて、「横書き原稿」と「縦書き原稿」とを区別することができる。 The image analysis unit 16 sequentially extracts a rectangular area including a character string in line units from such a document image. Then, as shown in FIG. 7B, a plurality of content areas 430 are extracted in each of the page areas 421, 422, and 423 included in the document image 420. For such extraction processing of the content area 430, a known method disclosed in, for example, Japanese Patent Application Laid-Open No. 09-134406 (Patent Document 1) can be used. Here, the type of content area to be extracted can be arbitrarily set in advance. For example, “character string”, “paragraph”, “figure”, “table”, “photograph”, etc. can be extracted. it can. Note that such selection of the type of extraction target can also be realized by using a known technique as described above. 7A and 7B exemplify a configuration for extracting the content area 430 for a horizontally written document, the content area 430 can be similarly extracted for a vertically written document. Is possible. In this case, the image analysis unit 16 regards the vertical direction of the paper as the “row direction” and extracts the content region 430. The distinction between “horizontal document” and “vertical document” can be determined based on the density of content elements in the document. Specifically, in a “horizontal writing document”, content elements are generally concentrated on the left side of the paper, while the density of content elements on the right side of the paper is low. On the other hand, in the “vertically written document”, the content elements are concentrated on the upper side of the paper, while the density of the content elements on the lower side of the paper is low. Based on such uneven distribution of content elements, “horizontal writing original” and “vertical writing original” can be distinguished.

以下では、便宜上抽出された内容領域４３０に対して「行１」〜「行２５」の識別番号を割当てて説明するが、画像解析部１６はこのような識別番号を必ずしも割当てる必要はなく、内容領域４３０を出現順（抽出順）に並べておくことで各内容領域を識別（特定）するようにしてもよい。 In the following, description will be made by assigning identification numbers of “row 1” to “row 25” to the content region 430 extracted for convenience, but the image analysis unit 16 does not necessarily need to assign such an identification number. Each content area may be identified (specified) by arranging the areas 430 in the order of appearance (extraction order).

（閲覧パスの設定処理）
図８は、図６のステップＳ１０８における閲覧パスの設定処理を説明するための図である。 (Browsing path setting process)
FIG. 8 is a diagram for explaining browsing path setting processing in step S108 of FIG.

図８を参照して、画像解析部１６は、抽出した内容領域４３０の位置に応じて閲覧パス４４０を設定する。より詳細には、画像解析部１６は、文書画像の行の始点側にあって、行と直行する方向に延びる閲覧パス４４０を設定する。そして、文書画像内に配置される行の先頭側に閲覧パス４４０の始点（基準点）を設定する。すなわち、閲覧パス４４０は、原稿の記述順序に対応した方向に延びる。代表的に、「横書き原稿」であれば、図８に示すように紙面左端を紙面上から紙面下に向かう閲覧パス４４０が設定される。なお、文書画像に含まれる各ページ領域に対して同一の位置に閲覧パス４４０が設定される。そして、この閲覧パス４４０の基準点は紙面左上に設定される。 With reference to FIG. 8, the image analysis unit 16 sets a browsing path 440 according to the position of the extracted content area 430. More specifically, the image analysis unit 16 sets a browsing path 440 that is on the start point side of the line of the document image and extends in a direction perpendicular to the line. Then, the start point (reference point) of the browsing path 440 is set on the head side of the line arranged in the document image. That is, the browsing path 440 extends in a direction corresponding to the document description order. Typically, in the case of a “horizontal writing document”, as shown in FIG. 8, a browsing path 440 is set so that the left edge of the paper faces from the top of the paper to the bottom of the paper. The browsing path 440 is set at the same position for each page area included in the document image. Then, the reference point of the browsing path 440 is set at the upper left of the page.

代替的に、「縦書き原稿」であれば紙面上端を紙面右から紙面左に向かう閲覧パスが設定される。そして、この場合の閲覧パスの基準点は紙面右上に設定される。その他については、上述の「横書き原稿」の場合と同様である。 Alternatively, in the case of “vertical writing document”, a browsing path is set in which the upper end of the sheet is directed from the right side to the left side. In this case, the reference point of the browsing path is set at the upper right of the page. Others are the same as in the case of the “horizontal writing document” described above.

ここで、閲覧パス４４０は「行の始点側」に設定されるが、この「行の始点側」は抽出した内容領域４３０のうち最も始点側に位置する内容領域に応じて決定される。すなわち、閲覧パス４４０を設定するためには、文書画像に含まれる内容領域４３０のうち最も始点側に位置するものを抽出する必要がある。しかしながら、多数の原稿を画像読取部１０４（図２，５）で読取って電子化文書を生成する場合などには、対象となる原稿の枚数を予め知ることができない。そのため、すべてのページに含まれる内容領域４３０を抽出した後に閲覧パス４４０を設定しようとすると、効率が低下するおそれがある。そこで、本実施の形態に従う画像解析部１６は、文書画像４２０の１ページ目のページ領域４２１に含まれる内容領域４３０に基づいて閲覧パス４４０を設定する。具体的には、画像解析部１６は、ページ領域４２１内に存在する内容領域４３０を囲む領域４５０を取得し、この領域４５０に基づいて閲覧パス４４０を設定する。 Here, the browsing path 440 is set to the “starting point side of the line”, and this “starting point side of the line” is determined according to the content area located closest to the starting point in the extracted content area 430. That is, in order to set the viewing path 440, it is necessary to extract the content area 430 included in the document image that is located closest to the starting point. However, when a digitized document is generated by reading a large number of originals with the image reading unit 104 (FIGS. 2 and 5), the number of target originals cannot be known in advance. Therefore, if an attempt is made to set the browsing path 440 after extracting the content area 430 included in all pages, the efficiency may decrease. Therefore, the image analysis unit 16 according to the present embodiment sets the browsing path 440 based on the content area 430 included in the page area 421 of the first page of the document image 420. Specifically, the image analysis unit 16 acquires a region 450 surrounding the content region 430 existing in the page region 421 and sets a browsing path 440 based on the region 450.

（位置取得処理）
図６のステップＳ１１０における各内容領域についての閲覧パス４４０を基準とした距離の取得処理について、図８を参照して説明する。 (Location acquisition processing)
The distance acquisition process based on the browsing path 440 for each content area in step S110 in FIG. 6 will be described with reference to FIG.

図８を参照して、本明細書では、閲覧パス４４０と各内容領域４３０との間の行方向の距離を「閲覧パスからの距離」と規定し、閲覧パス４４０上の各内容領域４３０に対応する行位置を「閲覧パス上位置」と規定する。たとえば、内容領域４３０Ａについての「閲覧パスからの距離」は符号４５４で示される距離であり、「閲覧パス上位置」は符号４５２で示される距離となる。なお、「閲覧パス上位置」としては、１ページ目の始点を基準として算出した「絶対値」、および対応するページの始点を基準として算出した「相対値」とを用いる。 With reference to FIG. 8, in this specification, the distance in the row direction between the browsing path 440 and each content area 430 is defined as “distance from the browsing path”, and each content area 430 on the browsing path 440 is defined. The corresponding line position is defined as “position on the browsing path”. For example, the “distance from the browsing path” for the content area 430A is a distance indicated by reference numeral 454, and the “position on the browsing path” is a distance indicated by reference numeral 452. As the “position on the browsing path”, an “absolute value” calculated based on the starting point of the first page and a “relative value” calculated based on the starting point of the corresponding page are used.

このように、画像解析部１６は、抽出した内容領域４３０の各々について、「閲覧パスからの距離」と「閲覧パス上位置（相対値）」および「閲覧パス上位置（相対値）」とを取得する。 As described above, the image analysis unit 16 calculates “distance from the browsing path”, “position on the browsing path (relative value)”, and “position on the browsing path (relative value)” for each of the extracted content areas 430. get.

（その他の属性情報の取得処理）
図６のステップＳ１１２における各内容領域の「文字の大きさ」、「文字の色」、「背景の色」の代表値の取得処理は、公知の文字認識技術などを用いて実現される。本実施の形態に従う画像解析部１６は、各内容領域の文字認識を行なって「文字の大きさ」および「文字の色」を取得する。ここで、各内容領域に文字の大きさや文字色が複数の種類だけ含まれる場合には、最も頻度の高いものの値、もしくはすべての値についての平均値を採用することができる。なお、この処理は対象となる内容領域が「文字列」である場合のみ有効である。 (Other attribute information acquisition processing)
The process of obtaining representative values of “character size”, “character color”, and “background color” of each content area in step S112 of FIG. 6 is realized using a known character recognition technique or the like. Image analysis unit 16 according to the present embodiment performs character recognition of each content area and acquires “character size” and “character color”. Here, when each of the content areas includes only a plurality of types of character sizes and character colors, the most frequently used value or an average value of all values can be employed. This process is effective only when the target content area is “character string”.

また、画像解析部１６は、各内容領域を構成する画素のヒストグラムに基づいて、一例として最も頻度の高い色を「背景の色」として抽出する。 Further, the image analysis unit 16 extracts, as an example, the most frequently used color as the “background color” based on the histogram of the pixels constituting each content area.

（属性情報）
図９は、図７に示す文書画像４２０から取得される各内容領域の属性情報の具体例を示す図である。 (Attribute information)
FIG. 9 is a diagram showing a specific example of attribute information of each content area acquired from the document image 420 shown in FIG.

図９を参照して、画像解析部１６は、文書画像４２０から抽出した各内容領域４３０について、図９のデータ欄４６１，４６２，４６３，４６４，４６５，４６６に記述されているようなデータを属性情報として出力する。ここで、この図９におけるデータ欄４６７に格納されている「ＴＹＰＥ値」については、後述するしおりデータ生成部１７が決定するため、画像解析部１６が出力する属性情報には含まれない。 Referring to FIG. 9, the image analysis unit 16 obtains data as described in the data columns 461, 462, 463, 464, 465, 466 of FIG. 9 for each content area 430 extracted from the document image 420. Output as attribute information. Here, the “TYPE value” stored in the data column 467 in FIG. 9 is not included in the attribute information output by the image analysis unit 16 because it is determined by the bookmark data generation unit 17 described later.

（グループへの分類処理）
図１０は、図６のステップＳ１１４におけるグループへの分類処理を説明するための図である。図１０（ａ）は、閲覧パスからの距離に基づく分類処理の一例を示す。図１０（ｂ）は、文字の大きさに基づく分類処理の一例を示す。 (Classification process into groups)
FIG. 10 is a diagram for explaining the group classification processing in step S114 of FIG. FIG. 10A shows an example of the classification process based on the distance from the browsing path. FIG. 10B shows an example of classification processing based on character size.

図１０（ａ）を参照して、しおりデータ生成部１７（図５）は、図９に示すデータ欄４６１に格納されている「閲覧パスからの距離」の値を用いて、すべての内容領域についての度数分布（ヒストグラム）を算出する。そして、しおりデータ生成部１７は、この度数分布に現れるピーク（出現頻度の高い部分）の位置に応じて、「閲覧パスからの距離」を少なくとも１つの区分（この例では、区分１〜区分４）に分類する。 Referring to FIG. 10A, the bookmark data generation unit 17 (FIG. 5) uses the value of “distance from browsing path” stored in the data column 461 shown in FIG. A frequency distribution (histogram) is calculated for. Then, the bookmark data generation unit 17 classifies the “distance from the browsing path” into at least one section (in this example, the sections 1 to 4 according to the position of the peak appearing in the frequency distribution (the part where the appearance frequency is high). ).

同様に、図１０（ｂ）を参照して、しおりデータ生成部１７は、図９に示すデータ欄４６２に格納されている「文字の大きさ」の値を用いて、すべての内容領域についての度数分布（ヒストグラム）を算出する。そして、しおりデータ生成部１７は、この度数分布に現れるピーク（出現頻度の高い部分）の位置に応じて、「文字の大きさ」を少なくとも１つの区分（この例では、区分小、区分中、区分大）に分類する。 Similarly, with reference to FIG. 10B, the bookmark data generation unit 17 uses the “character size” value stored in the data column 462 shown in FIG. Calculate the frequency distribution (histogram). Then, the bookmark data generation unit 17 assigns the “character size” to at least one section (in this example, the small section, the middle section, Classify).

このように、しおりデータ生成部１７は、属性値の各々について分類を行ない、これらの分類結果を統合して「ＴＹＰＥ値」を決定する。すなわち、しおりデータ生成部１７は、各属性値の分類結果に応じた分岐処理に従って、内容領域の全体を少なくとも１つのグループ（ＴＹＰＥ）に分類する。一例として、しおりデータ生成部１７は、「閲覧パスからの距離」が「区分１」であり「文字の大きさ」が「区分大」である内容領域を「ＴＹＰＥ１」と区分し、「閲覧パスからの距離」が「区分２」であり「文字の大きさ」が「区分中」である内容領域を「ＴＹＰＥ２」と区分することができる。 In this way, the bookmark data generation unit 17 classifies each attribute value and integrates these classification results to determine a “TYPE value”. That is, the bookmark data generation unit 17 classifies the entire content area into at least one group (TYPE) according to a branching process according to the classification result of each attribute value. As an example, the bookmark data generation unit 17 classifies the content area whose “distance from the browsing path” is “category 1” and “character size” is “large category” as “TYPE1”, The content area whose “distance from” is “category 2” and whose “character size” is “under classification” can be classified as “TYPE 2”.

なお、グループ数は２〜８が好ましく、上述のような分類処理を行なうことでグループ数が多くなり過ぎる場合には、グループの区分に用いる属性値の種類を適宜選択することが望ましい。 The number of groups is preferably 2 to 8, and when the number of groups becomes too large by performing the classification process as described above, it is desirable to appropriately select the type of attribute value used for group classification.

上述したようなグループへの分類処理は、文書内に現れる種別（たとえば、「タイトル」、「大見出し」、「中見出し」など）を共通にする内容領域同士をグルーピングするための処理である。すなわち、このような種別を共通にする内容領域同士は、いずれも類似した「文字の大きさ」や「閲覧パスからの距離」を有していると考えられるから、上述のように近似した属性情報を有する内容領域同士をグルーピングすることで、文書内の種別に応じた分類処理を実現できる。 The grouping process as described above is a process for grouping content areas that share the same type (for example, “title”, “large headline”, “medium headline”, etc.) appearing in the document. In other words, the content areas that share the same type are considered to have similar “character size” and “distance from the browsing path”, so the attributes approximated as described above. By grouping content areas having information, classification processing according to the type in the document can be realized.

このようなグループへの分類処理の結果、図９のデータ欄４６７に格納されているような「ＴＹＰＥ値」が決定される。 As a result of such grouping processing, the “TYPE value” stored in the data column 467 of FIG. 9 is determined.

（しおりとしての適合度の評価処理およびグループの選択処理）
図１１は、図６のステップＳ１１６およびＳ１１８におけるしおりとしての適合度の評価処理およびグループの選択処理を説明するための図である。 (Evaluation process for fitness as a bookmark and group selection process)
FIG. 11 is a diagram for explaining the fitness evaluation processing and group selection processing as bookmarks in steps S116 and S118 of FIG.

図１１を参照して、しおりデータ生成部１７は、上述の分類処理によって分類した各グループについて、しおりとしての適合度を評価する。具体的には、しおりデータ生成部１７は、各グループに所属する内容領域のページ毎の出現数に基づいて各グループを評価する。本実施の形態では、一例として、各グループの「出現網羅度」および「最大出現数」を評価指標として用いる。 Referring to FIG. 11, bookmark data generation unit 17 evaluates the degree of fitness as a bookmark for each group classified by the above-described classification process. Specifically, the bookmark data generation unit 17 evaluates each group based on the number of appearances of the content area belonging to each group for each page. In the present embodiment, as an example, “appearance coverage” and “maximum number of appearances” of each group are used as evaluation indexes.

ここで、「出現網羅度」とは、文書画像を構成する全ページのうち、各グループに所属する内容領域がどの程度それらのページに出現しているかを示す指標である。すなわち、各グループに所属する内容領域が文書画像を構成するページをどの程度網羅しているかを示す指標である。たとえば、文書画像に１０ページ分のページ領域が含まれている場合に、グループに所属する内容領域が５ページ分のページ領域に存在していれば、網羅度は「０．５」となる。この網羅度が「１」に近いグループほど、当該グループに所属する内容領域が文書画像の全体に出現することになり、しおりの対象として適当であると考えられる。 Here, the “appearance coverage” is an index indicating how many content areas belonging to each group appear in those pages among all pages constituting the document image. That is, it is an index indicating how much the content area belonging to each group covers the pages constituting the document image. For example, when a page area for 10 pages is included in a document image, if the content area belonging to the group exists in the page area for 5 pages, the coverage is “0.5”. The closer the coverage level is to “1”, the more the content area belonging to the group appears in the entire document image, which is considered appropriate as a bookmark target.

また、「最大出現数」とは、各グループに所属する内容領域のページ毎の出現数の最大値を示す指標である。すなわち、文書画像を構成する各ページについて見たときに、各グループに所属する内容領域がどの程度集中的に存在しているかを示す指標である。たとえば、１ページ目にあるグループに所属する内容領域が「５」回出現しており、他のページには全く出現しなければ、最大出現数は「５」となる。この最大出現数がしおりとして適切な値（たとえば、１〜２回）に近いグループほど、しおりとして選択するのが適当であると考えられる。 The “maximum number of appearances” is an index indicating the maximum number of appearances for each page of the content area belonging to each group. That is, this is an index indicating how concentrated the content area belonging to each group is when viewing each page constituting the document image. For example, if the content area belonging to the group on the first page appears “5” times and does not appear at all on the other pages, the maximum number of appearances is “5”. It is considered that it is appropriate to select a group whose maximum appearance number is close to a value appropriate for a bookmark (for example, 1 to 2 times) as a bookmark.

図１１は、図９に示す各内容領域の属性情報を用いて「出現網羅度」および「最大出現数」の具体例を算出した結果である。この図１１に示すように、図９において「ＴＹＰＥ３」および「ＴＹＰＥ４」のそれぞれに所属する内容領域は、すべてのページに出現している一方、「ＴＹＰＥ１」および「ＴＹＰＥ２」のそれぞれに所属する内容領域は、一部出現していないページがある。この「出現網羅度」において、その所属する内容領域がより多くのページに出現するグループに対して相対的に高い評価が与えられる。具体的には、すべてのページにその所属する内容領域が出現している「ＴＹＰＥ３」および「ＴＹＰＥ４」には、評価点として「２」点が与えられる。一方で、「ＴＹＰＥ１」および「ＴＹＰＥ２」には、それぞれお評価点として「０」点および「１」点が与えられる。 FIG. 11 shows the results of calculating specific examples of “appearance coverage” and “maximum number of appearances” using the attribute information of each content area shown in FIG. As shown in FIG. 11, the content areas belonging to “TYPE3” and “TYPE4” in FIG. 9 appear on all pages, while the contents belonging to “TYPE1” and “TYPE2” respectively. Some areas do not appear in the area. In this “appearance coverage”, a relatively high evaluation is given to a group in which the content area to which it belongs appears on more pages. More specifically, “TYPE 3” and “TYPE 4” in which the content areas belonging to all pages appear are given “2” points as evaluation points. On the other hand, “TYPE 1” and “TYPE 2” are given “0” points and “1” points as evaluation points, respectively.

また、「最大出現数」については、最大出現数が所定範囲内（一例として、１〜２回）であるグループに対して、その所定範囲外であるグループに比較して相対的に高い評価が与えられる。具体的には、その最大出現数が「１」または「２」回である「ＴＹＰＥ１」、「ＴＹＰＥ２」、「ＴＹＰＥ３」には、評価点として「２」点が与えられる。一方で、その最大出現数が「６」回である「ＴＹＰＥ４」には、評価点として「０」点が与えられる。 In addition, regarding the “maximum number of appearances”, a group whose maximum number of appearances is within a predetermined range (for example, 1 to 2 times) has a relatively high evaluation compared to a group that is outside the predetermined range. Given. Specifically, “TYPE 1”, “TYPE 2”, and “TYPE 3” whose maximum number of appearances is “1” or “2” times are given “2” points as evaluation points. On the other hand, “TYPE 4” whose maximum number of appearances is “6” is given “0” as an evaluation score.

さらに、しおりデータ生成部１７は、「出現網羅度」および「最大出現数」についての評価点の合計点を総合適合度として評価し、評価点が最上位のものから少なくとも１つのグループをしおりデータの生成対象として選択する。図１１に示す例では、「ＴＹＰＥ２」および「ＴＹＰＥ３」が総合適合度の上位２つであり、これらがしおりデータの生成対象として選択される。 Further, the bookmark data generation unit 17 evaluates the total score of the evaluation scores for the “appearance coverage” and the “maximum number of appearances” as the overall fitness, and bookmark data includes at least one group with the highest evaluation score. Select as the generation target. In the example illustrated in FIG. 11, “TYPE 2” and “TYPE 3” are the top two of the total fitness levels, and these are selected as bookmark data generation targets.

なお、上述の例では「出現網羅度」および「最大出現数」を総合した結果に基づいて、しおりデータの生成対象を選択したが、いずれか一方の評価結果を用いてしおりデータの生成対象を選択してもよく、さらに別の評価結果を用いてもよい。また、「出現網羅度」が非常に低いグループについては特徴的な記述である場合も想定されるため、これらのグループについてもしおりの生成対象としてもよい。 In the above example, the generation target of the bookmark data is selected based on the result of combining the “appearance coverage” and the “maximum number of appearances”, but the generation target of the bookmark data is selected using one of the evaluation results. You may select and you may use another evaluation result. In addition, since groups with very low “appearance coverage” are assumed to be characteristic descriptions, these groups may be generated as bookmarks.

（グループ間の従属関係の決定処理）
図１２は、図６のステップＳ１２０におけるグループ間の従属関係を決定する処理を説明するための図である。図１２（ａ）は、しおりデータの生成対象を選択しただけの状態を示す。図１２（ｂ）は、グループ間の従属関係を決定した後の状態を示す。 (Process to determine the dependency between groups)
FIG. 12 is a diagram for explaining the process of determining the dependency between groups in step S120 of FIG. FIG. 12A shows a state where the bookmark data generation target is simply selected. FIG. 12B shows a state after determining the dependency relationship between the groups.

しおりデータ生成部１７は、上述の選択処理によってしおりデータの生成対象として選択したグループが複数ある場合に、グループ間の従属関係を決定する。この従属関係は、原稿の記述の階層構造（たとえば、「大見出し」と「中見出し」との関係）を反映できるように決定される。 The bookmark data generation unit 17 determines a dependency relationship between groups when there are a plurality of groups selected as bookmark data generation targets by the selection process described above. This dependency relationship is determined so as to reflect the hierarchical structure of the description of the manuscript (for example, the relationship between “large heading” and “medium heading”).

具体的には、図１２（ａ）に示すように、しおりデータ生成部１７は、しおりデータの生成対象として選択したグループ（図１２の例では、「ＴＹＰＥ２」および「ＴＹＰＥ２」）に所属する内容領域の閲覧パス４４０上位置を相互に比較していく。そして、しおりデータ生成部１７は、異なるグループの内容領域（しおり）のうち、閲覧パス４４０上で互いに近接しているものを抽出する。 Specifically, as illustrated in FIG. 12A, the bookmark data generation unit 17 includes contents belonging to a group selected as a bookmark data generation target (“TYPE 2” and “TYPE 2” in the example of FIG. 12). The positions on the browsing path 440 of the areas are compared with each other. Then, the bookmark data generation unit 17 extracts content areas (bookmarks) of different groups that are close to each other on the browsing path 440.

そして、しおりデータ生成部１７は、このような異なるグループに所属するしおり同士を比較し、いずれのしおりが閲覧パス４４０上のより基準位置に近いかを判断する。図１２（ａ）に示す例では、「ＴＹＰＥ２」に所属するしおりが「ＴＹＰＥ３」に所属するしおりに比較して基準位置に近い（閲覧パス４４０上で先に出現している）ので、「ＴＹＰＥ３」に所属するしおりが「ＴＹＰＥ２」に所属するしおりに従属すると判断する。すなわち、しおりデータ生成部１７は、「ＴＹＰＥ２」に所属するしおりが「主」で、「ＴＹＰＥ３」に所属するしおりが「従」であると判断する。 Then, the bookmark data generation unit 17 compares bookmarks belonging to such different groups, and determines which bookmark is closer to the reference position on the browsing path 440. In the example shown in FIG. 12A, the bookmark belonging to “TYPE2” is closer to the reference position than the bookmark belonging to “TYPE3” (appears first on the browsing path 440). It is determined that the bookmark belonging to “” is subordinate to the bookmark belonging to “TYPE 2”. That is, the bookmark data generation unit 17 determines that the bookmark belonging to “TYPE2” is “main” and the bookmark belonging to “TYPE3” is “subordinate”.

このような手順によって、しおりデータ生成部１７はグループ間の従属関係を決定する。 By such a procedure, the bookmark data generation unit 17 determines the dependency relationship between groups.

（しおりデータおよび電子化文書の生成処理）
上述のような処理によって得られた情報に基づいて、しおりデータ生成部１７は、選択された内容領域の文書画像内での位置を示すしおりデータを生成する。さらに、電子化文書生成部１５がしおりデータ生成部１７が生成したしおりデータを文書画像に付加することで電子化文書４００を生成する。 (Booklet data and digitized document generation process)
Based on the information obtained by the processing as described above, the bookmark data generation unit 17 generates bookmark data indicating the position of the selected content area in the document image. Further, the digitized document 400 is generated by adding the bookmark data generated by the bookmark data generator 17 to the document image by the digitized document generator 15.

図１３は、電子化文書生成部１５が生成する電子化文書４００のデータ構造の一例を示す図である。 FIG. 13 is a diagram illustrating an example of a data structure of the digitized document 400 generated by the digitized document generation unit 15.

図１３を参照して、電子化文書４００は、ヘッダ部４０２と、文書画像部４０４と、しおりデータ部４０６と、フッタ部４０８とからなる。ヘッダ部４０２およびフッタ部４０８には、電子化文書４００の属性についての情報、たとえば作成日時・作成者・著作権情報などが格納される。文書画像部４０４には、各ページに対応する文書画像が格納される。なお、この文書画像は、上述したように圧縮された状態で格納されてもよい。しおりデータ部４０６には、しおりデータ生成部１７が生成したしおりデータが格納される。 Referring to FIG. 13, the digitized document 400 includes a header part 402, a document image part 404, a bookmark data part 406, and a footer part 408. The header part 402 and the footer part 408 store information about the attributes of the digitized document 400, such as creation date / time / creator / copyright information. A document image corresponding to each page is stored in the document image unit 404. The document image may be stored in a compressed state as described above. In the bookmark data unit 406, the bookmark data generated by the bookmark data generation unit 17 is stored.

図１４は、しおりデータ部４０６のデータ構造の一例を示す図である。
図１４を参照して、しおりデータ部４０６には、選択されたグループの内容領域の文書画像内での位置を示す位置情報が格納される。たとえば、「しおり１」として格納される位置情報である「ページ１，（１０，１２）」は、対象となる内容領域が、文書画像の「１」ページ目で、「閲覧パスからの距離」が「１０」で、かつ「閲覧パス上位置」が「１２」に存在することを示している。さらに、しおりデータ部４０６には、しおり同士の従属関係が規定されており、たとえば「しおり１」には、「しおり４」および「しおり５」が「しおり１」に従属することが規定される。 FIG. 14 is a diagram illustrating an example of the data structure of the bookmark data unit 406.
Referring to FIG. 14, bookmark data portion 406 stores position information indicating the position of the content area of the selected group in the document image. For example, “page 1, (10, 12)”, which is position information stored as “bookmark 1”, has a target content area “page 1” of the document image, and “distance from browsing path”. Is “10” and “position on the browsing path” is “12”. Further, the bookmark data unit 406 defines a dependency relationship between bookmarks. For example, “bookmark 1” defines that “bookmark 4” and “bookmark 5” are subordinate to “bookmark 1”. .

上述のような処理により、文書画像に含まれる内容領域に応じて効率的にしおりデータを生成することができる。 Through the processing as described above, bookmark data can be efficiently generated according to the content area included in the document image.

この発明の実施の形態１によれば、原稿を読込んで文書画像を含む電子化文書を生成する際に、文書画像に含まれる内容領域に応じて効率的にしおりなどの閲覧ナビゲート情報を生成できる。 According to the first embodiment of the present invention, when an electronic document including a document image is generated by reading an original, browsing navigation information such as a bookmark is efficiently generated according to the content area included in the document image. it can.

（変形例）
なお、上述の説明では、本発明を一段組の文書に適用した場合について例示したが、複数に段組された文書についても適用することができる。 (Modification)
In the above description, the case where the present invention is applied to a one-column document is illustrated, but the present invention can also be applied to a plurality of documents.

図１５は、この発明の実施の形態１の変形例に従う処理を模式的に示した図である。
図１５を参照して、一例として、二段組された電子化文書４００Ａに対して本発明を適用する場合には、各段組に対応させて閲覧パス４４０Ａ，４４０Ｂを設定する。ここで、閲覧パス４４０Ａおよび４４０Ｂは、電子化文書４００Ａにおける内容領域の抽出処理の結果に基づいて設定される領域４５０Ａおよび４５０Ｂに対応付けて設定される。すなわち、領域４５０Ａ内に存在する内容領域の各々は、閲覧パス４４０Ａとの間で「閲覧パスからの距離」（符号４５４Ａ）および「閲覧パス上位置」（符号４５２Ａ）を規定され、領域４５０Ｂ内に存在する内容領域の各々は、閲覧パス４４０Ｂとの間で「閲覧パスからの距離」および「閲覧パス上位置」を規定される。 FIG. 15 is a diagram schematically showing processing according to the modification of the first embodiment of the present invention.
Referring to FIG. 15, as an example, when the present invention is applied to a two-column digitized document 400A, browsing paths 440A and 440B are set corresponding to each column. Here, the browsing paths 440A and 440B are set in association with the areas 450A and 450B set based on the result of the content area extraction processing in the digitized document 400A. That is, each of the content areas existing in the area 450A is defined with a “distance from the browsing path” (reference 454A) and a “position on the browsing path” (reference 452A) with the browsing path 440A. Each of the content areas existing in is defined with a “distance from the browsing path” and a “position on the browsing path” with the browsing path 440B.

以下の処理は、上述したこの発明の実施の形態１に従う処理と同様であるので、詳細な説明は繰返さない。 Since the following processing is the same as the processing according to the first embodiment of the present invention described above, detailed description will not be repeated.

［実施の形態２］
この発明の実施の形態１では、ＭＦＰ１が予めしおりデータを付加した電子化文書を生成する構成について例示した。このような電子化文書は、その中に含まれる文書画像の１ページ分を一度に表示可能なパーソナルコンピュータなどには適している。しかしながら、特に携帯型端末などでは、ディスプレイ（表示領域）の大きさに制約があるため文書画像の１ページ分を一度に表示できる場合も多い。そのため、予め付加されたしおりが表示領域内に全く含まれない場合などもある。そこで、この発明の実施の形態２では、電子化文書の閲覧環境に応じてしおりを動的に生成する構成について例示する。 [Embodiment 2]
In the first embodiment of the present invention, the configuration in which the MFP 1 generates an electronic document to which bookmark data has been added in advance has been exemplified. Such an electronic document is suitable for a personal computer or the like that can display one page of a document image contained therein at a time. However, especially in a portable terminal or the like, there are many cases where one page of a document image can be displayed at a time because the size of the display (display area) is limited. For this reason, a bookmark added in advance may not be included in the display area at all. Therefore, Embodiment 2 of the present invention exemplifies a configuration that dynamically generates bookmarks according to the browsing environment of an electronic document.

この発明の実施の形態２に従うシステムは、図１に示すシステムと同様であり、ＭＦＰ１＃が電子化文書を携帯端末ＭＴやパーソナルコンピュータＰＣ（以下、「クライアント端末」とも総称する）へ送信し、送信先の装置がしおりデータをそれぞれ動的に生成する。 The system according to the second embodiment of the present invention is the same as the system shown in FIG. 1, and MFP 1 # transmits an electronic document to portable terminal MT or personal computer PC (hereinafter also collectively referred to as “client terminal”). Each destination device dynamically generates bookmark data.

図１６は、この発明の実施の形態２に従う携帯端末ＭＴにおける表示領域を模式的に示す図である。 FIG. 16 is a diagram schematically showing a display area in portable terminal MT according to the second embodiment of the present invention.

図１６を参照して、携帯端末ＭＴの文書表示領域５００（ディスプレイの大きさ）が電子化文書に含まれる文書画像のページ領域４２１に比較して小さければ、ユーザが一度に閲覧できる範囲は、文書画像の一部分に限定されたものとなってしまう。そのため、本実施の形態では、文書画像のページ単位とは別に、携帯端末ＭＴなどの表示領域の大きさに基づいて閲覧ページ領域を設定し、この閲覧ページ領域の単位でしおりデータを生成する。 Referring to FIG. 16, if the document display area 500 (display size) of the portable terminal MT is smaller than the page area 421 of the document image included in the digitized document, the range that the user can view at one time is The result is limited to a part of the document image. For this reason, in the present embodiment, apart from the page unit of the document image, a browsing page area is set based on the size of the display area of the mobile terminal MT or the like, and bookmark data is generated in units of the browsing page area.

特に、この発明の実施の形態２では、汎用の閲覧アプリケーションがインストールされているクライアント端末に対して、本発明に係る文書処理装置として機能させるためのプログラムを添付した電子化文書を送信する構成について説明する。より具体的には、汎用の閲覧アプリケーション（代表的に、米国カリフォルニア州サンノゼにあるアドビシステムズ社の「Ｒｅａｄｅｒ」）には、所定のスクリプト言語（代表的に、ＪａｖａＳｃｒｉｐｔ（登録商標））の実行環境を提供するモジュールを含めることができる。そのため、電子化文書にスクリプト形式のプログラムを添付することで、このような閲覧アプリケーションがインストールされている一般的なクライアント端末において、本発明に係る電子化文書の処理を実現することができる。 Particularly, in the second embodiment of the present invention, a configuration in which an electronic document attached with a program for causing a client terminal installed with a general-purpose browsing application to function as a document processing apparatus according to the present invention is transmitted. explain. More specifically, a general-purpose browsing application (typically “Reader” of Adobe Systems Inc. in San Jose, California, USA) includes an execution environment of a predetermined script language (typically JavaScript (registered trademark)). Modules can be included. Therefore, by attaching a script format program to the digitized document, the processing of the digitized document according to the present invention can be realized in a general client terminal in which such a browsing application is installed.

（ＭＦＰの構成）
この発明の実施の形態２に従うＭＦＰ１＃における構成は、図２と同様であるので詳細な説明は繰返さない。 (MFP configuration)
Configuration in MFP 1 # according to the second embodiment of the present invention is the same as that in FIG. 2, and therefore detailed description will not be repeated.

図１７は、この発明の実施の形態２に従うＭＦＰ１＃における機能構成を示すブロック図である。これらの機能は、主としてＭＦＰ１の制御部１００やメモリ部１０２（図２）などによって実現される。 FIG. 17 is a block diagram showing a functional configuration in MFP 1 # according to the second embodiment of the present invention. These functions are mainly realized by the control unit 100 and the memory unit 102 (FIG. 2) of the MFP 1.

図１７を参照して、ＭＦＰ１＃は、図５に示すこの発明の実施の形態１に従うＭＦＰ１の機能構成において、電子化文書生成部１５およびしおりデータ生成部１７に代えて電子化文書生成部１５＃および行属性情報生成部２０を設け、さらにクライアントプログラム格納部２１を加えたものに等しい。 Referring to FIG. 17, MFP 1 # has an electronic document generation unit 15 in place of electronic document generation unit 15 and bookmark data generation unit 17 in the functional configuration of MFP 1 according to the first embodiment of the present invention shown in FIG. # And line attribute information generation unit 20 are provided, and a client program storage unit 21 is further added.

行属性情報生成部２０は、画像解析部１６から出力される内容領域の属性情報に基づいて、内容領域の全体を少なくとも１つのグループに分類する。そして、行属性情報生成部２０は、抽出された各内容領域に対応付けて、少なくとも所属グループを示す「ＴＹＰＥ値」と、対応する内容領域の位置を示す「閲覧パス上位置」とを含む行属性情報を電子化文書生成部１５＃へ出力する。 The line attribute information generation unit 20 classifies the entire content region into at least one group based on the content region attribute information output from the image analysis unit 16. Then, the row attribute information generation unit 20 associates with each extracted content area, and includes a “TYPE value” indicating at least a group and a “position on the browsing path” indicating the position of the corresponding content area. The attribute information is output to the digitized document generation unit 15 #.

クライアントプログラム格納部２１は、電子化文書に付加して送付するクライアントプログラムを格納している。このクライアントプログラムは、汎用の閲覧アプリケーションがインストールされている携帯端末ＭＴやパーソナルコンピュータＰＣに対して、本発明に係る文書処理装置として機能させるためのものである。電子化文書生成部１５＃は、圧縮処理部１４で圧縮された文書画像に、行属性情報生成部２０からの行属性情報とクライアントプログラム格納部２１からのクライアントプログラムとを付加することで、電子化文書４００＃を生成する。 The client program storage unit 21 stores a client program that is sent in addition to an electronic document. This client program is for causing a portable terminal MT or a personal computer PC in which a general-purpose browsing application is installed to function as a document processing apparatus according to the present invention. The digitized document generation unit 15 # adds the line attribute information from the line attribute information generation unit 20 and the client program from the client program storage unit 21 to the document image compressed by the compression processing unit 14, thereby generating an electronic document. Generated document 400 #.

その他の機能ブロックについては、上述した図５の対応する機能ブロックと同様であるので、詳細な説明は繰返さない。 Other functional blocks are the same as the corresponding functional blocks in FIG. 5 described above, and thus detailed description will not be repeated.

図１８は、この発明の実施の形態２に従うＭＦＰ１＃が生成する電子化文書４００＃のデータ構造の一例を示す図である。図１８を参照して、電子化文書４００＃は、図１３に示す電子化文書４００において、しおりデータ部４０６に代えて行属性情報４１０を配置し、さらにクライアントプログラム４０９を付加したものである。 FIG. 18 shows an exemplary data structure of digitized document 400 # generated by MFP 1 # according to the second embodiment of the present invention. Referring to FIG. 18, an electronic document 400 # is obtained by arranging line attribute information 410 in place of the bookmark data portion 406 and adding a client program 409 in the electronic document 400 shown in FIG.

図１９は、図１８に示す行属性情報４１０のデータ構造の具体例を示す図である。
図１９を参照して、本実施の形態に従う行属性情報４１０には、図９に示すこの発明の実施の形態１に従う属性情報のうち、データ欄４６７および４６６に格納されているデータが格納される。すなわち、行属性情報４１０には、文書画像に含まれる内容領域のグループを示す「ＴＹＰＥ値」と、対応する「閲覧パス上位置（絶対値）」とが格納される。このように、行属性情報４１０には、しおりデータを生成するための元になるデータが格納されるだけでよい。 FIG. 19 is a diagram showing a specific example of the data structure of the row attribute information 410 shown in FIG.
Referring to FIG. 19, row attribute information 410 according to the present embodiment stores the data stored in data columns 467 and 466 among the attribute information according to the first embodiment of the present invention shown in FIG. The That is, the line attribute information 410 stores a “TYPE value” indicating a group of content areas included in the document image and a corresponding “position on browsing path (absolute value)”. In this way, the row attribute information 410 need only store data that is the basis for generating bookmark data.

（電子化文書の生成処理手順）
図２０は、この発明の実施の形態２に従う電子化文書の生成処理の具体例を示すフローチャートである。図２０のフローチャートに示される処理は、ＭＦＰ１＃の制御部１００がプログラムをメモリ部１０２などに読出して実行し、図１７に示される各機能を実現する。 (Digitized document generation procedure)
FIG. 20 is a flowchart showing a specific example of the digitized document generation process according to the second embodiment of the present invention. The processing shown in the flowchart of FIG. 20 is implemented by the control unit 100 of the MFP 1 # reading out the program to the memory unit 102 and executing the functions shown in FIG.

図１７および図２０を参照して、まず、画像読取部１０４がユーザ設定などに応じて原稿３００を読取って文書画像を生成する（ステップＳ２００）。次に、画像前処理部１２がこの生成された文書画像を調整する（ステップＳ２０２）。そして、調整後の文書画像は、画像バッファ部１３に格納される。 Referring to FIGS. 17 and 20, first, the image reading unit 104 reads the document 300 according to user settings and the like to generate a document image (step S 200). Next, the image preprocessing unit 12 adjusts the generated document image (step S202). The adjusted document image is stored in the image buffer unit 13.

続いて、圧縮処理部１４が、画像バッファ部１３に格納された文書画像を圧縮処理して、電子化文書生成部１５へ出力する（ステップＳ２０４）。 Subsequently, the compression processing unit 14 compresses the document image stored in the image buffer unit 13 and outputs the compressed document image to the digitized document generation unit 15 (step S204).

一方、画像解析部１６が、画像バッファ部１３に格納された文書画像から内容領域を行単位で抽出する（ステップＳ２０６）。そして、画像解析部１６が、１ページ目の文書画像に含まれる内容領域に応じて、各内容領域の位置を特定するための基準となる閲覧パスを文書画像内に設定する（ステップＳ２０８）。さらに、画像解析部１６が、抽出された各内容領域の閲覧パスを基準とする位置（「閲覧パスからの距離」および「閲覧パス上位置」）を取得する（ステップＳ２１０）。同時に、画像解析部１６が、抽出された各内容領域の「文字の大きさ」、「文字の色」、「背景の色」の代表値を取得する（ステップＳ２１２）。そして、各内容領域の「閲覧パスからの距離」、「閲覧パス上位置」、「文字の大きさ」、「文字の色」、「背景の色」は、属性情報として行属性情報生成部２０へ出力される。 On the other hand, the image analysis unit 16 extracts the content area from the document image stored in the image buffer unit 13 line by line (step S206). Then, the image analysis unit 16 sets a browsing path serving as a reference for specifying the position of each content area in the document image according to the content area included in the document image of the first page (step S208). Further, the image analysis unit 16 acquires a position (“distance from the browsing path” and “position on the browsing path”) based on the browsing path of each extracted content area (step S210). At the same time, the image analysis unit 16 acquires representative values of “character size”, “character color”, and “background color” of each extracted content area (step S212). Then, the “distance from the browsing path”, “position on the browsing path”, “size of character”, “character color”, and “background color” of each content area are set as attribute information in the row attribute information generation unit 20. Is output.

この属性情報を受けて、行属性情報生成部２０が、内容領域の全体を少なくとも１つのグループに分類する（ステップＳ２１４）。そして、行属性情報生成部２０が、各内容領域に対応付けて、その所属するグループを示す「ＴＹＰＥ値」と、対応する内容領域の位置を示す「閲覧パス上位置（絶対値）」とが格納された行属性情報４１０を生成する（ステップＳ２１６）。 Receiving this attribute information, the row attribute information generation unit 20 classifies the entire content area into at least one group (step S214). Then, the row attribute information generation unit 20 associates each content area with a “TYPE value” indicating the group to which the line attribute information belongs, and a “browsing path position (absolute value)” indicating the position of the corresponding content area. The stored row attribute information 410 is generated (step S216).

続いて、電子化文書生成部１５＃が、圧縮処理部１４からの（圧縮された）文書画像に、行属性情報生成部２０からの行属性情報４１０およびクライアントプログラム格納部２１からのクライアントプログラムを付加することで、電子化文書を生成する（ステップＳ２１８）。そして、電子化文書の生成処理は終了する。 Subsequently, the digitized document generation unit 15 # adds the line attribute information 410 from the line attribute information generation unit 20 and the client program from the client program storage unit 21 to the (compressed) document image from the compression processing unit 14. By adding, an electronic document is generated (step S218). Then, the digitized document generation process ends.

なお、上記の各ステップの詳細な処理については、上述したこの発明の実施の形態１と同様であるので、詳細な説明は繰返さない。 Since detailed processing of each step is the same as that of the first embodiment of the present invention described above, detailed description will not be repeated.

（クライアント端末の構成）
クライアント端末であるパーソナルコンピュータＰＣや携帯端末ＭＴの概略のハードウェア構成は図３と同様であるので、詳細な説明は繰返さない。 (Configuration of client terminal)
Since the general hardware configuration of personal computer PC and mobile terminal MT which are client terminals is the same as that of FIG. 3, detailed description will not be repeated.

図２１は、この発明の実施の形態２に従うクライアント端末における機能構成を示すブロック図である。図２１（ａ）は、携帯端末ＭＴにおける機能構成を示し、図２１（ｂ）は、パーソナルコンピュータＰＣにおける機能構成を示す。なお、これらの機能は、図３に示すＣＰＵ２０１がメモリ部２１３に記憶されたプログラムを実行することで実現される。 FIG. 21 is a block diagram showing a functional configuration in the client terminal according to the second embodiment of the present invention. FIG. 21A shows a functional configuration in the mobile terminal MT, and FIG. 21B shows a functional configuration in the personal computer PC. Note that these functions are realized by the CPU 201 shown in FIG. 3 executing a program stored in the memory unit 213.

図２１（ａ）を参照して、携帯端末ＭＴにおける機能構成は、受信部４０と、閲覧用アプリケーション４１と、表示部４２と、クライアントプログラム実行環境４３と、機器情報格納部４４とを含む。 With reference to FIG. 21A, the functional configuration of the mobile terminal MT includes a receiving unit 40, a browsing application 41, a display unit 42, a client program execution environment 43, and a device information storage unit 44.

受信部４０は、代表的に通信インターフェイス部１０８（図２）によって実現され、ＭＦＰ１＃から送信される電子化文書４００＃を受信して、閲覧用アプリケーション４１へ渡す。閲覧用アプリケーション４１は、ハードディスク部２１１（図２）などに格納されているプログラムコードがメモリ部２１３（図２）に展開されて、ＣＰＵ２０１（図２）で実行されることで実現される。閲覧用アプリケーション４１は、電子化文書４００＃に含まれる文書画像の表示データを生成し、表示部４２へ出力する。並行して、閲覧用アプリケーション４１は、電子化文書４００＃に含まれる付属情報および添付プログラムを抽出し、それをクライアントプログラム実行環境４３に渡す。 The receiving unit 40 is typically realized by the communication interface unit 108 (FIG. 2), receives the digitized document 400 # transmitted from the MFP 1 #, and passes it to the browsing application 41. The browsing application 41 is realized by developing a program code stored in the hard disk unit 211 (FIG. 2) in the memory unit 213 (FIG. 2) and executing it by the CPU 201 (FIG. 2). The browsing application 41 generates display data of the document image included in the digitized document 400 # and outputs it to the display unit 42. In parallel, the browsing application 41 extracts the attached information and the attached program included in the digitized document 400 # and passes them to the client program execution environment 43.

クライアントプログラム実行環境４３は、閲覧用アプリケーション４１から渡された添付プログラムを実行することで、電子化文書４００＃の行属性情報に基づいてしおりデータを生成し、当該しおりデータを文書画像とともに表示部４２に表示する。ここで、クライアントプログラム実行環境４３で実行されるクライアントプログラムは、機器情報格納部４４から機器属性や表示部４２の表示特性（代表的に、表示領域の大きさ）を取得し、これらの情報に応じてしおりデータを動的に生成する。ここで、機器情報格納部４４は、携帯端末ＭＴの機器属性を予め格納する部位である。 The client program execution environment 43 generates bookmark data based on the line attribute information of the digitized document 400 # by executing the attached program passed from the browsing application 41, and displays the bookmark data together with the document image. 42. Here, the client program executed in the client program execution environment 43 acquires the device attributes and the display characteristics (typically, the size of the display area) of the display unit 42 from the device information storage unit 44, and includes these information. In response, bookmark data is dynamically generated. Here, the device information storage unit 44 is a part that stores device attributes of the mobile terminal MT in advance.

このように、閲覧用アプリケーション４１およびクライアントプログラム実行環境４３が協働することで、本発明に係るしおりデータの生成処理および表示処理が実現される。なお、閲覧用アプリケーション４１とクライアントプログラム実行環境４３との間の機能分担については適宜設計することが可能である。 As described above, the browsing application 41 and the client program execution environment 43 cooperate to realize bookmark data generation processing and display processing according to the present invention. The function sharing between the browsing application 41 and the client program execution environment 43 can be designed as appropriate.

図２１（ｂ）を参照して、パーソナルコンピュータＰＣにおける機能構成は、受信部５０と、閲覧用アプリケーション５１と、表示部５２と、クライアントプログラム実行環境５３と、機器情報格納部５４と、ＧＵＩ（Graphical User Interface）部５５とを含む。受信部５０と、閲覧用アプリケーション５１と、表示部５２と、クライアントプログラム実行環境５３と、機器情報格納部５４とは、それぞれ上述した受信部４０と、閲覧用アプリケーション４１と、表示部４２と、クライアントプログラム実行環境４３と、機器情報格納部４４と同様であるので、詳細な説明は繰返さない。 Referring to FIG. 21B, the functional configuration of the personal computer PC includes a receiving unit 50, a browsing application 51, a display unit 52, a client program execution environment 53, a device information storage unit 54, a GUI ( Graphical User Interface) section 55. The receiving unit 50, the browsing application 51, the display unit 52, the client program execution environment 53, and the device information storage unit 54 are the receiving unit 40, the browsing application 41, the display unit 42, and the display unit 42, respectively. Since it is similar to client program execution environment 43 and device information storage unit 44, detailed description will not be repeated.

ここで、本実施の形態に従うパーソナルコンピュータＰＣは、代表的に、複数のアプリケーションを同時に実行可能なオペレーティングシステム（ＯＳ：Operating System）を搭載しており、表示部５２には複数のアプリケーションによって生成される画面が表示される。ＧＵＩ部５５は、このような複数のアプリケーションによる表示を制御しており、クライアントプログラム実行環境５３で実行されるクライアントプログラムからの要求に応答して、閲覧用アプリケーションの表示サイズ（ウィンドウサイズ）を返答する。このＧＵＩ部５５からの表示サイズの情報に応じて、クライアントプログラム実行環境５３で実行されるクライアントプログラムは、しおりデータを動的に生成する。 Here, personal computer PC according to the present embodiment typically includes an operating system (OS) that can simultaneously execute a plurality of applications, and display unit 52 generates a plurality of applications. Appears. The GUI unit 55 controls display by such a plurality of applications, and responds to the request from the client program executed in the client program execution environment 53 and returns the display size (window size) of the browsing application. To do. In accordance with the display size information from the GUI unit 55, the client program executed in the client program execution environment 53 dynamically generates bookmark data.

（しおりデータの生成処理手順）
図２２は、この発明の実施の形態２に従うしおりデータの生成処理の具体例を示すフローチャートである。図２２のフローチャートに示される処理は、図３に示すＣＰＵ２０１がハードディスク部２１１などに予め格納された閲覧用アプリケーションをメモリ部２１３に展開して実行するとともに、ハードディスク部２１１またはメモリ部２１３に予め取得されている電子化文書４００＃中のクライアントプログラムを並行的に実行することで実現される。 (Bookmark data generation processing procedure)
FIG. 22 is a flowchart showing a specific example of bookmark data generation processing according to the second embodiment of the present invention. The process shown in the flowchart of FIG. 22 is executed by the CPU 201 shown in FIG. This is realized by executing the client program in the digitized document 400 # in parallel.

図２２を参照して、ＣＰＵ２０１が、機器情報格納部４４または５４からクライアント端末の機器属性を取得する（ステップＳ３００）。そして、ＣＰＵ２０１が、取得した機器属性に基づいて、クライアント端末が携帯端末ＭＴであるか、パーソナルコンピュータＰＣであるかを判断する（ステップＳ３０２）。すなわち、ＣＰＵ２０１は、クライアント端末が、単一のアプリケーションだけを実行可能であるか、もしくは複数のアプリケーションを同時に実行可能であるかを判断する。 With reference to FIG. 22, CPU201 acquires the apparatus attribute of a client terminal from the apparatus information storage part 44 or 54 (step S300). Then, the CPU 201 determines whether the client terminal is the portable terminal MT or the personal computer PC based on the acquired device attribute (step S302). That is, the CPU 201 determines whether the client terminal can execute only a single application or can simultaneously execute a plurality of applications.

クライアント端末が携帯端末ＭＴである場合（ステップＳ３０２においてＭＴ）には、ＣＰＵ２０１が、機器情報格納部４４から表示部４２の表示領域の大きさを取得する（ステップＳ３０４）。一方、クライアント端末がパーソナルコンピュータＰＣである場合（ステップＳ３０２においてＰＣ）には、ＣＰＵ２０１が、ＧＵＩ部５５からアクティブになっている閲覧アプリケーションのウィンドウサイズを取得する（ステップＳ３０６）。 When the client terminal is the mobile terminal MT (MT in step S302), the CPU 201 acquires the size of the display area of the display unit 42 from the device information storage unit 44 (step S304). On the other hand, if the client terminal is a personal computer PC (PC in step S302), the CPU 201 acquires the window size of the browsing application that is active from the GUI unit 55 (step S306).

次に、ＣＰＵ２０１が、表示部４２の表示領域の大きさ、またはアクティブになっている閲覧アプリケーションのウィンドウサイズに応じて、少なくとも１つの閲覧ページ領域を設定する（ステップＳ３０８）。ここで、閲覧ページ領域とは、文書画像を表示部４２または５２に表示するために設定される便宜上のページ領域であり、元の原稿におけるページ領域とは独立に設定される。代表的に、閲覧ページ領域は、電子化文書に含まれる文書画像のうち表示部４２または５２で表示可能な最大領域に設定される。 Next, the CPU 201 sets at least one browsing page area according to the size of the display area of the display unit 42 or the window size of the active browsing application (step S308). Here, the browsing page area is a convenient page area set for displaying the document image on the display unit 42 or 52, and is set independently of the page area in the original document. Typically, the browsing page area is set to the maximum area that can be displayed on the display unit 42 or 52 among the document images included in the digitized document.

そして、ＣＰＵ２０１が、電子化文書４００＃に添付される行属性情報に規定される各グループに所属する内容領域の閲覧ページ毎の出現数に基づいて、各グループのしおりとしての適合度を評価する（ステップＳ３１０）。この評価結果に基づいて、ＣＰＵ２０１が評価結果の最上位のものから少なくとも１つのグループをしおりデータの生成対象として選択する（ステップＳ３１２）。ここで、ＣＰＵ２０１が、閲覧ページのうちしおりデータの生成対象となる内容領域が存在しない閲覧ページに対して、当該閲覧ページを特定するためのしおりデータを付加する（ステップＳ３１４）。 Then, the CPU 201 evaluates the fitness of each group as a bookmark based on the number of appearances of each content page belonging to each group defined in the line attribute information attached to the digitized document 400 #. (Step S310). Based on the evaluation result, the CPU 201 selects at least one group from among the highest evaluation results as a bookmark data generation target (step S312). Here, the CPU 201 adds bookmark data for specifying the browsing page to a browsing page in which there is no content area for which bookmark data is to be generated (step S314).

さらに、複数のグループを選択した場合には、ＣＰＵ２０１が、選択した各グループに所属する内容領域の文書画像内での位置に基づいて、グループ間の従属関係を決定する（ステップＳ３１６）。その後、ＣＰＵ２０１が、しおりデータの生成対象として選択されたグループに所属する内容領域についての属性情報に基づいて、しおりデータを生成する（ステップＳ３１８）。 Further, when a plurality of groups are selected, the CPU 201 determines the dependency relationship between the groups based on the position of the content area belonging to each selected group in the document image (step S316). Thereafter, the CPU 201 generates bookmark data based on the attribute information about the content area belonging to the group selected as the bookmark data generation target (step S318).

最終的に、ＣＰＵ２０１が、表示部４２または５２に表示される閲覧ナビゲート情報表示領域５１０（図４）の表示を更新する（ステップＳ３２０）。そして、しおりデータの生成処理は終了する。 Finally, the CPU 201 updates the display of the browsing navigation information display area 510 (FIG. 4) displayed on the display unit 42 or 52 (step S320). Then, the bookmark data generation process ends.

以下、上記の主要なステップの詳細な処理について説明する。
（閲覧ページ領域の設定処理）
図２３は、図２２のステップＳ３０８における閲覧ページ領域の設定処理を説明するための図である。 Hereinafter, detailed processing of the main steps will be described.
(Browsing page area setting process)
FIG. 23 is a diagram for explaining the browsing page area setting process in step S308 of FIG.

図２３を参照して、ＣＰＵ２０１は、代表的に閲覧パス上の「行数」の単位で「閲覧ページ領域」を設定する。すなわち、ＣＰＵ２０１は、表示部４２の表示領域の大きさ、またはアクティブになっている閲覧アプリケーションのウィンドウサイズに応じて、「閲覧ページ領域」の１ページに相当する閲覧パス上の「行数」を決定する。たとえば、閲覧パス上の「２９行」が１ページ分の閲覧ページ領域に相当する場合には、ＣＰＵ２０１は、図１９に示す行属性情報４１０に対して、図２３に示すような閲覧ページを設定する。図１９は、元来３ページ分の原稿から生成された文書画像についての行属性情報であったが、図２３ではクライアント端末の表示特性に応じて８ページ分の閲覧ページが設定されている（データ欄４８０）。そして、以下の処理はこの設定された閲覧ページの単位で行なわれる。 Referring to FIG. 23, CPU 201 typically sets “browsing page area” in units of “number of rows” on the browsing path. That is, the CPU 201 sets the “number of lines” on the browsing path corresponding to one page of the “browsing page area” according to the size of the display area of the display unit 42 or the window size of the active browsing application. decide. For example, when “29 lines” on the browsing path corresponds to a browsing page area for one page, the CPU 201 sets a browsing page as shown in FIG. 23 for the line attribute information 410 shown in FIG. To do. FIG. 19 originally shows line attribute information about a document image generated from a document of three pages. In FIG. 23, eight pages of browsing pages are set according to the display characteristics of the client terminal ( Data column 480). The following processing is performed in units of the set browsing page.

（しおりとしての適合度の評価処理およびグループの選択処理）
図２２のステップＳ３１０およびＳ３１２におけるしおりとしての適合度の評価処理およびグループの選択処理は、上述したこの発明の実施の形態１における図１１と同様に、各グループの「出現網羅度」および「最大出現数」に基づいて行なわれる。特に本実施の形態では、「出現網羅度」および「最大出現数」は閲覧ページの単位で算出される。 (Evaluation process for fitness as a bookmark and group selection process)
Similar to the above-described FIG. 11 in the first embodiment of the present invention, the evaluation process of the fitness as a bookmark and the group selection process in steps S310 and S312 of FIG. This is based on the “number of occurrences”. In particular, in the present embodiment, “appearance coverage” and “maximum number of appearances” are calculated in units of browsing pages.

図２４は、図２３に示す行属性情報を用いて「出現網羅度」および「最大出現数」の具体例を求めた結果である。図２４に示すように、「出現網羅度」については、その所属する内容領域がより多くのページに出現するグループに対して相対的に高い評価が与えられる。具体的には、その所属する内容領域が８ページ中のそれぞれ５ページおよび７ページに出現している「ＴＹＰＥ３」および「ＴＹＰＥ４」には、評価点として「２」点が与えられる。一方で、「ＴＹＰＥ１」および「ＴＹＰＥ２」には、評価点として「０」点が与えられる。 FIG. 24 shows the result of obtaining specific examples of “appearance coverage” and “maximum number of appearances” using the row attribute information shown in FIG. As shown in FIG. 24, regarding the “appearance coverage”, a relatively high evaluation is given to a group in which the content area to which it belongs appears on more pages. Specifically, “TYPE 3” and “TYPE 4” in which the content area to which the page belongs appear on page 5 and page 7 of 8 pages are given “2” points as evaluation points. On the other hand, “TYPE 1” and “TYPE 2” are given “0” points as evaluation points.

また、「最大出現数」については、最大出現数が所定範囲内（一例として、１〜２回）であるグループに対して、その所定範囲外であるグループに比較して相対的に高い評価が与えられる。具体的には、その最大出現数が「１」回である「ＴＹＰＥ１」、「ＴＹＰＥ２」および「ＴＹＰＥ３」には、評価点として「２」点が与えられる。一方で、その最大出現数が「３」回である「ＴＹＰＥ４」には、評価点として「０」点が与えられる。 In addition, regarding the “maximum number of appearances”, a group whose maximum number of appearances is within a predetermined range (for example, 1 to 2 times) has a relatively high evaluation compared to a group that is outside the predetermined range. Given. Specifically, “TYPE 1”, “TYPE 2”, and “TYPE 3” whose maximum number of appearances is “1” are given “2” points as evaluation points. On the other hand, “TYPE 4” whose maximum number of appearances is “3” is given “0” as an evaluation score.

そして、ＣＰＵ２０１は、「出現網羅度」および「最大出現数」についての評価点の合計点を総合適合度として評価し、評価点が最上位のものから少なくとも１つのグループをしおりデータの生成対象として選択する。図２４に示す例では、「ＴＹＰＥ３」がしおりデータの生成対象として選択される。 Then, the CPU 201 evaluates the total score of the evaluation points for the “appearance coverage” and “maximum number of appearances” as the overall fitness, and sets at least one group from the highest evaluation score as a bookmark data generation target. select. In the example illustrated in FIG. 24, “TYPE3” is selected as a bookmark data generation target.

なお、上述の例では、「出現網羅度」および「最大出現数」を総合した結果に基づいて、しおりデータの生成対象を選択したが、いずれか一方の評価結果を用いてしおりデータの生成対象を選択してもよく、さらに別の評価を用いてもよい。また、「出現網羅度」が非常に低いグループについては特徴的な記述である場合も想定されるため、これらのグループをしおりデータの生成対象としてもよい。 In the above example, the bookmark data generation target is selected based on the result of combining the “appearance coverage” and the “maximum number of occurrences”, but one of the evaluation results is used to generate the bookmark data. May be selected, and further evaluations may be used. In addition, since groups with very low “appearance coverage” may be characteristic descriptions, these groups may be used as bookmark data generation targets.

（しおりデータの付加処理）
図２５は、図２２のステップＳ３１４におけるしおりデータの付加処理を説明するための図である。図２５（ａ）は、選択されたグループに所属する内容領域に対してしおりデータ５２０が生成された状態を示す。図２５（ｂ）は、しおりデータ５２２を付加した後の状態を示す。 (Additional processing of bookmark data)
FIG. 25 is a diagram for explaining bookmark data addition processing in step S314 of FIG. FIG. 25A shows a state in which bookmark data 520 is generated for the content area belonging to the selected group. FIG. 25B shows a state after bookmark data 522 is added.

図２５（ａ）を参照して、一例として図２３に示す行属性情報のうち「ＴＹＰＥ３」のグループについての「出現網羅度」は「１」ではないので、「ＴＹＰＥ３」に設定されている内容領域に対してしおりデータ５２０を生成しただけでは、しおりデータが付加されていない閲覧ページが存在する（図２５（ａ）に示す例では、「ページ３」、「ページ５」、「ページ８」）。 Referring to FIG. 25A, as an example, the “appearance coverage” for the group “TYPE 3” in the row attribute information shown in FIG. 23 is not “1”, so the content set to “TYPE 3” By only generating bookmark data 520 for the area, there are browsing pages to which no bookmark data is added (in the example shown in FIG. 25A, “page 3”, “page 5”, “page 8”). ).

そこで、ＣＰＵ２０１は、閲覧ページのうちしおりデータの生成対象となる内容領域が存在しない閲覧ページに対して、閲覧ページを特定するためのしおりデータを付加する。本実施の形態では、代表的に、対応の閲覧ページの先頭を示すしおりデータ（ｐａｇｅｔｏｐ）を付加する。 Therefore, the CPU 201 adds bookmark data for specifying a browsing page to a browsing page that does not have a content area for which bookmark data is to be generated. In the present embodiment, typically, bookmark data (pagetop) indicating the head of the corresponding browsing page is added.

（グループ間の従属関係の決定処理）
図２２のステップＳ３１４におけるグループ間の従属関係を決定する処理は、上述したこの発明の実施の形態１における図１２と同様であり、異なるグループの内容領域（しおり）のうち閲覧パス上で互いに近接しているものを抽出することで、グループ間の従属関係が決定される。 (Process to determine the dependency between groups)
The process of determining the dependency relationship between groups in step S314 in FIG. 22 is the same as that in FIG. 12 in the first embodiment of the present invention described above, and is close to each other on the browsing path among the content areas (bookmarks) of different groups. By extracting what is being performed, the dependency between groups is determined.

なお、上述のように、しおりデータの生成対象となる内容領域が存在しない閲覧ページに対して付加されたしおりデータについては、直近のしおりデータの最下層に従属するように従属関係が決定される。 As described above, with respect to bookmark data added to a browse page for which no content area for which bookmark data is to be generated exists, the dependency relationship is determined so as to be subordinate to the lowest layer of the latest bookmark data. .

（しおりデータの生成処理および閲覧ナビゲート情報表示領域の表示更新処理）
上述のような処理によって得られた情報に基づいて、ＣＰＵ２０１は、選択された内容領域の文書画像内での位置を示すしおりデータを生成し、表示部４２または５２に表示される閲覧ナビゲート情報表示領域５１０の表示を更新する。 (Bookmark data generation processing and browsing navigation information display area display update processing)
Based on the information obtained by the processing as described above, the CPU 201 generates bookmark data indicating the position of the selected content area in the document image, and the browsing navigation information displayed on the display unit 42 or 52. The display in the display area 510 is updated.

この発明の実施の形態２によれば、文書画像を表示するパーソナルコンピュータＰＣや携帯端末ＭＴの表示手段の特性に応じて、効率的な閲覧ナビゲート情報を生成することができる。 According to the second embodiment of the present invention, efficient browsing navigation information can be generated according to the characteristics of the display means of the personal computer PC or the portable terminal MT that displays the document image.

［その他の実施の形態］
上述の実施の形態１および２においては、本発明に係る処理がＭＦＰ１またはＭＦＰ１＃で実行される場合について説明したが、原稿３００を読取るための画像読取機能を備えたコンピュータにおいて上記処理が実行されてもよい。この場合には、コンピュータを文書処理装置として機能させるための図２や図１７に示された処理機能を実行させるプログラムを提供することもできる。このようなプログラムは、コンピュータに付属するフレキシブルディスク、ＣＤ−ＲＯＭ（Compact Disk-Read Only Memory）、ＲＯＭ（Read Only Memory）、ＲＡＭ（Random Access Memory）およびメモリカードなどのコンピュータ読取り可能な記憶媒体にて記憶させて、プログラム製品として提供することもできる。あるいは、コンピュータに内蔵するハードディスクなどの記憶媒体にて記憶させて、プログラムを提供することもできる。また、ネットワークを介したダウンロードによって、プログラムを提供することもできる。 [Other embodiments]
In the first and second embodiments described above, the case where the processing according to the present invention is executed by the MFP 1 or MFP 1 # has been described, but the above processing is executed by a computer having an image reading function for reading the document 300. May be. In this case, a program for executing the processing functions shown in FIG. 2 and FIG. 17 for causing the computer to function as a document processing apparatus can be provided. Such a program is stored in a computer-readable storage medium such as a flexible disk attached to the computer, a CD-ROM (Compact Disk-Read Only Memory), a ROM (Read Only Memory), a RAM (Random Access Memory), and a memory card. And stored as a program product. Alternatively, the program can be provided by being stored in a storage medium such as a hard disk built in the computer. A program can also be provided by downloading via a network.

また、画像読取機能を他の装置またはコンピュータで実現した上で、生成された文書画像を受取って、上記のような処理に従って閲覧ナビゲート情報のみを生成してもよい。また、文書画像と閲覧ナビゲート情報とが同一の電子化文書に含まれる構成について例示したが、必ずしも同一の電子化文書に閲覧ナビゲート情報を付加しなくてもよく、別のファイルとして出力してもよい。 Alternatively, the image reading function may be realized by another device or a computer, and the generated document image may be received and only the browsing navigation information may be generated according to the above processing. In addition, the configuration in which the document image and the browsing navigation information are included in the same digitized document has been illustrated, but the browsing navigation information may not necessarily be added to the same digitized document, and is output as a separate file. May be.

なお、本発明にかかるプログラムは、コンピュータのオペレーティングシステム（ＯＳ）の一部として提供されるプログラムモジュールのうち、必要なモジュールを所定の配列で所定のタイミングで呼出して処理を実行させるものであってもよい。その場合、プログラム自体には上記モジュールが含まれずＯＳと協働して処理が実行される。このようなモジュールを含まないプログラムも、本発明にかかるプログラムに含まれ得る。 The program according to the present invention is a program module that is provided as a part of a computer operating system (OS) and calls necessary modules in a predetermined arrangement at a predetermined timing to execute processing. Also good. In that case, the program itself does not include the module, and the process is executed in cooperation with the OS. A program that does not include such a module can also be included in the program according to the present invention.

また、本発明にかかるプログラムは他のプログラムの一部に組込まれて提供されるものであってもよい。その場合にも、プログラム自体には上記他のプログラムに含まれるモジュールが含まれず、他のプログラムと協働して処理が実行される。このような他のプログラムに組込まれたプログラムも、本発明にかかるプログラムに含まれ得る。 The program according to the present invention may be provided by being incorporated in a part of another program. Even in this case, the program itself does not include the module included in the other program, and the process is executed in cooperation with the other program. Such a program incorporated in another program can also be included in the program according to the present invention.

提供されるプログラム製品は、ハードディスクなどのプログラム格納部にインストールされて実行される。なお、プログラム製品は、プログラム自体と、プログラムが記憶された記憶媒体とを含む。 The provided program product is installed in a program storage unit such as a hard disk and executed. Note that the program product includes the program itself and a storage medium in which the program is stored.

今回開示された実施の形態はすべての点で例示であって制限的なものではないと考えられるべきである。本発明の範囲は上記した説明ではなくて特許請求の範囲によって示され、特許請求の範囲と均等の意味および範囲内でのすべての変更が含まれることが意図される。 The embodiment disclosed this time should be considered as illustrative in all points and not restrictive. The scope of the present invention is defined by the terms of the claims, rather than the description above, and is intended to include any modifications within the scope and meaning equivalent to the terms of the claims.

この発明の実施の形態１に従う文書処理装置を含むシステムの概略構成図である。1 is a schematic configuration diagram of a system including a document processing device according to a first embodiment of the present invention. この発明の実施の形態１に従うＭＦＰにおける概略構成を示すブロック図である。FIG. 2 is a block diagram showing a schematic configuration in the MFP according to the first embodiment of the present invention. この発明の実施の形態１に従うパーソナルコンピュータの概略構成を示すブロック図である。It is a block diagram which shows schematic structure of the personal computer according to Embodiment 1 of this invention. この発明の実施の形態１に従うパーソナルコンピュータにおける電子化文書の表示画面の一例を模式的に示した図である。It is the figure which showed typically an example of the display screen of the digitized document in the personal computer according to Embodiment 1 of this invention. この発明の実施の形態１に従うＭＦＰにおける機能構成を示すブロック図である。It is a block diagram showing a functional configuration in the MFP according to the first embodiment of the present invention. この発明の実施の形態１に従う電子化文書の生成処理の具体例を示すフローチャートである。It is a flowchart which shows the specific example of the production | generation process of the digitized document according to Embodiment 1 of this invention. 図６のステップＳ１０６における内容領域の抽出処理を説明するための図である。It is a figure for demonstrating the extraction process of the content area | region in FIG.6 S106. 図６のステップＳ１０８における閲覧パスの設定処理を説明するための図である。It is a figure for demonstrating the setting process of the browsing path in step S108 of FIG. 図７に示す文書画像から取得される各内容領域の属性情報の具体例を示す図である。It is a figure which shows the specific example of the attribute information of each content area acquired from the document image shown in FIG. 図６のステップＳ１１４におけるグループへの分類処理を説明するための図である。It is a figure for demonstrating the classification process to the group in step S114 of FIG. 図６のステップＳ１１６およびＳ１１８におけるしおりとしての適合度の評価処理およびグループの選択処理を説明するための図である。It is a figure for demonstrating the evaluation process of the adaptation as a bookmark in FIG.6 and the selection process of a group in S118. 図６のステップＳ１２０におけるグループ間の従属関係を決定する処理を説明するための図である。It is a figure for demonstrating the process which determines the dependency relationship between the groups in step S120 of FIG. 電子化文書生成部が生成する電子化文書のデータ構造の一例を示す図である。It is a figure which shows an example of the data structure of the digitized document which an digitized document production | generation part produces | generates. しおりデータ部のデータ構造の一例を示す図である。It is a figure which shows an example of the data structure of a bookmark data part. この発明の実施の形態１の変形例に従う処理を模式的に示した図である。It is the figure which showed typically the process according to the modification of Embodiment 1 of this invention. この発明の実施の形態２に従う携帯端末における表示領域を模式的に示す図である。It is a figure which shows typically the display area in the portable terminal according to Embodiment 2 of this invention. この発明の実施の形態２に従うＭＦＰにおける機能構成を示すブロック図である。FIG. 11 is a block diagram showing a functional configuration in the MFP according to the second embodiment of the present invention. この発明の実施の形態２に従うＭＦＰが生成する電子化文書のデータ構造の一例を示す図である。It is a figure which shows an example of the data structure of the digitized document which MFP produces according to Embodiment 2 of this invention. 図１８に示す行属性情報のデータ構造の具体例を示す図である。It is a figure which shows the specific example of the data structure of the row attribute information shown in FIG. この発明の実施の形態２に従う電子化文書の生成処理の具体例を示すフローチャートである。It is a flowchart which shows the specific example of the production | generation process of the digitized document according to Embodiment 2 of this invention. この発明の実施の形態２に従うクライアント端末における機能構成を示すブロック図である。It is a block diagram which shows the function structure in the client terminal according to Embodiment 2 of this invention. この発明の実施の形態２に従うしおりデータの生成処理の具体例を示すフローチャートである。It is a flowchart which shows the specific example of the production | generation process of the bookmark data according to Embodiment 2 of this invention. 図２２のステップＳ３０８における閲覧ページ領域の設定処理を説明するための図である。It is a figure for demonstrating the setting process of the browsing page area | region in step S308 of FIG. 図２３に示す行属性情報を用いて「出現網羅度」および「最大出現数」の具体例を求めた結果である。This is a result of obtaining specific examples of “appearance coverage” and “maximum number of appearances” using the row attribute information shown in FIG. 図２２のステップＳ３１４におけるしおりデータの付加処理を説明するための図である。It is a figure for demonstrating the addition process of bookmark data in step S314 of FIG.

Explanation of symbols

１，１＃ＭＦＰ、１２画像前処理部、１３画像バッファ部、１４圧縮処理部、１５電子化文書生成部、１６画像解析部、７データ生成部、１８送信部、１９画像処理部、２０行属性情報生成部、２１クライアントプログラム格納部、４０受信部、４１閲覧用アプリケーション、４２表示部、４３クライアントプログラム実行環境、４４機器情報格納部、５０受信部、５１閲覧用アプリケーション、５２表示部、５３クライアントプログラム実行環境、５４機器情報格納部、５５ＧＵＩ部、１００制御部、１０２メモリ部、１０４画像読取部、１０６プリント部、１０８通信インターフェイス部、１１０データ格納部、２０１ＣＰＵ、２０３内部バス、２０５ディスプレイ部、２０７通信インターフェイス部、２０９入力部、２１１ハードディスク部（ＨＤＤ）、２１３メモリ部、２１５ＣＤ−ＲＯＭドライブ、２１５ａＣＤ−ＲＯＭ、２１７ＦＤＤドライブ、２１７ａフレキシブルディスク、３００原稿、４００，４００Ａ電子化文書、４０２ヘッダ部、４０４文書画像部、４０６データ部、４０８フッタ部、４０９クライアントプログラム、４１０行属性情報、４２０文書画像、４２１，４２２，４２３ページ領域、４３０，４３０Ａ内容領域、４４０，４４０Ａ，４４０Ｂ閲覧パス、４６１，４６２，４６３，４６４，４６５，４６６，４６７，４８０データ欄、５００文書表示領域、５１０閲覧ナビゲート情報表示領域、５１２，５１４，５１６アイコン、ＰＣ，ＰＣ１，ＰＣ２，ＰＣ３パーソナルコンピュータ、ＳＲＶサーバ装置、ＭＴ携帯端末。 1, 1 # MFP, 12 image preprocessing unit, 13 image buffer unit, 14 compression processing unit, 15 digitized document generation unit, 16 image analysis unit, 7 data generation unit, 18 transmission unit, 19 image processing unit, 20 lines Attribute information generation unit, 21 Client program storage unit, 40 reception unit, 41 browsing application, 42 display unit, 43 client program execution environment, 44 device information storage unit, 50 reception unit, 51 browsing application, 52 display unit, 53 Client program execution environment, 54 device information storage unit, 55 GUI unit, 100 control unit, 102 memory unit, 104 image reading unit, 106 print unit, 108 communication interface unit, 110 data storage unit, 201 CPU, 203 internal bus, 205 Display unit, 207 Communication interface Face section, 209 input section, 211 hard disk section (HDD), 213 memory section, 215 CD-ROM drive, 215a CD-ROM, 217 FDD drive, 217a flexible disk, 300 manuscript, 400, 400A digitized document, 402 header section 404 document image portion, 406 data portion, 408 footer portion, 409 client program, 410 line attribute information, 420 document image, 421, 422, 423 page area, 430, 430A content area, 440, 440A, 440B browsing path, 461 , 462, 463, 464, 465, 466, 467, 480 Data column, 500 document display area, 510 browsing navigation information display area, 512, 514, 516 icon, PC, PC1, PC2, PC3 Naru computer, SRV server device, MT mobile terminal.

Claims

A document processing apparatus for generating an electronic document including a document image,
An acquisition means for extracting at least one content area from the document image and acquiring attribute information about the content area;
The attribute information includes position information indicating a position of the content area in the document image, and information generation means for generating browsing navigation information for specifying the position of the content area in the document image With
The information generating means
Classification means for classifying the at least one content area into at least one group based on the attribute information;
Evaluation means for evaluating each of the groups based on the position in the document image of the content area belonging to each group;
A document processing apparatus comprising: a selecting unit that selects a group that is the generation target of the browsing navigation information from the at least one group based on an evaluation result by the evaluating unit.

Image reading means for generating the document image by reading a document;
The document processing apparatus according to claim 1, further comprising: a document generation unit that generates the digitized document by adding the browsing navigation information to the document image.

The document image is divided into pages,
The document processing apparatus according to claim 1, wherein the evaluation unit evaluates each of the groups based on the number of appearances of the content area belonging to each group for each page.

The evaluation means gives a relatively high evaluation to a group in which the content area to which it belongs belongs to more pages,
The document processing apparatus according to claim 3, wherein the selection unit selects a group given a relatively high evaluation.

The evaluation unit is further configured to compare a group in which the maximum value of the number of appearances per page of the content area to which the user belongs is within a predetermined range with respect to a group in which the maximum value of the number of appearances is outside the predetermined range. The document processing apparatus according to claim 4, which gives a relatively high evaluation.

The information generation unit determines a dependency relationship between groups based on positions of the content areas included in the plurality of groups in the document image when the selection unit selects the plurality of groups. The document processing apparatus according to claim 1, further comprising a dependency relationship determining unit.

The document processing apparatus according to claim 1, wherein the content area includes at least one of a character string, a paragraph, a figure, a table, and a photograph.

A document processing method for generating an electronic document including a document image,
Extracting at least one content area from the document image and obtaining attribute information for the content area;
The attribute information includes position information indicating a position of the content area in the document image, and further includes generating browsing navigation information for specifying a position of the content area in the document image. ,
The step of acquiring the attribute information includes
Classifying the at least one content region into at least one group based on the attribute information;
Evaluating each of the groups based on a position within the document image of the content region belonging to each group;
Selecting a group for which the browsing navigation information is to be generated from the at least one group based on an evaluation result obtained by the evaluating step.

A document processing program for causing a computer to execute the document processing method according to claim 8.

A document processing apparatus for processing an electronic document including a document image,
The digitized document includes attribute information in which the type of the group to which the content area belongs and the position of the content area in the document image are defined in association with the content area included in the document image,
The document processing apparatus includes:
Information generating means for generating browsing navigation information for specifying the position of the content area in the document image;
Display means for displaying the document image together with the browsing navigation information,
The information generating means
Display characteristic acquisition means for acquiring display characteristics of the display means;
Area setting means for setting at least one browsing page area according to the display characteristics of the display means;
Evaluation means for evaluating each of the groups based on the number of appearances of the content area belonging to each group for each browsing page;
A document processing apparatus comprising: a selecting unit that selects a group that is the generation target of the browsing navigation information from the at least one group based on an evaluation result by the evaluating unit.

The evaluation means gives a relatively high evaluation to a group in which the content area to which it belongs belongs to more browsing pages,
The document processing apparatus according to claim 10, wherein the selection unit selects a group given a relatively high evaluation.

The evaluation means further compares the group in which the maximum number of appearances of the content area to which the content area belongs belongs within a predetermined range with a group in which the maximum number of appearances is outside the predetermined range The document processing apparatus according to claim 11, which gives a relatively high evaluation.

The information generation means further includes an adding means for adding information for specifying the page to the browsing page in which the content area does not appear as the browsing navigation information. The document processing apparatus according to any one of -12.

The information generation unit determines a dependency relationship between groups based on positions of the content areas included in the plurality of groups in the document image when the selection unit selects the plurality of groups. The document processing apparatus according to claim 10, further comprising a dependency relationship determining unit.

The document processing apparatus according to claim 10, wherein the evaluation unit changes a reference for evaluation according to a browsing environment.

A document processing method for processing an electronic document including a document image,
The digitized document includes attribute information in which the type of the group to which the content area belongs and the position of the content area in the document image are defined in association with the content area included in the document image,
The document processing method includes:
Generating browsing navigation information for specifying a position of the content area in the document image;
Displaying the document image together with the browsing navigation information on a display unit,
The step of generating the browsing navigation information includes:
Obtaining display characteristics of the display unit;
Setting at least one viewing page area according to the display characteristics of the display unit;
Evaluating each of the groups based on the number of appearances of each of the browsing pages of the content region belonging to each group;
A document processing method comprising: a selection step of selecting a group for which the browsing navigation information is to be generated from the at least one group based on an evaluation result obtained by the step of evaluating each of the groups.

A document processing program for causing a computer to execute the document processing method according to claim 16.