JP2000293671A

JP2000293671A - Method and device for image processing and storage medium

Info

Publication number: JP2000293671A
Application number: JP11101973A
Authority: JP
Inventors: Shigeo Nara; 茂雄奈良
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 1999-04-09
Filing date: 1999-04-09
Publication date: 2000-10-20

Abstract

PROBLEM TO BE SOLVED: To analyze an inputted image and to acquire information which makes it possible to perform document editing by extracting a character string area from inputted image information and deciding whether the character string area is a character string area of a text according to the layout information of the extracted character string area. SOLUTION: Image data are read from an image reader (S11-1). The read image data are converted into a PAF(page analysis format) by using an OCR engine (S11-2). For instance, the read image data are subjected to area division, and a block shown by a rectangular frame are extracted. The separation processing of blocks for a text area from the other area blocks is performed (S11-3). Here, text blocks other than a block for a frame area are separated as blocks for a text area. Preparation processing of a text format is performed (S11-4). The format prepared here is the same as a document format prepared by normal word processor software, etc.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、入力した画像を解
析して文書編集に用い得る情報を取得する技術に関する
ものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a technique for analyzing an input image to obtain information that can be used for document editing.

【０００２】本発明は、画像からテキストへの変換に関
する技術である。[0002] The present invention relates to a technique for converting an image into text.

【０００３】[0003]

【従来の技術】近年、画像読み取り装置、ＯＣＲ（光学
的文字認識。ＯｐｔｉｃａｌＣｈａｒａｃｔｅｒＲ
ｅｃｏｇｎｉｔｉｏｎ）などの普及により、紙の原稿か
ら文字を読み取って文字コード化して文書ファイルを作
成し、さらにその文書ファイルを編集するという機会が
増えてきた。2. Description of the Related Art In recent years, image reading apparatuses, OCR (optical character recognition, Optical Character R).
Due to the spread of “ecognition” and the like, opportunities to read a character from a paper manuscript, convert it into a character code, create a document file, and further edit the document file have been increased.

【０００４】[0004]

【発明が解決しようとしている課題】しかし、画像読み
取り装置から読み込まれた画像データが、領域分割さ
れ、OCRされて作成された文書ファイルは、分割された領域が、それぞれ枠に変換されてしまって
いる。枠と枠の間で、文書としてのつながりがない。というものであった。従って、編集を行う際には、各領
域単位に領域内のみで編集を行わなければならず、非常
に効率が悪かった。However, the image data read from the image reading device is divided into regions, and the document file created by OCR is converted into a frame in which each divided region is converted into a frame. I have. There is no document connection between frames. It was that. Therefore, when performing editing, it is necessary to perform editing only in the area for each area, which is very inefficient.

【０００５】[0005]

【課題を解決するための手段】上記課題を解決する為
に、本発明は、入力した画像情報から文字列領域を抽出
し、前記抽出した文字列領域のレイアウト情報に従っ
て、当該文字列領域が本文の文字列領域であるか否かを
判断する画像処理方法、装置及び記憶媒体を提供する。According to the present invention, a character string region is extracted from input image information, and the character string region is converted into a text according to the layout information of the extracted character string region. An image processing method, apparatus, and storage medium for determining whether or not a character string area is included.

【０００６】上記課題を解決する為に、本発明は、好ま
しくは前記本文の文字列領域であると判断された領域
と、本文の文字列領域でないと判断された領域とを区別
して記憶する。In order to solve the above-mentioned problem, the present invention preferably stores the area determined to be the character string area of the text and the area determined not to be the character string area of the text separately.

【０００７】上記課題を解決する為に、本発明は、好ま
しくは前記記憶は、各領域から抽出した特徴情報を記憶
する。[0007] In order to solve the above problem, according to the present invention, preferably, the storage stores feature information extracted from each area.

【０００８】上記課題を解決する為に、本発明は、好ま
しくは前記記憶は、各々を識別し得る識別情報を付加し
て記憶する。[0008] In order to solve the above-mentioned problems, according to the present invention, the storage is preferably performed by adding identification information capable of identifying each of the storages.

【０００９】上記課題を解決する為に、本発明は、好ま
しくは前記文字列領域のレイアウト情報は、異なる領域
との位置関係とする。In order to solve the above-mentioned problem, in the present invention, preferably, the layout information of the character string area has a positional relationship with different areas.

【００１０】上記課題を解決する為に、本発明は、好ま
しくは前記文字列領域のレイアウト情報は、その文字列
領域に含まれる文字列の行数とする。In order to solve the above problem, in the present invention, preferably, the layout information of the character string area is the number of lines of the character string included in the character string area.

【００１１】上記課題を解決する為に、本発明は、入力
した画像情報から複数の画像領域を抽出し、前記抽出し
た画像領域から、本文領域とそれ以外の領域とに分類
し、前記分類に従って第一の記憶領域と第二の記憶領域
に分けて各領域から抽出した情報を記憶する画像処理方
法、装置及び記憶媒体を提供する。In order to solve the above problem, the present invention extracts a plurality of image regions from input image information, classifies the extracted image regions into a text region and other regions, and according to the classification. Provided are an image processing method, an apparatus, and a storage medium that store information extracted from each area separately into a first storage area and a second storage area.

【００１２】上記課題を解決する為に、本発明は、好ま
しくは前記画像情報はスキャナにより読み取った画像と
する。In order to solve the above-mentioned problems, the present invention is preferably arranged such that the image information is an image read by a scanner.

【００１３】上記課題を解決する為に、本発明は、好ま
しくは前記画像情報は回線を介して接続された他端末に
おいて入力した画像とする。[0013] In order to solve the above-mentioned problems, the present invention is preferably arranged such that the image information is an image input at another terminal connected via a line.

【００１４】上記課題を解決する為に、本発明は、好ま
しくは前記画像情報は回線を介して接続された他端末に
おいてスキャナにより読み取られた画像とする。In order to solve the above-mentioned problem, the present invention is preferably arranged such that the image information is an image read by a scanner at another terminal connected via a line.

【００１５】上記課題を解決する為に、本発明は、入力
した画像情報を解析して文字の配列を定める書式を作成
し、前記作成された書式に従って、前記入力した画像情
報を認識して得た文字を配置する文書処理方法、装置及
び記憶媒体を提供する。In order to solve the above-mentioned problems, the present invention analyzes input image information, creates a format for determining a character arrangement, and recognizes the input image information according to the created format. A document processing method, apparatus, and storage medium for arranging characters are provided.

【００１６】上記課題を解決する為に、本発明は、好ま
しくは前記解析する画像情報は、入力した画像から抽出
した本文文字列領域とする。In order to solve the above-mentioned problems, the present invention is preferably arranged such that the image information to be analyzed is a text string region extracted from an input image.

【００１７】上記課題を解決する為に、本発明は、好ま
しくは前記解析する画像情報は、入力した画像情報を領
域分割して得た複数の領域のうちの一つの領域とする。In order to solve the above-mentioned problem, the present invention preferably sets the image information to be analyzed to be one of a plurality of regions obtained by dividing input image information into regions.

【００１８】上記課題を解決する為に、本発明は、好ま
しくは前記書式は、入力した画像から抽出した領域の中
の文字列の配置を定めるものとする。In order to solve the above-mentioned problems, the present invention is preferably arranged such that the format defines an arrangement of character strings in an area extracted from an input image.

【００１９】上記課題を解決する為に、本発明は、好ま
しくは前記文字を配置した文書を表示画面に表示し、前
記表示画面上で編集を行なう。In order to solve the above-mentioned problems, the present invention preferably displays a document in which the characters are arranged on a display screen, and performs editing on the display screen.

【００２０】上記課題を解決する為に、本発明は、好ま
しくは前記文字を配置した文書を表示画面に表示する。In order to solve the above problems, the present invention preferably displays a document on which the characters are arranged on a display screen.

【００２１】上記課題を解決する為に、本発明は、好ま
しくは前記文字を配置した文書をプリンタにより印字す
る。In order to solve the above problems, the present invention preferably prints a document in which the characters are arranged by a printer.

【００２２】上記課題を解決する為に、本発明は、好ま
しくは前記配置する文字はフォントとする。In order to solve the above problem, according to the present invention, preferably, the characters to be arranged are fonts.

【００２３】上記課題を解決する為に、本発明は、好ま
しくは前記文字を配置した文書を編集し、前記編集され
た文書を前記文字を配置した書式に従って再配列する。In order to solve the above problems, the present invention preferably edits a document in which the characters are arranged, and rearranges the edited document according to a format in which the characters are arranged.

【００２４】上記課題を解決する為に、本発明は、好ま
しくは前記書式は、段組情報を含む。[0024] In order to solve the above problem, the present invention preferably provides that the format includes column information.

【００２５】上記課題を解決する為に、本発明は、好ま
しくは前記書式は、用紙サイズを含む。In order to solve the above-mentioned problem, the present invention preferably provides that the form includes a paper size.

【００２６】上記課題を解決する為に、本発明は、好ま
しくは前記書式は、余白情報を含む。In order to solve the above-mentioned problem, according to the present invention, preferably, the format includes margin information.

【００２７】上記課題を解決する為に、本発明は、好ま
しくは前記書式は、組み方向を含む。In order to solve the above-mentioned problem, the present invention preferably provides that the form includes an assembling direction.

【００２８】上記課題を解決する為に、本発明は、好ま
しくは前記書式は、文字サイズを含む。In order to solve the above problem, the present invention preferably provides that the format includes a character size.

【００２９】上記課題を解決する為に、本発明は、好ま
しくは前記書式は、１行あたりの文字数情報を含む。In order to solve the above problem, according to the present invention, preferably, the format includes information on the number of characters per line.

【００３０】上記課題を解決する為に、本発明は、好ま
しくは前記書式は、１領域あたりの行数情報を含む。In order to solve the above problem, according to the present invention, preferably, the format includes information on the number of lines per area.

【００３１】上記課題を解決する為に、本発明は、好ま
しくは前記書式は、書体情報を含む。In order to solve the above-mentioned problems, the present invention preferably provides that the format includes typeface information.

【００３２】上記課題を解決する為に、本発明は、入力
した画像情報から複数の画像領域を抽出し、前記抽出し
た複数の画像領域から枠領域を選択し、前記選択した枠
領域の画像情報から枠書式を作成する。In order to solve the above-mentioned problem, the present invention extracts a plurality of image areas from input image information, selects a frame area from the extracted plurality of image areas, and outputs image information of the selected frame area. Create a frame format from.

【００３３】上記課題を解決する為に、本発明は、好ま
しくは前記枠領域は、他の領域に重ねて配置するものと
する。In order to solve the above-mentioned problem, according to the present invention, preferably, the frame region is arranged so as to overlap another region.

【００３４】上記課題を解決する為に、本発明は、好ま
しくは前記枠領域は、本文領域の文字の組方向と異なる
組方向のテキスト領域とする。In order to solve the above-mentioned problem, according to the present invention, preferably, the frame region is a text region in a set direction different from the set direction of characters in the body region.

【００３５】[0035]

【発明の実施の形態】図1は本発明に係る装置のシステ
ムブロック図である。FIG. 1 is a system block diagram of an apparatus according to the present invention.

【００３６】システムバス１は、後述する各構成間での
データの授受を可能にするものである。ＣＰＵ（Ｃｅｎ
ｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）２は、後
述するＰＭＥＭ３に格納されている制御プログラムに従
って、本発明に係る各種処理を実行するものである。後
述するフローチャートに示す処理も、ＣＰＵ２により実
行される。ＰＭＥＭ（メモリ）３は、本発明に係る各種
処理の制御プログラムを、適宜ハードディスク１０から
選択して読み込み、格納するメモリである。後述するフ
ローチャートに示す処理の制御プログラムもこのＰＭＥ
Ｍ３に格納される。また、ＰＭＥＭ３はテキストメモリ
としても機能し、キーボード12から入力されたテキスト
データや、外部記憶制御部８の制御により外部記憶媒体
９やハードディスク１０から読み取ったテキストデータ
も格納する。通信制御部４は、通信ポート５における入
出力データの制御を行なう。通信ポート５は、通信制御
部４による制御の基、ＬＡＮやＷＡＮ等の通信回線６を
介してネットワーク上の他の装置の通信ポート７と接続
し、データの送信及び受信を行なう。ネットワークに接
続されているプリンタへの印字データの送出や、同じく
ネットワークに接続されているスキャナからのデータの
入力も、この通信ポート５を介して行なう。The system bus 1 enables data to be exchanged between components described below. CPU (Cen
The true processing unit 2 executes various processes according to the present invention in accordance with a control program stored in the PMEM 3 described later. The processing shown in the flowchart described later is also executed by the CPU 2. The PMEM (memory) 3 is a memory for appropriately reading, reading, and storing control programs for various processes according to the present invention from the hard disk 10 as appropriate. The control program for the processing shown in the flowchart described later is
It is stored in M3. The PMEM 3 also functions as a text memory, and stores text data input from the keyboard 12 and text data read from the external storage medium 9 or the hard disk 10 under the control of the external storage control unit 8. The communication control unit 4 controls input / output data at the communication port 5. The communication port 5 is connected to a communication port 7 of another device on the network via a communication line 6 such as a LAN or a WAN under the control of the communication control unit 4 to transmit and receive data. Transmission of print data to a printer connected to the network and input of data from a scanner also connected to the network are also performed through the communication port 5.

【００３７】外部記憶制御部８は、データファイル用の
メモリ、例えばハードディスク（ＨＤ）１０や本装置に
着脱可能な外部記憶媒体（例えばフロッピーディスクや
ＭＯ、ＣＤ−Ｒ等）９からのデータの読み込み及びデー
タの書きこみを制御する。入力制御部１１は、キーボー
ド１２やマウス１３等の入力装置からのデータの入力を
せいぎょする。ここで、マウス１３はＣＲＴ１６の表示
画面上で指示される位置の座標を入力する座標入力手段
として機能するものであれば良く、タブレットやタッチ
パネル等であっても良い。更に、指示した座標位置で、
ボタンの押下やタップ等により、選択指示を可能とする
ものであれば良い。マウスを操作することによって、マ
ウスカーソルで表示される指示位置を所望の位置とし、
例えばコマンドメニュー上のコマンドアイコン上に位置
させて、ボタンを押下することによって、そのコマンド
アイコンで表されているコマンドを入力することもでき
る。また、編集対象の指示及び描画位置の指示もマウス
13により可能である。操作者はキーボード１２を操作す
ることにより、文字コードの入力及び各種動作命令の入
力を行なう。The external storage control unit 8 reads data from a memory for data files, for example, a hard disk (HD) 10 or an external storage medium (for example, a floppy disk, MO, CD-R, etc.) that is removable from the apparatus. And control the writing of data. The input control unit 11 inputs data from input devices such as a keyboard 12 and a mouse 13. Here, the mouse 13 only needs to function as a coordinate input unit for inputting the coordinates of the position indicated on the display screen of the CRT 16, and may be a tablet, a touch panel, or the like. Furthermore, at the designated coordinate position,
Any device can be used as long as a selection instruction can be made by pressing a button, tapping, or the like. By operating the mouse, the designated position displayed by the mouse cursor is set to a desired position,
For example, a command represented by a command icon can be input by pressing the button while positioning the command icon on the command menu. Instructions for editing and drawing positions can also be made with the mouse.
13 is possible. The operator operates the keyboard 12 to input a character code and various operation commands.

【００３８】ビデオイメージメモリ（ＶＲＡＭ）１４
は、例えばＣＲＴ等の表示器に表示する画像を保持する
メモリであり、表示出力制御部１５の制御により、表示
データがビットマップデータに展開して書きこまれ、Ｃ
ＲＴ１６に表示される。表示器は、ＣＲＴに限定するも
のではなく、液晶表示器であっても良い。プリンタ制御
部１７は、接続されているプリンタ１８に対するデータ
の出力制御を行なう。プリンタ１８は、画像を印字する
手段として機能するものであって、ＬＢＰ、インクジェ
ットプリンタ等である。画像入力機器制御部１Ａは、接
続されている画像入力機器１Ｂの制御を行なう。画像入
力機器１Ｂは、載置した原稿を光学的に読みこむスキャ
ナであっても、或はフィルムスキャナであっても良い。
或は画像データを記憶したメモリから読み込む機能を実
行するものであっても良い。図1においては、プリンタ
制御部１７とプリンタ１８、及び画像入力機器１Ｂと画
像入力機器制御部１Ａとを別の構成として記載したが、
これらは物理的に別々のコンポーネントであっても、一
つのコンポーネントであっても良いことは勿論である。Video image memory (VRAM) 14
Is a memory for holding an image to be displayed on a display such as a CRT, for example. Under the control of the display output control unit 15, display data is developed and written into bitmap data.
Displayed on RT16. The display is not limited to the CRT, but may be a liquid crystal display. The printer control unit 17 controls output of data to the connected printer 18. The printer 18 functions as a unit for printing an image, and is an LBP, an inkjet printer, or the like. The image input device controller 1A controls the connected image input device 1B. The image input device 1B may be a scanner that optically reads a placed document or a film scanner.
Alternatively, it may execute a function of reading image data from a memory storing the image data. In FIG. 1, the printer control unit 17 and the printer 18 and the image input device 1B and the image input device control unit 1A are described as different configurations.
Of course, these may be physically separate components or one component.

【００３９】尚、本発明は、単体で構成される装置によ
り実現されても良いし、或はネットワークを介して接続
される複数の装置からなるシステムで構成されても良い
ことは勿論である。このネットワークのシステムの一例
を図２８に示し、以下説明する。It is to be noted that the present invention may, of course, be realized by a single device, or may be configured by a system including a plurality of devices connected via a network. An example of this network system is shown in FIG. 28 and will be described below.

【００４０】図２８は、プリンタをネットワークに接続
するためのネットワークボード（ＮＢ）１０１を、開放
型アーキテクチャをも持つプリンタ１０２へつなげた場
合のネットワークシステム構成図である。ＮＢ１０１は
ローカルエリアネットワーク（ＬＡＮ）１００へ、例え
ば同軸コネクタをもつＥｔｈｅｒｎｅｔインタフェース
１０Ｂａｓｅ−２や、ＲＪ−４５を持つ１０Ｂａｓｅ−
Ｔ等のＬＡＮインタフェースを介して接続されている。FIG. 28 is a block diagram of a network system in which a network board (NB) 101 for connecting a printer to a network is connected to a printer 102 having an open architecture. The NB 101 connects to a local area network (LAN) 100, for example, an Ethernet interface 10Base-2 having a coaxial connector or a 10Base- having an RJ-45.
It is connected via a LAN interface such as T.

【００４１】ＰＣ１０３、ＰＣ１０４、ＰＣ１１１、Ｐ
Ｃ１１２やＰＣ１１５等の複数のパーソナルコンピュー
タ（ＰＣ）がＬＡＮに接続されており、ネットワークオ
ペレーティングシステムの制御の下、これらのＰＣはＮ
Ｂ１０１と通信し、ネットワークに接続された各デバイ
スとして機能するようにすることができる。また、例え
ばＰＣ１０３を、ネットワークデバイス管理用ＰＣとし
て使用するように指定することができ、このＰＣ１０３
によってプリンタ１０２やＰＣ１０４にローカル接続さ
れたプリンタ１０５におけるプリント処理を制御しても
良い。PC 103, PC 104, PC 111, P
A plurality of personal computers (PCs) such as C112 and PC115 are connected to the LAN, and under the control of the network operating system, these PCs
It can communicate with B101 and function as each device connected to the network. Also, for example, the PC 103 can be designated to be used as a network device management PC.
The print processing in the printer 105 locally connected to the printer 102 or the PC 104 may be controlled.

【００４２】また、ＬＡＮ１００にファイルサーバＰＣ
１０４を接続し、このＰＣ１０４の表示画面上で入力さ
れた指示に応じてＬＡＮ１００を介してファイルサーバ
１０６にアクセスし、大容量（例えば１００億バイト）
のネットワークディスク１０７に記憶されたファイルか
らのデータの読み込み及びデータの書き込み、記憶を管
理する。ファイルサーバＰＣ１０４は、ファイル管理部
として、ＬＡＮメンバ間でデータのファイルの受信や、
記憶、キューイング、キャッシング、及び送信を行な
う。例えば、ＰＣ１０４自身やＰＣ１０３によって作ら
れたデータファイルは、ファイルサーバＰＣ１０４の制
御のもとファイルサーバ１０６へ送られ、ファイルサー
バ１０６はこれらのデータファイルを順に並べ、そして
プリントサーバ１０４からのコマンドに従って、並べら
れたデータファイルをプリンタ１１０へ送信する。A file server PC is connected to the LAN 100.
104, and accesses the file server 106 via the LAN 100 in accordance with an instruction input on the display screen of the PC 104, and has a large capacity (for example, 10 billion bytes).
The management of reading data from a file stored in the network disk 107 and writing and storing the data is managed. The file server PC 104 serves as a file management unit to receive data files between LAN members,
It performs storage, queuing, caching, and transmission. For example, a data file created by the PC 104 itself or the PC 103 is sent to the file server 106 under the control of the file server PC 104, and the file server 106 arranges these data files in order, and according to a command from the print server 104, The arranged data file is transmitted to the printer 110.

【００４３】スキャナサーバ１１５は、ローカル接続さ
れたスキャナ１１７や、遠隔にあるスキャナ１１０を制
御して画像の入力を行なわせる。複写機１１８は、イメ
ージプロセッシングユニット１１９を介してＬＡＮ１１
０につながり、ネットワーク１１０を介して接続される
ＰＣの制御のもと、スキャナやプリンタとしても機能す
る。The scanner server 115 controls the locally connected scanner 117 and the remote scanner 110 to input an image. The copying machine 118 is connected to the LAN 11 via the image processing unit 119.
0, and also functions as a scanner or printer under the control of a PC connected via the network 110.

【００４４】またＰＣ１０３とＰＣ１０４はそれぞれ、
データファイルの生成や、生成したデータファイルのＬ
ＡＮ１００への送信や、また、ＬＡＮ１００からのファ
イルの受信や、更にそのようなファイルの表示及び処理
を行なうことのできる通常のＰＣで構成される。尚、図
２８ではパーソナルコンピュータ機器を図示したが、ネ
ットワークソフトウェアを実行するのに適切であるよう
な、他のコンピュータ機器であっても良い。通常、ＬＡ
Ｎ１００やＬＡＮ１１０などのＬＡＮは、一つの建物内
の一つの階又は接続した複数の階でのユーザグループ等
の、幾分ローカルなユーザグループにサービスを提供す
るが、例えば、ユーザが他の建物や他県にいるなど、あ
るユーザが他のユーザから離れるに従って、ワイドエリ
アネットワーク（ＷＡＮ）を作っても良い。ＷＡＮは、
基本的には、いくつかのＬＡＮを高速度サービス総合デ
ジタルネットワーク（ＩＳＤＮ）電話線等の高速度デジ
タルラインで接続して形成された集合体である。従っ
て、図**に示すように、ＬＡＮ１００と、ＬＡＮ１１０
と、ＬＡＮ１２０とはバックボーン１４０を介して接続
されてＷＡＮを形成する。これらの接続は、数本のバス
による単純な電気的接続である。それぞれのＬＡＮは専
用のＰＣを含み、また、必ずしも必要なわけではない
が、通常はファイルサーバ及びプリントサーバを含む。
ＬＡＮ１００と、ＬＡＮ１１０と、ＬＡＮ１２０とに接
続されている機器は、ＷＡＮ接続を介して、他のＬＡＮ
の機器の機能にアクセスすることができる。PC 103 and PC 104 are respectively
Generation of data file and L of generated data file
It comprises a normal PC capable of transmitting to the AN 100, receiving a file from the LAN 100, and displaying and processing such a file. Although FIG. 28 illustrates a personal computer device, other computer devices that are suitable for executing network software may be used. Usually LA
LANs, such as N100 and LAN 110, provide services to somewhat local user groups, such as user groups on one floor or multiple connected floors in one building. A wide area network (WAN) may be created as a user moves away from another user, such as in another prefecture. WAN is
Basically, it is an aggregate formed by connecting several LANs with a high-speed digital line such as an integrated high-speed service digital network (ISDN) telephone line. Therefore, as shown in FIG.
And the LAN 120 via the backbone 140 to form a WAN. These connections are simple electrical connections with several buses. Each LAN includes a dedicated PC and, although not necessarily, usually includes a file server and a print server.
Devices connected to the LAN 100, the LAN 110, and the LAN 120 communicate with other LANs via the WAN connection.
Can access the functions of the device.

【００４５】図２は、装置（２−１）に画像入力機器１
Ｂの一例である画像読み取り装置（２−２）をつなげた
システムの斜視図である。FIG. 2 shows an image input device 1 in the device (2-1).
FIG. 2 is a perspective view of a system to which an image reading device (2-2) as an example of B is connected.

【００４６】次に、後述する「本文領域と他の領域との
分別方法」に重要な役割を果たすファイル形式であるPA
F（Page Analysis Format）について説明する。Next, a file format, PA, which plays an important role in a “method of separating a text region from other regions” described later.
F (Page Analysis Format) will be described.

【００４７】PAFとは、画像データをその画像データを
解析して得られる画像特徴に従い、同一の属性を有する
ひとかたまりの領域を一つの矩形として領域分割し、そ
の分割されたブロック（矩形領域）毎にデータ（ブロッ
ク領域を特定する為のデータと、そのブロックの画像の
特徴を表すデータとを含む）を保持する形式のファイル
フォーマットであり、ブロックの種類（テキストブロッ
ク、画像ブロック、ラインブロック、テーブルブロッ
ク）が異なれば、それぞれの種類に適した必要なデータ
を保持している。例えば、図３に示す画像データを画像
入力機器１Ｂから入力すると、図４に示すような領域分
割が行われる。例えば、４−１、４−２に示すようなテ
キストブロック（全てがテキストと判断されたブロッ
ク）、４−３に示すような画像ブロック、４−４に示す
ようなラインブロック、４−５に示すようなテーブルブ
ロック（表ブロック）等に領域分割され、各ブロックの
種類が判定される。The PAF is a method of dividing image data into a single rectangular area having a same attribute according to an image characteristic obtained by analyzing the image data, and for each divided block (rectangular area). Is a file format in which data (including data for specifying a block area and data representing characteristics of an image of the block) is stored, and the type of the block (text block, image block, line block, table) Block) is different and holds necessary data suitable for each type. For example, when the image data shown in FIG. 3 is input from the image input device 1B, the area division as shown in FIG. 4 is performed. For example, text blocks such as 4-1 and 4-2 (blocks all of which are determined to be text), image blocks such as 4-3, line blocks such as 4-4, and 4-5 The area is divided into the following table blocks (table blocks) and the like, and the type of each block is determined.

【００４８】図５を用いて、PAFの構造を更に詳細に説
明する。このＰＡＦのデータは、画像入力機器１Ｂから
入力した画像データをＣＰＵ２により解析してＨＤ１０
に格納されるものである。５−１がＰＡＦの全体であっ
て、先頭に「ヘッダ」領域を持ち、次に「ブロック管理
データ」、そしてブロック管理データによって管理され
る「ブロック内データ」とメモリ領域が続く。５−２に
示すように、「ヘッダ」領域には、ファイル全体のファ
イルサイズ、ファイル全体から抽出されたブロックの総
数を表わすブロック保有数、そして読み込まれた画像
（イメージ）の幅／高さをピクセル数で保持し、さらに
どのようなDPI（ｄｏｔ／ｉｎｃｈ）サイズで読み込ま
れた画像データであったかも保持している。The structure of the PAF will be described in more detail with reference to FIG. This PAF data is obtained by analyzing image data input from the image input device
Is stored in 5-1 is the entire PAF, having a “header” area at the beginning, followed by “block management data”, and “in-block data” managed by the block management data, and a memory area. As shown in 5-2, in the “header” area, the file size of the entire file, the number of blocks held indicating the total number of blocks extracted from the entire file, and the width / height of the read image (image) are displayed. It holds the number of pixels, and also holds what DPI (dot / inch) size image data was read.

【００４９】「ブロック管理データ」領域には、各ブロ
ックごとにそのブロックの読み込まれたイメージにおけ
る位置を表わすレイアウト情報や、そのブロックの画像
の属性情報を格納する。即ち、ＰＡＦデータはヘッダ５
−１内のブロック数に相当する数のブロック管理データ
領域を有する。用紙左からの距離、用紙上からの距離、
及びそれぞれのブロックの領域の横／縦サイズをピクセ
ル数で保持し、さらにブロックがどのような種類のデー
タを保持しているかを示すテーブルも持っている。ここ
までがどの種類のブロックでも共通に格納しているデー
タであり、以降がブロックの種類によって異なる。例え
ば、５−３に示す「テキストブロック管理データ」の場
合は、ブロック内データの存在位置を示すオフセットを
持ち、さらにそのテキストの言語情報（ブロック内のテ
キストは、日本語主体なのか英語主体なのか）、テキス
トの組方向（横書きなのか縦書きなのか）を持ってい
る。５−４に示す「画像ブロック管理データ」の場合
は、ブロック内画像データの存在位置を示すオフセット
を持ち、５−５に示す「ラインブロック管理データ」の
場合は、ラインのスタートとエンド位置を示すそれぞれ
の（X、Y）ポジション、及びラインの種類や太さを示す
テーブルを持っている。The "block management data" area stores, for each block, layout information indicating the position of the block in the read image and attribute information of the image of the block. That is, the PAF data is header 5
The number of block management data areas is equal to the number of blocks within -1. Distance from the left of the paper, distance from the top of the paper,
In addition, the table holds the horizontal / vertical size of the area of each block in the number of pixels, and further has a table indicating what kind of data the block holds. The data up to this point is the data that is commonly stored in any type of block, and the following differs depending on the type of block. For example, in the case of “text block management data” shown in 5-3, an offset indicating the location of the data in the block is provided, and the language information of the text (text in the block is mainly Japanese or English) ), And the direction of the text (horizontal or vertical). In the case of "image block management data" shown in 5-4, an offset indicating the location of the image data in the block is provided. In the case of "line block management data" shown in 5-5, the start and end positions of the line are set. It has a table showing each (X, Y) position shown, and the type and thickness of the line.

【００５０】「ブロック内データ」領域は、それぞれの
ブロック内データの種類により異なっている。例えば
「テキストブロック内データ」領域の場合は、対応づけ
られているブロック解離データにより規定されているブ
ロックの画像データを文字認識した結果のテキストデー
タを格納する領域であって、文字コード列と、各文字画
像を抽出した時に得た文字や行等の配置を表わすレイア
ウト情報とを含むものである。５−６に示すように、先
頭にOCR管理データを持ち、次に１行目から最終行まで
の行情報領域、さらに１行目の先頭文字から最終行の最
終文字までの文字情報領域を保持している。OCR管理デ
ータ５−７は、ブロック内の全行数、平均された行間値
（それぞれの行間隔を足して、行間数で割ったもの）、
及びブロックの横／縦サイズをピクセル数で保持してい
る。行情報５−８は、各行ごとの文字のレイアウト情報
等を格納する領域であって、行内に存在する全文字数、
先頭文字までのインデント値、さらに平均された文字間
値（それぞれの文字間隔を足して、文字間数で割ったも
の）、及び行末に改行が存在しているか否かを示すフラ
グを保持している。この改行が存在しているか否かの判
定は、その行の末尾からブロック領域の端までの距離に
よって行なう。文字情報５−９は、それぞれの文字毎
に、その文字の文字画像を抽出した位置（ブロックの左
／上からの距離）、文字幅、文字高、フォントタイプ、
フォントスタイル、フォントサイズ、そしてOCRした結
果の文字コードを保持している。この結果の文字コード
は、認識処理時の類似度が最高のものを１つのみでも良
いし、或は複数の候補文字のコードであっても良い。The "data in block" area differs depending on the type of data in each block. For example, in the case of the “data in a text block” area, an area for storing text data as a result of character recognition of image data of a block specified by the associated block dissociation data, and a character code string; Layout information indicating the arrangement of characters, lines, and the like obtained when each character image is extracted. As shown in 5-6, it has OCR management data at the beginning, and then holds the line information area from the first line to the last line, and further holds the character information area from the first character to the last character of the last line of the first line are doing. The OCR management data 5-7 includes the total number of lines in the block, an average line spacing value (a value obtained by adding each line interval and dividing by the number of lines),
And the horizontal / vertical size of the block in the number of pixels. The line information 5-8 is an area for storing character layout information and the like for each line, and includes the total number of characters existing in the line,
Holds the indentation value up to the first character, the average character spacing value (the sum of each character spacing and dividing by the number of characters), and a flag indicating whether there is a line feed at the end of the line. I have. The determination as to whether or not a line break exists is made based on the distance from the end of the line to the end of the block area. The character information 5-9 includes, for each character, a position (distance from the left / top of the block) where the character image of the character is extracted, a character width, a character height, a font type,
Holds the font style, font size, and character code resulting from OCR. The resulting character code may be the one having the highest similarity at the time of the recognition processing, or may be the code of a plurality of candidate characters.

【００５１】次に原稿を画像読み取り装置２−２から読
み込んだ画像データをどのように本文領域と他の領域と
に分別するかの処理を図１１、図１２、図１３、図１
４、図１５、図１６のフローチャートに従って説明す
る。Next, a process for separating the image data read from the image reading apparatus 2-2 into a text region and other regions will be described with reference to FIGS.
4, description will be given according to the flowcharts of FIGS.

【００５２】まず、ステップ１１−１において、画像読
み取り装置２−２から画像データを読み込む。さらにス
テップ１１−２において、読み込んだ画像データをOCR
エンジンを用いてPAF(Page Analysis Format)に変換す
る。例えば、図６に示すような原稿を画像読み取り装置
２−２に載置して読み取った場合の画像データは、領域
分割が行われて図７に矩形枠で示すようなブロックが抽
出される。７−１、７−２、７−３、７−４、７−５に
示すブロックは、テキストブロック、即ち文字列のみを
含む領域であり、組方向は、横書きである。７−６に示
すブロックもテキストブロックであるが、組方向は縦書
きである。７−７、７−８に示すのが、ラインブロック
であると、領域分割処理において判定が行なわれ、図５
に示したようなＰＡＦデータを作成してＨＤ１０に格納
される。First, in step 11-1, image data is read from the image reading device 2-2. Further, in step 11-2, the read image data is
Convert to PAF (Page Analysis Format) using engine. For example, the image data obtained when a document as shown in FIG. 6 is placed on the image reading device 2-2 and read is divided into regions, and blocks as shown by rectangular frames in FIG. 7 are extracted. Blocks 7-1, 7-2, 7-3, 7-4, and 7-5 are text blocks, that is, areas including only character strings, and the set direction is horizontal writing. The block shown in 7-6 is also a text block, but the set direction is vertical writing. If the line blocks shown in 7-7 and 7-8 are line blocks, a determination is made in the area division processing.
Is created and stored in the HD 10.

【００５３】PAFに変換後、ステップ１１−３におい
て、本文領域用のブロックと他の領域ブロックの分別処
理を行うために「本文用ブロックと他のブロックの分別
処理」を実行する。After the conversion into the PAF, in step 11-3, "separation processing between text blocks and other blocks" is executed in order to discriminate the text area blocks from other area blocks.

【００５４】図１２に示すフローチャートは、「本文用
ブロックと他のブロックの分別処理」（ステップ１１−
３）の詳細な処理を示すものである。ステップ１２−１
において、まず全テキストブロックの全てのフォントサ
イズ（５−９に示す文字情報のフォントサイズ）を調査
し、最も数の多いフォントサイズを求め、ＨＤ１０に最
多フォントサイズ情報としてそのフォントサイズを記憶
する。そして、ステップ１２−２において、ヘッダ領域
用のテキストブロックを抽出するために「ヘッダ領域用
テキストブロックの抽出処理」を実行する。The flowchart shown in FIG. 12 shows a "separation process between a text block and another block" (step 11-).
It shows the detailed processing of 3). Step 12-1
First, all the font sizes of all the text blocks (the font sizes of the character information shown in 5-9) are examined, the font size having the largest number is obtained, and the font size is stored in the HD 10 as the most font size information. Then, in step 12-2, "header area text block extraction processing" is executed to extract a header area text block.

【００５５】図１３に示すフローチャートが、「ヘッダ
領域用テキストブロックの抽出処理」（ステップ１２−
２）の詳細な処理を示すものであり、ステップ１３−１
において、PAF内のブロックのうち、Ｙポジション（５
−３に示す「テキストブロック管理データ」の用紙上か
らの距離）の最も小さなテキストブロックを取得する。
同じＹポジションのテキストブロックが存在する場合
は、Ｘポジション（５−３に示す「テキストブロック管
理データ」の用紙左からの距離）の小さなブロックを取
得する。さらにステップ１３−２において、そのテキス
トブロックの組方向（５−３に示す「テキストブロック
管理データ」の組方向）を調査し、横書きかどうかを判
断する。横書きでなかった場合は、ヘッダ領域用ブロッ
クではないと判断し、「ヘッダ領域用テキストブロック
の抽出処理」を終了する。横書きであった場合は、ステ
ップ１３−３において、ブロック内の行数（５−７に示
すOCR管理データの行数）を調査し、１行のみのブロッ
クか否かを判断する。２行以上のブロックであった場合
は、ヘッダ領域用ブロックではないと判断し、「ヘッダ
領域用テキストブロックの抽出処理」を終了する。１行
のみのブロックであった場合は、ステップ１３−４にお
いて、図１２のステップ１２−１で求め記憶した最多フ
ォントサイズと比較し、そのブロックの平均フォントサ
イズが、全テキストブロックの最多フォントサイズの
１．５倍以下かどうかを判断する。１．５倍以下でなか
った場合は、ヘッダ領域用ブロックではないと判断し、
「ヘッダ領域用テキストブロックの抽出処理」を終了す
る。１．５倍以下であった場合は、そのテキストブロッ
クは、ヘッダ領域用テキストブロックであると判断し、
ＨＤ１０にヘッダ領域用テキストブロックとしてそのブ
ロックの識別情報を記憶する。これで、最初のヘッダ領
域用テキストブロック（ヘッダ領域用テキストブロック
の第１要素とする）が求められた。次にステップ１３−
６において、Ｙポジション（５−３に示す「テキストブ
ロック管理データ」の用紙上からの距離）の次に小さな
テキストブロック（同じＹポジションのブロックが存在
した場合は、Ｘポジションの小さなブロック）を取得
し、ステップ１３−７において、そのブロックが、ステ
ップ１３−５で記憶したテキストブロックの２行分の範
囲に存在しているブロックかを判断する。例えば、図８
の８−１に示すテキストブロックが、ヘッダ領域用テキ
ストブロックの第１要素であるとしてステップ１３−５
で記憶された場合、その２行分の範囲とは、テキストブ
ロック８−１のブロック縦サイズ５−７を２倍したサイ
ズをテキストブロック８−１の用紙上からの距離５−４
を起点にして定めた範囲であって、８−２と８−３に示
す線分間の範囲である。従って、８−４に示すテキスト
ブロックは、範囲内であり、８−５に示すテキストブロ
ックは、範囲外となる。即ち、ステップ１３−７では、
定めた範囲内にステップ１３−６で取得したブロックが
完全に含まれるものを「範囲内である」と判定するもの
である。ステップ１３−６において、取得したテキスト
ブロックが範囲外と判断された場合は、ヘッダ領域用の
テキストブロックであって、未判定のブロックはもう存
在しないと判断し、「ヘッダ領域用テキストブロックの
抽出処理」を終了する。ステップ１３−７において、ヘ
ッダ領域用テキストブロックの第１要素の２行分の範囲
内と判断された場合は、ステップ１３−８において、そ
のテキストブロックの組方向（５−３に示す「テキスト
ブロック管理データ」の組方向）を調査し、横書きかど
うかを判断する。横書きでなかった場合は、ヘッダ領域
用ブロックではないと判断し、再びステップ１３−６に
戻り、次に対象となるテキストブロックを取得する。横
書きであった場合は、ステップ１３−９において、ブロ
ック内の行数（５−７に示すOCR管理データの行数）を
調査し、１行のみのブロックかを判断する。２行以上の
ブロックであった場合は、ヘッダ領域用ブロックではな
いと判断し、再びステップ１３−６に戻り、次に対象と
なるテキストブロックを取得する。１行のみのブロック
であった場合は、ステップ１３−１０において、ステッ
プ１３−５にて記憶したヘッダ領域用テキストブロック
の平均フォントサイズと当テキストブロックの平均フォ
ントサイズを比較し、サイズの差が２０％以内かどうか
を判断する。サイズの差が２０％以内でなかった場合
は、再びステップ１３−６に戻り、次に対象となるテキ
ストブロックを取得する。サイズの差が２０％以内であ
った場合は、ステップ１３−１１において、そのテキス
トブロックをヘッダ領域用テキストブロックとして、Ｈ
Ｄ１０にそのブロックの識別情報を記憶する。The flowchart shown in FIG. 13 shows the "extraction process of the header area text block" (step 12-).
This shows the detailed processing of 2), and is performed in step 13-1.
At the Y position (5
The text block having the smallest “distance from the paper of the“ text block management data ”indicated by -3) is acquired.
If there is a text block at the same Y position, a small block at the X position (the distance from the left of the sheet of “text block management data” shown in 5-3) is acquired. Further, in step 13-2, the composition direction of the text block (the composition direction of the “text block management data” shown in 5-3) is checked to determine whether or not the text is written horizontally. If it is not horizontal writing, it is determined that the block is not a header area block, and the “header area text block extraction processing” ends. In the case of horizontal writing, in step 13-3, the number of lines in the block (the number of lines of the OCR management data shown in 5-7) is examined, and it is determined whether or not the block has only one line. If it is a block of two or more lines, it is determined that the block is not a header area block, and the “header area text block extraction processing” ends. If the block has only one line, in step 13-4, the average font size of the block is compared with the maximum font size obtained and stored in step 12-1 of FIG. Is determined to be 1.5 times or less. If not less than 1.5 times, it is determined that the block is not a header area block,
The “text block extraction processing for header area” ends. If it is less than 1.5 times, the text block is determined to be a header area text block,
The identification information of the block is stored in the HD 10 as a text block for the header area. Thus, the first header area text block (the first element of the header area text block) is obtained. Next, step 13-
In step 6, a text block next to the Y position (the distance from the sheet of the “text block management data” shown in 5-3) is acquired (if there is a block at the same Y position, a small block at the X position) is acquired. Then, in step 13-7, it is determined whether the block exists in the range of two lines of the text block stored in step 13-5. For example, FIG.
It is assumed that the text block shown in 8-1 is the first element of the text block for the header area and the step 13-5
The range of the two lines is the size obtained by doubling the block vertical size 5-7 of the text block 8-1 by the distance 5-4 from the paper of the text block 8-1.
And the range defined by the line segments 8-2 and 8-3. Therefore, the text block shown in 8-4 is within the range, and the text block shown in 8-5 is outside the range. That is, in step 13-7,
A block in which the block acquired in step 13-6 is completely included in the determined range is determined to be “within the range”. If it is determined in step 13-6 that the obtained text block is out of the range, it is determined that the text block is a header area text block and an undetermined block no longer exists. Processing ”ends. If it is determined in step 13-7 that the value is within the range of two lines of the first element of the header area text block, in step 13-8, the text block combination direction ("text block" shown in 5-3) is set. Investigate the direction of management data) and determine whether it is horizontal writing. If it is not horizontal writing, it is determined that the block is not a header area block, and the process returns to step 13-6 to acquire the next target text block. In the case of horizontal writing, in step 13-9, the number of lines in the block (the number of lines of the OCR management data shown in 5-7) is examined to determine whether the block has only one line. If the block is a block of two or more lines, it is determined that the block is not a header area block, and the process returns to step 13-6 again to acquire the next target text block. If the block has only one line, in step 13-10, the average font size of the header area text block stored in step 13-5 and the average font size of the text block are compared. Determine if it is within 20%. If the size difference is not within 20%, the process returns to step 13-6 again to acquire the next target text block. If the size difference is within 20%, in step 13-11, the text block is set
The identification information of the block is stored in D10.

【００５６】上記処理により、１つあるいは複数のヘッ
ダ領域用テキストブロックが求められるヘッダ領域用テ
キストブロックの抽出処理（ステップ１２−２）が完了
する。With the above processing, the extraction processing of the header area text block for obtaining one or a plurality of header area text blocks (step 12-2) is completed.

【００５７】次に図１２のステップ１２−３において、
フッタ領域用のテキストブロックを抽出するための「フ
ッタ領域用テキストブロックの抽出処理」を実行する。Next, in step 12-3 of FIG.
A "footer area text block extraction process" for extracting a footer area text block is executed.

【００５８】図１４に示すフローチャートが、「フッタ
領域用テキストブロックの抽出処理」（ステップ１２−
３）の詳細な処理を示すものであり、ステップ１４−１
において、PAF内のブロックのうち、Ｙポジション（５
−３に示す「テキストブロック管理データ」の用紙上か
らの距離）の最も大きなテキストブロックを取得する。
同じＹポジションのテキストブロックが存在する場合
は、Ｘポジション（５−３に示す「テキストブロック管
理データ」の用紙左からの距離）の小さなブロックを取
得する。さらにステップ１４−２において、そのテキス
トブロックの組方向（５−３に示す「テキストブロック
管理データ」の組方向）を調査し、横書きかどうかを判
断する。横書きでなかった場合は、フッタ領域用ブロッ
クではないと判断し、「フッタ領域用テキストブロック
の抽出処理」を終了する。横書きであった場合は、ステ
ップ１４−３において、ブロック内の行数（５−７に示
すOCR管理データの行数）を調査し、１行のみのブロッ
クかを判断する。２行以上のブロックであった場合は、
フッタ領域用ブロックではないと判断し、「フッタ領域
用テキストブロックの抽出処理」を終了する。１行のみ
のブロックであった場合は、ステップ１４−４におい
て、図１２のステップ１２−１で求め記憶した最多フォ
ントサイズと比較し、そのブロックの平均フォントサイ
ズが、全テキストブロックの最多フォントサイズの１．
５倍以下かどうかを判断する。１．５倍以下でなかった
場合は、フッタ領域用ブロックではないと判断し、「フ
ッタ領域用テキストブロックの抽出処理」を終了する。
１．５倍以下であった場合は、そのテキストブロック
は、フッタ領域用テキストブロックであると判断し、Ｈ
Ｄ１０にフッタ領域用テキストブロックとしてそのブロ
ックの識別情報を記憶する。これで、最初のフッタ領域
用テキストブロック（フッタ領域用テキストブロックの
第１要素とする）が求められた。次にステップ１４−６
において、Ｙポジション（５−３に示す「テキストブロ
ック管理データ」の用紙上からの距離）の次に大きなテ
キストブロック（同じＹポジションのブロックが存在し
た場合は、Ｘポジションの小さなブロック）を取得す
る。ただし、ステップ１４−１或は以前のステップ１４
−６においてＹポジションが同じで処理が保留されてい
るブロックがある場合は、そのブロックのを取得する。
ステップ１４−７において、そのブロックが、ステップ
１４−６で記憶したテキストブロックの２行分の範囲に
存在しているブロックかを判断する。この２行分の範囲
の求め方はステップ１３−７に準ずるものであるが、範
囲を定める起点を第１の要素のブロックの用紙上からの
距離にブロック縦サイズを加算した距離とする点が異な
る。例えば、図８の８−６に示すテキストブロックが、
フッタ領域用テキストブロックの第１要素であった場
合、その２行分の範囲とは、８−７と８−８に示す線分
間である。従って、８−９、８−１０に示すテキストブ
ロックは、範囲内であり、８−１１に示すテキストブロ
ックは、範囲外となる。ステップ１４−６において、取
得したテキストブロックが範囲外と判断された場合は、
フッタ領域用のテキストブロックであって、未判定のブ
ロックはもう存在しないと判断し、「フッタ領域用テキ
ストブロックの抽出処理」を終了する。ステップ１４−
７において、フッタ領域用テキストブロックの第１要素
の２行分の範囲内と判断された場合は、ステップ１４−
８において、そのテキストブロックの組方向（５−３に
示す「テキストブロック管理データ」の組方向）を調査
し、横書きかどうかを判断する。横書きでなかった場合
は、フッタ領域用ブロックではないと判断し、再びステ
ップ１４−６に戻り、次に対象となるテキストブロック
を取得する。横書きであった場合は、ステップ１４−９
において、ブロック内の行数（５−７に示すOCR管理デ
ータの行数）を調査し、１行のみのブロックか否かを判
断する。２行以上のブロックであった場合は、フッタ領
域用ブロックではないと判断し、再びステップ１４−６
に戻り、次に対象となるテキストブロックを取得する。
１行のみのブロックであった場合は、ステップ１４−１
０において、ステップ１４−５にて記憶したフッタ領域
用テキストブロックの平均フォントサイズと当テキスト
ブロックの平均フォントサイズを比較し、サイズの差が
２０％以内かどうかを判断する。サイズの差が２０％以
内でなかった場合は、再びステップ１４−６に戻り、次
に対象となるテキストブロックを取得する。サイズの差
が２０％以内であった場合は、ステップ１４−１１にお
いて、そのテキストブロックをフッタ領域用テキストブ
ロックとして、ＨＤ１０にそのブロックの識別情報を記
憶する。上記処理により、１つあるいは複数のフッタ領
域用テキストブロックが求めるフッタ領域用テキストブ
ロックの抽出処理（ステップ１２−３）が完了する。The flowchart shown in FIG. 14 shows the "texter area extraction processing" (step 12-).
This shows the detailed processing of 3), and is step 14-1.
At the Y position (5
The text block with the largest “distance from the paper of“ text block management data ”indicated by -3) is acquired.
If there is a text block at the same Y position, a small block at the X position (the distance from the left of the sheet of “text block management data” shown in 5-3) is acquired. Further, in step 14-2, the composition direction of the text block (the composition direction of the “text block management data” shown in 5-3) is checked to determine whether or not it is horizontal writing. If it is not horizontal writing, it is determined that the block is not a footer area block, and the “footer area text block extraction processing” ends. In the case of horizontal writing, in step 14-3, the number of lines in the block (the number of lines of the OCR management data shown in 5-7) is examined, and it is determined whether the block has only one line. If the block is more than one line,
It is determined that the block is not a footer area block, and the “footer area text block extraction processing” ends. If the block is a block of only one line, in step 14-4, the average font size of the block is compared with the maximum font size obtained and stored in step 12-1 of FIG. 1.
It is determined whether it is 5 times or less. If it is not 1.5 times or less, it is determined that the block is not a footer area block, and the “footer area text block extraction processing” ends.
If it is 1.5 times or less, the text block is determined to be a footer area text block, and H
D10 stores the identification information of the block as a text block for the footer area. Thus, the first footer area text block (the first element of the footer area text block) is obtained. Next, step 14-6
, A text block next to the Y position (the distance from the sheet of “text block management data” shown in 5-3) (the smaller block at the X position if there is a block at the same Y position) is acquired. . However, step 14-1 or the previous step 14
If there is a block whose processing is suspended at the same Y position in -6, the block of that block is acquired.
In step 14-7, it is determined whether the block exists in the range of two lines of the text block stored in step 14-6. The method of obtaining the range for two rows is similar to step 13-7, except that the starting point for defining the range is the distance obtained by adding the block vertical size to the distance of the first element block from the sheet. different. For example, the text block shown at 8-6 in FIG.
In the case of the first element of the footer area text block, the range of the two lines is the line segment indicated by 8-7 and 8-8. Therefore, the text blocks shown in 8-9 and 8-10 are within the range, and the text blocks shown in 8-11 are outside the range. If it is determined in step 14-6 that the acquired text block is out of the range,
It is determined that there is no more undetermined block for the footer area text block, and the “footer area text block extraction process” ends. Step 14-
In step 7, if it is determined that the value is within the range of two lines of the first element of the footer area text block, the process proceeds to step 14-.
In step 8, the grouping direction of the text block (the grouping direction of the "text block management data" shown in 5-3) is checked to determine whether or not it is horizontal writing. If it is not horizontal writing, it is determined that the block is not a footer area block, and the process returns to step 14-6 again to acquire the next target text block. If it is horizontal writing, step 14-9
In step (1), the number of lines in the block (the number of lines of the OCR management data shown in 5-7) is examined to determine whether or not the block has only one line. If the block is two or more rows, it is determined that the block is not a footer area block, and step 14-6 is performed again.
To get the next target text block.
If the block is only one line, step 14-1
At 0, the average font size of the footer area text block stored in step 14-5 is compared with the average font size of the text block, and it is determined whether the size difference is within 20%. If the size difference is not within 20%, the process returns to step 14-6 again to acquire the next target text block. If the size difference is within 20%, in step 14-11, the text block is set as the footer area text block, and the identification information of the block is stored in the HD 10. With the above processing, the extraction processing (step 12-3) of the footer area text block obtained by one or more footer area text blocks is completed.

【００５９】次に図１２のステップ１２−４において、
枠領域用のブロックを抽出するための「枠領域用ブロッ
クの抽出処理」を実行する。図１５に示すフローチャー
トが、「枠領域用ブロックの抽出処理」（ステップ１２
−４）の詳細な処理を示すものである。Next, in step 12-4 of FIG.
A "frame area block extraction process" for extracting a frame area block is executed. The flowchart shown in FIG. 15 is described as “frame area block extraction processing” (step 12
4) shows the detailed processing of the above.

【００６０】ステップ１５−１において、既に求めて記
憶したヘッダ領域用テキストブロック及びフッタ領域用
テキストブロック以外の全テキストブロックから、主と
なる組方向を求め、本文領域用の組方向とする。主とな
る組方向は、以下のようにして求める。ヘッダ領域用テ
キストブロック及びフッタ領域用テキストブロック以外
の全テキストブロックのそれぞれの面積（５−３に示す
テキストブロック管理データのブロック横／縦サイズか
ら求める）をそれぞれの組方向（５−３に示すテキスト
ブロック管理データの組方向）別に合計し、面積の大き
い方を主となる組方向とする。In step 15-1, a main set direction is determined from all the text blocks other than the header area text block and the footer area text block which have been obtained and stored, and is set as the main area set direction. The main group direction is obtained as follows. The respective areas (determined from the block horizontal / vertical sizes of the text block management data shown in 5-3) of all the text blocks other than the header area text block and the footer area text block are shown in respective set directions (shown in 5-3). The sum is separately calculated for each text block management data), and the larger area is set as the main set direction.

【００６１】次にステップ１５−２において、PAF内の
ブロックのうち、既に求めて記憶したヘッダ領域用テキ
ストブロック及びフッタ領域用テキストブロック以外の
全テキストブロックの中で、Ｙポジション（５−３に示
す「テキストブロック管理データ」の用紙上からの距
離）の最も小さなブロックを取得する。同じＹポジショ
ンのブロックが存在する場合は、Ｘポジション（５−
３、５−４、５−５に示す「ブロック管理データ」の用
紙左からの距離）の小さなブロックを取得する。もう該
当するブロックが存在しないのであれば、ステップ１５
−３においてそれを判断し、「枠領域用ブロックの抽出
処理」を終了する。該当するブロックが存在する場合
は、ステップ１５−４において、そのブロックの種類
（５−３、５−４、５−５に示す「ブロック管理デー
タ」のブロックの種類）が画像であるか否かを判定し、
画像ブロックであった場合は、ステップ１５−５におい
て、画像枠として抽出してＨＤ１０にそのブロックの識
別情報と記憶する。画像ブロックでないと判定された場
合は、ステップ１５−６において、さらにブロックの種
類（５−３、５−４、５−５に示す「ブロック管理デー
タ」のブロックの種類）がテーブルであるか否かを判定
し、テーブルブロックであった場合は、ステップ１５−
７において、テーブル枠として抽出してＨＤ１０にその
ブロックの識別情報と記憶する。テーブルブロックでな
いと判定された場合は、ステップ１５−８において、さ
らにブロックの種類（５−３、５−４、５−５に示す
「ブロック管理データ」のブロックの種類）がラインで
あるか否かを判定し、ラインブロックであった場合は、
ステップ１５−９において、ラインデータとして抽出し
てＨＤ１０にそのブロックの識別情報と記憶する。ライ
ンブロックでないと判定された場合は、ステップ１５−
１０において、さらにブロックの種類（５−３、５−
４、５−５に示す「ブロック管理データ」のブロックの
種類）がテキストであるか否かを判定し、テキストブロ
ックであった場合は、ステップ１５−１１において、
「テキストブロックチェック処理」を実行する。Next, in step 15-2, among the blocks in the PAF, the Y position (5-3) is set in all the text blocks other than the text block for the header area and the text block for the footer area which have already been obtained and stored. The block with the smallest “distance from the paper of the“ text block management data ”shown) is acquired. If there is a block at the same Y position, the X position (5-
The smaller blocks (distance from the left of the sheet of “block management data” shown in 3, 5-4, and 5-5) are acquired. If there is no corresponding block, step 15
In step -3, this is determined, and the "frame area block extraction process" ends. If the corresponding block exists, in step 15-4, it is determined whether the type of the block (the type of the block of “block management data” shown in 5-3, 5-4, and 5-5) is an image. Judge,
If the block is an image block, the block is extracted as an image frame and stored in the HD 10 together with the identification information of the block in step 15-5. If it is determined that the block is not an image block, it is determined in step 15-6 whether the block type (block type of “block management data” shown in 5-3, 5-4, and 5-5) is a table. Is determined, and if it is a table block, step 15-
In step 7, the information is extracted as a table frame and stored in the HD 10 together with the identification information of the block. If it is determined that the block is not a table block, it is determined in step 15-8 whether the block type (block type of “block management data” shown in 5-3, 5-4, and 5-5) is a line. And if it is a line block,
In step 15-9, the data is extracted as line data and stored in the HD 10 together with the identification information of the block. If it is determined that the block is not a line block, step 15-
10, the type of block (5-3, 5-
It is determined whether the “block management data” shown in 4, 5-5 is a text block, and if it is a text block, in step 15-11,
Execute "text block check processing".

【００６２】図１６に示すフローチャートが「テキスト
ブロックチェック処理」（ステップ１５−１２）の詳細
な処理を示すものである。The flowchart shown in FIG. 16 shows the detailed processing of the "text block check processing" (steps 15-12).

【００６３】ステップ１６−１において、ブロック管理
データを用いてそのテキストブロックは、ヘッダ領域に
跨っているかどうかを判断し、跨っている場合は本文領
域になり得ないので、テキスト枠として抽出した旨をリ
ターンする（ステップ１６−６）。ヘッダ領域に跨って
いない場合は、ステップ１６−２において、フッタ領域
に跨っているかどうかを判断し、跨っている場合は、本
文領域になり得ないので、テキスト枠として抽出した旨
をリターンする（ステップ１６−６）。フッタ領域にも
跨っていない場合は、ステップ１６−３において、ステ
ップ１５−１で求め記憶した組方向と当テキストブロッ
クの組方向とを比較し、異なっている場合は、本文領域
になり得ないので、テキスト枠として抽出した旨をリタ
ーンする（ステップ１６−６）。同じ組方向であった場
合は、ステップ１６−４において、そのテキストブロッ
クの最も多いフォントサイズが、ステップ１２−１にお
いて求め記憶した全体のテキストブロックの最多フォン
トサイズの２倍以上かを判断し、２倍以上であった場合
は、テキスト枠として抽出した旨をリターンする（ステ
ップ１６−６）。２倍以上でなかった場合は、本文領域
として抽出した旨をリターンする（ステップ１６−
５）。In step 16-1, it is determined whether or not the text block extends over the header area using the block management data. If the text block does extend over the header area, the text block cannot be a text area. Is returned (step 16-6). If it does not straddle the header area, it is determined in step 16-2 whether it straddles the footer area. If it straddles, it cannot return to the body area, and it is returned that it has been extracted as a text frame ( Step 16-6). If the text block does not extend over the footer area, in step 16-3, the composition direction obtained and stored in step 15-1 is compared with the composition direction of the text block. Therefore, the fact that the text frame has been extracted is returned (step 16-6). If the orientations are the same, in step 16-4, it is determined whether the largest font size of the text block is at least twice the largest font size of the entire text block obtained and stored in step 12-1. If it is twice or more, it returns that it has been extracted as a text frame (step 16-6). If it is not twice or more, it returns that it has been extracted as a text region (step 16-).
5).

【００６４】ステップ１５−１２において、「テキスト
ブロックチェック処理」におけるステップ１６−５或は
ステップ１６−６でリターンされたデータを判断し、テ
キスト枠である旨を示すデータならば、ステップ１５−
１３において、ステップ１５−２で取得したテキストブ
ロックをテキスト枠として抽出してＨＤ１０にそのブロ
ックの識別情報と記憶する。In step 15-12, the data returned in step 16-5 or step 16-6 in the "text block check process" is judged, and if the data indicates that the frame is a text frame, step 15-
In step 13, the text block acquired in step 15-2 is extracted as a text frame, and the HD 10 stores the block identification information.

【００６５】上記の処理により、各ブロックを分類して
本文領域用と他の領域用のブロックに分別した記憶が完
了し、図１１のステップ１１−４における、本文書式の
作成を開始する。By the above-described processing, storage of the blocks classified into the blocks for the text area and the blocks for other areas is completed, and the creation of the text format in step 11-4 in FIG. 11 is started.

【００６６】図１２〜図１６のフローチャートに沿って
説明してきた「本文領域と他の領域との分別処理」（ス
テップ１１−３）を図６に示す画像データに対して実行
した場合の結果を具体的に説明する。The result of executing the “separation process between text region and other region” (step 11-3) described with reference to the flowcharts of FIGS. 12 to 16 is performed on the image data shown in FIG. This will be specifically described.

【００６７】ステップ１１−２において読み取った画像
データをOCRエンジンを用いてPAFに変換した結果の各ブ
ロック（図７）に対して「本文用ブロックと他のブロッ
クの分別処理」を実行すると、まずステップ１２−２に
おいて、ヘッダ領域用テキストブロックとして、図７の
７−１、７−２に示すテキストブロックが分別される。
続いてステップ１２−３において、フッタ領域用テキス
トブロックとして、図７の７−３、７−４、７−５に示
すテキストブロックが分別される。そして、ステップ１
２−４において、枠領域用ブロックとして、図７の７−
６に示すテキストブロック、７−７、７−８に示すライ
ンブロックが分別され、これらの分別されたブロック以
外の図９に示すテキストブロックが、本文領域用として
分別される。この結果、図９に示す１８個の各領域が本
文領域用ブロックとして分別された領域である。In step 11-2, when "block discrimination between text block and other blocks" is executed for each block (FIG. 7) obtained by converting the image data read into PAF using the OCR engine, first, In step 12-2, the text blocks shown in 7-1 and 7-2 in FIG. 7 are classified as the header area text blocks.
Subsequently, in step 12-3, the text blocks indicated by 7-3, 7-4, and 7-5 in FIG. 7 are classified as the footer area text blocks. And step 1
In 2-4, as a block for the frame area, 7- in FIG.
The text block shown in FIG. 6 and the line blocks shown in 7-7 and 7-8 are classified, and the text blocks shown in FIG. 9 other than these classified blocks are classified for the body region. As a result, each of the 18 areas shown in FIG. 9 is an area that is classified as a text area block.

【００６８】尚、図１３のフローチャートに示したヘッ
ダ領域用テキストブロックの抽出処理では、判定対象と
しているブロックがヘッダ領域用テキストであるか否か
を判定する基準の一つとして、ステップ１３−４におい
てそのブロックの平均フォントサイズが全テキストブロ
ックの最多フォントサイズの１．５倍以下か否かという
判断を行なったが、そのブロックの平均フォントサイズ
をユーザにより指定された数値と比較するようにしても
良い。この場合のヘッダ領域用テキストブロックの抽出
処理のフローチャートを図１７に示す。ステップ１３−
４に替えて、ステップ１７−４において、そのブロック
の平均フォントサイズがユーザによって指定されたポイ
ントサイズ以下か否かを判断する。指定されたポイント
サイズ以下でなかった場合は、ヘッダ領域用ブロックで
ないと判断し、「ヘッダ領域用テキストブロックの抽出
処理」を終了する。指定されたポイントサイズ以下であ
ると判断された場合は、そのテキストブロックはヘッダ
領域用テキストブロックであると判断し、ＨＤ１０にヘ
ッダ領域用テキストブロックとしてそのブロックの識別
情報を記憶する。図１７のフローチャートにおいて、ス
テップ１７−４以外の処理は図１３のフローチャートに
示した処理と同じであり、同ステップには同ステップ番
号を付した。In the header area text block extraction processing shown in the flow chart of FIG. 13, one of the criteria for determining whether or not the block to be determined is the header area text is step 13-4. In the above, it was determined whether or not the average font size of the block was 1.5 times or less the maximum font size of all text blocks, but the average font size of the block was compared with a numerical value specified by the user. Is also good. FIG. 17 shows a flowchart of the header area text block extraction process in this case. Step 13-
In step 17-4, it is determined whether or not the average font size of the block is equal to or smaller than the point size specified by the user. If it is not smaller than the designated point size, it is determined that the block is not a header area block, and the “header area text block extraction processing” ends. If it is determined that the text block is smaller than the designated point size, it is determined that the text block is a header area text block, and the identification information of the block is stored in the HD 10 as a header area text block. In the flowchart of FIG. 17, the processing other than step 17-4 is the same as the processing shown in the flowchart of FIG. 13, and the same steps are denoted by the same step numbers.

【００６９】また、ヘッダ領域用テキストブロックの抽
出処理に合わせ、フッタ領域用テキストブロックの抽出
においてもユーザにより指定された数値との比較ステッ
プに置き替える。この処理を示すのが図１８のフローチ
ャートであって、図１４のフローチャートにおけるステ
ップ１４−４に替えてステップ１８−４を実行する。ス
テップ１８−４において、そのブロックの平均フォント
サイズが、ユーザによって指定されたポイントサイズ以
下か否かを判断する。ユーザによって指定されたポイン
トサイズ以下でなかった場合は、フッタ領域用ブロック
ではないと判断し、「フッタ領域用テキストブロックの
抽出処理」を終了する。ユーザによって指定されたポイ
ントサイズ以下であった場合は、そのテキストブロック
は、フッタ領域用ブロックであると判断し、ＨＤ１０に
フッタ領域用テキストブロックとしてそのブロックの識
別情報を記憶する。図１８のフローチャートにおいて、
ステップ１８−４以外の処理は図１４のフローチャート
に示した処理と同じであり、同ステップには同ステップ
番号を付した。尚、ステップ１７−４及びステップ１８
−４で用いるユーザにより指定されたポイントサイズと
は、これらの処理が実行される以前にＨＤ１０にユーザ
指定ポイントサイズのパラメータとして予め記憶された
データであって、キーボード１２或はマウス１３より入
力された数値である。In accordance with the header area text block extraction processing, the footer area text block extraction is replaced with a comparison step with a numerical value designated by the user. This processing is shown in the flowchart of FIG. 18, and step 18-4 is executed instead of step 14-4 in the flowchart of FIG. In step 18-4, it is determined whether or not the average font size of the block is equal to or smaller than the point size specified by the user. If it is not smaller than the point size designated by the user, it is determined that the block is not a footer area block, and the “footer area text block extraction process” ends. If it is smaller than the point size specified by the user, the text block is determined to be a footer area block, and the identification information of the block is stored in the HD 10 as a footer area text block. In the flowchart of FIG.
The processing other than Step 18-4 is the same as the processing shown in the flowchart of FIG. 14, and the same steps are denoted by the same step numbers. Steps 17-4 and 18
The point size designated by the user used in -4 is data stored in advance in the HD 10 as a parameter of the user designated point size before these processes are executed, and is input from the keyboard 12 or the mouse 13. It is a numerical value.

【００７０】次に、図１６のフローチャートに示した
「テキストブロックチェック処理」におけるステップ１
６−４において、本文領域として抽出するか否かを判定
する基準の一つとして２倍というパラメータを用いた処
理に替えて、ユーザにより指定された数値を用いる処理
について説明する。その処理を示すのが図１９のフロー
チャートであって、ステップ１９−４においてそのテキ
ストブロックの最多フォントサイズが、ステップ１２−
１において求めて記憶した全体のテキストブロックの最
多フォントサイズにユーザが指定した倍率をかけた値以
上か否かを判断する。ここで指定倍以上であると判断さ
れた場合は、テキスト領域として抽出した旨をリターン
し（ステップ１６−６）、指定倍以上でないと判断され
た場合は、本文領域として抽出した旨をリターンする
（ステップ１６−５）。Next, step 1 in the "text block check process" shown in the flowchart of FIG.
In 6-4, a description will be given of a process using a numerical value designated by a user instead of a process using a parameter of twice as one of the criteria for determining whether or not to extract a text region. FIG. 19 is a flowchart showing this processing. In step 19-4, the maximum font size of the text block is determined in step 12-
It is determined whether or not a value obtained by multiplying the maximum font size of the entire text block obtained and stored in step 1 by the magnification designated by the user is equal to or larger than the value. Here, if it is determined that the size is equal to or more than the specified size, the fact that the text area has been extracted is returned (step 16-6). If it is determined that the size is not more than the specified size, the fact that the text area has been extracted is returned. (Step 16-5).

【００７１】図１９のフローチャートにおいて、ステッ
プ１９−４以外の処理は図１６のフローチャートに示し
た処理と同じであり、同ステップには同ステップ番号を
付した。ステップ１９−４で用いるユーザにより指定さ
れた倍率とは、これらの処理が実行される前にＨＤ１０
にユーザ指定倍率のパラメータとして予め記憶されたデ
ータであって、キーボード１２或はマウス１３より入力
された数値である。In the flow chart of FIG. 19, the processing other than step 19-4 is the same as the processing shown in the flow chart of FIG. 16, and the same steps are denoted by the same step numbers. The magnification specified by the user used in step 19-4 is the value of HD10 before these processes are executed.
Is data stored in advance as a parameter of a user-specified magnification, and is a numerical value input from the keyboard 12 or the mouse 13.

【００７２】ここで、ステップ１１−４における本文書
式の作成処理について説明する。尚、ここで作成される
書式とは、通常のワープロソフト等で作成される文書の
書式と同様のものであって、その設定された位置から先
の文書のレイアウトを定めるものであり、後の編集作業
において改頁が挿入された場合には、次頁のレイアウト
も同じにするものである。即ち、その書式が作成された
時の領域サイズに関わらず、その後の文字の挿入や削除
に合わせて制御対象領域が伸縮するものである。Here, the process of creating the text format in step 11-4 will be described. Note that the format created here is the same as the format of a document created by ordinary word processing software, etc., and defines the layout of the preceding document from the set position. When a page break is inserted in the editing operation, the layout of the next page is made the same. That is, regardless of the area size when the format is created, the control target area expands and contracts in accordance with subsequent insertion or deletion of characters.

【００７３】図２０に示すフローチャートが、ステップ
１１−４における「本文書式の作成処理」の詳細な処理
ステップである。The flowchart shown in FIG. 20 is a detailed processing step of “text body format creation processing” in step 11-4.

【００７４】ステップ２０−１において、図５のPAFの
ヘッダ領域５−２のイメージ幅情報及びイメージ高さ情
報により表わされる処理対象としてスキャナより入力し
たイメージより大きくかつ最も近い定型用紙サイズ（A
３、A４、A５、Ｂ４、Ｂ５等であって、用紙の種類の識
別情報と各用紙の縦横サイズとを対応づけて予めＨＤ１
０に登録しておいたものとする。）を本文書式の用紙サ
イズに決定し、ＨＤ１０に記憶する。その際、イメージ
幅と決定した定型用紙の幅との差、及びイメージ高と定
型用紙の高さとの差も算出してＨＤ１０に記憶してお
く。In step 20-1, the standard paper size (A) larger and closest to the image input from the scanner as a processing target represented by the image width information and the image height information of the header area 5-2 of the PAF in FIG.
3, A4, A5, B4, B5, etc., in which the identification information of the paper type and the length and width
0 has been registered. ) Is determined as the paper size of the text format and stored in the HD 10. At this time, the difference between the image width and the determined width of the standard paper and the difference between the image height and the height of the standard paper are also calculated and stored in the HD 10.

【００７５】次にステップ２０−２において、本文領域
用として残ったテキストブロック群（図９）を包み込む
最小矩形を（図１０、枠９−１）本文領域として、左右
上下余白を決定する。この時、ステップ２０−１におい
て記憶したイメージ幅と決定した定型用紙幅との差、及
びイメージ高と決定した定型用紙高との差を考慮し、左
右上下余白値を算出し、ＨＤ１０に記憶する。例えば、
図１０の９−１に示す矩形が、本文領域用のテキストブ
ロック群を包み込む最小矩形であるので、この矩形領域
を本文領域とする。そして、９−２、９−３、９−４、
９−５に示すのが、それぞれ左右上下余白となるので、
本文書式として決定する左余白値には、９−２に示す入
力イメージにおける左端から、本文領域までの巾である
左余白＋（定型用紙幅−イメージ幅）／２を設定する。
同様に右余白値にも、９−３に示す入力イメージにおけ
る右端から本文領域までの巾である右余白＋（定型用紙
幅−イメージ幅）／２を設定する。また、上余白値に
は、９−４に示す入力イメージにおける上端から本文領
域までの高さである上余白＋（定型用紙高−イメージ
高）／２を設定する。同様に下余白値にも、９−５に示
す入力イメージにおける下端から本文領域までの高さで
ある下余白＋（定型用紙高−イメージ高）／２を設定す
る。Next, in step 20-2, the left, right, top and bottom margins are determined by setting the minimum rectangle enclosing the text block group (FIG. 9) remaining for the text area as the text area (FIG. 10, frame 9-1). At this time, in consideration of the difference between the image width stored in step 20-1 and the determined standard paper width and the difference between the image height and the determined standard paper height, left, right, top and bottom margin values are calculated and stored in the HD 10. . For example,
The rectangle shown at 9-1 in FIG. 10 is the minimum rectangle that wraps the text block group for the text region, and thus this rectangular region is defined as the text region. And 9-2, 9-3, 9-4,
9-5 shows the left, right, top and bottom margins, respectively.
As the left margin value determined as the text format, a left margin + (standard paper width−image width) / 2, which is the width from the left end of the input image shown in 9-2 to the text area, is set.
Similarly, the right margin value is set to right margin + (standard paper width−image width) / 2, which is the width from the right end to the body area in the input image shown in 9-3. Further, as the upper margin value, an upper margin + (standard paper height−image height) / 2, which is the height from the upper end to the body area in the input image shown in 9-4, is set. Similarly, for the lower margin value, lower margin + (standard paper height−image height) / 2, which is the height from the lower end to the body area in the input image shown in 9-5, is set.

【００７６】次にステップ２０−３において、図１５の
ステップ１５−１で記憶した主となる組方向を本文書式
の組方向としてＨＤ１０に記憶する。さらに、ステップ
２０−４において、本文領域用のテキストブロック全て
のフォントサイズ（５−９に示す文字情報のフォントサ
イズ）を調査し、最も数の多いフォントサイズを求め、
本文書式の文字サイズとしてＨＤ１０に記憶する。そし
て、ステップ２０−５において、本文領域用のテキスト
ブロック全ての平均文字間（５−８に示す行情報の平均
文字間）を調査し、最も数の多い平均文字間を求め、求
めた平均文字間と本文書式の文字サイズ（ステップ２０
−４においてＨＤ１０に記憶）とを足した値で本文領域
の幅を割った商を文字数／行に設定し、ＨＤ１０に記憶
する。さらに、ステップ２０−６において、書体に明朝
体を設定してＨＤ１０に記憶し、ステップ２０−７にお
いて、本文領域用のテキストブロック全ての平均行間
（５−７に示すOCR管理データの平均行間）を調査し、
最も数の多い平均行間を求め、求めた平均行間と本文書
式の文字サイズ（ステップ２０−４においてＨＤ１０に
記憶）を足した値で本文領域の高さを割った商を行数／
頁に設定し、ＨＤ１０に記憶する。Next, in step 20-3, the main set direction stored in step 15-1 in FIG. 15 is stored in the HD 10 as the set direction of the text format. Further, in step 20-4, the font size of all the text blocks for the body area (the font size of the character information shown in 5-9) is checked, and the font size with the largest number is obtained.
The text size is stored in the HD 10 as the character size. Then, in step 20-5, the average character spacing of all the text blocks for the body area (the average character spacing of the line information shown in 5-8) is checked, and the average character spacing with the largest number is determined. Character size of the space and text format (step 20
-4 is stored in the HD 10), and a quotient obtained by dividing the width of the body area by the value obtained by adding the value to the number of characters / line is stored in the HD 10. Further, in step 20-6, the font is set to the Mincho font and stored in the HD 10. In step 20-7, the average line spacing of all the text blocks for the body area (the average line spacing of the OCR management data shown in 5-7) is set. ),
The average line spacing with the largest number is obtained, and the quotient obtained by dividing the height of the body area by the value obtained by adding the obtained average line space and the character size of the body format (stored in the HD 10 in step 20-4) is obtained by dividing the number of lines /
The page is set and stored in the HD 10.

【００７７】次にステップ２０−８において、本文書式
の段組関係の設定を行う。まず段数の決定方法は、本文
領域用のテキストブロック群をどのテキストブロックも
分割せずに、かつ他の矩形を重ならずに最多のテキスト
ブロックを含むことのできる矩形で分割した時、各矩形
の最小Ｙ座標値を示すＹポジションの最も小さな矩形が
いくつ存在するかを求め、それを本文書式の段数として
ＤＨ１０に記憶する。また、複数段の場合には、それぞ
れの矩形の幅を段幅に、矩形間を段間に設定する。矩形
間にラインデータが存在すれば、段間罫線として設定す
る。例えば、図２２において点線枠で示す１０−３、１
０−４、１０−５の３つの矩形内に存在し、各矩形によ
り分割されているのが、先に説明した処理により判別し
た本文領域用のテキストブロック群である。１０−３に
示す矩形が、Ｙポジションの最も小さな矩形であるの
で、この場合は、１段組が設定される。また、この場合
は、１段組であるので、段幅には本文領域の幅と同じ値
が設定される。Next, in step 20-8, the column relation of the text format is set. First, the number of columns is determined by dividing the text block group for the body area into rectangles that can include the largest number of text blocks without overlapping other text blocks and without overlapping other rectangles. The number of the smallest rectangles at the Y position that indicates the minimum Y coordinate value is determined and stored in the DH 10 as the number of steps in the text format. In the case of a plurality of stages, the width of each rectangle is set to the stage width, and the space between the rectangles is set between the stages. If line data exists between rectangles, it is set as a ruled line between steps. For example, 10-3, 1 shown by a dotted frame in FIG.
The text blocks for the body area determined by the above-described processing are present in three rectangles 0-4 and 10-5 and divided by each rectangle. Since the rectangle shown in 10-3 is the smallest rectangle at the Y position, in this case, one column is set. In this case, since the column is a single column, the column width is set to the same value as the width of the text region.

【００７８】次に、ステップ１７−９において、途中段
組変更の設定を行う。ステップ２０−８で設定された矩
形を除いた矩形の中でステップ２０−８で行なったと同
じＹポジションの判定を行なって頁の途中から段組が変
わると判断された場合は、段組変更が生じるＹポジショ
ン、新たな段数、段幅、段間を設定する。例えば、図２
２の場合、ステップ２０−８で矩形１０−３が段組設定
されたので残りの１０−４、１０−５に示す矩形のＹポ
ジションを判定すると、この２つの矩形のＹポジション
が同じであると判定できることにより、この２つの矩形
から、頁の途中から変更になる段組の書式を設定すべき
であると判断し、１０−４に示す矩形のＹポジションを
変更ポジションに設定し、このＹポジションに矩形が２
つあることから段数には２段組、段幅にはそれぞれの矩
形の幅、段間には矩形間を設定する。また、１０−６、
１０−７に示すラインデータは、図１５のステップ１５
−９において、メモリに記憶されているもので、１０−
６、１０−７に示すラインデータのラインブロック管理
データ（図５の５−５に示す）を調査すると、それぞれ
のスタートポジションＸ／エンドポジションＸの値が同
じであり、線種も太さも同じことから、１本の段間罫線
として設定する。Next, in step 17-9, the setting for changing the middle set is performed. If the same Y-position determination as in step 20-8 is performed in a rectangle other than the rectangle set in step 20-8 and it is determined that the column changes from the middle of the page, the column change is performed. The resulting Y position, new number of steps, step width, and step spacing are set. For example, FIG.
In the case of 2, since the rectangle 10-3 is set in columns in step 20-8, when the Y positions of the remaining rectangles 10-4 and 10-5 are determined, the Y positions of these two rectangles are the same. From the two rectangles, it is determined that a column format that should be changed from the middle of the page should be set, and the Y position of the rectangle shown in 10-4 is set as the changed position. 2 rectangles in position
Since there are two steps, the number of steps is set to two steps, the step width is set to the width of each rectangle, and the space between steps is set to the space between rectangles. Also, 10-6,
The line data shown in 10-7 is stored in step 15 in FIG.
-9, stored in the memory, 10-
When the line block management data (shown at 5-5 in FIG. 5) of the line data shown at 6, 10-7 is examined, the values of the start position X / end position X are the same, and the line type and thickness are the same. For this reason, it is set as one ruled line between steps.

【００７９】以上図２０のフローチャートに従って述べ
た本文領域テキストブロック群からの本文書式作成処理
は、例えば、ヘッダ／フッタ領域用のテキストブロック
からヘッダ／フッタ書式を決定し、本文書式へ含めても
良いし、ヘッダ／フッタ領域用のテキストブロック内の
ひげ付き数字から頁番号を本文書式に設定しても良い。
以下、図２１に示す「本文書式の作成処理」の第二例の
フローチャートに従い、これらのヘッダ、フッタ及び頁
番号を本文書式に設定する処理について説明する。In the process of creating a text format from a text region text block group described above with reference to the flowchart of FIG. 20, for example, a header / footer format may be determined from a text block for a header / footer region and included in the text format. Alternatively, the page number may be set in the text format from the bearded numbers in the text block for the header / footer area.
Hereinafter, the process of setting the header, footer, and page number in the text format will be described with reference to the flowchart of the second example of the "text format creation process" shown in FIG.

【００８０】ただし、ステップ２０−１から２０−９ま
での各処理ステップは図２０のフローチャートにおいて
説明した処理ステップと同様なので、図２１のフローチ
ャートにおいても同じステップ番号を付し、ここでの説
明を省略する。However, since the processing steps from step 20-1 to step 20-9 are the same as the processing steps described in the flowchart of FIG. 20, the same step numbers are given in the flowchart of FIG. Omitted.

【００８１】次にステップ２１−１０において、図１３
に示す「ヘッダ領域用テキストブロックの抽出処理」で
抽出しＨＤ１０に記憶したヘッダ領域用テキストブロッ
ク群を包み込む最小の矩形をヘッダ領域とし、ヘッダ書
式を決定する。この時、ステップ２０−１において記憶
したイメージ幅と決定した定型用紙幅との差、及びイメ
ージ高と決定した定型用紙高との差を考慮し、ヘッダ領
域の左上余白値を決定し、ＨＤ１０に記憶する。例え
ば、図２２の１０−１に点線枠で示す矩形が、ヘッダ領
域用のテキストブロック群を包み込む最小矩形であるの
で、この矩形領域をヘッダ領域とする。そして、１０−
８、１０−９に示すのが、それぞれ左上余白となるの
で、左余白値には、１０−８に示す左余白＋（定型用紙
幅−イメージ幅）／２を設定し、上余白値には、１０−
９に示す上余白＋（定型用紙高−イメージ高）／２を設
定する。また、ヘッダ領域のサイズには、ヘッダ領域用
テキストブロックを包み込む最小矩形の幅と高さを設定
し、ＨＤ１０に記憶する。Next, at step 21-10, FIG.
The header format is determined by defining the minimum rectangle that wraps the header area text blocks extracted and stored in the HD 10 in the “header area text block extraction process” shown in FIG. At this time, in consideration of the difference between the image width stored in step 20-1 and the determined standard paper width and the difference between the image height and the determined standard paper height, the upper left margin value of the header area is determined. Remember. For example, the rectangle indicated by the dotted frame in 10-1 in FIG. 22 is the minimum rectangle that wraps the text block group for the header area, and thus this rectangular area is used as the header area. And 10-
8 and 10-9 are the upper left margins, respectively, so the left margin value is set to the left margin shown in 10-8 + (standard paper width−image width) / 2, and the upper margin value is , 10-
The upper margin + (standard paper height−image height) / 2 shown in FIG. 9 is set. In addition, the width and height of the minimum rectangle enclosing the header area text block are set as the size of the header area, and stored in the HD 10.

【００８２】ヘッダ書式の文字サイズは、ヘッダ領域内
のテキストブロック全てのフォントサイズ（５−９に示
す文字情報のフォントサイズ）を調査し、最も数の多い
フォントサイズを求め、ヘッダ書式の文字サイズとして
ＨＤ１０に記憶する。また、文字数／行には、ヘッダ領
域用のテキストブロック全ての平均文字間（５−８に示
す行情報の平均文字間）を調査し、最も数の多い平均文
字間を求め、求めた平均文字間とヘッダ書式の文字サイ
ズを足した値でヘッダ領域の幅を割った商を文字数／行
に設定し、ＨＤ１０に記憶する。The font size of the header format is determined by examining the font size of all the text blocks in the header area (the font size of the character information shown in 5-9), finding the font size with the largest number, In the HD 10. For the number of characters / line, the average character spacing (average character spacing of the line information shown in 5-8) of all the text blocks for the header area is investigated, and the average character spacing with the largest number is determined. The quotient obtained by dividing the width of the header area by the value obtained by adding the space and the character size of the header format is set to the number of characters / line, and stored in the HD 10.

【００８３】次にステップ２１−１１において、図１４
に示す「フッタ領域用テキストブロックの抽出処理」で
抽出し記憶したフッタ領域用テキストブロック群を包み
込む最小の矩形をフッタ領域とし、フッタ書式を決定す
る。この時、ステップ２０−１において記憶したイメー
ジ幅と決定した定型用紙幅との差、及びイメージ高と決
定した定型用紙高との差を考慮し、フッタ領域の左上余
白値を決定し、ＨＤ１０に記憶する。例えば、図２２の
１０−２に点線枠で示す矩形が、フッタ領域用のテキス
トブロック群を包み込む最小矩形であるので、この矩形
領域をフッタ領域とする。そして、１０−１０、１０−
１１に示すのが、それぞれ左余白及び下余白となるの
で、左余白値には、１０−１０に示す左余白＋（定型用
紙幅−イメージ幅）／２を設定し、下余白値には、１０
−１１に示す下余白＋（定型用紙高−イメージ高）／２
を設定する。また、フッタ領域のサイズには、フッタ領
域用テキストブロックを包み込む最小矩形の幅と高さを
設定し、ＨＤ１０に記憶する。Next, in step 21-11, FIG.
The footer area is determined by setting the smallest rectangle that wraps the footer area text block group extracted and stored in the “footer area text block extraction processing” shown in FIG. At this time, in consideration of the difference between the image width stored in step 20-1 and the determined standard paper width and the difference between the image height and the determined standard paper height, the upper left margin value of the footer area is determined. Remember. For example, the rectangle indicated by a dotted frame in 10-2 in FIG. 22 is the minimum rectangle that wraps the text block group for the footer area, and thus this rectangular area is set as the footer area. And 10-10, 10-
11 shows a left margin and a lower margin, respectively. Therefore, a left margin shown in 10-10 + (standard paper width−image width) / 2 is set as a left margin value, and a lower margin value is 10
Lower margin shown at -11 + (standard paper height-image height) / 2
Set. In addition, the width and height of the minimum rectangle enclosing the footer area text block are set in the size of the footer area, and stored in the HD 10.

【００８４】フッタ書式の文字サイズは、フッタ領域内
のテキストブロック全てのフォントサイズ（５−９に示
す文字情報のフォントサイズ）を調査し、最も数の多い
フォントサイズを求め、フッタ書式の文字サイズとして
ＨＤ１０に記憶する。また、文字数／行には、フッタ領
域用のテキストブロック全ての平均文字間（５−８に示
す行情報の平均文字間）を調査し、最も数の多い平均文
字間を求め、求めた平均文字間とフッタ書式の文字サイ
ズを足した値でフッタ領域の幅を割った商を文字数／行
に設定し、ＨＤ１０に記憶する。The font size of the footer format is determined by examining the font size of all the text blocks in the footer area (the font size of the character information shown in 5-9), finding the font size with the largest number, In the HD 10. For the number of characters / line, the average character spacing (average character spacing of the line information shown in 5-8) of all the text blocks for the footer area is investigated, and the average character spacing with the largest number is determined. A quotient obtained by dividing the width of the footer area by a value obtained by adding the space and the character size of the footer format is set to the number of characters / line, and stored in the HD 10.

【００８５】次にステップ２１−１２において、ヘッダ
領域内とフッタ領域内に”ひげ”付き数字が存在するか
を調査する。すなわち、”−１−”、”（１）”、”＜
１＞”等、ひげに挟まれた数字が存在した場合、その”
ひげ”の種類、存在した領域（ヘッダ領域かフッタ領
域）、存在した位置（各領域の左上からのポジション）
を本文書式の頁番号に設定し、ＨＤ１０に記憶する。Next, in step 21-12, it is checked whether a number with a "whisker" exists in the header area and the footer area. That is, "-1-", "(1)", "<
If there is a number between the beards, such as 1>"
Type of beard, existing area (header area or footer area), existing position (position from upper left of each area)
Is set to the page number of the text format and stored in the HD 10.

【００８６】以上、図２０或は図２１のフローチャート
に示す処理により作成された各書式データは、一つの原
稿画像から抽出されたものをまとめて、その画像と関連
づけてＨＤ１０に記憶する。これは、その画像から得た
ＰＡＦ内に記憶しても良いし、或は、ＰＡＦ内に書式デ
ータを記憶しているＨＤ１０内の位置を示すアドレスを
記憶させても良い。As described above, the format data created by the processing shown in the flowchart of FIG. 20 or FIG. 21 collectively collects data extracted from one document image and stores them in the HD 10 in association with the image. This may be stored in the PAF obtained from the image, or an address indicating the position in the HD 10 where the format data is stored may be stored in the PAF.

【００８７】図１６のステップ１６−６においてテキス
ト枠として抽出すべきと判断されたテキストブロックの
各々について、テキスト枠書式を作成する。A text frame format is created for each text block determined to be extracted as a text frame in step 16-6 of FIG.

【００８８】図２３に示すフローチャートが、「テキス
ト枠書式の作成処理」である。The flowchart shown in FIG. 23 is the “text frame format creation processing”.

【００８９】ステップ２３−１において、図５の５−３
に示す「テキストブロック管理データ」のブロック横／
縦サイズから、テキスト枠のサイズを決定し、ＨＤ１０
に記憶する。次にステップ２３−２において、テキスト
枠書式の左右上下余白を以下のように求め、ＨＤ１０に
記憶する。まず、左余白は、図５の５−８に示す行情報
のインデントを調査し、そのテキストブロック内の全て
の行の中で最も小さなインデントを左余白とする。右余
白は、図５の５−９に示す文字情報のブロック左からの
距離と文字幅を調査し、そのテキストブロック内全ての
行の最終文字の右端を比べ、最も大きな値とブロックの
右端との距離の差を右余白とする。上余白は、図５の５
−９に示す文字情報のブロック上からの距離を調査し、
そのテキスト内の１行目内全ての文字の中の最も大きな
値を上余白とする。下余白は、図５の５−９に示す文字
情報のブロック上からの距離と文字高を調査し、そのテ
キストブロック内の最終行の全ての文字の下端を比べ、
最も大きな値とブロックの下端との距離の差を下余白と
する。In step 23-1, 5-3 in FIG.
"Text block management data" shown in the side of the block /
From the vertical size, determine the size of the text frame, HD10
To memorize. Next, in step 23-2, the left, right, top and bottom margins of the text frame format are obtained as follows, and stored in the HD 10. First, as for the left margin, the indentation of the line information shown in 5-8 in FIG. 5 is checked, and the smallest indent of all the lines in the text block is set as the left margin. The right margin is determined by examining the distance and character width from the left of the block of character information shown in 5-9 in FIG. Is the right margin. The upper margin is 5 in FIG.
Investigate the distance from the block of character information shown in -9,
The largest value among all the characters in the first line in the text is set as the upper margin. The lower margin is obtained by examining the distance and the character height from the block of character information shown in 5-9 in FIG. 5, and comparing the lower ends of all the characters on the last line in the text block.
The difference between the largest value and the distance from the lower end of the block is defined as the lower margin.

【００９０】次にステップ２３−３において、図５の５
−３に示すテキストブロック管理データの組方向をテキ
スト枠書式の組方向としてＨＤ１０に記憶する。さら
に、ステップ２３−４において、そのテキストブロック
全てのフォントサイズ（５−９に示す文字情報のフォン
トサイズ）を調査し、最も数の多いフォントサイズを求
め、テキスト枠書式の文字サイズとしてＨＤ１０に記憶
する。そして、ステップ２３−５において、そのテキス
トブロック全ての平均文字間（５−８に示す行情報の平
均文字間）を調査し、最も数の多い平均文字間を求め、
求めた平均文字間とテキスト枠書式の文字サイズ（ステ
ップ１８−４においてＨＤ１０に記憶）を足した値でテ
キスト領域の横（枠の横サイズから左右余白を引いた
値）を割った商を文字数／行に設定し、ＨＤ１０に記憶
する。さらに、ステップ２３−６において、書体に明朝
体を設定しＨＤ１０に記憶し、ステップ２３−７におい
て、そのテキストブロック全ての平均行間（５−７に示
すOCR管理データの平均行間）を調査し、最も数の多い
平均行間を求め、求めた平均行間とテキスト枠書式の文
字サイズ（ステップ１８−４においてＨＤ１０に記憶）
を足した値でテキスト領域の縦（枠の縦サイズから上下
余白を引いた値）を割った商を行数／頁に設定し、ＨＤ
１０に記憶する。Next, in step 23-3, 5 in FIG.
-3 is stored in the HD 10 as the set direction of the text frame format. Further, in step 23-4, the font size of all the text blocks (the font size of the character information shown in 5-9) is checked, the font size with the largest number is obtained, and stored in the HD 10 as the character size of the text frame format. I do. Then, in step 23-5, the average character spacing of all the text blocks (the average character spacing of the line information shown in 5-8) is investigated, and the average character spacing with the largest number is obtained.
The quotient obtained by dividing the width of the text area (the value obtained by subtracting the left and right margins from the horizontal size of the frame) by the value obtained by adding the obtained average character spacing and the character size of the text frame format (stored in the HD 10 in step 18-4) is the number of characters / Line, and store it in the HD 10. Further, in step 23-6, the font is set to Mincho font and stored in the HD 10. In step 23-7, the average line spacing of all the text blocks (the average line spacing of the OCR management data shown in 5-7) is checked. , The most common average line spacing, and the calculated average line spacing and the character size of the text frame format (stored in HD 10 in step 18-4)
Set the quotient obtained by dividing the height of the text area (the value obtained by subtracting the upper and lower margins from the vertical size of the frame) by the value obtained by adding
10 is stored.

【００９１】以上図２３のフローチャートに従って述べ
たテキスト枠として抽出したテキストブロックからのテ
キスト枠書式作成処理はテキスト枠であると判断したテ
キストブロック内の文字についてのみ書式を作成するも
のであったが、例えば、そのテキスト枠内の元画像の中
の枠（罫線）の線種や太さまで抽出し、枠書式として設
定しても良い。この場合、図５で説明したPAF(Page Ana
lysis Format)の「ブロック管理データ」５−３に、も
う一つ、「フレームブロック管理データ」を加える。The text frame format creation processing from a text block extracted as a text frame described in accordance with the flowchart of FIG. 23 creates a format only for characters in a text block determined to be a text frame. For example, the line type and thickness of the frame (ruled line) in the original image in the text frame may be extracted and set as a frame format. In this case, the PAF (Page Ana
Another “frame block management data” is added to the “block management data” 5-3 of the “lysis format”.

【００９２】「フレームブロック管理データ」とは、図
２４の１９−２に示す構造体で、他のブロック管理デー
タ（テキスト、画像、ライン等）と同様に、用紙左から
の距離、用紙上からの距離、及びそれぞれのブロックの
横／縦サイズをピクセル数で保持し、さらにブロックが
どのような種類のデータを保持しているかを示すテーブ
ルも持っている。そして、「フレームブロック管理デー
タ」固有のデータとして、矩形の線種と太さを持ってい
る。例えば、図６に示す画像データをOCRエンジンによ
り領域分割すると、図７に示すような領域分割が行われ
る。この中で、７−９で示すのが、テキスト枠であると
ステップ１６−６で判断されたブロック内の文字群を囲
む枠であり、７−１０で示すのが、フレームブロックで
ある。このように元画像の中の矩形イメージ部分が領域
分割された際に抽出されるブロックをフレームブロック
とする。The "frame block management data" is a structure shown in 19-2 in FIG. 24, and is similar to other block management data (text, image, line, etc.) in the distance from the left side of the sheet and the top of the sheet. , And the horizontal / vertical size of each block in the number of pixels, and a table indicating what kind of data the block holds. The data specific to the “frame block management data” has a rectangular line type and a thickness. For example, when the image data shown in FIG. 6 is divided into regions by the OCR engine, the region is divided as shown in FIG. Among them, the frame indicated by 7-9 is a frame surrounding a character group in the block determined to be a text frame in step 16-6, and the frame block is indicated by 7-10. A block extracted when the rectangular image portion in the original image is divided into regions is referred to as a frame block.

【００９３】次に図１２のステップ１２−４における、
「枠領域用ブロックの抽出処理」において、枠データを
も識別するステップを含む場合の詳細処理を図２５の
「枠領域用ブロックの抽出処理」のフローチャートに従
って説明する。ただし、ステップ１５−１から１５−９
及びステップ１５−１０から１５−１３の各処理ステッ
プは図１５のフローチャートに従って説明した処理と同
様なので、同じステップ番号を付し、ここでの説明は省
略する。Next, in step 12-4 of FIG.
The detailed processing in the case where the "frame area block extraction processing" includes a step of also identifying frame data will be described with reference to the flowchart of "frame area block extraction processing" in FIG. However, steps 15-1 to 15-9
Since each processing step of steps 15-10 to 15-13 is the same as the processing described according to the flowchart of FIG. 15, the same step numbers are given and the description is omitted here.

【００９４】ステップ１５−８においてラインブロック
でないと判断された場合は、ステップ２５−１におい
て、さらにブロックの種類（５−３、５−４、５−５、
１９−２に示す「ブロック管理データ」のブロックの種
類）を調査し、フレームブロックであった場合は、ステ
ップ２５−２において、枠データとして抽出し、ＨＤ１
０に記憶する。ステップ２５−１においてフレームブロ
ックでないと判断された場合は、ステップ１５−１１に
進む。ステップ１５−１１における「テキストブロック
チェック処理」においてリターンされた値を判断し、テ
キスト枠であると判断されたならば、ステップ１５−１
３において、そのテキストブロックをテキスト枠として
抽出し、ＨＤ１０に記憶し、ステップ２５−３におい
て、そのテキストブロックの書式を作成するために、
「テキスト枠書式の作成処理」を呼び出す。図２６がス
テップ２５−３で実行される「テキスト枠書式の作成処
理」の詳細を示すフローチャートである。If it is determined in step 15-8 that the block is not a line block, in step 25-1, the type of block (5-3, 5-4, 5-5,
The type of the “block management data” shown in 19-2) is checked, and if it is a frame block, it is extracted as frame data in step 25-2, and HD1 is extracted.
Store to 0. If it is determined in step 25-1 that the frame block is not a frame block, the process proceeds to step 15-11. The value returned in the "text block check process" in step 15-11 is determined, and if it is determined that the frame is a text frame, step 15-1 is performed.
In step 3, the text block is extracted as a text frame, stored in the HD 10, and in step 25-3, to create the format of the text block,
Call "text frame format creation process". FIG. 26 is a flowchart showing details of the "text frame format creation process" executed in step 25-3.

【００９５】ステップ２６−１において、当テキストブ
ロックを包含するフレームブロック（図２５のステップ
２５−２において、枠データとしてＨＤ１０に記憶した
ブロック）が存在するかを調査する（例えば、図７の７
−９の示すテキストブロックは、７−１０の示すフレー
ムブロックに包含されている）。当テキストブロックを
包含するフレームブロックが存在するとステップ２６−
１において判断された場合は、ステップ２６−３におい
て、図２４の１９−２に示す「フレームブロック管理デ
ータ」のブロック横／縦サイズから、テキスト枠のサイ
ズを決定し、かつ「フレームブロック管理データ」の線
種／太さから枠線種と枠線の太さを決定し、ＨＤ１０に
記憶する。In step 26-1, it is checked whether there is a frame block including the text block (the block stored in the HD 10 as the frame data in step 25-2 in FIG. 25) (for example, 7 in FIG. 7).
The text block indicated by -9 is included in the frame block indicated by 7-10). If there is a frame block including the text block, step 26-
If the determination is made in step 1, in step 26-3, the size of the text frame is determined from the block horizontal / vertical size of “frame block management data” shown in 19-2 of FIG. Is determined from the line type / thickness of "." And stored in the HD 10.

【００９６】次にステップ２６−４において、当テキス
トブロックとこれを包含するフレームブロックの管理デ
ータからテキスト枠書式の左右上下余白を以下のように
求め、ＨＤ１０に記憶する。まず、左余白は、図２４の
１９−２に示す「フレームブロック管理データ」の用紙
左からの距離と図５の５−３に示す「テキストブロック
管理データ」の用紙左からの距離との差を左余白とす
る。右余白は、図２４の１９−２に示す「フレームブロ
ック管理データ」の用紙左からの距離にブロック横サイ
ズを加えた値と図５の５−３に示す「テキストブロック
管理データ」の用紙左からの距離にブロック横サイズを
加えた値との差を右余白とする。上余白は、図２４の１
９−２に示す「フレームブロック管理データ」の用紙上
からの距離と図５の５−３に示す「テキストブロック管
理データ」の用紙上からの距離との差を上余白とする。
下余白は、図２４の１９−２に示す「フレームブロック
管理データ」の用紙上からの距離にブロック縦サイズを
加えた値と図５の５−３に示す「テキストブロック管理
データ」の用紙上からの距離にブロック縦サイズを加え
た値との差を下余白とする。Next, in step 26-4, the left, right, top and bottom margins of the text frame format are obtained from the management data of the text block and the frame block containing the text block as follows, and stored in the HD 10. First, the left margin is the difference between the distance from the left of the “frame block management data” shown in 19-2 in FIG. 24 and the distance from the left of the “text block management data” shown in 5-3 in FIG. Is the left margin. The right margin is the value obtained by adding the block horizontal size to the distance from the left of the “frame block management data” of 19-2 in FIG. 24 and the left of the “text block management data” in 5-3 in FIG. The difference from the value obtained by adding the block horizontal size to the distance from is set as the right margin. The upper margin is 1 in FIG.
The difference between the distance from the sheet of “frame block management data” shown in 9-2 and the distance from the sheet of “text block management data” shown in 5-3 in FIG. 5 is defined as an upper margin.
The lower margin is a value obtained by adding the block vertical size to the distance from the sheet of “frame block management data” shown in 19-2 in FIG. 24 and the “text block management data” shown in 5-3 in FIG. The difference from the value obtained by adding the vertical size of the block to the distance from is set as the lower margin.

【００９７】次にステップ２６−５において、図５の５
−３に示すテキストブロック管理データの組方向をテキ
スト枠書式の組方向としてＨＤ１０に記憶する。さら
に、ステップ２６−６において、そのテキストブロック
全てのフォントサイズ（５−９に示す文字情報のフォン
トサイズ）を調査し、最も数の多いフォントサイズを求
め、テキスト枠書式の文字サイズとしてＨＤ１０に記憶
する。そして、ステップ２６−７において、そのテキス
トブロック全ての平均文字間（５−８に示す行情報の平
均文字間）を調査し、最も数の多い平均文字間を求め、
求めた平均文字間とテキスト枠書式の文字サイズ（ステ
ップ２２−６においてＨＤ１０に記憶した文字サイズ）
を足した値でテキストブロックのブロック横サイズ（５
−３に示すテキストブロック管理データのブロック横サ
イズ）を割った商を文字数／行に設定し、はＨＤ１０に
記憶する。さらに、ステップ２６−８において、書体に
明朝体を設定しＨＤ１０に記憶し、ステップ２６−９に
おいて、そのテキストブロック全ての平均行間（５−７
に示すOCR管理データの平均行間）を調査し、最も数の
多い平均行間を求め、求めた平均行間とテキスト枠書式
の文字サイズ（ステップ２６−６においてＨＤ１０に記
憶した文字サイズ）を足した値でテキストブロックのブ
ロック縦サイズ（５−３に示すテキストブロック管理デ
ータのブロック縦サイズ）を割った商を行数／頁に設定
し、ＨＤ１０に記憶する。Next, at step 26-5, 5 in FIG.
-3 is stored in the HD 10 as the set direction of the text frame format. Further, in step 26-6, the font size of all the text blocks (the font size of the character information shown in 5-9) is investigated, the font size with the largest number is obtained, and stored in the HD 10 as the character size of the text frame format. I do. Then, in step 26-7, the average character spacing of all the text blocks (the average character spacing of the line information shown in 5-8) is checked, and the average character spacing with the largest number is obtained.
The calculated average character spacing and character size of the text frame format (character size stored in HD 10 in step 22-6)
The horizontal size of the text block (5
The quotient obtained by dividing the block horizontal size of the text block management data shown in -3) is set to the number of characters / line, and stored in the HD 10. Further, in step 26-8, the font is set to Mincho font and stored in the HD 10, and in step 26-9, the average line spacing (5-7
The average line spacing of the OCR management data shown in (1) is checked, the average line spacing with the largest number is obtained, and the obtained average line spacing is added to the character size of the text frame format (the character size stored in the HD 10 in step 26-6). Then, the quotient obtained by dividing the block vertical size of the text block (the block vertical size of the text block management data shown in 5-3) is set to the number of lines / page, and stored in the HD 10.

【００９８】ステップ２６−１において、当テキストブ
ロックを包含するフレームブロック（図２６のステップ
２５−２において、枠データとしてメモリに記憶したブ
ロック）が存在しなかった場合は、ステップ２３−１に
進む。ステップ２３−１から２３−７に示す各処理ステ
ップは図２３のフローチャートの各処理と同様なので同
じステップ番号を付し、ここでの説明は省略する。ただ
し、ステップ２３−１において、この場合は当テキスト
ブロックを包含するフレームブロックがないことから、
枠線は”なし”として、ＨＤ１０に記憶する。If there is no frame block containing the text block in step 26-1 (the block stored in the memory as the frame data in step 25-2 in FIG. 26), the process proceeds to step 23-1. . Since each processing step shown in steps 23-1 to 23-7 is the same as each processing in the flowchart of FIG. 23, the same step numbers are given and the description thereof is omitted here. However, in step 23-1, in this case, since there is no frame block including the text block,
The frame is stored in the HD 10 as “none”.

【００９９】以上述べたように各ブロックの抽出処理
（図１２のフローチャートに示す各処理ステップ）を図
３に示した画像サンプルに施し、終了した結果の各領域
の大きさや位置を示したものが図２７である。２７−１
に示す領域がヘッダ領域であり、２７−２に示す領域が
フッタ領域である。２７−３に示す領域がテキスト枠領
域であり、２７−４に示す領域がライン領域である。２
７−５、２７−６、２７−７に示す各領域が本文領域で
ある。As described above, the extraction processing of each block (each processing step shown in the flowchart of FIG. 12) is performed on the image sample shown in FIG. 3, and the size and position of each area as a result of the completion are shown. FIG. 27-1
Is the header area, and the area 27-2 is the footer area. The area indicated by 27-3 is a text frame area, and the area indicated by 27-4 is a line area. 2
Each area shown in 7-5, 27-6, and 27-7 is a text area.

【０１００】ここで、枠とは、図２７に示すように、本
文領域に重ねて設定し得る領域であって（図２７の場
合、テキスト枠領域２７−３は本文領域２７−６及び本
文領域２７−７に一部重なって設定されている。）、本
文領域に入力される文章（本文）は枠を避けて配置され
る。即ち、本文領域のうち、枠領域として定められてい
る領域以外に順に文字を配置するものである。Here, as shown in FIG. 27, the frame is an area which can be set so as to overlap the body area (in FIG. 27, the text frame area 27-3 is the body area 27-6 and the body area). 27-7), the text (text) input to the text area is arranged avoiding the frame. That is, characters are sequentially arranged in the body region other than the region defined as the frame region.

【０１０１】また、各々作成された書式は、先に述べた
ようにＰＡＦと関連づけて記憶し、キーボード１２或は
マウス１３の呼び出し指示に応じて表示器１６に表示或
はプリンタ１８から印字する時にはその書式に応じてテ
キストブロック内データ５−６の文字コードに対応する
フォントを配置、或はラインブロック管理データ５−５
のデータに従ってラインを描画して出力する。これによ
り、スキャナ１Ｂより入力した画像データと同じレイア
ウトで、しかし文字や図形は清書した状態の文書を出力
し、更に表示器１６上でキーボード１２やマウス１３の
指示により編集することが可能となる。ここでテキスト
ブロックに新たに文字が挿入された場合にも、書式とし
て定められている態様で文字を再配列するので、オペレ
ータの望む文書が入力画像と同じレイアウト、書式で配
置されたものを得ることができる。As described above, each created format is stored in association with the PAF, and is displayed on the display 16 or printed from the printer 18 in response to a call instruction of the keyboard 12 or the mouse 13. A font corresponding to the character code of the data 5-6 in the text block is arranged according to the format, or the line block management data 5-5
Draws a line according to the data of and outputs it. As a result, it is possible to output a document with the same layout as the image data input from the scanner 1B, but in a state in which characters and graphics have been copied, and to edit the document on the display 16 by the instruction of the keyboard 12 and the mouse 13. . Here, even when a new character is inserted into the text block, the characters are rearranged in a manner defined as the format, so that the document desired by the operator is obtained in the same layout and format as the input image. be able to.

[Brief description of the drawings]

【図１】本発明の一実施例による文字認識文書処理装置
の概略構成を示すブロック図である。FIG. 1 is a block diagram showing a schematic configuration of a character recognition document processing device according to an embodiment of the present invention.

【図２】本発明の一実施例による文字認識文書処理装置
と画像読み取り装置をつなげた図である。FIG. 2 is a diagram in which a character recognition document processing device and an image reading device according to an embodiment of the present invention are connected.

【図３】画像イメージを表した図である。FIG. 3 is a diagram showing an image image.

【図４】画像イメージを領域分割した図である。FIG. 4 is a diagram in which an image is divided into regions.

【図５】Page Analysis Format(PAF)のファイルフォー
マットを表した図である。FIG. 5 is a diagram showing a file format of Page Analysis Format (PAF).

【図６】画像イメージを表した図である。FIG. 6 is a diagram showing an image image.

【図７】画像イメージを領域分割した図である。FIG. 7 is a diagram obtained by dividing an image into regions.

【図８】ヘッダ及びフッタ領域付近のテキストブロック
を表した図である。FIG. 8 is a diagram illustrating a text block near a header and a footer area.

【図９】本文領域内のテキストブロックを表した図であ
る。FIG. 9 is a diagram showing a text block in a text region.

【図１０】本文領域と他の領域に分割された状態を表し
た図である。FIG. 10 is a diagram showing a state where the text region is divided into other regions.

【図１１】「本文領域と他の領域の分別処理」を示すフ
ローチャートである。FIG. 11 is a flowchart illustrating “separation processing between a text region and another region”.

【図１２】「本文用ブロックと他のブロックの分別処
理」を示すフローチャートである。FIG. 12 is a flowchart illustrating “separation processing between a text block and another block”;

【図１３】「ヘッダ領域用テキストブロックの抽出処
理」を示すフローチャートである。FIG. 13 is a flowchart illustrating “extraction processing of a header area text block”.

【図１４】「フッタ領域用テキストブロックの抽出処
理」を示すフローチャートである。FIG. 14 is a flowchart showing “footer area text block extraction processing”.

【図１５】「枠領域用ブロックの抽出処理」を示すフロ
ーチャートである。FIG. 15 is a flowchart illustrating “frame area block extraction processing”;

【図１６】「テキストブロックチェック処理」を示すフ
ローチャートである。FIG. 16 is a flowchart showing a “text block check process”.

【図１７】「ヘッダ領域用テキストブロックの抽出処
理」を示すフローチャートである。FIG. 17 is a flowchart illustrating “extraction processing of a header area text block”.

【図１８】「フッタ領域用テキストブロックの抽出処
理」を示すフローチャートである。FIG. 18 is a flowchart showing “footer area text block extraction processing”.

【図１９】「テキストブロックチェック処理」を示すフ
ローチャートである。FIG. 19 is a flowchart showing “text block check processing”.

【図２０】「本文書式の作成処理」の第一例を示すフロ
ーチャートである。FIG. 20 is a flowchart illustrating a first example of “text format creation processing”;

【図２１】「本文書式の作成処理」の第二例を示すフロ
ーチャートである。FIG. 21 is a flowchart illustrating a second example of “text format creation processing”.

【図２２】本文書式作成時のブロック分割を説明する図
ある。FIG. 22 is a diagram for describing block division when creating a text format.

【図２３】「テキスト枠書式の作成処理」を示すフロー
チャートである。FIG. 23 is a flowchart showing “text frame format creation processing”.

【図２４】ＰＡＦのデータ構造の第二例を示す図であ
る。FIG. 24 is a diagram showing a second example of the data structure of the PAF.

【図２５】「枠領域用ブロックの抽出処理」の第二例を
示すフローチャートである。FIG. 25 is a flowchart illustrating a second example of “frame area block extraction processing”.

【図２６】「テキスト枠書式の作成処理」の第二例を示
すフローチャートである。FIG. 26 is a flowchart illustrating a second example of “text frame format creation processing”.

【図２７】分別された各領域を例示する図である。FIG. 27 is a diagram exemplifying each divided area.

【図２８】ネットワークシステム図FIG. 28 is a network system diagram.

───────────────────────────────────────────────────── フロントページの続きＦターム(参考） 5B009 LA01 NB01 NB08 NB11 NB15 NC01 NC02 NC04 RB33 RC01 VC01 5B029 AA01 BB02 CC28 CC29 5B050 BA16 DA06 EA06 5L096 BA18 FA19 9A001 BB04 CC02 CC08 HH22 HH23 HH28 JJ12 JJ27 KK16 KK42 ──────────────────────────────────────────────────続き Continued on the front page F-term (reference)

Claims

[Claims]

1. A character string region is extracted from input image information, and it is determined whether or not the character string region is a character string region of a text according to the layout information of the extracted character string region. Image processing method.

2. The image processing apparatus according to claim 1, wherein an area determined to be a character string area of the body and an area determined to be not a character string area of the body are stored separately. Method.

3. The image processing method according to claim 1, wherein the storage stores feature information extracted from each area.

4. The image processing method according to claim 1, wherein the storage is performed by adding identification information capable of identifying each of the storages.

5. The layout information of a character string region is a positional relationship with a different region.
The image processing method according to 1.

6. The image processing method according to claim 1, wherein the layout information of the character string area is the number of lines of a character string included in the character string area.

7. A plurality of image areas are extracted from input image information, and classified into a text area and other areas from the extracted image areas, and a first storage area and a second storage area are classified according to the classification. An image processing method characterized by storing information extracted from each area divided into areas.

8. The image processing method according to claim 7, wherein the image information is an image read by a scanner.

9. The image processing method according to claim 7, wherein the image information is an image input from another terminal connected via a line.

10. The image processing method according to claim 7, wherein the image information is an image read by a scanner at another terminal connected via a line.

11. A format for analyzing input image information to determine a character arrangement, and arranging characters obtained by recognizing the input image information according to the generated format. Document processing method.

12. The image processing method according to claim 11, wherein the image information to be analyzed is a text string region extracted from an input image.

13. The image processing method according to claim 11, wherein the image information to be analyzed is one of a plurality of regions obtained by dividing input image information.

14. The image processing method according to claim 11, wherein the format defines an arrangement of character strings in an area extracted from an input image.

15. The image processing method according to claim 11, wherein a document in which the characters are arranged is displayed on a display screen, and editing is performed on the display screen.

16. The image processing method according to claim 11, wherein a document in which the characters are arranged is displayed on a display screen.

17. The image processing method according to claim 11, wherein the document in which the characters are arranged is printed by a printer.

18. The image processing method according to claim 11, wherein the characters to be arranged are fonts.

19. The image processing method according to claim 11, wherein the document in which the characters are arranged is edited, and the edited document is rearranged according to a format in which the characters are arranged.

20. The image processing method according to claim 11, wherein the format includes column information.

21. The image processing method according to claim 11, wherein the format includes a paper size.

22. The image processing method according to claim 11, wherein the format includes margin information.

23. The image processing method according to claim 11, wherein the format includes a combination direction.

24. The image processing method according to claim 11, wherein the format includes a character size.

25. The image processing method according to claim 11, wherein the format includes information on the number of characters per line.

26. The image processing method according to claim 11, wherein the format includes information on the number of lines per area.

27. The image processing method according to claim 11, wherein the format includes typeface information.

28. A method of extracting a plurality of image areas from input image information, selecting a frame area from the plurality of extracted image areas, and creating a frame format from the image information of the selected frame area. Image processing method.

29. The image processing method according to claim 28, wherein the frame area is arranged so as to overlap another area.

30. The image processing method according to claim 28, wherein the frame area is a text area in a set direction different from a set direction of characters in a body area.

31. A character string extracting unit for extracting a character string region from input image information, and the character string region is a character string region of a body according to layout information of the character string region extracted by the character string extracting unit. An image processing apparatus, comprising: a determination unit configured to determine whether the image processing is performed.

32. The apparatus according to claim 31, further comprising storage means for distinguishably storing an area determined to be a character string area of the text and an area determined not to be a text string area of the text. The image processing apparatus according to any one of the preceding claims.

33. The image processing apparatus according to claim 31, wherein said storage means stores feature information extracted from each area.

34. The storage device according to claim 31, wherein said storage means stores identification information capable of identifying each of them.
An image processing apparatus according to claim 1.

35. The layout information of the character string area includes:
The image processing apparatus according to claim 31, wherein the image processing apparatus has a positional relationship with different areas.

36. The layout information of the character string area includes:
32. The image processing apparatus according to claim 31, wherein the number of lines of the character string included in the character string area is set.

37. An extracting unit for extracting a plurality of image regions from input image information, a classifying unit for classifying the image regions extracted by the extracting unit into a text region and other regions, and An image processing apparatus comprising: a storage unit that stores information extracted from each of the first storage area and the second storage area according to the classification.

38. The image processing apparatus according to claim 37, further comprising a scanner that reads the image information.

39. The image processing apparatus according to claim 37, further comprising control means for controlling input of the image information from another terminal connected via a line.

40. The image processing apparatus according to claim 37, further comprising control means for controlling the other terminal connected via a line to read the image information by a scanner.

41. A format creating means for analyzing input image information to create a format for determining an arrangement of characters, and a character obtained by recognizing the input image information in accordance with the format created by the format creating means. A document processing apparatus having control means for controlling the placement of a document.

42. A body character string region extracting unit for extracting a body character string region from the input image information, wherein the format creating unit is configured to format the body character string region extracted by the body character string region extracting unit. 43. The image processing apparatus according to claim 42, wherein:

43. An area dividing unit that divides the input image information to obtain a plurality of areas, and wherein the format creating unit creates each format of the plurality of areas obtained by the area dividing unit. 43. The image processing device according to claim 42, wherein:

44. The image processing apparatus according to claim 42, wherein the format defines an arrangement of a character string in an area extracted from an input image.

45. The image processing apparatus according to claim 42, further comprising: display means for displaying a document on which the characters are arranged on a display screen; and editing means for performing editing specified on the display screen. apparatus.

46. The image processing apparatus according to claim 42, further comprising display means for displaying a document in which the characters are arranged.

47. The image processing apparatus according to claim 42, further comprising a printer for printing a document in which the characters are arranged.

48. The image processing apparatus according to claim 42, wherein the arranged characters are fonts.

49. An editing unit for editing a document in which the characters are arranged, wherein the control unit controls to rearrange the document edited by the editing unit in accordance with the format in which the characters are arranged. The image processing apparatus according to claim 42, wherein

50. An image area extracting means for extracting a plurality of image areas from input image information; a frame area selecting means for selecting a frame area from the plurality of image areas extracted by the image area extracting means; An image processing apparatus comprising: a frame format creation unit that creates a frame format from image information of a frame area selected by the selection unit.

51. The image processing apparatus according to claim 50, wherein the frame area is arranged so as to overlap another area.

52. The image processing apparatus according to claim 50, wherein the frame area selecting means selects a text area in a set direction different from the set direction of characters in the body area as a frame area.

53. A storage medium readable by a computer, comprising: a control program for extracting a character string area from input image information; and a character string area according to layout information of the extracted character string area. And a control program for determining whether or not the character string area is a character string area.

54. A control program for storing an area determined to be a character string area of the text and an area determined not to be a text string area of the text separately. Item 54. The storage medium according to Item 53.

55. A storage medium readable by a computer, comprising: a control program for extracting a plurality of image areas from input image information; and a text area and other areas from the extracted image areas. A storage medium storing a control program for classifying, and a control program for storing information extracted from each of the first storage area and the second storage area according to the classification. .

56. A storage medium readable by a computer, comprising: a control program for analyzing input image information to create a format for determining a character arrangement; and the input image according to the created format. A storage medium storing a control program for arranging characters obtained by recognizing information.

57. A control program according to claim 56, wherein a control program for displaying a document in which the characters are arranged on a display screen and a control program for editing on the display screen are stored. Storage medium.

58. The storage medium according to claim 56, wherein a control program for displaying a document in which the characters are arranged on a display screen is stored.

59. The storage medium according to claim 56, wherein a control program for printing a document in which the characters are arranged by a printer is stored.

60. A control program for editing a document in which the characters are arranged, and a control program for rearranging the edited document in accordance with a format in which the characters are arranged. 61. The storage medium according to item 56.

61. A computer-readable storage medium, comprising: a control program for extracting a plurality of image areas from input image information; and a control for selecting a frame area from the plurality of extracted image areas. A storage medium storing a program and a control program for creating a frame format from the image information of the selected frame area.