JPH11312231A

JPH11312231A - Recording medium with data processing program recorded therein, data processor and its method

Info

Publication number: JPH11312231A
Application number: JP10118420A
Authority: JP
Inventors: Hirobumi Fukumoto; 博文福本
Original assignee: Omron Corp; Omron Tateisi Electronics Co
Current assignee: Omron Corp
Priority date: 1998-04-28
Filing date: 1998-04-28
Publication date: 1999-11-09

Abstract

PROBLEM TO BE SOLVED: To provide a medium with a data processing program recorded therein, a data processor and its method, which simply editing of a layout for changing the arrangement of a text area, a table area, an image area, etc., for an existing document read by OCR, etc., and which automatically prepare a structured file in a hypertext form, etc., for a document whose layer is edited. SOLUTION: An area recognition apparatus 11 recognizes to which of a character area, a table area or an image area, an original image inputted from an image reader such as OCR belongs. When the original image belongs to the character area or the table area, characters within the area are recognized and made as a character code. When the original image belongs to the image area, an image within the area is segmented to prepare an image file. A layout editing part 15 displays a rectangle in each area to allow a user to edit layout by moving the rectangle, and a part 16 prepares a hypertext based on this edited layout. A storing part 17 retains a file prepared by a series of processings with a proper expander, and a managing part manages retained file.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】この発明は、ＯＣＲ等で光学
的に読み取った文字列、図形、表等が混在する既存の文
書の画像からＨＴＭＬ（ハイパーテキスト）形式等の構
造化ファイルの作成および、ここで作成したファイルの
保存・管理を行うデータ処理プログラムを記録した記録
媒体、データ処理装置およびデータ処理方法に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to the creation of a structured file in an HTML (hypertext) format or the like from an image of an existing document in which character strings, figures, tables, and the like optically read by an OCR or the like are mixed. The present invention relates to a recording medium, a data processing device, and a data processing method in which a data processing program for storing and managing the created file is recorded.

【０００２】[0002]

【従来の技術】インターネット等のマルチメディア環境
（ＷＷＷ（ＷｏｒｌｄＷｉｄｅＷａｖｅ）等）で
は、文章、表、イメージ等を関連づけて記述したハイパ
ーテキスト形式等の構造化ファイルを利用することによ
り、ブラウザと呼ばれる検索表示ツールでテキスト、イ
メージ、図表等を関連づけて出力（表示等）させること
ができる。すなわち、文書、表、イメージ等の情報を統
合的に取り扱うことができる。このようなマルチメディ
ア環境の発展にともなって、最近ではユーザからハイパ
ーテキスト形式等の構造化ファイルを自動的に作成する
装置やソフトウェアの開発が要望されはじめている。2. Description of the Related Art In a multimedia environment such as the Internet (WWW (World Wide Wave)), a browser is called a browser by using a structured file such as a hypertext format in which sentences, tables, images, and the like are described in association with each other. The search display tool can output (display, etc.) text, images, charts, and the like in association with each other. That is, information such as documents, tables, and images can be handled in an integrated manner. With the development of such a multimedia environment, recently, users have begun to request devices and software for automatically creating a structured file such as a hypertext format.

【０００３】従来、この種の装置としては、既存の文書
をＯＣＲ等で光学的に読み取り、読み取った文書画像を
処理して自動的に構造化ファイルを作成するものが提案
されていた（特開平９−２２３２４０号公報）。Conventionally, as this type of apparatus, there has been proposed an apparatus which optically reads an existing document by OCR or the like, processes the read document image, and automatically creates a structured file (Japanese Patent Laid-Open No. Hei 9-260,197). No. 9-223240).

【０００４】[0004]

【発明が解決しようとする課題】しかしながら、上記特
開平９−２２３２４０号公報に記載されている装置は、
ＯＣＲで読み取った既存の文書を忠実に再現するハイパ
ーテキスト形式のファイルを作成するものであった。し
たがって、ユーザはＯＣＲで読み取った既存の文書に忠
実なハイパーテキスト形式のファイルを得ることはでき
たが、既存の文書における文書、表、イメージ等のレイ
アウトを自分の好みに応じて変更したものについてはハ
イパーテキスト形式のファイルを得ることができなかっ
た。また、既存の文書における不要な領域を削除したも
のについてもハイパーテキスト形式のファイルを得るこ
ともできなかった。However, the apparatus described in Japanese Patent Application Laid-Open No. 9-223240 is
A hypertext format file that faithfully reproduces an existing document read by OCR was created. Therefore, the user was able to obtain a file in a hypertext format faithful to the existing document read by the OCR, but changed the layout of the document, table, image, etc. in the existing document according to his / her preference. Could not get a file in hypertext format. Also, a hypertext format file cannot be obtained for an existing document in which an unnecessary area is deleted.

【０００５】また、ブラウザでハイパーテキスト形式の
ファイルを出力する場合、関連するファイルについては
ブラウザがファイルタイプを認識できるようにしなけれ
ばならない。このため、ファイル毎にその種別（テキス
ト、画像等）に応じた拡張子を付けたファイル名で保存
することが規定されている。したがって、ファイルを保
存するときには、１つ１つのファイルについてその種別
を確認し、該ファイルの種別に応じた拡張子を付けて保
存しなければならず、ファイルの保存・管理に手間がか
かるという問題もあった。When a browser outputs a hypertext format file, the browser must be able to recognize the file type of the associated file. For this reason, it is defined that each file is saved with a file name with an extension corresponding to the type (text, image, etc.). Therefore, when saving a file, it is necessary to check the type of each file and save the file with an extension corresponding to the type of the file, and it takes time and effort to save and manage the file. There was also.

【０００６】この発明の目的は、ＯＣＲ等で読み取った
既存の文書におけるテキスト領域、表領域、画像領域等
の配置を変更するレイアウトの編集が行え、ここで編集
されたレイアウトに基づいてハイパーテキスト形式等の
構造化ファイルを作成するデータ処理プログラムを記録
した記録媒体、データ処理装置およびデータ処理方法を
提供することにある。An object of the present invention is to edit a layout for changing the arrangement of a text area, a table area, an image area, and the like in an existing document read by an OCR or the like, and to perform a hypertext format based on the edited layout. It is an object of the present invention to provide a recording medium, a data processing device, and a data processing method in which a data processing program for creating a structured file such as a file is recorded.

【０００７】また、この発明はユーザがファイル名（拡
張子）を意識することなくファイルの保存・管理が行え
るようにすることで、ファイルの保存・管理に手間がか
からないデータ処理プログラムを記録した記録媒体、デ
ータ処理装置およびデータ処理方法を提供することを目
的とする。Further, the present invention allows a user to save and manage a file without being aware of the file name (extension), thereby recording a data processing program that saves time and effort in saving and managing the file. It is an object to provide a medium, a data processing device, and a data processing method.

【０００８】[0008]

【課題を解決するための手段】この発明は、ＯＣＲ等の
画像読取装置で読み取った原画像を処理して、分割され
た領域毎に文字領域、表領域またはイメージ領域のいず
れであるかを識別する第１のステップと、前記第１のス
テップで文字領域または表領域であると識別した領域に
対して、該領域内の文字を認識し、文字コードに変換す
る第２のステップと、前記第１のステップでイメージ領
域であると識別した領域に対して、該領域内のイメージ
を切り出してビットマップ形式等の適当な形式でイメー
ジファイルを作成する第３のステップと、前記第１のス
テップで識別した領域毎にその位置を置き換えるレイア
ウト編集を行う第４のステップと、前記第４のステップ
で編集されたレイアウトに基づいてハイパーテキスト形
式等で構造化ファイルを作成する第５のステップと、を
有する。According to the present invention, an original image read by an image reading apparatus such as an OCR is processed to identify, for each divided area, a character area, a table area, or an image area. A first step of recognizing a character in the area identified as a character area or a table area in the first step and converting the character into a character code; and A third step of cutting out an image in the area identified as an image area in step 1 and creating an image file in an appropriate format such as a bit map format; A fourth step of performing layout editing for replacing the position of each identified area, and a structured file in a hypertext format or the like based on the layout edited in the fourth step. It has a fifth step of creating a Le, a.

【０００９】この構成では、ＯＣＲ等の画像読取装置で
読み取った原画像に対して、分割された領域毎に文字領
域、表領域またはイメージ領域のいずれであるかを識別
する。文字領域または表領域については、該領域内の文
字を認識して文字コードに変換し、イメージ領域につい
ては該領域内のイメージをビットマップ形式等でイメー
ジファイルを作成する。第４のステップでは、識別され
た領域毎にその位置を置き換えるレイアウトの編集を受
け付け、編集されたレイアウトに基づいてハイパーテキ
スト形式等の構造化ファイルを作成する。In this configuration, the original image read by an image reading apparatus such as an OCR is identified as a character area, a table area, or an image area for each divided area. For a character area or a table area, a character in the area is recognized and converted to a character code, and for an image area, an image in the area is created in an image file in a bitmap format or the like. In a fourth step, editing of a layout that replaces the position of each identified area is accepted, and a structured file in a hypertext format or the like is created based on the edited layout.

【００１０】例えば、ＯＣＲ等の画像読取装置で図９
（Ａ）に示す原稿画像が読み取られる。第１のステップ
では原稿画像における文字領域、表領域およびイメージ
領域を識別し、図９（Ｂ）に示すような各領域を矩形で
表示したレイアウト編集画面を表示する。そして、ユー
ザがマウス等の操作でこの矩形を任意の位置に移動させ
て自分の好みに合うレイアウトに編集する（図９（Ｃ）
参照）。ここで編集されたレイアウトに基づいてハイパ
ーテキスト形式の構造化ファイルを作成する。ここで作
成された構造化ファイルをブラウザで出力すると、図９
（Ｄ）に示すように原画像と異なるレイアウトの表示を
得ることができる。For example, in an image reading apparatus such as an OCR, FIG.
The original image shown in FIG. In the first step, a character editing area, a table area and an image area in the document image are identified, and a layout editing screen is displayed in which each area is displayed as a rectangle as shown in FIG. 9B. Then, the user moves the rectangle to an arbitrary position by operating the mouse or the like and edits the rectangle to a layout that suits the user's preference (FIG. 9C).
reference). Here, a structured file in a hypertext format is created based on the edited layout. When the structured file created here is output with a browser, FIG.
As shown in (D), a display with a layout different from that of the original image can be obtained.

【００１１】また、前記原画像の画像データ、文字の認
識結果、イメージファイル、構造化ファイル等、一連の
処理で発生したデータ毎に、そのデータ種別に応じたフ
ァイル拡張子を付けて保存する第６のステップを有す
る。Further, for each data generated in a series of processing, such as the image data of the original image, the character recognition result, the image file, the structured file, etc., a file extension corresponding to the data type is added and saved. It has six steps.

【００１２】この構成では、ファイルの保存時にデータ
の種別に応じた適当な拡張子が自動的に付けられるた
め、保存するファイルのデータ種別を気にする必要がな
く、ファイルの保存・管理が容易に行える。In this configuration, an appropriate extension according to the type of data is automatically added at the time of saving a file, so that there is no need to worry about the data type of the file to be saved, and the file can be easily stored and managed. Can be done.

【００１３】また、前記第２のステップは、文字種別で
あるフォントおよび文字の大きさであるポイント数も認
識する処理を含むステップとしてもよいし、表領域にお
ける罫線を認識する処理を含むステップとすることもで
きる。The second step may include a step of recognizing a font as a character type and a number of points as a character size, or a step including a step of recognizing a ruled line in a table area. You can also.

【００１４】また、前記第６のステップで保存された構
造化ファイル毎に、前記原画像を縮小して表示する第７
のステップを有することによって、保存・管理されてい
る構造化ファイルの内容を簡単に確認できる。[0014] In addition, for each structured file stored in the sixth step, a seventh image for reducing and displaying the original image is provided.
With this step, the contents of the structured file stored and managed can be easily confirmed.

【００１５】さらに、前記第６のステップで保存された
ファイルを読み出し、レイアウトの再編集等の処理を開
始する第８のステップを有することによって、保存・管
理されているものについては原画像を再度ＯＣＲ等で読
み取ることなくレイアウト等の再編集が行える。Further, the method has an eighth step of reading out the file stored in the sixth step and starting processing such as re-editing of the layout. The layout and the like can be re-edited without reading by OCR or the like.

【００１６】[0016]

【発明の実施の形態】図１は、この発明の実施形態であ
るデータ処理装置の構成を示す図である。データ処理装
置１は、ＣＰＵ、ＲＯＭおよびＲＡＭ等を有する制御部
２と、入力操作を行う入力部３と、表示を行う表示部４
と、後述する処理で作成される構造化ファイル等を記憶
する記憶部５と、既存の文書を光学的に読み取るＯＣＲ
装置９とのインタフェース（Ｉ／Ｆ）部６と、を備えて
いる。データ処理装置１の制御部２には、予めＣＤ−Ｒ
ＯＭ等の記録媒体からプログラム（この発明で言うデー
タ処理プログラム）が供給されており、このプログラム
に従って後述する処理を実行する。FIG. 1 is a diagram showing a configuration of a data processing apparatus according to an embodiment of the present invention. The data processing device 1 includes a control unit 2 having a CPU, a ROM, a RAM, and the like, an input unit 3 for performing an input operation, and a display unit 4 for performing a display.
And a storage unit 5 for storing a structured file or the like created in a process described later, and an OCR for optically reading an existing document.
And an interface (I / F) unit 6 with the device 9. The control unit 2 of the data processing device 1 has a CD-R
A program (a data processing program according to the present invention) is supplied from a recording medium such as an OM, and the processing described below is executed according to the program.

【００１７】図２は、この発明の実施形態であるデータ
処理装置の機能ブロック図である。１１は、ＯＣＲ装置
９で読み取った既存の文書における文字領域、表領域お
よび画像領域を認識する領域認識部である。１２は、領
域認識部１１で認識された文字領域および表領域内に存
在する文字を周知のパターンマッチングで認識する文字
認識部である。１３は、文字認識部１２で認識した文字
を文字コードに変換する文字コード化部である。１４
は、領域認識部１１で認識された画像領域に対して該領
域内の画像を抽出してビットマップ形式等で画像ファイ
ルを作成する画像処理部である。１５は、ＯＣＲ９で読
み取った既存の文書における文字領域、表領域および画
像領域のレイアウトを編集するレイアウト編集部であ
る。１６は、レイアウト編集部１５でレイアウトが編集
されたものについてハイパーテキスト形式のファイル
（構造化ファイル）を作成するハイパーテキスト部であ
る。１７は、ハイパーテキスト部１６で作成されたハイ
パーテキストファイルや画像処理部１４で作成された画
像ファイル、ＯＣＲ９から入力された原画像ファイル等
を保存する保存部である。１８は、保存部１７に保存さ
れているファイルを管理する管理部である。FIG. 2 is a functional block diagram of the data processing device according to the embodiment of the present invention. An area recognition unit 11 recognizes a character area, a table area, and an image area in an existing document read by the OCR device 9. Reference numeral 12 denotes a character recognition unit that recognizes characters existing in the character region and the table region recognized by the region recognition unit 11 by well-known pattern matching. Reference numeral 13 denotes a character encoding unit that converts a character recognized by the character recognition unit 12 into a character code. 14
Is an image processing unit that extracts an image in the image area recognized by the area recognition unit 11 and creates an image file in a bitmap format or the like. Reference numeral 15 denotes a layout editing unit that edits a layout of a character area, a table area, and an image area in an existing document read by the OCR 9. Reference numeral 16 denotes a hypertext unit that creates a hypertext format file (structured file) for a layout edited by the layout editing unit 15. A storage unit 17 stores a hypertext file created by the hypertext unit 16, an image file created by the image processing unit 14, an original image file input from the OCR 9, and the like. Reference numeral 18 denotes a management unit that manages files stored in the storage unit 17.

【００１８】次に、この実施形態のデータ処理装置１の
処理について説明する。図３は、データ処理装置の処理
を示すフローチャートである。まず、ＯＣＲ９で既存の
文書が光学的に読み取られる。データ処理装置１は、Ｉ
／Ｆ部６を介してＯＣＲ９で読み取った文書の画像デー
タが入力されると（ｎ１）、表示部４に入力された画像
を表示する（ｎ２）。表示部４には、図４に示すように
３つのウィンドが表示されており、原画像表示ウィンド
ウ２１に原画像が表示される。なお、この時点では認識
文字表示ウィンドウ２２およびレイアウト表示ウィンド
ウ２３内には何も表示されない。Next, the processing of the data processing apparatus 1 of this embodiment will be described. FIG. 3 is a flowchart showing the processing of the data processing device. First, an existing document is optically read by the OCR 9. The data processing device 1
When image data of a document read by the OCR 9 is input via the / F unit 6 (n1), the input image is displayed on the display unit 4 (n2). As shown in FIG. 4, three windows are displayed on the display unit 4, and an original image is displayed in an original image display window 21. At this point, nothing is displayed in the recognition character display window 22 and the layout display window 23.

【００１９】この状態で、ユーザは原画像表示ウィンド
ウ２１に表示されている原画像に対して領域の指定を行
う。領域の指定は、入力部３に設けられたマウスを操作
して指定する領域をドラッグすればよい。データ処理装
置１は、領域の指定が行われると（ｎ３）、指定された
領域が文字領域、表領域またはイメージ領域のいずれの
領域であるかを識別する（ｎ４）。データ処理装置１
は、ｎ４で文字領域または表領域であると判定すると、
該領域内の文字を周知のパターンマッチングによって認
識し（ｎ５）、認識した文字を文字コード化（ＪＩＳコ
ードやＡＳＣＩＩコード等にコード化）し、該領域にお
ける文字列データを作成する（ｎ６）。ｎ５では、文字
種別であるフォントおよび文字の大きさ（ポイント数）
も認識される。一方、ｎ４でイメージ領域であると判定
すると、該領域内のイメージを切り出しビットマップ形
式やＧＩＦ形式等の画像ファイルを作成する（ｎ７）。
この後、領域指定の完了が指示されなければ（ｎ８）、
ｎ３に戻って新たな領域の指定を受け付けて上記の処理
を繰り返す。In this state, the user specifies a region for the original image displayed in the original image display window 21. The designation of the area may be performed by operating the mouse provided on the input unit 3 and dragging the designated area. When the area is specified (n3), the data processing device 1 identifies whether the specified area is a character area, a table area, or an image area (n4). Data processing device 1
Is determined to be a character area or a table area at n4,
Characters in the area are recognized by well-known pattern matching (n5), and the recognized character is converted into a character code (encoded into JIS code, ASCII code, or the like) to create character string data in the area (n6). In n5, font and character size (point number) which are character types
Is also recognized. On the other hand, if it is determined in n4 that the area is an image area, an image in the area is cut out and an image file in a bitmap format, a GIF format or the like is created (n7).
Thereafter, if the completion of the area designation is not instructed (n8),
Returning to n3, the designation of a new area is accepted, and the above processing is repeated.

【００２０】ユーザによる領域の指定が完了すると、ｎ
９でレイアウト編集処理が開始される。図５は領域の指
定が完了した時点における表示部４の表示例を示してい
る。図５に示す原画像表示ウィンド２１内において破線
で囲まれている領域が指定された領域である。なお、文
字領域は赤色の破線で囲まれ、表領域は青色の破線で囲
まれ、イメージ領域は緑色の破線で囲まれている。した
がって、ユーザはこの破線の色から指定した領域の種別
が正確に識別されたかどうかを簡単に認識することがで
きる。なお、指定された領域には認識番号が付されてい
る。この図では、２つの画像領域と１つの文字領域が指
定された状態を示している。なお、指定された文字領域
における認識番号の右横に付されている矢印は、文字列
の読み方向（認識方向）を示している。また、認識文字
ウィンドウ２２には、指定された文字領域に対してｎ
５、ｎ６で処理した認識結果が表示され、レイアウト表
示ウィンドウ２３には指定された領域毎に矩形が表示さ
れる。When the user specifies the area, n
At 9 the layout editing process is started. FIG. 5 shows a display example of the display unit 4 when the designation of the area is completed. A region surrounded by a broken line in the original image display window 21 shown in FIG. 5 is a designated region. The character area is surrounded by a red broken line, the table area is surrounded by a blue broken line, and the image area is surrounded by a green broken line. Therefore, the user can easily recognize whether or not the type of the designated area is correctly identified from the color of the broken line. The designated area is given a recognition number. This figure shows a state where two image areas and one character area are designated. An arrow attached to the right of the recognition number in the designated character area indicates the reading direction (recognition direction) of the character string. In addition, the recognition character window 22 displays n for the specified character area.
The recognition results processed in steps 5 and n6 are displayed, and a rectangle is displayed in the layout display window 23 for each designated area.

【００２１】ユーザは、認識文字ウィンドウ２２で、指
定した文字領域および表領域内において文字の誤認識が
生じていないかどうかを確認する。誤認識されている文
字があればこの認識文字ウィンドウ２２で修正する。誤
認識された文字を修正する方法としては、ユーザに誤認
識されている文字を指定させ、キーボードから正しい文
字を入力させてもよいし、指定された文字と形状が類似
する文字を候補として表示し、表示した候補の中からユ
ーザに正しい文字を選択させるようにしてもよい。レイ
アウト表示ウィンドウ２３には上述のように指定された
領域毎に矩形が表示されており、文字領域に対応する矩
形は赤色、表領域に対応する矩形は青色、イメージ領域
に対応する矩形は緑色で表示される。また、各矩形の左
上には認識番号が付されており、ユーザはどの矩形が原
画像において指定したどの領域と対応しているのかを容
易に認識できる。The user confirms in the recognition character window 22 whether or not erroneous character recognition has occurred in the designated character area and table area. If there is any erroneously recognized character, it is corrected in this recognized character window 22. As a method of correcting a misrecognized character, the user may specify the misrecognized character and input the correct character from the keyboard, or display a character having a shape similar to the specified character as a candidate. Then, the user may be caused to select a correct character from the displayed candidates. A rectangle is displayed in the layout display window 23 for each of the designated regions as described above. A rectangle corresponding to the character region is red, a rectangle corresponding to the table region is blue, and a rectangle corresponding to the image region is green. Is displayed. In addition, a recognition number is given to the upper left of each rectangle, so that the user can easily recognize which rectangle corresponds to which area specified in the original image.

【００２２】ユーザは、レイアウト編集ウィンドウ２３
に表示されている各矩形をマウスで任意の位置に移動す
ることで、レイアウトの編集を行う。データ処理装置１
は、ユーザによるレイアウトの編集が完了すると、レイ
アウト表示ウィンド２３における矩形の配置データ（レ
イアウトデータ）、文字領域や表領域における文字の認
識結果、表領域における表罫線の情報、画像領域におけ
るイメージデータ等に基づいてハイパーテキスト形式の
構造化ファイルを作成する（ｎ１１）。図６は、ｎ１１
で作成された構造化ファイルをブラウザで出力したとき
の表示例を示している。図６に示すように、既存の文書
から読み取った画像から、ユーザ自身が選択した領域の
みで且つ自分の好みに合ったレイアウトの出力を得るこ
とができる。The user edits the layout edit window 23
The layout is edited by moving each of the rectangles displayed in the column to an arbitrary position with the mouse. Data processing device 1
When layout editing by the user is completed, rectangular layout data (layout data) in the layout display window 23, a character recognition result in a character area or a table area, table ruled line information in a table area, image data in an image area, etc. Then, a structured file in a hypertext format is created based on (n11). FIG.
Shows a display example when the structured file created in step (1) is output by a browser. As shown in FIG. 6, from an image read from an existing document, it is possible to obtain an output of a layout that suits the user's preference only in the area selected by the user himself.

【００２３】データ処理装置１は、ｎ１１で構造化ファ
イルの作成を完了すると、ＯＣＲ９で読み取った画像フ
ァイル、文字領域や表領域において認識した文字列デー
タ、画像領域における画像ファイル等関連するファイル
を一括して保存する保存処理を実行する（ｎ１２）。図
７は、保存処理における表示部４の表示画面を示す図で
ある。図示するように、表示部４に保存ウィンドウ２５
が表示され、ユーザにファイル名の入力を促す。ここ
で、ユーザはファイル名としてオブジェクト名を入力す
る。データ処理装置１は、ＯＣＲ９で読み取った画像フ
ァイル、文字領域や表領域において認識した文字列デー
タ、画像領域における画像ファイル等を入力されたオブ
ジェクト名に適当なファイル拡張子を付けたファイル名
で同じディレクトリに保存する。例えば、入力されたオ
ブジェクト名が『ＯＭＲＯＮ』であった場合、データ処
理装置１ではＯＣＲで読み取った画像ファイルには『Ｏ
ＭＲＯＮ．ｂｍｐ』、イメージファイルには『ＯＭＲＯ
Ｎ．ｇｉｆ』、テキストファイルには『ＯＭＲＯＮ．ｔ
ｘｔ』、ハイパーテキストファイルには『ＯＭＲＯＮ．
ｈｔｍ』等ファイルの種別に応じた拡張子を付け、また
これらのファイルを同じディレクトリに保存する。な
お、同じ種別のファイルが複数あるとき（例えばイメー
ジファイルが複数あるとき）には、『ＯＭＲＯＮ１．ｇ
ｉｆ』『ＯＭＲＯＮ２．ｇｉｆ』・・・等のように入力
されたファイル名の後ろに認識番号を付して保存する。When the creation of the structured file is completed in n11, the data processing device 1 collectively stores the image file read by the OCR 9, the character string data recognized in the character area and the table area, and the related files such as the image file in the image area. Then, a saving process is performed (n12). FIG. 7 is a diagram illustrating a display screen of the display unit 4 in the storage processing. As shown, a save window 25 is displayed on the display unit 4.
Is displayed to prompt the user to enter a file name. Here, the user inputs an object name as a file name. The data processing device 1 uses an image file read by the OCR 9, character string data recognized in a character area or a table area, an image file in an image area, or the like as a file name obtained by adding an appropriate file extension to an input object name. Save to a directory. For example, if the input object name is “OMRON”, the image file read by OCR is
MRON. bmp ”and the image file is“ OMRO
N. gif ”and the text file is“ OMRON. t
xt ”and“ OMRON.xt ”in the hypertext file.
"htm", etc., and an extension corresponding to the type of file, and save these files in the same directory. When there are a plurality of files of the same type (for example, when there are a plurality of image files), “OMRON1. g
if ”“ OMRON2. gif ”and the like are appended with a recognition number after the input file name and stored.

【００２４】このように、データ処理装置１が保存する
ファイルの種別に応じて適当な拡張子を付けて保存する
ので、ユーザがファイルを保存する際にそのファイルの
種別を認識して適当な拡張子を付ける必要がなく、ま
た、関連する複数のファイルが一括して保存されるため
ファイルの保存が簡単に行える。As described above, since the data is saved with an appropriate extension in accordance with the type of the file to be stored by the data processing apparatus 1, when the user saves the file, the user recognizes the type of the file and appropriately expands the file. There is no need to attach a child, and since a plurality of related files are stored collectively, the files can be easily stored.

【００２５】上記の処理で記憶部５に保存されたファイ
ルは、管理部１８で管理される。以下、管理部１８にお
けるファイルの管理について説明する。データ処理装置
１は、管理部１８で保存・管理しているハイパーテキス
トファイル毎にＯＣＲ９で読み取った原稿画像の一部を
縮小表示（サムネール表示）を行う機能を備えている。
図８はサムネール表示が行われた表示部４の表示例を示
す図である。この図では、管理部１８が３つのハイパー
テキストファイルを管理しているときの例を示してい
る。図に示すように、管理部１８で管理されているハイ
パーテキストファイル毎にＯＣＲ９で読み取った原画像
が表示されるため、ユーザはこのサムネール表示を見る
ことによって、管理部１８で保存・管理されているハイ
パーテキストファイルの内容を容易に確認することがで
きる。なお、ウィンドウには原画像の一部が表示される
のであるが、ウィンドウに表示される原画像の位置は任
意に設定できる。したがって、ユーザはその内容が最も
判断しやすい原画像の位置を設定しておくことができ
る。また、ここで設定した位置は記憶され、ここで設定
された位置の画像がサムネール表示を行ったときに表示
される。なお、ここでは、ＯＣＲ９で読み取った原画像
をサムネール表示するとしたが、上記の処理でユーザが
指定したイメージ領域の画像としてもよい。The file stored in the storage unit 5 in the above processing is managed by the management unit 18. Hereinafter, file management in the management unit 18 will be described. The data processing apparatus 1 has a function of reducing (thumbnail display) a part of the document image read by the OCR 9 for each hypertext file stored and managed by the management unit 18.
FIG. 8 is a diagram illustrating a display example of the display unit 4 on which the thumbnail display is performed. This figure shows an example in which the management unit 18 manages three hypertext files. As shown in the figure, since the original image read by the OCR 9 is displayed for each hypertext file managed by the management unit 18, the user sees the thumbnail display and saves and manages it by the management unit 18. You can easily check the contents of the existing hypertext file. Although a part of the original image is displayed in the window, the position of the original image displayed in the window can be set arbitrarily. Therefore, the user can set the position of the original image whose content is most easily determined. The position set here is stored and displayed when the image at the position set here is displayed as a thumbnail. Although the original image read by the OCR 9 is displayed as a thumbnail here, the image may be an image in an image area designated by the user in the above processing.

【００２６】また、管理部１８で保存・管理している原
画像を領域認識部１１に出力し、この画像を用いて新た
なハイパーテキストファイルを作成することもできる。
これにより、ＯＣＲで一度読み取った文書についてレイ
アウトの再編集を行う場合、再度ＯＣＲでこの文書を読
み取る必要がなく、再編集における操作性を向上するこ
とができる。The original image stored and managed by the management unit 18 can be output to the area recognition unit 11 and a new hypertext file can be created using this image.
Thus, when re-editing the layout of a document once read by OCR, it is not necessary to read this document again by OCR, and operability in re-editing can be improved.

【００２７】[0027]

【発明の効果】以上のように、この発明によれば、ＯＣ
Ｒ等の画像読取装置で読み取った文書をユーザが自分の
好みに合ったレイアウトに編集でき、且つ、このレイア
ウトを編集した構造化ファイルを作成させることができ
る。したがって、ユーザの好みに合った構造化ファイル
を容易に作成することができる。As described above, according to the present invention, the OC
A user can edit a document read by an image reading device such as R into a layout that suits his / her preference, and can create a structured file in which the layout is edited. Therefore, it is possible to easily create a structured file that meets the user's preference.

【００２８】また、ファイルを保存するときには、デー
タ種別に応じたファイル拡張子が自動的に付けられるた
め、ファイルを保存するときにユーザがファイル拡張子
を入力する必要がない。したがって、ユーザは保存する
ファイルのデータ種別を意識することなくファイルを保
存することができる。When a file is saved, a file extension corresponding to the data type is automatically added, so that the user does not need to input the file extension when saving the file. Therefore, the user can save the file without being aware of the data type of the file to be saved.

[Brief description of the drawings]

【図１】この発明の実施形態であるデータ処理装置の構
成を示すブロック図である。FIG. 1 is a block diagram showing a configuration of a data processing device according to an embodiment of the present invention.

【図２】この発明の実施形態であるデータ処理装置の機
能を示す図である。FIG. 2 is a diagram showing functions of a data processing device according to an embodiment of the present invention.

【図３】この発明の実施形態であるデータ処理装置の処
理を示すフローチャートである。FIG. 3 is a flowchart illustrating a process of the data processing device according to the embodiment of the present invention;

【図４】表示部における表示例を示す図である。FIG. 4 is a diagram showing a display example on a display unit.

【図５】表示部における表示例を示す図である。FIG. 5 is a diagram illustrating a display example on a display unit.

【図６】表示部における表示例を示す図である。FIG. 6 is a diagram illustrating a display example on a display unit.

【図７】表示部における表示例を示す図である。FIG. 7 is a diagram illustrating a display example on a display unit.

【図８】表示部における表示例を示す図である。FIG. 8 is a diagram illustrating a display example on a display unit.

【図９】この発明の作用を説明する図である。FIG. 9 is a diagram illustrating the operation of the present invention.

[Explanation of symbols]

１−データ処理装置２−制御部３−入力部４−表示部５−記憶部６−インタフェース部９−ＯＣＲ１１−領域認識部１２−文字認識部１３−文字コード化部１４−画像処理部１５−レイアウト編集部１６−ハイパーテキスト部１７−保存部１８−管理部 1-Data processing device 2-Control unit 3-Input unit 4-Display unit 5-Storage unit 6-Interface unit 9-OCR 11-Area recognition unit 12-Character recognition unit 13-Character coding unit 14-Image processing unit 15 -Layout editing unit 16-Hypertext unit 17-Storage unit 18-Management unit

Claims

[Claims]

A first step of processing an original image read by an image reading apparatus such as an OCR to identify each of the divided areas as a character area, a table area, or an image area; A second step of recognizing a character in the area identified as a character area or a table area in the first step and converting the character into a character code; and an image area in the first step. A third step of cutting out an image in the area and creating an image file in an appropriate format such as a bitmap format for the area identified as above, and setting the position for each area identified in the first step. A fourth step of performing layout editing to be replaced; and a fifth step of creating a structured file in a hypertext format or the like based on the layout edited in the fourth step. Recording medium recording the data processing program having a flop, a.

2. The method according to claim 1, further comprising the step of saving the image data of the original image, the result of character recognition, an image file, a structured file, and the like with a file extension corresponding to the data type. A recording medium that records a data processing program.

3. The recording according to claim 1, wherein the second step includes a step of recognizing a font as a character type and a point number as a character size. Medium.

4. The method according to claim 1, wherein the second step includes a step of recognizing a ruled line in the table area.
Or a recording medium recording the data processing program according to any one of 3.

5. The data processing according to claim 1, further comprising a seventh step of reducing and displaying the original image for each structured file stored in the sixth step. A recording medium on which a program is recorded.

6. The original image file stored in the sixth step is read out, and the processing from the first step on is performed on the read original image.
Or a recording medium on which the data processing program according to any one of 5 is recorded.

7. An area identification unit that processes an original image read by an image reading apparatus such as an OCR and identifies each of the divided areas as a character area, a table area, or an image area; Character processing means for recognizing a character in the area identified as a character area or table area by the identification means and converting the character into a character code; and an area identified by the area identification means as an image area Image file creation means for creating an image file in an appropriate format such as a bitmap format by cutting out an image in the area, and editing the layout by replacing the position of each area identified by the area identification means. Layout editing means for creating a structured file in a hypertext format or the like based on the layout edited by the layout editing means. Data processing apparatus including of a file creation means.

8. A file storing means for storing image data of the original image, a result of character recognition, an image file, a structured file, etc. with a file extension corresponding to the data type. Data processing equipment.

9. An original image read by an image reading apparatus such as an OCR is processed to identify each of the divided areas as a character area, a table area, or an image area. For an area identified as being present, characters in the area are recognized and converted to character codes, and for an area identified as an image area, an image in the area is cut out and converted to a bitmap format or the like. A data processing method that creates an image file in an appropriate format, further accepts layout editing by replacing the position of each identified area, and creates a structured file in a hypertext format or the like based on the edited result of the layout. .

10. The data processing method according to claim 9, wherein image data of the original image, a result of character recognition, an image file, a structured file, and the like are saved with a file extension corresponding to the data type.