JPH11238053A

JPH11238053A - Document device, document generation method and recording medium recording document generation program

Info

Publication number: JPH11238053A
Application number: JP10041644A
Authority: JP
Inventors: Junzo Ikuta; 淳三生田; Yasuyuki Numata; 泰之沼田
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1998-02-24
Filing date: 1998-02-24
Publication date: 1999-08-31

Abstract

PROBLEM TO BE SOLVED: To convert an existed document inputted as picture data into a document described by prescribed structured tag language. SOLUTION: Picture data inputted from an input means 1 is layout-recognized by a layout recognition means 3 and an HTML layout generation means 6 generates the control code of HTML based on respective recognized layouts and HTML(hyper text mark up language) layout knowledge 5. A character recognition means 4 character-recognizes input picture data and outputs a character code. An HTML document generation means 20 generates the HTML document by associating the control code of HTML with the character code and outputs it from an output means 7.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、例えばインターネ
ット（Internet）上で、ワールド・ワイド・ウェブ（Wo
rld Wide Web）のホームページを記述するための言語で
あるＨＴＭＬ（HyperText Markup Language）など、構
造化タグ言語で記述された文書を作成する文書作成装置
および文書作成方法ならびに文書作成プログラムを記録
した記録媒体に関するものである。[0001] The present invention relates to the World Wide Web (WoW), for example, on the Internet.
A document creation device and a document creation method for creating a document described in a structured tag language, such as HTML (HyperText Markup Language), which is a language for describing a home page of an rld Wide Web, and a recording medium storing a document creation program It is about.

【０００２】[0002]

【従来の技術】テキストファイル中にタグと呼ばれる文
字列（制御コード）を置くことで、文字の配置、大き
さ、使用フォントなどを指定することのできる構造化タ
グ言語が知られている。構造化タグ言語で記述された文
書は、ブラウザと呼ばれる専用のプログラムを用いるこ
とにより、文書に記述されている制御コードに従って表
示される。2. Description of the Related Art A structured tag language is known in which a character string (control code) called a tag is placed in a text file so that the arrangement, size, font and the like of characters can be specified. A document described in the structured tag language is displayed according to a control code described in the document by using a dedicated program called a browser.

【０００３】この構造化タグ言語の代表的なものとし
て、インターネット上のＷＷＷのホームページを記述す
る言語であるＨＴＭＬが知られている。特に、近年のイ
ンターネット技術やイントラネット技術の発達ならびに
普及により、様々な文書をＨＴＭＬ文書に変換する要求
が高まっている。当然ながら、従来より保持している紙
に印字された膨大な文書をＨＴＭＬ文書の形式に変換し
て利用したいという要求も大きい。As a typical example of the structured tag language, HTML, which is a language for describing a WWW homepage on the Internet, is known. In particular, with the development and spread of Internet technology and intranet technology in recent years, there has been an increasing demand for converting various documents into HTML documents. Naturally, there is a great demand to convert a huge document printed on paper held in the past into an HTML document format and use it.

【０００４】紙に印字された文書をＨＴＭＬ文書のよう
な構造化タグ言語で記述された文書に変換する場合に
は、次の２つの方法がある。１つ目の方法は、文書を光
学式スキャナ装置で読み取り、それをそのままイメージ
データとしてＨＴＭＬ文書内に組み込んで利用する方法
である。２つ目の方法は、文書を見ながら人手でＨＴＭ
Ｌ用の制御コードを付与していく方法である。There are two methods for converting a document printed on paper into a document described in a structured tag language such as an HTML document. The first method is a method in which a document is read by an optical scanner, and the document is used as it is incorporated into an HTML document as image data. The second method is HTM by hand while looking at the document
This is a method of giving control codes for L.

【０００５】[0005]

【発明が解決しようとする課題】しかしながら、従来の
方法では次のような問題点がある。まず、イメージデー
タとして利用する場合には、イメージデータが巨大にな
り、記憶する容量やユーザが閲覧するためにデータを取
得する時間が巨大になるという問題点がある。また、人
手でＨＴＭＬ用の制御コードを付与する場合には、膨大
な作業時間を必要とし、効率的ではない。However, the conventional method has the following problems. First, when used as image data, there is a problem that the image data becomes enormous, and the storage capacity and the time to acquire the data for the user to browse become enormous. In addition, when a control code for HTML is manually added, an enormous amount of work time is required, which is not efficient.

【０００６】本発明は、画像データとして入力される既
存の文書を、所定の構造化タグ言語で記述された文書に
変換することのできる文書作成装置および文書作成方法
ならびに文書作成プログラムを記録した記録媒体を提供
することを目的とする。SUMMARY OF THE INVENTION The present invention provides a document creating apparatus and a document creating method capable of converting an existing document input as image data into a document described in a predetermined structured tag language, and a recording in which a document creating program is recorded. The purpose is to provide a medium.

【０００７】[0007]

【課題を解決するための手段】この課題を解決するため
に本発明は、画像データとして入力される文書から、所
定の構造化タグ言語で記述された文書を作成する文書作
成装置であって、文書を画像データとして入力する入力
手段と、入力手段から入力される画像データを記憶する
入力画像記憶手段と、入力画像記憶手段に記憶される画
像データのレイアウトを認識するレイアウト認識手段
と、構造化タグ言語のレイアウトに関する変換ルールを
記憶したレイアウト知識記憶手段と、レイアウト知識記
憶手段の変換ルールに基づいて、レイアウト認識手段に
より認識される各レイアウトに対して構造化タグ言語の
制御コードを生成する制御コード生成手段と、入力画像
記憶手段に記憶される画像データ中の文字イメージを認
識し、文字コードに変換する文字認識手段と、文字認識
手段により出力される文字コードと、制御コード生成手
段により生成される制御コードとを関連付けて、構造化
タグ言語で記述された文書を作成する文書作成手段と、
文書作成手段により作成された文書を出力する出力手段
と、を有する構成とした。According to the present invention, there is provided a document creation apparatus for creating a document described in a predetermined structured tag language from a document input as image data, Input means for inputting a document as image data, input image storage means for storing image data input from the input means, layout recognition means for recognizing a layout of image data stored in the input image storage means, and structuring Layout knowledge storage means storing conversion rules related to a tag language layout, and control for generating a structured tag language control code for each layout recognized by the layout recognition means based on the conversion rules of the layout knowledge storage means Code generating means for recognizing a character image in the image data stored in the input image storage means, Character recognition means for conversion, a character code outputted by the character recognition unit, in association with the control code generated by the control code generating means, a document creation means for creating a document described in a structured tag language,
Output means for outputting a document created by the document creation means.

【０００８】これにより、画像データとして入力される
既存の文書を、所定の構造化タグ言語で記述された文書
に変換することのできる文書作成装置が得られる。As a result, a document creation device capable of converting an existing document input as image data into a document described in a predetermined structured tag language is obtained.

【０００９】[0009]

【発明の実施の形態】本発明の請求項１に記載の発明
は、画像データとして入力される文書から、所定の構造
化タグ言語で記述された文書を作成する文書作成装置で
あって、文書を画像データとして入力する入力手段と、
入力手段から入力される画像データを記憶する入力画像
記憶手段と、入力画像記憶手段に記憶される画像データ
のレイアウトを認識するレイアウト認識手段と、構造化
タグ言語のレイアウトに関する変換ルールを記憶したレ
イアウト知識記憶手段と、レイアウト知識記憶手段の変
換ルールに基づいて、レイアウト認識手段により認識さ
れる各レイアウトに対して構造化タグ言語の制御コード
を生成する制御コード生成手段と、入力画像記憶手段に
記憶される画像データ中の文字イメージを認識し、文字
コードに変換する文字認識手段と、文字認識手段により
出力される文字コードと、制御コード生成手段により生
成される制御コードとを関連付けて、構造化タグ言語で
記述された文書を作成する文書作成手段と、文書作成手
段により作成された文書を出力する出力手段と、を有す
る構成としたことにより、画像データとして入力される
文書のレイアウトに従って、所定の構造化タグ言語の制
御コードを付与し、構造化タグ言語で記述された文書に
変換することができる。DESCRIPTION OF THE PREFERRED EMBODIMENTS The invention according to claim 1 of the present invention is a document creating apparatus for creating a document described in a predetermined structured tag language from a document input as image data, Input means for inputting image data as image data;
Input image storage means for storing image data input from the input means, layout recognizing means for recognizing the layout of the image data stored in the input image storage means, and layout storing conversion rules for the layout of the structured tag language Knowledge storage means, control code generation means for generating a structured tag language control code for each layout recognized by the layout recognition means based on the conversion rules of the layout knowledge storage means, and storage in the input image storage means Character recognition means for recognizing a character image in the image data to be converted into a character code, character codes output by the character recognition means, and control codes generated by the control code generation means in association with each other. A document creation means for creating a document described in a tag language, and a document creation means Output means for outputting a document, a control code of a predetermined structured tag language is added in accordance with the layout of the document input as image data, and a document described in the structured tag language is added to the document. Can be converted.

【００１０】本発明の請求項２に記載の発明は、請求項
１に記載の発明において、レイアウト知識記憶手段は、
利用者により変換ルールを追加可能である構成としたこ
とにより、利用者の持つ文書に特有の表現に対応でき
る。According to a second aspect of the present invention, in the first aspect, the layout knowledge storage means includes:
By adopting a configuration in which a conversion rule can be added by a user, it is possible to cope with an expression unique to a document held by the user.

【００１１】本発明の請求項３に記載の発明は、請求項
１に記載の発明において、文書作成手段は、複数の入力
画像データから得られる各文書を連結し、構造化タグ言
語で記述された１つの文書として生成する構成としたこ
とにより、物理的に複数のページにわたる文書であって
も、構造化タグ言語で記述された１つのページとして利
用できる。According to a third aspect of the present invention, in the first aspect of the present invention, the document creating means connects each document obtained from a plurality of input image data and is described in a structured tag language. With such a configuration that a single document is generated, even a document physically extending over a plurality of pages can be used as one page described in a structured tag language.

【００１２】以下、本発明の実施の形態について、図面
を参照しながら説明する。なお、以下の各実施の形態で
は、構造化タグ言語の具体例として、ＨＴＭＬを例にと
り説明を行っている。Hereinafter, embodiments of the present invention will be described with reference to the drawings. In each of the following embodiments, HTML is described as a specific example of the structured tag language.

【００１３】（実施の形態１）図１は本発明の第１の実
施の形態における文書作成装置の機能ブロック図であ
り、機能手段による構成を示している。(Embodiment 1) FIG. 1 is a functional block diagram of a document creating apparatus according to a first embodiment of the present invention, and shows the configuration of functional units.

【００１４】図１において、１は画像イメージを入力す
る入力手段、２は画像イメージやレイアウト認識、文字
認識結果などを記憶するデータ記憶手段、３は入力され
た画像イメージのレイアウトを認識するレイアウト認識
手段、４は文字イメージを文字コードに変換する文字認
識手段と、５はＨＴＭＬレイアウトに関する変換ルール
を記憶したＨＴＭＬレイアウト知識ＤＢ（データベー
ス）、６はＨＴＭＬレイアウト知識ＤＢ５にしたがっ
て、ＨＴＭＬの制御コードを生成するＨＴＭＬレイアウ
ト生成手段、７は作成されたＨＴＭＬ文書をファイル等
に保存したりディスプレイに表示したりする出力手段、
２０は文字認識手段４より出力される文字コードと、Ｈ
ＴＭＬレイアウト生成手段６により生成された制御コー
ドとを関連付けてＨＴＭＬ文書を作成するＨＴＭＬ文書
作成手段である。In FIG. 1, 1 is an input means for inputting an image image, 2 is a data storage means for storing the image image, layout recognition, character recognition result, etc., 3 is a layout recognition apparatus for recognizing the layout of the input image image. Means 4, character recognition means for converting a character image into a character code, reference numeral 5, an HTML layout knowledge DB (database) storing conversion rules related to the HTML layout, and reference numeral 6, an HTML control code is generated according to an HTML layout knowledge DB 5 HTML layout generating means 7 for outputting the generated HTML document in a file or the like or displaying the generated HTML document on a display;
Reference numeral 20 denotes a character code output from the character recognition means 4 and H
HTML document creation means for creating an HTML document in association with the control code generated by the TML layout generation means 6.

【００１５】図２は本発明の第１の実施の形態における
文書作成装置の装置ブロック図であり、ハードウェアに
よる構成を示している。FIG. 2 is a device block diagram of the document creation device according to the first embodiment of the present invention, and shows a configuration by hardware.

【００１６】図２において、８はデータを読み込むスキ
ャナである。９は制御プログラムを実行することにより
装置を制御する中央処理演算装置（ＣＰＵ）である。１
０はデータを記憶するリード・オンリー・メモリ（ＲＯ
Ｍ）であり、ＣＰＵ９が実行するプログラムを格納する
領域を有する。１１はデータを一時的に記憶するランダ
ム・アクセス・メモリ（ＲＡＭ）である。１２は陰極線
管ディスプレイ（ＣＲＴ）などの表示装置である。In FIG. 2, reference numeral 8 denotes a scanner for reading data. Reference numeral 9 denotes a central processing unit (CPU) that controls the apparatus by executing a control program. 1
0 is a read only memory (RO) for storing data
M), and has an area for storing a program to be executed by the CPU 9. Reference numeral 11 denotes a random access memory (RAM) for temporarily storing data. Reference numeral 12 denotes a display device such as a cathode ray tube display (CRT).

【００１７】１３はフロッピー・ディスク（ＦＤ）など
の記録媒体にデータを読み書きするディスク・ドライブ
である。１４はキーボードやポインティング・デバイス
などの入力装置、１５はハード・ディスクなどの２次記
憶装置である。１６はデータ・バスである。Reference numeral 13 denotes a disk drive for reading and writing data on a recording medium such as a floppy disk (FD). Reference numeral 14 denotes an input device such as a keyboard or a pointing device, and 15 denotes a secondary storage device such as a hard disk. 16 is a data bus.

【００１８】ここで、図１の機能手段と図２のハードウ
ェアとの対応関係を説明する。図１および図２に示すよ
うに、入力手段１はスキャナ８により実現される。デー
タ記憶手段２はＲＡＭ１１により実現される。ＨＴＭＬ
レイアウト知識ＤＢ５はＲＯＭ１０あるいは２次記憶装
置１５により実現される。出力手段７はＣＲＴ１２ある
いはディスク・ドライブ１３により実現される。Here, the correspondence between the functional means of FIG. 1 and the hardware of FIG. 2 will be described. As shown in FIGS. 1 and 2, the input unit 1 is realized by a scanner 8. The data storage means 2 is realized by the RAM 11. HTML
The layout knowledge DB 5 is realized by the ROM 10 or the secondary storage device 15. The output means 7 is realized by a CRT 12 or a disk drive 13.

【００１９】レイアウト認識手段３、文字認識手段４、
ＨＴＭＬレイアウト生成手段６は、ＣＰＵ９がＲＯＭ１
０およびＲＡＭ１１とデータのやりとりを行いながらＲ
ＯＭ１０に記憶されている制御プログラムを実行するこ
とにより実現される。Layout recognition means 3, character recognition means 4,
The HTML layout generating means 6 includes a CPU 9
0 and R while exchanging data with RAM11.
This is realized by executing a control program stored in the OM 10.

【００２０】なお、本実施の形態では、ＣＰＵ９はＲＯ
Ｍ１０に格納された制御プログラムを実行する形態を示
しているが、記録媒体１７からディスク・ドライブ１３
を介して制御プログラムを読み込み、ＲＡＭ１１あるい
は２次記憶装置１５に展開したものをＣＰＵ９が実行す
る形態としても良い。このような形態とすることによ
り、本発明を汎用コンピュータでも容易に実現可能とす
ることができる。In this embodiment, the CPU 9 determines whether the RO
In this embodiment, the control program stored in the M10 is executed.
Alternatively, the CPU 9 may read the control program via the CPU 9 and develop the program on the RAM 11 or the secondary storage device 15 to execute the program. With such an embodiment, the present invention can be easily realized by a general-purpose computer.

【００２１】以上のように構成された文書作成装置につ
いて、以下にその動作について説明する。なお、以下の
フローチャートは、ＣＰＵ９がＲＯＭ１０に格納されて
いる制御プログラムを実行する様子について示したもの
である。The operation of the document creating apparatus configured as described above will be described below. Note that the following flowchart shows how the CPU 9 executes a control program stored in the ROM 10.

【００２２】図３は本発明の第１の実施の形態における
文書作成装置の動作フローチャートである。FIG. 3 is an operation flowchart of the document creating apparatus according to the first embodiment of the present invention.

【００２３】図３に示すように、まず、入力手段１から
画像が入力され、入力される画像データをデータ記憶手
段２に記憶させる（ステップ１）。ここでは、図５に示
す画像が入力されたものとする。As shown in FIG. 3, first, an image is input from the input means 1, and the input image data is stored in the data storage means 2 (step 1). Here, it is assumed that the image shown in FIG. 5 has been input.

【００２４】次に、レイアウト認識手段３は、入力され
た画像イメージの解析を行い、レイアウトの認識を行う
（ステップ２）。ここでは、文字領域が大きく２つに分
れ、「我が家のレシピ集」がセンタリングされ、比較的
大きな文字で書かれていること、および「昨日のメニュ
ー」から続く３行が箇条書きされていることが認識され
る。なお、ここで「我が家のレシピ集」という書き方を
行ったが、まだ文字認識は行われておらず、現時点では
「我が家のレシピ集」という文字列が生成されているわ
けではなく、そこにあるイメージの部分のことを示して
いる。Next, the layout recognizing means 3 analyzes the input image and recognizes the layout (step 2). In this case, the character area is divided into two large parts, the "home recipe collection" is centered, written in relatively large characters, and the three lines following "yesterday's menu" are listed. It will be recognized. In addition, here I wrote "my home recipe collection", but character recognition has not yet been performed, and at this time the character string "my home recipe collection" is not generated, it is there Indicates the image part.

【００２５】ＨＴＭＬレイアウト生成手段６は、ＨＴＭ
Ｌレイアウトの生成を行う（ステップ３）。このステッ
プ３は、図４を使って詳しく説明する。The HTML layout generating means 6 includes an HTML
An L layout is generated (step 3). Step 3 will be described in detail with reference to FIG.

【００２６】図４は本発明の第１の実施の形態における
レイアウト生成の動作フローチャートである。FIG. 4 is an operation flowchart of layout generation according to the first embodiment of the present invention.

【００２７】図４に示すように、まず、ステップ３０１
では、ＨＴＭＬレイアウト生成手段６は、ＨＴＭＬレイ
アウト知識ＤＢ５より、１つのルールを取り出す。図６
にＨＴＭＬレイアウト知識ＤＢ５の例を示す。ここで
は、「センタリングされている大きな文字→<CENTER><H
1>と、</H1></CENTER>で囲む」が取り出される。As shown in FIG. 4, first, at step 301
Then, the HTML layout generating means 6 extracts one rule from the HTML layout knowledge DB 5. FIG.
9 shows an example of the HTML layout knowledge DB5. Here, “Large centered character → <CENTER><H
1> and </ H1></CENTER>."

【００２８】次に、ステップ３０２では、ＨＴＭＬレイ
アウト生成手段６は、レイアウト認識手段３が出力した
レイアウト情報を参照し、取り出したルールに合致する
レイアウトがあるかどうか調べる。ここでは、「我が家
のレシピ集」の部分がルールに一致するのでステップ３
０３へ進む。Next, in step 302, the HTML layout generating means 6 refers to the layout information output by the layout recognizing means 3 and checks whether there is a layout that matches the extracted rule. Here, the step of "My home recipe collection" matches the rule, so step 3
Go to 03.

【００２９】ステップ３０３では、ルール中の「→」以
降の処理がなされ、「我が家のレシピ集」の部分が、<C
ENTER><H1>と、</H1></CENTER>で囲むことがデータ記憶
手段２に記憶される。In step 303, the processing after "→" in the rule is performed, and the part of "my home recipe collection" is changed to <C
ENTER><H1> and </ H1></CENTER> are stored in the data storage means 2.

【００３０】ステップ３０２に戻り、再度、ルールに合
致するレイアウトがあるかどうか調べる。ここでは存在
しないので、ステップ３０４に進む。Returning to step 302, it is checked again whether there is a layout that matches the rule. Here, since there is not, the process proceeds to step 304.

【００３１】ステップ３０４では、ＨＴＭＬレイアウト
生成手段６は、ＨＴＭＬレイアウト知識ＤＢ５に記憶さ
れているルールのうち、全てのルールの処理を行ったか
どうか調べる。図６に示すルールのうち、まだ１つしか
処理を終えていないのでステップ３０１に戻る。In step 304, the HTML layout generation means 6 checks whether or not all of the rules stored in the HTML layout knowledge DB 5 have been processed. Since only one of the rules shown in FIG. 6 has been processed, the process returns to step 301.

【００３２】ステップ３０１では、ＨＴＭＬレイアウト
生成手段６は、ＨＴＭＬレイアウト知識ＤＢ５より、別
のルールを取り出す。図６にＨＴＭＬレイアウト知識Ｄ
Ｂ５の例を示す。ここでは、「箇条書きになっている部
分→全体を<UL>と</UL>で囲み、それぞれの項目を<LI>
と</LI>で囲む」が取り出される。In step 301, the HTML layout generation means 6 extracts another rule from the HTML layout knowledge DB 5. FIG. 6 shows HTML layout knowledge D.
The example of B5 is shown. Here, "The bulleted part → Enclose the whole with <UL> and </ UL>, and each item is <LI>
And </ LI>).

【００３３】ステップ３０２では、ＨＴＭＬレイアウト
生成手段６は、レイアウト認識部３が出力したレイアウ
ト情報を参照し、ルールに合致するレイアウトがあるか
どうか調べる。ここでは、「昨日のメニュー」から「明
日のメニュー」までが箇条書き部分であるので、ルール
に一致し、ステップ３０３へ進む。In step 302, the HTML layout generating means 6 refers to the layout information output by the layout recognizing unit 3 and checks whether there is a layout that matches the rule. In this case, since the items from “the menu of yesterday” to “the menu of tomorrow” are the bulleted parts, they match the rules and go to step 303.

【００３４】ステップ３０３では、ルールの「→」以降
の処理がなされ、「昨日のメニュー」から「明日のメニ
ュー」までの全体を<UL>と</UL>で囲み、また、「昨日
のメニュー」、「今日のメニュー」、「明日のメニュ
ー」をそれぞれ<LI>と</LI>で囲むことがデータ記憶手
段２に記憶される。In step 303, the processing after "→" of the rule is performed, and the whole from "Yesterday's menu" to "Tomorrow's menu" is enclosed by <UL> and </ UL>. , "Menu of today" and "menu of tomorrow" are stored in the data storage means 2 in <LI> and </ LI>, respectively.

【００３５】ステップ３０２に戻り、再度、ルールに合
致するレイアウトがあるかどうか調べる。ここでは存在
しないので、ステップ３０４に進む。Returning to step 302, it is checked again whether there is a layout that matches the rule. Here, since there is not, the process proceeds to step 304.

【００３６】ステップ３０４では、ＨＴＭＬレイアウト
生成手段６は、ＨＴＭＬレイアウト知識ＤＢ５に記憶さ
れているルールのうち、全てのルールの処理を行ったか
どうか調べる。ここでは全てのルールの処理が終わった
ものとして、ステップ３０５へ進む。In step 304, the HTML layout generation means 6 checks whether or not all the rules among the rules stored in the HTML layout knowledge DB 5 have been processed. Here, it is assumed that all rules have been processed, and the process proceeds to step 305.

【００３７】ステップ３０５では、処理されていないレ
イアウトが存在するか調べる。ここでは全てのルールが
試されたが、処理されていないレイアウトが存在した場
合は、ステップ３０６に示すごとく、レイアウト処理が
できない部分があったことを出力手段７に出力する。In step 305, it is checked whether there is any unprocessed layout. Here, all the rules have been tried, but if there is a layout that has not been processed, as shown in step 306, the fact that there is a part where the layout processing cannot be performed is output to the output unit 7.

【００３８】なお、ステップ３０６で警告を表示する代
わりに、何らかのデフォルト的な処理を行ってもよい。
例えば、全ての文字を本文のデータとして何のレイアウ
ト情報なしに利用する、レイアウトが確認できなかった
データはコメントとして挿入する、などである。Instead of displaying a warning in step 306, some default processing may be performed.
For example, all characters are used as body data without any layout information, and data whose layout could not be confirmed is inserted as a comment.

【００３９】以上で、図４に示すＨＴＭＬレイアウト生
成が終了するので、図３に戻りステップ４に進む。As described above, the generation of the HTML layout shown in FIG. 4 is completed.

【００４０】図３に戻り、入力画像データの文字認識を
行う（ステップ４）。ステップ４では、各文字領域の文
字を認識し、文字コードを出力する。文字コードが出力
されると、ＨＴＭＬ文書作成手段２０は、ステップ３で
生成したＨＴＭＬレイアウト情報に従って文字コードを
埋め込み、ＨＴＭＬ文書を作成する（ステップ５）。ス
テップ５では、さらに、ＨＴＭＬ文書であることの必須
の制御コードとして、文書の先頭に<HTML><BODY>を、ま
た文書の最後に</BODY></HTML>をそれぞれ付与する。Returning to FIG. 3, character recognition of the input image data is performed (step 4). In step 4, the characters in each character area are recognized and a character code is output. When the character code is output, the HTML document creator 20 embeds the character code according to the HTML layout information generated in step 3 to create an HTML document (step 5). In step 5, <HTML><BODY> is added to the head of the document and </ BODY></HTML> is added to the end of the document as essential control codes for being an HTML document.

【００４１】作成されたＨＴＭＬ文書は、出力手段７に
より出力される。全てのデータが出力手段７に出力され
ている状態を図７に示す。この出力は、例えば、ディス
ク・ドライブ１３に保存したり、表示装置１２に表示し
たりすることである。The created HTML document is output by the output unit 7. FIG. 7 shows a state in which all data is output to the output means 7. This output is, for example, stored in the disk drive 13 or displayed on the display device 12.

【００４２】以上のように本実施の形態によれば、入力
される画像データのレイアウト認識を行い、レイアウト
認識した結果に基づいて、ＨＴＭＬの制御コードを生成
し、さらに、文字認識した結果と前記制御コードを関連
付けることで、ＨＴＭＬ文書を作成するので、操作者
は、既存の文書をＨＴＭＬ文書に変換する際、制御コー
ドを手動で付与する必要がなくなり、作業の効率化を図
ることができる。As described above, according to the present embodiment, the layout of input image data is recognized, an HTML control code is generated based on the layout recognition result, and the character recognition result and the HTML Since the HTML document is created by associating the control code, the operator does not need to manually assign the control code when converting the existing document into the HTML document, and thus the work efficiency can be improved.

【００４３】（実施の形態２）以下、本発明の第２の実
施の形態について説明する。(Embodiment 2) Hereinafter, a second embodiment of the present invention will be described.

【００４４】本実施の形態では、第１の実施の形態に示
すＨＴＭＬレイアウト知識ＤＢ５に利用者が必要に応じ
てルールを追加・修正することができるようにするもの
である。In this embodiment, the user can add and modify rules as needed in the HTML layout knowledge DB 5 shown in the first embodiment.

【００４５】例えば、図８に示すように、罫線で囲まれ
た３段組の表を含んだ文書をＨＴＭＬ文書化する場合、
ＨＴＭＬレイアウト知識ＤＢ５に、罫線で書かれた３段
組の表を処理するルールが存在しないとする。For example, as shown in FIG. 8, when a document including a three-column table surrounded by ruled lines is converted into an HTML document,
It is assumed that there is no rule in the HTML layout knowledge DB 5 for processing a three-column table written with ruled lines.

【００４６】このような場合、操作者は、キーボードな
どの入力装置を用い、図９に示すように、ＨＴＭＬレイ
アウト知識ＤＢ５に新たなルールを追加することによ
り、第１の実施の形態で示した手順により図１０に示す
ＨＴＭＬ文書を得ることができる。In such a case, the operator uses an input device such as a keyboard to add a new rule to the HTML layout knowledge DB 5 as shown in FIG. By the procedure, the HTML document shown in FIG. 10 can be obtained.

【００４７】以上のように本実施の形態では、このよう
にしてＨＴＭＬレイアウト知識ＤＢ５に用意していなか
ったようなレイアウトを持つ文書に対しても、操作者が
ルールを追加・修正することによりＨＴＭＬ文書を自動
的に作成することができる。As described above, in the present embodiment, even for a document having a layout that is not prepared in the HTML layout knowledge DB 5 in this way, the operator adds and corrects rules by using the HTML. Documents can be created automatically.

【００４８】（実施の形態３）以下、本発明の第３の実
施の形態について説明する。(Embodiment 3) Hereinafter, a third embodiment of the present invention will be described.

【００４９】本実施の形態では、第１の実施の形態で示
した文書作成装置において、入力される文書の複数のペ
ージを、１つのＨＴＭＬ文書として処理できるようにす
るものである。In this embodiment, in the document creation apparatus shown in the first embodiment, a plurality of pages of an input document can be processed as one HTML document.

【００５０】図１１は本発明の第３の実施の形態におけ
る文書作成装置の動作フローチャートである。FIG. 11 is an operation flowchart of the document creating apparatus according to the third embodiment of the present invention.

【００５１】図１１に示すように、入力手段１より複数
の画像データが入力され、データ記憶手段２に記憶され
る（ステップ５０１）。As shown in FIG. 11, a plurality of image data are inputted from the input means 1 and stored in the data storage means 2 (step 501).

【００５２】入力された画像データは、第１の実施の形
態に示したステップ２〜ステップ５の手順、すなわち、
図３に示す手順により、ＨＴＭＬ文書に変換される（ス
テップ５０２）。The input image data is stored in the procedure from step 2 to step 5 shown in the first embodiment, ie,
The document is converted into an HTML document by the procedure shown in FIG. 3 (step 502).

【００５３】この手順を、全てのページが処理されるま
で続ける（ステップ５０３）。ここでは図１２に示す２
ページの文書をＨＴＭＬ文書化することを例に取り説明
すると、図１３に示すように、それぞれのページがＨＴ
ＭＬ文書化される。This procedure is continued until all pages have been processed (step 503). Here, 2 shown in FIG.
Taking the document of the page as an HTML document as an example, as shown in FIG.
It is ML documented.

【００５４】次に、ＨＴＭＬ文書作成手段２０は、複数
ページの結合処理を行う（ステップ５０４）。具体的に
は、それぞれのページ間の<HTML><BODY>および</BODY><
/HTML>を削除して、それぞれのページをつなげる。その
結果を図１４に示す。Next, the HTML document creating means 20 performs a combining process of a plurality of pages (step 504). Specifically, <HTML><BODY> and </ BODY><
/ HTML> and connect each page. The result is shown in FIG.

【００５５】作成されたＨＴＭＬ文書は、出力手段７よ
り出力される（ステップ５０５）。この出力は、例え
ば、ディスク・ドライブ１３に保存したり、表示装置１
２に表示したりすることである。The created HTML document is output from the output means 7 (step 505). This output is stored in, for example, the disk drive 13 or the display device 1.
2 is displayed.

【００５６】以上のように本実施の形態によれば、論理
的には１つながりのページで、用紙の都合上複数ページ
に分割されているため、入力される画像データが複数に
分かれている場合でも、１つのＨＴＭＬ文書として作成
することが可能となる。As described above, according to the present embodiment, a logically continuous page is divided into a plurality of pages for convenience of paper, so that the input image data is divided into a plurality of pages. However, it can be created as one HTML document.

【００５７】以上、各実施の形態ではある形態を例にと
って説明を行ったが、本発明は、上記した各実施の形態
のみにとどまらないことは言うまでもない。例えば、入
力手段１としてスキャナ８を例にとり説明したが、スキ
ャナ８から入力するのではなく、画像データをディスク
・ドライブ１３やネットワークを介して入力する場合も
考えられる。As described above, each embodiment has been described by taking a certain embodiment as an example. However, it is needless to say that the present invention is not limited to each embodiment described above. For example, the scanner 8 has been described as an example of the input unit 1, but image data may be input via the disk drive 13 or a network instead of the scanner 8.

【００５８】また、出力結果を記憶媒体１７に出力する
例を説明したが、フラッシュ・メモリカードや、ネット
ワークを介した他のデバイスへ出力することなども考え
られる。Although an example in which the output result is output to the storage medium 17 has been described, output to a flash memory card or another device via a network may be considered.

【００５９】[0059]

【発明の効果】以上のように本発明によれば、画像デー
タとして入力される文書から、所定の構造化タグ言語で
記述された文書を作成する文書作成装置であって、文書
を画像データとして入力する入力手段と、入力手段から
入力される画像データを記憶する入力画像記憶手段と、
入力画像記憶手段に記憶される画像データのレイアウト
を認識するレイアウト認識手段と、構造化タグ言語のレ
イアウトに関する変換ルールを記憶したレイアウト知識
記憶手段と、レイアウト知識記憶手段の変換ルールに基
づいて、レイアウト認識手段により認識される各レイア
ウトに対して構造化タグ言語の制御コードを生成する制
御コード生成手段と、入力画像記憶手段に記憶される画
像データ中の文字イメージを認識し、文字コードに変換
する文字認識手段と、文字認識手段により出力される文
字コードと、制御コード生成手段により生成される制御
コードとを関連付けて、構造化タグ言語で記述された文
書を作成する文書作成手段と、文書作成手段により作成
された文書を出力する出力手段と、を有する構成とした
ことにより、画像データとして入力される文書のレイア
ウトに従って、所定の構造化タグ言語の制御コードを付
与し、構造化タグ言語で記述された文書に変換すること
ができるので、変換作業における操作者の操作負担が軽
減される。As described above, according to the present invention, there is provided a document creating apparatus for creating a document described in a predetermined structured tag language from a document input as image data, wherein the document is used as image data. Input means for inputting, input image storage means for storing image data input from the input means,
Layout recognition means for recognizing the layout of image data stored in the input image storage means, layout knowledge storage means for storing conversion rules relating to the layout of the structured tag language, and layout based on the conversion rules of the layout knowledge storage means A control code generation unit for generating a control code of a structured tag language for each layout recognized by the recognition unit; and a character image in image data stored in the input image storage unit is recognized and converted into a character code. A character recognition unit, a document creation unit that associates a character code output by the character recognition unit with a control code generated by the control code generation unit, and creates a document described in a structured tag language; And output means for outputting a document created by the means. According to the layout of a document input as data, a control code of a predetermined structured tag language can be added and converted into a document described in the structured tag language, so that the operation burden on the operator in the conversion work is reduced. It is reduced.

【００６０】また、レイアウト知識記憶手段は、操作者
により変換ルールを追加可能である構成としたことによ
り、利用者の持つ文書に特有の表現に対応でき、以後の
変換精度を向上させることができる。Further, the layout knowledge storage means is configured so that the conversion rule can be added by the operator, so that the layout knowledge storage means can cope with an expression peculiar to the document held by the user, and the subsequent conversion accuracy can be improved. .

【００６１】また、文書作成手段は、複数の入力画像デ
ータから得られる各文書を連結し、構造化タグ言語で記
述された１つのＨＴＭＬ文書として生成する構成とした
ことにより、物理的に複数のページにわたる文書であっ
ても、構造化タグ言語で記述された１つのページとして
利用でき、変換後の利用効率化を図ることができる。Further, the document creation means is configured to link documents obtained from a plurality of input image data to generate one HTML document described in a structured tag language, thereby physically providing a plurality of documents. Even a document spanning pages can be used as one page described in the structured tag language, and the efficiency of use after conversion can be improved.

[Brief description of the drawings]

【図１】本発明の第１の実施の形態における文書作成装
置の機能ブロック図FIG. 1 is a functional block diagram of a document creation device according to a first embodiment of the present invention.

【図２】本発明の第１の実施の形態における文書作成装
置の回路ブロック図FIG. 2 is a circuit block diagram of a document creation device according to the first embodiment of the present invention.

【図３】本発明の第１の実施の形態における文書作成装
置の動作フローチャートFIG. 3 is an operation flowchart of the document creation apparatus according to the first embodiment of the present invention.

【図４】本発明の第１の実施の形態におけるレイアウト
生成の動作フローチャートFIG. 4 is an operation flowchart of layout generation according to the first embodiment of the present invention;

【図５】本発明の第１の実施の形態における入力画像デ
ータを示す図FIG. 5 is a diagram showing input image data according to the first embodiment of the present invention.

【図６】本発明の第１の実施の形態におけるＨＴＭＬレ
イアウト知識ＤＢを示す図FIG. 6 is a diagram showing an HTML layout knowledge DB according to the first embodiment of the present invention.

【図７】本発明の第１の実施の形態における作成された
ＨＴＭＬ文書を示す図FIG. 7 is a view showing an HTML document created according to the first embodiment of the present invention;

【図８】本発明の第２の実施の形態における入力画像デ
ータを示す図FIG. 8 is a diagram showing input image data according to a second embodiment of the present invention.

【図９】本発明の第２の実施の形態におけるＨＴＭＬレ
イアウト知識ＤＢを示す図FIG. 9 is a diagram showing an HTML layout knowledge DB according to the second embodiment of the present invention.

【図１０】本発明の第２の実施の形態における作成され
たＨＴＭＬ文書を示す図FIG. 10 is a view showing an HTML document created according to the second embodiment of the present invention;

【図１１】本発明の第３の実施の形態における文書作成
装置の動作フローチャートFIG. 11 is an operation flowchart of a document creation device according to a third embodiment of the present invention.

【図１２】本発明の第３の実施の形態における入力画像
データを示す図FIG. 12 is a diagram showing input image data according to a third embodiment of the present invention.

【図１３】本発明の第３の実施の形態における処理途中
のＨＴＭＬ文書を示す図FIG. 13 is a diagram showing an HTML document being processed according to the third embodiment of the present invention;

【図１４】本発明の第３の実施の形態における作成され
たＨＴＭＬ文書を示す図FIG. 14 is a diagram showing an HTML document created according to the third embodiment of the present invention;

[Explanation of symbols]

１入力手段２データ記憶手段３レイアウト認識手段４文字認識手段５ＨＴＭＬレイアウト知識ＤＢ６ＨＴＭＬレイアウト生成手段７出力手段８スキャナ９中央処理装置（ＣＰＵ）１０リード・オンリー・メモリ（ＲＯＭ）１１ランダム・アクセス・メモリ（ＲＡＭ）１２表示装置１３ディスク・ドライブ１４入力装置１５２次記憶装置１６データ・バス２０ＨＴＭＬ文書作成手段 DESCRIPTION OF SYMBOLS 1 Input means 2 Data storage means 3 Layout recognition means 4 Character recognition means 5 HTML layout knowledge DB 6 HTML layout generation means 7 Output means 8 Scanner 9 Central processing unit (CPU) 10 Read only memory (ROM) 11 Random access -Memory (RAM) 12 Display device 13 Disk drive 14 Input device 15 Secondary storage device 16 Data bus 20 HTML document creation means

Claims

[Claims]

1. A document creation device for creating a document described in a predetermined structured tag language from a document input as image data, comprising: input means for inputting a document as image data; Input image storing means for storing input image data, layout recognizing means for recognizing a layout of image data stored in the input image storing means, and layout knowledge storing means for storing a conversion rule relating to a layout of a structured tag language And, based on the conversion rule of the layout knowledge storage means,
A control code generating means for generating a structured tag language control code for each layout recognized by the layout recognizing means; and a character code for recognizing a character image in image data stored in the input image storage means. A document recognizing means for converting a character code output by the character recognizing means and a control code generated by the control code generating means into a document described in a structured tag language Means for outputting a document created by the document creating means.

2. The document creation apparatus according to claim 1, wherein said layout knowledge storage means is capable of adding a conversion rule by a user.

3. The document according to claim 1, wherein said document creating means connects each document obtained from a plurality of input image data and generates a document described in one structured tag language. Creating device.

4. A document creation method for creating a document described in a predetermined structured tag language from a document input as image data, comprising: a step of storing input image data; Recognizing a layout of the data; generating a control code of the structured tag language for each recognized layout with reference to a conversion rule regarding the layout of the structured tag language; Recognizing a character image and converting it to a character code; associating the converted character code with the generated control code to create a document described in a structured tag language; and the created document. Outputting a document.

5. A recording medium on which a document creation program for creating a document described in a predetermined structured tag language from a document input as image data is stored, wherein the input image data is stored. Recognizing the layout of the input image data, generating a control code of the structured tag language for each recognized layout by referring to a conversion rule relating to the layout of the structured tag language, Recognizing a character image in the image data to be converted into a character code; and associating the converted character code with the generated control code to create a document described in a structured tag language. Outputting a created document. A storage medium storing a document creation program.