JP3085604B2

JP3085604B2 - Document processing device

Info

Publication number: JP3085604B2
Application number: JP04198971A
Authority: JP
Inventors: 石田祐介
Original assignee: Fuji Xerox Co Ltd; Fujifilm Business Innovation Corp
Current assignee: Fujifilm Business Innovation Corp
Priority date: 1992-07-03
Filing date: 1992-07-03
Publication date: 2000-09-11
Anticipated expiration: 2015-09-11
Also published as: JPH0622118A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は、文書画像に対して自動
的に色づけを行ない、カラー文書として出力するたの文
書処理装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a document processing apparatus for automatically coloring a document image and outputting it as a color document.

【０００２】[0002]

【従来の技術】紙の文書を読みとり、それを加工して出
力するという機能に対応する機器としては複写機があ
る。複写機の基本機能はあくまでも原稿を忠実に再現し
た文書を出力することにあるが、デジタル複写機の普及
とともに、例えば特開平２−２２３２７５号公報に開示
されている複写機のように文書の加工機能を持つものも
現れるようになってきた。しかし、これらの技術は、加
工しようとする箇所およびどのように加工するかという
点まで使用者が細かく指示する必要があり、使いこなす
には非常に複雑な操作を行わなければならないことや、
一般の利用者には結局どの箇所をどのように加工すれば
作成意図に合った高品位な文書を作り出すことができる
のかがわからないことなどから、結果的には入力文書を
より高品位なものにするためにはあまり役立つものでは
なかった。特に、近年のカラー複写機の普及とともに、
このような文書の色を変換したり白黒文書に色をつけた
りする機会が多くなってきている。2. Description of the Related Art There is a copying machine as a device corresponding to a function of reading a paper document, processing the document, and outputting the processed document. The basic function of a copying machine is to output a document in which a manuscript is faithfully reproduced. However, with the spread of digital copying machines, for example, the processing of documents such as the copying machine disclosed in Japanese Patent Application Laid-Open No. 223275/1990. Some of them have functions. However, these techniques require the user to give detailed instructions on the part to be machined and how to machine it, and to use it requires extremely complicated operations,
In the end, general users do not know how to process which part to produce a high-quality document that matches their intended purpose.As a result, the input document becomes higher-quality. It wasn't very helpful for me. In particular, with the spread of color copiers in recent years,
There are increasing opportunities to convert the color of such documents and add color to black and white documents.

【０００３】[0003]

【発明が解決しようとする課題】しかしながら、このよ
うな着色の操作は非常に煩雑で手間のかかる作業となる
上、配色に関する知見やデザインセンスの乏しい一般の
利用者には、出力文書を高品位なものとするのは至難の
技であった。本発明は以上のような事情を考慮してなさ
れたものであり、その目的とするところは、利用者から
の指示を受け取ることなく、入力された文書に対して自
動的に色づけを行ない、高品位な色づかいの文書に加工
して出力する機能を提供することである。However, such a coloring operation is very complicated and time-consuming, and a general user with little knowledge of the color scheme and design sense can provide a high-quality output document. It was an extremely difficult technique. The present invention has been made in view of the above circumstances, and its purpose is to automatically color an input document without receiving an instruction from a user, An object of the present invention is to provide a function of processing and outputting a document of high quality coloring.

【０００４】[0004]

【課題を解決するための手段および作用】本発明の文書
処理装置は、上記目的を達成するために、文書画像を保
持するための文書画像保持手段（１１１）と、その文書
画像保持手段に保持されている文書画像上の画素の集ま
りを、それぞれ文書要素として意味のある塊の領域に分
割する領域分割手段（１２）と、その領域分割手段が出
力する結果を参照して出力文書の色づけに関するパラメ
ータ値を領域別に決定する色パラメータ決定手段（１
３）と、前記文書画像保持手段に保持されている文書画
像から前記領域分割手段で分割された領域毎の各領域画
像を切り出し、前記色パラメータ決定手段によって決定
された領域別の色パラメータ値にしたがってその各領域
画像を加工して出力画像を生成する出力画像生成手段
（１４）とを備えた基本構成を有するものである。In order to achieve the above object, a document processing apparatus according to the present invention has a document image holding means (111) for holding a document image and a document image holding means for holding the document image. (12) that divides a set of pixels on a document image into regions each having a meaningful block as a document element, and relates to coloring of an output document by referring to a result output by the region dividing unit. Color parameter determining means (1) for determining parameter values for each area
3) cutting out each area image for each area divided by the area dividing means from the document image held in the document image holding means, and converting the image into color parameter values for each area determined by the color parameter determining means. Therefore, it has a basic configuration including output image generating means (14) for processing each area image to generate an output image.

【０００５】上記構成の文書処理装置において、文書画
像保持手段は、例えば紙の文書が白黒のディジタル画像
として入力された文書画像が保持され、その文書画像は
領域分割手段により文字ブロック、図形、けい線などの
文書要素として意味のある領域（レイアウト要素）に分
割される。色パラメータ決定手段は領域の種別ごとに予
め定められた色パラメータの情報を例えばテーブル（図
１の１３１、図７）の形式で保持しており、領域分割手
段が出力する結果を参照して出力文書の各領域の色パラ
メータ値を決定し、出力画像生成手段に渡す。出力画像
生成手段は、各領域画像を切り出し、前記色パラメータ
決定手段から受け取った領域別の色パラメータ値にした
がって、例えば図形領域の背景を色付けしたりけい線を
色付けしたりというように各領域画像を加工して出力画
像を生成する。本発明によれば、領域分割手段により文
書画像が領域に分割され、色パラメータ決定手段により
各領域ごとに色付けのパラメータが決定され、そのパラ
メータを基に出力画像生成手段で色付けされた文書画像
が生成されるので、利用者は文書色づけに関して何らの
操作や指示をする必要がなく、従って、文書色づけに関
する知識をもたなくても、紙等に書かれた既存の文書を
もとに、見栄えの良い文書を容易に得ることができる。In the above-structured document processing apparatus, the document image holding means holds a document image in which, for example, a paper document is input as a black-and-white digital image, and the document image is subjected to character block, graphic, and silicon by the area dividing means. It is divided into areas (layout elements) that are meaningful as document elements such as lines. The color parameter determining means holds information on color parameters predetermined for each type of area in the form of, for example, a table (131 in FIG. 1, FIG. 7), and outputs the information by referring to the result output from the area dividing means. The color parameter value of each area of the document is determined and passed to the output image generation means. The output image generating means cuts out each area image, and according to the color parameter value for each area received from the color parameter determining means, for example, colors the background of the graphic area or colors the ruled line, etc. Is processed to generate an output image. According to the present invention, the document image is divided into regions by the region dividing unit, the coloring parameter is determined for each region by the color parameter determining unit, and the document image colored by the output image generating unit based on the parameters is obtained. Since it is generated, the user does not need to perform any operation or instruction regarding document coloring, and therefore, without having to know the document coloring, it can be displayed on the basis of existing documents written on paper etc. Good documents can be easily obtained.

【０００６】また、本発明の他の態様による文書処理装
置は、文書画像を保持するための文書画像保持手段（図
１１の１１１１）と、その文書画像保持手段に保持され
ている文書画像上の画素の集まりを、それぞれ文書要素
として意味のある塊として領域に分割する領域分割手段
（図１１の１１２）と、その領域分割手段によって分割
された文書画像上の各領域に、それぞれ文書の論理的な
意味を表す論理識別子を付与する論理識別子付与手段
（図１１の１１３）と、その論理識別子付与手段が出力
する結果を参照して出力文書の色づけに関するパラメー
タ値を論理識別子別に決定する色パラメータ決定手段
（図１１の１１４）と、前記文書画像保持手段に保持さ
れている文書画像から前記論理識別子付与手段で付与さ
れた論理識別子毎の各領域画像を切り出し、前記色パラ
メータ決定手段によって決定された論理識別子別の色パ
ラメータ値にしたがってその各領域画像を加工して出力
画像を生成する出力画像生成手段（図１１の１１５）と
を備えている。この文書処理装置は、前述の本発明の基
本構成に論理識別子付与手段を付加した構成に相当す
る。領域分割手段により得られた各領域に対し、論理識
別子付与手段は論理識別子を付与するするので、領域の
みでなく各領域を同じ種類内でも論理識別子でさらに区
別できるので、さらに変化に富んだ色付けを行うことが
でき、一層見栄えの良い文書を得ることができる。A document processing apparatus according to another aspect of the present invention includes a document image holding unit (1111 in FIG. 11) for holding a document image, and a document image holding unit for holding a document image. An area dividing unit (112 in FIG. 11) that divides a set of pixels into regions each having a meaning as a document element, and a logical region of the document divided into each region on the document image divided by the region dividing unit. Parameter assigning means (113 in FIG. 11) for assigning logical identifiers representing various meanings, and color parameter determination for determining parameter values related to coloring of an output document for each logical identifier with reference to the result output by the logical identifier assigning means Means (114 in FIG. 11), and each of the logical identifiers assigned by the logical identifier assigning means from the document image held in the document image holding means. Output image generating means (115 in FIG. 11) for cutting out the area image, processing each area image in accordance with the color parameter value for each logical identifier determined by the color parameter determining means, and generating an output image. I have. This document processing device corresponds to a configuration in which a logical identifier assigning means is added to the above-described basic configuration of the present invention. The logical identifier assigning means assigns a logical identifier to each area obtained by the area dividing means, so that not only the area but also each area can be further distinguished by the logical identifier within the same type, so that more varied coloring is provided. Can be performed, and a more attractive document can be obtained.

【０００７】また、本発明の前記構成の文書処理装置
（図１１）の具体的態様において、前記領域分割手段
は、レイアイト識別子、領域種別、領域の寸法、位置を
含む領域情報と領域間の関係を表す情報とを含むレイア
ウト構造を出力するよう構成される。In a specific embodiment of the document processing apparatus having the above configuration according to the present invention (FIG. 11), the area dividing means includes area information including a layout identifier, an area type, an area size and a position, and a relation between the areas. And outputting a layout structure including information representing

【０００８】また、そのような領域分割手段を有する文
書処理装置の具体的態様において、前記論理識別子付与
手段は、レイアウト構造と論理構造との関係を表す複数
の構造モデルを記憶する構造モデル記憶手段（図１１の
１１３１）と、領域分割手段により得られた前記レイア
ウト構造を前記構造モデルと比較するマッチング処理手
段（図１１の１１３２）と、その比較の結果により一致
の取れた構造モデルを基に、各領域の論理識別子を付与
する論理構造決定手段（図１１の１１３３）からなる構
成とすことができる。この構成において、認識対象のレ
イアウト構造と論理構造の対応を表現する構造モデルを
あらかじめ複数個だけ構造モデル記憶部に登録してお
き、マッチング処理部によって領域分割結果と構造モデ
ルとのマッチング処理を行い、論理構造決定部によりレ
イアウト構造の要素のそれぞれに論理的な意味を表現す
る名称を付与する。このようにこの論理識別子付与手段
は、レイアウト構造を基に、論理構造を認識するので、
自動的にかつ確実に論理識別子を付与することができ
る。また同時に、複数個だけ登録された構造モデルのう
ち、どのモデルとマッチングが取れたかという情報か
ら、入力された文書の種類を推定できるので、色パラメ
ータ決定手段の色パラメータ情報を文書の種類ごとに用
意し、入力された文書の種類の情報により選択すること
により、文書に応じた豊富な色付けが可能である。In a specific embodiment of the document processing apparatus having such a region dividing means, the logical identifier assigning means stores a plurality of structural models representing a relationship between a layout structure and a logical structure. (1131 in FIG. 11), matching processing means (1132 in FIG. 11) for comparing the layout structure obtained by the area dividing means with the structural model, and a structural model which is matched by the result of the comparison. , A logical structure determining means (1133 in FIG. 11) for assigning a logical identifier to each area. In this configuration, only a plurality of structural models expressing the correspondence between the layout structure and the logical structure to be recognized are registered in advance in the structural model storage unit, and the matching processing unit performs a matching process between the region division result and the structural model. The logical structure determining unit assigns a name expressing a logical meaning to each of the elements of the layout structure. Thus, the logical identifier assigning unit recognizes the logical structure based on the layout structure,
Logical identifiers can be automatically and reliably assigned. At the same time, the type of the input document can be estimated from the information on which model was obtained from among the plurality of registered structural models, so that the color parameter information of the color parameter determining means was determined for each type of document. By preparing and selecting according to the input document type information, it is possible to perform abundant coloring according to the document.

【０００９】そのための色パラメータ決定手段は、一態
様によれば、前記論理識別子付与手段によって認識され
るような入力文書の種類とそれに対応するテーブル番号
との対応を示す第１のテーブルと、論理識別子と色付け
に関するパラメータとの対応を示す第２のテーブルとを
備えた色パラメータテーブル（図１１の１１４１）を有
するAccording to one embodiment, the color parameter determining means for that purpose comprises: a first table indicating the correspondence between the type of the input document recognized by the logical identifier assigning means and the corresponding table number; Identifiers and coloring
A color parameter table (1141 in FIG. 11) including a second table indicating correspondence with parameters related to

【００１０】前記色パラメータ決定手段は、他の態様に
よれば、文書の色付けに関する知識情報を格納した知識
ベース（図１８の１８４２）と、前記論理識別子付与手
段の出力に基づいて前記知識ベースに格納された知識情
報を用いて推論処理を実行することにより出力文書の文
書の色付けに関するパラメータ値を決定する推論処理手
段（図１８の１８４１）とを備える。これによれば、そ
れぞれの文書に適する、より高度な色づけを行うことが
できる。According to another aspect, the color parameter determining means stores the knowledge base (1842 in FIG. 18) storing the knowledge information on the coloring of the document and the knowledge base based on the output of the logical identifier assigning means. An inference processing means (1841 in FIG. 18) for determining a parameter value relating to coloring of the output document by executing an inference process using the stored knowledge information. According to this, more advanced coloring suitable for each document can be performed.

【００１１】[0011]

【実施例】（第１の実施例）以下、添付図面にそって実
施例の説明を行う。本発明を複写機に適用した実施例の
構成を図１に示す。この実施例の装置は、図１に示すよ
うに、原稿画像入力モジュール１１と、領域分割モジュ
ール１２と、色パラメータ決定モジュール１３と、出力
画像生成モジュール１４と、プリントアウト・モジュー
ル１５とを備えている。DESCRIPTION OF THE PREFERRED EMBODIMENTS (First Embodiment) An embodiment will be described below with reference to the accompanying drawings. FIG. 1 shows the configuration of an embodiment in which the present invention is applied to a copying machine. As shown in FIG. 1, the apparatus of this embodiment includes a document image input module 11, an area division module 12, a color parameter determination module 13, an output image generation module 14, and a printout module 15. I have.

【００１２】原稿画像入力モジュール１１は、イメージ
スキャナの機構そのものであり、副走査方向に光源とイ
メージセンサからなる読み取り部をずらしながら主走査
方向の細長い領域を光源で照射し、反射した光をレンズ
で集光してＣＣＤイメージセンサ等によって電気信号に
変換し、二値のデジタル画像として入力画像メモリに取
り込む。もちろん、反射光の強弱をＡＤ変換器を通して
多値の画像として入力しデジタル画像処理で二値化して
もかまわない。The original image input module 11 is a mechanism of an image scanner itself, and irradiates an elongated area in the main scanning direction with a light source while shifting a reading unit including a light source and an image sensor in a sub-scanning direction, and transmits reflected light to a lens. The light is condensed and converted into an electric signal by a CCD image sensor or the like, and is taken into an input image memory as a binary digital image. Of course, the intensity of the reflected light may be input as a multi-valued image through an AD converter and binarized by digital image processing.

【００１３】領域分割モジュール１２は文書の画像を入
力として、文字領域、図表領域などに分割する処理を行
うものであり、文書認識技術における領域分割処理に対
応する。これは公知の技術によって実現される。例えば
特開昭６４−１５８８９号公報あるいは特公昭６１−３
２７１２号公報に示されている技術がそれにあたる。こ
こでの領域分割モジュール１２はそれぞれの領域を矩形
に分割し、レイアウト識別子を付加して出力する。この
処理を行う前に傾き補正、ノイズ除去等の前処理を行っ
てもよいことはいうまでもない。この処理で分割された
領域は文字、図形、けい線などに識別され、これらの種
別も同時に付加されて出力される。分割された結果とし
てこの領域分割モジュールから出力される要素の列を文
書のレイアウト構造と呼ぶ。The area dividing module 12 performs processing of dividing a document image into a character area, a chart area, and the like by using the image of the document as an input, and corresponds to the area dividing processing in the document recognition technology. This is realized by a known technique. For example, JP-A-64-15889 or JP-B-61-3
A technique disclosed in Japanese Patent No. 2712 corresponds to this. The area dividing module 12 divides each area into rectangles, adds a layout identifier, and outputs the result. It goes without saying that pre-processing such as inclination correction and noise removal may be performed before performing this processing. The areas divided by this processing are identified as characters, figures, ruled lines, etc., and these types are also added and output at the same time. The sequence of elements output from the area division module as a result of division is called a document layout structure.

【００１４】色パラメータ決定モジュール１３は、レイ
アウト構造の各要素に対して、その領域の種別をキー
に、あらかじめ登録してある色パラメータテーブル１３
１を参照し、パラメータのデータを付与して、出力画像
生成モジュール１４に受け渡す。色パラメータテーブル
１３１はＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）
に記憶させておく。色パラメータテーブル１３１は文書
画像領域の種別に対してその前景色、背景色などが示さ
れたテーブルである。ただし、ここで前景色とは出力文
書画像における文字や図形の線など（すなわち、白黒の
画像における黒画素の部分）の色をいい、背景色は、出
力文書画像における各領域ごとの地の部分の色をいうも
のとする。これらは最終的な各文書要素がどのように色
づけされなければならないかを表現している。これらの
パラメータテーブルの保持するデータは、例えば『１９
９１年第４２回情報処理学会全国大会予稿集７Ｑ−７
「文書表現品質に関する評価尺度について」』で述べら
れているような主観評価等の方法によって得られた定量
的なデータや、『富士ゼロックス著「プレゼンテーショ
ン＆ドキュメンテーション」，富士ゼロックス，１９８
９年』や、『富士ゼロックスＤ推進グループ，コアデザ
イン制作部編「ビジネスドキュメントの演出技法−ＩＩ
カラー化」，日本経済新聞社，１９９２年』などに記
されているような定性的または定量的なデータ、あるい
は社内の規則による決めごとなどを表したデータなど、
文書の色づけを決定するのに必要な種々の知識をもとに
決定されている。The color parameter determination module 13 stores a color parameter table 13 registered in advance for each element of the layout structure using the type of the area as a key.
1, adding parameter data, and passing the data to the output image generation module 14. The color parameter table 131 is a ROM (Read Only Memory).
To be stored. The color parameter table 131 is a table indicating the foreground color, the background color, and the like for the type of the document image area. Here, the foreground color refers to the color of a character or graphic line in the output document image (that is, a black pixel portion in a black-and-white image), and the background color corresponds to the ground portion of each region in the output document image. Color. These represent how each final document element must be colored. The data held by these parameter tables is, for example, “19
Proceedings of the 42nd IPSJ Annual Conference 1991, 7Q-7
Quantitative data obtained by methods such as subjective evaluation as described in "Evaluation Scale for Document Expression Quality" and "Xerox Presentation and Documentation", Fuji Xerox, 198
Nine Years ”and“ Fuji Xerox D Promotion Group, Core Design Production Division, “Business Document Production Techniques-II
Qualitative or quantitative data as described in "Colorization", Nihon Keizai Shimbunsha, 1992, etc., or data expressing each rule according to company rules.
The determination is made based on various knowledge necessary for determining the coloring of the document.

【００１５】出力画像生成モジュール１４は、色パラメ
ータ決定モジュール１３から色パラメータのデータを受
け取り、領域分割モジュール１２からレイアウト構造デ
ータと、原稿入力モジュール１１から入力画像を受け取
る。そして、色パラメータ決定モジュール１３から受け
取ったパラメータに従って、それぞれレイアウト構造の
要素別に対応する入力原稿の部分画像を変換し、出力画
像を生成していき、最終的に指示された色づけイメージ
を持つ文書画像を生成する。プリントアウトモジュール
１５では出力画像メモリ１５１上の出力画像をレーザビ
ームプリント技術を用いて紙の上に転写し、排出する。
本実施例ではフルカラーで印字することが可能なレーザ
ビームプリンタを用いている。The output image generation module 14 receives color parameter data from the color parameter determination module 13, receives layout structure data from the area division module 12, and receives an input image from the document input module 11. Then, according to the parameters received from the color parameter determination module 13, the partial images of the input document corresponding to the respective elements of the layout structure are converted to generate an output image, and a document image having a finally designated coloring image Generate In the printout module 15, the output image on the output image memory 151 is transferred onto paper using a laser beam printing technique and discharged.
In this embodiment, a laser beam printer capable of printing in full color is used.

【００１６】以上のような構成を有する実施例の動作を
図３に示すようなレイアウトをもつ文書を例にとって説
明する。処理のフローチャートは図４に示す。なお、以
後座標系を図３中に示すようにページ右に向かう方向を
ｘ座標、ページ下に向かう方向をｙ方向とする。まず、
利用者は図３の文書をイメージスキャナ等のプラテンの
上に置き、図２に示したような操作パネル上で単なる複
写か、色づけ複写かを選択する。The operation of the embodiment having the above configuration will be described with reference to a document having a layout as shown in FIG. A flowchart of the process is shown in FIG. Hereinafter, as shown in FIG. 3, in the coordinate system, the direction toward the right of the page is referred to as x coordinate, and the direction toward the bottom of the page is referred to as y direction. First,
The user places the document shown in FIG. 3 on a platen such as an image scanner, and selects a simple copy or a color copy on an operation panel as shown in FIG.

【００１７】色づけ複写が選択された状態でスタートボ
タンが押下されると、画像入力モジュール１１は走査を
開始し、入力画像メモリ１１１であるイメージメモリに
原稿の画像を記憶する。次に領域分割モジュール１２が
処理を行い、その結果、図３に示す入力画像が入力画像
メモリ上で図５に示すように文字がまとまって並んでい
る領域（文字ブロック領域）とけい線が存在する領域
（けい線領域）と図形が存在する領域（図形領域）とそ
れ以外（余白）に分割される（ステップＳ４２）。領域
を表現する矩形は原稿画像のｘ、ｙ方向それぞれに平行
な辺を持ち、対象となる領域を囲む最小矩形とする。領
域を表現するデータは基本的に「レイアウト識別子、種
別、左上点ｘ座標、左上点ｙ座標、幅、高さ」の組で表
現される。領域分割結果は図６のような表形式で表現さ
れ、このデータ、すなわちレイアウト構造を色パラメー
タ決定モジュール１３および出力画像生成モジュール１
４に渡す。When the start button is pressed in a state where the color copying is selected, the image input module 11 starts scanning, and stores the image of the document in the input image memory 111. Next, the area dividing module 12 performs processing, and as a result, there is an area (character block area) in which the input image shown in FIG. 3 is arranged in the input image memory as shown in FIG. The image is divided into an area (line area), an area where the figure exists (graphic area), and the rest (margin) (step S42). The rectangle that represents the region has sides parallel to the x and y directions of the document image, and is the smallest rectangle that surrounds the target region. Data representing an area is basically represented by a set of “layout identifier, type, upper left point x coordinate, upper left point y coordinate, width, height”. The region division result is expressed in a table format as shown in FIG. 6, and the data, that is, the layout structure is represented by the color parameter determination module 13 and the output image generation module 1.
Pass to 4.

【００１８】色パラメータ決定モジュール１３では、領
域の種別にしたがい、あかじめＲＯＭ上の色パラメータ
テーブル１３１に記憶された色パラメータを取り出し、
各レイアウト識別子に対応する色パラメータを出力画像
生成モジュール１４に送る（ステップＳ４３）。出力画
像生成モジュール１４に送られる個別色パラメータテー
ブルは図７に示すように領域の種別ごとに前景色の色相
・彩度・明度・（効果）、背景色の色相・彩度・明度、
色づけ領域マージンなどがテーブルで管理されている。
色づけ領域マージンは、図８に示すように、出力画像に
おいて背景を着色する領域を、領域分割の結果得られる
矩形領域よりも外側に拡張する範囲を指定するものであ
る。本実施例では、上・下・左・右で独立に変更できる
ようにしてあるが、共通の色づけ領域マージンを用いる
ことも考えられる。The color parameter determination module 13 extracts the color parameters stored in the color parameter table 131 on the ROM in advance according to the type of the area.
The color parameters corresponding to each layout identifier are sent to the output image generation module 14 (Step S43). As shown in FIG. 7, the individual color parameter table sent to the output image generation module 14 includes the hue, saturation, brightness, (effect) of the foreground color, the hue, saturation, brightness of the background color,
The coloring area margin and the like are managed in a table.
As shown in FIG. 8, the coloring area margin designates a range in which the area for coloring the background in the output image is extended outside the rectangular area obtained as a result of the area division. In this embodiment, the upper, lower, left, and right sides can be independently changed, but a common coloring area margin may be used.

【００１９】出力画像生成モジュール１４、はレイアウ
ト構造と色パラメータを受け取り、そのパラメータに指
示された通りに入力原稿の部分画像を出力画像メモリ１
５１に写像していく（ステップＳ４４）。本実施例で
は、フルカラーの出力が行われるので、出力画像メモリ
１５１において、１画素ごとにシアン・マゼンタ・黄色
・黒の４色分の階調値情報を保持するものとする。The output image generation module 14 receives a layout structure and color parameters, and stores a partial image of the input document as specified by the parameters in the output image memory 1.
51 (step S44). In this embodiment, since full-color output is performed, it is assumed that the output image memory 151 holds gradation value information for four colors of cyan, magenta, yellow, and black for each pixel.

【００２０】出力画像生成モジュール１４における処理
のフローチャートは図９で示される通りである。すなわ
ち、最初に出力画像メモリの全ての画素の４色分の階調
値を全て０に初期化する（ステップＳ９１）。次にレイ
アウト構造から順にその要素を取り出す（ステップＳ９
３）。この例でいえば、レイアウト構造の最初の要素で
ある、レイアウト識別子が「文字ブロック−１」の要素
を取り出す。取り出したレイアウト要素の色パラメータ
取り出す（ステップＳ９４）。また、レイアウト要素の
領域に対応する部分画像を入力画像から切り出す（ステ
ップＳ９５）。このレイアウト構造要素「文字ブロック
−１」と、その要素に対応する色パラメータと、入力画
像上でこの要素に対応する部分画像の３つをパラメータ
として領域画像写像処理を呼出す（ステップＳ９６）。FIG. 9 shows a flowchart of the processing in the output image generation module 14. That is, first, the tone values for all four colors of all the pixels in the output image memory are initialized to 0 (step S91). Next, the elements are sequentially extracted from the layout structure (step S9).
3). In this example, an element whose layout identifier is "character block-1", which is the first element of the layout structure, is extracted. The color parameters of the extracted layout element are extracted (step S94). Further, a partial image corresponding to the area of the layout element is cut out from the input image (step S95). The area image mapping process is called using the layout structure element "character block-1", the color parameter corresponding to the element, and the partial image corresponding to the element on the input image as parameters (step S96).

【００２１】領域画像写像処理では、まず「文字ブロッ
ク−１」に対応する領域に背景色をつける。「文字ブロ
ック−１」の「左上点ｘ座標、左上点ｙ座標、幅、高
さ」を取り出し、それぞれｘ１、ｙ１、ｗ１、ｈ１と
する。また、色パラメータの中の「色づけ領域マージン
（上、下、左、右）」をそれぞれｍａｒｇｉｎ＿ｕ、ｍ
ａｒｇｉｎ＿ｄ、ｍａｒｇｉｎ＿ｌ、ｍａｒｇｉｎ＿ｒ
とする。さらに、（ｘ１−ｍａｒｇｉｎ＿ｌ、ｙ１−ｍ
ａｒｇｉｎ＿ｕ）を左上点とし、（ｗ１＋ｍａｒｇｉｎ
＿ｌ＋ｍａｒｇｉｎ＿ｒ）を幅、（ｈ１＋ｍａｒｇｉｎ
＿ｕ＋ｍａｒｇｉｎ＿ｄ）を高さとした領域を出力画像
中に設定する。そして、その設定した領域内の画素の値
を、色パラメータ中の背景色の色相・彩度・明度から公
知の方法によって計算した４色の階調値情報に設定す
る。次に、前景色の色相・彩度・明度から同様に４色の
階調値を求める。そして、与えられた入力画像の部分画
像に含まれるすべての黒画素に対応する位置の、出力画
像メモリ１５１の画素を、前景色に対応する階調値情報
に変更する。この処理の結果、色パラメータにしたがっ
て、その背景色で色づけされた領域内に前景色で内容が
書き込まれた出力画像が生成される。これで「文字ブロ
ック−１」については終了し、テーブルの次の要素であ
る「文字ブロック−２」について同様な処理を繰り返
す。こうしてレイアウト構造中のすべての要素の写像が
終了したかどうかを判定し（ステップＳ９２）、終了で
あれば処理は終わる。この出力画像は４回スキャンさ
れ、それぞれの色の階調値情報からレーザービームプリ
ント技術によって普通紙上に色を重ねて可視化され、図
１０のように出力される。In the area image mapping processing, first, a background color is given to an area corresponding to "character block-1". The “upper left point x coordinate, upper left point y coordinate, width, and height” of “character block-1” are extracted and defined as x1, y1, w1, and h1, respectively. Also, “coloring area margins (upper, lower, left, right)” in the color parameters are designated as margin_u, m, respectively.
argin_d, margin_l, margin_r
And Further, (x1-margin_l, y1-m
argin_u) as the upper left point, and (w1 + margin
_L + margin_r) as width, (h1 + margin
_U + margin_d) is set in the output image as the height. Then, the values of the pixels in the set area are set to the gradation value information of the four colors calculated by a known method from the hue, saturation, and brightness of the background color in the color parameters. Next, similarly, tone values of four colors are obtained from the hue, saturation, and brightness of the foreground color. Then, the pixels of the output image memory 151 at the positions corresponding to all the black pixels included in the partial image of the given input image are changed to the gradation value information corresponding to the foreground color. As a result of this processing, an output image is generated in which the content is written with the foreground color in the area colored with the background color according to the color parameter. This is the end of “character block-1”, and similar processing is repeated for “character block-2”, which is the next element in the table. In this way, it is determined whether or not the mapping of all the elements in the layout structure has been completed (step S92). The output image is scanned four times, the colors are visualized by superimposing the colors on plain paper by the laser beam printing technique from the gradation value information of each color, and output as shown in FIG.

【００２２】本実施例の文書処理装置によれば、領域分
割モジュール１２により文書画像が領域に分割され、色
パラメータ決定モジュール１３により各領域ごとに色付
けのパラメータが決定され、そのパラメータを基に出力
画像生成モジュール１４で色付けされた文書画像が生成
され、プリントアウトモジュール１５により可視化され
るので、利用者は文書色づけに関して何らの操作や指示
をする必要がなく、従って、文書色づけに関する知識を
もたなくても、紙等に書かれた既存の文書をもとに、見
栄えの良い文書を容易に得ることができる。According to the document processing apparatus of the present embodiment, the document image is divided into regions by the region dividing module 12, and the coloring parameters are determined for each region by the color parameter determining module 13, and output based on the parameters. Since the document image colored by the image generation module 14 is generated and visualized by the printout module 15, the user does not need to perform any operation or instruction regarding the document coloring, and thus has the knowledge about the document coloring. Even if not, a good-looking document can be easily obtained based on an existing document written on paper or the like.

【００２３】（第２の実施例）本発明を複写機に応用し
た第２の実施例を図１１に示す。この実施例の装置は、
図１１に示すように、原稿画像入力モジュール１１１
と、領域分割モジュール１１２と、論理識別子付与モジ
ュール１１３と、色パラメータ決定モジュール１１４
と、出力画像生成モジュール１１５と、プリントアウト
・モジュール１１６とを備えている。(Second Embodiment) FIG. 11 shows a second embodiment in which the present invention is applied to a copying machine. The device of this embodiment is
As shown in FIG. 11, the document image input module 111
, An area division module 112, a logical identifier assignment module 113, and a color parameter determination module 114
, An output image generation module 115, and a printout module 116.

【００２４】原稿画像入力モジュール１１１および領域
分割モジュール１１２は第１の実施例と同様のものが用
いられ、領域分割モジュール１１２から出力される文書
のレイアウト構造も同様の形式を採用する。The original image input module 111 and the area dividing module 112 are the same as those in the first embodiment, and the layout structure of the document output from the area dividing module 112 employs the same format.

【００２５】論理識別子付与モジュール１１３は、領域
分割モジュール１１２において得られた文書のレイアウ
ト構造を入力とし、それぞれに「タイトル」、「著
者」、「本文」などといった論理的な名称を識別子とし
て付与するものである。これはこれは本出願人によって
先に出願された特願平３−３１３１８６号（発明者尾崎
正治および石田祐介「文書論理構造認識および文書内容
認識のための装置および方法」）に記載されている技術
を用いるのがよい。この技術では、認識対象のレイアウ
ト構造と論理構造の対応を表現する構造モデルをあらか
じめ複数個だけ構造モデル記憶部１１３１に登録してお
き、マッチング処理部１１３２によって領域分割結果と
構造モデルとのマッチング処理を行い、論理構造決定部
１１３３によりレイアウト構造の要素のそれぞれに論理
的な意味を表現する名称を付与する。また同時に、複数
個だけ登録された構造モデルのうち、どのモデルとマッ
チングが取れたかという情報から、入力された文書の種
類を推定する。The logical identifier assigning module 113 receives the layout structure of the document obtained in the area dividing module 112 as an input, and assigns a logical name such as “title”, “author”, or “text” as an identifier. Things. This is described in Japanese Patent Application No. 3-313186, previously filed by the present applicant (inventors Shoji Ozaki and Yusuke Ishida, "Apparatus and method for document logical structure recognition and document content recognition"). It is better to use technology. In this technique, a plurality of structural models expressing the correspondence between the layout structure and the logical structure to be recognized are registered in advance in the structural model storage unit 1131, and the matching processing unit 1132 performs a matching process between the region division result and the structural model. Is performed, and a name expressing a logical meaning is given to each of the elements of the layout structure by the logical structure determination unit 1133. At the same time, the type of the input document is estimated from information on which of the plurality of registered structural models has been matched.

【００２６】色パラメータ決定モジュール１１４は、あ
らかじめ登録してあるパラメータテーブル１１４１か
ら、入力文書の種類に対応した論理識別子ごとの色パラ
メータのデータを出力画像生成モジュール１１５に受け
渡す。色パラメータテーブル１１４１は入力文書の種類
や出力紙サイズなどに分類してＲＯＭ（ＲｅａｄＯｎ
ｌｙＭｅｍｏｒｙ）に記憶させておく。量が多い場合
は磁気ディスク装置等で実現しても構わない。色パラメ
ータテーブル１１４１は２種類のテーブルから構成され
ており、１つは入力文書の種類とそれに対応する各テー
ブル番号のテーブルでもう１つは論理識別子（例えば
「タイトル」）を持つ各要素に対してその前景色、背景
色などが示されたテーブルである。ただし、ここで前景
色とは出力文書画像における文字や図形の線など（すな
わち、白黒の画像における黒画素の部分）の色をいい、
背景色は、出力文書画像における各領域ごとの地の部分
の色をいうものとする。これらは最終的な各文書要素が
どのように色づけされなければならないかを表現してい
る。これらのテーブルの保持するパラメータデータは、
第１の実施例と同様の方法で決定されている。The color parameter determination module 114 transfers color parameter data for each logical identifier corresponding to the type of the input document to the output image generation module 115 from the parameter table 1141 registered in advance. The color parameter table 1141 is classified into a type of an input document, an output paper size, and the like, and a ROM (Read On).
ly Memory). If the amount is large, it may be realized by a magnetic disk device or the like. The color parameter table 1141 is composed of two types of tables. One is a table of the type of the input document and the corresponding table number. The other is a table for each element having a logical identifier (for example, “title”). This is a table in which the foreground color, background color, and the like are shown. Here, the foreground color refers to the color of a character or figure line in an output document image (that is, a black pixel portion in a black and white image),
The background color refers to the color of the ground portion for each region in the output document image. These represent how each final document element must be colored. The parameter data stored in these tables is
It is determined in the same manner as in the first embodiment.

【００２７】出力画像生成モジュール１１５は色パラメ
ータ決定モジュール１１４から色パラメータのデータを
受け取り、論理識別子付与モジュール１１３から論理構
造データと、原稿画像入力モジュール１１１から入力画
像を受け取る。そして、色パラメータ決定モジュール１
１４から受け取ったパラメータに従って、それぞれ論理
識別子別に対応する入力原稿の部分画像を変換し、出力
画像を生成していき、最終的に指示された色づけイメー
ジを持つ文書画像を生成する。プリントアウトモジュー
ルは第１の実施例と同様のものが用いられる。The output image generation module 115 receives color parameter data from the color parameter determination module 114, receives logical structure data from the logical identifier assignment module 113, and receives an input image from the document image input module 111. Then, the color parameter determination module 1
In accordance with the parameters received from 14, each of the partial images of the input document corresponding to each logical identifier is converted, an output image is generated, and finally a document image having the designated coloring image is generated. The same printout module as in the first embodiment is used.

【００２８】第１の実施例に準じて図３の文書例をもと
に第２の実施例の動作を具体的に説明する。処理のフロ
ーチャートは図１２に示す。画像入力モジュールおよび
領域分割モジュールは第１の実施例と同様の処理を行な
う（ステップＳ１２２）。The operation of the second embodiment will be specifically described based on the document example of FIG. 3 according to the first embodiment. A flowchart of the processing is shown in FIG. The image input module and the area division module perform the same processing as in the first embodiment (step S122).

【００２９】論理識別子付与モジュール１１３において
は、マッチング処理部１１３２が、領域分割モジュール
１１２から受け取ったレイアウト構造と構造モデル記憶
部１１３１に保持している構造モデルとのマッチングを
行い、そのマッチングの結果を基に、論理構造決定部１
１３３が、レイアウト構造の各要素がどのような論理的
な意味合いを持つかを示す論理名称を識別子として与え
る（ステップＳ１２３）。論理名称は「タイトル」、
「著者」、「サブタイトル」、「本文」、「脚注」「脚
注罫」、「図」、「グラフ」などである。通常、ある範
囲で流通する文書は「タイトル」、「著者」は上下配置
になっているなど、ある程度固定的なデザインの文書が
多い。この性質を利用するのが構造モデルで、構造モデ
ルとはすなわち対象となる文書のデザインについてのテ
ンプレートである。これらテンプレートである構造モデ
ルはＲＯＭによって構成される構造モデル記憶部１１３
１にあらかじめ複数記憶しておく。構造モデルは図１３
に示すように要素をその種別（文字ブロック、文字行、
けい線、図形など）と対応する論理名称を持つノード、
これら要素間の相対的な位置関係をリンクとするような
グラフ構造で表現される。In the logical identifier assigning module 113, the matching processing unit 1132 performs matching between the layout structure received from the area dividing module 112 and the structural model stored in the structural model storage unit 1131, and outputs the result of the matching. Based on the logical structure determination unit 1
133 gives, as an identifier, a logical name indicating what logical meaning each element of the layout structure has (step S123). The logical name is "title",
"Author", "Subtitle", "Body", "Footnote", "Footnote Rule", "Figure", "Graph", and the like. Usually, there are many fixed design documents such as "title" and "author" arranged vertically in a certain range. The structure model utilizes this property, and the structure model is a template for the design of a target document. The structural models serving as these templates are stored in a structural model storage unit 113 constituted by a ROM.
One is stored in advance in one. The structure model is shown in FIG.
As shown in the element's type (character block, character line,
Nodes with logical names corresponding to
It is represented by a graph structure in which the relative positional relationship between these elements is used as a link.

【００３０】この論理識別子付与モジュール１１３にお
けるマッチング処理部１１３２のマッチング処理とは、
すなわち、登録してある１つの構造モデルとマッチする
かどうかの判定を行うことである。登録されている構造
モデルを１つずつ取り出し、マッチング処理が成功する
まで順にマッチング処理を起動する。もし、すべての構
造モデルとのマッチングが失敗したならば入力文書は認
識不可能というメッセージを機器のパネルに表示し、色
づけを行なわない単なる複写を行なって終了する。The matching processing of the matching processing unit 1132 in the logical identifier assignment module 113 is as follows.
That is, it is to determine whether or not it matches one registered structural model. The registered structural models are extracted one by one, and the matching processes are sequentially activated until the matching process is successful. If the matching with all the structural models fails, a message indicating that the input document is unrecognizable is displayed on the panel of the device, and a simple copy without coloring is performed and the processing is terminated.

【００３１】ある構造モデルとのマッチングが成功した
場合はレイアウト構造における要素と構造モデルのノー
ドとの対応がわかるため、これを利用してレイアウト構
造のマッチングの対象となった各要素に構造モデルのノ
ードに定義されている論理名称を付与する。このように
レイアウト構造の要素に論理名称が付与されたデータを
論理構造と呼ぶ。論理構造を表現する実際のデータは図
１４に示すように論理名称とそれに対応するレイアウト
要素の識別子の対の列で表現される。また、何番目の構
造モデルとマッチングが成功したかという情報から、入
力文書がどのような種類の文書であるかを判定すること
ができる。If the matching with a certain structural model is successful, the correspondence between the elements in the layout structure and the nodes of the structural model can be known. Assign a logical name defined to the node. Data in which a logical name is assigned to an element of the layout structure in this manner is called a logical structure. Actual data representing a logical structure is represented by a row of pairs of a logical name and an identifier of a layout element corresponding to the logical name as shown in FIG. Also, it is possible to determine what kind of document the input document is based on the information on the number of the structural model and the success of the matching.

【００３２】色パラメータ決定モジュール１１４では、
論理識別子付与モジュール１１３において推定された入
力文書の種類（すなわちマッチングに成功したモデルの
番号）に従い、あらかじめＲＯＭに分類して登録してあ
る複数の色パラメータテーブルの中から対応するものを
選び出し、この色パラメータテーブルに記憶された色パ
ラメータを取り出し、各論理名称に対応する色パラメー
タを出力画像生成モジュール１１５に送る（ステップＳ
１２４）。出力画像生成モジュール１１５に送られる個
別色パラメータテーブルは図１５に示すように論理名称
ごとに前景色の色相・彩度・明度・（効果）、背景色の
色相・彩度・明度、色づけ領域マージンなどがテーブル
で管理されている。In the color parameter determination module 114,
According to the type of the input document estimated by the logical identifier assignment module 113 (that is, the number of the model that has been successfully matched), a corresponding one is selected from a plurality of color parameter tables that are classified and registered in the ROM in advance, and The color parameters stored in the color parameter table are extracted, and the color parameters corresponding to each logical name are sent to the output image generation module 115 (Step S)
124). As shown in FIG. 15, the individual color parameter table sent to the output image generation module 115 has the hue / saturation / brightness / (effect) of the foreground color, the hue / saturation / brightness of the background color, and the coloring area margin for each logical name. Are managed in a table.

【００３３】出力画像生成モジュール１１５は、論理構
造と色パラメータを受け取り、そのパラメータに指示さ
れた通りに入力原稿の部分画像を出力画像メモリ１１６
１に写像していく（ステップＳ１２５）。出力画像メモ
リ１１６１は第１の実施例と同様に４色分（シアン・マ
ゼンタ・黄・黒）の階調値情報を保持する。出力画像生
成モジュール１１５における処理のフローチャートは図
１６で示される通りである。すなわち、出力画像を初期
化し（ステップＳ１６１）、その後、個別色パラメータ
テーブルの要素の順にパラメータを取り出すとともに
（ステップＳ１６３）、その論理名称を持つ領域分割結
果を取り出す（ステップＳ１６４）。この例でいえば色
パラメータテーブルの最初は「タイトル」であるので、
「タイトル」という名称を持つレイアウト構造要素を論
理構造から取り出す。また、入力画像からそのレイアウ
ト構造要素の領域に対応する部分画像を切り出す（ステ
ップＳ１６５）。そして、前記「タイトル」に対応する
レイアウト構造要素と個別色パラメータテーブル中の
「タイトル」の色パラメータと、入力画像上でこのレイ
アウト構造要素に対応する部分画像の３つをパラメータ
として領域画像写像処理を呼出す（ステップＳ１６
６）。領域画像写像処理では、第１の実施例と同様の処
理が行なわれる。一つの要素について処理が終了する
と、未処理の要素がテーブルにあるかを判定し（ステッ
プＳ１６２）、あれば次の要素について前記の処理を行
い、未処理の要素がなくなったとき処理を終了する。The output image generation module 115 receives the logical structure and the color parameters, and stores a partial image of the input document as specified by the parameters in the output image memory 116.
1 (step S125). The output image memory 1161 holds gradation value information for four colors (cyan, magenta, yellow, and black) as in the first embodiment. A flowchart of the process in the output image generation module 115 is as shown in FIG. That is, the output image is initialized (step S161), and thereafter, parameters are extracted in the order of the elements of the individual color parameter table (step S163), and a region division result having the logical name is extracted (step S164). In this example, the title of the color parameter table is "Title".
The layout structure element having the name "Title" is extracted from the logical structure. Further, a partial image corresponding to the area of the layout structuring element is cut out from the input image (step S165). The area image mapping process is performed by using the layout structure element corresponding to the title, the color parameter of the title in the individual color parameter table, and the partial image corresponding to the layout structure element on the input image as parameters. (Step S16)
6). In the area image mapping processing, the same processing as in the first embodiment is performed. When the processing is completed for one element, it is determined whether there is an unprocessed element in the table (step S162), and if so, the above-described processing is performed for the next element. .

【００３４】また、その出力結果は、第１の実施例と同
様に紙の上に可視化され、図１７に示すような色付けさ
れた文書が出力される。Further, the output result is visualized on paper as in the first embodiment, and a colored document as shown in FIG. 17 is output.

【００３５】以上に詳述した第２の実施例は、第１の実
施例と同様に、利用者は文書色づけに関して何らの操作
や指示をする必要がなく、従って、文書色づけに関する
知識をもたなくても、紙等に書かれた既存の文書をもと
に、見栄えの良い文書を容易に得ることができる。ま
た、第２の実施例では、論理識別子付与モジュール１１
３により、論理構造を識別して、文書の種類を推定する
とともに、各領域を同じ種類内でも論理識別子でさらに
区別できるので、さらに変化に富んだ色付けを行うこと
ができ、一層見栄えの良い文書を得ることができる。In the second embodiment described in detail above, similarly to the first embodiment, the user does not need to perform any operation or instruction regarding document coloring, and therefore, has a knowledge of document coloring. Even if not, a good-looking document can be easily obtained based on an existing document written on paper or the like. In the second embodiment, the logical identifier assigning module 11
3, the logical structure is identified, the type of the document is estimated, and each region can be further distinguished by the logical identifier even within the same type, so that a more varied coloring can be performed, and a document with a better appearance can be obtained. Can be obtained.

【００３６】以上に説明した第１および第２の実施例
は、プリントアウトモジュールにおいて、フルカラーで
印字するレーザビームプリンタを用いるためのものであ
ったが、そのようにフルカラーで印字することが可能な
レーザビームプリンタを用いる代わりに、黒を基本とし
てもう１色追加した「＋１色印字」することが可能なレ
ーザービームプリンタを用いて、本発明を実施すること
もできる。この場合には、より低コストでの実現と印字
速度の向上が期待できる。ただし、この場合には利用で
きる色の数が制限されるため色パラメータ等を変更する
必要がある。この際には、網かけ等の修飾パターンを変
化させて色数の少なさを補うこともできよう。Although the first and second embodiments described above use a laser beam printer that prints in full color in the printout module, printing in full color is possible. Instead of using a laser beam printer, the present invention can be implemented using a laser beam printer capable of performing “+1 color printing” by adding another color based on black. In this case, lower cost and higher printing speed can be expected. However, in this case, since the number of available colors is limited, it is necessary to change color parameters and the like. In this case, it is possible to compensate for the small number of colors by changing a decoration pattern such as shading.

【００３７】（第３の実施例）第３の実施例は、色パラ
メータ決定モジュールとして、第１および第２の実施例
のようにテーブルを参照する代わり、例えばＩＦ−ＴＨ
ＥＮルール形式で書かれた知識ベースと、それを用いて
前向きや後向きの推論を行なう推論エンジンを利用する
ものである。すなわち、図１８に示すように、色パラメ
ータ決定モジュール１８４は、論理識別子付与モジュー
ル１８３から得られる情報、色パラメータとそれら相互
の関係を規定する知識を保持する知識データベース１８
４２と、論理識別子付与モジュール１８３からの情報に
もとづいて知識ベース１８４２を参照しながら前向き推
論や後ろ向き推論などのなんらかの推論を行って色パラ
メータを決定する推論機構１８４１（推論エンジン）よ
り構成される。推論機構の動作手順例を図１９をもとに
説明する。色パラメータ決定モジュール１８５は、図１
９に示すように、推論開始の指示を受けると、推論エン
ジンの動きにより、まず知識ベース１８４２からＩＦ−
ＴＨＥＮ形式に書かれたすべてのルールを取り出し、こ
れをｒｕｌｅｓとする（ステップＳ１９１）。次に論理
識別子付与モジュール１８３から送られてきたデータを
基にＩＦ条件部に書かれた条件がすべて真となるルール
すべてを見つけ出し、これをＴｒｕｌｅｓとする（ステ
ップＳ１９３）。次にＴｒｕｌｅｓの中からルールをひ
とつだけ選び、このルールのＴＨＥＮ実行部に書かれた
実行内容を実行する（ステップＳ１９３）。これらルー
ルの選択および実行を、Ｔｒｕｌｅｓが空になるか（ス
テップＳ１９４により判定）またはＴＨＥＮ実行部によ
る推論停止指示が出るまで（ステップＳ１９２により判
定）繰り返す。これにより、色パラメータ決定モジュー
ル１８５はレイアウト構造に応じて適切な色パラメータ
を決定する。なお、色パラメータ決定モジュール１８４
以外の構成要素は、図１１の第２の実施例と同じもので
ある。この第３の実施例によれば、それぞれの文書に適
する、より高度な色づけを行うことができる。(Third Embodiment) In a third embodiment, as a color parameter determination module, for example, IF-TH is used instead of referring to a table as in the first and second embodiments.
It utilizes a knowledge base written in EN rule format and an inference engine that performs forward and backward inference using the knowledge base. That is, as shown in FIG. 18, the color parameter determination module 184 stores information obtained from the logical identifier assigning module 183, the knowledge database 18 holding the color parameters and the knowledge defining their mutual relations.
42 and an inference mechanism 1841 (inference engine) that determines color parameters by performing some inference such as forward inference or backward inference while referring to the knowledge base 1842 based on information from the logical identifier assignment module 183. An example of the operation procedure of the inference mechanism will be described with reference to FIG. The color parameter determination module 185 is provided in FIG.
As shown in FIG. 9, when an instruction to start inference is received, the operation of the inference engine first causes the knowledge base 1842 to send the IF-
All the rules written in the THEN format are extracted and set as rules (step S191). Next, based on the data sent from the logical identifier assigning module 183, all rules where all the conditions written in the IF condition section are true are found, and these are set as Trues (step S193). Next, only one rule is selected from the rules, and the execution content of the rule written in the THEN execution unit is executed (step S193). The selection and execution of these rules are repeated until Trues becomes empty (determined in step S194) or an inference stop instruction is issued by the THEN execution unit (determined in step S192). Thereby, the color parameter determination module 185 determines an appropriate color parameter according to the layout structure. The color parameter determination module 184
The other components are the same as those of the second embodiment shown in FIG. According to the third embodiment, more sophisticated coloring suitable for each document can be performed.

【００３８】なお、以上に説明した第１ないし第３の実
施例において、原稿画像入力モジュール、プリントアウ
トモジュール等は、既存のハードウェア技術によって構
成することができる。また、領域分割モジュール、論理
識別しモジュール、色パラメータ決定モジュール、画像
生成モジュール等の一部または全部を、ソフトウェアモ
ジュールとして構成し、それらのソフトウェアモジュー
ルをＲＯＭあるいはその他の適宜の記憶装置に格納し、
中央処理装置により実行するようにしてもよく、あるい
は、それらを個別の論理素子を組み合せた論理回路によ
るハードウェアモジュールとして構成してもよい。In the first to third embodiments described above, the original image input module, the printout module and the like can be constituted by existing hardware technology. In addition, a part or all of the area division module, the logical identification module, the color parameter determination module, the image generation module, and the like are configured as software modules, and those software modules are stored in a ROM or other appropriate storage device,
They may be executed by a central processing unit, or they may be configured as hardware modules by a logic circuit in which individual logic elements are combined.

【００３９】[0039]

【発明の効果】以上詳細に述べてきたように、本発明に
よれば使用者は文書色づけに関する知識をもたなくて
も、紙等に書かれた既存の文書をもとに、色付けされた
見栄えの良い文書を容易に作成することができる。As has been described in detail above, according to the present invention, a user can obtain a color based on an existing document written on paper or the like without having knowledge of the document coloring. Good looking documents can be easily created.

【００４０】また、本発明の論理識別子付与手段を設け
た態様によれば、論理構造を識別することにより、文書
の種類を推定するとともに、各領域を同じ種類内でも論
理識別子でさらに区別できるので、さらに変化に富んだ
高品位の色付けを行うことができ、一層見栄えの良い文
書を得ることができる。Further, according to the aspect of the present invention provided with the logical identifier assigning means, the type of the document can be estimated by identifying the logical structure, and each area can be further distinguished by the logical identifier within the same type. In addition, a variety of high-quality coloring can be performed, and a document with a better appearance can be obtained.

【００４１】また、色パラメータ決定手段に知識ベース
と推論機構を用いた本発明の態様によれば、それぞれの
文書に適する、より高度な色づけを行うことができる。Further, according to the aspect of the present invention in which the knowledge base and the inference mechanism are used for the color parameter determining means, more advanced coloring suitable for each document can be performed.

[Brief description of the drawings]

【図１】第１の実施例を示す構成図FIG. 1 is a configuration diagram showing a first embodiment.

【図２】色づけ指示入力モジュールの例を示す図FIG. 2 shows an example of a coloring instruction input module.

【図３】入力文書の例を示す図FIG. 3 shows an example of an input document.

【図４】第１の実施例における処理のフローチャートFIG. 4 is a flowchart of a process according to the first embodiment;

【図５】入力文書例による領域分割モジュールの処理結
果を示す図FIG. 5 is a diagram showing a processing result of a region dividing module according to an example of an input document;

【図６】領域分割モジュールの処理結果のレイアウト構
造を示す図FIG. 6 is a diagram showing a layout structure of a processing result of a region division module;

【図７】第１の発明における色パラメータテーブルの例
を示す図FIG. 7 is a diagram showing an example of a color parameter table according to the first invention.

【図８】色パラメータテーブル中の色づけマージンの利
用例を示す図FIG. 8 is a diagram showing an example of using a coloring margin in a color parameter table.

【図９】第１の発明の出力画像生成モジュールにおける
処理のフローチャートFIG. 9 is a flowchart of processing in the output image generation module of the first invention.

【図１０】第１の実施例のプリントアウトモジュールに
おいて出力された画像の例を示す図FIG. 10 is a diagram illustrating an example of an image output by the printout module according to the first embodiment;

【図１１】第２の実施例を示す構成図FIG. 11 is a configuration diagram showing a second embodiment.

【図１２】第２の実施例における処理のフローチャートFIG. 12 is a flowchart of a process in the second embodiment.

【図１３】構造文書モデルの例を示す図FIG. 13 shows an example of a structural document model.

【図１４】入力文書例と構造文書モデル例とのマッチン
グ結果を示す図FIG. 14 is a diagram illustrating a matching result between an input document example and a structural document model example.

【図１５】第２の実施例における色パラメータテーブル
の例を示す図FIG. 15 is a diagram illustrating an example of a color parameter table according to the second embodiment.

【図１６】第２の実施例の出力画像生成モジュールにお
ける処理のフローチャートFIG. 16 is a flowchart of processing in an output image generation module according to the second embodiment.

【図１７】第２の実施例のプリントアウトモジュールに
おいて出力された画像の例を示す図FIG. 17 is a diagram illustrating an example of an image output by the printout module according to the second embodiment.

【図１８】第３の実施例を示す構成図FIG. 18 is a configuration diagram showing a third embodiment.

【図１９】第３の実施例における推論処理のフローチャ
ートFIG. 19 is a flowchart of the inference processing in the third embodiment.

[Explanation of symbols]

１１，１１１…原稿画像入力モジュール、１２，１１２
…領域分割モジュール、１３，１１４…色パラメータ決
定モジュール、１４，１１５…出力画像生成モジュー
ル、１５，１１６…プリントアウト・モジュール、１１
３…論理識別子付与モジュール。11, 111: Original image input module, 12, 112
... Area division module, 13, 114 ... Color parameter determination module, 14, 115 ... Output image generation module, 15, 116 ... Printout module, 11
3 ... Logical identifier assigning module.

フロントページの続き (56)参考文献特開平１−315865（ＪＰ，Ａ) 特開平４−160981（ＪＰ，Ａ) 中野康明外、「自動ファイリングのための文書理解の一方式」電子情報通信学会論文誌、昭和63年、Ｖｏｌ．Ｊ71− Ｄ，Ｎｏ．10 Ｐ．2050−2058Continuation of the front page (56) References JP-A-1-315865 (JP, A) JP-A-4-160981 (JP, A) Yasuaki Nakano Outside, "A method of document understanding for automatic filing" Journal of the Society of Japan, 1988, Vol. J71-D, No. 10 P. 2050-2058

Claims

(57) [Claims]

1. A document image holding unit for holding a document image, and a set of pixels on the document image held by the document image holding unit are divided into chunk regions each having a meaning as a document element. , Area type and area size for each area
Area information including the law and position and information indicating the relationship between the areas.
Area dividing means for outputting a layout structure including the table, and table means for storing correspondence between the type of the area and parameter values relating to coloring, and coloring the output document by referring to the table means based on an output result of the area dividing means. A color parameter determining unit that determines a parameter value related to each region, a region image for each region divided by the region dividing unit is cut out from a document image held in the document image holding unit, and the color parameter determining unit A document processing device comprising: an output image generation unit configured to process each area image according to the determined color parameter value for each area to generate an output image.

2. A document image holding unit for holding a document image, and a set of pixels on the document image held by the document image holding unit are divided into blocks each having a meaning as a document element. , Including area, area type, area dimensions, and position
Layout structure including information and information indicating relationship between regions
, A logical identifier assigning means for assigning an identifier representing the logical meaning of the document to each area on the document image divided by the area dividing means, and a logical identifier assigning means. A color parameter determining unit that determines a parameter value related to coloring of an output document for each logical identifier by referring to a result; and a logical image identifier assigned by the logical identifier assigning unit from a document image held in the document image holding unit. A document that cuts out each area image and processes each area image in accordance with the color parameter value for each logical identifier determined by the color parameter determination means to generate an output image. Processing equipment.

3. A document image storage for storing a document image.
Means and an image on the document image held by the document image holding means.
A collection of elements is defined as a set of
Area divided into areas, including area type, area dimensions, and position
Layout structure including information and information indicating relationship between regions
And a region on the document image divided by the region dividing means.
Identifiers that indicate the logical meaning of each document
Logical identifier assigning means, and an output document with reference to the result output by the logical identifier assigning means.
Determine the parameter value related to the coloring of each logical identifier
Color parameter determining means, and a document image stored in the document image storing means.
Each area for each logical identifier assigned by the logical identifier assigning means
Cut out the area image, and
According to the determined color parameter value for each logical identifier
An output image that generates an output image by processing each area image
A document processing apparatus comprising: a generation unit; wherein the logical identifier assigning unit is obtained by a structure model storage unit that stores a plurality of structural models representing a relationship between a layout structure and a logical structure; and a region division unit. A document comprising matching processing means for comparing the layout structure with the structural model; and logical structure determining means for assigning a logical identifier for each area based on the structural model that has been matched based on the result of the comparison. Processing equipment.

4. The color parameter determining means includes: a first table indicating a correspondence between a type of an input document and each corresponding table number; and a second table indicating a correspondence between a logical identifier and a parameter relating to coloring. 4. The document processing apparatus according to claim 3, further comprising a color parameter table having the following.

5. The color parameter determining means performs inference processing using a knowledge base storing knowledge information relating to coloring of a document and knowledge information stored in the knowledge base based on an output of the logical identifier providing means. claim 3, characterized in that by executing and a inference processing means for determining the parameter values for coloring documents output document
Document processing device as described.