JPH07287767A

JPH07287767A - Document picture processor

Info

Publication number: JPH07287767A
Application number: JP6101732A
Authority: JP
Inventors: Masakazu Fujimoto; 正和藤本; Maki Oonishi; 麻希大西; Shinya Kogou; 慎也古郷
Original assignee: Fuji Xerox Co Ltd
Current assignee: Fujifilm Business Innovation Corp
Priority date: 1994-04-18
Filing date: 1994-04-18
Publication date: 1995-10-31
Anticipated expiration: 2017-02-12
Also published as: JP3254896B2

Abstract

PURPOSE:To provide a document picture processor capable of simplifying an indication method for an object to be edited and executing specific edition such as the reduction of density in a part of a document without requiring complicated operation. CONSTITUTION:When an out-of-edition area indicating means 102 indicates an area not to be edited on a document picture of an input document to be edited, an indication area discriminating means 103 adds an indication area flag indicating corresponding relation between the area indicated by the means 102 and the document picture of the inputted original. An out-of-edition area discriminating means 104 converts the indication area flag added by the means 103 into an edition area flag indicating the editing area of the document picture. A document picture converting means 105 changes the density of an area extracted from the editing area indicated by the editing area flag and outputs a document picture 106 for an output document.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、ディジタル複写機など
において、文書の一部分についての濃度変換を行なうこ
とができる文書画像処理装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a document image processing apparatus capable of density conversion of a part of a document in a digital copying machine or the like.

【０００２】[0002]

【従来の技術】近年のディジタル複写機において、その
複写性能および諸機能は著じるしく向上している。この
ような複写機の諸機能を用いて、紙の原稿の文書を複写
する場合、例えば、文書中で目立たせたくない文字の濃
度を下げることにより、目立たせたい部分との区別をは
っきりさせたり、本文の文字濃度を下げることにより、
おしゃれなイメージを付加したり、また、長い文章に対
しては、背景とのコントラストを抑えて複写することに
より、目が疲れにくいような高品質の文書にして複写す
ることなどが所望される場合がある。2. Description of the Related Art In recent years, the copying performance and various functions of digital copying machines have been remarkably improved. When copying a document of a paper manuscript using various functions of such a copying machine, for example, by lowering the density of characters that are not desired to stand out in the document, the distinction from the part to stand out can be made clear. , By reducing the character density of the text,
In some cases, it may be desirable to add a fashionable image, or to copy long texts by suppressing the contrast with the background to make high-quality documents that are easy on the eyes. is there.

【０００３】こような文字に対する編集を行うには、従
来の技術において、複写機ではないが、デスクトップパ
ブリッシングシステムの技術を使用することにより、文
字列に対する文字修飾を行い対応することができる。こ
のような技術としては、例えば、特開平３−１５６６６
７号公報に記載されている「文書編集処理装置」の技術
が利用できる。ここでは、文書の構成に基づき、文書中
の見出し文字列に対して、簡単な操作で文字修飾が行え
る技術が提案されている。また、電子コード化されてい
ない文書に対しては、例えば、特開平２−２２３２７５
号公報に記載されている「画像処理装置の編集制御方
式」のように、編集機能付のデジタルカラー複写機の技
術を用いることができる。この技術を用いることによ
り、編集方法として色変換を指定し、編集範囲の指定お
よび色の指定の操作を行なうことにより、紙に出力され
た文書の文字についての濃度を下げることができる。In order to edit such a character, the conventional technique is not a copying machine, but the technique of the desktop publishing system can be used to perform the character modification on the character string. As such a technique, for example, Japanese Laid-Open Patent Publication No. 3-156666.
The technique of "document edit processing device" described in Japanese Patent No. 7 can be used. Here, a technique has been proposed in which the headline character string in a document can be modified by a simple operation based on the structure of the document. For documents that are not electronically encoded, for example, Japanese Patent Laid-Open No. 2-223275.
The technique of a digital color copying machine with an editing function can be used as in the "editing control system of image processing apparatus" described in Japanese Patent Publication No. By using this technique, it is possible to reduce the density of the characters of the document output on the paper by designating the color conversion as the editing method, and designating the editing range and the color.

【０００４】[0004]

【発明が解決しようとする課題】しかしながら、これま
での編集機能付デジタルカラー複写機のような編集方法
を用いる場合には、まず、編集する範囲を指定し、次
に、色（濃度）を指定する２回の操作が必要である。ま
た、この場合、文書の一部分を残して編集したい場合に
は対応できない。このため、例えば、文書の本文全体の
濃度を下げる際、その中に数か所の強調部分があり、そ
の強調部分については濃度を下げたくない場合、その強
調部分に対応して編集対象領域を複数に分割し、分割し
た各々の編集対象領域に対して、編集対象として編集指
示の操作を行なわなければならない。そのため、編集の
操作が非常に複雑になり、また、煩雑となるという問題
点がある。However, in the case of using the editing method such as the conventional digital color copying machine with the editing function, the range to be edited is designated first, and then the color (density) is designated. Two operations are required. Further, in this case, it is not possible to deal with a case where a part of the document is to be edited and left. For this reason, for example, when reducing the density of the entire text of a document, if there are several emphasized parts in it and you do not want to reduce the density of that emphasized part, the edit target area is selected in correspondence with the emphasized part. It is necessary to divide the image into a plurality of areas and perform an edit instruction operation as an edit object for each of the divided edit object areas. Therefore, there are problems that the editing operation becomes very complicated and complicated.

【０００５】本発明は、これらの問題を解決するために
なされたものであり、本発明の目的は、編集対象の指示
方法を簡略化でき、また、煩雑な操作を行なわずに、文
書の一部分について例えば濃度を下げる編集を行うこと
ができる文書画像処理装置を提供することにある。The present invention has been made in order to solve these problems, and an object of the present invention is to simplify the method of designating an object to be edited and to perform a part of a document without performing a complicated operation. With respect to the document image processing apparatus, it is possible to perform editing to reduce the density.

【０００６】[0006]

【課題を解決するための手段】上記のような目的を達成
するため、本発明において第１の特徴とする文書画像処
理装置は、文書画像上の編集されない領域を指示する編
集対象外領域指示手段（１０２）と、編集対象外領域指
示手段により指示された領域と入力原稿の文書画像との
対応関係を示す指示領域フラグを付加する指示領域判別
手段（１０３）と、指示領域判別手段により付加された
指示領域フラグにより、文書画像の編集対象領域を示す
編集領域フラグに変換する編集対象領域判別手段（１０
４）と、編集領域フラグにより指示された編集対象領域
に対して抽出された領域の濃度を変換する文書画像変換
手段（１０５）とを備えることを特徴とする。In order to achieve the above object, the document image processing apparatus, which is the first feature of the present invention, is a non-editing target area designating means for designating an unedited area on a document image. (102), an instruction area discriminating means (103) for attaching an instruction area flag indicating the correspondence between the area instructed by the non-editing area instruction means and the document image of the input document, and the instruction area discriminating means. Based on the designated area flag, the edit area determining unit (10) converts the edit area into the edit area flag indicating the edit area.
4) and a document image conversion unit (105) for converting the density of the extracted area with respect to the edit target area designated by the edit area flag.

【０００７】本発明の第２の特徴とする文書画像処理装
置は、文書画像上の画素の集まりをそれぞれ文書要素の
意味のある塊として小領域に分割する領域分割手段（２
０２）と、領域分割手段により分割された小領域から、
その中の最下位要素の数を計数する最下位要素計数手段
（２０３）と、最下位要素計数手段で計数された小領域
中の最下位要素の数から長い文章の領域を判別する長文
領域判別手段（２０４）と、長文領域判別手段によって
抽出された領域の長文の濃度を変換する文書画像変換手
段（２０５）とを備えることを特徴とする。A second feature of the present invention, a document image processing apparatus, is an area dividing means (2) for dividing a collection of pixels on a document image into small areas as meaningful blocks of document elements.
02) and the small area divided by the area dividing means,
The lowest element counting means (203) for counting the number of the lowest elements in it, and the long sentence area discrimination for discriminating a long sentence area from the number of the lowest elements in the small areas counted by the lowest element counting means It is characterized by comprising means (204) and document image conversion means (205) for converting the density of the long sentence of the region extracted by the long sentence region discrimination means.

【０００８】本発明の第３の特徴とする文書画像処理装
置は、文書画像上の画素の集まりをそれぞれ文書要素の
意味のある塊として小領域に分割する領域分割手段（３
０２）と、領域分割手段によって分割された小領域にそ
れぞれ文書の論理的な構造を意味付ける識別子を付与す
る論理識別子付与手段（３０３）と、論理識別子付与手
段により付与された識別子に対応して濃度変換対象とす
る領域を判定する編集領域判別手段（３０４）と、編集
領域判別手段によって判別された領域の濃度を変換する
文書画像変換手段（３０５）とを備えることを特徴とす
る。A document image processing apparatus as a third feature of the present invention is an area dividing means (3) for dividing a group of pixels on a document image into small areas as meaningful blocks of document elements.
02), a logical identifier assigning means (303) for assigning an identifier that gives a logical structure of a document to each of the small areas divided by the area dividing means, and an identifier assigned by the logical identifier assigning means. It is characterized by comprising an edit area discrimination means (304) for determining an area to be a density conversion target, and a document image conversion means (305) for converting the density of the area discriminated by the edit area discrimination means.

【０００９】[0009]

【作用】本発明の第１の特徴とする文書画像処理装置に
おいては、編集対象外領域指示手段が、文書画像上の編
集されない領域を指示すると、指示領域判別手段が、編
集対象外領域指示手段により指示された領域と入力原稿
の文書画像との対応関係を示す指示領域フラグを付加す
る。この指示領域判別手段により付加された指示領域フ
ラグにより、編集対象領域判別手段が、文書画像の編集
対象領域を示す編集領域フラグに変換する。そして、文
書画像変換手段が、編集領域フラグにより指示された編
集対象領域に対して抽出された領域の濃度を変換する。In the first embodiment of the document image processing apparatus of the present invention, when the non-editing area designating means designates an area which is not edited on the document image, the designated area determining means causes the non-editing area designating means. A designated area flag indicating the correspondence between the area designated by and the document image of the input document is added. Based on the designated area flag added by the designated area determination means, the edit target area determination means converts the edit area flag into the edit area flag indicating the edit target area of the document image. Then, the document image converting means converts the density of the extracted area with respect to the edit target area designated by the edit area flag.

【００１０】このように、利用者が、編集対象外領域指
示手段により領域を指定するが、その領域は編集しない
領域として指示される。編集対象領域判別手段が、逆に
指示されなかった領域を抽出して編集領域とする。そし
て、文書画像変換手段が編集領域された領域の濃度を変
更する。これにより、複雑な編集領域を指示する場合で
あっても、編集しない領域の指示を行うことにより、領
域の指定が簡略化される。また、更に、例えば、編集す
る領域の指定と組合せて領域を指定することにより、更
に領域の指定が簡略化される。As described above, the user designates the area by the non-editing area designating means, but the area is designated as the area not edited. On the contrary, the edit target area discriminating means extracts the area which is not instructed to be the edit area. Then, the document image converting means changes the density of the edited area. As a result, even when a complicated edit area is designated, the designation of the area is simplified by designating the area that is not edited. Further, for example, the designation of the region is further simplified by designating the region in combination with the designation of the region to be edited.

【００１１】本発明の第２の特徴とする文書画像処理装
置においては、領域分割手段が、文書画像上の画素の集
まりをそれぞれ文書要素の意味のある塊として小領域に
分割する。最下位要素計数手段が、領域分割手段により
分割された小領域からその中の最下位要素の数を計数す
ると、長文領域判別手段は、最下位要素計数手段で計数
された小領域の中の最下位要素の数から長い文章の領域
を判別する。そして、文書画像変換手段が、長文領域判
別手段によって抽出された領域の長文の濃度を変換す
る。In the document image processing apparatus which is the second feature of the present invention, the area dividing means divides a group of pixels on the document image into small areas as meaningful blocks of the document elements. When the lowest element counting means counts the number of lowest elements in the small areas divided by the area dividing means, the long sentence area determining means determines the lowest of the small areas counted by the lowest element counting means. The area of a long sentence is determined from the number of lower elements. Then, the document image converting means converts the long sentence density of the region extracted by the long sentence region discriminating means.

【００１２】このように、最下位要素計数手段が、分割
された小領域から、文書画像上の画素の集まりの文書要
素の意味のある塊の最下位要素（文字）の数を計数し、
つまり、文字領域のブロックごとの文字数を計数するの
で、計数した値によって、長文領域判別手段が、文字領
域の中から長い文章に相当するブロックを判別して抽出
する。抽出されたブロックの領域に対して、文書画像変
換手段が濃度を変更する。このため、例えば、画像の濃
度を変更する領域を、予じめ文字領域と定めておけば、
その領域の指示が省略できる。As described above, the least significant element counting means counts the number of the least significant elements (characters) of a meaningful block of the document element of the collection of pixels on the document image from the divided small areas,
That is, since the number of characters in each block of the character area is counted, the long-sentence area discriminating means discriminates and extracts a block corresponding to a long sentence from the character area based on the counted value. The document image conversion means changes the density of the extracted block area. Therefore, for example, if the area where the density of the image is changed is defined as the preliminary character area,
The instruction of the area can be omitted.

【００１３】また、本発明の第３の特徴とする文書画像
処理装置においては、同じく、領域分割手段が、文書画
像上の画素の集まりをそれぞれ文書要素の意味のある塊
として小領域に分割するので、論理識別子付与手段が、
領域分割手段によって分割された小領域にそれぞれ文書
の論理的な構造を意味付ける識別子を付与する。論理識
別子付与手段により付与された識別子に対応して、編集
領域判別手段が、濃度変換対象とする領域を判定する。
そして、文書画像変換手段が、編集領域判別手段によっ
て判別された領域の濃度を変換する。In the document image processing apparatus of the third aspect of the present invention, similarly, the area dividing means divides each group of pixels on the document image into small areas as meaningful blocks of the document elements. Therefore, the logical identifier assigning means
Each small area divided by the area dividing means is provided with an identifier that means the logical structure of the document. The editing area determination means determines the area to be the density conversion target, corresponding to the identifier assigned by the logical identifier assigning means.
Then, the document image converting means converts the density of the area discriminated by the editing area discriminating means.

【００１４】このように、本発明の第３の特徴とする文
書画像処理装置においては、論理識別子付与手段によっ
て、文字領域のブロックごとに「本文」，「見出し」，
または「注」などの文書の論理的な構造を意味付ける識
別子を付与する。ここで付与された識別子に応じて、編
集領域判別手段が、例えば、「注」の識別子が付与され
た主要ではない文字領域を判別し、この判別の結果をも
とづき、文書画像変換手段が該当の領域の濃度を変更す
る。この場合においても前述の場合と同様に、画像の濃
度を変更する領域を、予じめ識別子により定めておけ
ば、その領域の指示の省略が可能となる。As described above, in the document image processing apparatus, which is the third feature of the present invention, the "text", "heading",
Alternatively, an identifier such as "Note" that gives a logical structure of the document is given. In accordance with the assigned identifier, the editing area determination means determines, for example, a non-main character area to which the identifier “Note” is assigned, and based on the result of this determination, the document image conversion means determines Change the density of the area. Also in this case, as in the case described above, if the area in which the density of the image is to be changed is defined by the predetermined identifier, the instruction of the area can be omitted.

【００１５】[0015]

【実施例】以下、本発明の実施例を図面を参照して具体
的に説明する。図１は、本発明の文書画像処理装置の第
１の実施例の基本構成を示すブロック図である。図１に
おいて、１０１は編集対象文書画像、１０２は編集対象
外領域指示部、１０３は指示領域判別部、１０４は編集
対象領域判定部、１０５は文書画像変換部、１０６は編
集後文書画像である。Embodiments of the present invention will be specifically described below with reference to the drawings. FIG. 1 is a block diagram showing the basic configuration of the first embodiment of the document image processing apparatus of the present invention. In FIG. 1, 101 is an edit target document image, 102 is a non-edit target area designating section, 103 is a designated area determining section, 104 is an edit target area determining section, 105 is a document image converting section, and 106 is an edited document image. .

【００１６】編集対象外領域指示部１０２は、編集され
ない領域の座標を入力するため機能要素であり、デジタ
イザ，ライトペン，マウスなどポインティングデバイス
の座標入力装置が利用される。指示領域判別部１０３
は、編集対象外領域指示部１０２により指示された領域
が、編集対象画像１０１として入力された原稿の文書画
像データ中のどの領域に相当するかを判別するための処
理機能要素である。編集対象領域判別部１０４は、指示
された領域が編集対象外となるように、逆に編集対象領
域を抽出する処理機能要素である。また、文書画像変換
部１０５は、編集領域判別部１０４によって抽出された
編集対象となる領域の画像の濃度やコントラストを変換
し、出力画像を生成する処理機能要素である。文書画像
変換部１０５からは、編集後文書画像１０６が出力され
る。文書画像変換部１０５としては通常の画像形成処理
装置が利用される。The non-editing target area designating section 102 is a functional element for inputting coordinates of a non-editing area, and a coordinate input device such as a digitizer, a light pen, a mouse, or a pointing device is used. Designated area discrimination unit 103
Is a processing function element for determining which region in the document image data of the document input as the editing target image 101 corresponds to the region designated by the non-edition region designating unit 102. The edit target area determination unit 104 is a processing function element that conversely extracts the edit target area so that the designated area is excluded from the edit target. The document image conversion unit 105 is a processing function element that converts the density or contrast of the image of the area to be edited extracted by the editing area determination unit 104 and generates an output image. The edited document image 106 is output from the document image conversion unit 105. A normal image forming processing device is used as the document image converting unit 105.

【００１７】次に、このような各々の機能要素により構
成される文書画像処理装置を、デジタルカラー複写機に
適用した場合を例として説明する。装置を構成する各々
の機能要素は、前述した公知例となっている特開平２−
２２３２７５号公報に記載されているようなディジタル
カラー複写機における各々の機能要素が利用できるの
で、個別の各々の機能要素についての説明は省略し、以
下の説明では、文書画像処理の動作を順を追って説明す
る。Next, the case where the document image processing apparatus constituted by such respective functional elements is applied to a digital color copying machine will be described as an example. Each functional element that constitutes the device is a publicly known example described above.
Since each functional element in the digital color copying machine as described in Japanese Patent No. 223275 can be used, description of each individual functional element will be omitted, and in the following description, the operation of document image processing will be described in order. I will explain later.

【００１８】図３は、デジタルカラー複写機に適用した
文書画像処理装置の要部の構成を説明するブロック図で
ある。図３において、２０は文書画像処理装置、２１は
イメージスキャナ、２２は編集対象外領域指示モジュー
ル、２３は指示領域判別モジュール、２４は編集対象領
域判別モジュール、２５は文書画像変換モジュール、２
６はプリンタ機構、２７は制御モジュール、また、２８
はコントロールパネル部である。FIG. 3 is a block diagram for explaining the arrangement of the main part of the document image processing apparatus applied to the digital color copying machine. In FIG. 3, 20 is a document image processing device, 21 is an image scanner, 22 is a non-editing target area designation module, 23 is a designated area determination module, 24 is an edit target area determination module, 25 is a document image conversion module, 2
6 is a printer mechanism, 27 is a control module, and 28
Is the control panel section.

【００１９】コントロールパネル部２８は、利用者から
濃度変換などの指示を受け付ける機能要素であり、テン
キーおよびファンクションキーからなるキーボード、お
よびディスプレイなどから構成される。制御モジュール
２７はコントロールパネル部２８に対するデータの入出
力処理、イメージスキャナ２１の起動処理、プリンタ機
構２６の起動処理などの制御処理を行なう制御ユニット
である。制御ユニットは制御用のマイクロプロセッサが
搭載されて構成されており、これらの制御処理プログラ
ムが内部にプログラムされている。The control panel unit 28 is a functional element that receives an instruction such as density conversion from a user, and is composed of a keyboard including ten keys and function keys, a display, and the like. The control module 27 is a control unit that performs control processing such as data input / output processing for the control panel unit 28, image scanner 21 startup processing, and printer mechanism 26 startup processing. The control unit is configured by mounting a control microprocessor, and these control processing programs are programmed inside.

【００２０】また、図２はディジタル複写機におけるコ
ンソールパネルの一例を示す図である。図２に示すよう
に、コンソールパネル３０には、複写枚数指定用のテン
キー部３１と、設定された複写枚数の表示部３２と、通
常の複写モードを指示する複写ボタン３３と、部分的な
濃度変換モードを指示する濃度変換ボタン３４と、複写
スタートボタン３５と、状態表示部３６とが設けられて
いる。複写ボタン３３と濃度変換ボタン３４は内部にそ
の機能が指定されているかどうかを示すＬＥＤランプが
設けられており、図２に示すような状態では、濃度変換
のモードが選択されている状態を表示している。次に、
このような濃度変換のモードが選択されている場合の処
理の動作をフローチャートを参照して説明する。FIG. 2 is a diagram showing an example of a console panel in a digital copying machine. As shown in FIG. 2, the console panel 30 includes a ten-key unit 31 for designating the number of copies, a display unit 32 for the set number of copies, a copy button 33 for instructing a normal copy mode, and a partial density. A density conversion button 34 for instructing a conversion mode, a copy start button 35, and a status display section 36 are provided. The copy button 33 and the density conversion button 34 are internally provided with LED lamps indicating whether or not their functions are designated. In the state shown in FIG. 2, the state in which the density conversion mode is selected is displayed. is doing. next,
The operation of the processing when such a density conversion mode is selected will be described with reference to the flowchart.

【００２１】図４は、指定された領域に対する濃度変換
を行う場合の処理を流れを示すフローチャートである。
図４を参照して処理の概略を説明する。電源が投入さ
れ、処理が開始されると、まず、ステップ４０におい
て、立ち上げ処理を行う。次に、ステップ４１におい
て、濃度変換モードが指定されているか否かを判定す
る。濃度変換モードが指定されていない場合には、ステ
ップ４２に進み、通常の複写処理を行う。そして、再
び、ステップ４１に戻り、濃度変換モードの指定を判定
する。FIG. 4 is a flow chart showing the flow of processing when density conversion is performed on a specified area.
The outline of the processing will be described with reference to FIG. When the power is turned on and the process is started, first, in step 40, a startup process is performed. Next, in step 41, it is determined whether or not the density conversion mode is designated. If the density conversion mode is not designated, the routine proceeds to step 42, where normal copying processing is performed. Then, the process returns to step 41 again, and the designation of the density conversion mode is determined.

【００２２】ステップ４１の判定により、濃度変換モー
ドが指定されていることが判定されると、ステップ４３
からの一部の領域指定による濃度変換処理を行う。この
処理では、まず、ステップ４３でスタートが指示された
か否かを判定する。スタートの指示が判定されない場合
は、ステップ４４に進み、領域指示入力の受け付け処理
を行い、ステップ４３に戻って、再び、スタートが指示
されたか否かを判定する。つまり、スタートが指示され
るまでは、ステップ４４の領域指示入力の受け付け処理
を繰り返し行う。この領域指示入力では、編集対象外と
する領域の指定を行う。When it is determined in step 41 that the density conversion mode is designated, step 43
The density conversion process is performed by designating a part of the area. In this process, first, in step 43, it is determined whether or not the start is instructed. If the start instruction is not determined, the process proceeds to step 44, the area instruction input acceptance processing is performed, the process returns to step 43, and it is determined again whether the start instruction has been performed. That is, the process of accepting the area instruction input in step 44 is repeated until the start is instructed. In this area instruction input, an area to be excluded from editing is designated.

【００２３】領域指示入力の受け付け処理が終了し、更
に、スタートが指示されると、入力指示された領域に応
じて、ステップ４５からの濃度変換の処理を行う。この
処理では、ステップ４５において、原稿読み取りの処理
を行い、次に、ステップ４６において、指示領域判別の
処理を行う。ここで指示されている領域は画像編集を行
う編集対象外とする領域の指定なので、次のステップ４
７において、この指定の領域から編集領域抽出の処理を
行う。次に、ステップ４８において、抽出された編集領
域の文字画像の変換を行い、次のステップ４９におい
て、画像出力を行う。そして、ステップ４１に戻り、次
の文書画像に対して同様の処理の流れに従って、ステッ
プ４１から処理を繰り返し行う。When the area instruction input acceptance processing is completed and the start is further instructed, the density conversion processing from step 45 is performed in accordance with the input instruction area. In this process, the document reading process is performed in step 45, and then the designated area determination process is performed in step 46. Since the area specified here is the area to be excluded from the editing target for image editing, the next step 4
At 7, the editing area is extracted from the designated area. Next, in step 48, the character image of the extracted editing area is converted, and in the next step 49, the image is output. Then, the process returns to step 41, and the process is repeated from step 41 on the next document image according to the same process flow.

【００２４】次に、このような領域を指定した部分的な
濃度変換の処理について、具体的な文書画像の処理例に
ついて説明する。図５は、処理対象の文書画像として入
力する白黒の入力文書の一例を示す図であり、図６は、
入力文書において処理対象の領域を指定する場合の操作
例を説明する図である。また、図７は、２５６階調グレ
ースケールによる文書画像の画像データを部分的に示す
図であり、図８は、画像データに指示領域フラグが設け
られた場合の画像データを部分的に示す図であり、図９
は、指示領域フラグが反転されて編集対象領域フラグと
された状態の画像データを部分的に示す図である。ま
た、図１０は、指定された領域に対する濃度変換が行な
われた状態の画像データを部分的に示す図であり、図１
１は、最終的に濃度変換が行なわれた状態の出力文書の
文書画像の一例を示す図である。図１２は、領域が指示
された状態の領域テーブルの一例を示す図である。Next, a specific document image processing example of the partial density conversion processing in which such an area is designated will be described. FIG. 5 is a diagram showing an example of a monochrome input document input as a document image to be processed, and FIG.
FIG. 10 is a diagram illustrating an operation example when a region to be processed is specified in an input document. Further, FIG. 7 is a diagram partially showing image data of a document image in 256 gray scales, and FIG. 8 is a diagram partially showing image data when a designated area flag is provided in the image data. And FIG.
FIG. 6 is a diagram partially showing image data in a state in which a designated area flag is inverted to be an edit target area flag. Further, FIG. 10 is a diagram partially showing the image data in the state where the density conversion is performed on the designated area.
FIG. 1 is a diagram showing an example of a document image of an output document in which density conversion is finally performed. FIG. 12 is a diagram showing an example of an area table in a state where an area is designated.

【００２５】次に、これらの図５〜図１２を参照しなが
ら、指定された領域に対する濃度変換を行う場合の処理
を説明する。以降の説明では、図５に示す入力文書を濃
度変換する場合を例として、図６〜図１２を参照しなが
ら順次にその動作例を説明する。なお、文書画像の処理
を行う場合の位置の基準として、直交座標の座標値を用
いるが、この座標軸は、図５の中に示すように、文書画
像のページ右に向かってｘ軸、ページ下に向かってｙ軸
とする。Next, with reference to these FIGS. 5 to 12, the processing when the density conversion is performed for the designated area will be described. In the following description, an example of the operation will be sequentially described with reference to FIGS. 6 to 12 by taking the case of density conversion of the input document shown in FIG. 5 as an example. It should be noted that although coordinate values of Cartesian coordinates are used as a position reference when processing a document image, this coordinate axis is, as shown in FIG. Toward the y-axis.

【００２６】電源が投入されると、制御モジュール２７
が立ち上げ処理を行ない（ステップ４０：図４）、コン
トロールパネル部２８においてコンソールパネルの状態
表示部３６に初期画面を表示する。この初期画面が表示
された状態において、次に、利用者がコンソールパネル
上の濃度変換ボタン３４を押すと、濃度変換モードが指
示され、濃度変換モードとなる。濃度変換モードが指示
されていなければ、通常の複写処理を行なう。濃度変換
モードになった後は、複写スタートボタン３５が押され
るまで、領域指示入力を受け付ける（ステップ４１〜４
４）。When the power is turned on, the control module 27
Performs a startup process (step 40: FIG. 4), and the control panel unit 28 displays an initial screen on the status display unit 36 of the console panel. Next, when the user presses the density conversion button 34 on the console panel while the initial screen is displayed, the density conversion mode is instructed and the density conversion mode is set. If the density conversion mode is not designated, normal copying processing is performed. After entering the density conversion mode, the area instruction input is accepted until the copy start button 35 is pressed (steps 41 to 4).
4).

【００２７】領域指示入力の受け付けの処理（ステップ
４４）では、利用者は、編集対象外領域指示モジュール
２２のデジタイザの上に原稿を置き、ここでは編集を加
えない部分の領域の指定入力を行う。つまり、この領域
指示入力の処理では、領域指定方法と座標と入力する。
この領域指定方法としては、例えば矩形，正方形または
円などの領域の形状を指定し、これに対して、その形状
に対する座標を入力する。図１２に示すように、例え
ば、矩形の形状で領域を入力する場合には、領域指定方
法として矩形を指定し、その始点と終点として、矩形の
対角の２点を指示する。また、図示していないが、例え
ば、領域指定方法として円を指定した場合には、その始
点と終点として、円の中心点と円周上の１点を指示す
る。ここで指定した領域は、編集を加えない領域として
処理される。In the process of accepting the area instruction input (step 44), the user places an original on the digitizer of the non-editing area instruction module 22, and here, the user inputs the area of the area not edited. . That is, in this area instruction input processing, the area designation method and coordinates are input.
As the area specifying method, for example, the shape of an area such as a rectangle, a square, or a circle is specified, and the coordinates for the shape are input. As shown in FIG. 12, for example, when inputting a region in a rectangular shape, a rectangle is designated as the region designation method, and two points on the diagonal of the rectangle are designated as the start point and the end point. Although not shown, for example, when a circle is designated as the area designation method, the center point of the circle and one point on the circumference are designated as the start point and the end point. The area specified here is processed as an area not edited.

【００２８】例えば、図５に示すような入力文書の文書
画像５１に対して、ある文字列の領域のみ、濃度はその
ままとし、その他の領域については濃度を押える（濃度
を薄くする）ような画像編集を行う場合、編集を加えな
い部分の領域を、図６に示すように、その領域指定方法
と始点および終点の座標の入力を行う。この場合、入力
文書の文書画像５１に対して、第１の編集領域５２とし
て領域指定方法で矩形を指定し、その始点５３および終
点５４を指示し、また、第２の編集領域５５として領域
指定方法で矩形を指定し、その始点５６および終点５７
を指示する。For example, with respect to a document image 51 of an input document as shown in FIG. 5, an image in which the density remains unchanged only in the area of a certain character string and the density is suppressed (decreased density) in the other areas. In the case of editing, the area designating method and the coordinates of the start point and the end point are input to the area of the portion not to be edited, as shown in FIG. In this case, for the document image 51 of the input document, a rectangle is designated as the first edit area 52 by the area designation method, the start point 53 and the end point 54 thereof are designated, and the area is designated as the second edit area 55. Specify the rectangle by the method and its start point 56 and end point 57
Instruct.

【００２９】このように、指定された領域指定方法の内
容とその座標（始点，終点）の入力に応じて、制御モジ
ュール２７は、領域として指定された座標値を、図１２
に示すように、順次に領域テーブル６０に書き込む。こ
の領域テーブル６０に書き込まれた領域の例は、文書中
の２か所の矩形の領域を、編集しない領域として指示す
る例となっている。そして、文書画像に対して編集対象
外の領域の全てを指定し終えると、次に、原稿を原稿台
（プラテン）の上に置いて、複写スタートボタン３５を
押す。複写スタートボタン３５が押されると、制御モジ
ュール２７は、続いて、イメージスキャナに読み取り指
示信号を送り、原稿画像の読み取りが開始される（ステ
ップ４５：図４）。As described above, according to the contents of the designated area designating method and the input of the coordinates (start point, end point) thereof, the control module 27 sets the coordinate values designated as the area in FIG.
As shown in, the data is sequentially written in the area table 60. The example of the area written in the area table 60 is an example in which two rectangular areas in the document are designated as areas not to be edited. Then, when all areas not to be edited are designated for the document image, the original is placed on the original table (platen) and the copy start button 35 is pressed. When the copy start button 35 is pressed, the control module 27 subsequently sends a reading instruction signal to the image scanner to start reading the original image (step 45: FIG. 4).

【００３０】原稿画像の読み取りが行なわれると、図７
に示すように、読み取られた画像データ６１は表形式の
データで表現され、画像メモリ（図示せず）に格納され
る。この実施例では、入力文書が白黒文書であり、イメ
ージスキャナ２１が２５６階調のグレースケールで入力
文書からの画像データを読み取る場合を例にして説明し
ているが、カラー文書の場合には、Ｒ（赤），Ｇ
（緑），Ｂ（青）の３色のそれぞれについて、グレース
ケールの場合と同様に、２５６階調での画像処理とする
ことにより、同様に扱え、同様な効果が得られる。ま
た、２値画像の場合であっても、白い部分の階調を
“０”、黒い部分の階調を“２５５”とすることによ
り、グレースケールの場合と同様に扱うことができる。When the original image is read, FIG.
As shown in FIG. 5, the read image data 61 is represented by tabular data and stored in an image memory (not shown). In this embodiment, the case where the input document is a monochrome document and the image scanner 21 reads the image data from the input document in a gray scale of 256 gradations is described as an example. However, in the case of a color document, R (red), G
For each of the three colors of (green) and B (blue), the same processing can be performed and the same effect can be obtained by performing image processing with 256 gradations as in the case of gray scale. Further, even in the case of a binary image, by setting the gradation of the white part to "0" and the gradation of the black part to "255", it is possible to treat it in the same manner as the case of the gray scale.

【００３１】原稿画像の読み取りが終了すると、読み取
られた画像データ６１と、編集対象外として指示入力さ
れた領域の座標データ（領域テーブル６０）が、指示領
域判別モジュール２３に送られ、指示領域判別の処理が
行なわれる（ステップ４６）。つまり、指示領域判別モ
ジュール２３では、まず、画像データ６１に対し、編集
対象外として指示された領域と指示されなかった領域を
区別する。このため、画像データ６１を、図８に示すよ
うに、指示領域フラグ６３を追加した画像データ６２と
する。画像データ６２に指示領域フラグ６３を追加する
場合、まず、全ての指示領域フラグ６２に、初期値とし
て“０”を書き込み、更に、領域テーブル６０（図１
２）の座標データをもとに指示された領域だけにその指
示領域フラグを“１”とする書き込みを行う。例えば、
図１２に示す領域テーブル６０では、始点および終点の
座標（ｘ，ｙ）により、ｘ１≦ｘ＜ｘ２かつｙ１≦ｙ＜
ｙ２を満たす矩形の領域と、ｘ３≦ｘ＜ｘ４かつｙ３≦
ｙ＜ｙ４を満たす矩形の領域の２つの領域を指示してい
るので、この２つの領域の範囲内の画像データの点（ド
ット）の指示領域フラグ６３を“１”とする。この結
果、図８に示すように、指示領域フラグ６３がそれぞれ
“１”または“０”に設定された状態となる。When the reading of the original image is completed, the read image data 61 and the coordinate data (area table 60) of the area designated and input as the non-editing target are sent to the designated area determination module 23, and the designated area determination is performed. Is performed (step 46). That is, in the designated area determination module 23, first, in the image data 61, an area designated as a non-editing target area and a non-designated area are distinguished. Therefore, the image data 61 is set as the image data 62 to which the designated area flag 63 is added as shown in FIG. When adding the designated area flags 63 to the image data 62, first, “0” is written as an initial value in all the designated area flags 62, and further, the area table 60 (see FIG.
Writing is performed with the designated area flag set to "1" only in the designated area based on the coordinate data of 2). For example,
In the area table 60 shown in FIG. 12, x1 ≦ x <x2 and y1 ≦ y <depending on the coordinates (x, y) of the start point and the end point.
A rectangular area satisfying y2 and x3 ≦ x <x4 and y3 ≦
Since two areas of the rectangular area satisfying y <y4 are designated, the designated area flag 63 of the point (dot) of the image data within the range of these two areas is set to "1". As a result, as shown in FIG. 8, the designated area flag 63 is set to "1" or "0", respectively.

【００３２】なお、この実施例の説明では理解を容易に
するため、指示領域フラグ６３に設定される値としては
“０”と“１”の場合のみを示しているが、この指示領
域フラグ６３は、同時に、他の編集方法を併用するた
め、つまり、それぞれの編集方法に応じて指示する領域
を区別するため、指示領域フラグ６３の値として“０”
または“１”以外の値をとるようにしても良い。例え
ば、他の編集方法による領域を指示する場合（別の濃度
に変換するような場合など）、その領域の指定のため、
指示領域フラグの値を“２”とする。これにより、最初
の指定の領域と区別して、領域を指定することができ
る。この結果、その領域は、最初の濃度変換の編集の影
響を受けないようにすることができる。In order to facilitate understanding in the description of this embodiment, the values set in the designated area flag 63 are only "0" and "1", but this designated area flag 63 is shown. Simultaneously uses another editing method, that is, in order to distinguish the area to be designated according to each editing method, the value of the designated area flag 63 is “0”.
Alternatively, a value other than "1" may be set. For example, when designating a region by another editing method (such as when converting to another density), to designate that region,
The value of the designated area flag is set to "2". As a result, the area can be designated separately from the first designated area. As a result, the region can be unaffected by the editing of the first density conversion.

【００３３】このようにして、指示領域フラグ６３が追
加された画像データ６２は、編集対象領域判別モジュー
ル２４に送られ、編集領域抽出の処理が行われる（ステ
ップ４７）。すなわち、編集対象領域判別モジュール２
４は、指示領域フラグ６３が追加された画像データ６２
を受けとると、この画像データ６２の指示領域フラグ６
３を、図９に示すように、反転して、編集対象領域フラ
グ６４に変更し、画像データ６５とする処理を行う。具
体的には、指示領域フラグ６３が“１”の場合には編集
対象領域フラグ６４は“０”とし、指示領域フラグ６３
が“０”の場合には編集対象領域フラグ６４を“１”と
する変更を行う。なお、上述したように、他の編集方法
が併用されており、指示領域フラグの値が“１”または
“０”以外の別の値となっているような場合には、ここ
で指示領域フラグ６３の値は、編集対象領域フラグ６４
のような値に変更されないので、その領域を抽出する影
響を受けない。編集領域抽出の処理が行われた後の画像
データ６５は、文書画像変換モジュール２５に送られ
る。In this way, the image data 62 to which the designated area flag 63 has been added is sent to the editing area discrimination module 24, and the editing area extraction processing is performed (step 47). That is, the edit target area determination module 2
4 is the image data 62 to which the designated area flag 63 is added.
When receiving the instruction area flag 6 of the image data 62,
As shown in FIG. 9, 3 is inverted and changed to the edit target area flag 64, and the image data 65 is processed. Specifically, when the designated area flag 63 is “1”, the edit target area flag 64 is set to “0”, and the designated area flag 63 is set.
If the value is "0", the edit target area flag 64 is changed to "1". As described above, if another editing method is also used and the value of the designated area flag is another value other than “1” or “0”, the designated area flag is set here. The value of 63 is the edit target area flag 64.
Since it is not changed to a value such as, it is not affected by extracting that area. The image data 65 after the editing area extraction process is performed is sent to the document image conversion module 25.

【００３４】文書画像変換モジュール２５では、編集対
象領域フラグ６４を有する画像データ６５に基づいて、
文書画像変換の処理を行う（ステップ４８）。つまり、
文書画像変換モジュール２５は、編集対象領域フラグ６
４が付加されている画像データ６５を受けとると、編集
対象領域フラグ６４を１ドットづつ読み出し、編集対象
領域フラグ６４が立っている（編集対象領域フラグが
“１”である）場合に、そのドットの画像データの値を
読み出す。そして、読み出した画像データの値を編集対
象領域以外の領域（編集対象領域フラグが“０”である
領域）との濃度差をつけるように書きかえる。In the document image conversion module 25, based on the image data 65 having the edit target area flag 64,
A document image conversion process is performed (step 48). That is,
The document image conversion module 25 uses the edit target area flag 6
When the image data 65 to which 4 is added is received, the edit target area flag 64 is read dot by dot, and if the edit target area flag 64 is set (the edit target area flag is "1"), the dot The value of the image data of is read. Then, the value of the read image data is rewritten so as to have a density difference from the area other than the area to be edited (area where the area flag to be edited is “0”).

【００３５】濃度差をつける編集を行う具体例として、
例えば、図１０に示すように、編集対象領域フラグが
“１”である画素のドットに対してのみ、その画像デー
タの値（濃度の値）が０．６倍とする。このような画像
データの値の書き換えが行なわれ、編集された状態の画
像データ６６となる。この実施例では、濃度の変換を行
う場合に、係数０．６を掛けるようにしているが、トナ
ーを用いて記録する複写機の場合では、約０．３〜０．
８程度の係数を掛るようにする。As a specific example of editing for making a density difference,
For example, as shown in FIG. 10, the value (density value) of the image data is set to 0.6 times only for the dot of the pixel whose edit target area flag is “1”. The value of the image data is rewritten in this way to obtain the edited image data 66. In this embodiment, the coefficient is multiplied by 0.6 when the density is converted. However, in the case of a copying machine which records using toner, it is about 0.3 to 0.
A coefficient of about 8 is applied.

【００３６】また、画像変換の処理では、編集対象領域
のドットの濃度を下げる代わりに、編集対象領域の背景
部分をグレーにする編集方法も利用しても良い。その場
合には、係数を掛けるのではなく、編集対象領域の全て
の画素の画像データの濃度の値に、例えば係数２５を加
える（ただし、加算した値の最大値は２５５を越えな
い）処理を行う。係数２５を加える場合の処理では、背
景が１０％のグレー化される処理となる。更に、利用者
が個別に濃度を指定して、濃度を下げるようにしても良
い。また、背景をグレーにする方法などの個々の編集方
法をコントロールパネル部２８の操作により選択できる
ようにしても良い。このようにして、各々の画素の画像
データの値が書き換えられた画像データは、プリンタ機
構２６に送られて、プリントアウトされる（ステップ４
９）。Further, in the image conversion processing, an editing method may be used in which the background portion of the edit target area is gray instead of reducing the density of dots in the edit target area. In that case, instead of multiplying by a coefficient, for example, a coefficient 25 is added to the density value of the image data of all pixels in the editing target area (however, the maximum value of the added values does not exceed 255). To do. In the process of adding the coefficient 25, the background is grayed by 10%. Further, the user may individually specify the density to reduce the density. Further, an individual editing method such as a method of making the background gray may be selected by operating the control panel unit 28. The image data in which the value of the image data of each pixel is rewritten in this way is sent to the printer mechanism 26 and printed out (step 4).
9).

【００３７】このようにして、図５に示したような入力
文書の文書画像５１に対して、その文書中に２ヶ所の領
域の指定が行なわれ、その指定の領域に対しては編集を
行なわず、それ以外のの背景のみをグレー変換すると、
その結果、図１１に示すような出力文書の文書画像６７
が得られる。In this way, for the document image 51 of the input document as shown in FIG. 5, two areas are designated in the document, and the designated areas are edited. If you convert only the other background to gray,
As a result, the document image 67 of the output document as shown in FIG.
Is obtained.

【００３８】ところで、前述した第１の実施例において
は、利用者がポインティングデバイス，ディジタイザな
どの座標入力装置によって、入力文書の文書画像５１に
対して、例えば、濃度変換を行うための編集対象以外の
領域を指定し、その編集対象以外を指定した領域の指定
により、逆に編集対象領域を判定して、入力画像に対す
る編集処理を施すものとなっている。このように、その
編集対象以外を指定する領域の指定を行うことにより、
編集対象領域の指定を行い、文書の周辺部までの編集対
象とする領域を正確に指定することができる。しかし、
このような領域の指定は、利用者による手操作による領
域の指定であり、手操作の入力ではその操作が煩雑にな
る。これに対して、従来からの技術を利用して、文字領
域または図表領域などの文書画像の特徴を自動的に判定
して、編集対象の領域の指定（編集対象以外の指定をも
含めて）を行うように構成してもよい。このような変形
例を第２の実施例として説明する。By the way, in the above-described first embodiment, the user uses a coordinate input device such as a pointing device or a digitizer to edit the document image 51 of the input document, for example, except for an object to be edited for density conversion. The area to be edited is designated, and the area other than the editing target is designated to determine the editing target area, and the editing process is performed on the input image. In this way, by specifying the area other than the editing target,
By specifying the edit target area, the edit target area up to the peripheral portion of the document can be accurately specified. But,
The designation of such an area is the designation of the area by the user's manual operation, and the manual operation makes the operation complicated. On the other hand, using the conventional technology, the characteristics of the document image such as the character area or the chart area are automatically determined, and the area to be edited is specified (including the specification other than the object to be edited). May be configured to perform. Such a modification will be described as a second embodiment.

【００３９】（第２の実施例）図１３は、本発明の第２
の実施例の文書画像処理装置の基本構成を示すブロック
図である。図１３において、２０１は編集対象文書画
像、２０２は領域分割処理部、２０３は最下位要素計数
部、２０４は長文領域判定部、２０５は文書画像変換処
理部、２０６は編集後文書画像である。(Second Embodiment) FIG. 13 shows a second embodiment of the present invention.
3 is a block diagram showing the basic configuration of the document image processing apparatus of the embodiment. FIG. In FIG. 13, 201 is an edit target document image, 202 is a region division processing unit, 203 is a lowest element counting unit, 204 is a long sentence region determination unit, 205 is a document image conversion processing unit, and 206 is an edited document image.

【００４０】領域分割処理部２０２は、文書画像を入力
として、文字領域、図表領域などに分割する処理機能要
素である。すなわち、文書画像上の画素の集まりをそれ
ぞれの文書要素の意味のある塊として小領域に分割し、
その文書画像の領域の物理的性質を認識して文字領域お
よび図表領域などの領域に分割する処理を行う。このよ
うな機能要素の技術に関しては、つまり、文書画像を入
力し、その物理的な性質から文字領域または図形領域な
どに分割する技術に関しては、従来から公知となってい
る例えば特開昭６４−１５８８９号公報あるいは特公昭
６１−３２７１２号公報により提案されているような技
術を利用すれば良いので、ここでの詳細な説明は省略す
る。The area division processing unit 202 is a processing function element that receives a document image and divides it into a character area, a figure area and the like. That is, a collection of pixels on the document image is divided into small areas as meaningful blocks of each document element,
The physical property of the area of the document image is recognized and the processing is divided into areas such as a character area and a figure area. The technology of such a functional element, that is, the technology of inputting a document image and dividing it into a character area or a graphic area based on its physical properties is conventionally known, for example, Japanese Patent Laid-Open No. 64-64-. Since a technique such as that proposed in Japanese Patent No. 15889 or Japanese Patent Publication No. 61-32712 may be used, detailed description thereof will be omitted here.

【００４１】なお、ここでの領域分割処理部２０２で
は、それぞれの領域を矩形に分割して出力するが、文書
画像に対して、領域分割の処理を行う前に、傾き補正、
ノイズ除去等の前処理が行なわれる。また、この領域分
割処理部２０２においては、ここで分割された文書画像
の各領域において、更に、文字の要素、図形の要素、け
い線の要素などの種別が識別され、これらの要素の種別
も同時に付加して出力される。また、要素の種別のデー
タにより、更に、領域の特性を判定する。このような領
域が分割された結果として、領域分割処理部２０２から
出力される文書画像の要素の列は、文書のレイアウト構
造データと呼ばれる。The area division processing unit 202 divides each area into rectangular areas and outputs the rectangular areas. Before performing the area division processing on the document image, inclination correction,
Preprocessing such as noise removal is performed. Further, in the area division processing unit 202, types of character elements, graphic elements, ruled line elements, and the like are further identified in each area of the document image divided here, and the types of these elements are also identified. It is added and output at the same time. Further, the characteristic of the area is further determined by the data of the element type. The sequence of elements of the document image output from the region division processing unit 202 as a result of dividing the region is called layout structure data of the document.

【００４２】最下位要素計数部２０３は、領域分割処理
部２０２で得られたレイアウト構造データから、文字領
域の文字のブロックごとに最下位要素の領域（１文字に
対応する領域）の数を、文字数として計数する。各々の
領域について、その文字数が最下位要素計数部２０３に
より計数されると、計数された計数値により、長文領域
判定部２０４は、計数された領域数の平均値を計算し、
平均領域数より多い文字ブロックの領域を、編集対象領
域として判定する。この判定された結果の領域に対し
て、文書画像変換処理部２０５は、編集対象となる領域
の濃度やコントラストを変換し、出力画像を生成する。
編集後文書画像２０６として出力する処理機能要素であ
る。文書画像変換部２０５としては通常の画像形成処理
装置が利用される。From the layout structure data obtained by the area division processing section 202, the lowest element counting section 203 determines the number of lowest element areas (areas corresponding to one character) for each character block of the character area. Count as the number of characters. For each area, when the number of characters is counted by the lowest element counting unit 203, the long sentence area determination unit 204 calculates the average value of the counted number of areas, based on the counted value.
The area of the character block larger than the average number of areas is determined as the edit target area. The document image conversion processing unit 205 converts the density and contrast of the area to be edited with respect to the area of the result of this determination, and generates an output image.
It is a processing function element output as the edited document image 206. A normal image forming processing device is used as the document image converting unit 205.

【００４３】次に、このような各々の機能要素により構
成される第２の実施例にかかる文書画像処理装置を、デ
ジタルカラー複写機に適用した場合を例として説明す
る。前述した第１の実施例の説明と同様に、装置を構成
する各々の機能要素は、前述した公知例となっている特
開平２−２２３２７５号公報に記載されているディジタ
ルカラー複写機における各々の機能要素が利用できるの
で、個別の各々の機能要素についての説明は省略し、以
下の説明では、文書画像処理の動作に従い順を追って説
明する。Next, the case where the document image processing apparatus according to the second embodiment configured by each of the above functional elements is applied to a digital color copying machine will be described as an example. Similar to the description of the first embodiment described above, the respective functional elements constituting the apparatus are the same as those in the digital color copying machine described in the above-mentioned publicly known Japanese Patent Application Laid-Open No. 2-223275. Since the functional elements can be used, the description of each individual functional element is omitted, and in the following description, the operation will be sequentially described according to the operation of the document image processing.

【００４４】図１４はデジタルカラー複写機に適用した
第２の実施例の文書画像処理装置の要部の装置構成を説
明するブロック図である。図１４において、７０は文書
画像処理装置、７１はイメージスキャナ、７２は領域分
割モジュール、７３最下位要素計数モジュール、７４は
長文領域判別モジュール、７５は文書画像変換モジュー
ル、７６はプリンタ機構、７７は制御モジュール、７８
はコントロールパネル部である。FIG. 14 is a block diagram for explaining the arrangement of the essential parts of the document image processing apparatus of the second embodiment applied to a digital color copying machine. In FIG. 14, 70 is a document image processing apparatus, 71 is an image scanner, 72 is an area dividing module, 73 is the lowest element counting module, 74 is a long sentence area discrimination module, 75 is a document image converting module, 76 is a printer mechanism, and 77 is Control module, 78
Is the control panel section.

【００４５】コントロールパネル部７８は、利用者から
の変換指示を受け付ける機能要素であり、テンキーおよ
びファンクションキーからなるキーボード、およびディ
スプレイなどから構成される。制御モジュール７７はコ
ントロールパネル部７８に対するデータの入出力処理、
イメージスキャナ７１の起動処理、プリンタ機構７６の
起動処理などの制御処理を行なうため制御ユニットであ
る。制御ユニットには制御用のマイクロプロセッサが搭
載されており、マイクロプロセッサには、制御処理のプ
ログラムが内部にプログラムされている。これらは第１
の実施例で用いられているものと同様である。The control panel unit 78 is a functional element that receives a conversion instruction from the user, and is composed of a keyboard including ten keys and function keys, a display, and the like. The control module 77 inputs / outputs data to / from the control panel unit 78,
The control unit is for performing control processing such as activation processing of the image scanner 71 and activation processing of the printer mechanism 76. A microprocessor for control is installed in the control unit, and a program for control processing is internally programmed in the microprocessor. These are the first
Is the same as that used in the embodiment.

【００４６】図１５は、文書画像の文字領域を判別して
その領域に対する濃度変換を行う場合の処理を流れを示
すフローチャートである。図１５を参照して処理の概略
を説明する。電源が投入され、処理が開始されると、ま
ず、ステップ８０において、立ち上げ処理を行う。次
に、ステップ８１において、濃度変換モードが指定され
ているか否かを判定する。濃度変換モードが指定されて
いない場合には、ステップ８２に進み、通常の複写処理
を行う。そして、再び、ステップ８１に戻り、濃度変換
モードの指定を判定する。FIG. 15 is a flow chart showing the flow of processing in the case of discriminating a character area of a document image and performing density conversion for the area. The outline of the processing will be described with reference to FIG. When the power is turned on and the process is started, first, in step 80, a startup process is performed. Next, in step 81, it is determined whether the density conversion mode is designated. If the density conversion mode has not been designated, the process proceeds to step 82 and normal copying processing is performed. Then, the process returns to step 81 again, and the designation of the density conversion mode is determined.

【００４７】ステップ８１の判定により、濃度変換モー
ドが指定されていることが判定されると、ステップ８３
からの処理により、文書画像の文字領域を判別してその
領域に対する濃度変換処理を行う。この処理では、ま
ず、ステップ８３でスタートが指示されたか否かを判定
する。スタートの指示が判定されない場合は、再び、ス
テップ８３に戻り、再び、スタートが指示されたか否か
を判定する。つまり、スタートが指示されるまで待つダ
イナミックループに入る。When it is determined in step 81 that the density conversion mode is designated, step 83
By the processing from 1), the character area of the document image is discriminated and the density conversion processing is performed on the area. In this process, first, in step 83, it is determined whether or not the start is instructed. If the start instruction has not been determined, the process returns to step 83 again, and it is determined again whether or not the start instruction has been given. In other words, it enters a dynamic loop that waits until the start is instructed.

【００４８】ステップ８３において、スタートが指示さ
れたことが判定されると、次に、ステップ８４に進み、
原稿読み取りの処理を行う。次に、ステップ８５におい
て、文書画像の領域分割の処理を行い、レイアウト構造
データを取り出し、次のステップ８６において、分割し
た各々の領域に対して、最下位要素の領域を計数する文
字数計数の処理を行う。次に、ステップ８７において、
各々の領域毎に文字数として計数した計数値から長文領
域を判定する処理を行う。そして、次のステップ８８に
おいて、判定された長文領域を編集領域として文字画像
の濃度変換を行う文字画像変換の処理を行い、次のステ
ップ８９において、画像出力を行う。そして、ステップ
８１に戻り、次の文書画像に対して同様の処理の流れに
従って、ステップ８１から処理を繰り返し行う。When it is determined in step 83 that the start is instructed, the process proceeds to step 84,
Performs document scanning processing. Next, in step 85, the process of dividing the area of the document image is performed, the layout structure data is taken out, and in the next step 86, the process of counting the number of characters that counts the region of the lowest element for each divided region. I do. Then, in step 87,
A process of determining a long sentence area is performed from the count value counted as the number of characters for each area. Then, in the next step 88, a character image conversion process is performed in which the density of the character image is converted using the determined long text area as an editing area, and in the next step 89, an image is output. Then, the process returns to step 81, and the process is repeated from step 81 according to the same process flow for the next document image.

【００４９】次に、このようにして、編集対象の文書画
像の領域判定を行い、その領域判定の結果により部分的
に濃度変換を行う場合の処理について、具体的な文書画
像の処理例について説明する。図１６は、処理対象の文
書画像に対して領域判定を行なわれた結果の領域判別デ
ータの一例を示す図であり、図１７は、領域判定結果の
１つの文字ブロック領域における階層構造の判定結果と
入力文書との対応関係を説明する図である。また、図１
８は領域判別データの１つの領域における判定結果の階
層構造の領域データを示す図であり、図１９は各々の判
定領域毎に文字数として計数された計数データを格納す
る文字数テーブルを示す図である。また、図２０は、判
定された長文領域に対して最終的に濃度変換が行なわれ
た状態の文書画像の出力文書の一例を示す図である。な
お、文書画像（図１６および図２０）においては、前述
の場合と同様に、領域の位置を表わすため、位置の基準
として直交座標の座標値を用いるが、この座標軸は、図
中に示すように、ページ右に向かってｘ軸とし、ページ
下に向かってｙ軸とする。Next, with respect to the processing in the case where the area of the document image to be edited is determined and the density conversion is partially performed according to the result of the area determination, a specific processing example of the document image will be described. To do. FIG. 16 is a diagram showing an example of area determination data obtained as a result of performing area determination on a document image to be processed, and FIG. 17 is a determination result of a hierarchical structure in one character block area of the area determination result. It is a figure explaining the corresponding relationship between and an input document. Also, FIG.
8 is a diagram showing region data of a hierarchical structure of a determination result in one region of the region discrimination data, and FIG. 19 is a diagram showing a character number table storing count data counted as the number of characters for each determination region. . FIG. 20 is a diagram showing an example of an output document of a document image in a state in which the density conversion is finally performed on the determined long text area. In the document image (FIGS. 16 and 20), as in the case described above, since the position of the region is represented, the coordinate value of the Cartesian coordinate is used as the position reference, and this coordinate axis is as shown in the figure. In addition, the x-axis is set to the right of the page, and the y-axis is set to the bottom of the page.

【００５０】次に、これらの図１６〜図２０を参照して
説明する。電源が投入されると、制御モジュール７７が
立ち上げ処理を行ない、コントロールパネル部７８にお
いて初期画面を表示する（ステップ８０）。利用者が、
コントロールパネル部７８において所望する操作を行
い、例えば、濃度変換モードを指示する（「濃度変換」
ボタンを押す）と、濃度変換モードとなるが、そうでな
ければ、通常の複写処理を行なう。濃度変換モードにな
った後は、「スタート」を指示する複写スタートボタン
が押されるのを待つ（ステップ８１〜８３）。Next, description will be made with reference to FIGS. 16 to 20. When the power is turned on, the control module 77 performs a start-up process to display an initial screen on the control panel unit 78 (step 80). The user
A desired operation is performed on the control panel unit 78 to instruct, for example, a density conversion mode (“density conversion”).
When the button is pressed), the density conversion mode is entered. If not, normal copying processing is performed. After entering the density conversion mode, the process waits until the copy start button for instructing "start" is pressed (steps 81 to 83).

【００５１】次に、利用者が、原稿をプラテン上に置
き、複写スタートボタンを押すと、制御モジュール７７
がイメージスキャナ７１を起動する。イメージスキャナ
７１からは原稿画像が読み取られ、デジタル画像データ
として、文書画像処理装置７０の領域分割モジュール７
２に受け渡される。領域分割モジュール７２は、領域分
割の処理を行い、例えば、図５に示すような入力文書の
文書画像５１に対して、領域分割の処理を行う。その結
果、図１６に示すように、文書画像データ９１の各々の
領域に対して、文字がまとまって並んでいる領域（文字
ブロック領域）９２と、罫線が存在する領域（罫線領
域）９３と、これら以外の余白の領域（余白領域）９４
とに分割される。Next, when the user puts the original on the platen and presses the copy start button, the control module 77 is pressed.
Activates the image scanner 71. The original image is read from the image scanner 71, and as the digital image data, the area dividing module 7 of the document image processing apparatus 70.
Handed over to 2. The area division module 72 performs area division processing, for example, performs area division processing on the document image 51 of the input document as shown in FIG. As a result, as shown in FIG. 16, with respect to each area of the document image data 91, an area (character block area) 92 in which characters are arranged side by side, an area (ruled line area) 93 in which a ruled line exists, Other blank area (margin area) 94
Is divided into and

【００５２】ここで各々の領域を表現する矩形は、原稿
画像のｘ軸およびｙ軸の方向それぞれに平行な辺を持
ち、対象となる領域を囲む最小の矩形とする。この矩形
の領域を表現するデータは基本的に「種別，左上点ｘ座
標，左上点ｙ座標，幅，高さ」の５個のデータの組で表
現される。領域を表現する矩形のデータは、レイアウト
構造データのそれぞれの要素であり、それらの要素が階
層構造のデータとなっている。例えば、矩形の領域が文
字ブロック領域である場合は、図１７に示すように、文
字ブロック領域に対して、その領域内の下位の領域デー
タとして、レイアウト構造データの要素である文字行領
域のデータがあり、更に、文字行領域のデータに対して
は、その文字行領域内の下位の領域データとして、レイ
アウト構造データの最下位の要素である一文字ずつの文
字領域のデータがある。Here, the rectangle representing each area is the smallest rectangle that has sides parallel to the x-axis and y-axis directions of the original image and surrounds the target area. The data expressing this rectangular area is basically expressed by a set of five data of "type, upper left point x coordinate, upper left point y coordinate, width, height". The rectangular data representing the area is each element of the layout structure data, and these elements are hierarchical structure data. For example, when the rectangular area is the character block area, as shown in FIG. 17, the data of the character line area, which is an element of the layout structure data, is set as the lower area data in the character block area. Further, for the data in the character line area, there is character area data for each character, which is the lowest element of the layout structure data, as the lower area data in the character line area.

【００５３】例えば、図５に示すような入力文書の文書
画像５１に対して、領域分割により領域判定が行なわれ
た場合には、図１７に示すように、入力文書における文
字列“本報告は顧客の満足度を… ○○○○○”の領
域に対して、それぞれに分割された領域は、最上位の文
字ブロック領域１０１に対して、その下位の領域とし
て、文字行領域１０２があり、更に下位のレイアウト構
造データの要素として文字領域１０３の各々の領域が判
定され、それぞれの領域データに分割される。なお、図
１７では、階層構造１００となっている各々の領域デー
タに対応して、その本文の文字列部分と、その領域デー
タとを示している。For example, when the area determination is performed on the document image 51 of the input document as shown in FIG. 5 by the area division, as shown in FIG. 17, the character string "this report The customer satisfaction degree is divided into areas of "○○○○○", and the uppermost character block area 101 has a character line area 102 as a lower area thereof. Further, each area of the character area 103 is determined as an element of the lower layout structure data, and is divided into respective area data. Note that, in FIG. 17, the character string portion of the text and the area data thereof are shown corresponding to each area data having the hierarchical structure 100.

【００５４】このように領域分割された各々の階層構造
を有する領域データは、図１８に示すように、表形式の
レコードデータとして表現され、その各々の領域データ
が領域テーブル１０４に格納される。この領域テーブル
１０４の１つのレコードデータが１つの領域データを表
現している。ここでの各々の領域データのレコードデー
タは、前述したように「種別，左上点ｘ座標，左上点ｙ
座標，幅，高さ」の５個のデータの組から成る領域を表
わす各々のフィールドのデータに加えて、各々の領域に
対して、その階層構造を表現するため、更に、下位要素
個数フィールド１０５および下位要素開始番号フィール
ド１０６のフィールドデータが追加されている。この下
位要素個数フィールド１０５および下位要素開始番号フ
ィールド１０６の２つのフィールドデータによる階層構
造を順次に辿ることにより、１つの文字ブロック領域の
文字領域の個数が文字数として計数できる。The area data having each hierarchical structure thus divided into areas is expressed as tabular record data as shown in FIG. 18, and each area data is stored in the area table 104. One record data of this area table 104 represents one area data. As described above, the record data of each area data here is “type, upper left point x coordinate, upper left point y.
In addition to the data of each field that represents an area composed of a set of five data of "coordinate, width, and height", in order to express the hierarchical structure of each area, a subelement number field 105 Field data of the lower element start number field 106 is added. The number of character areas in one character block area can be counted as the number of characters by sequentially tracing the hierarchical structure of the two field data of the lower element number field 105 and the lower element start number field 106.

【００５５】つまり、最下位要素計数モジュール７３で
は、領域分割モジュール７２で得られたレイアウト構造
データの中から、文字ブロック領域ごとに最下位要素の
領域数を計数する処理を行う。ここの処理は、レイアウ
ト構造データから、その最上位の領域が文字ブロック領
域であるものについて、具体的には、図１８に示すよう
な領域テーブル１０４において、種別フィールドのデー
タが“文字ブロック”となっている領域のレコードデー
タについて、その階層構造から下位側の領域データの要
素を辿る。そして、最終的に、下位要素開始番号フィー
ルド１０６のデータが“０”となっている領域データ
（すなわち、それより下位にリンクする要素がない最下
位要素の領域データ）までの要素の数を計数する。That is, the lowest element counting module 73 performs a process of counting the number of lowest element areas for each character block area from the layout structure data obtained by the area dividing module 72. In this processing, the layout structure data whose topmost area is a character block area, specifically, in the area table 104 as shown in FIG. 18, the data of the type field is “character block” With respect to the record data in the specified area, the elements of the area data on the lower side are traced from the hierarchical structure. Then, finally, the number of elements up to the area data in which the data of the lower element start number field 106 is "0" (that is, the area data of the lowermost element that has no element linked below it) is counted. To do.

【００５６】図１８に示す領域テーブル１０４のデータ
の例で具体的に説明すると、番号フィールドの値が
“１”である文字ブロック領域の領域データは、下位要
素個数フィールド１０５および下位要素開始番号フィー
ルド１０６のデータにより、その下位の要素は、番号フ
ィールドの値が“１１”の文字行領域のみとなってい
る。更に、番号フィールドの値が“１１”の文字行領域
の領域データは、更に下位の要素が、番号フィールドの
値が“５０”から“５９”までの文字領域のデータであ
る。これらの文字領域の領域データは、それより下位に
リンクされる要素は存在しないため、最下位要素の数は
「１０個」と求められる。このようにして得られた各々
の文字ブロック領域毎に、文字数として計数された結果
は、そのレイアウト構造データと共に、長文領域判別モ
ジュール７４へと送られる。Explaining in detail with an example of the data of the area table 104 shown in FIG. 18, the area data of the character block area in which the value of the number field is "1" includes the lower element number field 105 and the lower element start number field. According to the data of 106, the lower element is only the character line area whose value in the number field is "11". Further, in the area data of the character line area having the value of the number field of "11", the lower-order elements are the data of the character area having the values of the number field of "50" to "59". In the area data of these character areas, since there are no elements linked below it, the number of lowest elements is calculated as "10". The result of counting the number of characters for each character block area thus obtained is sent to the long sentence area determination module 74 together with the layout structure data.

【００５７】長文領域判別モジュール７４では、送られ
てきた文字数のデータとそのレイアウト構造のデータ
を、図１９に示すように、文字数テーブル１０７に、各
々の文字ブロック領域毎にその文字数のデータを格納す
る。そして、得られた各領域毎の文字数の平均値を計算
し、計算した平均値よりも文字数が多い文字ブロック領
略を長文領域と判定する。図１９に示す例では、文字ブ
ロック番号が“５”および“７”である２つの領域が長
文領域と判定され、その判定結果フィールドには判定フ
ラグが立てられる。In the long sentence area discrimination module 74, the data of the number of characters and the data of its layout structure which have been sent are stored in the character number table 107 for each character block area, as shown in FIG. To do. Then, the average value of the number of characters obtained for each area is calculated, and the character block region having the larger number of characters than the calculated average value is determined as the long sentence area. In the example shown in FIG. 19, two areas having character block numbers “5” and “7” are determined to be long sentence areas, and a determination flag is set in the determination result field.

【００５８】このように判定された長文領域を画像編集
領域として、ここでの濃度変換が行われる。例えば、図
５に示すような入力文書の文書画像５１に対して、領域
分割が行なわれ、分割された結果の文字ブロック領域に
対して、２つの長文領域が判定され、図１９に示すよう
な判定結果のデータが得られる。そして、長文領域であ
ると判定された文字ブロック領域に対して、その領域デ
ータ「左上点ｘ座標，左上点ｙ座標，幅，高さ」で示さ
れる領域が、編集領域とされる。次に、第１の実施例の
場合と同様に、編集領域とされた領域の範囲内の画素の
画像データに、編集対象領域フラグを追加し、長文領域
であると判定された文字ブロック領域の編集対象領域フ
ラグを“１”とする処理を行なった後、その画像データ
を文書画像変換モジュール７５に受け渡す。The long sentence area thus determined is used as the image editing area, and the density conversion is performed here. For example, the document image 51 of the input document as shown in FIG. 5 is divided into regions, and two long sentence regions are determined for the divided character block region, as shown in FIG. Data of the judgment result is obtained. Then, for the character block area determined to be the long sentence area, the area indicated by the area data “upper left point x coordinate, upper left point y coordinate, width, height” is set as the editing area. Next, as in the case of the first embodiment, the edit target area flag is added to the image data of the pixels within the range of the edit area, and the character block area of the character block area determined to be the long sentence area is added. After performing the process of setting the edit target area flag to “1”, the image data is transferred to the document image conversion module 75.

【００５９】文書画像変換モジュール７５では、第１の
実施例と同様の処理を行ない、編集対象領域フラグが
“１”となっている領域の画像データに対して、前述し
たように所定の係数（０．６）を掛けて濃度を下げる
か、または、それぞれの領域の内の画像データの所定の
値を加算して、その背景をグレーにする濃度変換を行
う。このようにして、画像変換を行った後の画像データ
は、プリンタ機構７６に受け渡され、出力文書の文書画
像とし出力される。この結果、図５に示すような入力文
書の文書画像５１に対して、長文領域が判定され、その
領域の背景をグレーにする編集が行なわれた場合、図２
０に示すように、文字列のまとまった領域がグレーに編
集された結果の出力文書の文書画像１０８が出力され
る。The document image conversion module 75 performs the same processing as that of the first embodiment, and applies the predetermined coefficient (as described above) to the image data of the area in which the edit target area flag is "1". 0.6) is applied to reduce the density, or a predetermined value of the image data in each area is added to perform density conversion to make the background gray. In this way, the image data after the image conversion is transferred to the printer mechanism 76 and output as a document image of an output document. As a result, when the long sentence area is determined and the background image of the area is edited to be gray in the document image 51 of the input document as shown in FIG.
As shown in 0, the document image 108 of the output document obtained as a result of the area in which the character string is collected being edited in gray is output.

【００６０】以上に説明した第２の実施例においては、
文書画像の編集領域の指示を、従来からの技術を利用し
て、文字領域または図表領域などの文書画像の物理的な
特徴を自動的に判定し、編集対象の領域の指定（編集対
象以外の指定をも含めて）を行うように構成したもので
あったが、この文書画像の編集領域の指示を、文字領域
または図表領域などの文書画像の物理的な特徴を判定
し、更に、編集する入力文書の文書画像に特有の論理的
な特徴から、その領域指定を行うように変形しても良
い。その場合の論理的な特徴付けは、例えば、レイアウ
ト構造データの各々要素に対応して、予じめ「タイト
ル」，「著者」，「本文」などの論理的な意味から領域
を対応づけをしておく。次に、このような変形例の文書
画像処理装置を第３の実施例として説明する。In the second embodiment described above,
By using the conventional technology, the physical characteristics of the document image such as the character area or the chart area are automatically determined to specify the editing area of the document image, and the area to be edited is specified (other than the editing object). The specification of the editing area of the document image is performed, and the physical characteristics of the document image such as the character area or the chart area are determined and further edited. The area may be modified based on the logical characteristics peculiar to the document image of the input document. The logical characterization in that case is, for example, by associating regions with logical meanings such as “title”, “author”, “text”, etc., corresponding to each element of the layout structure data. Keep it. Next, a document image processing apparatus of such a modification will be described as a third embodiment.

【００６１】（第３の実施例）図２１は、本発明の第３
の実施例の文書画像処理装置の基本構成を示すブロック
図である。図２１において、３０１は編集対象文書画
像、３０２は領域分割処理部、３０３は論理識別子付与
部、３０４は編集領域判別部、３０５は文書画像変換処
理部、３０６は編集後文書画像である。(Third Embodiment) FIG. 21 shows the third embodiment of the present invention.
3 is a block diagram showing the basic configuration of the document image processing apparatus of the embodiment. FIG. In FIG. 21, reference numeral 301 is an edit target document image, 302 is an area dividing processing unit, 303 is a logical identifier assigning unit, 304 is an editing area determining unit, 305 is a document image conversion processing unit, and 306 is an edited document image.

【００６２】領域分割処理部３０２は、第２の実施例で
説明して領域分割処理部２０２と同様なものであり、文
書画像を入力として、文字領域、図表領域などに分割す
る処理機能要素である。この場合においても、領域分割
処理部３０２は、分割された文書画像の各領域におい
て、更に、文字の要素、図形の要素、けい線の要素など
を識別すると共に、これらの要素の種別も領域データに
付加したレイアウト構造データを出力する。The area division processing unit 302 is the same as the area division processing unit 202 described in the second embodiment, and is a processing function element for dividing a document image into a character area, a figure area and the like. is there. Also in this case, the area division processing unit 302 further identifies character elements, graphic elements, ruled line elements, and the like in each area of the divided document image, and the types of these elements are also area data. The layout structure data added to is output.

【００６３】論理識別子付与部３０３は、領域分割処理
部３０２で得られたレイアウト構造データの要素を入力
とし、それぞれに「タイトル」，「著者」，「本文」な
どの論理的に意味付ける識別子を付与する処理機能要素
である。具体的には、例えば特開平５−１５９１０１号
公報に記載されたような技術を利用する。この論理識別
子付与部３０３は、認識対象のレイアウト構造と論理構
造の対応を表わす文書構造モデルをあらかじめ登録して
おき、領域分割の結果データと構造モデルとのマッチン
グによってそれぞれの領域に対して論理的な意味を表現
する名称を付与する。The logical identifier assigning unit 303 receives the elements of the layout structure data obtained by the area division processing unit 302 as inputs, and assigns logically meaningful identifiers such as “title”, “author”, and “text” to each. It is a processing function element to be given. Specifically, for example, the technique described in Japanese Patent Laid-Open No. 5-159101 is used. The logical identifier assigning unit 303 registers in advance a document structure model that represents the correspondence between the layout structure to be recognized and the logical structure, and logically assigns a logical structure to each area by matching the area division result data with the structural model. Assign a name that expresses the meaning.

【００６４】編集領域判別部３０４は、論理識別子付与
部３０３において付与された論理識別子に基づいて編集
対象となる領域を決定する機能要素である。例えば、文
書画像におけるレイアウト構造から、タイトルなどの大
きな文字部分を除き、小さい文字部分だけ濃度を薄くす
る編集を行う場合、文書構造の「本文」，「ページ番
号」，「ヘッダ」，「脚注」，「著者」などの論理構造
の名称により、編集対象する要素を指定しておく。これ
により、編集領域判別部３０４は、それに対応して文書
画像中の編集領域を判定し、編集領域指示フラグを設定
する。このように判定された結果の編集領域に対して、
文書画像変換処理部３０５は、編集対象とされた領域の
濃度やコントラストを変換し、出力文書の文書画像を生
成し、編集後文書画像３０６として出力する機能要素と
なっている。これは第１の実施例および第２の実施例の
ものと同様である。The editing area discriminating unit 304 is a functional element that determines an area to be edited based on the logical identifier given by the logical identifier giving unit 303. For example, when editing the layout structure of a document image to remove large characters such as titles and lighten the density of only small characters, "text", "page number", "header", "footnote" of the document structure , The element to be edited is specified by the name of the logical structure such as "author". As a result, the edit area determination unit 304 determines the edit area in the document image correspondingly, and sets the edit area instruction flag. For the editing area of the result determined in this way,
The document image conversion processing unit 305 is a functional element that converts the density and contrast of the area to be edited, generates a document image of the output document, and outputs the edited document image 306. This is similar to that of the first and second embodiments.

【００６５】次に、このような各々の機能要素により構
成される第３の実施例の文書画像処理装置を、前述の場
合と同様に、デジタルカラー複写機に適用した場合を例
として、実際の動作例で説明する。前述した第１の実施
例の説明と同様に、装置を構成する各々の機能要素は、
前述した公知例となっている特開平２−２２３２７５号
公報に記載されているディジタルカラー複写機における
各々の機能要素が利用できるので、個別の各々の機能要
素についての説明は省略し、以下の説明では、文書画像
処理の動作に従い順を追って説明する。Next, as an example, the document image processing apparatus of the third embodiment constituted by the respective functional elements as described above is applied to a digital color copying machine as in the case described above. An operation example will be described. Similar to the above description of the first embodiment, each functional element constituting the device is
Since the respective functional elements in the digital color copying machine described in the above-mentioned publicly known Japanese Patent Application Laid-Open No. 2-223275 can be used, description of each individual functional element will be omitted, and the following description will be given. Now, description will be made step by step according to the operation of the document image processing.

【００６６】図２３はデジタルカラー複写機に適用した
第３の実施例の文書画像処理装置の要部の装置構成を説
明するブロック図である。図２３において、１２０は文
書画像処理装置、１２１はイメージスキャナ、１２２は
領域分割モジュール、１２３は論理識別子付与モジュー
ル、１２４は編集領域判定モジュール、１２５は文書画
像変換モジュール、１２６はプリンタ機構、１２７は制
御モジュール、１２８はコントロールパネル部である。FIG. 23 is a block diagram for explaining the arrangement of the essential parts of the document image processing apparatus of the third embodiment applied to a digital color copying machine. In FIG. 23, 120 is a document image processing device, 121 is an image scanner, 122 is an area dividing module, 123 is a logical identifier assigning module, 124 is an editing area determination module, 125 is a document image converting module, 126 is a printer mechanism, 127 is a The control module 128 is a control panel unit.

【００６７】コントロールパネル部１２８は、利用者か
らの変換指示を受け付ける機能要素であり、テンキーお
よびファンクションキーからなるキーボード、およびデ
ィスプレイなどから構成される。制御モジュール１２７
はコントロールパネル部１２８に対するデータの入出力
処理、イメージスキャナ１２１の起動処理、プリンタ機
構１２６の起動処理などの制御処理を行なうため制御ユ
ニットである。これらの第１の実施例で用いられている
ものと同様である。The control panel unit 128 is a functional element that receives a conversion instruction from the user, and is composed of a keyboard including ten keys and function keys, a display, and the like. Control module 127
Is a control unit for performing control processing such as data input / output processing for the control panel unit 128, image scanner 121 startup processing, and printer mechanism 126 startup processing. It is similar to that used in these first embodiments.

【００６８】図２４は、文書画像の各領域を論理識別子
により判別してその領域に対する濃度変換を行う場合の
処理を流れを示すフローチャートである。図２４を参照
して処理の概略を説明する。電源が投入され、処理が開
始されると、まず、ステップ１３０において、立ち上げ
処理を行う。次に、ステップ１３１において、濃度変換
モードが指定されているか否かを判定する。濃度変換モ
ードが指定されていない場合には、ステップ１３２に進
み、通常の複写処理を行う。そして、再び、ステップ１
３１に戻り、濃度変換モードの指定を判定する。FIG. 24 is a flow chart showing the flow of processing in the case where each area of a document image is discriminated by a logical identifier and density conversion is performed on that area. The outline of the processing will be described with reference to FIG. When the power is turned on and the process is started, first, in step 130, a startup process is performed. Next, in step 131, it is determined whether the density conversion mode is designated. If the density conversion mode is not designated, the routine proceeds to step 132, where normal copying processing is performed. And again, step 1
Returning to step 31, the designation of the density conversion mode is judged.

【００６９】ステップ１３１の判定により、濃度変換モ
ードが指定されていることが判定されると、ステップ１
３３からの処理により、文書画像の各領域を論理識別子
により判別してその領域に対する濃度変換処理を行う。
この処理では、まず、ステップ１３３でスタートが指示
されたか否かを判定する。スタートの指示が判定されな
い場合は、再び、ステップ１３３に戻り、再び、スター
トが指示されたか否かを判定し、スタートが指示される
まで待つ。If it is determined in step 131 that the density conversion mode is designated, step 1
By the processing from 33, each area of the document image is discriminated by the logical identifier and the density conversion processing is performed on that area.
In this process, first, it is determined in step 133 whether or not the start is instructed. If the start instruction is not determined, the process returns to step 133 again, it is determined again whether or not the start instruction is given, and the process waits until the start instruction is given.

【００７０】ステップ１３３において、スタートが指示
されたことが判定されると、次に、ステップ１３４に進
み、原稿読み取りの処理を行う。次に、ステップ１３５
において、文書画像の領域分割の処理を行い、レイアウ
ト構造データを取り出し、次のステップ１３６におい
て、取り出したレイアウト構造データに対して論理識別
子を付与する処理を行う。つまり、取り出されたレイア
ウト構造データと、保持している構造モデルとのマッチ
ングを行い、レイアウト構造の最上位の要素がどのよう
な文書構造における論理的な意味を持つかを示す識別子
を与える。この識別子は、「タイトル」，「著者」，
「サブタイトル」，「本文」，「脚注」などの文書構造
の要素を意味する識別子である。When it is determined in step 133 that the start is instructed, the process proceeds to step 134, and the document reading process is performed. Then, step 135
In step 1, the document image is divided into areas, the layout structure data is extracted, and in step 136, a process of assigning a logical identifier to the extracted layout structure data is performed. That is, the extracted layout structure data is matched with the held structure model, and an identifier indicating what kind of document structure the topmost element of the layout structure has a logical meaning is given. This identifier is "title", "author",
It is an identifier that means an element of the document structure such as “subtitle”, “text”, and “footnote”.

【００７１】次に、ステップ１３７において、各々の領
域に付与された識別子に応じて、編集する領域を判別す
る処理を行う。そして、次のステップ１３８において、
判定された各々の編集領域に対して文字画像の濃度変換
を行う文字画像変換の処理を行い、次のステップ１３９
において、画像出力を行う。そして、ステップ１３１に
戻り、次の文書画像に対して同様の処理の流れに従っ
て、ステップ１３１から処理を繰り返し行う。Next, in step 137, a process of discriminating the region to be edited is performed according to the identifier given to each region. Then, in the next step 138,
A character image conversion process for converting the density of the character image is performed on each of the determined editing areas, and the following step 139 is performed.
At, the image is output. Then, the process returns to step 131, and the process is repeated from step 131 on the next document image according to the same process flow.

【００７２】このようにして、編集対象の文書画像の領
域分割を行い、その分割した領域に付与した論理的な識
別子による判定した領域に対して濃度変換を行う場合の
処理について、具体的な文書画像の処理例について説明
する。図２２は、文書構造の論理識別子の一例を示す図
であり、また、図２５は、文書画像の各領域に対応付け
る構造文書モデルの一例を示す図である。図２６は、レ
イアウト構造データの要素に対応づけた論理識別子との
対応関係を示す図である。また、図２７は、編集対象の
候補として抽出する領域を論理識別子により指定する編
集対象判定データの一例を示す図である。図２８は、論
理識別子により判定された領域判定データの一例を示す
図であり、図２９は、領域判定データによる指定された
領域に対して最終的に濃度変換が行なわれた状態の文書
画像の出力文書の一例を示す図である。なお、前述の場
合と同様に、領域の位置を表わすため、位置の基準とし
て直交座標の座標値を用いるが、この座標軸は、図２０
に示すように、文書画像のページ右に向かってｘ軸と
し、ページ下に向かってｙ軸とする。In this manner, the process of dividing the area of the document image to be edited and performing the density conversion on the area determined by the logical identifier given to the divided area will be described below. An example of image processing will be described. FIG. 22 is a diagram showing an example of a logical identifier of the document structure, and FIG. 25 is a diagram showing an example of a structural document model associated with each area of the document image. FIG. 26 is a diagram showing the correspondence relationship with the logical identifiers associated with the elements of the layout structure data. In addition, FIG. 27 is a diagram illustrating an example of edit target determination data that specifies an area to be extracted as a candidate for an edit target by a logical identifier. FIG. 28 is a diagram showing an example of area determination data determined by a logical identifier, and FIG. 29 shows a document image in a state in which density conversion is finally performed on an area designated by the area determination data. It is a figure which shows an example of an output document. Note that, as in the case described above, in order to represent the position of the area, the coordinate values of the Cartesian coordinates are used as the reference of the position.
As shown in, the x-axis is set to the right of the page of the document image, and the y-axis is set to the bottom of the page.

【００７３】これらの図２２，図２５〜図２８を参照し
て説明する。電源が投入されると、制御モジュール１２
７が立ち上げ処理を行ない、コントロールパネル部１２
８で初期画面を表示する（ステップ１３０）。利用者が
コントロールパネル部１２８において操作を行い、濃度
変換モードを指示する（「濃度変換」ボタンを押す）
と、濃度変換モードとなるが、そうでなければ、通常の
複写処理を行なう。濃度変換モードになった後は、「ス
タート」ボタンが押されるのを待つ（ステップ１３１〜
１３３）。This will be described with reference to FIGS. 22 and 25 to 28. When the power is turned on, the control module 12
7 performs the startup process, and the control panel unit 12
The initial screen is displayed at 8 (step 130). The user operates the control panel unit 128 to instruct the density conversion mode (press the "density conversion" button).
Then, the density conversion mode is set, but if not, normal copy processing is performed. After entering the density conversion mode, wait until the "start" button is pressed (steps 131 to 131).
133).

【００７４】利用者が編集対象文書の原稿をプラテン上
に置き、「スタート」ボタンを押すと、制御モジュール
１２７がイメージスキャナ１２１を起動し、原稿の文書
画像が読み取られ、デジタル画像データとされて、領域
分割モジュール１２２に受け渡される（ステップ１３
４）。領域分割モジュール１２２では、第２の実施例と
同様に、文書画像に対して領域分割の処理を行ない、そ
の処理結果のレイアウト構造データを論理識別子付与モ
ジュール１２３に受け渡す（ステップ１３５）。When the user places the manuscript of the document to be edited on the platen and presses the "start" button, the control module 127 activates the image scanner 121, and the document image of the manuscript is read and converted into digital image data. , And is passed to the area division module 122 (step 13).
4). As in the second embodiment, the area division module 122 performs area division processing on the document image and transfers the layout structure data resulting from the processing to the logical identifier assignment module 123 (step 135).

【００７５】論理識別子付与モジュール１２３は、受け
取ったレイアウト構造データと保持している文書構造モ
デル（図２５）とのマッチングを行い、レイアウト構造
データの階層構造の分割領域データの最上位の要素が、
文書構造において、どのような論理的な意味を持つかを
示す識別子を与える（ステップ１３６）。The logical identifier assigning module 123 matches the received layout structure data with the document structure model (FIG. 25) held therein, and the highest element of the divided area data of the hierarchical structure of the layout structure data is
An identifier indicating what kind of logical meaning it has in the document structure is given (step 136).

【００７６】ここでの識別子は、例えば、図２２に示す
ように、文書構造に対する各々の構成要素を意味する
「タイトル」，「著者」，「サブタイトル」，「本
文」，「脚注」，「脚注罫」などに対して、それぞれを
区別するフラグのデータである。また、レイアウト構造
データにおける最上位の要素とは、レイアウト構造デー
タのどの要素に対しても下位要素となっていない要素で
あり、例えば、階層構造の領域データである文字ブロッ
ク領域は、最上位の文字ブロック全体でひとつの識別子
が付与される。したがって、その更に下位の要素である
文字行領域や、文字領域は個々には、論理識別子が付与
される対象とならない。The identifiers here are, for example, as shown in FIG. 22, "title", "author", "subtitle", "text", "footnote", "footnote" which means each constituent element for the document structure. It is data of a flag that distinguishes each of "ruled lines" and the like. The highest element in the layout structure data is an element that is not a lower element than any of the elements in the layout structure data. For example, the character block area that is the area data of the hierarchical structure is the highest element. One identifier is assigned to the entire character block. Therefore, the character line area and the character area, which are subordinate elements, are not the targets to which the logical identifiers are given individually.

【００７７】通常、ある範囲で流通する文書（所定形式
を有する論文，報告書，事務連絡文書など）では、「タ
イトル」，「著者」は上下配置になっているなど、ある
程度は固定的なデザインとなっている文書が多い。ここ
では、この性質を利用して文書構造に対する論理識別子
を付与する。つまり、文書構造モデルとして文書画像の
各々の領域データに対する構造（領域の配置）を登録し
ておき、文書画像の領域分割により得られたレイアウト
構造データの各要素の領域データと、文書構造モデルに
おける領域の配置とのマッチングを行い、対応が付けら
れれた場合に論理識別子を付与する。すなわち、この文
書構造モデルとは、処理対象となる文書のデザインにつ
いてのテンプレートであり、これらのテンプレートとな
る文書構造モデルはＲＯＭ等にあらかじめ複数の種類を
登録しておく。Usually, in a document distributed in a certain range (paper having a predetermined format, report, office communication document, etc.), the “title” and the “author” are vertically arranged, and the design is fixed to some extent. There are many documents. Here, this property is used to assign a logical identifier to the document structure. That is, the structure (arrangement of areas) for each area data of the document image is registered as the document structure model, and the area data of each element of the layout structure data obtained by area division of the document image and the area data of the document structure model are registered. It matches with the arrangement of the areas and gives a logical identifier when the correspondence is established. That is, the document structure model is a template for designing a document to be processed, and a plurality of types of document structure models serving as these templates are registered in a ROM or the like in advance.

【００７８】文書構造モデルは、具体的には、図２５に
示すように、文書画像の要素の種別（文字ブロック、文
字行、けい線など）と対応する論理名称を持つノード
と、これら要素間の相対的な位置関係を示すリンク情報
とにより表われるグラフ構造のデータとする。その場
合、識別子付与モジュール１２３におけるマッチング処
理では、登録してある１つの文書構造モデルとマッチす
るか否かを判定する処理を順次に行う。つまり、登録さ
れている文書構造モデルを１つずつ取り出し、マッチン
グ処理が成功するまで順次にマッチング処理を実行す
る。もし、全ての文書構造モデルとのマッチングが失敗
した場合には、入力文書に対する編集処理は処理不可能
である旨のメッセージをコントロールパネル部１２８に
表示し、以後の処理を何も行なわずに終了する。なお、
この場合、以降の処理では、例えば、第１の実施例で説
明したように、手動での領域指定による編集処理を行う
ようにしても良い。Specifically, as shown in FIG. 25, the document structure model is composed of nodes having logical names corresponding to the types of document image elements (character blocks, character lines, ruled lines, etc.), and between these elements. The data has a graph structure represented by link information indicating the relative positional relationship of In that case, in the matching process in the identifier assigning module 123, a process of determining whether or not it matches one registered document structure model is sequentially performed. That is, the registered document structure models are taken out one by one, and the matching process is sequentially executed until the matching process is successful. If the matching with all the document structure models fails, a message indicating that the edit processing for the input document cannot be displayed is displayed on the control panel unit 128, and the subsequent processing is ended without performing any processing. To do. In addition,
In this case, in the subsequent processing, for example, the editing processing by manually specifying the area may be performed as described in the first embodiment.

【００７９】ある文書構造モデルとのマッチング処理が
成功した場合は、レイアウト構造データにおける各々の
要素の領域データは、文書構造モデルの各ノードとの対
応が付けられるので、その処理結果は、図２６に示すよ
うに、論理識別子付与テーブル１４０に格納される。論
理識別子付与テーブル１４０は、要素番号フィールド１
４１と論理識別子フィールド１４２から構成されてお
り、レイアウト構造データの各々の要素番号のデータに
対応してその論理識別子が対応付けられたデータテーブ
ルである。ここでの論理識別子が付与された論理識別子
付与テーブル１４０のレイアウト構造データは、編集領
域判別モジュール１２４に受け渡される。When the matching process with a certain document structure model is successful, the area data of each element in the layout structure data is associated with each node of the document structure model, and the processing result is as shown in FIG. As shown in FIG. The logical identifier assignment table 140 has an element number field 1
41 and a logical identifier field 142, and is a data table in which the logical identifier is associated with the data of each element number of the layout structure data. The layout structure data of the logical identifier assignment table 140 to which the logical identifier is assigned here is transferred to the editing area determination module 124.

【００８０】編集領域判別モジュール１２４は、図２７
に示すように、各々の論理識別子に対応して編集を行な
う候補を判定する判定規準を示す編集対象判定データ１
５０を予じめ記憶しており、この編集対象判定データ１
５０に基づいて、編集対象となるレイアウト構造データ
の要素を判別する。例えば、図２６に示すようなレイア
ウト構造データの要素番号に対して論理識別子が付与さ
れた論理識別子付与テーブル１４０のデータが、編集領
域判別モジュール１２４に渡され、図２７に示すような
編集対象判定データ１５０に基づいて、編集対象とする
領域が判定された場合、図２８に示すような領域判定結
果データ１６０が得られる（ステップ１３７）。The editing area discrimination module 124 is shown in FIG.
As shown in FIG. 2, edit target judgment data 1 showing a judgment criterion for judging a candidate to be edited corresponding to each logical identifier.
50 is stored in advance, and this edit target judgment data 1
Based on 50, the element of the layout structure data to be edited is determined. For example, the data of the logical identifier assignment table 140 in which the logical identifier is assigned to the element number of the layout structure data as shown in FIG. 26 is passed to the edit area determination module 124, and the edit target determination as shown in FIG. 27 is made. When the area to be edited is determined based on the data 150, area determination result data 160 as shown in FIG. 28 is obtained (step 137).

【００８１】この領域判定結果データ１６０による判別
の結果により、レイアウト構造データの領域データに対
して、第２の実施例と同様に、編集対象となる文字ブロ
ック領域の「左上点ｘ座標、左上点ｙ座標、幅、高さ」
で示される領域データから、編集領域が抽出される。文
書画像の編集領域の指定の処理では、第１の実施例の場
合と同様に、画素の画像データに編集対象領域フラグを
追加し、編集対象となる領域の編集対象領域フラグを
“１”とする処理を行なった後、文書画像の画像データ
を文書画像変換モジュール１２５に受け渡す。Based on the result of the determination by the area determination result data 160, the "upper left point x coordinate, upper left point" of the character block area to be edited is set for the area data of the layout structure data, as in the second embodiment. y coordinate, width, height "
The edit area is extracted from the area data indicated by. In the process of designating the edit area of the document image, as in the case of the first embodiment, the edit area flag is added to the image data of the pixel, and the edit area flag of the area to be edited is set to "1". After performing the processing, the image data of the document image is transferred to the document image conversion module 125.

【００８２】文書画像変換モジュール１２５では、第１
の実施例の場合と同様に、編集対象領域フラグが“１”
である領域の画像データに対して係数を掛けて、濃度を
下げるか、または、画像データに所定数の値を加算して
背景をグレーにする（ステップ１３８）。この変換後の
画像データはプリンタ機構１２６に受け渡され、出力文
書の画像として出力される（ステップ１３９）。例え
ば、図５に示すような入力文書の文書画像５１に対し
て、その文書構造から編集対象の領域を指示して文書画
像変換を行った場合、その編集結果の出力文書の画像
は、図２９に示すような文書画像１７０として出力され
る。In the document image conversion module 125, the first
As in the case of the embodiment described above, the edit target area flag is "1".
The image data of the area is multiplied by a coefficient to reduce the density, or a predetermined number of values are added to the image data to make the background gray (step 138). The converted image data is delivered to the printer mechanism 126 and output as an image of an output document (step 139). For example, when a document image 51 of an input document as shown in FIG. 5 is subjected to document image conversion by designating an area to be edited from the document structure, the image of the output document of the edited result is as shown in FIG. The document image 170 is output as shown in FIG.

【００８３】図２９に示すような出力文書の文書画像１
７０では、図２７に示す編集対象判定データ１５０に基
づいて、編集対象とする領域を判定し、その判定された
領域に対して画像変換が行われた結果となっている。つ
まり、「本文」，「ページ番号」，「ヘッダ」，「脚
注」，「著者」の論理識別子を編集対象として指定する
指示内容を持つ編集対象判定データ１５０により、それ
ぞれの編集領域が判定され、その論理識別子を持つ要素
（レイアウト構造データの要素）の領域データに従っ
て、その背景がグレーにされた出力文書の文書画像の例
となっている。A document image 1 of an output document as shown in FIG.
At 70, the region to be edited is determined based on the edit target determination data 150 shown in FIG. 27, and the image conversion is performed on the determined region. That is, each edit area is determined by the edit target determination data 150 having the instruction content that specifies the logical identifiers of “text”, “page number”, “header”, “footnote”, and “author” as the edit target, This is an example of the document image of the output document whose background is grayed out according to the area data of the element having the logical identifier (element of the layout structure data).

【００８４】なお、この実施例の説明では、指定された
論理識別子を持つ文字ブロック領域を編集対象とした
が、編集対象領域を決定する場合、例えば、第１の実施
例と同様に、「タイトル」，「サブタイトル」などの編
集候補とならない論理識別子から、それらの論理識別子
を持つ要素以外を編集対象とするようにしても良い。ま
た、編集しない候補の論理識別子の指定により、それ以
外の論理識別子を持つ要素と余白部分を加えた領域を編
集対象とするようにも変形できる。In the description of this embodiment, the character block area having the designated logical identifier is set as the edit target. However, when the edit target area is determined, for example, as in the first embodiment, "title ], “Subtitle”, and other logical identifiers that are not candidates for editing, elements other than those having those logical identifiers may be edited. In addition, by designating a logical identifier of a candidate that is not edited, it is possible to modify the region to which an element having a logical identifier other than that and a margin portion is added as an editing target.

【００８５】[0085]

【発明の効果】以上に、説明したように、本発明の文書
画像処理装置によれば、簡単な操作によって、文書画像
の一部分についてその画像の濃度を下げたり、背景の明
度を下げたりすることができる。また、編集対象としな
い領域の指定により、編集する領域を指定することによ
り編集対象領域の指示が簡単になり、その他，編集処理
の指定が不要になり、操作が簡単になる。また、文書画
像における文字領域または図面領域などの物理的な特徴
から長文領域を自動判別して、編集対象の処理を行うよ
うに構成できるため、領域を指示する操作が不要にな
る。更に、予じめ論理識別子による指定により「本
文」，「注」などの編集対象領域を自動判別して、編集
対象の処理を行うため、領域を指示する操作が不要にな
る。このため、利用者は文書画像の操作を行う場合に
は、特に、複雑な操作を行うことなく、画像編集の操作
が容易に行える。As described above, according to the document image processing apparatus of the present invention, the density of an image of a part of a document image can be reduced or the brightness of the background can be reduced by a simple operation. You can Further, by designating an area not to be edited, designation of the area to be edited simplifies the designation of the area to be edited, and in addition, it becomes unnecessary to designate the editing process, which simplifies the operation. In addition, since it is possible to automatically determine the long sentence region from the physical characteristics such as the character region or the drawing region in the document image and perform the process of the edit target, the operation of designating the region becomes unnecessary. Further, since the edit target area such as "text" or "note" is automatically discriminated by the designation by the preliminary logical identifier and the edit target processing is performed, the operation of designating the area becomes unnecessary. Therefore, when the user operates the document image, the user can easily perform the image editing operation without performing a complicated operation.

[Brief description of drawings]

【図１】図１は本発明の文書画像処理装置の第１の実
施例の基本構成を示すブロック図、FIG. 1 is a block diagram showing a basic configuration of a first embodiment of a document image processing apparatus of the present invention,

【図２】図２はディジタル複写機におけるコンソール
パネルの一例を示す図、FIG. 2 is a diagram showing an example of a console panel in a digital copying machine,

【図３】図３はデジタルカラー複写機に適用した文書
画像処理装置の要部の構成を説明するブロック図、FIG. 3 is a block diagram illustrating a configuration of a main part of a document image processing apparatus applied to a digital color copying machine,

【図４】図４は指定された領域に対する濃度変換を行
う場合の処理を流れを示すフローチャート、FIG. 4 is a flowchart showing a flow of processing when density conversion is performed on a designated area,

【図５】図５は、処理対象の文書画像として入力する
白黒の入力文書の一例を示す図、FIG. 5 is a diagram showing an example of a monochrome input document to be input as a document image to be processed,

【図６】図６は入力文書において処理対象の領域を指
定する場合の操作例を説明する図、FIG. 6 is a diagram for explaining an operation example when designating an area to be processed in an input document;

【図７】図７は２５６階調グレースケールによる文書
画像の画像データを部分的に示す図、FIG. 7 is a diagram partially showing image data of a document image in 256 gradation gray scale,

【図８】図８は画像データに指示領域フラグが設けら
れた場合の画像データを部分的に示す図、FIG. 8 is a diagram partially showing image data in the case where a designated area flag is provided in the image data,

【図９】図９は指示領域フラグが反転され編集対象領
域フラグとされた状態の画像データを部分的に示す図、FIG. 9 is a diagram partially showing image data in a state in which a designated area flag is inverted to be an edit target area flag;

【図１０】図１０は指定された領域に対する濃度変換
が行なわれた状態の画像データを部分的に示す図、FIG. 10 is a diagram partially showing image data in a state in which density conversion is performed on a designated area,

【図１１】図１１は最終的に濃度変換が行なわれた状
態の出力文書の文書画像の一例を示す図、FIG. 11 is a diagram showing an example of a document image of an output document in a state where density conversion is finally performed;

【図１２】図１２は領域が指示された状態の領域テー
ブルの例を示す図、FIG. 12 is a diagram showing an example of a region table in a state where a region is designated;

【図１３】図１３は本発明の第２の実施例の文書画像
処理装置の基本構成を示すブロック図、FIG. 13 is a block diagram showing the basic arrangement of a document image processing apparatus according to the second embodiment of the present invention,

【図１４】図１４はデジタルカラー複写機に適用した
第２の実施例の文書画像処理装置の要部の装置構成を説
明するブロック図、FIG. 14 is a block diagram illustrating a device configuration of a main part of a document image processing apparatus according to a second embodiment applied to a digital color copying machine;

【図１５】図１５は文書画像の文字領域を判別してそ
の領域に対する濃度変換を行う場合の処理を流れを示す
フローチャート、FIG. 15 is a flowchart showing a flow of processing when a character area of a document image is identified and density conversion is performed on the area.

【図１６】図１６は処理対象の文書画像に対して領域
判定を行なわれた結果の領域判別データの一例を示す
図、FIG. 16 is a diagram showing an example of area determination data as a result of area determination performed on a document image to be processed;

【図１７】図１７は領域判定結果の１つの文字ブロッ
ク領域における階層構造の判定結果と入力文書との対応
関係を説明する図、FIG. 17 is a diagram illustrating a correspondence relationship between a determination result of a hierarchical structure in one character block area of the area determination result and an input document;

【図１８】図１８は領域判別データの１つの領域にお
ける判定結果の階層構造の領域データを示す図、FIG. 18 is a diagram showing region data of a hierarchical structure of a determination result in one region of the region determination data,

【図１９】図１９は各々の判定領域毎に文字数として
計数された計数データを格納する文字数テーブルを示す
図、FIG. 19 is a diagram showing a character number table storing count data counted as the number of characters for each determination area;

【図２０】図２０は、判定された長文領域に対して最
終的に濃度変換が行なわれた状態の文書画像の出力文書
の一例を示す図、FIG. 20 is a diagram showing an example of an output document of a document image in a state in which density conversion is finally performed on the determined long text area;

【図２１】図２１は本発明の第３の実施例の文書画像
処理装置の基本構成を示すブロック図、FIG. 21 is a block diagram showing the basic arrangement of a document image processing apparatus according to the third embodiment of the present invention,

【図２２】図２２は文書構造の論理識別子の一例を示
す図、FIG. 22 is a diagram showing an example of a logical identifier of a document structure,

【図２３】図２３はデジタルカラー複写機に適用した
第３の実施例の文書画像処理装置の要部の装置構成を説
明するブロック図、FIG. 23 is a block diagram illustrating a device configuration of a main part of a document image processing device according to a third embodiment applied to a digital color copying machine;

【図２４】図２４は文書画像の各領域を論理識別子に
より判別してその領域に対する濃度変換を行う場合の処
理を流れを示すフローチャート、FIG. 24 is a flowchart showing the flow of processing in the case where each area of a document image is discriminated by a logical identifier and density conversion is performed on that area;

【図２５】図２５は文書画像の各領域に対応付ける構
造文書モデルの一例を示す図、FIG. 25 is a diagram showing an example of a structural document model associated with each area of a document image,

【図２６】図２６はレイアウト構造データの要素に対
応づけた論理識別子との対応関係を示す図、FIG. 26 is a diagram showing a correspondence relationship with a logical identifier associated with an element of layout structure data;

【図２７】図２７は編集対象の候補として抽出する領
域を論理識別子により指定する編集対象判定データの一
例を示す図、FIG. 27 is a diagram showing an example of edit target determination data in which an area to be extracted as a candidate for edit target is designated by a logical identifier;

【図２８】図２８は論理識別子により判定された領域
判定データの一例を示す図、FIG. 28 is a diagram showing an example of area determination data determined by a logical identifier;

【図２９】図２９は領域判定データによる指定された
領域に対して最終的に濃度変換が行なわれた状態の文書
画像の出力文書の一例を示す図である。FIG. 29 is a diagram showing an example of an output document of a document image in which density conversion is finally performed on an area designated by area determination data.

[Explanation of symbols]

２０…文書画像処理装置、２１…イメージスキャナ、２
２…編集対象外領域指示モジュール、２３…指示領域判
別モジュール、２４…編集対象領域判別モジュール、２
５…文書画像変換モジュール、２６…プリンタ機構、２
７…制御モジュール、２８…コントロールパネル部、３
０…コンソールパネル、３１…テンキー部３１と、３２
…表示部、３３…複写ボタン、３４…濃度変換ボタン、
３５…複写スタートボタン、３６…状態表示部、５１…
入力文書の文書画像、５２…第１の編集領域、５３…始
点、５４…終点、５５…第２の編集領域、５６…始点、
５７…終点、６０…領域テーブル、６１…画像データ、
６２…画像データ、６３…指示領域フラグ、６４…編集
対象領域フラグ、６５…画像データ、６６…編集された
状態の画像データ、６７…出力文書の文書画像、７０…
文書画像処理装置、７１…イメージスキャナ、７２…領
域分割モジュール、７３最下位要素計数モジュール、７
４…長文領域判別モジュール、７５…文書画像変換モジ
ュール、７６…プリンタ機構、７７…制御モジュール、
７８…コントロールパネル部、９１…文書画像データ、
９２…文字ブロック領域、９３…罫線領域、９４…余白
領域、１００…階層構造、１０１…文字ブロック領域、
１０２…文字行領域、１０３…文字領域、１０４…領域
テーブル、１０５…下位要素個数フィールド、１０６…
下位要素開始番号フィールド、１０７…文字数テーブ
ル、１０８…出力文書の文書画像、１２０…文書画像処
理装置、１２１…イメージスキャナ、１２２…領域分割
モジュール、１２３…論理識別子付与モジュール、１２
４…編集領域判定モジュール、１２５…文書画像変換モ
ジュール、１２６…プリンタ機構、１２７…制御モジュ
ール、１２８…コントロールパネル部、１４０…論理識
別子付与テーブル、１４１…要素番号フィールド、１４
２…論理識別子フィールド、１５０…編集対象判定デー
タ、１６０…領域判定結果データ、１７０…出力文書の
文書画像、３０１…編集対象文書画像、３０２…領域分
割処理部、３０３…論理識別子付与部、３０４…編集領
域判別部、３０５…文書画像変換処理部、３０６…編集
後文書画像。20 ... Document image processing device, 21 ... Image scanner, 2
Reference numeral 2 ... Non-editing target area designation module, 23 ... Designated area determination module, 24 ... Edit target area determination module, 2
5 ... Document image conversion module, 26 ... Printer mechanism, 2
7 ... Control module, 28 ... Control panel section, 3
0 ... Console panel, 31 ... Numerical keys 31 and 32
... Display unit, 33 ... Copy button, 34 ... Density conversion button,
35 ... Copy start button, 36 ... Status display section, 51 ...
Document image of input document, 52 ... First edit area, 53 ... Start point, 54 ... End point, 55 ... Second edit area, 56 ... Start point,
57 ... end point, 60 ... area table, 61 ... image data,
62 ... Image data, 63 ... Instruction area flag, 64 ... Editing area flag, 65 ... Image data, 66 ... Image data in edited state, 67 ... Document image of output document, 70 ...
Document image processing device, 71 ... Image scanner, 72 ... Area dividing module, 73 Lowest element counting module, 7
4 ... Long sentence area discrimination module, 75 ... Document image conversion module, 76 ... Printer mechanism, 77 ... Control module,
78 ... control panel section, 91 ... document image data,
92 ... Character block area, 93 ... Ruled line area, 94 ... Margin area, 100 ... Hierarchical structure, 101 ... Character block area,
102 ... Character line area, 103 ... Character area, 104 ... Area table, 105 ... Lower element number field, 106 ...
Lower element start number field, 107 ... Character number table, 108 ... Document image of output document, 120 ... Document image processing device, 121 ... Image scanner, 122 ... Area dividing module, 123 ... Logical identifier assigning module, 12
4 ... Editing area determination module, 125 ... Document image conversion module, 126 ... Printer mechanism, 127 ... Control module, 128 ... Control panel section, 140 ... Logical identifier assignment table, 141 ... Element number field, 14
2 ... Logical identifier field, 150 ... Edit target determination data, 160 ... Region determination result data, 170 ... Output document document image, 301 ... Edit target document image, 302 ... Region dividing processing unit, 303 ... Logical identifier assigning unit, 304 ... edit area discrimination unit, 305 ... document image conversion processing unit, 306 ... edited document image.

───────────────────────────────────────────────────── フロントページの続き (72)発明者古郷慎也神奈川県横浜市保土ヶ谷区神戸町134番地横浜ビジネスパークイーストタワー富士ゼロックス株式会社内 ─────────────────────────────────────────────────── ─── Continuation of front page (72) Inventor Shinya Furusato 134 Kobe-cho, Hodogaya-ku, Yokohama-shi, Kanagawa Yokohama Business Park East Tower Fuji Xerox Co., Ltd.

Claims

[Claims]

1. A non-editing target area designating means for designating an unedited area on a document image, and a designation area flag indicating a correspondence relationship between the area designated by the non-editing target area designating means and a document image of an input document. Based on the designated area discriminating means to be added and the designated area flag added by the designated area discriminating means, an edit target area discriminating means for converting into an edit area flag indicating the edit target area of the document image, and the editing designated by the edit area flag A document image processing device, comprising: a document image conversion means for converting the density of the extracted region with respect to the target region.

2. A region dividing unit that divides a group of pixels on a document image into small regions as meaningful blocks of document elements, and a subregion divided by the region dividing unit from the smallest element among them. The lowest element counting means for counting the number, the long sentence area determining means for determining a long sentence area from the number of the lowest elements in the small areas counted by the lowest element counting means, and the long sentence area determining means. A document image processing device, comprising: a document image conversion means for converting the density of a long sentence of a region.

3. A region dividing unit that divides a group of pixels on a document image into small regions as meaningful blocks of document elements, and a logical structure of a document in each of the small regions divided by the region dividing unit. A logical identifier assigning means for assigning an identifier to give a meaning; an edit area determining means for determining an area to be a density conversion target corresponding to the identifier assigned by the logical identifier assigning means; and an area determined by the edit area determining means. A document image processing apparatus, comprising: a document image conversion unit for converting density.