JPH1125283A

JPH1125283A - Document picture processor

Info

Publication number: JPH1125283A
Application number: JP9179200A
Authority: JP
Inventors: Yutaka Nakamura; 豊中村
Original assignee: Fuji Xerox Co Ltd
Current assignee: Fujifilm Business Innovation Corp
Priority date: 1997-07-04
Filing date: 1997-07-04
Publication date: 1999-01-29

Abstract

PROBLEM TO BE SOLVED: To enlarge a character picture without any sense of incongruity by a relatively simple processing, and to output a character picture in a configuration easily readable for a reader. SOLUTION: A picture input processing part 1 inputs document picture information as digital data. A picture data storing part 2 stores the inputted picture data, a picture analyzed result and a picture processing result or the like. A picture area division processing part 3 divides the document picture according to attributes such as a character area, graphic area, and image area or the like. The character area is divided into each character unit. A character picture processing part 4 performs a proper character picture enlargement processing to a character picture divided for one character unit according to an enlargement ratio. Also, an emphasis processing and color processing such as the increase of a contrast or color conversion is performed. A document picture re-synthesis processing part 6 re-arranges each kind of picture processed for element units according to an algorithm.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、文書画像データに
対して、文書画像データを要素単位に分割し、要素単位
に拡大等の画像処理を行い、再合成を行う画像処理装置
に係わり、特に拡大した文書画像を再合成する際に違和
感が生じないように、文字要素を拡大する際には再現す
る文字要素の大きさに応じた拡大手法もしくは文字パラ
メータを利用することにより適切な文字要素の拡大を行
うとともに、再合成する際の文字要素の間隔、行間隔は
読み手にとって読みやすい間隔に調整して文書の再合成
を行い、さらに拡大によって生じる、行のはみ出し、ペ
ージのはみ出しも自動的に処理し、場合によっては改行
もしくは改ページ作業を行い、また非文字領域に対して
も適切な拡大を行い、再合成処理をすることを特徴とす
る文書画像処理装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an image processing apparatus for dividing document image data into element units, performing image processing such as enlargement in element units, and recombining the document image data. In order to avoid discomfort when resynthesizing the enlarged document image, when enlarging the character element, use an enlargement method or character parameters according to the size of the character element to be reproduced. In addition to enlarging, the spacing of character elements and line spacing at the time of resynthesis is adjusted to an easy-to-read interval for the document, and the document is resynthesized. A document image processing device that performs a line feed or page break operation in some cases, performs appropriate enlargement even in a non-character area, and performs re-synthesis processing. On.

【０００２】[0002]

【従来の技術】画像をデジタル情報としてハンドリング
する画像処理装置が増えつつある。例えば複写機も従来
は光学的拡大縮小等、限られた機能しか実現できなかっ
たが、複写機がデジタル化するにしたがって、単なる拡
大縮小のみでなく、画像回転、位置補正、鏡像等、画像
をデジタルとして記憶、処理できるからこそ実現できる
機能も付加されてきている。2. Description of the Related Art An image processing apparatus for handling an image as digital information is increasing. For example, in the past, copiers could only realize limited functions such as optical enlargement / reduction, but with the digitalization of copiers, not only mere enlargement / reduction but also image rotation, position correction, mirror image, etc. Functions that can be realized because they can be stored and processed as digital have been added.

【０００３】そのような中で、例えば、高齢者や、視覚
障害者は小さい文字、特別な色が見えにくいため、複写
機で拡大処理、色変換処理等を行うことにより見やすい
原稿を得ることができる。複写機等で提供している画像
処理機能のみでも前記問題に対してはある程度の対処が
可能である。最近の複写機には拡大率を％単位で変えら
れるものがあり、またカラー複写機では色変換機能もあ
るからである。しかしそれら単独の機能を組み合わせる
ことは手作業で行わなければならないため、使い方まで
含めた場合はさまざまな問題が残っている。また、通常
の機能だけで解決できる場面は所定の定型サイズから他
の定型サイズのような場合のみであり、実際への適用範
囲は限定される。特に拡大率が一般に想定される拡大に
比べて大きくなると、出力ページ内におさまらなくな
り、文書として拡大複写するには実用上耐えるものでは
ない。In such a situation, for example, the elderly or visually impaired people cannot easily see small characters and special colors. it can. The above problem can be dealt with to some extent only by the image processing function provided by a copying machine or the like. This is because some recent copying machines can change the enlargement ratio in units of%, and color copying machines also have a color conversion function. However, since combining these independent functions must be done manually, various problems remain when including usage. In addition, a scene that can be solved only by a normal function is only a case from a predetermined fixed size to another fixed size, and an actual application range is limited. In particular, if the enlargement ratio becomes larger than the enlargement generally assumed, it will not fit within the output page, and it is not practically feasible to enlarge and copy as a document.

【０００４】また、画像編集装置として例えば、拡大し
たい領域を表示装置上で矩形領域で指示し、それを所望
の大きさに拡大・縮小し、例えば一定の領域に収まるよ
うにする装置もある。[0004] As an image editing apparatus, for example, there is an apparatus for designating a region to be enlarged as a rectangular region on a display device, and enlarging or reducing it to a desired size, for example, to fit in a fixed region.

【０００５】また、特開平５−８１４０１号公報に記載
されるように画像表示装置で編集すべき画像領域を指定
し、画像に関しては貼り付け先の領域にあわせて拡縮を
行い、文字に関しては原画像の１行の文字数、および貼
り付け先の１行の文字数を指示することにより、１行あ
たりの文字数を変更することができる画像編集装置もあ
る。[0005] As described in Japanese Patent Application Laid-Open No. 5-81401, an image area to be edited by an image display device is designated. Some image editing apparatuses can change the number of characters per line by designating the number of characters in one line of an image and the number of characters in one line to be pasted.

【０００６】[0006]

【発明が解決しようとする課題】上記説明したように、
これまでの拡大・縮小方式は、領域のレイアウトを保っ
たまま、拡大、縮小を行うものが主であるため、例えば
Ａ４サイズの原稿を拡大したときには、はみ出しの部分
が生じるために、出力側で何らかの修正作業が必要であ
った。例えば複写機では出力原稿のサイズを可変するこ
とによりある程度の対処は可能であるが、それは原稿の
拡大率が定型（例えばＡ４サイズからＢ４サイズ、Ａ４
サイズからＡ３サイズ）のときに限られ、それ以外の場
合は対処が不完全であった。また、４００％拡大といっ
て１枚の原稿を１６枚の原稿にそれぞれ拡大し、それを
張り合わせることにより対処する方式もあるが、これな
どでは、拡大した文書を見ようとした場合には不便であ
り、実用性が薄いという問題があった。特にこれまでの
技術ではＡ４サイズ文書全体を拡大して複数ページのＡ
４サイズ文書に構成して見るというようなこはできなか
った。As described above,
The conventional enlargement / reduction methods mainly perform enlargement / reduction while maintaining the layout of the area. For example, when an A4 size document is enlarged, a protruding portion occurs. Some correction work was needed. For example, in a copier, some measures can be taken by changing the size of the output document, but the enlargement rate of the document is fixed (for example, from A4 size to B4 size, A4 size).
(A3 size to A3 size), otherwise the measures were incomplete. There is also a method of increasing the size of one original to 16 originals, which is called 400% enlargement, and coping with the original by pasting them together. However, this method is inconvenient when trying to view an enlarged document. However, there is a problem that the practicability is low. In particular, the conventional technology enlarges the entire A4 size document to increase the
I couldn't make it into a 4 size document.

【０００７】また、これまでの拡大方式では画像全体を
単純拡大する手法が基本であり、補正によりデジタルデ
ータの拡大に伴う画素のぎざぎざをアンチエイリアス技
法等により滑らかに再現することが行われていた。しか
し文字の拡大を考えた場合、例えばエディタで１２ポイ
ントの文字で文書を作成する場合と４８ポイントの文字
で文書を作成する場合は文字の大きさは４倍であるが、
それに伴い文字の線の太さ、文字間隔、行間隔とも単純
に４倍するとは限らない。文字の大きさによって適切な
パラメータが存在する。つまり画像を単純に拡大するの
みではその文字の大きさにとって適切な状態になってい
るとは限らない。Further, in the conventional enlargement method, a method of simply enlarging the entire image is basically used, and the jaggedness of pixels accompanying the enlargement of digital data is smoothly reproduced by an anti-aliasing technique or the like by correction. However, in consideration of enlarging characters, for example, when a document is created with 12-point characters using an editor and when a document is created with 48-point characters, the character size is four times larger.
Accordingly, the line width of characters, character spacing, and line spacing are not always simply quadrupled. There are appropriate parameters depending on the character size. In other words, simply enlarging the image does not necessarily mean that the image is in a state appropriate for the size of the character.

【０００８】また文書を再構成するためには文字領域に
対して文字認識を行うことにより文字情報データとして
扱い、適切な文字、文字間隔、行間隔で再構成すること
も可能である。しかし、文字認識は処理に時間がかか
り、処理のための演算装置も強力なものが必要となる。
そのため、例えば複写機のようにある程度の速度が要求
され、コストも重要視されるところへの採用は難しい。
また文字認識の認識率は高くはなってきているが、誤認
識をすることがある。この誤認識が問題であり、誤った
字で認識し、それを文書として再合成すると、原稿が意
図していた文章の意味が伝わらない、わからないことが
生じるため、実用上は好ましくない。In order to reconstruct a document, it is possible to perform character recognition on a character area to treat it as character information data, and to reconstruct the document with appropriate characters, character intervals, and line intervals. However, character recognition takes a long time to process, and requires a powerful arithmetic unit for the process.
For this reason, for example, it is difficult to adopt such a device as a copier in which a certain speed is required and where cost is considered important.
In addition, although the recognition rate of character recognition is increasing, erroneous recognition may occur. This erroneous recognition is a problem, and if it is recognized as an erroneous character and recombined as a document, the meaning of the sentence intended by the manuscript will not be transmitted or understood, which is not preferable in practical use.

【０００９】また、特開平５−８１４０１号公報に記載
されるように表示装置上で文字領域を指定し、さら１行
あたりの文字数を指定し、貼り付け先の１行あたりの文
字数も指定することにより、違和感の無い貼り付けを行
うこともできる。この手法では１行あたりの文字数が一
定でなければ使えないが、最近の文書には固定文字サイ
ズ、固定文字ピッチを利用しているものはほとんど存在
しないため、文字切り出しがうまく行かない。また、手
順としても領域を表示装置上で囲み、さらに行の文字数
を入力しなければならない等、操作が複雑であり、実用
的でないという問題があった。Further, as described in Japanese Patent Application Laid-Open No. 5-81401, a character area is specified on the display device, the number of characters per line is further specified, and the number of characters per line to be pasted is also specified. By doing so, it is also possible to perform affixing without discomfort. This method cannot be used unless the number of characters per line is constant. However, there is almost no document that uses a fixed character size and a fixed character pitch in recent documents. In addition, there is a problem that the operation is complicated and impractical, for example, it is necessary to enclose the area on the display device and input the number of characters of the line as a procedure.

【００１０】本発明は上記の問題を解決するためになさ
れたものであり、比較的簡単な処理により、文字画像を
違和感なく拡大し、また、読み手にとって読みやすい態
様で文字画像を出力する文書画像処理装置を提供するこ
とを目的としている。SUMMARY OF THE INVENTION The present invention has been made to solve the above-described problems, and a document image which enlarges a character image without a sense of incongruity by relatively simple processing, and outputs a character image in a form which is easy for a reader to read. It is intended to provide a processing device.

【００１１】[0011]

【課題を解決するための手段】本発明の文書画像処理装
置は、画像データを入力する画像入力処理手段と、前記
入力された画像データを蓄積する画像データ蓄積手段
と、前記画像データに対して、画像を異なる属性の画像
領域に分割して要素単位にする画像領域分割手段と、文
字要素に対して、所定の拡縮・構造変換・色変換処理を
行う文字画像処理手段と、非文字要素に対して所定の拡
縮・色変換処理を行う非文字画像処理手段と、処理され
た要素画像に対して文書の再合成を行う文書画像再合成
処理手段と、再合成された文書画像を出力する文書画像
出力手段とを有することを特徴とする。According to the present invention, there is provided a document image processing apparatus comprising: image input processing means for inputting image data; image data storage means for storing the input image data; An image area dividing unit that divides an image into image regions having different attributes to make an element unit, a character image processing unit that performs predetermined scaling / structure conversion / color conversion processing on a character element, and a non-character element Non-character image processing means for performing predetermined scaling / color conversion processing, document image resynthesis processing means for resynthesizing a document with the processed element image, and a document for outputting the resynthesized document image Image output means.

【００１２】この構成においては、文書画像を要素単位
で扱うことにより、文書画像を拡大、再合成する際に生
じる様々な問題を解決する。すなわち、再合成の際に好
ましいパラメータを自動付与、もしくは手動付与しなが
ら要素画像変換や再合成を行い、さらに改ページ機能を
持たせることにより読み手にとって読みやすい画像を再
現する。さらに画像データの拡大を伴う再配置のみで以
上の処理を実現するので、処理自体も煩雑でなく、ま
た、要素単位の分割における解析に間違いが生じた場合
でもユーザで予測がつき、文書理解に障害がおきない。In this configuration, various problems that occur when a document image is enlarged and recombined are solved by treating the document image in element units. That is, elemental image conversion or resynthesis is performed while automatically or manually adding preferable parameters at the time of resynthesis, and a page break function is provided to reproduce an image that is easy for a reader to read. Furthermore, since the above processing is realized only by rearrangement with enlargement of image data, the processing itself is not complicated, and even if an error occurs in the analysis in the division of the element unit, the user can make a prediction and understand the document. No obstacles.

【００１３】また、この構成において、前記画像領域分
割手段が、入力画像全体に対して解析処理を行い、文字
領域・グラフィックス領域・イメージ領域を含む種々の
属性領域に分割し、さらに文字に対しては一文字単位で
扱えるような文字要素にまで分割するようにしてもよ
い。In this configuration, the image area dividing means analyzes the entire input image, divides the input image into various attribute areas including a character area, a graphics area, and an image area. In other words, it may be divided into character elements that can be handled in units of one character.

【００１４】また、前記画像領域分割手段が、あらかじ
め指示された領域のみを抽出し、当該領域のみに対して
解析処理を行い、文字領域・グラフィックス領域・イメ
ージ領域を含む種々の属性領域に分割し、さらに文字に
対しては一文字単位で扱えるような文字要素にまで分割
するようにしてもよい。Further, the image area dividing means extracts only a designated area, performs an analysis process only on the area, and divides the area into various attribute areas including a character area, a graphics area, and an image area. Further, a character may be further divided into character elements that can be handled in units of one character.

【００１５】また、前記画像領域分割手段が、あらかじ
め指示された領域のみを抽出し、当該領域のみに対して
解析処理を行い、一文字単位で扱えるような文字要素に
まで分割するようにしてもよい。Further, the image area dividing means may extract only a previously designated area, perform an analysis process only on the area, and divide the area into character elements that can be handled in units of one character. .

【００１６】また、前記文字画像処理手段が、変換すべ
き文字の大きさに適した文字の拡大縮小および変形・色
変換を行うようにしてもよい。Further, the character image processing means may perform enlargement / reduction, deformation, and color conversion of a character suitable for the size of the character to be converted.

【００１７】また、前記文書画像再合成手段が、再合成
する要素画像が用意された場合に、要素画像の特性に応
じて、適切な文字間隔、行間隔、要素間間隔を付与しな
がら文書画像を再合成し、ページの概念をもち、ページ
内で適切な再合成を行うとともに、複数ページにわたっ
て再合成作業を行うようにしてもよい。Further, when the element image to be recombined is prepared, the document image re-synthesizing means assigns appropriate character spacing, line spacing, and element-to-element spacing according to the characteristics of the element image. May be re-synthesized, have the concept of a page, perform appropriate re-synthesis within a page, and perform re-synthesis work over a plurality of pages.

【００１８】また、前記文書画像再合成手段が、再合成
する要素画像が用意された場合に、要素画像の特性に応
じて、適切な文字間隔、行間隔、要素間間隔を付与しな
がら文書画像を再合成し、表示領域の概念をもち、表示
領域内で適切な再合成を行うようにしてもよい。Further, when the document image re-synthesizing means prepares the element image to be re-synthesized, the document image re-synthesizing means adds appropriate character spacing, line spacing, and inter-element spacing according to the characteristics of the element image. May be re-synthesized to have a concept of a display area, and appropriate re-synthesis may be performed in the display area.

【００１９】また、本発明の文書画像処理装置は、入力
画像から文字単位領域を抽出する手段と、拡大率に応じ
た処理を実行しながら上記文字単位領域を拡大する手段
と、上記拡大した文字単位領域を所定の配置で出力画像
中に合成する手段と、上記合成された出力画像を出力す
る手段とを有することを特徴とする。The document image processing apparatus according to the present invention further comprises means for extracting a character unit area from an input image, means for enlarging the character unit area while executing processing in accordance with an enlargement ratio, It is characterized by comprising means for combining unit areas in an output image in a predetermined arrangement, and means for outputting the combined output image.

【００２０】この構成においても、読み手にとって読み
やすい拡大画像を再現する。また、処理自体も煩雑でな
く、さらに、要素単位の分割における解析に間違いが生
じた場合でもユーザで予測がつき、文書理解に障害がお
きない。Also in this configuration, an enlarged image which is easy for a reader to read is reproduced. Further, the processing itself is not complicated, and even if an error occurs in the analysis in the division of the element unit, the user can make a prediction, and there is no obstacle in understanding the document.

【００２１】[0021]

【発明の実施の態様】以下、本発明を実施する場合の形
態について、図面を参照して具体的に説明する。図１
は、本発明の一実施例の文書画像処理装置の全体の構成
を示すブロック図である。図１において１は画像入力処
理部、２は画像データ蓄積部、３は画像領域分割処理
部、４は文字画像処理部、５は非文字画像処理部、６は
文書画像合成処理部、７は画像出力部である。Embodiments of the present invention will be described below in detail with reference to the drawings. FIG.
FIG. 1 is a block diagram illustrating an overall configuration of a document image processing apparatus according to an embodiment of the present invention. In FIG. 1, 1 is an image input processing unit, 2 is an image data storage unit, 3 is an image area division processing unit, 4 is a character image processing unit, 5 is a non-character image processing unit, 6 is a document image synthesis processing unit, 7 is An image output unit.

【００２２】画像入力処理部１は、例えばスキャナであ
り、文書画像情報をデジタルデータとして入力し、画像
領域分割処理を行うために適した形態に変換する処理を
行うものである。画像データ蓄積部２は入力された画像
データおよび画像解析結果、画像処理結果等を格納する
ものである。画像領域分割処理部３は、文書画像をそれ
ぞれの属性に応じた領域に分割する処理を行うものであ
る。例えば文字領域、グラフィックス領域、イメージ領
域等の属性に応じた分割を行う。分割は全自動の場合を
始めとして、半自動の場合、手動による分割の場合が考
えられる。さらに文字領域に対しては、文字の列・行、
さらには一文字単位までの分割を行う。文字画像処理部
４は一文字単位で切り分けられた文字画像に対して拡大
率に応じて適切な文字画像拡大処理を行う。また、コン
トラストをあげたり、色変換を行うなどの強調処理・色
処理等を行う。非文字画像処理部５は文字以外のグラフ
ィックス領域、イメージ領域に対しては出力原稿を考慮
した上で領域全体で拡大処理を行う。また、コントラス
トをあげたり、色変換を行うなどの強調処理・色処理等
を行う場合もある。文書画像再合成処理部６は要素単位
で各種処理された画像をアルゴリズムにしたがって再配
置する。例えば、文字間隔、行間隔を適切な値に自動設
定、もしくは手動設定して配置する。非文字画像も適切
な処理を行った後、文字領域との適切な間隔をあけて再
配置する。これら作業に伴い、同一出力ページに収まり
きらない場合は、改ページ等の処理を行う。文書画像出
力部７は文書画像再合成処理によって得られた画像を適
切な形で出力する。The image input processing unit 1 is, for example, a scanner, and performs a process of inputting document image information as digital data and converting the document image information into a form suitable for performing an image area dividing process. The image data storage unit 2 stores input image data, image analysis results, image processing results, and the like. The image area division processing unit 3 performs processing for dividing a document image into areas corresponding to respective attributes. For example, division is performed according to attributes such as a character area, a graphics area, and an image area. The division may be fully automatic, semi-automatic, or manual. Furthermore, for character areas, character columns / lines,
Furthermore, the division is performed up to one character unit. The character image processing unit 4 performs an appropriate character image enlargement process on the character image cut out in units of one character according to the enlargement ratio. Further, enhancement processing and color processing such as increasing the contrast and performing color conversion are performed. The non-character image processing unit 5 performs enlargement processing on the entire area of the graphics area and the image area other than the character in consideration of the output document. In some cases, enhancement processing and color processing such as increasing the contrast and performing color conversion are performed. The document image resynthesis processing unit 6 rearranges images processed variously for each element in accordance with an algorithm. For example, the character spacing and the line spacing are automatically set to appropriate values or manually set and arranged. The non-character image is also rearranged at an appropriate interval from the character area after performing appropriate processing. If these operations do not fit on the same output page, processing such as a page break is performed. The document image output unit 7 outputs an image obtained by the document image recombining process in an appropriate form.

【００２３】次に、本発明の画像処理装置の一実施例の
詳細な構成および動作について図面を参照して具体的に
説明する。Next, the detailed configuration and operation of one embodiment of the image processing apparatus of the present invention will be specifically described with reference to the drawings.

【００２４】図２は画像入力処理部１についての詳細な
ブロック図である。入力画像としては２値画像、白黒多
値画像、カラー多値画像等が考えられる。まず原稿の状
態を良好にするための前処理としてノイズ除去処理部１
１、スキュー補正処理部１２がある。領域分割にとって
ノイズは悪影響を与えるものなので、孤立点除去、ブッ
ク物入力時の綴じ部の黒ノイズ等を除去する。ただし、
これは領域分割の中でも実施する事ができる。また、文
書が傾いて入力されると、出力画像も傾いて出力され
る。それのみならず、傾きがひどいと、領域分割に影響
を与え、的確な文字再合成が行われない場合もあるた
め、スキュー補正が必要な場合がある。スキュー補正を
実現するにはハフ変換を利用する方法、周辺分布を利用
する方法等がある。FIG. 2 is a detailed block diagram of the image input processing unit 1. As the input image, a binary image, a black-and-white multi-valued image, a color multi-valued image, and the like can be considered. First, as a pre-process for improving the state of the original, the noise removal processing unit 1
1. There is a skew correction processing unit 12. Since noise has an adverse effect on area division, isolated points are removed, and black noise at the binding portion when a book is input is removed. However,
This can be performed even in the area division. Further, when the document is input at an angle, the output image is also output at an angle. In addition, if the inclination is severe, it may affect the area division and correct character re-synthesis may not be performed. Therefore, skew correction may be required. To implement skew correction, there are a method using Hough transform, a method using marginal distribution, and the like.

【００２５】さらに前処理の１つとしてマーカー検出処
理部１３がある。これはあらかじめ原稿に対象となる領
域をマーカー等で囲む場合に必要となり、マーカーで囲
まれた領域のみを抽出する。カラー画像の場合、マーカ
ーの色は特定されるため、特定の色のみを抽出・処理す
ることにより容易に領域抽出が行われる。例えば図３に
示すように抽出されたマーカーの色のつながりに対して
座標値を求めることにより、そのマーカーの周囲を矩形
化した座標値は簡単に求めることができる。また、別の
手法として紙にではなく複写機のプラテン等に付属して
いるデジタイザ等で指示する場合もある。この場合はあ
らかじめデジタイザ等から情報が機器に入るため、特別
な処理は必要とせずに必要な領域のみを抽出することが
できる。Further, there is a marker detection processing section 13 as one of the preprocessing. This is necessary in a case where a target area of a document is surrounded by a marker or the like in advance, and only the area surrounded by the marker is extracted. In the case of a color image, since the color of the marker is specified, area extraction can be easily performed by extracting and processing only a specific color. For example, as shown in FIG. 3, by obtaining coordinate values with respect to the color continuity of the extracted marker, the coordinate values obtained by making the periphery of the marker rectangular can be easily obtained. As another method, the instruction may be given by a digitizer attached to a platen or the like of a copying machine instead of paper. In this case, since information is previously input to the device from a digitizer or the like, it is possible to extract only a necessary area without requiring special processing.

【００２６】また、画像の拡大、再合成処理は別とし
て、領域分割処理は基本的には２値画像で行うため、２
値画像を得るためのしきい値処理部１４が必要となる。
カラー画像から白黒多値画像に変換する例としては例え
ばＲＧＢ成分のＧのみを用いる手法、輝度成分を計算す
る手法、原稿の背景等の色特性を考えて必要な輝度成分
を求める手法など様々な手法がある。しきい値処理にも
固定しきい値を用いる手法、画面全体の濃度特性から一
意のしきい値を求めて処理する手法、画像の状態を見な
がら動的にしきい値を可変する手法等がある。Apart from image enlargement and resynthesis processing, the area division processing is basically performed on a binary image.
A threshold processing unit 14 for obtaining a value image is required.
Examples of converting a color image into a black-and-white multi-valued image include various methods such as a method using only G of RGB components, a method of calculating a luminance component, and a method of obtaining a necessary luminance component in consideration of color characteristics such as a background of a document. There is a method. There are also a method using a fixed threshold value for the threshold processing, a method of obtaining and processing a unique threshold value from the density characteristics of the entire screen, and a method of dynamically changing the threshold value while observing the state of the image. .

【００２７】また、画像解析は入力画像の解像度のまま
で処理しても構わないが、縮小画像を用いる場合もあ
り、その場合は線画を保存する縮小処理部１５が必要と
なる。線画を保存する縮小方法の例として例えばＯＲ縮
小がある。この概念はＭ＊Ｍを１画素に縮小する場合に
Ｍ＊Ｍ内に１つでも黒があった場合は縮小した画素も黒
画素とする。また画像縮小においては２値画像に対して
縮小処理を行う場合と多値画像に対して縮小処理を行
い、その次にしきい値処理をする場合もある。Although the image analysis may be performed with the resolution of the input image unchanged, a reduced image may be used. In such a case, a reduction processing unit 15 for storing a line image is required. An example of a reduction method for storing a line drawing is OR reduction. In this concept, when M * M is reduced to one pixel, if there is even one black in M * M, the reduced pixel is also regarded as a black pixel. In image reduction, there are cases where a reduction process is performed on a binary image and a case where a reduction process is performed on a multi-valued image and then a threshold value process is performed.

【００２８】図４は画像領域分割処理部３の詳細を示す
ブロック図である。領域分割基本処理部３１で用いる領
域分割の手法にはにはマクロからミクロに解析する手法
とミクロからマクロに解析する手法に分かれる。例えば
マクロからミクロに解析する一例としては周辺分布特性
を複数回用いる手法がある。図５に示すように周辺分布
を用いて画像全体の特性をつかみ周辺分布の山谷の部分
から複数の領域に分割する作業を複数回行う。縦方向と
横方向の周辺分布を求めることにより図５の例ではイメ
ージ領域と文字領域に大きく分割される。またこのとき
文字領域は文字行として分割されている。次に行として
分割された文字領域に対して横方向に対して再度周辺分
布を求める。さらに文字行の幅から文字の大きさを推定
し、その値をもとに分割を行うことにより、ほぼ一文字
単位で扱えるように分割することができる。一方ミクロ
からマクロに解析する手法としては、例えばラベリング
を用いる手法がある。図６に示すようにまず、画素の繋
がりを単位とした小領域を求める。次にその小領域の前
後の配置関係から１文字単位でまとめる。引き続いて
行、段落といった大きな単位にまとめていくことによ
り、領域分割が可能になる。どちらの技法も最終的には
一文字単位の画像情報と文字の連なった行としての画像
情報を得ることができる。FIG. 4 is a block diagram showing details of the image area division processing section 3. As shown in FIG. The method of region division used in the region division basic processing unit 31 is divided into a method of analyzing macro to micro and a method of analyzing micro to macro. For example, as an example of analyzing from macro to micro, there is a method of using a marginal distribution characteristic a plurality of times. As shown in FIG. 5, the operation of grasping the characteristics of the entire image using the marginal distribution and dividing the peak and valley portion of the marginal distribution into a plurality of regions is performed a plurality of times. By obtaining the peripheral distribution in the vertical and horizontal directions, in the example of FIG. 5, the image is largely divided into an image area and a character area. At this time, the character area is divided as character lines. Next, a marginal distribution is obtained again in the horizontal direction with respect to the character region divided as a line. Furthermore, by estimating the size of the character from the width of the character line and performing the division based on the value, it is possible to divide the character so that it can be handled almost in units of one character. On the other hand, as a technique for analyzing from micro to macro, for example, there is a technique using labeling. First, as shown in FIG. 6, a small area is determined in units of pixel connection. Next, the small areas are summarized in units of one character based on the positional relationship before and after the small areas. Subsequent grouping into large units, such as lines and paragraphs, enables region division. Both techniques can ultimately obtain image information in units of one character and image information as a continuous line of characters.

【００２９】属性判別処理部３２はそれぞれの分割領域
が文字領域であるか中間調領域であるか、グラフィック
ス領域であるか等を領域内の画像特性により判別する処
理を行う。The attribute discrimination processing section 32 performs a process of discriminating whether each of the divided areas is a character area, a halftone area, a graphics area, or the like based on image characteristics in the area.

【００３０】縦書／横書き判別処理部３３は文書内の各
領域の文字が縦書きであるか横書きであるかを判定す
る。これは領域分析の途中結果を利用することにより求
められる。マクロからミクロに解析する手法でもミクロ
からマクロに解析する手法のどちらも解析の途中で文字
行、もしくは文字列に相当する領域が求まる。つまり図
７に相当するような解析結果が得られる。横書きの文書
であれば横長の矩形である行が抽出され、縦書きの文書
であれば縦長の矩形である列が抽出される。つまり行も
しくは列が抽出されるため、その情報から横書きである
か、縦書きであるかが求まる。この結果は一文字一文字
の切り出し順序、さらに文字拡大、文書画像再合成の順
序決めとして使われる図８は文字画像処理部４の詳細を示すブロック図であ
る。通常の拡大は入力された画像に対して拡大率が決ま
ればその値を画像全体に対して処理して拡大画像を得
る。そのような拡大のみでも構わないが、ここでは拡大
する文字の大きさに適した拡大手法を用いる。拡縮処理
部４１は出力に要求される文字の大きさまで拡縮処理を
行う所である。要素画像に対する画像処理の詳細を図９
に示す。例えば１０ポイント相当の文字を４０ポイント
に拡大する場合について説明する。通常の拡大であれば
単純に１画素を４＊４の１６画素の拡大する。また拡大
した際の輪郭のぎざぎざがきになる場合は図９に示すよ
うな補間処理を行うことにより見かけ上の文字品質を上
げることができる。補間手法には２値画像の場合と多値
画像の場合では補間方法が異なる。２値画像の場合は補
間は白・黒のどちらかであるが、多値画像の場合は補間
の際に濃度値も決めなければならない。一般的に知られ
ている既知の手法として線形補間、９点２次補間のよう
な手法がある。The vertical / horizontal writing discrimination processing unit 33 determines whether the character in each area in the document is vertical writing or horizontal writing. This is obtained by using the intermediate result of the area analysis. In both the technique of analyzing from macro to micro and the technique of analyzing from micro to macro, an area corresponding to a character line or a character string is obtained during the analysis. That is, an analysis result corresponding to FIG. 7 is obtained. In the case of a horizontally written document, a horizontally long rectangular row is extracted, and in the case of a vertically written document, a vertically rectangular column is extracted. That is, since a row or a column is extracted, it is determined from the information whether the writing is horizontal or vertical. This result is used to determine the order in which each character is cut out, further character enlargement, and document image resynthesis. FIG. 8 is a block diagram showing details of the character image processing unit 4. In normal enlargement, if an enlargement ratio is determined for an input image, the value is processed for the entire image to obtain an enlarged image. Such an enlargement alone is acceptable, but here, an enlargement method suitable for the size of the character to be enlarged is used. The enlargement / reduction processing unit 41 performs enlargement / reduction processing to the size of a character required for output. FIG. 9 shows details of image processing for elemental images.
Shown in For example, a case where a character equivalent to 10 points is enlarged to 40 points will be described. For normal enlargement, one pixel is simply enlarged to 4 * 4 16 pixels. If the contour becomes jagged when enlarged, the apparent character quality can be improved by performing an interpolation process as shown in FIG. The interpolation method differs between a binary image and a multi-valued image. In the case of a binary image, interpolation is either white or black, but in the case of a multi-valued image, the density value must also be determined at the time of interpolation. There are generally known techniques such as linear interpolation and 9-point quadratic interpolation.

【００３１】構造変換処理部４２は文字の字体を擬似的
に変える処理を行う。これには２つの効果がある。一つ
は例えば１０ポイントの文字と４０ポイントの文字では
文字の大きさは４倍であるが、線幅も４倍になるとは限
らないからである。文字の大きさに適した線幅に修正を
加えることにより、より見やすい文字を再合成すること
ができる。２つめに、人によっては線幅が太くないと文
字が読みにくい場合があり、その時には極力線幅を太く
することが要求される場合がある。つまり、明朝体で書
かれたような線幅の細い文字に対してもゴシック体のよ
うな線幅の太い文字ににせた文字の線幅を太くする処理
を行うことにより、より見やすい文字を差再合成するこ
とができる。線幅を可変する処理としては例えば膨張・
収縮処理がある。The structure conversion processing unit 42 performs a process of changing the character style of a character in a pseudo manner. This has two effects. One is that, for example, the character size of a 10-point character and a 40-point character is four times, but the line width is not always four times. By modifying the line width suitable for the character size, a more legible character can be recombined. Second, depending on the person, it may be difficult to read characters unless the line width is thick. At that time, it may be required to increase the line width as much as possible. In other words, even for characters with thin lines such as those written in Mincho font, by making the line width of characters that have been changed to thick characters such as Gothic font thicker, more legible characters can be obtained. The difference can be recombined. Examples of processing for changing the line width include expansion and
There is shrinkage processing.

【００３２】色変換処理部４３は文字を見やすくするた
めに色変換を行う。例えば目の不自由な人のために出力
画像のコントラストをあげる、色盲の人に対しては個別
の色に対処した色変換をして見やすい画像に変える等の
処理を付加することができる。The color conversion processing section 43 performs color conversion to make the characters easy to see. For example, it is possible to add processing such as increasing the contrast of an output image for a visually impaired person, and for a color blind person, performing color conversion corresponding to individual colors to change the image into an easily viewable image.

【００３３】これらの処理によりそれぞれの要素単位で
適切な文字画像処理が実施される。この際に、オリジナ
ルの拡大画像に対して各種処理により若干、文字の大き
さが変わる場合がある。再合成をするためには、大きさ
の座標系の変動は修正しておく必要がある。With these processes, appropriate character image processing is performed for each element. At this time, the size of the character may slightly change due to various processes on the original enlarged image. In order to perform re-synthesis, it is necessary to correct the variation of the size coordinate system.

【００３４】また、本実施例では画像を要素単位に分解
してからそれぞれの画像処理を行っていたが、領域に対
して同じ画像処理を行う場合に、画像全体に対して画像
処理を行った後に要素に分解するような形式をとっても
かまわない。In this embodiment, each image processing is performed after the image is decomposed in units of elements. However, when the same image processing is performed on an area, the image processing is performed on the entire image. It may take the form of decomposing into elements later.

【００３５】非文字画像処理部５も基本的には文字画像
処理部４と同じであるが、構造変換処理４２は必要とし
ない。色変換に対しては、自然画像に対しては必要ない
が、グラフィックスに対しては必要となる。ただし、こ
の場合は変換した色はその周囲の色と似た色になると例
えばグラフの場合、見分けがつかなくなるため、周囲に
使われていない色に変換する必要がある。The non-character image processing unit 5 is basically the same as the character image processing unit 4, but does not require the structure conversion processing 42. Color conversion is not required for natural images, but is required for graphics. However, in this case, if the converted color becomes a color similar to the surrounding color, for example, in the case of a graph, it becomes indistinguishable, so it is necessary to convert to a color not used in the surroundings.

【００３６】図１０は文書画像再合成処理部６の詳細を
示すブロック図であり、図１１は実際の処理例を示す図
である。これまでの処理により文字要素は１文字づつ分
離され、適切な文字処理により所望の大きさに最適な文
字形で変形を加えた。また、拡大した文字の大きさより
文書を再合成する場合の文字領域の文字間隔、行間隔も
決まる。また、非文字領域も適切な拡大率で拡大されて
いるのでこれらも併せて再合成処理を行う。これら情報
を用いて、文書再合成は拡大された文字画像および非文
字画像を貼り付けることにより実施することができる。
文字要素合成処理部６１は適切な処理を加えられた文字
要素に対して合成を実施するが、このようなサブ文字パ
ターンを埋め込む方式としては例えば、特開平５−２６
６１６８号公報に示されるような手法がある。図１１を
もとに説明する。この図では横書文書に対する例である
が、まず基本は一文字貼り付け、文字間隔分のスペース
をあけ、また文一文字貼り付けるという作業をその行に
文字が埋め込めなくなるまで続ける。埋め込めなくなっ
た場合は、指定された行間隔だけ、ずらし、次の行に対
して同様の処理を行う。このように行を埋めていった場
合、文字行の右端は必ずしも一様とならない。ジャステ
ィファイが必要な場合は、その行に対して文字間隔を微
調整することにより、擬似的なジャスティファイが可能
となる。また、このままでは左先端に句読点がきてしま
う場合もある。一般の文書エディタではこのようなこと
が生じないように禁則処理を行っているが、本実施例で
は文字としての認識は行っていないため、文書エディタ
のような禁則処理は実施できない。しかし、句読点は通
常の文字に比べて極端に小さいという性質があるため、
このような文字（矩形）がきた場合には前の行の右端に
追いやる擬似的な禁則処理も可能となる。同様に擬似的
な改行処理も可能となる。FIG. 10 is a block diagram showing details of the document image resynthesis processing unit 6, and FIG. 11 is a diagram showing an example of actual processing. The character elements are separated one by one by the processing up to this point, and the character is deformed in an optimal character form to a desired size by appropriate character processing. In addition, the character spacing and line spacing of the character area when re-synthesizing a document are determined from the size of the enlarged character. In addition, since the non-character area is also enlarged at an appropriate enlargement ratio, these are also subjected to the re-synthesis processing. Using these information, the document resynthesis can be performed by pasting the enlarged character image and non-character image.
The character element synthesis processing unit 61 performs synthesis on a character element that has been subjected to appropriate processing. A method of embedding such a sub character pattern is described in, for example, Japanese Patent Application Laid-Open No. 5-26.
There is a technique as disclosed in Japanese Patent No. 6168. This will be described with reference to FIG. This figure shows an example of a horizontally written document. Basically, the operation of pasting one character, leaving a space corresponding to the character interval, and pasting one character is continued until characters cannot be embedded in the line. When the embedding cannot be performed, the same processing is performed on the next line by shifting by the designated line interval. When such a line is filled, the right end of the character line is not always uniform. If justification is necessary, pseudo justification can be performed by finely adjusting the character spacing for the line. In addition, punctuation marks may be left at the left end. Although a general document editor performs prohibition processing so as not to cause such a problem, in this embodiment, since recognition as a character is not performed, prohibition processing like a document editor cannot be performed. However, punctuation is extremely small compared to normal characters,
When such a character (rectangle) arrives, pseudo prohibition processing that moves to the right end of the previous line becomes possible. Similarly, pseudo line feed processing is also possible.

【００３７】非文字要素合成処理部６２はこのような作
業を文字ではなく、非文字画像要素に対して行う。画像
の大きさに応じて適切な間隔を他の要素との間に埋め込
み、非文字画像要素を再合成する処理のため、文字要素
合成処理部６１に比べ簡便な処理となる。The non-character element synthesizing section 62 performs such an operation not on characters but on non-character image elements. An appropriate interval is embedded between the other elements according to the size of the image, and the non-character image element is re-synthesized.

【００３８】このような場合、必ずしも１枚の文書にお
さまるとは限らず、処理結果が複数のページにまたがる
場合がある。この制御をするのがページ処理部６３であ
る。文字再合成の際にページという概念を持たせる。処
理の始めとしてまず１ページ分の領域を確保し、文字要
素・非文字要素とも再合成作業を行う。そして１ページ
に収まりきらなかった場合はあらたに１ページ分の領域
を確保し、再度、文字・非文字領域とも貼り付け作業を
行う。このような作業を対象となる文書の要素がなくな
るまで続ける。つまりページ処理部６３はページの制御
をするために、文字要素合成処理部６１、非文字要素合
成処理部６２による再合成処理結果を監視する役目を行
う。In such a case, the processing result does not always fit into one document, and the processing result may span a plurality of pages. The page processing unit 63 performs this control. Give the concept of a page when resynthesizing characters. At the beginning of the process, an area for one page is first secured, and the character element and the non-character element are recombined. If it does not fit in one page, an area for one page is newly secured, and the character and non-character areas are pasted again. This operation is continued until there are no more elements in the target document. That is, the page processing section 63 plays a role of monitoring the result of the re-synthesis processing by the character element synthesis processing section 61 and the non-character element synthesis processing section 62 in order to control the page.

【００３９】図１２はマーカーした領域のみを拡大した
場合の例であり、図１３は画像１ページを拡大した場合
の例である。どちらの場合も文字を所望の大きさに拡大
するとともに適した文字形・色に変換する。次に、その
文字の大きさに適した形態で、再合成を行う。また、非
文字領域がある場合も、適切な拡大率で拡大し、再合成
処理を行う。FIG. 12 shows an example in which only the marked area is enlarged, and FIG. 13 shows an example in which one page of the image is enlarged. In either case, the character is enlarged to a desired size and converted to a suitable character shape and color. Next, resynthesis is performed in a form suitable for the size of the character. In addition, even when there is a non-character area, enlargement is performed at an appropriate enlargement ratio, and re-synthesis processing is performed.

【００４０】これまでの説明では、基本的に文字要素を
１づつ埋め込むため、基本として文字要素が１文字単位
に分離されている必要がある。この精度は、領域分割の
性能に依存するが、領域分割のの段階での誤認により、
例えば図１３に示すように文字が分離あるいは連結して
まとめられてしまう場合があり、その場合は再合成した
結果も見えが若干異なってくる。特に行の右端と次の行
の左端の間でその現象がおきたときに問題となる。しか
し、人間は目を通してパターン認識を行うため、そのよ
うな誤認があったとしても頭の中で文字を再構成するこ
とができるため、読みにくくはなるが、読めないわけで
はない。これが文字認識の場合であると、誤認とは違う
文字に変換されてしまうことであるため、修正するため
には単なる目のパターン認識ではなく、文章の意味理解
まで必要となり、間違った解釈を起こすことがある。In the above description, since the character elements are basically embedded one by one, it is basically necessary that the character elements are separated in units of one character. This accuracy depends on the performance of the region division, but due to misidentification at the stage of the region division,
For example, as shown in FIG. 13, there are cases where characters are separated or concatenated and put together, and in that case, the recombined result also looks slightly different. In particular, the problem occurs when the phenomenon occurs between the right end of a line and the left end of the next line. However, since humans perform pattern recognition through their eyes, even if there is such an erroneous recognition, characters can be reconstructed in their heads, which makes reading difficult, but not impossible. If this is the case of character recognition, it will be converted to a character different from misrecognition, so it is necessary to understand the meaning of the sentence, not just eye pattern recognition, to correct it, causing incorrect interpretation Sometimes.

【００４１】本実施例では、要素ごとに、例えば、文字
の拡大、文字の構造変換、文字の色変換、非文字領域の
拡大等を実施して、ページを単位として貼り付け処理を
実施していたが、例えば、文字領域では、まずオリジナ
ルサイズを用いて拡大率を考慮した文字間隔、行間隔で
貼り付けを行い、次に、行方向に分割可能な領域を１つ
の処理単位と考えて、その領域に対して拡大処理、構造
変換処理、色変換処理等を行った後、ページ内に貼り付
け処理を行うようにしても差し支えない。In this embodiment, for example, enlargement of a character, conversion of the structure of a character, conversion of the color of a character, enlargement of a non-character area, etc. are performed for each element, and the pasting process is performed on a page basis. However, for example, in a character area, first, paste is performed at a character interval and a line interval in consideration of an enlargement ratio using an original size, and then, an area that can be divided in the line direction is considered as one processing unit. After performing the enlargement processing, the structure conversion processing, the color conversion processing, and the like on the area, the paste processing may be performed in the page.

【００４２】本実施例では横書き文書に対する再合成例
を説明したが、縦書き文書に対する処理も要素を埋め込
む方向が異なるだけで同様の処理で実現できる。また、
ここでは文字間隔、行間隔を文字の大きさにより自動的
に調整するが、この値は手動で調整しても差し支えな
い。In this embodiment, an example of re-synthesis for a horizontally written document has been described. However, processing for a vertically written document can be realized by the same processing except that the direction in which elements are embedded is different. Also,
Here, the character spacing and line spacing are automatically adjusted according to the size of the character, but this value may be adjusted manually.

【００４３】本実施例は複写機において説明を行った
が、本機能は複写機のみで限定されるものではない。ス
キャナ等を備えたＰＣ（パーソナルコンピュータ）上で
も同様のことが実現できる。全画面に対して解析処理を
行うことができるし、マーカー等で指示された領域に対
してのみ処理を行うこともできる。また、ディスプレイ
に仮想的に表示された原画像に対してマウス等で所望の
領域を指定し、その領域に対してのみ処理を行うことも
できる。また、出力再生の形態としてはディスプレイの
場合とプリンタの場合がある。プリンタの場合の画像合
成処理は本実施例で示した複写機の場合と同一である。
ディスプレイ表示の場合は画面を１ページとして再合成
する場合と、ページの概念を持たずに表示する場合の２
通りが考えられる。例えば横書き文書の場合はディスプ
レイの横幅は固定とし、縦方向は、スクロールすること
によりユーザにとって読みやすい表示を行う。縦書きの
場合は縦方向を固定とし、横方向にスクロールすること
によりユーザにとって読みやすい表示を行う。ディスプ
レイの場合はユーザにとってより見やすいコントラスト
の状態（例えば白黒反転）で表示することも可能であ
る。さらに非文字領域は出力先が紙である場合は位置を
特定して再合成する必要があるが、ＣＲＴ（陰極線管）
のような表示装置の場合は、再合成する位置を特定せ
ず、例えば画面上に非画像領域があることを示すポイン
タを常に表示し、必要な時に呼出して表示するようにす
ることもできる。Although the present embodiment has been described with reference to a copying machine, this function is not limited to the copying machine alone. The same can be realized on a PC (personal computer) having a scanner or the like. The analysis processing can be performed on the entire screen, or the processing can be performed only on an area specified by a marker or the like. In addition, a desired area can be designated with a mouse or the like on the original image virtually displayed on the display, and processing can be performed only on that area. Further, as a form of output reproduction, there are a case of a display and a case of a printer. The image synthesizing process in the case of a printer is the same as that of the copier shown in the present embodiment.
In the case of display display, there are two cases: a case where the screen is recombined as one page, and a case where the screen is displayed without having the concept of a page.
The street is conceivable. For example, in the case of a horizontally written document, the width of the display is fixed, and the display in the vertical direction is easy to read for the user by scrolling. In the case of vertical writing, the vertical direction is fixed, and scrolling in the horizontal direction provides a display that is easy for the user to read. In the case of a display, it is also possible to display in a state of contrast (for example, black-and-white inversion) that is more visible to the user. Furthermore, if the output destination is paper, it is necessary to specify the position of the non-character area and re-synthesize it, but the CRT (cathode ray tube)
In the case of such a display device, for example, a pointer indicating that there is a non-image area on the screen may always be displayed without specifying the position to be re-synthesized, and may be called up and displayed when necessary.

【００４４】[0044]

【発明の効果】以上説明したように、本発明の文書画像
処理装置によれば、文書画像を要素単位で扱うことによ
り、文書画像を拡大、再合成する際に生じる様々な問題
を解決することができる。すなわち、文書画像を要素単
位で扱うことにより、再合成の際に好ましいパラメータ
を自動付与、もしくは手動付与しながら要素画像変換や
再合成を行い、さらに改ページ機能を持たせることによ
り読み手にとって読みやすい画像を再現することができ
る。また、画像データの拡大を伴う再配置のみで実現す
ることにより、解析の間違いが生じた場合でもユーザで
予測がつき、文書理解に支障がない。As described above, according to the document image processing apparatus of the present invention, various problems that occur when a document image is enlarged and recombined can be solved by treating the document image in element units. Can be. In other words, by handling the document image in element units, the preferred parameters are automatically added or manually added at the time of re-synthesis, and the element image conversion or re-synthesis is performed, and the page break function is provided. Images can be reproduced. Further, by realizing only the rearrangement accompanied by enlargement of the image data, even if an analysis error occurs, the user can make a prediction and does not hinder the understanding of the document.

[Brief description of the drawings]

【図１】本発明の一実施例の文書画像処理装置の全体
の構成を示すブロック図である。FIG. 1 is a block diagram illustrating an overall configuration of a document image processing apparatus according to an embodiment of the present invention.

【図２】本発明の画像入力処理部の構成を示すブロッ
ク図である。FIG. 2 is a block diagram illustrating a configuration of an image input processing unit according to the present invention.

【図３】本発明の一実施例のマーカー検出を説明する
図である。FIG. 3 is a diagram illustrating marker detection according to one embodiment of the present invention.

【図４】本発明の画像領域分割処理部の構成を示すブ
ロック図である。FIG. 4 is a block diagram illustrating a configuration of an image area division processing unit according to the present invention.

【図５】本発明の一実施例の領域分割を説明する図で
ある。FIG. 5 is a diagram illustrating area division according to an embodiment of the present invention.

【図６】本発明の一実施例の領域分割を説明する図で
ある。FIG. 6 is a diagram illustrating area division according to an embodiment of the present invention.

【図７】本発明の一実施例の縦書き／横書き判別処理
を説明する図である。FIG. 7 is a view for explaining vertical / horizontal writing discrimination processing according to one embodiment of the present invention.

【図８】本発明の文字画像処理部の構成を示すブロッ
ク図である。FIG. 8 is a block diagram illustrating a configuration of a character image processing unit according to the present invention.

【図９】本発明の一実施例の拡大処理を説明する図で
ある。FIG. 9 is a diagram illustrating an enlargement process according to an embodiment of the present invention.

【図１０】本発明の文書画像再合成処理の構成を示す
ブロック図である。FIG. 10 is a block diagram illustrating a configuration of a document image recombining process according to the present invention.

【図１１】本発明の一実施例の文書画像再合成処理を
説明する図である。FIG. 11 is a diagram illustrating a document image recombining process according to an embodiment of the present invention.

【図１２】本発明の一実施例の文書画像再合成処理例
を説明する図である。FIG. 12 is a diagram illustrating an example of a document image recombining process according to an embodiment of the present invention.

【図１３】本発明の一実施例の文書画像再合成処理例
を説明する図である。FIG. 13 is a diagram illustrating an example of a document image recombining process according to an embodiment of the present invention.

【図１４】本発明の一実施例の文書画像再合成処理例
を説明する図である。FIG. 14 is a diagram illustrating an example of a document image recombining process according to an embodiment of the present invention.

[Explanation of symbols]

１画像入力処理部２画像データ蓄積部３画像領域分割処理部４文字画像処理部５非文字画像処理部６文書画像再合成処理部７文書画像出力部１１スキュー補正処理部１２ノイズ除去処理部１３マーカー検出処理部１４しきい値処理部１５縮小処理部３１領域分割基本処理部３２属性判別処理部３３縦書き／横書き判別処理部４１拡大縮小処理部４２構造変換処理部４３色変換処理部５１文字要素合成処理部５２非文字要素合成処理部５３ページ処理部 DESCRIPTION OF SYMBOLS 1 Image input processing part 2 Image data storage part 3 Image area division processing part 4 Character image processing part 5 Non-character image processing part 6 Document image resynthesis processing part 7 Document image output part 11 Skew correction processing part 12 Noise removal processing part 13 Marker detection processing unit 14 Threshold processing unit 15 Reduction processing unit 31 Area division basic processing unit 32 Attribute determination processing unit 33 Vertical / horizontal writing determination processing unit 41 Enlargement / reduction processing unit 42 Structure conversion processing unit 43 Color conversion processing unit 51 Character Element synthesis processing section 52 Non-character element synthesis processing section 53 Page processing section

Claims

[Claims]

1. An image input processing means for inputting image data, an image data storage means for storing the input image data, and an element for dividing the image into image areas having different attributes with respect to the image data. Image area dividing means as a unit, character image processing means for performing predetermined scaling / structure conversion / color conversion processing for character elements, and non-character for performing predetermined scaling / color conversion processing for non-character elements A document image comprising: image processing means; document image resynthesis processing means for resynthesizing a document with a processed element image; and document image output means for outputting a resynthesized document image. Processing equipment.

2. The image area dividing means performs an analysis process on an entire input image, divides the input image into various attribute areas including a character area, a graphics area, and an image area. 2. The document image processing device according to claim 1, wherein the document image is divided into character elements that can be handled by.

3. The image area dividing means extracts only an area designated in advance, performs analysis processing only on the area, and divides the image area into various attribute areas including a character area, a graphics area, and an image area. 2. The document image processing apparatus according to claim 1, wherein the character is further divided into character elements that can be handled in units of one character.

4. The image area dividing means extracts only an area designated in advance, performs analysis processing only on the area, and divides the image area into character elements that can be handled in units of one character. Item 7. The document image processing device according to Item 1.

5. The document according to claim 1, wherein the character image processing means performs character enlargement / reduction, deformation / color conversion suitable for the size of the character to be converted. Image processing device.

6. The document image re-synthesizing means, when an element image to be re-synthesized is prepared, assigns appropriate character spacing, line spacing, and inter-element spacing according to the characteristics of the element image. Re-synthesizing, having the concept of a page, performing appropriate re-synthesis within a page, and performing re-synthesis work over a plurality of pages.
6. The document image processing device according to 4 or 5.

7. The document image re-synthesizing means, when an element image to be re-synthesized is prepared, assigns an appropriate character interval, line interval, and inter-element interval according to the characteristics of the element image. Re-synthesizing the image data, having a concept of a display area, and performing appropriate re-synthesis within the display area.
The document image processing apparatus according to 2, 3, 4, 5, or 6.

8. A means for extracting a character unit area from an input image, a means for enlarging the character unit area while performing processing according to an enlargement ratio, and an output image in which the enlarged character unit area is arranged in a predetermined arrangement. A document image processing apparatus, comprising: means for combining images in a medium; and means for outputting the combined output image.