JP4080973B2

JP4080973B2 - Image output device, image output program, and recording medium on which program is recorded

Info

Publication number: JP4080973B2
Application number: JP2003290616A
Authority: JP
Inventors: 広文西田
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 2003-08-08
Filing date: 2003-08-08
Publication date: 2008-04-23
Anticipated expiration: 2023-08-08
Also published as: JP2005064736A

Description

本発明は、スキャナ、デジタルカメラ等の画像入力機器により取り込まれた文書、図面の画像から、文字・線図を正確に再現した画像または文字・線図を改善して再現した画像を作成することができる画像出力装置、画像出力プログラムおよびこのプログラムが記録された記録媒体に関する。 The present invention creates an image that accurately reproduces a character / line diagram or an image that reproduces an improved character / line diagram from a document or drawing image captured by an image input device such as a scanner or a digital camera. The present invention relates to an image output apparatus, an image output program, and a recording medium on which the program is recorded.

近年、カラースキャナ、カラーデジタルカメラ（カラーデジタルカメラ付きの携帯型電話機を含む）等のカラー情報入力機器が広く普及している。カラースキャナでは、たとえば解像度４００ｄｐｉ、２５６階調で原稿画像を読み取る場合には、処理するべきデータ量が増化し、読み取りに時間がかかる等の不都合がある。また、カラーデジタルカメラでは、たとえば解像度４００ｄｐｉで撮影するとメモリに記憶できる画像数が少なくなるといった不都合がある。 In recent years, color information input devices such as a color scanner and a color digital camera (including a mobile phone with a color digital camera) are widely used. For example, when a document image is read at a resolution of 400 dpi and 256 gradations, the color scanner has a disadvantage that the amount of data to be processed increases and it takes time to read. In addition, the color digital camera has a disadvantage that the number of images that can be stored in the memory is reduced, for example, when shooting at a resolution of 400 dpi.

一般に、カラー画像では、たとえば２００ｄｐｉ程度であっても、ディスプレイに表示する場合に画質が劣って見えることはない。このため、この種のカラー情報入力機器では、カラーの文書、図面、地図等を、２００ｄｐｉ程度の解像度で取得し（すなわち読み取りまたは撮影し）、所定フォーマット（ＴＩＦ、ＪＰＥＧ、ＧＩＦ等）のイメージファイルとして他の装置に送信等している。 In general, even in the case of a color image, for example, about 200 dpi, the image quality does not look inferior when displayed on a display. For this reason, in this type of color information input device, color documents, drawings, maps, etc. are acquired (that is, read or photographed) with a resolution of about 200 dpi, and an image file in a predetermined format (TIF, JPEG, GIF, etc.). To other devices.

ところで、カラービットマップ画像に、写真、絵等のほか、文字・線図が含まれている場合において、これら文・線図を正確にまたは改善して出力したいことがある。たとえば、あるユーザは、カラービットマップ画像中の文字・線図を、ディスプレイに表示し、またはプリントしたい場合がある。 By the way, when a color bitmap image includes a character / diagram in addition to a photograph, a picture, etc., it may be desired to output the sentence / diagram with accuracy or improvement. For example, a user may want to display or print a character / line diagram in a color bitmap image on a display.

ところが、カラーの印刷文書、図面、地図等が２００ｄｐｉ程度の解像度で作成されている場合に、画像に含まれる文字・線図が小さいと（たとえば、３ミリ四方程度だと）、当該画像をディスプレイに表示出力したときや、印刷出力したときに文字・線図の視認性が低下する場合がある。 However, when a color print document, drawing, map, etc. are created with a resolution of about 200 dpi, if the character / line diagram included in the image is small (for example, about 3 mm square), the image is displayed. In some cases, the visibility of characters / diagrams may be reduced when the information is displayed and printed.

モノクロ画像からなる文書、図面、地図等は２００ｄｐｉ程度の解像度では、たとえばディスプレイに表示したときに画質が劣って見える。このため、モノクロ画像からな文書等は、通常は、解像度４００ｄｐｉ以上の解像度で作成される。すなわち、モノクロスキャナでは（あるいは、カラースキャナのモノクロ読み取りモードでは）、４００ｄｐｉ程度の解像度で、文書、図面、地図等を読み取り、２値画像としてディスプレイに表示したり印刷したりすることができる。また、デジタルカメラでは、モノクロで撮影した場合には、撮影画像のデータ量は、カラーで撮影した場合のデータ量に比べて圧倒的に小さいので、撮影画像は４００ｄｐｉ程度の解像度で、ディスプレイに表示したり印刷したりすることができる。 Documents, drawings, maps, etc. composed of monochrome images appear inferior in image quality when displayed on a display, for example, at a resolution of about 200 dpi. For this reason, a document or the like composed of a monochrome image is usually created with a resolution of 400 dpi or higher. That is, with a monochrome scanner (or in a monochrome reading mode of a color scanner), a document, drawing, map, etc. can be read at a resolution of about 400 dpi and displayed on a display or printed as a binary image. Also, with a digital camera, when shooting in monochrome, the data amount of the captured image is overwhelmingly smaller than the data amount when shooting in color, so the captured image is displayed on the display with a resolution of about 400 dpi. Can be printed.

モノクロビットマップ画像は、４００ｄｐｉ程度の解像度であれば、画像中の文字が小さい場合（たとえば、３ミリ四方程度）であっても、高い視認性でディスプレイに表示することができるし、またプリンタにより高解像度で印刷出力することができる。 If the monochrome bitmap image has a resolution of about 400 dpi, it can be displayed on the display with high visibility even if the characters in the image are small (for example, about 3 mm square). Can be printed out with high resolution.

従来、２００ｄｐｉ程度以下の解像度、多階調のカラー画像に含まれる線図等を、モノクロの４００ｄｐｉと同様に、正確に再現しまたは改善して作成し、これをディスプレイに表示し、印刷し、または所定装置に送信することが望まれている。 Conventionally, a resolution of about 200 dpi or less, a line drawing included in a multi-tone color image, etc. are created by accurately reproducing or improving the same as monochrome 400 dpi, and this is displayed on a display, printed, Alternatively, it is desired to transmit to a predetermined device.

従来のカラービットマップ画像に描かれている文字を正確に再現し、または改善して作成するためには、（１）フィルタリング、（２）コントラスト強調、（３）モデルベースの画像復元、（４）高解像度化が用いられる。 To accurately reproduce or improve the characters drawn in a conventional color bitmap image, (1) filtering, (2) contrast enhancement, (3) model-based image restoration, (4 ) Higher resolution is used.

（１）のフィルタリングには、たとえばモルフォルジー（ＭｏｒｐｈｏｌｏｇｉｃａｌＯｐｅｒａｔｉｏｎ）を用いたノイズ除去方法（非特許文献１，非特許文献２参照）、細部をぼかさずにノイズを除去する２次フィルタを用いた方法（非特許文献３参照）がある。 For the filtering of (1), for example, a noise removal method using Morphologic (see Non-Patent Document 1 and Non-Patent Document 2), a method using a secondary filter that removes noise without blurring details ( Non-Patent Document 3).

（２）のコントラスト強調には、たとえば処理ウィンドウ（たとえば、３×３画素，５×５画素の処理領域）内の局所的統計量をもとにした非線形の階調変換（非特許文献４参照）がある。 For the contrast enhancement in (2), for example, nonlinear tone conversion based on local statistics within a processing window (for example, a processing region of 3 × 3 pixels and 5 × 5 pixels) (see Non-Patent Document 4) )

（３）のモデルベースの画像復元には、たとえばＯＣＲの誤認識の原因をクラスタ分析によりモデル化して画像復元する方法（非特許文献５参照）がある。 In model-based image restoration (3), for example, there is a method of modeling the cause of erroneous recognition of OCR by cluster analysis and restoring the image (see Non-Patent Document 5).

（４）の高解像度化としては、ビットマップ画像中の文字・線図を構成する画素をクラスタリングして平均することにより、任意の解像度のアウトラインを生成する方法（非特許文献６参照）、分布の双峰性、滑らかさ、輝度の３つをパラメータとする評価関数に基づき、逆問題として定式化し、最適な高解像度画像を復元する方法（非特許文献７参照）、補間による高解像度化と２値化に基づく方法（特許文献１，特許文献２，特許文献３，非特許文献８，非特許文献９参照）などがある。 (4) As an increase in resolution, a method of generating an outline of an arbitrary resolution by clustering and averaging pixels constituting a character / line diagram in a bitmap image (see Non-Patent Document 6), distribution Based on an evaluation function with three parameters of bimodality, smoothness, and luminance as a parameter, it is formulated as an inverse problem, and an optimal high-resolution image is restored (see Non-Patent Document 7). There are methods based on binarization (see Patent Literature 1, Patent Literature 2, Patent Literature 3, Non-Patent Literature 8, and Non-Patent Literature 9).

人間にとってのテキストの読みやすさという点からの画質の改善の両方を考えるならば、低解像度のビットマップ画像から高解像度の２値ビットマップ画像を復元するというアプローチが自然である。なお、理論的には、低解像度・多階調（あるいは多値）画像と高解像度・２値画像とは、サンプリング間隔、量子化レベル数、ＰＳＦ（点広がり関数：ｐｏｉｎｔｓｐｒｅａｄｆｕｎｃｔｉｏｎ）のぼけ効果に関して、ある関係が成り立てば、同じ情報量を持つことが知られている（非特許文献１０）。 Considering both improvement in image quality in terms of human text readability, an approach of restoring a high-resolution binary bitmap image from a low-resolution bitmap image is natural. Theoretically, a low-resolution / multi-gradation (or multi-value) image and a high-resolution / binary image have a blurring effect of sampling interval, number of quantization levels, and PSF (point spread function). Is known to have the same amount of information if a certain relationship is established (Non-patent Document 10).

特許第３３４５３５０号Japanese Patent No. 3345350 特開平８‐３４０４４６号JP-A-8-340446 特開２００１‐１１８０３２JP 2001-118032 A L. Koskinen, H. Huttunen, and J.T. Astola, Text enhancement method based on soft morphological filters Proceedings of SPIE, vol. 2181, pp. 243-253, 1994.L. Koskinen, H. Huttunen, and J.T.Astola, Text enhancement method based on soft morphological filters Proceedings of SPIE, vol. 2181, pp. 243-253, 1994. J. Liang, R.M. Haralick, and I.T. Phillips, Document image restoration using binary morphological filters, Proceedings of SPIE, vol. 2660, pp. 274-285, 1996.J. Liang, R.M.Halalick, and I.T.Phillips, Document image restoration using binary morphological filters, Proceedings of SPIE, vol. 2660, pp. 274-285, 1996. G. Ramponi and P. Fontanot, Enhancing document images with a quadratic filter, Signal Processing, vol. 33, pp. 23-34. 1993.G. Ramponi and P. Fontanot, Enhancing document images with a quadratic filter, Signal Processing, vol. 33, pp. 23-34. 1993. Y.C. Shin, R. Sridhar, V. Demjanenko, P.W. Palumbo, and J.J. Hull、 Contrast enhancement of mail piece images, Proceedings of SPIE, vol. 1661, pp. 27-37, 1992.Y.C. Shin, R. Sridhar, V. Demjanenko, P.W.Palumbo, and J.J.Hull, Contrast enhancement of mail piece images, Proceedings of SPIE, vol. 1661, pp. 27-37, 1992. M.Y. Jaisimha, E.A. Riskin, R. Ladner, and S. Werner, Model-based restoration of document images for OCR, Proceedings of SPIE, vol. 2660, pp. 297-308, 1996.1997M.Y.Jaisimha, E.A.Riskin, R. Ladner, and S. Werner, Model-based restoration of document images for OCR, Proceedings of SPIE, vol. 2660, pp. 297-308, 1996.1997 US5930393(T.K. Ho and J.D. Hobby; Lucent Technologies), July 27, 1999.US5930393 (T.K.Ho and J.D.Hobby; Lucent Technologies), July 27, 1999. P.D. Thouin and C.-I. Chang, A method for restoration of low-resolution document images, International Journal on Document Analysis and Recognition， vol. 2， pp. 200-210， 2000.P.D.Thouin and C.-I.Chang, A method for restoration of low-resolution document images, International Journal on Document Analysis and Recognition, vol. 2, pp. 200-210, 2000. US5524070(Y.-C. Shin, R. Sridhar, S.N. Srihari and V. Demjamenko; State University of New York, Buffalo), April 6, 1996.US5524070 (Y.-C. Shin, R. Sridhar, S.N.Srihari and V. Demjamenko; State University of New York, Buffalo), April 6, 1996. US6347156 (H. Kamada and K. Fujimoto; Fujitsu), February 12, 2002.US6347156 (H. Kamada and K. Fujimoto; Fujitsu), February 12, 2002. D. Lee, T. Pavlidis, and G.W. Wasilkowski, A note on the trade-off between sampling and quantization in signal processing, Journal of Complexity, vol. 3, pp. 359-371, 1987.D. Lee, T. Pavlidis, and G.W. Wasilkowski, A note on the trade-off between sampling and quantization in signal processing, Journal of Complexity, vol. 3, pp. 359-371, 1987. N.B. Karayiannis and A.N. Venetsanopoulos, Image interpolation based on variational principles, Signal Processing, vol. 25、 pp. 259-288, 1991.N.B.Karayiannis and A.N.Venetsanopoulos, Image interpolation based on variational principles, Signal Processing, vol. 25, pp. 259-288, 1991. A.D. Kulkarni and K. Sivaraman, Interpolation of digital imagery using hyperspace approximation, Signal Processing, vol. 7, pp. 65-73, 1984.A.D.Kulkarni and K. Sivaraman, Interpolation of digital imagery using hyperspace approximation, Signal Processing, vol. 7, pp. 65-73, 1984.

しかし、上述した従来方法には次のような問題点がある。
まず、（１）のビットマップのクラスタリングと平均化の方法では、同一の文書画像上に同じ文字について十分な数のサンプルが存在することを仮定している。この仮定は、ヨーロッパ系の言語のように、文字種が少ない場合（アルファベット、数字、記号等を合計しても高々１００種類程度）には成立するが、東洋系の言語のように文字種が多い場合（たとえば漢字の場合には数千〜数万）には成立しない場合が多い。 However, the conventional method described above has the following problems.
First, in the bitmap clustering and averaging method (1), it is assumed that a sufficient number of samples exist for the same character on the same document image. This assumption is valid when the number of character types is small as in European languages (about 100 at most including alphabets, numbers, symbols, etc.), but when there are many character types as in Eastern languages In many cases (for example, several thousand to several tens of thousands in the case of kanji) is not established.

次に、（４）の逆問題として定式化して解く方法であるが、計算量が多く、さらに、漢字のようにストロークの密度が高い場合には、後述する図１３（Ａ），（Ｂ），（Ｃ）にも示すように、ストロークと背景の輝度が逆転することもあり、分布の双峰性や輝度によって復元することが難しい場合が生じる。また、補間して２値化するという単純な方法でも、フォント形状の特徴をある程度は復元できる。しかし、後述する図１１（Ｂ）に示すように、輪郭のがたつきやストロークの切れが生じやすく、十分な画質が得られない。 Next, it is a method of formulating and solving as an inverse problem of (4). However, when the amount of calculation is large and the stroke density is high like kanji, FIGS. 13A and 13B described later. , (C), the brightness of the stroke and the background may be reversed, and it may be difficult to restore due to the bimodality of the distribution and the brightness. Even a simple method of interpolation and binarization can restore the font shape characteristics to some extent. However, as shown in FIG. 11B, which will be described later, it is easy for outlines to be rattled and strokes to be cut off, and sufficient image quality cannot be obtained.

画像が低解像度（たとえば２００ｄｐｉ程度）である場合、小さい文字（例えば、２００ｄｐｉで３ミリ四方）では、ストローク幅がサンプリング間隔と同じ程度になる。したがって、ストロークの輝度のばらつきが、統計的変動で説明できる範囲を超えるため、通常の２値化ではストロークが抽出できないことがある。 When the image has a low resolution (for example, about 200 dpi), the stroke width is about the same as the sampling interval for a small character (for example, 3 mm square at 200 dpi). Therefore, since the variation in the brightness of the stroke exceeds the range that can be explained by the statistical variation, the stroke may not be extracted by normal binarization.

ぼやけた（すなわち、不鮮明な）ストロークを抽出するようにパラメータを調整すると、今度は潰れが生じる。また、補間には、双１次補間、３次スプライン補間などのほかに、変分原理に基づくもの（非特許文献１１.）や直交多項式基底に基づくもの（非特許文献１２）などの様々な技術があるが、これらの補間技術は、自然画像（写真）には効果を発揮するが、文字・線図に特に効果があるものは存在しない。 If the parameters are adjusted to extract blurry (ie, blurry) strokes, this time collapse will occur. In addition to bilinear interpolation, cubic spline interpolation, etc., there are various types of interpolation such as those based on the variational principle (Non-Patent Document 11) and those based on orthogonal polynomial bases (Non-Patent Document 12). Although there are techniques, these interpolation techniques are effective for natural images (photos), but there are no techniques that are particularly effective for text and diagrams.

また、（２），（３）のクラスタリングによる方法では、文字輪郭の微細修正ができるものの、文字のストローク抽出等の処理をすることができない。 Further, although the method of clustering (2) and (3) can finely correct the character outline, it cannot perform processing such as character stroke extraction.

本発明の目的は、アルファベット等のキャラクタ数が少ない文字のみならず、簡単なアルゴリズムにより、漢字のようにキャラクタ数が多くかつ複雑な形状構造の文字、さらには細かい構成の線図を、原画像から正確に再現して作成し、または改善して作成し、これをディスプレイに表示し、プリンタに出力し、あるいは通信回線を介してそうチンすることができる画像出力装置、画像出力プログラムおよびこのプログラムが記録された記録媒体を提供することである。 An object of the present invention is not only a character with a small number of characters such as alphabets, but also a simple algorithm that has a large number of characters such as kanji and a complicated shape structure, and further shows a line diagram with a fine structure. Image output device, image output program, and program that can be accurately reproduced from or created by improvement, displayed on a display, output to a printer, or connected via a communication line Is to provide a recording medium on which is recorded.

〔１〕本発明は、「多値ビットマップの原画像を取得する原画像取得手段と、
前記原画像取得手段が取得した前記原画像から、当該原画像よりも高解像度の多値画像を生成する高解像度多値画像生成手段と、
前記原画像取得手段が取得した前記原画像から、ｘｙ座標が画素座標、ｚ座標が輝度である曲面を生成する輝度曲面生成手段と、
前記輝度曲面生成手段が生成した輝度曲面の地形的特徴を抽出する地形的特徴抽出手段と、
前記高解像度多値画像生成手段が生成した前記多値画像に含まれる文字・線図の輪郭に、前記地形的特徴抽出手段が抽出した地形的特徴を組み込むことで、前記文字・線図を再現または改善した２値画像を生成する地形的特徴組み込み手段と、
前記地形的特徴組み込み手段が生成した前記２値画像、または当該２値画像を多値化した画像に含まれる文字・線図の輪郭を修正する輪郭修正手段と、
前記輪郭修正手段により文字・線図の輪郭が修正された画像を、表示装置、印刷装置またはネットワークを介した接続された外部装置に出力する画像出力手段と、
を備えたことを特徴とする画像出力装置。」
を要旨とする。 [1] The present invention provides: “Original image acquisition means for acquiring an original image of a multilevel bitmap;
From the original image acquired by the original image acquisition means, a high-resolution multi-value image generation means for generating a multi-value image having a higher resolution than the original image;
A luminance curved surface generating means for generating a curved surface in which the xy coordinates are pixel coordinates and the z coordinate is luminance from the original image acquired by the original image acquiring means;
Topographic feature extraction means for extracting topographic features of the luminance curved surface generated by the luminance curved surface generation means;
The character / line diagram is reproduced by incorporating the topographic feature extracted by the topographic feature extracting unit into the outline of the character / line diagram included in the multi-level image generated by the high-resolution multi-level image generating unit. Or a topographic feature embedding means for generating an improved binary image;
Contour correcting means for correcting a contour of a character / line diagram included in the binary image generated by the topographic feature incorporation means or an image obtained by multi-valued the binary image;
Image output means for outputting an image in which the contour of the character / line diagram is corrected by the contour correcting means to a display device, a printing device or an external device connected via a network;
An image output apparatus comprising: "
Is the gist.

〔２〕本発明は、「前記地形的特徴は、輝度曲面を実際の地形に対応させたときに、周囲よりも輝度が低い「谷または窪地」、周囲よりも輝度が高い「尾根または山頂」、「谷または窪地」と「尾根または山頂」との間に位置する「山腹または鞍部」であることを特徴とする〔１〕に記載の画像出力装置。」
を要旨とする。 [2] According to the present invention, “the topographical feature is a“ valley or depression ”whose brightness is lower than the surroundings when the brightness curved surface is made to correspond to the actual topography, and“ ridge or mountaintop ”whose brightness is higher than the surroundings. The image output device according to [1], wherein the image output device is a “hillside or ridge” located between a “valley or depression” and a “ridge or mountaintop”. "
Is the gist.

〔３〕本発明は、「さらに、前記画像出力手段が出力する画像の前記文字・線図または更に背景の色彩が、前記原画像取得手段が取得した前記原画像における色に近似させて生成されていることを特徴とする〔１〕または〔２〕に記載の画像出力装置。」
を要旨とする。 [3] According to the present invention, “the character / line diagram or further background color of the image output by the image output means is generated by approximating the color in the original image acquired by the original image acquisition means”. The image output device according to [1] or [2].
Is the gist.

〔４〕本発明は、「前記原画像取得手段は、前記原画像がカラー画像であるときは、当該原画像をグレイスケール多値画像に変換するカラー／グレイスケール変換手段を備えたことを特徴とする〔１〕から〔３〕の何れかに記載の画像出力装置。」
を要旨とする。 [4] The present invention is characterized in that "the original image acquisition means includes color / grayscale conversion means for converting the original image into a grayscale multilevel image when the original image is a color image. The image output device according to any one of [1] to [3].
Is the gist.

〔５〕本発明は、「前記地形的特徴組み込み手段は、前記輝度曲面のうち極小部分が線状または帯状に連続する領域および前記極小部分が局在する領域を、文字・線図の一部として前記元画像における文字・線図を再現または改善した文字・線図を再現または改善した前記２値画像を生成することを特徴とする〔１〕から〔４〕の何れかに記載の画像出力装置。」
を要旨とする。 [5] According to the present invention, “the topographic feature incorporation means includes a region of the luminance curved surface in which a minimal portion is continuous in a linear or belt shape and a region in which the minimal portion is localized. The image output according to any one of [1] to [4], wherein the binary image in which the character / diagram in the original image is reproduced or improved is reproduced or improved apparatus."
Is the gist.

〔６〕本発明は、「前記高解像度多値画像生成手段が生成した前記高解像度多値画像から、各画素の周囲画素を参照した統計情報に基づき、（ａ）文字・線図を構成する画素と、（ｂ）文字・線図を構成しない画素と、（ｃ）文字・線図を構成するか否かが確定されていない画素とからなる基本画像（地形的特徴の組み込みがなされる画像）を生成する基本画像生成手段を備え、
前記地形的特徴組み込み手段は、前記基本画像生成手段が生成した前記基本画像に基づき前記文字・線図を再現または改善した２値画像を生成することを特徴とする〔１〕から〔５〕の何れかに記載の画像出力装置。」
を要旨とする。 [6] According to the present invention, (a) a character / line diagram is configured based on statistical information referring to surrounding pixels of each pixel from the high-resolution multi-value image generated by the high-resolution multi-value image generation unit. A basic image (an image in which topographic features are incorporated) composed of pixels, (b) pixels that do not constitute a character / line diagram, and (c) pixels that are not determined whether to constitute a character / line diagram. ) To generate basic image generation means,
The topographic feature incorporation means generates a binary image that reproduces or improves the character / line diagram based on the basic image generated by the basic image generation means. The image output apparatus according to any one of the above. "
Is the gist.

〔７〕本発明は、「前記地形的特徴組み込み手段は、前記（ｃ）文字・線図を構成するか否かが確定されていない画素が、前記輝度曲面のうち極小部分が線状または帯状に連続する領域または前記極小部分が局在する領域に含まれるときは、当該画素が文字・線図を構成するものとして、文字・線図を再現または改善した前記２値画像を生成することを特徴とする〔６〕に記載の画像出力装置。」
を要旨とする。 [7] According to the present invention, “the topographic feature incorporation means has (c) a pixel for which it is not determined whether or not to constitute a character / line diagram, a minimum portion of the luminance curved surface is linear or band-shaped. If the pixel is included in a region that is continuous with the region or the region where the local minimum portion is localized, the binary image in which the pixel / line diagram is reproduced or improved is assumed to constitute the character / line diagram. The image output device according to [6], which is characterized.
Is the gist.

〔８〕本発明は、「前記高解像度多値画像生成手段が生成した高解像度多値画像の各画素における輝度勾配量を検出する輝度勾配量検出手段を備え、
前記輪郭修正手段は、前記輝度勾配量検出手段が検出した各画素の輝度勾配量により前記地形的特徴組み込み手段が作成した文字・線図を再現または改善した前記画像中の文字・線図の輪郭を修正する第１の輪郭修正手段を有する、
ことを備えたことを特徴とする〔１〕から〔７〕の何れかに記載の画像出力装置。」
を要旨とする。 [8] The present invention includes: “a luminance gradient amount detection unit that detects a luminance gradient amount in each pixel of the high resolution multilevel image generated by the high resolution multilevel image generation unit;
The contour correcting unit reproduces or improves the character / line diagram created by the topographic feature incorporation unit based on the luminance gradient amount of each pixel detected by the luminance gradient amount detecting unit. First contour correcting means for correcting
The image output device according to any one of [1] to [7], comprising: "
Is the gist.

〔９〕本発明は、「前記輪郭修正手段は、前記第１の輪郭修正手段が修正した前記画像中の文字・線図の輪郭を形成する画素列の曲率または方向変化を参照してさらに当該画像中の文字・線図の輪郭を修正する第２の輪郭修正手段を有することを特徴とする〔８〕に記載の画像出力装置。」
を要旨とする。 [9] According to the present invention, “the contour correcting unit further refers to a curvature or direction change of a pixel row forming a contour of a character / line diagram in the image corrected by the first contour correcting unit. [8] The image output apparatus according to [8], further including second contour correcting means for correcting a contour of a character / line diagram in an image.
Is the gist.

〔１０〕本発明は、「前記第２の輪郭修正手段は、前記第１の輪郭修正手段が修正した前記画像中の文字・線図の輪郭を構成する画素列を、接線方向に基づいてクラスタリングして、各クラスタについて前記輪郭の円滑化および／または角部の角度鮮明化により、前記画像中の文字・線図の輪郭を修正することを特徴とする〔９〕に記載の画像出力装置。」
を要旨とする。 [10] According to the present invention, “the second contour correcting unit clusters the pixel columns constituting the contour of the character / line diagram in the image corrected by the first contour correcting unit based on the tangent direction. Then, the image output device according to [9], wherein the contour of the character / line diagram in the image is corrected by smoothing the contour and / or sharpening the corner of each cluster. "
Is the gist.

〔１１〕本発明は、「多値ビットマップの原画像を取得する原画像取得手段と、
前記原画像取得手段が取得した前記原画像から、当該原画像よりも高解像度の多値画像を生成する高解像度多値画像生成手段と、
前記高解像度多値画像生成手段が生成した前記多値画像から、ｘｙ座標が画素座標、ｚ座標が輝度である曲面を生成する輝度曲面生成手段と、
前記輝度曲面生成手段が生成した輝度曲面の地形的特徴を抽出する地形的特徴抽出手段と、
前記地形的特徴抽出手段が抽出した前記地形的特徴を組み込むことで、前記原画像中の文字・線図を再現または改善した２値画像を作成する地形的特徴組み込み手段と、
前記地形的特徴組み込み手段が生成した前記２値画像、または当該２値画像を多値化した画像に含まれる文字・線図の輪郭を修正する輪郭修正手段と、
前記輪郭修正手段により文字・線図の輪郭が修正された画像を、表示装置、印刷装置またはネットワークを介した接続された外部装置に出力する画像出力手段と、
を備えたことを特徴とする画像出力装置。」
を要旨とする。 [11] The present invention provides: “Original image acquisition means for acquiring an original image of a multilevel bitmap;
From the original image acquired by the original image acquisition means, a high-resolution multi-value image generation means for generating a multi-value image having a higher resolution than the original image;
A luminance curved surface generating means for generating a curved surface in which the xy coordinates are pixel coordinates and the z coordinate is luminance from the multi-value image generated by the high-resolution multi-value image generating means;
Topographic feature extraction means for extracting topographic features of the luminance curved surface generated by the luminance curved surface generation means;
Topographic feature incorporation means for creating a binary image that reproduces or improves a character / line diagram in the original image by incorporating the topographic feature extracted by the topographic feature extraction means;
Contour correcting means for correcting a contour of a character / line diagram included in the binary image generated by the topographic feature incorporation means or an image obtained by multi-valued the binary image;
Image output means for outputting an image in which the contour of the character / line diagram is corrected by the contour correcting means to a display device, a printing device or an external device connected via a network;
An image output apparatus comprising: "
Is the gist.

〔１２〕本発明は、「前記地形的特徴は、輝度曲面を実際の地形に対応させたときに、周囲よりも輝度が低い「谷または窪地」、周囲よりも輝度が高い「尾根または山頂」、「谷または窪地」と「尾根または山頂」との間に位置する「山腹または鞍部」であることを特徴とする〔１１〕に記載の画像出力装置。」
を要旨とする。 [12] According to the present invention, “the topographic feature is a“ valley or depression ”having a lower luminance than the surroundings when the luminance curved surface is made to correspond to the actual terrain, and a“ ridge or mountaintop ”having a higher luminance than the surroundings. [11] The image output device according to [11], wherein the image output device is a “hillside or ridge” located between a “valley or depression” and a “ridge or mountaintop”. "
Is the gist.

〔１３〕本発明は、「さらに、前記画像出力手段が出力する画像の前記文字・線図または更に背景の色彩が、前記原画像取得手段が取得した前記原画像における色に近似させて生成されていることを特徴とする〔１１〕または〔１２〕に記載の画像出力装置。」
を要旨とする。 [13] According to the present invention, “the character / line diagram or further background color of the image output by the image output means is generated by approximating the color in the original image acquired by the original image acquisition means”. The image output device according to [11] or [12].
Is the gist.

〔１４〕本発明は、「前記原画像取得手段は、前記原画像がカラー画像であるときは、当該原画像をグレイスケール多値画像に変換するカラー／グレイスケール変換手段を備えたことを特徴とする〔１１〕から〔１３〕の何れかに記載の画像出力装置。」
を要旨とする。 [14] The present invention is characterized in that the original image acquisition means includes a color / grayscale conversion means for converting the original image into a grayscale multilevel image when the original image is a color image. The image output device according to any one of [11] to [13].
Is the gist.

〔１５〕本発明は、「前記地形的特徴組み込み手段は、前記輝度曲面のうち極小部分が線状または帯状に連続する領域および前記極小部分が局在する領域を、文字・線図の一部として前記元画像における文字・線図を再現または改善した文字・線図を再現または改善した前記２値画像を生成することを特徴とする〔１１〕から〔１４〕の何れかに記載の画像出力装置。」
を要旨とする。 [15] According to the present invention, “the topographic feature incorporation means includes a part of a character / diagram in which the minimum part of the luminance curved surface is continuous in a linear or belt-like form and the region in which the minimum part is localized. The image output according to any one of [11] to [14], wherein the binary image in which the character / line diagram in the original image is reproduced or improved is reproduced or improved apparatus."
Is the gist.

〔１６〕本発明は、「前記高解像度多値画像生成手段が生成した前記高解像度多値画像から、各画素の周囲画素を参照した統計情報に基づき、（ａ）文字・線図を構成する画素と、（ｂ）文字・線図を構成しない画素と、（ｃ）文字・線図を構成するか否かが確定されていない画素とからなる基本画像を生成する基本画像生成手段を備え、
前記地形的特徴組み込み手段は、前記基本画像生成手段が生成した前記基本画像に基づき前記文字・線図を再現または改善した２値画像を生成することを特徴とする〔１１〕から〔１５〕の何れかに記載の画像出力装置。」
を要旨とする。 [16] According to the present invention, “(a) a character / line diagram is constructed based on statistical information referring to surrounding pixels of each pixel from the high-resolution multi-value image generated by the high-resolution multi-value image generation means”. Basic image generation means for generating a basic image comprising pixels, (b) pixels that do not constitute a character / line diagram, and (c) pixels that are not determined whether to constitute a character / line diagram,
[11] to [15], wherein the topographic feature incorporation means generates a binary image in which the character / line diagram is reproduced or improved based on the basic image generated by the basic image generation means. The image output apparatus according to any one of the above. "
Is the gist.

〔１７〕本発明は、「前記地形的特徴組み込み手段は、前記（ｃ）文字・線図を構成するか否かが確定されていない画素が、前記輝度曲面のうち極小部分が線状または帯状に連続する領域または前記極小部分が局在する領域に含まれるときは、当該画素が文字・線図を構成するものとして、文字・線図を再現または改善した前記２値画像を生成することを特徴とする〔１６〕に記載の画像出力装置。」
を要旨とする。 [17] The present invention is as follows: "The topographic feature incorporation means has (c) a pixel for which it is not determined whether or not to constitute a character / line diagram, and a minimum portion of the luminance curved surface is linear or band-shaped. If the pixel is included in a region that is continuous with the region or the region where the local minimum portion is localized, the binary image in which the pixel / line diagram is reproduced or improved is assumed to constitute the character / line diagram. The image output device as set forth in [16], which is characterized.
Is the gist.

〔１８〕本発明は、「前記高解像度多値画像生成手段が生成した高解像度多値画像の各画素における輝度勾配量を検出する輝度勾配量検出手段を備え、
前記輪郭修正手段は、前記輝度勾配量検出手段が検出した各画素の輝度勾配量により前記地形的特徴組み込み手段が作成した文字・線図を再現または改善した前記画像中の文字・線図の輪郭を修正する第１の輪郭修正手段を有する、
ことを備えたことを特徴とする〔１１〕から〔１７〕の何れかに記載の画像出力装置。」
を要旨とする。 [18] The present invention includes: “a luminance gradient amount detection unit that detects a luminance gradient amount in each pixel of the high resolution multilevel image generated by the high resolution multilevel image generation unit;
The contour correcting unit reproduces or improves the character / line diagram created by the topographic feature incorporation unit based on the luminance gradient amount of each pixel detected by the luminance gradient amount detecting unit. First contour correcting means for correcting
The image output device according to any one of [11] to [17], wherein "
Is the gist.

〔１９〕本発明は、「前記輪郭修正手段は、前記第１の輪郭修正手段が修正した前記画像中の文字・線図の輪郭を形成する画素列の曲率または方向変化を参照してさらに当該画像中の文字・線図の輪郭を修正する第２の輪郭修正手段を有することを特徴とする〔１８〕に記載の画像出力装置。」
を要旨とする。 [19] The present invention provides: “The contour correcting means further refers to a curvature or direction change of a pixel row forming a contour of a character / line diagram in the image corrected by the first contour correcting means. The image output apparatus according to [18], further comprising second contour correcting means for correcting a contour of a character / line diagram in an image.
Is the gist.

〔２０〕本発明は、「前記第２の輪郭修正手段は、前記第１の輪郭修正手段が修正した前記画像中の文字・線図の輪郭を構成する画素列を、接線方向に基づいてクラスタリングして、各クラスタについて前記輪郭の円滑化および／または角部の角度鮮明化により、前記画像中の文字・線図の輪郭を修正することを特徴とする〔１９〕に記載の画像出力装置。」
を要旨とする。 [20] According to the present invention, “the second contour correcting unit clusters the pixel columns constituting the contour of the character / diagram in the image corrected by the first contour correcting unit based on the tangent direction. The image output device according to [19], wherein the contour of the character / line diagram in the image is corrected by smoothing the contour and / or sharpening the corners of each cluster. "
Is the gist.

〔２１〕本発明は、「コンピュータを、〔１〕から〔２０〕の何れかに記載の各手段として機能させる画像出力プログラム。」
を要旨とする。 [21] The present invention provides an “image output program that causes a computer to function as each means according to any one of [1] to [20].”
Is the gist.

〔２２〕本発明は、「コンピュータを、〔１〕から〔２０〕の何れかに記載の各手段として実行させるためのプログラムを記録したコンピュータ読み取り可能な記録媒体。」を要旨とする [22] The gist of the present invention is “a computer-readable recording medium on which a program for causing a computer to be executed as each means according to any of [1] to [20]” is recorded.

なお、本発明の画像出力装置における各手段の一部または全部を１つまたは複数のハードウェア（ＤＳＰ等）により構成することができる。 In addition, a part or all of each means in the image output apparatus of the present invention can be configured by one or a plurality of hardware (DSP or the like).

本発明によれば、漢字のような複雑な形状構造をもつ文字・線図を高い認識精度で再現することができる。また、簡単な計算で処理を行なうので、高速に文字・線図を再現して、ディスプレイに表示し、プリンタにより出力し、あるいは携帯型電話機糖の通信端末に送信することができる。 According to the present invention, it is possible to reproduce a character / line diagram having a complicated shape structure such as kanji with high recognition accuracy. Further, since the processing is performed by simple calculation, it is possible to reproduce a character / line diagram at high speed, display it on a display, output it by a printer, or transmit it to a communication terminal of a portable telephone sugar.

図１は本発明の画像出力装置の一構成例を示す図である。
図１において、パーソナルコンピュータ（ＰＣ）１１０は画像出力装置として動作するもので、ＣＰＵ１１１と、メモリ１１２（ＲＯＭ１１２１とＲＡＭ１１２２とからなる）と、ハードディスク装置１１３と、リムーバブルディスク装置１１４と、ディスプレイ・インタフェース１１５と、プリンタ・インタフェース１１６と、キーボード１１７と、ネットワーク・インタフェース１１８とがバス１１９に接続されて構成されている。 FIG. 1 is a diagram showing a configuration example of an image output apparatus according to the present invention.
In FIG. 1, a personal computer (PC) 110 operates as an image output device, and includes a CPU 111, a memory 112 (consisting of a ROM 1121 and a RAM 1122), a hard disk device 113, a removable disk device 114, and a display interface 115. A printer interface 116, a keyboard 117, and a network interface 118 are connected to a bus 119.

図１では、ディスプレイ１１５１はディスプレイ・インタフェース１１５に接続され、プリンタ１１６１はプリンタ・インタフェース１１６に接続され、ネットワーク・インタフェース１１８は通信回線１００に接続されている。 In FIG. 1, the display 1151 is connected to the display interface 115, the printer 1161 is connected to the printer interface 116, and the network interface 118 is connected to the communication line 100.

メモリ１１２には、後述する原画像取得手段１１，高解像度多値画像生成手段１２，基本画像生成手段１３，輝度曲面生成手段１４，地形的特徴抽出手段１５，地形的特徴組み込み手段１６，輝度勾配量検出手段１７，輪郭修正手段（第１の輪郭修正手段１８１，第２の輪郭修正手段１８２）として機能するプログラムが格納されている。なお、画像出力手段１９は、ディスプレイ・インタフェース１１５，プリンタ・インタフェース１１６，ネットワーク・インタフェース１１８として示されている。本発明の画像出力手段は、これらのインタフェース（ハードウェア）と、こららの駆動回路および駆動ソフトウェア（ドライバプログラム等）により構成される。上記のプログラムが本発明の画像出力プログラムを構成する。 In the memory 112, an original image acquisition unit 11, a high-resolution multilevel image generation unit 12, a basic image generation unit 13, a luminance curved surface generation unit 14, a topographic feature extraction unit 15, a topographic feature incorporation unit 16, a luminance gradient, which will be described later. A program that functions as the amount detection unit 17 and the contour correction unit (the first contour correction unit 181 and the second contour correction unit 182) is stored. The image output means 19 is shown as a display interface 115, a printer interface 116, and a network interface 118. The image output means of the present invention is constituted by these interfaces (hardware), these drive circuits, and drive software (driver program etc.). The above program constitutes the image output program of the present invention.

ここでは、画像出力の処理対象となることができるカラービットマップ画像がリムーバブルディスク装置１１４に格納されているものとする。また、本発明の画像出力プログラムがハードディスク装置１１３に格納されているものとし、ユーザが画像出力プログラムを起動すると、画像出力プログラム（「ＧＯＰ」で示す）はＲＡＭ１１２に読み込まれ、画像作成処理が可能になる。 Here, it is assumed that a color bitmap image that can be processed for image output is stored in the removable disk device 114. Further, it is assumed that the image output program of the present invention is stored in the hard disk device 113, and when the user starts the image output program, the image output program (indicated by “GOP”) is read into the RAM 112 and image creation processing is possible. become.

正確に原画像を再現するためあるいはまたは改善して原画像を再現するためには、様々な情報を多角的（トップダウン／ボトムアップ）に利用することが有効である。
本発明では、画像解析によって得られる多様な特徴を統合して、補間と局所統計量により生成した基本画像を、輝度曲面の地形的特徴を導入して修正する手法を採用することで再現性の高い画像、あるいは改善された画像を出力することができる。
本発明における処理は、（１）〜（５）により構成される。 In order to accurately reproduce the original image or to reproduce the original image with improvement, it is effective to use various information in various ways (top-down / bottom-up).
The present invention integrates various features obtained by image analysis and adopts a technique for correcting the basic image generated by interpolation and local statistics by introducing the topographic features of the luminance curved surface. A high image or an improved image can be output.
The process in this invention is comprised by (1)-(5).

（１）地形的特徴による欠落ストロークの補完
処理対象画像（多値ビットマップの原画像）から、輝度曲面ｚ＝ｆ（ｘ，ｙ）を形成する。この輝度曲面ｚ＝ｆ（ｘ，ｙ）の地形的特徴（尾根、峡谷、山頂、窪地、山腹、鞍部）を調べる。なお、原画像がカラービットマップ画像である場合には、原画像をグレイスケールの多値ビットマップ画像に変換し、これを処理対象画像とする。 (1) Interpolation of missing strokes by topographic features A luminance curved surface z = f (x, y) is formed from a processing target image (original image of a multi-value bitmap). The topographic features (ridge, canyon, summit, depression, hillside, ridge) of this luminance curved surface z = f (x, y) are examined. When the original image is a color bitmap image, the original image is converted into a grayscale multi-value bitmap image, which is used as a processing target image.

輝度曲面ｆ（ｘ，ｙ）上で，各画素について周囲との高さを比較すると，周囲より低い（暗い）画素の連なりであるストロークの部分（すなわち、「谷または窪地」）、周囲よりも高い（明るい）画素の連なりであるストローク間のギャップに相当する部分（すなわち、「尾根または山頂」）、その他（背景：すなわち、「山腹または鞍部」）の３レベルに分類できる。図１０は、輝度曲面ｚ＝ｆ（ｘ，ｙ）により，各画素をこのように分類した地形特徴図ＧｉｏＣであり、ここでは、多値ビットマップ画像中の日本語４文字（Ｃ１：「策」，Ｃ２：「基」，Ｃ３：「静」，Ｃ４：「か」）を例に挙げてある。
多値ビットマップ画像中の日本語４文字（Ｃ１，Ｃ２，Ｃ３，Ｃ４）を３レベル（高，中，低）で表した場合、当該多値ビットマップ画像が低解像度（たとえば、２００ｄｐｉ程度）でも、文字の特徴は欠落することなく保存されている。
したがって、特に、上述した漢字のように複雑な構造を持つ画像からストロークを抽出するような場合に、上記輝度曲面のレベルを参照することは、極めて効果的である。 Comparing the height of each pixel with the surroundings on the luminance curved surface f (x, y), a stroke portion (that is, “valley or depression”) that is a series of pixels that are lower (darker) than the surroundings, than the surroundings. It can be classified into three levels: a portion corresponding to a gap between strokes that is a series of high (bright) pixels (ie, “ridge or peak”), and other (background: ie, “hillside or ridge”). FIG. 10 is a topographic feature map GioC in which each pixel is classified in this way by the luminance curved surface z = f (x, y). Here, four Japanese characters (C1: “skill” in the multilevel bitmap image are shown. , C2: “group”, C3: “still”, C4: “ka”).
When four Japanese characters (C1, C2, C3, C4) in a multilevel bitmap image are represented by three levels (high, medium, low), the multilevel bitmap image has a low resolution (for example, about 200 dpi). But the character features are preserved without loss.
Therefore, it is extremely effective to refer to the level of the luminance curved surface, particularly when a stroke is extracted from an image having a complicated structure such as the above-described Chinese character.

そこで、後述する各実施形態では、低解像度の原画像で計算した地形的特徴を取り入れることで、当該原画像をそのまま２値化した場合に欠落してしまうストローク（図１１（Ｂ）参照）を補完している。
たとえば、図１０に示したような地形特徴図ＧｉｏＣの「峡谷」部分（黒の部分）は、極小部分が線状または帯状に連続する領域であり「ストロークの長い線分」に相当する。また、「窪地」部分は、極小部分が局在する領域であり「点あるいはストロークが短い線分」に相当する。
なお、「峡谷」の端部近傍に、他の「峡谷」や「窪地」が存在していれば、当該「峡谷」は他の「峡谷」や「窪地」と連続するはずであり、「窪地」の端部近傍に、他の「区窪地」や「峡谷」が存在していれば、当該「窪地」は他の「窪地」や「峡谷」と連続するはずである。したがって、「峡谷」や「峡谷」の周囲画素を参照することで、ストロークを正確に、または改善して作成することもできる。
Wang等の文献（L. Wang and T. Pavlidis、 Direct gray-scale extraction of features for character recognition、 IEEE Transactions on Pattern Analysis and Machine Intelligence、 vol. 15、 no. 10、 pp. 1053-1067、 1993.）で述べられているように、地形的特徴を考察することは、特に、低解像度の画像において有効である。 Therefore, in each of the embodiments described later, a stroke (see FIG. 11B) that is lost when the original image is binarized as it is by incorporating the topographic features calculated from the low-resolution original image. Complement.
For example, the “gorge” portion (black portion) of the topographic feature map GioC as shown in FIG. 10 is a region in which the minimal portion is continuous in a line shape or a belt shape, and corresponds to a “line segment with a long stroke”. Further, the “recess” portion is a region where the minimal portion is localized, and corresponds to “a line segment having a short point or stroke”.
If there are other “canyons” or “recesses” in the vicinity of the end of the “canyon”, the “canyons” should be connected to other “canyons” or “recesses”. If there is another “kubochi” or “gorge” in the vicinity of the end of “,” the “depression” should be continuous with the other “pit” or “canyon”. Therefore, the stroke can be created accurately or with improved by referring to the surrounding pixels of the “canyon” and “canyon”.
Wang et al. (L. Wang and T. Pavlidis, Direct gray-scale extraction of features for character recognition, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 15, no. 10, pp. 1053-1067, 1993.) As discussed in, it is particularly useful in low resolution images to consider topographic features.

（２）補間画像の生成と局所統計量による基本画像の生成処理
計算が簡単であり、扱う対象が限定されない（すなわち、どのような文字，線図をも扱うことができる）という一般性を考慮し、以下の処理を施す。
（ｉ）補間により原画像から高解像度多値ビットマップ画像を生成し、この画像に（１）の地形的特徴による欠落ストロークの補完処理を施すことができる。
（ｉｉ）原画像から局所統計量による基本画像を生成し、この画像に（１）の地形的特徴による欠落ストロークの補完処理を施すことができる。基本画像とは、図１２（Ａ）のように、各画素の輝度値は３レベルの値をとることができ、「ＯＮ」（文字・線図を構成する画素、ここでは、ｚ＝ｆ（ｘ，ｙ）＝１とする）を黒の領域、「ＯＦＦ」（文字・線図を構成しない画素、ここでは、ｚ＝ｆ（ｘ，ｙ）＝０とする）を白の領域、「ＴＢＤ」（文字・線図を構成するか否かが確定されていない画素、ここでは、ｚ＝ｆ（ｘ，ｙ）＝−１とする）をグレイの領域とした画像である。
（ｉｉｉ）補間により原画像から高解像度多値ビットマップ画像を生成し、この画像から局所統計量による基本画像を生成する。そして、この基本画像に（１）の地形的特徴による欠落ストロークの補完処理を施すことができる。 (2) Interpolated image generation and basic image generation processing based on local statistics Considering the generality that the calculation is simple and the objects to be handled are not limited (that is, any character or diagram can be handled). Then, the following processing is performed.
(I) A high-resolution multilevel bitmap image can be generated from the original image by interpolation, and the missing stroke complementing process according to the topographic feature of (1) can be applied to this image.
(Ii) A basic image based on local statistics is generated from the original image, and the missing stroke complementing process based on the topographic features of (1) can be performed on the image. With the basic image, as shown in FIG. 12A, the luminance value of each pixel can take a value of three levels, and “ON” (pixels constituting a character / line diagram, here z = f ( x, y) = 1) is a black region, “OFF” (a pixel not constituting a character / line diagram, here z = f (x, y) = 0) is a white region, “TBD” ”(Pixels for which it is not determined whether or not to constitute a character / line diagram, here z = f (x, y) = − 1) is a gray region.
(Iii) A high-resolution multilevel bitmap image is generated from the original image by interpolation, and a basic image based on local statistics is generated from this image. Then, the missing stroke complementing process based on the topographic feature (1) can be performed on the basic image.

（３）輪郭の第１の修正
（１），（２）により欠落したストロークを補完できたとしても、輪郭線がなめらかでないため、文字画像の品質としては、貧弱となることがある。
なめらかな輪郭線の基準として、曲率が小さい（曲率半径が大きい）ことが必要となる。また、輪郭線は、輝度勾配量が極大となるような画素（すなわち、白領域と黒領域との境界）を通過することが前提となる。
そこで、現在では画像処理の標準的手法となっている、Active Contour Model、またはSnakeの手法（D. Williams and M. Shah、 A fast algorithm for active contours and curvature estimation、 CVGIP: Image Understanding、 vol. 55、 no. 1、 pp. 14-26、 1992.）を用いて輪郭線を修正する（図１４の修正結果参照）。 (3) First correction of outline Even if the missing strokes can be complemented by (1) and (2), the outline is not smooth, and the quality of the character image may be poor.
As a reference for a smooth contour line, it is necessary that the curvature is small (the radius of curvature is large). Further, it is assumed that the contour line passes through a pixel having a maximum luminance gradient amount (that is, a boundary between the white area and the black area).
Therefore, the Active Contour Model or Snake method (D. Williams and M. Shah, A fast algorithm for active contours and curvature estimation, CVGIP: Image Understanding, vol. 55 No. 1, pp. 14-26, 1992.) to correct the contour line (see the correction result in FIG. 14).

（４）輪郭の第２の修正
人間の視覚にとって気になる点である、水平・垂直方向の線のがたつきや、垂直線と水平線の交差部分の鈍りを補正するために、輪郭の修正を行うこともできる。
（５）画像の出力
文字・線図の輪郭が復元または改善されている画像を、ディスプレイ（表示手段）や、プリンタに出力する。 (4) Second correction of the contour Correction of the contour in order to correct the shakiness of the horizontal and vertical lines and the dullness at the intersection of the vertical and horizontal lines, which are points of concern for human vision. Can also be done.
(5) Image output An image whose character / line diagram outline is restored or improved is output to a display (display means) or a printer.

以下、本発明の画像出力装置の実施形態を説明する。
《第１実施形態》
図２に第１実施形態の画像出力装置１Ａの構成を示す。
第１実施形態では、画像出力装置１Ａは、原画像取得手段１１、高解像度多値画像生成手段１２、基本画像生成手段１３、輝度曲面生成手段１４、地形的特徴抽出手段１５、地形的特徴組み込み手段１６、輝度勾配量検出手段１７、第1の輪郭修正手段１８１、および画像出力手段１９を備えている。
原画像取得手段１１は原画像取得プログラムにより、高解像度多値画像生成手段１２は高解像度多値画像生成プログラムにより、基本画像生成手段１３は基本画像生成プログラムにより、輝度曲面生成手段１４は輝度曲面生成プログラムにより、地形的特徴抽出手段１５は地形的特徴抽出プログラムにより、地形的特徴組み込み手段１６は地形的特徴組み込みプログラムにより、輝度勾配量検出手段１７は輝度勾配量検出プログラムにより、第1の輪郭修正手段１８１は第1の輪郭修正プログラムにより、それぞれ実現することができる。画像出力手段１９は、前述したようにハードウェアおよびソフトウェアにより実現することができる。 Hereinafter, embodiments of the image output apparatus of the present invention will be described.
<< First Embodiment >>
FIG. 2 shows the configuration of the image output apparatus 1A of the first embodiment.
In the first embodiment, the image output apparatus 1A includes an original image acquisition unit 11, a high-resolution multilevel image generation unit 12, a basic image generation unit 13, a luminance curved surface generation unit 14, a topographic feature extraction unit 15, and a topographic feature incorporation. Means 16, brightness gradient amount detection means 17, first contour correction means 181 and image output means 19 are provided.
The original image acquisition means 11 is an original image acquisition program, the high resolution multi-value image generation means 12 is a high-resolution multi-value image generation program, the basic image generation means 13 is a basic image generation program, and the luminance curved surface generation means 14 is a luminance curved surface. According to the generation program, the topographic feature extraction means 15 is a topographic feature extraction program, the topographic feature incorporation means 16 is a topographic feature incorporation program, and the luminance gradient amount detection means 17 is a first gradient contour detection program. The correction means 181 can be realized by the first contour correction program. The image output means 19 can be realized by hardware and software as described above.

原画像取得手段１１は、多値ビットマップの原画像を取得することができ、この原画像がカラーであるときは、当該原画像をグレイスケール多値画像に変換するカラー／グレイスケール変換手段１１１を備えている。
第１実施形態では、高解像度多値画像生成手段１２は、原画像取得手段１１が取得した原画像から、当該原画像の解像度よりも高い解像度の多値画像を生成することができる。
基本画像生成手段１３は、高解像度多値画像生成手段１２が生成した多値画像から、各画素の周囲画素を参照した統計情報に基づき、（ａ）文字・線図を構成する画素と、（ｂ）文字・線図を構成しない画素と、（ｃ）文字・線図を構成するか否かが確定されていない画素とから基本画像を生成することができる。
地形的特徴組み込み手段１６は、原画像取得手段１１が取得した多値画像に含まれる文字・線図の輪郭に、地形的特徴組み込み手段１６が抽出した地形的特徴を組み込むことで、文字・線図を再現または改善した２値画像を生成することができる。 The original image acquisition unit 11 can acquire an original image of a multi-value bitmap, and when this original image is in color, a color / gray scale conversion unit 111 that converts the original image into a gray-scale multi-value image. It has.
In the first embodiment, the high-resolution multi-value image generation unit 12 can generate a multi-value image having a resolution higher than the resolution of the original image from the original image acquired by the original image acquisition unit 11.
The basic image generation means 13 is based on statistical information referring to surrounding pixels of each pixel from the multi-value image generated by the high-resolution multi-value image generation means 12, (a) the pixels constituting the character / diagram, It is possible to generate a basic image from b) pixels that do not constitute a character / line diagram and (c) pixels that are not determined whether to constitute a character / line diagram.
The topographic feature incorporation means 16 incorporates the topographic feature extracted by the topographic feature incorporation means 16 into the outline of the character / line diagram included in the multivalued image acquired by the original image acquisition means 11, thereby A binary image in which the figure is reproduced or improved can be generated.

輝度曲面生成手段１４は、原画像取得手段１１が取得した原画像から、ｘｙ座標が画素座標、ｚ座標が輝度である曲面ｚ＝ｆ（ｘ，ｙ）を生成する。たとえば、輝度が高レベル，中レベル，低レベルの３値の何れかをとるようにでき、高輝度の場合にはｆ（ｘ，ｙ）＝１、中輝度の場合にはｆ（ｘ，ｙ）＝０、低輝度の場合にはｆ（ｘ，ｙ）＝−１とすることができる。
地形的特徴抽出手段１５は、輝度曲面生成手段１４が生成した輝度曲面ｆ（ｘ，ｙ）の地形的特徴（尾根、峡谷、山頂、窪地、山腹）を抽出することができる。 The luminance curved surface generation unit 14 generates a curved surface z = f (x, y) from the original image acquired by the original image acquisition unit 11 where the xy coordinates are pixel coordinates and the z coordinate is luminance. For example, the luminance can take one of three values: high level, medium level, and low level. F (x, y) = 1 for high luminance and f (x, y for medium luminance. ) = 0, and in the case of low luminance, f (x, y) =-1.
The topographic feature extraction unit 15 can extract the topographic features (ridge, canyon, summit, depression, hillside) of the luminance curved surface f (x, y) generated by the luminance curved surface generation unit 14.

地形的特徴組み込み手段１６は、輝度曲面ｆ（ｘ，ｙ）のうち極小部分が線状または帯状に連続する領域（峡谷）および極小部分が局在する領域（窪地）を文字・線図の一部として前記元画像における文字・線図を再現または改善した文字・線図を再現または改善した２値画像を生成することができる。たとえば、地形的特徴組み込み手段１６は、たとえば、高解像度の原画像から、適宜の手法（たとえば、適宜のフィルタ）により文字・線図を形成し、当該文字・線図を構成しないが、輝度曲面ｆ（ｘ，ｙ）の峡谷および窪地に相当する画素については、文字・線図を構成するように修正を加えることができる。
そして、地形的特徴組み込み手段１６は、（ｃ）文字・線図を構成するか否かが確定されていない画素が、輝度曲面ｆ（ｘ，ｙ）のうち「峡谷」（極小部分が線状または帯状に連続する領域）または「窪地」（極小部分が局在する領域）に含まれるときは、当該画素が文字・線図を構成するものとして、文字・線図を再現または改善した２値画像を生成することができる。
輝度勾配量検出手段１７は、高解像度多値画像生成手段１２が生成した高解像度多値画像の各画素における輝度勾配を検出することができる。
第１の輪郭修正手段１８１は、輝度勾配量検出手段１７が検出した各画素の輝度勾配量により地形的特徴組み込み手段１６が作成した文字・線図を再現または改善した２値画像中の文字・線図の輪郭を修正することができる。
画像出力手段１９は、第１の輪郭修正手段１８１により文字・線図の輪郭が修正された画像を、表示装置、印刷装置またはネットワークを介して接続された外部装置に出力することができる。 The topographical feature incorporation means 16 is a character / line diagram showing a region (canyon) in which the minimal portion continues in a linear or strip shape and a region (concave) in which the minimal portion is localized in the luminance curved surface f (x, y). As a part, it is possible to generate a binary image in which a character / line diagram in the original image is reproduced or improved. For example, the topographic feature incorporating means 16 forms a character / line diagram from a high-resolution original image by an appropriate method (for example, an appropriate filter) and does not constitute the character / line diagram. The pixels corresponding to the canyons and depressions of f (x, y) can be modified so as to form a character / line diagram.
Then, the topographic feature incorporating means 16 (c) the pixels for which it is not determined whether or not to constitute the character / line diagram are “gorges” (the minimum portion is linear in the luminance curved surface f (x, y)). Or a region that is continuous in the form of a band) or “recess” (region where the local minimum part is localized), a binary that reproduces or improves the character / diagram, assuming that the pixel constitutes the character / diagram An image can be generated.
The luminance gradient amount detection unit 17 can detect the luminance gradient in each pixel of the high resolution multilevel image generated by the high resolution multilevel image generation unit 12.
The first contour correcting unit 181 reproduces or improves the character / line diagram created by the topographic feature incorporating unit 16 based on the luminance gradient amount of each pixel detected by the luminance gradient amount detecting unit 17. The contour of the diagram can be modified.
The image output means 19 can output the image in which the outline of the character / line diagram is corrected by the first outline correction means 181 to an external device connected via a display device, a printing device or a network.

《第２実施形態》
図３に第２実施形態の画像出力装置１Ｂの構成を示す。
第２実施形態では、画像出力装置１Ｂは、画像出力装置１Ａの各構成要素に加えて、第２の輪郭修正手段１８２を備えている。第２の輪郭修正手段１８２は第２の輪郭修正プログラムにより実現することができる。 << Second Embodiment >>
FIG. 3 shows the configuration of the image output apparatus 1B of the second embodiment.
In the second embodiment, the image output apparatus 1B includes a second contour correcting unit 182 in addition to the components of the image output apparatus 1A. The second contour correcting means 182 can be realized by a second contour correcting program.

第２の輪郭修正手段１８２は、第１の輪郭修正手段１８１が修正した２値画像中の文字・線図の輪郭を構成する画素列を、接線方向に基づいてクラスタリングして、各クラスタについて輪郭の円滑化および／または角部の角度鮮明化により、２値画像中の文字・線図の輪郭を修正することができる。 The second contour correcting unit 182 clusters the pixel columns constituting the contour of the character / line diagram in the binary image corrected by the first contour correcting unit 181 based on the tangent direction, and contours each cluster. By smoothing and / or sharpening the corners, it is possible to correct the outline of the character / line diagram in the binary image.

《第３実施形態》
図４に第３実施形態の画像出力装置１Ｃの構成を示す。
第３実施形態では、画像出力装置１Ｃは、画像出力装置１Ｂの各構成要素に加えて、原画像保存手段２１および色復元手段２２を備えている。原画像保存手段２１は、ハードウェアとしての記憶装置と原画像保存プログラムとにより構成され、色復元手段２２は色復元プログラムにより実現することができる。 << Third Embodiment >>
FIG. 4 shows a configuration of an image output apparatus 1C according to the third embodiment.
In the third embodiment, the image output apparatus 1C includes an original image storage unit 21 and a color restoration unit 22 in addition to the components of the image output apparatus 1B. The original image storage unit 21 includes a storage device as hardware and an original image storage program, and the color restoration unit 22 can be realized by a color restoration program.

原画像保存手段２１は、原画像取得手段１１が取得した原画像を保存しており、色復元手段２２は、この原画像の文字・線図の色を特定し、この色を第２の輪郭修正手段１８２が認識した文字・線図に付与し、画像出力手段１９がこれを、ディスプレイ，プリンタに出力し、あるいは通信回線を介して外部機器に出力する。通常、画像出力手段１９が出力する画像の文字・線図の色は、原画像の文字・線図の色に近似させる。 The original image storage unit 21 stores the original image acquired by the original image acquisition unit 11, and the color restoration unit 22 specifies the color of the character / line diagram of the original image, and uses this color as the second contour. The correction means 182 adds the character / line diagram recognized, and the image output means 19 outputs it to a display, a printer, or outputs it to an external device via a communication line. Usually, the color of the character / diagram of the image output by the image output means 19 is approximated to the color of the character / diagram of the original image.

《第４実施形態》
図５に第４実施形態の画像出力装置１Ｄの構成を示す。
第３実施形態では、画像出力装置１Ｄは、画像出力装置１Ａと同様、原画像取得手段１１、高解像度多値画像生成手段１２、基本画像生成手段１３、輝度曲面生成手段１４、地形的特徴抽出手段１５、地形的特徴組み込み手段１６、輝度勾配量検出手段１７、第1の輪郭修正手段１８１、および画像出力手段１９を備えている。
ただし、本実施形態では、輝度曲面生成手段１４は、高解像度多値画像生成手段１２が生成した多値画像から、輝度曲面ｆ（ｘ，ｙ）を生成する。 << 4th Embodiment >>
FIG. 5 shows a configuration of an image output apparatus 1D according to the fourth embodiment.
In the third embodiment, the image output device 1D, like the image output device 1A, is an original image acquisition unit 11, a high-resolution multilevel image generation unit 12, a basic image generation unit 13, a luminance curved surface generation unit 14, a topographic feature extraction. Means 15, topographic feature incorporation means 16, luminance gradient amount detection means 17, first contour correction means 181, and image output means 19 are provided.
However, in the present embodiment, the luminance curved surface generation unit 14 generates the luminance curved surface f (x, y) from the multivalued image generated by the high resolution multilevel image generation unit 12.

《第５実施形態》
図６に第５実施形態の画像出力装置１Ｅの構成を示す。
第４実施形態では、画像出力装置１Ｅは、画像出力装置１Ｄの各構成要件に加えて、第２の輪郭修正手段１８２を備えている。
画像出力装置１Ｂと同様に、第２の輪郭修正手段１８２は、第１の輪郭修正手段１８１が修正した２値画像中の文字・線図の輪郭を構成する画素列を、接線方向に基づいてクラスタリングして、各クラスタについて輪郭の円滑化および／または角部の角度鮮明化により、２値画像中の文字・線図の輪郭を修正することができる。
ただし、本実施形態では、輝度曲面生成手段１４は、高解像度多値画像生成手段１２が生成した多値画像から、輝度曲面ｆ（ｘ，ｙ）を生成する。 << 5th Embodiment >>
FIG. 6 shows a configuration of an image output apparatus 1E according to the fifth embodiment.
In the fourth embodiment, the image output apparatus 1E includes a second contour correcting unit 182 in addition to the constituent elements of the image output apparatus 1D.
Similar to the image output apparatus 1B, the second contour correcting unit 182 determines the pixel sequence that forms the contour of the character / line diagram in the binary image corrected by the first contour correcting unit 181 based on the tangent direction. By clustering, the contour of the character / line diagram in the binary image can be corrected by smoothing the contour and / or sharpening the corner of each cluster.
However, in the present embodiment, the luminance curved surface generation unit 14 generates the luminance curved surface f (x, y) from the multivalued image generated by the high resolution multilevel image generation unit 12.

《第６実施形態》
図７に第６実施形態の画像出力装置１Ｆの構成を示す。
第６実施形態では、画像出力装置１Ｆは、画像出力装置１Ｅの各構成要素に加えて、原画像保存手段２１および色復元手段２２を備えている。
画像出力装置１Ｃと同様に、原画像保存手段２１は、原画像取得手段１１が取得した原画像を保存しており、色復元手段２２は、この原画像の文字・線図の色を特定し、この色を第２の輪郭修正手段１８２が認識した文字・線図に付与し、画像出力手段１９がこれを、ディスプレイ，プリンタに出力し、あるいは通信回線を介して外部機器に出力する。通常、画像出力手段１９が出力する画像の文字・線図の色は、原画像の文字・線図の色に近似させる。 << 6th Embodiment >>
FIG. 7 shows a configuration of an image output apparatus 1F according to the sixth embodiment.
In the sixth embodiment, the image output device 1F includes an original image storage unit 21 and a color restoration unit 22 in addition to the components of the image output device 1E.
Similar to the image output device 1C, the original image storage unit 21 stores the original image acquired by the original image acquisition unit 11, and the color restoration unit 22 specifies the color of the character / line diagram of the original image. The color is added to the character / diagram recognized by the second contour correcting means 182 and the image output means 19 outputs it to a display or printer, or outputs it to an external device via a communication line. Usually, the color of the character / diagram of the image output by the image output means 19 is approximated to the color of the character / diagram of the original image.

図８（Ａ）に示すように、携帯型電話機２は、ウェブサイト（画像出力装置１として示す）の簡易画像Ｇ１を表示している場合に、画像作成装置１に画像変換要求を発行することができる。この場合には、画像作成装置１は上述した原画像を持っているので、画像作成装置１から変換画像（実施形態１から６における処理後の画像）Ｇ２を受け取ることができる。 As shown in FIG. 8A, the mobile phone 2 issues an image conversion request to the image creating apparatus 1 when a simple image G1 of a website (shown as the image output apparatus 1) is displayed. Can do. In this case, since the image creating apparatus 1 has the above-described original image, the converted image (the image after processing in the first to sixth embodiments) G2 can be received from the image creating apparatus 1.

また、図８（Ｂ）に示すように、携帯型電話機２は、ウェブサイト３の簡易画像Ｇ１を表示している場合に、画像作成装置１に画像変換要求を発行することができる。この場合には、画像作成装置１はウェブサイト３から原画像Ｇ０を取得して画像変換を行なう。携帯型電話機２は、画像作成装置１から変換画像（実施形態１から６における処理後の画像）Ｇ２を受け取ることができる。
さらに、図８（Ｃ）に示すように、携帯型電話機２は、ウェブサイト３の簡易画像Ｇ１を表示している場合に、画像作成装置１に画像変換要求を発行するとともにその簡易画像（あるいは、適宜取得した２００ｄｐｉ程度の原画像）を画像作成装置１に送信することができる。この場合には、画像作成装置１は取得した画像の画像変換を行なう。そして、携帯型電話機２は、画像作成装置１から変換画像（実施形態１から６における処理後の画像）Ｇ２を受け取ることができる。 Further, as shown in FIG. 8B, the mobile phone 2 can issue an image conversion request to the image creating apparatus 1 when the simple image G1 of the website 3 is displayed. In this case, the image creating apparatus 1 acquires the original image G0 from the website 3 and performs image conversion. The mobile phone 2 can receive the converted image (the image after processing in the first to sixth embodiments) G2 from the image creating apparatus 1.
Further, as shown in FIG. 8C, when the mobile phone 2 displays the simple image G1 of the website 3, the mobile phone 2 issues an image conversion request to the image creating apparatus 1 and also displays the simple image (or , An appropriately acquired original image of about 200 dpi) can be transmitted to the image creating apparatus 1. In this case, the image creating apparatus 1 performs image conversion of the acquired image. Then, the mobile phone 2 can receive the converted image (the image after processing in the first to sixth embodiments) G2 from the image creating apparatus 1.

以下、本発明の実施例を、図９のフローチャートに沿って説明する。
《補間と局所統計量による基本画像の生成》
基本画像の生成は、次のような手順で行われる。まず、原画像取得手段１１は、原画像（多値ビットマップ画像）を取得し（Ｓ１０１）、この取得した画像（多値ビットマップ画像）がグレイスケール画像であるならば（Ｓ１０２の「ＮＯ」）、これをそのまま処理対象画像とする。取得した画像がカラー画像ならば（Ｓ１０２の「ＹＥＳ」）、原画像をグレイスケール画像ＩＯに変換する（Ｓ１０３）。 Hereinafter, an embodiment of the present invention will be described with reference to the flowchart of FIG.
<< Generation of basic image by interpolation and local statistics >>
The basic image is generated by the following procedure. First, the original image acquisition unit 11 acquires an original image (multi-level bitmap image) (S101), and if the acquired image (multi-level bitmap image) is a grayscale image ("NO" in S102). ), And this is used as the processing target image. If the acquired image is a color image (“YES” in S102), the original image is converted into a grayscale image IO (S103).

次に、補間により高解像度多値画像ＩＨを生成する（Ｓ１０４）。補間には様々な方法があるが、ここでは、計算が簡単な双１次補間を用いて、その結果に平滑化（たとえば３×３の線形フィルタ）処理を施す。また、後述する第１の輪郭修正で用いるために、高解像度多値画像ＩＨ上の各画素について、輝度勾配量ＧＨを計算しておき、計算結果を所定のメモリに保存しておく（Ｓ１０５）。
また、高解像度多値画像ＩＨから、局所統計量をもとに基本画像ＦＨを生成する（Ｓ１０６）。 Next, a high-resolution multilevel image IH is generated by interpolation (S104). There are various methods for interpolation. Here, bilinear interpolation, which is easy to calculate, is used, and the result is subjected to smoothing (for example, 3 × 3 linear filter) processing. Further, for use in the first contour correction described later, a luminance gradient amount GH is calculated for each pixel on the high-resolution multilevel image IH, and the calculation result is stored in a predetermined memory (S105). .
Further, a basic image FH is generated from the high-resolution multilevel image IH based on the local statistics (S106).

図１１（Ｂ）は、図１１（Ａ）に示した原画像をＮｉｂｌａｃｋによる２値化技術を用いて２値化した結果を示している。図１１（Ｂ）から明らかなように、このままでは、水平のストロークが欠落してしまう。このため、基本画像ＦＨとして、Ｎｉｂｌａｃｋによる２値化技術(W. Niblack, An introduction to image processing, pp. 115-116， Englewood Cliffs, NJ: Prentice Hall, 1986.)を拡張する。 FIG. 11B shows the result of binarization of the original image shown in FIG. 11A using the Niblack binarization technique. As is clear from FIG. 11B, the horizontal stroke is lost as it is. For this reason, as a basic image FH, a binary technique based on Niblack (W. Niblack, An introduction to image processing, pp. 115-116, Englewood Cliffs, NJ: Prentice Hall, 1986.) is expanded.

すなわち、基本画像ＦＨは、たとえば、１，０，−１の何れかの値をとるものとする。
ＦＨ（ｘ，ｙ）＝１：「ＯＮ」（前景、あるいは、文字のストローク）、
ＦＨ（ｘ，ｙ）＝０：「ＯＦＦ」（背景）、
ＦＨ（ｘ，ｙ）＝−１：「ＴＢＤ」（「ＯＮ」の可能性があり、後で地形的特徴によって決定する）、
の３値の何れかをとる画像を生成する。 That is, the basic image FH takes, for example, one of 1, 0, and -1.
FH (x, y) = 1: “ON” (foreground or character stroke),
FH (x, y) = 0: “OFF” (background),
FH (x, y) = − 1: “TBD” (may be “ON”, later determined by topographic features),
An image taking one of the three values is generated.

図１２（Ａ），（Ｂ）（（Ｂ）は（Ａ）の画像の部分Ｗの拡大図）に、基本画像ＦＨの例を示す。ここで、ＦＨ（ｘ，ｙ）の「ＯＮ」、「ＯＦＦ」、「ＴＢＤ」を、それぞれ、黒、白、グレイで表わしている。
具体的には、高解像度多値画像ＩＨ（ｘ，ｙ）について、処理対象となる画素を中心とするウィンドウ（処理領域：たとえば３×３画素，５×５画素等）内で計算される、輝度の平均μ（ｘ，ｙ）と標準偏差σ（ｘ，ｙ）をもとに、基本画像ＦＨ（ｘ，ｙ）を次のような規則で設定する。 FIGS. 12A and 12B (B is an enlarged view of the portion W of the image of FIG. 12A) show examples of the basic image FH. Here, “ON”, “OFF”, and “TBD” of FH (x, y) are represented by black, white, and gray, respectively.
Specifically, for the high-resolution multilevel image IH (x, y), calculation is performed within a window (processing region: for example, 3 × 3 pixels, 5 × 5 pixels, etc.) centered on the pixel to be processed. Based on the average luminance μ (x, y) and standard deviation σ (x, y), the basic image FH (x, y) is set according to the following rules.

（１）ＩＨ（ｘ，ｙ）≦μ（ｘ，ｙ）＋ｋ０σ（ｘ，ｙ）（ただし、ｋ０は既定のパラメータ）ならば、ＦＨ（ｘ，ｙ）＝１（「ＯＮ」に設定）とする。
（２）μ（ｘ，ｙ）＋ｋ０σ（ｘ，ｙ）＜ＩＨ（ｘ，ｙ）＜μ（ｘ，ｙ）＋ｋ１σ（ｘ，ｙ）（ただし、ｋ０＜ｋ１）で、かつ、（ｘ，ｙ）の近傍にＦＨ（ｘ，ｙ）が「ＯＮ」の画素が存在するならば、ＦＨ（ｘ，ｙ）＝−１（「ＴＢＤ」に設定）とする。
（３）その他の場合には、ＦＨ（ｘ，ｙ）＝０（「ＯＦＦ」に設定）とする。 (1) If IH (x, y) ≦ μ (x, y) + k0σ (x, y) (where k0 is a default parameter), FH (x, y) = 1 (set to “ON”) To do.
(2) μ (x, y) + k0σ (x, y) <IH (x, y) <μ (x, y) + k1σ (x, y) (where k0 <k1) and (x, y ) In the vicinity of FH (x, y) is “ON”, FH (x, y) = − 1 (set to “TBD”).
(3) In other cases, FH (x, y) = 0 (set to “OFF”).

「ＴＢＤ」は、漢字のようにストロークの密度が高い場合に、白とするか黒とするかは、地形的特徴を調べなければ決定できないことを表わす。
実際には、同じストロークでもサンプリング位置により、輝度が大きく異なる。図１３（Ｂ），（Ｃ）は図１３（Ａ）に示したグレイスケール原画像Ｉｏの垂直方向のスキャンラインＡ，Ｂに沿った輝度のプロファイルである。図１３（Ｂ），（Ｃ）に示されるように、特に、水平方向のストロークが密集している個所で、その傾向が著しい。したがって、（２）の場合（ＦＨ（ｘ，ｙ）＝−１（「ＴＢＤ」））には、ＩＨ（ｘ，ｙ）からＦＨ（ｘ，ｙ）を決定するための処理領域（ウィンドウ）の大きさは、その上下左右ｔ画素以内（ｔは解像度の拡大率に応じて決められるパラメータ）とすればよい。 “TBD” indicates that when the density of strokes is high, such as kanji, it is not possible to determine whether the color is white or black without examining topographic features.
Actually, the luminance varies greatly depending on the sampling position even in the same stroke. FIGS. 13B and 13C are luminance profiles along the scan lines A and B in the vertical direction of the grayscale original image Io shown in FIG. As shown in FIGS. 13B and 13C, the tendency is particularly remarkable in the places where the horizontal strokes are dense. Therefore, in the case of (2) (FH (x, y) = − 1 (“TBD”)), the processing area (window) for determining FH (x, y) from IH (x, y) is determined. The size may be within t pixels in the upper, lower, left, and right sides (t is a parameter determined according to the resolution enlargement ratio).

《地形的特徴による欠落ストロークの補完》
処理対象となるグレイスケール画像（多値画像）ＩＯから、輝度曲面ｚ＝ｆ（ｘ，ｙ）を生成して（Ｓ１０７）、輝度曲面ｚ＝ｆ（ｘ，ｙ）上の地形的特徴（尾根、峡谷、山頂、窪地、山腹）を抽出する（Ｓ１０８）。
ここでは、白が黒よりもｚの値が大きいものとして定義する。ストロークの補完にとって重要な特徴は、ｚの値が局所的に小さい部分、すなわち、峡谷（ｆ（ｘ，ｙ）が１方向で極小）と窪地（ｆ（ｘ，ｙ）が全ての方向で極小）である。《Completing missing strokes by topographic features》
A luminance curved surface z = f (x, y) is generated from the grayscale image (multi-valued image) IO to be processed (S107), and the topographic features (ridges) on the luminance curved surface z = f (x, y) are generated. , Gorge, summit, depression, hillside) are extracted (S108).
Here, white is defined as having a larger z value than black. An important feature for stroke interpolation is that the z value is locally small, that is, the gorge (f (x, y) is minimal in one direction) and the depression (f (x, y) is minimal in all directions). ).

これらの特徴の具体的な計算方法は、ＷａｎｇとＰａｖｌｉｄｉｓによる「文字認識のための特徴の直接的なグレイスケール抽出」（：L. Wang and T. Pavlidis、 Direct gray-scale extraction of features for character recognition, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 15, no. 10, pp. 1053-1067, 1993.）の技術、あるいは、Seong-Whan Lee と Young Joon Kim による「文字認識におけるグレイスケール画像からの直接的な形状特徴抽出」（Direct Extraction of Topographic Features for Gray Scale Character Recognition：IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 17, No. 7, pp.724-729, July 1995.）の技術を用いることができる。 The specific calculation method of these features is described in “Direct gray-scale extraction of features for character recognition” by Wang and Pavlidis (“L. Wang and T. Pavlidis, Direct gray-scale extraction of features for character recognition”). , IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 15, no. 10, pp. 1053-1067, 1993.) or Seong-Whan Lee and Young Joon Kim Using the technology of “Direct Extraction of Topographic Features for Gray Scale Character Recognition: IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 17, No. 7, pp.724-729, July 1995.” be able to.

例えば、グレイスケール画像（多値画像）ＩＯから、図１０に示したような地形的特徴を得ることができる。次に、このようにして得られた地形的特徴を用いて、ストロークを補完して、項解像度修正画像ＪＨを作成する（Ｓ１０９）。
基本画像ＦＨ（ｘ，ｙ）が値「ＴＢＤ」を持つ画素について、グレイスケール画像（多値画像）ＩＯでの対応する画素が、「峡谷」または「窪地」であれば、基本画像ＦＨ（ｘ，ｙ）を「ＯＮ」に、そうでなければ「ＯＦＦ」に設定しなおす。すなわち、輝度曲面ｚ＝ｆ（ｘ，ｙ）から得られる特徴を、基本画像ＦＨ（ｘ，ｙ）から得られる画像に優先させている。 For example, a topographic feature as shown in FIG. 10 can be obtained from a gray scale image (multi-valued image) IO. Next, using the topographic features thus obtained, the stroke is complemented to create a term resolution corrected image JH (S109).
For the pixel having the value “TBD” in the basic image FH (x, y), if the corresponding pixel in the grayscale image (multi-valued image) IO is “the canyon” or “the depression”, the basic image FH (x , Y) is reset to “ON”, otherwise “OFF”. That is, the feature obtained from the luminance curved surface z = f (x, y) is prioritized over the image obtained from the basic image FH (x, y).

このようにして得られた２値画像を図１４に示す。基本画像ＦＨに輝度曲面ｚ＝ｆ（ｘ，ｙ）の特徴を取り入れる前の画像（図１１（Ｂ）参照）と比べると、文字Ｃ３（「静」）の中のＣ３１で示す部分（「月」）の水平ストロークや、文字Ｃ２（「基」）の水平ストロークが復元できていることがわかる。
しかし、図１４の画像では、文字のストロークは十分に復元されているが、輪郭がなめらかでないため、文字画像の品質としては、明らかに貧弱である。 The binary image thus obtained is shown in FIG. Compared with the image (see FIG. 11B) before incorporating the feature of the luminance curved surface z = f (x, y) into the basic image FH (see FIG. 11B), the portion indicated by C31 (“Month” in the character C3 (“static”) It can be seen that the horizontal stroke of “)” and the horizontal stroke of the character C2 (“base”) can be restored.
However, in the image of FIG. 14, the stroke of the character is sufficiently restored, but since the outline is not smooth, the quality of the character image is clearly poor.

《輪郭の第１の修正》
すでに述べたように、なめらかな輪郭線の基準として、曲率が小さい（曲率半径が大きい）ことが必要となる。また、輪郭線は、輝度勾配量が極大となるような画素（すなわち、白領域と黒領域との境界）を通過することが前提となる。 <First contour correction>
As already described, it is necessary that the curvature is small (the radius of curvature is large) as a reference for a smooth contour line. Further, it is assumed that the contour line passes through a pixel having a maximum luminance gradient amount (that is, a boundary between the white area and the black area).

そこで、１９８０年代終わりに考案され、現在では画像処理の標準的手法となっている、上述したActive Contour または、Snake アルゴリズムを用いて、輪郭線を修正する。前述した、このアルゴリズムでは、弧長ｓをパラメータとした初期曲線をｖ（ｓ）＝（ｘ（ｓ），ｙ（ｓ））とし、次の量が最小となるように、ｖ（ｓ）を修正する（Ｓ１１０）。 Therefore, the contour line is corrected by using the above-described Active Contour or Snake algorithm, which was conceived at the end of the 1980s and is now a standard method for image processing. In this algorithm described above, an initial curve with the arc length s as a parameter is set to v (s) = (x (s), y (s)), and v (s) is set so that the next quantity is minimized. It corrects (S110).

E＝∫（α（ｓ）Ｅｃｏｎｔ＋β（ｓ）Ｅｃｕｒｖ＋λ（ｓ）Ｅｉｍａｇｅ）ｄｓ
ここで、Ｅｃｏｎｔ（≧０）は曲線の収縮を防き点列が等間隔に配置されるようにするための項、Ｅｃｕｒｖ（≧０）は点列の曲率が小さくなるようにするための項、Ｅｉｍａｇｅ（≦０）は画像ＩＨ上の勾配量を大きくするための項で、初めに計算しておいた輝度勾配量ＧＨを使って、−ＧＨ（ｖ（ｓ））と表わすことができる。図１５に輝度勾配量ＧＨの例を示す。図１５では、輝度勾配量ＧＨが大きくなればなるほど高濃度となるように表示してある。図１５に示されるように、文字の輪郭部分の輝度勾配量が大きくなっていることがわかる。 E = ∫ (α (s) Econt + β (s) Ecurv + λ (s) Eimage) ds
Here, Econt (≧ 0) is a term for preventing the curve from shrinking and the point sequence is arranged at equal intervals, and Ecurv (≧ 0) is a term for reducing the curvature of the point sequence. , Image (≦ 0) is a term for increasing the gradient amount on the image IH, and can be expressed as −GH (v (s)) using the luminance gradient amount GH calculated first. FIG. 15 shows an example of the luminance gradient amount GH. In FIG. 15, the higher the luminance gradient amount GH, the higher the density is displayed. As shown in FIG. 15, it can be seen that the luminance gradient amount of the outline portion of the character is large.

なお、α，β，γの３つのパラメータは点ごとに異なる値に設定することもできるが、ここでは、固定した値を用いる（α＝β＝γ＝１．０）。
基本画像ＦＨから得られる輪郭線（「ＯＮ」画素の境界）のそれぞれにActive Contourアルゴリズムを使って、輪郭を修正する。全ての輪郭線について修正を行った後、ベクトル−ラスター変換によって、輪郭線から２値画像を生成する。 The three parameters α, β, and γ can be set to different values for each point, but here, fixed values are used (α = β = γ = 1.0).
The contour is corrected by using the Active Contour algorithm for each contour line (boundary of “ON” pixels) obtained from the basic image FH. After all the contour lines are corrected, a binary image is generated from the contour lines by vector-raster conversion.

図１６に、図１５に示した輝度勾配量ＧＨと、図１４に示した画像とを重ね合わせた画像を示す。また、図１７に輪郭修正の結果を示す。図１７の画像と、図１４の画像とを比較するれば明らかなように、輪郭線の滑らかさやストロークの太さの均一性が向上していることがわかる。 FIG. 16 shows an image obtained by superimposing the luminance gradient amount GH shown in FIG. 15 and the image shown in FIG. FIG. 17 shows the result of contour correction. As apparent from comparing the image of FIG. 17 with the image of FIG. 14, it can be seen that the smoothness of the contour line and the uniformity of the thickness of the stroke are improved.

《輪郭の第２の修正》
Active Contourのアルゴリズムによって文字画像の品質は格段に向上するが、水平・垂直方向の線のがたつきや、垂直線と水平線の交差部分の鈍りが観察される。《Second contour correction》
Active Contour's algorithm significantly improves the quality of character images, but horizontal and vertical line shakiness and dullness at the intersection of vertical and horizontal lines are observed.

これらは人間の知覚にとって非常に気になる点である。そのため、人間にとってきれいに見えるように、第２の修正により輪郭を整えることができる。特に、水平・垂直方向の線のがたつきや、垂直線と水平線の交差部分の鈍りを補正する（Ｓ１１１）。
いま、処理対象の閉輪郭線を点列Ｐ＝（ｐ（０），ｐ（１），・・・，ｐ（ｎ−１））で表わすものとする。ただし、点列Ｐのｐの括弧内の添え字は、便宜上付したものである。すなわち、点ｐ（ｉ）の添え字値ｉは、一般には、０〜ｎ−１とならならないことを考慮して、ｉ＞ｎならばｐ（ｉ）＝ｐ（ｉ−ｎ）、ｉ＜０、ならばｐ（ｉ）＝ｐ（ｉ＋ｎ））とする。 These are points of great concern for human perception. Therefore, the contour can be adjusted by the second correction so that it looks beautiful for humans. In particular, the shakiness of the horizontal and vertical lines and the dullness of the intersection of the vertical and horizontal lines are corrected (S111).
Now, the closed contour to be processed is represented by a point sequence P = (p (0), p (1),..., P (n−1)). However, the subscripts in parentheses for p in the point sequence P are given for convenience. That is, considering that the subscript value i of the point p (i) is generally not 0 to n-1, if i> n, then p (i) = p (i−n), i < If 0, p (i) = p (i + n)).

本実施形態では、この点列Ｐを接線方向でクラスタリングする。点ｐ（ｉ）の接線方向をθ（ｉ）として、点ｐ（ｉ）にラベルＬ（ｉ）を次のように与える。
（ｊπ／２）−δ≦θ≦（ｊπ／２）＋δ
（ただし、δはπ／２に比べて十分に小さいパラメータ、ｊ＝０，１，２，３）
ならば、Ｌ（ｉ）＝ｊとする。すなわち、点ｐ（ｉ）の接線方向が、ほぼ０°，９０°，１８０°，２７０°に近ければ、点ｐ（ｉ）にＬ（ｉ）＝０，１，２，３の値を与える。 In the present embodiment, this point sequence P is clustered in the tangential direction. With the tangential direction of the point p (i) as θ (i), a label L (i) is given to the point p (i) as follows.
(Jπ / 2) −δ ≦ θ ≦ (jπ / 2) + δ
(Where δ is a parameter sufficiently smaller than π / 2, j = 0, 1, 2, 3)
Then, L (i) = j. That is, if the tangential direction of the point p (i) is nearly 0 °, 90 °, 180 °, 270 °, the value of L (i) = 0, 1, 2, 3 is given to the point p (i). .

その他の場合、Ｌ（ｉ）＝−１とする。
すなわち、水平方向に近い接線を持つ点ｐはラベル０または２、垂直方向に近い接線を持つ点はラベル１または３、それ以外の点はラベル−１を持つ。
このようにして、点列Ｐから、レベルの系列（Ｌ（０），Ｌ（１），・・・，Ｌ（ｎ−１））が得られる。同じラベルを持つ一連の点を１つのクラスタとしてまとめることにより、図１９（Ａ）に示すように、点列Ｐをクラスタ分けできる。 In other cases, L (i) = − 1.
That is, a point p having a tangent close to the horizontal direction has label 0 or 2, a point having a tangent close to the vertical direction has label 1 or 3, and the other points have label -1.
In this way, a series of levels (L (0), L (1),..., L (n−1)) is obtained from the point sequence P. By collecting a series of points having the same label as one cluster, the point sequence P can be clustered as shown in FIG.

ｊ番目のクラスタについて、ｋｊをそのクラスタの開始点のインデックス、ｍｊ＞０をそのクラスタに属する点の数として、
Ｃｊ＝（ｐ（ｋｊ），ｐ（ｋｊ＋１），・・・，ｐ（ｋｊ＋ｍｊ−１））
とすると、
ｊ−１番目とｊ＋１番目のクラスタとの間には、
ｋｊ＝ｋｊ−１＋ｍｊ−１，ｋｊ＋１＝ｋｊ＋ｍｊ
という関係があり、
Ｌ（ｋｊ−１＋ｍｊ−１）≠Ｌ（ｋｊ）＝Ｌ（ｋｊ＋１）
＝・・・＝Ｌ（ｋｊ＋ｍｊ−１）≠Ｌ（ｋｊ＋１）
という性質を満たす。
クラスタＣｊについて、水平・垂直方向の線のがたつきを修正する。Ｌ（ｋｊ）＝０，または、２、すなわち、水平方向に近い接線をもつクラスタならば、Ｃｊの各点のｙ座標Ｙ（ｋｊ＋ｉ）（ｉ＝０，１，・・・，ｍｊ−１）の分布から、そのモードＭを求める。 For the jth cluster, let kj be the index of the starting point of that cluster, mj> 0 be the number of points belonging to that cluster,
Cj = (p (kj), p (kj + 1),..., P (kj + mj-1))
Then,
Between the (j-1) th and (j + 1) th clusters,
kj = kj-1 + mj-1, kj + 1 = kj + mj
There is a relationship
L (kj−1 + mj−1) ≠ L (kj) = L (kj + 1)
= ... = L (kj + mj-1) ≠ L (kj + 1)
Satisfies the property.
For the cluster Cj, the shakiness of the horizontal and vertical lines is corrected. If L (kj) = 0 or 2, that is, a cluster having a tangent line close to the horizontal direction, the y coordinate Y (kj + i) (i = 0, 1,..., Mj−1) of each point of Cj The mode M is obtained from the distribution of.

そして、｜Ｙ（ｋｊ＋ｉ）−Ｍ｜≦１ならば、Ｙ（ｋｊ＋ｉ）←Ｍと設定する。
Ｌ（ｋｊ）＝１、または、３の場合は、ｘ座標について同様の処理を行う。
次に、垂直線と水平線の交差部分の鈍りを整形する。接線方向が水平・垂直以外の点のクラスタＣｊが、互いに垂直な接線方向を持つ点のクラスタに囲まれている、すなわち、Ｌ（ｋｊ−１）≧０、Ｌ（ｋｊ）＜０、Ｌ（ｋｊ＋１）＞０、Ｌ（ｋｊ−１）≠（ｋｊ＋１）とする。 If | Y (kj + i) −M | ≦ 1, Y (kj + i) ← M is set.
When L (kj) = 1 or 3, the same processing is performed for the x coordinate.
Next, the dullness at the intersection of the vertical and horizontal lines is shaped. A cluster Cj of points whose tangent directions are not horizontal / vertical is surrounded by a cluster of points having tangential directions perpendicular to each other, that is, L (kj−1) ≧ 0, L (kj) <0, L ( kj + 1)> 0 and L (kj−1) ≠ (kj + 1).

もし、クラスタＣｊに属する点の数が十分に少ないならば、図１９（Ｂ）に示すように、クラスタＣｊ−１とＣｊ＋１を延長し、直角のコーナーを構成することにより、クラスタＣｊ内の点を修正する。
全ての輪郭線について修正を行った後、ベクトル−ラスター変換によって、輪郭線から２値画像を生成する。以上の処理フローを図１８に示す。 If the number of points belonging to the cluster Cj is sufficiently small, as shown in FIG. 19B, the points in the cluster Cj are formed by extending the clusters Cj−1 and Cj + 1 to form a right-angled corner. To correct.
After all the contour lines are corrected, a binary image is generated from the contour lines by vector-raster conversion. The above processing flow is shown in FIG.

漢字「回」（符号Ｃ５で示す）について第２の処理を施す前の状態を図２０（Ａ）に示し、第２の処理を施した後の状態を図２０（Ｂ）に示す。図２０（Ａ），（Ｂ）において黒塗りの画素ａ，ｃ，ｅ，ｇ，ｉは、クラスタを構成しない画素（接線が０°，９０°，１８０°，２７０°から外れている画素）を示し、ｂ，ｄ，ｆ，ｈはクラスタを構成する画素（接線が０°，９０°，１８０°，２７０°に近い画素）を示している。
図２０（Ａ）に示す処理前の輪郭線（画素列）の凹凸やコーナ部分（符号ＣＮＲで示す）の鈍りは、図２０（Ｂ）に示す輪郭線では緩和されていることがわかる。 FIG. 20A shows a state before the second process is performed on the Chinese character “times” (indicated by reference numeral C5), and FIG. 20B shows a state after the second process is performed. In FIGS. 20A and 20B, black pixels a, c, e, g, and i are pixels that do not constitute a cluster (pixels whose tangent lines are out of 0 °, 90 °, 180 °, and 270 °). , B, d, f, and h indicate pixels constituting the cluster (pixels whose tangent lines are close to 0 °, 90 °, 180 °, and 270 °).
It can be seen that the unevenness of the contour line (pixel row) before the processing shown in FIG. 20A and the dullness of the corner portion (indicated by the symbol CNR) are alleviated in the contour line shown in FIG.

図２１（Ｂ）に、図２１（Ａ）の２００ｄｐｉの原画像を、解像度４倍で作成した結果を示す。図２１（Ａ），（Ｂ）からわかるように、本実施例によれば多階調の原画像が鮮明でなくとも、再現性に優れた２値ビットマップ画像が再現される。なお、２００ｄｐｉで入力されたカラー画像に対して、特許文献１の方法では９７．２％の認識精度であったが、上記の方法では、９９．１％の認識精度が得られた。 FIG. 21B shows the result of creating the 200 dpi original image of FIG. 21A at a resolution of 4 times. As can be seen from FIGS. 21A and 21B, according to this embodiment, even if the multi-tone original image is not clear, a binary bitmap image having excellent reproducibility is reproduced. Note that the recognition accuracy of 97.2% was obtained with the method of Patent Document 1 for a color image input at 200 dpi, but the recognition accuracy of 99.1% was obtained with the above method.

本発明の画像認識装置の一構成例を示す図である。It is a figure which shows the example of 1 structure of the image recognition apparatus of this invention. 第１実施形態における各手段による処理の流れを示す機能ブロック図である。It is a functional block diagram which shows the flow of the process by each means in 1st Embodiment. 第２実施形態における各手段による処理の流れを示す機能ブロック図である。It is a functional block diagram which shows the flow of the process by each means in 2nd Embodiment. 第３実施形態における各手段による処理の流れを示す機能ブロック図である。It is a functional block diagram which shows the flow of the process by each means in 3rd Embodiment. 第４実施形態における各手段による処理の流れを示す機能ブロック図である。It is a functional block diagram which shows the flow of the process by each means in 4th Embodiment. 第５実施形態における各手段による処理の流れを示す機能ブロック図である。It is a functional block diagram which shows the flow of the process by each means in 5th Embodiment. 第６実施形態における各手段による処理の流れを示す機能ブロック図である。It is a functional block diagram which shows the flow of the process by each means in 6th Embodiment. （Ａ）はウェブサイトにおいて原画像の再現等を行なった画像受け取る具体例を示す図、（Ｂ）は同じく他の具体例を示す図、（Ｃ）は同じくさらに他の具体例を示す図である。(A) is a diagram showing a specific example of receiving an image obtained by reproducing an original image on a website, (B) is a diagram showing another specific example, and (C) is a diagram showing still another specific example. is there. 本発明の実施例を示すフローチャートである。It is a flowchart which shows the Example of this invention. 輝度曲面のｘｙ平面から抽出される地形的特徴を示す図である。It is a figure which shows the topographic feature extracted from xy plane of a luminance curved surface. （Ａ）は原画像を、（Ｂ）は（Ａ）の画像をＮｉｂｌａｃｋによる２値化技術を用いて２値化した結果を示す図である。(A) is a figure which shows the result which binarized the original image and (B) the image of (A) using the binarization technique by Niblack. （Ａ）は基本画像の例を示す図、（Ｂ）は（Ａ）の画像の部分拡大図である。(A) is a figure which shows the example of a basic image, (B) is the elements on larger scale of the image of (A). （Ａ）はグレイスケール原画像、（Ｂ）は（Ａ）のグレイスケール原画像の垂直方向のスキャンラインＡに沿った輝度のプロファイル、（Ｃ）は同じくスキャンラインＢに沿った輝度のプロファイルである。(A) is a gray scale original image, (B) is a luminance profile along the vertical scan line A of the gray scale original image of (A), and (C) is a luminance profile along the scan line B. is there. 輝度曲面ｚ＝ｆ（ｘ，ｙ）の地形的特徴を取り入れた２値画像を示す図である。It is a figure which shows the binary image which took in the topographical feature of the luminance curved surface z = f (x, y). 高解像度多値画像の各画素における輝度勾配の例を示す図である。It is a figure which shows the example of the brightness | luminance gradient in each pixel of a high resolution multi-value image. 図１３に示した輝度勾配量と、図１２に示した画像とを重ね合わせた画像を示す図である。It is a figure which shows the image which overlap | superposed the brightness | luminance gradient amount shown in FIG. 13, and the image shown in FIG. 図１４に示した画像について第２の輪郭修正手段による処理を行なった結果を示す図である。It is a figure which shows the result of having performed the process by the 2nd outline correction means about the image shown in FIG. 全ての輪郭線について修正を行った後、ベクトル−ラスター変換によって、輪郭線から２値画像を生成するときの処理フローを示す図である。It is a figure which shows the processing flow when producing | generating a binary image from an outline by vector-raster conversion, after correcting about all the outlines. （Ａ）輪郭画素列をクラスタ分けした例を示す図であり、（Ｂ）はクラスタ単位で凹凸を修正した例を示す図である。(A) It is a figure which shows the example which divided the outline pixel row into clusters, (B) is a figure which shows the example which corrected the unevenness | corrugation per cluster. 。（Ａ）は漢字についての整形処理を施す前の状態を示す図、（Ｂ）は整形処理を施した後の状態を示す図である。. (A) is a figure which shows the state before performing the shaping process about a Chinese character, (B) is a figure which shows the state after performing the shaping process. （Ａ）の２００ｄｐｉの原画像を示す図、（Ｂ）は解像度４倍で作成した結果を示す図である。FIG. 6A is a diagram illustrating a 200 dpi original image, and FIG. 5B is a diagram illustrating a result created at a resolution of 4 times.

Explanation of symbols

１Ａ〜１Ｆ画像出力装置
２携帯型電話機
３ウェブサイト
１１原画像取得手段
１２高解像度多値画像生成手段
１３基本画像生成手段
１４輝度曲面生成手段
１５地形的特徴抽出手段
１６地形的特徴組み込み手段
１７輝度勾配量検出手段
１９画像出力手段
２１原画像保存手段
２２色復元手段
１００通信回線
１１０ＰＣ（パーソナルコンピュータ）
１１１ＣＰＵ
１１２メモリ
１１３ハードディスク装置
１１４リムーバブルディスク装置
１１５ディスプレイ・インタフェース
１１６プリンタ・インタフェース
１１７キーボード
１１８ネットワーク・インタフェース
１１９バス
１８１第１の輪郭修正手段
１８２第２の輪郭修正手段
１１２１ＲＯＭ
１１２２ＲＡＭ
１１５１ディスプレイ
１１６１プリンタ
１１６１プリンタ
１１７１キーボード DESCRIPTION OF SYMBOLS 1A-1F Image output device 2 Mobile telephone 3 Website 11 Original image acquisition means 12 High resolution multi-value image generation means 13 Basic image generation means 14 Luminance curved surface generation means 15 Topographic feature extraction means 16 Topographic feature incorporation means 17 Brightness Gradient amount detection means 19 Image output means 21 Original image storage means 22 Color restoration means 100 Communication line 110 PC (personal computer)
111 CPU
112 Memory 113 Hard Disk Device 114 Removable Disk Device 115 Display Interface 116 Printer Interface 117 Keyboard 118 Network Interface 119 Bus 181 First Contour Correction Unit 182 Second Contour Correction Unit 1121 ROM
1122 RAM
1151 Display 1161 Printer 1161 Printer 1171 Keyboard

Claims

Original image acquisition means for acquiring an original image of a multi-valued bitmap;
From the original image acquired by the original image acquisition means, a high-resolution multi-value image generation means for generating a multi-value image having a higher resolution than the original image;
A luminance curved surface generating means for generating a curved surface in which the xy coordinates are pixel coordinates and the z coordinate is luminance from the original image acquired by the original image acquiring means;
Topographic feature extraction means for extracting topographic features of the luminance curved surface generated by the luminance curved surface generation means;
The character / line diagram is reproduced by incorporating the topographic feature extracted by the topographic feature extracting unit into the outline of the character / line diagram included in the multi-level image generated by the high-resolution multi-level image generating unit. Or a topographic feature embedding means for generating an improved binary image;
Contour correcting means for correcting a contour of a character / line diagram included in the binary image generated by the topographic feature incorporation means or an image obtained by multi-valued the binary image;
Image output means for outputting an image in which the contour of the character / line diagram is corrected by the contour correcting means to a display device, a printing device or an external device connected via a network;
An image output apparatus comprising:

The topographical features include a “valley or depression” having lower luminance than the surroundings when the luminance curved surface is made to correspond to actual terrain, “ridge or mountain peak”, “valley or depression” having higher luminance than the surroundings, and “ The image output apparatus according to claim 1, wherein the image output apparatus is a “mountain hill or a buttocks” located between the “ridge or the mountain top”.

Further, the character / line diagram or further background color of the image output by the image output means is generated by approximating the color in the original image acquired by the original image acquisition means. Item 3. The image output device according to Item 1 or 2.

4. The original image acquisition means comprises color / grayscale conversion means for converting the original image into a grayscale multilevel image when the original image is a color image. The image output apparatus according to any one of the above.

The topographic feature incorporation means includes a character / line in the original image as a part of a character / line diagram including a region where the minimal part of the luminance curved surface is continuous in a linear or belt shape and a region where the minimal part is localized. 5. The image output apparatus according to claim 1, wherein the binary image is generated by reproducing or improving a character / line diagram obtained by reproducing or improving a figure.

From the high-resolution multi-value image generated by the high-resolution multi-value image generating means, based on statistical information referring to surrounding pixels of each pixel, (a) pixels constituting a character / line diagram, and (b) characters / Basic image generation means for generating a basic image (an image in which topographic features are incorporated) composed of pixels that do not constitute a diagram and (c) pixels that are not determined whether or not to constitute a character / line diagram With
6. The topographic feature embedding unit generates a binary image in which the character / line diagram is reproduced or improved based on the basic image generated by the basic image generating unit. The image output device described in 1.

The topographic feature incorporation means is characterized in that (c) a pixel for which whether or not to constitute a character / line diagram is determined is a region in which a minimal portion of the luminance curved surface is continuous in a linear or belt shape or the minimal portion 7. The binary image obtained by reproducing or improving the character / diagram is generated on the assumption that the pixel constitutes the character / diagram when the pixel is included in the localized region. Image output device.

A luminance gradient amount detecting means for detecting a luminance gradient amount in each pixel of the high resolution multilevel image generated by the high resolution multilevel image generating unit;
The contour correcting unit reproduces or improves the character / line diagram created by the topographic feature incorporation unit based on the luminance gradient amount of each pixel detected by the luminance gradient amount detecting unit. First contour correcting means for correcting
The image output apparatus according to claim 1, further comprising:

The contour correcting means refers to a curvature or direction change of a pixel row that forms an outline of the character / diagram in the image corrected by the first contour correcting means, and further changes the character / diagram in the image. 9. The image output apparatus according to claim 8, further comprising second contour correcting means for correcting the contour.

The second contour correcting unit clusters the pixel columns constituting the contour of the character / diagram in the image corrected by the first contour correcting unit based on a tangent direction, and the contour for each cluster. The image output apparatus according to claim 9, wherein the contour of the character / line diagram in the image is corrected by smoothing and / or sharpening the corners.

Original image acquisition means for acquiring an original image of a multi-valued bitmap;
From the original image acquired by the original image acquisition means, a high-resolution multi-value image generation means for generating a multi-value image having a higher resolution than the original image;
A luminance curved surface generating means for generating a curved surface in which the xy coordinates are pixel coordinates and the z coordinate is luminance from the multi-value image generated by the high-resolution multi-value image generating means;
Topographic feature extraction means for extracting topographic features of the luminance curved surface generated by the luminance curved surface generation means;
Topographic feature incorporation means for creating a binary image that reproduces or improves a character / line diagram in the original image by incorporating the topographic feature extracted by the topographic feature extraction means;
Contour correcting means for correcting a contour of a character / line diagram included in the binary image generated by the topographic feature incorporation means or an image obtained by multi-valued the binary image;
Image output means for outputting an image in which the contour of the character / line diagram is corrected by the contour correcting means to a display device, a printing device or an external device connected via a network;
An image output apparatus comprising:

The topographical features include a “valley or depression” having lower luminance than the surroundings when the luminance curved surface is made to correspond to actual terrain, “ridge or mountain peak”, “valley or depression” having higher luminance than the surroundings, and “ The image output device according to claim 11, wherein the image output device is a “hillside or a buttocks” located between the “ridge or the mountaintop”.

Further, the character / line diagram or further background color of the image output by the image output means is generated by approximating the color in the original image acquired by the original image acquisition means. Item 13. The image output device according to Item 11 or 12.

14. The original image acquisition means comprises color / grayscale conversion means for converting the original image into a grayscale multilevel image when the original image is a color image. The image output apparatus according to any one of the above.

The topographic feature incorporation means includes a character / line in the original image as a part of a character / line diagram including a region where the minimal part of the luminance curved surface is continuous in a linear or belt shape and a region where the minimal part is localized. 15. The image output device according to claim 11, wherein the binary image is generated by reproducing or improving a character / diagram obtained by reproducing or improving a figure.

From the high-resolution multi-value image generated by the high-resolution multi-value image generating means, based on statistical information referring to surrounding pixels of each pixel, (a) pixels constituting a character / line diagram, and (b) characters / Comprising basic image generating means for generating a basic image consisting of pixels that do not constitute a diagram and (c) pixels that are not determined whether or not to constitute a character / diagram;
16. The topographic feature incorporation means generates a binary image in which the character / line diagram is reproduced or improved based on the basic image generated by the basic image generation means. The image output device described in 1.

The topographic feature incorporation means is characterized in that (c) a pixel for which whether or not to constitute a character / line diagram is determined is a region in which a minimal portion of the luminance curved surface is continuous in a linear or belt shape or the minimal portion 17. The binary image obtained by reproducing or improving a character / diagram is generated assuming that the pixel constitutes a character / diagram when the pixel is included in a localized region. Image output device.

A luminance gradient amount detecting means for detecting a luminance gradient amount in each pixel of the high resolution multilevel image generated by the high resolution multilevel image generating unit;
The contour correcting unit reproduces or improves the character / line diagram created by the topographic feature incorporation unit based on the luminance gradient amount of each pixel detected by the luminance gradient amount detecting unit. First contour correcting means for correcting
The image output apparatus according to claim 11, further comprising:

The contour correcting means refers to a curvature or direction change of a pixel row that forms an outline of the character / diagram in the image corrected by the first contour correcting means, and further changes the character / diagram in the image. The image output apparatus according to claim 18, further comprising second contour correcting means for correcting the contour.

The second contour correcting unit clusters the pixel columns constituting the contour of the character / diagram in the image corrected by the first contour correcting unit based on a tangent direction, and the contour for each cluster. 20. The image output apparatus according to claim 19, wherein the contour of the character / diagram in the image is corrected by smoothing and / or sharpening the corners.

An image output program for causing a computer to function as each means according to any one of claims 1 to 20.

A computer-readable recording medium recording a program for causing a computer to be executed as each means according to any one of claims 1 to 20.