JP2000322511A

JP2000322511A - Device and method for processing image and computer readable memory

Info

Publication number: JP2000322511A
Application number: JP11128908A
Authority: JP
Inventors: Hiroshi Tanioka; 宏谷岡; Izuru Horiuchi; 出堀内; Junnosuke Kataoka; 淳之介片岡; Makoto Kobayashi; 誠小林; Nagakazu Honda; 永和本田
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 1999-05-10
Filing date: 1999-05-10
Publication date: 2000-11-24

Abstract

PROBLEM TO BE SOLVED: To reproduce an image mixing a character image and an intermediate image with high definition by synthesizing an image signal corresponding to a character, an image signal corresponding to an area image disabled in character recognition and an image signal corresponding to an image excepting a character line. SOLUTION: It is discriminated whether each divided line is a recognizing line capable of character recognition or not (S105). Next, the character recognition is performed to a character segmented from each recognizing line (S106) and the recognized character is converted to font data (S107). The font data are synthesized with image data except the recognizing line (an image binarized by a binarizing processing part based on an error diffusion method) (S108). Namely, when a character within the segmented area can be recognized concerning the recognizing line on which a character pixel exists, the character is expanded into corresponding font data. Even when the character can not be recognized, the existence of a character within the area is recognized by image area separating processing and the simply binarized image signal of highest resolvability is selected and outputted so that the image of satisfactory resolution can be generated.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、入力された画像を
処理する画像処理装置及びその方法、コンピュータ可読
メモリに関するものである。[0001] 1. Field of the Invention [0002] The present invention relates to an image processing apparatus and method for processing an input image, and a computer readable memory.

【０００２】[0002]

【従来の技術】文字認識技術は、近年のＣＰＵ及び半導
体の高速化によって低速複写機の複写速度程度において
は、略リアルタイムでの処理が可能である。また、近
年、複写原稿が殆どワードプロセッサ等で作成されるた
め、その認識精度も高まり、実用レベルに到達した。2. Description of the Related Art The character recognition technology can perform almost real-time processing at a copying speed of a low-speed copying machine due to the recent increase in the speed of CPUs and semiconductors. Further, in recent years, since almost all copied originals are created by a word processor or the like, the recognition accuracy has been improved, and has reached a practical level.

【０００３】また、原稿中の文字サイズの認識及び文字
の認識技術に関して、例えば、特開昭６１−１０７８７
６等で開示される原稿の文字イメージ情報を認識したフ
ォントイメージ情報に変換して像再生する技術を用いれ
ば、原理的に繰り返される複写による画像劣化はなくな
る。[0003] Regarding the recognition of the character size in a document and the recognition technology of the character, see, for example, Japanese Patent Laid-Open No. 61-10787.
6 and the like, the technique of converting the character image information of the original document into recognized font image information and reproducing the image is used, so that image deterioration due to repeated copying in principle is eliminated.

【０００４】[0004]

【発明が解決しようとする課題】しかしながら、上記従
来の文字認識技術では、２値化した原稿画像全体から直
行する２方向の射影像を求め、平均的文字サイズを予測
し、該サイズで原稿全体を文字サイズに相当するメッシ
ュに分割する。そして、この分割され、切り出された文
字に対して文字認識を実行するために、原稿画像中に中
間調画画像が混在している場合、文字列を示す射影像が
この中間調画像によって乱されるため、メッシュが精度
良く認識することができないという問題点があった。However, in the above-described conventional character recognition technology, a projected image in two directions orthogonal to each other is obtained from the entire binarized document image, an average character size is predicted, and the entire document is calculated using the size. Is divided into meshes corresponding to the character size. In order to perform character recognition on the divided and cut-out characters, if a halftone image is mixed in the original image, the projected image indicating the character string is disturbed by the halftone image. Therefore, there is a problem that the mesh cannot be accurately recognized.

【０００５】また、文字認識不能な文字と一般中間調の
画像の一部との区別がつかないために、両者に対する出
力画像信号は同一となり、中間調画像品位を重視すれ
ば、認識不能文字の解像性が低下し、逆に認識不能文字
を重視すれば中間調画像の階調性が低下するという問題
点があった。Further, since it is not possible to distinguish a character that cannot be recognized from a character and a part of a general halftone image, the output image signals for both are the same. There is a problem in that the resolution is reduced, and conversely, if emphasis is placed on unrecognizable characters, the gradation of the halftone image is reduced.

【０００６】本発明は上記の問題点に鑑みてなされたも
のであり、文字画像と中間調画像が混在する画像をより
高品位に再生することができる画像処理装置及びその方
法、コンピュータ可読メモリを提供することを目的とす
る。SUMMARY OF THE INVENTION The present invention has been made in view of the above problems, and has an image processing apparatus and method capable of reproducing an image in which a character image and a halftone image are mixed at a higher quality, and a computer readable memory. The purpose is to provide.

【０００７】[0007]

【課題を解決するための手段】上記の目的を達成するた
めの本発明による画像処理装置は以下の構成を備える。
即ち、入力された画像を処理する画像処理装置であっ
て、前記入力された画像中の文字行を抽出する抽出手段
と、前記抽出手段で抽出された文字行から領域画像を切
り出す切出手段と、前記切出手段で切り出された領域を
文字認識する文字認識手段と、前記文字認識手段で認識
された文字に対応する第１画像信号を生成する第１生成
手段と、前記文字認識手段で文字認識不能な領域画像に
対する第２画像信号を生成する第２生成手段と、前記画
像中の前記文字行以外の画像に対する第３画像信号を生
成する第３生成手段と、前記第１画像信号、前記第２画
像信号、前記第３画像信号を合成して出力する出力手段
とを備える。An image processing apparatus according to the present invention for achieving the above object has the following arrangement.
That is, an image processing apparatus that processes an input image, an extraction unit that extracts a character line in the input image, and a cutout unit that cuts out an area image from the character line extracted by the extraction unit. A character recognizing means for recognizing a character in the region cut out by the extracting means, a first generating means for generating a first image signal corresponding to the character recognized by the character recognizing means, A second generation unit configured to generate a second image signal for an unrecognizable region image, a third generation unit configured to generate a third image signal for an image other than the character line in the image, and the first image signal; Output means for combining and outputting the second image signal and the third image signal.

【０００８】また、好ましくは、前記第１画像信号は、
前記文字に対応するフォントデータである。Preferably, the first image signal is:
This is font data corresponding to the character.

【０００９】また、好ましくは、前記第２画像信号は、
単純２値化された画像信号である。Preferably, the second image signal is:
This is a simple binarized image signal.

【００１０】また、好ましくは、前記第３画像信号は、
擬似中間調処理された画像信号である。Preferably, the third image signal is:
This is an image signal that has undergone pseudo halftone processing.

【００１１】また、好ましくは、前記文字認識手段は、
注目領域画像に隣接する文字行の境界に画素が存在する
か否かを判定する第１判定手段を備え、前記第１判定手
段の判定結果に基づいて、前記注目領域画像を文字認識
不能な領域画像と判定する。Preferably, the character recognition means includes:
A first determining unit that determines whether a pixel exists at a boundary of a character line adjacent to the attention area image; and, based on a determination result of the first determination means, an area in which the attention area image cannot be recognized. It is determined as an image.

【００１２】また、好ましくは、前記文字認識手段は、
注目領域画像内に存在する画素数に基づいて、該注目領
域画像が文字認識可能であるか否かを判定する第２判定
手段を備え、前記第２判定手段の判定結果に基づいて、
前記注目領域画像を文字認識不能な領域画像と判定する
を備える。[0012] Preferably, the character recognizing means includes:
A second determination unit configured to determine whether or not the attention area image is character-recognizable based on the number of pixels present in the attention area image, and based on a determination result of the second determination means,
Determining the attention area image as an area image in which characters cannot be recognized.

【００１３】また、好ましくは、前記文字認識手段は、
注目領域画像に隣接する領域画像が文字認識可能でない
場合、前記注目領域画像を文字認識不能な領域画像と判
定する。Preferably, the character recognition means includes:
If the area image adjacent to the attention area image is not character recognizable, the attention area image is determined to be an area image incapable of character recognition.

【００１４】また、好ましくは、前記画像が多値画像信
号である場合、前記第２画像信号は、エッジ強調された
多値画像信号である。Preferably, when the image is a multi-valued image signal, the second image signal is an edge-enhanced multi-valued image signal.

【００１５】また、好ましくは、前記画像が多値画像信
号である場合、前記第３画像信号は、平滑化された多値
画像信号である。Preferably, when the image is a multi-level image signal, the third image signal is a smoothed multi-level image signal.

【００１６】上記の目的を達成するための本発明による
画像処理方法は以下の構成を備える。即ち、入力された
画像を処理する画像処理方法であって、前記入力された
画像中の文字行を抽出する抽出工程と、前記抽出工程で
抽出された文字行から領域画像を切り出す切出工程と、
前記切出工程で切り出された領域を文字認識する文字認
識工程と、前記文字認識工程で認識された文字に対応す
る第１画像信号を生成する第１生成工程と、前記文字認
識工程で文字認識不能な領域画像に対する第２画像信号
を生成する第２生成工程と、前記画像中の前記文字行以
外の画像に対する第３画像信号を生成する第３生成工程
と、前記第１画像信号、前記第２画像信号、前記第３画
像信号を合成して出力する出力工程とを備える。An image processing method according to the present invention for achieving the above object has the following arrangement. That is, an image processing method for processing an input image, comprising: an extraction step of extracting a character line in the input image; and a cutting step of cutting out an area image from the character line extracted in the extraction step. ,
A character recognition step of recognizing a character in the region cut out in the extraction step, a first generation step of generating a first image signal corresponding to the character recognized in the character recognition step, and a character recognition step of the character recognition step A second generation step of generating a second image signal for an impossible area image, a third generation step of generating a third image signal of an image other than the character line in the image, the first image signal, An output step of combining and outputting the two image signals and the third image signal.

【００１７】上記の目的を達成するための本発明による
コンピュータ可読メモリは以下の構成を備える。即ち、
入力された画像を処理する画像処理のプログラムコード
が格納されたコンピュータ可読メモリであって、前記入
力された画像中の文字行を抽出する抽出工程のプログラ
ムコードと、前記抽出工程で抽出された文字行から領域
画像を切り出す切出工程のプログラムコードと、前記切
出工程で切り出された領域を文字認識する文字認識工程
のプログラムコードと、前記文字認識工程で認識された
文字に対応する第１画像信号を生成する第１生成工程の
プログラムコードと、前記文字認識工程で文字認識不能
な領域画像に対する第２画像信号を生成する第２生成工
程のプログラムコードと、前記画像中の前記文字行以外
の画像に対する第３画像信号を生成する第３生成工程の
プログラムコードと、前記第１画像信号、前記第２画像
信号、前記第３画像信号を合成して出力する出力工程の
プログラムコードとを備える。A computer readable memory according to the present invention for achieving the above object has the following configuration. That is,
A computer readable memory storing an image processing program code for processing an input image, wherein a program code of an extraction step for extracting a character line in the input image, and a character extracted in the extraction step A program code of a cutting step for cutting out an area image from a line, a program code of a character recognizing step for character recognizing the area cut out in the cutting step, and a first image corresponding to the character recognized in the character recognizing step A program code of a first generation step of generating a signal, a program code of a second generation step of generating a second image signal for an area image in which characters cannot be recognized in the character recognition step, and a program code other than the character line in the image. A program code of a third generation step for generating a third image signal for the image, the first image signal, the second image signal, and the third image signal; And a program code for an output step of outputting a signal synthesized by.

【００１８】[0018]

【発明の実施の形態】以下、図面を参照して本発明の好
適な実施形態を詳細に説明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Preferred embodiments of the present invention will be described below in detail with reference to the drawings.

【００１９】図１は本実施形態の画像処理装置の機能構
成を示すブロック図である。FIG. 1 is a block diagram showing a functional configuration of the image processing apparatus according to the present embodiment.

【００２０】原稿１００１は、画像読取部１００２のレ
ンズ１０１よりイメージセンサ１０２を通して画像信号
となる。次に、この画像信号がＡ／Ｄコンバータ１０３
に入力されてＡ／Ｄ変換された後、画像処理部１００３
に入力される。画像処理部１００３では、入力された画
像信号に対しシェーディング補正回路１０４でシェーデ
ィング補正を行った後、モード切替回路１０５へ入力す
る。The original 1001 is converted into an image signal through the image sensor 102 from the lens 101 of the image reading unit 1002. Next, this image signal is supplied to the A / D converter 103.
Is input to the A / D converter, and is subjected to A / D conversion.
Is input to In the image processing unit 1003, the input image signal is subjected to shading correction by the shading correction circuit 104, and then input to the mode switching circuit 105.

【００２１】モード切替回路１０５では、操作部１１６
による操作者の設定、つまり、認識複写モードであるか
通常複写モードであるかに基づいて出力先を切り替え
る。通常複写モードが設定された場合、光濃度変換回路
１０６で輝度データを記録濃度データに変換する。次
に、２値化部１２０で記録濃度データを２値化した後、
画像編集部１０７に出力する。画像編集部１０７では、
ＣＰＵ１１３内のＲＡＭ１１５に記憶されている画像処
理の設定内容に基づき、記録濃度データに対し編集処理
を行う。そして、その編集された記録濃度データを画像
記録部１００４に出力する。In the mode switching circuit 105, an operation unit 116
, The output destination is switched based on the recognition copy mode or the normal copy mode. When the normal copy mode is set, the light density conversion circuit 106 converts the luminance data into recording density data. Next, after the recording density data is binarized by the binarization unit 120,
Output to the image editing unit 107. In the image editing unit 107,
The editing process is performed on the recording density data based on the setting contents of the image processing stored in the RAM 115 in the CPU 113. Then, the edited recording density data is output to the image recording unit 1004.

【００２２】画像記録部１００４は、転写紙等の搬送を
行うモータ等の制御回路、画像処理部１００３より入力
されたビデオ信号を感光ドラムに書き込むレーザ記録回
路部、及び現像を行う現像制御回路で構成される記録部
１１７を有し、記録部１１７によって記録紙等の記録媒
体上に画像を記録する。An image recording unit 1004 includes a control circuit such as a motor for transporting transfer paper and the like, a laser recording circuit unit for writing a video signal input from the image processing unit 1003 to a photosensitive drum, and a development control circuit for performing development. The recording unit 117 is configured to record an image on a recording medium such as recording paper.

【００２３】一方、モード切替回路１０５で、認識複写
モードが設定された場合、モード切替回路１０５の出力
先は画像認識部４０００へと切り替えられる。画像認識
部４０００では、画像バッファメモリ１１８を使って、
原稿中の文字を認識し、認識結果である文字コードをフ
ォントデータに変換する。尚、画像認識部４の詳細につ
いては、後述する。On the other hand, when the recognition copy mode is set by the mode switching circuit 105, the output destination of the mode switching circuit 105 is switched to the image recognition unit 4000. The image recognition unit 4000 uses the image buffer memory 118 to
Characters in the document are recognized, and the character code as the recognition result is converted into font data. The details of the image recognition unit 4 will be described later.

【００２４】また、操作部１１６は、画像処理部１００
３に対する画像編集内容、コピー枚数、変倍率等の画像
複写動作等を指示する各種キー群、各種ＬＥＤ群と、操
作時の内容を表示する表示部などを有している。The operation unit 116 is connected to the image processing unit 100.
It has various key groups and various LED groups for instructing an image copy operation such as image editing contents, the number of copies, and a magnification ratio for 3 and a display unit for displaying the contents at the time of operation.

【００２５】次に、本実施形態の画像処理装置のハード
ウェア構成について、図２を用いて説明する。Next, a hardware configuration of the image processing apparatus according to the present embodiment will be described with reference to FIG.

【００２６】図２は本実施形態の画像処理装置のハード
ウェア構成を示す断面図である。FIG. 2 is a sectional view showing a hardware configuration of the image processing apparatus of the present embodiment.

【００２７】図２において、１は原稿給送手段となる原
稿給送装置であり、載置された原稿を１枚ずつ或いは２
枚連続に原稿台ガラス面２上の所定位置に給送する。４
はランプ３、走査ミラー５等で構成されるスキャナであ
り、原稿給送装置１により原稿台ガラス面２に載置され
る。そして、操作部１１６から例えば複写指示や文字認
識指示を与えると、スキャナ４が所定方向（図示の左右
方向）に走査して原稿反射光を走査ミラー５〜７を介し
てレンズ８を通過させ、イメージセンサ部９に結像させ
る。In FIG. 2, reference numeral 1 denotes a document feeder serving as a document feeder, which feeds the placed documents one by one or two.
The sheet is fed to a predetermined position on the platen glass surface 2 continuously. 4
Reference numeral denotes a scanner including a lamp 3, a scanning mirror 5, and the like, which is placed on the platen glass surface 2 by the document feeder 1. When, for example, a copy instruction or a character recognition instruction is given from the operation unit 116, the scanner 4 scans in a predetermined direction (left and right directions in the drawing) to pass the original reflected light through the lens 8 via the scanning mirrors 5 to 7. An image is formed on the image sensor unit 9.

【００２８】１００は、図１に示した各種回路を搭載し
た制御部である。１０は露光部であり、制御部１００内
の画像編集部１０７からの画像データに基づいて画像信
号を発生する画像信号制御部（不図示）より画像信号を
受け、その画像信号に応じた光ビームを感光体１１に照
射する。１２、１３は現像器であり、感光体１１に形成
された静電潜像を所定色の現像剤（トナー）で可視化す
る。１４、１５は被転写紙積載部であり、定形サイズの
記録媒体を積載収納する。記録媒体は、給送ローラの駆
動によりレジストローラ配設位置まで給送され、感光体
１１に形成される画像との画像先端合わせタイミングを
とられた状態で再給紙される。Reference numeral 100 denotes a control unit on which the various circuits shown in FIG. 1 are mounted. Reference numeral 10 denotes an exposure unit that receives an image signal from an image signal control unit (not shown) that generates an image signal based on image data from the image editing unit 107 in the control unit 100, and a light beam corresponding to the image signal. Is irradiated on the photoconductor 11. Developing devices 12 and 13 visualize the electrostatic latent image formed on the photoreceptor 11 with a developer (toner) of a predetermined color. Reference numerals 14 and 15 denote transfer paper stacking units, which stack and store recording media of a fixed size. The recording medium is fed to the position where the registration roller is provided by driving the feeding roller, and is re-fed with the timing of aligning the leading edge of the image with the image formed on the photoconductor 11.

【００２９】１６は転写分離帯電器であり、感光体１１
に現像されたトナー像を記録媒体に転写した後、感光体
１１より分離して搬送ベルトを介して定着部１７で定着
させる。１８は排紙ローラであり、画像形成が終了した
記録媒体をトレー２０に積載排紙する。１９は方向フラ
ッパであり、画像形成の終了した被転写紙の搬送方向を
排紙口と内部搬送方向に切り替え、多重／両面画像形成
プロセスに備える。Reference numeral 16 denotes a transfer separation charger, which is a photosensitive member 11
After the toner image developed on the recording medium is transferred to the recording medium, the toner image is separated from the photoreceptor 11 and fixed by the fixing unit 17 via the transport belt. Reference numeral 18 denotes a paper discharge roller, which stacks and discharges the recording medium on which the image formation has been completed on the tray 20. Reference numeral 19 denotes a directional flapper, which switches the transport direction of the transfer-receiving sheet on which the image formation has been completed between the paper discharge port and the internal transport direction, and prepares for the multiplex / double-side image forming process.

【００３０】次に、本実施形態の操作部１１６の外観に
ついて、図３を用いて説明する。Next, the appearance of the operation unit 116 of this embodiment will be described with reference to FIG.

【００３１】５００１は装置本体への通電を制御する電
源スイッチである。５００２はリセットキーであり、ス
タンバイ中は、標準モードに復帰させるキーとして動作
する。５００３はコピースタートキーである。５００４
はクリアキーであり、複写枚数等の数値をクリアすると
きに使用する。A power switch 5001 controls power supply to the apparatus main body. A reset key 5002 operates as a key for returning to the standard mode during standby. Reference numeral 5003 denotes a copy start key. 5004
Is a clear key used to clear numerical values such as the number of copies.

【００３２】５００５はＩＤキーであり、このＩＤキー
５００５により特定の操作者に対して複写動作を可能に
し、特定の操作者以外の操作者に対しては、ＩＤキー５
００５によりＩＤを入力しない限り複写動作を禁止する
ことが可能となる。５００６はストップキーであり、コ
ピーを中断したり、中止したりするときに用いるキーで
ある。５００７はガイドキーであり、各機能を知りたい
ときに使用するキーである。Reference numeral 5005 denotes an ID key. The ID key 5005 enables a specific operator to perform a copying operation.
005 makes it possible to prohibit a copying operation unless an ID is input. A stop key 5006 is used to interrupt or stop copying. Reference numeral 5007 denotes a guide key, which is used when the user wants to know each function.

【００３３】５００８は上カーソルキーであり、各機能
設定画面においてポインタを上に移動させるキーであ
る。５００９は下カーソルキーであり、各機能設定画面
においてポインタを下に移動させるキーである。５０１
０は右カーソルキーであり、各機能設定画面においてポ
インタを右に移動させるキーである。５０１１は左カー
ソルキーであり、各機能設定画面においてポインタを左
に移動させるキーである。５０１２はＯＫキーであり、
各機能設定画面において、処理の確定を指示するキーで
ある。An upper cursor key 5008 is used to move the pointer upward on each function setting screen. Reference numeral 5009 denotes a down cursor key which is used to move a pointer downward on each function setting screen. 501
A right cursor key 0 is a key for moving the pointer to the right on each function setting screen. A left cursor key 5011 is a key for moving the pointer to the left on each function setting screen. 5012 is an OK key,
This key is used to instruct the finalization of processing in each function setting screen.

【００３４】５０１３は指示キーであり、各機能設定画
面において画面５０５２の右下に出力された画像の内容
の実行を指示するキーである。５０１４は定形縮小キー
であり、定形サイズを他の定形サイズに縮小するときに
使用する。５０１５は等倍コピーキーであり、等倍コピ
ーを選択するときに使用する。５０１６は定形拡大キー
であり、定形サイズを他の定形サイズに拡大するときに
使用する。５０１７はカセット選択キーであり、コピー
するカセット段を選択するときに使用する。An instruction key 5013 is a key for instructing execution of the content of the image output to the lower right of the screen 5052 in each function setting screen. A standard size reduction key 5014 is used to reduce the standard size to another standard size. Reference numeral 5015 denotes a 1: 1 copy key, which is used to select 1: 1 copy. A standard enlargement key 5016 is used to enlarge the standard size to another standard size. A cassette selection key 5017 is used to select a cassette stage to be copied.

【００３５】５０１８はコピー濃度調整キーであり、コ
ピー濃度を薄くするときに使用する。５０１９はＡＥキ
ーであり、原稿の濃度に対しコピー濃度を自動的に調整
するときに使用する。５０２０はコピー濃度調整キーで
あり、コピー濃度を濃くする。５０２１はソータの動作
を指定するキーである。５０２２は予熱キーであり、予
熱モードのＯＮ／ＯＦＦに使用する。５０２３は割り込
みキーであり、コピー中に割り込みしてコピーを行いた
いときに使用する。５０２４はテンキーであり、数値の
入力を行うときに使用する。Reference numeral 5018 denotes a copy density adjustment key which is used to reduce the copy density. An AE key 5019 is used to automatically adjust the copy density with respect to the original density. A copy density adjustment key 5020 increases the copy density. 5021 is a key for specifying the operation of the sorter. Reference numeral 5022 denotes a preheating key, which is used to turn on / off a preheating mode. Reference numeral 5023 denotes an interrupt key, which is used when an interrupt is required to be performed during copying. Reference numeral 5024 denotes a numeric keypad, which is used to input numerical values.

【００３６】５０２５はマーカー処理キーであり、トリ
ミング，マスキング，部分処理（輪郭処理，網処理，影
付け処理，ネガポジ処理）を設定する。５０２６はパタ
ーン可処理キーであり、色をパターン化して表現した
り、色を濃度差で表現したりするときに使用する。５０
２７は色消去キーであり、特定色を消去したいときに使
用する。５０２８は画質キーであり、画質の設定を行い
たいときに使用する。５０２９はネガポジキーであり、
ネガポジ処理を行うときに使用する。５０３０はイメー
ジクリエイトキーであり、輪郭処理，影付け処理，網処
理，斜体，ミラー処理，リピート処理を行うときに使用
する。Reference numeral 5025 denotes a marker processing key for setting trimming, masking, and partial processing (contour processing, halftone processing, shadowing processing, negative / positive processing). Reference numeral 5026 denotes a pattern processable key which is used to express a color in a pattern or to express a color by a density difference. 50
A color erasing key 27 is used to erase a specific color. An image quality key 5028 is used to set the image quality. 5029 is a negative / positive key,
Used when performing negative / positive processing. Reference numeral 5030 denotes an image create key which is used when performing contour processing, shadowing processing, halftone processing, italic, mirror processing, and repeat processing.

【００３７】５０３１はトリミングキーであり、エリア
を指定し、トリミングをするときに使用する。５０３２
はマスキングキーであり、エリアを指定し、マスキング
を行うときに使用する。５０３３は部分処理キーであ
り、エリアを指定し、その後、部分処理（輪郭処理，網
処理，影付け処理，ネガポジ処理）を行うときに使用す
る。５０３４は枠消しキーであり、モードに合わせて枠
消しを行うときに使用する。モードには、シート枠消し
（シートサイズに対して枠を作成する）、原稿枠消し
（原稿サイズに合わせて枠を作成する。原稿サイズ指定
有り）、ブック枠消し（ブックの見開きサイズに合わせ
て枠と中央に空白を作成する。ブック見開きサイズ指定
有り）がある。５０３５は綴じ代キーであり、用紙の一
端に綴じ代を作成したいときに使用する。Reference numeral 5031 denotes a trimming key used to designate an area and perform trimming. 5032
A masking key is used to designate an area and perform masking. Reference numeral 5033 denotes a partial processing key, which is used to designate an area and then perform partial processing (contour processing, halftone processing, shadowing processing, negative / positive processing). Reference numeral 5034 denotes a frame erasing key, which is used when erasing a frame in accordance with a mode. The modes include sheet frame erasing (creating a frame for the sheet size), document frame erasing (creating a frame in accordance with the document size. Document size is specified), and book frame erasing (according to the spread size of the book). Create a blank in the frame and center. Book spread size specified). A binding margin key 5035 is used to create a binding margin at one end of a sheet.

【００３８】５０３６は所望とする領域の画像を移動さ
せる移動キーであり、移動を行いたいときに使用する。
移動には、平行移動（上下左右），センター移動，コー
ナー移動，指定移動（ポイント指定）がある。５０３７
はズームキーであり、複写倍率を２５％〜４００％ま
で、１％刻みで設定できる。また、主走査，副走査を独
立に設定できる。尚、画像の変倍は、原稿の副走査方向
に関しては、スキャナ４の移動速度を制御することで行
ない、原稿の主走査方向に関しては、スキャナ４より読
出した画像データの間引き、或は補間処理で行う。Reference numeral 5036 denotes a movement key for moving an image in a desired area, which is used when moving is desired.
The movement includes parallel movement (up, down, left and right), center movement, corner movement, and designated movement (point designation). 5037
Is a zoom key, which can set a copy magnification from 25% to 400% in increments of 1%. Further, the main scanning and the sub-scanning can be set independently. The magnification of the image is controlled by controlling the moving speed of the scanner 4 in the sub-scanning direction of the document, and the image data read out from the scanner 4 is thinned out or interpolated in the main scanning direction of the document. Do with.

【００３９】５０３８はオート変倍キーであり、複写紙
のサイズに合わせて自動的に拡大縮小する。また、主走
査，副走査を独立にオート変倍できる。５０３９は拡大
連写キーであり、１枚の原稿を複数枚に拡大して複写を
行うときに使用する。５０４０は縮小レイアウトキーで
あり、複数の原稿を１枚に拡大縮小して複写を行うとき
に使用する。５０４３は連写キーであり、原稿台ガラス
面の複写領域を左右に２分割し、自動的に２枚のコピー
をする連続複写（ページ連写，両面連写）を行いたいと
きに使用する。５０４４は両面キーであり、両面（片面
両面，ページ連写両面，両面両面）の出力を行いたいと
きに使用する。Reference numeral 5038 denotes an automatic scaling key, which automatically enlarges or reduces the size according to the size of the copy sheet. In addition, the main scanning and the sub-scanning can be automatically scaled independently. Reference numeral 5039 denotes an enlargement continuous shooting key, which is used when one original is enlarged to a plurality of copies and copied. Reference numeral 5040 denotes a reduced layout key, which is used when copying a plurality of originals by enlarging / reducing them. Reference numeral 5043 denotes a continuous shooting key which is used to divide a copy area on the glass surface of the original platen into two right and left sides and perform continuous copying (page continuous shooting, double-side continuous shooting) for automatically making two copies. Reference numeral 5044 denotes a double-sided key which is used to output both sides (single-sided double-sided, page continuous double-sided, double-sided double-sided).

【００４０】５０４５は多重キーであり、多重（多重，
ページ連写多重）を行いたいときに使用する。５０４６
はメモリキーであり、メモリを使用したモード（メモリ
合成，エリア合成，すかし合成）を行いたいときに使用
する。５０４７はプロジェクタキーであり、プロジェク
タを使用するときに使用する。５０４８はプリンタキー
であり、プリンタ時の設定を行うときに使用する。５０
５０は、原稿混載キーであり、フィーダを使用してコピ
ーを取るとき原稿サイズが混載しているときに使用す
る。５０５１はモードメモリキーであり、複写設定され
た複写モードを登録するため、登録された複写モードを
呼び出すときに使用する。Reference numeral 5045 denotes a multiplex key.
Use when you want to perform (page continuous multiplexing). 5046
Is a memory key, which is used to perform a mode using a memory (memory synthesis, area synthesis, watermark synthesis). A projector key 5047 is used when using the projector. Reference numeral 5048 denotes a printer key, which is used when setting for a printer. 50
Reference numeral 50 denotes a mixed original key, which is used when originals are mixed when taking a copy using a feeder. Reference numeral 5051 denotes a mode memory key, which is used for registering the copy mode set for copy, when calling the registered copy mode.

【００４１】５０５２はタッチパネル付きの液晶表示器
であり、装置の状態，複写枚数，複写倍率，複写用紙サ
イズを表示し、認識複写モード或いは通常複写モードの
モード設定及び設定された複写モードの内容を表示す
る。Reference numeral 5052 denotes a liquid crystal display with a touch panel, which displays the status of the apparatus, the number of copies, the copy magnification, and the copy paper size, and displays the mode settings of the recognition copy mode or the normal copy mode and the contents of the set copy mode. indicate.

【００４２】次に、本実施形態の画像認識部４０００の
詳細構成について、図１を用いて説明する。Next, a detailed configuration of the image recognition unit 4000 of the present embodiment will be described with reference to FIG.

【００４３】まず、認識複写モード時では、２値化部１
２０は誤差拡散法による２値化処理部と一定閾値を用い
た単純２値化処理部を備え、両処理部を用いて光濃度変
換回路１０６からの記録濃度データをそれぞれ２値化
し、それらの２値化信号は画像バッファメモリ１１８に
蓄積される。一方、モード切替回路１０５からの輝度デ
ータは、像域分離部１１９に入力され、入力された輝度
データを画素毎に、注目画素が文字を構成する画素か否
かを判定し、その判定結果を示す属性判定信号（１ビッ
ト）を生成する。この属性判定信号も、画像バッファメ
モリ１１８に蓄積される。特に、像域分離部１１９で
は、一般に文字のエッジ部画素を文字部画素として検出
するため、例えば、８ポイント以下の小さい文字は濃度
に関わらずその文字を構成する全画素に対して文字部画
素を示す属性判定信号“１”を生成する。また、線幅が
太い大きな文字や線分に対しては、その輪郭部を構成す
る画素のみが文字部画素を示す属性判定信号として
“１”が生成される。First, in the recognition copying mode, the binarizing unit 1
Numeral 20 includes a binarization processing unit using an error diffusion method and a simple binarization processing unit using a fixed threshold value. The recording density data from the light density conversion circuit 106 is binarized using both of the processing units. The binarized signal is stored in the image buffer memory 118. On the other hand, the luminance data from the mode switching circuit 105 is input to the image area separation unit 119, and the input luminance data is determined for each pixel to determine whether or not the pixel of interest is a pixel constituting a character. Attribute determination signal (1 bit) shown in FIG. This attribute determination signal is also stored in the image buffer memory 118. In particular, since the image area separation unit 119 generally detects an edge part pixel of a character as a character part pixel, for example, a small character of 8 points or less is replaced with a character part pixel regardless of density. Is generated. For a large character or line segment having a large line width, "1" is generated as an attribute determination signal indicating that only pixels constituting the outline indicate a character part pixel.

【００４４】ここで、像域分離部１１９は、例えば、特
開昭６２−１０７５７０等で開示される周知の像域分離
手法を用いることが可能である。具体的には、注目多値
画素信号をその周辺画素信号から得られる平均値で２値
化し、その２値化された２値化信号の２次元的連続性と
２値化時に得られる平均値と注目画素信号とのレベル差
から文字部を、他の網点中間調画像あるいは銀塩写真等
の連続中間調画像から識別分離する。Here, the image area separation section 119 can use a well-known image area separation technique disclosed in, for example, Japanese Patent Application Laid-Open No. 62-107570. More specifically, the target multi-valued pixel signal is binarized by an average value obtained from its peripheral pixel signals, the two-dimensional continuity of the binarized binarized signal, and the average value obtained at the time of binarization. The character portion is identified and separated from another halftone dot image or a continuous halftone image such as a silver halide photograph from the level difference between the halftone image and the pixel signal of interest.

【００４５】画像バッファメモリ１１８に格納された画
像信号は、像域分離結果である属性判定信号に基づい
て、文字切出部１０８で、画像信号中の黒画素を直交す
る２方向へ射影し、そのヒストグラムから画像信号中の
行或いは列の存在を検出する。そして、検出された行或
いは列の方向により、画像信号全域を一定幅の行或いは
列に分割し、各行毎にその行内の各文字を切り出し、各
切り出された文字に対して順次その特徴量をベクトル算
出部１０９で算出する。認識制御部１１１に切り出され
た文字の特徴量が入力されると、認識制御部１１１が有
する認識辞書を用いて、その文字を認識する。尚、この
認識制御部１１１の処理の詳細については、後述する。
そして、出力された認識文字は、イメージデータ変換部
１１０に渡され、認識文字に対応するフォントデータに
変換される。The image signal stored in the image buffer memory 118 is projected by a character cutout unit 108 in two orthogonal directions on a black pixel in the image signal based on an attribute determination signal as a result of image area separation. The presence of a row or a column in the image signal is detected from the histogram. Then, the whole area of the image signal is divided into rows or columns of a fixed width according to the direction of the detected row or column, each character in the row is cut out for each row, and the feature amount is sequentially determined for each cut out character. It is calculated by the vector calculation unit 109. When the feature amount of the extracted character is input to the recognition control unit 111, the character is recognized using the recognition dictionary of the recognition control unit 111. The details of the processing of the recognition control unit 111 will be described later.
Then, the output recognition character is passed to the image data conversion unit 110 and is converted into font data corresponding to the recognition character.

【００４６】次に、画像認識部４０００の処理フローに
ついて、図４を用いて説明する。Next, the processing flow of the image recognition unit 4000 will be described with reference to FIG.

【００４７】図４は本実施形態の画像認識部の処理フロ
ーを示すフローチャートである。FIG. 4 is a flowchart showing the processing flow of the image recognition unit of this embodiment.

【００４８】まず、ステップＳ１０９で、像域分離部１
１９は、入力された輝度データに対し、像域分離処理を
施し、その処理結果である属性判定信号を画像バッファ
メモリ１１８に蓄積する。尚、文字以外の画像部分で
は、属性判定信号“０”が各画素で得られるが、中間調
画像中のエッジ部等では、文字との誤判定画素が孤立し
て存在する。従って、ステップＳ１１０で、この文字と
孤立判定された誤判定画素（孤立判定画素）を除去す
る。これにより、略全ての画像に対して、その文字部の
みの画素に対して属性判定信号“１”が得られる。First, in step S109, the image area separating unit 1
Reference numeral 19 performs image area separation processing on the input luminance data, and accumulates an attribute determination signal as a processing result in the image buffer memory 118. In an image portion other than a character, an attribute determination signal “0” is obtained for each pixel. However, in an edge portion or the like in a halftone image, a pixel erroneously determined to be a character exists in isolation. Therefore, in step S110, the erroneous determination pixel (isolation determination pixel) determined to be isolated from the character is removed. As a result, for almost all images, the attribute determination signal “1” is obtained for pixels of only the character portion.

【００４９】ステップＳ１００で、ステップＳ１１０の
処理によって得られる１ページ分の画像を画像バッファ
メモリ１１８に格納する。次に、ステップＳ１０１で、
画像バッファメモリ１１８に格納された画像に対し、属
性判定信号に基づいて、Ｘ，Ｙ方向の黒画素の射影を求
めてヒストグラムを作成する。In step S100, the image for one page obtained by the processing in step S110 is stored in the image buffer memory 118. Next, in step S101,
For the image stored in the image buffer memory 118, the projection of black pixels in the X and Y directions is calculated based on the attribute determination signal to create a histogram.

【００５０】例えば、図５に示すような画像が画像バッ
ファメモリ１１８に格納されている場合、つまり、画像
の構成として、上部に絵があり、その下部に横書き文書
が存在する画像に対して、属性判定信号に基づいてヒス
トグラムを作成すると、紙面縦方向のヒストグラムから
は行／列に相当する分離部分は存在しないが、紙面横方
向のヒストグラムからは紙面下部に横方向に行の存在が
検出でき、図６に示すような検出結果を得ることができ
る。即ち、両方向の黒画素のヒストグラムを所定閾値で
２値化し、一定周期の規則性が検出できれば、紙面に対
して行／列の存在が判定できる。また、図６の文字行の
部分をより詳細に示すと、図７のようになる。For example, when an image as shown in FIG. 5 is stored in the image buffer memory 118, that is, as an image configuration, for an image having a picture at the top and a horizontal writing document at the bottom, If a histogram is created based on the attribute determination signal, the histogram in the vertical direction of the paper does not have a separation portion corresponding to a row / column, but the histogram in the horizontal direction of the paper can detect the presence of a horizontal line at the bottom of the paper. , A detection result as shown in FIG. 6 can be obtained. That is, the histogram of black pixels in both directions is binarized by a predetermined threshold value, and if regularity at a fixed period can be detected, the existence of a row / column with respect to the sheet can be determined. FIG. 7 shows the character line portion of FIG. 6 in more detail.

【００５１】次に、ステップＳ１０３で、判定した行／
列に対し、平均行高さＧ／平均文字高さＶを決定する。
つまり、図７に示すように、横方向の黒画素ヒストグラ
ムからその平均周期Ｇが得られ、これが行間空白部を含
めた行高さとなる。また、平均文字高さＶも同様に得ら
れる。そして、ステップＳ１０４で、得られた平均行高
さＧを用いて、画像全体を帯状に分割する。図６の例で
は、Ｌ１からＬ２７の２７本の行に分割できる。Next, in step S103, the line /
Determine the average row height G / average character height V for the columns.
That is, as shown in FIG. 7, the average period G is obtained from the horizontal black pixel histogram, and this is the line height including the space between lines. Also, the average character height V is obtained in the same manner. Then, in step S104, the entire image is divided into strips using the obtained average row height G. In the example of FIG. 6, the data can be divided into 27 rows from L1 to L27.

【００５２】次に、ステップＳ１０５で、分割された各
行が文字認識可能な認識行であるか否かを判定する。例
えば、図６の例では、行Ｌ１から行Ｌ１３及び行Ｌ２７
の行の場合、明らかに各行内には黒画素が存在しないの
で、文字以外の画像領域と判定する。一方、行Ｌ１６は
隣接する行Ｌ１５及び行Ｌ１７との境界に黒画素が存在
しないので、これらの行は認識行であると判定できる。
即ち、分割された帯状の領域が認識行であるか否かは、
その境界線上の黒画素の有無から容易に判定できる。こ
こで、行Ｌ１４，行Ｌ１５は平均的なサイズの文字行と
は異なる認識行であるが、この場合は認識対象から外
す。また、行Ｌ１，行Ｌ１３，行Ｌ２７は空白領域であ
ることも判定でき、この場合、白い画像領域として扱っ
ても良いし、あるいは空白文字の集合として扱うことも
可能である。Next, in step S105, it is determined whether or not each of the divided lines is a recognized line in which characters can be recognized. For example, in the example of FIG. 6, the rows L1 to L13 and L27
In the case of the row, there is obviously no black pixel in each row, so that it is determined that the area is an image area other than a character. On the other hand, since the row L16 has no black pixel at the boundary between the adjacent rows L15 and L17, these rows can be determined to be recognized rows.
That is, whether or not the divided strip-shaped area is a recognition line is
It can be easily determined from the presence or absence of a black pixel on the boundary line. Here, the lines L14 and L15 are recognition lines different from character lines of average size, but are excluded from recognition targets in this case. It is also possible to determine that the lines L1, L13, and L27 are blank areas. In this case, the rows L1, L13, and L27 may be handled as white image areas or as a set of blank characters.

【００５３】次に、ステップＳ１０６で、各認識行から
切り出される文字に対して文字認識を行う。次に、ステ
ップＳ１０７で、認識文字をフォントデータに変換す
る。次に、ステップＳ１０８で、フォントデータと認識
行以外のイメージデータ（誤差拡散法による２値化処理
部で２値化された画像）とを合成する。従って、図１に
おいてイメージデータ変換部１１０は、対象行が認識行
の場合、認識制御部１１１で得られた認識文字からフォ
ントデータを展開すると共に、対象行がイメージデータ
の場合は、画像バッファメモリ１１８より誤差拡散法に
よる２値化処理部で２値化された画像信号を、夫々切り
替えて画像編集部１０７に出力する。Next, in step S106, character recognition is performed on characters cut out from each recognition line. Next, in step S107, the recognized character is converted into font data. Next, in step S108, the font data and the image data other than the recognition line (the image binarized by the binarization processing unit using the error diffusion method) are synthesized. Therefore, in FIG. 1, the image data conversion unit 110 expands the font data from the recognized character obtained by the recognition control unit 111 when the target line is a recognition line, and also, when the target line is image data, the image buffer memory. From 118, the image signals binarized by the binarization processing unit using the error diffusion method are switched to each other and output to the image editing unit 107.

【００５４】以上の処理によって、図６の例の場合、行
Ｌ１３から行Ｌ２６までの各行が認識行となる。また、
認識行ではない各行内の画素に対しては、誤差拡散法で
２値化された画像信号を記録／表示のための画像信号と
して使う。By the above processing, in the case of the example of FIG. 6, each row from the row L13 to the row L26 becomes a recognized row. Also,
For pixels in each row that is not a recognition row, an image signal binarized by the error diffusion method is used as an image signal for recording / display.

【００５５】次に、ステップＳ１０６で、各認識行から
切り出される文字に対して文字認識を行う。次に、ステ
ップＳ１０７で、認識文字をフォントデータに変換す
る。次に、ステップＳ１０８で、フォントデータと認識
行以外のイメージデータ（誤差拡散法による２値化処理
部で２値化された画像）とを合成する。従って、図１に
おいてイメージデータ変換部１１０は、対象行が認識行
の場合、認識制御部１１１で得られた認識文字からフォ
ントデータを展開すると共に、対象行がイメージデータ
の場合は、画像バッファメモリ１１８より２値化部１２
０で誤差拡散法によって２値化された画像信号を、夫々
切り替えて画像編集部１０７に出力する。Next, in step S106, character recognition is performed on characters cut out from each recognition line. Next, in step S107, the recognized character is converted into font data. Next, in step S108, the font data and the image data other than the recognition line (the image binarized by the binarization processing unit using the error diffusion method) are synthesized. Therefore, in FIG. 1, the image data conversion unit 110 expands the font data from the recognized character obtained by the recognition control unit 111 when the target line is a recognition line, and also, when the target line is image data, the image buffer memory. From 118, the binarization unit 12
The image signals binarized by the error diffusion method at 0 are respectively switched and output to the image editing unit 107.

【００５６】次に、ステップＳ１０６の処理の詳細につ
いて、図８を用いて説明する。Next, details of the process in step S106 will be described with reference to FIG.

【００５７】図８は本実施形態の図４のステップＳ１０
６の処理の詳細を示すフローチャートである。FIG. 8 shows step S10 of FIG. 4 of the present embodiment.
6 is a flowchart illustrating details of a process 6;

【００５８】尚、図８の具体例として、図９を参照して
説明する。図９は、特に、図６の先頭行を更に拡大した
認識行（単純２値化した画像信号）を示している。A specific example of FIG. 8 will be described with reference to FIG. FIG. 9 particularly shows a recognition row (simple binarized image signal) obtained by further enlarging the top row of FIG.

【００５９】まず、ステップＳ２００で、紙面縦（Ｙ）
方向に黒画素を射影してヒストグラム５５０を作成す
る。次に、ステップＳ２０１で、作成したヒストグラム
５５０より、認識行の先頭文字の左先端部を検出する。
この場合、認識行の左端から所定数以上黒画素が存在す
る画素列５５１が検出される。First, in step S 200, the vertical direction (Y)
A histogram 550 is created by projecting black pixels in the direction. Next, in step S201, the left end of the first character of the recognition line is detected from the created histogram 550.
In this case, a pixel column 551 having a predetermined number or more of black pixels from the left end of the recognition row is detected.

【００６０】ステップＳ２０２で、文字Ｃｎの先端から
平均文字幅Ｈの列に黒画素があるか否かを判定する。黒
画素がある場合（ステップＳ２０２でＹＥＳ）、ステッ
プＳ２０４で、文字Ｃｎは半角であると判定する。一
方、黒画素がない場合（ステップＳ２０２でＮＯ）、文
字Ｃｎは全角であると判定する。In step S202, it is determined whether there is a black pixel in the column of the average character width H from the leading end of the character Cn. If there is a black pixel (YES in step S202), it is determined in step S204 that the character Cn is a half-width character. On the other hand, when there is no black pixel (NO in step S202), it is determined that the character Cn is a double-byte character.

【００６１】尚、ステップＳ２０２の判定は、以下のよ
うに行う。即ち、通常漢字は縦横の比が略１：１である
ため、平均文字高さＶを使って、平均全角文字幅Ｈ（＝
Ｖ＋α αは定数）を定義する。画素列５５１から平均
全角文字幅Ｈ右に位置する列５５２上の黒画素数５５３
を評価する。即ち、文字Ｃ１が全角文字であるなら、列
５５２上は次の文字との隙間に位置しているため、黒画
素は存在しない。従って、認識行内の文字をその左端か
ら夫々平均全角文字幅Ｈ毎右に位置する列の黒画素数を
評価して、その文字が全角文字か否かを判定する。列５
５２では、黒画素が存在しないため、文字Ｃ１は全角と
判定される。一方、次の文字Ｃ２も同様にその左端から
平均全角文字幅Ｈに位置する列５５４上の黒ドットを評
価する。この場合、明らかに列５５４上に黒ドットが存
在するため、文字Ｃ２は全角文字でないと判定できる。
特に、本実施形態では、文字を全角と半角のどちらかに
識別するため、文字Ｃ２は半角と判定される。The determination in step S202 is performed as follows. That is, since a normal kanji has an aspect ratio of about 1: 1, the average full-width character width H (=
V + α α is a constant). The number of black pixels 553 on the column 552 located to the right of the average full-width character width H from the pixel column 551
To evaluate. That is, if the character C1 is a double-byte character, no black pixel exists because the column 552 is located in the gap with the next character. Therefore, the number of black pixels in a column located on the right side of each character in the recognition row from the left end thereof to each average full-width character width H is evaluated to determine whether or not the character is a full-width character. Row 5
At 52, since no black pixel exists, the character C1 is determined to be full-width. On the other hand, the next character C2 similarly evaluates a black dot on the column 554 located at the average full-width character width H from the left end. In this case, since a black dot clearly exists on the column 554, it can be determined that the character C2 is not a full-width character.
In particular, in the present embodiment, the character C2 is determined to be a half-width because the character is identified as either a full-width or a half-width.

【００６２】次に、ステップＳ２１１で、文字の字体を
全角あるいは半角のどちらかに識別した各領域、即ち、
全角の場合Ｈ＊Ｇ、半角の場合（Ｈ／２）＊Ｇ領域内の
文字画素の存在を判定する。文字画素が存在する場合
（ステップＳ２１１でＹＥＳ）、ステップＳ２０５に進
み、認識された文字のサイズに応じて、文字Ｃｎの認識
を行う。一方、文字画素が存在しない場合（ステップＳ
２１１でＮＯ）、ステップＳ２１３に進み、文字をイメ
ージデータとして中間調を表現できる誤差拡散法で２値
化した画像信号を選択する。Next, in step S211, each area where the font of the character is identified as either full-width or half-width,
The presence of a character pixel in the full-width H * G and half-width (H / 2) * G areas is determined. If there is a character pixel (YES in step S211), the process advances to step S205 to perform recognition of the character Cn according to the size of the recognized character. On the other hand, if there is no character pixel (step S
In step S213, the process proceeds to step S213, where an image signal binarized by an error diffusion method capable of expressing halftones using characters as image data is selected.

【００６３】ステップＳ２０６で、文字Ｃｎが認識可能
であるか否かを判定する。認識可能である場合（ステッ
プＳ２０６でＹＥＳ）、ステップＳ２０７で、認識され
た文字をフォントデータに展開して、新たなイメージデ
ータとして置き換える。一方、認識不可能である場合
（ステップＳ２０６でＮＯ）、ステップＳ２１２に進
み、文字をイメージデータとして単純２値化した画像信
号を選択する。In step S206, it is determined whether or not the character Cn can be recognized. If recognizable (YES in step S206), in step S207, the recognized character is developed into font data and replaced with new image data. On the other hand, if recognition is not possible (NO in step S206), the flow advances to step S212 to select an image signal obtained by simply binarizing characters as image data.

【００６４】次に、ステップＳ２０８で、認識行内の次
の文字の先端を検出する。ステップＳ２０９で、認識行
の終了であるか否かを判定する。終了でない場合（ステ
ップＳ２０９でＮＯ）、ステップＳ２０２に戻る。一
方、終了である場合（ステップＳ２０９でＹＥＳ）、処
理を終了する。Next, in step S208, the leading end of the next character in the recognition line is detected. In step S209, it is determined whether the end of the recognition line has been reached. If the processing has not been completed (NO in step S209), the process returns to step S202. On the other hand, if the processing has ended (YES in step S209), the processing ends.

【００６５】例えば、図１０は、該当行の像域分離結果
を拡大したものであり、明らかに図９の文字Ｃ１から文
字Ｃ１０に対しては、その文字形状を保存した文字画素
判定結果が得られる。従って、この場合は、各文字に対
する認識は、その１文字分の領域内の画像データ（単純
２値化信号）を正規化し、認識辞書との比較によって行
う。一方、この行の後端部には絵が存在している。即
ち、図１０の後端部Ａ３０からＡ３４の部分には、孤立
判定画素を除去しても、数点の文字画素５６３が存在し
ている。従って、このような文字画素の画素数が所定値
以下である場合、、その１文字文の領域は非文字部分、
つまり、イメージデータとして中間調を表現できる誤差
拡散法で２値化した画像信号を選択する。尚、この領域
内に存在する図１１に示す単純２値化した画像信号であ
る画素５６２に対し、上述したステップＳ２０２の処理
を施した場合、殆どの領域が半角と判定されるが、中間
調の画像の濃度が淡い場合には、全角と判定される。こ
の場合、全角／半角のどちらかに判定する判定結果はど
ちらでも構わない。For example, FIG. 10 is an enlarged view of the image area separation result of the corresponding line. Obviously, for the characters C1 to C10 in FIG. Can be Therefore, in this case, recognition of each character is performed by normalizing the image data (simple binary signal) in the area for one character and comparing it with the recognition dictionary. On the other hand, a picture exists at the rear end of this row. That is, several character pixels 563 exist at the rear end portions A30 to A34 in FIG. 10 even if the isolation determination pixels are removed. Therefore, when the number of such character pixels is equal to or less than a predetermined value, the area of the one character sentence is a non-character part,
That is, an image signal binarized by an error diffusion method capable of expressing a halftone as image data is selected. It should be noted that when the above-described processing in step S202 is performed on the pixel 562 that is a simple binarized image signal shown in FIG. If the density of the image is light, it is determined to be full-width. In this case, the determination result for either full-width or half-width may be either.

【００６６】ここで、ステップＳ２０５の認識方法の一
例について、図１２を用いて説明する。Here, an example of the recognition method in step S205 will be described with reference to FIG.

【００６７】図１２は本実施形態の認識方法の一例を説
明するための図である。FIG. 12 is a diagram for explaining an example of a recognition method according to the present embodiment.

【００６８】図１２では、切出された文字［王］を認識
する場合の木構造の認識辞書を用いた例である。FIG. 12 shows an example using a tree-structured recognition dictionary for recognizing a cut-out character [king].

【００６９】ここで、木構造の認識辞書における１つの
段のｉ番目のグループの標準パターンベクトル（特徴
量）を＊ｖi、切出された文字の入力ベクトルを＊ｕと
すると、ベクトル＊ｖiとベクトル＊ｕとの距離は、ｒ
をベクトルの次元数として、Here, if the standard pattern vector (feature amount) of the i-th group in one stage in the tree-structured recognition dictionary is * vi and the input vector of the extracted character is * u, the vector * vi The distance from the vector * u is r
Is the number of dimensions of the vector,

【００７０】[0070]

【数１】で表される。そして、この木構造に沿い、前後の類似し
た標準パターン同士でグループ分けすることにより次段
が構成される複数段の各段において、各グループの標準
パターンと切出された文字との最短距離を求める。この
ようにして、最短距離を与えるグループを結ぶパスを形
成しながら、下り検索を行い、最終段で得られた最短距
離の文字候補（標準パターン）を認識文字として出力す
る。このようにして、各段において距離が最小となるグ
ループが選択されて、図１２に示す最終段の文字［王］
に至るパスが形成される。(Equation 1) It is represented by Then, along this tree structure, the shortest distance between the standard pattern of each group and the extracted character is determined in each of a plurality of stages that are formed by grouping similar standard patterns before and after. Ask. In this way, a down search is performed while forming a path connecting the groups giving the shortest distance, and the shortest distance character candidate (standard pattern) obtained in the final stage is output as a recognition character. In this way, the group with the smallest distance in each row is selected, and the character [King] in the last row shown in FIG.
Is formed.

【００７１】尚、得られた文字候補である標準パターン
ベクトルと切出された文字の入力ベクトルとの距離が一
定値を超える場合には、その文字候補を採用せず、認識
不能と判断して、切出された文字を、単純２値化された
イメージデータとして扱う。一方、認識できた場合、後
述する手順に従って、切り出された文字に対応するフォ
ントデータを新たな画像信号として置き換えると共に、
認識行内の次の文字の左端をヒストグラム５５０より検
出する。When the distance between the obtained standard pattern vector as a character candidate and the input vector of the extracted character exceeds a certain value, the character candidate is not adopted, and it is determined that the character cannot be recognized. , And the extracted characters are treated as simple binarized image data. On the other hand, if it is recognized, the font data corresponding to the extracted character is replaced with a new image signal according to a procedure described later,
The left end of the next character in the recognition line is detected from the histogram 550.

【００７２】次に、認識された文字のフォントデータへ
の変換方法について、図１０、図１１、図１３を用いて
説明する。Next, a method of converting recognized characters into font data will be described with reference to FIGS. 10, 11, and 13. FIG.

【００７３】図１０は単純２値化された画像信号、図１
１は像域分離された画像信号、図１３は本発明によって
認識結果をフォントデータに変換した結果の例である。FIG. 10 shows a simple binarized image signal.
1 is an example of an image signal whose image area has been separated, and FIG. 13 is an example of a result obtained by converting a recognition result into font data according to the present invention.

【００７４】図１３は、特に、図９に示す認識行から切
出された文字Ｃ１から文字Ｃ１０を文字Ｂ１から文字Ｂ
１０として変換した例である。説明のため、フォントは
字体を丸ゴシックとし、サイズも少し小さめで表してい
る。フォントデータの展開は、各切出された文字の左上
端の点５６０，５６１を基準に対応するフォントデータ
を展開して配置する。従って、原稿の文字サイズより大
きなサイズで展開すれば、隣接する文字間が縮まり、更
に大きい場合は重なることになる。FIG. 13 shows characters C1 to C10 extracted from the recognition line shown in FIG.
This is an example converted as 10. For the sake of explanation, the font is round Gothic and the font size is slightly smaller. The font data is developed by arranging the font data corresponding to the points 560 and 561 at the upper left corner of each cut character. Therefore, if the image is developed in a size larger than the character size of the document, the space between adjacent characters is reduced, and if the size is larger, the characters overlap.

【００７５】尚、被認識文字Ｃ６は認識不能であったと
想定した例であり、この場合、先に説明したように、切
出された文字Ｃ６がイメージデータとして扱われ、フォ
ントデータに展開された文字Ｂ１から文字Ｂ１０と共に
合成されることになる。また、行の後端部Ｃ３０、Ｃ３
１、Ｃ３２、Ｃ３３は誤差拡散法による２値化された画
像信号５６４が合成される。このように認識行中の切出
された文字の端部に、展開したフォントデータの端部を
合わせることにより、認識不能文字をイメージデータと
してそのまま用いてフォントデータ中に合成しても、自
然な文書として品位の高い画像信号を出力することがで
きる。It is assumed that the recognized character C6 cannot be recognized. In this case, as described above, the extracted character C6 is treated as image data and expanded into font data. The character B1 is combined with the character B10. Also, the rear end portions C30 and C3 of the row
1, C32, and C33 are combined with a binary image signal 564 by the error diffusion method. By aligning the end of the expanded font data with the end of the cut-out character in the recognition line in this way, even if the unrecognizable character is directly used as image data and synthesized into the font data, A high-quality image signal can be output as a document.

【００７６】以上説明したように、本実施形態によれ
ば、文字画素の存在する認識行に対して、切り出された
領域内の文字が認識可能であれば、対応するフォントデ
ータへ展開する。また、切り出された領域内の文字が認
識不能でも、像域分離処理によって領域内に文字が存在
することを識別し、その認識不能文字については最も解
像性の高い単純２値化された画像信号を選択し出力する
ことで、解像度が良好な画像を生成することができる。
更に、認識行の一部に存在する絵の部分は、像域分離処
理の処理結果に基づいて、中間調を表現できる誤差拡散
法で２値化された画像信号を選択し出力することで、良
好な階調表現を有する画像を生成することができる。As described above, according to the present embodiment, if a character in a cut-out region can be recognized in a recognition line in which a character pixel exists, the character line is developed into corresponding font data. Further, even if the characters in the cut-out area are not recognizable, the presence of characters in the area is identified by the image area separation processing, and the unrecognizable character is converted into a simple binary image having the highest resolution. By selecting and outputting a signal, an image with good resolution can be generated.
Further, the picture part existing in a part of the recognition line is selected and output by selecting an image signal binarized by an error diffusion method capable of expressing a halftone based on the processing result of the image area separation processing, An image having a good gradation expression can be generated.

【００７７】上記実施形態では、図６で示した行Ｌ１
３、行Ｌ１４にまたがって存在する文字は通常の文字と
して文字認識を行うが、標準パターンと切出された文字
との最短距離が所定値を超えるため、認識不能文字とし
て扱われる。そして、行Ｌ１３、行Ｌ１４にまたがって
存在する文字は、図８で示した処理では、認識不能文字
となることは明らかである。そこで、図８のステップＳ
２１１の処理で切り出された領域、すなわち、全角なら
Ｈ＊Ｇ、半角なら（Ｈ／２）*Ｇの領域内の文字画素を
評価するだけでなく、特に、その横方向の（例えば、行
Ｌ１３と行Ｌ１４）の境界線上に文字画素が存在するか
否か判定する処理（図１４のステップＳ２１４）を行
い、境界線上に文字画素が所定値を超えて存在する場合
には、その領域内の文字を認識不能文字として、ステッ
プＳ２１２へ分岐すれば、より高速に認識不能文字の判
定を行うことができる。In the above embodiment, the row L1 shown in FIG.
3. Characters existing over the line L14 are recognized as normal characters, but are treated as unrecognizable characters because the shortest distance between the standard pattern and the extracted character exceeds a predetermined value. Then, it is clear that a character existing over the line L13 and the line L14 becomes an unrecognizable character in the processing shown in FIG. Therefore, step S in FIG.
In addition to evaluating the character pixels in the area cut out in the process of 211, that is, the area of H * G for full-width and (H / 2) * G for half-width, it particularly evaluates the character pixels in the horizontal direction (for example, line L13). (Step S214 in FIG. 14) is performed to determine whether or not a character pixel exists on the boundary between the line and the line L14). If the character is determined as an unrecognizable character and the process branches to step S212, the determination of the unrecognizable character can be performed more quickly.

【００７８】また、画像内の文字領域を分離する像域分
離方法は、上記実施形態に限定されず、例えば、画像内
の２値化ドット数、濃度勾配、濃度ヒストグラム等を用
いた他の方法を用いても実施可能である。The image area separating method for separating a character area in an image is not limited to the above embodiment. For example, another method using the number of binarized dots in an image, a density gradient, a density histogram, or the like is used. It can also be implemented using.

【００７９】また、入力原稿がカラー原稿に対しては、
色分解した３色信号から輝度信号を生成すれば、白黒画
像信号と同様に文字領域の認識が可能である。When the input document is a color document,
If a luminance signal is generated from the three color signals obtained by color separation, it is possible to recognize a character area in the same manner as a monochrome image signal.

【００８０】また、画像内のイメージ領域を誤差拡散法
で２値化した画像信号で出力したが、中間調が良好に表
現できる、例えば、多値擬似中間調処理、組織ディザ
法、パルス幅変調を想定した多値の画像信号で出力する
ようにしても良い。特に、多値信号として出力する場合
には、フォントデータは０と１の１ｂｉｔデータを、例
えば、０と２５５の８ｂｉｔデータとして出力し、認識
不能文字は原稿画像を読み取った時に得られる多値信号
にエッジ強調処理を施して出力し、中間調の絵の部分は
原稿画像を読み取った時に得られる多値信号に平滑化処
理を施して出力すれば、より高品位な画像を出力するこ
とができるられる。Although the image area in the image is output as an image signal binarized by an error diffusion method, halftones can be expressed well. For example, multi-value pseudo halftone processing, tissue dithering, pulse width modulation May be output as a multi-valued image signal. In particular, when outputting as a multi-valued signal, the font data is output as 1-bit data of 0 and 1, for example, as 8-bit data of 0 and 255, and the unrecognizable character is a multi-valued signal obtained when an original image is read. If a halftone picture portion is subjected to a smoothing process on a multi-level signal obtained when a document image is read and then output, a higher quality image can be output. Can be

【００８１】また、単純２値化された画像信号から文字
認識を行わずに、像域分離結果で得られる文字領域に対
し文字認識を実行し、それ以外の空白部分をイメージ領
域として処理すれば、イメージ領域を直接判定でき、か
つ文字領域のみの文字認識処理を実行すれば良いので、
処理を高速化できる。If character recognition is performed on a character area obtained as a result of image area separation without performing character recognition from a simple binarized image signal, and other blank parts are processed as image areas, Since the image area can be directly determined and the character recognition processing only for the character area needs to be executed,
Processing can be accelerated.

【００８２】また、孤立判定画素を除去する構成とした
が、それ以外の画素で発生する文字画素の誤判定は、例
えば、認識行内から切り出された各領域に対する像域分
離処理において、隣接領域に挟まれる孤立した認識可能
文字領域は、文字以外の領域として判定を補正すれば良
い。例えば、図１０の領域Ａ３１は、その文字画素数に
よって文字と判定され、文字認識が実行されてしまう。
この場合、その文字認識結果には、該当文字が見つから
ず、結局、認識不能文字と判定される。その結果、図１
１のＣ３１の単純２値化された画像信号が出力される画
像信号、つまり、図１３のＢ３１の画像信号が選択され
てしまい、生成される画像が極めて見づらいものとなっ
てしまう。従って、図１０のＡ３１に隣接するＡ３０、
Ａ３２が共に非文字領域と判定された場合は、Ａ３０、
Ａ３２に挟まれるＡ３１も非文字領域と判定を補正す
る。Although the isolation determination pixel is removed, the erroneous determination of a character pixel occurring in other pixels may be caused, for example, in an image area separation process for each area cut out from the recognition line. The determination may be corrected for the isolated recognizable character area that is sandwiched between the areas as a non-character area. For example, the area A31 in FIG. 10 is determined to be a character based on the number of character pixels, and character recognition is performed.
In this case, the corresponding character is not found in the character recognition result, and is eventually determined to be an unrecognizable character. As a result, FIG.
The image signal from which the simple binarized image signal of C31 of FIG. 1 is output, that is, the image signal of B31 in FIG. 13 is selected, and the generated image becomes extremely difficult to see. Therefore, A30 adjacent to A31 in FIG.
If A32 is determined to be a non-character area, A30,
A31 sandwiched between A32 also corrects the determination as a non-character area.

【００８３】尚、本発明は、複数の機器（例えばホスト
コンピュータ、インタフェース機器、リーダ、プリンタ
など）から構成されるシステムに適用しても、一つの機
器からなる装置（例えば、複写機、ファクシミリ装置な
ど）に適用してもよい。Even if the present invention is applied to a system composed of a plurality of devices (for example, a host computer, an interface device, a reader, a printer, and the like), a device including one device (for example, a copying machine, a facsimile machine) Etc.).

【００８４】また、本発明の目的は、前述した実施形態
の機能を実現するソフトウェアのプログラムコードを記
録した記憶媒体を、システムあるいは装置に供給し、そ
のシステムあるいは装置のコンピュータ（またはＣＰＵ
やＭＰＵ）が記憶媒体に格納されたプログラムコードを
読出し実行することによっても、達成されることは言う
までもない。Further, an object of the present invention is to provide a storage medium storing a program code of software for realizing the functions of the above-described embodiments to a system or an apparatus, and a computer (or CPU) of the system or the apparatus.
And MPU) read and execute the program code stored in the storage medium.

【００８５】この場合、記憶媒体から読出されたプログ
ラムコード自体が前述した実施形態の機能を実現するこ
とになり、そのプログラムコードを記憶した記憶媒体は
本発明を構成することになる。In this case, the program code itself read from the storage medium realizes the functions of the above-described embodiment, and the storage medium storing the program code constitutes the present invention.

【００８６】プログラムコードを供給するための記憶媒
体としては、例えば、フロッピディスク、ハードディス
ク、光ディスク、光磁気ディスク、ＣＤ−ＲＯＭ、ＣＤ
−Ｒ、磁気テープ、不揮発性のメモリカード、ＲＯＭな
どを用いることができる。Examples of a storage medium for supplying the program code include a floppy disk, hard disk, optical disk, magneto-optical disk, CD-ROM, and CD.
-R, a magnetic tape, a nonvolatile memory card, a ROM, or the like can be used.

【００８７】また、コンピュータが読出したプログラム
コードを実行することにより、前述した実施形態の機能
が実現されるだけでなく、そのプログラムコードの指示
に基づき、コンピュータ上で稼働しているＯＳ（オペレ
ーティングシステム）などが実際の処理の一部または全
部を行い、その処理によって前述した実施形態の機能が
実現される場合も含まれることは言うまでもない。When the computer executes the readout program code, not only the functions of the above-described embodiment are realized, but also the OS (Operating System) running on the computer based on the instruction of the program code. ) May perform some or all of the actual processing, and the processing may realize the functions of the above-described embodiments.

【００８８】更に、記憶媒体から読出されたプログラム
コードが、コンピュータに挿入された機能拡張ボードや
コンピュータに接続された機能拡張ユニットに備わるメ
モリに書込まれた後、そのプログラムコードの指示に基
づき、その機能拡張ボードや機能拡張ユニットに備わる
ＣＰＵなどが実際の処理の一部または全部を行い、その
処理によって前述した実施形態の機能が実現される場合
も含まれることは言うまでもない。Further, after the program code read from the storage medium is written into a memory provided in a function expansion board inserted into the computer or a function expansion unit connected to the computer, based on the instruction of the program code, It goes without saying that the CPU included in the function expansion board or the function expansion unit performs part or all of the actual processing, and the processing realizes the functions of the above-described embodiments.

【００８９】本発明を上記記憶媒体に適用する場合、そ
の記憶媒体には、先に説明した図４、図８、図１４に示
すフローチャートに対応するプログラムコードが格納さ
れることになる。When the present invention is applied to the storage medium, the storage medium stores program codes corresponding to the flowcharts shown in FIGS. 4, 8, and 14 described above.

【００９０】[0090]

【発明の効果】以上説明したように、本発明によれば、
文字画像と中間調画像が混在する画像をより高品位に再
生することができる画像処理装置及びその方法、コンピ
ュータ可読メモリを提供できる。As described above, according to the present invention,
It is possible to provide an image processing apparatus and method capable of reproducing an image in which a character image and a halftone image are mixed with higher quality, and a computer-readable memory.

[Brief description of the drawings]

【図１】本実施形態の画像処理装置の機能構成を示すブ
ロック図である。FIG. 1 is a block diagram illustrating a functional configuration of an image processing apparatus according to an embodiment.

【図２】本実施形態の画像処理装置のハードウェア構成
を示す断面図である。FIG. 2 is a cross-sectional view illustrating a hardware configuration of the image processing apparatus according to the embodiment.

【図３】本実施形態の操作部の外観を示す図である。FIG. 3 is a diagram illustrating an appearance of an operation unit according to the embodiment.

【図４】本実施形態の画像認識部の処理フローを示すフ
ローチャートである。FIG. 4 is a flowchart illustrating a processing flow of an image recognition unit according to the embodiment.

【図５】本実施形態の画像及びヒストグラム作成結果の
一例を示す図である。FIG. 5 is a diagram illustrating an example of an image and a histogram creation result according to the embodiment;

【図６】本実施形態の認識行の判定結果の一例を示す図
である。FIG. 6 is a diagram illustrating an example of a determination result of a recognition row according to the embodiment;

【図７】本実施形態のヒストグラム作成結果の詳細を示
す図である。FIG. 7 is a diagram illustrating details of a histogram creation result according to the present embodiment.

【図８】本実施形態の図４のステップＳ１０６の処理の
詳細を示すフローチャートである。FIG. 8 is a flowchart illustrating details of a process in step S106 in FIG. 4 of the present embodiment.

【図９】本実施形態の認識行に施す処理を説明するため
の図である。FIG. 9 is a diagram for explaining processing performed on a recognition row according to the embodiment;

【図１０】本実施形態の認識方法の一例を説明するため
の図である。FIG. 10 is a diagram illustrating an example of a recognition method according to the embodiment.

【図１１】本実施形態のフォントデータへの変換方法を
説明するための図である。FIG. 11 is a diagram for explaining a method of converting to font data according to the present embodiment.

【図１２】本実施形態の認識方法の一例を説明するため
の図である。FIG. 12 is a diagram illustrating an example of a recognition method according to the embodiment.

【図１３】本実施形態のフォントデータへの変換方法を
説明するための図である。FIG. 13 is a diagram for explaining a method of converting to font data according to the present embodiment.

【図１４】本実施形態の画像認識部の処理フローの他の
例を示すフローチャートである。FIG. 14 is a flowchart illustrating another example of the processing flow of the image recognition unit of the present embodiment.

[Explanation of symbols]

１０１レンズ１０２イメージセンサ１０３ＡＤコンバータ１０４シェーディング補正回路１０５モード切替回路１０６光濃度変換回路１０７画像編集部１０８文字切出部１０９ベクトル算出部１１０イメージデータ変換部１１１認識制御部１１３ＣＰＵ１１４ＲＯＭ１１５ＲＡＭ１１６操作部１１７記録部１１８画像バッファメモリ１１９像域分離部１２０２値化部１００１原稿１００２画像読取部１００３画像処理部１００４画像記録部４０００画像認識部 Reference Signs List 101 lens 102 image sensor 103 AD converter 104 shading correction circuit 105 mode switching circuit 106 light density conversion circuit 107 image editing unit 108 character cutout unit 109 vector calculation unit 110 image data conversion unit 111 recognition control unit 113 CPU 114 ROM 115 RAM 116 Operation unit 117 Recording unit 118 Image buffer memory 119 Image area separation unit 120 Binarization unit 1001 Document 1002 Image reading unit 1003 Image processing unit 1004 Image recording unit 4000 Image recognition unit

フロントページの続き (72)発明者片岡淳之介東京都大田区下丸子３丁目30番２号キヤノン株式会社内 (72)発明者小林誠東京都大田区下丸子３丁目30番２号キヤノン株式会社内 (72)発明者本田永和東京都大田区下丸子３丁目30番２号キヤノン株式会社内Ｆターム(参考） 5B029 AA01 BB02 CC21 CC29 DD03 EE11 5B064 AA01 BA01 CA03 CA08 DC17 5C077 MP01 NN11 PP27 TT06 Continuation of the front page (72) Inventor Junnosuke Kataoka 3-30-2 Shimomaruko, Ota-ku, Tokyo Inside Canon Inc. (72) Inventor Makoto Kobayashi 3-30-2 Shimomaruko, Ota-ku, Tokyo Inside Canon Inc. (72) Inventor Eiwa Honda 3-30-2 Shimomaruko, Ota-ku, Tokyo F-term in Canon Inc. (reference) 5B029 AA01 BB02 CC21 CC29 DD03 EE11 5B064 AA01 BA01 CA03 CA08 DC17 5C077 MP01 NN11 PP27 TT06

Claims

[Claims]

1. An image processing apparatus for processing an input image, comprising: extracting means for extracting a character line in the input image; and cutting out an area image from the character line extracted by the extracting means. Output means; character recognition means for character recognizing an area cut out by the extraction means; first generation means for generating a first image signal corresponding to the character recognized by the character recognition means; Means for generating a second image signal for an area image in which characters cannot be recognized by the means; third generating means for generating a third image signal for an image other than the character line in the image; and the first image An image processing apparatus comprising: an output unit that synthesizes and outputs a signal, the second image signal, and the third image signal.

2. The image processing apparatus according to claim 1, wherein the first image signal is font data corresponding to the character.

3. The image processing apparatus according to claim 1, wherein the second image signal is a simple binarized image signal.

4. The image processing apparatus according to claim 1, wherein the third image signal is an image signal subjected to pseudo halftone processing.

5. The character recognizing means includes first determining means for determining whether or not a pixel exists at a boundary of a character line adjacent to the attention area image, and based on a determination result of the first determining means. 2. The image processing apparatus according to claim 1, wherein the target area image is determined to be a non-character recognizable area image.

6. The character recognition means includes a second determination means for determining whether or not the attention area image is character recognizable, based on the number of pixels present in the attention area image, The image processing apparatus according to claim 1, further comprising: determining the target area image as an area image in which characters cannot be recognized based on a determination result of a determination unit.

7. The apparatus according to claim 1, wherein the character recognizing means determines that the region of interest image is a region image incapable of character recognition when a region image adjacent to the region of interest image is not character recognizable. Image processing device.

8. The image processing apparatus according to claim 1, wherein when the image is a multi-level image signal, the second image signal is an edge-emphasized multi-level image signal.

9. The image processing apparatus according to claim 1, wherein when the image is a multi-valued image signal, the third image signal is a smoothed multi-valued image signal.

10. An image processing method for processing an input image, comprising: an extraction step of extracting a character line in the input image; and an area image cutting out an area image from the character line extracted in the extraction step. An output step; a character recognition step of recognizing a character in the area cut out in the extraction step; a first generation step of generating a first image signal corresponding to the character recognized in the character recognition step; A second generation step of generating a second image signal for an area image in which characters cannot be recognized in the step; a third generation step of generating a third image signal for an image other than the character line in the image; and the first image An output step of combining and outputting a signal, the second image signal, and the third image signal.

11. The apparatus according to claim 10, wherein the first image signal is font data corresponding to the character.
The image processing method according to 1.

12. The image processing method according to claim 10, wherein the second image signal is a simple binarized image signal.

13. The image processing method according to claim 10, wherein the third image signal is an image signal subjected to pseudo halftone processing.

14. The character recognition step includes a first determination step of determining whether or not a pixel exists at a boundary of a character line adjacent to the attention area image, and based on a determination result of the first determination step. 11. The image processing method according to claim 10, wherein the attention area image is determined to be an area image in which characters cannot be recognized.

15. The character recognition step includes a second determination step of determining whether or not the attention area image is capable of character recognition based on the number of pixels present in the attention area image; The image processing method according to claim 10, further comprising: determining the attention area image as an area image in which characters cannot be recognized based on a determination result of the determination step.

16. The character recognition step according to claim 10, wherein, if an area image adjacent to the attention area image is not capable of character recognition, the attention area image is determined to be an area image incapable of character recognition. Image processing method.

17. When the image is a multi-valued image signal,
The image processing method according to claim 10, wherein the second image signal is a multi-valued image signal with edge enhancement.

18. When the image is a multi-valued image signal,
The image processing method according to claim 10, wherein the third image signal is a smoothed multi-valued image signal.

19. A computer-readable memory storing an image processing program code for processing an input image, wherein: a program code for an extraction step for extracting a character line in the input image; A program code of a cutting step of cutting out an area image from the character line extracted in the above, a program code of a character recognizing step of recognizing a character in the area cut out in the cutting step, and A program code of a first generation step of generating a corresponding first image signal; a program code of a second generation step of generating a second image signal for an area image in which characters cannot be recognized in the character recognition step; A program code of a third generation step of generating a third image signal for an image other than the character line, the first image signal, the second image signal, Image signal, a computer readable memory; and a program code of the third image signal combining and outputs the output process.