JP2000137765A

JP2000137765A - Processor and method for image processing and storage medium

Info

Publication number: JP2000137765A
Application number: JP10311464A
Authority: JP
Inventors: Hiroshi Tanioka; 宏谷岡; Izuru Horiuchi; 出堀内; Junnosuke Kataoka; 淳之介片岡; Makoto Kobayashi; 誠小林; Nagakazu Honda; 永和本田
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 1998-10-30
Filing date: 1998-10-30
Publication date: 2000-05-16

Abstract

PROBLEM TO BE SOLVED: To increase the reproducibility of the whole document image and to obtain an output image having high quality characters by outputting character codes for image parts which are highly possible to be the character document image and outputting images corresponding to the original document image for the other parts. SOLUTION: When a recognition copy mode is selected at an operation part 116, an image read by an image read part 1002 is stored in an image buffer memory 118 and recognized by an image recognition part 4000. Here, a partial image to be recognized is cut by a character segmentation part 108 and a vector calculation part 109 extracts the features quantity. Then it is decided that the partial image can be outputted as characters when the difference between the extracted features quantity and the that of a standard character that a recognition control part 111 has is minimum and less than a threshold, and then its character code are outputted, and an image data conversion part 110 generates a character pattern. When the difference is larger than the threshold, the partial image is outputted in the form of image data as it is.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は画像処理装置及び方
法及び記憶媒体、特に、読み取った画像中の文字画像を
認識する画像処理装置及び方法及び記憶媒体に関するも
のである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an image processing apparatus, method, and storage medium, and more particularly to an image processing apparatus, method, and storage medium for recognizing a character image in a read image.

【０００２】[0002]

【従来の技術】文字認識技術は近年のＣＰＵ及び半導体
の高速化によって低速複写機の複写速度程度において
は、略リアルタイムでの処理が可能となっている。又、
近年複写対象の原稿の殆どは、ワードプロセッサー等で
作成され印刷されたものであるため、その認識精度も高
まり、十分に実用レベルに到達してきている。2. Description of the Related Art With the recent increase in the speed of CPUs and semiconductors, character recognition technology is capable of processing in substantially real time at a copying speed of a low-speed copying machine. or,
In recent years, most of the originals to be copied have been created and printed by a word processor or the like, and thus their recognition accuracy has been improved, and has reached a practical level.

【０００３】又、原稿中の文字サイズの認識及び文字の
認識技術に関して、本願出願人が例えば特開昭６１−１
０７８７６号等で既に提案している技術を用いれば、原
稿の文字イメージ情報を認識したフォントイメージ情報
に変換して像再生すれば、原理的に繰り返される複写に
よる画像劣化はなくなる。The applicant of the present invention has disclosed, for example, Japanese Patent Application Laid-Open No.
If the technology already proposed in JP 07876 or the like is used, if the character image information of the document is converted into recognized font image information and the image is reproduced, image deterioration due to repeated copying in principle is eliminated.

【０００４】[0004]

【発明が解決しようとする課題】しかしながら、原稿が
文字のみではなく、図形等が混在する場合においては、
図形の一部を類似した文字、あるいは記号に変換するこ
とがあり、極めて不自然な再生画像となる。更に認識し
た文字を原稿の文字の字体で再生する為に文字の字体を
認識する際においても図形部等の特徴が混在する為精度
の高い字体認識が出来ない。However, in the case where the original is not only characters but also graphics, etc.,
A part of a figure may be converted into a similar character or symbol, resulting in an extremely unnatural reproduced image. Further, when recognizing the character font in order to reproduce the recognized character in the character font of the document, highly accurate character recognition cannot be performed because features such as graphic portions are mixed.

【０００５】[0005]

【課題を解決するための手段】本発明は上記問題点に鑑
みなされたものであり、原稿画像中の文字としての可能
性の高い画像部分について文字コードを出力し、それ以
外は原稿画像中の対応する画像を出力することで、原稿
画像全体に対する再現性を高めると共に、文字について
は高品位の出力画像を得ることを可能ならしめる画像処
理装置及び方法を提供しようとするものである。SUMMARY OF THE INVENTION The present invention has been made in view of the above problems, and outputs a character code for an image portion which is highly likely to be a character in a document image. It is an object of the present invention to provide an image processing apparatus and method which can output a corresponding image to enhance the reproducibility of the whole original image and obtain a high-quality output image for characters.

【０００６】この課題を解決するため、たとえば本発明
の画像処理装置は以下の構成を備える。すなわち、原稿
画像中の文字候補画像部分について文字認識する画像処
理装置であって、文字候補画像部分についての特徴量と
認識辞書に格納されている標準特徴量との差が小さい文
字を検索する検索手段と、該検索手段によって最終的に
検索して得られた前記差と、所定の閾値とを比較する比
較手段と、該比較手段の比較結果、前記差が前記閾値よ
り小さい場合には注目文字候補画像部分は文字として、
該当する文字コードを前記認識辞書に基づいて出力する
文字コード出力手段と前記比較手段の比較結果、前記差
が前記閾値以上の場合には注目文字候補画像部分は非文
字として、当該文字候補画像部分を画像データとして出
力する画像出力手段とを備える。In order to solve this problem, for example, an image processing apparatus according to the present invention has the following configuration. That is, an image processing apparatus for recognizing a character in a character candidate image portion in a document image, wherein a search is performed to search for a character having a small difference between a feature amount of the character candidate image portion and a standard feature amount stored in a recognition dictionary. Means, a comparing means for comparing the difference finally obtained by the searching means with a predetermined threshold value, and a comparison result of the comparing means. If the difference is smaller than the threshold value, The candidate image part is a character,
A comparison result between the character code output unit that outputs the corresponding character code based on the recognition dictionary and the comparison unit. If the difference is equal to or greater than the threshold, the target character candidate image portion is regarded as a non-character, and the character candidate image portion As image data.

【０００７】[0007]

【発明の実施の形態】以下、添付図面に従って本発明に
係る実施形態の詳細に説明する。Embodiments of the present invention will be described below in detail with reference to the accompanying drawings.

【０００８】図１に実施例における複写機のブロック構
成図を示し、以下、各構成要素をその動作と共に説明す
る。FIG. 1 is a block diagram of a copying machine according to an embodiment, and each component will be described below together with its operation.

【０００９】原稿１００１は画像読み取り部１００２の
レンズ１０１よりＣＣＤ１０２を通し、Ａ／Ｄ変換を行
なうＡ／Ｄコンバータ１０３に入り、Ａ／Ｄ変換後の信
号が画像処理部１００３に入力される。画像処理部１０
０３では、シェーディング補正回路１０４で読取画像デ
ータに対して適正な補正を行い、モード切り換え回路１
０５に供給する。An original 1001 passes through a CCD 102 from a lens 101 of an image reading unit 1002, enters an A / D converter 103 for performing A / D conversion, and a signal after the A / D conversion is input to an image processing unit 1003. Image processing unit 10
In step 03, the shading correction circuit 104 corrects the read image data appropriately, and the mode switching circuit 1
05.

【００１０】ここでは、モード切換回路１０５である
が、これは、操作部１１６によって操作者が認識複写モ
ード、あるいは通常複写モードのいずれであるかによっ
てその出力先を切り換える。通常の複写モードであれ
ば、光濃度変換回路１０６で輝度データを記録濃度デー
タに変換し、画像編集部１０７に出力される。画像編集
部１０７では、ＣＰＵ回路１１３内のＲＡＭ１１５に記
憶されている画像処理の設定内容に従って処理されたデ
ータに基づき、各種編集処理を行う。たとえば、トリミ
ングや、色変換等である。また、この画像編集部１０７
には１ページ分のイメージ展開用のメモリが内蔵されて
おり、このメモリを用いて各種編集処理を行うと共に、
後述する如く、フォントパターン等を展開し、画像記録
部１００４に出力することもできるようになっている。In this case, the mode switching circuit 105 switches the output destination of the operation unit 116 depending on whether the operator is in the recognition copy mode or the normal copy mode. In the normal copy mode, the luminance data is converted into recording density data by the light density conversion circuit 106 and output to the image editing unit 107. The image editing unit 107 performs various editing processes based on the data processed according to the image processing settings stored in the RAM 115 in the CPU circuit 113. For example, trimming, color conversion, and the like. The image editing unit 107
Has a built-in memory for image development for one page, and performs various editing processes using this memory.
As described later, a font pattern or the like can be developed and output to the image recording unit 1004.

【００１１】画像記録部１００４は、転写紙等の搬送を
行うモータ等の制御回路、画像処理部１００３より入力
されたビデオ信号を感光ドラムに書き込むレーザ記録回
路部、及び、現像を行う現像制御回路で構成され、記録
紙等の記録媒体上に画像を記録する。An image recording unit 1004 includes a control circuit such as a motor for transporting transfer paper and the like, a laser recording circuit unit for writing a video signal input from the image processing unit 1003 to a photosensitive drum, and a development control circuit for performing development. And an image is recorded on a recording medium such as recording paper.

【００１２】一方、認識複写モードが設定された場合、
モード切り換え回路１０５の出力先は画像認識部４００
０へと切り換えられる。画像認識部４０００では、出力
されてきた画像データを一旦、画像バッファメモリー１
１８に格納し、原稿中の文字の認識（文字コードの生
成）し、その結果は、イメージデータ変換部１１０に渡
され、ここで認識文字に対する出力用の出力フォント画
像を生成する。なお、イメジデータ変換部１１０には、
種々の書体別のフォントメモリ（実施形態ではアウトラ
インフォントデータとした）を有しており、指定された
字体のフォントパターンを生成するようになっている。On the other hand, when the recognition copy mode is set,
The output of the mode switching circuit 105 is the image recognition unit 400
It is switched to zero. The image recognition unit 4000 temporarily stores the output image data in the image buffer memory 1.
18 and recognizes characters in the document (generates character codes). The result is passed to the image data conversion unit 110, where an output font image for outputting the recognized characters is generated. The image data conversion unit 110 includes:
It has a font memory for each typeface (in the embodiment, outline font data), and generates a font pattern of a specified typeface.

【００１３】また、操作部１１２は、画像処理部１００
３に対する画像編集内容、コピー枚数、変倍率等の画像
複写動作等を指示する各種キー群、各種ＬＥＤ群と、操
作時の内容を表示する表示部などを有している。The operation unit 112 is connected to the image processing unit 100.
It has various key groups and various LED groups for instructing an image copy operation such as image editing contents, the number of copies, and a magnification ratio for 3 and a display unit for displaying the contents at the time of operation.

【００１４】図２は実施例における画像複写装置の構造
を示す断面図である。FIG. 2 is a sectional view showing the structure of the image copying apparatus in the embodiment.

【００１５】図２において、１は原稿給送手段となる原
稿給送装置で、載置された原稿を１枚ずつ或いは、２枚
連続に原稿台ガラス面２上の所定位置に給送する。３は
ランプ，走査ミラー５等で構成されるスキャナで、原稿
給送装置１により原稿台ガラス面２に載置され、操作部
１１６から例えば複写指示や文字認識指示を与えると、
スキャナが所定方向（図示の左右方向）に走査して原稿
反射光を走査ミラー５〜７を介してレンズ８を通過さ
せ、イメージセンサ部９に結像させる。In FIG. 2, reference numeral 1 denotes a document feeder serving as a document feeder, which feeds the placed documents one by one or two at a predetermined position on the platen glass surface 2 continuously. Reference numeral 3 denotes a scanner constituted by a lamp, a scanning mirror 5, and the like. The scanner 3 is placed on the platen glass surface 2 by the document feeding device 1, and when a copy instruction or a character recognition instruction is given from the operation unit 116,
The scanner scans in a predetermined direction (horizontal direction in the drawing), passes the original reflected light through the lens 8 via the scanning mirrors 5 to 7, and forms an image on the image sensor unit 9.

【００１６】１００は、図１に示した各種回路を搭載し
た制御部（主としてプリント基板）である。１０は露光
部であり、制御部１００内の画像編集部１０７からの画
像データに基づいて光ビームを発生する。光ビームはポ
リゴンミラーで水平方向に掃引され、それがミラー等を
介して感光体１１上の走査露光する。１２，１３は現像
器で、感光体１１に形成された静電潜像を所定色の現像
剤（トナー）で可視化する。１４，１５は被転写紙積載
部で、定形サイズの記録媒体が積載収納され、給送のロ
ーラの駆動によりレジストローラ配設位置まで給送され
感光体１１に形成される画像との画像先端合わせタイミ
ングをとられた状態で再給紙される。Reference numeral 100 denotes a control unit (mainly a printed circuit board) on which the various circuits shown in FIG. 1 are mounted. An exposure unit 10 generates a light beam based on image data from an image editing unit 107 in the control unit 100. The light beam is swept in the horizontal direction by a polygon mirror, which scans and exposes the photoconductor 11 via a mirror or the like. Developing devices 12 and 13 visualize the electrostatic latent image formed on the photoreceptor 11 with a developer (toner) of a predetermined color. Reference numerals 14 and 15 denote transfer paper stacking portions, on which recording media of a fixed size are stacked and stored, and which are fed to a position where a registration roller is provided by driving a feeding roller and are aligned with the image formed on the photoreceptor 11. The sheet is fed again in a timed manner.

【００１７】１６は転写分離帯電器で感光体１１に現像
されたトナー像を被転写紙に転写した後、感光体１１よ
り分離して搬送ベルトを介して定着部１７で定着され
る。１８は排紙ローラで、画像形成の終了した被転写紙
をトレー２０に積載排紙する。１９は方向フラッパで画
像形成の終了した被転写紙の搬送方向を排紙口と内部搬
送方向に切り換え、多重／両面画像形成プロセスに備え
る。Reference numeral 16 denotes a transfer / separation charger, which transfers the toner image developed on the photoconductor 11 to a transfer sheet, and then separates from the photoconductor 11 and is fixed by a fixing unit 17 via a transport belt. Reference numeral 18 denotes a paper discharge roller which stacks and discharges the transfer-receiving paper on which image formation has been completed on the tray 20. Reference numeral 19 denotes a directional flapper which switches the transport direction of the transfer-receiving sheet on which image formation has been completed between the paper discharge port and the internal transport direction, and prepares for a multiplex / double-sided image forming process.

【００１８】図３は、本実施例で使用する操作部１１６
の上面図である。FIG. 3 shows an operation unit 116 used in this embodiment.
FIG.

【００１９】図中、５００１は装置本体への通電を制御
する電源スイッチである。５００２はリセットキーでス
タンバイ中は、標準モード（通常複写モード）に復帰さ
せるキーとして動作する。５００３はコピースタートキ
ーである。５００４はクリアキーであり、複写枚数等の
数値をクリアするときに用いる。In the figure, reference numeral 5001 denotes a power switch for controlling power supply to the apparatus main body. A reset key 5002 operates as a key for returning to the standard mode (normal copy mode) during standby. Reference numeral 5003 denotes a copy start key. A clear key 5004 is used to clear a numerical value such as the number of copies.

【００２０】５００５はＩＤキーで、このＩＤキー５０
０５により特定の操作者に対して複写動作を可能にし、
上記以外の操作者に対しては、ＩＤキーによりＩＤを入
力しない限り複写動作を禁止することが可能となる。５
００６はストップキーであり、コピーを中断したり、中
止したりするときに用いるキーである。５００７はガイ
ドキーであり、各機能を知りたいときに使用するキーで
ある。５００８は上カーソルキーであり、各機能設定画
面においてポインタを上に移動させるキーである。５０
０９は下カーソルキー、５０１０は右カーソルキー、左
カーソルキーであり、機能は上カーソルキー５０８と方
向が異なるだで同じである。Reference numeral 5005 denotes an ID key.
05 enables copying operation for a specific operator,
For other operators, the copying operation can be prohibited unless an ID is input with the ID key. 5
Reference numeral 006 denotes a stop key, which is used to interrupt or stop copying. Reference numeral 5007 denotes a guide key, which is used when the user wants to know each function. Reference numeral 5008 denotes an up cursor key, which is a key for moving a pointer upward on each function setting screen. 50
09 is a down cursor key, 5010 is a right cursor key and a left cursor key, and the function is the same as that of the up cursor key 508 except that the direction is different.

【００２１】５０１３は各機能設定画面において、５０
５２の画面の右下に出力されたことを実行する時にこの
キーを押す。５０１４は定形縮小キーであり、定形サイ
ズを他の定形サイズに縮小するときに使用する。５０１
５は等倍コピーを選択するときに使用する。５０１６は
定形拡大キーであり、定形サイズを他の定形サイズに拡
大するときに使用する。５０１７はカセット選択キーで
あり、コピーするカセット段を選択する。５０１８はコ
ピー濃度調整キーであり、濃度を薄くする。５０１９は
ＡＥキーであり、原稿の濃度に対しコピー濃度を自動的
に調整する。５０２０はコピー濃度調整キーであり、濃
度を濃くする。５０２１はソータの動作を指定するキー
である。５０２２は予熱キーであり、予熱モードのＯＮ
／ＯＦＦに使用する。５０２３は割り込みキーであり、
コピー中に割り込みしてコピーを行いたいときに押す。
５０２４はテンキーであり、数値の入力を行うときに使
用する。Reference numeral 5013 denotes 50 in each function setting screen.
Press this key to execute what is output to the lower right of the screen 52. A standard size reduction key 5014 is used to reduce the standard size to another standard size. 501
Reference numeral 5 is used to select the same-size copy. A standard enlargement key 5016 is used to enlarge the standard size to another standard size. A cassette selection key 5017 selects a cassette stage to be copied. A copy density adjustment key 5018 reduces the density. An AE key 5019 automatically adjusts the copy density with respect to the density of the original. A copy density adjustment key 5020 increases the density. 5021 is a key for specifying the operation of the sorter. Reference numeral 5022 denotes a preheating key for turning on a preheating mode.
Used for / OFF. 5023 is an interrupt key,
Press this button to interrupt and copy during copying.
Reference numeral 5024 denotes a numeric keypad, which is used to input numerical values.

【００２２】５０２５はマーカー処理キーであり、各種
画像編集処理を行わせるものであり、たとえばトリミン
グ，マスキング，部分処理（輪郭処理，網処理，影付け
処理，ネガポジ処理）を設定する。５０２６はパターン
可処理キーであり、色をパターン化して表現したり、色
を濃度差で表現したりするときに使用する。５０２７は
色消去キーであり、特定色を消去したいときに使用す
る。５０２８は画質キーであり、画質の設定を行いたい
ときに使用する。５０２９はネガポジキーであり、ネガ
ポジ処理を行うときに使用する。５０３０はイメージク
リエイトキーであり、輪郭処理，影付け処理，網処理，
斜体，ミラー処理，リピート処理を行うときに使用す
る。５０３１はトリミングキーであり、エリアを指定
し、トリミングをするときに使用する。５０３２はマス
キングキーであり、エリアを指定し、マスキングをする
ときに使用する。５０３３は部分処理キーであり、エリ
アを指定し、その後、部分処理（輪郭処理，網処理，影
付け処理，ネガポジ処理）を指定する。５０３４は枠消
しキーであり、モードに合わせて枠消しを行うときに使
用する。モードはシート枠消し（シートサイズに対して
枠を作成する）、原稿枠消し（原稿サイズに合わせて枠
を作成する。原稿サイズ指定有り）、ブック枠消し（ブ
ックの見開きサイズに合わせて枠と中央に空白を作成す
る。ブック見開きサイズ指定有り）がある。５０３５は
綴じ代キーであり、用紙の一端に綴じ代を作成したいと
きに使用する。Reference numeral 5025 denotes a marker processing key for performing various kinds of image editing processing, for example, setting trimming, masking, and partial processing (contour processing, halftone processing, shadowing processing, negative / positive processing). Reference numeral 5026 denotes a pattern processable key which is used to express a color in a pattern or to express a color by a density difference. A color erasing key 5027 is used to erase a specific color. An image quality key 5028 is used to set the image quality. A negative / positive key 5029 is used when performing a negative / positive process. Reference numeral 5030 denotes an image create key, which includes contour processing, shadowing processing, halftone processing,
Used to perform italic, mirror, and repeat processing. Reference numeral 5031 denotes a trimming key used to designate an area and perform trimming. A masking key 5032 is used to designate an area and perform masking. A partial processing key 5033 specifies an area, and then specifies partial processing (contour processing, halftone processing, shadowing processing, negative / positive processing). Reference numeral 5034 denotes a frame erasing key, which is used when erasing a frame in accordance with a mode. The modes are sheet frame erasing (creating a frame for the sheet size), document frame erasing (creating a frame according to the document size. With original size specified), and book frame erasing (frames matching the spread size of the book). Create a blank in the center. Book spread size specified). A binding margin key 5035 is used to create a binding margin at one end of a sheet.

【００２３】５０３６は所望とする領域の画像を移動さ
せる移動キーであり、移動を行いたいときに使用する。
移動には、平行移動（上下左右），センター移動，コー
ナー移動，指定移動（ポイント指定）がある。５０３７
はズームキーであり、複写倍率を２５％〜４００％ま
で、１％刻みで設定できる。また、主走査，副走査を独
立に設定できる。尚、画像の変倍は、原稿の副走査方向
に関しては、スキャナ３の移動速度を制御することで行
ない、原稿の主走査方向に関しては、スキャナ３より読
出した画像データの間引き、或は補間処理で行う。Reference numeral 5036 denotes a movement key for moving an image in a desired area, which is used when moving is desired.
The movement includes parallel movement (up, down, left and right), center movement, corner movement, and designated movement (point designation). 5037
Is a zoom key, which can set a copy magnification from 25% to 400% in increments of 1%. Further, the main scanning and the sub-scanning can be set independently. The magnification of the image is controlled by controlling the moving speed of the scanner 3 in the sub-scanning direction of the document, and the image data read out from the scanner 3 is thinned out or interpolated in the main scanning direction of the document. Do with.

【００２４】５０３８はオート変倍キーであり、複写紙
のサイズに合わせて自動的に拡大縮小する。また、主走
査，副走査を独立にオート変倍できる。５０３９は拡大
連写キーであり、１枚の原稿を複数枚に拡大して複写を
行うときに使用する。５０４０は縮小レイアウトキーで
あり、複数の原稿を１枚に拡大縮小して複写を行うとき
に使用する。５０４３は連写キーであり、原稿台ガラス
面の複写領域を左右に２分割し、自動的に２枚のコピー
をする連続複写を行いたいときに使用する（ページ連
写，両面連写）。５０４４は両面キーであり、両面の出
力を行いたいときに使用する（片面両面，ページ連写両
面，両面両面）。５０４５は多重キーであり、多重を行
いたいときに使用する（多重，ページ連写多重）。５０
４６はメモリキーであり、メモリを使用したモードを行
いたいときに使用する（メモリ合成，エリア合成，すか
し合成）。５０４７はプロジェクタキーであり、プロジ
ェクタを使用するときに使用する。５０４８はプリンタ
キーであり、プリンタ時の設定を行うときに使用する。
５０５０は、原稿混載キーであり、フィーダを使用して
コピーを取るとき原稿サイズが混載しているときに使用
する。５０５１は、モードメモリキーであり、複写設定
された複写モードを登録するため、登録された複写モー
ドを呼び出すときに使用する。５０５２はタッチパネル
付きの液晶表示器であり、装置の状態，複写枚数，複写
倍率，複写用紙サイズを表示する。Reference numeral 5038 denotes an automatic scaling key, which automatically enlarges or reduces the size according to the size of the copy sheet. In addition, the main scanning and the sub-scanning can be automatically scaled independently. Reference numeral 5039 denotes an enlargement continuous shooting key, which is used when one original is enlarged to a plurality of copies and copied. Reference numeral 5040 denotes a reduced layout key, which is used when copying a plurality of originals by enlarging / reducing them. Reference numeral 5043 denotes a continuous shooting key which is used to divide a copy area on the glass surface of the original platen into two right and left sides and to perform continuous copying in which two copies are automatically made (page continuous shooting, double-side continuous shooting). Reference numeral 5044 denotes a double-sided key, which is used when two-sided output is desired (single-sided double-sided, page continuous double-sided, double-sided double-sided). Reference numeral 5045 denotes a multiplex key, which is used when multiplexing is desired (multiplexing, page continuous copying multiplexing). 50
Reference numeral 46 denotes a memory key, which is used when a mode using a memory is desired (memory synthesis, area synthesis, watermark synthesis). A projector key 5047 is used when using the projector. Reference numeral 5048 denotes a printer key, which is used when setting for a printer.
Reference numeral 5050 denotes an original mixed key which is used when originals are mixed when copying using the feeder. Reference numeral 5051 denotes a mode memory key, which is used when registering a copy mode set for copy, when calling the registered copy mode. Reference numeral 5052 denotes a liquid crystal display with a touch panel, which displays the state of the apparatus, the number of copies, the copy magnification, and the copy paper size.

【００２５】先に説明した、認識複写モード或いは通常
複写モードのモード設定を指定する場合にもこの液晶表
示器５０５２に表示されたメニューの１つを指で触るこ
とでそのモードに移行することが可能となっている。Even when the mode setting of the recognition copying mode or the normal copying mode described above is designated, the mode can be shifted to by touching one of the menus displayed on the liquid crystal display 5052 with a finger. It is possible.

【００２６】次に実施形態の特徴とする文字認識部を図
１を用いて詳細に説明する。Next, a character recognition unit which is a feature of the embodiment will be described in detail with reference to FIG.

【００２７】画像バッファメモリー１１８に格納した画
像は文字切り出し部１０８で画像を２値化し直交する２
方向への射影（黒ドットのヒストグラム）から１文字を
包含する平均的文字メッシュサイズを検出し、それに従
って文字画像を分割する。分割された各メッシュ内の文
字画像は正規化され、未知の文字ベクトルとしての特徴
量をベクトル解析部１０９が求める。認識制御部１１１
はこのようにして求められた特徴量（未知の文字の特徴
量）が入力されると、同制御部が有する辞書の木構造に
沿い、求められた特徴量の前後する類似した標準パター
ンの特徴量の近い方のパスをたどっていくことにより、
下り検索を行い、最終段で得られた最短距離の特徴量を
持つ文字を認識文字として出力する。なお、メッシュサ
イズはＣＰＵ１１３に通知され、ＣＰＵ１１３はイメー
ジデータ変換部１１０にそのサイズを通知する。The image stored in the image buffer memory 118 is binarized by the character extracting unit 108 into two orthogonal images.
The average character mesh size including one character is detected from the projection in the direction (histogram of black dots), and the character image is divided accordingly. The character images in each of the divided meshes are normalized, and the vector analysis unit 109 obtains a feature amount as an unknown character vector. Recognition control unit 111
When the feature amount (the feature amount of an unknown character) obtained in this way is input, the feature of the similar standard pattern before and after the obtained feature amount follows the tree structure of the dictionary of the control unit. By following the closer path,
A downward search is performed, and the character having the shortest distance feature obtained in the final stage is output as a recognized character. The mesh size is notified to the CPU 113, and the CPU 113 notifies the image data conversion unit 110 of the size.

【００２８】図４は、未知の文字（ここでは「王」とい
う文字）の場合の木構造の辞書を用いた上記認識方過程
を示している。FIG. 4 shows the above recognition method using a tree-structured dictionary in the case of an unknown character (here, the character “king”).

【００２９】ここで、木構造辞書における１つの段のｉ
番目のグループの標準パターンベクトル（特徴量）を＊
ｖi、未知文字の入力ベクトル（文字画像「王」の特徴
量）を＊ｕとすると、ベクトル＊ｖiとベクトル＊ｕと
の距離ｄｉは、ｒをベクトルの次元数とした場合、ｄｉ＝Σ｜＊ｖｉｋ−＊ｕｋ｜で表わされる。ただし、ΣはＫ＝０〜(ｒ−１)までの合
算を示している。また、＊ｖｉ＝{ｖｉ0,ｖｉ1,ｖｉ2,
…ｖｉ(r-1）}であり、＊ｕｋ＝{ｕ0,ｕ1,ｕ2,…ｕ(r-
1)}である。Here, i of one stage in the tree structure dictionary
The standard pattern vector (feature amount) of the group
vi, the input vector of the unknown character (the characteristic amount of the character image “king”) is * u, and the distance di between the vector * vi and the vector * u is: di = Σ | * Vik- * uk | Here, Σ indicates the sum of K = 0 to (r−1). Also, * vi = {vi0, vi1, vi2,
... vi (r-1)} and * uk = {u0, u1, u2, ... u (r-
1)}.

【００３０】こうして算出された距離ｄｉが最少となる
グループが選択され、以下、同様に第２段目、弾３段目
と下っていき、最終段階で文字「王」に行きつくパスが
形成されることになる。The group with the smallest distance di calculated in this way is selected, and thereafter, similarly, descends to the second stage and the third stage to form a path reaching the character "king" in the final stage. Will be.

【００３１】ところで、このように各メッシュに含まれ
る画像を辞書を用いて認識してゆく場合、該メッシュ内
の画像が文字で無かった場合、例えば、表図形を構成す
る線分の一部の場合、辞書に格納された特定の文字ある
いは記号がその距離の絶対値に関わらず、最も距離の短
いものと判断される。従って、例えば、罫線の一部が、
数字「一」に直行する場合、その部分が数字「十」に変
換されてしまう。そこで、最終段で得られた文字に対す
る距離をＶeとするなら、と判断する。By the way, when the image included in each mesh is recognized by using the dictionary, when the image in the mesh is not a character, for example, a part of a line segment forming a table graphic is In this case, a specific character or symbol stored in the dictionary is determined to be the shortest distance regardless of the absolute value of the distance. Therefore, for example, a part of the ruled line
When going straight to the number "one", that part is converted to the number "ten". Therefore, it is determined that the distance to the character obtained in the final stage is Ve.

【００３２】Ｖe ＜Ｋ（予め設定した定数）の場合 ………メッシュ内は文字そうで無い場合 ………メッシュ内は文字以外と判断する。In the case of Ve <K (a constant set in advance)..., A character is not in the mesh otherwise, it is determined that the inside of the mesh is not a character.

【００３３】すなわち、距離Ｖｅが、所定値Ｋ以下、つ
まり、第１候補に挙がった文字との尤度がある程度以上
高い場合にのみ文字と判断し、そうで無い場合はイメー
ジ情報として判断する。That is, a character is determined only when the distance Ve is equal to or less than a predetermined value K, that is, when the likelihood with the character listed as the first candidate is higher than a certain degree, and otherwise, it is determined as image information.

【００３４】従って、図４の場合をＶe＜Ｋとするな
ら、イメージデータ変換部１１０は文字「王」を示す２
バイトコードにより保有するフォントＲＯＭからドット
データを生成する。逆に、Ｖ≧Ｋの場合、画像バッファ
ーメモリに格納されたイメージデータがそのまま画像編
集部に送られる。Therefore, if Ve <K in the case of FIG. 4, the image data conversion unit 110 determines that the character "king"
The dot data is generated from the font ROM held by the byte code. Conversely, when V ≧ K, the image data stored in the image buffer memory is sent to the image editing unit as it is.

【００３５】以上の処理を全メッシュに対して行う事
で、文字部は認識された文字フォントで、文字以外の領
域は、通常の複写機同様のイメージ情報として両者が混
合された再生画像として記録表示が可能となる。By performing the above processing on all the meshes, the character portion is recorded in the recognized character font, and the area other than the character is recorded as a reproduced image in which both are mixed as image information similar to a normal copying machine. Display becomes possible.

【００３６】ところで、このようにイメージ情報とフォ
ントデータを混在させて像再生する場合、原稿の文字の
字体に合致したフォントデータを用いれば、更に品位の
高い再生画像となる。本実施形態における字体の認識に
ついて図５に示すフローチャートで説明する。When an image is reproduced by mixing image information and font data as described above, a higher-quality reproduced image can be obtained by using font data that matches the font of the characters on the document. Recognition of a font in this embodiment will be described with reference to a flowchart shown in FIG.

【００３７】図５において先述した様にステップＳ１０
では入力画像中の文字画像のメッシュを決定し、決定さ
れたメッシュサイズで文字画像を分割し、各メッシュ内
の画像信号から特徴量を求める。そして、ステップＳ１
１で辞書を参照して文字認識を行う。As described above with reference to FIG.
Then, the mesh of the character image in the input image is determined, the character image is divided by the determined mesh size, and the feature amount is obtained from the image signal in each mesh. Then, step S1
In step 1, character recognition is performed with reference to the dictionary.

【００３８】認識して得られた文字の第１候補（図４で
の最下層にある文字）の特徴量（辞書に記憶されてい
る）に対する、注目しているメッシュ内の画像から得ら
れた特徴量との距離（差）が閾値Ｋ未満か否かを判断す
る。距離が閾値Ｋ未満であると判断した場合には、ステ
ップＳ１３に進み、注目メッシュ内にある画像は文字画
像であると判断して良いから、その候補である文字コー
ドをイメージデータ変換部１１０内に設けられたメモリ
（図示せず）にそのメッシュのあった位置情報と共に格
納する（ステップＳ１３）。The feature amount (stored in the dictionary) of the first candidate character (the character at the bottom in FIG. 4) obtained by recognition is obtained from the image in the mesh of interest. It is determined whether or not the distance (difference) from the feature value is less than a threshold value K. If it is determined that the distance is less than the threshold K, the process proceeds to step S13, and the image in the mesh of interest may be determined to be a character image. Is stored in a memory (not shown) provided along with the position information where the mesh exists (step S13).

【００３９】更に、この場合、明らかに、認識した画像
は文字の確率が高い為、字体特徴量Ｓｊを算出する。こ
の算出の元になる情報であるが、特徴量が文字の幅や方
向まで含んでいるのであればその特徴量でかまわない
が、特徴量に文字の幅（ゴシック体では線分の太さはす
べて同じであるが、明朝体では異なる）等に関する情報
がない場合にはそのメッシュ内の画像データも含めて算
出する。そして、算出した字体特徴量を順次加算する
（ステップＳ１５）。なお、図示では単純に加算する例
を示しているが、実際は明朝体とゴシック体の関係はい
ずれか一方でしかなく、斜体は別のカテゴリになる。従
って、明朝体である度合（あるいはゴシック体である度
合）を示す量と、斜体である度合を示す量の別々の変数
に、各メッシュ毎に得られた字体特徴量を加算すること
になる。Further, in this case, since the recognized image clearly has a high probability of the character, the character feature amount Sj is calculated. This information is the basis of this calculation. If the feature amount includes the width and direction of the character, the feature amount may be used. If there is no information about the same, but different for Mincho, etc., the calculation is performed including the image data in the mesh. Then, the calculated character features are sequentially added (step S15). Although an example of simple addition is shown in the figure, in fact, there is only one of the relationship between the Mincho style and the Gothic style, and italics fall into another category. Therefore, the character feature amount obtained for each mesh is added to the variable indicating the degree of the Mincho style (or the degree of the Gothic style) and the variable indicating the degree of the italic style. .

【００４０】また、距離が閾値Ｋ以上の場合には文字で
はない、あるいは文字として認定するには信頼性がない
ことになるから、そのメッシュ内の画像データは、その
位置情報と共にイメージデータ変換部１１０を通過し、
画像編集部１０７に供給され、通常の複写と同様の画像
扱いにする。この場合、ステップＳ１３〜Ｓ１５の処理
はスキップすることになるので、字体判定には影響を与
えない。If the distance is equal to or greater than the threshold value K, the image data is not a character, or the character is not reliable if it is recognized as a character. Through 110,
The image is supplied to the image editing unit 107, and is treated as an image similar to a normal copy. In this case, the processing of steps S13 to S15 is skipped, and thus does not affect the font determination.

【００４１】こうして、ステップＳ１６において全メッ
シュ内の画像に対して上記の処理を繰り返し行ってい
く。Thus, in step S16, the above processing is repeatedly performed on the images in all the meshes.

【００４２】さて、全メッシュに対する処理が終了する
と、ステップＳ１７に進み、上記のステップＳ１５で得
られた字体特徴量Ｓに基づいて字体を判定し、それをイ
メージデータ変換部１１０に通知する。たとえば、明朝
体の度合を示す総量を総メッシュ数で割ることで１文字
当たりの明朝体であることの平均的な度合、斜体である
度合を示す総量を総メッシュ数で割ることで平均的な斜
体の度合を算出する。そして、平均的な明朝体である度
合を示す値が閾値以上の場合には明朝体であると判断
し、そうでない場合にはゴシック体であると判断する。
斜体も同様である。When the processing for all the meshes is completed, the process proceeds to step S17, where the font is determined based on the font feature value S obtained in step S15, and the determination is sent to the image data conversion unit 110. For example, by dividing the total amount indicating the degree of the Mincho style by the total number of meshes, the average degree of the Mincho style per character is calculated by dividing the total amount indicating the degree of the italic type by the total number of meshes. The degree of typical italic is calculated. If the value indicating the average degree of Mincho is greater than or equal to the threshold value, it is determined to be Mincho, otherwise it is determined to be Gothic.
The same applies to italics.

【００４３】イメジデータ変換部１１０は、この通知を
受けると、対応する字体のフォントメモリを選択し、自
身のメモリに格納された文字コードに基づいてフォント
パターンを生成する。このとき、斜体であると判断され
た場合には、斜体文字パターンを生成するのは勿論であ
る。そして、生成したフォントパターンとその展開位置
情報を画像編集部１０７に出力する。Upon receiving this notification, the image data conversion unit 110 selects a font memory of the corresponding font and generates a font pattern based on the character code stored in its own memory. At this time, if it is determined that the character is italic, it goes without saying that an italic character pattern is generated. Then, the generated font pattern and its development position information are output to the image editing unit 107.

【００４４】画像編集部１０７は、先に説明したように
１ページ分の画像展開用のメモリを有しているので、こ
こにイメージデータ変換部１１０から供給された文字パ
ターンデータ及びその位置情報、文字として認定されて
なかった画像データとその位置情報に基づき、それらを
合成し、１ページ分の画像合成及び展開が完了してから
記録動作を行う。Since the image editing unit 107 has a memory for developing an image of one page as described above, it stores the character pattern data supplied from the image data conversion unit 110 and its position information, Based on the image data that has not been recognized as a character and its position information, they are combined, and the recording operation is performed after the completion of image combination and development for one page.

【００４５】以上の結果、字体の判定は、文字である可
能性が高いメッシュ内の画像データに基づいて判定され
るので、字体判定の精度を高めることができるようにな
る。As a result, the font is determined based on the image data in the mesh which is likely to be a character, so that the accuracy of the font determination can be improved.

【００４６】なお、上記の例では書体として明朝体とゴ
シック体の２通り、及び、斜体か否かの判断のみを示し
たが、これ以外、あるいは、これに追加する形態で字体
の判定を行ってもよいのは勿論である。In the above example, only the two types of fonts, Mincho and Gothic, and the determination of whether it is italic are shown. However, the font is determined in other or additional forms. Of course, it may be done.

【００４７】次に実施形態で文字として判定するための
閾値Ｋの設定について説明する。Next, the setting of the threshold value K for determining as a character in the embodiment will be described.

【００４８】図６は図３で示した操作表示部のタッチパ
ネル付き表示器５０５２の画面を示したもので、閾値Ｋ
の設定に関わる部分のみ提示している。FIG. 6 shows a screen of the display unit 5052 with a touch panel of the operation display unit shown in FIG.
Only the part related to the setting is shown.

【００４９】通常複写可能時には、画面４２０が表示さ
れており、原稿の種類或いは複写目的に応じて認識複写
モード選択表示キー４２１、文字／写真モード選択表示
キー４２２、写真モード選択表示キー４２３が表示され
ている。When normal copying is possible, a screen 420 is displayed, and a recognition copy mode selection display key 421, a character / photo mode selection display key 422, and a photo mode selection display key 423 are displayed according to the type of the document or the purpose of copying. Have been.

【００５０】実施形態では、キー４２１がタッチされ、
認識複写が選択されると、先に説明した認識複写モード
が実行されることになる。また、ユーザー設定キー４２
４を押下すれば、画面は４３０に移行し、この画面で
は、従来複写機でユーザーが設定可能な、例えば画質調
整、或いは各タイマー等の設定に加えて、認識複写モー
ド設定表示キー４３１を有する。In the embodiment, the key 421 is touched,
When the recognition copy is selected, the above-described recognition copy mode is executed. Also, the user setting key 42
When 4 is pressed, the screen shifts to 430. This screen has a recognition copy mode setting display key 431 that can be set by the user in the conventional copying machine. .

【００５１】今該認識複写モード設定表示キー４３１を
押下して、画面４４０に移行させると、レバー表示４４
２を左右に動かすキー４４１，４４３が表示される。す
なわちレバー表示４４２の位置に応じて定数Ｋの値を設
定する。今キー４４１の操作でレバー位置４４２を左に
移動させると、これは、原稿が殆ど文字だけで構成され
た場合を想定し、この位置では、メッシュ内に図形の一
部が存在する可能性が少ない為、閾値Ｋは十分大きな値
とする。すなわち、最短距離の値に関わらず、最短距離
で得られた文字が原稿の文字である確率が最も高い為
に、殆ど定数Ｋによる制限を加える事無く、人が読み難
い文字までも認識複写が可能である。When the recognition copy mode setting display key 431 is depressed to shift to the screen 440, the lever display 44 is displayed.
Keys 441 and 443 for moving 2 right and left are displayed. That is, the value of the constant K is set according to the position of the lever display 442. Now, when the lever 442 is moved to the left by operating the key 441, it is assumed that the document is almost composed of only characters. At this position, there is a possibility that a part of the graphic exists in the mesh. Since the number is small, the threshold value K is set to a sufficiently large value. In other words, regardless of the value of the shortest distance, the character obtained at the shortest distance is most likely to be a document character, so that it is possible to recognize and copy even characters that are difficult for humans to read with almost no restrictions imposed by the constant K. It is possible.

【００５２】逆に今キー４４１の操作でレバー位置４４
２を右に移動させると、これは、原稿中に文字が少なく
図形等他の属性の画像が混在する場合を想定し、この位
置では、メッシュ内に図形の一部が存在する可能性が高
い為、Ｋは逆に小さな値とする。Conversely, the operation of the key 441 causes the lever position 44
By moving 2 to the right, it is assumed that there are few characters in the original document and images of other attributes such as graphics are mixed, and at this position, there is a high possibility that a part of the graphics exists in the mesh. Therefore, K is set to a small value.

【００５３】レバー位置４４２と定数Ｋは以上の様に対
応ずけて予め設定する。The lever position 442 and the constant K are set in advance in correspondence with each other as described above.

【００５４】この表示、設定及び定数Ｋはバックアップ
電池等で保持されるＲＡＭ１１５に記憶保持する。The display, setting and constant K are stored and held in the RAM 115 which is held by a backup battery or the like.

【００５５】更に画面４４０には書体複写設定表示キー
４４６と書体変換設定表示キー４４５を有し、通常どち
らかのモードに選択設定される。書体複写モードでは、
先に説明した様に原稿中の文字の書体まで認識し、その
書体でフォント展開した記録データで像再生する。この
とき、書体判定は原稿の１ページ単位にしても良い。Further, the screen 440 has a typeface copy setting display key 446 and a typeface conversion setting display key 445, and is normally selected and set to one of the modes. In typeface copy mode,
As described above, even the typeface of the characters in the original is recognized, and the image is reproduced with the recording data obtained by developing the font in the typeface. At this time, the typeface determination may be made in units of one page of the document.

【００５６】一方書体変換モードでは、原稿の書体に関
わらず予め設定された書体でフォント展開した記録デー
タで像再生する。これが選択された場合には、先に説明
したステップＳ１５、Ｓ１７は不要になるので、スキッ
プするようにしてもよい。On the other hand, in the typeface conversion mode, an image is reproduced with recording data obtained by developing a font in a predetermined typeface regardless of the typeface of the original. If this is selected, steps S15 and S17 described above become unnecessary and may be skipped.

【００５７】尚、各設定を終了するには夫々戻るキー４
４４，４３２の押下で画面４２０に戻る。Return key 4 to end each setting
Pressing 44 or 432 returns to the screen 420.

【００５８】又、Ｋの設定は通常画面４２０に配置し、
夫々の操作者が原稿毎に設定する事も可能である。その
場合、複写動作が終了後所定時間経過後、予めＲＡＭ１
１５に記憶させた標準的Ｋの値、或いは設定された字体
設定モードに復帰させる。The setting of K is arranged on the normal screen 420,
Each operator can also set each document. In this case, after a lapse of a predetermined time from the end of the copying operation,
The mode is returned to the standard K value stored in No. 15 or the set character style setting mode.

【００５９】＜変形例の説明＞定数Ｋを操作者が設定す
る例を述べたが、原稿の大まかな状態が把握出来れば原
稿毎に自動で可変設定可能で有る。すなわち図１の原稿
給送手段となる原稿給送装置１で原稿を原稿台に搬送す
る過程で画像を読み取りその原稿の濃度ヒストグラムか
らは、絵の様な中間調の画像の存在が判定出来る。<Explanation of Modification> The example in which the operator sets the constant K has been described. However, if the rough state of the document can be grasped, the variable K can be automatically and variably set for each document. That is, the image is read in the course of conveying the original to the original plate by the original feeding device 1 serving as the original feeding means in FIG. 1, and the presence of a halftone image such as a picture can be determined from the density histogram of the original.

【００６０】図７は代表的な濃度分布の例である。同図
（ａ）の濃度分布は殆ど白いレベルと中間濃度部に夫々
１つの山を有する例で、前者の山は原稿の下地部分を示
し前画素数のたとえば９割を占め、他方は文字部を示し
ている。すなわち濃度分布Ａは典型的な文字線画のみの
原稿を示していることになる。FIG. 7 is an example of a typical density distribution. The density distribution in FIG. 9A is an example in which one peak is provided in each of the almost white level and the intermediate density portion. The former peak indicates the background portion of the document and occupies 90% of the number of previous pixels, and the other is a character portion Is shown. That is, the density distribution A indicates a typical document having only a typical character line image.

【００６１】一方、同図（ｂ）は濃度分布Ｂは中間調を
含む画像の例であり、白から黒にかけて小さな山が多数
存在し、明らかに文字が示す濃度分布以外に淡い濃度領
域が連続的に存在する事が認識できる。通常このように
濃度分布から原稿中に中間調部分を含む原稿の識別が可
能である。On the other hand, FIG. 7B shows an example of an image including a halftone in which the density distribution B includes halftones. There are many small peaks from white to black, and a light density area other than the density distribution clearly indicated by a character is continuous. It can be recognized that it exists. Usually, it is possible to identify a document including a halftone portion in the document from the density distribution.

【００６２】さて、文字線画を含む、すなわち濃度分布
Ａに類似する原稿の中から文字が占める比率を認識する
には、画像を２値化し、その中で黒に２値化された画素
の位置を夫々原稿の紙面を直交する２方向に射影して、
黒画素の配置を評価する。Now, in order to recognize the ratio of characters occupying a document including a character line drawing, that is, a document similar to the density distribution A, the image is binarized, and the positions of the pixels binarized to black in the image are binarized. Are projected in two directions perpendicular to the paper surface of the manuscript, respectively.
Evaluate the arrangement of black pixels.

【００６３】図８（ａ）は横書き原稿を各画素ライン横
方向に射影した例で夫々各行に相当する繰り返し山パタ
ーンが得られている。同図の射影で各山の中央が各行の
中央で、谷の部分が行間を表している。従って同図の射
影の谷間は行間であて、山の部分は文字部分であると判
断できる。換言すれば、図示の如く、互い違いに山と谷
がほぼ同じ間隔に存在すれば、その原稿の大部分は文字
で構成されていると判断出来る。FIG. 8A shows an example in which a horizontally written original is projected in the horizontal direction of each pixel line, and a repeated mountain pattern corresponding to each row is obtained. In the projection in the figure, the center of each peak is the center of each row, and the valley portion indicates the space between rows. Therefore, it can be determined that the valley of the projection in the figure is a line gap, and the mountain portion is a character portion. In other words, as shown in the figure, if the peaks and valleys exist alternately at approximately the same interval, it can be determined that most of the original is composed of characters.

【００６４】同図（ｂ）の射影では、谷の部分にも黒ド
ットが存在しており、文字行以外の図形、或いは線、異
なる文字等が存在している可能性が高いと判断できる。In the projection shown in FIG. 9B, a black dot also exists in the valley, and it can be determined that there is a high possibility that a figure other than a character line, a line, a different character, or the like exists.

【００６５】従って、本実施形態では、文字の占める率
として、図８に示す「谷の最大値Ｌ」で評価する。Therefore, in the present embodiment, the ratio occupied by the characters is evaluated by the “maximum value L of the valley” shown in FIG.

【００６６】すなわちＬの値が０に近い程文字比率が高
く逆にＬの値が大きい程図形が混在する比率が高いと判
断する。尚、射影は直交する２方向で評価すれば、縦
行、横行両者の存在が認識できる。That is, it is determined that the ratio of characters is higher as the value of L is closer to 0, and that the ratio of mixed graphics is higher as the value of L is higher. If the projection is evaluated in two orthogonal directions, the presence of both vertical and horizontal rows can be recognized.

【００６７】以上により、原稿の濃度分布と射影から文
字の比率が予測出来、この両者から本発明による定数Ｋ
を自動で設定出来る。As described above, the character ratio can be predicted from the density distribution and the projection of the document.
Can be set automatically.

【００６８】尚、以上の処理は文字認識処理走査に先立
って行う事も可能であるが、特に射影処理は原稿をメッ
シュに分割する際に行う処理であり、文字認識処理走査
時に行う事も可能である。The above processing can be performed prior to the character recognition processing scan. In particular, the projection processing is processing performed when a document is divided into meshes, and can also be performed during the character recognition processing scan. It is.

【００６９】又、文字の比率を認識する手段は本実施例
で開示した以外に例えば、２値化画像中の連続する黒画
素のランレングスの分布等、他の手段を用いても本発明
による同様の効果が得られる事は言うまでもない。The means for recognizing the character ratio is not limited to the one disclosed in the present embodiment. For example, another means such as a distribution of run lengths of continuous black pixels in a binary image may be used. It goes without saying that a similar effect can be obtained.

【００７０】また、実施形態では複写機に適用した例を
説明するが、出力対象あるいは認識対象の画像データは
スキャナから読み取られるものではなく、たとえばフロ
ッピー等の記録媒体に保存されているものであってもよ
いし、出力も印刷に限らず表示装置に表示する場合にも
適用できるので、これによって本願発明が限定されるも
のではない。In the embodiment, an example in which the present invention is applied to a copying machine will be described. However, image data to be output or recognized is not read by a scanner, but is stored in a recording medium such as a floppy disk. The present invention may be applied to a case where the output is not limited to printing but is displayed on a display device, so that the present invention is not limited thereto.

【００７１】さらにまた、認識対象の画像データの発生
源（スキャナや記憶媒体等）と、認識結果の出力装置
（プリンタや表示装置等）、及び、その間に介在する処
理装置でもってシステムを構築した場合にも本願発明を
適用できる。この場合、処理装置としては、たとえばパ
ーソナルコンピュータ等の汎用情報処理装置で構築でき
るであろう。Furthermore, a system was constructed with a source of image data to be recognized (scanner, storage medium, etc.), an output device of the recognition result (printer, display device, etc.), and a processing device interposed therebetween. In this case, the present invention can be applied. In this case, the processing device could be constructed with a general-purpose information processing device such as a personal computer.

【００７２】従って、本願は、複数の機器（例えばホス
トコンピュータ，インタフェイス機器，リーダ，プリン
タなど）から構成されるシステムに適用してもよいし、
本発明の目的は、前述した実施形態の機能を実現するソ
フトウェアのプログラムコードを記録した記憶媒体を、
システムあるいは装置に供給し、そのシステムあるいは
装置のコンピュータ（またはＣＰＵやＭＰＵ）が記憶媒
体に格納されたプログラムコードを読出し実行すること
によっても、達成されることは言うまでもない。Therefore, the present invention may be applied to a system including a plurality of devices (for example, a host computer, an interface device, a reader, a printer, etc.)
An object of the present invention is to provide a storage medium in which software program codes for realizing the functions of the above-described embodiments are recorded.
It is needless to say that the present invention is also achieved by supplying the data to a system or an apparatus and causing a computer (or CPU or MPU) of the system or the apparatus to read and execute the program code stored in the storage medium.

【００７３】この場合、記憶媒体から読出されたプログ
ラムコード自体が前述した実施形態の機能を実現するこ
とになり、そのプログラムコードを記憶した記憶媒体は
本発明を構成することになる。In this case, the program code itself read from the storage medium realizes the function of the above-described embodiment, and the storage medium storing the program code constitutes the present invention.

【００７４】プログラムコードを供給するための記憶媒
体としては、例えば、フロッピディスク，ハードディス
ク，光ディスク，光磁気ディスク，ＣＤ−ＲＯＭ，ＣＤ
−Ｒ，磁気テープ，不揮発性のメモリカード，ＲＯＭな
どを用いることができる。As a storage medium for supplying the program code, for example, a floppy disk, hard disk, optical disk, magneto-optical disk, CD-ROM, CD
-R, a magnetic tape, a nonvolatile memory card, a ROM, or the like can be used.

【００７５】また、コンピュータが読出したプログラム
コードを実行することにより、前述した実施形態の機能
が実現されるだけでなく、そのプログラムコードの指示
に基づき、コンピュータ上で稼働しているＯＳ（オペレ
ーティングシステム）などが実際の処理の一部または全
部を行い、その処理によって前述した実施形態の機能が
実現される場合も含まれることは言うまでもない。When the computer executes the readout program code, not only the functions of the above-described embodiment are realized, but also the OS (Operating System) running on the computer based on the instruction of the program code. ) May perform some or all of the actual processing, and the processing may realize the functions of the above-described embodiments.

【００７６】さらに、記憶媒体から読出されたプログラ
ムコードが、コンピュータに挿入された機能拡張ボード
やコンピュータに接続された機能拡張ユニットに備わる
メモリに書込まれた後、そのプログラムコードの指示に
基づき、その機能拡張ボードや機能拡張ユニットに備わ
るＣＰＵなどが実際の処理の一部または全部を行い、そ
の処理によって前述した実施形態の機能が実現される場
合も含まれることは言うまでもない。Further, after the program code read from the storage medium is written in a memory provided in a function expansion board inserted into the computer or a function expansion unit connected to the computer, based on the instruction of the program code, It goes without saying that the CPU included in the function expansion board or the function expansion unit performs part or all of the actual processing, and the processing realizes the functions of the above-described embodiments.

【００７７】以上説明したように本実施形態によれば、
誤認識による画像劣化を抑圧出来る。また、字体認識精
度を向上できる。更に又、原稿に応じて、適応的に良好
な再生画像が得られる。As described above, according to the present embodiment,
Image degradation due to erroneous recognition can be suppressed. In addition, the character recognition accuracy can be improved. Furthermore, a good reproduced image is adaptively obtained according to the document.

【００７８】[0078]

【発明の効果】以上説明したように本発明によれば、原
稿画像中の文字としての可能性の高い画像部分について
文字コードを出力し、それ以外は原稿画像中の対応する
画像を出力することで、原稿画像全体に対する再現性を
高めると共に、文字については高品位の出力画像を得る
ことが可能になる。As described above, according to the present invention, a character code is output for an image portion likely to be a character in a document image, and a corresponding image in the document image is output otherwise. Thus, the reproducibility of the entire document image can be improved, and a high-quality output image of characters can be obtained.

【図面の簡単な説明】[Brief description of the drawings]

【図１】実施形態における装置のブロック構成図であ
る。FIG. 1 is a block diagram of an apparatus according to an embodiment.

【図２】実施形態が適用する画像複写装置の断面構造図
である。FIG. 2 is a sectional structural view of an image copying apparatus to which the embodiment is applied;

【図３】実施形態における操作部１１６の上面図であ
る。FIG. 3 is a top view of the operation unit 116 according to the embodiment.

【図４】実施形態における文字認識で参照する木構造の
辞書の概念図である。FIG. 4 is a conceptual diagram of a tree-structured dictionary referred to in character recognition in the embodiment.

【図５】実施形態における処理内容を示すフローチャー
トである。FIG. 5 is a flowchart showing processing contents in the embodiment.

【図６】操作表示部のタッチパネル付き表示器５０５２
の画面とその推移の一例を示す図である。FIG. 6 is a display 5052 with a touch panel of an operation display unit.
FIG. 6 is a diagram showing an example of the screen and its transition.

【図７】原稿の種別毎のヒストグラムを示す図である。FIG. 7 is a diagram illustrating a histogram for each document type.

【図８】実施形態における横書き原稿を各画素ライン横
方向に射影したときのヒストグラムを示す図である。FIG. 8 is a diagram illustrating a histogram when a horizontally written document according to the embodiment is projected in a horizontal direction of each pixel line.

[Explanation of symbols]

１００２画像読取部１００３画像処理部１００４画像記録部４０００画像認識部 1002 Image reading unit 1003 Image processing unit 1004 Image recording unit 4000 Image recognition unit

フロントページの続き (72)発明者片岡淳之介東京都大田区下丸子３丁目30番２号キヤノン株式会社内 (72)発明者小林誠東京都大田区下丸子３丁目30番２号キヤノン株式会社内 (72)発明者本田永和東京都大田区下丸子３丁目30番２号キヤノン株式会社内Ｆターム(参考） 5B029 AA01 BB02 CC22 CC29 EE11(72) Inventor Junnosuke Kataoka 3-30-2 Shimomaruko, Ota-ku, Tokyo Canon Inc. (72) Inventor Makoto Kobayashi 3-30-2 Shimomaruko, Ota-ku, Tokyo Canon Inc. (72) Inventor Eiwa Honda 3-30-2 Shimomaruko, Ota-ku, Tokyo F-term in Canon Inc. (reference) 5B029 AA01 BB02 CC22 CC29 EE11

Claims

[Claims]

1. An image processing apparatus for recognizing a character in a character candidate image portion in a document image, wherein a character having a small difference between a feature amount of the character candidate image portion and a standard feature amount stored in a recognition dictionary is determined. A search means for searching; a comparison means for comparing the difference finally obtained by the search means with a predetermined threshold; and a comparison result of the comparison means, wherein the difference is smaller than the threshold. Is a comparison result between the character code output unit that outputs the corresponding character code based on the recognition dictionary and the comparison unit. If the difference is greater than or equal to the threshold, the target character candidate image portion Is a non-character
An image output unit that outputs the character candidate image portion as image data.

2. A character pattern generating means for generating a character pattern based on the character code output by the character code output means, a character pattern generated by the character pattern generating means, and a character pattern output by the image output means. 2. The image processing apparatus according to claim 1, further comprising: an image forming unit configured to form a visible image by combining the image data with the image data.

3. The apparatus according to claim 1, further comprising a font determining unit configured to determine a font based on a plurality of character candidate image portions for which the difference is determined to be smaller than the threshold by the comparing unit. 2. The image processing device according to claim 1.

4. A character pattern determining means for determining a character form based on a plurality of character candidate image portions for which the difference is determined to be smaller than the threshold value by the comparing means, 3. The image processing apparatus according to claim 2, wherein a character pattern corresponding to the determined font is generated.

5. The image processing apparatus according to claim 1, further comprising means for setting the value of the threshold value by a manual operation.

6. The image processing apparatus according to claim 1, further comprising: a detecting unit configured to detect a ratio of characters occupied in the document image; and an adjusting unit configured to adjust the threshold value according to the ratio detected by the detecting unit. 2. The image processing device according to claim 1.

7. The detecting means includes a histogram creating means for creating a histogram of dots in horizontal and / or vertical directions, and the adjusting means based on a minimum frequency value in the histogram created by the histogram creating means. The image processing apparatus according to claim 5, wherein the threshold is adjusted.

8. The apparatus according to claim 5, wherein said detecting means includes means for detecting a density distribution in a document image, and said adjusting means adjusts said threshold value according to said density distribution. The image processing apparatus according to any one of the preceding claims.

9. An image processing method for recognizing a character in a character candidate image portion in a document image, wherein a character having a small difference between a feature amount of the character candidate image portion and a standard feature amount stored in a recognition dictionary is determined. A search step of searching; a comparison step of comparing the difference finally obtained by the search means with a predetermined threshold; and a comparison result of the comparison step, wherein the difference is smaller than the threshold. Is a comparison result between a character code output step of outputting a corresponding character code based on the recognition dictionary and the comparison step. If the difference is equal to or greater than the threshold value, the target character candidate image part Is a non-character
An image output step of outputting the character candidate image portion as image data.

10. A storage medium storing a program code that functions as an image processing device that recognizes a character candidate image portion in a document image when the computer reads and executes the character candidate image portion. Means for searching for a character having a small difference between the character string and the standard feature value stored in the recognition dictionary; and comparing means for comparing the difference finally obtained by the search means with a predetermined threshold value When the comparison result of the comparison means is that the difference is smaller than the threshold value, the target character candidate image portion is a character, and a character code output means for outputting a corresponding character code based on the recognition dictionary; As a result of the comparison, if the difference is equal to or larger than the threshold, the target character candidate image portion is regarded as a non-character,
A storage medium storing a program code that functions as an image output unit that outputs the character candidate image portion as image data.