JP2000032247A

JP2000032247A - Image recognition device

Info

Publication number: JP2000032247A
Application number: JP10195995A
Authority: JP
Inventors: Hiroshi Sugiura; 博杉浦; Shoji Imaizumi; 祥二今泉; Kazuhiro Ueda; 和弘上田
Original assignee: Minolta Co Ltd
Current assignee: Minolta Co Ltd
Priority date: 1998-07-10
Filing date: 1998-07-10
Publication date: 2000-01-28
Anticipated expiration: 2018-07-10
Also published as: JP3629962B2

Abstract

PROBLEM TO BE SOLVED: To provide an image recognition device capable of speeding up a processing without lowering the reliability of a judged result in the top/bottom recognition processing of an original. SOLUTION: An area division part 230 divides image data outputted from an image signal processing part 120 and binarized into plural areas. Then, for the respective divided areas, a reliability judgement part 240 obtains the reliability in the case of using them for top/bottom recognition. A top/bottom recognition part 250 segments character data from the area whose reliability is a highest value and executes the top/bottom recognition processing.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明が属する技術分野】本発明は、複写機などの画像
形成装置において読み取った原稿の向きを認識する画像
認識装置に関する。以下、原稿の向きを認識することを
「天地の認識」とする。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an image recognition apparatus for recognizing the direction of a document read by an image forming apparatus such as a copying machine. Hereinafter, recognizing the orientation of a document is referred to as “top and bottom recognition”.

【０００２】[0002]

【従来の技術】複写機、特にデジタル複写機では、多数
の原稿を連続して複写する場合、原稿の向きにかかわら
ず同じ方向を向いて複写できるようする技術の開発研究
が進められている（特開平６−１０３４１０）。原稿の
向きが一定でなければ複写結果の向きも一定しないとい
うのでは、複写前あるいは複写後に、利用者が原稿また
は複写結果の並べ替えをしなければならないという不都
合が生じるからである。2. Description of the Related Art In a copying machine, in particular, in a digital copying machine, when a large number of originals are successively copied, research and development of a technology for enabling copying in the same direction regardless of the orientation of the originals has been progressed. JP-A-6-103410). If the orientation of the copy result is not constant unless the orientation of the original is constant, there is an inconvenience that the user must rearrange the original or the copy result before or after copying.

【０００３】そして、このように複写結果の向きをそろ
えるためには、原稿の天地認識および画像回転の処理を
行うことが必要となる。天地認識処理方法では、原稿の
画像データから切り出した文字の方向を判定して、文字
の方向を原稿の方向とするものが多い。画像回転処理
は、天地認識処理で求めた原稿の方向が所定の方向と一
致していない場合に、画像データを必要な角度だけ回転
処理して、所定方向に一致させるものである。回転処理
後の画像データから複写画像を形成すれば、複写結果の
方向は一定になる。In order to align the direction of the copy result, it is necessary to perform the process of recognizing the top and bottom of the document and rotating the image. In many of the upside-down recognition processing methods, the direction of a character cut out from image data of a document is determined, and the direction of the character is used as the direction of the document. The image rotation process is a process of rotating the image data by a required angle when the direction of the document obtained by the top / bottom recognition process does not match the predetermined direction, so that the image data matches the predetermined direction. If a copy image is formed from the image data after the rotation processing, the direction of the copy result becomes constant.

【０００４】この天地認識処理については、処理の効率
化や判定結果の信頼度向上のために様々な方法が考案さ
れている。その中に信頼度向上のための方法として、特
開平９−６９１３６公報記載のものがある。ここに公開
されている方法は、天地認識処理の基本的な前提「文字
の方向＝原稿の方向」の例外となる文字の存在を考慮し
て、こうした例外的な文字をもとに天地認識が行われる
ことで発生する誤認識を減らそうとするものである。Various methods have been devised for the top-bottom recognition process in order to increase the efficiency of the process and improve the reliability of the judgment result. Among them, a method for improving reliability is described in Japanese Patent Application Laid-Open No. 9-69136. The method disclosed here takes into account the existence of characters that are exceptions to the basic premise of the orientation recognition process “text direction = original direction”, and An attempt is made to reduce erroneous recognition caused by the operation.

【０００５】図８は、原稿と向きが一致しない文字の例
を示している。同図（ａ）の文字列８０１は、グラフ、
図表などで説明のために付加されるキャプション文字で
ある。上向きの原稿８１０において、左向きとなってい
る。同図（ｂ）の文字列８０２は、表中文字である。表
８２１が横向き（左向き）に掲載されているため、原稿
８２０が上向きなのに、文字列８０２は左向きになって
いる。FIG. 8 shows an example of a character whose orientation does not match that of a document. A character string 801 in FIG.
This is a caption character added for explanation in a chart or the like. In the upward document 810, the document is oriented leftward. A character string 802 in FIG. 8B is a character in the table. Since the table 821 is placed horizontally (leftward), the character string 802 is leftward even though the document 820 is upward.

【０００６】これらキャプション文字や表中文字をもと
に天地認識が行われれば、結果として原稿の向きが誤認
識されることは容易に理解できる。そこで、特開平９−
６９１３６公報記載の方法では、下記の手順で、キャプ
ション文字や表中文字による天地認識をなるべく行わな
いようにしている。先ず、原稿の文字部分を複数の領域
に分割する。次いで各領域の属性を判定する。属性は、
本文に当たる「テキスト属性」、表題を示す「タイトル
属性」、表中の記載であることを示す「表中文字属
性」、図やグラフに付随する説明文字であることを示す
「キャプション属性」などがある。さらに、属性をもと
に領域ごとの優先順位を設定する。優先順位は「テキス
ト」や「タイトル」が高く、「表中文字」、「キャプシ
ョン」は低いのが普通である。そして、優先順位の高い
領域から複数の文字を切り出して、各文字に天地認識を
行う。そして、これら複数文字の天地認識結果が一致す
れば、その結果を採用し、不一致の場合は次に優先順位
の高い領域から文字を切り出して天地認識処理を行う。It is easy to understand that if the top-bottom recognition is performed based on these caption characters and characters in the table, the orientation of the document is erroneously recognized as a result. Therefore, Japanese Patent Application Laid-Open
According to the method described in Japanese Patent No. 69136, the top and bottom recognition using caption characters and characters in the table is performed as little as possible in the following procedure. First, the character portion of the document is divided into a plurality of areas. Next, the attribute of each area is determined. The attributes are
"Text attribute", which corresponds to the text, "Title attribute", which indicates the title, "Character attribute in the table", which indicates the description in the table, "Caption attribute", which indicates the description character accompanying figures and graphs, etc. is there. Further, a priority is set for each area based on the attribute. The priority is generally high for "text" and "title", and low for "characters in table" and "caption". Then, a plurality of characters are cut out from the high priority area, and top and bottom recognition is performed on each character. If the result of the recognition of the plurality of characters matches, the result is adopted. If the result of the recognition does not match, the character is cut out from the area having the next highest priority and the top and bottom recognition processing is performed.

【０００７】[0007]

【発明が解決しようとする課題】しかしながら、上記従
来技術では、領域ごとに属性の判定を行い、属性の優先
順位を考慮しながら優先順位に従って領域ごとに天地認
識処理処理を行うので負荷を大きい。もちろん、こうし
た処理は天地認識結果の精度を向上させるためのもので
あり、無用なものではないが、実際のところ優先順位は
固定的で、属性が「タイトル」あるいは「テキスト」で
ある部分をもとに天地認識が行われることがほとんどで
ある。「表中文字」あるいは「キャプション」で天地認
識を行うのは、原稿にこれらの文字しか存在しない場合
であり、こうした原稿はどのような天地認識方法を採っ
ても天地認識結果の信頼性は低い。「テキスト」と「キ
ャプション」が混在する原稿で、あえて「キャプショ
ン」の優先順位を上げて天地認識を行う場合は考えにく
く、あったとしても極めて特殊な場合であろう。よっ
て、属性ごとに分割した領域に優先順位まで設定して行
う天地認識処理は、効果に対して負荷が過大となる場合
が多い。本発明は上記課題に鑑み、より小さな負荷でし
かも結果の信頼性を落とすことなく原稿の天地認識を実
行できる画像認識装置を提供することを目的とする。However, in the above-mentioned prior art, the attribute is determined for each area, and the top / bottom recognition processing is performed for each area in accordance with the priority while considering the priority of the attribute. Of course, such processing is intended to improve the accuracy of the top-bottom recognition result, and is not useless, but in reality, the priority is fixed, and even if the attribute is "title" or "text", In most cases, top and bottom recognition is performed. Performing top / bottom recognition using “characters in table” or “captions” is when only these characters are present in the manuscript, and the reliability of the top / bottom recognition result is low regardless of the top / bottom recognition method used for such a manuscript. . It is difficult to imagine a case where a document in which “text” and “caption” are mixed and the recognition of the top and bottom is performed by intentionally increasing the priority of “caption”. Therefore, in the upside-down recognition processing performed by setting up the priorities in the areas divided for each attribute, the load on the effect is often excessive. SUMMARY OF THE INVENTION The present invention has been made in view of the above circumstances, and has as its object to provide an image recognition apparatus capable of performing top and bottom recognition of a document with a smaller load and without reducing the reliability of the result.

【０００８】[0008]

【課題を解決するための手段】上記の課題を解決するた
めに、本発明の画像認識装置は、原稿を読み取って画像
データを生成する画像読取手段と、画像データを複数の
領域に分割する分割手段と、前記複数の領域のそれぞれ
について、原稿の天地認識処理に用いる場合の信頼度を
算出する信頼度算出手段と、信頼度が最も高い領域の画
像データから読み取り対象となった原稿の天地を判定す
る天地認識手段とを備えることを特徴とし、この構成に
よって天地認識結果の確度を落とすことなく天地認識処
理速度を向上させることを可能としている。In order to solve the above-mentioned problems, an image recognition apparatus according to the present invention comprises an image reading means for reading an original and generating image data, and a dividing means for dividing the image data into a plurality of areas. Means, for each of the plurality of areas, a reliability calculation means for calculating the reliability when used in the top and bottom recognition processing of the document, and the top and bottom of the document to be read from the image data of the area with the highest reliability It is characterized by including a top and bottom recognition means for determining, and this configuration makes it possible to improve the top and bottom recognition processing speed without lowering the accuracy of the top and bottom recognition result.

【０００９】そして、信頼度については、前記信頼度算
出手段が、前記分割領域ごとに画像データのヒストグラ
ムを作成し、走査方向における度数の最大値と最小値と
の差に基づいて当該領域の信頼度を算出する。信頼度に
ついては更に、前記信頼度算出手段が、前記分割領域ご
とに画像データのヒストグラムを作成し、度数が走査方
向において増加する変加点の数と減少する変加点の数と
を求め、これら２つの値から当該領域の信頼度を求める
ということもできる。As for the reliability, the reliability calculating means creates a histogram of the image data for each of the divided areas, and determines the reliability of the area based on the difference between the maximum value and the minimum value of the frequency in the scanning direction. Calculate the degree. Regarding the reliability, the reliability calculating means creates a histogram of the image data for each of the divided areas, and obtains the number of change points whose frequency increases in the scanning direction and the number of change points whose frequency decreases. It can be said that the reliability of the area is obtained from the two values.

【００１０】そして、前記複数の分割領域において最も
高い信頼度を有する領域が複数あった場合でも、前記天
地認識手段は、これら複数の領域の天地認識結果に加え
て信頼度が次に高い領域の天地認識結果を参照して原稿
の天地を判定するので、認識結果の確度は高い。[0010] Even when there are a plurality of areas having the highest reliability in the plurality of divided areas, the top-and-bottom recognizing means adds the top / bottom recognition result of the plurality of areas to the area having the next highest reliability. Since the top and bottom of the document is determined with reference to the top and bottom recognition results, the accuracy of the recognition results is high.

【００１１】[0011]

【発明の実施の形態】以下、本発明の実施の形態を、デ
ジタル複写機を例にとって、図面を参照しながら説明す
る。（１）デジタル複写機全体の構成まず、本実施の形態におけるデジタル複写機１（以下、
単に「複写機１」という。）の全体の構成を図１により
説明する。同図に示すように、この複写機１は、原稿自
動搬送装置１０と、画像読取部３０と、プリンタ部５０
と、給紙部７０とからなる。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Embodiments of the present invention will be described below, taking a digital copying machine as an example, with reference to the drawings. (1) Overall Configuration of Digital Copying Machine First, a digital copying machine 1 (hereinafter, referred to as a digital copying machine) according to the present embodiment.
Simply referred to as “copier 1”. 1) will be described with reference to FIG. As shown in FIG. 1, the copying machine 1 includes an automatic document feeder 10, an image reading section 30, a printer section 50,
And a paper feeding unit 70.

【００１２】原稿自動搬送装置１０は、原稿を自動的に
画像読取部３０に搬送する装置であって、原稿給紙トレ
イ１１に載置された原稿は、給紙ローラ１２、捌きロー
ラ１３により１枚ずつ分離されて下方に送られ、搬送ベ
ルト１４によって、プラテンガラス３１上の原稿読取位
置まで搬送される。原稿読取位置に搬送された原稿は、
画像読取部３０のスキャナ３２によりスキャンされた
後、再び、搬送ベルト１４により図の右方向に送られ、
排紙ローラ１５を経て原稿排紙トレイ１６上に排出され
る。The automatic document feeder 10 is a device for automatically feeding a document to the image reading section 30. The document placed on the document feed tray 11 is fed by a feed roller 12 and a separating roller 13. Each sheet is separated and sent downward, and is transported by the transport belt 14 to a document reading position on the platen glass 31. The document transported to the document reading position
After being scanned by the scanner 32 of the image reading section 30, the sheet is again sent rightward in the drawing by the transport belt 14,
The document is discharged onto a document discharge tray 16 via a discharge roller 15.

【００１３】画像読取部３０は、上記プラテンガラス３
１の原稿読取位置に搬送された原稿の画像を光学的に読
み取るものであって、スキャナ３２、ＣＣＤイメージセ
ンサ（以下、「ＣＣＤセンサ」という）３８などから構
成される。スキャナ３２には、露光ランプ３３とこの露
光ランプ３３の照射による原稿からの反射光をプラテン
ガラス３１に平行な方向に光路変更するミラー３４が設
置され、図の矢印方向に移動することによりプラテンガ
ラス３１上の原稿をスキャンする。原稿からの反射光は
ミラー３４に反射された後、さらにミラー３５、３６お
よび集光レンズ３７を介してＣＣＤイメージセンサ３８
まで導かれ、ここで電気信号に変換されて画像データが
生成される。The image reading section 30 includes the platen glass 3
The scanner optically reads an image of a document conveyed to one document reading position, and includes a scanner 32, a CCD image sensor (hereinafter, referred to as a “CCD sensor”) 38, and the like. The scanner 32 is provided with an exposure lamp 33 and a mirror 34 for changing the optical path of light reflected from the original by irradiation of the exposure lamp 33 in a direction parallel to the platen glass 31. The original on 31 is scanned. The reflected light from the original is reflected by a mirror 34 and then further transmitted through mirrors 35 and 36 and a condenser lens 37 to a CCD image sensor 38.
, Where it is converted to an electrical signal to generate image data.

【００１４】当該画像データは、制御部１００において
Ａ／Ｄ変換されてデジタル信号となり、さらにシェーデ
ィング補正や濃度変換処理等を加えられた後、公知の誤
差拡散処理を加えられた後、いったんメモリに格納され
る。そして、天地認識の結果に応じて回転処理され、プ
リンタ部５０のレーザダイオード５１の駆動信号とな
る。The image data is A / D-converted into a digital signal by the control unit 100, subjected to shading correction, density conversion, and the like, subjected to a known error diffusion process, and then temporarily stored in a memory. Is stored. Then, rotation processing is performed according to the result of the top and bottom recognition, and the rotation signal is used as a drive signal for the laser diode 51 of the printer unit 50.

【００１５】プリンタ部５０は、公知の電子写真方式に
より記録シート上に画像を形成するものであって、上記
駆動信号を受信するとレーザダイオード５１を駆動して
レーザ光を出射させる。レーザ光は、所定の角速度で回
転するポリゴンミラー５２側面のミラー面で反射され、
ｆθレンズ５３、ミラー５４、５５を介して、感光体ド
ラム５６の表面を露光走査する。この感光体ドラム５６
は、上記露光を受ける前にクリーニング部５７で感光体
表面の残留トナーを除去され、さらにイレーサランプ
（図示せず）の照射を受けて除電された後、帯電チャー
ジャ５８により一様に帯電されており、このように一様
に帯電した状態で上記露光を受けると、感光体ドラム５
６表面に静電潜像が形成される。現像器５９は、感光体
ドラム５６表面に形成された上記静電潜像を現像する。The printer section 50 forms an image on a recording sheet by a known electrophotographic method, and upon receiving the drive signal, drives the laser diode 51 to emit laser light. The laser light is reflected by a mirror surface on the side of the polygon mirror 52 rotating at a predetermined angular velocity,
The surface of the photosensitive drum 56 is exposed and scanned through the fθ lens 53 and the mirrors 54 and 55. This photosensitive drum 56
Before the exposure, the cleaning unit 57 removes the residual toner on the surface of the photoreceptor, and after being irradiated with an eraser lamp (not shown), is discharged, and then uniformly charged by the charging charger 58. When the above-described exposure is performed in a state of being uniformly charged, the photosensitive drum 5
An electrostatic latent image is formed on the six surfaces. The developing device 59 develops the electrostatic latent image formed on the surface of the photosensitive drum 56.

【００１６】一方、給紙部７０には、２つの用紙カセッ
ト７１、７２が設けられており、上述の感光体ドラム５
６における露光および現像の動作と同期して、必要なサ
イズの記録シートが、用紙カセット７１、７２のいずれ
かから、給紙ローラ７１１もしくは７２１の駆動により
給紙される。給紙された記録シートは、感光体ドラム５
６の下方で当該感光体ドラム５６の表面に接触し、この
時、転写チャージャ６０の静電力により、感光体ドラム
５６表面に形成されていたトナー像が当該記録シート表
面に転写される。On the other hand, the paper supply section 70 is provided with two paper cassettes 71 and 72,
In synchronism with the exposure and development operations in 6, a recording sheet of a required size is fed from one of the paper cassettes 71 and 72 by driving the paper feed roller 711 or 721. The fed recording sheet is a photosensitive drum 5
6, the surface of the photosensitive drum 56 contacts the surface of the photosensitive drum 56. At this time, the toner image formed on the surface of the photosensitive drum 56 is transferred to the surface of the recording sheet by the electrostatic force of the transfer charger 60.

【００１７】その後、記録シートは、分離チャージャ６
１の静電力によって感光体ドラム５６の表面から分離さ
れ、搬送ベルト６２により定着部６３に搬送される。記
録シートに転写されたトナー像は、定着部６３において
内部にヒータを備えた定着ローラ６４で加熱されながら
押圧されることにより定着される。定着後の記録シート
は、排出ローラ６５により排紙トレイ６６上に排出され
る。Thereafter, the recording sheet is separated from the separation charger 6.
The photosensitive drum 56 is separated from the surface of the photosensitive drum 56 by the electrostatic force of 1, and is transported to the fixing unit 63 by the transport belt 62. The toner image transferred to the recording sheet is fixed by being pressed while being heated by a fixing roller 64 having a heater inside in a fixing unit 63. The recording sheet after fixing is discharged onto a discharge tray 66 by a discharge roller 65.

【００１８】画像読取部３０の前面の操作しやすい位置
には、操作パネル９０が設けられており、コピー枚数を
入力するテンキーやコピー開始を指示するスタートキ
ー、各種のコピーモードを設定するための設定キー、上
記設定キーなどにより設定されたモードをメッセージで
表示する表示部などが設けられている。An operation panel 90 is provided at an easy-to-operate position on the front of the image reading section 30. The operation panel 90 has ten keys for inputting the number of copies, a start key for instructing the start of copying, and various copy modes. There are provided a setting key, a display unit for displaying a mode set by the setting key or the like by a message, and the like.

【００１９】（２）制御部１００の構成次に、複写機１の内部に設置されている制御部１００の
構成を図面に従って説明する。図２は、制御部１００の
構成を示すブロック図である。制御部１００は、画像読
取制御部１１０、画像信号処理部１２０、メモリ制御部
１３０、プリンタ制御部１４０、メイン制御部１５０、
原稿認識部２００などから成る。上記各構成部は、それ
ぞれＣＰＵを中心として構成されており、コマンドライ
ン（図中、点線で表示）を介して情報やコマンドを、画
像データバス（図中、実線で表示）を介して画像データ
を、相互にやり取りする。(2) Configuration of Control Unit 100 Next, the configuration of the control unit 100 installed inside the copying machine 1 will be described with reference to the drawings. FIG. 2 is a block diagram illustrating a configuration of the control unit 100. The control unit 100 includes an image reading control unit 110, an image signal processing unit 120, a memory control unit 130, a printer control unit 140, a main control unit 150,
The document recognition unit 200 is included. Each of the components described above is configured around a CPU, and transmits information and commands via a command line (indicated by a dotted line in the figure) to image data via an image data bus (indicated by a solid line in the figure). Interact with each other.

【００２０】画像読取制御部１１０は、原稿自動搬送装
置１０および画像読取部３０の動作を制御するものであ
る。すなわち、メイン制御部１５０からの実行指示を受
けて起動し、先ず原稿自動搬送装置１０に対し原稿の順
次搬送を行わせる。そして、搬送された原稿の読取りを
画像読取部３０に指示して、読み取った画像データを画
像信号処理部１２０に出力させる。The image reading control section 110 controls the operations of the automatic document feeder 10 and the image reading section 30. That is, it is started in response to an execution instruction from the main control unit 150, and first causes the automatic document feeder 10 to sequentially feed a document. Then, it instructs the image reading unit 30 to read the conveyed document, and causes the image signal processing unit 120 to output the read image data.

【００２１】画像信号処理部１２０は、ＣＣＤセンサ３
８から出力されてくる画像データについて、Ａ／Ｄコン
バータでデジタルの多値信号に変換し、シェーディング
補正部で露光ランプ３３の照度ムラやＣＣＤセンサ３８
の感度ムラを補正する。その後、ＭＴＦ補正部でエッジ
強調などの画質改善を施すなどの処理をした上で、原稿
認識部２００およびメモリ制御部１３０に出力する。The image signal processing unit 120 includes the CCD sensor 3
8 is converted into a digital multi-value signal by an A / D converter, and the shading correction unit causes uneven illuminance of the exposure lamp 33 and the CCD sensor 38.
To correct the sensitivity unevenness. After that, the MTF correction unit performs processing such as improving image quality such as edge enhancement, and outputs the processed image to the document recognition unit 200 and the memory control unit 130.

【００２２】原稿認識部２００は、上記画像データに基
づいて原稿の天地認識を行い、天地認識の結果、原稿の
向きの調整が必要となった場合には、メモリ制御部１３
０に指示して、画像データの回転処理を行わせる。原稿
認識部２００については、構成や処理内容の詳細を後述
する。The document recognizing unit 200 performs the top and bottom recognition of the document based on the image data, and if the orientation of the document needs to be adjusted as a result of the top and bottom recognition, the memory control unit 13
Instruct 0 to rotate the image data. Details of the configuration and processing of the document recognition unit 200 will be described later.

【００２３】メモリ制御部１３０は、画像信号処理部１
２０から出力されてくる画像データを２値化、さらに必
要な場合は圧縮した上で画像メモリ１３１にいったん格
納する。そして、メイン制御部１５０から指示を受ける
と、画像メモリ１３１から画像データを読み出し、多値
化、さらに圧縮されている場合は伸長を行って画像メモ
リ１３１格納前の画像データに戻す。さらに、上記原稿
認識部２００から画像回転処理の指示を受けていた場合
は、指示に応じた角度だけ画像データを回転させ、作像
処理のためにプリント制御部１４０に出力する。なお、
画像の回転処理については公知の技術（例えば、特開昭
６０−１２６７６９など）を用いて実行する。The memory control unit 130 includes the image signal processing unit 1
The image data output from 20 is binarized, and if necessary, compressed and stored in the image memory 131 once. Then, upon receiving an instruction from the main control unit 150, the image data is read from the image memory 131, multi-valued, and if compressed, decompressed to return to the image data before storage in the image memory 131. Further, when an instruction for image rotation processing has been received from the document recognizing unit 200, the image data is rotated by an angle corresponding to the instruction and output to the print control unit 140 for image forming processing. In addition,
The image rotation processing is performed using a known technique (for example, Japanese Patent Application Laid-Open No. 60-126679).

【００２４】プリンタ制御部１４０は、上記メモリ制御
部１３０から出力されてきた画像データを各再現色ごと
に、レーザーダイオード駆動信号に変換して、それぞれ
をレーザーダイオード５１に出力して、露光走査を行わ
せる。メイン制御部１５０は、利用者の指定（複写枚
数、片面／両面指定、複写開始指示など）を図外の操作
パネルから受け付けると、指定内容を制御部１００の構
成各部に通知する。また、構成各部の処理タイミングを
統一的に制御して、円滑な複写動作を実現する。The printer control section 140 converts the image data output from the memory control section 130 into laser diode drive signals for each reproduced color, outputs each to the laser diode 51, and performs exposure scanning. Let it do. When the main control unit 150 receives a user's designation (copy number, single / double-side designation, copy start instruction, etc.) from an operation panel (not shown), the main control unit 150 notifies the components of the control unit 100 of the designated contents. In addition, the processing timing of each component is uniformly controlled to realize a smooth copying operation.

【００２５】（３）原稿認識部２００の構成次に、制御部１００のうち、天地認識処理を実行する原
稿認識部２００について、構成と処理内容とを説明す
る。図３は、原稿認識部２００の構成を示すブロック図
である。原稿認識部２００は、認識制御部２１０、２値
化部２２０、領域分割部２３０、信頼度判定部２４０、
天地認識部２５０、作業用メモリ２６０などで構成され
る。(3) Configuration of Document Recognition Unit 200 Next, the configuration and processing of the document recognition unit 200 that performs the top-bottom recognition process in the control unit 100 will be described. FIG. 3 is a block diagram illustrating the configuration of the document recognition unit 200. The document recognition unit 200 includes a recognition control unit 210, a binarization unit 220, an area division unit 230, a reliability determination unit 240,
It comprises a top and bottom recognition unit 250, a work memory 260, and the like.

【００２６】２値化部２２０は、画像信号処理部１２０
から出力されてくる多階調画像データを所定階調レベル
のスレッシュレベルと比較して、２値データに変換す
る。そして、２値化した画像データを作業用メモリ２６
０に格納し、処理終了を認識制御部２１０に通知する。The binarizing section 220 includes an image signal processing section 120
Is converted to binary data by comparing the multi-tone image data output from the FD with a threshold level of a predetermined tone level. Then, the binarized image data is stored in the working memory 26.
0, and notifies the recognition control unit 210 of the end of the process.

【００２７】領域分割部２３０は、認識制御部２１０か
らの指示を受け、作業用メモリ２６０内の２値化画像デ
ータを複数の領域に分割する。図４は、領域分割の一例
を示す模式図である。ここでは、原稿を主走査方向と副
走査方向とでそれぞれ２等分し、Ａ，Ｂ，Ｃ，Ｄの４つ
の領域に分割している。領域分割部２３０は、分割した
領域の画像データについて識別情報（作業用メモリ２６
０におけるアドレス）を、信頼性判定部２４０に通知す
る。そして、認識制御部２１０に処理終了を通知する。The area dividing section 230 receives an instruction from the recognition control section 210 and divides the binarized image data in the working memory 260 into a plurality of areas. FIG. 4 is a schematic diagram illustrating an example of area division. Here, the original is divided into two equal parts in the main scanning direction and the sub-scanning direction, and divided into four areas A, B, C, and D. The region dividing unit 230 identifies the image data of the divided region with the identification information (work memory 26).
(Address at 0) to the reliability determination unit 240. Then, it notifies the recognition control unit 210 of the end of the process.

【００２８】信頼性判定部２４０は、領域分割部２３０
から通知されたアドレスをもとに作業用メモリ２６０内
の各領域について、ヒストグラムを作成し、ヒストグラ
ムから画像データ中の文字列の向き（行方向）が主走査
方向と副走査方向のいずれかを判定する。そして、行方
向のヒストグラムをもとに、各領域の画像データを天地
認識に使用した場合の信頼度を判定する。ここで言う信
頼度とは、具体的には画像データのヒストグラムから算
出されるＭＴＦ値である。ＭＴＦ値は、行方向のヒスト
グラムにおいてヒストグラム値の最大値（ｍａｘ）、最
小値（ｍｉｎ）を取り、以下の（式１）にあてはめるこ
とで求められる。ＭＴＦ値＝（ｍａｘ−ｍｉｎ）／（ｍａｘ＋ｍｉｎ） …（式１）信頼度算出部２４０は、当該領域の行方向のヒストグラ
ムをいくつかの区分に分けてＭＴＦ値を求め、各区分の
ＭＴＦ値の平均値を当該領域の信頼度とする。The reliability judging section 240 includes the area dividing section 230
A histogram is created for each area in the working memory 260 based on the address notified from the user, and the direction (line direction) of the character string in the image data is determined from the histogram in either the main scanning direction or the sub-scanning direction. judge. Then, based on the histogram in the row direction, the reliability when the image data of each area is used for the top and bottom recognition is determined. The reliability here is specifically an MTF value calculated from a histogram of image data. The MTF value is obtained by taking the maximum value (max) and the minimum value (min) of the histogram values in the histogram in the row direction and applying them to the following (Equation 1). MTF value = (max−min) / (max + min) (Equation 1) The reliability calculation unit 240 divides the row-direction histogram of the region into several sections to obtain MTF values, and calculates the MTF value of each section. The average value is used as the reliability of the area.

【００２９】図５は、信頼度（ＭＴＦ値）が高くなる画
像データの例を示す。ヒストグラム５１０は、６つの区
分５１１〜５１６に分けられており、各区分において、
ヒストグラム値の最大値は様々だが、最小値は０となっ
ている。結果として、全区分でＭＴＦ値は、ｍａｘ−０／ｍａｘ＋０＝ｍａｘ／ｍａｘ＝１となる。“１”はＭＴＦ値の最大値である。このように
ＭＴＦが最大値となるのは、ヒストグラム５１０に谷
（度数＝０の部分）があるためであり、これはつまり、
ヒストグラム５１０の元となる画像データ５２０には、
一列に並んだ文字データが間隔を置いて複数配置されて
いることを意味している。FIG. 5 shows an example of image data in which the reliability (MTF value) increases. The histogram 510 is divided into six sections 511 to 516, and in each section,
The maximum value of the histogram value varies, but the minimum value is 0. As a result, the MTF value for all sections is: max−0 / max + 0 = max / max = 1. “1” is the maximum value of the MTF value. The MTF has the maximum value as described above because the histogram 510 has a valley (portion where frequency = 0).
The image data 520 that is the basis of the histogram 510 includes
This means that a plurality of character data arranged in a line are arranged at intervals.

【００３０】ＭＴＦ値は、原稿に傾きがある場合のほか
に、表の罫線や図形など文字以外の情報が含まれていて
谷ができない画像データの場合に低くなる。キャプショ
ン文字や表中文字など、天地認識に用いるのに不適当な
文字データは、グラフや表罫線など文字以外の情報を伴
なうことが多いので、ＭＴＦ値が低い領域（文字以外の
情報を含む領域）に含まれる文字については、天地認識
に用いるのは不適当であると考えることができる。逆に
ＭＴＦ値の大きい領域には、上述の通り文字データが傾
きなしに一列に並んでいると考えることができ、天地認
識における信頼性が高い。以上のことが、ＭＴＦ値を天
地認識結果の信頼度とする根拠である。The MTF value becomes low in the case of image data which includes information other than characters, such as table ruled lines and figures, and in which a valley cannot be formed, in addition to the case where the original is skewed. Character data that is not suitable for use in top and bottom recognition, such as caption characters and characters in tables, often includes information other than characters such as graphs and table ruled lines. It can be considered that the characters included in the (contained area) are inappropriate to be used for the top and bottom recognition. Conversely, in an area having a large MTF value, it can be considered that character data is arranged in a line without inclination as described above, and the reliability in top-bottom recognition is high. The above is the basis for setting the MTF value as the reliability of the top-bottom recognition result.

【００３１】信頼度判定部２４０は、このように、２値
画像データからヒストグラムを作成し、作成したヒスト
グラムのＭＴＦ値の平均を信頼度として求める。そし
て、信頼度の算出を終えると、当該領域の２値画像デー
タのアドレス、これに対応するヒストグラムのアドレ
ス、そして信頼度の数値を対にして認識制御部２１０に
出力する。図４の例では、４つの領域のうち、領域Ｃ，
Ｄはグラフを含むため、信頼度が低くなる。また、領域
Ｂはグラフなどの図形は含まないものの、空白部分が多
い。信頼度が最も高いのは、データすべてが文字列であ
る領域Ａとなる。The reliability determination section 240 creates a histogram from the binary image data in this way, and obtains the average of the MTF values of the created histogram as the reliability. When the calculation of the reliability is completed, the address of the binary image data of the area, the address of the corresponding histogram, and the numerical value of the reliability are output to the recognition control unit 210 as a pair. In the example of FIG. 4, among the four areas, the areas C,
Since D includes a graph, the reliability is low. The region B does not include a graphic such as a graph, but has many blank portions. The highest reliability is the area A in which all data is a character string.

【００３２】天地認識部２５０は、認識制御部２１０か
らアドレスが出力されてくる領域（信頼度判定部２４０
が最も信頼度が高いと判定した領域）について公知の方
法で天地認識を行う。天地認識の方法については様々な
ものが公開されている（特開平４−２２９７６３、特開
平７−６５１２０など）ので詳細な説明は省くが、基本
的な手順は以下の通りである。先ず、処理対象領域の画
像データからヒストグラムに応じて１文字分のデータを
切り出し、この切り出しデータに対応する文字データ
（比較用文字）を図外のメモリ内のパターン辞書から見
つけ出す。それから、比較用文字を９０度ずつ回転させ
ては、切り出しデータと比較する。そして、一致した時
点での角度（０，９０，１８０または２７０度）を切り
出し文字の向きを示す情報として認識制御部２１０に出
力する。The top / bottom recognition unit 250 includes an area from which the address is output from the recognition control unit 210 (the reliability determination unit 240).
Is determined by using a known method. A variety of methods for recognizing the top and bottom are disclosed (JP-A-4-229763, JP-A-7-65120, etc.), so a detailed description will be omitted, but the basic procedure is as follows. First, data of one character is cut out from the image data of the processing target area in accordance with the histogram, and character data (characters for comparison) corresponding to the cut data is found from a pattern dictionary in a memory (not shown). Then, the character for comparison is rotated by 90 degrees and compared with the cut-out data. Then, the angle (0, 90, 180 or 270 degrees) at the time of matching is output to the recognition control unit 210 as information indicating the direction of the cut-out character.

【００３３】次いで、認識制御部２１０について説明す
るが、認識制御部２１０は原稿認識部２００全体の処理
の制御も行うので、原稿認識部２００の動作説明を兼ね
ることにする。図６は、原稿認識部２００による原稿の
天地判断処理の流れを示すフローチャート図である。原
稿認識部２００による処理は、画像信号処理部１２０か
ら補正済み画像データが出力されてきたタイミングで開
始される。Next, the recognition control unit 210 will be described. Since the recognition control unit 210 also controls the entire processing of the document recognition unit 200, the operation of the document recognition unit 200 is also described. FIG. 6 is a flowchart illustrating the flow of the document top / bottom determination process performed by the document recognition unit 200. The processing by the document recognition unit 200 is started at the timing when the corrected image data is output from the image signal processing unit 120.

【００３４】先ず認識制御部２１０は、２値化部２２０
に指示して画像データを２値化させてから作業用一時記
憶に格納し（Ｓ６０１）、領域分割部２３０に画像デー
タ分割を指示する。領域分割部２３０は、この２値化さ
れた画像データを分割する（Ｓ６０２）。First, the recognition control unit 210 includes a binarization unit 220
, The image data is binarized and stored in the temporary work storage (S601), and the area dividing unit 230 is instructed to divide the image data. The area dividing unit 230 divides the binarized image data (S602).

【００３５】認識制御部２１０は、領域分割部２３０か
ら分割領域の数と各領域のアドレスとを受け取ると、こ
れら情報を信頼度判定部２４０に出力し、各領域の信頼
度を求めさせる。信頼度判定部２４０は、領域ごとに画
素ヒストグラムを生成し（Ｓ６０４）、ＭＴＦ値を算出
して認識制御部２１０に通知する（Ｓ６０５）処理を、
未処理領域がなくなるまで繰り返す（Ｓ６０３）。When the recognition control unit 210 receives the number of divided regions and the address of each region from the region dividing unit 230, it outputs these information to the reliability determination unit 240 to determine the reliability of each region. The reliability determination unit 240 generates a pixel histogram for each region (S604), calculates the MTF value, and notifies the recognition control unit 210 (S605).
This is repeated until there is no unprocessed area (S603).

【００３６】認識制御部２１０は、信頼度判定部２４０
から出力されてくる領域ごとの信頼度情報を保持し、全
ての領域についての信頼度情報がそろった時点で、信頼
度の値をもとに天地認識に用いる領域を選択する。認識
制御部２１０は、信頼度の値が最大となる領域を選択す
る（Ｓ６０６）。それから、認識制御部２１０は、この
信頼度の値を所定の閾値と比較する。そして、信頼度の
最大値が閾値を下回る場合（Ｓ６０７:No）、天地認識
部２５０への処理実行指示は出さず、メモリ制御部１３
０に対しては、画像データの回転補正は不要とする情報
（回転角度＝０度）を出力する（Ｓ６１５、Ｓ６１
８）。これは、最大値があくまで相対的なものであり、
例えば２０％程度の信頼度の領域でも、他の領域の信頼
度が１０％などと低い値であれば最大値となってしまう
からである。信頼度の最大値が低い場合は、どの領域を
用いて天地認識を行っても信頼できる結果は得られない
と考えられるので、天地認識処理は行わず、操作者が原
稿を置いた向きのままにして複写するのである。The recognition control section 210 includes a reliability determination section 240
The reliability information for each area output from the above is held, and when the reliability information for all the areas is available, an area to be used for top-bottom recognition is selected based on the value of the reliability. The recognition control unit 210 selects an area where the value of the reliability is maximum (S606). Then, the recognition control unit 210 compares the value of the reliability with a predetermined threshold. When the maximum value of the reliability is lower than the threshold value (S607: No), the memory control unit 13 does not issue a process execution instruction to the top and bottom recognition unit 250.
For 0, information that the image data does not require rotation correction (rotation angle = 0 degrees) is output (S615, S61).
8). This is only a relative maximum,
This is because, for example, even in an area having a reliability of about 20%, the maximum value is obtained if the reliability of another area is a low value such as 10%. If the maximum value of the reliability is low, it is considered that reliable results cannot be obtained even if any area is used for top-bottom recognition. And copy it.

【００３７】一方、信頼度の最大値が所定の閾値以上で
あった場合（Ｓ６０７:Yes）、認識制御部２１０は、当
該領域の２値画像データとヒストグラムとのアドレスを
天地認識部２５０に出力し、これらを用いて天地認識を
行うよう指示する（Ｓ６０８）。そして、この指示に対
して天地認識部２５０から当該領域の天地認識結果が出
力されてくると、この結果をもとに、この原稿のコピー
を所定の向きに向けさせるのに必要な回転角度を算出
し、これをメモリ制御部１３０に出力する（Ｓ６１
７）。On the other hand, when the maximum value of the reliability is equal to or larger than the predetermined threshold (S607: Yes), the recognition control section 210 outputs the address of the binary image data and the histogram of the area to the top and bottom recognition section 250. Then, an instruction is given to perform upside down recognition using these (S608). Then, in response to this instruction, the top / bottom recognition unit 250 outputs the top / bottom recognition result of the area, and based on the result, determines the rotation angle required to orient the copy of the original in a predetermined direction. Calculation and outputs this to the memory control unit 130 (S61).
7).

【００３８】なお、信頼度の値が最も高い領域が複数あ
った場合（Ｓ６０９:Yes）、認識制御部２１０は、それ
ら全ての領域に対して上記の天地認識処理を行わせ（Ｓ
６０８）、結果が複数の領域で一致すれば（Ｓ６１０:Y
es）、その結果を採用する。領域間で結果が不一致とな
れば（Ｓ６１０:No）、２番目に高い信頼度を有する別
領域に対して、更に天地認識処理を行わせる（Ｓ６１
１）。この際、認識制御部２１０は、この２番目に高い
信頼度についても閾値との比較を行い、閾値以上である
場合に限って（Ｓ６１２:Yes）天地認識を行わせる（Ｓ
６１３）。閾値を下回っていれば（Ｓ６１２:No）、天
地認識不能として、メモリ制御部１３０に対して画像デ
ータの回転補正は不要とする情報（回転角度＝０度）を
出力して処理を終える（Ｓ６１５、Ｓ６１８）。When there are a plurality of areas having the highest reliability values (S609: Yes), the recognition control section 210 causes the above-described top / bottom recognition processing to be performed on all the areas (S609).
608), if the result matches in a plurality of areas (S610: Y
es), adopt the result. If the result does not match between the regions (S610: No), the other region having the second highest reliability is further subjected to the top-bottom recognition process (S61).
1). At this time, the recognition control unit 210 also compares the second highest reliability with the threshold, and performs the top / bottom recognition only when the reliability is equal to or higher than the threshold (S612: Yes) (S612).
613). If the difference is smaller than the threshold value (S612: No), information indicating that rotation correction of image data is unnecessary (rotation angle = 0 degrees) is output to the memory control unit 130, and the process ends (S615). , S618).

【００３９】２番目に信頼度の高い領域に天地認識処理
を行った場合、認識制御部２１０は、この結果を先に行
った２種類の結果と比較し、いずれかと一致すれば（Ｓ
６１４:Yes）、一致した結果をもとに必要な回転角度を
算出し、これをメモリ制御部１３０に出力する（Ｓ６１
６、Ｓ６１８）。先の天地認識結果のいずれもが後から
行った天地認識結果と一致しなかった場合（Ｓ６１４:N
o）、認識制御部２１０は天地認識不能と判定して、メ
モリ制御部１３０に対して画像データの回転補正は不要
とする情報（回転角度＝０度）を出力する（Ｓ６１５、
Ｓ６１８）。When the top / bottom recognition processing is performed on the area having the second highest reliability, the recognition control unit 210 compares this result with the two kinds of results obtained earlier, and if it matches one of the results (S
614: Yes), calculate the required rotation angle based on the matching result, and output this to the memory control unit 130 (S61).
6, S618). If none of the results of the top and bottom recognition match the results of the top and bottom recognition performed later (S614: N
o), the recognition control unit 210 determines that the orientation cannot be recognized, and outputs information (rotation angle = 0 degrees) to the memory control unit 130 that the rotation correction of the image data is unnecessary (S615,
S618).

【００４０】以上のように、本実施の形態ににおいて、
原稿認識部２００は画像データを領域に分割して、最も
信頼度の高い領域から切り出すデータで原稿の天地認識
を行うが、その際、領域分割は単純な規則に従って行
い、領域ごとの属性（テキスト、キャプション、表中文
字など）を判定することもしない。また、信頼度の判定
は画像データのヒストグラムのＭＴＦ値によって定める
ので、天地認識処理の負荷は従来技術に比べ大きく低減
される。しかも、ＭＴＦ値を信頼度の基準とすること
で、キャプション文字や表中文字など誤認識の原因とな
るデータは排除できるので、認識結果の信頼度が従来技
術に比べて低下することもない。As described above, in this embodiment,
The document recognizing unit 200 divides the image data into regions and recognizes the top and bottom of the document using data cut out from the region with the highest reliability. At this time, the region is divided according to a simple rule, and the attribute (text , Captions, characters in the table, etc.). In addition, since the determination of the reliability is determined by the MTF value of the histogram of the image data, the load of the top and bottom recognition processing is greatly reduced as compared with the related art. In addition, by using the MTF value as a reference for reliability, data that causes erroneous recognition, such as caption characters and characters in a table, can be eliminated, so that the reliability of the recognition result does not decrease as compared with the related art.

【００４１】なお、本実施の形態においては、ＭＴＦ値
によって天地認識に使用する場合の信頼度が高い領域
（一列に文字データが並んでいる領域）を判断している
が、ヒストグラムにおけるエッジ数を用いて、信頼度が
高い領域を判断することもできる。In the present embodiment, an area having a high degree of reliability (an area in which character data is arranged in a line) is determined based on the MTF value when used for upside-down recognition. By using this, it is also possible to determine a region having high reliability.

【００４２】図７にエッジ数と画像データの関係を示
す。同図（ａ）は、横書きの左詰めで傾きのないテキス
トの画像データと、この画像データについて、走査方向
のうち行方向に一致しない方向のヒストグラム７１０を
示す。ヒストグラム７１０には、ヒストグラム値が増加
する方向の変化点（増加エッジ：同図中では白丸で示
す）の数はヒストグラム値が減少する方向の変化点（減
少エッジ：同図中では黒丸で示す）の数より少なくな
る。（図７では、増加エッジは２個、減少エッジは４
個。）文字列の開始位置は改行部分を除いて左側で一致
するのに対し、文字列の終端は不特定だがらである。傾
きのある文字列や図表を含む画像データでは、エッジの
数は多くなり、増加エッジと減少エッジの数に差は出に
くい。よって、増加エッジ数と減少エッジ数、また両者
の差に着目すれば、文字列を多く含んだ画像データを見
つけ出すことができる。増加エッジ数と減少エッジ数と
の和は少ない方が、両者の差は大きい方が、画像データ
７２０のような文字列の画像データである可能性が高い
と判断できる。FIG. 7 shows the relationship between the number of edges and image data. FIG. 11A shows image data of horizontally written left-justified text without inclination, and a histogram 710 of the image data in the scanning direction that does not coincide with the row direction. In the histogram 710, the number of change points in the direction in which the histogram value increases (increasing edge: indicated by white circles in the figure) is the number of change points in the direction in which the histogram value decreases (decreasing edge: indicated by black circles in the figure). Less than the number of. (In FIG. 7, there are two increasing edges and four decreasing edges.
Pieces. The start position of the character string matches on the left except for the line feed, while the end of the character string is unspecified. In image data including an inclined character string or a chart, the number of edges increases, and it is difficult to make a difference between the numbers of increasing edges and decreasing edges. Therefore, by paying attention to the number of increasing edges and the number of decreasing edges, and the difference between the two, it is possible to find image data containing many character strings. It can be determined that the smaller the sum of the number of increasing edges and the number of decreasing edges, and the greater the difference between the two, the higher the possibility of character string image data such as image data 720.

【００４３】また、上記実施の形態においては、本発明
に係る画像認識装置をモノクロの複写機に適用した例を
説明したが、その他の原稿認識が必要な装置、例えばカ
ラー複写機やファクシミリ装置における画像認識装置と
しても適用される。ただし、その場合、画像データ中の
有彩色データを予めキャンセルする回路を組み込んでい
ることが必要である。有彩色データキャンセル回路につ
いては公知の技術なので、詳細な説明は省略する。In the above-described embodiment, an example in which the image recognition apparatus according to the present invention is applied to a monochrome copying machine has been described. However, in other apparatuses requiring document recognition, for example, a color copying machine or a facsimile machine. It is also applied as an image recognition device. However, in this case, it is necessary to incorporate a circuit for canceling chromatic data in the image data in advance. Since the chromatic data cancel circuit is a known technique, a detailed description is omitted.

【００４４】[0044]

【発明の効果】以上の説明から明らかなように、本発明
の画像認識装置によれば、原稿を読み取って画像データ
を生成する画像読取手段と、画像データを複数の領域に
分割する分割手段と、前記複数の領域のそれぞれについ
て、原稿の天地認識処理に用いる場合の信頼度を算出す
る信頼度算出手段と、信頼度が最も高い領域の画像デー
タから読み取り対象となった原稿の天地を判定する天地
認識手段とによって天地認識処理を行うので、従来のよ
うに領域の属性を判定する必要もなく、天地認識処理を
迅速に実行することができる。また、信頼度は画像デー
タのヒストグラムに表れる値を基に算出され、一列に並
んだ文字データを多く含む領域ほど高くなるので、信頼
度を基準に選んだ領域を用いて行った認識結果の確度も
高い。As is apparent from the above description, according to the image recognition apparatus of the present invention, there are provided image reading means for reading an original to generate image data, and dividing means for dividing the image data into a plurality of areas. A reliability calculating unit that calculates a reliability of each of the plurality of regions when the plurality of regions are used in the top and bottom recognition process of the document, and determines a top and bottom of the document to be read from the image data of the region having the highest reliability. Since the top / bottom recognition processing is performed by the top / bottom recognition means, it is not necessary to determine the attribute of the area as in the related art, and the top / bottom recognition processing can be executed quickly. In addition, the reliability is calculated based on the values appearing in the histogram of the image data, and the higher the area containing a large amount of character data arranged in a row, the higher the reliability. Therefore, the accuracy of the recognition result performed using the area selected based on the reliability is Is also expensive.

[Brief description of the drawings]

【図１】本発明に係る画像認識装置が適用される複写機
の全体の構成を示す断面図である。FIG. 1 is a cross-sectional view showing the overall configuration of a copying machine to which an image recognition device according to the present invention is applied.

【図２】上記複写機における制御部の構成を示すブロッ
ク図である。FIG. 2 is a block diagram showing a configuration of a control unit in the copying machine.

【図３】上記制御部における原稿認識部の構成を示すブ
ロック図である。FIG. 3 is a block diagram illustrating a configuration of a document recognition unit in the control unit.

【図４】上記原稿認識部による画像データの領域分割の
一例を示す図である。FIG. 4 is a diagram showing an example of area division of image data by the document recognition unit.

【図５】信頼度の具体的な目安であるＭＴＦ値が高くな
る種類の画像データとそのヒストグラムとの一例を示す
図である。FIG. 5 is a diagram illustrating an example of image data of a type in which an MTF value, which is a specific measure of reliability, is increased, and a histogram thereof;

【図６】上記原稿認識部による天地認識処理の流れを示
すフローチャート図である。FIG. 6 is a flowchart illustrating a flow of a top / bottom recognition process performed by the document recognition unit.

【図７】信頼度の別の目安であるエッジカウントを説明
するための図である。FIG. 7 is a diagram for explaining edge counting, which is another measure of reliability.

【図８】従来の天地認識処理において誤認識の原因とな
る文字データの例を示す図である。FIG. 8 is a diagram illustrating an example of character data that causes erroneous recognition in conventional top-down recognition processing.

[Explanation of symbols]

１複写機１００制御部１２０画像信号処理部１３０メモリ制御部１３１画像メモリ１５０メイン制御部２００原稿認識部２１０認識制御部２３０領域分割部２４０信頼度判定部２５０天地認識部 REFERENCE SIGNS LIST 1 copier 100 control unit 120 image signal processing unit 130 memory control unit 131 image memory 150 main control unit 200 document recognition unit 210 recognition control unit 230 area division unit 240 reliability determination unit 250 top and bottom recognition unit

フロントページの続き (72)発明者上田和弘大阪府大阪市中央区安土町二丁目３番13号大阪国際ビルミノルタ株式会社内Ｆターム(参考） 5C072 AA05 BA03 BA10 UA07 UA11 UA13 UA20 XA01 5C076 AA01 AA24 BA05 CA10 Continued on the front page (72) Inventor Kazuhiro Ueda 2-3-13 Azuchicho, Chuo-ku, Osaka-shi, Osaka F-term in Osaka International Building Minolta Co., Ltd. 5C072 AA05 BA03 BA10 UA07 UA11 UA13 UA20 XA01 5C076 AA01 AA24 BA05 CA10

Claims

[Claims]

An image reading unit that reads an original to generate image data; a dividing unit that divides the image data into a plurality of regions; and a reliability when each of the plurality of regions is used in a document direction recognition process. An image recognition apparatus comprising: a reliability calculation unit that calculates a degree; and a top and bottom recognition unit that determines the orientation of a document to be read based on image data of an area having the highest reliability.

2. The reliability calculating means creates a histogram of image data for each of the plurality of regions, and calculates the reliability of the region based on a difference between a maximum value and a minimum value of a frequency in a scanning direction. 2. The method according to claim 1, wherein
An image recognition device according to claim 1.

3. The reliability calculation means creates a histogram of image data for each of the plurality of regions, and obtains the number of change points whose frequency increases in the scanning direction and the number of change points whose frequency decreases. The image recognition device according to claim 1, wherein the reliability of the area is obtained from two values.

4. When there are a plurality of areas having the highest reliability among the plurality of areas and the results of the top and bottom recognition of these areas do not match, the top and bottom recognition means adds the reliability to the top and bottom recognition results of these areas. 4. The image recognition apparatus according to claim 1, wherein the direction of the document is determined by referring to a top and bottom recognition result of an area having the next highest degree.