JP2003085479A

JP2003085479A - Image processor

Info

Publication number: JP2003085479A
Application number: JP2001276365A
Authority: JP
Inventors: Hisatsugu Tawara; 久嗣田原
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2001-09-12
Filing date: 2001-09-12
Publication date: 2003-03-20

Abstract

PROBLEM TO BE SOLVED: To provide an image processor capable of preventing erroneous discrimination by enhancing character discrimination accuracy without discriminating only characters the orientations of which are difficult to be discriminated even in the case that time is restricted for discrimination of the orientation. SOLUTION: The image processor to perform an image processing of image data read by a reading means to read an original image, is provided with a character area detecting means to detect character areas in the image data, a discriminating means to discriminate the orientations of the prescribed numbers of characters for characters in the respective character areas detected by the character area detecting means and a document orientation discriminating means to discriminate the document orientation of the image data according to discrimination results of the respective character areas by the discriminating means.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、デジタル複写機等
の原稿画像を読み取る読み取り手段によって読み取られ
た画像データを画像処理する画像処理装置に関するもの
である。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an image processing apparatus for performing image processing on image data read by a reading unit for reading a document image such as a digital copying machine.

【０００２】[0002]

【従来の技術】従来、この種の画像処理装置において、
読み取られた原稿画像の文書方向を検出し、原稿の文書
の方向が異なっていても画像データを回転させ、文書方
向を揃える機能を有するものが考案されている。又、こ
のような機能を有する画像処理装置においては、１枚の
原稿の中の文字について、文字を検出した順番に文字方
向の判別を行い、その原稿の文書方向を判別している。2. Description of the Related Art Conventionally, in this type of image processing apparatus,
A device having a function of detecting the document direction of a read document image and rotating the image data even if the document direction of the document is different to align the document direction has been devised. Further, in the image processing apparatus having such a function, for the characters in one document, the character direction is determined in the order in which the characters are detected, and the document direction of the document is determined.

【０００３】[0003]

【発明が解決しようとす課題】しかしながら、従来の技
術では、文字を検出した順番に文字方向の判別を行い、
その原稿の文書方向を判別していた。従って、文書方向
の判別に時間の制限がある場合には、同じ原稿の中に判
別が容易な文字が存在していても、たまたま判別の困難
なフォント、サイズの文字ばかりを処理する場合が考え
られ、その処理を行っているうちに判別時間が終了する
と、原稿の文書方向の検出が不能になったり、誤判別す
ることがあった。However, in the conventional technique, the character direction is determined in the order in which the characters are detected,
The document direction of the manuscript was determined. Therefore, if there is a time limit for determining the orientation of a document, it may happen that even if there are easily identifiable characters in the same manuscript, only fonts and sizes that are difficult to identify happen to be processed. However, if the determination time ends while the processing is being performed, the document direction of the document may not be detected or the document may be erroneously determined.

【０００４】本発明は上記問題に鑑みてなされたもの
で、その目的とする処は、方向判別に時間制限がある場
合であっても、文字方向判別の困難な文字ばかりを判別
してしまうことなく、文字判別精度を高めて誤判別を防
ぐことができる画像処理装置を提供することにある。The present invention has been made in view of the above problems, and an object of the present invention is to discriminate only characters whose character direction is difficult to discriminate even when the direction discrimination has a time limit. It is another object of the present invention to provide an image processing device capable of improving character discrimination accuracy and preventing erroneous discrimination.

【０００５】[0005]

【課題を解決するための手段】上記目的を達成するた
め、本発明は、原稿画像を読み取る読み取り手段によっ
て読み取られた画像データを画像処理する画像処理装置
において、前記画像データ中の文字領域を検出する文字
領域検出手段と、該文字領域検出手段によって検出され
た各文字領域内の文字について所定数の文字の方向を判
別する判別手段と、該判別手段の各文字領域の判別結果
に応じて前記画像データの文書方向を判別する文書方向
判別手段を設けたことを特徴とする。In order to achieve the above object, the present invention is an image processing apparatus for image-processing image data read by a reading means for reading an original image, and detects a character area in the image data. The character area detecting means, the determining means for determining the direction of a predetermined number of characters in the characters in each character area detected by the character area detecting means, and the determining means according to the determination result of each character area by the determining means. It is characterized in that a document orientation discriminating means for discriminating the document orientation of the image data is provided.

【０００６】[0006]

【発明の実施の形態】以下に本発明の実施の形態を添付
図面に基づいて説明する。BEST MODE FOR CARRYING OUT THE INVENTION Embodiments of the present invention will be described below with reference to the accompanying drawings.

【０００７】図３は本発明の一実施例を示す画像処理装
置の構成を説明する断面図であり、同図において、１０
１は原稿台ガラスであり、原稿自動送り装置１４２から
給送された原稿が順次所定位置に載置される。１０２は
例えばハロゲンランプから構成される原稿照明ランプで
あり、原稿台ガラス１０１に載置された原稿を露光す
る。FIG. 3 is a sectional view for explaining the structure of an image processing apparatus showing an embodiment of the present invention. In FIG.
Reference numeral 1 denotes a platen glass on which the documents fed from the automatic document feeder 142 are sequentially placed at predetermined positions. Reference numeral 102 denotes a document illumination lamp composed of, for example, a halogen lamp, which exposes a document placed on the platen glass 101.

【０００８】１０３，１０４，１０５は走査ミラーであ
り、これらは不図示の光学走査ユニットに収容され、往
復動しながら原稿からの反射光をＣＣＤユニット１０６
に導く。ＣＣＤユニット１０６は、ＣＣＤに原稿からの
反射光を結像させる結像レンズ１０７、例えばＣＣＤか
ら構成される撮像素子１０８、該撮像素子１０８を駆動
するＣＣＤドライバ１０９等から構成されている。撮像
素子１０８からの画像信号出力は、例えば８ビットのデ
ジタルデータに変換された後、コントローラ部１３９に
人力される。Scanning mirrors 103, 104, 105 are housed in an optical scanning unit (not shown), and the CCD unit 106 receives reflected light from the original while reciprocating.
Lead to. The CCD unit 106 is composed of an image forming lens 107 for forming an image of reflected light from a document on the CCD, an image pickup element 108 including, for example, a CCD, a CCD driver 109 driving the image pickup element 108, and the like. The image signal output from the image sensor 108 is converted into, for example, 8-bit digital data, and then manually input to the controller unit 139.

【０００９】又、１１０は感光ドラムであり、１１２の
前露光ランプによって画像形成に備えて除電される。１
１３は１次帯電器であり、これは感光ドラム１１０を一
様に帯電させる。１１７は露光手段であり、これは例え
ば半導体レーザー等で構成され、画像形成や装置全体の
制御を行うコントローラ部１３９で処理された画像デー
タに基づいて感光ドラム１１０を露光して静電潜像を形
成する。Further, 110 is a photosensitive drum, which is precharged by a pre-exposure lamp 112 to prepare for image formation. 1
13 is a primary charger, which uniformly charges the photosensitive drum 110. Reference numeral 117 denotes an exposure unit, which is composed of, for example, a semiconductor laser, and exposes the photosensitive drum 110 on the basis of image data processed by a controller unit 139 that controls image formation and the entire apparatus to form an electrostatic latent image. Form.

【００１０】１１８は現像器であり、これには黒色の現
像剤（トナー）が収容されている。１１９は転写前帯電
器であり、これは感光ドラム１１０上に現像されたトナ
ー像を用紙に転写する前に高圧を掛ける。１２０，１２
２，１２４は給紙ユニットであり、各給紙ローラ１２
１，１２３，１２５の駆動により転写用紙が装置内へ給
送され、転写用紙はレジストローラ１２６の配設位置で
一旦停止し、感光ドラム１１０に形成された画像との書
き出しタイミングが取られて再給送される。Reference numeral 118 denotes a developing device, which contains a black developer (toner). A pre-transfer charger 119 applies a high voltage before transferring the toner image developed on the photosensitive drum 110 to a sheet. 120, 12
Reference numerals 2 and 124 denote paper feeding units, and each paper feeding roller 12
The transfer paper is fed into the apparatus by driving 1, 123, and 125, and the transfer paper is temporarily stopped at the position where the registration roller 126 is provided, and the transfer timing with the image formed on the photosensitive drum 110 is taken and re-transferred. Be delivered.

【００１１】１２７は転写帯電器であり、これは感光ド
ラム１１０に現像されたトナー像を給送される転写用紙
に転写する。１２８は分離帯電器であり、これは転写動
作の終了した転写用紙を感光ドラム１１０より分離す
る。尚、転写されないで感光ドラム１１０上に残ったト
ナーは、クリーナー１１１によって回収される。Reference numeral 127 denotes a transfer charger, which transfers the toner image developed on the photosensitive drum 110 onto a transfer sheet to be fed. Reference numeral 128 denotes a separation charger, which separates the transfer sheet after the transfer operation from the photosensitive drum 110. The toner remaining on the photosensitive drum 110 without being transferred is collected by the cleaner 111.

【００１２】１２９は搬送ベルトであり、これは転写プ
ロセスの終了した転写用紙を定着器１３０に搬送し、例
えば転写用紙は熱によりトナー像の定着を受ける。１３
１はフラッパであり、これは定着プロセスの終了した転
写用紙の搬送パスをステイプルソーター１３２又は中間
トレイ１３７の配置方向の何れかに制御する。ステイプ
ルソーター１３２に排紙された用紙は、各ビンに仕分け
され、コントローラ部１３９からの指示によりステイプ
ル部１４１がステイプルを行う。A conveyor belt 129 conveys the transfer sheet after the transfer process to the fixing device 130. For example, the transfer sheet receives a toner image by heat. Thirteen
A flapper 1 controls the conveyance path of the transfer sheet for which the fixing process has been completed to either the staple sorter 132 or the arrangement direction of the intermediate tray 137. The sheets discharged to the staple sorter 132 are sorted into bins, and the stapling unit 141 performs stapling according to an instruction from the controller unit 139.

【００１３】又、１３３〜１３６は給送ローラであり、
これらは一度定着プロセスの終了した転写用紙を中間ト
レイ１３７に反転（多重）又は非反転（両面）して給送
する。１３８は再給送ローラであり、これは中間トレイ
１３７に載置された転写用紙を再度レジストローラ１２
６の配設位置まで搬送する。コントローラ部１３９には
後述するマイクロコンピュータ、画像処理部等を備えて
おり、操作パネル１４０からの指示に従って前述の画像
形成動作を行う。Further, 133 to 136 are feeding rollers,
In these, the transfer paper after the fixing process is once inverted (multiplexed) or non-inverted (double-sided) and fed to the intermediate tray 137. Reference numeral 138 denotes a re-feeding roller that re-transfers the transfer sheet placed on the intermediate tray 137 to the registration roller 12 again.
It is conveyed to the position of 6. The controller section 139 is provided with a microcomputer, an image processing section, and the like, which will be described later, and performs the above-mentioned image forming operation according to an instruction from the operation panel 140.

【００１４】図２は本発明に係る画像処理装置における
コントローラ部１３９の構成を示すブロック図である。FIG. 2 is a block diagram showing the configuration of the controller unit 139 in the image processing apparatus according to the present invention.

【００１５】図２において、２０１は画像処理装置全体
の制御を行うＣＰＵであり、これは装置本体の制御手順
（制御プログラム）を記憶した読み取り専用メモリ２０
３（ＲＯＭ）からプログラムを順次読み取って実行す
る。ＣＰＵ２０１のアドレスバス及びデータバスはバス
ドライバー回路２０２、アドレスデコーダ回路を経て各
負荷に接続されている。又、２０４は入力データの記憶
や作業用記憶領域等として用いる主記憶装置であるラン
ダムアクセスメモリ（ＲＡＭ）である。In FIG. 2, reference numeral 201 denotes a CPU for controlling the entire image processing apparatus, which is a read-only memory 20 storing a control procedure (control program) for the apparatus main body.
3 (ROM) to sequentially read and execute the program. The address bus and data bus of the CPU 201 are connected to each load via a bus driver circuit 202 and an address decoder circuit. Reference numeral 204 denotes a random access memory (RAM) which is a main storage device used as a storage area for input data, a work storage area, and the like.

【００１６】２０５はＩ／Ｏインターフェースであり、
操作者がキー入力を行い、装置の状態等を液晶、ＬＥＤ
を用いて表示する操作パネル１４０や給紙系、搬送系、
光学系の駆動を行うモーター類２０７、クラッチ類２０
８、ソレノイド類２０９、又、搬送される用紙を検知す
るための紙検知センサ類２１０等の装置の各負荷に接続
される。現像器１１８には該現像器１１８内のトナー量
を検知するトナー残検センサ２１１が配置されており、
その出力信号がＩ／Ｏポート２０５に入力される。205 is an I / O interface,
The operator performs key input and displays the status of the device such as liquid crystal, LED
The operation panel 140, the paper feed system, the transport system,
Motors 207 and clutches 20 for driving the optical system
8, solenoids 209, and paper detection sensors 210 for detecting the conveyed paper, etc. are connected to respective loads of the apparatus. In the developing device 118, a toner remaining detection sensor 211 for detecting the amount of toner in the developing device 118 is arranged,
The output signal is input to the I / O port 205.

【００１７】２１５は高圧ユニットであり、これはＣＰ
Ｕの指示に従って前記１次帯電器１１３、現像器１１
８、転写前帯電器１１９、転写帯電器１２７、分離帯電
器１２８へ高圧を出力する。215 is a high-voltage unit, which is a CP
According to the instruction of U, the primary charger 113, the developing device 11
8. High voltage is output to the pre-transfer charger 119, the transfer charger 127, and the separation charger 128.

【００１８】２０６は画像処理部であり、ＣＣＤユニッ
ト１０６から出力された画像信号が入力され、後述する
画像処理を行い、画像データに従ってレーザーユニット
１１７の制御信号を出力する。レーザーユニット１１７
から出力されるレーザー光は、感光ドラム１１０を照射
し、露光するとともに非画像領域において受光センサで
あるビーム検知センサ２１３によって発光状態が検知さ
れ、その出力信号がＩ／Ｏポート２０５に入力される。
又、Ｉ／Ｏポート２０５からは後述する画像処理部２０
６内のセレクタ３１２のセレクト信号が出力される。An image processing unit 206 receives the image signal output from the CCD unit 106, performs image processing described later, and outputs a control signal for the laser unit 117 according to the image data. Laser unit 117
The laser light output from the device irradiates and exposes the photosensitive drum 110, and the light emission state is detected by the beam detection sensor 213 which is a light receiving sensor in the non-image area, and the output signal is input to the I / O port 205. .
In addition, the I / O port 205 is connected to the image processing unit 20 described later.
The select signal of the selector 312 in 6 is output.

【００１９】図１は本発明に係る画像処理装置における
コントローラ部１３９内の画像処理部２０６のブロック
図である。FIG. 1 is a block diagram of the image processing unit 206 in the controller unit 139 in the image processing apparatus according to the present invention.

【００２０】ＣＣＤ１０８により電気信号に変換された
画像信号は、先ず、シェーディング回路３０１によって
画素間のばらつきの補正を行った後、変倍回路３０２に
おいて、縮小コピー時はデータの間引き処理を行い、拡
大コピー時はデータの補間を行う。The image signal converted into the electric signal by the CCD 108 is first corrected by the shading circuit 301 for the pixel-to-pixel variation, and then the scaling circuit 302 performs the data thinning process during the reduction copy to enlarge the image. Data is interpolated during copying.

【００２１】次に、エッジ強調回路３０３において、例
えば５×５のウィンドウで２次微分を行い、画像のエッ
ジを強調する。この画像データは輝度データであるた
め、プリンタに出力するための濃度データに変換するた
めγ変換回路３０４でテーブルサーチによりデータ変換
を行う。濃度データに変換された画像データは２値化処
理部３０５に入力される。ここでは、例えばＥＤ法によ
り多値データを２値データに変換する。２値に変換され
た画像データは合成回路３０７に入力される。合成回路
３０７では、入力された画像データと例えばＤＲＡＭに
より構成される画像用メモリ３１０内の画像データを選
択的に出力する、又はＯＲを取って出力する。この画像
用メモリ３１０に対するリードライト制御はメモリ制御
部３０９で行う。これらの画像データはレーザーの発光
強度の信号に変換するためＰＷＭ回路３０８へ入力さ
れ、画像の濃度に従ったパルス幅をレーザーユニットに
対して出力する。又、シェーディング回路３０１からの
画像出力は文書方向判別部３０６へ入力され、後述する
文書方向判別処理が行われる。Next, in the edge emphasizing circuit 303, for example, a second-order differentiation is performed in a 5 × 5 window to emphasize the edge of the image. Since this image data is luminance data, the γ conversion circuit 304 performs data conversion by table search in order to convert it to density data to be output to the printer. The image data converted into the density data is input to the binarization processing unit 305. Here, multivalued data is converted into binary data by the ED method, for example. The binary-converted image data is input to the combining circuit 307. The synthesizing circuit 307 selectively outputs the input image data and the image data in the image memory 310 configured by, for example, a DRAM, or outputs an ORed result. The read / write control for the image memory 310 is performed by the memory control unit 309. These image data are input to the PWM circuit 308 for conversion into a signal of laser emission intensity, and a pulse width according to the image density is output to the laser unit. Further, the image output from the shading circuit 301 is input to the document direction discriminating unit 306, and the document direction discriminating process described later is performed.

【００２２】次に、図４〜図９を用いて本発明における
文書方向判別動作について説明する。図４は文書方向判
別部３０６内のブロック図である。Next, the document direction discriminating operation according to the present invention will be described with reference to FIGS. FIG. 4 is a block diagram of the inside of the document orientation discriminating unit 306.

【００２３】シェーディング回路３０１から出力された
画像データは、ＣＰＵ／メモリ部４０１に入力され、画
像データを一時的に保存するとともに、各種制御を行
う。コントローラ部１３９内のＣＰＵ２０１とは例えば
不図示のデュアルポートＲＡＭによりバス接続されてお
り、データを送受信する。尚、シリアル通信でも良い。The image data output from the shading circuit 301 is input to the CPU / memory section 401 to temporarily store the image data and perform various controls. The CPU 201 in the controller unit 139 is bus-connected by, for example, a dual port RAM (not shown), and transmits / receives data. Note that serial communication may be used.

【００２４】文字認識／方向判別部４０２は、文書の方
向を一番正確に表しているのは文字であることに着目
し、文書中の数種類の文字領域を０°、９０°、１８０
°、２７０°の方向から文字認識を行い、それら各方向
における文字認識の精度（文字認識の自信度：文字の特
徴分布に対する距離）の中で一番精度の高い方向を文書
方向とする。The character recognizing / orienting discriminating unit 402 pays attention to the fact that it is the character that most accurately represents the orientation of the document, and it recognizes several kinds of character areas in the document at 0 °, 90 °, 180 °.
Character recognition is performed from the directions of 270 ° and 270 °, and the direction with the highest accuracy among the character recognition accuracies in each direction (confidence of character recognition: distance with respect to character feature distribution) is the document direction.

【００２５】領域分離部４０３は文字認識／方向判別部
４０２による文字認識・方向判別処理を行うための前処
理として、文書画像データより、文字部、図形部、自然
面部、表部等を矩形の領域に分離して、各領域の属性
（文字部等）を付加する処理を行うブロックである。The area separation unit 403 makes a character portion, a graphic portion, a natural surface portion, a front surface portion, etc. of the document image data into a rectangular shape as a preprocessing for performing the character recognition / direction determination processing by the character recognition / direction determination unit 402. This is a block for performing a process of separating an area and adding an attribute (character portion or the like) of each area.

【００２６】記憶装置４０４は、例えばハードディスク
や光磁気ディスク等により構成され、各種処理結果（画
像データ、領域分離結果、文字認識結果等）を保存する
ために利用される。Ｉ／Ｆ部４０５は、ＳＣＳＩやＲＳ
２３２Ｃ等により構成され、外部ヘデータを伝送するた
めに設けられている。コンピュータ４０６は、Ｉ／Ｆ部
４０５を介して情報を得たり、光磁気ディスク等の移動
可能の記憶装置よりデータを得て利用する。The storage device 404 is composed of, for example, a hard disk or a magneto-optical disk, and is used to store various processing results (image data, area separation result, character recognition result, etc.). The I / F unit 405 uses SCSI or RS.
232C or the like, and is provided for transmitting data to the outside. The computer 406 obtains information via the I / F unit 405 or obtains data from a movable storage device such as a magneto-optical disk for use.

【００２７】次に、本実施の形態における文書方向自動
判別・補正及び文字認識処理の概要を図５に示すフロー
チャートに従って説明する。Next, the outline of the document direction automatic discrimination / correction and character recognition processing in this embodiment will be described with reference to the flowchart shown in FIG.

【００２８】入力された画像データ（多値画像）は、先
ず領域分離部４０３により、文字部、図形部、自然画
部、表部等の属性別に矩形の領域に分離される（ステッ
プＳ１，Ｓ２）。ここでは、実際には、矩形で囲まれた
領域情報を作成する。The input image data (multi-valued image) is first separated by the area separating unit 403 into rectangular areas according to attributes such as a character portion, a graphic portion, a natural image portion, and a front portion (steps S1 and S2). ). Here, the area information enclosed by the rectangle is actually created.

【００２９】次に、各属性より文字領域の矩形情報を抽
出する（ステップＳ３）。ここで、文字領域とは、文章
部、タイトル部、表中の文字、図のキャプション部等で
ある。例えば、図６（ａ），（ｃ）に示す文書の場合
は、それぞれ図６（ｂ），（ｄ）に示したような文字領
域の矩形情報が抽出される。そして、これらの中の数ブ
ロックを用いて文書方向判別を行う（ステップＳ４）。
その結果、文書方向が正方向であれば、引き続き画像中
の文字ブロックに対して文字認識処理を行う（ステップ
Ｓ７）。Next, the rectangle information of the character area is extracted from each attribute (step S3). Here, the character area is a text part, a title part, characters in a table, a caption part of a figure, or the like. For example, in the case of the documents shown in FIGS. 6A and 6C, the rectangular information of the character area as shown in FIGS. 6B and 6D is extracted. Then, the document direction is discriminated using several blocks among these blocks (step S4).
As a result, if the document direction is the forward direction, the character recognition process is continuously performed on the character block in the image (step S7).

【００３０】一方、文書方向が不正方向であれば、画像
データを正しい方向に回転させる（ステップＳ５）。そ
して、回転画像に対して領域分離を行い、領域分離情報
の補正処理を行う（ステップＳ６）。これは、画像回転
に伴う領域分離情報の相違を補正するもので、１つの方
法としては、全回転画像データに対して再び領域分離処
理を行う方法。もう１つは、アドレス変換を領域分離結
果に掛ける方法がある。領域分離処理は、一般に画像が
正方向を想定しているため、初期の段階で行った領域分
離処理と回転画像データに対して行った領域分離処理は
結果が異なることが多い。このため、前者の方法が採ら
れるのが望ましい。On the other hand, if the document direction is incorrect, the image data is rotated in the correct direction (step S5). Then, the rotation image is subjected to region separation, and the region separation information is corrected (step S6). This is to correct the difference in area separation information due to image rotation, and one method is to perform area separation processing again on all rotated image data. The other method is to apply address translation to the area separation result. In the area separation processing, the image is generally assumed to be in the forward direction, and therefore the area separation processing performed in the initial stage and the area separation processing performed on the rotated image data often have different results. Therefore, it is desirable to adopt the former method.

【００３１】次に、ステップＳ７に進んで、回転画像デ
ータ中の文字領域ブロックは、文字認識処理系で文字認
識される。この結果、最終的に、回転なし／回転ありの
両方の場合とも、領域分離情報と文字認識情報が得られ
る（ステップＳ８）。この処理結果は、Ｉ／Ｆ部４０５
を介してコンピュータ４０６に伝送され、コンピュータ
４０６上のファイリングのアプリケーションソフト等で
利用される。又、コントローラ部１３９内のＣＰＵ２０
１へ各画像毎に送信される。Next, in step S7, the character area block in the rotated image data is recognized by the character recognition processing system. As a result, the area separation information and the character recognition information are finally obtained in both cases of no rotation / with rotation (step S8). This processing result is the I / F unit 405.
Is transmitted to the computer 406 via the computer and used by filing application software on the computer 406. In addition, the CPU 20 in the controller unit 139
1 is transmitted for each image.

【００３２】次に、文字認識処理を用いた文書方向判別
の手法について説明する。Next, a method of discriminating the document direction using the character recognition processing will be described.

【００３３】［領域分離処理］文書画像データの黒画素
を検出してゆき、輪郭線追跡又はラベリング方式によ
り、黒画素ブロックの矩形枠を作成する。次に、その矩
形の中の黒画素密度、隣接矩形ブロックの有無、矩形の
縦横比率等を判断基準にして、文字領域（タイトル、本
文、キャプション等）、図形領域、自然画領域、表領域
等を判別する。この処理結果により、文字領域の矩形領
域が判別される。[Region Separation Processing] The black pixels of the document image data are detected, and the rectangular frame of the black pixel block is created by the contour tracking or labeling method. Next, character areas (titles, text, captions, etc.), graphic areas, natural image areas, table areas, etc., based on the black pixel density in the rectangle, the presence or absence of adjacent rectangular blocks, the aspect ratio of the rectangle, etc. To determine. The rectangular area of the character area is discriminated from the processing result.

【００３４】［文字認識処理］文字認識処理の１つの方
法として、特徴ベクトル抽出、比較方式がある。例えば
図７（ａ）に示したように、「本」という文字を含む文
字領域が判別されたとする。第１段階として、この文字
領域について文字切り出し処理を行う（図７（ｂ）参
照）。これは、１つの文字の矩形を切り出す処理で、先
ず８ビットである画像データを白と黒の２値に変換し、
黒画素連続性の状態を検出していけば求められる。この
ときの２値変換の閾値は図３におけるビストグラム作成
部３１２で作成される画像の地肌データが用いられる。
第２段階として、１文字をｍ×ｎ（例えば６４×６４）
の画素ブロックに切り出す（図７（ｃ）参照）。そし
て、その中から３×３画素のウィンドウを用いて、黒画
素の分布方向を抽出する（方向ベクトル情報：図７
（ｄ）参照）。[Character Recognition Processing] One method of character recognition processing is the feature vector extraction and comparison method. For example, as shown in FIG. 7A, it is assumed that a character area including the character "book" is identified. As the first step, character cutting processing is performed on this character area (see FIG. 7B). This is the process of cutting out the rectangle of one character. First, the image data of 8 bits is converted into binary of white and black,
It is required if the state of black pixel continuity is detected. The background data of the image created by the bistogram creating unit 312 in FIG. 3 is used as the binary conversion threshold value at this time.
In the second stage, one character is m × n (eg 64 × 64)
The pixel block is cut out (see FIG. 7C). Then, the distribution direction of the black pixels is extracted from that using a window of 3 × 3 pixels (direction vector information: FIG. 7).
(See (d)).

【００３５】尚、図７（ｄ）は方向ベクトル情報の一部
を例示したものであり、上記３×３画素のウィンドウを
ずらしてゆき、方向ベクトル情報を数１０個得る。この
ベクトル情報が文字の特徴となる。この特徴ベクトルと
予め記憶されている文字認識辞書の内容と比較して、特
徴ベクトルに特徴が一番近い文字から順番に文字を抽出
する。この場合、特徴ベクトルに特徴が近い順番に第１
候補、第２候補…となる。この特徴ベクトルに対する特
徴の近さが、その文字に対する距離の近さ、即ち、文字
認識の自信度（精度）という数値になる。Incidentally, FIG. 7 (d) shows an example of a part of the direction vector information, and the window of 3 × 3 pixels is shifted to obtain several ten pieces of direction vector information. This vector information becomes a character feature. By comparing this feature vector with the contents of the character recognition dictionary stored in advance, characters are extracted in order from the character having the feature closest to the feature vector. In this case, the features are first in the order in which the features are closer to the feature vector.
Candidate, second candidate ... The closeness of the feature to the feature vector is the closeness of the distance to the character, that is, the confidence level (accuracy) of character recognition.

【００３６】［文字方向判別処理］このようにして文字
認識の自信度が求められるが、その自信度に基づいた文
字方向判別処理を図８に示した「本発明の名称」という
文例を用いて説明する。[Character Orientation Discrimination Processing] In this way, the degree of confidence in character recognition is obtained. The character orientation discrimination processing based on the degree of confidence is performed by using the sentence example "name of the present invention" shown in FIG. explain.

【００３７】図８（ａ）は正方向の文、図８（ｂ）は２
７０°回転した文である。ここで「本」に注目すると、
文字方向を判別する場合は、図８（ｃ）に示したよう
に、１つの文字「本」について０°、９０°、１８０
°、２７０°の４方向から文字認識を行ってみる。各回
転角度は、文字矩形の領域の読み出し方を変更すれば良
く、特に原稿を回転させる必要はない。FIG. 8 (a) is a forward sentence, and FIG. 8 (b) is 2
It is a sentence rotated by 70 °. Focusing on the “book” here,
When determining the character direction, as shown in FIG. 8C, 0 °, 90 °, 180 for one character “book”
Let's try character recognition from 4 directions of ° and 270 °. For each rotation angle, the reading method of the character rectangular area may be changed, and the original need not be rotated.

【００３８】各回転角度における文字認識結果は、図８
（ｃ）に示したように、互いに異なっている。尚、図８
（ｃ）には説明用の仮の文字認識結果及び自信度が示さ
れており、現実にこの通りになるとは限らない。The character recognition result at each rotation angle is shown in FIG.
As shown in (c), they are different from each other. Note that FIG.
In (c), the temporary character recognition result and the confidence level for the purpose of explanation are shown, and it is not always the case.

【００３９】図８（ｃ）において、正方向（０°）から
文字認識を行った場合は、「本」と正しく認識され、自
信度も０．９０と高い値となる。９０°回転した方向か
ら文字認識を行った場合は、「町」と誤認識され、自信
度も０．４０と低下する。このように誤認識が発生し、
自信度も低下するのは、回転した方向から見た場合の特
徴ベクトルに基づいて文字認識を行ったからである。同
様に１８０°、２７０°回転した方向から文字認識を行
った場合も、誤認識が発生し、自信度も低下する。尚、
文字認識の方向別の自信度は、複雑な文字であればある
程、その差が顕著に現れてくる。In FIG. 8C, when the character recognition is performed from the forward direction (0 °), the character is correctly recognized as "book", and the confidence is a high value of 0.90. When the character recognition is performed from the direction rotated by 90 °, it is erroneously recognized as “town” and the confidence level is also reduced to 0.40. False recognition occurs like this,
The reason for the decrease in confidence is that the character recognition is performed based on the feature vector when viewed from the rotated direction. Similarly, when character recognition is performed from a direction rotated by 180 ° or 270 °, erroneous recognition occurs and confidence is also reduced. still,
The difference in the degree of confidence in character recognition by direction becomes more remarkable as the character is more complicated.

【００４０】図８（ｃ）の結果は、正方向の場合に自信
度が一番高いため、文書は正方向に向いている可能性が
高いと判断される。文字方向判別の精度を向上させるた
め、同一ブロック内の複数の文字について、同様に４方
向から文字認識を行ってみる。更に、１つのブロックだ
けで文字方向を判別した場合、特殊な文字列について文
字方向を誤って判別する可能性があるため、複数のブロ
ックについて同様の文字認識を行ってみる。そして、各
ブロックについて、当該ブロック内の各認識対象文字の
４方向別の自信度の平均値を求め、更に、各ブロックで
の４方向別の自信度の平均値に対する平均値を求め、こ
の平均値が最も高い方向を文字方向（文書方向）として
認定する。The result shown in FIG. 8C has the highest degree of confidence in the case of the forward direction, and therefore it is judged that the document is highly likely to be in the normal direction. In order to improve the accuracy of character direction discrimination, character recognition is similarly performed from four directions for a plurality of characters in the same block. Furthermore, if the character direction is determined only by one block, the character direction may be erroneously determined for a special character string, so the same character recognition will be performed for a plurality of blocks. Then, for each block, the average value of the confidence levels of the respective recognition target characters in each block in each of the four directions is calculated, and the average value of the confidence values of each block in each of the four directions is calculated. The direction with the highest value is recognized as the character direction (document direction).

【００４１】このように、１文字だけの自信度で文字方
向を認定することなく、同一ブロック内の複数文字、更
には同一ブロック内の複数文字の自信度で文字方向を認
定することにより、文字（文書）方向を高精度に判別す
ることが可能となる。但し、１文字だけの自信度で文字
方向を判別したり、或は同一ブロック内の複数文字の自
信度で文字方向を判別しても、従来よりも高精度に文字
方向を判別できることは言うまでもない。As described above, the character direction is not recognized by the confidence level of only one character, but the character direction is recognized by the confidence levels of a plurality of characters in the same block, and further, a plurality of characters in the same block. It is possible to determine the (document) direction with high accuracy. However, it goes without saying that the character direction can be determined with higher accuracy than before even if the character direction is determined based on the confidence level of only one character, or the character direction is determined based on the confidence levels of multiple characters in the same block. .

【００４２】次に、文字方向（文書方向）の判別結果が
正方向以外の方向であるときは、文字方向が正方向にな
るように原画像を回転する。この回転は、図４に示すＣ
ＰＵ／メモリ４０１を用いて公知の技術により簡単に行
うことが可能であり、その説明は省略する。Next, when the result of discrimination of the character direction (document direction) is a direction other than the positive direction, the original image is rotated so that the character direction becomes the positive direction. This rotation is C shown in FIG.
This can be easily performed by a known technique using the PU / memory 401, and the description thereof will be omitted.

【００４３】以上のような処理により、図９（ａ）に示
した原画像データ、図９（ｂ）に示した領域分離デー
タ、図９（ｃ）に示した文字認識情報を得ることができ
る。これらの情報は前述のようにコントローラ部１３９
のＣＰＵ２０１へ送られ、各種画像処理、各種制御に使
用される。By the above processing, the original image data shown in FIG. 9A, the area separation data shown in FIG. 9B, and the character recognition information shown in FIG. 9C can be obtained. . As described above, this information is stored in the controller unit 139.
Is sent to the CPU 201 and used for various image processing and various controls.

【００４４】領域分離データの形式は、図９（ｂ）に示
したように、領域分離データである旨を示す「ｈｅａｄ
ｅｒ」と、分離した領域の識別子「ｒｅｃｔ１」〜「ｒ
ｅｃｔ４」により構成され、この識別子で区別された各
領域（ブロック）の情報は、ブロックの番号「ｏｒｄｅ
ｒ」、ブロックの属性（文字部、図形部等）「ａｒ
ｔ」、ブロックの左上の座標値「ｘ１」及び「ｙ１」、
ブロックの幅「ｗ」、ブロックの高さ「ｈ」、縦書き又
は横書きを示す「ｄｉｒｅｃｔｉｏｎ」、当該ブロック
のＩＤである「ｓｅｌｆＩＤ」、当該ブロックを包含す
る親ブロックのＩＤである「ｕｐｐｅｒＩＤ」、親ブロ
ックの属性「ｕｐｐｅｒＡｔｔ」、予備領域「ｒｅｓｅ
ｒｖｅ」により構成されている。The format of the area separation data is, as shown in FIG. 9B, "head" indicating that it is area separation data.
er "and the identifiers" rect1 "to" r "of the separated areas
The information of each area (block) that is configured by "ect4" and distinguished by this identifier is the block number "orde".
r ", block attribute (character part, graphic part, etc.)" ar
t ”, the coordinate values“ x1 ”and“ y1 ”at the upper left of the block,
The block width “w”, the block height “h”, the vertical or horizontal writing “direction”, the block ID “selfID”, the parent block ID including the block “upperID”, Parent block attribute "upperAtt", spare area "rese"
rve ”.

【００４５】又、文字認識情報は、図９（ｃ）に示した
ように、文字認識情報である旨を示す「ｈｅａｄｅｒ」
を有し、例えば「本」等の単一の文字に関する文字認識
情報「ＯＣＲ１」等と、当該文字が合まれているブロッ
クを示す上記「ｒｅｃｔ１」等に相当する「ｂｌｋｈ
ｅａｄｅｒ」との組み合わせ情報により構成されてい
る。As shown in FIG. 9C, the character recognition information is "header" indicating that it is the character recognition information.
Character recognition information “OCR1” or the like regarding a single character such as “book” and “blkh” corresponding to the above “rect1” or the like indicating a block in which the character is merged.
“Eader” and combination information.

【００４６】そして、「ＯＣＲ１」等の各文字認識情報
は、文字であるか或は空白であるかを示す「ｔｙｐ
ｅ」、前述の文字認識の自信度に従った第１〜第５候補
文字「文字１」〜「文字５」、当該文字の切り出し位置
「ｘ１」及び「ｙ１」、当該文字の幅「ｗ」、当該文字
の高さ「ｈ」、予備領域「ｒｅｓｅｒｖｅ」により構成
されている。文字認識ができない場合、例えば画像デー
タ全てに文字が含まれない等のときは予備領域「ｒｅｓ
ｅｒｖｅ」に「ｕｎｋｎｏｗｎ（検知不能）」を表すデ
ータを返す。Each character recognition information such as "OCR1" is "type" indicating whether it is a character or a blank.
e ”, the first to fifth candidate characters“ character 1 ”to“ character 5 ”according to the confidence level of the character recognition, the cutout positions“ x1 ”and“ y1 ”of the character, and the width“ w ”of the character. , The height of the character “h”, and the spare area “reserve”. If the characters cannot be recognized, for example, if the characters are not included in all the image data, the reserved area "res"
Data representing "unknown (undetectable)" is returned to "erve".

【００４７】次に、図１０及び図１１を用いて本発明の
画像処理装置の操作パネル１４０について説明する。Next, the operation panel 140 of the image processing apparatus of the present invention will be described with reference to FIGS.

【００４８】図１０は本発明における画像処理装置の操
作パネル１４０に表示される基本画面である。１００１
は拡張機能キーであり、このキーを押すことによって両
面複写、多重複写、移動、綴じ代の設定、枠消しの設定
等のモードに入る。１００２は画像モードキーであり、
複写画像に対して網掛け、影付け、トリミング、マスキ
ングを行うための設定モードに入る。１００３はユーザ
ーモードキーであり、モードメモリの登録、標準モード
画面の設定がユーザー毎に行うことができる。FIG. 10 is a basic screen displayed on the operation panel 140 of the image processing apparatus according to the present invention. 1001
Is an extended function key, and by pressing this key, a mode such as double-sided copying, multiple copying, moving, setting of binding margin, setting of frame erasing, etc. is entered. 1002 is an image mode key,
Enter the setting mode for shading, shadowing, trimming and masking the copied image. A user mode key 1003 can be used to register a mode memory and set a standard mode screen for each user.

【００４９】１００４は応用ズームキーであり、原稿の
Ｘ方向、Ｙ方向を独立に変倍するモード、原稿のサイズ
と複写サイズから変倍率を計算するズームプログラムの
モードに入る。１００５、１００６、１００７はＭ１キ
ー、Ｍ２キー、Ｍ３キーであり、それぞれのモードメモ
リを呼び出す際に押す。１００８はコールキーであり、
前回設定されていた複写モードを呼び出す際に押す。１
００９はオプションキーであり、フィルムから直接複写
するためのフィルムプロジェクター等のオプション機能
の設定を行う。An applied zoom key 1004 enters a mode for independently changing the magnification in the X and Y directions of a document, and a mode for a zoom program for calculating the magnification from the size of the document and the copy size. Reference numerals 1005, 1006 and 1007 denote M1 key, M2 key and M3 key, which are pressed when calling the respective mode memories. 1008 is a call key,
Press to call the previously set copy mode. 1
An option key 009 is used to set an optional function such as a film projector for directly copying from a film.

【００５０】１０１０はソーターキーであり、ソーター
のソート、グループ等のモード設定を行う。１０１１は
原稿混載キーであり、原稿フィーダーにＡ４サイズとＡ
３サイズ又はＢ５サイズとＢ４サイズの原稿を一緒にセ
ットする際に押す。１０１２は等倍キーであり、複写倍
率を１００％にする際に押す。１０１４，１０１５はそ
れぞれ縮小キー、拡大キーであり、定形の縮小、拡大を
行う際に押す。１０１６はズームキーであり、１％刻み
で非定形の縮小、拡大を行う際に押す。１０１３は用紙
選択キーであり、複写用紙の選択を行う際に押す。A sorter key 1010 is used to sort sorters and set modes such as groups. Reference numeral 1011 denotes a document mixed loading key, which has A4 size and A
Press to set 3 size or B5 size and B4 size documents together. Reference numeral 1012 denotes an equal magnification key, which is pressed when the copy magnification is 100%. Reference numerals 1014 and 1015 are a reduction key and an enlargement key, respectively, which are pressed when performing a fixed size reduction and enlargement. Reference numeral 1016 denotes a zoom key, which is pressed when performing non-standard size reduction / enlargement in 1% steps. A paper selection key 1013 is pressed when selecting a copy paper.

【００５１】１０１８，１０２０は濃度キーであり、濃
度キー１０１８を押す毎に濃く複写され、濃度キー１０
２０を押す毎に薄く複写される。１０１７は濃度表示で
あり、濃度キー１０１８，１０２０を押すと表示が左右
へ変化する。１０１９はＡＥキーであり、新聞のように
地肌の濃い原稿を自動濃度調整複写するときに押す。１
０２１はＨｉＦｉキーであり、写真原稿のように中間調
の濃度が多い原稿の複写の際に押す。Reference numerals 1018 and 1020 denote density keys. Each time the density key 1018 is pressed, a dark copy is made.
Each time you press 20, a light copy is made. Reference numeral 1017 denotes a density display, and when the density keys 1018 and 1020 are pressed, the display changes to the left and right. Reference numeral 1019 denotes an AE key, which is pressed when a document with a dark background such as a newspaper is copied for automatic density adjustment. 1
Reference numeral 021 denotes a HiFi key, which is pressed when a document such as a photographic document having a high halftone density is copied.

【００５２】１０２２は文字強調キーであり、文字原稿
の複写で文字を際立たせたい場合に押す。１０２３はガ
イドキーであり、キーの機能が分からないとき押すとそ
のキーの説明が表示される。１０２４はコピーモードキ
ーであり、複写を行うときに押す。１０２５はファクス
キーであり、ファクスを行うときに押す。１０２６はフ
ァイルキーであり、ファイルデータを出力したいときに
押す。１０２７はプリンターキーであり、コンピュータ
等の外部装置からの画像データをプリント出力したいと
きに押す。１０２８は原稿向き検知キーであり、原稿フ
ィーダーにセットされた原稿の文書方向を検知させると
きに押す。１０２９は原稿向き検知詳細設定キーであ
り、原稿向き検知に関する詳細な設定を行うときに押
す。Reference numeral 1022 denotes a character emphasizing key, which is pressed when a character is desired to be highlighted in copying a character original. Reference numeral 1023 denotes a guide key, which is displayed when the key is pressed when the function of the key is unknown. Reference numeral 1024 denotes a copy mode key, which is pressed when copying. Reference numeral 1025 denotes a fax key, which is pressed when performing a fax. Reference numeral 1026 denotes a file key, which is pressed to output file data. Reference numeral 1027 denotes a printer key, which is pressed to print out image data from an external device such as a computer. A document orientation detection key 1028 is pressed to detect the document direction of the document set in the document feeder. Reference numeral 1029 denotes a document orientation detection detailed setting key, which is pressed to make detailed settings relating to document orientation detection.

【００５３】図１１は図１０における原稿向き検知詳細
設定キー１０２９を押したときに操作パネル１４０に表
示される画面である。FIG. 11 is a screen displayed on the operation panel 140 when the document orientation detection detail setting key 1029 in FIG. 10 is pressed.

【００５４】図１０の基本画面の状態で原稿向き検知詳
細設定キー１０２９を押すと、図１１に示すように原稿
向き検知の詳細設定画面が表示される。この画面では原
稿内で検出された文字領域１つについて何文字を検出さ
せるかを設定する。例えば図示しないテンキーによって
１〜５０までの数を入力すると、文字数設定表示１０３
０に表示される。又、ここにはデフォルトの値として例
えば１０が予め入力されている。１０３１は戻るキー
で、ここでの設定を終了して図の画面に戻るときに押
す。When the document orientation detection detailed setting key 1029 is pressed in the state of the basic screen of FIG. 10, the document orientation detection detailed setting screen is displayed as shown in FIG. On this screen, the number of characters to be detected for one character area detected in the document is set. For example, if a number from 1 to 50 is input using a ten-key pad (not shown)
Displayed at 0. Also, for example, 10 is input in advance as a default value. Reference numeral 1031 denotes a return key, which is pressed to end the setting here and return to the screen in the figure.

【００５５】次に、図１２の説明図を用いて従来の装置
の文書方向決定の動作について説明する。Next, the operation for determining the document direction of the conventional apparatus will be described with reference to the explanatory view of FIG.

【００５６】例えば、図１２に示したような原稿の場
合、文字領域１には数字列で１２０文字、文字領域２に
は「ＣＯＮＦＩＤＥＮＴＩＡＬ」という斜めの文字列で
１２文字、文字領域３には「この明細書に記述された内
容」という文字列で１３文字、文字領域４には「・本発
明の解決する課題、・本発明の効果」という文字列で１
６文字の各文字領域が検出されたとする。For example, in the case of a manuscript as shown in FIG. 12, the character area 1 is 120 characters in the numerical string, the character area 2 is 12 characters in the diagonal character string "CONFIDENTIAL", and the character area 3 is " 13 characters in the character string "contents described in this specification", and 1 in the character string "・ Problems to be solved by the present invention, ・ Effect of the present invention"
It is assumed that each character area of 6 characters is detected.

【００５７】又、本実施の形態の文字判別に与えられた
時間が例えば１秒間であり、この時間内に判別可能な文
字数を３０文字とする。従来のものでは或る文字領域を
検出すると、その文字領域内の文字を順番に全て方向判
別しようとしていた。図１２の例では文字領域１を検出
した時点で文字領域１内の文字を所定時間内で順番に方
向判別する。文字領域１内の文字は１２０文字あり、全
ての文字が天地を逆にしても同じ文字になるような文字
が存在している。従って、文字領域１内の文字の方向判
別では０°と検出したり、１８０°と検出したりという
ことが考えられる。Further, the time given to the character discrimination of the present embodiment is, for example, one second, and the number of discriminable characters within this time is 30. In the conventional art, when a certain character area is detected, all the characters in the character area are tried to be discriminated in order. In the example of FIG. 12, when the character area 1 is detected, the direction of the characters in the character area 1 is sequentially determined within a predetermined time. There are 120 characters in the character area 1, and there is a character in which all the characters are the same even if they are turned upside down. Therefore, it is possible to detect 0 ° or 180 ° in the direction determination of the character in the character area 1.

【００５８】又、文字領域１には１２０文字あり、所定
時間内には３０文字の判別しか行えないため、文字領域
３、４に確実に判定できる文字領域があったとしても、
文字領域１内の文字のみの判別でこの原稿の文書方向を
決定しなければならない。本来、図１２に示した原稿は
０°の方向と判定されなければならないが、１８０°と
誤判定する可能性があった。又、文字領域２のように文
字が斜めになっている場合も同様である。Further, since there are 120 characters in the character area 1 and only 30 characters can be discriminated within a predetermined time, even if there is a character area that can be surely discriminated in the character areas 3 and 4,
The document direction of this original must be determined by discriminating only the characters in the character area 1. Originally, the original shown in FIG. 12 had to be judged to be in the direction of 0 °, but there was a possibility of misjudging to be 180 °. The same applies to the case where the characters are inclined as in the character area 2.

【００５９】次に、図１２の説明図及び図１３のフロー
チャートを用いて本発明の画像処理装置の詳細な動作に
ついて説明する。Next, the detailed operation of the image processing apparatus of the present invention will be described with reference to the explanatory view of FIG. 12 and the flowchart of FIG.

【００６０】先ず、ステップＳ１で操作パネル１４０上
のコピーキーがＯＮしたが否かを判断する。ＯＮされれ
ばステップＳ２で原稿台上の原稿を読み取り、ステップ
Ｓ３において、前述した方法により、原稿内の文字領域
を切り出し、或る文字領域内の文字について方向判別を
行う。First, in step S1, it is determined whether or not the copy key on the operation panel 140 is turned on. If turned on, the original on the original table is read in step S2, and in step S3, the character area in the original is cut out by the method described above, and the direction of the character in a certain character area is determined.

【００６１】例えば、図１０に示した様な原稿の場合、
文字領域１には数字列で１２０文字、文字領域２には
「ＣＯＮＦＩＤＥＮＴＩＡＬ」という斜めの文字列で１
２文字、文字領域３には「この明細書に記述された内
容」という文字列で１３文字、文字領域４には「・本発
明の解決する課題、・本発明の効果」という文字列で１
６文字の各文字領域が検出されたとする。For example, in the case of a manuscript as shown in FIG.
The character area 1 is 120 characters in a numerical string, and the character area 2 is 1 in a diagonal character string "CONFIDENTIAL".
Two characters, 13 characters in the character string "contents described in this specification" in the character area 3, and 1 in the character string "・ Problems to be solved by the present invention-Effects of the present invention" in the character area 4.
It is assumed that each character area of 6 characters is detected.

【００６２】又、本実施の形態の文字判別に与えられた
時間が例えば１秒間であり、この時間内に判別可能な文
字数を３０文字、１つの文字領域内で判別する文字数は
図１１に示した原稿向き検知詳細設定画面において８文
字が設定されたとする。ステップ３において、先ず文字
領域１について文字方向判別を行い、ステップＳ４で８
文字の判別が終了したか否かを判断する。８文字の判別
が終了していない場合はステップＳ３に戻り、文字方向
判別を行うが、ステップＳ４で所定文字数（８文字）の
判別が終了するとステップＳ５に進み、所定時間（１
秒）が終了か否かを判断する。所定時間が終了していな
ければ、ステップＳ６に進み、原稿内の全ての文字領域
について文字方向判別が終了したか否かを判断する。Further, the time given to the character discrimination of the present embodiment is, for example, 1 second, and the number of characters that can be discriminated within this time is 30 characters, and the number of characters discriminated in one character area is shown in FIG. It is assumed that 8 characters are set on the document orientation detection detail setting screen. In step 3, the character direction is first determined for the character area 1, and in step S4, 8 is determined.
It is determined whether or not the character discrimination is completed. If the determination of 8 characters has not been completed, the process returns to step S3 to determine the character direction. If the determination of the predetermined number of characters (8 characters) is completed in step S4, the process proceeds to step S5 for a predetermined time (1
Second) is finished. If the predetermined time has not ended, the process proceeds to step S6, and it is determined whether the character direction determination has ended for all the character areas in the document.

【００６３】全ての文字領域について文字方向判別が終
了していなければ、ステップＳ７に進んで別の文字領域
について文字方向判別を行う。前回、文字領域１の文字
について行っていたので、今度は文字領域２の文字につ
いて文字方向判別動作を行う。そして、ステップＳ８で
所定文字数（８文字）の判別が終了したか否かを判断
し、終了していなければステップＳ７に戻って文字方向
判別動作を行うが、ステップＳ８で所定文字数が終了し
たと判断されると、ステップＳ５に戻って所定時間が終
了したか否かを判断する。所定時間が終了していない場
合は先に説明したステップＳ６からステップＳ８までの
動作を繰り返し行い、文字領域３、文字領域４について
も同様に８文字の文字方向判別を行う。If the character direction determination is not completed for all the character areas, the process proceeds to step S7, and the character direction determination is performed for another character area. Since the previous operation was performed for the character in the character area 1, the character direction determination operation is performed for the character in the character area 2 this time. Then, in step S8, it is determined whether or not the predetermined number of characters (8 characters) has been determined. If not, the process returns to step S7 to perform the character direction determination operation, but in step S8, the predetermined number of characters is determined to have ended. When judged, it returns to step S5 and judges whether or not the predetermined time has ended. If the predetermined time has not ended, the operations from step S6 to step S8 described above are repeated, and the character direction determination of 8 characters is similarly performed for the character areas 3 and 4.

【００６４】ステップＳ５で所定時間が終了したと判断
されると、ステップＳ９に進み、文字領域１から文字領
域４までの文字方向判別結果の多数決を取る。例えば、
文字領域１では１８０°、文字領域２では９０°、文字
領域３では０°、文字領域４では０°と判別結果が出た
とすると多数決を取って０°となり、ステップＳ１０で
この原稿の文書方向は０°と決定される。When it is determined in step S5 that the predetermined time has ended, the process proceeds to step S9, and the majority of the character direction determination results from the character areas 1 to 4 is taken. For example,
If the determination result is 180 ° in the character area 1, 90 ° in the character area 2, 0 ° in the character area 3, and 0 ° in the character area 4, the majority decision is taken to be 0 °, and the document direction of this document is determined in step S10. Is determined to be 0 °.

【００６５】[0065]

【発明の効果】以上の説明で明らかなように、本発明に
よれば、原稿を読み取る読み取り手段によって読み取ら
れた画像データを画像処理する画像処理装置において、
前記画像データ中の文字領域を検出する文字領域検出手
段と、該文字領域検出手段によって検出された各文字領
域内の文字について所定数の文字の方向を判別する判別
手段と、該判別手段の各文字領域の判別結果に応じて前
記画像データの文書方向を判別する文書方向判別手段を
設けたため、方向判別に時間制限がある場合であって
も、文字方向判別の困難に文字ばかりを判別してしまう
ことなく、文字判別精度を高めて誤判別を防ぐことがで
きるという効果が得られる。As is apparent from the above description, according to the present invention, in the image processing apparatus for image-processing the image data read by the reading means for reading the original,
A character area detecting unit that detects a character area in the image data, a determining unit that determines the direction of a predetermined number of characters in each character area detected by the character area detecting unit, and each of the determining means. Since the document direction determining means for determining the document direction of the image data according to the result of the determination of the character area is provided, even if there is a time limit in the direction determination, it is difficult to determine the character direction and only the characters are determined. It is possible to obtain the effect that the accuracy of character discrimination can be improved and erroneous discrimination can be prevented without causing a mistake.

[Brief description of drawings]

【図１】本発明に係る画像処理装置におけるコントロー
ラ部内の画像処理部の構成を示すブロック図である。FIG. 1 is a block diagram showing a configuration of an image processing unit in a controller unit in an image processing apparatus according to the present invention.

【図２】本発明に係る画像処理装置におけるコントロー
ラ部の構成を示すブロック図である。FIG. 2 is a block diagram showing a configuration of a controller unit in the image processing apparatus according to the present invention.

【図３】本発明に係る画像処理装置の構成を示す断面図
である。FIG. 3 is a sectional view showing a configuration of an image processing apparatus according to the present invention.

【図４】図１に示す文書方向判別部の構成を示すブロッ
ク図である。FIG. 4 is a block diagram showing a configuration of a document orientation determination unit shown in FIG.

【図５】本発明の文書方向自動判別と文字認識処理を示
すフローチャートである。FIG. 5 is a flowchart showing automatic document orientation determination and character recognition processing of the present invention.

【図６】本発明の文書方向自動判別における領域分離状
態を示した図である。FIG. 6 is a diagram showing a region separation state in automatic document orientation determination according to the present invention.

【図７】文字認識処理の処理過程を説明するための図で
ある。FIG. 7 is a diagram illustrating a process of character recognition processing.

【図８】本発明の文書（文字）方向自動判別処理を説明
するための図である。FIG. 8 is a diagram for explaining a document (character) direction automatic determination process of the present invention.

【図９】領域分離及び文字認識情報のデータ形式を示す
図である。FIG. 9 is a diagram showing a data format of area separation and character recognition information.

【図１０】本発明に係る画像処理装置における操作パネ
ルの表示例を示す図である。FIG. 10 is a diagram showing a display example of an operation panel in the image processing apparatus according to the present invention.

【図１１】本発明に係る画像処理装置における操作パネ
ルの表示例を示す図である。FIG. 11 is a diagram showing a display example of an operation panel in the image processing apparatus according to the present invention.

【図１２】本発明及び従来例の説明のための原稿例を示
す図である。FIG. 12 is a diagram showing an example of a document for explaining the present invention and a conventional example.

【図１３】本発明に係る画像処理装置の動作を示すフロ
ーチャートである。FIG. 13 is a flowchart showing an operation of the image processing apparatus according to the present invention.

[Explanation of symbols]

１０１原稿台ガラス１０２原稿照明ランプ１０３〜１０５走査ミラー１０６ＣＣＤユニット１０７結像レンズ１０８撮像素子１０９ＣＣＤドライバ１１０感光ドラム１１２前露光ランプ１１３１次帯電器１１７露光手段１１８現像器１２７転写帯電器１３９コントローラ部２０１ＣＰＵ２０３ＲＯＭ２０４ＲＡＭ２０６画像処理部３０１シェーディング回路３０２変倍回路３０３エッジ強調回路３０４ γ変換回路３０５２値化処理部３０６文書方向判別部３０７合成回路３０８ＰＷＭ回路３０９メモリ制御部３１０画像用メモリ 101 Platen glass 102 Original illumination lamp 103-105 Scanning mirror 106 CCD unit 107 Imaging lens 108 image sensor 109 CCD driver 110 photosensitive drum 112 Pre-exposure lamp 113 Primary charger 117 exposure means 118 developing device 127 Transfer charger 139 Controller section 201 CPU 203 ROM 204 RAM 206 Image processing unit 301 Shading circuit 302 scaling circuit 303 Edge enhancement circuit 304 γ conversion circuit 305 Binarization processing unit 306 Document direction determination unit 307 Synthesis circuit 308 PWM circuit 309 memory control unit 310 Image memory

Claims

[Claims]

1. An image processing apparatus for image-processing image data read by reading means for reading an original image, wherein the character area detecting means detects a character area in the image data, and the character area detecting means detects the character area. Further, a discriminating means for discriminating the direction of a predetermined number of characters in each character area and a document direction discriminating means for discriminating the document direction of the image data according to the discrimination result of each character area of the discriminating means are provided. An image processing device characterized by the above.

2. The image processing apparatus according to claim 1, further comprising setting means for setting the predetermined number of characters.

3. The image processing apparatus according to claim 1, wherein the document orientation discriminating means discriminates the document orientation of the original image by a majority decision of the discrimination results of the respective character areas.

4. The image processing apparatus according to claim 1, wherein the determination unit sequentially performs each character area within a predetermined time.