JPH06103411A

JPH06103411A - Document reader

Info

Publication number: JPH06103411A
Application number: JP4254354A
Authority: JP
Inventors: Katsumi Marukawa; 勝美丸川; Kazuki Nakajima; 和樹中島; Masashi Koga; 昌史古賀; Yoshihiro Shima; 好博嶋
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1992-09-24
Filing date: 1992-09-24
Publication date: 1994-04-15
Anticipated expiration: 2016-07-11
Also published as: JPH11219409A; JP3186246B2

Abstract

PURPOSE:To read the contents of a document, and to exhibit a corrected picture to a user or to store it by detecting an angle of rotation, and correcting the input picture to a correct direction even in the case that the document is input ted while being rotated by an arbitrary angle to the set direction of a scanner. CONSTITUTION:This reader is provided with a picture input means 105 for inputting a document picture, a character line extracting means 110 for extracting the character line of the inputted document picture, an inclination extracting means 125 for extracting the inclination of the document, a character line coordinate rotating means 165 for rotating the extracted character line by the angles obtained by adding 0 deg., 90 deg., 180 deg., 270 deg. to the inclination of the document, and a document rotation angle judging means 150 for recognizing respectively four rotated character strings and judging the angle of rotation of the most correct one among them as the inclination of the document, and the inclination of the document is corrected.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、文書読取装置および電
子ファイル装置およびファクシミリ装置および複写機お
よび計算機に入力する紙の文書を電子的なデータに変換
する装置に関し、特に、予め決まっているスキャナ（走
査線）の読み取り方向（移動方向）に対し、ユーザがこ
の方向を意識せず、紙の文書をスキャナ上に０度から３
６０度までのどのような角度で設定しても、文書に記載
されている内容を読み取ったり、あるいは、正しい方向
に入力画像を修正したりする等のユーザの使い勝手を改
善した文書読取装置および電子ファイル装置およびファ
クシミリ装置および複写機および計算機に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a document reading device, an electronic file device, a facsimile device, a device for converting a paper document input to a copying machine and a computer into electronic data, and more particularly to a predetermined scanner. The user is not aware of the (scanning line) reading direction (moving direction), and the paper document is read on the scanner from 0 degrees to 3 degrees.
A document reading device and an electronic device with improved usability such as reading the contents described in a document or correcting an input image in the correct direction regardless of the angle set up to 60 degrees The present invention relates to a file device, a facsimile device, a copying machine, and a computer.

【０００２】[0002]

【従来の技術】元来、スキャナの読み取り方向と文書の
スキャナ上への設定方向が一致した状態でのみ、文書中
に記載された内容が読み取れる文書読取装置が知られて
いる。しかしながら、近年、文書読取装置が実現される
に及んで、文書が傾いて入力されたり、それと同時にス
キャナの読み取り方向と文書のスキャナ上への設定方向
が不一致であるという設定状態の不十分な場合において
も入力文書の読み取りを行なわなければならない状況が
発生している。2. Description of the Related Art Originally, there has been known a document reading device which can read the contents described in a document only when the reading direction of the scanner and the setting direction of the document on the scanner match. However, in recent years, with the realization of document reading devices, when a document is input with a tilt, and at the same time, the reading direction of the scanner and the setting direction of the document on the scanner do not match, the setting state is insufficient. There is a situation in which the input document must be read.

【０００３】上記の問題を解決する従来例としては、特
開昭６２−１４２７７号公報、特開平２−１０５２６６
号公報がある。As a conventional example for solving the above problems, Japanese Patent Laid-Open Nos. 62-14277 and 2-105266 are available.
There is a gazette.

【０００４】特開昭６２−１４２７７号公報に開示の装
置では、画像から輪郭抽出を行ない、抽出された輪郭か
ら罫線などの直線部分の傾きを抽出して、この直線部分
の傾きが水平（あるいは垂直）になるように傾きを補正
することにより、画像の傾きを補正可能とする。In the apparatus disclosed in Japanese Patent Laid-Open No. 62-14277, contour extraction is performed from an image, the inclination of a straight line portion such as a ruled line is extracted from the extracted contour, and the inclination of this straight line portion is horizontal (or By correcting the inclination so that it becomes (vertical), the inclination of the image can be corrected.

【０００５】特開平２−１０５２６６号公報に開示の装
置では、黒画素計数手段により計数した計数結果を比較
しその結果により入力されている文書画像を回転させる
ことにより、文書画像の上下関係の自動修正を図る。In the apparatus disclosed in Japanese Patent Application Laid-Open No. 2-105266, the counting results counted by the black pixel counting means are compared, and the document image input based on the comparison result is rotated, whereby the vertical relationship of the document images is automatically detected. Fix it.

【０００６】[0006]

【発明が解決しようとする課題】しかし、上記の従来の
装置では、上下関係が一致して傾いている状態（傾きが
０〜９０度）、あるいは、入力文書に傾きがなく上下関
係が異なっていた状態（傾きが１８０度）しか文書に記
載された内容を読み取ることができない。However, in the above-described conventional apparatus, the vertical relationship is the same, and the vertical relationship is the same (the inclination is 0 to 90 degrees), or the input document has no inclination and the vertical relationship is different. The contents described in the document can be read only in the open state (inclination is 180 degrees).

【０００７】つまり、上記の従来の装置では、スキャナ
の設定方向に対し任意の角度（０度から３６０度）回転
されて入力された場合、その内容を読み取ることができ
なかったり、正しい方向に入力画像を修正しユーザに画
像を提示できない等のユーザの使い勝手を考慮した機能
を持っていなかった。That is, in the above-mentioned conventional apparatus, when the input is rotated by an arbitrary angle (0 to 360 degrees) with respect to the setting direction of the scanner, the contents cannot be read or the input is made in the correct direction. It does not have a function that considers the usability of the user such as modifying the image and not presenting the image to the user.

【０００８】これは、図３に示すように、Ａ３スキャナ
２３００の走査線が移動する方向２３１０と文書を設定
する領域２３２０はあらかじめ決まっている。そのた
め、従来の装置ではシステムが処理する方向はスキャナ
の走査線が移動する方向と一致していなければ処理でき
ない。As shown in FIG. 3, the direction 2310 in which the scanning line of the A3 scanner 2300 moves and the area 2320 for setting a document are predetermined. Therefore, in the conventional apparatus, the processing direction of the system cannot be processed unless the scanning line of the scanner moves.

【０００９】したがって、処理できる許容範囲として
は、図４に示すように、文書２４００の上下関係が一致
して多少傾いたもの、あるいは、図５に示すように、文
書２５００に傾きが無く上下関係が反転したものであっ
た。図６に示すように、文書２６００が９０度あるいは
１８０度あるいは２７０度の回転に加えて傾きもある場
合、従来のシステムでは文書中に記載された内容を読み
取れなかったし、入力画像を修正しユーザに提示する機
能等を持っていなかった。Therefore, as the allowable range of processing, as shown in FIG. 4, the document 2400 has the same vertical relationship and is slightly tilted, or, as shown in FIG. 5, the document 2500 has no tilt and the vertical relationship. Was the flipped one. As shown in FIG. 6, when the document 2600 has a tilt in addition to the rotation of 90 degrees, 180 degrees, or 270 degrees, the conventional system cannot read the content described in the document and corrects the input image. It did not have a function to present to the user.

【００１０】ここで、回転角とはスキャナの読み取り方
向と文書の上下方向が指示する方向との角度の差として
定義する。例えば、図５の矢印２６０５は文書の上下方
向を指しており、回転角は文書の上下関係の概念を考慮
した角度である。Here, the rotation angle is defined as the difference between the reading direction of the scanner and the direction indicated by the vertical direction of the document. For example, the arrow 2605 in FIG. 5 indicates the vertical direction of the document, and the rotation angle is an angle considering the concept of the vertical relationship of the document.

【００１１】また、上記の装置では、ユーザが間違えて
文書の裏面を入力した場合とか、文書の読み取るべき部
分がスキャナの読み取り領域からはみ出した場合の検出
あるいはそのような文書の処理方法等のユーザの使い勝
手を考慮した機能を持っていない。Further, in the above apparatus, the user is required to detect when the user mistakenly inputs the back side of the document or when the portion to be read of the document is out of the reading area of the scanner or the method of processing such a document. It does not have a function considering the usability of.

【００１２】また、上記の装置では、文書に記載されて
いない文書に関わる著者、入手先、入手日時、メモ等の
付加情報を入力文書に関する情報に関連付けて入力した
り、文書間同志の関係を持たせる機能が無いため、文書
に記載されていない情報を登録することも検索すること
もできず、また、関連のある他の文書の情報から所望の
文書に関わる情報を検索することができない等のユーザ
の使い勝手を考慮した機能を持っていない。Further, in the above apparatus, additional information such as an author, a place of acquisition, a date and time of acquisition, a memo, etc. relating to a document not described in the document can be input in association with the information regarding the input document, and the relationship between documents can be established. Since there is no function to have it, it is not possible to register or search information that is not described in the document, and it is not possible to search information related to the desired document from information of other related documents, etc. Does not have a function that considers the usability of the user.

【００１３】さらに、上記装置では、データ登録時での
ファイル容量のチェック機能、大量に蓄積・管理された
画像データに対しての読み取り機能、画像回転修正機
能、あるいは、文字認識時での外字処理機能等のユーザ
の使い勝手を考慮していない。Further, in the above apparatus, a file capacity checking function at the time of data registration, a reading function for a large amount of image data stored and managed, an image rotation correction function, or external character processing at the time of character recognition The user-friendliness of functions and the like are not considered.

【００１４】以上のように、従来の装置ではユーザにと
って使い勝手が悪いと言う問題点があった。As described above, the conventional device has a problem that it is inconvenient for the user.

【００１５】そこで、本発明の第１の目的は、文書がス
キャナの設定方向に対し任意の角度（０度から３６０
度）で回転されて入力された場合でも、その内容を読み
取ることができたり、あるいは、正しい方向に入力画像
を修正しユーザに提示する機能等のユーザの使い勝手を
考慮した文書読取装置あるいは電子ファイル装置あるい
はファクシミリ装置あるいは複写機あるいは計算機を提
供することにある。Therefore, a first object of the present invention is to set a document at an arbitrary angle (from 0 degree to 360 degrees) with respect to the setting direction of the scanner.
Even if the image is rotated and input, the contents can be read, or a document reading device or an electronic file that considers the usability of the user such as the function of correcting the input image in the correct direction and presenting it to the user. To provide an apparatus, a facsimile machine, a copying machine, or a computer.

【００１６】また、本発明の第２の目的は、ユーザが間
違えて文書の裏面を入力した場合とか、文書の読み取る
べき部分がスキャナの読み取り領域からはみ出した場合
の検出、そのような文書の処理方法そしてユーザが再度
文書の設定を行うこと無く自動的にはみ出し領域の内容
を含め読み取る等のユーザの使い勝手を考慮した文書読
取装置あるいは電子ファイル装置あるいはファクシミリ
あるいは複写機あるいは計算機を提供することにある。A second object of the present invention is to detect when the user mistakenly inputs the back side of the document or when the portion to be read of the document is out of the reading area of the scanner, and to process such a document. A method and a document reading device, an electronic file device, a facsimile, a copying machine, or a computer, which considers the user's usability such as automatically reading the contents of the protruding area without the user having to set the document again. .

【００１７】また、本発明の第３の目的は、文書に記載
されていない文書に関わる著者、入手先、入手日時、メ
モ等の付加情報の登録や検索をするができ、また、関連
のある他の文書の情報から所望の文書に関わる情報を検
索できる等のユーザの使い勝手を考慮した機能を持った
文書読取装置あるいは電子ファイル装置あるいはファク
シミリ装置あるいは複写機あるいは計算機を提供するこ
とにある。The third object of the present invention is to register and retrieve additional information related to a document which is not described in the document, such as author, source, date and time of acquisition, memo, etc. Another object of the present invention is to provide a document reading device, an electronic file device, a facsimile device, a copying machine, or a computer having a function in consideration of user's usability, such as searching information related to a desired document from information of another document.

【００１８】さらに、本発明の第４の目的は、データ登
録時でのファイル容量のチェック機能、大量に蓄積・管
理された画像データに対しての読み取り機能、画像回転
修正機能、あるいは、文字認識時での外字処理機能等の
ユーザの使い勝手を考慮した文書読取装置あるいは電子
ファイル装置あるいはファクシミリ装置あるいは複写機
あるいは計算機を提供することにある。Further, a fourth object of the present invention is to check a file capacity at the time of data registration, a reading function for a large amount of image data stored and managed, an image rotation correction function, or character recognition. Another object of the present invention is to provide a document reading device, an electronic file device, a facsimile device, a copying machine, or a computer in consideration of user's usability such as an external character processing function.

【００１９】[0019]

【課題を解決するための手段】上記の第１の目的を達成
するために、文書画像を入力する手段と、入力された文
書画像の文字行を抽出する手段と、文書の傾きを抽出す
る手段と、上記の抽出された文字行を文書の傾きに０
度、９０度、１８０度、２７０度を加えた角度回転させ
る手段と、回転された４つの文字行をそれぞれ認識を行
ない、その中で最も正しいものの回転角を文書の傾きと
して画像を補正する。In order to achieve the above first object, means for inputting a document image, means for extracting character lines of the input document image, and means for extracting inclination of a document. And the above extracted character line to the inclination of the document as 0
A unit for rotating an angle of 90 °, 180 °, 270 ° and four rotated character lines are respectively recognized, and the most correct rotation angle among them is used as the inclination of the document to correct the image.

【００２０】第２の目的を達成するために、上記の文書
画像の文字行を抽出する手段で、文字行が抽出されない
場合は、文書が裏側で入力されたと判定する手段を備え
た。In order to achieve the second object, the means for extracting the character line of the document image described above is provided with means for determining that the document is input on the back side when the character line is not extracted.

【００２１】また、任意の位置にある文字行を抽出する
手段により得られた文字行の４つの頂点の２頂点以上が
スキャナ読み取り領域の４辺上に存在するか否かに従い
読み取るべき文字行がスキャナ読み取り領域外にあるか
否かを判定する手段を備えた。Further, the character line to be read depends on whether or not two or more of the four vertices of the character line obtained by the means for extracting the character line at an arbitrary position are present on the four sides of the scanner reading area. A means for determining whether or not it is outside the scanner reading area is provided.

【００２２】また、Ａ４スキャナ読み取り領域において
上記手段によりはみ出していると判定した場合、新たに
Ａ３スキャナで文書画像を採取することで自動的にはみ
出し領域であった内容も含め文書中の記載内容を漏らさ
ず読み取る手段とを備えた。Further, when it is determined that the document is protruding in the A4 scanner reading area by the above-mentioned means, a new document image is collected by the A3 scanner to automatically display the contents described in the document, including the contents of the protruding area. And means for reading without leaking.

【００２３】さらに、入力に不備があったと判定された
文書画像の文書番号をリジェクトファイルに登録する手
段や、エラーメッセージのウインド上への表示あるいは
音声での呼び掛けによる警告を促す手段とを備えても良
い。Further, the apparatus is provided with means for registering the document number of the document image determined to be incomplete in the reject file, and means for displaying an error message on the window or prompting a warning by voice call. Is also good.

【００２４】第３の目的を達成するために、入力された
文書画像に文書番号を登録する手段と、文書に関わる入
手日時や目的や入手先等の文書に書かれていない付加情
報を入力するための付加情報入力する手段と、入力され
た付加情報を文書番号や入力文書を処理した文書情報に
対応付けて電子的に記録する手段と、付加情報や文書情
報を検索する手段とを備えた。In order to achieve the third object, a means for registering a document number in the input document image and additional information which is not written in the document such as date and time of acquisition, purpose and destination of the document are input. Means for inputting additional information, electronically recording the input additional information in association with the document number or the document information obtained by processing the input document, and means for searching the additional information or the document information. .

【００２５】また、文書同志の関係情報を電子的に記録
する手段と、文書同志の関係情報を検索して所望の文書
についての文書情報や付加情報を検索する手段とを備え
ても良い。Further, there may be provided means for electronically recording related information between documents and means for searching for related information between documents and searching for document information or additional information about a desired document.

【００２６】第４の目的を達成するために、本発明は、
入力文書画像の処理結果をファイルに出力するための空
き容量を表示する手段と、空き容量が少なくなった場合
には警告をウインド上への表示あるいは音声で促す手段
あるいはネットワークを介しオペレータがいる他の装置
に警告を促す手段とを具備したことを特徴とする文書読
取装置あるいは電子ファイル装置あるいはファクシミリ
装置あるいは複写機あるいは計算機を提供する。In order to achieve the fourth object, the present invention provides
There is a means to display the free space to output the processing result of the input document image to a file, a means to display a warning on the window or a voice prompt when the free space is low, or an operator via the network. And a document reading device, an electronic file device, a facsimile device, a copying machine, or a computer.

【００２７】また、複数枚の文書をスキャナ入力した文
書画像をデータ蓄積装置に格納する手段と、格納時に文
書番号を付加する手段と、格納された文書画像を逐次ロ
ードし画像回転修正あるいは読み取り処理を行う手段と
を備えた。Further, a means for storing a document image in which a plurality of documents are scanner-input into the data storage device, a means for adding a document number at the time of storage, a stored document image is sequentially loaded to perform image rotation correction or reading processing. And means for performing.

【００２８】さらに、文書をディジタル画像として入力
する装置の読み取り処理において、認識対象文字コード
がシステム側に存在しない場合、文字画像を外字として
辞書に登録する手段や、登録された記号を読み取り結果
として割り当てて表示あるいはファイルに出力する手段
を備えても良い。Further, in the reading process of the apparatus for inputting a document as a digital image, if the character code to be recognized does not exist on the system side, a means for registering the character image as an external character in the dictionary or a registered symbol as a reading result. Means for allocating and displaying or outputting to a file may be provided.

【００２９】[0029]

【作用】上記の構成により、文書画像中の任意の位置に
ある文字行を抽出し、スキャナ設定方向に対する文書画
像の傾きを抽出し、スキャナ設定方向に対する文書画像
の回転角を求めるための適切な文字行を選択し、適切な
文字行の部分画像を検出した傾きに４種類の角度０度、
９０度、１８０度、２７０度を加えた角度だけそれぞれ
回転し、４種類のそれぞれの回転文字行部分画像から部
分画像中の文字を切り出し認識し入力文書画像のスキャ
ナ設定方向に対する回転角を評価する手段あるいはこの
手段に入力文書のレイアウト情報を用いて入力文書画像
のスキャナ設定方向に対する回転角を評価し、求められ
た文書の回転角だけ入力画像を回転修正することがで
き、従来不可能であった任意の回転角で入力された文書
をユーザが見やすいようにディスプレイ上に表示あるい
は蓄積出来、ユーザのデータ操作の使い勝手をはるかに
向上できる。また、回転修正画像に対して文字行を抽出
し直して記載された内容を読み取る手段あるいは求めた
回転角だけ文字行部分画像を回転修正しレイアウト情報
を利用することで回転修正文字行画像を処理する順番を
求め記載された内容を順次読み取ることで、従来不可能
であった任意の回転角で入力された文書中に記載された
文字画像のコード化が可能となり、オートフィーダー等
を用いた自動登録やユーザがマニュアルでデータ入力す
る際の再入力が不要になるため、入力作業の高効率化が
実現可能となる。With the above structure, a character line at an arbitrary position in the document image is extracted, an inclination of the document image with respect to the scanner setting direction is extracted, and an appropriate angle for obtaining the rotation angle of the document image with respect to the scanner setting direction is obtained. Select a character line, and select four angles of 0 degrees for the tilt that detected the partial image of the appropriate character line,
Rotate by 90 degrees, 180 degrees, and 270 degrees, respectively, and cut out characters in the partial image from each of the four types of rotated character line partial images, recognize them, and evaluate the rotation angle of the input document image with respect to the scanner setting direction. It is possible to evaluate the rotation angle of the input document image with respect to the scanner setting direction by using the means or the layout information of the input document in this means, and rotate and correct the input image by the obtained rotation angle of the document. Further, a document input at an arbitrary rotation angle can be displayed or stored on the display so that the user can easily see it, and the usability of the user's data operation can be greatly improved. In addition, the rotation correction character line image is processed by re-extracting the character line from the rotation correction image and reading the described content or by rotating the character line partial image by the obtained rotation angle and using the layout information. It is possible to encode the character image described in the document input at an arbitrary rotation angle, which was not possible in the past, by sequentially reading the described contents by determining the order to perform, and automatic registration using an auto feeder etc. Since there is no need to re-input data when the user manually inputs data, it is possible to improve the efficiency of input work.

【００３０】また、求めた回転角だけ入力画像を修正回
転し、修正画像を順次蓄積でき、ユーザはスキャナ入力
方向を意識せず文書画像の登録作業を行うことが出来
る。また、文書入力はスキャナのカバーを用いて行うた
め、雑誌等の見開き文書の入力作業は１頁おきにスキャ
ナのカーバーが邪魔になり入力作業が困難であった。し
かし、文書の回転角を判定し画像を修正・蓄積するた
め、ユーザは文書の設定方向を全く意識せずに気楽に文
書を反転させてでも入力することが出来るため、入力作
業の高効率化が実現可能となる。Further, the input image can be corrected and rotated by the obtained rotation angle and the corrected images can be sequentially stored, and the user can register the document image without being aware of the scanner input direction. Further, since the document input is performed by using the cover of the scanner, the input work of the spread document such as a magazine is difficult because the carver of the scanner interferes with every other page. However, since the rotation angle of the document is determined and the image is corrected / stored, the user can easily input the document by reversing the document without paying attention to the setting direction of the document. Can be realized.

【００３１】文書画像から抽出した文字行の有無に従
い、入力文書が表で正常に入力されたものか間違えて裏
で入力されたものかを自動的に判定することが出来るた
め、ユーザにエラーを指示することができ、入力作業の
効率化を実現できる。また、入力不備の文書番号をリジ
ェクトファイルに登録するため、マニュアルでのデータ
入力やオートフィダー等を用いた自動登録時に、入力状
況のチェックができ、目視等による人間の確認作業を大
幅に削減できる。Depending on the presence or absence of character lines extracted from the document image, it is possible to automatically determine whether the input document is normally input on the front side or is input by mistake on the back side. It is possible to give instructions, and the efficiency of input work can be realized. In addition, because the document number of the input error is registered in the reject file, it is possible to check the input status during manual data input or automatic registration using the auto feeder, etc., and it is possible to greatly reduce human confirmation work by visual inspection etc. .

【００３２】また、システムが任意の位置にある文字行
を抽出する手段により得られた文字行の４つの頂点の２
頂点以上がスキャナ読み取り領域の４辺上に存在するか
否かに従い、読み取るべき文字行がスキャナ読み取り領
域外にあるか否かを判定することが出来るため、ユーザ
にエラーを指示することができ、入力作業の効率化を実
現できる。また、入力不備の文書番号をリジェクトファ
イルに登録するため、マニュアルでのデータ入力やオー
トフィダー等を用いた自動登録時に、入力状況のチェッ
クができ、目視等による人間の確認作業を大幅に削減で
きる。Further, 2 of the four vertices of the character line obtained by the means for the system to extract the character line at an arbitrary position.
It is possible to judge whether or not the character line to be read is outside the scanner reading area according to whether or not the vertices and above are present on the four sides of the scanner reading area, and thus it is possible to instruct the user of an error. The efficiency of input work can be realized. In addition, because the document number of the input error is registered in the reject file, it is possible to check the input status during manual data input or automatic registration using the auto feeder, etc., and it is possible to greatly reduce human confirmation work by visual inspection etc. .

【００３３】また、上記手段により入力文書がＡ４スキ
ャナ読み取り領域をはみ出していることがわかった場
合、新たにＡ３スキャナで文書画像を採取し、これに対
し読み取り処理を行うことにより、ユーザが目視により
はみ出しを確認する必要がなく、かつ、文書を再設定し
て再度読み取り処理を行わなくてもシステムが自動的に
文書中の記載内容を漏らすこと無く読み取ることが出
来、入力作業の効率化を実現できる。When it is found by the above means that the input document is out of the A4 scanner reading area, a document image is newly picked up by the A3 scanner and the reading process is performed on the document image so that the user can visually check it. There is no need to check the protrusion, and the system can automatically read the contents of the document without leaking it even without resetting the document and performing the reading process again. it can.

【００３４】また、システムが入力に不備があったと判
定した場合、エラーメッセージのウインド上への表示あ
るいは音声での呼び掛けによる警告を促すことで、ユー
ザにエラーを指示することが出来るため、入力作業の効
率化を実現できる。When the system determines that the input is inadequate, the user can be informed of the error by displaying an error message on the window or prompting a warning by calling out by voice. The efficiency of can be realized.

【００３５】さらに、入力された文書画像に文書番号を
登録し、ユーザは文書に関わる著者、入手日時、入手先
うあメモ等の文書に書かれていない付加情報を入力する
ための付加情報入力し、入力された付加情報を文書番号
や入力文書を処理した文書情報に対応付けて電子的に記
録するため、ユーザは付加情報や文書情報を指定して、
対応する付加情報や文書情報を検索手段より効率良く検
索し、容易に情報を取り出すことが出来る。Further, the document number is registered in the input document image, and the user inputs additional information for inputting additional information not written in the document such as the author, date and time of acquisition, and memo of the user. Then, since the input additional information is electronically recorded in association with the document number or the document information obtained by processing the input document, the user specifies the additional information or the document information,
The corresponding additional information or document information can be retrieved more efficiently than the retrieval means, and the information can be retrieved easily.

【００３６】また、文書同志の関係情報を電子的に記録
するため、ユーザは検索手段により文書同志関係情報を
容易に検索して、ある文書から他の文書をたぐり、その
文書についての文書情報や付加情報を取り出すことがで
き、ユーザのおぼろげな記憶からでも他の文書に関する
情報を用いて所望の情報を入手することが出来る。Further, in order to electronically record the relationship information between documents, the user can easily search the document relationship information by the searching means, search for a document from another document, and obtain document information about the document. The additional information can be taken out, and the desired information can be obtained from the user's vague memory by using the information about other documents.

【００３７】入力文書画像の処理結果をファイルに出力
するための空き容量を表示し、空き容量が少なくなった
場合には警告をウインド上への表示あるいは音声で促し
たり、あるいは、ネットワークを介しオペレータがいる
他の装置に警告を促すことにより、入力作業のやり直し
やシステムへの弊害を回避することが出来る。The free space for outputting the processing result of the input document image to a file is displayed, and when the free space becomes small, a warning is displayed on the window or voice is urged, or an operator is requested via a network. It is possible to avoid the redo of the input work and the adverse effect on the system by urging the other device having a warning to warn.

【００３８】また、複数枚の文書をスキャナ入力した文
書画像をデータ蓄積装置に格納し、格納時に文書番号を
付加し、格納された文書画像を逐次ロードし読み取り処
理あるいは画像回転修正を行うことで、大量に入力され
た文書画像に対して文書に関する情報を管理しながら文
字画像をコード化でき、ユーザの修正作業を削減でき
る。Further, by storing a document image in which a plurality of documents are scanner-input into the data storage device, adding a document number at the time of storage, and sequentially loading the stored document images to perform reading processing or image rotation correction. A character image can be encoded while managing information about the document for a large number of input document images, and the correction work of the user can be reduced.

【００３９】さらに、文書をディジタル画像として入力
する装置の読み取り処理において、認識対象文字コード
がシステム側に存在しない場合、文字画像を外字として
辞書に登録し、ある記号を登録した外字の読み取り結果
として割り当てて表示あるいはファイルに出力すること
で、システムに存在しない認識不可文字が入力されても
対処できる。また、意味不明な認識結果を出力せず、ユ
ーザが容易に読み取り結果を処理することができ、読み
取り精度を向上させる。Further, in the reading process of the device for inputting a document as a digital image, if the character code to be recognized does not exist on the system side, the character image is registered in the dictionary as an external character, and a certain symbol is registered as the external character reading result. By allocating and displaying or outputting to a file, it is possible to deal with input of unrecognizable characters that do not exist in the system. In addition, the user can easily process the reading result without outputting the meaningless recognition result, and the reading accuracy is improved.

【００４０】[0040]

【実施例】以下、図に示す実施例により本発明を詳細に
説明する。なお、これにより本発明が限定されるもので
はない。The present invention will be described in detail below with reference to the embodiments shown in the drawings. The present invention is not limited to this.

【００４１】図２は本発明の一実施例の文書読取装置の
構成図である。FIG. 2 is a block diagram of a document reading apparatus according to an embodiment of the present invention.

【００４２】この文書読取装置はＣＰＵ２１１０と、主
メモリ２１２０と、画像メモリ２１３０と、ＣＲＴ２１
４０と、キーボード２１５０と、マウス２１６０と、ス
キャナ制御部２１７０と、スキャナ２１８０と、データ
蓄積部２１９０と、磁気ディスク２２００と、光ディス
ク２２１０と、光磁気ディスク２２２０と、プリントア
ウト装置２２３０と、スピーカ２２４０と、バス２２５
０とから構成される。This document reading device includes a CPU 2110, a main memory 2120, an image memory 2130, and a CRT 21.
40, a keyboard 2150, a mouse 2160, a scanner control unit 2170, a scanner 2180, a data storage unit 2190, a magnetic disk 2200, an optical disk 2210, a magneto-optical disk 2220, a printout device 2230, and a speaker 2240. And the bus 225
It consists of 0 and.

【００４３】図１は本発明の文書読取装置の一実施例の
ブロック図である。FIG. 1 is a block diagram of an embodiment of the document reading apparatus of the present invention.

【００４４】画像入力手段１０５は前記スキャナ２１０
８とスキャナ制御部２１７０とＣＰＵ２１１０と画像メ
モリ２１３０から構成され、文書１０１を読み取って文
書画像を得て、これを一時的に記憶し、これをＣＲＴ２
５５に表示する。The image input means 105 is the scanner 210.
8, a scanner control unit 2170, a CPU 2110, and an image memory 2130. The document 101 is read to obtain a document image, which is temporarily stored and stored in the CRT 2
Display at 55.

【００４５】文字行抽出手段１１０は前記ＣＰＵ２１１
０から構成され、画像メモリ２１３０上に記憶された文
書画像から文書中の文字行を抽出する。この文字行抽出
方法は、例えば、特開昭６２−１６５２８４号公報に開
示されている。The character line extracting means 110 is the CPU 211.
A character line in the document is extracted from the document image which is composed of 0 and is stored in the image memory 2130. This character line extracting method is disclosed in, for example, Japanese Patent Laid-Open No. 62-165284.

【００４６】表裏判定手段１１５は前記ＣＰＵ２１１０
から構成され、前記文字行抽出手段１１０の結果を用い
て、入力された文書１０１が間違えて裏面で入力されて
いないかどうかを判定する。裏面で入力されたと判定さ
れた場合、リジェクト警告手段２４５に信号を送り、こ
の手段２４５が入出力制御手段２５０を介してＣＲＴ２
５５上への表示あるいはスピーカ２８０を用いて、ユー
ザに裏面入力警告を促す。また、リジェクト登録手段２
０５により、文書番号登録手段１９５で付加された文書
番号をリジェクトファイル２１０に登録する。The front / back determination means 115 is the CPU 2110.
And the result of the character line extraction means 110 is used to determine whether or not the input document 101 is mistakenly input on the back side. When it is determined that the input is made on the back side, a signal is sent to the reject warning means 245, and this means 245 causes the CRT 2 via the input / output control means 250.
The display on the screen 55 or the speaker 280 is used to prompt the user for a back side input warning. Also, the reject registration means 2
05, the document number added by the document number registration means 195 is registered in the reject file 210.

【００４７】はみ出し判定手段１２０は前記ＣＰＵ２１
１０から構成され、前記文字行抽出手段１１０の結果を
用いて、入力された文書画像がスキャナ２１０８の読み
取り領域をはみ出しているかどうかを判定する。読み取
り領域をはみ出したと判定された場合、リジェクト警告
手段２４５に信号を送り、この手段２４５が入出力制御
手段２５０を介してＣＲＴ２５５上への表示あるいはス
ピーカ２８０を用いて、ユーザに裏面入力警告を促す。
また、リジェクト登録手段２０５により、文書番号登録
手段１９５で付加された文書番号をリジェクトファイル
２１０に登録する。The protrusion determination means 120 is the CPU 21.
It is constituted by 10 and using the result of the character line extraction means 110, it is determined whether or not the input document image is outside the reading area of the scanner 2108. When it is determined that the reading area has been pushed out, a signal is sent to the reject warning means 245, and this means 245 prompts the user to give a backside input warning using the display on the CRT 255 or the speaker 280 via the input / output control means 250. .
Further, the reject registration means 205 registers the document number added by the document number registration means 195 in the reject file 210.

【００４８】傾き抽出手段１２５は前記ＣＰＵ２１１０
から構成され、画像メモリ２１３０上に記憶された文書
画像から入力文書の傾きを抽出する。この傾き抽出方法
は、例えば、特開昭６２−１４２７７号公報に開示され
ている。The inclination extracting means 125 is the CPU 2110.
The inclination of the input document is extracted from the document image stored in the image memory 2130. This inclination extraction method is disclosed in, for example, Japanese Patent Application Laid-Open No. 62-14277.

【００４９】最適文字行選択手段１３０は前記ＣＰＵ２
１１０から構成され、文字認識ベース回転角評価手段１
４５での評価用文字行として高い精度でかつ高速な処理
を実現するため、前記文字行抽出手段１１０により得ら
れた文字行から最適な複数個の文字行を選択する。The optimum character line selection means 130 is the CPU 2
Character recognition based rotation angle evaluation means 1
In order to realize high-accuracy and high-speed processing as the character line for evaluation in 45, a plurality of optimum character lines are selected from the character lines obtained by the character line extracting means 110.

【００５０】文字行画像回転手段１４０は前記ＣＰＵ２
１１０から構成され、最適文字行選択手段１３０により
選択された複数個の文字行の画像を４種類の回転角、す
なわち、傾き抽出手段１２５により得られた傾きに０度
あるいは９０度あるいは１８０度あるい２７０度加えた
回転角だけ回転する。The character line image rotating means 140 is the CPU 2
An image of a plurality of character lines, which is composed of 110 and is selected by the optimum character line selection unit 130, has four types of rotation angles, that is, the inclination obtained by the inclination extraction unit 125 has 0 degree, 90 degrees, or 180 degrees. It rotates by the rotation angle of 270 degrees.

【００５１】文字認識ベース回転角評価手段１４５は前
記ＣＰＵ２１１０から構成され、最適文字行選択手段１
３０により選択された複数個の文字行を文字行画像回転
手段１４０により４種類の回転角で回転した回転文字行
画像に対し、それぞれの回転文字行画像に対し文字切り
出しおよび文字認識を行い文字認識結果の類似度を用い
て、４種類の回転角の評価を行う。The character recognition base rotation angle evaluation means 145 comprises the CPU 2110, and the optimum character line selection means 1
A plurality of character lines selected by 30 are rotated by the character line image rotating means 140 at four types of rotation angles, and character recognition is performed by performing character segmentation and character recognition on each rotated character line image. Four kinds of rotation angles are evaluated using the similarity of the result.

【００５２】また、文書の回転角の判定を高精度に求め
るため、文書のレイアウト情報を文字認識ベース回転角
評価手段１４５の結果に加えて利用する方法について説
明する。A method of utilizing the layout information of the document in addition to the result of the character recognition-based rotation angle evaluation means 145 to obtain the determination of the rotation angle of the document with high accuracy will be described.

【００５３】文字行座標回転手段１６５は前記ＣＰＵ２
１１０から構成され、文字行抽出手段１１０により得ら
れた文字行の座標を４種類の回転角、すなわち、傾き抽
出手段１２５により得られた傾きに０度あるいは９０度
あるいは１８０度あるい２７０度加えて考慮した回転角
だけ回転する。The character line coordinate rotating means 165 is the CPU 2
110, and the coordinates of the character line obtained by the character line extraction means 110 are added to four types of rotation angles, that is, the inclination obtained by the inclination extraction means 125, 0 degree, 90 degrees, 180 degrees or 270 degrees. Rotate by the rotation angle considered.

【００５４】レイアウト情報抽出手段１７０は前記ＣＰ
Ｕ２１１０から構成され、前記文字行座標回転手段１６
５により得た４種類の回転角で回転させて得た文字行座
標に対しレイアウト情報を抽出する。このレイアウト情
報抽出方法は、例えば、特開平１−１３０２９３号公報
に開示されている。The layout information extraction means 170 uses the CP
The character line coordinate rotating means 16 is composed of U2110.
Layout information is extracted with respect to the character line coordinates obtained by rotating at four types of rotation angles obtained in step 5. This layout information extraction method is disclosed in, for example, Japanese Patent Application Laid-Open No. 1-130293.

【００５５】レイアウトベース回転角評価手段１７５は
前記ＣＰＵ２１１０から構成され、レイアウト知識１８
０とレイアウト情報抽出手段１７０で抽出した４種類の
回転角での回転させて得たレイアウト情報を用いて評価
を行う。The layout base rotation angle evaluation means 175 is composed of the CPU 2110, and the layout knowledge 18
The evaluation is performed by using 0 and the layout information obtained by rotating the layout information extracting means 170 at four kinds of rotation angles.

【００５６】文書回転角判定手段１５０は前記ＣＰＵ２
１１０から構成され、文字認識ベース回転角評価手段１
４５、あるいは、この手段とレイアウトベース回転角評
価手段１７５で得られたそれぞれの４種類の回転角での
評価結果を基にして入力文書の回転角を判定する。この
判定手段により、回転角の判定結果が曖昧であった場
合、リジェクト警告手段２４５に信号を送り、この手段
２４５が入出力制御手段２５０を介してＣＲＴ２５５上
への表示あるいはスピーカ２８０を用いて、ユーザに回
転角判定不可の警告を促す。また、リジェクト登録手段
２０５により、文書番号登録手段１９５で付加された文
書番号をリジェクトファイル２１０に登録する。The document rotation angle determination means 150 is the CPU 2
Character recognition based rotation angle evaluation means 1
45, or the rotation angle of the input document is determined on the basis of the evaluation results of the four types of rotation angles obtained by this means and the layout-based rotation angle evaluation means 175. When the determination result of the rotation angle is ambiguous by this determination means, a signal is sent to the reject warning means 245, and this means 245 uses the display on the CRT 255 or the speaker 280 via the input / output control means 250. Prompt the user with a warning that the rotation angle cannot be determined. Further, the reject registration means 205 registers the document number added by the document number registration means 195 in the reject file 210.

【００５７】画像回転手段１５５は前記ＣＰＵ２１１０
から構成され、画像メモリ２１３０上に記憶された文書
画像を文書回転角判定手段１５０により得られた回転角
だけ回転する。The image rotation means 155 is the CPU 2110.
The document image stored in the image memory 2130 is rotated by the rotation angle obtained by the document rotation angle determination means 150.

【００５８】読取手段１６０は前記ＣＰＵ２１１０から
構成され、画像回転手段１５５により回転角だけ回転さ
れた修正文書画像に対して、修正画像中の文字画像を文
字コードに変換する。The reading means 160 is composed of the CPU 2110, and converts the character image in the corrected image into a character code for the corrected document image rotated by the rotation angle by the image rotating means 155.

【００５９】読取結果修正手段２４０は前記ＣＰＵ２１
１０から構成され、入出力制御手段２５０を介して、読
取手段１６０で処理した内容に対し、ＣＲＴ２５５に読
み取り結果や修正結果を表示したり、キーボード２６０
あるいはマウス２６５を用いて読み取り結果の修正を行
う。The reading result correction means 240 is the CPU 21.
10, the reading result and the correction result are displayed on the CRT 255 for the contents processed by the reading unit 160 via the input / output control unit 250, and the keyboard 260 is used.
Alternatively, the mouse 265 is used to correct the reading result.

【００６０】文書番号登録手段１９５は前記ＣＰＵ２１
１０から構成され、入力文書１０１に対し文書番号を付
け、文書番号ファイル２００に文書番号を登録する。The document number registration means 195 is the CPU 21.
The input document 101 is provided with a document number, and the document number is registered in the document number file 200.

【００６１】入力画像登録手段１８５は前記ＣＰＵ２１
１０から構成され、文書番号登録手段１９５によりつけ
られた文書番号と共に画像メモリ２１３０上に記憶され
た入力文書画像を入力画像ファイル１９０に登録する。The input image registration means 185 is the CPU 21.
The input document image which is composed of 10 and is stored in the image memory 2130 together with the document number assigned by the document number registration means 195 is registered in the input image file 190.

【００６２】修正画像登録手段２１５は前記ＣＰＵ２１
１０から構成され、画像回転手段１５５により修正され
た修正文書画像を文書番号と共に修正画像ファイル２２
０に登録する。The corrected image registration means 215 is the CPU 21.
The modified document image composed of 10 and the modified document image modified by the image rotation means 155 together with the document number.
Register to 0.

【００６３】読取結果登録手段２２５は前記ＣＰＵ２１
１０から構成され、読取手段１６０により読み取られた
結果を文書番号と共に読取結果ファイル２３０に登録す
る。The reading result registration means 225 is the CPU 21.
The result read by the reading unit 160 is registered in the read result file 230 together with the document number.

【００６４】付加情報登録手段２７５は前記ＣＰＵ２１
１０から構成され、スキャナ２１８０から入力した情報
ではなく、キーボード２６０あるいはマウス２６５から
入力した情報を付加、あるいは、関連づけて管理する。
そして、キーボード２６０あるいはマウス２６５等のス
キャナ２１８０以外から入力した情報を付加情報ファイ
ル２７０に登録する。The additional information registration means 275 is the CPU 21.
It is composed of 10 units, and the information input from the keyboard 260 or the mouse 265, not the information input from the scanner 2180, is added or associated and managed.
Then, information input from other than the scanner 2180 such as the keyboard 260 or the mouse 265 is registered in the additional information file 270.

【００６５】ファイル制御手段２３５は前記ＣＰＵ２１
１０から構成され、上記述べたような複数個のファイル
の登録・管理、あるいは、これらファイル間での情報を
関連づける。そして、複数個のファイル間に対し同一文
書での情報同志および異文書間同志での情報の関係を用
いて管理する。The file control means 235 is the CPU 21.
It is composed of 10 and registers or manages a plurality of files as described above, or associates information between these files. Then, the management is performed by using the relationship between the information in the same document and the information in the different documents between a plurality of files.

【００６６】上記ファイル群はデータ蓄積部２１９０を
介して磁気ディスク２２００あるいは光ディスク２２１
０あるいは光磁気ディスク２２３０に格納される。The above file group is stored in the magnetic disk 2200 or the optical disk 221 via the data storage unit 2190.
0 or stored on the magneto-optical disk 2230.

【００６７】次に、本システムの大まかな処理の流れに
ついて図７を用いて説明する。Next, a rough processing flow of this system will be described with reference to FIG.

【００６８】−クレームにあわせて訂正すること。-Correct according to the claim.

【００６９】まず、画像入力２７００にて紙の文書デー
タを電子的な画像データに変換する。そして、文書番号
登録２７０５にて変換された文書画像に文書番号を付加
する。そして、文書画像登録２７１０にて文書画像を登
録する。そして、文字行抽出２７１５にて文書画像中に
存在する文字行を抽出する。そして、表裏判定２７３５
にて文字行の有無に従い入力文書が間違えて裏面を入力
されたものか否かを判定する。そして、リジェクト判定
２７４０にて入力文書をリジェクトすべき否かを判定す
る。そして、もしリジェクトする場合、リジェクト警告
２７４５そして文書番号をリジェクト登録２７５０す
る。そして、はみ出し判定２７５５にて入力文書がスキ
ャナの読み取り領域をはみ出しているか否かを判定す
る。そして、リジェクト判定２７６０にて入力文書をリ
ジェクトすべき否かを判定する。そして、もしリジェク
トする場合、リジェクト警告２７６５そして文書番号を
リジェクト登録２７７０する。そして、傾き検出２７７
２にてスキャナ読み取り領域での水平線と入力文書水平
線との角度の差である傾きを検出する。そして、最適文
字行選択２７７４にて入力文書の回転角を求める文字認
識ベース回転角評価２７７８で評価対象とする最適な文
字行を複数個選択する。そして、文字行画像回転２７７
６にて選択した複数個の文字行画像を抽出した傾きに０
度、９０度、１８０度、、２７０度を加えた４種類の回
転角だけ回転する。そして、文字認識ベース回転角評価
２７７８にてそれぞれの回転角で回転させた文字行画像
から文字を切り出し、認識させ、その時の類似度により
４種類の回転角の評価を行う。そして、レイアウト解析
評価実行２７８０にて文字認識ベース回転角評価２７７
８にレイアウト情報を用いた回転角の評価を加えるか否
かにより分岐する。もしレイアウト情報を用いた回転角
の評価も加味させる場合、文字行座標回転２７８２にて
文字行座標を抽出した傾きに０度、９０度、１８０
度、、２７０度を加えた４種類の回転角だけ回転する。
そして、レイアウト情報抽出２７８４にて４種類の回転
角での文字行座標からレイアウト情報を抽出する。そし
て、レイアウトベース回転角評価８６５５にてそれぞれ
の回転角でのレイアウト情報とレイアウト知識を用いて
回転角の評価を行う。そして、文書回転角判定２７７９
にて先に求めた文字認識ベース回転角評価結果あるいは
これとレイアウトベース回転角評価結果から入力文書の
回転角を判定する。そして、もし画像回転が必要か否か
を画像回転判定２７９０にて判定し、もし画像回転が必
要な場合には画像回転２７９２にて判定した回転角を用
いて文書画像を回転する。そして、回転した修正画像を
登録する（２７９４）。そして、読み取り２７９６にて
回転した修正画像中の文字画像を文字コードへと変換す
る。そして、読み取り結果を登録（２７９８）し、ユー
ザの指示に従って読み取り結果の修正（２７９９）を行
う。First, the image input 2700 converts paper document data into electronic image data. Then, the document number is added to the document image converted in the document number registration 2705. Then, the document image is registered in the document image registration 2710. Then, in the character line extraction 2715, the character line existing in the document image is extracted. And front / back determination 2735
At, it is determined whether or not the input document is mistakenly input on the back side according to the presence or absence of character lines. Then, a rejection determination 2740 determines whether or not the input document should be rejected. If rejected, the reject warning 2745 and the document number are reject registered 2750. Then, it is determined whether or not the input document is out of the reading area of the scanner in the out-judgment determination 2755. Then, a rejection determination 2760 determines whether or not the input document should be rejected. If rejected, the reject warning 2765 and the document number are reject registered 2770. Then, the tilt detection 277
At 2, the inclination which is the difference between the horizontal line in the scanner reading area and the horizontal line of the input document is detected. Then, in the optimum character line selection 2774, a plurality of optimum character lines to be evaluated are selected in the character recognition base rotation angle evaluation 2778 for obtaining the rotation angle of the input document. Then, the character line image rotation 277
0 is added to the inclination extracted from the plurality of character line images selected in 6.
It rotates by four types of rotation angles including degrees, 90 degrees, 180 degrees, and 270 degrees. Then, in the character recognition base rotation angle evaluation 2778, characters are cut out from the character line image rotated at each rotation angle and recognized, and four types of rotation angles are evaluated according to the similarity at that time. Then, in layout analysis evaluation execution 2780, character recognition based rotation angle evaluation 277 is performed.
8 is branched depending on whether or not the evaluation of the rotation angle using the layout information is added. If the evaluation of the rotation angle using the layout information is also taken into consideration, the inclination of extracting the character line coordinates in the character line coordinate rotation 2782 is 0 °, 90 °, 180 °.
Rotate by 4 kinds of rotation angles, which are 270 degrees and 270 degrees.
Then, layout information extraction 2784 extracts layout information from the character line coordinates at four types of rotation angles. Then, the layout base rotation angle evaluation 8655 evaluates the rotation angle using the layout information and the layout knowledge at each rotation angle. Then, the document rotation angle determination 2779
The rotation angle of the input document is determined based on the character recognition-based rotation angle evaluation result previously obtained or in the layout-based rotation angle evaluation result. Then, whether the image rotation is necessary is determined by the image rotation determination 2790, and when the image rotation is required, the document image is rotated using the rotation angle determined by the image rotation 2792. Then, the rotated corrected image is registered (2794). Then, the character image in the corrected image rotated by reading 2796 is converted into a character code. Then, the read result is registered (2798), and the read result is corrected (2799) according to the instruction of the user.

【００７０】次に、入力画像から抽出した文字行の有無
に従い入力文書が表で正常に入力されたものか間違えて
裏で入力されたものかを判定する方法について説明す
る。Next, a method of determining whether the input document is normally input on the front side or mistakenly input on the back side according to the presence / absence of character lines extracted from the input image will be described.

【００７１】図９に示すように、まず、文字行を抽出す
る（９００）。そして、文字行が存在するか否かを判定
する（９１０）。もし裏面が入力された場合、入力文書
は白紙であるため、文字行が抽出されないので文字行が
存在しない。すなわち、文字行が存在すれば次処理を実
行し（９２０）、文字行が存在しなければ裏面で入力さ
れたものと判定しリジェクト処理を実行する（９３
０）。リジェクト処理では、ユーザにリジェクト入力で
あることを促すかあるいはリジェクト文書としてその文
書番号をリジェクト文書番号ファイルに登録する。As shown in FIG. 9, first, a character line is extracted (900). Then, it is determined whether a character line exists (910). If the back side is input, the input document is blank and the character line is not extracted, so there is no character line. That is, if the character line exists, the next process is executed (920), and if the character line does not exist, it is determined that the input is made on the back side and the reject process is executed (93).
0). In the reject processing, the user is prompted to reject input or the document number is registered as a reject document in the reject document number file.

【００７２】次に、読み取るべき文字行がスキャナ読み
取り領域外にあるか否かを判定する方法について説明す
る。Next, a method of determining whether the character line to be read is outside the scanner reading area will be described.

【００７３】図１０に示すように、まず、スキャナ読み
取り領域をはみ出した否かを示すはみ出しフラグをセッ
トする（１０００）。そして、文字行数回、次の処理を
繰り返す（１００５）。はみ出し文字行の回数を示すＣ
ＯＵＮＴを０にセットする（１０１０）。そして、カレ
ント文字行の頂点数である４回、次の処理を繰り返す
（１０１５）。１０２０でスキャナ読み取り領域境界線
である（＊、０）上に注目している頂点が存在するか否
かを判定する。もし１０２０を満たせばＣＯＵＮＴをイ
ンクリメントとする（１０２５）。ここで、＊は任意の
数値であることを示す。そして、１０３０でスキャナ読
み取り領域境界線である（０、＊）上に注目している頂
点が存在するか否かを判定する。もし１０３０を満たせ
ばＣＯＵＮＴをインクリメントとする（１０３５）。そ
して、１０４０でスキャナ読み取り領域境界線である
（Ｘｅ、＊）上に注目している頂点が存在するか否かを
判定する。もし１０４０を満たせばＣＯＵＮＴをインク
リメントとする（１０４５）。ここで、Ｘｅはスキャナ
読み取り領域の最大Ｘ座標である。そして、１０５０で
スキャナ読み取り領域境界線である（＊、Ｙｅ）上に注
目している頂点が存在するか否かを判定する。もし１０
５０を満たせばＣＯＵＮＴをインクリメントとする（１
０５５）。ここで、Ｙｅはスキャナ読み取り領域の最大
Ｙ座標である。そして、１０６０にてＣＯＵＮＴが２以
上であるか否かを判定する。ここで、ＣＯＵＮＴが２以
上であるということは文字行がスキャナの非読み取り領
域にあり、入力文書がはみ出していることを示す。も
し、ＣＯＵＮＴが２以上ならばその文字行の番号を登録
する（１０６５）。そして、はみ出しフラグを１にセッ
トする（１０７０）。以上の処理が終了して、はみ出し
フラグが１でセットされているか否かを判定する（１０
７５）。そして、もし、はみ出しフラグが１でセットさ
れていれば、リジェクト処理を実行する（１０８０）。As shown in FIG. 10, first, a protrusion flag indicating whether or not the scanner reading region is protruded is set (1000). Then, the next process is repeated several times for the character lines (1005). C, which indicates the number of overhanging character lines
OUNT is set to 0 (1010). Then, the next process is repeated four times, which is the number of vertices of the current character line (1015). At 1020, it is determined whether or not the vertex of interest exists on the scanner reading area boundary line (*, 0). If 1020 is satisfied, COUNT is incremented (1025). Here, * indicates an arbitrary numerical value. Then, at 1030, it is determined whether or not the vertex of interest exists on the scanner reading area boundary line (0, *). If 1030 is satisfied, COUNT is incremented (1035). Then, at 1040, it is determined whether or not the vertex of interest exists on the scanner reading area boundary line (Xe, *). If 1040 is satisfied, COUNT is incremented (1045). Here, Xe is the maximum X coordinate of the scanner reading area. Then, at 1050, it is determined whether or not the vertex of interest exists on the scanner reading area boundary line (*, Ye). If 10
If 50 is satisfied, COUNT is incremented (1
055). Here, Ye is the maximum Y coordinate of the scanner reading area. Then, at 1060, it is determined whether COUNT is 2 or more. Here, the fact that COUNT is 2 or more means that the character line is in the non-reading area of the scanner and the input document is protruding. If COUNT is 2 or more, the number of the character line is registered (1065). Then, the protrusion flag is set to 1 (1070). After the above processing is completed, it is determined whether or not the protrusion flag is set to 1 (10
75). If the protrusion flag is set to 1, reject processing is executed (1080).

【００７４】次に、はみ出し文字行が検出された場合の
画面表示について説明する。Next, the screen display when the protruding character line is detected will be described.

【００７５】図１１に示すように、画面上１１００に表
示されたウインド１１１０上に文字行が表示される。こ
の時、はみ出した文字行であることが容易に判り易いよ
うに、文字行がはみ出していないもの１１２０と文字行
がはみ出したもの１１３０の表示の色を違えて表示す
る。As shown in FIG. 11, character lines are displayed on the window 1110 displayed on the screen 1100. At this time, in order to make it easy to recognize that the character line is protruding, the display color of the object 1120 in which the character line does not protrude and the display 1130 in which the character line protrudes are displayed in different colors.

【００７６】次に、ユーザが入力文書の傾きを調整する
方法について説明する。Next, a method for the user to adjust the inclination of the input document will be described.

【００７７】図１２に示すように、画面上１２００に表
示されたウインド１２１０上に文字行１２２０、傾きイ
ンディケータ１２４０、その初期位置１２３０、およ
び、傾きインディケータを操作するポインティングデバ
イス１２５０を表示する。この場合、入力画像として傾
きがあり、それに加えて１８０度回転した文書である。
そして、ユーザは中央部に表示された傾きインディケー
タを表示された文字行に直接重ねることが出来、容易に
かつ高精度に傾きを調節することが出来る。そして、調
節が完了したら、操作パネル１２６０上の終了ボタン１
２７０あるいはキャンセルボタン１２８０により処理を
終える。As shown in FIG. 12, a character line 1220, a tilt indicator 1240, its initial position 1230, and a pointing device 1250 for operating the tilt indicator are displayed on a window 1210 displayed on a screen 1200. In this case, the input image has a tilt and is a document rotated by 180 degrees.
Then, the user can directly superimpose the tilt indicator displayed in the center on the displayed character line, and can easily and accurately adjust the tilt. Then, when the adjustment is completed, the end button 1 on the operation panel 1260
The process is terminated by pressing 270 or the cancel button 1280.

【００７８】次に、最適文字行選択手段１３０での文字
行の選択方法について説明する。Next, a method of selecting a character line by the optimum character line selection means 130 will be described.

【００７９】図８に示すように、まず、選択する文字行
のカウンタであるＣＯＵＮＴの初期化を行う（８０
０）。そして、カウンタＣＯＵＮＴが選択文字行数に至
るまで、以下の処理を繰り返す（８１０）。カレント文
字行の縦横比あるいは横縦比がある一定値以上の場合
（８２０）、その文字行を最適文字行の一つとする（８
３０）。そして、カウンタＣＯＵＮＴをインクリメント
とする（８４０）。そして、選択文字数に至った場合
（８５０）、最適文字行選択手段を終了する（８６
０）。ここで、カレント文字行の縦横比がある一定値以
上の場合は横書き文字行を示し、横縦比がある一定値以
上の場合は縦書き文字行を示す。このように、ある一定
値で判定した根拠として、一つには文字認識の類似度に
より回転角を判定する場合、高い精度で判定する必要が
あり、一文字行中に複数の文字が存在するようにするた
めである。その根拠は、一文字行中に含まれる文字が少
ない場合、日本語の「口」や漢数字「一」等は０度、９
０度、１８０度、２７０度と回転しても形状に大きな差
が無く文字認識の類似度による回転角判定が困難である
からである。例えば、章番号等は「１．１」のように書
くため、これが文字行として判定されると０度の回転角
なのか１８０度の回転角なのか判定は曖昧になる。As shown in FIG. 8, first, COUNT, which is a counter of selected character lines, is initialized (80).
0). Then, the following processing is repeated until the counter COUNT reaches the number of selected character lines (810). When the aspect ratio or the aspect ratio of the current character line is equal to or greater than a certain value (820), the character line is regarded as one of the optimum character lines (8
30). Then, the counter COUNT is incremented (840). When the number of selected characters is reached (850), the optimum character line selection means is terminated (86).
0). Here, when the aspect ratio of the current character line is equal to or greater than a certain value, it indicates a horizontally written character line, and when the aspect ratio is greater than a certain value, it indicates a vertically written character line. In this way, as a basis for judging with a certain constant value, one is that when judging the rotation angle based on the similarity of character recognition, it is necessary to judge with high accuracy, and it seems that there are multiple characters in one character line. This is because The reason is that if there are few characters in one character line, Japanese words such as "mouth" and Chinese numerals "1" are 0 degrees, 9
This is because there is no big difference in the shapes even if they are rotated by 0 degrees, 180 degrees, and 270 degrees, and it is difficult to determine the rotation angle based on the similarity of character recognition. For example, since the chapter number or the like is written as "1.1", if this is determined as a character line, the determination as to whether the rotation angle is 0 degrees or 180 degrees becomes ambiguous.

【００８０】次に、文字行画像回転手段１４０および文
字認識ベース回転角評価手段１４５について説明する。Next, the character line image rotation means 140 and the character recognition base rotation angle evaluation means 145 will be described.

【００８１】まず、図１３に示すように、最適文字行選
択手段１３０により選択された複数個の文字行の画像１
３０５と入力文書画像の傾き１３００を４種類の回転処
理部（１３１５〜１３２５）に入力させ、各回転処理部
にて複数個の文字行の画像を（−ａ）度、（９０−ａ）
度、（１８０−ａ）度、（２７０−ａ）度だけ回転させ
た文字行画像を得る。そして、１３３０において、各回
転角での文字行画像を対象に、文字切り出し（１３３５
〜１３５０）、文字認識（１３５５〜１３７０）を実行
する。そして、文字類似度評価処理部（１３７５〜１３
９０）にて、文字認識の類似度を用いそれぞれの回転角
での文字行画像に対する評価値を求める。ここで、文字
類似度評価処理部の評価関数としては全ての文字の類似
度の平均値あるいは中央値等が利用される。そして、最
良回転角判定部１３９７にて、各回転角で求められた評
価値から文書回転角１３９９を判定し、評価値が曖昧な
場合にはリジェクト情報１３９９を出力する。First, as shown in FIG. 13, an image 1 of a plurality of character lines selected by the optimum character line selection means 130.
305 and the inclination 1300 of the input document image are input to four types of rotation processing units (1315-1325), and images of a plurality of character lines are (-a) degrees, (90-a) in each rotation processing unit.
A character line image rotated by 180 degrees, (180-a) degrees, and (270-a) degrees is obtained. Then, in 1330, character segmentation (1335) is performed on the character line image at each rotation angle.
˜1350) and character recognition (1355-1370). Then, the character similarity evaluation processing unit (1375 to 13)
In 90), the evaluation value for the character line image at each rotation angle is obtained using the similarity of character recognition. Here, as the evaluation function of the character similarity evaluation processing unit, the average value or the median value of the similarity of all characters is used. Then, the best rotation angle determination unit 1397 determines the document rotation angle 1399 from the evaluation value obtained at each rotation angle, and outputs reject information 1399 when the evaluation value is ambiguous.

【００８２】次に、文字認識ベース回転角評価手段１４
５の一実施例について説明する。Next, the character recognition based rotation angle evaluation means 14
Example 5 will be described.

【００８３】図１４に示すように、入力文書の回転角を
判定する際、類似度による評価が曖昧な場合には対象文
書をリジェクト扱いにしようというものであり、まず、
得られた４種類の回転角での文字類似度評価値を入力す
る（１４００）。そして、最良の評価値（ａ）を求め
（１４１０）、次点の評価値（ｂ）を求める（１４２
０）。そして、ａ−ｂの絶対値がある値Ｋよりも大きい
か否かを判定する（１４３０）。もし条件を満足すれ
ば、文書の回転角を決定し（１４４０）、そうでない場
合にはリジェクト処理を行う（１４５０）。As shown in FIG. 14, when determining the rotation angle of an input document, if the evaluation based on the similarity is ambiguous, the target document is treated as rejected.
The character similarity evaluation values at the obtained four types of rotation angles are input (1400). Then, the best evaluation value (a) is calculated (1410), and the evaluation value (b) of the next point is calculated (142).
0). Then, it is determined whether the absolute value of ab is larger than a certain value K (1430). If the condition is satisfied, the rotation angle of the document is determined (1440), and if not, reject processing is performed (1450).

【００８４】次に、文字行座標回転手段１６５、レイア
ウト情報抽出手段１７０とレイアウトベース回転角評価
手段１７５について説明する。Next, the character line coordinate rotation means 165, the layout information extraction means 170 and the layout base rotation angle evaluation means 175 will be described.

【００８５】まず、図１５に示すように、入力文書画像
の傾き１５００と最適文字行選択手段１３０により選択
された複数個の文字行の座標１５０５を４種類の回転処
理部（１５１０〜１５２５）に入力させ、各回転処理部
にて複数個の文字行の座標を（−ａ）度、（９０−ａ）
度、（１８０−ａ）度、（２７０−ａ）度だけ回転させ
た文字行座標を得る。そして、レイアウト解析処理部
（１５３０〜１５４５）において、各回転角での文字行
座標からレイアウト解析を実行する。そして、レイアウ
ト照合部（１５５０〜１５６５）にて、レイアウト知識
（１５７０〜１５８５）を用い各回転角の評価値（１５
９０〜１５９７）を求める。First, as shown in FIG. 15, the inclination 1500 of the input document image and the coordinates 1505 of the plurality of character lines selected by the optimum character line selection means 130 are set in four types of rotation processing units (1510 to 1525). Input the coordinates of multiple character lines in each rotation processing unit (-a) degrees, (90-a)
The character line coordinates rotated by degrees, (180-a) degrees, and (270-a) degrees are obtained. Then, the layout analysis processing unit (1530 to 1545) executes layout analysis from the character line coordinates at each rotation angle. Then, the layout collation unit (1550 to 1565) uses the layout knowledge (1570 to 1585) to evaluate the rotation angle (15).
90-1597).

【００８６】次に、予め入力文書の縦書き・横書き情報
がわかっている場合のレイアウト情報を用いた回転角の
評価方法について説明する。Next, a method of evaluating the rotation angle using the layout information when the vertical writing / horizontal writing information of the input document is known in advance will be described.

【００８７】まず、図１６に示すように、図１５の場合
と異なるのはレイアウト照合時に予めレイアウト情報が
わかっているためレイアウト情報信号（１６９９）をレ
イアウト照合部（１６５０〜１６６５）に入力させ、照
合させるレイアウト知識（１６７０〜１６８５）を限定
させるものである。First, as shown in FIG. 16, the difference from the case of FIG. 15 is that the layout information signal (1699) is input to the layout collating section (1650 to 1665) because the layout information is known in advance at the time of layout collation. The layout knowledge (1670 to 1685) to be collated is limited.

【００８８】次に、レイアウト情報を照合させる方法に
ついて説明する。Next, a method of collating layout information will be described.

【００８９】文書は１つ以上の節（ブロック）から構成
され、図１７に文書を構成するブロックが必ず持つ属性
の縦書き、横書き情報を０度、９０度、１８０度、２７
０度回転した時のイメージを示す。この図からわかるよ
うに、横書き・縦書きを各回転させたもののどれも文字
が始まる字下げ座標と中途で終了する文末の座標に特徴
があり、この特徴を用いて照合することで入力文書の回
転角を求めることが出来る。レイアウト知識内に図１７
の（１）の（８）の情報を格納し、これとレイアウト解
析させて得られる図１７の（１）から（８）のどのパタ
ンとが近いかを調べることで入力文書の回転角の識別が
可能になる。A document is composed of one or more sections (blocks), and the vertical writing and horizontal writing information of the attributes that the blocks constituting the document necessarily have in FIG. 17 are 0 degrees, 90 degrees, 180 degrees, and 27 degrees.
The image when rotated 0 degrees is shown. As you can see from this figure, each of the horizontal and vertical writing rotated has a feature in the indentation coordinates at which the characters begin and the coordinates at the end of the sentence that ends midway. The rotation angle can be calculated. Figure 17 in the layout knowledge
The rotation angle of the input document is identified by storing the information of (8) of (1) of (1) and checking which pattern of (1) to (8) of FIG. Will be possible.

【００９０】次に、文字認識ベース回転角評価手段１４
５の結果とレイアウトベース回転角評価手段１７５の結
果とを合わせた文書回転角の判定方法について説明す
る。Next, the character recognition based rotation angle evaluation means 14
A method of determining the document rotation angle that combines the result of No. 5 and the result of the layout-based rotation angle evaluation unit 175 will be described.

【００９１】図１８に示すように、文字認識ベース回転
角評価値（ａ）を求める（１８００）。そして、レイア
ウトベース回転角評価値（ｂ）を求める（１８１０）。
そして、文書回転各判定（１８２０）にて、値ａとｂが
等しい場合には回転角を決定し（１８３０）、読み取り
処理を行う（１８４０）。また、値が等しくない場合に
はリジェクト処理を実行する（１８５０）。As shown in FIG. 18, a character recognition base rotation angle evaluation value (a) is obtained (1800). Then, the layout base rotation angle evaluation value (b) is obtained (1810).
Then, in each of the document rotation determinations (1820), when the values a and b are equal, the rotation angle is determined (1830) and the reading process is performed (1840). If the values are not equal, reject processing is executed (1850).

【００９２】次に、求めた入力文書の回転角を用いて文
書画像中の文字画像を文字コードに変換する処理方法に
ついて説明する。Next, a processing method for converting a character image in a document image into a character code by using the obtained rotation angle of the input document will be described.

【００９３】図１９に示すように、文字行座標１９０
５、文書回転角１９１０と入力画像１９１５を画像回転
部１９２０に入力する。そして、画像回転部１９２０に
て文書回転角だけ入力文書を回転修正する。そして、文
字行抽出部にて新たに修正画像中から文字行を抽出す
る。そして、文字切出部１９３５にて文字を切り出し、
文字認識部１９４０にて文字認識を行い、文字コード１
９５５に変換し出力する。As shown in FIG. 19, character line coordinates 190
5. Input the document rotation angle 1910 and the input image 1915 into the image rotation unit 1920. Then, the image rotation unit 1920 rotates and corrects the input document by the document rotation angle. Then, the character line extraction unit newly extracts a character line from the corrected image. Then, the character cutting unit 1935 cuts out the character,
Character recognition unit 1940 performs character recognition and character code 1
Converted to 955 and output.

【００９４】次に、求めた入力文書の回転角とレイアウ
ト情報を用いて文書画像中の文字画像を文字コードに変
換する処理方法について説明する。Next, a processing method for converting a character image in a document image into a character code by using the obtained rotation angle and layout information of the input document will be described.

【００９５】図２０に示すように、文書回転角２１１０
と文字行部分画像２１０５を部分画像回転部２１２５に
入力する。そして、画像回転部にて文書回転角だけ入力
文書を回転修正する。また、文字行番号２１１４とレイ
アウト情報２１１５を読み順決定部２１２０に入力す
る。この読み順決定部２１２０で文字コードに変換して
いく文字行の順序求める。そして、回転文字行画像と読
み順情報を文字切出部２１３０に入力する。そして、読
み順決定部２１２０で得た順番に従い、文字切出部２１
３０にて文字を切り出し、文字認識部２１３５にて文字
認識を実行し、文字コード１９５５を出力する。As shown in FIG. 20, the document rotation angle 2110.
And the character line partial image 2105 are input to the partial image rotation unit 2125. Then, the image rotation unit corrects the rotation of the input document by the document rotation angle. Further, the character line number 2114 and the layout information 2115 are input to the reading order determination unit 2120. The reading order determination unit 2120 obtains the order of character lines to be converted into character codes. Then, the rotated character line image and the reading order information are input to the character cutout unit 2130. Then, according to the order obtained by the reading order determining unit 2120, the character cutting unit 21
A character is cut out at 30, a character recognition unit 2135 executes character recognition, and a character code 1955 is output.

【００９６】次に、自動的に入力文書の頁番号を読み取
り、これを付加情報として登録する方法について説明す
る。Next, a method of automatically reading the page number of the input document and registering it as additional information will be described.

【００９７】図２１に示すように、回転処理部２２１０
に回転角２２００および入力画像２２０５を入力し文書
画像を回転修正する。そして、頁番号を認識するため
に、まず、頁番号を修正画像から抽出する必要から頁番
号レイアウト情報ファイル２２２０に格納されている知
識を用いて頁番号画像抽出部２２１５にて頁番号部分画
像を抽出する。次に、抽出した頁番号部分画像から頁番
号を一文字づつ文字切り出し部２２２５にて切り出し、
文字認識部２２３０にて画像データから文字コードに変
換する。最後に、認識した頁番号を付加情報として付加
情報登録部２２３５にてこれを付加情報ファイル２２４
０に登録する。As shown in FIG. 21, the rotation processing unit 2210.
The rotation angle 2200 and the input image 2205 are input to and the document image is rotated and corrected. In order to recognize the page number, the page number image extraction unit 2215 uses the knowledge stored in the page number layout information file 2220 to extract the page number partial image from the corrected image. Extract. Next, the page number is cut out from the extracted page number partial image by the character cutting unit 2225 one by one,
The character recognition unit 2230 converts the image data into a character code. Finally, the recognized page number is treated as additional information by the additional information registration unit 2235, and this is added to the additional information file 224.
Register to 0.

【００９８】次に、登録される情報について説明する。
登録されるデータは文書番号、入力画像、リジェクト番
号、修正画像、読取結果そして付加情報がある。この付
加情報には、さらに、図２２に示すように、文書名２３
０５、著者名１：２３１０、著者名２：２３１５、雑誌
名２３２０、入手先２３２５、入手日時２３３０等の情
報が登録され、同一文書同士内で関係情報が結び付けら
れる。そして、図２３に示すようにファイル制御手段２
４４０が文書同士の関係を抽出し、文書同士関係ファイ
ル２４４５に登録する。Next, the registered information will be described.
The registered data includes a document number, input image, reject number, corrected image, reading result, and additional information. As shown in FIG. 22, the additional information further includes a document name 23
05, author name 1: 2310, author name 2: 2315, magazine name 2320, source 2325, date and time of acquisition 2330, etc. are registered, and related information is linked within the same document. Then, as shown in FIG. 23, the file control means 2
440 extracts the relationship between the documents and registers it in the document relationship file 2445.

【００９９】図２４に、雑誌名に関する情報ファイルを
示す。このファイルには項目内容とその項目での文書番
号が登録されている。他の登録項目のファイルも同様な
構成である。文書同士関係を生成するファイル制御手段
は各登録項目のファイルの参照時に、同じ項目内容をも
つ文書番号は相互に関係があるものとし、文書番号から
順に関係のある文書番号をリスト化していく。これによ
り、図２５の示す如き文書同士関係ファイルが作成され
ることになる。FIG. 24 shows an information file relating to the magazine name. The content of the item and the document number of the item are registered in this file. Files of other registered items have the same structure. The file control means for generating a document relation assumes that document numbers having the same item contents are related to each other when referring to files of each registered item, and lists related document numbers in order from the document number. As a result, a document relationship file as shown in FIG. 25 is created.

【０１００】以上述べたように、上記システムでは同一
文書内の情報の検索はもちろん、登録項目からの関連文
書の検索、また、ユーザが所望の文書に関する情報がお
ぼろげである場合でも、文書同士関係を用いて他の文書
からでも所望の文書に関する情報を入手することが出来
る。As described above, in the system described above, not only the information in the same document is searched, but also the related document is searched from the registered item, and even if the information about the document desired by the user is vague, the documents are not matched. Relationships can be used to obtain information about a desired document from other documents as well.

【０１０１】次に、Ａ４スキャナでの読み取り時に生じ
ることがある読み取り領域からの文書のはみ出しを解決
する方法を図２７を用いて説明する。Next, a method for solving the protrusion of the document from the reading area which may occur at the time of reading with the A4 scanner will be described with reference to FIG.

【０１０２】電子ファイリング装置ではスキャナとして
Ａ３読み取り可能のものが多く、文書をＡ４で入力する
のでは無く、Ａ３の読み取り領域で入力し（２８０
０）、その後は図７で示した方法と同様な方法にて入力
文書画像中の内容を読み取る（２８０５−２８７０）。
ここで、図７と異なるのははみ出し判定が不必要になる
ことと、それに伴うリジェクト処理が不要になることで
ある。また、処理に関しては図７で示した処理領域を拡
張するのみで良いため容易に実現できる。Many electronic filing apparatuses are capable of reading A3 as a scanner, and a document is not input in A4 but is input in the reading area of A3 (280
0) and thereafter, the contents in the input document image are read by the same method as that shown in FIG. 7 (2805-2870).
Here, what is different from FIG. 7 is that the protrusion determination is unnecessary and the reject process accompanying it is unnecessary. Further, the processing can be easily realized because only the processing area shown in FIG. 7 needs to be expanded.

【０１０３】次に、先に示した方法では毎回Ａ３読み取
り領域を処理するため処理時間がＡ４対応の場合に比べ
て掛かってしまう。そのため、まず、Ａ４読み取り領域
で画像を入力して（２９００）、はみ出し処理を行ない
（２９０５）、その結果に基づきはみ出し判定する（２
９１０）。はみ出した場合、Ａ３読み取り領域で画像を
再入力し（２９１５）、図２７で示した処理（２８０５
−２８７０）ここでは処理２９を実行する（２９２
０）。また、読み取り領域を文書画像がはみ出していな
い場合、Ａ４読み取り領域の画像に対して処理２９を実
行する（２９２５）。以上述べた方法により、ユーザが
読み取りたい文書が読み取り領域をはみ出した場合、毎
回入力文書を整えて入力し直すこと無く、自動的に内容
を読み取ることが出来る。Next, in the method described above, since the A3 reading area is processed every time, the processing time is longer than that in the case of supporting A4. Therefore, first, an image is input in the A4 reading area (2900), a protrusion process is performed (2905), and a protrusion determination is performed based on the result (2900).
910). If it does, the image is re-input in the A3 reading area (2915) and the processing shown in FIG.
-2870) Here, the process 29 is executed (292).
0). If the document image does not extend beyond the reading area, processing 29 is performed on the image in the A4 reading area (2925). By the method described above, when the user wants to read the document out of the reading area, the contents can be automatically read without adjusting the input document and re-inputting each time.

【０１０４】次に、読み取るべき文字行がスキャナ読み
取り領域外にあるか否かを判定する方法について図２９
を用いて説明する。Next, a method for determining whether or not the character line to be read is outside the scanner reading area will be described with reference to FIG.
Will be explained.

【０１０５】図１０では文字行の４すみの座標により入
力文書が読み取り領域をはみ出したか否かを判定する方
法を示した。ここでは、もっと簡易な方法で入力文書が
読み取り領域をはみ出したか否かを判定する方法を示
す。図２９は、入力画像（３０００）に文字行（３００
５）が存在しそれがスキャナ読み取り領域をはみ出して
いる図を示している。ここでは、上辺、底辺、左辺、右
辺に対しＮビットの幅を持つ矩形（例えば、３０１０、
３０１５）に対し周辺分布あるいは累積黒ドット数を積
算する。入力文書が読み取り領域をはみ出した場合、必
ず４辺のどれかに接触するため４つのどれか一つ以上の
矩形の累積黒ドット数はある値Ｖを超える。図２９の場
合、矩形３０１５の累積黒ドット数が値Ｖを超えてしま
う。このように、４辺の矩形中の累積黒ドット数の値を
調べることにより容易に入力文書が読み取り領域をはみ
出したか否かを判定することが出来る。FIG. 10 shows a method of determining whether or not the input document is out of the reading area based on the coordinates of the four corners of the character line. Here, a method for determining whether or not the input document extends beyond the reading area by a simpler method will be shown. FIG. 29 shows that the input image (3000) has a character line (300
5) is present and it extends beyond the scanner reading area. Here, a rectangle having a width of N bits with respect to the top side, the bottom side, the left side, and the right side (for example, 3010,
3015), the peripheral distribution or the cumulative number of black dots is integrated. When the input document extends beyond the reading area, it always touches any of the four sides, so the cumulative number of black dots in any one of the four rectangles exceeds a certain value V. In the case of FIG. 29, the cumulative number of black dots in the rectangle 3015 exceeds the value V. In this way, by checking the values of the cumulative number of black dots in the four-sided rectangle, it is possible to easily determine whether or not the input document exceeds the reading area.

【０１０６】次に、読み取る入力文書の方向角を決定す
る際に文字認識の結果を利用するが、この時、文字画像
あるい文字行画像を任意の角度に回転させて文字認識を
行う必要がある。この回転の方法としてビットごとに回
転を行う方法が考えられるがこれは処理量が少なくは無
い、そのため、文字行画像中から文字画像を取り出し、
文字認識方法として文字の骨格あるいは輪郭を用いるも
のに対しては、文字の骨格あるいは輪郭を、例えば図３
１に示す８方向のチェーンコードで表現し、回転に必要
な角度だけチェーンコードの番号をずらすのみで処理量
を大幅に削減し容易に実現できる。回転角の文か伊能に
応じて１６方向あるいは３２方向と言うようにチェーン
コードの方向数を増せば容易に細かい角度での回転が行
える。例えば、「但」と言う文字の骨格データに対し８
方向のチェーンコードでこれを表現した図２９に示す。
このように、骨格データを８方向のチェーンで表現で
き、容易に４５度単位で回転が行える。Next, the result of character recognition is used in determining the direction angle of the input document to be read. At this time, it is necessary to rotate the character image or the character line image at an arbitrary angle for character recognition. is there. As a method of this rotation, a method of performing rotation for each bit is conceivable, but this processing amount is not small, so a character image is extracted from the character line image,
For a method that uses a character skeleton or contour as a character recognition method, the character skeleton or contour is described in, for example, FIG.
It is expressed by the 8-direction chain code shown in 1 and can be easily realized by greatly reducing the processing amount only by shifting the chain code number by the angle required for rotation. If the number of directions of the chain cord is increased to say 16 directions or 32 directions depending on the sentence of the rotation angle or Ino, it is possible to easily rotate at a fine angle. For example, 8 for the skeletal data of the character "Ta"
This is shown in FIG. 29 in which this is expressed by the chain code of the direction.
In this way, the skeleton data can be represented by a chain in 8 directions, and rotation can be easily performed in 45 degree units.

【０１０７】[0107]

【発明の効果】本発明の文書読取装置あるいは電子ファ
イル装置あるいはファクシミリあるいは複写機あるいは
計算機によれば、文書がスキャナの設定方向に対し任意
の角度（０度から３６０度）で回転されて入力された場
合でも、入力文書の回転角を検出し、正しい方向に入力
画像を修正し、その内容を読み取ることが出来る。ま
た、ユーザに修正した画像を提示あるいは蓄積すること
が出来る。According to the document reading device, the electronic file device, the facsimile, the copying machine, or the computer of the present invention, the document is input by being rotated at an arbitrary angle (0 to 360 degrees) with respect to the setting direction of the scanner. Even if the input document is rotated, the rotation angle of the input document can be detected, the input image can be corrected in the correct direction, and the content can be read. Further, the corrected image can be presented or stored to the user.

【０１０８】また、入力された文書がユーザが間違えて
裏面で入力されたものかを判定し、裏面入力時にはユー
ザにメッセージを促すことが出来る。また、入力された
文書の読み取るべき部分がスキャナの読み取り領域から
はみ出しているかを判定し、はみ出して入力された場合
にはユーザにメッセージを促すことが出来る。Further, it is possible to judge whether the input document is mistakenly input on the back side by the user and prompt the user for a message when inputting on the back side. In addition, it is possible to determine whether or not the portion of the input document to be read extends beyond the reading area of the scanner, and if the portion is input beyond the reading area, a message can be prompted to the user.

【０１０９】また、文書に記載されていない文書に関わ
る入手先、入手日時、メモ等の付加情報を入力画像やそ
の処理結果に対応づけて記録し、文書情報を指定して、
その文書に関連のある情報を検索し、取り出すことが出
来る。また、関連のある他の文書同志の関係情報を記録
し、その文書同志関係情報を検索して、ある文書からた
の文書をたぐり、所望の文書に関わる情報を検索し、取
り出すことが出来る。Further, additional information such as a source, a date and time of acquisition, a memo, etc. relating to a document which is not described in the document is recorded in association with the input image and the processing result thereof, and the document information is designated.
You can search and retrieve information related to the document. Further, it is possible to record the related information of other related documents, search for the related information of the documents, search for the document from a certain document, and search for and retrieve the information related to the desired document.

【０１１０】また、入力文書がＡ４スキャナ読み取り領
域をはみ出しても、はみ出し領域を判定し再度Ａ３スキ
ャナ読み取り領域で文書画像を自動的に入力することに
より、文書に記載された内容を漏らすこと無く読み取る
ことが出来る。Even if the input document extends beyond the A4 scanner reading area, the extension area is determined, and the document image is automatically input again in the A3 scanner reading area to read the content described in the document without leaking. You can

【０１１１】さらに、データ登録時にファイル容量をチ
ェックすることで、処理結果が格納か否かを判定し、ユ
ーザにメッセージを促すことが出来る。また、大量な文
書を入力し、入力画像の回転角を修正し、修正画像を蓄
積すると同時にその画像を管理することが出来る。Furthermore, by checking the file capacity at the time of data registration, it is possible to judge whether or not the processing result is stored and prompt the user for a message. Also, a large amount of documents can be input, the rotation angle of the input image can be corrected, the corrected images can be stored, and the images can be managed at the same time.

【０１１２】のユーザの使い勝手を考慮した文書読取装
置あるいは電子ファイル装置あるいはファクシミリ装置
あるいは複写機あるいは計算機を提供することにある。It is another object of the present invention to provide a document reading device, an electronic file device, a facsimile device, a copying machine, or a computer in consideration of user's usability.

[Brief description of drawings]

【図１】本発明の文書読取装置の一実施例のブロック図
である。FIG. 1 is a block diagram of an embodiment of a document reading device of the present invention.

【図２】本発明の文書読取装置の一実施例のブロック図
である。FIG. 2 is a block diagram of an embodiment of a document reading device of the present invention.

【図３】スキャナの捜査線の方向とシステムの処理方向
を示した図である。FIG. 3 is a diagram showing a scanning line direction of a scanner and a processing direction of a system.

【図４】文書がスキャナ上に傾いて設定された状態を示
す図である。FIG. 4 is a diagram showing a state in which a document is tilted on a scanner and set.

【図５】文書がシステムの処理方向と反転して設定され
た状態を示す図である。FIG. 5 is a diagram showing a state in which a document is set upside down with respect to the processing direction of the system.

【図６】文書がシステムの処理方向と任意の回転角で設
定された状態を示す図である。FIG. 6 is a diagram showing a state in which a document is set with a processing direction of the system and an arbitrary rotation angle.

【図７】本発明の文書読取装置の大まかな処理の過程を
示した図である。FIG. 7 is a diagram showing a rough process of the document reading apparatus of the present invention.

【図８】最適な文字行を選択するフローを示した図であ
る。FIG. 8 is a diagram showing a flow for selecting an optimum character line.

【図９】間違えて裏面で入力されたか否かを判定するフ
ローを示す図である。FIG. 9 is a diagram showing a flow of determining whether or not an input is made on the back side by mistake.

【図１０】スキャナの読み取り領域をはみ出したか否か
を判定するフローを示す図である。FIG. 10 is a diagram showing a flow of determining whether or not the reading area of the scanner is protruded.

【図１１】スキャナの読み取り領域をはみ出した部分の
表示方法を示した図である。FIG. 11 is a diagram showing a display method of a portion outside the reading area of the scanner.

【図１２】入力文書の傾きをユーザが設定する方法を示
した図である。FIG. 12 is a diagram showing a method for a user to set the inclination of an input document.

【図１３】文字認識を利用して文書の回転角の評価を示
す図である。FIG. 13 is a diagram showing evaluation of a rotation angle of a document using character recognition.

【図１４】４種類の候補文書回転角から回転角を決定す
る方法を示す図である。FIG. 14 is a diagram showing a method of determining a rotation angle from four types of candidate document rotation angles.

【図１５】レイアウト情報を利用して文書の回転角の評
価を示す図である。FIG. 15 is a diagram showing evaluation of a rotation angle of a document using layout information.

【図１６】予め設定されたレイアウト情報を利用して文
書の回転角の評価を示す図である。FIG. 16 is a diagram showing evaluation of a rotation angle of a document using preset layout information.

【図１７】縦書き・横書きのブロックを０、９０、１８
０、２７０度回転させた図である。FIG. 17: Vertical writing / horizontal writing blocks 0, 90, 18
It is the figure rotated 0,270 degree.

【図１８】文字認識を利用して求めた回転角とレイアウ
ト情報を利用して求めた回転角による文書の回転角の決
定方法を示す図である。FIG. 18 is a diagram showing a method of determining a rotation angle of a document based on a rotation angle obtained by using character recognition and a rotation angle obtained by using layout information.

【図１９】入力文書を回転角だけ修正し、記述された内
容の読み取りを示す図である。FIG. 19 is a diagram showing reading of the described contents by correcting the input document by the rotation angle.

【図２０】入力文書をレイアウト情報を用いて、回転角
だけ修正し、記述された内容の読み取り方法を示す図で
ある。FIG. 20 is a diagram showing a method of reading the described content by correcting only the rotation angle of the input document using layout information.

【図２１】文書画像を回転修正し頁番号を認識し、それ
を付加情報として登録する一実施例を示す図である。FIG. 21 is a diagram showing an embodiment in which a document image is rotated and corrected, a page number is recognized, and the page number is registered as additional information.

【図２２】付加情報として登録する内容の一例を示した
図である。FIG. 22 is a diagram showing an example of contents registered as additional information.

【図２３】複数の項目のファイルから文書間同志の情報
を抽出し登録する方法を示す図である。FIG. 23 is a diagram showing a method of extracting and registering information of inter-document documents from files of a plurality of items.

【図２４】雑誌名情報ファイルの例示図である。FIG. 24 is a view showing an example of a magazine name information file.

【図２５】文書同志関係情報ファイルの例示例である。FIG. 25 is an example of a document fellowship information file.

【図２６】本発明の文書読取装置の一実施例の処理フロ
ーを示した図である。FIG. 26 is a diagram showing a processing flow of an embodiment of the document reading apparatus of the present invention.

【図２７】Ａ３スキャナを用いることにより、Ａ４スキ
ャナの読み取り領域からはみ出すことがない読み取りフ
ローを示す図である。FIG. 27 is a diagram showing a reading flow in which an A3 scanner is used and the A4 scanner does not extend beyond the reading area.

【図２８】Ａ４スキャナの読み取り領域からのはみ出し
を検出した場合、更にＡ３スキャナの読み取り領域で画
像入力することにより、読み取りを実現する処理フロー
を示す図である。FIG. 28 is a diagram showing a processing flow for realizing reading by detecting an overflow from the reading area of the A4 scanner and further inputting an image in the reading area of the A3 scanner.

【図２９】スキャナの読み取り領域をはみ出したか否か
を判定するフローを示す図である。FIG. 29 is a diagram showing a flow for determining whether or not the reading area of the scanner is protruded.

【図３０】文字画像の回転を行なうこと無く、チェーン
コードにより文字認識対象を高速に回転する表現に用い
るチェーンコードを示す図である。FIG. 30 is a diagram showing a chain code used for an expression in which the character recognition target is rotated at high speed by the chain code without rotating the character image.

【図３１】チェーンコードにより表現されたもの骨格を
示す図である。FIG. 31 is a diagram showing a skeleton expressed by a chain code.

[Explanation of symbols]

１０１文書、１０５画像入力手段、１１０文字行抽出手段、１２５傾き抽出手段、１５０文書回転角判定手段、１６５文字行座標回転手段。 101 document, 105 image input means, 110 character line extraction means, 125 inclination extraction means, 150 document rotation angle determination means, 165 character line coordinate rotation means.

───────────────────────────────────────────────────── フロントページの続き (72)発明者嶋好博東京都国分寺市東恋ケ窪１丁目280番地株式会社日立製作所中央研究所内 ─────────────────────────────────────────────────── ─── Continuation of the front page (72) Inventor Yoshihiro Shima 1-280, Higashi Koigokubo, Kokubunji, Tokyo Inside the Central Research Laboratory, Hitachi, Ltd.

Claims

[Claims]

1. An input unit for inputting an image of a document, a character line extracting unit for extracting a character line of the input image, an inclination extracting unit for extracting an inclination of the document, and the extracted character line. To the inclination of the document 0 degrees, 90 degrees, 180 degrees, 270
A rotation means for rotating an angle added with a degree and a determination means for recognizing the rotated four types of character lines and determining the inclination of the document from the recognition are provided, and the inclination of the document is corrected to read. A document reading device characterized by performing.

2. The document reading apparatus according to claim 1, wherein the input means inputs all the document images by using an A3 input scanner.

3. The character line extracting means comprises means for adopting an aspect ratio of a character line or an aspect ratio of a certain value or more so that a plurality of characters are present in the character line. The document reading device according to claim 1, wherein

4. The determining means recognizes a character cut out from each of the character lines rotated by the four types of angles, and calculates an average value, a median value, and a filter of similarity degrees of a plurality of characters obtained by the recognition. The document reading device according to claim 1, wherein the document inclination is determined based on at least one of the values obtained by the multiplication.

5. The document reading device according to claim 1, wherein the determining means obtains layout information from the character line rotated by the rotating means and determines the inclination of the document based on the layout information. apparatus.

6. The document reading device according to claim 5, wherein the layout information is four types of information in which vertical writing / horizontal writing and vertical / horizontal writing information are combined.

7. A document reading apparatus according to claim 1, wherein first means for adding a document number to said document and additional information for inputting additional information not written in a document related to said document are inputted. A second means, a third means for electronically storing the input additional information in association with the document number or the document information obtained by processing the input document, and a fourth means for searching the additional information or the document information. A document reading device comprising:

8. The document reading apparatus according to claim 7, wherein the first means recognizes the page number image of the document, recognizes the extracted page number, and recognizes the extracted page number. A document reading device, comprising: means for registering a page number as additional information.

9. A document reading apparatus according to claim 7, wherein means for electronically recording relationship information between documents and means for retrieving the relationship information between the documents to display document information and additional information relating to a desired document. A document reading device comprising a searching unit.

10. The document reading device according to claim 1,
A document reading device comprising means for determining whether the input document is normally input on the front side or is input on the back side by mistake according to the presence / absence of character lines extracted from the input digital image.

11. The document reading apparatus according to claim 1,
It is determined whether the character line to be read is outside the scanner reading area according to whether two or more of the four vertices of the character row extracted by the character row extracting means are present on the four sides of the scanner reading area. A document reading device comprising a means.

12. The document reading device according to claim 11, wherein when it is determined that the character line to be read of the document image is outside the area, an error occurs in a window displaying the document image or another window. A document reading device, comprising: means for enlarging and displaying a portion or a means for displaying a character line in which an error has occurred by changing the color of the character line from that of another character line.

13. The document reading apparatus according to claim 11, wherein, when it is determined that the character line to be read of the document image is outside the area, whether the user continues to read the input image is set by a mode. A document reading device comprising means capable of performing.

14. The document reading device according to claim 11, wherein when it is determined that the character line to read the document image is outside the area, a means for registering the document number of the document image in a reject file or an error. A document reading device comprising: means for displaying a message on a window or urging a warning by voice call.

15. A device for inputting a document as a digital image, a means for displaying a free space for outputting the processing result of the digital image to a file, and a warning window when the free space becomes small. A document reading apparatus comprising means for prompting by means of a display or voice to a user and means for prompting a warning to another device in which an operator is present via a network.

16. A device for inputting a document as a digital image, wherein the cumulative number of black dots (N is an integer) of a rectangle corresponding to the upper side N bits of the inputted image data and the cumulative number of black dots of a rectangular region corresponding to the right side N bits and the left side. A means for determining the cumulative number of black dots in the rectangle for N bits and the cumulative number of black dots in the rectangle for the bottom N bits, and whether the values of the four types of cumulative black dots obtained above exceed the value V (V is an integer). A means for determining whether or not, and a document reading device, characterized in that when the above determination result exceeds a value V (V is an integer), it is determined that the written content to be read is outside the scanner reading area.

17. A character reading device for expressing a skeleton or outline of a character by a chain code in N directions (N is an integer), means for cutting out a character from an appropriate selected character line image, and the cut out character image. Means for extracting a skeleton or contour of a character from the expression and expressing it as a chain code in the N direction, and four kinds of angles 0, 90, 180 for the detected inclination.
A document reading apparatus comprising means for rotating the chain code by an angle corresponding to an angle of +270 degrees and means for recognizing the rotated chain code.

18. An electronic file device comprising: input means for inputting an image of a document; storage means for storing the input image; and output means for outputting the stored image. A character line extracting means for extracting a character line of an image, a tilt extracting means for extracting a document inclination, and the above-mentioned extracted character lines for a document inclination of 0 °, 90 °, 1
The inclination of the document is corrected by including a rotation unit that rotates an angle of 80 degrees and 270 degrees and a determination unit that recognizes the four types of rotated character lines and determines the inclination of the document from the recognition. An electronic filing device characterized by performing reading.

19. An electronic file device comprising input means for inputting an image of a document, storage means for storing the input image, and output means for outputting the stored image. A character line extracting means for extracting a character line of an image, a tilt extracting means for extracting a document inclination, and the above-mentioned extracted character lines for a document inclination of 0 °, 90 °, 1
The inclination of the document is corrected by including a rotation unit that rotates an angle of 80 degrees and 270 degrees and a determination unit that recognizes the four types of rotated character lines and determines the inclination of the document from the recognition. An electronic filing device characterized by performing reading.

20. A facsimile device comprising an input means for inputting an image of a document and a transmitting means for transmitting the input image, a character line extracting means for extracting a character line of the input image, An inclination extracting unit that extracts the inclination of the document, and the extracted character line to the inclination of the document is 0 degrees,
Rotating means for rotating an angle obtained by adding 90 degrees, 180 degrees, and 270 degrees, and determination means for recognizing the rotated four types of character lines and determining the inclination of the document from the recognition are provided. A facsimile machine characterized by correcting inclination and reading.

21. A character line extracting means for extracting a character line of the input image in a copying machine having an input means for inputting an image of a document and an output means for outputting the accumulated image, Inclination extraction means for extracting the inclination of the document, and the above extracted character lines to the inclination of the document are 0 °, 90 °, 1
The inclination of the document is corrected by including a rotation unit that rotates an angle of 80 degrees and 270 degrees and a determination unit that recognizes the four types of rotated character lines and determines the inclination of the document from the recognition. A copying machine characterized by scanning and reading.