JP3014257B2

JP3014257B2 - Document reading device

Info

Publication number: JP3014257B2
Application number: JP5268753A
Authority: JP
Inventors: 哲夫中村
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1993-10-27
Filing date: 1993-10-27
Publication date: 2000-02-28
Anticipated expiration: 2015-02-28
Also published as: JPH07121655A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は、文書内に記載されてい
る文字、図形および写真等の読み取り処理を行う文書読
み取り装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a document reading apparatus for reading characters, figures, photographs and the like described in a document.

【０００２】[0002]

【従来の技術】従来の、文字、図形あるいは写真等が混
在する文書の読み取りを行う文書読み取り装置は、ま
ず、入力部において文字認識を行うために文字、図形あ
るいは写真上を走査し、単純二値変換を行い画像データ
を入力する。ここで得られた画像データは、いったんメ
モリに格納される。続いて、識別部において前記メモリ
内に格納されている画像データに基づき、文字、図形あ
るいは写真等といったそれぞれの領域に識別する。そし
て、前記識別部で得られた識別結果のうち文字領域の部
分については、文字認識処理を行った後に文字コードを
出力し、残りの図形領域あるいは写真領域については、
単純二値の画像データとして出力している。2. Description of the Related Art A conventional document reading apparatus for reading a document in which characters, figures, photographs, etc. are mixed, first scans a character, figure, or photograph in order to perform character recognition in an input unit, and scans the document by a simple method. Perform value conversion and input image data. The image data obtained here is temporarily stored in the memory. Subsequently, the identification unit identifies each area such as a character, a figure, or a photograph based on the image data stored in the memory. Then, for the character area portion of the identification result obtained by the identification section, a character code is output after performing a character recognition process, and for the remaining graphic area or photograph area,
It is output as simple binary image data.

【０００３】[0003]

【発明が解決しようとする課題】しかしながら、上述し
た従来の文書読み取り装置には、次のような不都合な点
があった。従来の文書読み取り装置では、写真領域をも
文字領域あるいは図形領域と同じように単純二値変換に
より画像入力しているため、モアレ縞等の画像劣化が発
生していた。ここで、モアレ縞とは、網点画像のコピー
をとる際などに入る縦横の縞のことである。また、逆に
写真領域の画質を保つために、写真領域だけでなく文字
領域や図形領域をも疑似多値変換により画像入力してし
まうと、文字領域の認識率が低下し、図形領域の画質も
劣化させてしまっていた。However, the conventional document reading apparatus described above has the following disadvantages. In a conventional document reading apparatus, an image is also input to a photograph area by a simple binary conversion in the same manner as a character area or a graphic area, so that image deterioration such as moire fringes has occurred. Here, the moiré fringes are vertical and horizontal stripes that occur when a halftone dot image is copied. Conversely, if the image input of not only the photograph area but also the character area and the graphic area by pseudo multi-value conversion to maintain the image quality of the photograph area, the recognition rate of the character area decreases, and the image quality of the graphic area decreases. Had also deteriorated.

【０００４】本発明は、このような従来の不都合な点を
解決するためになされたもので、文字領域の認識率の低
下や図形領域の画質の劣化を招くことなく、写真領域に
おいても高画質の画像入力を行えるような文書読み取り
装置を提供することを目的とする。SUMMARY OF THE INVENTION The present invention has been made to solve such a conventional disadvantage, and it is possible to obtain a high quality image even in a photographic area without lowering the recognition rate of a character area or deteriorating the image quality of a graphic area. It is an object of the present invention to provide a document reading apparatus capable of inputting an image.

【０００５】[0005]

【課題を解決するための手段】本発明の文書読み取り装
置は、記録媒体上に記録された文字、図形あるいは写真
等を走査し画像信号に変換し、さらに該画像信号をデジ
タル変換する画像入力手段と、この画像入力手段の処理
により得られる画像データを格納しておく画像メモリ
と、画像メモリ内に格納されている画像データから文
字、図形あるいは写真等のそれぞれの領域を識別処理す
る領域識別手段と、このようにして、領域識別手段にて
得られた領域識別結果のうち、文字領域に対応する前記
画像メモリ内の前記画像データから行および該行内の文
字を切り出し、いま切り出した文字一つずつの画像デー
タを認識し、文字コードを得る文字認識手段と、前記領
域識別結果のうち、図形領域に対応する前記画像メモリ
内の前記画像データを図形データとして読み出す図形デ
ータ読み出し手段と、前記領域識別結果のうち、写真領
域に対応する前記画像メモリ内の前記画像データを、前
記画像入力手段においてデジタル変換して写真データと
し、該写真データを入力する写真データ入力手段と、前
記文字コードと、前記図形データ及び前記写真データと
を任意のフォーマットの文書データとして出力する出力
手段とを備え、前記写真データ入力手段は、前記領域識
別手段にて識別された写真領域よりも広い領域を、前記
画像入力手段においてデジタル変換して得た画像データ
を入力し、前記画像メモリに格納し、さらに、該画像デ
ータ内の写真領域を前記領域識別手段にて識別し、ここ
で識別された写真領域に対応する画像データを写真デー
タとすることを特徴とするものである。 According to the present invention, there is provided a document reading apparatus which scans a character, a figure, a photograph, or the like recorded on a recording medium, converts the scanned image into an image signal, and further converts the image signal into a digital signal. And an image memory for storing image data obtained by the processing of the image input means, and an area identifying means for identifying each area such as a character, a figure or a photograph from the image data stored in the image memory. A line and characters in the line are cut out from the image data in the image memory corresponding to the character region in the region identification result obtained by the region identification means in this way, and one character just cut out is cut out. Character recognition means for recognizing each image data and obtaining a character code; and, of the area identification results, the image data in the image memory corresponding to the graphic area. A graphic data reading means for reading as shape data; and, among the area identification results, the image data in the image memory corresponding to a photographic area are digitally converted by the image input means into photographic data, and the photographic data is input. and picture data input means for, with the character code, and output means for outputting said graphic data and the photo data as document data in any format, the photo data input means, said area identification
An area larger than the photograph area identified by another means is
Image data obtained by digital conversion in image input means
And stores the image data in the image memory.
The photograph area in the data is identified by the area
The image data corresponding to the photo area identified by
Data.

【０００６】また、本発明の文書読み取り装置は、前記
写真データ入力手段が、前記領域識別手段にて識別され
た写真領域よりも広い領域を、前記画像入力手段におい
てデジタル変換して得た画像データを入力し、前記画像
メモリに格納し、さらに、該画像データ内の写真領域を
前記領域識別手段にて識別し、ここで識別された写真領
域に対応する画像データを写真データとすることを特徴
とするものである。In the document reading apparatus according to the present invention, the image data obtained by the photo data input means by digitally converting, by the image input means, an area larger than the photo area identified by the area identification means. And storing the image data in the image memory. Further, a photographic region in the image data is identified by the region identification means, and image data corresponding to the identified photographic region is defined as photographic data. It is assumed that.

【０００７】[0007]

【作用】本発明の文書読み取り装置によれば、記録媒体
上に記録された文字、図形、写真等といったおのおのの
領域に応じて、画像データとして取り込む際の画像入力
の手段を変えるようにする。これにより、文字領域の認
識率の低下やモアレ縞等の画像の劣化が発生することな
く、高い文字認識率かつ高画質の画像データを入出力す
ることが可能になる。また、本発明の文書読み取り装置
によれば、写真領域の検出を行う際、領域識別され、一
旦、入力処理された画像データを、さらに広い領域につ
いて領域識別を行い、これにより得られた画像データを
写真領域の画像データとして出力するようにすること
で、二度の画像入力を行うことによって生じる文書自体
のずれや、写真領域の位置の変化等に影響されることな
く、正しく認識された写真領域の画像データを入出力す
ることができるようになる。According to the document reading apparatus of the present invention, the means for inputting an image when capturing as image data is changed in accordance with each area such as a character, a graphic, and a photograph recorded on a recording medium. As a result, it is possible to input and output image data of a high character recognition rate and high image quality without causing a reduction in the recognition rate of the character area and deterioration of the image such as moire fringes. Further, according to the document reading apparatus of the present invention, when detecting a photograph area, the image data that has been area-identified and once subjected to input processing is area-identified for a wider area, and the image data obtained thereby is obtained. Is output as image data of the photograph area, so that the correctly recognized photograph can be obtained without being affected by the displacement of the document itself caused by inputting the image twice and the change in the position of the photograph area. Image data of the area can be input and output.

【０００８】[0008]

【実施例】以下に、本発明の第一の実施例を図面を参照
して詳細に説明する。図１は、本発明の文書読み取り装
置の実施例をあらわすブロック図であり、図２は、本発
明の文書読み取り装置の第一の実施例をあらわすフロー
チャートである。画像入力部１１は、読み取りの対象の
文書上を走査し、当該文書上に記録されている文字、図
形あるいは写真等を光電変換により画像信号に変換し、
さらにこの画像信号を二値の画像データに変換する。こ
のとき、画像信号の二値の画像データへの変換は、単純
二値あるいは疑似多値変換である。加えて、画像入力部
１１は画像データの画像メモリ１２への転送も行う。そ
して、画像メモリ１２は、画像入力部１１より転送され
てきた画像データを格納しておくメモリであり、後述す
る領域識別部１３、文字認識部１４、結果出力部１５か
らアクセス可能である。領域識別部１３は、画像メモリ
１２内に格納されている画像データを文字、図形あるい
は写真等といったそれぞれの領域ごとに識別し、これら
の領域識別結果を画像入力部１１、文字認識部１４およ
び結果出力部１５へと転送する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, a first embodiment of the present invention will be described in detail with reference to the drawings. FIG. 1 is a block diagram showing an embodiment of the document reading apparatus of the present invention, and FIG. 2 is a flowchart showing a first embodiment of the document reading apparatus of the present invention. The image input unit 11 scans a document to be read, converts characters, graphics, photographs, and the like recorded on the document into image signals by photoelectric conversion.
Further, this image signal is converted into binary image data. At this time, the conversion of the image signal into binary image data is a simple binary or pseudo multi-value conversion. In addition, the image input unit 11 transfers image data to the image memory 12. The image memory 12 is a memory for storing image data transferred from the image input unit 11, and is accessible from an area identification unit 13, a character recognition unit 14, and a result output unit 15, which will be described later. The area identification unit 13 identifies image data stored in the image memory 12 for each area such as a character, a figure, or a photograph, and outputs these area identification results to the image input unit 11, the character recognition unit 14, and the result. Transfer to the output unit 15.

【０００９】次に、文字認識部１４は、画像メモリ１２
内に格納されている画像データのうち、領域識別部１３
における識別処理による領域識別結果（この場合は、文
字領域）に対応する画像データから検出した文字を文字
コードとして認識し、この文字コードの結果出力部１５
への転送を行う。この結果、結果出力部１５では、文字
認識部１４から得た文字コードと、画像メモリ１２から
読み出した、領域識別部１３にて識別された領域識別結
果にもとづく図形領域および写真領域それぞれに対応す
る画像データとを任意のフォーマットの文書データに変
換し、出力装置１６へ出力する。ここでいう出力装置１
６とは、図示しないが、結果出力部１５から送られてき
た文書データを記憶しておく磁気ディスク装置や当該文
書データを印字するプリンタ等のことを示している。Next, the character recognizing unit 14 operates in the image memory 12.
Out of the image data stored in the
The character detected from the image data corresponding to the area identification result (in this case, the character area) by the identification processing in (1) is recognized as a character code, and the character code result output unit 15
Transfer to. As a result, the result output unit 15 corresponds to the character code obtained from the character recognizing unit 14 and the graphic area and the photograph area read from the image memory 12 and based on the area identification result identified by the area identifying unit 13. The image data is converted into document data of an arbitrary format and output to the output device 16. Output device 1 here
Reference numeral 6 denotes a magnetic disk device (not shown) for storing the document data sent from the result output unit 15, a printer for printing the document data, and the like.

【００１０】そして、図形データ読み出し部１８は、領
域識別部１３から転送された領域識別結果のうちの図形
領域に対応する画像データを画像メモリ１２からの読み
出しを行い、写真データ入力部１９は、領域識別部１３
から転送された領域識別結果のうちの写真領域をデジタ
ル変換することによって得た画像データの入力を行う。
また、制御部１７は、文書読み取り装置全体の各動作を
制御している。The graphic data reading section 18 reads out image data corresponding to the graphic area from the area identification result transferred from the area identifying section 13 from the image memory 12. Area identification unit 13
The image data obtained by digitally converting the photographic region in the region identification result transferred from the computer is input.
Further, the control unit 17 controls each operation of the entire document reading apparatus.

【００１１】つづいて、以上のように構成される文書読
み取り装置における第１の実施例を示す処理動作につい
て説明する。図３は、読み取った文書の一頁全体をあら
わす画像例であり、図４は、領域識別部による領域識別
結果をあらわす説明図である。まず最初に、画像入力を
行う（ステップＳ１）。具体的には、画像入力部１１に
よって、今回読み取りの対象である文書上に記録されて
いる一頁全体のすべての文字、図形あるいは写真等を、
光電変換により画像信号に変換し、さらに、この画像信
号を単純二値化変換により画像データを作成し、この画
像データを画像メモリ１２に格納しておく。このときの
文書一頁全体の画像例が、図３に示すとおりである。画
像入力が終了すると、領域識別部１３により、ステップ
Ｓ１で画像メモリに格納した画像データを図４に示すよ
うな文字領域４１、図形領域４２あるいは写真領域４３
に識別する領域識別処理を行う（ステップＳ２）。この
領域識別処理は、例えば、画像データの周辺分布ヒスト
グラム（幾何的な特徴をみる）を利用する方法等を用い
て行われる。その後、領域識別部１３は、領域識別結果
を画像入力部１１、文字認識部１２あるいは結果出力部
１５へそれぞれ転送する。Next, a description will be given of a processing operation according to the first embodiment in the document reading apparatus configured as described above. FIG. 3 is an image example showing an entire page of a read document, and FIG. 4 is an explanatory diagram showing an area identification result by an area identification unit. First, image input is performed (step S1). More specifically, the image input unit 11 can copy all the characters, graphics, photographs, etc. of the entire page recorded on the document to be read this time.
The image signal is converted into an image signal by photoelectric conversion, and the image signal is converted into image data by simple binarization conversion. The image data is stored in the image memory 12. FIG. 3 shows an example of an image of the entire page of the document at this time. When the image input is completed, the area identification unit 13 converts the image data stored in the image memory in step S1 into a character area 41, a graphic area 42, or a photograph area 43 as shown in FIG.
Is performed (step S2). This region identification processing is performed using, for example, a method that uses a peripheral distribution histogram of image data (seeing geometric characteristics). After that, the area identification unit 13 transfers the area identification result to the image input unit 11, the character recognition unit 12, or the result output unit 15, respectively.

【００１２】次に、文字認識部１４による文字認識処理
を行う（ステップＳ３）。文字認識部１４では、領域識
別部１３から転送された領域識別結果のうちの文字領域
４１に対応する画像メモリ１２内に格納された画像デー
タの中から行を検出し、さらに、当該行から文字を検出
する。そして、いま検出した文字一文字毎の画像データ
について再度画像メモリ１２を参照し、この一文字毎の
画像データを文字認識することで文字コードを得るので
ある。これらの文字コードは、全て結果出力部１５へと
転送される。つづいて、領域識別部１３から転送された
領域識別結果のうちの図形領域４２に対応する画像デー
タを画像メモリ１２から、図形データ読み出し部１８に
おいて読み出す（ステップＳ４）。その後、画像入力部
１１において、先にステップＳ１で画像入力した今回読
み取り対象である文書について、領域識別部１３から転
送された領域識別結果のうちの写真領域４３を疑似多値
変換することによって得た画像データを写真データ入力
部１９から入力し（ステップＳ５）、さらに、これらの
画像データ（図形、写真）を画像メモリ１２を介して結
果出力部１５へ転送する。Next, a character recognition process is performed by the character recognition unit 14 (step S3). The character recognizing unit 14 detects a line from the image data stored in the image memory 12 corresponding to the character area 41 in the area identification result transferred from the area identifying unit 13, and further detects a character from the line. Is detected. Then, the image data for each character detected at this time is referred to the image memory 12 again, and a character code is obtained by character recognition of the image data for each character. All of these character codes are transferred to the result output unit 15. Subsequently, the image data corresponding to the graphic area 42 in the area identification result transferred from the area identification section 13 is read from the image memory 12 by the graphic data reading section 18 (step S4). Thereafter, the image input unit 11 obtains the document to be read this time, which has been input in step S1 earlier, by performing pseudo multi-value conversion of the photograph area 43 in the area identification result transferred from the area identification unit 13. The received image data is input from the photograph data input unit 19 (step S5), and these image data (graphics and photographs) are transferred to the result output unit 15 via the image memory 12.

【００１３】最後に、結果出力部１５では、文字領域４
１についてはステップＳ３によって得られた文字コード
を、そして、図形領域４２、写真領域４３についてはそ
れぞれステップＳ４、ステップＳ５によって得られた画
像データを任意の文書フォーマットの文書データに変換
し、この文書データを出力装置１６へ出力することで処
理を終了する（ステップＳ６）。このように、文字領域
４１および図形領域４２については、単純二値変換によ
って画像入力し、写真領域４３については、疑似多値変
換によって画像入力するようにしたことで、高い文字認
識率および図形領域における高画質を保ったまま、写真
領域においても高画質の画像データを出力することがで
きる。Finally, in the result output unit 15, the character area 4
1 for the character code obtained in step S3, and for the graphic area 42 and the photograph area 43, the image data obtained in step S4 and step S5, respectively, are converted into document data in an arbitrary document format. The process ends by outputting the data to the output device 16 (step S6). As described above, the character area 41 and the graphic area 42 are image-input by the simple binary conversion, and the photograph area 43 is image-input by the pseudo multi-value conversion, so that the character recognition rate and the graphic area are high. It is possible to output high-quality image data even in a photographic area, while maintaining high image quality in.

【００１４】次に、本発明の第二の実施例を図面を用い
て詳細に説明する。第二の実施例を説明するにあたり、
本発明の文書読み取り装置をあらわすブロック図に示す
構成・動作等、第一の実施例の説明と重複する箇所につ
いては、説明を省略することにする。図５は、本発明の
文書読み取り装置の第二の実施例をあらわすフローチャ
ートであり、図６は、写真領域の拡大領域をあらわす説
明図であり、図７は、拡大された写真領域の画像データ
をあらわす説明図である。先に説明した、図２のステッ
プＳ４の図形領域４２に対応する画像データの読み出し
が終了後、画像入力部１１により、写真領域より広い領
域を疑似多値変換により画像入力を行う（ステップＳ５
１）。具体的には、ステップＳ１で画像入力した読み取
り対象文書について、領域識別部１３における領域識別
結果である写真領域６１より広い拡大領域６２（図６参
照）を、疑似多値変換により、図７に示す画像データ７
２として入力しているのである。当然、この画像データ
７２も画像メモリ１２へ格納される。Next, a second embodiment of the present invention will be described in detail with reference to the drawings. In describing the second embodiment,
The description of the same parts as those of the first embodiment, such as the configuration and operation shown in the block diagram showing the document reading apparatus of the present invention, will be omitted. FIG. 5 is a flowchart showing a second embodiment of the document reading apparatus of the present invention, FIG. 6 is an explanatory diagram showing an enlarged area of a photograph area, and FIG. 7 is a diagram showing image data of the enlarged photograph area. FIG. After the reading of the image data corresponding to the graphic area 42 in step S4 of FIG. 2 described above is completed, the image input unit 11 inputs an image of an area larger than the photograph area by pseudo multi-value conversion (step S5).
1). Specifically, an enlarged area 62 (see FIG. 6) wider than the photograph area 61, which is the area identification result of the area identification unit 13, of the document to be read input in step S1 is converted into a pseudo multi-valued image as shown in FIG. Image data 7 shown
It is entered as 2. Naturally, the image data 72 is also stored in the image memory 12.

【００１５】ここで、拡大領域の求め方の一例を紹介す
る。図６に示す拡大領域６２は、Ｘ方向（画像入力部１
１の主走査方向：左右方向）に、ｗ＝Ｗ／３０（Ｗは文書全体の画像のＸ方向：頁
幅）のｗ分だけ左右両方向に拡大し、Ｙ方向（画像入力部１
１の副走査方向：上下方向）に、ｈ＝Ｈ／３０（Ｈは文書全体の画像のＹ方向：頁高
さ）のｈ分だけ上下両方向に拡大した領域をあらわしてい
る。そして、前記ステップＳ５１で示す拡大領域６２の
画像入力が終了すると、領域識別部１３により、画像メ
モリ１２内に格納されている画像データ７２（図７参
照）から写真領域７１を識別する領域識別処理を行い
（ステップＳ５２）、ここでの識別処理結果を結果出力
部１５へ転送する。これを受けた結果出力部１５では、
領域識別部１３から送られた領域識別結果である写真領
域７１に対応する画像データを画像メモリ１２から読み
出し（ステップＳ５３）、以下第一の実施例と同様、ス
テップＳ５３にて読み出した画像データを任意の文書フ
ォーマットの文書データに変換し、この文書データを出
力装置１６へ出力することで全ての処理を終了する。Here, an example of a method of obtaining an enlarged area will be introduced. The enlarged area 62 shown in FIG.
1 in the main scanning direction: left / right direction, and w = W / 30 (W is the X direction of the image of the entire document: page width).
In the sub-scanning direction 1 (vertical direction), h = H / 30 (where H is the Y direction of the image of the entire document: page height) represents an area enlarged in both the vertical and horizontal directions by h. When the input of the image of the enlarged area 62 shown in step S51 is completed, the area identifying unit 13 identifies the photographic area 71 from the image data 72 (see FIG. 7) stored in the image memory 12. Is performed (step S52), and the identification processing result is transferred to the result output unit 15. In response to this, the output unit 15
The image data corresponding to the photograph area 71, which is the area identification result sent from the area identification unit 13, is read from the image memory 12 (step S53), and the image data read in step S53 is then read out in the same manner as in the first embodiment. By converting the document data into document data of an arbitrary document format and outputting the document data to the output device 16, all the processes are completed.

【００１６】このように、本発明の第２の実施例では、
写真領域の検出を行う際に、一旦、疑似多値変換して入
力した画像データを、再び領域識別部１３にて、さらに
広い領域の識別を行っている。これにより、一回目の画
像入力（ステップＳ１：単純二値変換による画像入力）
と二回目の画像入力（ステップＳ５１：疑似多値変換に
よる画像入力）を実施することによって生じる文書自体
のずれや、また、一回目と二回目の光電変換時に発生す
る画像データの変化による写真領域の位置の変化等に影
響されることなく、正しく認識された写真領域の画像デ
ータを入力することができる。最後に、上記第一の実施
例、第二の実施例では、画像データを入力する手段とし
て、疑似多値変換を例に説明してきたが、本発明は、こ
れに限られるものではなく、モノクロ多値変換やカラー
による再入力等であっても有効である。As described above, in the second embodiment of the present invention,
When detecting a photographic region, the image data once subjected to pseudo multi-value conversion and input is again identified by the region identification unit 13 for a wider region. Thereby, the first image input (step S1: image input by simple binary conversion)
And the second image input (step S51: image input by pseudo-multi-value conversion), the shift of the document itself, and the photographic area due to the change of the image data generated at the first and second photoelectric conversion. It is possible to input image data of a correctly recognized photographic area without being affected by a change in the position of the image data. Finally, in the first and second embodiments, pseudo multi-value conversion has been described as an example of a means for inputting image data. However, the present invention is not limited to this. It is effective even for multi-value conversion, color re-input, and the like.

【００１７】[0017]

【発明の効果】以上説明したように、本発明の文書読み
取り装置によれば、記録媒体上に記録された文字、図
形、写真等といったおのおのの領域に応じて、画像デー
タとして取り込む際の画像入力の手段を変えるようにし
たことで、文字領域の認識率の低下やモアレ縞等の画像
の劣化が発生することなく、高い文字認識率かつ高画質
の画像データを出力することができるようになった。ま
た、本発明の文書読み取り装置によれば、写真領域の検
出を行う際、領域識別され、一旦、入力処理された画像
データを、さらに広い領域について領域識別を行い、こ
れにより得られた画像データを写真領域の画像データと
して出力するようにしたので、二度の画像入力を行うこ
とによって生じる文書自体のずれや、写真領域の位置の
変化等に影響されることなく、正しく認識された写真領
域の画像データを入出力することができるようになっ
た。As described above, according to the document reading apparatus of the present invention, it is possible to input an image when capturing as image data in accordance with each area such as characters, figures, photographs, etc. recorded on a recording medium. By changing the means, it is possible to output image data with a high character recognition rate and high image quality without lowering the recognition rate of the character area or deteriorating the image such as moire fringes. Was. Further, according to the document reading apparatus of the present invention, when detecting a photograph area, the image data that has been area-identified and once subjected to input processing is area-identified for a wider area, and the image data obtained thereby is obtained. Is output as image data of the photographic area, so that the photographic area correctly recognized without being affected by the displacement of the document itself caused by inputting the image twice, a change in the position of the photographic area, and the like. Image data can be input and output.

[Brief description of the drawings]

【図１】本発明の文書読み取り装置の実施例をあらわす
ブロック図である。FIG. 1 is a block diagram illustrating an embodiment of a document reading apparatus according to the present invention.

【図２】本発明の文書読み取り装置の第一の実施例をあ
らわすフローチャートである。FIG. 2 is a flowchart showing a first embodiment of the document reading apparatus of the present invention.

【図３】読み取った文書の一頁全体をあらわす画像例で
ある。FIG. 3 is an image example showing an entire page of a read document.

【図４】領域識別部による領域識別結果をあらわす説明
図である。FIG. 4 is an explanatory diagram showing an area identification result by an area identification unit.

【図５】本発明の文書読み取り装置の第二の実施例をあ
らわすフローチャートである。FIG. 5 is a flowchart showing a second embodiment of the document reading apparatus of the present invention.

【図６】写真領域の拡大領域をあらわす説明図である。FIG. 6 is an explanatory diagram showing an enlarged area of a photograph area.

【図７】拡大された写真領域の画像データをあらわす説
明図である。FIG. 7 is an explanatory diagram showing image data of an enlarged photograph area.

[Explanation of symbols]

１１画像入力部１２画像メモリ１３領域識別部１４文字認識部１５結果出力部１６出力装置１７制御部 Reference Signs List 11 image input unit 12 image memory 13 area identification unit 14 character recognition unit 15 result output unit 16 output device 17 control unit

Claims

(57) [Claims]

An image input means for scanning a character, a figure, a photograph, or the like recorded on a recording medium, converting the image signal into an image signal, and further converting the image signal into a digital signal, and an image obtained by the image input means. An image memory for storing data; an area identification unit for identifying each area such as a character, a figure, and a photograph from the image data stored in the image memory; Character recognition means for extracting a line and characters in the line from the image data in the image memory corresponding to the character region in the region identification result, recognizing the image data of each character just cut out, and obtaining a character code A graphic data reading means for reading out the image data in the image memory corresponding to the graphic area from the area identification result as graphic data; The image data in the image memory corresponding to the photo region in the region identification result is converted into digital data by the image input unit to obtain photo data, and the photo data input unit for inputting the photo data; And an output unit for outputting the graphic data and the photograph data as document data , wherein the photograph data input unit has a larger area than the photograph area identified by the area identifying unit.
Area obtained by digital conversion in the image input means.
Inputting image data and storing the image data in the image memory;
And the image corresponding to the photo area identified here
Document reading characterized by using data as photographic data
Device.