JPH07121655A

JPH07121655A - Document reading device

Info

Publication number: JPH07121655A
Application number: JP5268753A
Authority: JP
Inventors: Tetsuo Nakamura; 哲夫中村
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1993-10-27
Filing date: 1993-10-27
Publication date: 1995-05-12
Anticipated expiration: 2015-02-28
Also published as: JP3014257B2

Abstract

PURPOSE:To attain the input of images of high picture quality in a photo area without deteriorating the recognition rate of a character area nor deteriorating the picture quality of a graphic area. CONSTITUTION:This document reading device consists of an image input part 11 which scans the characters, graphics or photos, etc., on a recording medium to acquire the image data, an image member 12 which stores the image data, and area identifying part 13 which discriminates the character, graphic and photo areas based on the image data, a result output part 15 which outputs the character codes, the graphic data and the photo data obtained from the image data corresponding to those areas discriminated by the part 13 through a character recognizing part 14, etc., as the document data, an output device 16 which is represented by a printer, etc., a graphic data reading part 18 which reads and supplies the image data based on the area recognizing result acquired by the part 13, a photo data input part 19, and a control part 17 which controls the whole of the device.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、文書内に記載されてい
る文字、図形および写真等の読み取り処理を行う文書読
み取り装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a document reading device for reading characters, figures, photographs, etc. described in a document.

【０００２】[0002]

【従来の技術】従来の、文字、図形あるいは写真等が混
在する文書の読み取りを行う文書読み取り装置は、ま
ず、入力部において文字認識を行うために文字、図形あ
るいは写真上を走査し、単純二値変換を行い画像データ
を入力する。ここで得られた画像データは、いったんメ
モリに格納される。続いて、識別部において前記メモリ
内に格納されている画像データに基づき、文字、図形あ
るいは写真等といったそれぞれの領域に識別する。そし
て、前記識別部で得られた識別結果のうち文字領域の部
分については、文字認識処理を行った後に文字コードを
出力し、残りの図形領域あるいは写真領域については、
単純二値の画像データとして出力している。2. Description of the Related Art A conventional document reading apparatus for reading a document in which characters, figures, photographs, etc. are mixed, first scans the characters, figures, or photographs in order to recognize the characters in an input section, and then simply scans them. Input the image data by converting the value. The image data obtained here is temporarily stored in the memory. Then, the identification section identifies each area such as a character, a figure, or a photograph based on the image data stored in the memory. Then, for the portion of the character area in the identification result obtained by the identification unit, the character code is output after performing the character recognition processing, and for the remaining graphic area or photograph area,
It is output as simple binary image data.

【０００３】[0003]

【発明が解決しようとする課題】しかしながら、上述し
た従来の文書読み取り装置には、次のような不都合な点
があった。従来の文書読み取り装置では、写真領域をも
文字領域あるいは図形領域と同じように単純二値変換に
より画像入力しているため、モアレ縞等の画像劣化が発
生していた。ここで、モアレ縞とは、網点画像のコピー
をとる際などに入る縦横の縞のことである。また、逆に
写真領域の画質を保つために、写真領域だけでなく文字
領域や図形領域をも疑似多値変換により画像入力してし
まうと、文字領域の認識率が低下し、図形領域の画質も
劣化させてしまっていた。However, the conventional document reading device described above has the following disadvantages. In the conventional document reading apparatus, since the image is input by the simple binary conversion in the same manner as the character area or the graphic area in the photo area, image deterioration such as moire fringes occurs. Here, the moire fringes are vertical and horizontal stripes that are included when a halftone image is copied. On the contrary, in order to maintain the image quality of the photo area, if not only the photo area but also the character area and the graphic area are input by the pseudo multi-value conversion, the recognition rate of the character area decreases and the image quality of the graphic area decreases. Had also deteriorated.

【０００４】本発明は、このような従来の不都合な点を
解決するためになされたもので、文字領域の認識率の低
下や図形領域の画質の劣化を招くことなく、写真領域に
おいても高画質の画像入力を行えるような文書読み取り
装置を提供することを目的とする。The present invention has been made in order to solve such inconveniences of the prior art, and the high image quality is achieved even in the photographic area without lowering the recognition rate of the character area and the image quality of the graphic area. It is an object of the present invention to provide a document reading device capable of inputting the image.

【０００５】[0005]

【課題を解決するための手段】本発明の文書読み取り装
置は、記録媒体上に記録された文字、図形あるいは写真
等を走査し画像信号に変換し、さらに該画像信号をデジ
タル変換する画像入力手段と、この画像入力手段の処理
により得られるの画像データを格納しておく画像メモリ
と、画像メモリ内に格納されている画像データから文
字、図形あるいは写真等のそれぞれの領域を識別処理す
る領域識別手段と、このようにして、領域識別手段にて
得られた領域識別結果のうち、文字領域に対応する前記
画像メモリ内の前記画像データから行および該行内の文
字を切り出し、いま切り出した文字一つずつの画像デー
タを認識し、文字コードを得る文字認識手段と、前記領
域識別結果のうち、図形領域に対応する前記画像メモリ
内の前記画像データを図形データとして読み出す図形デ
ータ読み出し手段と、前記領域識別結果のうち、写真領
域に対応する前記画像メモリ内の前記画像データを、前
記画像入力手段においてデジタル変換して写真データと
して入力する写真データ入力手段と、前記文字コード
と、前記図形データおよび前記写真データとを任意のフ
ォーマットの文書データとして出力する出力手段とを備
えたことを特徴とするものである。The document reading device of the present invention is an image input means for scanning characters, figures or photographs recorded on a recording medium, converting them into image signals, and further converting the image signals into digital signals. And an image memory for storing the image data obtained by the processing of the image input means, and an area identification for identifying each area such as a character, a figure or a photograph from the image data stored in the image memory. Means and the area identification result thus obtained by the area identifying means, a line and a character in the line are cut out from the image data in the image memory corresponding to the character area, and the character Character recognition means for recognizing each image data to obtain a character code, and the image data in the image memory corresponding to a graphic area in the area identification result Graphic data read-out means for reading out as graphic data, and photograph data input means for digitally converting the image data in the image memory corresponding to the photograph area out of the area identification result in the image input means to input it as photograph data. And output means for outputting the character code, the graphic data, and the photograph data as document data in an arbitrary format.

【０００６】また、本発明の文書読み取り装置は、前記
写真データ入力手段が、前記領域識別手段にて識別され
た写真領域よりも広い領域を、前記画像入力手段におい
てデジタル変換して得た画像データを入力し、前記画像
メモリに格納し、さらに、該画像データ内の写真領域を
前記領域識別手段にて識別し、ここで識別された写真領
域に対応する画像データを写真データとすることを特徴
とするものである。Further, in the document reading apparatus of the present invention, image data obtained by digitally converting, in the image input means, an area wider than the photographic area identified by the area identifying means by the photographic data input means. Is input and stored in the image memory, and the photographic area in the image data is identified by the area identifying means, and the image data corresponding to the identified photographic area is used as photographic data. It is what

【０００７】[0007]

【作用】本発明の文書読み取り装置によれば、記録媒体
上に記録された文字、図形、写真等といったおのおのの
領域に応じて、画像データとして取り込む際の画像入力
の手段を変えるようにする。これにより、文字領域の認
識率の低下やモアレ縞等の画像の劣化が発生することな
く、高い文字認識率かつ高画質の画像データを入出力す
ることが可能になる。また、本発明の文書読み取り装置
によれば、写真領域の検出を行う際、領域識別され、一
旦、入力処理された画像データを、さらに広い領域につ
いて領域識別を行い、これにより得られた画像データを
写真領域の画像データとして出力するようにすること
で、二度の画像入力を行うことによって生じる文書自体
のずれや、写真領域の位置の変化等に影響されることな
く、正しく認識された写真領域の画像データを入出力す
ることができるようになる。According to the document reading apparatus of the present invention, the means for inputting an image at the time of capturing as image data is changed according to each area such as a character, a figure, or a photograph recorded on the recording medium. As a result, it is possible to input and output image data having a high character recognition rate and high image quality without a reduction in the recognition rate of the character area or deterioration of the image such as moire fringes. Further, according to the document reading apparatus of the present invention, when detecting a photographic area, the image data that has been subjected to area identification and once subjected to input processing is subjected to area identification for a wider area, and image data obtained by this Is output as the image data of the photo area, the correctly recognized photo is not affected by the deviation of the document itself caused by inputting the image twice and the change of the position of the photo area. It becomes possible to input and output the image data of the area.

【０００８】[0008]

【実施例】以下に、本発明の第一の実施例を図面を参照
して詳細に説明する。図１は、本発明の文書読み取り装
置の実施例をあらわすブロック図であり、図２は、本発
明の文書読み取り装置の第一の実施例をあらわすフロー
チャートである。画像入力部１１は、読み取りの対象の
文書上を走査し、当該文書上に記録されている文字、図
形あるいは写真等を光電変換により画像信号に変換し、
さらにこの画像信号を二値の画像データに変換する。こ
のとき、画像信号の二値の画像データへの変換は、単純
二値あるいは疑似多値変換である。加えて、画像入力部
１１は画像データの画像メモリ１２への転送も行う。そ
して、画像メモリ１２は、画像入力部１１より転送され
てきた画像データを格納しておくメモリであり、後述す
る領域識別部１３、文字認識部１４、結果出力部１５か
らアクセス可能である。領域識別部１３は、画像メモリ
１２内に格納されている画像データを文字、図形あるい
は写真等といったそれぞれの領域ごとに識別し、これら
の領域識別結果を画像入力部１１、文字認識部１４およ
び結果出力部１５へと転送する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS A first embodiment of the present invention will be described below in detail with reference to the drawings. FIG. 1 is a block diagram showing an embodiment of a document reading device of the present invention, and FIG. 2 is a flow chart showing a first embodiment of the document reading device of the present invention. The image input unit 11 scans a document to be read, converts characters, figures, photographs, etc. recorded on the document into an image signal by photoelectric conversion,
Further, this image signal is converted into binary image data. At this time, the conversion of the image signal into binary image data is simple binary or pseudo multi-level conversion. In addition, the image input unit 11 also transfers image data to the image memory 12. The image memory 12 is a memory for storing the image data transferred from the image input unit 11, and can be accessed by the area identification unit 13, the character recognition unit 14, and the result output unit 15, which will be described later. The area identification unit 13 identifies the image data stored in the image memory 12 for each area such as a character, a figure, or a photograph, and the area identification result is obtained by the image input unit 11, the character recognition unit 14, and the result. Transfer to the output unit 15.

【０００９】次に、文字認識部１４は、画像メモリ１２
内に格納されている画像データのうち、領域識別部１３
における識別処理による領域識別結果（この場合は、文
字領域）に対応する画像データから検出した文字を文字
コードとして認識し、この文字コードの結果出力部１５
への転送を行う。この結果、結果出力部１５では、文字
認識部１４から得た文字コードと、画像メモリ１２から
読み出した、領域識別部１３にて識別された領域識別結
果にもとづく図形領域および写真領域それぞれに対応す
る画像データとを任意のフォーマットの文書データに変
換し、出力装置１６へ出力する。ここでいう出力装置１
６とは、図示しないが、結果出力部１５から送られてき
た文書データを記憶しておく磁気ディスク装置や当該文
書データを印字するプリンタ等のことを示している。Next, the character recognizing unit 14 operates the image memory 12
Of the image data stored in the area identification unit 13
A character detected from the image data corresponding to the area identification result (character area in this case) by the identification processing in step S1 is recognized as a character code, and the result output unit 15 of this character code is recognized.
Transfer to. As a result, the result output unit 15 corresponds to each of the graphic area and the photograph area based on the character code obtained from the character recognition unit 14 and the area identification result read out from the image memory 12 and identified by the area identifying unit 13. The image data and the document data are converted into document data of an arbitrary format and output to the output device 16. Output device 1 here
Although not shown, 6 indicates a magnetic disk device for storing the document data sent from the result output unit 15, a printer for printing the document data, and the like.

【００１０】そして、図形データ読み出し部１８は、領
域識別部１３から転送された領域識別結果のうちの図形
領域に対応する画像データを画像メモリ１２からの読み
出しを行い、写真データ入力部１９は、領域識別部１３
から転送された領域識別結果のうちの写真領域をデジタ
ル変換することによって得た画像データの入力を行う。
また、制御部１７は、文書読み取り装置全体の各動作を
制御している。Then, the graphic data reading section 18 reads out from the image memory 12 the image data corresponding to the graphic area of the area identification result transferred from the area identifying section 13, and the photographic data input section 19 Area identification unit 13
The image data obtained by digitally converting the photographic area of the area identification result transferred from is input.
The control unit 17 also controls each operation of the entire document reading apparatus.

【００１１】つづいて、以上のように構成される文書読
み取り装置における第１の実施例を示す処理動作につい
て説明する。図３は、読み取った文書の一頁全体をあら
わす画像例であり、図４は、領域識別部による領域識別
結果をあらわす説明図である。まず最初に、画像入力を
行う（ステップＳ１）。具体的には、画像入力部１１に
よって、今回読み取りの対象である文書上に記録されて
いる一頁全体のすべての文字、図形あるいは写真等を、
光電変換により画像信号に変換し、さらに、この画像信
号を単純二値化変換により画像データを作成し、この画
像データを画像メモリ１２に格納しておく。このときの
文書一頁全体の画像例が、図３に示すとおりである。画
像入力が終了すると、領域識別部１３により、ステップ
Ｓ１で画像メモリに格納した画像データを図４に示すよ
うな文字領域４１、図形領域４２あるいは写真領域４３
に識別する領域識別処理を行う（ステップＳ２）。この
領域識別処理は、例えば、画像データの周辺分布ヒスト
グラム（幾何的な特徴をみる）を利用する方法等を用い
て行われる。その後、領域識別部１３は、領域識別結果
を画像入力部１１、文字認識部１２あるいは結果出力部
１５へそれぞれ転送する。Next, the processing operation of the first embodiment of the document reading apparatus configured as described above will be described. FIG. 3 is an example of an image showing the entire one page of the read document, and FIG. 4 is an explanatory diagram showing the area identification result by the area identification unit. First, an image is input (step S1). Specifically, by the image input unit 11, all characters, figures, photographs, etc. of the entire one page recorded on the document to be read this time,
It is converted into an image signal by photoelectric conversion, further, image data is created by simple binarization conversion of this image signal, and this image data is stored in the image memory 12. An example of an image of the entire page of the document at this time is as shown in FIG. When the image input is completed, the area identification unit 13 causes the image data stored in the image memory in step S1 to display the character area 41, the graphic area 42 or the photograph area 43 as shown in FIG.
A region identification process for identifying is performed (step S2). This area identification processing is performed using, for example, a method using a peripheral distribution histogram of image data (see geometrical characteristics). After that, the area identification unit 13 transfers the area identification result to the image input unit 11, the character recognition unit 12, or the result output unit 15, respectively.

【００１２】次に、文字認識部１４による文字認識処理
を行う（ステップＳ３）。文字認識部１４では、領域識
別部１３から転送された領域識別結果のうちの文字領域
４１に対応する画像メモリ１２内に格納された画像デー
タの中から行を検出し、さらに、当該行から文字を検出
する。そして、いま検出した文字一文字毎の画像データ
について再度画像メモリ１２を参照し、この一文字毎の
画像データを文字認識することで文字コードを得るので
ある。これらの文字コードは、全て結果出力部１５へと
転送される。つづいて、領域識別部１３から転送された
領域識別結果のうちの図形領域４２に対応する画像デー
タを画像メモリ１２から、図形データ読み出し部１８に
おいて読み出す（ステップＳ４）。その後、画像入力部
１１において、先にステップＳ１で画像入力した今回読
み取り対象である文書について、領域識別部１３から転
送された領域識別結果のうちの写真領域４３を疑似多値
変換することによって得た画像データを写真データ入力
部１９から入力し（ステップＳ５）、さらに、これらの
画像データ（図形、写真）を画像メモリ１２を介して結
果出力部１５へ転送する。Next, character recognition processing is performed by the character recognition unit 14 (step S3). The character recognition unit 14 detects a line from the image data stored in the image memory 12 corresponding to the character area 41 of the area identification result transferred from the area identification unit 13, and further detects a character from the line. To detect. Then, the character code is obtained by referring to the image memory 12 again for the detected image data of each character, and recognizing the image data of each character. All of these character codes are transferred to the result output unit 15. Subsequently, the image data corresponding to the graphic area 42 in the area identification result transferred from the area identification unit 13 is read from the image memory 12 by the graphic data reading unit 18 (step S4). Then, the image input unit 11 obtains the document to be read this time, which has been image-inputted in step S1, by subjecting the photographic region 43 of the region identification result transferred from the region identification unit 13 to pseudo multi-value conversion. The obtained image data is input from the photograph data input unit 19 (step S5), and further, these image data (graphics, photographs) are transferred to the result output unit 15 via the image memory 12.

【００１３】最後に、結果出力部１５では、文字領域４
１についてはステップＳ３によって得られた文字コード
を、そして、図形領域４２、写真領域４３についてはそ
れぞれステップＳ４、ステップＳ５によって得られた画
像データを任意の文書フォーマットの文書データに変換
し、この文書データを出力装置１６へ出力することで処
理を終了する（ステップＳ６）。このように、文字領域
４１および図形領域４２については、単純二値変換によ
って画像入力し、写真領域４３については、疑似多値変
換によって画像入力するようにしたことで、高い文字認
識率および図形領域における高画質を保ったまま、写真
領域においても高画質の画像データを出力することがで
きる。Finally, in the result output section 15, the character area 4
The character code obtained in step S3 for 1 and the image data obtained in steps S4 and S5 for the graphic area 42 and the photograph area 43 are converted into document data of an arbitrary document format, The process ends by outputting the data to the output device 16 (step S6). As described above, the character region 41 and the graphic region 42 are image-inputted by the simple binary conversion, and the photograph region 43 is image-inputted by the pseudo multi-valued conversion. It is possible to output high-quality image data even in the photographic area while maintaining the high image quality in the above.

【００１４】次に、本発明の第二の実施例を図面を用い
て詳細に説明する。第二の実施例を説明するにあたり、
本発明の文書読み取り装置をあらわすブロック図に示す
構成・動作等、第一の実施例の説明と重複する箇所につ
いては、説明を省略することにする。図５は、本発明の
文書読み取り装置の第二の実施例をあらわすフローチャ
ートであり、図６は、写真領域の拡大領域をあらわす説
明図であり、図７は、拡大された写真領域の画像データ
をあらわす説明図である。先に説明した、図２のステッ
プＳ４の図形領域４２に対応する画像データの読み出し
が終了後、画像入力部１１により、写真領域より広い領
域を疑似多値変換により画像入力を行う（ステップＳ５
１）。具体的には、ステップＳ１で画像入力した読み取
り対象文書について、領域識別部１３における領域識別
結果である写真領域６１より広い拡大領域６２（図６参
照）を、疑似多値変換により、図７に示す画像データ７
２として入力しているのである。当然、この画像データ
７２も画像メモリ１２へ格納される。Next, a second embodiment of the present invention will be described in detail with reference to the drawings. In explaining the second embodiment,
The description of the portions, such as the configuration and operation shown in the block diagram showing the document reading apparatus of the present invention, which are the same as those in the first embodiment, will be omitted. FIG. 5 is a flowchart showing a second embodiment of the document reading apparatus of the present invention, FIG. 6 is an explanatory view showing an enlarged area of the photograph area, and FIG. 7 is image data of the enlarged photograph area. It is an explanatory view showing. After the reading of the image data corresponding to the graphic area 42 in step S4 of FIG. 2 described above is completed, the image input unit 11 inputs an image by a pseudo multi-value conversion in an area wider than the photograph area (step S5).
1). Specifically, with respect to the reading target document image-inputted in step S1, an enlarged area 62 (see FIG. 6) wider than the photograph area 61, which is the area identification result in the area identification unit 13, is displayed in FIG. Image data 7
It is input as 2. Of course, this image data 72 is also stored in the image memory 12.

【００１５】ここで、拡大領域の求め方の一例を紹介す
る。図６に示す拡大領域６２は、Ｘ方向（画像入力部１
１の主走査方向：左右方向）に、ｗ＝Ｗ／３０（Ｗは文書全体の画像のＸ方向：頁
幅）のｗ分だけ左右両方向に拡大し、Ｙ方向（画像入力部１
１の副走査方向：上下方向）に、ｈ＝Ｈ／３０（Ｈは文書全体の画像のＹ方向：頁高
さ）のｈ分だけ上下両方向に拡大した領域をあらわしてい
る。そして、前記ステップＳ５１で示す拡大領域６２の
画像入力が終了すると、領域識別部１３により、画像メ
モリ１２内に格納されている画像データ７２（図７参
照）から写真領域７１を識別する領域識別処理を行い
（ステップＳ５２）、ここでの識別処理結果を結果出力
部１５へ転送する。これを受けた結果出力部１５では、
領域識別部１３から送られた領域識別結果である写真領
域７１に対応する画像データを画像メモリ１２から読み
出し（ステップＳ５３）、以下第一の実施例と同様、ス
テップＳ５３にて読み出した画像データを任意の文書フ
ォーマットの文書データに変換し、この文書データを出
力装置１６へ出力することで全ての処理を終了する。Here, an example of how to obtain the enlarged area will be introduced. The enlarged region 62 shown in FIG. 6 is in the X direction (image input unit 1
1 in the main scanning direction: left / right direction, w = W / 30 (W is the X direction of the image of the entire document: page width) is enlarged in the left and right directions by w, and the Y direction (image input unit 1
1 in the sub-scanning direction: up and down), h = H / 30 (H is Y direction of the image of the entire document: page height) and an area enlarged in both up and down directions by h is shown. Then, when the image input of the enlarged area 62 shown in step S51 is completed, the area identifying unit 13 identifies the photograph area 71 from the image data 72 (see FIG. 7) stored in the image memory 12. Is performed (step S52), and the identification processing result here is transferred to the result output unit 15. Upon receiving this, the result output unit 15
The image data corresponding to the photographic area 71, which is the area identification result sent from the area identification unit 13, is read from the image memory 12 (step S53), and the image data read in step S53 is read as in the first embodiment. All the processing is completed by converting the document data into an arbitrary document format and outputting the document data to the output device 16.

【００１６】このように、本発明の第２の実施例では、
写真領域の検出を行う際に、一旦、疑似多値変換して入
力した画像データを、再び領域識別部１３にて、さらに
広い領域の識別を行っている。これにより、一回目の画
像入力（ステップＳ１：単純二値変換による画像入力）
と二回目の画像入力（ステップＳ５１：疑似多値変換に
よる画像入力）を実施することによって生じる文書自体
のずれや、また、一回目と二回目の光電変換時に発生す
る画像データの変化による写真領域の位置の変化等に影
響されることなく、正しく認識された写真領域の画像デ
ータを入力することができる。最後に、上記第一の実施
例、第二の実施例では、画像データを入力する手段とし
て、疑似多値変換を例に説明してきたが、本発明は、こ
れに限られるものではなく、モノクロ多値変換やカラー
による再入力等であっても有効である。Thus, in the second embodiment of the present invention,
When the photographic area is detected, the image data once subjected to pseudo multi-value conversion is input to the area identifying unit 13 again to identify a wider area. As a result, the first image input (step S1: image input by simple binary conversion)
And a second image input (step S51: image input by pseudo multi-value conversion), which causes a deviation of the document itself, and a photographic area due to a change in image data generated at the first and second photoelectric conversions. It is possible to input correctly recognized image data of a photographic area without being affected by a change in the position of the. Lastly, in the first and second embodiments, the pseudo multi-value conversion has been described as an example of the means for inputting the image data, but the present invention is not limited to this, and the monochrome It is effective even for multi-value conversion and re-input by color.

【００１７】[0017]

【発明の効果】以上説明したように、本発明の文書読み
取り装置によれば、記録媒体上に記録された文字、図
形、写真等といったおのおのの領域に応じて、画像デー
タとして取り込む際の画像入力の手段を変えるようにし
たことで、文字領域の認識率の低下やモアレ縞等の画像
の劣化が発生することなく、高い文字認識率かつ高画質
の画像データを出力することができるようになった。ま
た、本発明の文書読み取り装置によれば、写真領域の検
出を行う際、領域識別され、一旦、入力処理された画像
データを、さらに広い領域について領域識別を行い、こ
れにより得られた画像データを写真領域の画像データと
して出力するようにしたので、二度の画像入力を行うこ
とによって生じる文書自体のずれや、写真領域の位置の
変化等に影響されることなく、正しく認識された写真領
域の画像データを入出力することができるようになっ
た。As described above, according to the document reading apparatus of the present invention, an image is input when capturing it as image data according to each area such as a character, a figure, or a photograph recorded on a recording medium. By changing the means, it becomes possible to output image data with high character recognition rate and high image quality without deterioration of recognition rate of character areas and deterioration of images such as moire fringes. It was Further, according to the document reading apparatus of the present invention, when detecting a photographic area, the image data that has been subjected to area identification and once subjected to input processing is subjected to area identification for a wider area, and image data obtained by this Since it is output as image data of the photo area, the correctly recognized photo area is not affected by the deviation of the document itself caused by inputting the image twice and the change of the position of the photo area. It became possible to input and output the image data of.

[Brief description of drawings]

【図１】本発明の文書読み取り装置の実施例をあらわす
ブロック図である。FIG. 1 is a block diagram showing an embodiment of a document reading device of the present invention.

【図２】本発明の文書読み取り装置の第一の実施例をあ
らわすフローチャートである。FIG. 2 is a flow chart showing a first embodiment of the document reading device of the present invention.

【図３】読み取った文書の一頁全体をあらわす画像例で
ある。FIG. 3 is an example of an image showing an entire page of a read document.

【図４】領域識別部による領域識別結果をあらわす説明
図である。FIG. 4 is an explanatory diagram showing an area identification result by an area identification unit.

【図５】本発明の文書読み取り装置の第二の実施例をあ
らわすフローチャートである。FIG. 5 is a flowchart showing a second embodiment of the document reading device of the present invention.

【図６】写真領域の拡大領域をあらわす説明図である。FIG. 6 is an explanatory diagram showing an enlarged area of a photograph area.

【図７】拡大された写真領域の画像データをあらわす説
明図である。FIG. 7 is an explanatory diagram showing image data of an enlarged photographic area.

[Explanation of symbols]

１１画像入力部１２画像メモリ１３領域識別部１４文字認識部１５結果出力部１６出力装置１７制御部 11 Image Input Section 12 Image Memory 13 Area Identification Section 14 Character Recognition Section 15 Result Output Section 16 Output Device 17 Control Section

Claims

[Claims]

1. An image input means for scanning a character, a figure, a photograph or the like recorded on a recording medium to convert the image signal into an image signal, and further digitally converting the image signal, and an image obtained by the processing of the image input means. An image memory for storing data, an area identifying means for identifying each area of characters, figures, photographs, etc. from the image data stored in the image memory, and the area identifying means. A character recognizing unit that obtains a character code by cutting out a line and a character in the line from the image data in the image memory corresponding to a character region in the region identification result, and recognizing the image data of each character just cut out. And graphic data reading means for reading, as graphic data, the image data in the image memory corresponding to a graphic area in the area identification result. Of the area identification result, the image data in the image memory corresponding to the photograph area is digitally converted into photograph data by the image input means, and the photograph data input means for inputting the photograph data, the character code And a output unit for outputting the graphic data and the photo data as document data.

2. The photographic data input means inputs image data obtained by digitally converting an area wider than the photographic area identified by the area identifying means by the image input means, and stores the image data in the image memory. 2. The document according to claim 1, wherein the document data is stored, and the photo region in the image data is identified by the region identification means, and the image data corresponding to the photo region identified here is taken as the photo data. Reader.