JP2005260597A

JP2005260597A - Digital camera and program

Info

Publication number: JP2005260597A
Application number: JP2004069595A
Authority: JP
Inventors: Takeshi Endo; 剛遠藤
Original assignee: Casio Computer Co Ltd
Current assignee: Casio Computer Co Ltd
Priority date: 2004-03-11
Filing date: 2004-03-11
Publication date: 2005-09-22
Anticipated expiration: 2024-03-11
Also published as: JP4315024B2

Abstract

<P>PROBLEM TO BE SOLVED: To save on work of a user by accurately determining image quality in document electronization using a digital camera. <P>SOLUTION: A control part 110 of a digital camera 100 segments a character part to specify a character size and specifies an edge part of the character part. The control part 110 calculates a rate of gradational change in the edge part and specifies the readable level of the character on the basis of the calculated rate of gradational change and the character size. Further, the control part 110 determines whether the document is illegible according to a relation of the specified readable level to a prescribed threshold. The control part 110 controls an outputting part 140 to output prescribed notification information corresponding to a determination result, notifying the user of the notification information. <P>COPYRIGHT: (C)2005,JPO&NCIPI

Description

本発明は、デジタルカメラに関し、特に、文書の電子化に好適なデジタルカメラに関する。 The present invention relates to a digital camera, and more particularly to a digital camera suitable for digitizing a document.

従来、用紙などに記された文書を電子化するには、フラットベッドスキャナなどで読み込むことが一般的であったが、近年、高画素のデジタルカメラが低価格で入手できるようになったことにより、スキャナやメモの代わりとしてデジタルカメラが利用されつつある。撮影した画像をデジタルデータとして記録することができるデジタルカメラの特徴を活かし、用紙に印刷されている文書や、街中の看板などに記されている文字などをデジタルカメラで撮影することで、文書情報を手軽に電子化することができる。また、フラットベッドスキャナなどと異なり、デジタルカメラは携帯性にすぐれているため、対象とする文書の形態にかかわらず、いつでもどこでも文書の電子化を実施することができる。また、デジタルデータで記録することができるので、所定の文字認識処理などにより、利便性の高いテキストデータへの変換なども容易におこなうことができる。 Conventionally, in order to digitize documents on paper etc., it was common to read them with a flatbed scanner, etc., but in recent years, high-pixel digital cameras have become available at low prices. Digital cameras are being used instead of scanners and memos. Taking advantage of the features of a digital camera that can record the captured image as digital data, document information can be obtained by shooting a document printed on paper or characters on a signboard in the city with a digital camera. Can be easily digitized. In addition, unlike a flatbed scanner or the like, a digital camera is excellent in portability, so that it is possible to digitize a document anytime and anywhere regardless of the form of the target document. Further, since it can be recorded as digital data, it can be easily converted into convenient text data by a predetermined character recognition process or the like.

しかしながら、対象との距離などが常に一定であるフラットベッドスキャナなどとは異なり、デジタルカメラによる撮影では、撮影条件が撮影毎に異なるため、文字認識処理で必要とされる画像品質を常に得ることは困難である。特に、用紙に印刷されている文書を撮影する場合はマクロ撮影となることが多く、手ブレやピンボケなどが生じやすい。手ブレやピンボケなどが発生している画像は画像品質が低く、文字認識処理の認識精度を低下させる結果となる。 However, unlike flatbed scanners where the distance to the target is always constant, shooting with digital cameras has different shooting conditions for each shooting, so it is always possible to obtain the image quality required for character recognition processing. Have difficulty. In particular, when shooting a document printed on paper, macro shooting is often performed, and camera shake and blurring tend to occur. An image in which camera shake, blurring, or the like occurs has low image quality, resulting in a decrease in recognition accuracy of the character recognition process.

したがって、デジタルカメラを用いて文書情報を記録する場合には、従来、（１）同じ撮影対象を複数枚撮影し、最も画像品質のよいものを撮影後に選択したり、あるいは、（２）撮影毎に画面で画像品質を確認し、画像品質が悪い場合には撮り直しする、などの作業によって対応していた。（１）の方法では、画像品質の低い画像データまでも記録しておかなければならないため、メモリ領域を無駄に占有してしまい、本当に必要な画像の記録に支障をきたすことがある。また、デジタルカメラに備えられている画面（液晶表示装置など）は解像度が低いことが多く、画像品質を確認するためには拡大表示やスクロールをおこなわなければならない。したがって、（２）の方法では確認のための作業時間が長くなってしまい、撮影効率を低下させてしまう。また、いずれの方法もユーザ（撮影者）がおこなう作業であるため、ユーザの労力を要し、負担となっていた。 Therefore, when document information is recorded using a digital camera, conventionally, (1) a plurality of images of the same object to be imaged are selected and the best image quality is selected after shooting, or (2) each image is captured. In other cases, the image quality was confirmed on the screen, and if the image quality was poor, it was retaken. In the method (1), even image data with low image quality must be recorded, so that the memory area is occupied unnecessarily, which may hinder the recording of a really necessary image. In addition, a screen (such as a liquid crystal display device) provided in a digital camera often has a low resolution, and an enlarged display or scrolling must be performed in order to check the image quality. Therefore, in the method (2), the work time for confirmation becomes long and the photographing efficiency is lowered. In addition, since both methods are operations performed by the user (photographer), the user's labor is required and is a burden.

所定の画像品質であるか否かを画像処理によって自動的に判断する技術（例えば、特許文献１、２）も提案されているが、従来の手法では、判断基準が「文字の大きさ」のみ、あるいは、「画像のブレ」のみ、となっている。このような方法では、例えば、画像のブレはあっても文字が十分大きい場合、画像品質が低いと判断されて記録されない場合がある。すなわち、文字が十分大きい場合には、ブレがあっても文字認識に十分な画像品質となる場合があるにもかかわらず「画像品質が低い」と判断されてしまう。一方、画像のブレはなくとも文字が小さい場合、文字認識に必要な画像品質とならない場合があるが、ブレのみに基づいて「画像品質が高い」と判断され、画像品質の低い画像が記録されてしまう場合もある。すなわち、上記従来技術によっては、画像品質を正確に判別することができず、その結果、有用な画像が撮影できたにもかかわらず撮り直しを強いられたり、画像品質の低い画像を記録してしまうなどの不都合が生じ、効率的ではなかった。
特開平１０−２４７２２０号公報特開平０８−０６３５４７号公報 Techniques (for example, Patent Documents 1 and 2) for automatically determining whether or not a predetermined image quality is obtained by image processing have also been proposed, but in the conventional method, the determination criterion is only “character size”. Or, only “image blurring” is available. In such a method, for example, if the characters are sufficiently large even though the image is blurred, it may be determined that the image quality is low and not recorded. That is, when the character is sufficiently large, it is determined that “image quality is low” even though there is a case where the image quality is sufficient for character recognition even if there is a blur. On the other hand, if the characters are small even if there is no image blurring, the image quality required for character recognition may not be achieved, but based on the blurring only, the image quality is judged to be high and an image with low image quality is recorded. There is also a case. That is, depending on the prior art, the image quality cannot be accurately determined. As a result, even though a useful image can be captured, it is forced to re-shoot, or an image with low image quality is recorded. Inconveniences such as endlessness occurred, and it was not efficient.
JP-A-10-247220 Japanese Patent Laid-Open No. 08-063547

本発明は上記実状に鑑みてなされたもので、文書撮影をおこなう場合に画像品質を正確に判定して通知することができるデジタルカメラおよびプログラムを提供することを目的とする。 The present invention has been made in view of the above circumstances, and an object of the present invention is to provide a digital camera and a program capable of accurately determining and notifying image quality when photographing a document.

上記目的を達成するため、本発明に係るデジタルカメラは、
入射光をデジタルデータに変換して画像データを取得するデジタルカメラにおいて、
前記画像データの特徴を検出する特徴検出手段と、
前記特徴検出手段が検出した特徴に基づいて、当該画像データに示される文字の可読性レベルを判定する判定手段と、
前記判定手段が判定した可読性レベルに基づいて、所定の動作を実行する実行手段と、を備え、
前記特徴検出手段は、前記画像データにおける階調変化率を検出する階調検出手段をさらに備え、
前記判定手段は、前記階調検出手段が検出した階調変化率に基づいて前記文字の可読性レベルを判定し、
前記実行手段は、前記判定手段が判定した可読性レベルに基づいて、所定の通知情報を出力する出力手段をさらに備える、
ことを特徴とする。 In order to achieve the above object, a digital camera according to the present invention provides:
In a digital camera that acquires image data by converting incident light into digital data,
Feature detection means for detecting the features of the image data;
Determination means for determining the readability level of the character indicated in the image data based on the feature detected by the feature detection means;
Execution means for executing a predetermined operation based on the readability level determined by the determination means,
The feature detection means further comprises a gradation detection means for detecting a gradation change rate in the image data,
The determination means determines the readability level of the character based on the gradation change rate detected by the gradation detection means,
The execution unit further includes an output unit that outputs predetermined notification information based on the readability level determined by the determination unit.
It is characterized by that.

上記デジタルカメラは、
前記画像データに示される文字のエッジを検出するエッジ検出手段をさらに備えていることが望ましく、この場合、
前記階調検出手段は、前記エッジ検出手段が検出したエッジ部分の階調変化率を検出することが望ましい。 The above digital camera
It is desirable to further comprise an edge detection means for detecting the edge of the character shown in the image data, in this case,
The gradation detecting means preferably detects the gradation change rate of the edge portion detected by the edge detecting means.

上記デジタルカメラにおいて、
前記特徴検出手段は、前記文字のサイズを特定する文字サイズ特定手段をさらに備えていることが望ましく、この場合、
前記判定手段は、前記階調検出手段が検出した階調変化率と、前記文字サイズ特定手段が特定した文字サイズとに基づいて、当該文字の可読性レベルを判定することが望ましい。 In the above digital camera,
The feature detection unit preferably further includes a character size specifying unit that specifies the size of the character. In this case,
The determination unit preferably determines the readability level of the character based on the gradation change rate detected by the gradation detection unit and the character size specified by the character size specification unit.

上記デジタルカメラは、
所定の閾値と前記判定手段が判定した可読性レベルとに基づいて、前記文字の判読可否を判別する判別手段をさらに備えていることが望ましく、この場合、
前記出力手段は、前記判別手段によって判読不可と判別された場合に、その旨を示す通知情報を出力することが望ましい。 The above digital camera
It is desirable to further include a determination unit that determines whether or not the character can be read based on a predetermined threshold and the readability level determined by the determination unit.
Preferably, the output means outputs notification information indicating that when it is determined by the determination means that reading is impossible.

上記デジタルカメラにおいて、
前記判定手段は、前記画像データ上の複数箇所で可読性レベルを判定し、
前記判別手段は、前記複数箇所で判読可否を判別し、
前記出力手段は、前記複数箇所のいずれかにおいて判読不可と判別された場合に、前記通知情報を出力してもよい。 In the above digital camera,
The determination means determines a readability level at a plurality of locations on the image data,
The determination means determines whether or not the plurality of places can be read,
The output means may output the notification information when it is determined that reading is impossible at any of the plurality of locations.

上記デジタルカメラは、
前記特徴検出手段が検出する特徴を示す情報と可読性レベルとが対応付けられた判定情報を記憶する記憶手段をさらに備えていることが望ましく、この場合、
前記判定手段は、前記記憶手段に記憶された判定情報に基づいて、可読性レベルを判定することが望ましい。 The above digital camera
It is desirable to further comprise storage means for storing determination information in which the information indicating the feature detected by the feature detection means is associated with the readability level.
It is desirable that the determination unit determines a readability level based on determination information stored in the storage unit.

上記デジタルカメラは、
ユーザからの指示を入力する入力手段をさらに備えていることが望ましく、この場合、
前記記憶手段は、前記入力手段に入力されたユーザの指示に応じて、前記判定情報を更新してもよい。 The above digital camera
It is desirable to further include an input means for inputting an instruction from the user.
The storage unit may update the determination information in accordance with a user instruction input to the input unit.

また、
前記文字サイズ特定手段は、前記入力手段に入力されたユーザの指示に応じて、文字サイズを特定してもよい。 Also,
The character size specifying unit may specify a character size in accordance with a user instruction input to the input unit.

上記デジタルカメラにおいて、
前記階調検出手段は、前記画像データにおける階調変化率の分布を検出する階調分布検出手段をさらに備えていてもよく、この場合、
前記判定手段は、前記階調分布検出手段が検出した階調変化率の分布に基づいて、前記文字の可読性レベルを判定してもよい。 In the above digital camera,
The gradation detection means may further comprise gradation distribution detection means for detecting a distribution of gradation change rate in the image data.
The determination unit may determine the readability level of the character based on a distribution of gradation change rates detected by the gradation distribution detection unit.

この場合、
前記判定手段は、前記階調変化率の分布に基づき、最も出現頻度の高い階調変化率を前記文字における階調変化率として可読性レベルを判定することが望ましい。 in this case,
It is preferable that the determination unit determines the readability level based on the distribution of the gradation change rate, with the gradation change rate having the highest appearance frequency as the gradation change rate in the character.

上記デジタルカメラは、
前記画像データに示される文字領域を抽出する文字領域抽出手段をさらに備えていることが望ましく、この場合、
前記特徴検出手段は、前記文字領域抽出手段が抽出した文字領域内での特徴を検出することが望ましい。 The above digital camera
It is desirable to further comprise a character area extraction means for extracting the character area shown in the image data, in this case,
It is desirable that the feature detection means detects a feature in the character area extracted by the character area extraction means.

上記目的を達成するため、本発明の第２の観点にかかるプログラムは、
入射光をデジタルデータに変換して画像データを取得するデジタルカメラを制御するコンピュータに、
取得した画像データに示される文字を検出するステップと、
少なくとも検出された文字における階調変化率を検出するステップと、
検出された文字のサイズを特定するステップと、
検出された階調変化率および／または特定された文字サイズに基づいて、前記文字の可読性レベルを判定するステップと、
判定された可読性レベルに応じた所定の通知情報を出力するステップと、
を実行させることを特徴とする。 In order to achieve the above object, a program according to the second aspect of the present invention is:
To a computer that controls a digital camera that converts incident light into digital data and acquires image data,
Detecting characters shown in the acquired image data;
Detecting a gradation change rate in at least the detected character;
Identifying the size of the detected characters;
Determining a readability level of the character based on the detected gradation change rate and / or the specified character size;
Outputting predetermined notification information according to the determined readability level;
Is executed.

本発明によれば、撮影した画像中の階調変化率に基づいて、当該画像に示される文字の可読性を判定して通知するので、画像品質を正確に判定して通知することができる。この結果、撮影時の省力化や撮影効率の向上を図ることができるとともに、メモリ領域の浪費などを防止することができる。 According to the present invention, since the readability of the character shown in the image is determined and notified based on the gradation change rate in the captured image, the image quality can be accurately determined and notified. As a result, it is possible to save labor at the time of shooting and improve shooting efficiency, and to prevent the memory area from being wasted.

本発明にかかる実施形態を、以下図面を参照して説明する。本実施形態では、ＣＣＤ（Charge Coupled Device：電荷結合素子）などの撮像素子を用いて画像データを取得するデジタルカメラ（撮像装置）によって文字や文書を撮影する場合（以下、「文書撮影」とする）を例に以下説明する。本実施形態において「文書撮影」とは、例えば、用紙や看板などに記されている文字をデジタルカメラで撮影し、スキャナなどによる取り込みやメモなどの代用として文字情報を記録することをいうものとする。また、文書撮影を目的として取得された画像を「文書画像」とする。このような文書画像は所定の文字認識処理に供せられることでテキストデータなどに変換される。すなわち、デジタルカメラで文書を撮影することで、文書情報の電子化を図るものである。 Embodiments according to the present invention will be described below with reference to the drawings. In this embodiment, when a character or document is photographed by a digital camera (imaging device) that acquires image data using an imaging device such as a CCD (Charge Coupled Device) (hereinafter referred to as “document photographing”). ) As an example. In this embodiment, “document shooting” refers to, for example, shooting characters written on paper or a signboard with a digital camera and recording character information as a substitute for taking in by a scanner or a memo. To do. In addition, an image acquired for the purpose of taking a document is referred to as a “document image”. Such a document image is converted into text data by being subjected to a predetermined character recognition process. That is, document information is digitized by photographing a document with a digital camera.

本実施形態に係るデジタルカメラの構成を図１、図２を参照して説明する。図１は本実施形態にかかるデジタルカメラ１００のシステム構成（内部構成）を示すブロック図であり、図２はデジタルカメラ１００の外観例を示す図（図２（ａ）は正面図、図２（ｂ）は背面図）である。図示するように、本実施形態にかかるデジタルカメラ１００は、制御部１１０と、撮像部１２０と、入力部１３０、出力部１４０と、画像記録部１５０と、記憶部１６０と、を備える。 The configuration of the digital camera according to this embodiment will be described with reference to FIGS. FIG. 1 is a block diagram showing a system configuration (internal configuration) of a digital camera 100 according to the present embodiment, FIG. 2 is a diagram showing an example of the appearance of the digital camera 100 (FIG. 2A is a front view, FIG. b) is a rear view). As illustrated, the digital camera 100 according to the present embodiment includes a control unit 110, an imaging unit 120, an input unit 130, an output unit 140, an image recording unit 150, and a storage unit 160.

制御部１１０は、例えば、ＣＰＵ（Central Processing Unit：中央演算処理装置）などから構成され、デジタルカメラ１００の各部を制御する。ここで、制御部１１０は、記憶部１６０に格納された動作プログラムを実行することで後述する各処理が実現される。なお、制御部１１０は、動作時に必要なデータや動作プログラムなどを展開（ロード）するための記憶領域を備えているものとする。この記憶領域（以下「ワークエリア」とする）は、例えば、レジスタやキャッシュメモリ、および、ＲＡＭ（Random Access Memory）などから構成される。 The control unit 110 includes, for example, a CPU (Central Processing Unit) and controls each unit of the digital camera 100. Here, the control part 110 implement | achieves each process mentioned later by running the operation | movement program stored in the memory | storage part 160. FIG. Note that the control unit 110 includes a storage area for developing (loading) data and operation programs necessary for operation. This storage area (hereinafter referred to as “work area”) is configured by, for example, a register, a cache memory, and a RAM (Random Access Memory).

撮像部１２０は、制御部１１０の制御によって撮像動作をおこなうものであり、本実施形態では、図２に示すように、レンズユニット１２１、測距・測光部１２２、撮像素子１２３、などを備える。 The imaging unit 120 performs an imaging operation under the control of the control unit 110. In the present embodiment, as illustrated in FIG. 2, the imaging unit 120 includes a lens unit 121, a distance measurement / photometry unit 122, an imaging element 123, and the like.

レンズユニット１２１は、レンズ群や絞り羽根などから構成される光学的部材や、光学的部材を駆動する駆動部などから構成される。駆動部は、制御部１１０の制御により光学的部材を駆動する。例えば、設定された絞りとなるよう絞り羽根を駆動する。また、レンズユニット１２１がＡＦ（Auto Focus：オートフォーカス）機能を備えている場合は、撮影対象に合焦するよう光学的部材を駆動し、ズーム機能を備えている場合には、ユーザのズーム操作に応じて光学的部材を駆動する。 The lens unit 121 includes an optical member including a lens group and diaphragm blades, and a drive unit that drives the optical member. The drive unit drives the optical member under the control of the control unit 110. For example, the diaphragm blades are driven to achieve the set diaphragm. In addition, when the lens unit 121 has an AF (Auto Focus) function, the optical member is driven to focus on an object to be photographed. The optical member is driven accordingly.

測距・測光部１２２は、例えば、赤外線照射部や距離センサ、および、光センサなどから構成され、デジタルカメラ１００から撮影対象（被写体）までの距離や撮影空間の光量などを測定する。 The distance measurement / photometry unit 122 includes, for example, an infrared irradiation unit, a distance sensor, and an optical sensor, and measures a distance from the digital camera 100 to a shooting target (subject), a light amount in a shooting space, and the like.

撮像素子１２３は、例えば、ＣＣＤなどから構成され、レンズユニット１２１を介して入光した可視光（入射光）を光電変換により電荷に変換して蓄積する。ここで撮像素子１２３は、所定のＡＤ変換回路を備えるものとし、変換された電荷の電荷量に基づいてデジタルデータに変換する。すなわち、変換されたデジタルデータは、入射光から得られる「画像データ」を構成することとなる。 The image pickup device 123 is configured by, for example, a CCD, and converts visible light (incident light) incident through the lens unit 121 into electric charge by photoelectric conversion and accumulates it. Here, the imaging device 123 is provided with a predetermined AD conversion circuit, and converts it into digital data based on the amount of the converted charge. That is, the converted digital data constitutes “image data” obtained from incident light.

入力部１３０は、ユーザからの指示を受け付けるボタンなどから構成され、ユーザによって操作される。入力部１３０は、操作に応じた所定の信号を制御部１１０に送出することで、ユーザの指示を制御部１１０に入力する。本実施形態では、入力部１３０は、少なくとも、シャッタボタン１３１と操作入力部１３２とを備える。 The input unit 130 includes buttons that accept instructions from the user and is operated by the user. The input unit 130 inputs a user instruction to the control unit 110 by sending a predetermined signal corresponding to the operation to the control unit 110. In the present embodiment, the input unit 130 includes at least a shutter button 131 and an operation input unit 132.

シャッタボタン１３１は、ユーザによる押下に応じて上下方向に可動するボタンなどから構成され、ユーザに押下されることによって、撮影開始などを指示する信号（以下、「シャッタ信号」とする）を制御部１１０に送出する。なお、シャッタボタン１３１は、いわゆる「半押し」状態と「全押し」状態の２つの状態に応じて異なる信号を制御部１１０に送出するものとする。ここで「半押し」とはシャッタボタン１３１の可動ストロークの半ばまで押された状態をいい、「全押し」とはシャッタボタン１３１が完全に押し込まれた状態をいうものとする。 The shutter button 131 includes a button that can be moved up and down in response to a press by the user, and a signal that instructs the start of shooting (hereinafter referred to as a “shutter signal”) when pressed by the user. 110. Note that the shutter button 131 sends different signals to the control unit 110 depending on two states, a so-called “half-pressed” state and a “full-pressed” state. Here, “half-pressed” refers to a state where the shutter button 131 is pressed halfway, and “full-pressed” refers to a state where the shutter button 131 is fully pressed.

通常のカメラと同様、本実施形態にかかるデジタルカメラ１００においても、「半押し」で測距・測光およびピント合わせ（合焦）など（以下、「撮像準備動作」とする）をおこない、「全押し」で撮像・記録（以下、「撮像動作」とする）をおこなう。したがって、シャッタボタン１３１が半押しされたときには、その旨を示すシャッタ信号（以下、「半押し信号」とする）が制御部１１０に送出され、制御部１１０はこれに応じて合焦動作の開始などを実行する。また、シャッタボタン１３１が全押しされたときには、その旨を示すシャッタ信号（以下、「全押し信号」とする）が制御部１１０に送出され、制御部１１０は撮像・記録動作の開始などを実行する。すなわち、制御部１１０は、シャッタボタン１３１から送出された信号の受信を契機に撮像のための各動作を開始する。 As with a normal camera, the digital camera 100 according to this embodiment performs distance measurement / photometry and focusing (focusing) (hereinafter referred to as “imaging preparation operation”) by “half-pressing”. “Press” to capture and record (hereinafter referred to as “imaging operation”). Therefore, when the shutter button 131 is half-pressed, a shutter signal indicating that effect (hereinafter referred to as “half-press signal”) is sent to the control unit 110, and the control unit 110 starts the focusing operation accordingly. And so on. When the shutter button 131 is fully pressed, a shutter signal indicating that effect (hereinafter referred to as a “full press signal”) is sent to the control unit 110, and the control unit 110 executes the start of an imaging / recording operation. To do. That is, the control unit 110 starts each operation for imaging upon reception of a signal transmitted from the shutter button 131.

操作入力部１３２は、例えば、所定の操作ボタンやダイヤル、カーソルキー、などから構成され、ユーザの操作により、種々の設定を変更する指示などが入力される。本実施形態においては、撮影モードをはじめとする各種モードの変更・設定に用いられる他、文書画像における文字の判読可否の基準となる可読性レベル（詳細後述）の設定に用いられる。また、本実施形態では、撮影モードとして「文書撮影モード」が用意されるものとし、デジタルカメラ１００で文書撮影をおこなう場合は、操作入力部１３２を操作して文書撮影モードにする。 The operation input unit 132 includes, for example, predetermined operation buttons, dials, cursor keys, and the like, and instructions for changing various settings are input by user operations. In the present embodiment, it is used for changing / setting various modes including a photographing mode, and for setting a readability level (details will be described later) as a reference for determining whether or not characters can be read in a document image. In this embodiment, it is assumed that “document shooting mode” is prepared as the shooting mode. When the digital camera 100 performs shooting of a document, the operation input unit 132 is operated to set the document shooting mode.

出力部１４０は、ユーザに対して通知される種々の情報を出力するものであり、本実施形態では、画像を出力する表示部１４１と、音声を出力する報音部１４２と、点灯や点滅などの発光をおこなうインジケータ部１４３、を備える。表示部１４１、報音部１４２、インジケータ部１４３は、図２（ｂ）に示すように、デジタルカメラ１００の背面側に構成される。すなわち、デジタルカメラ１００を用いた撮影時に、出力された情報を撮影者（ユーザ）に伝達可能な位置に構成されることが望ましい。 The output unit 140 outputs various types of information notified to the user. In the present embodiment, the display unit 141 that outputs an image, the sound report unit 142 that outputs sound, lighting, flashing, and the like. The indicator part 143 which performs this light emission is provided. The display unit 141, the sound report unit 142, and the indicator unit 143 are configured on the back side of the digital camera 100 as shown in FIG. In other words, it is desirable that the information output at the time of photographing using the digital camera 100 be configured to be transmitted to the photographer (user).

表示部１４１は、例えば、液晶表示装置などから構成され、画像情報を表示出力する。本実施形態では、撮像素子１２３によって得られた画像や画像記録部１５０に記録された画像を表示する他、設定変更をおこなうためのメニュー画面、および、設定状態やユーザに対する警告を示す情報（例えば、文字やアイコン）などが表示される。本実施形態では、可読性レベルに応じたユーザへの警告や案内を示す画像や文字情報が表示される。なお、表示部１４１がタッチパネルなどから構成される場合には、操作入力部１３２と同等の入力装置として機能させてもよい。 The display unit 141 is composed of, for example, a liquid crystal display device, and displays and outputs image information. In the present embodiment, in addition to displaying an image obtained by the image sensor 123 and an image recorded in the image recording unit 150, a menu screen for changing the setting, and information indicating a setting state and a warning to the user (for example, , Characters and icons) are displayed. In the present embodiment, an image or character information indicating a warning or guidance to the user according to the readability level is displayed. Note that when the display unit 141 includes a touch panel or the like, the display unit 141 may function as an input device equivalent to the operation input unit 132.

報音部１４２は、例えば、スピーカなどから構成され、所定の音声情報を出力する。ここでは、デジタルカメラ１００による撮影時の各種動作（例えば、合焦、シャッタ押下）に応じた音声（アラーム音など）が出力される他、本実施形態では、可読性レベルに応じたユーザへの警告や案内を示す音声などが出力される。 The sound report unit 142 includes, for example, a speaker and outputs predetermined sound information. Here, sound (alarm sound, etc.) corresponding to various operations (for example, focusing, shutter pressing) at the time of shooting by the digital camera 100 is output, and in this embodiment, a warning to the user according to the readability level is provided. And voices indicating guidance are output.

インジケータ部１４３は、例えば、ＬＥＤ（Light Emitting Diode：発光ダイオード）などから構成され、デジタルカメラ１００による撮影時の各種動作（例えば、合焦、シャッタ押下）に応じて所定の色や明滅パターンで発光される他、本実施形態では、可読性レベルに応じたユーザへの警告時などにも発光する。 The indicator unit 143 includes, for example, an LED (Light Emitting Diode) and emits light with a predetermined color or blinking pattern according to various operations (for example, focusing, shutter pressing) at the time of shooting with the digital camera 100. In addition, in the present embodiment, light is emitted also at the time of warning to the user according to the readability level.

なお、可読性レベルに応じた警告や案内の出力は、表示部１４１、報音部１４２、インジケータ部１４３のすべてによって行われてもよく、あるいは、これらのいずれかによって行われてもよい。また、いずれの出力装置を用いるかはユーザによって任意に選択・設定可能であるものとする。 Note that the output of the warning or guidance according to the readability level may be performed by all of the display unit 141, the sound report unit 142, and the indicator unit 143, or may be performed by any of these. It is assumed that which output device is used can be arbitrarily selected and set by the user.

画像記録部１５０は、例えば、フラッシュメモリなどの記憶装置を備えるメモリカードなどから構成され、撮像素子１２３のＡＤ変換回路によって変換されたデジタルデータを記録することで、撮像部１２０の撮像動作によって得られた画像データを記録する。 The image recording unit 150 is configured by, for example, a memory card including a storage device such as a flash memory, and is obtained by an imaging operation of the imaging unit 120 by recording digital data converted by the AD conversion circuit of the imaging element 123. The recorded image data is recorded.

記憶部１６０は、例えば、フラッシュメモリなどの記憶装置から構成され、制御部１１０が実行するプログラムや、各処理に必要なデータ（以下、「処理データ」とする）などを記憶する。ここでは、撮像準備動作および撮像動作のための各種制御を実行するプログラム（以下、「撮像プログラム」とする）の他、本実施形態では、撮像素子１２３によって得られた画像データに基づいた特徴を検出し、画像データに示される文字の可読性を判定するためのプログラムが格納される。より詳細には、以下のようなプログラムが格納される。
（１）「文字検出プログラム」：画像データに文字が示されているかを判別し、文字領域を抽出するプログラム
（２）「エッジ検出プログラム」：画像データに示される文字のエッジを検出するプログラム
（３）「文字サイズ特定プログラム」：画像データに示されている文字のサイズを特定するプログラム
（４）「階調変化率検出プログラム」：画像データ上の階調変化率を検出するプログラム
（５）「可読性レベル判定プログラム」：画像データに示されている文字のサイズや階調変化率に基づいて、文字の可読性レベル（詳細後述）を判定するプログラム
（６）「判読可否判定プログラム」：判定された可読性レベルに基づいて、画像データに示されている文字が判読可能か否かを判定するプログラム
（７）「通知情報出力プログラム」：判定された可読性レベルに基づいて所定の通知情報を出力するプログラム The storage unit 160 includes, for example, a storage device such as a flash memory, and stores a program executed by the control unit 110, data necessary for each process (hereinafter referred to as “process data”), and the like. Here, in addition to a program for executing various controls for an imaging preparation operation and an imaging operation (hereinafter referred to as an “imaging program”), in this embodiment, features based on image data obtained by the imaging element 123 are included. A program for detecting and determining the readability of the characters shown in the image data is stored. More specifically, the following program is stored.
(1) “Character detection program”: a program for determining whether or not a character is shown in image data and extracting a character area (2) “Edge detection program”: a program for detecting an edge of a character indicated in image data ( 3) “Character size identification program”: a program that identifies the size of the character indicated in the image data (4) “Gradation change rate detection program”: a program that detects the gradation change rate in the image data (5) “Readability level determination program”: a program for determining a character readability level (details will be described later) based on the character size and gradation change rate indicated in the image data (6) “readability determination program”: determined (7) “Notification information output program for determining whether or not the characters shown in the image data are legible based on the readability level : Program that outputs a predetermined notification information based on the readability levels determined

このようなプログラムを制御部１１０が実行することで、制御部１１０は以下のような機能を実現する。
（１）「撮像機能」：撮像部１２０を動作させ、レンズユニット１２１から入光した入射光に基づく画像データを取得する機能
（２）「文字検出機能」：取得された画像データに示される文字部分を検出する機能
（３）「特徴検出機能」：取得された画像データの特徴を検出する機能
（４）「可読性レベル判定機能」：検出された画像データの特徴に基づいて、画像データに示される文字の可読性レベル（詳細後述）を判定する機能
（５）「判読可否判定機能」：判定された可読性レベルに基づいて、画像データに示される文字が判読可能か否かを判定する機能
（６）「出力機能」：判定された可読性レベルに応じて、ユーザへの通知情報を出力する機能 When the control unit 110 executes such a program, the control unit 110 realizes the following functions.
(1) “Imaging function”: a function for operating the imaging unit 120 and acquiring image data based on incident light incident from the lens unit 121. (2) “Character detecting function”: a character indicated in the acquired image data. Function for detecting part (3) “Feature detection function”: Function for detecting feature of acquired image data (4) “Readability level determination function”: indicated in image data based on detected feature of image data (5) “Readability determination function”: a function for determining whether or not the characters shown in the image data can be read based on the determined readability level (6) ) "Output function": A function to output notification information to the user according to the determined readability level

上記「特徴検出機能」は、画像データ上での階調変化率などを、当該画像データの「特徴」として検出する。また「出力機能」により、ユーザが撮像しようとしている文字の可読性レベルに応じた所定の「通知情報」（音声、画像、など）が出力部１４０から出力される。 The “feature detection function” detects a gradation change rate on image data as a “feature” of the image data. In addition, by the “output function”, predetermined “notification information” (sound, image, etc.) corresponding to the readability level of the character that the user is about to capture is output from the output unit 140.

なお、本実施形態では、制御部１１０がプログラムを実行することによるソフトウェア処理で上記各機能を実現するが、例えば、これらの各機能をそれぞれ専門的に処理する回路等（いわゆるＡＳＩＣ（Application Specific Integrated Circuit））をデジタルカメラ１００に構成することにより、ハードウェア処理によって上記各機能が実現されてもよい。 In the present embodiment, each function is realized by software processing by the control unit 110 executing a program. For example, a circuit that specially processes each function (so-called ASIC (Application Specific Integrated)). By configuring the circuit)) in the digital camera 100, the above functions may be realized by hardware processing.

また、記憶部１６０には、「処理データ」として、例えば、適正露出値（絞り値）やシャッタスピードなどの組み合わせなどといった撮影時に必要となる諸設定を示す情報（以下、「撮影パラメータ」という）などが記憶される他、本実施形態では、撮像された画像データに示される文字の可読性レベル（画像品質）を判定するための「可読性レベル判定テーブル」が記憶される。この「可読性レベル判定テーブル」の例を図３（ａ）に示す。 In the storage unit 160, information indicating various settings necessary for shooting such as a combination of an appropriate exposure value (aperture value), a shutter speed, and the like (hereinafter referred to as “shooting parameter”) as “process data”. In this embodiment, a “readability level determination table” for determining the readability level (image quality) of characters indicated in the captured image data is stored. An example of this “readability level determination table” is shown in FIG.

図示するように、「可読性レベル判定テーブル」には、「文字サイズ」と「階調変化率」とに応じた「可読性レベル」が対応付けられて記録される。ここで「文字サイズ」とは、撮像素子１２３によって得られた画像データで示される文字のサイズを示すものであり、文字の縦横ドット数に基づいて規定される。すなわち、撮像素子１２３によって得られる画像データは、複数のピクセル（ドット）から構成されるラスタ画像であるため、その画像中に示される文字を構成するドット数によって文字サイズを規定する。本実施形態では、各文字の横方向ドット数を用いることとする。 As shown in the drawing, in the “readability level determination table”, “readability level” corresponding to “character size” and “gradation change rate” is recorded in association with each other. Here, the “character size” indicates the size of the character indicated by the image data obtained by the image sensor 123, and is defined based on the number of vertical and horizontal dots of the character. That is, since the image data obtained by the image sensor 123 is a raster image composed of a plurality of pixels (dots), the character size is defined by the number of dots constituting the character shown in the image. In the present embodiment, the number of horizontal dots of each character is used.

また、文字サイズを示すドット数の範囲によって、文字サイズを「Ａ」、「Ｂ」、「Ｃ」、「Ｄ」、「Ｅ」の５段階のレベル（以下、「文字サイズレベル」とする）に区分する。本実施形態では、ドット数範囲が「〜８」の場合を「レベルＡ」とし、以下、「レベルＢ」＝「９〜１６」、「レベルＣ」＝「１７〜３２」、「レベルＤ」＝「３３〜６４」、「レベルＥ」＝「６５〜」、と区分する。すなわち、各レベルによって文字サイズの大小関係を表すと「Ａ＜Ｂ＜Ｃ＜Ｄ＜Ｅ」となる。なお、この区分数やドット数範囲の設定は任意であり、ユーザによって設定可能であってもよい。また、文字サイズを規定することができるのであれば任意の単位を採用してもよい。例えば、各文字の縦方向ドット数を用いてもよく、あるいは、横方向ドット数と縦方向ドット数の和もしくは積を用いてもよい。 Further, depending on the range of the number of dots indicating the character size, the character size is classified into five levels of “A”, “B”, “C”, “D”, and “E” (hereinafter referred to as “character size level”). Divide into In the present embodiment, the case where the dot number range is “˜8” is referred to as “level A”, and “level B” = “9-16”, “level C” = “17-32”, “level D” hereinafter. = “33 to 64” and “Level E” = “65 to”. In other words, the size relationship between the character sizes is expressed by “A <B <C <D <E”. Note that the setting of the number of divisions and the range of the number of dots is arbitrary and may be set by the user. Further, any unit may be adopted as long as the character size can be defined. For example, the number of vertical dots of each character may be used, or the sum or product of the number of horizontal dots and the number of vertical dots may be used.

また、「階調変化率」とは、画像のボケ具合を判定する指標となるものであり、「階調変化量／ドット数」で表す。本実施形態では、デジタルカメラ１００によって文字を撮像する文書撮影をおこなうが、撮影時にピントが合っていない場合（いわゆる「ピンボケ」）や手ブレが生じた場合、撮影した画像が不鮮明になり文字部分の判読が困難となってしまう。すなわち、ピンボケや手ブレの場合には、画像がぼやけたり、ぶれたりするが、このとき、画像上の文字のエッジ部分（文字部分と背景との境界部分）などにいわゆる「ボケ足」が発生する。本実施形態では、この「ボケ足」の程度（画像品質）を示す指標として「階調変化率」を用いる。 The “gradation change rate” is an index for determining the degree of blurring of an image, and is represented by “gradation change amount / number of dots”. In the present embodiment, the digital camera 100 shoots a document that captures characters, but if the image is out of focus (so-called “out-of-focus”) or camera shake occurs, the captured image becomes unclear and the character portion Is difficult to interpret. In other words, in the case of out-of-focus or camera shake, the image may be blurred or blurred. At this time, so-called “blurred feet” occur at the edge of the character on the image (the boundary between the character and the background). To do. In the present embodiment, “gradation change rate” is used as an index indicating the degree of “blurred foot” (image quality).

例えば、図４（ａ）に示すような白地に黒で表される文字（「Ｃ」）を画像表示する場合、ピンボケや手ブレがまったくない状態での文字のエッジ部（図中の矩形領域ＲＡ）を拡大すると、図４（ｂ）に示すように、「黒」のピクセル（ドット）でエッジ部が構成され、ボケ足が生じていない。一方、ピンボケや手ブレが生じると、図４（ｃ）に示すように、下地の白と文字の黒との間の中間階調のピクセルでエッジ部が構成される。この中間階調ピクセルがボケ足となり、輪郭がぼやけた文字となって判読性が低くなる。そして、ボケ足の幅が広い程、判読性もより低くなる。 For example, when a character (“C”) expressed in black on a white background as shown in FIG. 4A is displayed as an image, the edge portion of the character (rectangular region in the figure) without any blur or camera shake. When (RA) is enlarged, as shown in FIG. 4 (b), the edge portion is composed of “black” pixels (dots), and no blurring occurs. On the other hand, when out-of-focus or camera shake occurs, as shown in FIG. 4C, an edge portion is formed by pixels of intermediate gradation between the white background and the black character. This halftone pixel becomes blurred and becomes a character with a blurred outline, resulting in poor legibility. And the wider the blur foot, the lower the legibility.

このような場合の階調変化を図５を参照して説明する。ここでは、上記のような文字「Ｃ」を２５６階調（「黒」の階調を「０」、「白」の階調を「２５５」とする）で画像表示した場合の１次元方向ピクセル（例えば、図５（ａ）の「線分Ｘ−Ｘ’」上のピクセル）の階調変化をみる。 The gradation change in such a case will be described with reference to FIG. Here, the one-dimensional pixel when the character “C” as described above is displayed as an image with 256 gradations (the gradation of “black” is “0” and the gradation of “white” is “255”). A change in gradation is observed (for example, a pixel on “line segment XX ′” in FIG. 5A).

まず、ピンボケや手ブレが全くない状態での階調変化を図５（ｂ）に示す。ここでは、横軸に線分Ｘ−Ｘ’上のドット（ピクセル）をとり、縦軸に各ドットの階調をとる（図５（ｃ）においても同様）。ボケ足が生じていない場合、エッジ部での階調変化は、背景の「２５５」から文字部の「０」に急激に変化する。すなわち、文字のエッジ部においては、背景のピクセルから文字部分方向に１ドットずれると、２５６階調変化する（階調変化量）。この場合の階調変化率は、「２５６（階調変化量）／１（ドット数）＝２５６」となる。すなわち、ピンボケや手ブレが全く生じていない状態での階調変化率は、背景と文字部分との間の階調差と一致することになる。 First, FIG. 5B shows a gradation change in a state where there is no blurring or camera shake. Here, the horizontal axis represents dots (pixels) on the line segment X-X ′, and the vertical axis represents the gradation of each dot (the same applies to FIG. 5C). When no blurring occurs, the gradation change at the edge portion changes abruptly from “255” in the background to “0” in the character portion. That is, at the edge portion of the character, when one dot is shifted from the background pixel in the direction of the character portion, 256 gradation changes (gradation change amount). The gradation change rate in this case is “256 (tone change amount) / 1 (number of dots) = 256”. That is, the gradation change rate in a state where there is no out-of-focus or camera shake coincides with the gradation difference between the background and the character portion.

一方、エッジ部にボケ足が生じている場合は、図５（ｃ）に示すように、背景の白から文字の黒までなだらかに階調が変化する。すなわち、背景の白と文字の黒との間に中間階調のピクセルが存在してボケ足となる（図５（ｃ）においてハッチングで示す部分）。ここで、中間階調を構成するピクセルの数（ドット数）が「ボケ足の幅」ということになる。例えば、中間階調ピクセルのドット数が「８」である場合の階調変化率は「２５６／８＝３２」となる。また、ドット数が「３２」である場合の階調変化率は「２５６／３２＝８」となる。 On the other hand, when the blur is generated at the edge portion, as shown in FIG. 5C, the gradation gradually changes from white in the background to black in the character. In other words, there is an intermediate gradation pixel between the white background and the black character, resulting in blurring (the hatched portion in FIG. 5C). Here, the number of pixels (number of dots) constituting the intermediate gradation is the “blurred foot width”. For example, the gradation change rate when the number of dots of the intermediate gradation pixel is “8” is “256/8 = 32”. Further, the gradation change rate when the number of dots is “32” is “256/32 = 8”.

すなわち、本実施形態における「階調変化率」は、背景と文字部との階調差を最大値とし、ボケ足の幅が広くなるほど減少する数値である。したがって、階調変化率の値の大小を相対的にみることにより、当該画像のボケ具合を判定することができる。つまり、階調変化率の値が小さいほど画像のボケ具合が大きいことを示す。なお、上述したように、文字の判読性の良否にはボケ足の幅が影響するため、ボケ足のドット数のみからでも画像のボケ具合をある程度みることができる。しかし、撮影対象によっては、背景と文字部分との階調差が小さい場合（例えば、背景や文字がカラーである場合など）もあり、さらに、一つの画像内に、階調差が大きい部分と小さい部分とが混在する場合もある（例えば、背景や文字部分が複数色の色彩で構成されている場合など）。このような場合においても、画像全体のボケ具合を適正に判定するため、ボケ足のドット数と階調変化量（階調差）との関係から導き出す階調変化率を用いる。 That is, the “gradation change rate” in the present embodiment is a numerical value that decreases with an increase in the width of the blurred foot, with the gradation difference between the background and the character portion being the maximum value. Therefore, it is possible to determine the degree of blur of the image by relatively viewing the magnitude of the gradation change rate. That is, the smaller the gradation change rate value, the greater the degree of image blur. As described above, since the width of the blur foot affects the quality of the legibility of the characters, the degree of blur of the image can be seen to some extent only from the number of dots of the blur foot. However, there are cases where the gradation difference between the background and the character portion is small (for example, when the background or characters are in color) depending on the object to be photographed. There may be a case where a small part is mixed (for example, a case where a background or a character part is composed of a plurality of colors). Even in such a case, the gradation change rate derived from the relationship between the number of dots on the blurred foot and the gradation change amount (gradation difference) is used in order to appropriately determine the degree of blur of the entire image.

図３（ａ）に示す「可読性レベル判定テーブル」では、階調変化率を示す数値の範囲に応じて、階調変化率を「Ａ’」、「Ｂ’」、「Ｃ’」、「Ｄ’」、「Ｅ’」の５段階のレベル（以下、「階調変化率レベル」とする）に区分する。階調変化率レベルは、階調変化率の所定の数値範囲ごとに設定される。本実施形態では、数値範囲が「〜５」の場合を「レベルＡ’」とし、以下、「レベルＢ’」＝「６〜１０」、「レベルＣ’」＝「１１〜２０」、「レベルＤ’」＝「２１〜４０」、「レベルＥ’」＝「４１〜」、と区分する。すなわち、各レベルによって階調変化率の大小関係を表すと「Ａ’＜Ｂ’＜Ｃ’＜Ｄ’＜Ｅ’」となる。なお、「階調変化率」は、画像上のドット数を要素とするため、デジタルカメラ１００の性能（例えば、解像度や画素数など。以下、「画像性能」とする）が影響する。したがって、「可読性レベル判定テーブル」には、デジタルカメラ１００の画像性能などに応じて各段階を規定する数値範囲が設定されるものとする。ただし、区分数や数値範囲の設定は任意であり、ユーザによって設定可能であってもよい。 In the “readability level determination table” shown in FIG. 3A, the gradation change rates are set to “A ′”, “B ′”, “C ′”, “D” according to the range of numerical values indicating the gradation change rate. It is divided into five levels of “′” and “E” (hereinafter referred to as “gradation change rate level”). The gradation change rate level is set for each predetermined numerical range of the gradation change rate. In the present embodiment, the case where the numerical value range is “˜5” is “level A ′”, and hereinafter, “level B ′” = “6-10”, “level C ′” = “11-20”, “level” D ′ ”=“ 21-40 ”and“ level E ′ ”=“ 41- ”. That is, the level change of the gradation change rate is represented by each level, “A ′ <B ′ <C ′ <D ′ <E ′”. Note that since the “gradation change rate” is based on the number of dots on the image, the performance of the digital camera 100 (for example, the resolution, the number of pixels, etc., hereinafter referred to as “image performance”) is affected. Therefore, a numerical value range that defines each stage is set in the “readability level determination table” according to the image performance of the digital camera 100 and the like. However, the setting of the number of sections and the numerical value range is arbitrary and may be set by the user.

そして「可読性レベル判定テーブル」には、文字サイズと階調変化率とに対応付けられた「可読性レベル」が記録される（図３（ａ））。この「可読性レベル」は、デジタルカメラ１００による文書撮影で取得された文書画像データに示される文字の判読性を示す情報である。本実施形態では、「１」〜「９」の数値によってレベルを示すこととし、数値が小さいほど可読性が低く、数値が大きいほど可読性が高いことを示すものとする。なお、本実施形態における「判読」とは、取得した文書画像を表示等させたときにユーザ（人間）が目視により文字を認識することをいう他、所定の文字認識処理においてコンピュータが文字を認識することをいうものとする。 In the “readability level determination table”, the “readability level” associated with the character size and the gradation change rate is recorded (FIG. 3A). This “readability level” is information indicating the legibility of the characters shown in the document image data acquired by document shooting by the digital camera 100. In the present embodiment, the level is indicated by numerical values “1” to “9”, and the lower the numerical value, the lower the readability, and the higher the numerical value, the higher the readability. Note that “reading” in the present embodiment means that a user (human) recognizes a character by visual observation when the acquired document image is displayed, and the computer recognizes the character in a predetermined character recognition process. It means to do.

ここで、文字の可読性は、文字のサイズと画像のボケ具合によってレベル分けされる。すなわち、文字サイズが小さく、かつ、ボケ具合が大きい（階調変化率の値が小さい場合）ほど可読性は低いため、最も小さい文字サイズを含んだ文字サイズレベルである「文字サイズレベル：Ａ」で、かつ、最も小さい階調変化率を含んだ階調変化率レベルである「階調変化率レベル：Ａ’」（以下、「Ａ−Ａ’レベル」というように表記する。この場合、ハイフンの前が文字サイズレベルを示し、ハイフンの後が階調変化率レベルを示す。）に対応する可読性レベルを、最低レベルとなる「１」に設定する。一方、文字サイズが大きく、かつ、ボケ具合が小さい（階調変化率の値が大きい場合）ほど可読性は高いため、最も大きい文字サイズと最も大きい階調変化率を含んでいる「Ｅ−Ｅ’レベル」に対応する可読性レベルを、最高レベルとなる「９」に設定する。 Here, the readability of characters is classified into levels according to the size of characters and the degree of image blur. That is, the smaller the character size and the greater the degree of blur (when the gradation change rate value is smaller), the lower the readability. Therefore, the character size level including the smallest character size is “character size level: A”. The tone change rate level including the smallest tone change rate is expressed as “tone change rate level: A ′” (hereinafter referred to as “AA ′ level”. In this case, the hyphen The readability level corresponding to the character size level before and the tone change rate level after the hyphen is set to “1” which is the lowest level. On the other hand, as the character size is larger and the degree of blur is smaller (when the gradation change rate value is larger), the readability is higher. Therefore, “EE” includes the largest character size and the largest gradation change rate. The readability level corresponding to “level” is set to “9” which is the highest level.

ここで、可読性レベルには所定の閾値を設定する。本実施形態では、「１」〜「９」の中間値となる「５」を閾値とする。この閾値は、文書画像に示される文字が判読可能であるか否かを判定するための基準として用いる。本実施形態では、可読性レベルの数値が閾値未満である場合は判読不可とし、閾値以上である場合は判読可能とする。すなわち、可読性レベル「５」は、文字が判読できる最低限度のレベルであることを示す。このような閾値として設定された可読性レベルがいずれであるかを示す情報が、「可読性レベル判定テーブル」とともに記憶部１６０に記憶されるものとする。 Here, a predetermined threshold is set for the readability level. In the present embodiment, “5” that is an intermediate value between “1” to “9” is set as a threshold value. This threshold value is used as a reference for determining whether or not the characters shown in the document image are legible. In this embodiment, when the numerical value of the readability level is less than the threshold value, it is not readable, and when it is equal to or higher than the threshold value, it is readable. That is, the readability level “5” indicates the minimum level at which characters can be read. Information indicating which readability level is set as such a threshold is stored in the storage unit 160 together with the “readability level determination table”.

このような閾値となる可読性レベル「５」を、例えば、文字サイズレベルおよび階調変化率レベルがともに中位となる「Ｃ−Ｃ’レベル」に設定する。そして、「Ｃ−Ｃ’レベル」を基点として、階調変化率レベルが一段下がるごとに可読性レベルを「１」マイナスし、一段上がるごとに可読性レベルを「１」プラスすることで、「Ｃ−Ａ’レベル」、「Ｃ−Ｂ’レベル」、「Ｃ−Ｃ’レベル」「Ｃ−Ｄ’レベル」「Ｃ−Ｅ’レベル」に対応する可読性レベルが設定される。このようにして設定される可読性レベルは、例えば、「Ｃ−Ａ’レベル：３」、「Ｃ−Ｂ’レベル：４」、「Ｃ−Ｃ’レベル：５」「Ｃ−Ｄ’レベル：６」「Ｃ−Ｅ’レベル：７」、となる（図３（ａ））。 For example, the readability level “5” serving as the threshold is set to “C-C ′ level” in which both the character size level and the gradation change rate level are medium. Then, with the “CC ′ level” as a base point, the readability level is decremented by “1” every time the gradation change rate level is lowered by one level, and the readability level is incremented by “1” every time the gray level change rate level is raised by “C−”. Readability levels corresponding to “A ′ level”, “CB ′ level”, “CC ′ level”, “CD ′ level”, and “CE ′ level” are set. The readability level set in this way is, for example, “CA ′ level: 3”, “CB ′ level: 4”, “CC ′ level: 5”, “CD ′ level: 6”. "CE 'level: 7" (FIG. 3A).

そして、このように設定された各可読性レベルのそれぞれを基点として、文字サイズレベルが一段下がるごとに可読性レベルを「１」マイナスし、一段上がるごとに可読性レベルを「１」プラスすることで、すべての可読性レベルが設定される（図３（ａ））。このような設定とすることで、文字サイズが小さくても、ボケ具合が小さければ、判読可能となる場合があるとともに、ボケ具合が大きくても、文字サイズが大きければ判読可能となる場合もある。すなわち、文字サイズとボケ具合との相対評価によって、判読可否が決定される。したがって、文字サイズのみ、あるいは、ボケ具合のみによって判読可否を決定する場合に比べ、より正確かつ柔軟に判読可否の決定をすることができる。 Then, with each of the readability levels set in this way as a base point, every time the character size level decreases by one step, the readability level is decremented by “1”, and every time the character size level increases by one, the readability level is incremented by “1”. Is set (FIG. 3A). With this setting, even if the character size is small, it may be legible if the degree of blur is small, and even if the degree of blur is large, it may be legible if the text size is large. . That is, the legibility is determined by the relative evaluation of the character size and the degree of blur. Therefore, it is possible to determine whether or not reading can be performed more accurately and flexibly than when determining whether or not reading is possible based on only the character size or only the degree of blur.

なお、このような設定方法は一例であり、任意の方法によって可読性レベルを設定することができる。また、上記のような方法で設定した可読性レベルをデフォルト値とし、ユーザにより適宜変更可能としてもよい。この場合、例えば、ユーザの視力や好みなどに応じて変更してもよい他、文書画像の用途に応じて変更できるようにしてもよい。例えば、文書画像中の文字を、所定の文字認識プログラムなどでテキストデータとして抽出するような場合には、当該文字認識プログラムの認識精度などに応じて可読性レベルを設定してもよい。ユーザによる可読性レベルの設定は、デジタルカメラ１００の操作入力部１３２などを操作することで行われるものとする。また、上記のような文字認識プログラムなどに応じて設定するような場合には、デジタルカメラ１００と当該プログラムとの連携（協働）によって、可読性レベルが自動的に設定されるようにしてもよい。 Such a setting method is an example, and the readability level can be set by any method. Further, the readability level set by the method as described above may be a default value and may be changed as appropriate by the user. In this case, for example, it may be changed according to the user's visual acuity or preference, or may be changed according to the use of the document image. For example, when characters in a document image are extracted as text data by a predetermined character recognition program or the like, the readability level may be set according to the recognition accuracy of the character recognition program. It is assumed that the user's readability level is set by operating the operation input unit 132 of the digital camera 100 or the like. Further, in the case of setting according to the character recognition program as described above, the readability level may be automatically set by cooperation (cooperation) between the digital camera 100 and the program. .

また記憶部１６０には、出力部１４０が出力する通知情報が記憶される。本実施形態では、後述する処理によって判定される可読性レベルや判読可否に応じた通知情報が出力される。したがって、本実施形態では、例えば、判読可否の判定結果や可読性レベルを示す情報などと、出力される通知情報とが対応付けられた、図３（ｂ）に示すような「通知情報テーブル」が記憶部１６０に記憶される。「通知情報」には、所定のメッセージを示すテキストデータが記録される他、必要に応じてアイコンなどの画像情報が記録される。 The storage unit 160 stores notification information output from the output unit 140. In this embodiment, notification information according to the readability level determined by the process described later and the legibility is output. Therefore, in the present embodiment, for example, there is a “notification information table” as shown in FIG. 3B in which information indicating the determination result of readability or information indicating the readability level is associated with the output notification information. It is stored in the storage unit 160. In “notification information”, text data indicating a predetermined message is recorded, and image information such as an icon is recorded as necessary.

また、「通知情報テーブル」には、設定されている通知方式がいずれであるかを示す情報も記録される。ここで「通知方式」とは、通知情報の出力形態であり、デジタルカメラ１００が有する出力部１４０の構成に応じて用意される。本実施形態では、例えば、「メッセージ表示」、「アイコン表示」、「インジケータ発光」、「音声通知」、「アラーム通知」、などが用意される。これらの通知方式は、ユーザによって任意に選択されるものであり、操作入力部１３２などの操作によって、ユーザが所望する通知方式が選択・設定される。各通知方式の詳細は後述する。 In the “notification information table”, information indicating which notification method is set is also recorded. Here, the “notification method” is an output form of notification information, and is prepared according to the configuration of the output unit 140 of the digital camera 100. In this embodiment, for example, “message display”, “icon display”, “indicator emission”, “voice notification”, “alarm notification”, and the like are prepared. These notification methods are arbitrarily selected by the user, and the notification method desired by the user is selected and set by the operation of the operation input unit 132 or the like. Details of each notification method will be described later.

デジタルカメラ１００には、上記の構成の他、デジタルカメラとして必要な構成や機能、および、その他の付加的な構成や機能が、必要に応じて備えられているものとする。 In addition to the above-described configuration, the digital camera 100 is provided with a configuration and functions necessary as a digital camera and other additional configurations and functions as necessary.

次に、上記のように構成されたデジタルカメラ１００の動作を図面を参照して以下説明する。後述する各処理は、制御部１１０が記憶部１６０に格納されている動作プログラムを実行することで実現される。 Next, the operation of the digital camera 100 configured as described above will be described below with reference to the drawings. Each process to be described later is realized by the control unit 110 executing an operation program stored in the storage unit 160.

実施形態にかかる「文書撮影処理」を図６に示すフローチャートを参照して説明する。ここでは、用紙や看板などに記された文字（以下、「文書」とする）をデジタルカメラ１００で撮影し、当該文書の内容を示す画像データとして保存する。保存された画像データは、例えば、所定の文字認識プログラムで処理されることにより、テキストデータなどに変換して抽出することで、従来のメモやスキャナによる取り込みの代用として利用される。このような目的で文書を撮影することを「文書撮影」とし、文書撮影によって得られる画像を「文書画像」とする。この「文書撮影処理」は、撮影モードとして「文書撮影モード」が選択された状態で、制御部１１０がシャッタボタン１３１からシャッタ信号を受信することを契機に開始される。 The “document photographing process” according to the embodiment will be described with reference to the flowchart shown in FIG. Here, characters (hereinafter referred to as “document”) written on a sheet of paper or a signboard are photographed by the digital camera 100 and stored as image data indicating the contents of the document. For example, the stored image data is processed by a predetermined character recognition program to be converted into text data or the like and extracted, thereby being used as a substitute for capturing with a conventional memo or scanner. Shooting a document for such a purpose is referred to as “document shooting”, and an image obtained by document shooting is referred to as a “document image”. This “document shooting process” is started when the control unit 110 receives a shutter signal from the shutter button 131 in a state where the “document shooting mode” is selected as the shooting mode.

シャッタボタン１３１からは、まず半押し信号が制御部１１０に送出される。制御部１１０は、シャッタボタン１３１から半押し信号を受信したことを契機に撮影準備動作を開始する（ステップＳ１０１）。ここでは、測距・測光部１２２による測距・測光動作に基づいて、適正なシャッタスピードや絞りが決定されるとともに、レンズユニット１２１が駆動されて文書に対するピント合わせ（合焦）などが行われる。なお、ユーザがシャッタボタン１３１を半押し状態にしている間、半押し信号がシャッタボタン１３１から制御部１１０に送出され続けるものとする。 First, a half-press signal is sent from the shutter button 131 to the control unit 110. The control unit 110 starts a shooting preparation operation when receiving a half-press signal from the shutter button 131 (step S101). Here, an appropriate shutter speed and aperture are determined based on the distance measurement / photometry operation by the distance measurement / photometry unit 122, and the lens unit 121 is driven to focus (focus) on the document. . It is assumed that the half-press signal continues to be sent from the shutter button 131 to the control unit 110 while the user is pressing the shutter button 131 halfway.

撮影準備動作により文書に合焦されると、撮像素子１２３がレンズユニット１２１を介して入光する入射光を受光し、受光した素子の電荷を、決定されたシャッタスピードなどに応じて蓄積（チャージ）する。撮像素子１２３は、蓄積された電荷を光電変換することにより、電荷量に応じたデジタルデータに変換し、画像データ（文書画像）を取得する（ステップＳ１０２）。ここで取得された画像データに基づく画像は、例えば、表示部１４１に表示される。 When the document is focused by the shooting preparation operation, the image pickup device 123 receives incident light that enters through the lens unit 121, and accumulates (charges) the charge of the received device according to the determined shutter speed or the like. ) The image sensor 123 performs photoelectric conversion on the accumulated charge, thereby converting the accumulated charge into digital data corresponding to the amount of charge, and obtains image data (document image) (step S102). An image based on the image data acquired here is displayed on the display unit 141, for example.

制御部１１０は、ステップＳ１０２で取得した画像データに基づいて、文書画像に示されている文字が判読可能か否かを判定する「可読性判定処理」を実行する（ステップＳ２００）。ここでは、文書画像に示されている文字について、サイズやボケ具合を検出することで、可読性レベル（画像品質）を特定し、特定された可読性レベルに応じて、文書画像として記録される文書の判読が可能か否かを判定する。この「可読性判定処理」（ステップＳ２００）の詳細は後述する。 Based on the image data acquired in step S102, the control unit 110 executes “readability determination processing” for determining whether or not the characters shown in the document image are legible (step S200). Here, the readability level (image quality) is specified by detecting the size and the degree of blurring of the characters shown in the document image, and the document recorded as the document image according to the specified readability level. Judge whether or not interpretation is possible. Details of the “readability determination process” (step S200) will be described later.

制御部１１０は、「可読性判定処理」（ステップＳ２００）の判定結果に応じて、所定の通知情報を出力する「通知情報出力処理」を実行する（ステップＳ５００）。ここでは、「可読性判定処理」（ステップＳ２００）の判定結果に応じた通知情報が出力部１４０から出力される。すなわち、「判読不可」である場合には、その旨を示すとともに、撮影条件の変更を促す情報などを出力し、「判読可能」である場合には、そのまま撮影可能である旨を示す情報を出力する。この「通知情報出力処理」（ステップＳ５００）の詳細は後述する。 The control unit 110 executes “notification information output processing” for outputting predetermined notification information according to the determination result of “readability determination processing” (step S200) (step S500). Here, the notification information corresponding to the determination result of the “readability determination process” (step S200) is output from the output unit 140. In other words, when it is “unreadable”, information indicating that it is also displayed and information that prompts the user to change the shooting conditions is output. When it is “readable”, information indicating that the image can be taken is displayed. Output. Details of the “notification information output process” (step S500) will be described later.

通知情報出力処理によって所定の通知情報を出力すると、制御部１１０は、ユーザによる撮影指示の有無を判別する（ステップＳ１０３）。ここでは、シャッタボタン１３１からの全押し信号の到着の有無により撮影指示を判別する。例えば、ステップＳ５００で出力された通知情報が「判読可能」を示す場合、ユーザは当該文書の撮影を決定し、シャッタボタン１３１を全押しする。この結果、シャッタボタン１３１から全押し信号が制御部１１０に送出される。すなわち、半押し信号から全押し信号に切り替わる。制御部１１０は、シャッタボタン１３１から全押し信号を受信すると、ユーザからの撮影指示であると判別し（ステップＳ１０３：Ｙｅｓ）、ステップＳ１０２で取得された画像データを画像記録部１５０に記録して（ステップＳ１０４）、処理を終了する。 When the predetermined notification information is output by the notification information output process, the control unit 110 determines whether there is a photographing instruction from the user (step S103). Here, the shooting instruction is determined based on whether or not the full-press signal has arrived from the shutter button 131. For example, when the notification information output in step S500 indicates “readable”, the user decides to capture the document and presses the shutter button 131 fully. As a result, a full press signal is sent from the shutter button 131 to the control unit 110. That is, the half-press signal is switched to the full-press signal. When the control unit 110 receives a full-press signal from the shutter button 131, the control unit 110 determines that the instruction is a shooting instruction from the user (step S103: Yes), and records the image data acquired in step S102 in the image recording unit 150. (Step S104), the process ends.

一方、例えば、ステップＳ５００で出力された通知情報が「判読不可」を示す場合、ユーザは、撮影条件を変更するためにシャッタボタン１３１の押下をやめ、半押し状態が解除される。この結果、制御部１１０に送出されていた半押し信号が途絶える。これに基づき、制御部１１０はユーザからの撮影指示がないと判別し（ステップＳ１０３：Ｎｏ）、そのまま処理を終了し、例えば、次の撮影準備動作を待機する。 On the other hand, for example, when the notification information output in step S500 indicates “unreadable”, the user stops pressing the shutter button 131 to change the shooting condition, and the half-pressed state is released. As a result, the half-press signal sent to the control unit 110 is interrupted. Based on this, the control unit 110 determines that there is no shooting instruction from the user (step S103: No), ends the processing as it is, and waits for the next shooting preparation operation, for example.

上記のような「文書撮影処理」によれば、文書撮影の対象としている文書が判読可能であるか否かを文字サイズと階調変化率とに基づいて判定しているので、正確に画像品質を判定して、通知することができる。したがって、ユーザは画像品質の低い画像を無駄に記録をすることがない。この結果、無駄な撮影動作や記憶領域の消費などを防止することができ、撮影時あるいは後処理における省力化や撮影効率の向上を図ることができる。 According to the “document shooting process” as described above, whether or not the document that is the target of document shooting is legible is determined based on the character size and the gradation change rate. Can be determined and notified. Therefore, the user does not wastefully record an image with low image quality. As a result, it is possible to prevent useless shooting operations and consumption of storage areas, and to save labor and improve shooting efficiency during shooting or post-processing.

すなわち、撮影時に何度も撮り直す必要がない他、何枚も撮影した画像から判読可能なものを選択する作業などをする必要がなくなり、ユーザの労力を軽減させることができる。また、デジタルカメラに備えられている表示装置で画像品質を判別することは困難であるが、上記処理では、撮像素子１２３で取得された画像データを用いて、制御部１１０が判読可能か否かを判定して通知するので、正確に画像品質を判定することができる。 That is, it is not necessary to re-shoot many times at the time of shooting, and it is not necessary to select a legible one from the images that have been shot, and the user's labor can be reduced. In addition, although it is difficult to determine the image quality with the display device provided in the digital camera, in the above process, whether or not the control unit 110 can read using the image data acquired by the image sensor 123. Therefore, the image quality can be accurately determined.

本実施形態では、判読可能であるか否かを判定する「可読性判定処理」（ステップＳ２００）において、画像データに示される文書の文字サイズや画像のボケ具合に基づいて判読可能か否かを判定することで、正確かつ適正な判定を行う。この「可読性判定処理」（ステップＳ２００）の詳細を図面を参照して以下説明する。まず、可読性を判定する方法の一例である「可読性判定処理（１）」を図７に示すフローチャートを参照して説明する。 In the present embodiment, in the “readability determination process” (step S200) for determining whether or not it is legible, it is determined whether or not it is legible based on the character size of the document indicated in the image data and the degree of blur of the image. By doing so, an accurate and appropriate determination is made. Details of the “readability determination processing” (step S200) will be described below with reference to the drawings. First, “readability determination processing (1)”, which is an example of a method for determining readability, will be described with reference to a flowchart shown in FIG.

「可読性判定処理（１）」では、取得された文書画像の特徴を検出する「特徴検出処理」（ステップＳ２１０）が実行される。この「特徴検出処理」では、文書画像に示されている文字のサイズと画像中の階調変化率を、当該文書画像の特徴として検出する。「特徴検出処理」の詳細を図８に示すフローチャートを参照して説明する。 In “readability determination processing (1)”, “feature detection processing” (step S210) for detecting the characteristics of the acquired document image is executed. In this “feature detection process”, the character size and the gradation change rate in the image shown in the document image are detected as features of the document image. The details of the “feature detection process” will be described with reference to the flowchart shown in FIG.

制御部１１０は、ステップＳ１０２で取得された画像データから、文字部分の切り出しをおこなう。ここでは、周知技術により文字切り出しをおこなうことができるが、例えば、まず背景色を検出した後、この背景色とは異なる色のピクセルで構成されている部分を文字部分として切り出す。 The control unit 110 cuts out a character portion from the image data acquired in step S102. Here, characters can be cut out by a known technique. For example, after a background color is first detected, a portion composed of pixels having a color different from the background color is cut out as a character portion.

例えば、ステップＳ１０２で取得された画像データが、図９（ａ）に示すような、白地に黒い文字が表されている文書画像である場合、制御部１１０は、例えば、画像の四隅や縁部付近を構成しているピクセルの色に基づいて背景色を「白」と特定した後に、「黒」のピクセルが密集している部分を文字部分として特定する（ステップＳ２１１）。この場合、制御部１１０は、背景色の階調と文字部分の階調を検出し、検出されたそれぞれの階調を示す情報をワークエリアに保持する（ステップＳ２１２）。 For example, when the image data acquired in step S102 is a document image in which black characters are represented on a white background as shown in FIG. 9A, the control unit 110, for example, the four corners or edges of the image After the background color is specified as “white” based on the color of the pixels constituting the vicinity, a portion where “black” pixels are densely specified is specified as a character portion (step S211). In this case, the control unit 110 detects the gradation of the background color and the gradation of the character portion, and holds information indicating each detected gradation in the work area (step S212).

また、制御部１１０は、文字を構成している各ピクセルの位置（座標）を検出し、検出された位置を示す情報（以下、「位置情報」とする）をワークエリアに保持する（ステップＳ２１３）。制御部１１０はさらに、検出した位置情報に基づいて文字の範囲ＣＡを特定し（図９（ｂ））、当該範囲を含む領域を切り出す（ステップＳ２１４、図９（ｃ））。ここでは、文書画像に示される文字の内の１文字を切り出してもよく、複数文字を切り出してもよい。理解を容易にするため、１文字を切り出す場合を例に以下説明する。 In addition, the control unit 110 detects the position (coordinates) of each pixel constituting the character, and holds information indicating the detected position (hereinafter referred to as “position information”) in the work area (step S213). ). Further, the control unit 110 specifies a character range CA based on the detected position information (FIG. 9B), and cuts out a region including the range (step S214, FIG. 9C). Here, one of the characters shown in the document image may be cut out, or a plurality of characters may be cut out. In order to facilitate understanding, an example in which one character is cut out will be described below.

制御部１１０は、切り出した文字のサイズを特定する（ステップＳ２１５）。ここでは、切り出し時に検出した当該文字を構成するピクセルの座標に基づいて、当該文字の横方向ドット数を検出し、検出したドット数を当該文字のサイズとする。制御部１１０は、検出されたドット数を示す情報（以下、「文字サイズ情報」とする）をワークエリアに保持する（ステップＳ２１６）。 The control unit 110 specifies the size of the cut out character (step S215). Here, the number of dots in the horizontal direction of the character is detected based on the coordinates of the pixels constituting the character detected at the time of clipping, and the detected number of dots is set as the size of the character. The control unit 110 holds information indicating the number of detected dots (hereinafter referred to as “character size information”) in the work area (step S216).

次に制御部１１０は、切り出した文字の位置情報に基づいて、当該文字のエッジ部分を検出する（ステップＳ２１７）。ここでは、図５（ａ）に示すＸ−Ｘ’線分のように、切り出された領域から１次元方向にピクセルを抽出し、背景の階調から文字の階調の階調に変化している部分を当該文字のエッジ部分として検出する。すなわち、抽出された１次元方向のピクセルの内、背景色の階調を示すピクセルの隣のピクセルが、当該背景色の階調と異なる階調となっている箇所と、文字色の階調を示すピクセルの隣のピクセルが、当該文字色の階調と異なる階調となっている箇所を特定する。そして、特定された２箇所間のピクセルを当該文字のエッジ部分とする。ここで検出するエッジ部分は１箇所でもよく、複数箇所でもよい。理解を容易にするため、１箇所のエッジ部分を検出する場合を例に以下説明する。 Next, the control unit 110 detects an edge portion of the character based on the position information of the extracted character (step S217). Here, as in the XX ′ line segment shown in FIG. 5A, pixels are extracted in a one-dimensional direction from the cut out region, and the gradation of the background changes to the gradation of the character gradation. Is detected as an edge portion of the character. That is, among the extracted one-dimensional pixels, the pixel adjacent to the pixel indicating the background color gradation has a gradation different from the gradation of the background color, and the character color gradation. A location where a pixel adjacent to the pixel to be shown has a gradation different from the gradation of the character color is specified. And let the pixel between the specified two places be the edge part of the said character. The edge part detected here may be one place or a plurality of places. In order to facilitate understanding, a case where one edge portion is detected will be described below as an example.

制御部１１０は、検出したエッジ部分での階調変化率を求めるために、エッジ部分のドット数と階調変化量を求める。ここでは、ステップＳ２１７のエッジ検出において特定した２箇所間のピクセル数をエッジ部のドット数として検出し、検出したドット数を示す情報（以下、「ドット数情報」とする）をワークエリアに保持する。制御部１１０はまた、背景色の階調と文字色の階調との差分を求め、階調変化量とする。制御部１１０は、求めた階調変化量を示す情報（以下、「階調変化量情報」とする）をワークエリアに保持する（ステップＳ２１８）。 The control unit 110 obtains the number of dots in the edge part and the gradation change amount in order to obtain the gradation change rate at the detected edge part. Here, the number of pixels between the two locations specified in the edge detection in step S217 is detected as the number of dots in the edge portion, and information indicating the detected number of dots (hereinafter referred to as “dot number information”) is held in the work area. To do. The control unit 110 also obtains a difference between the gradation of the background color and the gradation of the character color and sets it as the gradation change amount. The control unit 110 holds information indicating the obtained gradation change amount (hereinafter referred to as “gradation change amount information”) in the work area (step S218).

制御部１１０は、ステップＳ２１８で取得した「ドット数情報」と「階調変化量情報」を用いて、ステップＳ２１７で検出したエッジ部分の階調変化率を求める（ステップＳ２１９）。ここでは、制御部１１０が「階調変化量／ドット数」を演算することで、当該エッジ部での階調変化率を求め、求めた階調変化率を示す情報（以下、「階調変化率情報」とする）をワークエリアに保持する。階調変化率を求めると、制御部１１０は「特徴検出処理」を終了し、図７に示す「可読性判定処理（１）」のフローに戻る。 The control unit 110 uses the “dot number information” and the “gradation change amount information” acquired in step S218 to obtain the gradation change rate of the edge portion detected in step S217 (step S219). Here, the control unit 110 calculates “gradation change amount / number of dots” to obtain a gradation change rate at the edge portion, and information indicating the obtained gradation change rate (hereinafter, “tone change”). Rate information ”) in the work area. When the gradation change rate is obtained, the control unit 110 ends the “feature detection process” and returns to the flow of the “readability determination process (1)” illustrated in FIG.

次に制御部１１０は、「特徴検出処理」での検出結果に応じて、当該文書画像に示される文字の可読性レベルを特定するための「可読性レベル特定処理」を実行する（ステップＳ２３０）。「可読性レベル特定処理」の詳細を、図１０に示すフローチャートを参照して説明する。 Next, the control unit 110 executes a “readability level specifying process” for specifying the readability level of the characters shown in the document image in accordance with the detection result in the “feature detection process” (step S230). The details of the “readability level specifying process” will be described with reference to the flowchart shown in FIG.

制御部１１０は、記憶部１６０の「可読性レベル判定テーブル」（図３（ａ））を参照し、「特徴検出処理」のステップＳ２１６で取得した文字サイズ情報に示される文字サイズに対応する文字サイズレベルを特定する（ステップＳ２３１）。同様に、「特徴検出処理」のステップＳ２１９で取得した階調変化率情報に示される階調変化率に対応する階調変化率レベルを特定する（ステップＳ２３２）。 The control unit 110 refers to the “readability level determination table” (FIG. 3A) in the storage unit 160, and the character size corresponding to the character size indicated in the character size information acquired in step S216 of the “feature detection process”. A level is specified (step S231). Similarly, a gradation change rate level corresponding to the gradation change rate indicated in the gradation change rate information acquired in step S219 of the “feature detection process” is specified (step S232).

制御部１１０は、ステップＳ２３１で特定した文字サイズレベルと、ステップＳ２３２で特定した階調変化率レベルとに対応した可読性レベルを「可読性レベル判定テーブル」で特定する（ステップＳ２３３）。制御部１１０は、可読性レベルを特定すると、特定した可読性レベルを示す情報（以下、「可読性レベル情報」とする）をワークエリアに保持し（ステップＳ２３４）、「可読性レベル特定処理」を終了して、図７に示す「可読性判定処理（１）」のフローに戻る。 The control unit 110 specifies the readability level corresponding to the character size level specified in step S231 and the gradation change rate level specified in step S232 using the “readability level determination table” (step S233). When specifying the readability level, the control unit 110 stores information indicating the specified readability level (hereinafter referred to as “readability level information”) in the work area (step S234), and ends the “readability level specifying process”. Returning to the flow of “readability determination processing (1)” shown in FIG.

制御部１１０は、記憶部１６０に記憶されている「可読性レベル判定テーブル」を参照し、「可読性レベル特定処理」（図１０）のステップＳ２３４で取得した可読性レベル情報に示されている可読性レベルと閾値とを比較することで、文書画像に示されている文字が判別可能か否かを判別する（ステップＳ２５０）。ここでは、「可読性レベル特定処理」で特定された可読性レベルが閾値以上であるか否かを判別する。本実施形態の例では、閾値となる可読性レベルが「５」であるので、特定された可読性レベルに示される数値が「５」以上であるか否かが判別される。 The control unit 110 refers to the “readability level determination table” stored in the storage unit 160, and the readability level indicated in the readability level information acquired in step S234 of “readability level specifying process” (FIG. 10). By comparing with the threshold value, it is determined whether or not the character shown in the document image can be determined (step S250). Here, it is determined whether or not the readability level specified in the “readability level specifying process” is equal to or greater than a threshold value. In the example of the present embodiment, since the readability level serving as the threshold is “5”, it is determined whether or not the numerical value indicated by the specified readability level is “5” or more.

特定された可読性レベルが閾値以上である場合（ステップＳ２５０：Ｙｅｓ）、制御部１１０は、文書画像に示される文字の判読が可能であると判別する（ステップＳ２５１）。一方、特定された可読性レベルが閾値未満である場合（ステップＳ２５０：Ｎｏ）、制御部１１０は、文書画像に示される文字の判読が不可能であると判別する（ステップＳ２５２）。 If the identified readability level is equal to or higher than the threshold (step S250: Yes), the control unit 110 determines that the characters shown in the document image can be read (step S251). On the other hand, when the specified readability level is less than the threshold (step S250: No), the control unit 110 determines that the characters shown in the document image cannot be read (step S252).

制御部１１０は、ステップＳ２５１またはステップＳ２５２での判別結果を示す情報（以下、「判読可否情報」とする）をワークエリアに保持し（ステップＳ２５３）、図６に示す「文書撮影処理」のフローに戻る。 The control unit 110 holds information indicating the determination result in step S251 or step S252 (hereinafter referred to as “readability information”) in the work area (step S253), and the flow of “document photographing process” illustrated in FIG. Return to.

「文書撮影処理」においては、ステップＳ２５３で取得された判読可否情報に示される判別結果に応じて、所定の通知情報が出力される（ステップＳ５００）。ここで、判別結果が「判読可能」であれば、その旨を示す通知情報が出力部１４０から出力され、「判読不可」であれば、撮影条件の変更などを促す通知情報が出力される。 In the “document photographing process”, predetermined notification information is output according to the determination result indicated in the legibility information acquired in step S253 (step S500). Here, if the determination result is “readable”, notification information indicating that is output from the output unit 140, and if it is “unreadable”, notification information that prompts the user to change the shooting condition is output.

上記のような「可読性判定処理（１）」によれば、文書を撮影した文書画像から文字部分を切り出し、切り出した文字のサイズを検出するとともに、文字のエッジ部分の階調変化率を求めることで、文書画像のボケ具合を判定する。そして、検出された文字サイズと階調変化率とに基づいて、当該文書画像の可読性レベルを特定し、特定された可読性レベルと所定の閾値とを比較することで、文書画像に示される文字の判読可否が判定される。 According to the “readability determination processing (1)” as described above, a character portion is cut out from a document image obtained by photographing a document, the size of the cut-out character is detected, and the gradation change rate of the edge portion of the character is obtained. Then, the degree of blur of the document image is determined. Then, based on the detected character size and gradation change rate, the readability level of the document image is specified, and the specified readability level is compared with a predetermined threshold value to thereby determine the character of the character indicated in the document image. Judgment of legibility is made.

上記「可読性判定処理（１）」では、切り出した文字のエッジ部分における階調変化率によって画像のボケ具合を判定しているので、当該文字の判読性を的確に判定することができる。しかし、「可読性判定処理（１）」では、白黒文書などといった、背景と文字部分との階調差が大きい文書（以下、「モノクロ文書」とする）では適切な判定をすることができるが、文書中の文字が複数の色彩で構成されていたり、背景が複数の色彩で構成されている場合（以下、「カラー文書」とする）、場所によって文字と背景色との階調差にバラツキがあるため、特定の文字について判定された可読性レベルのみでは、画像全体で判読可能か否かを正確に判定できない場合がある。あるいは、文書を構成する文字によってサイズが異なるような場合も、特定の文字について判定された可読性レベルのみでは正確に判読可否を判定することができない。すなわち、ある文字についての可読性レベルに基づいて「判読可」と判定されても、当該文字については判読できるが他の文字は判読できない場合が生じうる。このような背景と文字との階調差が異なる場合や複数の異なる文字サイズが混在する場合であっても適切に判読性を判定するための処理例を「可読性判定処理（２）」として図１１に示すフローチャートを参照して以下説明する。 In the “readability determination process (1)”, the degree of blurring of the image is determined based on the gradation change rate at the edge portion of the cut out character, so that the legibility of the character can be determined accurately. However, in the “readability determination process (1)”, an appropriate determination can be made for a document having a large gradation difference between the background and the character portion (hereinafter referred to as “monochrome document”), such as a black and white document. When the characters in the document are composed of multiple colors or the background is composed of multiple colors (hereinafter referred to as “color document”), the gradation difference between the characters and the background color varies depending on the location. For this reason, it may not be possible to accurately determine whether or not the entire image can be read only by the readability level determined for a specific character. Alternatively, even when the size differs depending on the characters constituting the document, it is not possible to accurately determine whether or not the characters can be read only with the readability level determined for the specific characters. That is, even if it is determined as “readable” based on the readability level of a certain character, there may occur a case where the character can be read but other characters cannot be read. A process example for appropriately determining the legibility even when the background and characters have different gradation differences or when a plurality of different character sizes are mixed is shown as “readability determination process (2)”. This will be described below with reference to the flowchart shown in FIG.

この「可読性判定処理（２）」では、図１２（ａ）に示すような、異なる色彩の文字から構成される文書（カラー文書）を撮影して文書画像（カラー文書画像）を取得した場合を例に説明する。また、取得された文書画像に複数の領域を設定し、各領域毎に文字サイズと階調変化率を検出する。ここでは、例えば、図１２（ｂ）に示すような複数（９つ）の領域を文書画像に設定し、各領域に所定のインデックス（ここでは、「１」〜「９」の連続番号とする）を割り当てる。 In this “readability determination process (2)”, a document image (color document image) obtained by photographing a document (color document) composed of characters of different colors as shown in FIG. Explained as an example Also, a plurality of areas are set in the acquired document image, and the character size and gradation change rate are detected for each area. Here, for example, a plurality (nine) areas as shown in FIG. 12B are set in the document image, and predetermined indexes (here, “1” to “9” are assigned consecutive numbers) in each area. ).

制御部１１０はまず、割り当てられている連続番号に基づいて、設定されている領域の１つを選択する（ステップＳ２６１）。すなわち、領域に割り当てられている連続番号を示すカウンタを初期値の「１」に設定し、この値に対応する領域を選択する。 First, the control unit 110 selects one of the set areas based on the assigned serial number (step S261). That is, the counter indicating the serial number assigned to the area is set to the initial value “1”, and the area corresponding to this value is selected.

制御部１１０は、ステップＳ２６１で選択した領域について、上述した「特徴検出処理」（ステップＳ２１０、図８）を実行し（ステップＳ２６２）、「文字の切り出し」、「文字サイズの特定」、「エッジ部の検出」、「階調変化率の検出」、「可読性レベルの判定」、「判読可否の判定」、「判定結果の記録（ワークエリアでの保持）」をおこなう。 The control unit 110 executes the above-described “feature detection process” (step S210, FIG. 8) for the region selected in step S261 (step S262), “character segmentation”, “character size specification”, “edge” Part detection "," detection of gradation change rate "," determination of readability level "," determination of readability ", and" recording of determination result (holding in work area) ".

制御部１１０は、すべての領域について判読可否の判定がされたか否かを判別し（ステップＳ２６３）、未判定領域がある場合（ステップＳ２６３：Ｎｏ）には、カウンタを「＋１」し（ステップＳ２６４）、当該カウンタ値に対応する領域について「特徴検出処理」を実行する（ステップＳ２６２）。 The control unit 110 determines whether or not all areas have been determined to be legible (step S263). If there is an undetermined area (step S263: No), the control unit 110 increments the counter by “+1” (step S264). ) The “feature detection process” is executed for the area corresponding to the counter value (step S262).

一方、すべての領域について判読可否の判定が完了した場合（ステップＳ２６３：Ｙｅｓ）、すべての領域についての判読可否結果を参照し、いずれかの領域において「判読不可」とする判定結果が存在するか否かを判別する（ステップＳ２６５）。 On the other hand, when the determination of readability for all areas is completed (step S263: Yes), the determination result for all areas is referred to, and whether there is a determination result of “unreadable” in any area. It is determined whether or not (step S265).

制御部１１０は、いずれかの領域で「判読不可」となった場合（ステップＳ２６５：Ｙｅｓ）、当該文書画像全体について「判読不可」と判定する（ステップＳ２６）。一方、すべての領域において「判読可」となった場合（ステップＳ２６５：Ｎｏ）、当該文書画像全体について「判読可」と判定して（ステップＳ２６７）、図６に示す「文書撮影処理」のフローに戻る。 When it becomes “illegible” in any region (step S265: Yes), the controller 110 determines that the entire document image is “illegible” (step S26). On the other hand, if “readable” is obtained in all the areas (step S265: No), it is determined that the entire document image is “readable” (step S267), and the flow of “document photographing process” shown in FIG. Return to.

「文書撮影処理」においては、ステップＳ２６６またはＳ２６７の判定結果に基づいて、所定の通知情報が出力される（ステップＳ５００）。 In the “document photographing process”, predetermined notification information is output based on the determination result in step S266 or S267 (step S500).

上記のような「可読性判定処理（２）」によれば、文書画像を複数の領域に分割し、各領域毎に判読可否を判定した上で、画像全体での判読可否を判定するので、文字と背景との階調差が均一でない場合や異なる文字サイズが混在する場合であっても、画像全体の判読可否を正確に判定することができる。 According to the “readability determination process (2)” as described above, the document image is divided into a plurality of areas, and whether or not the entire image is readable is determined after determining whether or not each area is readable. Even when the gradation difference between the image and the background is not uniform or when different character sizes are mixed, it is possible to accurately determine whether or not the entire image can be read.

上記「可読性判定処理（１）」および「可読性判定処理（２）」における文字サイズの特定は、文書画像の文字部分のドット数に基づいておこなったが、ユーザによる任意入力により文字サイズを特定してもよい。例えば、規定サイズ（ＪＩＳ規格の「Ａ４判」など）の用紙に印刷されている文書を撮影することで文書撮影をおこなう場合において、印刷されている文字の文字サイズ（例えば、「ポイント」などの既定の単位で示される文字サイズ。以下、「既定サイズ」とする）が予め分かっている際には、当該既定サイズなどをユーザが操作入力部１３２などを操作して入力する。 The character size in the above “readability determination process (1)” and “readability determination process (2)” is determined based on the number of dots in the character portion of the document image. May be. For example, when shooting a document by shooting a document printed on paper of a prescribed size (JIS standard “A4 size”, etc.), the character size of the printed character (eg, “point”, etc.) When the character size indicated in a predetermined unit (hereinafter referred to as “default size”) is known in advance, the user inputs the default size by operating the operation input unit 132 or the like.

この場合、デジタルカメラ１００の画角全体と撮影対象の用紙全体とが一致するように撮影するものとする。すなわち、デジタルカメラ１００と文書との距離を調整したり、ズーム機能などを用いることで、用紙全体が収まるように撮影する。ここで、デジタルカメラ１００の画像性能（解像度や画素数など）、用紙のサイズ（「Ａ４判」など）、および、既定サイズの単位（「ポイント」など）が規定のものであるため、それぞれの対応関係に基づいて、入力された規定サイズをラスタ画像におけるドット数に変換することができる。 In this case, it is assumed that shooting is performed so that the entire angle of view of the digital camera 100 matches the entire sheet to be shot. That is, shooting is performed so that the entire sheet can be accommodated by adjusting the distance between the digital camera 100 and the document or using a zoom function or the like. Here, the image performance (such as resolution and the number of pixels), the paper size (such as “A4 size”), and the default size unit (such as “point”) of the digital camera 100 are specified, so Based on the correspondence, the input specified size can be converted into the number of dots in the raster image.

すなわち、用紙サイズ、規定サイズ、、画像サイズ、などと、文字サイズ（ドット数）とを予め対応付けた、図１３に示すような「文字サイズ対応テーブル」を記憶部１６０に記憶させておく。ここで「文字サイズ対応テーブル」は、デジタルカメラ１００の画像性能に応じて設定されるものとする。ユーザは、操作入力部１３２などを操作し、撮影前に既定サイズと用紙サイズとを入力する。制御部１１０は、入力された既定サイズと用紙サイズとに対応するドット数を「文字サイズ対応テーブル」から取得し、当該文書を撮影して得られる文書画像における「文字サイズ」として特定する。制御部１１０は、このように特定した文字サイズと、別途取得する階調変化率とに基づいて、「可読性レベル判定テーブル」から可読性レベルを特定する。 That is, a “character size correspondence table” as shown in FIG. 13 in which the paper size, the prescribed size, the image size, and the like are associated in advance with the character size (number of dots) is stored in the storage unit 160. Here, the “character size correspondence table” is set according to the image performance of the digital camera 100. The user operates the operation input unit 132 and the like to input a default size and a paper size before shooting. The control unit 110 acquires the number of dots corresponding to the input default size and paper size from the “character size correspondence table”, and specifies it as the “character size” in the document image obtained by photographing the document. The control unit 110 specifies the readability level from the “readability level determination table” based on the character size specified in this way and the gradation change rate acquired separately.

また、上記「可読性判定処理（１）」および「可読性判定処理（２）」では階調変化率と文字サイズとの関係から可読性レベルを特定して判読可否を判定したが、文字サイズを考慮せずに判読可否を判定してもよい。この場合の処理例を「可読性判定処理（３）」として図１４に示すフローチャートを参照して説明する。 In the “readability determination process (1)” and “readability determination process (2)”, the readability level is determined by specifying the readability level from the relationship between the gradation change rate and the character size. It may be determined whether or not it is legible. A processing example in this case will be described as “readability determination processing (3)” with reference to the flowchart shown in FIG.

この「可読性判定処理（３）」では、文書画像の複数箇所で階調変化率を検出し、階調変化率の出現頻度に基づいて画像全体のボケ具合を判定して、判読可否を判定する。この「可読性判定処理（３）」でも、「可読性判定処理（２）」の場合と同様に、文書画像に複数の領域が割り当てられているものとする（図１２（ｂ）参照）。 In this “readability determination process (3)”, the gradation change rate is detected at a plurality of locations in the document image, and the degree of blurring of the entire image is determined based on the appearance frequency of the gradation change rate to determine whether or not the image can be read. . In this “readability determination process (3)”, it is assumed that a plurality of areas are assigned to the document image as in the case of “readability determination process (2)” (see FIG. 12B).

制御部１１０は、割り当てられている連続番号に基づいて、設定されている領域の１つを選択する（ステップＳ２７１）。すなわち、割り当てられている連続番号を示すカウンタを初期値の「１」に設定し、この値に対応する領域を選択する。 The control unit 110 selects one of the set areas based on the assigned serial number (step S271). That is, the counter indicating the assigned serial number is set to the initial value “1”, and an area corresponding to this value is selected.

制御部１１０は、ステップＳ２７１で選択した領域について、上述した「特徴検出処理」（ステップＳ２１０、図８）と同様の処理を実行し、「文字の切り出し」、「エッジ部の検出」、「階調変化率の検出」、をおこなう。 The control unit 110 executes the same process as the above-described “feature detection process” (step S210, FIG. 8) for the region selected in step S271, and performs “character segmentation”, “edge detection”, Detection of key change rate ".

すなわち、各領域における１次元方向のピクセル群から、階調が変化している部分のピクセルを抽出して、当該部分の階調変化率（階調変化量／ドット数）を取得する（ステップＳ２７２）。ここで、各領域において求める階調変化部分の箇所は任意であるものとする。 That is, the pixel of the portion where the gradation is changed is extracted from the pixel group in the one-dimensional direction in each region, and the gradation change rate (tone change amount / number of dots) of the portion is acquired (step S272). ). Here, it is assumed that the portion of the gradation change portion obtained in each region is arbitrary.

制御部１１０は、すべての領域についての階調変化率が取得されたか否かを判別し（ステップＳ２７３）、未判定領域がある場合（ステップＳ２７３：Ｎｏ）には、カウンタを「＋１」し（ステップＳ２７４）、当該カウンタ値に対応する領域について階調変化率を取得する（ステップＳ２７２）。 The control unit 110 determines whether or not the gradation change rate has been acquired for all the regions (step S273). If there is an undetermined region (step S273: No), the control unit 110 increments the counter by “+1” (step S273: No). In step S274, the gradation change rate is acquired for the area corresponding to the counter value (step S272).

一方、すべての領域で階調変化率が取得された場合（ステップＳ２７３：Ｙｅｓ）、制御部１１０は、求められた階調変化率の出現率を示すヒストグラムを作成する（ステップＳ２７５）。ここで作成されるヒストグラムの例を図１５に示す。図１５に示すヒストグラムは、文書画像に設定された複数領域のそれぞれから取得された階調変化率を合算することで、文書画像全体におけるおよその階調変化率の分布を示すものである。ここで、ヒストグラムの横軸は取得された階調変化率を示し、縦軸は階調変化率の出現頻度を示す。横軸の階調変化率の取りうる値は「０〜∞」とし、目安となるインデックスとして、例えば、「可読性レベル判定テーブル」（図１５）で設定した階調変化率レベルに応じた値を設定する（図１５の例では、「５」、「１０」、「２０」、「４０」）。 On the other hand, when the gradation change rate is acquired in all regions (step S273: Yes), the control unit 110 creates a histogram indicating the appearance rate of the obtained gradation change rate (step S275). An example of the histogram created here is shown in FIG. The histogram shown in FIG. 15 shows an approximate distribution of gradation change rates in the entire document image by adding the gradation change rates acquired from each of a plurality of regions set in the document image. Here, the horizontal axis of the histogram indicates the acquired gradation change rate, and the vertical axis indicates the appearance frequency of the gradation change rate. The value that the gradation change rate on the horizontal axis can take is “0 to ∞”, and a value corresponding to the gradation change rate level set in the “readability level determination table” (FIG. 15), for example, is used as a standard index. It is set (in the example of FIG. 15, “5”, “10”, “20”, “40”).

このようなヒストグラムには、画像全体のボケ具合に応じて、出現頻度のピークが現れる。ここで、階調変化率の最小値と最高値（すなわち、「０」と「∞」）付近にもピーク部が現れるが、このピークは、同じ階調値が一様に広く存在する部分、すなわち、背景部分と文字部分に対応するものであるので、「０」と「∞」付近のピークを除外したピーク部分を検出する（ステップＳ２７６）。 In such a histogram, an appearance frequency peak appears according to the degree of blur of the entire image. Here, a peak portion also appears near the minimum value and the maximum value (that is, “0” and “∞”) of the gradation change rate, and this peak is a portion where the same gradation value exists uniformly and widely, That is, since it corresponds to the background portion and the character portion, the peak portion excluding the peaks near “0” and “∞” is detected (step S276).

このようなピーク部分は、当該画像において最頻出の階調変化率を示すものであることから、このピーク部分の階調変化率の程度をみることで、当該画像全体のボケ具合を判定することができる。例えば、画像全体のボケ具合が高い場合は、図１５（ａ）に示すようなヒストグラムとなり、ボケ具合が低い場合は図１５（ｂ）に示すようなヒストグラムとなる。上述したように、階調変化率は、ボケ足の幅が広いほど（すなわち、ボケ具合が高い程）値が小さくなるため、ピーク部分に対応する階調変化率が高いほど画像のボケ具合が低く、ピーク部分に対応する階調変化率が低いほど画像のボケ具合が高くなる。したがって、ピーク部分が出現している位置が、ヒストグラムの横軸上のどこにあるかをみることで、画像のボケ具合を判定できる。この場合、ヒストグラムの横軸上に閾値を設定し、ピーク部分の出現位置が、当該閾値より右か左かを検出することで、判読可否を判定する。より詳細には、ピーク部分に対応する階調変化率が閾値以上であるか否かを判別する（ステップＳ２７７）。 Since such a peak portion indicates the most frequent gradation change rate in the image, the degree of blur of the entire image can be determined by checking the degree of the gradation change rate of the peak portion. Can do. For example, when the degree of blur of the entire image is high, the histogram is as shown in FIG. 15A, and when the degree of blur is low, the histogram is as shown in FIG. As described above, the gradation change rate becomes smaller as the width of the blur foot becomes wider (that is, the higher the degree of blur), so the higher the gradation change rate corresponding to the peak portion, the more the degree of blur of the image. The lower the gradation change rate corresponding to the peak portion, the higher the degree of image blur. Therefore, the degree of blurring of the image can be determined by looking at where the position where the peak portion appears on the horizontal axis of the histogram. In this case, a threshold is set on the horizontal axis of the histogram, and it is determined whether or not interpretation is possible by detecting whether the appearance position of the peak portion is right or left of the threshold. More specifically, it is determined whether or not the gradation change rate corresponding to the peak portion is greater than or equal to a threshold value (step S277).

ここで、本処理においては、閾値となる階調変化率を、例えば、「４０」に設定する。本処理では文字サイズを考慮しないため、上記「可読性判定処理（１）」および「可読性判定処理（２）」で設定された閾値よりも高めの数値を設定することで、文字サイズが小さい場合でも適切に判読可否を判定できるようにする。なお、閾値の設定は任意である。 Here, in this processing, the gradation change rate serving as a threshold is set to “40”, for example. Since the character size is not considered in this process, even if the character size is small by setting a numerical value higher than the threshold value set in the “readability determination process (1)” and “readability determination process (2)” above. Appropriately determine whether or not it can be read. The threshold value can be set arbitrarily.

制御部１１０は、検出したピーク部分に対応する階調変化率が閾値以上である場合（ステップＳ２７７：Ｙｅｓ）は、当該文書画像における文字について「判読可」と判定し（ステップＳ２７８）、ピーク部分に対応する階調変化率が閾値未満である場合には（ステップＳ２７７：Ｎｏ）、「判読不可」と判定して（ステップＳ２７９）、図６に示す「文書撮影処理」のフローに戻る。 When the gradation change rate corresponding to the detected peak portion is greater than or equal to the threshold (step S277: Yes), the control unit 110 determines that the character in the document image is “readable” (step S278), and the peak portion. If the gradation change rate corresponding to is less than the threshold value (step S277: No), it is determined as “unreadable” (step S279), and the flow returns to the “document photographing process” flow shown in FIG.

「文書撮影処理」においては、ステップＳ２７９またはＳ２８０の判定結果に基づいて、所定の通知情報が出力される（ステップＳ５００）。 In the “document photographing process”, predetermined notification information is output based on the determination result of step S279 or S280 (step S500).

上記のような「可読性判定処理（３）」によれば、文字サイズを考慮しなくとも、画像全体のボケ具合から画像品質を正確に判定して、判読可否を判定することができる。 According to the “readability determination process (3)” as described above, it is possible to accurately determine the image quality from the degree of blur of the entire image without considering the character size, and determine whether or not the image can be read.

制御部１１０は、上記「可読性判定処理（１）〜（３）」のいずれかを実行するように構成されてもよいが、「可読性判定処理（１）〜（３）」のいずれかを選択的に実行してもよい。この場合、例えば、ユーザからの指示によって処理を選択することができる。すなわち、「可読性判定処理（１）」はモノクロ文書における判読可否の判定に好適であるため、撮影対象としている文書がモノクロ文書であることが明らかな場合には、ユーザが操作入力部１３２などを操作することによって「可読性判定処理（１）」が実行されるよう指定する。同様に、撮影対象がカラー文書であることが明らかな場合には、「可読性判定処理（２）」が実行されるよう指定する。また、例えば、看板などに記された大きな文字を撮影した場合など、文字サイズが十分に大きいことが明らかな場合には、文字サイズを考慮しない「可読性判定処理（３）」を指定する。 The control unit 110 may be configured to execute any one of the “readability determination processes (1) to (3)”, but selects any one of the “readability determination processes (1) to (3)”. May be executed automatically. In this case, for example, the process can be selected by an instruction from the user. That is, since the “readability determination process (1)” is suitable for determining whether or not a monochrome document can be read, if it is clear that the document to be photographed is a monochrome document, the user can use the operation input unit 132 or the like. It is specified that the “readability determination process (1)” is executed by the operation. Similarly, if it is clear that the subject to be photographed is a color document, the “readability determination process (2)” is designated to be executed. For example, when it is clear that the character size is sufficiently large, such as when a large character written on a signboard is photographed, “readability determination processing (3)” that does not consider the character size is designated.

このような文書の種類や文字サイズを、制御部１１０が判定することで、「可読性判定処理（１）〜（３）」のいずれを適用するかを制御部１１０が決定してもよい。すなわち、好適な「可読性判定処理」が自動的に選択・適用される。この場合、文書画像全体の背景色と文字色とを認識することで、「モノクロ文書」であるか「カラー文書」であるかを判定し、「可読性判定処理（１）」または「可読性判定処理（２）」のいずれかを適用する。また、最初に文字の切り出しとサイズ特定をおこない、文字サイズレベルが最大レベル（図３（ａ）の「可読性レベル判定テーブル」における「レベルＥ」）である場合には、「可読性判定処理（３）」を実行するようにしてもよい。このように「可読性判定処理（１）〜（３）」が選択的に実行されてもよい他、制御部１１０が「可読性判定処理（１）〜（３）」のすべてを実行し、各処理の判定結果に基づいて、判読可否の最終判定をおこなうようにしてもよい。 The control unit 110 may determine which one of the “readability determination processes (1) to (3)” is applied by determining the type and character size of the document. That is, a suitable “readability determination process” is automatically selected and applied. In this case, by recognizing the background color and character color of the entire document image, it is determined whether the document is a “monochrome document” or “color document”, and the “readability determination process (1)” or “readability determination process” is performed. (2) "applies. In addition, when character extraction and size identification are first performed and the character size level is the maximum level (“level E” in the “readability level determination table” in FIG. 3A), “readability determination processing (3 ) "May be executed. As described above, the “readability determination processes (1) to (3)” may be selectively executed, and the control unit 110 executes all of the “readability determination processes (1) to (3)”. Based on the determination result, it may be possible to make a final determination as to whether or not readability is possible.

上記のような「可読性判定処理」によって判読可否が判定されると、「通知情報出力処理」（ステップＳ５００）によって判定結果に応じた所定の通知情報が出力される。「通知情報出力処理」の詳細を図１６に示すフローチャートを参照して説明する。 When the readability is determined by the “readability determination process” as described above, predetermined notification information corresponding to the determination result is output by the “notification information output process” (step S500). Details of the “notification information output process” will be described with reference to the flowchart shown in FIG.

制御部１１０は、上記「可読性判定処理」のいずれかによって得られた判定結果が、「判読不可」であるか「判読可」であるかを判別する（ステップＳ５０１）。 The control unit 110 determines whether the determination result obtained by any of the above “readability determination processing” is “unreadable” or “readable” (step S501).

判定結果が「判読不可」である場合（ステップＳ５０１：Ｙｅｓ）、制御部１１０は記憶部１６０にアクセスし、「通知情報テーブル」（図３（ｂ））から、判定結果が「判読不可」に対応する通知情報を取得するとともに（ステップＳ５０２）、当該通知情報の通知方式（出力方式）がいずれであるかを識別する（ステップＳ５０３）。 When the determination result is “unreadable” (step S501: Yes), the control unit 110 accesses the storage unit 160, and the determination result is changed to “unreadable” from the “notification information table” (FIG. 3B). Corresponding notification information is acquired (step S502), and the notification method (output method) of the notification information is identified (step S503).

制御部１１０は、取得した通知情報を、識別した通知方式で出力部１４０から出力する（ステップＳ５０４）。ここでは、通知方式として、例えば、「メッセージ表示」、「アイコン表示」、「インジケータ発光」、「音声通知」、「アラーム通知」などが用意されるものとする。これらの通知方式は、ユーザの所望に応じて任意に選択される。すなわち、操作入力部１３２などを操作することで、通知方式をいずれにするかが予め設定される。なお、これらの通知方式のうち、１つが選択されてもよく、あるいは、複数が組み合わされて選択されてもよい。 The control unit 110 outputs the acquired notification information from the output unit 140 using the identified notification method (step S504). Here, for example, “message display”, “icon display”, “indicator light emission”, “voice notification”, “alarm notification”, and the like are prepared as notification methods. These notification methods are arbitrarily selected according to the user's desire. In other words, the notification method is set in advance by operating the operation input unit 132 or the like. One of these notification methods may be selected, or a plurality may be selected in combination.

「メッセージ表示」が選択されている場合、例えば、現在の撮影条件で撮影しても文字が判読できない旨や、撮影条件の変更を促すメッセージが、図１７（ａ）に示すように、文字情報として表示部１４１に通知情報として表示される。すなわち、このようなメッセージを示すテキストデータが記憶部１６０の「通知情報テーブル」から取得されて、表示部１４１に表示される。 When “message display” is selected, for example, a message indicating that characters cannot be read even if shooting is performed under the current shooting conditions, or a message prompting the change of shooting conditions is displayed as text information as shown in FIG. As notification information on the display unit 141. That is, text data indicating such a message is acquired from the “notification information table” in the storage unit 160 and displayed on the display unit 141.

「アイコン表示」が選択されている場合、例えば、文字の判読ができないことを示す所定のアイコン（絵文字）が、図１７（ｂ）に示すように、表示部１４１に通知情報として表示される。 When “icon display” is selected, for example, a predetermined icon (pictogram) indicating that the character cannot be read is displayed as notification information on the display unit 141 as shown in FIG.

「インジケータ発光」が選択されている場合、例えば、文字の判読ができないことを示す所定の発光パターンで、インジケータ部１４３が点灯または点滅する。ここで、発光パターンとして、例えば、発光色や明滅パターンなどが予め規定される。例えば、インジケータ部１４３が「緑」と「赤」の２色のＬＥＤから構成されている場合、いずれの色をどのように明滅させるか（明滅速度など）が規定されており、デジタルカメラ１００における種々の検出事象（例えば、合焦、バッテリ切れ、など）とが予め対応付けられる。文字の判読ができないことを示す発光パターンとして、例えば、「赤色ＬＥＤの点滅」などとすることができる。この場合、規定されている発光パターンが通知情報として機能する。 When “indicator light emission” is selected, for example, the indicator portion 143 is lit or blinks with a predetermined light emission pattern indicating that the character cannot be read. Here, as the light emission pattern, for example, a light emission color or a blinking pattern is defined in advance. For example, in the case where the indicator unit 143 is composed of LEDs of two colors of “green” and “red”, which color is to be blinked and how (flashing speed, etc.) is defined. Various detection events (for example, focusing, battery exhaustion, etc.) are associated in advance. As a light emission pattern indicating that the character cannot be read, for example, “flashing red LED” can be used. In this case, the prescribed light emission pattern functions as notification information.

「音声通知」が選択されている場合、例えば、現在の撮影条件で撮影しても文字が判読できない旨や、撮影条件の変更を促すメッセージが報音部１４２から出力される。この場合、「メッセージ表示」の場合に取得されるテキストデータが取得され、所定の音声合成手法により、当該テキストデータで示されるメッセージが、あたかも人が発話しているように報音部１４２から出力される。この場合、出力された音声が通知情報として機能する。 When “sound notification” is selected, for example, a message that the text cannot be read even if shooting is performed under the current shooting conditions, or a message that prompts the user to change the shooting conditions is output from the sound report unit 142. In this case, the text data acquired in the case of “message display” is acquired, and the message indicated by the text data is output from the sound report unit 142 as if a person is speaking by a predetermined speech synthesis method. Is done. In this case, the output voice functions as notification information.

「アラーム通知」が選択している場合、例えば、文字の判読ができないことを示す所定の報音パターンでアラーム音が報音部１４２から出力される。この場合の報音パターンは、例えば、音色や音程、あるいは、音の長さなどから規定され、デジタルカメラ１００における種々の検出事象（例えば、合焦、バッテリ切れ、など）と予め対応付けられる。したがって、「文字の判読不可」という事象に対応付けられた報音パターンでアラーム音が報音部１４２から出力される。この場合、規定されている報音パターンが通知情報として機能する。 When “alarm notification” is selected, for example, an alarm sound is output from the sound report unit 142 with a predetermined sound pattern indicating that the character cannot be read. In this case, the report sound pattern is defined by, for example, tone color, pitch, or sound length, and is associated with various detection events (for example, in-focus, battery exhaustion, etc.) in the digital camera 100 in advance. Therefore, the alarm sound is output from the sound report unit 142 in the sound sound pattern associated with the event “character cannot be read”. In this case, a prescribed sound report pattern functions as notification information.

制御部１１０は、上記のように通知情報を出力すると、図６に示す「文書撮影処理」のフローに戻る。 When the notification unit outputs the notification information as described above, the control unit 110 returns to the “document photographing process” flow shown in FIG.

一方、判定結果が「判読可」である場合（ステップＳ５０１：Ｎｏ）、上記のような通知情報を出力せずに「文書撮影処理」（図６）のフローに戻る。 On the other hand, when the determination result is “readable” (step S501: No), the process returns to the “document photographing process” (FIG. 6) without outputting the notification information as described above.

上記のような「通知情報出力処理」によれば、文書撮影時の撮影準備動作時に、撮影後の文字が判読可能か否か、および、撮影条件変更の是非などを種々の方式によってユーザ（撮影者）に通知する。したがって、ユーザは、現在の撮影条件での撮影続行の是非をその場で判断することができる。この結果、無駄な撮影を回避することができ、メモリ領域の浪費を防止することができるとともに、ユーザの労力の軽減や撮影効率の向上を図ることができる。 According to the “notification information output process” as described above, during the shooting preparation operation at the time of document shooting, whether or not the characters after shooting can be read and whether or not the shooting conditions should be changed by various methods (shooting by the user). Person). Therefore, the user can determine on the spot whether or not to continue shooting under the current shooting conditions. As a result, useless shooting can be avoided, the waste of the memory area can be prevented, the user's labor can be reduced, and shooting efficiency can be improved.

上記「通知情報出力処理」では、判定結果が「判読可」の場合には通知情報を出力しないものとしたが、判定結果が「判読可」である場合にその旨を示す通知情報を出力するようにしてもよい。 In the above “notification information output process”, the notification information is not output when the determination result is “readable”, but the notification information indicating that is output when the determination result is “readable”. You may do it.

さらに、「可読性レベル判定テーブル」を用いて判定する場合においては、その判定の根拠となる可読性レベルに応じたアドバイス情報を通知情報として出力してもよい。例えば、可読性レベルが閾値より１段低いために「判読不可」と判定された場合には、（１）「撮影対象に近づいたり、ズーム機能を利用することで文字サイズを大きくする」、（２）「三脚等を利用して手ブレを防止する」、（３）「フラッシュなどを使用することで光量を増大させる」、などの対応策を講じれば「判読可」になり得る旨を示すアドバイス情報を出力するようにしてもよい。 Further, in the case of determining using the “readability level determination table”, advice information corresponding to the readability level that is the basis of the determination may be output as notification information. For example, if the readability level is determined to be “unreadable” because it is one step lower than the threshold value, (1) “get close to the shooting target or use the zoom function to increase the character size”, (2 ) Advice indicating that it can be “readable” by taking measures such as “Use a tripod to prevent camera shake”, (3) “Use a flash to increase the amount of light”, etc. Information may be output.

さらに、このような対応策の選択を、当該可読性レベルの根拠となった「文字サイズレベル」と「階調変化率レベル」とに基づいて制御部１１０が決定してもよい。図３（ａ）に示す「可読性レベル判定テーブル」の場合、「Ａ−Ｄ’レベル」の可読性レベルが、閾値（「５」）より１段低い「４」となっているが、この「Ａ−Ｄ’レベル」の場合、階調変化率レベルは十分であるにもかかわらず、文字サイズが小さいために可読性レベルが「４」となり、その結果「判読不可」と判定されてしまう。このような場合には、文字サイズを大きくするための対応策（上記（１）など）をアドバイス情報として出力する。一方、文字サイズが十分大きいにもかかわらず、階調変化率が低い（ボケ具合が大きい）ために「判読不可」と判定された場合には、ピンボケや手ブレを抑制するための対応策（上記（２）または（３）など）をアドバイス情報として出力する。 Further, the selection of the countermeasure may be determined by the control unit 110 based on the “character size level” and the “gradation change rate level” that are the basis of the readability level. In the case of the “readability level determination table” shown in FIG. 3A, the readability level of “AD ′ level” is “4”, which is one step lower than the threshold (“5”). In the case of “−D ′ level”, although the gradation change rate level is sufficient, the readability level is “4” because the character size is small, and as a result, it is determined as “unreadable”. In such a case, a countermeasure for increasing the character size (such as (1) above) is output as advice information. On the other hand, if the character size is sufficiently large, but the gradation change rate is low (the degree of blurring is large) and it is determined as “unreadable”, a countermeasure for suppressing blurring and camera shake ( (2) or (3)) is output as advice information.

また、判定結果が「判読可」である場合でも、例えば、可読性レベルが閾値より１段上であるために「判読可」となった場合には、撮影条件を変更することで、より確実に判読可能な画像を得ることができる旨を示すアドバイス情報を出力してもよい。 Further, even when the determination result is “readable”, for example, when the readability level is “one legible” because the readability level is one level higher than the threshold value, it is more reliable by changing the shooting condition. Advice information indicating that a legible image can be obtained may be output.

以上説明したように、本発明にかかる上記実施形態によれば、当該文書画像の画像品質を正確に判定し、文書画像を記録する前に判読可否をユーザに通知することができる。この結果、無駄な撮影動作を回避することができ、撮影効率の向上やユーザの労力軽減を図ることができる。また、無駄な記録を回避することができるので、メモリ領域などの撮影資源の浪費を防止することができる。 As described above, according to the above-described embodiment of the present invention, it is possible to accurately determine the image quality of the document image and notify the user whether or not the document image can be read before recording the document image. As a result, useless shooting operations can be avoided, and shooting efficiency can be improved and user effort can be reduced. Moreover, since useless recording can be avoided, it is possible to prevent waste of photographing resources such as a memory area.

上記実施形態で取得される文書画像は、文字認識処理にてテキストデータなどに変換することが好適であるが、このような文字認識処理の実行形態は任意である。例えば、パーソナルコンピュータなどのコンピュータ装置で文字認識処理を実行することができる。この場合、画像記録部１５０に記録された文書画像データをコンピュータ装置に転送するが、転送方法は任意であり、例えば、デジタルカメラ１００に備えられている所定のインタフェース（例えば、ＵＳＢ（Universal Serial Bus））などを介してコンピュータ装置と接続することで文書画像データを転送することができる。また、画像記録部１５０が脱着可能なメモリカードから構成されている場合には、コンピュータ装置がメモリカードから文書画像データを読み取ることで転送させることができる。 The document image acquired in the above embodiment is preferably converted into text data or the like by character recognition processing, but the execution form of such character recognition processing is arbitrary. For example, the character recognition process can be executed by a computer device such as a personal computer. In this case, the document image data recorded in the image recording unit 150 is transferred to the computer apparatus, but the transfer method is arbitrary. For example, a predetermined interface (for example, USB (Universal Serial Bus) provided in the digital camera 100 is used. The document image data can be transferred by connecting to a computer device via a) or the like. Further, when the image recording unit 150 is constituted by a removable memory card, the computer device can transfer the document image data by reading it from the memory card.

このような文字認識処理をデジタルカメラ１００で実行してもよい。この場合、デジタルカメラ１００の記憶部１６０に文字認識処理をおこなうためのプログラム（文字認識プログラム）を格納し、制御部１１０がこの文字認識プログラムを実行することで、文字認識処理をおこなう。デジタルカメラ１００で文字認識処理を実行した場合、撮影したその場でテキストデータへの変換などを行うことができる。デジタルカメラ１００で変換されたテキストデータなどは、画像記録部１５０に記録したり、コンピュータ装置などに転送することができる。 Such a character recognition process may be executed by the digital camera 100. In this case, a program (character recognition program) for performing character recognition processing is stored in the storage unit 160 of the digital camera 100, and the control unit 110 executes this character recognition program to perform character recognition processing. When character recognition processing is executed by the digital camera 100, conversion to text data can be performed on the spot where the image is taken. Text data converted by the digital camera 100 can be recorded in the image recording unit 150 or transferred to a computer device or the like.

上記実施形態では、判読可否の判定結果に応じて通知情報を出力するようにしたが、判定結果に応じてデジタルカメラ１００の撮影動作を自動的に制御して、ユーザをアシストするようにしてもよい。例えば、階調変化率は十分であるが、文字サイズが小さいために「判読不可」と判定された場合に、制御部１１０がレンズユニット１２１を駆動し、ズーム機能によって文字サイズが大きくなるよう制御したり、光量不足により手ブレが生じる場合に自動的にフラッシュが発光するように制御してもよい。 In the above embodiment, the notification information is output according to the determination result of legibility. However, the photographing operation of the digital camera 100 is automatically controlled according to the determination result to assist the user. Good. For example, if the gradation change rate is sufficient but the character size is small and it is determined that the character cannot be read, the control unit 110 drives the lens unit 121 to control the character size to be increased by the zoom function. Or when the camera shake occurs due to insufficient light quantity, the flash may be automatically emitted.

また、動作プログラムの更新により機能の拡張を図ることができるデジタルカメラに上述した各処理を実現するプログラムを適用することで、既存のデジタルカメラを上記実施形態にかかるデジタルカメラ１００として機能させることができる。この場合、上記実施形態にかかる動作プログラムを当該デジタルカメラにインストールし、当該デジタルカメラの制御部が実行する。これにより、上述の各処理が実行され、文書撮影時の画像品質を正確に判定して判読可否の判定および通知をおこなうことができる。このような動作プログラムを配布する方法は任意であり、例えば、ＣＤ−ＲＯＭやメモリカードなどの記録媒体に格納して配布可能であることはもとより、インターネットなどの通信媒体を介して配布することもできる。 In addition, by applying a program for realizing each process described above to a digital camera whose functions can be expanded by updating the operation program, the existing digital camera can function as the digital camera 100 according to the embodiment. it can. In this case, the operation program according to the above embodiment is installed in the digital camera, and is executed by the control unit of the digital camera. As a result, the above-described processes are executed, and it is possible to accurately determine the image quality at the time of document shooting and to determine and notify whether or not the document can be read. A method for distributing such an operation program is arbitrary. For example, the operation program can be distributed by being stored in a recording medium such as a CD-ROM or a memory card, and can also be distributed via a communication medium such as the Internet. it can.

以上説明したように、本発明を上記実施形態の如く適用することで、文書を撮影する場合に、画像中の階調変化率に基づいて画像品質を正確に判定して通知することができる。この結果、撮影時の省力化や撮影効率の向上を図ることができるとともに、メモリ領域の浪費などを防止することができる。 As described above, by applying the present invention as in the above embodiment, when a document is photographed, it is possible to accurately determine and notify the image quality based on the gradation change rate in the image. As a result, it is possible to save labor at the time of shooting and improve shooting efficiency, and to prevent the memory area from being wasted.

本発明の実施形態に係るデジタルカメラのシステム構成を示す図である。1 is a diagram illustrating a system configuration of a digital camera according to an embodiment of the present invention. 本発明の実施形態にかかるデジタルカメラの外観を示す図であり、（ａ）は正面図、（ｂ）は背面図である。It is a figure which shows the external appearance of the digital camera concerning embodiment of this invention, (a) is a front view, (b) is a rear view. 図１に示す記憶部に記録される情報の例を示す図であり、（ａ）は「可読性レベル判定テーブル」の例を示し、（ｂ）は「通知情報テーブル」の例を示す。FIG. 2 is a diagram illustrating an example of information recorded in a storage unit illustrated in FIG. 1, where (a) illustrates an example of a “readability level determination table” and (b) illustrates an example of a “notification information table”. 本発明の実施形態にかかるデジタルカメラで撮像される画像の画像品質を説明するための図であり、（ａ）は文書画像の例を示し、（ｂ）ピンボケや手ブレがない場合の文書画像におけるピクセルの様子を示し、（ｃ）は（ｂ）ピンボケや手ブレがある場合の文書画像におけるピクセルの様子を示す。It is a figure for demonstrating the image quality of the image imaged with the digital camera concerning embodiment of this invention, (a) shows the example of a document image, (b) Document image when there is no blur and camera shake (C) shows the state of the pixels in the document image when there is a blur or camera shake. 本発明の実施形態にかかるデジタルカメラで撮像される画像における階調変化率を説明するための図であり、（ａ）は文書画像の例を示し、（ｂ）ピンボケや手ブレがない場合の階調変化の様子を示し、（ｃ）は（ｂ）ピンボケや手ブレがある場合の階調変化の様子を示す。It is a figure for demonstrating the gradation change rate in the image imaged with the digital camera concerning embodiment of this invention, (a) shows the example of a document image, (b) When there is no blurring or camera shake (C) shows the state of gradation change when there is a blur or camera shake. 本発明の実施形態にかかる「文書撮影処理」を説明するためのフローチャートである。It is a flowchart for demonstrating the "document photographing process" concerning embodiment of this invention. 図６に示す「文書撮影処理」で実行される「可読性判定処理」の一例を説明するためのフローチャートである。FIG. 7 is a flowchart for explaining an example of a “readability determination process” executed in the “document photographing process” shown in FIG. 6. 図７に示す「可読性判定処理（１）」で実行される「特徴検出処理」を説明するためのフローチャートである。It is a flowchart for demonstrating the "feature detection process" performed by the "readability determination process (1)" shown in FIG. 図８に示す「特徴検出処理」で実行される文字領域の切り出しを説明するための図であり、（ａ）は対象となる文書画像の例を示し、（ｂ）は特定された文字領域の範囲を示し、（ｃ）は切り出された文字の例を示す。FIGS. 9A and 9B are diagrams for explaining character region segmentation executed in the “feature detection process” illustrated in FIG. 8, in which FIG. 8A illustrates an example of a target document image, and FIG. The range is shown, and (c) shows an example of the extracted character. 図７に示す「可読性判定処理（１）」で実行される「可読性レベル特定処理」を説明するためのフローチャートである。It is a flowchart for demonstrating the "readability level specific process" performed by the "readability determination process (1)" shown in FIG. 図６に示す「文書撮影処理」で実行される「可読性判定処理」の他の例を説明するためのフローチャートである。7 is a flowchart for explaining another example of “readability determination processing” executed in “document photographing processing” shown in FIG. 6. 図１１に示す「可読性判定処理（２）」の処理を説明するための図であり、（ａ）は処理対象となる文書画像の例を示し、（ｂ）は処理をおこなう領域の例を示す。FIG. 12 is a diagram for explaining the process of “readability determination process (2)” illustrated in FIG. 11, where (a) illustrates an example of a document image to be processed, and (b) illustrates an example of an area to be processed. . 「可読性判定処理（１）」または「可読性判定処理（２）」における文字サイズの特定を任意入力でおこなう場合に用いられる「文字サイズ対応テーブル」の例を示す図である。It is a figure which shows the example of the "character size corresponding | compatible table" used when specifying the character size in "readability determination processing (1)" or "readability determination processing (2)" by arbitrary input. 図６に示す「文書撮影処理」で実行される「可読性判定処理」の他の例を説明するためのフローチャートである。7 is a flowchart for explaining another example of “readability determination processing” executed in “document photographing processing” shown in FIG. 6. 図１４に示す「可読性判定処理（３）」で作成されるヒストグラムの例を示す図であり、（ａ）はボケ具合が高い画像で取得されるヒストグラムの例を示し、（ｂ）はボケ具合が低い画像で取得されるヒストグラムの例を示す。It is a figure which shows the example of the histogram created by the "readability determination process (3)" shown in FIG. 14, (a) shows the example of the histogram acquired with an image with high blur condition, (b) is the blur condition. The example of the histogram acquired with an image with low is shown. 図６に示す「文書撮影処理」で実行される「通知情報出力処理」を説明するためのフローチャートである。7 is a flowchart for explaining a “notification information output process” executed in the “document photographing process” shown in FIG. 6. 図１６に示す「通知情報出力処理」で出力される通知情報の例を示す図であり、（ａ）は通知方式が「メッセージ表示」である場合の通知情報の表示例を示し、（ｂ）は通知方式が「アイコン表示」である場合の通知情報の表示例を示す。It is a figure which shows the example of the notification information output by the "notification information output process" shown in FIG. 16, (a) shows the example of a display of notification information in case a notification system is "message display", (b) Shows a display example of notification information when the notification method is “icon display”.

Explanation of symbols

１００…デジタルカメラ、１１０…制御部、１２０…撮像部、１２１…レンズユニット、１２２…測距・測光部、１２３…撮像素子、１３０…入力部、１３１…シャッタボタン、１３２…操作入力部、１４０…出力部、１４１…表示部、１４２…報音部、１４３…インジケータ部、１５０…画像記録部、１６０…記憶部 DESCRIPTION OF SYMBOLS 100 ... Digital camera, 110 ... Control part, 120 ... Imaging part, 121 ... Lens unit, 122 ... Ranging and photometry part, 123 ... Imaging element, 130 ... Input part, 131 ... Shutter button, 132 ... Operation input part, 140 ... output unit, 141 ... display unit, 142 ... reporting unit, 143 ... indicator unit, 150 ... image recording unit, 160 ... storage unit

Claims

In a digital camera that acquires image data by converting incident light into digital data,
Feature detection means for detecting the feature of the image data;
Determination means for determining the readability level of the character indicated in the image data based on the feature detected by the feature detection means;
Execution means for executing a predetermined operation based on the readability level determined by the determination means,
The feature detection means further comprises a gradation detection means for detecting a gradation change rate in the image data,
The determination means determines the readability level of the character based on the gradation change rate detected by the gradation detection means,
The execution unit further includes an output unit that outputs predetermined notification information based on the readability level determined by the determination unit.
A digital camera characterized by that.

An edge detecting means for detecting an edge of the character indicated in the image data;
The gradation detection means detects a gradation change rate of the edge portion detected by the edge detection means;
The digital camera according to claim 1.

The feature detecting means further comprises character size specifying means for specifying the size of the character,
The determination means determines the readability level of the character based on the gradation change rate detected by the gradation detection means and the character size specified by the character size specification means.
The digital camera according to claim 1, wherein the digital camera is a digital camera.

A determination unit that determines whether or not the character can be read based on a predetermined threshold and the readability level determined by the determination unit;
The output means outputs notification information indicating that when it is determined by the determination means that it cannot be read,
The digital camera according to claim 1, wherein the digital camera is a digital camera.

The determination means determines a readability level at a plurality of locations on the image data,
The determination means determines whether or not the plurality of places can be read,
The output means outputs the notification information when it is determined that it is unreadable at any of the plurality of locations.
The digital camera according to claim 4.

A storage unit for storing determination information in which information indicating the feature detected by the feature detection unit is associated with a readability level;
The determination unit determines a readability level based on determination information stored in the storage unit.
The digital camera according to any one of claims 1 to 5, wherein:

It further comprises an input means for inputting an instruction from the user,
The storage means updates the determination information in accordance with a user instruction input to the input means.
The digital camera according to claim 6.

The character size specifying means specifies a character size in accordance with a user instruction input to the input means;
The digital camera according to claim 7.

The gradation detection means further comprises gradation distribution detection means for detecting distribution of gradation change rate in the image data,
The determination unit determines a readability level of the character based on a distribution of gradation change rates detected by the gradation distribution detection unit;
The digital camera according to claim 1, wherein the digital camera is a digital camera.

The determination means determines the readability level based on the distribution of the gradation change rate, with the gradation change rate having the highest appearance frequency as the gradation change rate in the character.
The digital camera according to claim 9.

A character area extracting means for extracting a character area shown in the image data;
The feature detection means detects a feature in the character area extracted by the character area extraction means;
The digital camera according to claim 1, wherein the digital camera is a digital camera.

To a computer that controls a digital camera that converts incident light into digital data and acquires image data,
Detecting characters shown in the acquired image data;
Detecting a gradation change rate in at least the detected character;
Identifying the size of the detected characters;
Determining a readability level of the character based on the detected gradation change rate and / or the specified character size;
Outputting predetermined notification information according to the determined readability level;
A program characterized by having executed.