JP2012073749A

JP2012073749A - Image formation device and image formation program

Info

Publication number: JP2012073749A
Application number: JP2010217169A
Authority: JP
Inventors: Kenichi Katsura; 健一桂
Original assignee: Kyocera Mita Corp
Current assignee: Kyocera Document Solutions Inc
Priority date: 2010-09-28
Filing date: 2010-09-28
Publication date: 2012-04-12

Abstract

PROBLEM TO BE SOLVED: To smoothly generate a high-quality searchable file.SOLUTION: An image formation device includes: an OCR analysis unit 153 for extracting character information from a target image of predetermined image quality; a panel 13 for displaying output images of the character information and the target image; and an image processing unit 152 for generating searchable PDF data of the target image where the character information can be detected, on the basis of the character information and the target image. The panel 13 displays an output image of character information extracted from the target image of changed image quality and an output image of the target image. The image processing unit 152 generates searchable PDF on the basis of the output image of the character information extracted from the target image of changed image quality and the output image of the target image.

Description

本発明は、ＰＤＦなどのファイルデータを生成する画像形成装置及び画像形成プログラムに関し、特に、不可視のテキストデータが付されたサーチャブルファイルを生成する画像形成装置及び画像形成プログラムに関する。 The present invention relates to an image forming apparatus and an image forming program for generating file data such as PDF, and more particularly to an image forming apparatus and an image forming program for generating a searchable file with invisible text data.

近年、原稿をＰＤＦ（Portable Document Format）やＸＰＳ（XML Paper Specification）などのファイルデータに変換する際、原稿に含まれる文字情報を付加することによって、所望の文字列が検索可能なサーチャブルＰＤＦやサーチャブルＸＰＳ等（以下、これらをサーチャブルファイルと総称する。）の利用が高まっている。
図９に示すように、サーチャブルファイルは、原稿から文字情報を抽出するため、画像データ（ＪＰＥＧデータなど）を二値化したラスタデータからテキストデータを抽出するＯＣＲ（Optical Character Reader）の技術が用いられる。このためサーチャブルファイルの生成に際しては、文字オブジェクトの的確なパターンマッチングを実現すべく、適切な解像度変換が求められる。
すなわち、解像度は文字認識レベルに相当し、解像度の増減によって文字の認識精度を調整することができる（例えば、特許文献１参照）。 In recent years, when a document is converted into file data such as PDF (Portable Document Format) or XPS (XML Paper Specification), a searchable PDF or searchable PDF in which a desired character string can be searched by adding character information included in the document. The use of XPS and the like (hereinafter collectively referred to as searchable files) is increasing.
As shown in FIG. 9, in order to extract character information from a manuscript, a searchable file uses OCR (Optical Character Reader) technology that extracts text data from raster data obtained by binarizing image data (JPEG data or the like). Used. For this reason, when generating a searchable file, appropriate resolution conversion is required to achieve accurate pattern matching of character objects.
That is, the resolution corresponds to the character recognition level, and the character recognition accuracy can be adjusted by increasing or decreasing the resolution (see, for example, Patent Document 1).

ところで、サーチャブルファイルは、図１０に示すように、ＯＣＲ処理によって抽出したテキストデータからなるテキストレイヤを非表示とし、このテキストレイヤと画像データのレイヤとが階層状に形成されたものである。
このため、ユーザーは、サーチャブルファイルに埋め込まれた文字列が正しいか否かを一見して判断することはできず、ビューワー等を介して実際に文字列検索を行ってみなければその精度を判別できない問題があった。
このような問題に対し、ＯＣＲ処理によって抽出された文字列を表示するステップや、誤って抽出された文字を修正するステップを具備した文字認識方法が提案されている（例えば、特許文献２参照）。 By the way, as shown in FIG. 10, the searchable file hides a text layer made up of text data extracted by OCR processing, and the text layer and the image data layer are formed in a hierarchy.
For this reason, the user cannot judge at a glance whether or not the character string embedded in the searchable file is correct. If the user does not actually perform a character string search through a viewer or the like, the accuracy will be improved. There was a problem that could not be determined.
In order to solve such a problem, a character recognition method including a step of displaying a character string extracted by OCR processing and a step of correcting an erroneously extracted character has been proposed (for example, see Patent Document 2). .

特開２００６−１７２０５５号公報JP 2006-172055 A 特開２００４−３１８７６０号公報JP 2004-318760 A

しかしながら、特許文献２に記載された文字認識方法によれば、文字そのものをユーザーの手作業により修正するようにしているため、二次的なミスが発生するおそれがあった。
また、特許文献１に記載された文字認識装置によれば、このようなミスは発生し得ないが、ＯＣＲの認識精度を判定するために種々のキーワードやキーワード毎のスコアを事前に設定しなければならず煩雑であった。さらに、読み取り解像度を任意の値に設定し、その結果を事前に確認する手段がないため、柔軟性に欠き、ユーザーにとって不便なものとなっていた。 However, according to the character recognition method described in Patent Document 2, since the character itself is corrected manually by the user, there is a possibility that a secondary error may occur.
Further, according to the character recognition device described in Patent Document 1, such a mistake cannot occur, but various keywords and scores for each keyword must be set in advance in order to determine OCR recognition accuracy. It was complicated. Furthermore, since there is no means for setting the reading resolution to an arbitrary value and confirming the result in advance, it is inflexible and inconvenient for the user.

本発明は、以上のような事情に鑑みなされたものであり、サーチャブルファイルの生成にあたり、ユーザーが任意に文字認識レベルを設定できるとともに、文字抽出の精度を確認することができ、また、これらの処理を繰り返し行うことで、高品質のサーチャブルファイルを円滑に生成することができる画像形成装置及び画像形成プログラムを提供する。 The present invention has been made in view of the circumstances as described above, and in generating a searchable file, the user can arbitrarily set the character recognition level and can check the accuracy of character extraction. An image forming apparatus and an image forming program capable of smoothly generating a high-quality searchable file by repeating the above process are provided.

上記目的を達成するため、本発明の画像形成装置は、所定の画質の対象画像から文字情報を抽出する文字情報抽出手段と、前記文字情報と前記対象画像の出力イメージを表示する表示手段と、前記文字情報及び対象画像にもとづき、前記文字情報を検出可能な前記対象画像のファイルデータを生成するファイルデータ生成手段と、入力操作に応じ対象画像の画質を変更する文字精度変更手段と、を備え、前記表示手段が、変更された画質の前記対象画像から抽出した文字情報と前記対象画像の出力イメージを表示し、前記ファイルデータ生成手段が、変更された画質の前記対象画像から抽出した文字情報及び前記対象画像にもとづき、前記ファイルデータを生成する構成としてある。 In order to achieve the above object, an image forming apparatus of the present invention includes character information extraction means for extracting character information from a target image having a predetermined image quality, display means for displaying the character information and an output image of the target image, File data generating means for generating file data of the target image capable of detecting the character information based on the character information and the target image; and character accuracy changing means for changing the image quality of the target image in accordance with an input operation. The display means displays the character information extracted from the target image with the changed image quality and the output image of the target image, and the file data generation means extracts the character information extracted from the target image with the changed image quality. In addition, the file data is generated based on the target image.

また、本発明の画像形成プログラムは、画像形成装置のコンピューターを、所定の画質の対象画像から文字情報を抽出する文字情報抽出手段、前記文字情報と前記対象画像の出力イメージを表示する表示手段、前記文字情報及び対象画像にもとづき、前記文字情報を検出可能な前記対象画像のファイルデータを生成するファイルデータ生成手段、及び、入力操作に応じ対象画像の画質を変更する文字精度変更手段、として機能させるとともに、前記表示手段に、変更された画質の前記対象画像から抽出した文字情報と前記対象画像の出力イメージを表示させ、前記ファイルデータ生成手段に、変更された画質の前記対象画像から抽出した文字情報及び前記対象画像にもとづき、前記ファイルデータを生成させる構成としてある。 Further, the image forming program of the present invention includes a computer of an image forming apparatus, character information extracting means for extracting character information from a target image having a predetermined image quality, display means for displaying the character information and an output image of the target image, Functions as file data generating means for generating file data of the target image capable of detecting the character information based on the character information and the target image, and character accuracy changing means for changing the image quality of the target image in accordance with an input operation. And displaying the character information extracted from the target image having the changed image quality and the output image of the target image on the display unit, and extracting the image data from the target image having the changed image quality on the file data generating unit. The file data is generated based on the character information and the target image.

本発明の画像形成装置及び画像形成プログラムによれば、高品質のサーチャブルファイルを円滑に生成することができる。 According to the image forming apparatus and the image forming program of the present invention, a high-quality searchable file can be generated smoothly.

本発明の一実施形態に係るＭＦＰ（画像形成装置）の構成を示すブロック図である。1 is a block diagram illustrating a configuration of an MFP (image forming apparatus) according to an embodiment of the present invention. 画像データ入力時における画像処理を説明するための説明図である。It is explanatory drawing for demonstrating the image process at the time of image data input. テキストデータを抽出しプレビュー表示する工程における画像処理を説明するための説明図である。It is explanatory drawing for demonstrating the image process in the process of extracting text data and displaying a preview. 文字認識レベルの再設定時における画像処理を説明するための説明図である。It is explanatory drawing for demonstrating the image process at the time of the reset of a character recognition level. 原稿のプレビュー画像と文字列のプレビュー画像を示した図である。FIG. 5 is a diagram showing a preview image of a document and a preview image of a character string. 文字認識レベルの調整バーを示した図である。It is the figure which showed the adjustment bar of a character recognition level. サーチャブルＰＤＦの生成時における画像処理を説明するための説明図である。It is explanatory drawing for demonstrating the image process at the time of searchable PDF production | generation. 本発明の一実施形態に係る画像処理の手順を説明するためのフローチャートである。It is a flowchart for demonstrating the procedure of the image processing which concerns on one Embodiment of this invention. サーチャブルファイルの構成を示した第一の説明図である。It is the 1st explanatory view showing the composition of a searchable file. サーチャブルファイルの構成を示した第二の説明図である。It is the 2nd explanatory view showing the composition of a searchable file.

以下、本発明の一実施形態について図面を参照して説明する。
図１は、本発明の一実施形態に係るＭＦＰ（画像形成装置）の構成を示すブロック図である。
図１に示すように、本実施形態のＭＦＰ１０は、通信インターフェース１１と、表示インターフェース１２と、パネル１３と、スキャナデバイス１４と、制御部１５とを備える。 Hereinafter, an embodiment of the present invention will be described with reference to the drawings.
FIG. 1 is a block diagram showing a configuration of an MFP (image forming apparatus) according to an embodiment of the present invention.
As illustrated in FIG. 1, the MFP 10 according to the present exemplary embodiment includes a communication interface 11, a display interface 12, a panel 13, a scanner device 14, and a control unit 15.

通信インターフェース１１は、ＬＡＮ（Local Area Network）等のネットワークを介し、パーソナルコンピューター２０（以下、ＰＣ２０という。）との間で必要なデータや信号の通信を行うためのネットワークインターフェースである。例えば、ＰＣ２０におけるユーティリティやＷｅｂブラウザの実行にもとづきＰＣ２０から出力された原稿の画像データ（例えば、ＪＰＥＧやＭＭＲなどのイメージデータ）を受信する。 The communication interface 11 is a network interface for communicating necessary data and signals with a personal computer 20 (hereinafter referred to as a PC 20) via a network such as a LAN (Local Area Network). For example, image data (for example, image data such as JPEG or MMR) of a document output from the PC 20 based on execution of a utility or a Web browser in the PC 20 is received.

表示インターフェース１２は、パネル１３との間で種々のデータを送受信する入出力インターフェースである。例えば、パネル１３から出力されたサーチャブルＰＤＦの作成要求を制御部１５に受け渡す処理や、生成されたプレビュー画像をパネル１３に受け渡す処理等を行う。 The display interface 12 is an input / output interface that transmits and receives various data to and from the panel 13. For example, a process for delivering a searchable PDF creation request output from the panel 13 to the control unit 15, a process for delivering the generated preview image to the panel 13, and the like are performed.

パネル１３は、タッチパネルとカラーＬＣＤ（Liquid Crystal Display）とを組み合わせたタッチパネルユニット等から構成され、種々の画面を表示するとともに、ユーザーのタッチ操作により種々の操作指令を入力するための操作ボタンを表示する操作パネルを構成する。例えば、パネル１３は、原稿のプレビュー画像や原稿から抽出した文字列のプレビュー画像を表示し、または、サーチャブルＰＤＦの作成要求を行うための操作ボタンや他のユーザーインターフェースを表示する（図５，６参照）。 The panel 13 includes a touch panel unit that combines a touch panel and a color LCD (Liquid Crystal Display). The panel 13 displays various screens and operation buttons for inputting various operation commands by a user's touch operation. Configure the operation panel. For example, the panel 13 displays a preview image of a document or a preview image of a character string extracted from the document, or displays an operation button or other user interface for making a searchable PDF creation request (FIGS. 5 and 6). reference).

スキャナデバイス１４は、スキャナユニットとして搭載されたイメージセンサなどのスキャナ装置であり、画像や文字が描かれた原稿を読み取って画像データ（ＪＰＥＧやＭＭＲなどのイメージデータ）を取得するものである。
なお、スキャナデバイス１４は、取得した原稿の画像データを制御部１５（後述するデータ送受信部１５１）に出力する。 The scanner device 14 is a scanner device such as an image sensor mounted as a scanner unit, and acquires image data (image data such as JPEG and MMR) by reading a document on which images and characters are drawn.
The scanner device 14 outputs the acquired document image data to the control unit 15 (a data transmission / reception unit 151 described later).

制御部１５は、ＣＰＵ、制御プログラムやデータが記憶されたＲＯＭ、ワークメモリとして機能するＲＡＭ、Ｉ／Ｏポートなどのマイクロコンピューターで構成される。制御部１５は、記憶された制御プログラムを実行して、ＭＦＰ１０を構成する各部を制御する。
例えば、制御部１５は、通信インターフェース１１を介して受信した画像データやスキャナデバイス１４を介して入力した画像データをラスタデータに変換する等の画像処理を行う。
なお、画像処理によって得られたラスタデータは、通常は出力画像（出力イメージ）として印刷処理に用いられる。具体的には、ラスタデータにもとづくトナー像が感光体ドラム上に形成され、このトナー像が一旦、転写ベルトに転写される。次いで、転写ベルト上のトナー像を用紙に転写し、転写ローラーによりトナー像を用紙に定着させることによって印刷処理が行われる（感光体ドラム、転写ドラム及び転写ローラーは非図示）。 The control unit 15 includes a microcomputer such as a CPU, a ROM that stores control programs and data, a RAM that functions as a work memory, and an I / O port. The control unit 15 executes the stored control program to control each unit constituting the MFP 10.
For example, the control unit 15 performs image processing such as converting image data received via the communication interface 11 or image data input via the scanner device 14 into raster data.
Note that raster data obtained by image processing is normally used for print processing as an output image (output image). Specifically, a toner image based on the raster data is formed on the photosensitive drum, and this toner image is temporarily transferred to the transfer belt. Next, the toner image on the transfer belt is transferred to a sheet, and the toner image is fixed on the sheet by a transfer roller, whereby a printing process is performed (photosensitive drum, transfer drum, and transfer roller are not shown).

ここで、本発明の一実施形態に係るＭＦＰの制御部について詳細に説明する。
制御部１５は、図１に示すように、データ送受信部１５１と、画像処理部１５２と、ＯＣＲ解析部１５３と、フォントラスタライザ１５４とを、各機能ブロックとして備える。
データ送受信部１５１は、制御部１５と他の構成部との間で必要なデータ又は信号の受け渡しを行う。
例えば、通信インターフェース１１やスキャナデバイス１４から入力した画像データを画像処理部１５２に出力し、または、画像処理部１５２から入力したラスタデータを通信インターフェース１１や表示インターフェース１２に出力する。 Here, the control unit of the MFP according to the embodiment of the present invention will be described in detail.
As shown in FIG. 1, the control unit 15 includes a data transmission / reception unit 151, an image processing unit 152, an OCR analysis unit 153, and a font rasterizer 154 as functional blocks.
The data transmission / reception unit 151 exchanges necessary data or signals between the control unit 15 and other components.
For example, image data input from the communication interface 11 or the scanner device 14 is output to the image processing unit 152, or raster data input from the image processing unit 152 is output to the communication interface 11 or the display interface 12.

画像処理部１５２は、入力した画像データを所定の出力画質（解像度）にもとづくピクセルに分解して２値データに変換することによって、出力画像（ラスタデータ）を生成する（出力画像生成手段）。
また、画像処理部１５２は、必要に応じ、解像度、画像サイズを変更し、又は、回転や色変換を行うなど、出力画質を変更したうえで出力画像を生成することもできる。
画像処理部１５２は、生成した出力画像をＯＣＲ処理部１５３に出力する。
さらに、画像処理部１５２は、出力画像をＰＤＦデータに変換するとともに、ＯＣＲ解析部１５３によって抽出されたテキストデータとＰＤＦデータとを結合してＰＤＦファイル（サーチャブルＰＤＦ）を生成する（ファイルデータ生成手段）。 The image processing unit 152 generates an output image (raster data) by decomposing the input image data into pixels based on a predetermined output image quality (resolution) and converting them into binary data (output image generation means).
The image processing unit 152 can also generate an output image after changing the output image quality, such as changing the resolution and image size, or performing rotation and color conversion, as necessary.
The image processing unit 152 outputs the generated output image to the OCR processing unit 153.
Further, the image processing unit 152 converts the output image into PDF data, and generates a PDF file (searchable PDF) by combining the text data extracted by the OCR analysis unit 153 and the PDF data (file data generation unit). ).

ＯＣＲ解析部１５３は、出力画像を解析してテキストデータを抽出する（文字情報抽出手段）。具体的には、ＯＣＲ解析部１５３は、ラスタデータから文字を構成するピクセルを取り出し、該当する文字コード、サイズ、位置などの属性情報を付する。
フォントラスタライザ１５４は、テキストデータをラスタデータに変換することで文字列のプレビュー画像を生成する。生成された文字列のプレビュー画像は、ＰＣ２０やパネル１３に送られて表示される（図５参照）。 The OCR analysis unit 153 analyzes the output image and extracts text data (character information extraction unit). Specifically, the OCR analysis unit 153 takes out pixels constituting a character from the raster data and attaches attribute information such as a corresponding character code, size, and position.
The font rasterizer 154 generates a preview image of a character string by converting text data into raster data. A preview image of the generated character string is sent to the PC 20 or the panel 13 and displayed (see FIG. 5).

なお、ＰＣ２０は、ＭＦＰ１０とＬＡＮ（Local Area Network）等のネットワークを介し電気的に接続されている。
ＰＣ２０には、ビットマップ画像のプレビュー機能を有するＷｅｂブラウザその他のＰＣユーティリティがインストールされており、また、このＰＣユーティリティによりＭＦＰ２０との各種情報の受け渡しを可能としている。 The PC 20 is electrically connected to the MFP 10 via a network such as a LAN (Local Area Network).
A PC browser and other PC utilities having a bitmap image preview function are installed in the PC 20, and various information can be exchanged with the MFP 20 by the PC utility.

つぎに、本発明の一実施形態の画像処理を詳細に説明する。具体的には、i）画像データ入力時における画像処理、ii）テキストデータを抽出しプレビュー表示する工程における画像処理、iii）文字認識レベルの再設定時における画像処理、及び、iv）サーチャブルＰＤＦの生成時における画像処理についてそれぞれ説明する。 Next, image processing according to an embodiment of the present invention will be described in detail. Specifically, i) image processing at the time of image data input, ii) image processing in the step of extracting and previewing text data, iii) image processing at the time of resetting the character recognition level, and iv) searchable PDF Each of the image processing at the time of generation will be described.

i）画像データ入力時における画像処理
図２は、本発明の一実施形態において画像データ入力時における画像処理を説明するための説明図である。
具体的には、図２に示すように、画像データの入力から出力画像をＯＣＲ解析部１５３に出力するまでの間に行われる画像処理を説明する。 i) Image Processing at the Time of Image Data Input FIG. 2 is an explanatory diagram for explaining image processing at the time of image data input in one embodiment of the present invention.
Specifically, as shown in FIG. 2, image processing performed between the input of image data and the output of an output image to the OCR analysis unit 153 will be described.

図２に示すように、まず、ＰＣ２０又はスキャナデバイス１４を介して原稿の画像データを入力する。
例えば、ＰＣ２０が、ＭＦＰ１０と情報の受け渡しが可能なＷｅｂブラウザその他のユーティリティを介し、サーチャブルＰＤＦの作成要求信号と、原稿の画像データとを通信インターフェース１１に出力する。通信インターフェース１１は、入力した信号及びデータをデータ送受信部１５１に出力する。
他方、スキャナデバイス１４で読み取った原稿の画像データを用いてサーチャブルＰＤＦを生成する場合、パネル１３が、サーチャブルＰＤＦの作成要求信号をデータ送受信部１５１に出力し、スキャナデバイス１４が、原稿の画像データをデータ送受信部１５１に出力する。 As shown in FIG. 2, first, image data of an original is input via the PC 20 or the scanner device 14.
For example, the PC 20 outputs a searchable PDF creation request signal and document image data to the communication interface 11 via a Web browser or other utility that can exchange information with the MFP 10. The communication interface 11 outputs the input signal and data to the data transmission / reception unit 151.
On the other hand, when generating a searchable PDF using document image data read by the scanner device 14, the panel 13 outputs a search request signal for the searchable PDF to the data transmission / reception unit 151, and the scanner device 14 outputs the image data of the document. Is output to the data transmission / reception unit 151.

データ送受信部１５１は、入力したサーチャブルＰＤＦの作成要求信号及び原稿の画像データを画像処理部１５２に出力する。
画像処理部１５２は、入力した原稿の画像データを所定の出力画質（解像度）にもとづいて２値化し、ラスタデータからなる出力画像を取得する。
なお、画像処理部１５２は、取得した出力画像にもとづいて原稿のプレビュー画像を生成してこれを保持するとともに、この出力画像をＯＣＲ解析部１５３に出力する。 The data transmission / reception unit 151 outputs the input searchable PDF creation request signal and the image data of the document to the image processing unit 152.
The image processing unit 152 binarizes the input document image data based on a predetermined output image quality (resolution), and acquires an output image composed of raster data.
The image processing unit 152 generates a document preview image based on the acquired output image, holds the generated preview image, and outputs the output image to the OCR analysis unit 153.

ii）テキストデータを抽出しプレビュー表示する工程における画像処理
図３は、出力画像からテキストデータを抽出してからプレビュー表示するまでに行われる画像処理を説明するための説明図である。 ii) Image Processing in Step of Extracting Text Data and Displaying Preview FIG. 3 is an explanatory diagram for explaining image processing performed from extraction of text data from an output image to preview display.

ここでは、まず、ＯＣＲ解析部１５３が、出力画像から文字情報を抽出し、これをプレーンテキスト、リッチテキストなどのテキストデータに変換して画像処理部１５２に出力する。
つぎに、画像処理部１５２は、入力したテキストデータを、フォントラスタライザ１５４を使用してラスタデータに変換し、文字列のプレビュー画像として取得する。なお、画像処理部１５２は、入力したテキストデータを保持しておく。
また、画像処理部１５２は、保持しておいた原稿のプレビュー画像と文字列のプレビュー画像とをデータ送受信部１５１に出力する。なお、この際、画像処理部１５２は、原稿のプレビュー画像と文字列のプレビュー画像とを対比可能に並べて配置したプレビュー画像に加工してデータ送受信部１５１に出力することもできる。
次いで、画像処理部１５２は、このようなプレビュー画像と、ＯＣＲ解析部１５３から入力したテキストデータとをデータ送受信部１５１に出力する。 Here, first, the OCR analysis unit 153 extracts character information from the output image, converts it into text data such as plain text and rich text, and outputs the text data to the image processing unit 152.
Next, the image processing unit 152 converts the input text data into raster data using the font rasterizer 154 and acquires it as a preview image of a character string. The image processing unit 152 holds the input text data.
Further, the image processing unit 152 outputs the stored document preview image and character string preview image to the data transmission / reception unit 151. At this time, the image processing unit 152 can process the preview image of the document and the preview image of the character string side by side so that they can be compared with each other and output the processed preview image to the data transmission / reception unit 151.
Next, the image processing unit 152 outputs such a preview image and the text data input from the OCR analysis unit 153 to the data transmission / reception unit 151.

データ送受信部１５１は、入力したプレビュー画像及びテキストデータを通信インターフェース１１又は表示インターフェース１２に出力する。
通信インターフェース１１は、入力したプレビュー画像及びテキストデータをＰＣ２０に送信し、表示インターフェース１２は、入力したプレビュー画像及びテキストデータをパネル１３に出力する。
プレビュー画像を入力したＰＣ２０又はパネル１３では、原稿のプレビュー画像と、文字列のプレビュー画像を表示する（表示手段）。なお、文字列のプレビュー画像に加え、認識文字数、言語、処理時間などの他の文字情報を表示することもできる。
このため、ユーザーは、表示された２つのプレビュー画像を比較しながら参照することができ、ＯＣＲ解析部１５３によって抽出された文字の正否を容易に確認することができる。 The data transmission / reception unit 151 outputs the input preview image and text data to the communication interface 11 or the display interface 12.
The communication interface 11 transmits the input preview image and text data to the PC 20, and the display interface 12 outputs the input preview image and text data to the panel 13.
The PC 20 or the panel 13 to which the preview image is input displays the document preview image and the character string preview image (display means). In addition to the preview image of the character string, other character information such as the number of recognized characters, language, and processing time can be displayed.
Therefore, the user can refer to the two displayed preview images while comparing them, and can easily confirm whether the characters extracted by the OCR analysis unit 153 are correct.

iii）文字認識レベルの再設定時における画像処理
図４は、文字認識レベルの再設定時における画像処理を説明するための説明図である。
文字認識レベルの再設定は、プレビュー画像を確認した結果、文字の誤りが多く許容できないと判断した場合等、ユーザーの判断で任意で行われる。
ユーザーは、図５や図６に示すように、ＰＣ２０に接続された非図示のモニターやパネル１３の表示領域に表示されたプレビュー画面を参照しつつ、操作ボタン、調整バーなどのユーザーインターフェースを介して文字認識レベルの再設定を行う。 iii) Image processing at the time of resetting the character recognition level FIG. 4 is an explanatory diagram for explaining image processing at the time of resetting the character recognition level.
The resetting of the character recognition level is arbitrarily performed at the user's discretion, for example, when the preview image is confirmed and it is determined that many character errors are unacceptable.
As shown in FIGS. 5 and 6, the user refers to a monitor screen (not shown) connected to the PC 20 and a preview screen displayed on the display area of the panel 13 through a user interface such as operation buttons and an adjustment bar. Reset the character recognition level.

具体的には、ユーザーは、図５に示された表示領域に表示される「認識レベル再調整」ボタンを選択する。「認識レベル再調整」ボタンが選択されると、図６に示す文字認識レベルの調整バー（操作部）を表示する（操作部表示手段）。この調整バーによれば、文字の認識精度（抽出精度）とパフォーマンス（処理速度）との割合をボタン操作やスライド操作によって調整できる（文字精度変更手段）。例えば、図６において、ユーザーは、レベル選択ボタンの左を選択して、精度を高く設定することができるが、この場合、パフォーマンスは低く設定される。また、レベル選択ボタンの右を選択すると、パフォーマンスを高く設定することができるが精度は低く設定される。
文字認識レベルが再設定され、「決定」ボタンが選択されると、この再設定した文字認識レベルの設定情報が出力される。 Specifically, the user selects a “recognition level readjustment” button displayed in the display area shown in FIG. When the “recognition level readjustment” button is selected, the character recognition level adjustment bar (operation unit) shown in FIG. 6 is displayed (operation unit display means). According to this adjustment bar, the ratio between the character recognition accuracy (extraction accuracy) and the performance (processing speed) can be adjusted by a button operation or a slide operation (character accuracy changing means). For example, in FIG. 6, the user can select the left of the level selection button and set the accuracy high, but in this case, the performance is set low. If the right of the level selection button is selected, the performance can be set high, but the accuracy is set low.
When the character recognition level is reset and the “OK” button is selected, the reset character recognition level setting information is output.

文字認識レベルの設定情報は、出力画質に関するパラメータである。具体的には、文字認識レベルの再設定に応じ、画像処理部１５２は、画像データを出力画像（ラスタデータ）に変換する際の解像度等の出力画質を変更する（文字精度変更手段）。つまり、文字認識レベルとしての認識精度を元の認識精度より高く設定した場合にはこれに相応して高解像度の出力画像が生成され、認識精度を元の認識精度より低く設定した場合にはこれに相応して低解像度の出力画像が生成される。 The character recognition level setting information is a parameter relating to output image quality. Specifically, according to the resetting of the character recognition level, the image processing unit 152 changes the output image quality such as the resolution when converting the image data into the output image (raster data) (character accuracy changing means). In other words, when the recognition accuracy as the character recognition level is set higher than the original recognition accuracy, a high-resolution output image is generated accordingly, and when the recognition accuracy is set lower than the original recognition accuracy, Accordingly, a low-resolution output image is generated.

また、文字認識レベルの設定情報は、原稿の種類に応じて定めるようにしてもよい。例えば、スキャナデバイス１４のプレスキャン機能等により、白黒文字の原稿、色文字原稿、イメージ混在など、原稿の種類を判別し、判別された原稿の種類に応じて出力画質（解像度）を設定し又は変更することもできる。
なお、解像度の他、画像サイズ、色変換などの画質に関する他のパラメータを文字認識レベルの設定情報として用いることもできる。 The character recognition level setting information may be determined according to the type of document. For example, by using the pre-scan function of the scanner device 14 or the like, the type of the original such as a black and white character original, a color character original, or a mixed image is determined, and the output image quality (resolution) is set according to the determined original type. It can also be changed.
In addition to the resolution, other parameters relating to image quality such as image size and color conversion can also be used as setting information for the character recognition level.

具体的な画像処理の流れは以下の通りである。
まず、ＰＣ２０に付属するマウス等の操作を介して文字認識レベルの再設定を行った場合には、ＰＣ２０が、再設定された文字認識レベルの設定情報を通信インターフェース１１に送信し、通信インターフェース１１は、この設定情報をデータ送受信部１５１に出力する。
他方、パネル１３上のキー操作によって文字認識レベルの再設定を行った場合、パネル１３が、再設定された文字認識レベルの設定情報を表示インターフェース１２に出力し、表示インターフェース１２が、この設定情報をデータ送受信部１５１に出力する。
次いで、データ送受信部１５１は、再設定に係る文字認識レベルの設定情報を画像処理部１５２に出力する。 The specific image processing flow is as follows.
First, when the character recognition level is reset through the operation of the mouse attached to the PC 20, the PC 20 transmits the reset character recognition level setting information to the communication interface 11, and the communication interface 11. Outputs this setting information to the data transmission / reception unit 151.
On the other hand, when the character recognition level is reset by a key operation on the panel 13, the panel 13 outputs the reset character recognition level setting information to the display interface 12, and the display interface 12 displays the setting information. Is output to the data transmission / reception unit 151.
Next, the data transmission / reception unit 151 outputs the character recognition level setting information related to the resetting to the image processing unit 152.

画像処理部１５２は、再設定に係る文字認識レベルの設定情報にもとづき、原稿の画像データを出力画像に変換してＯＣＲ解析部１５３に出力する。
例えば、文字認識レベルの再設定において、元の解像度より高い解像度が設定された場合、新たに設定されたその解像度にもとづいて原稿のイメージデータをラスタデータに変換し、これをＯＣＲ解析部１５３に出力する。
なお、前述したii）における画像処理と同様、ＯＣＲ解析部１５３は、出力画像からテキストデータを抽出し、フォントラスタライザ１５４がこれをラスタデータに変換することで文字列のプレビュー画像を生成し、画像処理部１５３が、テキストデータとともに、ＰＣ２０又はパネル１３に出力する。 The image processing unit 152 converts the image data of the document into an output image based on the character recognition level setting information related to the resetting, and outputs the output image to the OCR analysis unit 153.
For example, in the resetting of the character recognition level, when a resolution higher than the original resolution is set, the original image data is converted into raster data based on the newly set resolution, and this is converted to the OCR analysis unit 153. Output.
Similar to the image processing in ii) described above, the OCR analysis unit 153 extracts text data from the output image, and the font rasterizer 154 converts this into raster data to generate a character string preview image. The processing unit 153 outputs the text data to the PC 20 or the panel 13.

iv）サーチャブルＰＤＦの生成時における画像処理
図７は、本発明の一実施形態において、サーチャブルＰＤＦの生成時における画像処理を説明するための説明図である。
図７に示すように、所定操作に応じ、ＰＣ２０又はパネル１３が、サーチャブルＰＤＦのファイル作成の実行信号を出力する。この動作は、図５に示すプレビュー画面上に表示される「実行」キーを選択することにより実行される。
ＰＣ２０を介してファイル作成の実行信号が出力されると、通信インターフェース１１がこれをデータ送受信部１５１に出力する。他方、パネル１３を介してファイル作成の実行信号が出力されると、表示インターフェース１２がこれをデータ送受信部１５１に出力する。
データ送受信部１５１は、入力したファイル作成の実行信号を画像処理部１５２に出力する。 iv) Image processing when generating a searchable PDF FIG. 7 is an explanatory diagram for explaining image processing when generating a searchable PDF in one embodiment of the present invention.
As shown in FIG. 7, the PC 20 or the panel 13 outputs a searchable PDF file creation execution signal in response to a predetermined operation. This operation is executed by selecting an “execute” key displayed on the preview screen shown in FIG.
When a file creation execution signal is output via the PC 20, the communication interface 11 outputs this to the data transmission / reception unit 151. On the other hand, when a file creation execution signal is output via the panel 13, the display interface 12 outputs this to the data transmission / reception unit 151.
The data transmission / reception unit 151 outputs the input file creation execution signal to the image processing unit 152.

ファイル作成の実行信号を入力した画像処理部１５２は、その時点において保持している原稿の出力画像をＰＤＦデータに変換するとともに、その時点において保持しているテキストデータ（すなわち、直前に設定した文字認識レベルにもとづいてＯＣＲ解析部１５３が抽出したテキストデータ）とＰＤＦデータとを結合することでサーチャブルＰＤＦを生成する。サーチャブルＰＤＦの生成に際しては、テキストデータを形成するテキストレイヤを非表示にしつつ、このテキストレイヤとＰＤＦデータとを階層状に結合する。 The image processing unit 152 that has received the execution signal for file creation converts the output image of the document held at that time into PDF data, and also stores the text data (that is, the character set immediately before) at that time. The searchable PDF is generated by combining the text data (extracted by the OCR analysis unit 153 based on the recognition level) and the PDF data. When generating a searchable PDF, the text layer forming the text data is hidden, and the text layer and the PDF data are combined in a hierarchical manner.

ところで、ＰＣ２０又はパネル１３を介し、ユーザーが、テキストデータを修正する処理を介してサーチャブルＰＤＦを生成することもできる。
この場合、ユーザーは、ＰＣ２０のキーボード操作やパネル１３のキー操作により、該当する文字の削除、加筆、変更を行う。そして、所定操作に応じ、ＰＣ２０又はパネル１３が、修正後のテキストデータとファイル作成の実行信号とをデータ送受信部１５１に出力する。
ファイル作成の実行信号を入力したデータ送受信部１５１は、この実行信号とともに入力した修正テキストデータを取り出す。そして、その時点において保持していた原稿の出力画像をＰＤＦデータに変換するとともに、取り出した修正テキストデータとそのＰＤＦデータとを結合することでサーチャブルＰＤＦを生成する。 By the way, the searchable PDF can be generated by the user via the PC 20 or the panel 13 through the process of correcting the text data.
In this case, the user deletes, adds, or changes the corresponding character by a keyboard operation of the PC 20 or a key operation of the panel 13. Then, in response to a predetermined operation, the PC 20 or the panel 13 outputs the corrected text data and a file creation execution signal to the data transmission / reception unit 151.
The data transmission / reception unit 151 having received the execution signal for file creation takes out the corrected text data input together with the execution signal. Then, the original output image held at that time is converted into PDF data, and the retrieved corrected text data and the PDF data are combined to generate a searchable PDF.

次に、本発明の一実施形態に係る画像処理方法について図面を参照しながら説明する。
図８は、本発明の一実施形態に係る画像処理方法を示すフローチャートである。
図８に示すように、まず、ＭＦＰ１０は、サーチャブルＰＤＦのファイル作成要求と原稿の画像データとを受信する（Ｓ１）。具体的には、ＰＣ２０におけるユーザーの所定操作に応じ、サーチャブルＰＤＦのファイル作成の要求信号と原稿の画像データとが出力され、通信インターフェース１１を介しデータ送受信部１５１がこれを受信する。または、パネル１３におけるユーザーの所定操作に応じ出力されたサーチャブルＰＤＦのファイル作成の要求信号を、表示インターフェース１２を介してデータ送受信部１５１が入力するとともに、スキャナデバイス１４によって読み込まれた原稿の画像データをデータ送受信部１５１が入力する。
なお、データ送受信部１５１は、入力したファイル作成の要求信号と原稿の画像データを画像処理部１５２に出力する。 Next, an image processing method according to an embodiment of the present invention will be described with reference to the drawings.
FIG. 8 is a flowchart showing an image processing method according to an embodiment of the present invention.
As shown in FIG. 8, first, the MFP 10 receives a searchable PDF file creation request and document image data (S1). Specifically, a searchable PDF file creation request signal and document image data are output in response to a user's predetermined operation on the PC 20, and the data transmission / reception unit 151 receives the request signal via the communication interface 11. Alternatively, a searchable PDF file creation request signal output in response to a user's predetermined operation on the panel 13 is input to the data transmission / reception unit 151 via the display interface 12 and the image data of the original read by the scanner device 14 Is input by the data transmission / reception unit 151.
The data transmission / reception unit 151 outputs the input file creation request signal and the original image data to the image processing unit 152.

ファイル作成の要求信号を入力した画像処理部１５２は、この要求信号とともに入力した原稿の画像データを所定の出力画質（所定解像度）に調整して出力画像（ラスタデータ）を作成する。また、この出力画像にもとづき、原稿のプレビュー画像を生成する（Ｓ２）。
画像処理部１５２は、生成した原稿の出力画像をＯＣＲ解析部１５３に出力する。
ＯＣＲ解析部１５３は、原稿の出力画像からテキストデータを抽出し（Ｓ３）、これを画像処理部１５２に出力する。
また、画像処理部１５２は、フォントラスタライザ１５４を使用してテキストデータをラスタデータに変換し、文字列のプレビュー画像を生成する（Ｓ４）。
次いで、画像処理部１５２は、生成した原稿のプレビュー画像と文字列のプレビュー画像及びテキストデータをＰＣ２０又はパネル１３に送る（Ｓ５）。なお、画像処理部１５２は、テキストデータを保持しておく。 The image processing unit 152 that has received the file creation request signal adjusts the image data of the document input together with the request signal to a predetermined output image quality (predetermined resolution) to create an output image (raster data). Further, based on the output image, a preview image of the document is generated (S2).
The image processing unit 152 outputs the generated output image of the document to the OCR analysis unit 153.
The OCR analysis unit 153 extracts text data from the output image of the document (S3) and outputs it to the image processing unit 152.
Further, the image processing unit 152 converts the text data into raster data using the font rasterizer 154, and generates a preview image of the character string (S4).
Next, the image processing unit 152 sends the generated document preview image, character string preview image, and text data to the PC 20 or the panel 13 (S5). The image processing unit 152 holds text data.

つぎに、ＰＣ２０又はパネル１３は、受信（入力）した原稿のプレビュー画像と文字列のプレビュー画像を表示する（Ｓ６）。なお、このとき、認識文字数、言語、処理時間等の他の文字情報をプレビュー画像とともに表示するようにしてもよい。
これにより、ユーザーは、プレビュー画像を見て、ＯＣＲ解析部１５３による文字抽出の精度を確認することができる。
ここで、文字認識レベルの再設定がされた場合（Ｓ７：ＹＥＳ）、画像処理部１５２は、この再設定された文字認識レベルの設定情報を取得し、この設定情報にもとづいて原稿の画像データをラスタデータに変換して出力画像を生成する（Ｓ８）。なお、生成された原稿の出力画像はＯＣＲ解析部１５３に出力され、再度ステップＳ３以降の処理を行う。 Next, the PC 20 or the panel 13 displays the received (input) preview image of the original and the preview image of the character string (S6). At this time, other character information such as the number of recognized characters, language, and processing time may be displayed together with the preview image.
Thus, the user can check the accuracy of character extraction by the OCR analysis unit 153 by looking at the preview image.
When the character recognition level is reset (S7: YES), the image processing unit 152 acquires the reset character recognition level setting information, and the original image data based on the setting information. Is converted into raster data to generate an output image (S8). The generated output image of the document is output to the OCR analysis unit 153, and the processes after step S3 are performed again.

他方、文字認識レベルの再設定がされなかった場合（Ｓ７：ＮＯ）、テキストデータが修正されたか否かを判定する（Ｓ９）。すなわち、ユーザーの手作業によりテキストデータが修正されたか否かを判断する。
テキストデータが修正された場合（Ｓ９：ＹＥＳ）、データ送受信部１５１は、ＰＣ２０又はパネル１３から修正テキストデータを受信（入力）し、画像処理部１５２に出力する（Ｓ１０）。
そして、画像処理部１５２が、出力画像のＰＤＦデータと修正テキストデータとを結合してサーチャブルＰＤＦを生成する（Ｓ１１）。
テキストデータが修正されなかった場合（Ｓ９：ＮＯ）、画像処理部１５２が、出力画像のＰＤＦデータと保持しておいたテキストデータとを結合してサーチャブルＰＤＦを生成する（Ｓ１１）。 On the other hand, if the character recognition level has not been reset (S7: NO), it is determined whether the text data has been modified (S9). That is, it is determined whether or not the text data has been corrected manually by the user.
When the text data is corrected (S9: YES), the data transmitting / receiving unit 151 receives (inputs) the corrected text data from the PC 20 or the panel 13 and outputs it to the image processing unit 152 (S10).
Then, the image processing unit 152 combines the PDF data of the output image and the corrected text data to generate a searchable PDF (S11).
If the text data has not been corrected (S9: NO), the image processing unit 152 combines the PDF data of the output image and the stored text data to generate a searchable PDF (S11).

以上、本発明の実施形態に係るＭＦＰ１０によれば、サーチャブルファイルの生成にあたり、ユーザーが文字認識レベル（出力画質）を変更してＯＣＲ処理を行わせることができるとともに、このＯＣＲ処理における文字抽出精度を確認することができる。
そして、このような再帰的なＯＣＲ処理を可能とすることで、精度の高いサーチャブルファイルを生成することができる。 As described above, according to the MFP 10 according to the embodiment of the present invention, when generating a searchable file, the user can change the character recognition level (output image quality) to perform OCR processing, and character extraction in this OCR processing. The accuracy can be confirmed.
By enabling such recursive OCR processing, it is possible to generate a searchable file with high accuracy.

また、ＯＣＲ解析部１５３における文字情報の抽出精度と文字情報の抽出速度との割合を設定し得る調整バーを表示し、この調整バーの設定によって定まる割合にもとづいて変更された出力画質にもとづいてＯＣＲ処理を行うことができるようにしている。
このため、速度パフォーマンスとのバランスを保ちつつ一定の精度を有するサーチャブルＰＤＦを生成することができる。 In addition, an adjustment bar that can set the ratio between the character information extraction accuracy and the character information extraction speed in the OCR analysis unit 153 is displayed, and the output image quality is changed based on the ratio determined by the setting of the adjustment bar. OCR processing can be performed.
For this reason, it is possible to generate a searchable PDF having a certain accuracy while maintaining a balance with the speed performance.

さらに、文字の修正を手作業で直接行うことも可能としているため、例えば、文書全体の修正を文字認識レベル（出力画質）の再設定により行い、微細な修正についてはユーザーがピンポイントで修正を行うなど、柔軟な対応が可能となる。 In addition, since it is also possible to manually correct characters, for example, the entire document is corrected by resetting the character recognition level (output image quality), and the user makes pinpoint corrections for fine corrections. Flexible response is possible.

以上、本発明の画像形成装置について、好ましい実施形態を示して説明したが、本発明にかかる画像形成装置は、上述した各実施形態に限定されるものではなく、本発明の範囲で種々の変更実施が可能であることは言うまでもない。
例えば、本発明の実施形態に係るＭＦＰ１０においては、サーチャブルＰＤＦを生成する例について説明したが、テキストデータとの組み合わせが有用な他のドキュメントデータを用いてサーチャブルファイルを生成してもよい。例えば、ＸＰＳデータとテキストデータとの組み合わせからなるサーチャブルＸＰＳを生成することができる。
また、前記実施形態においては、ＭＦＰ１０でサーチャブルＰＤＦを生成するようにしているが、ＰＣ２０が必要な構成をすべて備え、ＰＣ２０単体でサーチャブルＰＤＦを生成する構成であっても良い。 The image forming apparatus of the present invention has been described with reference to the preferred embodiments. However, the image forming apparatus according to the present invention is not limited to the above-described embodiments, and various modifications can be made within the scope of the present invention. It goes without saying that implementation is possible.
For example, in the MFP 10 according to the embodiment of the present invention, an example of generating a searchable PDF has been described, but a searchable file may be generated using other document data that is useful in combination with text data. For example, a searchable XPS composed of a combination of XPS data and text data can be generated.
In the above-described embodiment, the searchable PDF is generated by the MFP 10. However, the PC 20 may have all the necessary configurations, and the searchable PDF may be generated by the PC 20 alone.

本発明は、複写機、プリンター、複合機などの画像形成装置に好適に用いることができる。 The present invention can be suitably used for an image forming apparatus such as a copying machine, a printer, or a multifunction machine.

１０ＭＦＰ（画像形成装置）
１３パネル
１４スキャナデバイス
１５制御部
１５２画像処理部
１５３ＯＣＲ解析部
１５４フォントラスタライザ
２０ＰＣ（ホストコンピューター） 10 MFP (image forming apparatus)
13 Panel 14 Scanner Device 15 Control Unit 152 Image Processing Unit 153 OCR Analysis Unit 154 Font Rasterizer 20 PC (Host Computer)

Claims

Character information extraction means for extracting character information from a target image of a predetermined image quality;
Display means for displaying the character information and an output image of the target image;
File data generating means for generating file data of the target image capable of detecting the character information based on the character information and the target image;
Character accuracy changing means for changing the image quality of the target image in response to an input operation,
The display means displays the character information extracted from the target image of the changed image quality and the output image of the target image;
The image forming apparatus, wherein the file data generation unit generates the file data based on the character information extracted from the target image having a changed image quality and the target image.

Output image generating means for binarizing the target image based on a predetermined image quality or a changed image quality;
The character information extraction means is
The image forming apparatus according to claim 1, wherein character information is extracted from the binarized target image.

An operation unit display unit for displaying an operation unit for setting a ratio between the character information extraction accuracy and the character information extraction speed in the character information extraction unit;
The image forming apparatus according to claim 1, wherein the character accuracy changing unit changes the image quality of the target image based on the ratio determined by the setting of the operation unit.

The image forming apparatus according to claim 1, wherein the character accuracy changing unit changes an image quality of the target image according to a type of the target image.

The computer of the image forming device
Character information extraction means for extracting character information from a target image of a predetermined image quality;
Display means for displaying the character information and an output image of the target image;
Based on the character information and the target image, function as file data generation means for generating file data of the target image capable of detecting the character information, and character accuracy change means for changing the image quality of the target image in accordance with an input operation. With
The display means displays the character information extracted from the target image with the changed image quality and the output image of the target image,
An image forming program for causing the file data generation means to generate the file data based on the character information extracted from the target image having a changed image quality and the target image.