JP2013236234A

JP2013236234A - Image processing apparatus, image processing system, and program

Info

Publication number: JP2013236234A
Application number: JP2012107117A
Authority: JP
Inventors: Hiroyuki Sakuyama; 宏幸作山
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 2012-05-08
Filing date: 2012-05-08
Publication date: 2013-11-21

Abstract

PROBLEM TO BE SOLVED: To improve efficiency in creating data obtained by adding, to an image, data having undergone image processing.SOLUTION: An image processing apparatus comprises: generation means for generating a binary image from an image; compression means for compressing the binary image generated by the generation means and an image being the generation source of the binary image; transmission means which transmits the binary image compressed by the compression means to an external apparatus having image processing means to perform predetermined image processing; reception means which receives, from the external apparatus, information on the binary image obtained by performing the predetermined image processing on the binary image transmitted by the transmission means, by the image processing means; and file generation means which generates an image file based on information on the binary image having undergone the predetermined image processing and received by the reception means and the image being the generation source compressed by the compression means.

Description

本発明は、画像を処理する画像処理装置、画像処理システム及びプログラムに関する。 The present invention relates to an image processing apparatus, an image processing system, and a program for processing an image.

従来、カラー画像を読み取ってＯＣＲ（Optical Character Recognition）処理する装置において、１枚のカラー画像から白黒二値画像を生成して、白黒二値画像に対して処理を行っていた。 Conventionally, in a device that reads a color image and performs OCR (Optical Character Recognition) processing, a monochrome binary image is generated from one color image, and the monochrome binary image is processed.

例えば、原画像を二値画像とカラー画像とに分離し、画像処理（例えば圧縮）をしたカラー画像と、二値画像に対してＯＣＲ処理を施した結果とを合成する技術がある（例えば特許文献１参照）。 For example, there is a technique of separating an original image into a binary image and a color image, and combining a color image obtained by image processing (for example, compression) and a result obtained by performing OCR processing on the binary image (for example, a patent) Reference 1).

ここで、従来の画像処理装置（例えばＭＦＰ（Multifunction Peripheral））には、スキャンした原画像にＯＣＲ（文字認識）処理を施し、原画像を圧縮したデータに文字認識後の文字データのレイヤを加え、特定のファイルフォーマットでラップしてファイルとして出力する機能がある。 Here, a conventional image processing apparatus (for example, MFP (Multifunction Peripheral)) performs OCR (character recognition) processing on the scanned original image, and adds a character data layer after character recognition to the compressed data of the original image. There is a function to wrap in a specific file format and output as a file.

典型的なファイルフォーマットはＰＤＦであり，この場合のファイルはサーチャブルＰＤＦと呼ばれることが多い。以降では、簡単のためこのファイルを「ＯＣＲ＿ＰＤＦ」とも呼ぶ。 A typical file format is PDF, and the file in this case is often called a searchable PDF. Hereinafter, for simplicity, this file is also referred to as “OCR_PDF”.

ＯＣＲ＿ＰＤＦの作成及びユーザへの送信過程については、従来技術にもあるように、以下の手順が実行される。
（Ａ１）カラーの原画像から二値画像を生成
（Ａ２）カラー画像を圧縮
（Ａ３）二値画像に対してＯＣＲ処理
（Ａ４）圧縮したカラー画像と、ＯＣＲ処理結果とをファイルフォーマットでラップしてＯＣＲ＿ＰＤＦを生成
（Ａ５）ＯＣＲ＿ＰＤＦをユーザのＰＣ（Personal Computer）に転送
つまり、上記処理は、主に以下の３つの処理を含む。
・原画像の圧縮
・ＯＣＲ処理
・ファイルフォーマットのラップ処理
この３つの処理時間の比は次の通りである。
ファイルフォーマットラップ処理＜原画像の圧縮＜＜ＯＣＲ処理
ＯＣＲ処理は、一般的な画像処理と比べて処理負荷が大きいため、ＯＣＲ＿ＰＤＦを作成するのに、ＯＣＲ処理の処理時間が支配的になってしまう。 Regarding the creation of OCR_PDF and the transmission process to the user, the following procedure is executed as in the prior art.
(A1) Generate binary image from color original image (A2) Compress color image (A3) OCR process for binary image (A4) Wrapped color image and OCR process result in file format Then, OCR_PDF is generated (A5) OCR_PDF is transferred to the user's PC (Personal Computer) That is, the above process mainly includes the following three processes.
-Original image compression-OCR processing-File format wrap processing The ratio of these three processing times is as follows.
File format wrap processing << original image compression << OCR processing Since OCR processing has a larger processing load than general image processing, the processing time of OCR processing becomes dominant in creating OCR_PDF. .

そのため、ＭＦＰではなく、ネットワーク上のサーバなどを用いてＯＣＲ処理を行い、ＯＣＲ処理時間を削減することができる。また、サーバでは、ＯＣＲ処理に限らず、処理負荷が大きい画像処理を行わせてもよい。 Therefore, OCR processing can be performed using a server on the network instead of the MFP, and the OCR processing time can be reduced. Further, the server is not limited to OCR processing, and may perform image processing with a large processing load.

このとき、サーバで処理負荷が大きい画像処理を行わせる場合、サーバに余計な処理をさせない、テータ転送量を考慮する、など画像処理後のデータを画像に加えたデータ（例えばＯＣＲ＿ＰＤＦ）を効率良く作成することが要求される。 At this time, when image processing with a heavy processing load is performed on the server, data (for example, OCR_PDF) obtained by adding data after image processing to the image, such as not allowing the server to perform extra processing or considering the amount of data transfer, is efficiently used. It is required to create.

そこで、本発明は上記問題点に鑑みてなされたものであり、画像処理後のデータを画像に加えたデータを作成する際の効率性を向上させることができる画像処理装置、画像処理システム及びプログラムを提供することを目的とする。 Therefore, the present invention has been made in view of the above problems, and an image processing apparatus, an image processing system, and a program capable of improving the efficiency in creating data obtained by adding data after image processing to an image The purpose is to provide.

本発明の一態様の画像処理装置は、画像から二値画像を生成する生成手段と、前記生成手段で生成された二値画像と該二値画像の生成元の画像とを圧縮する圧縮手段と、前記圧縮手段で圧縮された二値画像を、所定の画像処理を施す画像処理手段を有する外部装置に送信する送信手段と、前記送信手段で送信した二値画像に対して前記画像処理手段により所定の画像処理が施された二値画像の情報を前記外部装置から受信する受信手段と、前記受信手段で受信した前記所定の画像処理が施された二値画像の情報と、前記圧縮手段で圧縮された生成元の画像とに基づいた画像ファイルを生成するファイル生成手段とを備える。 An image processing apparatus according to an aspect of the present invention includes a generation unit that generates a binary image from an image, a compression unit that compresses the binary image generated by the generation unit and the image from which the binary image is generated. Transmitting the binary image compressed by the compression unit to an external apparatus having an image processing unit for performing predetermined image processing; and the binary image transmitted by the transmission unit by the image processing unit. A receiving unit that receives information of a binary image that has undergone predetermined image processing from the external device; information of the binary image that has been subjected to the predetermined image processing received by the receiving unit; and the compression unit. File generating means for generating an image file based on the compressed source image.

本発明によれば、画像処理後のデータを画像に加えたデータを作成する際の効率性を向上させることができる。 ADVANTAGE OF THE INVENTION According to this invention, the efficiency at the time of creating the data which added the data after image processing to the image can be improved.

実施例１における画像処理システムの一例を示す図。1 is a diagram illustrating an example of an image processing system in Embodiment 1. FIG. 実施例１における画像処理装置のハードウェアの一例を示すブロック図。FIG. 2 is a block diagram illustrating an example of hardware of the image processing apparatus according to the first embodiment. 実施例１におけるサーバのハードウェアの一例を示すブロック図。FIG. 2 is a block diagram illustrating an example of server hardware in the first embodiment. 実施例１における画像処理装置の機能の一例を示すブロック図。FIG. 3 is a block diagram illustrating an example of functions of the image processing apparatus according to the first embodiment. 実施例１におけるサーバの機能の一例を示すブロック図。FIG. 3 is a block diagram illustrating an example of server functions according to the first embodiment. ファイルフォーマットの一例を示す図。The figure which shows an example of a file format. 段組を説明するための図。The figure for demonstrating a column. 行位置認識を説明するための図。The figure for demonstrating line position recognition. 文字位置認識を説明するための図。The figure for demonstrating character position recognition. 文字オブジェクトの要素の一例を示す図。The figure which shows an example of the element of a character object. 訳語オブジェクトの具体例を示す図。The figure which shows the specific example of a translation object. サーバに送信する二値画像の形式の一例を示す図。The figure which shows an example of the format of the binary image transmitted to a server. サーバから送信されるＯＣＲ処理後のテキストデータ形式の一例を示す図。The figure which shows an example of the text data format after the OCR process transmitted from a server. 実施例１における画像処理の一例を示すフローチャート。5 is a flowchart illustrating an example of image processing in the first embodiment. 実施例２における画像処理装置の機能の一例を示すブロック図。FIG. 9 is a block diagram illustrating an example of functions of an image processing apparatus according to a second embodiment. 実施例２におけるサーバの機能の一例を示すブロック図。FIG. 9 is a block diagram illustrating an example of server functions according to the second embodiment. 実施例２における画像処理の一例を示すフローチャート。10 is a flowchart illustrating an example of image processing according to the second embodiment.

まず、処理負荷が大きい画像処理を、ＭＦＰではなく、処理能力の高いサーバに処理させる場合の典型例について説明する。この典型例の処理手順は以下の通りである。なお、以降では、画像処理としてＯＣＲを例に説明するが、この限りではない。
（Ｂ１）ＭＦＰが、カラーの原画像をスキャンし、スキャンされたカラー画像を圧縮
（Ｂ２）ＭＦＰが、圧縮後のカラー画像をサーバに送信
（Ｂ３）サーバが、圧縮されたカラー画像を伸長し、二値画像を生成
（Ｂ４）サーバが、二値画像にＯＣＲ処理を施す
（Ｂ５）サーバが、伸長後のカラー画像を圧縮
（Ｂ６）サーバが、圧縮後のカラー画像とＯＣＲ結果とをファイルフォーマットでラップしてＯＣＲ＿ＰＤＦを生成
（Ｂ７）サーバが、ＯＣＲ＿ＰＤＦをユーザのＰＣに送信
しかし、上記の典型例には、次の４つの問題点がある。
・（Ｂ１）において、ＯＣＲ対象の二値画像を精度良く生成するために、その原画像のカラー画像をなるべく低い圧縮率で圧縮する必要がある。圧縮率が低ければ低いほど、圧縮後のデータサイズは大きくなる。
・（Ｂ２）において、（Ｂ１）の結果、サーバに転送するデータ量が多くなる。
・（Ｂ３）において、二値画像生成のために、圧縮されたカラー画像を伸長する必要がある。
・（Ｂ５）において、ＯＣＲ＿ＰＤＦのファイルサイズを実用的な小ささにするために、カラー画像の再度の圧縮が必要となる。（Ｂ２）で圧縮されたカラー画像自体は、ＯＣＲ＿ＰＤＦで使えない場合が多い。 First, a typical example of a case where image processing with a large processing load is processed by a server having a high processing capacity instead of the MFP will be described. The processing procedure of this typical example is as follows. In the following, OCR is described as an example of image processing, but this is not a limitation.
(B1) The MFP scans the original color image and compresses the scanned color image. (B2) The MFP transmits the compressed color image to the server. (B3) The server decompresses the compressed color image. The binary image is generated (B4). The server performs OCR processing on the binary image. (B5) The server compresses the decompressed color image. (B6) The server stores the compressed color image and the OCR result as a file. Wrapping in format to generate OCR_PDF (B7) Server sends OCR_PDF to user's PC However, the above typical example has the following four problems.
In (B1), in order to generate a binary image to be subjected to OCR with high accuracy, it is necessary to compress the color image of the original image with a compression rate as low as possible. The lower the compression rate, the larger the data size after compression.
In (B2), the amount of data transferred to the server increases as a result of (B1).
In (B3), it is necessary to decompress the compressed color image in order to generate a binary image.
In (B5), in order to make the file size of OCR_PDF practically small, it is necessary to compress the color image again. In many cases, the color image itself compressed in (B2) cannot be used in OCR_PDF.

そこで、上記の問題点を解消し、画像処理後のデータを画像に加えたデータ（例えばＯＣＲ＿ＰＤＦ）を生成する処理を効率良く行う各実施例について、以下、図面に基づいて説明する。 Therefore, each embodiment that solves the above-described problems and efficiently generates data (for example, OCR_PDF) in which data after image processing is added to an image will be described with reference to the drawings.

［実施例１］
＜システム＞
図１は、実施例１における画像処理システムの一例を示す図である。図１に示す画像処理システム１は、画像処理装置（例えばＭＦＰ）１０、ＰＣ２０、サーバ２０、及びサーバ４０等を有する。接続されるサーバの数は、２つに限られず、１つや３つ以上であってもよい。 [Example 1]
<System>
FIG. 1 is a diagram illustrating an example of an image processing system according to the first embodiment. An image processing system 1 illustrated in FIG. 1 includes an image processing apparatus (for example, MFP) 10, a PC 20, a server 20, a server 40, and the like. The number of servers connected is not limited to two and may be one or three or more.

画像処理装置１０は、原稿をスキャナにより読み取ってカラーの画像を生成する。画像処理装置１０は、カラー画像から白黒の二値画像を生成し、二値画像を例えばサーバ２０に送信する。このとき、二値画像が複数のページを有する場合は、ページ毎に異なるサーバに送信してもよい。 The image processing apparatus 10 reads a document with a scanner and generates a color image. The image processing apparatus 10 generates a black and white binary image from the color image, and transmits the binary image to, for example, the server 20. At this time, when the binary image has a plurality of pages, it may be transmitted to a different server for each page.

画像処理装置１０は、二値画像に対してＯＣＲ処理が実行された処理結果の情報をサーバ２０から受信する。画像処理装置１０は、受信した処理結果の情報と、圧縮したカラー画像とを合成し、画像ファイルを生成する。画像処理装置１０は、画像ファイルをユーザのＰＣ２０に送信してもよい。 The image processing apparatus 10 receives information on a processing result obtained by executing the OCR process on the binary image from the server 20. The image processing apparatus 10 combines the received processing result information and the compressed color image to generate an image file. The image processing apparatus 10 may transmit the image file to the user's PC 20.

サーバ２０、３０は、例えば処理能力の高い情報処理装置であり、処理負荷が大きい画像処理を実行することができる。ここでの画像処理は、例えばＯＣＲ処理などである。サーバ２０、３０は、画像処理装置１０から受信した二値画像を伸長し、伸長後の二値画像に対してＯＣＲ処理を施す。サーバ２０、３０は、ＯＣＲ後のデータを画像処理装置１０に送信する。次に、各装置の詳細について説明する。 The servers 20 and 30 are, for example, information processing devices with high processing capabilities, and can execute image processing with a large processing load. The image processing here is, for example, OCR processing. The servers 20 and 30 expand the binary image received from the image processing apparatus 10 and perform OCR processing on the expanded binary image. The servers 20 and 30 transmit the data after OCR to the image processing apparatus 10. Next, details of each device will be described.

＜ハードウェア＞
図２は、実施例１における画像処理装置１０のハードウェアの一例を示すブロック図である。図２に示す例では、画像処理装置１０は、コントローラ１１、スキャナ１２、プリンタ１３、モデム１４、操作パネル１５、ネットワークインタフェース１６、及びＳＤカードスロット１７等のハードウェアを有する。画像処理装置１０は、例えば複合機等である。 <Hardware>
FIG. 2 is a block diagram illustrating an example of hardware of the image processing apparatus 10 according to the first embodiment. In the example illustrated in FIG. 2, the image processing apparatus 10 includes hardware such as a controller 11, a scanner 12, a printer 13, a modem 14, an operation panel 15, a network interface 16, and an SD card slot 17. The image processing apparatus 10 is, for example, a multifunction machine.

コントローラ１１は、ＣＰＵ（Central Processing Unit）１１１、ＲＡＭ（Random Access Memory）１１２、ＲＯＭ（Read Only Memory）１１３、ＨＤＤ（Hard Disk Drive）１１４、及びＮＶＲＡＭ（Non Volatile RAM）１１５等を有する。 The controller 11 includes a CPU (Central Processing Unit) 111, a RAM (Random Access Memory) 112, a ROM (Read Only Memory) 113, an HDD (Hard Disk Drive) 114, an NVRAM (Non Volatile RAM) 115, and the like.

ＲＯＭ１１３には、各種のプログラムやプログラムによって利用されるデータ等が記憶されている。ＲＡＭ１１２は、プログラムをロードするための記憶領域や、ロードされたプログラムのワーク領域等として用いられる。また、ＲＡＭ１１２は、例えばスキャナ１２により読み取られた画像や、圧縮された画像を記憶したり、ＯＣＲ後のデータ（例えばテキストデータ）を記憶したりする。 The ROM 113 stores various programs and data used by the programs. The RAM 112 is used as a storage area for loading a program, a work area for the loaded program, and the like. The RAM 112 stores, for example, an image read by the scanner 12, a compressed image, or data after OCR (for example, text data).

ＣＰＵ１１１は、ＲＡＭ１１２にロードされたプログラムを処理することにより、上述した機能を実現する。ＨＤＤ１１４には、プログラムやプログラムが利用する各種のデータ等が記憶される。ＮＶＲＡＭ１１５には、各種の設定情報等が記憶される。 The CPU 111 realizes the above-described functions by processing a program loaded in the RAM 112. The HDD 114 stores a program and various data used by the program. The NVRAM 115 stores various setting information and the like.

スキャナ１２は、原稿から原画像を読み取るためのハードウェアである。例えば原画像は、カラーの画像データである。プリンタ１３は、印刷データを印刷用紙に印刷するためのハードウェアである。モデム１４は、電話回線に接続するためのハードウェアであり、ＦＡＸ通信による画像データの送受信を実行するために用いられる。 The scanner 12 is hardware for reading an original image from a document. For example, the original image is color image data. The printer 13 is hardware for printing print data on printing paper. The modem 14 is hardware for connecting to a telephone line, and is used to execute transmission / reception of image data by FAX communication.

操作パネル１５は、ユーザからの入力の受け付けを行うためのボタン等の入力手段や、液晶パネル等の表示手段等を備えたハードウェアである。ネットワークインタフェース１６は、ＬＡＮ等のネットワーク（有線又は無線の別は問わない）に接続するためのハードウェアである。ネットワークインタフェース１６は、データの送受信を行う各サーバやユーザのＰＣ２０のアドレスを保持する。 The operation panel 15 is hardware including input means such as buttons for accepting input from the user, display means such as a liquid crystal panel, and the like. The network interface 16 is hardware for connecting to a network such as a LAN (whether wired or wireless). The network interface 16 holds the address of each server that transmits and receives data and the PC 20 of the user.

ＳＤカードスロット１７は、ＳＤカード８０に記憶されたプログラムを読み取るために利用される。すなわち、画像処理装置１０では、ＲＯＭ１１３に記憶されたプログラムだけでなく、ＳＤカード８０に記憶されたプログラムもＲＡＭ１１２にロードされ、実行されうる。 The SD card slot 17 is used for reading a program stored in the SD card 80. That is, in the image processing apparatus 10, not only the program stored in the ROM 113 but also the program stored in the SD card 80 can be loaded into the RAM 112 and executed.

なお、他の記録媒体（例えば、ＣＤ−ＲＯＭ又はＵＳＢ（Universal Serial Bus）メモリ等）によってＳＤカード８０が代替されてもよい。すなわち、ＳＤカード８０の位置付けに相当する記録媒体の種類は、所定のものに限定されない。この場合、ＳＤカードスロット１７は、記録媒体の種類に応じたハードウェアによって代替されればよい。 The SD card 80 may be replaced by another recording medium (for example, a CD-ROM or a USB (Universal Serial Bus) memory). That is, the type of recording medium corresponding to the positioning of the SD card 80 is not limited to a predetermined one. In this case, the SD card slot 17 may be replaced by hardware corresponding to the type of recording medium.

図３は、実施例１におけるサーバ３０のハードウェアの一例を示すブロック図である。図３に示すように、サーバ３０は、ＣＰＵ３１、ＲＡＭ３２、ＲＯＭ３３、ＨＤＤ３４、ネットワークインタフェース３５、表示部３６、及び外部記録装置Ｉ／Ｆ部３７等を有する。これら各構成は、バスを介して相互にデータ送受信可能に接続されている。 FIG. 3 is a block diagram illustrating an example of hardware of the server 30 according to the first embodiment. As shown in FIG. 3, the server 30 includes a CPU 31, a RAM 32, a ROM 33, an HDD 34, a network interface 35, a display unit 36, an external recording device I / F unit 37, and the like. These components are connected to each other via a bus so as to be able to transmit and receive data.

ＲＯＭ３３には、各種のプログラム（例えばＯＣＲ処理プログラム）やプログラムによって利用されるデータ等が記憶されている。また、ＲＯＭ３３は、例えば認識辞書メモリを有し、ＯＣＲ処理で使用する認識辞書データを記憶する。 The ROM 33 stores various programs (for example, an OCR processing program), data used by the programs, and the like. The ROM 33 has, for example, a recognition dictionary memory, and stores recognition dictionary data used in OCR processing.

ＲＡＭ３２は、プログラムをロードするための記憶領域や、ロードされたプログラムのワーク領域等として用いられる。また、ＲＡＭ３２には、例えばＯＣＲ後のデータ（例えばテキストデータ）や画像処理装置１０から受信したデータを記憶したりする。 The RAM 32 is used as a storage area for loading a program, a work area for the loaded program, and the like. Further, the RAM 32 stores, for example, data after OCR (for example, text data) and data received from the image processing apparatus 10.

ＣＰＵ３１は、ＲＡＭ３２にロードされたプログラムを処理することにより、上述した機能を実現する。ＨＤＤ３４には、プログラムやプログラムが利用する各種のデータ等が記憶される。 The CPU 31 realizes the above-described functions by processing a program loaded in the RAM 32. The HDD 34 stores programs and various data used by the programs.

ネットワークインタフェース３５は、ネットワークを介して接続された通信機能を有する機器とサーバ３０とのインターフェースである。ネットワークは、有線及び／又は無線回線などのデータ伝送路により構築されたＬＡＮ（Local Area Network）、ＷＡＮ（Wide Area Network）などである。ネットワークインタフェース３５は、データの送受信を行う画像処理装置１０のアドレスを保持する。 The network interface 35 is an interface between the server 30 and a device having a communication function connected via a network. The network is a LAN (Local Area Network), a WAN (Wide Area Network), or the like constructed by a data transmission path such as a wired and / or wireless line. The network interface 35 holds the address of the image processing apparatus 10 that transmits and receives data.

表示部３６は、例えばＬＣＤ（Liquid Crystal Display）等により構成され、ＣＰＵ３１から入力される表示データに応じた表示が行われる。 The display unit 36 is configured by, for example, an LCD (Liquid Crystal Display) or the like, and performs display according to display data input from the CPU 31.

外部記録装置Ｉ／Ｆ部３７は、ＵＳＢ（Universal Serial Bus）などのデータ伝送路を介して接続された記録媒体３８（例えば、フラッシュメモリなど）とサーバ３０とのインターフェースである。 The external recording device I / F unit 37 is an interface between the server 30 and a recording medium 38 (for example, a flash memory) connected via a data transmission path such as a USB (Universal Serial Bus).

また、記録媒体３８に、所定のプログラムを格納し、この記録媒体３８に格納されたプログラムは外部記録装置Ｉ／Ｆ部３７を介してサーバ３０にインストールされ、インストールされた所定のプログラムはサーバ３０により実行可能となる。 A predetermined program is stored in the recording medium 38, and the program stored in the recording medium 38 is installed in the server 30 via the external recording device I / F unit 37, and the installed predetermined program is stored in the server 30. Can be executed.

また、サーバ４０やＰＣ２０は、サーバ３０と同様のハードウェアを有するため、その説明を省略する。 Further, since the server 40 and the PC 20 have the same hardware as the server 30, the description thereof is omitted.

＜機能＞
次に、画像処理装置１０の機能について説明する。図４は、実施例１における画像処理装置１０の機能の一例を示すブロック図である。画像処理措置１０は、二値画像生成手段２０１、圧縮手段２０３、指定手段２０５、送信手段２０７、受信手段２０９、及びファイル生成手段２１１を有する。なお、図示しないが、画像処理装置１０は、ＭＦＰとしての一般的な機能も有する。 <Function>
Next, functions of the image processing apparatus 10 will be described. FIG. 4 is a block diagram illustrating an example of functions of the image processing apparatus 10 according to the first embodiment. The image processing unit 10 includes a binary image generation unit 201, a compression unit 203, a designation unit 205, a transmission unit 207, a reception unit 209, and a file generation unit 211. Although not shown, the image processing apparatus 10 also has a general function as an MFP.

また、二値画像生成手段２０１、圧縮手段２０３、指定手段２０５、ファイル生成手段２１１は、例えばＣＰＵ１１１やワークメモリとしてのＲＡＭ１１２などにより実現されうる。また、送信手段２０７や受信手段２０９は、例えばネットワークインタフェース１６などにより実現されうる。 The binary image generating unit 201, the compressing unit 203, the specifying unit 205, and the file generating unit 211 can be realized by, for example, the CPU 111 or the RAM 112 as a work memory. The transmission unit 207 and the reception unit 209 can be realized by the network interface 16 or the like, for example.

二値画像生成手段２０１は、スキャナ１２により読み取られた画像から二値画像を生成する。例えば、読み取られた画像はカラー画像（以下、原画像とも呼ぶ）であり、二値画像は白黒の二値画像である。二値画像生成手段２０１は、原画像と、生成した二値画像とを圧縮手段２０３に出力する。 The binary image generation unit 201 generates a binary image from the image read by the scanner 12. For example, the read image is a color image (hereinafter also referred to as an original image), and the binary image is a monochrome binary image. The binary image generation unit 201 outputs the original image and the generated binary image to the compression unit 203.

圧縮手段２０３は、二値画像生成手段２０１で生成された二値画像と、この二値画像の生成元の画像とを圧縮する。圧縮手段２０３は、原画像に対して、例えば周知のＪＰＥＧ（Joint Photographic Exerts Group）により圧縮を行えばよい。また、圧縮手段２０３は、二値画像に対して、例えばＭＲ（Modified Read）、ＭＨ（Modified Huffman）、ＭＭＲ（Modified Modified Read）などのロスレスの圧縮を行えばよい。 The compression unit 203 compresses the binary image generated by the binary image generation unit 201 and the image from which the binary image is generated. The compression unit 203 may compress the original image using, for example, the well-known JPEG (Joint Photographic Experts Group). The compression unit 203 may perform lossless compression such as MR (Modified Read), MH (Modified Huffman), and MMR (Modified Modified Read) on the binary image.

圧縮手段２０３は、圧縮した二値画像を指定手段２０５に出力し、圧縮した原画像をファイル生成手段２１１に出力する。 The compression unit 203 outputs the compressed binary image to the designation unit 205, and outputs the compressed original image to the file generation unit 211.

指定手段２０５は、圧縮された二値画像に対して、ＯＣＲ処理の対象言語を指定する。例えば、指定手段２０５は、ＯＣＲ処理の対象言語をユーザに指定させ、指定された対象言語を圧縮された二値画像のヘッダに付与し、この二値画像を送信手段２０７に出力する。圧縮された二値画像は、圧縮後の二値画像ともいう。 The designation unit 205 designates a target language for OCR processing for the compressed binary image. For example, the designation unit 205 causes the user to designate the target language for the OCR process, assigns the designated target language to the header of the compressed binary image, and outputs the binary image to the transmission unit 207. The compressed binary image is also referred to as a binary image after compression.

なお、指定手段２０５は、必ずしも必要な機能ではない。指定手段２０５がない場合は、圧縮手段２０３は、圧縮された二値画像を送信手段２０７に出力する。 Note that the designation unit 205 is not necessarily a necessary function. When there is no designation unit 205, the compression unit 203 outputs the compressed binary image to the transmission unit 207.

送信手段２０７は、圧縮手段２０３で圧縮された二値画像を、所定の画像処理を施す画像処理手段を有する装置としてのサーバ３０に送信する。また、送信手段２０７は、圧縮された二値画像が複数のページを有する場合、各ページに対して複数のサーバから送信先を選択してもよい。送信手段２０７は、例えばページ毎に異なる送信先を選択してもよい。 The transmission unit 207 transmits the binary image compressed by the compression unit 203 to the server 30 as an apparatus having an image processing unit that performs predetermined image processing. Further, when the compressed binary image has a plurality of pages, the transmission unit 207 may select a transmission destination from a plurality of servers for each page. For example, the transmission unit 207 may select a different transmission destination for each page.

なお、送信手段２０７は、予め送信先が設定されていてもよいし、ユーザによりその都度送信先を選択させるようにしてもよい。送信先の選択手段は、送信手段２０７とは別の機能としてもよい。 Note that the transmission unit 207 may set a transmission destination in advance, or may allow the user to select a transmission destination each time. The transmission destination selection unit may have a function different from that of the transmission unit 207.

受信手段２０９は、サーバ３０又は各サーバから、送信手段２０７で送信した二値画像に対して、後述する画像処理手段により所定の画像処理が施された二値画像の情報を受信する。受信手段２０９は、受信された情報をファイル生成手段２１１に出力する。受信された二値画像の情報は、例えばＯＣＲ処理後のテキストデータである。 The receiving unit 209 receives, from the server 30 or each server, binary image information obtained by performing predetermined image processing on the binary image transmitted by the transmitting unit 207 by an image processing unit described later. The receiving unit 209 outputs the received information to the file generating unit 211. The received binary image information is, for example, text data after OCR processing.

ファイル生成手段２１１は、受信手段２０９で受信した所定の画像処理が施された二値画像の情報と、圧縮手段２０３で圧縮された生成元の画像とに基づいた画像ファイルを生成する。ファイル生成手段２１１は、受信された情報と圧縮された原画像とを１つのファイルフォーマットにする。 The file generation unit 211 generates an image file based on the binary image information subjected to the predetermined image processing received by the reception unit 209 and the generation source image compressed by the compression unit 203. The file generation unit 211 converts the received information and the compressed original image into one file format.

１つのファイルフォーマットとは、例えば、以下のファイルをいう。
・レイヤ分離された画像フォーマットに、現像の文字位置とＯＣＲテキストデータの文字位置とが同じになるように重ね合わせた画像ファイル
・ＯＣＲテキストデータを現像の次のページに添付した画像ファイル
・ＯＣＲテキストデータをコメントとして原画像に追加した画像ファイル
また、画像ファイルは、例えば上述したＯＣＲ＿ＰＤＦである。 One file format refers to the following files, for example.
-An image file in which the character position of development and the character position of OCR text data are superimposed on the image format separated into layers-Image file with OCR text data attached to the next page of development-OCR text An image file in which data is added to the original image as a comment. The image file is, for example, the above-described OCR_PDF.

画像処理装置１０は、生成した画像ファイルを、ユーザのＰＣ２０に送信したりしてもよい。 The image processing apparatus 10 may transmit the generated image file to the user's PC 20.

以上により、画像処理装置１０は、二値画像に対するＯＣＲ処理を短縮させることができる。また、圧縮された二値画像のサイズはカラー画像と比べてかなり小さいので、サーバへのデータ転送量を減らすことができる。また、ＯＣＲ処理の対象言語を指定する場合は、ＯＣＲの精度を維持することができる。 As described above, the image processing apparatus 10 can shorten the OCR processing on the binary image. Further, since the size of the compressed binary image is considerably smaller than that of the color image, the data transfer amount to the server can be reduced. In addition, when the target language for OCR processing is designated, the accuracy of OCR can be maintained.

ＯＣＲ処理では、自動で対象言語を認識することもできるため、対象言語を指定しなくてもよく、この場合は二値画像だけを送信すればよい。 Since the target language can be automatically recognized in the OCR process, it is not necessary to specify the target language. In this case, only the binary image needs to be transmitted.

次に、サーバの機能について説明する。図５は、実施例１におけるサーバ３０の機能の一例を示すブロック図である。図５に示すサーバ３０は、受信手段３０１、伸長手段３０３、画像処理手段３０５、及び送信手段３０７を有する。なお、サーバ３０は、図示しないがサーバとしての一般的な機能も有する。 Next, the function of the server will be described. FIG. 5 is a block diagram illustrating an example of functions of the server 30 according to the first embodiment. The server 30 illustrated in FIG. 5 includes a reception unit 301, an expansion unit 303, an image processing unit 305, and a transmission unit 307. The server 30 also has a general function as a server (not shown).

また、伸長手段３０３や画像処理手段３０５は、例えばＣＰＵ３１及びワークメモリとしてのＲＡＭ３２などにより実現されうる。受信手段３０１や送信手段３０７は、例えばネットワークインタフェース３５などにより実現されうる。 The decompressing unit 303 and the image processing unit 305 can be realized by the CPU 31 and the RAM 32 as a work memory, for example. The receiving unit 301 and the transmitting unit 307 can be realized by the network interface 35, for example.

受信手段３０１は、画像処理装置１０から、圧縮された二値画像を受信する。受信手段３０１は、圧縮された二値画像を伸長手段３０３に出力する。 The receiving unit 301 receives a compressed binary image from the image processing apparatus 10. The receiving unit 301 outputs the compressed binary image to the decompressing unit 303.

伸長手段３０３は、圧縮された二値画像を取得し、圧縮された二値画像を伸長する。伸長の方法は、画像処理装置１０の圧縮に対応する伸長方法である。伸長手段３０３は、伸長後の二値画像を画像処理手段３０５に出力する。 The decompression unit 303 acquires a compressed binary image and decompresses the compressed binary image. The expansion method is an expansion method corresponding to the compression of the image processing apparatus 10. The decompression unit 303 outputs the binary image after decompression to the image processing unit 305.

画像処理手段３０５は、伸長後の二値画像を取得し、伸長後の二値画像に対して画像処理を行う。この画像処理は、例えばＯＣＲ処理である。画像処理手段３０５は、画像処理した二値画像の情報を送信手段３０７に出力する。二値画像の情報とは、例えばＯＣＲ処理後のテキストデータである。 The image processing unit 305 acquires the decompressed binary image and performs image processing on the decompressed binary image. This image processing is, for example, OCR processing. The image processing unit 305 outputs information on the binary image subjected to the image processing to the transmission unit 307. The binary image information is, for example, text data after OCR processing.

送信手段３０７は、画像処理された二値画像の情報（例えばＯＣＲ処理後のテキストデータ）を画像処理装置１０に送信する。 The transmission unit 307 transmits information on the binary image subjected to image processing (for example, text data after OCR processing) to the image processing apparatus 10.

以上により、サーバ側では、カラー画像の圧縮、伸長を行う必要がないため、不必要な処理を行う必要がない。 As described above, since it is not necessary to compress and decompress a color image on the server side, it is not necessary to perform unnecessary processing.

＜ファイルフォーマット＞
次に、実施例１におけるファイルフォーマットについて説明する。図６は、ファイルフォーマットの一例を示す図である。図６に示す例は、ＯＣＲ＿ＰＤＦのファイルフォーマットの一例である。 <File format>
Next, the file format in Example 1 will be described. FIG. 6 is a diagram illustrating an example of a file format. The example shown in FIG. 6 is an example of an OCR_PDF file format.

図６に示すファイルフォーマットには、固有の値であるマジックナンバを有するヘッダに続き、画像データのレイヤ、ＯＣＲ処理後のテキストデータのレイヤが格納されている。 The file format shown in FIG. 6 stores a layer of image data and a layer of text data after OCR processing following a header having a magic number that is a unique value.

画像データのレイヤは、解像度と圧縮後の画像符号そのものを含む。実施例１では、画像圧縮方式として周知のＪＰＥＧを採用する。原稿が複数ページの場合には、画像データのレイヤ部分にページ分だけの「解像度と圧縮後の画像符号」のペアが含まれ、テキストレイヤ部分も複数ページ分のデータを含む。 The layer of image data includes the resolution and the compressed image code itself. In the first embodiment, a well-known JPEG is adopted as an image compression method. When the document has a plurality of pages, a pair of “resolution and image code after compression” for the page is included in the layer portion of the image data, and the text layer portion also includes data for a plurality of pages.

なお、後述するが、テキストレイヤには、ページごとに解像度と文字の座標（画素位置）が与えられている。そのため、適切な表示装置を用いれば、画像データ中の特定の文字と同じ位置に、テキストレイヤ中のテキストを重ねて表示することが可能になる。このように、画像データ中の文字とテキストとを同じ位置に重ねて表示する場合には、一般にはテキストの色は透明にすることが多い。 As will be described later, the resolution and character coordinates (pixel position) are given to the text layer for each page. Therefore, if an appropriate display device is used, the text in the text layer can be superimposed and displayed at the same position as the specific character in the image data. As described above, when characters and text in image data are displayed in an overlapping manner at the same position, generally the color of the text is often transparent.

＜ＯＣＲ処理＞
次に、サーバ側のＯＣＲ処理について説明する。以下に、文字認識処理の手順を説明する。
（Ｃ１）段組認識
（Ｃ２）段組位置認識
（Ｃ３）行認識
（Ｃ４）行位置認識
（Ｃ５）文字位置認識
（Ｃ６）文字認識
つまり、画像処理手段３０５は、初めに段組認識を実行する。段組認識は、主走査方向（紙面水平方向）及び副走査方向（紙面垂直方向）共に、二値画像の文字が存在しない部分（白領域）が一定間隔以上存在した場合を段の区切りとして認識する。 <OCR processing>
Next, server-side OCR processing will be described. Below, the procedure of a character recognition process is demonstrated.
(C1) Column recognition (C2) Column position recognition (C3) Line recognition (C4) Line position recognition (C5) Character position recognition (C6) Character recognition That is, the image processing means 305 first executes column recognition. To do. Column recognition is recognized as a step break when there is a portion (white area) where there are no characters in the binary image in both the main scanning direction (horizontal direction on the paper) and the sub-scanning direction (vertical direction on the paper surface). To do.

次に、画像処理手段３０５は、段組位置認識を実行する。段組位置認識は、二値画像の画像情報に、画素単位で座標を与え、認識された段組が画像中に占める座標を求める。 Next, the image processing unit 305 executes column position recognition. In column position recognition, coordinates are given in pixel units to image information of a binary image, and the coordinates occupied by the recognized column set in the image are obtained.

図７は、段組を説明するための図である。図７に示す原稿中の「1.0 Introduction」で始まる段組を例にすると、図７に示すように「1.0 Introduction」で始まる段組が完全に含まれる長方形の領域（例えば外接矩形）が求められる。 FIG. 7 is a diagram for explaining columns. Taking the column starting with “1.0 Introduction” in the document shown in FIG. 7 as an example, a rectangular area (for example, circumscribed rectangle) completely including the column starting with “1.0 Introduction” is obtained as shown in FIG. .

画像処理手段３０５は、その外接矩形の領域の左上の座標（ｂｓｘ，ｂｓｙ）と右下の座標（ｂｅｘ，ｂｅｙ）とを認識する。ここでの座標は、画素位置であり、スキャンで生成された画素数そのものを用い、スキャン時の解像度とは無関係である。 The image processing unit 305 recognizes the upper left coordinates (bsx, bsy) and the lower right coordinates (bex, bey) of the circumscribed rectangle area. The coordinate here is a pixel position, uses the number of pixels itself generated by scanning, and is independent of the resolution at the time of scanning.

次に、画像処理手段３０５は、行認識を実行する。行認識は、認識された段組ごとに行い、副走査方向に、二値画像の文字が存在しない部分（白領域）が一定間隔以上存在した場合を行の区切りとして認識する。 Next, the image processing unit 305 executes line recognition. Line recognition is performed for each recognized column, and a case where a portion (white area) where no character exists in the binary image exists in the sub-scanning direction at a predetermined interval or more is recognized as a line break.

次に、画像処理手段３０５は、行位置認識を実行する。行位置認識は、段組位置認識と同様にして、二値画像の画像情報に、画素単位で座標を与え、認識された行が画像中に占める座標を求める。 Next, the image processing unit 305 executes line position recognition. In the same way as the column position recognition, the line position recognition gives coordinates in pixel units to the image information of the binary image, and obtains the coordinates occupied by the recognized lines in the image.

例えば、図７に示した原稿中の「AAA Corporation」で始まる行を例にする。図８は、行位置認識を説明するための図である。図８に示すように「AAA Corporation」で始まる行が完全に含まれる長方形の領域（例えば外接矩形）が求められる。 For example, a line beginning with “AAA Corporation” in the document shown in FIG. 7 is taken as an example. FIG. 8 is a diagram for explaining line position recognition. As shown in FIG. 8, a rectangular area (for example, a circumscribed rectangle) that completely includes a line beginning with “AAA Corporation” is obtained.

画像処理手段３０５は、その外接矩形の領域の左上の座標（ｌｓｘ，ｌｓｙ）と右下の座標（ｌｅｘ，ｌｅｙ）を認識する。 The image processing unit 305 recognizes the upper left coordinates (lsx, lsy) and the lower right coordinates (lex, ley) of the circumscribed rectangle area.

次に、画像処理手段３０５は、文字位置認識を実行する。文字位置認識は、認識された行ごとに行い、副走査方向に、二値画像の文字が存在しない部分（白領域）が一定間隔以上存在した場合を文字の区切りとして認識する。 Next, the image processing unit 305 executes character position recognition. Character position recognition is performed for each recognized line, and a case where a portion (white region) where no character exists in the binary image exists in a sub-scanning direction is recognized as a character delimiter.

次に、画像処理手段３０５は、行位置認識と同様にして、区切られた文字が画像中に占める座標を求める。例えば、図８に示した原稿中の「AAA Corporation」で始まる行を例にする。 Next, the image processing unit 305 obtains the coordinates occupied by the divided characters in the image in the same manner as the line position recognition. For example, the line starting with “AAA Corporation” in the document shown in FIG. 8 is taken as an example.

図９は、文字位置認識を説明するための図である。図９に示すように「A」の１文字だけが完全に含まれる長方形の領域（例えば外接矩形）が求められる。 FIG. 9 is a diagram for explaining character position recognition. As shown in FIG. 9, a rectangular area (for example, a circumscribed rectangle) completely including only one character “A” is obtained.

画像処理手段３０５は、その外接矩形の領域の左上の座標（ｃｓｘ，ｃｓｙ）と右下の座標（ｃｅｘ，ｃｅｙ）を認識する。 The image processing unit 305 recognizes the upper left coordinates (csx, csy) and the lower right coordinates (cex, cey) of the circumscribed rectangle area.

最後に、画像処理手段３０５は、文字認識を実行する。文字認識アルゴリズムは、公知のテンプレートマッチングを利用する。例えば、認識辞書メモリには、言語種ごとに、文字認識に使用するテンプレートデータ（認識辞書データとも呼ぶ）が格納されている。画像処理手段３０５は、後述するターゲット言語と一致する認識辞書データを用いる。 Finally, the image processing unit 305 executes character recognition. The character recognition algorithm uses known template matching. For example, the recognition dictionary memory stores template data (also referred to as recognition dictionary data) used for character recognition for each language type. The image processing unit 305 uses recognition dictionary data that matches a target language described later.

＜テキストレイヤの生成＞
文字認識処理の後には、テキストレイヤ生成処理が行われる。テキストレイヤは、複数の文字オブジェクトの集合を有し、１つの文字オブジェクトは図１０に示す要素を含む。 <Text layer generation>
After the character recognition process, a text layer generation process is performed. The text layer has a set of a plurality of character objects, and one character object includes the elements shown in FIG.

図１０は、文字オブジェクトの要素の一例を示す図である。図１０に示す例では、文字オブジェクトは、文字開始識別子と文字終了識別子との間に挟まれた、オペランドとオペコードからで構成される。 FIG. 10 is a diagram illustrating an example of elements of a character object. In the example shown in FIG. 10, the character object is composed of an operand and an opcode sandwiched between a character start identifier and a character end identifier.

図１１は、訳語オブジェクトの具体例を示す図である。図１１に示す例では、
Ｔｆ：フォントオペレータ
ｒｇ：色オペレータ
Ｔｄ：配置オペレータ
Ｔｊ：テキストオペレータ
である。 FIG. 11 is a diagram illustrating a specific example of the translated word object. In the example shown in FIG.
Tf: Font operator rg: Color operator Td: Placement operator Tj: Text operator

図１１に示すＴｆに対しては、フォント番号とフォントサイズが指定される。例えば、図１１に示す「／Ｆ１」（フォント番号１番）は明朝体を指し、「／Ｆ２」（フォント番号２番）はゴシック体を示すとする。 A font number and a font size are designated for Tf shown in FIG. For example, “/ F1” (font number 1) shown in FIG. 11 indicates the Mincho style, and “/ F2” (font number 2) indicates the Gothic style.

図１１に示す「８」は、フォントサイズである。フォントサイズの算出には、文字位置の座標に加え、スキャン時の解像度が用いられる。 “8” shown in FIG. 11 is a font size. In calculating the font size, the resolution at the time of scanning is used in addition to the coordinates of the character position.

図１１に示す「ｒｇ」に対しては、文字語の色としてＲＧＢ各々の画素値が順に指定可能である。ただし、実施例においては、文字色を透明にしたいため、その場合は「−１ −１ −１」を指定する規約となっている。 For “rg” shown in FIG. 11, RGB pixel values can be specified in order as the character word color. However, in the embodiment, since it is desired to make the character color transparent, in this case, the convention is to designate “−1 −1 −1”.

図１１に示す「Ｔｄ」に対しては、文字を配置する外接矩形の左上の画素の座標（図９で言えば（ｃｓｘ，ｃｓｙ））が指定される。座標の原点は、例えば画像の左上とする。 For "Td" shown in FIG. 11, the coordinates of the upper left pixel of the circumscribed rectangle in which the character is placed ((csx, csy) in FIG. 9) are designated. The origin of the coordinates is, for example, the upper left of the image.

図１１に示す「Ｔｊ」に対しては、（）内に、文字として配置するテキストが指定可能である。図１１に示す例では、「Ｒ」という文字が指定されている。 For “Tj” shown in FIG. 11, text to be arranged as characters can be specified in (). In the example shown in FIG. 11, the letter “R” is designated.

なお、文字オブジェクトが複数ある場合には、続く文字オブジェクト内でフォント又は色についてのオペランド及びオペコードを省略することが可能である。省略された場合は、直前の文字オブジェクトの値が適用される。 When there are a plurality of character objects, it is possible to omit operands and opcodes for fonts or colors in the following character objects. If omitted, the value of the previous character object is applied.

＜二値画像のデータ形式＞
図１２は、サーバに送信する二値画像の形式の一例を示す図である。図１２に示す二値画像のヘッダは、原画像の横画素数、縦画素数、解像度、ターゲット言語（対象言語）、頁番号を含む。また、図１２に示す二値画像の圧縮方式は、例えばＭＭＲである。 <Data format of binary image>
FIG. 12 is a diagram illustrating an example of a format of a binary image transmitted to the server. The binary image header shown in FIG. 12 includes the number of horizontal pixels, the number of vertical pixels, the resolution, the target language (target language), and the page number of the original image. The binary image compression method shown in FIG. 12 is, for example, MMR.

このようなデータの形式の二値画像が、画像処理装置１０からサーバ３０に送信される。また、ターゲット言語については必ずしも指定する必要はない。 A binary image in such a data format is transmitted from the image processing apparatus 10 to the server 30. Moreover, it is not always necessary to specify the target language.

＜ＯＣＲ処理後のテキストデータ形式＞
図１３は、サーバから送信されるＯＣＲ処理後のテキストデータ形式の一例を示す図である。図１３に示すヘッダは、原画像の横画素数、縦画素数、解像度、ターゲット言語、頁番号を含む。また、ヘッダに続く部分の形式は、図１０や図１１で説明したデータ形式である。 <Text data format after OCR processing>
FIG. 13 is a diagram illustrating an example of a text data format after OCR processing transmitted from the server. The header shown in FIG. 13 includes the number of horizontal pixels, the number of vertical pixels, the resolution, the target language, and the page number of the original image. Further, the format of the part following the header is the data format described with reference to FIGS.

よって、図１３に示したテキストデータは、図６に示すテキストレイヤそのものとして用いることができる。また、原画像が複数のページを有する場合には、テキストレイヤ中に、ページ分だけのテキストデータが含まれる。 Therefore, the text data shown in FIG. 13 can be used as the text layer shown in FIG. Further, when the original image has a plurality of pages, text data for only the pages is included in the text layer.

＜動作＞
次に、実施例１における画像処理システム１の動作について説明する。図１４は、実施例１における画像処理の一例を示すフローチャートである。 <Operation>
Next, the operation of the image processing system 1 in the first embodiment will be described. FIG. 14 is a flowchart illustrating an example of image processing according to the first embodiment.

図１４に示すステップＳ１０１で、二値画像生成手段２０１は、スキャナ１２により読み取られたカラーの画像から二値画像を生成する。 In step S 101 illustrated in FIG. 14, the binary image generation unit 201 generates a binary image from the color image read by the scanner 12.

ステップＳ１０２で、圧縮手段２０３は、カラー画像及び二値画像に対して圧縮処理を行う。 In step S102, the compression unit 203 performs compression processing on the color image and the binary image.

ステップＳ１０３で、送信手段２０７は、圧縮後の二値画像を、画像処理手段を有する装置としてのサーバ２０に送信する。また、送信手段２０７は、圧縮後の二値画像が複数のページを有する場合、各ページに対して複数のサーバから送信先を選択してもよい。 In step S103, the transmission unit 207 transmits the binary image after compression to the server 20 as an apparatus having an image processing unit. Further, when the binary image after compression has a plurality of pages, the transmission unit 207 may select a transmission destination from a plurality of servers for each page.

ステップＳ１０４で、受信手段３０１は、画像処理装置１０から、圧縮後の二値画像を受信する。伸長手段３０３は、圧縮後の二値画像を受信手段３０１から取得し、圧縮後の二値画像を伸長する。画像処理手段３０５は、伸長後の二値画像を取得し、伸長後の二値画像に対して画像処理を実行する。この画像処理は、例えばＯＣＲ処理である。 In step S 104, the receiving unit 301 receives a binary image after compression from the image processing apparatus 10. The decompressing unit 303 obtains the compressed binary image from the receiving unit 301 and decompresses the compressed binary image. The image processing unit 305 acquires the decompressed binary image and executes image processing on the decompressed binary image. This image processing is, for example, OCR processing.

ステップＳ１０５で、送信手段３０７は、画像処理後の二値画像の情報（例えばＯＣＲ処理後のテキストデータ）を画像処理装置１０に送信する。 In step S 105, the transmission unit 307 transmits binary image information after image processing (for example, text data after OCR processing) to the image processing apparatus 10.

ステップＳ１０６で、受信手段２０９は、サーバ３０から、画像処理後の二値画像の情報を受信する。ファイル生成手段２１１は、受信手段２０９により受信された情報と圧縮後のカラー画像とに基づき画像ファイルを生成する。 In step S 106, the receiving unit 209 receives information about the binary image after image processing from the server 30. The file generation unit 211 generates an image file based on the information received by the reception unit 209 and the compressed color image.

ステップＳ１０７で、画像処理装置１０は、生成した画像ファイルを、ユーザのＰＣ２０に送信する。 In step S107, the image processing apparatus 10 transmits the generated image file to the user's PC 20.

なお、ステップＳ１０３で、二値画像を送信する際に、ユーザにより指定された対象言語をヘッダに含めてもよい。 In step S103, the target language specified by the user may be included in the header when the binary image is transmitted.

以上、実施例１によれば、画像処理後のデータを画像に加えたデータを作成する際の効率性を向上させることができる。例えば、画像処理装置１０では、ボトルネックとなるＯＣＲ処理をサーバに行わせるためＯＣＲ処理時間を軽減することができる。 As described above, according to the first embodiment, it is possible to improve the efficiency when creating data obtained by adding data after image processing to an image. For example, the image processing apparatus 10 can reduce the OCR processing time because the server performs the OCR processing that becomes a bottleneck.

また、二値画像の圧縮後のサイズは、カラー画像に比べて小さいのでトラフィックを圧迫しなくすむ。また、二値画像の圧縮はロスレスなので、伸長後の二値画像は劣化が生じない。また、圧縮された二値画像の送信先を、例えばページ毎に変えることで、ＯＣＲ処理時間をさらに短縮することができる。また、対象言語を指定することで、ＯＣＲ処理の精度を落とさなくてすむ。 Further, the compressed size of the binary image is smaller than that of the color image, so that traffic is not required to be compressed. In addition, since binary image compression is lossless, the decompressed binary image does not deteriorate. Further, by changing the transmission destination of the compressed binary image for each page, for example, the OCR processing time can be further shortened. Also, by specifying the target language, it is not necessary to reduce the accuracy of the OCR process.

［実施例２］
次に、実施例２における画像処理システムについて説明する。実施例２では、画像処理装置側の処理量を少なくするため、ラップ処理や画像ファイルの送信処理もサーバ側に行わせる。 [Example 2]
Next, an image processing system according to the second embodiment will be described. In the second embodiment, in order to reduce the processing amount on the image processing apparatus side, the server side also performs wrap processing and image file transmission processing.

＜システム＞
実施例２における画像処理システムの構成は、図１に示す画像処理システム１と同様であるため、同じ符号を用いる。実施例２では、画像処理装置１０は、読み取った原画像から二値画像を生成し、原画像と二値画像とを圧縮する。画像処理装置１０は、圧縮後の原画像と圧縮後の二値画像とをサーバ３０に送信する。 <System>
The configuration of the image processing system in the second embodiment is the same as that of the image processing system 1 shown in FIG. In the second embodiment, the image processing apparatus 10 generates a binary image from the read original image, and compresses the original image and the binary image. The image processing apparatus 10 transmits the original image after compression and the binary image after compression to the server 30.

サーバ３０は、受信した圧縮後の二値画像に対して伸長処理を行い、伸長後の二値画像に対して画像処理を実行する。サーバ３０は、画像処理された二値画像の情報と、受信した圧縮後の原画像とをラップして画像ファイルを生成する。サーバ３０は、生成された画像ファイルを、ユーザのＰＣ２０に送信する。 The server 30 performs decompression processing on the received binary image after compression, and executes image processing on the binary image after decompression. The server 30 wraps the image-processed binary image information and the received compressed original image to generate an image file. The server 30 transmits the generated image file to the user's PC 20.

＜ハードウェア＞
実施例２における画像処理装置１０、各サーバのハードウェアは、それぞれ図２、３に示すハードウェアと同様であるため、同じ符号を用いる。 <Hardware>
The hardware of the image processing apparatus 10 and each server in the second embodiment is the same as that shown in FIGS.

＜機能＞
図１５は、実施例２における画像処理装置１０の機能の一例を示すブロック図である。図１５に示す機能のうち、図４に示す機能と同様のものは同じ符号を付す。以降では、実施例１と異なる機能を主に説明する。 <Function>
FIG. 15 is a block diagram illustrating an example of functions of the image processing apparatus 10 according to the second embodiment. Among the functions shown in FIG. 15, the same functions as those shown in FIG. Hereinafter, functions different from those of the first embodiment will be mainly described.

なお、二値画像生成手段２０１や圧縮手段４０１は、例えばＣＰＵ１１１やワークメモリとしてのＲＡＭ１１２などにより実現されうる。送信手段４０３は、例えばネットワークインタフェース１６などにより実現されうる。 The binary image generation unit 201 and the compression unit 401 can be realized by the CPU 111 or the RAM 112 as a work memory, for example. The transmission unit 403 can be realized by the network interface 16 or the like, for example.

圧縮手段４０１は、圧縮後のカラー画像についても、圧縮後の二値画像同様、サーバに送信するため、送信手段４０３に出力する。 The compression unit 401 outputs the compressed color image to the transmission unit 403 in order to transmit it to the server in the same manner as the binary image after compression.

送信手段４０３は、圧縮後のカラー画像、圧縮後の二値画像ともに、予め設定されているサーバ３０に送信する。これにより、画像処理装置１０側では、処理負荷を低減することができる。 The transmission unit 403 transmits both the compressed color image and the compressed binary image to the server 30 set in advance. Thereby, the processing load can be reduced on the image processing apparatus 10 side.

図１６は、実施例２におけるサーバ３０の機能の一例を示すブロック図である。図１６に示す機能のうち、図５に示す機能と同様のものは同じ符号を付す。以下では、実施例１と異なる機能を主に説明する。実施例２におけるサーバ４０もサーバ３０と同様の機能を有する。 FIG. 16 is a block diagram illustrating an example of functions of the server 30 according to the second embodiment. Among the functions shown in FIG. 16, the same functions as those shown in FIG. Hereinafter, functions different from those of the first embodiment will be mainly described. The server 40 in the second embodiment also has the same function as the server 30.

なお、伸長手段３０３、画像処理手段３０５、及びファイル生成手段５０３は、例えばＣＰＵ３１及びワークメモリとしてのＲＡＭ３２などにより実現されうる。また、受信手段５０１や送信手段５０５は、例えばネットワークインタフェース３５などにより実現されうる。 The decompression unit 303, the image processing unit 305, and the file generation unit 503 can be realized by, for example, the CPU 31 and the RAM 32 as a work memory. The receiving unit 501 and the transmitting unit 505 can be realized by the network interface 35, for example.

受信手段５０１は、画像処理装置１０から圧縮後のカラー画像と、圧縮後の二値画像とを受信する。受信手段５０１は、圧縮後のカラー画像をファイル生成手段５０３に出力し、圧縮後の二値画像を伸長手段３０３に出力する。 The receiving unit 501 receives a compressed color image and a compressed binary image from the image processing apparatus 10. The receiving unit 501 outputs the compressed color image to the file generating unit 503, and outputs the compressed binary image to the decompressing unit 303.

ファイル生成手段５０３は、基本的には実施例１のファイル生成手段３１１と同様の処理を行う。ファイル生成手段５０３は、画像処理後の二値画像の情報（例えばＯＣＲ処理後のテキストデータ）と、圧縮後のカラー画像とをファイルフォーマットでラップし、１つの画像ファイルを生成する。ファイル生成手段５０３は、生成した画像ファイルを送信手段５０５に出力する。 The file generation unit 503 basically performs the same processing as the file generation unit 311 of the first embodiment. The file generation unit 503 generates one image file by wrapping binary image information after image processing (for example, text data after OCR processing) and a color image after compression in a file format. The file generation unit 503 outputs the generated image file to the transmission unit 505.

送信手段５０５は、ファイル生成手段５０３から取得した画像ファイルを、ユーザのＰＣ２０に送信する。これにより、カラー画像を伸長する必要がなく、また、カラー画像の再圧縮も必要ない。また、サーバに送信するカラー画像は、圧縮率を上述した典型例よりも高めに設定することができる。 The transmission unit 505 transmits the image file acquired from the file generation unit 503 to the user's PC 20. This eliminates the need for decompressing the color image and does not require recompression of the color image. Further, the color image transmitted to the server can be set to a higher compression rate than the above-described typical example.

＜動作＞
次に、実施例２における画像処理システムの動作について説明する。図１７は、実施例２における画像処理の一例を示すフローチャートである。 <Operation>
Next, the operation of the image processing system in the second embodiment will be described. FIG. 17 is a flowchart illustrating an example of image processing according to the second embodiment.

図１７に示すステップＳ２０１で、二値画像生成手段２０１は、スキャナ１２により読み取られたカラーの画像から二値画像を生成する。 In step S 201 shown in FIG. 17, the binary image generation unit 201 generates a binary image from the color image read by the scanner 12.

ステップＳ２０２で、圧縮手段４０１は、カラー画像及び二値画像に対して圧縮処理を行う。 In step S202, the compression unit 401 performs compression processing on the color image and the binary image.

ステップＳ２０３で、送信手段４０３は、圧縮後のカラー画像と圧縮後の二値画像とを、画像処理手段を有する装置としてのサーバ３０に送信する。 In step S203, the transmission unit 403 transmits the compressed color image and the compressed binary image to the server 30 as an apparatus having an image processing unit.

ステップＳ２０４で、受信手段５０１は、画像処理装置１０から、圧縮後のカラー画像及び圧縮後の二値画像を受信する。伸長手段３０３は、圧縮後の二値画像を受信手段５０１から取得し、圧縮された二値画像を伸長する。画像処理手段３０５は、伸長後の二値画像を取得し、伸長後の二値画像に対して画像処理を実行する。この画像処理は、例えばＯＣＲ処理である。 In step S 204, the receiving unit 501 receives a compressed color image and a compressed binary image from the image processing apparatus 10. The decompression unit 303 acquires the compressed binary image from the reception unit 501 and decompresses the compressed binary image. The image processing unit 305 acquires the decompressed binary image and executes image processing on the decompressed binary image. This image processing is, for example, OCR processing.

ステップＳ２０５で、ファイル生成手段５０３は、画像処理後の二値画像の情報と圧縮後のカラー画像とに基づき画像ファイルを生成する。 In step S205, the file generation unit 503 generates an image file based on the binary image information after image processing and the compressed color image.

ステップＳ２０６で、送信手段５０５は、生成した画像ファイルを、ユーザのＰＣ２０に送信する。 In step S206, the transmission unit 505 transmits the generated image file to the user's PC 20.

なお、ステップＳ２０３で、二値画像を送信する際に、ユーザにより指定された対象言語をヘッダに含めてもよい。 In step S203, the target language specified by the user may be included in the header when the binary image is transmitted.

以上、実施例２によれば、画像処理後のデータを画像に加えたデータを作成する際の効率性を向上させることができる。例えば、画像処理装置１０では、ボトルネックとなるＯＣＲ処理や画像ファイル生成処理などをサーバに行わせるため、画像処理装置１０の処理を削減することができる。実施例２は、画像処理装置１０が廉価版のＭＦＰなどの場合に好適である。 As described above, according to the second embodiment, it is possible to improve efficiency when creating data obtained by adding data after image processing to an image. For example, the image processing apparatus 10 can reduce the processing of the image processing apparatus 10 by causing the server to perform OCR processing and image file generation processing that are bottlenecks. The second embodiment is suitable when the image processing apparatus 10 is an inexpensive MFP or the like.

［変形例］
各実施例に記載の画像処理装置やサーバで実行されるプログラムは、インストール可能な形式又は実行可能な形式のファイルでＣＤ−ＲＯＭ、フレキシブルディスク（ＦＤ）、ＣＤ−Ｒ、ＤＶＤ（Digital Versatile Disk）等のコンピュータで読み取り可能な記録媒体に記録されて提供される。 [Modification]
The program executed by the image processing apparatus and server described in each embodiment is a CD-ROM, flexible disk (FD), CD-R, DVD (Digital Versatile Disk) in an installable or executable file. Or the like recorded on a computer-readable recording medium.

また、各実施例の画像処理装置やサーバで実行されるプログラムを、インターネット等のネットワークに接続されたコンピュータ上に格納し、ネットワーク経由でダウンロードさせることにより提供するように構成してもよい。また、各実施例の画像処理装置やサーバで実行されるプログラムをインターネット等のネットワーク経由で提供または配布するように構成してもよい。 Further, the program executed by the image processing apparatus and server of each embodiment may be stored on a computer connected to a network such as the Internet and provided by being downloaded via the network. Further, the program executed by the image processing apparatus or server of each embodiment may be provided or distributed via a network such as the Internet.

また、各実施例の画像処理装置やサーバで実行されるプログラムを、ＲＯＭ等に予め組み込んで提供するように構成してもよい。 In addition, the program executed by the image processing apparatus or server of each embodiment may be provided by being incorporated in advance in a ROM or the like.

各実施例の画像処理装置やサーバで実行されるプログラムは、前述した各手段を含むモジュール構成となっている。実際のハードウェアとしては、ＣＰＵがＲＯＭなどからプログラムを読み出して実行することにより上記各手段のうち１又は複数の各手段がＲＡＭ上にロードされ、１又は複数の各手段がＲＡＭ上に生成されるようになっている。 A program executed by the image processing apparatus and the server according to each embodiment has a module configuration including the above-described units. As actual hardware, when the CPU reads a program from a ROM or the like and executes it, one or more of the above means are loaded onto the RAM, and one or more of the respective means are generated on the RAM. It has become so.

なお、本発明は、上記実施例そのままに限定されるものではなく、実施段階ではその要旨を逸脱しない範囲で構成要素を変形して具体化することができる。また、上記実施例に開示されている複数の構成要素の適宜な組み合わせにより、種々の発明を形成することができる。例えば、実施例に示される全構成要素からいくつかの構成要素を削除してもよい。 In addition, this invention is not limited to the said Example as it is, A component can be deform | transformed and embodied in the range which does not deviate from the summary in an implementation stage. Moreover, various inventions can be formed by appropriately combining a plurality of constituent elements disclosed in the above embodiments. For example, some components may be deleted from all the components shown in the embodiments.

１０画像処理装置
１６ネットワークインタフェース
２０ＰＣ
３０、４０サーバ
３１ＰＣＵ
３２ＲＡＭ
３６ネットワークインタフェース
１１１ＣＰＵ
１１２ＲＡＭ
２０１二値画像生成手段
２０３、４０１圧縮手段
２０５指定手段
２０７、４０３送信手段
２０９受信手段
２１１、５０３ファイル生成手段
３０１、５０１受信手段
３０３伸長手段
３０５画像処理手段
３０７、５０５送信手段 10 image processing device 16 network interface 20 PC
30, 40 Server 31 PCU
32 RAM
36 Network interface 111 CPU
112 RAM
201 binary image generation means 203, 401 compression means 205 designation means 207, 403 transmission means 209 reception means 211, 503 file generation means 301, 501 reception means 303 decompression means 305 image processing means 307, 505 transmission means

特開２００４−１０４４３５号公報JP 2004-104435 A

Claims

Generating means for generating a binary image from the image;
Compression means for compressing the binary image generated by the generation means and the image from which the binary image is generated;
Transmitting means for transmitting the binary image compressed by the compression means to an external device having image processing means for performing predetermined image processing;
Receiving means for receiving, from the external device, information on a binary image obtained by performing predetermined image processing on the binary image transmitted by the transmitting means by the image processing means;
Image processing comprising: file generation means for generating an image file based on the binary image information subjected to the predetermined image processing received by the reception means and the generation source image compressed by the compression means apparatus.

The image processing apparatus according to claim 1, wherein the information of the binary image subjected to the predetermined image processing is text data after OCR processing performed by the image processing unit.

The image processing apparatus according to claim 2, further comprising designation means for designating a target language for OCR processing with respect to the compressed binary image.

The transmission means includes
The image processing apparatus according to claim 1, wherein when the compressed binary image has a plurality of pages, a transmission destination is selected from the plurality of apparatuses for each page.

An image processing system comprising a first device and a second device connected via a network,
The first device includes:
Generating means for generating a binary image from the image;
Compression means for compressing the binary image generated by the generation means and the image from which the binary image is generated;
Transmission means for transmitting the binary image compressed by the compression means to the second device;
Receiving means for receiving, from the second device, information on a binary image that has undergone predetermined image processing;
File generating means for generating an image file based on the information of the binary image subjected to the predetermined image processing received by the receiving means and the source image compressed by the compressing means;
The second device includes:
Receiving means for receiving the compressed binary image;
Image processing means for performing predetermined image processing on the binary image obtained by expanding the compressed binary image;
An image processing system comprising: transmission means for transmitting information on the binary image subjected to the predetermined image processing to the first device.

A generating step for generating a binary image from the image;
A compression step of compressing the generated binary image and an image from which the binary image is generated;
A transmission step of transmitting the compressed binary image to an external device having image processing means for performing predetermined image processing;
A receiving step of receiving, from the external device, binary image information obtained by performing predetermined image processing on the transmitted binary image by the image processing unit;
A program for causing a computer to execute a file generation step of generating an image file based on the received binary image information subjected to the predetermined image processing and the compressed generation source image.