JP2000172785A

JP2000172785A - Device and method for recognizing character and computer readable memory

Info

Publication number: JP2000172785A
Application number: JP10344605A
Authority: JP
Inventors: Kitahiro Kaneda; 北洋金田
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 1998-12-03
Filing date: 1998-12-03
Publication date: 2000-06-23

Abstract

PROBLEM TO BE SOLVED: To provide a character recognizing device capable of improving character recognition precision and also total throughput and to provide a character recognizing method and a computer readable memory. SOLUTION: A attribute information reading part 4 reads the attribute information of image data inputted from an image inputting part 2. A character recognition controlling part 6 decides a dictionary of character recognition corresponding to the read attribute information. A character recognizing part 10 performs character recognition to the inputted image data by using the decided character recognition dictionary.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、入力された画像デ
ータに対し、文字認識を行う文字認識装置及びその方
法、コンピュータ可読メモリに関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character recognition apparatus and method for performing character recognition on input image data, and a computer readable memory.

【０００２】[0002]

【従来の技術】従来の文字認識装置、特に手書き文字の
認識は、不特定多数の筆記者による文字を認識対象とし
ているので、その全てを満足させるような文字認識用辞
書の作成は事実上不可能であった。従って、現状は統計
的に抽出された手書き文字サンプルに基づき文字認識用
辞書を作成させるか、あるいは筆記者を限定して文字認
識用辞書を作成するか等の妥協的な手法で、手書き文字
の認識を行っている。また、活字文字認識に関しては、
ある程度フォントを限定して文字認識用辞書を作成して
いる。2. Description of the Related Art In a conventional character recognition apparatus, particularly in the recognition of handwritten characters, since the characters of an unspecified number of writers are to be recognized, it is virtually impossible to create a character recognition dictionary that satisfies all of them. It was possible. Therefore, the current situation is to create a character recognition dictionary based on a statistically extracted sample of handwritten characters, or to create a character recognition dictionary by limiting the writer, etc. Recognition. Regarding type recognition,
The dictionary for character recognition is created by limiting fonts to some extent.

【０００３】また、文字認識後の後処理を行う後処理部
では、文字認識結果の単語のつながりをチェックして文
字認識の誤りを訂正していた。ここで用いられる単語
は、単語照合用辞書にあらかじめ登録されており、チェ
ック毎に呼び出される。また、一般に単語照合用辞書に
は、あらゆる種類の原稿に対応するよう単語が登録され
ている。A post-processing unit that performs post-processing after character recognition checks the connection between words in the character recognition result and corrects character recognition errors. The words used here are registered in the word collation dictionary in advance, and are called for each check. Generally, words are registered in the word collation dictionary so as to correspond to all types of manuscripts.

【０００４】[0004]

【発明が解決しようとする課題】しかしながら、上記従
来の文字認識装置では、認識対象に最適な文字認識用辞
書の絞り込みができないため、一定以上の精度を出すこ
とが困難であった。また、筆記者、あるいはフォントを
限定した場合、その使い勝手が大幅に悪化してしまうこ
とも事実である。However, in the above-mentioned conventional character recognition apparatus, it is difficult to narrow down a character recognition dictionary optimal for a recognition target, so that it is difficult to obtain a certain level of accuracy. In addition, when the writer or the font is limited, the usability is greatly deteriorated.

【０００５】また、文字認識装置の後処理部では、単語
照合用辞書が汎用的な作りとなっているため、冗長度が
高く、それに伴い処理速度、精度の低下を招いていた。In the post-processing unit of the character recognition device, the dictionary for word collation is made versatile, so that the degree of redundancy is high and the processing speed and accuracy are reduced accordingly.

【０００６】本発明は上記の問題点に鑑みてなされたも
のであり、文字認識精度、かつトータルスループットを
向上することができる文字認識装置及びその方法、コン
ピュータ可読メモリを提供することを目的とする。SUMMARY OF THE INVENTION The present invention has been made in consideration of the above problems, and has as its object to provide a character recognition apparatus and method capable of improving character recognition accuracy and total throughput, and a computer readable memory. .

【０００７】[0007]

【課題を解決するための手段】上記の目的を達成するた
めの本発明による文字認識装置は以下の構成を備える。
即ち、入力された画像データに対し、文字認識を行う文
字認識装置であって、前記入力された画像データの属性
情報を読み取る読取手段と、前記読取手段で読み取られ
た属性情報に対応する文字認識用辞書を決定する決定手
段と、前記決定手段で決定された文字認識辞書を用い
て、前記入力された画像データに対し文字認識を行う文
字認識手段とを備える。A character recognition apparatus according to the present invention for achieving the above object has the following arrangement.
That is, a character recognizing device that performs character recognition on input image data, comprising: reading means for reading attribute information of the input image data; and character recognition corresponding to the attribute information read by the reading means. Determining means for determining a dictionary for use; and character recognition means for performing character recognition on the input image data using the character recognition dictionary determined by the determining means.

【０００８】また、好ましくは、前記画像データは、Ｆ
ｌａｓｈＰｉｘフォーマットの画像データである。[0008] Preferably, the image data is F
This is image data in a flashPix format.

【０００９】また、好ましくは、前記属性情報は、前記
入力された画像データの作成者を示す情報である。[0009] Preferably, the attribute information is information indicating a creator of the input image data.

【００１０】また、好ましくは、前記決定手段は、前記
作成者毎の文字認識用辞書を記憶する記憶手段を備え、
前記記憶手段を参照して、前記読取手段で読み取られた
属性情報に対応する文字認識用辞書を決定する。[0010] Preferably, the determination means includes a storage means for storing a character recognition dictionary for each creator,
A character recognition dictionary corresponding to the attribute information read by the reading unit is determined with reference to the storage unit.

【００１１】また、好ましくは、前記属性情報は、前記
入力された画像データの種類を示す情報である。Preferably, the attribute information is information indicating a type of the input image data.

【００１２】また、好ましくは、前記決定手段は、前記
入力された画像データの種類毎の文字認識用辞書を記憶
する記憶手段を備え、前記記憶手段を参照して、前記読
取手段で読み取られた属性情報に対応する文字認識用辞
書を決定する。Preferably, the determining means includes a storage means for storing a character recognition dictionary for each type of the input image data, and the reading means is read by the reading means with reference to the storing means. A character recognition dictionary corresponding to the attribute information is determined.

【００１３】上記の目的を達成するための本発明による
文字認識装置は以下の構成を備える。即ち、入力された
画像データに対し、文字認識を行う文字認識装置であっ
て、前記入力された画像データに対し文字認識を行う文
字認識手段と、前記入力された画像データの属性情報を
読み取る読取手段と、前記読取手段で読み取られた属性
情報に基づいて、前記入力された画像データの種類を特
定する特定手段と、前記特定手段で特定された種類にも
とづいて、前記文字認識手段で得られる文字認識結果に
対し後処理を行う後処理手段とを備える。A character recognition device according to the present invention for achieving the above object has the following configuration. That is, a character recognition device that performs character recognition on input image data, a character recognition unit that performs character recognition on the input image data, and a reading device that reads attribute information of the input image data. Means for specifying the type of the input image data based on the attribute information read by the reading means; and the character recognition means based on the type specified by the specifying means. Post-processing means for performing post-processing on the character recognition result.

【００１４】また、好ましくは、前記画像データは、Ｆ
ｌａｓｈＰｉｘフォーマットの画像データである。Preferably, the image data is F
This is image data in a flashPix format.

【００１５】また、好ましくは、前記後処理手段は、画
像データの種類毎に文字認識結果中の単語と照合する単
語照合用辞書を記憶する記憶手段と、前記記憶手段を参
照し、前記特定手段で特定された種類に対応する単語照
合用辞書を決定する決定手段とを備え、前記決定手段で
決定された単語照合用辞書を用いて、前記文字認識結果
に対し後処理を行う。Preferably, the post-processing means stores a word collation dictionary for collating words in a character recognition result for each type of image data, and the identification means refers to the storage means. Determining means for determining a word matching dictionary corresponding to the type specified in the step (a), and performing post-processing on the character recognition result using the word matching dictionary determined by the determining means.

【００１６】上記の目的を達成するための本発明による
文字認識方法は以下の構成を備える。即ち、入力された
画像データに対し、文字認識を行う文字認識方法であっ
て、前記入力された画像データの属性情報を読み取る読
取工程と、前記読取工程で読み取られた属性情報に対応
する文字認識用辞書を決定する決定工程と、前記決定工
程で決定された文字認識辞書を用いて、前記入力された
画像データに対し文字認識を行う文字認識工程とを備え
る。A character recognition method according to the present invention for achieving the above object has the following configuration. That is, a character recognition method for performing character recognition on input image data, comprising: a reading step of reading attribute information of the input image data; and a character recognition corresponding to the attribute information read in the reading step. And a character recognition step of performing character recognition on the input image data using the character recognition dictionary determined in the determination step.

【００１７】上記の目的を達成するための本発明による
文字認識方法は以下の構成を備える。即ち、入力された
画像データに対し、文字認識を行う文字認識方法であっ
て、前記入力された画像データに対し文字認識を行う文
字認識工程と、前記入力された画像データの属性情報を
読み取る読取工程と、前記読取工程で読み取られた属性
情報に基づいて、前記入力された画像データの種類を特
定する特定工程と、前記特定工程で特定された種類にも
とづいて、前記文字認識工程で得られる文字認識結果に
対し後処理を行う後処理工程とを備える。A character recognition method according to the present invention for achieving the above object has the following configuration. That is, a character recognition method for performing character recognition on input image data, wherein a character recognition step for performing character recognition on the input image data, and reading for reading attribute information of the input image data A step of specifying the type of the input image data based on the attribute information read in the reading step; and a step of obtaining the character recognition step based on the type specified in the specifying step. And a post-processing step of performing post-processing on the character recognition result.

【００１８】上記の目的を達成するための本発明による
コンピュータ可読メモリは以下の構成を備える。即ち、
入力された画像データに対し、文字認識を行う文字認識
のプログラムコードが格納されたコンピュータ可読メモ
リであって、前記入力された画像データの属性情報を読
み取る読取工程のプログラムコードと、前記読取工程で
読み取られた属性情報に対応する文字認識用辞書を決定
する決定工程のプログラムコードと、前記決定工程で決
定された文字認識辞書を用いて、前記入力された画像デ
ータに対し文字認識を行う文字認識工程のプログラムコ
ードとを備える。A computer readable memory according to the present invention for achieving the above object has the following configuration. That is,
A computer-readable memory storing a character recognition program code for performing character recognition on the input image data, wherein a program code for a reading step for reading attribute information of the input image data; and Character recognition for performing character recognition on the input image data by using a program code of a determination step of determining a character recognition dictionary corresponding to the read attribute information and a character recognition dictionary determined in the determination step Process code.

【００１９】上記の目的を達成するための本発明による
コンピュータ可読メモリは以下の構成を備える。即ち、
入力された画像データに対し、文字認識を行う文字認識
のプログラムコードが格納されたコンピュータ可読メモ
リであって、前記入力された画像データに対し文字認識
を行う文字認識工程のプログラムコードと、前記入力さ
れた画像データの属性情報を読み取る読取工程のプログ
ラムコードと、前記読取工程で読み取られた属性情報に
基づいて、前記入力された画像データの種類を特定する
特定工程のプログラムコードと、前記特定工程で特定さ
れた種類にもとづいて、前記文字認識工程で得られる文
字認識結果に対し後処理を行う後処理工程のプログラム
コードとを備える。A computer readable memory according to the present invention for achieving the above object has the following configuration. That is,
A computer readable memory storing a character recognition program code for performing character recognition on input image data, wherein a program code for a character recognition step for performing character recognition on the input image data is provided. A program code for a reading step for reading attribute information of the input image data, a program code for a specifying step for specifying the type of the input image data based on the attribute information read in the reading step, and the specifying step And a program code of a post-processing step for performing post-processing on the character recognition result obtained in the character recognition step based on the type specified in the above.

【００２０】[0020]

【発明の実施の形態】以下、図面を参照して本発明の好
適な実施形態を詳細に説明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Preferred embodiments of the present invention will be described below in detail with reference to the drawings.

【００２１】まず、以下の実施形態で用いる画像の画像
フォーマットの１つであるＦｌａｓｈＰｉｘフォーマッ
トについて説明する。First, a FlashPix format which is one of image formats of an image used in the following embodiments will be described.

【００２２】ＦｌａｓｈＰｉｘTM（ＦｌａｓｈＰｉｘ
は、米国ＥａｓｔｍａｎＫｏｄａｋ社の登録商標）フ
ァイルフォーマットでは、画像ヘッダ部に格納されてい
る属性情報および画像データをさらに構造化し、ファイ
ルとして管理する。この構造化した画像ファイルを図
９、図１０に示す。FlashPix ™ (FlashPix
In the file format (registered trademark of Eastman Kodak Company, USA), attribute information and image data stored in an image header portion are further structured and managed as a file. This structured image file is shown in FIGS.

【００２３】図９、図１０はＦｌａｓｈＰｉｘフォーマ
ットの画像ファイルの構成を示す図である。FIG. 9 and FIG. 10 are diagrams showing the structure of a FlashPix format image file.

【００２４】画像ファイル内の各プロパティやデータに
は、ＭＳ−ＤＯＳのディレクトリとファイルに相当す
る、ストレージとストリームによってアクセスする。図
９、図１０において、影付き部分がストレージで、影な
し部分がストリームである。画像データや属性情報は、
ストリーム部分に格納される。画像データは異なる解像
度で階層化されており、それぞれの解像度の画像データ
をＳｕｂｉｍａｇｅと呼び、それぞれＲｅｓｏｌｕｔｉ
ｏｎ０，１，…ｎで示してある。各解像度の画像データ
に対して、その画像データを読み出すために必要な情報
がＳｕｂｉｍａｇｅｈｅａｄｅｒに、また、実際の画
像データがＳｕｂｉｍａｇｅｄａｔａに格納される。Each property and data in the image file is accessed by storage and stream corresponding to the directory and file of MS-DOS. 9 and 10, the shaded portion is the storage and the unshaded portion is the stream. Image data and attribute information
Stored in the stream part. The image data is hierarchized at different resolutions, and the image data of each resolution is called a Subimage, and each is a Resolution.
on0, 1,... n. For image data of each resolution, information necessary for reading out the image data is stored in the Subimage header, and actual image data is stored in the Subimage data.

【００２５】プロパティセットとは、属性情報をその使
用目的、内容に応じて分類して定義したもので、Ｓｕｍ
ｍａｒｙｉｎｆｏ．Ｐｒｏｐｅｒｔｙｓｅｔ、Ｉ
ｍａｇｅｉｎｆｏ．Ｐｒｏｐｅｒｔｙｓｅｔ、Ｉ
ｍａｇｅｃｏｎｔｅｎｔｓＰｒｏｐｅｒｔｙｓｅ
ｔ、ＥｘｔｅｎｓｉｏｎｌｉｓｔＰｒｏｐｅｒｔｙ
ｓｅｔがある。＜各プロパティセットの説明＞Ｓｕｍｍａｒｙｉｎｆ
ｏ．Ｐｒｏｐｅｒｔｙｓｅｔは、ＦｌａｓｈＰｉｘ
特有のものではなく、Ｍｉｃｒｏｓｏｆｔ社のストラク
チャードストレージでは必須のプロパティセットであ
る。このプロパティセット内には、画像ファイルのタイ
トル・題名・著者・サムネール画像等を格納する。A property set is defined by classifying attribute information according to its purpose of use and its contents.
Mary info. Property set, I
image info. Property set, I
mage contentsPropertyse
t, Extension list Property
There is a set. <Explanation of each property set> Summery inf
o. Property set is FlashPix
It is not unique and is an essential property set in Microsoft's structured storage. In this property set, the title, title, author, thumbnail image, etc. of the image file are stored.

【００２６】ＩｍａｇｅｃｏｎｔｅｎｔｓＰｒｏｐ
ｅｒｔｙｓｅｔは、画像データの格納方法を記述する
属性である（図１３）。この属性には、画像データの階
層数、最大解像度の画像データの幅、高さや、それぞれ
の解像度の画像データについての幅、高さ、色の構成、
あるいはＪＰＥＧ圧縮を用いる際の量子化テーブル・ハ
フマンテーブルの定義を記述する。Image contents Prop
The erty set is an attribute that describes a method of storing image data (FIG. 13). This attribute includes the number of layers of the image data, the width and height of the image data of the maximum resolution, the width, height, color configuration of the image data of each resolution,
Alternatively, the definition of the quantization table / Huffman table when using JPEG compression is described.

【００２７】Ｉｍａｇｅｉｎｆｏ．Ｐｒｏｐｅｒｔ
ｙｓｅｔは、画像データを使用する際に利用できるさ
まざまな情報、例えば、以下に示すような画像がどのよ
うにして取り込まれ、どのように利用可能であるかの情
報を格納する。[0027] Image info. Property
y set stores various information that can be used when using image data, for example, information on how an image as shown below is captured and how it can be used.

【００２８】・デジタル画像データの取り込み方法／あ
るいは生成方法に関する情報（ＦｉｌｅＳｏｕｒｃ
ｅ）・著作権に関する情報（Ｉｎｔｅｌｌｅｃｔｕａｌｐ
ｒｏｐｅｒｔｙ）・画像の内容（画像中の人物、場所など）に関する情報
（Ｃｏｎｔｅｎｔｄｅｓｃｒｉｐｔｉｏｎ）・撮影に使われたカメラに関する情報（Ｃａｍｅｒａ
ｉｎｆｏｒｍａｔｉｏｎ）・撮影時のカメラのセッティング（露出、シャッタース
ピード、焦点距離、フラッシュ使用の有無など）に関す
る情報（ＰｅｒＰｉｃｔｕｒｅｃａｍｅｒａｓｅ
ｔｔｉｎｇｓ）・デジタルカメラ特有解像度やモザイクフィルタに関す
る情報（Ｄｉｇｉｔａｌｃａｍｅｒａｃｈａｒａｃ
ｔｅｒｉｚａｔｉｏｎ）・フィルムのメーカ名、製品名、種類（ネガ／ポジ、カ
ラー／白黒）に関する情報（Ｆｉｌｍｄｅｓｃｒｉｐ
ｔｉｏｎ）・オリジナルが書物や印刷物である場合の種類やサイズ
に関する情報（Ｏｒｉｇｉｎａｌｄｏｃｕｍｅｎｔ
ｓｃａｎｄｅｓｃｒｉｐｔｉｏｎ）・スキャン画像の場合、使用したスキャナやソフト、操
作した人に関する情報（Ｓｃａｎｄｅｖｉｃｅ）ＥｘｔｅｎｓｉｏｎｌｉｓｔＰｒｏｐｅｒｔｙｓ
ｅｔは、上記ＦｌａｓｈＰｉｘの基本仕様に含まれない
情報を格納する。Information on a method of capturing and / or generating digital image data (File Source)
e) Copyright information (Intellectual p
information) (information about the content of the image (person, place, etc. in the image)) (Content description) ・ Information about the camera used for shooting (Camera)
Information) Information on camera settings (exposure, shutter speed, focal length, use of flash, etc.) at the time of shooting (Per Picture camera series)
(tings) ・ Information on digital camera specific resolution and mosaic filter (Digital camera charac)
Information about film manufacturer name, product name, and type (negative / positive, color / black and white) (Film desrip)
Information about the type and size when the original is a book or printed matter (Original document)
(scan description)-In the case of a scanned image, information on a scanner, software, and a person who operated the device (Scan device) Extension list Property s
et stores information not included in the basic specifications of the FlashPix.

【００２９】図１０のＦｌａｓｈＰｉｘＩｍａｇｅ
ｖｉｅｗｏｂｊｅｃｔは、画像を表示する際に用いる
ビューイングパラメータと画像データをあわせて格納す
る画像ファイルである。ビューイングパラメータとは、
画像の回転、拡大／縮小、移動、色変換、フィルタリン
グの処理を画像表示の際に適応するために記憶しておく
処理係数のセットである。FlashPix Image of FIG.
The view object is an image file that stores viewing parameters and image data used when displaying an image. What are viewing parameters?
This is a set of processing coefficients stored for adapting the processing of image rotation, enlargement / reduction, movement, color conversion, and filtering when displaying an image.

【００３０】Ｓｏｕｒｃｅ／ＲｅｓｕｌｔＦｌａｓｈ
Ｐｉｘｉｍａｇｅｏｂｊｅｃｔは、ＦｌａｓｈＰｉ
ｘフォーマットの画像データの実体であり、Ｓｏｕｒｃ
ｅＦｌａｓｈＰｉｘｉｍａｇｅｏｂｊｅｃｔは必
須、ＲｅｓｕｌｔＦｌａｓｈＰｉｘｉｍａｇｅｏ
ｂｊｅｃｔはオプションである。ＳｏｕｒｃｅＦｌａ
ｓｈＰｉｘｉｍａｇｅｏｂｊｅｃｔはｍオリジナル
の画像データを、ＲｅｓｕｌｔＦｌａｓｈＰｉｘｉ
ｍａｇｅｏｂｊｅｃｔはビューイングパラメータを使
って画像処理した結果の画像データを格納する。Source / Result Flash
Pix image object is FlashPi
x-format image data.
eFlashPix image object is required, Result FlashPix image o
bject is optional. Source Fla
shPix image object converts m original image data to Result FlashPix i
The image object stores image data obtained as a result of image processing using viewing parameters.

【００３１】Ｓｏｕｒｃｅ／Ｒｅｓｕｌｔｄｅｓｃ．
Ｐｒｏｐｅｒｔｙｓｅｔは、上記画像データの識別
のためのプロパティセットであり、画像ＩＤ、変更禁止
のプロパティセット、最終更新日時等を格納する。Source / Result desc.
The property set is a property set for identifying the image data, and stores an image ID, a property set for which change is prohibited, a last update date and time, and the like.

【００３２】Ｔｒａｎｓｆｏｒｍｐｒｏｐｅｒｔｙ
ｓｅｔは、回転、拡大／縮小、移動のためのＡｆｆｉｎ
ｅ変換係数、色変換マトリクス、コントラスト調整値、
フィルタリング係数を格納する。[0032] Transform property
set is Affin for rotation, scaling / movement
e conversion coefficient, color conversion matrix, contrast adjustment value,
Stores filtering coefficients.

【００３３】次に、画像データの取り扱いについて説明
する。Next, handling of image data will be described.

【００３４】図１１は解像度の異なる複数の画像データ
から構成される画像ファイルの例を示す図である。FIG. 11 is a diagram showing an example of an image file composed of a plurality of image data having different resolutions.

【００３５】図１１において、最大解像度の画像データ
は、列×行がＸ０×Ｙ０で構成されており、その次に大
きい画像データは、列×行がＸ０／２×Ｙ０／２であ
り、それ以降順次、列×行ともに１／２づつ縮小し、列
×行ともに６４画素以下あるいは等しくなるまで繰り返
す。In FIG. 11, the maximum resolution image data has a column × row of X0 × Y0, and the next largest image data has a column × row of X0 / 2 × Y0 / 2. Thereafter, both the column and the row are sequentially reduced by １／, and the process is repeated until both the column and the row are equal to or smaller than 64 pixels.

【００３６】このように、複数の解像度に階層化した結
果、画像データの属性情報として「１つの画像ファイル
中の階層数」やそれぞれの階層の画像データに対して、
ヘッダ情報と画像データが必要となる。１つの画像デー
タ中の階層の数や最大解像度の画像データの幅、高さ、
あるいはそれぞれの解像度の画像の幅、高さ色構成、圧
縮方式等に関する情報は、図１３に示したＩｍａｇｅ
ｃｏｎｔｅｎｔｓＰｒｏｐｅｒｔｙｓｅｔ中に記述
される。As described above, as a result of hierarchization into a plurality of resolutions, as the attribute information of the image data, “the number of layers in one image file” and the image data of each layer
Header information and image data are required. The number of layers in one image data, the width and height of the image data of the maximum resolution,
Alternatively, information on the width, height, color configuration, compression method, and the like of an image at each resolution can be obtained from Image shown in FIG.
contents Described in the property set.

【００３７】更に、各解像度の画像データは、図１２に
示すように６４×６４のタイルに分割される。画像デー
タの左上部から順次６４×６４のタイルに分割をする
と、画像データによっては右端および下端のタイルの一
部に空白が生ずる場合がある。この場合は、それぞれ最
右端画像データまたは最下端画像データを繰り返し挿入
することで、６４×６４画素を構築する。ＦｌａｓｈＰ
ｉｘフォーマットでは、それぞれのタイル中の画像デー
タをＪＰＥＧ圧縮、シングルカラー、非圧縮のいずれか
の方法で格納する。Further, the image data of each resolution is divided into 64 × 64 tiles as shown in FIG. When the image data is divided into 64 × 64 tiles sequentially from the upper left, blanks may occur in some of the right and lower end tiles depending on the image data. In this case, 64 × 64 pixels are constructed by repeatedly inserting the rightmost image data or the bottommost image data. FlashP
In the ix format, image data in each tile is stored in one of JPEG compression, single color, and non-compression.

【００３８】ＪＰＥＧ圧縮は、ＩＳＯ／ＩＥＣＪＴＣ１
／ＳＣ２９により国際標準化された画像圧縮方式であ
り、方式自体の説明はここでは省略する。このようにタ
イル分割された画像データは、Ｓｕｂｉｍａｇｅｄａ
ｔａストリーム中に格納され、タイルの総数、個々のタ
イルのサイズ、データの開始位置、圧縮方法はすべて、
Ｓｕｂｉｍａｇｅｈｅａｄｅｒ（図１４）に格納され
る。シングルカラーとは、１つのタイルがすべて同じ色
で構成されている場合にのみ、個々の画素の値を記録す
ることなく、そのタイルの色を１色で表現する方式であ
る。この方法は特に、コンピュータグラフィックスによ
り生成された画像で有効である。［実施形態１］図１は本発明の実施形態１の文字認識装
置に適用可能な情報処理装置の構成を示すブロック図で
ある。JPEG compression is based on ISO / IECJTC1.
/ SC29 is an image compression system internationally standardized, and the description of the system itself is omitted here. The image data divided in this way is described in Subimage data.
ta, the total number of tiles, the size of each tile, the starting position of the data, and the compression method are all
It is stored in the Subimage header (FIG. 14). The single color is a method in which the color of a tile is represented by one color without recording the value of each pixel only when all the tiles are composed of the same color. This method is particularly useful for images generated by computer graphics. [First Embodiment] FIG. 1 is a block diagram showing a configuration of an information processing apparatus applicable to a character recognition apparatus according to a first embodiment of the present invention.

【００３９】図１において、ＣＰＵ１０１はメインバス
１０７を介して情報処理装置２００全体の制御を実行す
るとともに、情報処理装置２００の外部に接続される入
力装置１１１（例えば、イメージスキャナ、記憶装置、
ネットワーク回線を介して接続される他の情報処理装
置、電話回線を介して接続されるファクシミリ等）を入
力Ｉ／Ｆ（インタフェース）１０４を介して制御する。
また、情報処理装置２００の外部に接続される出力装置
１１２（例えば、プリンタ、モニタ、ネットワーク回線
を介して接続される他の情報処理装置、電話回線を介し
て接続されるファクシミリ等）を出力Ｉ／Ｆ１０５を介
して制御する。また、ＣＰＵ１０１は、ＫＢＤＩ／Ｆ
（キーボードインタフェース）１０６を介して入力部
（例えば、ポインティングデバイス１２３やキーボード
１２４やペン１２５）から入力された指示に従って、画
像の入力、画像処理、色変換処理、画像の出力制御等の
一連の処理を実行する。更に、入力装置１１１より入力
された画像データや、ポインティングデバイス１２３や
キーボード１２４やペン１２５を用いて作成された画像
データを表示する表示部１０９をビデオＩ／Ｆ（インタ
フェース）１０８を介して制御する。In FIG. 1, a CPU 101 controls the entire information processing apparatus 200 via a main bus 107, and also has an input device 111 (for example, an image scanner, a storage device,
Other information processing devices connected via a network line, a facsimile connected via a telephone line, etc.) are controlled via an input I / F (interface) 104.
In addition, an output device 112 (for example, a printer, a monitor, another information processing device connected via a network line, a facsimile connected via a telephone line, etc.) connected to the outside of the information processing device 200 is output to the output I. / F105. In addition, the CPU 101 determines whether the KBDI / F
A series of processes such as image input, image processing, color conversion processing, image output control, etc., according to an instruction input from an input unit (for example, the pointing device 123, the keyboard 124, or the pen 125) via the (keyboard interface) 106. Execute Further, a display unit 109 that displays image data input from the input device 111 and image data created using the pointing device 123, the keyboard 124, and the pen 125 is controlled via a video I / F (interface) 108. .

【００４０】ＲＯＭ１０２は、ＣＰＵ１０１の各種制御
を実行する各種制御プログラムを記憶している。ＲＡＭ
１０３は、ＣＰＵ１０１によりＯＳや本発明を実現する
ための制御プログラムを含むその他の制御プログラムが
ロードされ実行される。また、制御プログラムを実行す
るために用いられる各種作業領域、一時待避領域として
機能する。また、入力装置１１１より入力された画像デ
ータや、ポインティングデバイス１２３やキーボード１
２４やペン１２５を用いて作成された画像データを、一
旦、保持するＶＲＡＭ（不図示）が構成されている。The ROM 102 stores various control programs for executing various controls of the CPU 101. RAM
The CPU 103 loads and executes an OS and other control programs including a control program for implementing the present invention by the CPU 101. In addition, it functions as various work areas used for executing the control program and a temporary save area. In addition, image data input from the input device 111, the pointing device 123, the keyboard 1
A VRAM (not shown) is configured to temporarily hold image data created using the pen 24 and the pen 125.

【００４１】次に、実施形態１の文字認識装置の機能構
成について、図２を用いて説明する。Next, the functional configuration of the character recognition device according to the first embodiment will be described with reference to FIG.

【００４２】図２は本発明の実施形態１の文字認識装置
の機能構成を示す図である。FIG. 2 is a diagram showing a functional configuration of the character recognition device according to the first embodiment of the present invention.

【００４３】図２において、２はＦｌａｓｈＰｉｘ画像
を入力する画像入力部である。４は入力されたＦｌａｓ
ｈＰｉｘ画像の属性情報を読み取る属性情報読取部であ
る。６は属性情報読取部４で読み取られた属性情報より
文字認識を制御する文字認識制御部である。８は画像入
力部２において入力されたＦｌａｓｈＰｉｘ画像の画像
本体に文字認識に適する処理を施す画像処理部である。
１０は文字認識を行う文字認識部である。In FIG. 2, reference numeral 2 denotes an image input unit for inputting a FlashPix image. 4 is the input Flash
The attribute information reading unit reads attribute information of the hPix image. Reference numeral 6 denotes a character recognition control unit that controls character recognition based on the attribute information read by the attribute information reading unit 4. Reference numeral 8 denotes an image processing unit that performs processing suitable for character recognition on the image body of the FlashPix image input by the image input unit 2.
Reference numeral 10 denotes a character recognition unit that performs character recognition.

【００４４】尚、文字認識制御部６には、ＦｌａｓｈＰ
ｉｘ画像の属性情報で示される原稿作成者毎の文字認識
用辞書と、汎用文字認識用辞書を記憶している。Note that the character recognition control unit 6 has a FlashP
A dictionary for character recognition for each document creator indicated by attribute information of the ix image and a dictionary for general character recognition are stored.

【００４５】次に動作について説明する。Next, the operation will be described.

【００４６】画像入力部２から入力されたＦｌａｓｈＰ
ｉｘ画像は、その属性情報が属性情報読取部４に、Ｆｌ
ａｓｈＰｉｘ画像本体の実体データは画像処理部８に入
力される。属性情報読取部４では、原稿画像の作成者を
示す原稿作成者情報の読み取りを行う。文字認識制御部
６では、読み取られた原稿作成者情報によって文字認識
用辞書を選択する。一方、画像処理部８では、Ｆｌａｓ
ｈＰｉｘ画像本体の実体データに対し２値化処理を施
す。２値化された画像データは、文字認識部１０に入力
され、文字認識制御部６において選択された文字認識用
辞書を用い、文字認識が実行される。FlashP input from the image input unit 2
ix image, the attribute information of which is stored in the attribute information reading unit 4
The entity data of the AshPix image body is input to the image processing unit 8. The attribute information reading section 4 reads document creator information indicating the creator of the document image. The character recognition control section 6 selects a character recognition dictionary based on the read document creator information. On the other hand, in the image processing unit 8,
Binary processing is performed on the entity data of the hPix image body. The binarized image data is input to the character recognition unit 10, and character recognition is performed using the character recognition dictionary selected by the character recognition control unit 6.

【００４７】次に、実施形態１の文字認識制御部６で実
行される処理について、図３を用いて説明する。Next, the processing executed by the character recognition control unit 6 of the first embodiment will be described with reference to FIG.

【００４８】図３は本発明の実施形態１の文字認識制御
部で実行される処理を示すフローチャートである。FIG. 3 is a flowchart showing processing executed by the character recognition control unit according to the first embodiment of the present invention.

【００４９】ステップＳ２０２で、属性情報読取部４に
おいて取得された原稿作成者情報を読み込む。ステップ
Ｓ２０４で、ステップＳ２０２で読み込まれた原稿作成
者情報より原稿作成者が指定されているか否かを判定す
る。作成者が指定されている場合（ステップＳ２０４で
ＹＥＳ）、ステップＳ２０６へ進み、原稿作成者により
あらかじめ設定されている手書き文字認識用辞書を選択
する。一方、作成者が指定されていない場合（ステップ
Ｓ２０４でＮＯ）、ステップＳ２０８へ進み、汎用手書
き文字認識用辞書を選択する。In step S202, the document creator information obtained by the attribute information reading section 4 is read. In step S204, it is determined whether or not the original creator is specified from the original creator information read in step S202. If the creator has been designated (YES in step S204), the flow advances to step S206 to select a handwritten character recognition dictionary set in advance by the document creator. On the other hand, if the creator has not been specified (NO in step S204), the flow advances to step S208 to select a general-purpose handwritten character recognition dictionary.

【００５０】次に、文字認識部１０で実行される処理に
ついて、図４を用いて説明する。Next, the processing executed by the character recognition unit 10 will be described with reference to FIG.

【００５１】図４は本発明の実施形態１の文字認識部で
実行される処理を示すフローチャートである。FIG. 4 is a flowchart showing the processing executed by the character recognition unit according to the first embodiment of the present invention.

【００５２】ステップＳ３０２で、画像処理部８により
２値化された画像データを読み込む。ステップＳ３０４
で、文字認識制御部６により選択された手書き文字認識
用辞書を読み込む。ステップＳ３０６で、ステップＳ３
０２、ステップＳ３０４で読み込まれた画像データ、手
書き文字認識用辞書を用いて、文字認識を行う。In step S302, the image data binarized by the image processing section 8 is read. Step S304
Then, the dictionary for handwritten character recognition selected by the character recognition control unit 6 is read. In step S306, step S3
02, character recognition is performed using the image data read in step S304 and the handwritten character recognition dictionary.

【００５３】以上説明したように、実施形態１によれ
ば、ＦｌａｓｈＰｉｘ画像の属性情報を活用することに
より、その文章に最適な文字認識用辞書を選択すること
ができる。つまり、文字認識精度と、使い勝手を大幅に
向上させることができる。［実施形態２］実施形態１では、入力された画像データ
の原稿作成者を特定することで、手書き文字認識用辞書
を選択する構成としたが、これに限定されるものではな
い。例えば、図５に示したＩｍａｇｅｉｎｆｏ．ｐ
ｒｏｐｅｒｔｙｓｅｔのＯｒｉｇｉｎａｌＤｏｃｕ
ｍｅｎｔＳｃａｎＤｅｓｃｒｉｐｔｉｏｎＧｒｏ
ｕｐより、原稿の種類を特定し、文字認識用辞書を選択
する構成にしても良い。この場合は、特に、活字文字認
識に有効である。［実施形態３］図６は本発明の実施形態３の文字認識装
置の機能構成を示す図である。As described above, according to the first embodiment, by utilizing the attribute information of the FlashPix image, it is possible to select an optimal character recognition dictionary for the text. That is, the character recognition accuracy and usability can be significantly improved. [Second Embodiment] In the first embodiment, the configuration is such that the dictionary for handwritten character recognition is selected by specifying the document creator of the input image data. However, the present invention is not limited to this. For example, Image info. p
Original Docu of the property set
Ment Scan Description Gro
The configuration may be such that the type of the document is specified from up, and a dictionary for character recognition is selected. In this case, it is particularly effective for print character recognition. Third Embodiment FIG. 6 is a diagram showing a functional configuration of a character recognition device according to a third embodiment of the present invention.

【００５４】図６において、１０２はＦｌａｓｈＰｉｘ
画像を入力する画像入力部である。１０４は入力された
ＦｌａｓｈＰｉｘ画像の属性情報を読み取る属性情報読
取部である。１０６属性情報読取部１０４で読み取られ
た属性情報より文字認識の後処理を制御する後処理制御
部である。１０８は入力されたＦｌａｓｈＰｉｘ画像本
体の実体データに文字認識を行う文字認識部である。１
１０は文字認識部１０８より得られる文字認識結果の単
語照合を行う後処理部である。In FIG. 6, reference numeral 102 denotes FlashPix.
An image input unit for inputting an image. Reference numeral 104 denotes an attribute information reading unit that reads attribute information of the input FlashPix image. 106 is a post-processing control unit that controls post-processing of character recognition based on the attribute information read by the attribute information reading unit 104. Reference numeral 108 denotes a character recognition unit that performs character recognition on the input entity data of the FlashPix image. 1
Reference numeral 10 denotes a post-processing unit that performs word matching on the character recognition result obtained by the character recognition unit 108.

【００５５】尚、後処理制御部１０６には、Ｆｌａｓｈ
Ｐｉｘ画像の属性情報が示す原稿の種類毎の単語照合用
辞書と、汎用単語照合用辞書を記憶している。Note that the post-processing control unit 106 has a
A dictionary for word matching for each document type indicated by the attribute information of the Pix image and a dictionary for general-purpose word matching are stored.

【００５６】次に動作について説明する。Next, the operation will be described.

【００５７】画像入力部１０２から取得されたＦｌａｓ
ｈＰｉｘ画像本体の実体データは文字認識部１０８に入
力される。また、ＦｌａｓｈＰｉｘ画像の属性情報は、
属性情報読取部１０４に入力される。後処理制御部１０
６では、認識対象原稿の種類を読み取られた属性情報を
元に判断し、それに応じて文字認識後処理部１１０内の
単語照合用辞書を選択する。一方、文字認識部１０８で
は、画像入力部１０２より入力されたＦｌａｓｈＰｉｘ
画像本体の実体データに対し文字認識を行う。文字認識
後処理部１１０では、文字認識部１０８によって得られ
る文字認識結果に対し、後処理制御部１０６で選択され
た単語照合用辞書を元に単語照合を施し、文字認識の誤
りを訂正する。Flas acquired from the image input unit 102
The entity data of the hPix image body is input to the character recognition unit 108. The attribute information of the FlashPix image is
The information is input to the attribute information reading unit 104. Post-processing control unit 10
In step 6, the type of the document to be recognized is determined based on the read attribute information, and a word collation dictionary in the character recognition post-processing unit 110 is selected accordingly. On the other hand, in the character recognition unit 108, the FlashPix input from the image input unit 102
Character recognition is performed on the entity data of the image body. The character recognition post-processing unit 110 performs word matching on the character recognition result obtained by the character recognition unit 108 based on the word matching dictionary selected by the post-processing control unit 106, and corrects character recognition errors.

【００５８】次に、実施形態３の後処理制御部１０６で
実行される処理について、図７を用いて説明する。Next, the processing executed by the post-processing control unit 106 in the third embodiment will be described with reference to FIG.

【００５９】図７は本発明の実施形態３の後処理制御部
で実行される処理を示すフローチャートである。FIG. 7 is a flowchart showing the processing executed by the post-processing control section of the third embodiment of the present invention.

【００６０】ステップＳ２１２で、属性情報読取部１０
４において取得されたＦｌａｓｈＰｉｘ画像の属性情報
を読み込む。ステップＳ２１４で、ステップＳ２１２で
読み込まれた属性情報のうち、例えば、図５に示したＩ
ｍａｇｅｉｎｆｏ．ＰｒｏｐｅｒｔｙｓｅｔのＯ
ｒｉｇｉｎａｌＤｏｃｕｍｅｎｔＳｃａｎＤｅｓ
ｃｒｉｐｔｉｏｎＧｒｏｕｐを読み取り、原稿の種類
を判定する。原稿の種類が一般用と判定された場合、ス
テップＳ２１８へ進み、汎用単語照合用辞書を選択す
る。一方、原稿の種類が特定分野の原稿と判定された場
合、それに最適な特定分野向けの単語照合用辞書を選択
する。In step S212, the attribute information reading unit 10
The attribute information of the FlashPix image acquired in step 4 is read. In step S214, of the attribute information read in step S212, for example, the I shown in FIG.
image info. O of Property set
original Document Scan Des
The “Cryption Group” is read to determine the type of the document. If it is determined that the type of the original is for general use, the process proceeds to step S218, where a general-purpose word collation dictionary is selected. On the other hand, when the type of the document is determined to be a document in a specific field, the most appropriate word matching dictionary for the specific field is selected.

【００６１】次に、文字認識後処理部１１０で実行され
る処理について、図８を用いて説明する。Next, the processing executed by the character recognition post-processing unit 110 will be described with reference to FIG.

【００６２】図８は本発明の実施形態３で実行される文
字認識後処理部で実行される処理を示すフローチャート
である。FIG. 8 is a flowchart showing a process executed by the character recognition post-processing unit executed in the third embodiment of the present invention.

【００６３】ステップＳ３１２で、文字認識部１０８か
ら得られる文字認識結果を読み込む。ステップＳ３１４
で、後処理制御部１０２により選択された単語照合用辞
書を読み込む。ステップＳ３１６で、ステップＳ３１４
で選択された単語照合用辞書を元に、ステップＳ３１２
で読み込まれた文字認識結果の誤りを訂正する。In step S312, the character recognition result obtained from character recognition unit 108 is read. Step S314
Then, the dictionary for word matching selected by the post-processing control unit 102 is read. In step S316, step S314
Based on the word collation dictionary selected in step S312,
Corrects the error in the character recognition result read by.

【００６４】以上説明したように、実施形態３によれ
ば、ＦｌａｓｈＰｉｘ画像の属性情報を活用することに
より、その文章に最適な単語照合用辞書を選択すること
ができるようになる。そのため、後処理精度、ひいては
文字認識全体の精度を向上することができる。［実施形態４］実施形態３では、ＦｌａｓｈＰｉｘ画像
の属性情報のうち、Ｉｍａｇｅｉｎｆｏ．ｐｒｏｐ
ｅｒｔｙｓｅｔのＯｒｉｇｉｎａｌＤｏｃｕｍｅｎ
ｔＳｃａｎＤｅｓｃｒｉｐｔｉｏｎＧｒｏｕｐを
読み取り、原稿の種類を特定する構成としていたが、こ
れに限定されるものではない。例えば、Ｉｍａｇｅｉ
ｎｆｏ．ｐｒｏｐｅｒｔｙｓｅｔのＣｏｎｔｅｎｔ
ＤｅｓｃｒｉｐｔｉｏｎＧｒｏｕｐから原稿の種類
を類推する構成にしても良い。［実施形態５］実施形態３では、後処理の単語照合用辞
書を一般用と、特定分野向けに分けていたが、これに限
定されるものではなく、様々な特定分野向けの単語照合
用辞書を用意しても良い。As described above, according to the third embodiment, by utilizing the attribute information of the FlashPix image, it is possible to select the most suitable word matching dictionary for the sentence. Therefore, it is possible to improve post-processing accuracy and, consequently, accuracy of overall character recognition. [Fourth Embodiment] In the third embodiment, in the attribute information of the FlashPix image, Image info. prop
erty set Original Documen
The configuration is such that the t Scan Description Group is read and the type of the document is specified, but the present invention is not limited to this. For example, Image i
nfo. Content of property set
The configuration may be such that the type of the document is inferred from the Description Group. [Embodiment 5] In the third embodiment, the dictionary for word matching in post-processing is divided into a general dictionary and a dictionary for a specific field. However, the present invention is not limited to this. May be prepared.

【００６５】以上説明した各実施形態では、処理対象の
画像としてＦｌａｓｈＰｉｘフォーマットのＦｌａｓｈ
Ｐｉｘ画像を用いたが、これに限定されるものではな
い。作成者の情報や、原稿の種類を特定できる情報等の
本発明を実現できる情報を有するフォーマットの画像で
あれば、どのような画像でも良い。In each of the embodiments described above, a FlashPix format Flash is used as an image to be processed.
Although the Pix image was used, the present invention is not limited to this. Any image may be used as long as the image has a format that has information that can implement the present invention, such as information of a creator and information that can specify the type of a document.

【００６６】尚、本発明は、複数の機器（例えばホスト
コンピュータ、インタフェース機器、リーダ、プリンタ
など）から構成されるシステムに適用しても、一つの機
器からなる装置（例えば、複写機、ファクシミリ装置な
ど）に適用してもよい。Even if the present invention is applied to a system including a plurality of devices (for example, a host computer, an interface device, a reader, a printer, etc.), a device including one device (for example, a copying machine, a facsimile machine) Etc.).

【００６７】また、本発明の目的は、前述した実施形態
の機能を実現するソフトウェアのプログラムコードを記
録した記憶媒体を、システムあるいは装置に供給し、そ
のシステムあるいは装置のコンピュータ（またはＣＰＵ
やＭＰＵ）が記憶媒体に格納されたプログラムコードを
読出し実行することによっても、達成されることは言う
までもない。Another object of the present invention is to provide a storage medium storing a program code of software for realizing the functions of the above-described embodiments to a system or an apparatus, and to provide a computer (or CPU) of the system or apparatus.
And MPU) read and execute the program code stored in the storage medium.

【００６８】この場合、記憶媒体から読出されたプログ
ラムコード自体が前述した実施形態の機能を実現するこ
とになり、そのプログラムコードを記憶した記憶媒体は
本発明を構成することになる。In this case, the program code itself read from the storage medium implements the functions of the above-described embodiment, and the storage medium storing the program code constitutes the present invention.

【００６９】プログラムコードを供給するための記憶媒
体としては、例えば、フロッピディスク、ハードディス
ク、光ディスク、光磁気ディスク、ＣＤ−ＲＯＭ、ＣＤ
−Ｒ、磁気テープ、不揮発性のメモリカード、ＲＯＭな
どを用いることができる。As a storage medium for supplying the program code, for example, a floppy disk, hard disk, optical disk, magneto-optical disk, CD-ROM, CD
-R, a magnetic tape, a nonvolatile memory card, a ROM, or the like can be used.

【００７０】また、コンピュータが読出したプログラム
コードを実行することにより、前述した実施形態の機能
が実現されるだけでなく、そのプログラムコードの指示
に基づき、コンピュータ上で稼働しているＯＳ（オペレ
ーティングシステム）などが実際の処理の一部または全
部を行い、その処理によって前述した実施形態の機能が
実現される場合も含まれることは言うまでもない。When the computer executes the readout program code, not only the functions of the above-described embodiment are realized, but also the OS (Operating System) running on the computer based on the instruction of the program code. ) May perform some or all of the actual processing, and the processing may realize the functions of the above-described embodiments.

【００７１】更に、記憶媒体から読出されたプログラム
コードが、コンピュータに挿入された機能拡張ボードや
コンピュータに接続された機能拡張ユニットに備わるメ
モリに書込まれた後、そのプログラムコードの指示に基
づき、その機能拡張ボードや機能拡張ユニットに備わる
ＣＰＵなどが実際の処理の一部または全部を行い、その
処理によって前述した実施形態の機能が実現される場合
も含まれることは言うまでもない。Further, after the program code read from the storage medium is written into a memory provided in a function expansion board inserted into the computer or a function expansion unit connected to the computer, based on the instructions of the program code, It goes without saying that the CPU included in the function expansion board or the function expansion unit performs part or all of the actual processing, and the processing realizes the functions of the above-described embodiments.

【００７２】[0072]

【発明の効果】以上説明したように、本発明によれば、
文字認識精度、かつトータルスループットを向上するこ
とができる文字認識装置及びその方法、コンピュータ可
読メモリを提供できる。As described above, according to the present invention,
A character recognition device and method capable of improving character recognition accuracy and total throughput, and a computer-readable memory can be provided.

[Brief description of the drawings]

【図１】本発明の実施形態１の文字認識装置に適用可能
な情報処理装置の構成を示すブロック図である。FIG. 1 is a block diagram illustrating a configuration of an information processing device applicable to a character recognition device according to a first embodiment of the present invention.

【図２】本発明の実施形態１の文字認識装置の機能構成
を示す図である。FIG. 2 is a diagram illustrating a functional configuration of the character recognition device according to the first embodiment of the present invention.

【図３】本発明の実施形態１の文字認識制御部で実行さ
れる処理を示すフローチャートであるFIG. 3 is a flowchart illustrating a process executed by a character recognition control unit according to the first embodiment of the present invention.

【図４】本発明の実施形態１の文字認識部で実行される
処理を示すフローチャートである。FIG. 4 is a flowchart illustrating processing executed by a character recognition unit according to the first embodiment of the present invention.

【図５】ＦｌａｓｈＰｉｘフォーマットに定義されてい
るＩｍａｇｅｉｎｆｏ．Ｐｒｏｐｅｒｔｙｓｅｔ
項目を示す図である。FIG. 5 shows an image info. Image defined in the FlashPix format. Property set
It is a figure showing an item.

【図６】本発明の実施形態３の文字認識装置の機能構成
を示す図である。FIG. 6 is a diagram illustrating a functional configuration of a character recognition device according to a third embodiment of the present invention.

【図７】本発明の実施形態３の後処理制御部で実行され
る処理を示すフローチャートである。FIG. 7 is a flowchart illustrating a process executed by a post-processing control unit according to a third embodiment of the present invention.

【図８】本発明の実施形態３で実行される文字認識後処
理部で実行される処理を示すフローチャートである。FIG. 8 is a flowchart illustrating a process executed by a character recognition post-processing unit executed in a third embodiment of the present invention.

【図９】ＦｌａｓｈＰｉｘフォーマットの画像ファイル
の構成を示す図である。FIG. 9 is a diagram illustrating a configuration of a FlashPix format image file.

【図１０】ＦｌａｓｈＰｉｘフォーマットの画像ファイ
ルの構成を示す図である。FIG. 10 is a diagram showing a configuration of a FlashPix format image file.

【図１１】解像度の異なる複数の画像データから構成さ
れる画像ファイルの例を示す図である。FIG. 11 is a diagram showing an example of an image file composed of a plurality of image data having different resolutions.

【図１２】タイル分割を説明するための図である。FIG. 12 is a diagram for explaining tile division.

【図１３】ＩｍａｇｅｃｏｎｔｅｎｔｓＰｒｏｐｅ
ｒｔｙＳｅｔを説明するための図である。FIG. 13: Image contents Prop
It is a figure for explaining rty Set.

【図１４】Ｓｕｂｉｍａｇｅｈｅａｄｅｒを説明する
ための図である。FIG. 14 is a diagram for explaining a Subimage header.

[Explanation of symbols]

１０１ＣＰＵ１０２ＲＯＭ１０３ＲＡＭ１０４入力Ｉ／Ｆ１０５出力Ｉ／Ｆ１０６ＫＢＤＩ／Ｆ１０７メインバス１０８ビデオＩ／Ｆ１０９表示部１１１入力装置１１２出力装置１２３ポインティングデバイス１２４キーボード１２５ペン２００情報処理装置２、１０２画像入力部４、１０４属性情報読取部６文字認識制御部８画像処理部１０、１０８文字認識部１０６後処理制御部１１０文字認識後処理部 101 CPU 102 ROM 103 RAM 104 Input I / F 105 Output I / F 106 KBDI / F 107 Main bus 108 Video I / F 109 Display unit 111 Input device 112 Output device 123 Pointing device 124 Keyboard 125 Pen 200 Information processing device 2, 102 Image input unit 4, 104 Attribute information reading unit 6 Character recognition control unit 8 Image processing unit 10, 108 Character recognition unit 106 Post-processing control unit 110 Character recognition post-processing unit

Claims

[Claims]

1. A character recognition device for performing character recognition on input image data, comprising: a reading unit that reads attribute information of the input image data; and a character recognition device that corresponds to the attribute information read by the reading unit. Determining means for determining a character recognition dictionary to be performed, and character recognition means for performing character recognition on the input image data using the character recognition dictionary determined by the determining means. Recognition device.

2. The character recognition apparatus according to claim 1, wherein the image data is FlashPix format image data.

3. The character recognition device according to claim 1, wherein the attribute information is information indicating a creator of the input image data.

4. The deciding means comprises a storage means for storing a character recognition dictionary for each creator, and referring to the storage means, a character recognition dictionary corresponding to the attribute information read by the reading means. The character recognition device according to claim 3, wherein a dictionary is determined.

5. The apparatus according to claim 1, wherein the attribute information is information indicating a type of the input image data.
The character recognition device according to 1.

6. The image processing apparatus according to claim 1, wherein the determining unit includes a storage unit that stores a character recognition dictionary for each type of the input image data, and stores the attribute information read by the reading unit with reference to the storage unit. The character recognition device according to claim 5, wherein a corresponding character recognition dictionary is determined.

7. A character recognition device for performing character recognition on input image data, comprising: character recognition means for performing character recognition on the input image data; and attribute information of the input image data. Reading means for reading the image data; specifying means for specifying the type of the input image data based on the attribute information read by the reading means; and the character recognizing means based on the type specified by the specifying means. And a post-processing means for performing post-processing on the character recognition result obtained in (1).

8. The character recognition device according to claim 7, wherein the image data is FlashPix format image data.

9. The post-processing unit includes: a storage unit that stores a word collation dictionary for collating with words in character recognition results for each type of image data; Determining means for determining a word matching dictionary corresponding to the type, and performing post-processing on the character recognition result using the word matching dictionary determined by the determining means. 8. The character recognition device according to 7.

10. A character recognition method for performing character recognition on input image data, comprising: a reading step of reading attribute information of the input image data; and a method corresponding to the attribute information read in the reading step. And a character recognition step of performing character recognition on the input image data using the character recognition dictionary determined in the determination step. Recognition method.

11. The method according to claim 11, wherein the image data is a FlashPix.
The character recognition method according to claim 10, wherein the data is image data in a format.

12. The character recognition method according to claim 10, wherein the attribute information is information indicating a creator of the input image data.

13. The character recognition dictionary according to claim 12, wherein the determining step determines a character recognition dictionary corresponding to the attribute information read in the reading step from the character recognition dictionary for each creator. Character recognition method.

14. The character recognition method according to claim 10, wherein the attribute information is information indicating a type of the input image data.

15. The determining step includes a storing step of storing a character recognition dictionary for each type of the input image data, and referring to the storing step, the attribute information read in the reading step is referred to. The character recognition method according to claim 14, wherein a corresponding character recognition dictionary is determined.

16. A character recognition method for performing character recognition on input image data, comprising: a character recognition step of performing character recognition on the input image data; and attribute information of the input image data. A reading step of reading the image data; a specifying step of specifying a type of the input image data based on the attribute information read in the reading step; and a character recognition step based on the type specified in the specifying step. And a post-processing step of performing post-processing on the character recognition result obtained in (1).

17. The method according to claim 17, wherein the image data is a FlashPix.
17. The character recognition method according to claim 16, wherein the data is image data in a format.

18. The post-processing step determines a word matching dictionary corresponding to the type specified in the specifying step from a word matching dictionary that matches a word in a character recognition result for each type of image data. 17. The character recognition method according to claim 16, further comprising a determining step, wherein post-processing is performed on the character recognition result using the word matching dictionary determined in the determining step.

19. A computer readable memory storing a character recognition program code for performing character recognition on input image data, the program code for a reading step of reading attribute information of the input image data, A program code of a determining step of determining a character recognition dictionary corresponding to the attribute information read in the reading step; and using the character recognition dictionary determined in the determining step, a character is input to the input image data. A program code for a character recognition step for performing recognition.

20. A computer readable memory storing a character recognition program code for performing character recognition on input image data, wherein a program for a character recognition step for performing character recognition on the input image data is provided. A program code for a reading step for reading attribute information of the input image data; and a program code for a specifying step for specifying the type of the input image data based on the attribute information read in the reading step. And a program code of a post-processing step of performing post-processing on a character recognition result obtained in the character recognition step based on the type specified in the specifying step.