JPH1069494A

JPH1069494A - Image retrieval method and device therefor

Info

Publication number: JPH1069494A
Application number: JP8228504A
Authority: JP
Inventors: Hirotaka Shiiyama; 弘隆椎山
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 1996-08-29
Filing date: 1996-08-29
Publication date: 1998-03-10

Abstract

PROBLEM TO BE SOLVED: To provide an image retrieval method and the device capable of highly accurately retrieving a character string at a low cost and at a high speed. SOLUTION: A retrieval character string is developed to a corresponding character string image and prescribed filtering is performed to the developed character string image (S24). Then, the filtered character string image is segmented, segmented respective parts are character-recognized and plural recognition character candidates including the retrieval character string are acquired (S3). Then, the acquired plural recognition character candidates are combined and possible different recognition character string candidates are generated (S44). By the OR conditions of the recognition character string candidates, character string recognition is performed (S5).

Description

DETAILED DESCRIPTION OF THE INVENTION

【発明の属する技術分野】本発明は、画像検索方法とそ
の装置、特に、所定の文字列を対象画像から検索する画
像検索方法とその装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an image search method and apparatus, and more particularly to an image search method and apparatus for searching a target image for a predetermined character string.

【従来の技術】従来、ＯＣＲを行なう場合には、誤認識
対策のため、認識率がある程度低い認識候補までを明示
して、人がそれを手入力で選択、あるいは、訂正してい
た。2. Description of the Related Art Conventionally, when performing OCR, in order to prevent erroneous recognition, recognition candidates with a low recognition rate are specified, and a person manually selects or corrects the candidates.

【発明が解決しようとする課題】しかしながら、この手
作業が必要であるが故に、文字の全自動認識処理化を難
しいものとしていた。これを解決すべく、この作業を機
械的に行なう研究も行われており、単語辞書や知識ベー
スを用いた誤認識訂正の例があるが、これを実現するに
は高い計算能力を有するコンピュータを用いなければ、
実用化に耐えない等の多くの障壁がある。また、認識候
補を１つに絞るために、一旦、誤訂正をしてしまうと元
に戻す手続きが面倒になる問題をはらんでいる。本発明
は、上記従来例に鑑みてなされたもので、高精度の文字
列検索を安価に行うことができる画像検索方法とその装
置を提供することを目的とする。However, the necessity of this manual work makes it difficult to realize full automatic recognition processing of characters. In order to solve this, research has been conducted to perform this work mechanically, and there are examples of misrecognition and correction using word dictionaries and knowledge bases. If not used,
There are many barriers, such as not being practical. Also, in order to narrow down the number of recognition candidates to one, there is a problem that once erroneous correction is performed, a procedure for restoring the recognition candidate becomes troublesome. The present invention has been made in view of the above conventional example, and has as its object to provide an image search method and an image search method capable of performing high-accuracy character string search at low cost.

【課題を解決するための手段】上記目的を達成するた
め、本発明の画像検索方法とその装置は以下の構成を備
える。即ち、検索文字列を対応する文字列イメージに展
開する展開工程と、前記展開工程で展開された文字列イ
メージに所定のフィルタリングを行うフィルタリング工
程と、前記フィルタリング工程でフィルタリングされた
文字列イメージを独立部分にセグメンテーションし、セ
グメンテーションされた各部分の文字認識を行い、認識
文字列候補を獲得する認識工程と、前記認識工程で獲得
された認識文字列候補の違いに基づいて、組み合わせ可
能な別の認識文字列候補を生成する生成工程と、前記生
成工程で生成された前記組み合わせ可能な別の認識文字
列候補と前記認識工程で獲得された認識文字列候補の論
理和条件で所定の文字列の検索を行う文字列検索工程と
を備える。また、別の発明は、検索文字列を対応する文
字列イメージに展開する展開手段と、前記展開手段で展
開された文字列イメージに所定のフィルタリングを行う
フィルタリング手段と、前記フィルタリング手段でフィ
ルタリングされた文字列イメージを独立部分にセグメン
テーションし、セグメンテーションされた各部分の文字
認識を行い、認識文字列候補を獲得する認識手段と、前
記認識手段で獲得された認識文字列候補の違いに基づい
て、組み合わせ可能な別の認識文字列候補を生成する生
成手段と、前記生成手段で生成された前記組み合わせ可
能な別の認識文字列候補と前記認識手段で獲得された認
識文字列候補の論理和条件で所定の文字列の検索を行う
文字列検索手段とを備える。また、別の発明は、コンピ
ュータプログラム製品であって、コンピュータ読み取り
可能なプログラムコード手段を有するコンピュータ使用
可能な媒体を備え、前記コンピュータプログラム製品
は、検索文字列を対応する文字列イメージに展開する、
コンピュータ読み取り可能な第１プログラムコード手段
と、前記第１プログラムコード手段で展開された文字列
イメージに所定のフィルタリングを行う、コンピュータ
読み取り可能な第２プログラムコード手段と、前記第２
プログラムコード手段でフィルタリングされた文字列イ
メージを独立部分にセグメンテーションし、セグメンテ
ーションされた各部分の文字認識を行い、認識文字列候
補を獲得する、コンピュータ読み取り可能な第３プログ
ラムコード手段と、前記第３プログラムコード手段で獲
得された認識文字列候補の違いに基づいて、組み合わせ
可能な別の認識文字列候補を生成する、コンピュータ読
み取り可能な第４プログラムコード手段と、第４プログ
ラムコード手段で生成された前記組み合わせ可能な別の
認識文字列候補と第３プログラムコード手段で獲得され
た認識文字列候補の論理和条件で所定の文字列の検索を
行う第５プログラムコード手段とを備える。In order to achieve the above object, an image search method and apparatus according to the present invention have the following arrangement. That is, an expansion step of expanding the search character string into a corresponding character string image, a filtering step of performing predetermined filtering on the character string image expanded in the expansion step, and a character string image filtered in the filtering step are performed independently. Segmentation into parts, performing character recognition of each of the segmented parts, and a recognition step of obtaining a recognized character string candidate, and another recognition that can be combined based on a difference between the recognized character string candidates obtained in the recognition step. A generating step of generating a character string candidate, and searching for a predetermined character string based on a logical sum condition of the another recognizable character string candidate generated in the generating step and the recognized character string candidate obtained in the recognition step Performing a character string search step. Further, another invention is a development means for developing a search character string into a corresponding character string image, a filtering means for performing a predetermined filtering on the character string image developed by the development means, and a filtering operation performed by the filtering means. Segmenting the character string image into independent parts, performing character recognition on each of the segmented parts, and a recognition unit that obtains a recognized character string candidate, based on the difference between the recognized character string candidates obtained by the recognition unit, combining Generating means for generating another possible recognition character string candidate; and a predetermined logical sum condition of the another combination character string candidate generated by the generation means and the recognition character string candidate obtained by the recognition means. And character string search means for searching for the character string. Another invention is a computer program product, comprising a computer usable medium having computer readable program code means, wherein the computer program product expands a search character string into a corresponding character string image,
Computer-readable first program code means, computer-readable second program code means for performing predetermined filtering on the character string image developed by the first program code means, and the second program code means;
Computer-readable third program code means for segmenting the character string image filtered by the program code means into independent parts, performing character recognition on each of the segmented parts, and obtaining recognized character string candidates; Computer-readable fourth program code means for generating another recognizable character string candidate that can be combined based on the difference between the recognized character string candidates obtained by the program code means, and a fourth program code means generated by the fourth program code means A fifth program code unit that searches for a predetermined character string based on a logical sum condition of the another combination of recognized character strings that can be combined and the recognized character string candidate obtained by the third program code unit.

【発明の実施の形態】以下、本発明の実施の形態の文字
認識／文字列検索処理構成の詳細な説明を図１を参照し
て行う。図１を参照して、本発明の実施の形態の文字認
識／文字列検索処理構成は、情報処理装置８、外部記憶
装置４、キーボード／マウス５、表示装置６、イメージ
スキャナ１を備える。情報処理装置８は、ＣＰＵ７がそ
の全体の制御を行う。ＣＰＵ７は、外部記憶装置４に格
納された各種処理ソフトウエアを予めＲＡＭにロードし
ておき、それを順次読み出し、解釈し、実行する。この
各種処理ソフトウエアには、ＯＣＲ処理ソフトウェア
２、文書検索ソフトウェア３を含む。イメージスキャナ
１は、文書９をスキャンしてイメージ情報を獲得し、情
報処理装置８に転送する。表示装置１は、情報処理装置
８で処理された各種の結果や、キーボード／マウス５か
ら入力されたコマンド／データを表示する。キーボード
／マウス５は、各種コマンドや、検索語・検索条件など
のデータを入力する。外部記憶装置４は、ＲＡＭにロー
ドされてＣＰＵ７によって実行されるＯＣＲ処理ソフト
ウエアと文書検索ソフトウエアの各プログラム（１
０）、その文字認識／文書検索処理のための認識辞書１
５を格納している。また、外部記憶装置４は、イメージ
スキャナ１から取り込まれた入力イメージデータ（１
１）、ＯＣＲソフトウエア２による認識結果のテキスト
情報（１３）、また、文書検索ソフトウエア３の実行に
よる検索用ファイル（１４）を格納する。この検索用フ
ァイル（１４）は、テキスト情報に基づいて作成され
る。また、外部記憶装置４には、キーボード５から入力
する検索指示文字列の各文字に対応する基準イメージパ
タンである辞書イメージデータ（１２）を格納してい
る。図２Ａは、本実施の形態の情報処理装置８での文書
検索のためのインデックス情報作成のための処理手順を
示すフローチャートである。また、図２Ｂは、作成され
たインデックス情報を用いて文字検索を行う文字検索処
理手順を示すフローチャートである。まず、図２Ａを参
照して、文書検索のためのインデックス情報作成のため
の処理手順を説明する。ステップＳ１１では、検索対象
画像をイメージスキャナ１から入力し、外部記憶装置４
に格納された後、情報処理装置８のＲＡＭにロードす
る。ステップＳ１２では、ＲＡＭにロードされた検索対
象画像について、文字認識を行う。そして、最も確から
しい認識文字列を得る。ステップＳ１３では、ステップ
Ｓ１１で入力した入力画像とこれに含まれるステップＳ
１２で認識された文字列と、その入力画像とその認識さ
れた文字列の関係を記述した目次データを検索用ファイ
ル（図１、１４）に格納する。この目次データは例え
ば、「画像」という文字列がどの登録文書画像群にはい
っていたかを検索するためのデータであり、これを用い
ることである文字列を含む文書画像を高速に知ることが
できる。以上で、登録時の処理の説明を終了する。次
に、図２Ｂを参照して、作成されたインデックス情報を
用いて文字検索を行う文字検索処理を説明する。ステッ
プ１からステップＳ４までの処理は、入力した検索文字
列に基づいて、ステップ５での文字列検索処理のための
検索論理和条件を求めるための処理手順を示す。そし
て、ステップＳ５とステップＳ６では、求められた検索
論理和条件に基づいて、入力検索文字列に最も近い文字
列を確定し、図２Ａのフローチャートで生成したインデ
ックス情報を検索して、対応する文書画像を表示する。
尚、ＯＣＲ処理ソフトウエア２は、ステップ１からステ
ップＳ４と、ステップ１１からステップ１２の処理手順
を記述している。また、文書検索ソフトウエア１は、ス
テップ５からステップＳ６の処理手順を記述している。
以下、図２の処理手順を各ステップ毎に説明する。ステ
ップＳ１では、キーボード５から、検索指示文字列を入
力する。ステップＳ２では、入力した検索指示文字列の
各文字に対応する基準のイメージパタンデータを辞書イ
メージデータ１２から抽出して、入力した検索指示文字
列に対応する文字列イメージパターンを生成する。尚、
ここでは、各文字イメージ間隔や画像のぼかし処理のぼ
かし量などの画像処理パラメータは、所定の値を用いた
り、キーボード５から入力することで決定する。ステッ
プＳ３では、ステップＳ２で生成された文字列イメージ
を各文字に分割処理を行う。そして、分割された文字単
位に、所定の特徴の特徴量を抽出し、その特徴量に基づ
いて、認識辞書を検索し、認識候補を抽出する。尚、こ
のステップでは、入力した検索指示文字列を少なくとも
認識候補の１つとして含ませる。尚、ここで、特徴の種
類は、例えば、方向特徴や濃淡特徴などが考えられる
が、本発明は、この特徴に制限されるものではない。ス
テップＳ４は、ステップＳ３で生成された認識候補に基
づいて、考えられる各文字を組み合わせて、認識候補と
しての複数の文字列を生成する。そして、ステップＳ５
での検索処理のための、文字列のＯＲ条件を生成する。
以上の処理が、ＯＣＲソフトウエア２での処理である。
次に、文書検索ソフトウエア３での処理であるステップ
Ｓ５、Ｓ６を説明する。ステップＳ５では、ステップＳ
４で生成された文字列のＯＲ条件で、それら文字列に近
い文字列をステップＳ１３までの処理で生成された、文
書画像とそれに含まれる文字列を含むインデックス情報
のうちの文字列からサーチする。そして、その検索結果
（文書画像、文字列など）を表示装置６に表示する。例
えば、入力した検索対象画像とその検索結果に対応する
文字列の位置にアンダーラインを引くなどで、検索され
た位置を表示する。次に、図３を参照して、ステップＳ
１からステップＳ４での検索ＯＲ条件生成処理でのポイ
ントを具体的に説明する。まず、キーボード５から、検
索指示文字列として、例えば、 "マルチ" を入力する（参照番号１０００）。次に、この検索指示
文字列の各文字に対応するイメージパタンを辞書イメー
ジデータから取り込み、その文字列のイメージパタンを
生成する。ここで生成する文字列イメージパタンを生成
するための複数のパラメータが用意されており、それら
を、辞書イメージデータから取り込んだイメージパタン
に作用させて、文字列イメージパタンを生成する。その
パラメータには、各文字イメージ間隔や画像のぼかし処
理のぼかし量などの画像処理パラメータを含む。これら
のパラメータには、外部記憶装置４に予め格納された所
定の値を用いたり、キーボード５から入力することで決
定する。次に、ステップＳ３で、ＲＡＭに格納された文
書イメージの一連の文字認識処理（１００２）を開始す
る。そして、“ル”の位置に関して、文字領域の切り出
し、続いて、切り出された領域での文字認識をおこなっ
たところ、 “ノ”、“レ” の２文字として、認識されたとする。“ル”の様な文字
に関しては、文字の切り出しエラーが起き易く、例え
ば、“ル”を“ノ”“レ”に誤認識し易い。この認識処
理で、第１の認識候補：マノレチまた、第２の認識候補：アノＬチの認識候補（１００３）が得られたとする。次に、ステ
ップＳ４では、得られた第１と第２の認識文字を比較し
て、異なっている文字部分を検出し、その文字部分を入
れ替えることにより、別の認識候補文字列を生成する。
上述の例では、第１と第３文字の２文字が異なっている
ため、可能な文字列の組み合わせは４通りとなり、上述
の２つの候補プラス２つの文字列の組み合わせが可能で
ある。即ち、ａ．マノレチｂ．アノＬチｃ．アノレチｄ．アノＬチである。これら４通りの文字列（１００４）は、期待の
文字列が「マルチ」であったので、いずれも誤認識文字
列候補となる。しかし、文字認識の過程では、これらの
認識候補が選択されることはさけられなく、正しい認識
は難しくなる。そこで、本発明の実施の形態では、逆の
発想で、文字認識の過程で選択される可能性のある誤認
識候補文字列も、期待認識候補である「マルチ」に対す
る認識候補として認め、これら誤認識候補文字列と期待
認識候補の論理和条件を取る（１００５）ことによっ
て、総合的な認識を行う方法を提供する。従って、通常
の全文検索では、「マルチ」のような他の文字分離の可
能性がある検索指定文字列に対しては、文書画像の検索
で誤検索を起こしやすかったが、上述の誤認識候補文字
列を含む総合的な認識を行うことにより、「マルチ」の
文字列を認識結果として選択できる。ステップＳ４まで
の処理で、入力文字列に対応する検索ＯＲ条件が生成さ
れた。この検索ＯＲ条件を満足する文字列をステップＳ
１３までの処理で生成された、文書画像とそれに含まれ
る文字列を含むインデックス情報のうちの文字列からサ
ーチし、その検索結果（文書画像、文字列など）を表示
装置６に表示することができる。次に、図４を参照し
て、ステップＳ２の詳細な処理手順を説明する。ステッ
プＳ２０では、入力した検索指示文字列に対応する文字
列イメージパターンを生成するための１つのパラメータ
として、曖昧度を設定する。この曖昧度は、外部記憶装
置に予め格納されている曖昧度のディフォルト値でもよ
いし、また、キーボード５から入力してもよい。ここ
で、この曖昧度は、検索対象の文書画像の解像度が悪か
ったり、ノイズをおおく含んでいる場合に予想される文
字認識処理の精度の低下を助けるために導入する。即
ち、検索対象の文書画像の質が悪い時は、曖昧度が高い
とし、質がよい場合は、曖昧度が低いとする。ユーザ
は、検索対象の文書画像の質をおおよそ判断して、対応
する曖昧度を設定する。設定された曖昧度に応じて、入
力した検索指示文字列の各文字に対応する文字イメージ
パターン（フォント）のサイズや文字ピッチサイズを設
定する。この設定は、曖昧度が大きくなれば、文字フォ
ントを小さくし、また、文字ピッチを狭める。また、曖
昧度が小さくなれば、文字フォントを大きくし、文字ピ
ッチを広くする。次に、ステップＳ２１では、入力した
検索指示文字列の各文字に対応する基準のイメージパタ
ンデータを辞書イメージデータ１２から抽出して、ステ
ップＳ２０で設定された文字フォントサイズと文字ピッ
チに対応する文字列パターンを生成する。この場合、基
準のイメージパタンデータ（フォント）は、認識が不得
意な文字フォントを用いる。ステップＳ２２では、ステ
ップＳ２１で生成した文字列イメージパターンに対し
て、曖昧度に対応するノイズを重畳する。即ち、曖昧度
が大きければ、強いノイズとし、逆に、曖昧度が小さけ
れば、弱いノイズとする。このノイズの強度は、例え
ば、強いノイズとしては、多くのノイズドットを用い
て、文字列イメージパターンに対してランダムに重畳す
る。逆に、弱いノイズとしては、少ないノイズドットを
用いて、文字列イメージパターンに対してランダムに重
畳することで、文字列イメージパターンに対するノイズ
のレベルを調整可能である。ステップＳ２３では、ステ
ップＳ２２で処理された文字列イメージパターンに対し
て、ぼかし処理を施す。このぼかし量も、ステップＳ２
２と同様、曖昧度が大きいときは、ぼかし量を大きく、
逆に、曖昧度が小さい時は、ぼかし量を小さくすればよ
い。以上の手順で、ステップＳ２での、入力した検索指
示文字列の各文字に対応する文字列イメージパターンを
生成する処理が実現可能となる。以上説明したように、
本発明の実施の形態では、検索文字列、例えば、「マル
チメ」をＯＣＲが誤認識を起こし易い状態になるように
ビットマップ展開し、これをＯＣＲすることにより、
「マノレチ」という文字列が誤認識傾向文字列群の中の
一つに得られ、検索指定文字列とこれらの誤認識傾向文
字列群をＯＲの関係で、入力文書画像を検索することに
より、検索性能を上げることができる。尚、本発明の実
施の形態では、先に示した文書画像検索以外にも、デー
タベースを用いた検索にも応用できる。更に、ＯＭＲや
バーコードによる光学読み取り装置の一部情報の認識尤
度が低い場合への応用も同様の手法で行なえる。本発明
に係る実施の形態では、ＯＣＲ固有の癖を、検索指定文
字列をビットマップ展開しこれをＯＣＲすることによっ
て得たが、誤認識を行ない易い文字、文字列の正解−誤
認識テーブルを用いて、ＯＣＲ固有の癖を誤認識傾向文
字列に反映させてもよいことは言うまでもない。なお、
本発明は、複数の機器（例えばホストコンピュータ，イ
ンタフェイス機器，リーダ，プリンタなど）から構成さ
れるシステムに適用しても、一つの機器からなる装置
（例えば、複写機，ファクシミリ装置など）に適用して
もよい。また、本発明の目的は、前述した実施形態の機
能を実現するソフトウェアのプログラムコードを記録し
た記憶媒体を、システムあるいは装置に供給し、そのシ
ステムあるいは装置のコンピュータ（または、ＣＰＵや
ＭＰＵ）が記憶媒体に格納されたプログラムコードを読
出し実行することによっても、達成されることは言うま
でもない。この場合、記憶媒体から読出されたプログラ
ムコード自体が前述した実施形態の機能を実現すること
になり、そのプログラムコードを記憶した記憶媒体は本
発明を構成することになる。プログラムコードを供給す
るための記憶媒体としては、例えば、フロッピディス
ク，ハードディスク，光ディスク，光磁気ディスク，Ｃ
Ｄ−ＲＯＭ，ＣＤ−Ｒ，磁気テープ，不揮発性のメモリ
カード，ＲＯＭなどを用いることができる。また、コン
ピュータが読出したプログラムコードを実行することに
より、前述した実施形態の機能が実現されるだけでな
く、そのプログラムコードの指示に基づき、コンピュー
タ上で稼働しているＯＳ（オペレーティングシステム）
などが実際の処理の一部または全部を行い、その処理に
よって前述した実施形態の機能が実現される場合も含ま
れることは言うまでもない。さらに、記憶媒体から読出
されたプログラムコードが、コンピュータに挿入された
機能拡張ボードやコンピュータに接続された機能拡張ユ
ニットに備わるメモリに書込まれた後、そのプログラム
コードの指示に基づき、その機能拡張ボードや機能拡張
ユニットに備わるＣＰＵなどが実際の処理の一部または
全部を行い、その処理によって前述した実施形態の機能
が実現される場合も含まれることは言うまでもない。本
発明を上記記憶媒体に適用する場合、その記憶媒体に
は、先に説明したフローチャートに対応するプログラム
コードを格納することになるが、簡単に説明すると、図
５のメモリマップ例に示す各モジュールを記憶媒体に格
納することになる。すなわち、少なくとも、検索文字列
を入力する「検索文字列入力モジュール」、入力した検
索文字列に基づいて、検索ＯＲ条件を生成する「検索Ｏ
Ｒ条件生成モジュール」、検索対象文書画像を入力する
「検索対象画像入力モジュール」、検索文字列に基づい
て生成された文字列イメージパタンと入力した検索対象
文書画像の文字認識を行う「文字認識モジュール」、生
成された検索ＯＲ条件を満足する文字列を検索対象文書
画像の文字列から獲得する「文字列検索モジュール」、
検索結果を表示する「表示モジュール」の各モジュール
のプログラムコードを記憶媒体に格納すればよい。以上
説明したように、本発明の実施の形態によれば、ＯＣＲ
の誤認識が系統的な場合に、検索指定文字列の文字並び
に即した誤認識の傾向を表す文字列群を得て、これを、
本来の検索指定文字列とＯＲの関係にして検索すること
により、ＯＣＲの認識率が低い場合にでも、検索漏れの
少ない文書画像検索が行なえる。また、前もって誤認識
を行ない易い文字、文字列の正解−誤認識テーブルを作
成する必要が無く、また、検索指定文字列の文字の並び
に即した誤認識傾向が得られるため、個々の文字に対し
て、正解−誤認識テーブルを適応する場合よりも精度の
高い誤認識傾向が得られる。本実施の形態は、検索指定
語に関する技術拡張であるため、従来から存在するＯＣ
Ｒを用いた文書画像検索システムに対して、検索条件入
力段の前に本発明に係る実施の形態を適応するだけで、
ＯＣＲの誤認識に強い検索が可能となる。これらのメリ
ットにより、人手によるＯＣＲ誤認識訂正作業をなく
し、文書画像入力からＯＣＲを経てイメージデータ・テ
キストデータ蓄積まで無人で行なえるシステムを構築で
き、且つ、コンピュータに対し負荷を掛けず、処理速度
の低下の心配が無く、比較的安価なシステムを実現でき
る。 [第２の実施の形態]上述の検索ＯＲ条件に基づいて全体
として確からしい文字列を検索する一例を示したが、こ
れに限定されることはなく、検索ＯＲ条件のいづれか１
つを満足する文字列を検索して、その文字列が含まれる
文書画像を検索結果としてもよい。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS A detailed description will be given of a character recognition / character string search processing configuration according to an embodiment of the present invention with reference to FIG. Referring to FIG. 1, the character recognition / character string search processing configuration according to the embodiment of the present invention includes an information processing device 8, an external storage device 4, a keyboard / mouse 5, a display device 6, and an image scanner 1. The CPU 7 controls the entire information processing device 8. The CPU 7 loads various kinds of processing software stored in the external storage device 4 into the RAM in advance, sequentially reads, interprets, and executes the software. The various kinds of processing software include OCR processing software 2 and document search software 3. The image scanner 1 obtains image information by scanning the document 9 and transfers the image information to the information processing device 8. The display device 1 displays various results processed by the information processing device 8 and commands / data input from the keyboard / mouse 5. The keyboard / mouse 5 inputs various commands and data such as search words and search conditions. The external storage device 4 includes programs (1) of OCR processing software and document search software which are loaded into the RAM and executed by the CPU 7.
0), a recognition dictionary 1 for character recognition / document search processing
5 is stored. Further, the external storage device 4 stores the input image data (1
1) The text information (13) of the recognition result by the OCR software 2 and a search file (14) by executing the document search software 3 are stored. This search file (14) is created based on text information. The external storage device 4 stores dictionary image data (12), which is a reference image pattern corresponding to each character of the search instruction character string input from the keyboard 5. FIG. 2A is a flowchart showing a processing procedure for creating index information for document search in information processing apparatus 8 of the present embodiment. FIG. 2B is a flowchart illustrating a character search processing procedure for performing a character search using the created index information. First, a processing procedure for creating index information for document search will be described with reference to FIG. 2A. In step S11, a search target image is input from the image scanner 1, and the external storage device 4
Is loaded into the RAM of the information processing device 8. In step S12, character recognition is performed on the search target image loaded in the RAM. Then, the most likely recognized character string is obtained. In step S13, the input image input in step S11 and the step S
Then, the character string recognized in step 12 and the index data describing the relationship between the input image and the recognized character string are stored in the search file (FIGS. 1, 14). This table of contents data is, for example, data for searching in which registered document image group a character string "image" was entered, and a document image including the character string using this can be quickly known. . This is the end of the description of the processing at the time of registration. Next, a character search process for performing a character search using the created index information will be described with reference to FIG. 2B. The processing from step 1 to step S4 shows a processing procedure for obtaining a search logical sum condition for the character string search processing in step 5 based on the input search character string. In steps S5 and S6, a character string closest to the input search character string is determined based on the obtained search logical OR condition, and the index information generated in the flowchart of FIG. Display an image.
The OCR processing software 2 describes the processing procedure from step 1 to step S4 and from step 11 to step 12. Further, the document search software 1 describes the processing procedure from step 5 to step S6.
Hereinafter, the processing procedure of FIG. 2 will be described for each step. In step S1, a search instruction character string is input from the keyboard 5. In step S2, reference image pattern data corresponding to each character of the input search instruction character string is extracted from the dictionary image data 12, and a character string image pattern corresponding to the input search instruction character string is generated. still,
Here, image processing parameters such as each character image interval and the amount of blurring of the image blurring process are determined by using a predetermined value or by inputting from the keyboard 5. In step S3, the character string image generated in step S2 is divided into individual characters. Then, a feature amount of a predetermined feature is extracted for each divided character, and a recognition dictionary is searched based on the feature amount to extract a recognition candidate. In this step, the input search instruction character string is included as at least one of the recognition candidates. Here, the type of the feature may be, for example, a directional feature or a shading feature, but the present invention is not limited to this feature. In step S4, based on the recognition candidates generated in step S3, possible characters are combined to generate a plurality of character strings as recognition candidates. Then, step S5
Generates an OR condition of a character string for the search processing in.
The above processing is processing in the OCR software 2.
Next, steps S5 and S6 which are processes in the document search software 3 will be described. In step S5, step S5
In the OR condition of the character strings generated in step 4, a character string close to those character strings is searched for from the character string in the index information including the document image and the character string included in the document image generated in the processing up to step S13. . Then, the search results (document images, character strings, etc.) are displayed on the display device 6. For example, the searched position is displayed by underlining the position of the input search target image and the character string corresponding to the search result. Next, referring to FIG.
The points in the search OR condition generation processing from 1 to S4 will be specifically described. First, for example, "multi" is input as a search instruction character string from the keyboard 5 (reference numeral 1000). Next, an image pattern corresponding to each character of the search instruction character string is fetched from the dictionary image data, and an image pattern of the character string is generated. A plurality of parameters for generating a character string image pattern to be generated here are prepared. These parameters are applied to an image pattern taken from dictionary image data to generate a character string image pattern. The parameters include image processing parameters such as a character image interval and a blur amount of the image blur processing. These parameters are determined by using predetermined values stored in the external storage device 4 in advance or by inputting from the keyboard 5. Next, in step S3, a series of character recognition processing (1002) of the document image stored in the RAM is started. Then, it is assumed that a character area is cut out with respect to the position of “R”, and then character recognition is performed in the cut out area, and the character is recognized as two characters “NO” and “RE”. With respect to a character such as "ru", a character cutout error is likely to occur, and for example, "ru" is likely to be erroneously recognized as "no" or "re". In this recognition process, it is assumed that a recognition candidate (1003) of a first recognition candidate: Manorechi and a second recognition candidate: Anoruchi are obtained. Next, in step S4, the obtained first and second recognition characters are compared, a different character portion is detected, and another character candidate character string is generated by replacing the character portion.
In the above example, since the first and third characters are different from each other, there are four possible combinations of character strings, and the combination of the above two candidates plus two character strings is possible. That is, a. Manoreti b. Ano Lchi c. Anoreti d. Ano Lchi. These four types of character strings (1004) are erroneously recognized character string candidates because the expected character string is “multi”. However, in the process of character recognition, it cannot be avoided that these recognition candidates are selected, and correct recognition becomes difficult. Therefore, in the embodiment of the present invention, with the reverse idea, a misrecognition candidate character string that may be selected in the character recognition process is also recognized as a recognition candidate for the expected recognition candidate “multi,” and these misrecognition candidates are recognized. By taking the logical OR condition of the recognition candidate character string and the expected recognition candidate (1005), a method of performing comprehensive recognition is provided. Therefore, in the ordinary full-text search, for a search designated character string such as “multi” which may possibly separate characters, an erroneous search is likely to occur in the search for a document image. By performing comprehensive recognition including a character string, a “multi” character string can be selected as a recognition result. In the processing up to step S4, a search OR condition corresponding to the input character string has been generated. A character string that satisfies the search OR condition is set
It is possible to search from the character string of the index information including the document image and the character string included in the document image generated by the processing up to 13 and display the search result (document image, character string, etc.) on the display device 6. it can. Next, a detailed processing procedure of step S2 will be described with reference to FIG. In step S20, the degree of ambiguity is set as one parameter for generating a character string image pattern corresponding to the input search instruction character string. This ambiguity may be a default value of the ambiguity stored in the external storage device in advance, or may be input from the keyboard 5. Here, this degree of ambiguity is introduced to help lower the accuracy of the character recognition process that is expected when the resolution of the document image to be searched is low or contains a lot of noise. That is, when the quality of the document image to be searched is poor, the degree of ambiguity is determined to be high, and when the quality is good, the degree of ambiguity is determined to be low. The user roughly determines the quality of the document image to be searched and sets the corresponding ambiguity. The size of the character image pattern (font) and the character pitch size corresponding to each character of the input search instruction character string are set according to the set ambiguity. This setting reduces the character font and the character pitch when the degree of ambiguity increases. When the degree of ambiguity is reduced, the character font is increased and the character pitch is increased. Next, in step S21, reference image pattern data corresponding to each character of the input search instruction character string is extracted from the dictionary image data 12, and a character corresponding to the character font size and character pitch set in step S20 is extracted. Generate a column pattern. In this case, as the reference image pattern data (font), a character font that is not good at recognition is used. In step S22, noise corresponding to the degree of ambiguity is superimposed on the character string image pattern generated in step S21. That is, if the degree of ambiguity is large, the noise is determined to be strong, and if the degree of ambiguity is small, the noise is determined to be weak. The intensity of this noise is, for example, superimposed randomly on a character string image pattern using many noise dots as strong noise. Conversely, as weak noise, the noise level for the character string image pattern can be adjusted by randomly superimposing the character string image pattern using a small number of noise dots. In step S23, a blurring process is performed on the character string image pattern processed in step S22. This blur amount is also determined in step S2.
Similar to 2, when the degree of ambiguity is large, the blur amount is large,
Conversely, when the degree of ambiguity is small, the blurring amount may be reduced. With the above procedure, the process of generating the character string image pattern corresponding to each character of the input search instruction character string in step S2 can be realized. As explained above,
In the embodiment of the present invention, a search character string, for example, “multi-me” is bitmap-developed so that the OCR is likely to cause erroneous recognition.
A character string “Manorechi” is obtained as one of the misrecognition tendency character strings, and the input document image is searched by ORing the search designated character string and these misrecognition tendency character strings in an OR relationship. Search performance can be improved. In the embodiment of the present invention, in addition to the document image search described above, the present invention can be applied to a search using a database. Further, application to a case where the likelihood of recognition of some information of the optical reading device by the OMR or the barcode is low can be performed by the same method. In the embodiment according to the present invention, the peculiarity of the OCR is obtained by bitmap-expanding the search designated character string and performing the OCR. It goes without saying that the characteristic peculiar to the OCR may be used to reflect the character string that tends to be misrecognized. In addition,
The present invention can be applied to a system including a plurality of devices (for example, a host computer, an interface device, a reader, a printer, etc.) but also to an apparatus (for example, a copying machine, a facsimile device, etc.) including one device. May be. Further, an object of the present invention is to provide a storage medium storing a program code of software for realizing the functions of the above-described embodiments to a system or an apparatus, and a computer (or CPU or MPU) of the system or apparatus to store the storage medium. It goes without saying that the present invention is also achieved by reading and executing the program code stored in the medium. In this case, the program code itself read from the storage medium implements the functions of the above-described embodiment, and the storage medium storing the program code constitutes the present invention. Examples of a storage medium for supplying the program code include a floppy disk, hard disk, optical disk, magneto-optical disk, and C
A D-ROM, a CD-R, a magnetic tape, a nonvolatile memory card, a ROM, and the like can be used. When the computer executes the readout program code, not only the functions of the above-described embodiments are realized, but also an OS (Operating System) running on the computer based on the instruction of the program code.
It goes without saying that a case where the functions of the above-described embodiments are implemented by performing some or all of the actual processing, and the processing performs the functions of the above-described embodiments. Further, after the program code read from the storage medium is written into a memory provided on a function expansion board inserted into the computer or a function expansion unit connected to the computer, the function expansion is performed based on the instruction of the program code. It goes without saying that a CPU or the like provided in the board or the function expansion unit performs part or all of the actual processing, and the processing realizes the functions of the above-described embodiments. When the present invention is applied to the storage medium, the storage medium stores program codes corresponding to the above-described flowcharts. However, in brief, each module shown in the memory map example of FIG. Is stored in the storage medium. That is, at least a “search character string input module” for inputting a search character string and a “search O for generating a search OR condition based on the input search character string”
"R condition generation module", "search target image input module" for inputting a search target document image, "character recognition module for performing character recognition of a character string image pattern generated based on a search character string and input search target document image""A character string search module for acquiring a character string that satisfies the generated search OR condition from the character string of the search target document image,
What is necessary is just to store the program code of each module of the “display module” for displaying the search result in the storage medium. As described above, according to the embodiment of the present invention, the OCR
If the misrecognition of is systematic, obtain the characters of the search specified character string and a group of character strings indicating the tendency of misrecognition in accordance with this,
By performing the search in the relationship between the original search designation character string and the OR, even when the OCR recognition rate is low, a document image search with few omissions in search can be performed. In addition, there is no need to create a correct answer-misrecognition table for characters and character strings that are likely to be misrecognized in advance, and a tendency of misrecognition that matches the characters of the search designated character string is obtained. As a result, a more accurate misrecognition tendency can be obtained than when the correct-misrecognition table is applied. Since the present embodiment is a technical extension related to a search designation word, a conventionally existing OC
For a document image search system using R, simply applying the embodiment according to the present invention before the search condition input stage,
A search that is resistant to OCR misrecognition becomes possible. These advantages eliminate the need for manual OCR misrecognition and correction work, enable the construction of an unmanned system from input of document images to storage of image data and text data via OCR, and without imposing a load on the computer and processing speed. A relatively inexpensive system can be realized without fear of a decrease. [Second Embodiment] An example of searching for a likely character string as a whole based on the above-described search OR condition has been described. However, the present invention is not limited to this, and any one of the search OR conditions may be used.
A character string that satisfies the above conditions may be searched, and a document image including the character string may be used as a search result.

【発明の効果】以上説明したように本発明によれば、高
精度の文字列検索を高速、かつ、安価に行うことができ
る。As described above, according to the present invention, a high-precision character string search can be performed at high speed and at low cost.

[Brief description of the drawings]

【図１】本発明の実施の形態のハードウエア構成図であ
る。FIG. 1 is a hardware configuration diagram of an embodiment of the present invention.

【図２Ａ】本発明の実施の形態の文書画像とそれに含ま
れる文字列情報を獲得する処理手順を示すフローチャー
トである。FIG. 2A is a flowchart illustrating a processing procedure for acquiring a document image and character string information included in the document image according to the embodiment of this invention.

【図２Ｂ】本発明の実施の形態の文字列検索処理手順を
示すフローチャートである。FIG. 2B is a flowchart illustrating a character string search processing procedure according to the embodiment of this invention.

【図３】本発明の実施の形態の検索ＯＲ条件生成工程を
説明するための図である。FIG. 3 is a diagram for explaining a search OR condition generation step according to the embodiment of this invention.

【図４】ステップＳ２の詳細な処理手順を示したフロー
チャートである。FIG. 4 is a flowchart showing a detailed processing procedure of step S2.

【図５】コンピュータ読み取り可能な所定の記録媒体に
格納された各プログラムモジュールのレイアウトを示す
図である。FIG. 5 is a diagram showing a layout of each program module stored in a predetermined computer-readable recording medium.

[Explanation of symbols]

１イメージスキャナ２ＲＡＭ（ＯＣＲ処理ソフトウェア）３ＲＡＭ（文書検索ソフトウェア）４外部記憶装置５キーボード６表示装置７ＣＰＵ DESCRIPTION OF SYMBOLS 1 Image scanner 2 RAM (OCR processing software) 3 RAM (document search software) 4 External storage device 5 Keyboard 6 Display device 7 CPU

─────────────────────────────────────────────────────
────────────────────────────────────────────────── ───

【手続補正書】[Procedure amendment]

【提出日】平成８年９月９日[Submission date] September 9, 1996

【手続補正１】[Procedure amendment 1]

【補正対象書類名】明細書[Document name to be amended] Statement

【補正対象項目名】発明の詳細な説明[Correction target item name] Detailed description of the invention

【補正方法】変更[Correction method] Change

【補正内容】[Correction contents]

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【０００２】[0002]

【０００３】[0003]

【発明が解決しようとする課題】しかしながら、この手
作業が必要であるが故に、文字の全自動認識処理化を難
しいものとしていた。これを解決すべく、この作業を機
械的に行なう研究も行われており、単語辞書や知識ベー
スを用いた誤認識訂正の例があるが、これを実現するに
は高い計算能力を有するコンピュータを用いなければ、
実用化に耐えない等の多くの障壁がある。However, the necessity of this manual work makes it difficult to realize full automatic recognition processing of characters. In order to solve this, research has been conducted to perform this work mechanically, and there are examples of misrecognition and correction using word dictionaries and knowledge bases. If not used,
There are many barriers, such as not being practical.

【０００４】また、認識候補を１つに絞るために、一
旦、誤訂正をしてしまうと元に戻す手続きが面倒になる
問題をはらんでいる。本発明は、上記従来例に鑑みてな
されたもので、高精度の文字列検索を安価に行うことが
できる画像検索方法とその装置を提供することを目的と
する。[0004] Further, in order to narrow down the number of recognition candidates to one, there is a problem that once erroneous correction is performed, a procedure for restoring the erroneous correction becomes troublesome. The present invention has been made in view of the above conventional example, and has as its object to provide an image search method and an image search method capable of performing high-accuracy character string search at low cost.

【０００５】[0005]

【課題を解決するための手段】上記目的を達成するた
め、本発明の画像検索方法とその装置は以下の構成を備
える。即ち、検索文字列を対応する文字列イメージに展
開する展開工程と、前記展開工程で展開された文字列イ
メージに所定のフィルタリングを行うフィルタリング工
程と、前記フィルタリング工程でフィルタリングされた
文字列イメージを独立部分にセグメンテーションし、セ
グメンテーションされた各部分の文字認識を行い、認識
文字列候補を獲得する認識工程と、前記認識工程で獲得
された認識文字列候補の違いに基づいて、組み合わせ可
能な別の認識文字列候補を生成する生成工程と、前記生
成工程で生成された前記組み合わせ可能な別の認識文字
列候補と前記認識工程で獲得された認識文字列候補の論
理和条件で所定の文字列の検索を行う文字列検索工程と
を備える。In order to achieve the above object, an image search method and apparatus according to the present invention have the following arrangement. That is, an expansion step of expanding the search character string into a corresponding character string image, a filtering step of performing predetermined filtering on the character string image expanded in the expansion step, and a character string image filtered in the filtering step are performed independently. Segmentation into parts, performing character recognition of each of the segmented parts, and a recognition step of obtaining a recognized character string candidate, and another recognition that can be combined based on a difference between the recognized character string candidates obtained in the recognition step. A generating step of generating a character string candidate, and searching for a predetermined character string based on a logical sum condition of the another recognizable character string candidate generated in the generating step and the recognized character string candidate obtained in the recognition step Performing a character string search step.

【０００６】また、別の発明は、検索文字列を対応する
文字列イメージに展開する展開手段と、前記展開手段で
展開された文字列イメージに所定のフィルタリングを行
うフィルタリング手段と、前記フィルタリング手段でフ
ィルタリングされた文字列イメージを独立部分にセグメ
ンテーションし、セグメンテーションされた各部分の文
字認識を行い、認識文字列候補を獲得する認識手段と、
前記認識手段で獲得された認識文字列候補の違いに基づ
いて、組み合わせ可能な別の認識文字列候補を生成する
生成手段と、前記生成手段で生成された前記組み合わせ
可能な別の認識文字列候補と前記認識手段で獲得された
認識文字列候補の論理和条件で所定の文字列の検索を行
う文字列検索手段とを備える。Further, another invention is a developing means for developing a search character string into a corresponding character string image, a filtering means for performing a predetermined filtering on the character string image developed by the developing means, and a filtering means. Recognition means for segmenting the filtered character string image into independent parts, performing character recognition on each of the segmented parts, and obtaining recognized character string candidates;
Generating means for generating another recognizable character string candidate that can be combined based on the difference between the recognized character string candidates obtained by the recognizing means, and another recognizable character string candidate that can be generated by the generating means And a character string search means for searching for a predetermined character string based on a logical OR condition of the recognized character string candidates obtained by the recognition means.

【０００７】また、別の発明は、コンピュータプログラ
ム製品であって、コンピュータ読み取り可能なプログラ
ムコード手段を有するコンピュータ使用可能な媒体を備
え、前記コンピュータプログラム製品は、検索文字列を
対応する文字列イメージに展開する、コンピュータ読み
取り可能な第１プログラムコード手段と、前記第１プロ
グラムコード手段で展開された文字列イメージに所定の
フィルタリングを行う、コンピュータ読み取り可能な第
２プログラムコード手段と、前記第２プログラムコード
手段でフィルタリングされた文字列イメージを独立部分
にセグメンテーションし、セグメンテーションされた各
部分の文字認識を行い、認識文字列候補を獲得する、コ
ンピュータ読み取り可能な第３プログラムコード手段
と、前記第３プログラムコード手段で獲得された認識文
字列候補の違いに基づいて、組み合わせ可能な別の認識
文字列候補を生成する、コンピュータ読み取り可能な第
４プログラムコード手段と、第４プログラムコード手段
で生成された前記組み合わせ可能な別の認識文字列候補
と第３プログラムコード手段で獲得された認識文字列候
補の論理和条件で所定の文字列の検索を行う第５プログ
ラムコード手段とを備える。Another aspect of the invention is a computer program product comprising a computer usable medium having computer readable program code means, wherein the computer program product converts a search character string into a corresponding character string image. Computer-readable first program code means to be expanded, computer-readable second program code means to perform predetermined filtering on the character string image expanded by the first program code means, and the second program code Computer-readable third program code means for segmenting the character string image filtered by the means into independent parts, performing character recognition on each of the segmented parts, and obtaining recognized character string candidates, and the third program. Computer-readable fourth program code means for generating another recognizable character string candidate that can be combined based on the difference between the recognized character string candidates obtained by the program code means, and the fourth program code means generated by the fourth program code means. A fifth program code unit for searching for a predetermined character string based on a logical sum condition of another combination of recognized character strings that can be combined with the recognized character string candidate obtained by the third program code unit.

【０００８】[0008]

【発明の実施の形態】以下、本発明の実施の形態の文字
認識／文字列検索処理構成の詳細な説明を図１を参照し
て行う。図１を参照して、本発明の実施の形態の文字認
識／文字列検索処理構成は、情報処理装置８、外部記憶
装置４、キーボード／マウス５、表示装置６、イメージ
スキャナ１を備える。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS A detailed description will be given of a character recognition / character string search processing configuration according to an embodiment of the present invention with reference to FIG. Referring to FIG. 1, the character recognition / character string search processing configuration according to the embodiment of the present invention includes an information processing device 8, an external storage device 4, a keyboard / mouse 5, a display device 6, and an image scanner 1.

【０００９】情報処理装置８は、ＣＰＵ７がその全体の
制御を行う。ＣＰＵ７は、外部記憶装置４に格納された
各種処理ソフトウエアを予めＲＡＭにロードしておき、
それを順次読み出し、解釈し、実行する。この各種処理
ソフトウエアには、ＯＣＲ処理ソフトウェア２、文書検
索ソフトウェア３を含む。イメージスキャナ１は、文書
９をスキャンしてイメージ情報を獲得し、情報処理装置
８に転送する。The CPU 7 controls the entire information processing apparatus 8. The CPU 7 loads various processing software stored in the external storage device 4 into the RAM in advance,
It is read, interpreted and executed sequentially. The various kinds of processing software include OCR processing software 2 and document search software 3. The image scanner 1 obtains image information by scanning the document 9 and transfers the image information to the information processing device 8.

【００１０】表示装置１は、情報処理装置８で処理され
た各種の結果や、キーボード／マウス５から入力された
コマンド／データを表示する。キーボード／マウス５
は、各種コマンドや、検索語・検索条件などのデータを
入力する。外部記憶装置４は、ＲＡＭにロードされてＣ
ＰＵ７によって実行されるＯＣＲ処理ソフトウエアと文
書検索ソフトウエアの各プログラム（１０）、その文字
認識／文書検索処理のための認識辞書１５を格納してい
る。The display device 1 displays various results processed by the information processing device 8 and commands / data input from the keyboard / mouse 5. Keyboard / Mouse 5
Inputs various commands and data such as search terms and search conditions. The external storage device 4 is loaded into the RAM and
Each program (10) of OCR processing software and document search software executed by the PU 7 and a recognition dictionary 15 for character recognition / document search processing are stored.

【００１１】また、外部記憶装置４は、イメージスキャ
ナ１から取り込まれた入力イメージデータ（１１）、Ｏ
ＣＲソフトウエア２による認識結果のテキスト情報（１
３）、また、文書検索ソフトウエア３の実行による検索
用ファイル（１４）を格納する。この検索用ファイル
（１４）は、テキスト情報に基づいて作成される。ま
た、外部記憶装置４には、キーボード５から入力する検
索指示文字列の各文字に対応する基準イメージパタンで
ある辞書イメージデータ（１２）を格納している。The external storage device 4 stores input image data (11), O
Text information of recognition result by CR software 2 (1
3) In addition, a search file (14) by executing the document search software 3 is stored. This search file (14) is created based on text information. The external storage device 4 stores dictionary image data (12), which is a reference image pattern corresponding to each character of the search instruction character string input from the keyboard 5.

【００１２】図２Ａは、本実施の形態の情報処理装置８
での文書検索のためのインデックス情報作成のための処
理手順を示すフローチャートである。また、図２Ｂは、
作成されたインデックス情報を用いて文字検索を行う文
字検索処理手順を示すフローチャートである。まず、図
２Ａを参照して、文書検索のためのインデックス情報作
成のための処理手順を説明する。FIG. 2A shows an information processing apparatus 8 according to this embodiment.
9 is a flowchart showing a processing procedure for creating index information for document search in the first embodiment. Also, FIG.
9 is a flowchart illustrating a character search processing procedure for performing a character search using created index information. First, a processing procedure for creating index information for document search will be described with reference to FIG. 2A.

【００１３】ステップＳ１１では、検索対象画像をイメ
ージスキャナ１から入力し、外部記憶装置４に格納され
た後、情報処理装置８のＲＡＭにロードする。ステップ
Ｓ１２では、ＲＡＭにロードされた検索対象画像につい
て、文字認識を行う。そして、最も確からしい認識文字
列を得る。ステップＳ１３では、ステップＳ１１で入力
した入力画像とこれに含まれるステップＳ１２で認識さ
れた文字列と、その入力画像とその認識された文字列の
関係を記述した目次データを検索用ファイル（図１、１
４）に格納する。この目次データは例えば、「画像」と
いう文字列がどの登録文書画像群にはいっていたかを検
索するためのデータであり、これを用いることである文
字列を含む文書画像を高速に知ることができる。In step S11, a search target image is input from the image scanner 1, stored in the external storage device 4, and then loaded into the RAM of the information processing device 8. In step S12, character recognition is performed on the search target image loaded in the RAM. Then, the most likely recognized character string is obtained. In step S13, the input file input in step S11, the character string included in the image recognized in step S12, and table of contents data describing the relationship between the input image and the recognized character string are stored in a search file (FIG. 1). , 1
4). This table of contents data is, for example, data for searching in which registered document image group a character string "image" was entered, and a document image including the character string using this can be quickly known. .

【００１４】以上で、登録時の処理の説明を終了する。
次に、図２Ｂを参照して、作成されたインデックス情報
を用いて文字検索を行う文字検索処理を説明する。ステ
ップ１からステップＳ４までの処理は、入力した検索文
字列に基づいて、ステップ５での文字列検索処理のため
の検索論理和条件を求めるための処理手順を示す。そし
て、ステップＳ５とステップＳ６では、求められた検索
論理和条件に基づいて、入力検索文字列に最も近い文字
列を確定し、図２Ａのフローチャートで生成したインデ
ックス情報を検索して、対応する文書画像を表示する。The description of the registration process has been completed.
Next, a character search process for performing a character search using the created index information will be described with reference to FIG. 2B. The processing from step 1 to step S4 shows a processing procedure for obtaining a search logical sum condition for the character string search processing in step 5 based on the input search character string. In steps S5 and S6, a character string closest to the input search character string is determined based on the obtained search logical OR condition, and the index information generated in the flowchart of FIG. Display an image.

【００１５】尚、ＯＣＲ処理ソフトウエア２は、ステッ
プ１からステップＳ４と、ステップ１１からステップ１
２の処理手順を記述している。また、文書検索ソフトウ
エア１は、ステップ５からステップＳ６の処理手順を記
述している。以下、図２の処理手順を各ステップ毎に説
明する。ステップＳ１では、キーボード５から、検索指
示文字列を入力する。Note that the OCR processing software 2 executes steps 1 to S4 and steps 11 to 1
2 is described. Further, the document search software 1 describes the processing procedure from step 5 to step S6. Hereinafter, the processing procedure of FIG. 2 will be described for each step. In step S1, a search instruction character string is input from the keyboard 5.

【００１６】ステップＳ２では、入力した検索指示文字
列の各文字に対応する基準のイメージパタンデータを辞
書イメージデータ１２から抽出して、入力した検索指示
文字列に対応する文字列イメージパターンを生成する。
尚、ここでは、各文字イメージ間隔や画像のぼかし処理
のぼかし量などの画像処理パラメータは、所定の値を用
いたり、キーボード５から入力することで決定する。In step S2, reference image pattern data corresponding to each character of the input search instruction character string is extracted from the dictionary image data 12, and a character string image pattern corresponding to the input search instruction character string is generated. .
Here, the image processing parameters such as the character image interval and the blur amount of the image blur processing are determined by using a predetermined value or input from the keyboard 5.

【００１７】ステップＳ３では、ステップＳ２で生成さ
れた文字列イメージを各文字に分割処理を行う。そし
て、分割された文字単位に、所定の特徴の特徴量を抽出
し、その特徴量に基づいて、認識辞書を検索し、認識候
補を抽出する。尚、このステップでは、入力した検索指
示文字列を少なくとも認識候補の１つとして含ませる。
尚、ここで、特徴の種類は、例えば、方向特徴や濃淡特
徴などが考えられるが、本発明は、この特徴に制限され
るものではない。In step S3, the character string image generated in step S2 is divided into characters. Then, a feature amount of a predetermined feature is extracted for each divided character, and a recognition dictionary is searched based on the feature amount to extract a recognition candidate. In this step, the input search instruction character string is included as at least one of the recognition candidates.
Here, the type of the feature may be, for example, a directional feature or a shading feature, but the present invention is not limited to this feature.

【００１８】ステップＳ４は、ステップＳ３で生成され
た認識候補に基づいて、考えられる各文字を組み合わせ
て、認識候補としての複数の文字列を生成する。そし
て、ステップＳ５での検索処理のための、文字列のＯＲ
条件を生成する。以上の処理が、ＯＣＲソフトウエア２
での処理である。次に、文書検索ソフトウエア３での処
理であるステップＳ５、Ｓ６を説明する。In step S4, based on the recognition candidates generated in step S3, possible characters are combined to generate a plurality of character strings as recognition candidates. Then, OR the character strings for the search processing in step S5.
Generate a condition. The above processing is the OCR software 2
It is processing in. Next, steps S5 and S6 which are processes in the document search software 3 will be described.

【００１９】ステップＳ５では、ステップＳ４で生成さ
れた文字列のＯＲ条件で、それら文字列に近い文字列を
ステップＳ１３までの処理で生成された、文書画像とそ
れに含まれる文字列を含むインデックス情報のうちの文
字列からサーチする。そして、その検索結果（文書画
像、文字列など）を表示装置６に表示する。例えば、入
力した検索対象画像とその検索結果に対応する文字列の
位置にアンダーラインを引くなどで、検索された位置を
表示する。In step S5, a character string close to the character string is indexed by the OR condition of the character string generated in step S4, including the document image generated by the processing up to step S13 and the character string included therein. Search from the character string of Then, the search results (document images, character strings, etc.) are displayed on the display device 6. For example, the searched position is displayed by underlining the position of the input search target image and the character string corresponding to the search result.

【００２０】次に、図３を参照して、ステップＳ１から
ステップＳ４での検索ＯＲ条件生成処理でのポイントを
具体的に説明する。まず、キーボード５から、検索指示
文字列として、例えば、 "マルチ" を入力する（参照番号１０００）。Next, with reference to FIG. 3, the points in the search OR condition generation processing in steps S1 to S4 will be specifically described. First, for example, "multi" is input as a search instruction character string from the keyboard 5 (reference numeral 1000).

【００２１】次に、この検索指示文字列の各文字に対応
するイメージパタンを辞書イメージデータから取り込
み、その文字列のイメージパタンを生成する。ここで生
成する文字列イメージパタンを生成するための複数のパ
ラメータが用意されており、それらを、辞書イメージデ
ータから取り込んだイメージパタンに作用させて、文字
列イメージパタンを生成する。そのパラメータには、各
文字イメージ間隔や画像のぼかし処理のぼかし量などの
画像処理パラメータを含む。これらのパラメータには、
外部記憶装置４に予め格納された所定の値を用いたり、
キーボード５から入力することで決定する。Next, an image pattern corresponding to each character of the search instruction character string is fetched from the dictionary image data, and an image pattern of the character string is generated. A plurality of parameters for generating a character string image pattern to be generated here are prepared. These parameters are applied to an image pattern taken from dictionary image data to generate a character string image pattern. The parameters include image processing parameters such as a character image interval and a blur amount of the image blur processing. These parameters include
Using a predetermined value stored in the external storage device 4 in advance,
It is determined by inputting from the keyboard 5.

【００２２】次に、ステップＳ３で、ＲＡＭに格納され
た文書イメージの一連の文字認識処理（１００２）を開
始する。そして、“ル”の位置に関して、文字領域の切
り出し、続いて、切り出された領域での文字認識をおこ
なったところ、 “ノ”、“レ” の２文字として、認識されたとする。“ル”の様な文字
に関しては、文字の切り出しエラーが起き易く、例え
ば、“ル”を“ノ”“レ”に誤認識し易い。Next, in step S3, a series of character recognition processing (1002) of the document image stored in the RAM is started. Then, it is assumed that a character area is cut out with respect to the position of “R”, and then character recognition is performed in the cut out area, and the character is recognized as two characters “NO” and “RE”. With respect to a character such as "ru", a character cutout error is likely to occur, and for example, "ru" is likely to be erroneously recognized as "no" or "re".

【００２３】この認識処理で、第１の認識候補：マノレチまた、第２の認識候補：アノＬチの認識候補（１００３）が得られたとする。In this recognition processing, it is assumed that a recognition candidate (1003) of a first recognition candidate: Manorechi and a second recognition candidate: Anoruchi are obtained.

【００２４】次に、ステップＳ４では、得られた第１と
第２の認識文字を比較して、異なっている文字部分を検
出し、その文字部分を入れ替えることにより、別の認識
候補文字列を生成する。上述の例では、第１と第３文字
の２文字が異なっているため、可能な文字列の組み合わ
せは４通りとなり、上述の２つの候補プラス２つの文字
列の組み合わせが可能である。即ち、ａ．マノレチｂ．アノＬチｃ．アノレチｄ．アノＬチである。Next, in step S4, the obtained first and second recognition characters are compared, a different character portion is detected, and the character portion is replaced, so that another recognition candidate character string is obtained. Generate. In the above example, since the first and third characters are different from each other, there are four possible combinations of character strings, and the combination of the above two candidates plus two character strings is possible. That is, a. Manoreti b. Ano Lchi c. Anoreti d. Ano Lchi.

【００２５】これら４通りの文字列（１００４）は、期
待の文字列が「マルチ」であったので、いずれも誤認識
文字列候補となる。しかし、文字認識の過程では、これ
らの認識候補が選択されることはさけられなく、正しい
認識は難しくなる。そこで、本発明の実施の形態では、
逆の発想で、文字認識の過程で選択される可能性のある
誤認識候補文字列も、期待認識候補である「マルチ」に
対する認識候補として認め、これら誤認識候補文字列と
期待認識候補の論理和条件を取る（１００５）ことによ
って、総合的な認識を行う方法を提供する。These four character strings (1004) are erroneously recognized character string candidates because the expected character string is "multi". However, in the process of character recognition, it cannot be avoided that these recognition candidates are selected, and correct recognition becomes difficult. Therefore, in the embodiment of the present invention,
In the reverse idea, an erroneous recognition candidate character string that may be selected during the character recognition process is also recognized as a recognition candidate for the expected recognition candidate “multi,” and the logic of the erroneous recognition candidate character string and the expected recognition candidate is recognized. By taking the sum condition (1005), a method for comprehensive recognition is provided.

【００２６】従って、通常の全文検索では、「マルチ」
のような他の文字分離の可能性がある検索指定文字列に
対しては、文書画像の検索で誤検索を起こしやすかった
が、上述の誤認識候補文字列を含む総合的な認識を行う
ことにより、「マルチ」の文字列を認識結果として選択
できる。ステップＳ４までの処理で、入力文字列に対応
する検索ＯＲ条件が生成された。Therefore, in a normal full-text search, "multi"
It is easy to cause erroneous search in the search of document image for the search specified character string that has the possibility of character separation like, but perform comprehensive recognition including the erroneous recognition candidate character string described above Thus, the character string “multi” can be selected as the recognition result. In the processing up to step S4, a search OR condition corresponding to the input character string has been generated.

【００２７】この検索ＯＲ条件を満足する文字列をステ
ップＳ１３までの処理で生成された、文書画像とそれに
含まれる文字列を含むインデックス情報のうちの文字列
からサーチし、その検索結果（文書画像、文字列など）
を表示装置６に表示することができる。次に、図４を参
照して、ステップＳ２の詳細な処理手順を説明する。A character string that satisfies the search OR condition is searched from the character string in the index information including the document image generated by the processing up to step S13 and the character string included therein, and the search result (document image , Strings, etc.)
Can be displayed on the display device 6. Next, a detailed processing procedure of step S2 will be described with reference to FIG.

【００２８】ステップＳ２０では、入力した検索指示文
字列に対応する文字列イメージパターンを生成するため
の１つのパラメータとして、曖昧度を設定する。この曖
昧度は、外部記憶装置に予め格納されている曖昧度のデ
ィフォルト値でもよいし、また、キーボード５から入力
してもよい。ここで、この曖昧度は、検索対象の文書画
像の解像度が悪かったり、ノイズをおおく含んでいる場
合に予想される文字認識処理の精度の低下を助けるため
に導入する。即ち、検索対象の文書画像の質が悪い時
は、曖昧度が高いとし、質がよい場合は、曖昧度が低い
とする。In step S20, the degree of ambiguity is set as one parameter for generating a character string image pattern corresponding to the input search instruction character string. This ambiguity may be a default value of the ambiguity stored in the external storage device in advance, or may be input from the keyboard 5. Here, this degree of ambiguity is introduced to help lower the accuracy of the character recognition process that is expected when the resolution of the document image to be searched is low or contains a lot of noise. That is, when the quality of the document image to be searched is poor, the degree of ambiguity is determined to be high, and when the quality is good, the degree of ambiguity is determined to be low.

【００２９】ユーザは、検索対象の文書画像の質をおお
よそ判断して、対応する曖昧度を設定する。設定された
曖昧度に応じて、入力した検索指示文字列の各文字に対
応する文字イメージパターン（フォント）のサイズや文
字ピッチサイズを設定する。この設定は、曖昧度が大き
くなれば、文字フォントを小さくし、また、文字ピッチ
を狭める。また、曖昧度が小さくなれば、文字フォント
を大きくし、文字ピッチを広くする。The user roughly determines the quality of the document image to be searched and sets the corresponding ambiguity. The size of the character image pattern (font) and the character pitch size corresponding to each character of the input search instruction character string are set according to the set ambiguity. This setting reduces the character font and the character pitch when the degree of ambiguity increases. When the degree of ambiguity is reduced, the character font is increased and the character pitch is increased.

【００３０】次に、ステップＳ２１では、入力した検索
指示文字列の各文字に対応する基準のイメージパタンデ
ータを辞書イメージデータ１２から抽出して、ステップ
Ｓ２０で設定された文字フォントサイズと文字ピッチに
対応する文字列パターンを生成する。この場合、基準の
イメージパタンデータ（フォント）は、認識が不得意な
文字フォントを用いる。Next, at step S21, reference image pattern data corresponding to each character of the input search instruction character string is extracted from the dictionary image data 12, and is converted to the character font size and character pitch set at step S20. Generate the corresponding string pattern. In this case, as the reference image pattern data (font), a character font that is not good at recognition is used.

【００３１】ステップＳ２２では、ステップＳ２１で生
成した文字列イメージパターンに対して、曖昧度に対応
するノイズを重畳する。即ち、曖昧度が大きければ、強
いノイズとし、逆に、曖昧度が小さければ、弱いノイズ
とする。このノイズの強度は、例えば、強いノイズとし
ては、多くのノイズドットを用いて、文字列イメージパ
ターンに対してランダムに重畳する。逆に、弱いノイズ
としては、少ないノイズドットを用いて、文字列イメー
ジパターンに対してランダムに重畳することで、文字列
イメージパターンに対するノイズのレベルを調整可能で
ある。In step S22, noise corresponding to the degree of ambiguity is superimposed on the character string image pattern generated in step S21. That is, if the degree of ambiguity is large, the noise is determined to be strong, and if the degree of ambiguity is small, the noise is determined to be weak. The intensity of this noise is, for example, superimposed randomly on a character string image pattern using many noise dots as strong noise. Conversely, as weak noise, the noise level for the character string image pattern can be adjusted by randomly superimposing the character string image pattern using a small number of noise dots.

【００３２】ステップＳ２３では、ステップＳ２２で処
理された文字列イメージパターンに対して、ぼかし処理
を施す。このぼかし量も、ステップＳ２２と同様、曖昧
度が大きいときは、ぼかし量を大きく、逆に、曖昧度が
小さい時は、ぼかし量を小さくすればよい。以上の手順
で、ステップＳ２での、入力した検索指示文字列の各文
字に対応する文字列イメージパターンを生成する処理が
実現可能となる。In step S23, a blurring process is performed on the character string image pattern processed in step S22. As in the case of step S22, the blur amount may be increased when the degree of ambiguity is large, and may be decreased when the degree of ambiguity is small. With the above procedure, the process of generating the character string image pattern corresponding to each character of the input search instruction character string in step S2 can be realized.

【００３３】以上説明したように、本発明の実施の形態
では、検索文字列、例えば、「マルチメ」をＯＣＲが誤
認識を起こし易い状態になるようにビットマップ展開
し、これをＯＣＲすることにより、「マノレチ」という
文字列が誤認識傾向文字列群の中の一つに得られ、検索
指定文字列とこれらの誤認識傾向文字列群をＯＲの関係
で、入力文書画像を検索することにより、検索性能を上
げることができる。As described above, in the embodiment of the present invention, a search character string, for example, "multi-me" is bitmap-developed so that OCR is likely to cause erroneous recognition, and this is subjected to OCR. , Is obtained as one of the misrecognition tendency character strings, and the input document image is searched for the search designated character string and the misrecognition tendency character strings in an OR relationship. , Search performance can be improved.

【００３４】尚、本発明の実施の形態では、先に示した
文書画像検索以外にも、データベースを用いた検索にも
応用できる。更に、ＯＭＲやバーコードによる光学読み
取り装置の一部情報の認識尤度が低い場合への応用も同
様の手法で行なえる。本発明に係る実施の形態では、Ｏ
ＣＲ固有の癖を、検索指定文字列をビットマップ展開し
これをＯＣＲすることによって得たが、誤認識を行ない
易い文字、文字列の正解−誤認識テーブルを用いて、Ｏ
ＣＲ固有の癖を誤認識傾向文字列に反映させてもよいこ
とは言うまでもない。In the embodiment of the present invention, in addition to the document image search described above, the present invention can be applied to a search using a database. Further, application to a case where the likelihood of recognition of some information of the optical reading device by the OMR or the barcode is low can be performed by the same method. In the embodiment according to the present invention, O
A CR peculiar habit was obtained by developing a search designated character string into a bitmap and performing OCR on the character string.
It goes without saying that the CR peculiarity may be reflected in the misrecognition tendency character string.

【００３５】なお、本発明は、複数の機器（例えばホス
トコンピュータ，インタフェイス機器，リーダ，プリン
タなど）から構成されるシステムに適用しても、一つの
機器からなる装置（例えば、複写機，ファクシミリ装置
など）に適用してもよい。また、本発明の目的は、前述
した実施形態の機能を実現するソフトウェアのプログラ
ムコードを記録した記憶媒体を、システムあるいは装置
に供給し、そのシステムあるいは装置のコンピュータ
（または、ＣＰＵやＭＰＵ）が記憶媒体に格納されたプ
ログラムコードを読出し実行することによっても、達成
されることは言うまでもない。The present invention can be applied to a system including a plurality of devices (for example, a host computer, an interface device, a reader, a printer, etc.), but can be applied to a single device (for example, a copier, a facsimile). Device). Further, an object of the present invention is to provide a storage medium storing a program code of software for realizing the functions of the above-described embodiments to a system or an apparatus, and a computer (or CPU or MPU) of the system or apparatus to store the storage medium. It goes without saying that the present invention is also achieved by reading and executing the program code stored in the medium.

【００３６】この場合、記憶媒体から読出されたプログ
ラムコード自体が前述した実施形態の機能を実現するこ
とになり、そのプログラムコードを記憶した記憶媒体は
本発明を構成することになる。プログラムコードを供給
するための記憶媒体としては、例えば、フロッピディス
ク，ハードディスク，光ディスク，光磁気ディスク，Ｃ
Ｄ−ＲＯＭ，ＣＤ−Ｒ，磁気テープ，不揮発性のメモリ
カード，ＲＯＭなどを用いることができる。In this case, the program code itself read from the storage medium realizes the functions of the above-described embodiment, and the storage medium storing the program code constitutes the present invention. Examples of a storage medium for supplying the program code include a floppy disk, hard disk, optical disk, magneto-optical disk, and C
A D-ROM, a CD-R, a magnetic tape, a nonvolatile memory card, a ROM, and the like can be used.

【００３７】また、コンピュータが読出したプログラム
コードを実行することにより、前述した実施形態の機能
が実現されるだけでなく、そのプログラムコードの指示
に基づき、コンピュータ上で稼働しているＯＳ（オペレ
ーティングシステム）などが実際の処理の一部または全
部を行い、その処理によって前述した実施形態の機能が
実現される場合も含まれることは言うまでもない。When the computer executes the readout program code, not only the functions of the above-described embodiment are realized, but also the OS (Operating System) running on the computer based on the instruction of the program code. ) May perform some or all of the actual processing, and the processing may realize the functions of the above-described embodiments.

【００３８】さらに、記憶媒体から読出されたプログラ
ムコードが、コンピュータに挿入された機能拡張ボード
やコンピュータに接続された機能拡張ユニットに備わる
メモリに書込まれた後、そのプログラムコードの指示に
基づき、その機能拡張ボードや機能拡張ユニットに備わ
るＣＰＵなどが実際の処理の一部または全部を行い、そ
の処理によって前述した実施形態の機能が実現される場
合も含まれることは言うまでもない。Further, after the program code read from the storage medium is written into a memory provided on a function expansion board inserted into the computer or a function expansion unit connected to the computer, based on the instructions of the program code, It goes without saying that the CPU included in the function expansion board or the function expansion unit performs part or all of the actual processing, and the processing realizes the functions of the above-described embodiments.

【００３９】本発明を上記記憶媒体に適用する場合、そ
の記憶媒体には、先に説明したフローチャートに対応す
るプログラムコードを格納することになるが、簡単に説
明すると、図５のメモリマップ例に示す各モジュールを
記憶媒体に格納することになる。すなわち、少なくと
も、検索文字列を入力する「検索文字列入力モジュー
ル」、入力した検索文字列に基づいて、検索ＯＲ条件を
生成する「検索ＯＲ条件生成モジュール」、検索対象文
書画像を入力する「検索対象画像入力モジュール」、検
索文字列に基づいて生成された文字列イメージパタンと
入力した検索対象文書画像の文字認識を行う「文字認識
モジュール」、生成された検索ＯＲ条件を満足する文字
列を検索対象文書画像の文字列から獲得する「文字列検
索モジュール」、検索結果を表示する「表示モジュー
ル」の各モジュールのプログラムコードを記憶媒体に格
納すればよい。When the present invention is applied to the storage medium, the storage medium stores program codes corresponding to the above-described flowcharts. Each module shown will be stored in a storage medium. That is, at least a “search character string input module” for inputting a search character string, a “search OR condition generation module” for generating a search OR condition based on the input search character string, and a “search for inputting a search target document image” "Target image input module", "Character recognition module" that performs character recognition of a character string image pattern generated based on a search character string and an input search target document image, and searches for a character string that satisfies the generated search OR condition The program codes of the "character string search module" obtained from the character string of the target document image and the "display module" for displaying the search result may be stored in the storage medium.

【００４０】以上説明したように、本発明の実施の形態
によれば、ＯＣＲの誤認識が系統的な場合に、検索指定
文字列の文字並びに即した誤認識の傾向を表す文字列群
を得て、これを、本来の検索指定文字列とＯＲの関係に
して検索することにより、ＯＣＲの認識率が低い場合に
でも、検索漏れの少ない文書画像検索が行なえる。ま
た、前もって誤認識を行ない易い文字、文字列の正解−
誤認識テーブルを作成する必要が無く、また、検索指定
文字列の文字の並びに即した誤認識傾向が得られるた
め、個々の文字に対して、正解−誤認識テーブルを適応
する場合よりも精度の高い誤認識傾向が得られる。As described above, according to the embodiment of the present invention, when the OCR erroneous recognition is systematic, the characters of the search designated character string and the character string group indicating the tendency of the erroneous recognition are obtained. Then, by performing this search in the relationship between the original search designation character string and OR, even when the OCR recognition rate is low, a document image search with few omissions in search can be performed. In addition, correct characters and character strings that are easily misrecognized in advance
Since there is no need to create a misrecognition table and a tendency of misrecognition that matches the sequence of characters in the search designated character string is obtained, the accuracy of each character is higher than when the correct-misrecognition table is applied. High misrecognition tendency is obtained.

【００４１】本実施の形態は、検索指定語に関する技術
拡張であるため、従来から存在するＯＣＲを用いた文書
画像検索システムに対して、検索条件入力段の前に本発
明に係る実施の形態を適応するだけで、ＯＣＲの誤認識
に強い検索が可能となる。これらのメリットにより、人
手によるＯＣＲ誤認識訂正作業をなくし、文書画像入力
からＯＣＲを経てイメージデータ・テキストデータ蓄積
まで無人で行なえるシステムを構築でき、且つ、コンピ
ュータに対し負荷を掛けず、処理速度の低下の心配が無
く、比較的安価なシステムを実現できる。 [第２の実施の形態]上述の検索ＯＲ条件に基づいて全体
として確からしい文字列を検索する一例を示したが、こ
れに限定されることはなく、検索ＯＲ条件のいづれか１
つを満足する文字列を検索して、その文字列が含まれる
文書画像を検索結果としてもよい。Since the present embodiment is a technical extension related to a search designation word, the embodiment according to the present invention is applied to a conventional document image search system using OCR before a search condition input stage. A search that is resistant to OCR misrecognition becomes possible only by adapting. These advantages eliminate the need for manual OCR misrecognition and correction work, and enable the construction of a system that enables unattended operation from document image input to storage of image data and text data via OCR. A relatively inexpensive system can be realized without fear of a decrease. [Second Embodiment] An example of searching for a likely character string as a whole based on the above-mentioned search OR condition has been described. However, the present invention is not limited to this.
A character string that satisfies the above conditions may be searched, and a document image including the character string may be used as a search result.

【００４２】[0042]

Claims

[Claims]

An expansion step of expanding a search character string into a corresponding character string image; a filtering step of performing predetermined filtering on the character string image expanded in the expansion step; and a character string filtered in the filtering step Segmentation of the image into independent parts, performing character recognition of each segmented part, and a recognition step of obtaining a recognized character string candidate, based on the difference between the recognized character string candidates obtained in the recognition step, can be combined A generation step of generating another recognized character string candidate; and a predetermined character under a logical OR condition of the another combinable character string candidate generated in the generating step and the recognized character string candidate obtained in the recognition step. A character string search step of searching for a column.

2. The method according to claim 1, wherein the character string image corresponding to the search character string is a character string image having a relatively low recognition rate.

3. The method according to claim 2, wherein the predetermined character string is included in a search target image, and the predetermined filtering type is determined based on an ambiguity of the search target image. 2. The image search method according to 1.

4. The method according to claim 3, wherein the ambiguity of the search target image is high if the image quality of the search target image is poor, and low if the image quality is good. .

5. The image according to claim 4, wherein in the expanding step, the character string image corresponding to the search character string based on the ambiguity is expanded into a character string image having a character pitch and a character size. retrieval method.

6. The development step, wherein, if the degree of ambiguity is large, the character string image corresponding to the search character string is developed into a character string image having a small character pitch and a small character size. 5. The image search method according to 5.

7. The filtering method according to claim 1, wherein a filter that superimposes strong noise is used when the degree of ambiguity of the search target image is large.
Image search method described in.

8. The image search method according to claim 1, wherein, as the predetermined filtering type, a filter that increases blurring is used when the ambiguity of the search target image is large.

9. Expanding means for expanding a search character string into a corresponding character string image, filtering means for performing predetermined filtering on the character string image expanded by the expanding means, and a character string filtered by the filtering means Segmenting the image into independent parts, performing character recognition on each of the segmented parts, and a recognition unit that obtains a recognized character string candidate, based on the difference between the recognized character string candidates obtained by the recognition unit, can be combined. Generating means for generating another recognized character string candidate; and a predetermined character based on a logical sum condition of the another recognizable character string candidate generated by the generating means and the recognized character string candidate obtained by the recognizing means. An image search apparatus comprising: a character string search unit that searches for a column.

10. The image search apparatus according to claim 9, wherein the character string image corresponding to the search character string is a character string image having a relatively low recognition rate.

11. The method according to claim 11, wherein the predetermined character string is included in a search target image, and the predetermined filtering type is determined based on an ambiguity of the search target image. 10. The image search device according to 9.

12. The image search apparatus according to claim 11, wherein the ambiguity of the search target image is high when the image quality of the search target image is poor, and low when the image quality is good. .

13. The image according to claim 12, wherein the expansion unit expands the character string image corresponding to the search character string based on the ambiguity into a character string image having a character pitch and a character size. Search device.

14. The character string image corresponding to the search character string having a narrow character pitch and a small character size if the degree of ambiguity is large. 14. The image search device according to claim 13.

15. The type of the predetermined filtering is:
The image search apparatus according to claim 9, wherein a filter that superimposes strong noise is used if the degree of ambiguity of the search target image is large.

16. The predetermined filtering type is:
10. The image search device according to claim 9, wherein a filter for increasing blur is used when the degree of ambiguity of the search target image is large.

17. A computer program product comprising a computer usable medium having computer readable program code means, said computer program product developing a search string into a corresponding string image. Possible first program code means, computer-readable second program code means for performing predetermined filtering on the character string image developed by the first program code means, and filtered by the second program code means Computer-readable third program code means for segmenting the character string image into independent parts, performing character recognition on each of the segmented parts, and obtaining recognized character string candidates, and the third program code means Computer-readable fourth program code means for generating another recognizable character string candidate based on the difference between the recognized character string candidates obtained in the above step, and the combination possible generated by the fourth program code means A computer program product comprising: a fifth program code means for searching for a predetermined character string based on a logical sum condition of another recognized character string candidate and a recognized character string candidate obtained by the third program code means. .

18. The image search method according to claim 1, wherein the recognition step includes at least the search character string in the obtained recognized character string candidates.

19. The image search apparatus according to claim 9, wherein the recognition unit includes at least the search character string in the obtained recognized character string candidates.