JP4229521B2

JP4229521B2 - Character recognition method and apparatus

Info

Publication number: JP4229521B2
Application number: JP14105399A
Authority: JP
Inventors: 潤二柏岡; 聡直井
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1999-05-21
Filing date: 1999-05-21
Publication date: 2009-02-25
Anticipated expiration: 2019-05-21
Also published as: JP2000331122A

Description

【０００１】
【発明の属する技術分野】
近年、入力周辺機器として文字認識技術の需要が増加している。本発明は、この文字認識に使用される文字認識方法および装置に関し、特に本発明は、手書き文字と活字が混在する文書において、より高い認織精度で高速に認識結果を得ることができる文字認識方法及び装置に関するものである。
【０００２】
【従来の技術】
手書き文字と活字が混在する文書の文字認識においては、通常、手書き文字認識と活字認識の２つの処理を行い、両者の認識結果を比較して信頼度の高い結果を採用していた。この方法は、認識のために手書き文字認識と活字認識を行う必要があり処理時間を要する。
そこで、予め認識の対象となる文字（列）が手書き文字か活字かを判別し、その結果に応じて手書き文字認識もしくは活字認識を行う方法が提案されている。この方法によれば、文字認識処理を２度行う必要がないので処理時間を短縮することができる。
【０００３】
手書き文字／活字判別法としては、例えば、文書や帳票からスキャナ等により文書画像を得て、該文書画像から文字列を抽出し、さらに文字列に含まれる文字を抽出し、全文字についてピッチ、中心位置のずれなどの文字の配置の規則性を求め、文字列の文字の配置に規則性があるときに活字文字列とし、配置に規則性がないときに文字列を手書き文字列と判別する方法が用いられる。
【０００４】
【発明が解決しようとする課題】
文字列中の文字の配置の規則性を、文字列中の全文字の文字の高さ方向への中心位置の規則性により表す方法としては、例えば前記したように、文字列から文字を切り出し、高さ方向の文字の中心位置を求めて、中心位置のバラツキが小さい場合は活字、バラツキが大きい場合は手書き文字と判別方法が用いられる。
この方法は、漢字又は数字の場合の多くについては、活字では文字の大きさが一定かつ規則的に配置されているため、中心位置にバラツキが少なく、手書き文字との判別に利用可能な特徴ではある。
【０００５】
しかし、図１４に示すようにカタカナの場合にみられる文字に濁点が含まれる文字列の場合、濁点により文字の中心位置が上にずれ（例えば図中の「ジ」）、活字でも文字の中心位置のバラツキが大きくなる。
また、文字列に小文字が含まれると、文字の中心位置が、下の方向にずれるため、中心位置のバラツキが活字でも大きくなる（図中の「ョ」）。
以上の理由から、カタカナ、平仮名など、濁点、小文字をもつ字種を対象に含む場合、従来方法を適用することはできなかった。
【０００６】
また、前記した手書き／活字判別法は、対象がレイアウト情報の既知な帳票でも、レイアウト情報を手書き／活字判別処理に利用していなかった。
すなわち、予めレイアウト情報が登録されている帳票においては、例えば、金額欄などの項目はこれに続く文字記入領域に数字が書かれることが明白であるが、従来方法では、レイアウト情報から判別できる文字記入欄に書かれる字種に関する情報を利用せずに手書き／活字判別を行っており、手書き文字／活字の判別精度が悪かった。
また、帳票においては、金額欄などの項目に続く文字記入欄は一つの帳票内では手書き文字または活字にほとんどが統一されて文字が記入されるが、この情報も利用しておらず、各文字記入欄毎の手書き／活字判別結果をそのまま利用していた。このため、手書き文字／活字の判別精度を向上させることが難しかった。
本発明は上記した事情を考慮してなされたものであって、文字認識処理における手書き／活字判別の精度を向上させ、文字認識による処理時間を短縮するともに文字認識精度を向上させることである。
【０００７】
【課題を解決するための手段】
図１は本発明の原理図であり、同図に示すように本発明においては次のようにして上記課題を解決する。
図１に示すように、文字列を抽出し、手書き／活字判別処理を行い、その結果に基づき手書き文字認識処理もしくは活字認識処理を行う。
上記手書き／活字判別処理において、次のようにして手書き文字と活字の判別を行う。(a) 文字列から濁点を含めずに文字を抽出し、抽出した全文字の中心位置を求め、その規則性から、手書き文字と活字の判別を行う。
これにより、濁点を含む活字において、中心位置のバラツキが大きくなる問題を解決できる。
(b) 文字列に含まれる小文字を含めずに抽出した全文字の中心位置を求め、その規則性をから、手書き文字と活字の判別を行う。
これにより、小文字を含む活字において、中心位置のバラツキが大きくなる問題を解決できる。
上記、小文字を含めずに文字の中心位置の規則性を求める方法としては、例えば、文字列から基準とする１文字を抽出し、この文字に対して大きさが小さく、かつ、この基準とする文字の下方に位置する文字を除去して、文字の中心位置の規則性を求める方法を用いることができる。
また、文字列中の文字の高さ方向の中心位置の規則性を手書き／活字判別に利用するに際し、文字列に含まれる文字数が少ない場合は、手書き／活字の判別を保留するようにしてもよい。これにより文字数がすくない場合でも、手書き／活字の判別を誤ることがなくなる。なお、手書き／活字の判別を保留した場合には、文字認識処理において、手書き文字認識処理、活字認識処理を両方を行い信頼性の高い結果を採用する。
【０００８】
【発明の実施の形態】
図２は本発明が適用されるシステムの構成例を示す図である。同図において、１１はＣＰＵであり、ＣＰＵ１１にはバス１０が接続されており、バス１０には、ＲＡＭ１２、ＲＯＭ１３、通信インタフェース１４、外部記憶装置１５、記憶媒体読み取り装置１６、画像読み取りを行うスキャナ１７、ディスプレイ、キーボード等の入出力装置１８等が接続されている。ＲＯＭ１３、ＲＡＭ１２にはＯＳや本実施例の処理を行うためのプログラム、データ等が格納される。
スキャナ１７は、文字認識の対象となる文字等が記録された画像を読み取る。また、入出力装置１８は、キーボードやマウス等の入力デバイス、ディスプレイモニタ、プリンタ等の出力デバイスを含み、本発明のデータ処理を行うためのデータを入力したり、処理されたデータを出力する。
【０００９】
本発明の文字認識処理を行うためのプログラム、データ等は、ハードディスク等の記憶装置１５あるいはＣＤ−ＲＯＭ、フロッピーディスク等の記録媒体に記録されており、データ処理実行時、記録装置１５から、あるいは、記録媒体読み取り装置１７により記録媒体から読み取られＲＡＭ１２に格納され、ＣＰＵ１１により実行される。また、バス１０には通信インタフェース１４が接続されており、通信回線を介して、データやプログラムの授受を行うことができる。
【００１０】
図３は本発朋の実施例の文字認識処理の処理フローを示す図である。
同図に示すように、スキャナ１７等の画像入力手段により文書又は帳票の画像が獲得され、獲得された画像に対してノイズ除去、傾き補正等の前処理がなされる（ステップＳ１）。ついで、画像から一文字列を抽出して（ステップＳ２）、本発明に係わる手書き／活字の識別が行なわれる（ステップＳ３）。そして、抽出された文字列が手書き文字として判別された場合には、フラグを１に設定し、また活字として判別された場合にはフラグを２に設定する。さらに、手書き文字か活字かの判別不可能の場合には、フラグを３に設定する（ステップＳ４）。
【００１１】
文字認識処理においては上記フラグを参照し、次のように文字認識を行う（ステップＳ５〜Ｓ６）。
▲１▼ 手書き文字と判別された場合には、手書き文字認識処理を行う。
▲２▼ 活字として判別された場合には、活字認識処理を行う。
▲３▼ 手書き文字、活字の判別が不可能な場合には、手書き文字認識処理と活字認識処理を行い、より信頼性が高い方を採用する。
上記のように、予め手書き文字か活字であるかを判別し、文字認識処理を行うことにより、文字認識処理の信頼性を向上させることができるとともに、手書き文字認識処理と活字認識処理の両方の処理を行うことなく文字認識を行うことができるので処理の高速化を図ることができる。
【００１２】
次に上記文字認識処理における手書き／活字判別処理について本発明の実施例を説明する。
図４は本発明の実施例の手書き活字判別の処理フローを示す図である。
図４に示すように、まず、抽出した文字列の文字から連結成分を取り出す（ステップＳ１）。例えば、抽出された文字列が前記図１４に示した「モジニンシキショリ」の場合、最初の文字「モ」は文字を構成する全画素が連結しているので図４に示すように１つの連結成分として抽出される。また、例えば、次の文字「ジ」については、５つの連結成分が抽出される。
次に抽出した連結成分から濁点を除去し（ステップＳ２）、連結成分を統合する（ステップＳ３）。すなわち、濁点除去処理で残った連結成分を、適当な条件の元に統合し、文字を構成する単位に分離する。統合条件は、例えば、連結成分の外接矩形同士を比較し、重なりがある場合などに統合する。
図１４の例の場合には図４に示すように「ジ」の濁点が除去され、残りの連結成分が統合される。
【００１３】
ついで、ステップＳ４において、小文字を除去する。図１４の例の場合には、小文字の「ョ」が除去される。
ステップＳ５において、残りの文字数を、予め定められた閾値と比較し、残り文字数が閾値より小さい場合には、ステップＳ６に行き、手書き／活字の判別ができないとしてフラグを３に設定して処理を終わる。
【００１４】
すなわち、抽出した文字数が少ない場合は、後述する文字の中心位置のバラツキの値に信頼性が乏しいため、判別を誤る可能性を有する。従って、抽出した文字数が少ない場合は、判別を保留して、前記▲３▼のように手書き／活字の両認識処理を行い、信頼性の高い方を採用する。
また、残り文字数が閾値より大きい場合には、ステップＳ７に行き、文字の中心位置の規則性を算出する。すなわち、図５に示すように各文字を分離して抽出し、各文字の高さ方向の中心位置を求める。そして、各文字の中心位置のバラツキを分散などにより数値化する。
次いでステップＳ８において、中心位置のバラツキの大小に応じて手書き文字か活字かを判別する。すなわち、中心位置のバラツキが大きい場合には手書き文字列であるとしてフラグを１に設定し、バラツキが小さい場合には、活字の文字列であるとしてフラグを２に設定する。
【００１５】
図６は上記濁点の除去方法を示す処理フローである。濁点の除去方法としては、図６の処理フローに示すように、まず、ステップＳ１において、所定値より上方にあり、かつ小さい連結成分を抽出する。例えば図７に示すように「ジ」の場合には、所定値より大きな連結成分ｃ，ｄ，ｅが抽出される。
次に、ステップＳ２において、抽出された連結成分の相互間の距離を算出する。図７の例の場合には連結成分ｃ，ｄ，ｅの相互の距離を算出する。
ステップＳ３において、連結成分間の距離が閾値より小さいものがあるかを調べる。そして、連結成分間の距離が閾値より小さいものがない場合には濁点がないとして処理を終了する。また、連結成分間の距離が閾値より小さい場合には、ステップＳ４に行き、距離が閾値より小さい連結成分のペアを濁点として除去する。図７の場合には連結成分ｄ，ｅの距離が閾値より小さいので濁点として除去される。その結果、図８に示すように濁点候補１，２が除去される。
【００１６】
以上のように濁点を除去したのち、前記したように連結成分を統合し、文字を構成する単位に分離する。
この処理により分離抽出された文字から小文字を除去するための処理フローを図９に示す。
まず、ステップＳ１において基準文字を抽出し、ステップＳ２において基準文字の高さ、下座標を抽出する。
【００１７】
例えば、最初の１文字目（図１０の横書き文字列では一番左の文字）の文字を基準文字とし、その高さと高さ方向への下座標を抽出する。なお、その他、文字列の内の高さが最大の文字を基準文字としてもよい。
ステップＳ３において、基準文字と各文字の下座標を比較し、下座標の差が所定値以下の文字の高さと基準文字の高さの差（もしくは比）を算出する。
そして残りの各文字につき、高さが基準文字の高さに比して小さく、下座標が基準文字の下座標に近いものを小文字として除去する（ステップＳ４）。例えば図１０の例の場合には、「ョ」が除去される。
以下、前記したように、小文字を除去して残った文字に対して、文字の中心位置の規則性を、文字の高さ方向の中心位置の座標の分散などにより数値化して求め、この数値の大きさにより、手書き文字か、活字かを判別する。
【００１８】
以上のように本実施例においては、濁点、小文字を除去した文字列中の文字の中心位置の規則性により手書き／活字判別を行っているので、カタカナ、平仮名など、濁点、小文字をもつ字種を対象に含む場合であっても、手書き／活字判別処理を精度よく行うことができる。
なお、本実施例に示した文字の中心位置の規則性による判別処理を単独で用いて手書き／活字の判別を行ってもよいが、この特徴量に加えて、文字列から他の特徴量を複数抽出して、これらの特徴と合わせて総合的に判別することも可能である。その場合、判別の方法は、特徴量の大小による判別だけではなく、判別分析などの方法も用いることも可能である。
【００１９】
次にレイアウト情報を手書き／活字判別処理に利用した例について説明する。
この場合の処理フローを図１１に示し、また、この場合の処理概念図を図１２に示す。まず、図１１のステップＳ１において、画像データに変換された帳票に対し、帳票のレイアウトを解析し、ステップＳ２において、項目とそれに続く文字記入領域にグループに分けする。
グループ分けは、帳票の列毎にグループを作成するなどの方法を用いることができる。例えば、図１２の場合には、項目として「銀行名」、「支店名」、「口座番号」、「金額」等の項目文字とそれに続く、文字記入欄にグループ分けされる。
ステップＳ３において、帳票の項目領域に対して文字認識を行い、項目文字を認識する。
【００２０】
次いでステップＳ４において、文字認識の結果得られた情報から、文字記入領域の字種を判定する。例えば、項目領域に「金額」とかかれている場合には、この項目と同じグループに属する文字記入欄の字種を数字と推定する。字種の推定方法としては、例えば、図１１に示すように、項目とそれに続く文字記入欄の字種の対応データを参照し文字記入領域の字種を判定することができる。
ステップＳ５において、上記判定結果から得られた字種に限定して、特徴量を抽出し、手書き文字／活字判別処理を行う。特徴量としては、第１の実施例に示した文字列中に含まれる文字の中心位置のバラツキや、文字の高さのバラツキなどを用いることができる。
また判別方法としては、特徴量を予め設定した閾値を比較して、バラツキの量が閾値以下なら活字、閾値以上なら手書き文字と判別できる。また、特徴量を複数用いて、判別分析などにより特徴空間上で総合的に判別することも可能である。
【００２１】
以上のように、帳票のレイアウト情報を利用し、項目領域の文字認識結果に基づき、文字記入欄の字種を限定して手書き／活字判別処理を行っているので、手書き文字／活字判別処理精度を向上させることができる。
なお、上記では、未知の帳票に対しては、項目領域の文字を認識した結果により、この項目と同一のグループに属する文字記入領域の文字の字種を限定して手書き／活字判別を行うようにしたが、帳票のレイアウト情報と文字記入領域の字種情報が予め登録されている場合は、まず処理対象の帳票のレイアウト情報を抽出し、レイアウト情報を用いて帳票照合を行い、登録されている帳票と一致した場合には、登録されている帳票の文字記入欄についての字種情報を用いて、文字記入欄毎に字種を限定して手書き／活字判別を行うことも可能である。
【００２２】
次にレイアウト情報を手書き／活字判別処理に利用した他の例について説明する。処理フローを図１３に示す。
図１３のステップＳ１において、画像データに変換された帳票に対して、帳票の構造を解析し、レイアウト情報を取得する。
次に、ステップＳ２において、帳票の文字記入領域をグループ分けする。グループ分けは、帳票の列毎にグループを作成するなどの方法を用いることができる。例えば、前記図１２に示した帳票の場合には、項目「銀行名」の文字記入欄（グループ１）、項目「支店名」の文字記入欄（グループ２）、…、項目「金額」（グループ４）の文字記入欄にグループ分けする。
【００２３】
そして、ステップＳ３において、各文字記入欄の文字列毎に手書き／活字判別を行う。すなわち、図１２の場合、グループ１〜グループ４について文字記入欄の手書き／活字判別を行う。
ステップＳ４において、グループ毎に、手書き／活字判別結果を集計し、ステップＳ５において、上記集計結果で、数の多い方の判別結果を正解とし、数の少ない方の判別結果を数の多い方の判別結果に修正する。
例えば、図１２の例では、グループ１について、手書き文字に判別した結果が多く、活字に判別した結果が少ない場合には、そのグループは手書き文字であるとして、活字と判別した結果を手書き文字の判別結果に修正する。また、逆に活字に判別した結果が多く、手書き文字に判別した結果が少ない場合には、手書き文字と判別した結果を活字の判別結果に修正する。同様に各グループ１〜４について、判別結果を修正する。
以上のように、帳票のレイアウト情報を利用し、各グループの手書き／活字判別結果を集計し、集計結果に基づき同じグループに属する文字記入欄の手書き／活字判別処理結果を修正しているので、第２の実施例と同様、手書き文字／活字判別処理精度を向上させることができる。
【００２４】
【発明の効果】
以上説明したように、本発明においては以下の効果を得ることができる。
（１）文字列から濁点を含めずに求めた文字の中心位置の規則性を抽出して、手書き文字と活字の判別を行うため、濁点を含む可能性のあるカタカナ、平仮名などの任意の文字列の手書き／活字判別が可能となる。
（２）文字列に含まれる小文字を含めずに文字の中心位置の規則性を抽出して、手書き文字と活字の判別を行うため、小文字を含む可能性のあるカタカナ、平仮名などの任意の文字列の手書き／活字判別が可能となる。
また、対象とする文字列の文字数が少ない場合は、判別保留（リジェクト）の判別結果を出力し、手書き、活字の両認識処理に認識を委ねることにより、文字認識の精度を損なうことがない。
【図面の簡単な説明】
【図１】本発明の原理図である。
【図２】本発明が適用されるシステムの構成例を示す図である。
【図３】本発朋の実施例の文字認識処理の処理フローを示す図である。
【図４】本発明の実施例の手書き活字判別の処理フローを示す図である。
【図５】本実施例の手書き／活字判別処理の概念を説明する図である。
【図６】濁点の除去方法を示す処理フローである。
【図７】濁点の除去方法を説明する図である。
【図８】濁点の除去処理の概念を説明する図である。
【図９】小文字を除去するための処理フローを示す図である。
【図１０】小文字除去処理の概念を説明する図である。
【図１１】レイアウト情報を手書き／活字判別処理に利用した例の処理フローである。
【図１２】図１１の例の処理概念を説明する図である。
【図１３】レイアウト情報を手書き／活字判別処理に利用した他の例の処理フローである。
【図１４】従来方法で問題となる濁点、小文字を含むカタカナ活字の文字例である。
【符号の説明】
１１ＣＰＵ
１０バス
１２ＲＡＭ
１３ＲＯＭ１３
１４通信インタフェース
１５外部記憶装置
１６記憶媒体読み取り装置
１７スキャナ
１８入出力装置[0001]
BACKGROUND OF THE INVENTION
In recent years, demand for character recognition technology as an input peripheral device has increased. The present invention relates to a character recognition method and apparatus used for character recognition, and in particular, the present invention relates to character recognition capable of obtaining a recognition result at a high speed with higher recognition accuracy in a document in which handwritten characters and printed characters are mixed. It relates to a method and a device.
[0002]
[Prior art]
In character recognition of a document in which handwritten characters and type letters are mixed, usually, two processes of handwritten character recognition and type recognition are performed, and the recognition results of both are compared to adopt a result with high reliability. This method requires processing time because it requires recognition of handwritten characters and type recognition for recognition.
In view of this, a method has been proposed in which it is determined in advance whether a character (sequence) to be recognized is a handwritten character or a type, and handwritten character recognition or type recognition is performed according to the result. According to this method, since it is not necessary to perform the character recognition process twice, the processing time can be shortened.
[0003]
As a handwritten character / type discrimination method, for example, a document image is obtained from a document or form by a scanner or the like, a character string is extracted from the document image, characters included in the character string are further extracted, pitches for all characters, Obtain the regularity of the character arrangement such as the shift of the center position, and when the character arrangement of the character string is regular, it is regarded as a printed character string, and when the arrangement is not regular, the character string is determined as a handwritten character string The method is used.
[0004]
[Problems to be solved by the invention]
As a method of expressing the regularity of the arrangement of characters in the character string by the regularity of the center position in the height direction of the characters of all characters in the character string, for example, as described above, the characters are cut out from the character string, The center position of the character in the height direction is obtained. If the variation in the center position is small, the type is used, and if the variation is large, the discrimination method is a handwritten character.
This method is characterized by the fact that in many cases in the case of kanji or numbers, the size of the characters in the type is constant and regularly arranged, so there is little variation in the center position, and it can be used for discrimination from handwritten characters. is there.
[0005]
However, as shown in FIG. 14, in the case of a character string in which the character seen in katakana includes a cloud point, the center position of the character is shifted upward due to the cloud point (for example, “G” in the drawing), The position variation becomes large.
If the character string includes lowercase letters, the center position of the character shifts in the downward direction, so that the variation in the center position becomes large even in the typed characters (“ョ” in the figure).
For the above reasons, the conventional method could not be applied to the case where the character type having dakuten and lowercase letters such as katakana and hiragana was included.
[0006]
In the handwriting / type discrimination method described above, the layout information is not used for handwriting / type discrimination processing even if the target is a form whose layout information is known.
That is, in a form in which layout information is registered in advance, for example, it is clear that an item such as an amount field is written with a number in the character entry area that follows this, but in the conventional method, characters that can be distinguished from the layout information The handwriting / type discrimination is performed without using the information on the character type written in the entry column, and the discrimination accuracy of the handwritten / type is poor.
In addition, in the form, the text entry field following the item such as the amount field is mostly unified with handwritten characters or printed characters in one form, but this information is not used and each character is entered. The handwriting / type discrimination results for each entry field were used as they were. For this reason, it has been difficult to improve the discrimination accuracy of handwritten characters / types.
The present invention has been made in consideration of the above circumstances, and is to improve the accuracy of handwriting / type discrimination in character recognition processing, shorten the processing time by character recognition, and improve the character recognition accuracy.
[0007]
[Means for Solving the Problems]
FIG. 1 is a diagram illustrating the principle of the present invention. As shown in FIG. 1, the present invention solves the above problem as follows.
As shown in FIG. 1 , a character string is extracted, handwriting / type discrimination processing is performed, and handwriting recognition processing or type recognition processing is performed based on the result.
In the handwriting / type discrimination process, a handwritten character and type are discriminated as follows. (a) Characters are extracted from the character string without including a cloud point, the center positions of all the extracted characters are obtained, and handwritten characters and printed characters are discriminated from their regularity.
Thereby, it is possible to solve the problem that the variation of the center position becomes large in the type including the muddy point.
(b) The center positions of all characters extracted without including lowercase letters included in the character string are obtained, and handwritten characters and type characters are discriminated based on their regularity.
Thereby, it is possible to solve the problem that the variation in the center position becomes large in a type including lower case letters.
As a method for obtaining the regularity of the center position of the character without including the lower case letters, for example, one character as a reference is extracted from the character string, the size is small with respect to the character, and the reference is used. It is possible to use a method of obtaining the regularity of the center position of the character by removing the character located below the character.
In addition, when the regularity of the center position in the height direction of the character in the character string is used for handwriting / printing discrimination, if the number of characters included in the character string is small, the handwriting / printing discrimination may be suspended. Good. As a result, even when the number of characters is small, there is no mistake in handwriting / printing discrimination. When the handwriting / type discrimination is suspended, both the handwritten character recognition process and the type recognition process are performed in the character recognition process, and a highly reliable result is adopted.
[0008]
DETAILED DESCRIPTION OF THE INVENTION
FIG. 2 is a diagram showing a configuration example of a system to which the present invention is applied. In the figure, reference numeral 11 denotes a CPU, and a bus 10 is connected to the CPU 11. The bus 10 includes a RAM 12, a ROM 13, a communication interface 14, an external storage device 15, a storage medium reading device 16, and a scanner for reading an image. 17, an input / output device 18 such as a display and a keyboard is connected. The ROM 13 and RAM 12 store an OS, a program for performing the processing of this embodiment, data, and the like.
The scanner 17 reads an image in which characters and the like that are targets for character recognition are recorded. The input / output device 18 includes input devices such as a keyboard and a mouse, and output devices such as a display monitor and a printer, and inputs data for performing data processing according to the present invention and outputs processed data.
[0009]
The program, data, and the like for performing the character recognition processing of the present invention are recorded on a storage device 15 such as a hard disk or a recording medium such as a CD-ROM or a floppy disk. Then, it is read from the recording medium by the recording medium reading device 17, stored in the RAM 12, and executed by the CPU 11. In addition, a communication interface 14 is connected to the bus 10, and data and programs can be exchanged via a communication line.
[0010]
FIG. 3 is a diagram showing a processing flow of character recognition processing according to the embodiment of the present invention.
As shown in the figure, an image of a document or a form is acquired by an image input means such as a scanner 17, and preprocessing such as noise removal and inclination correction is performed on the acquired image (step S1). Next, one character string is extracted from the image (step S2), and handwriting / printing according to the present invention is identified (step S3). When the extracted character string is determined as a handwritten character, the flag is set to 1, and when it is determined as a printed character, the flag is set to 2. Further, when it is impossible to discriminate between handwritten characters and printed characters, the flag is set to 3 (step S4).
[0011]
In the character recognition process, the flag is referred to and character recognition is performed as follows (steps S5 to S6).
(1) When the handwritten character is determined, a handwritten character recognition process is performed.
(2) When it is determined as a type, type recognition processing is performed.
(3) When it is impossible to discriminate between handwritten characters and characters, a handwritten character recognition process and a character recognition process are performed, and the one with higher reliability is adopted.
As described above, it is possible to improve the reliability of the character recognition process by determining whether the character is a handwritten character or a character in advance and performing the character recognition process. Since character recognition can be performed without performing processing, the processing speed can be increased.
[0012]
Next, an embodiment of the present invention will be described with respect to the handwriting / type discrimination process in the character recognition process.
FIG. 4 is a diagram showing a processing flow of handwritten type discrimination according to the embodiment of the present invention.
As shown in FIG. 4, first, a connected component is extracted from the characters of the extracted character string (step S1). For example, if the extracted character string is “modification” shown in FIG. 14, the first character “MO” is connected to all the pixels constituting the character, so that as shown in FIG. Extracted as a connected component. For example, for the next character “di”, five connected components are extracted.
Next, the muddy point is removed from the extracted connected components (step S2), and the connected components are integrated (step S3). That is, the connected components remaining in the muddy point removal process are integrated under appropriate conditions and separated into units constituting characters. As the integration condition, for example, circumscribed rectangles of connected components are compared and integrated when there is an overlap.
In the case of the example in FIG. 14, as shown in FIG. 4, the “d” cloud point is removed and the remaining connected components are integrated.
[0013]
In step S4, lowercase letters are removed. In the case of the example in FIG. 14, the lowercase “yo” is removed.
In step S5, the remaining number of characters is compared with a predetermined threshold value. If the remaining number of characters is smaller than the threshold value, the process goes to step S6 and the flag is set to 3 because the handwriting / printing type cannot be determined. End.
[0014]
That is, when the number of extracted characters is small, there is a possibility of erroneous determination because the reliability of the variation value of the center position of the characters described later is poor. Therefore, when the number of extracted characters is small, the determination is suspended, and both handwriting / printing recognition processing is performed as described in (3) above, and the one with higher reliability is adopted.
If the number of remaining characters is larger than the threshold, the process goes to step S7 to calculate the regularity of the character center position. That is, as shown in FIG. 5, each character is separated and extracted, and the center position in the height direction of each character is obtained. Then, the variation in the center position of each character is digitized by dispersion or the like.
Next, in step S8, it is determined whether the character is a handwritten character or a printed character according to the variation in the center position. That is, if the variation in the center position is large, the flag is set to 1 because it is a handwritten character string, and if the variation is small, the flag is set to 2 because it is a character string.
[0015]
FIG. 6 is a processing flow showing a method for removing the muddy point. As a muddy point removal method, as shown in the process flow of FIG. 6, first, in step S1, a small connected component that is above a predetermined value and is extracted is extracted. For example, as shown in FIG. 7, in the case of “di”, connected components c, d, e larger than a predetermined value are extracted.
Next, in step S2, the distance between the extracted connected components is calculated. In the case of the example in FIG. 7, the mutual distance between the connected components c, d, e is calculated.
In step S3, it is checked whether there is an object whose distance between connected components is smaller than a threshold value. Then, if there is no distance between connected components that is smaller than the threshold value, the process is terminated because there is no cloud point. If the distance between the connected components is smaller than the threshold value, the process goes to step S4 to remove the connected component pair whose distance is smaller than the threshold value as a muddy point. In the case of FIG. 7, since the distance between the connected components d and e is smaller than the threshold value, it is removed as a muddy point. As a result, the muddy point candidates 1 and 2 are removed as shown in FIG.
[0016]
After removing the cloud point as described above, the connected components are integrated as described above and separated into units constituting characters.
FIG. 9 shows a process flow for removing lowercase letters from characters separated and extracted by this process.
First, in step S1, a reference character is extracted, and in step S2, the height and lower coordinates of the reference character are extracted.
[0017]
For example, the first character (the leftmost character in the horizontally written character string in FIG. 10) is used as a reference character, and the height and the lower coordinate in the height direction are extracted. In addition, the character with the maximum height in the character string may be used as the reference character.
In step S3, the reference character is compared with the lower coordinates of each character, and the difference (or ratio) between the height of the character whose difference in the lower coordinates is equal to or less than a predetermined value and the height of the reference character is calculated.
For each remaining character, a character whose height is smaller than the height of the reference character and whose lower coordinate is close to the lower coordinate of the reference character is removed as a lower case letter (step S4). For example, in the example of FIG.
Hereinafter, as described above, the regularity of the center position of the character with respect to the remaining character after removing the lowercase letters is obtained by quantifying the character by, for example, the distribution of the coordinates of the center position in the height direction of the character. Depending on the size, it is determined whether it is handwritten or typed.
[0018]
As described above, in this embodiment, handwriting / type discrimination is performed based on the regularity of the center position of the character in the character string from which the dakuten and lowercase characters are removed. Can be performed with high accuracy even if it is included in the target.
In addition, although the discrimination processing based on the regularity of the center position of the character shown in the present embodiment may be used alone, handwriting / printing discrimination may be performed. It is also possible to extract a plurality and collectively determine these features. In this case, as a discrimination method, not only discrimination based on the magnitude of the feature amount but also a method such as discriminant analysis can be used.
[0019]
Next, an example in which layout information is used for handwriting / type discrimination processing will be described.
Shows a process flow in this case is shown in FIG. 11 also shows the processing conceptual diagram of this case is shown in FIG. 12. First, in step S1 of FIG. 11, the layout of the form is analyzed with respect to the form converted into image data, and in step S2, the item and the subsequent character entry area are divided into groups.
For grouping, a method such as creating a group for each column of a form can be used. For example, in the case of FIG. 12, the items are grouped into item characters such as “bank name”, “branch name”, “account number”, “money amount”, and subsequent character entry fields.
In step S3, character recognition is performed on the item area of the form to recognize the item character.
[0020]
Next, in step S4, the character type of the character entry area is determined from the information obtained as a result of character recognition. For example, when “amount” is written in the item area, the character type of the character entry column belonging to the same group as this item is estimated as a number. As a method of estimating the character type, for example, as shown in FIG. 11, the character type of the character entry area can be determined by referring to the correspondence data of the character type in the item and the subsequent character entry field.
In step S5, the feature amount is extracted only for the character type obtained from the determination result, and the handwritten character / type determination process is performed. As the feature amount, variations in the center position of characters included in the character string shown in the first embodiment, variations in character height, and the like can be used.
Further, as a determination method, a threshold value in which a feature amount is set in advance is compared, and if the amount of variation is equal to or less than the threshold value, it can be determined as a printed character, and if it is equal to or greater than the threshold value, it can be determined as handwritten characters. It is also possible to use a plurality of feature amounts to make a comprehensive discrimination on the feature space by discriminant analysis or the like.
[0021]
As described above, the handwriting / type discrimination processing is performed by limiting the character type in the character entry field based on the character recognition result of the item area using the layout information of the form. Can be improved.
In the above, for unknown forms, the character type of the character entry area belonging to the same group as this item is limited based on the result of recognizing the characters in the item area, and handwriting / type discrimination is performed. However, if the layout information of the form and the character type information of the character entry area are registered in advance, the layout information of the form to be processed is extracted first, the form is collated using the layout information, and registered. In the case of a match with a registered form, it is possible to perform handwriting / type discrimination by limiting the character type for each character entry field using the character type information for the character entry field of the registered form.
[0022]
Next , another example in which layout information is used for handwriting / type discrimination processing will be described. The processing flow is shown in FIG.
In step S1 of FIG. 13, the form structure is analyzed for the form converted into image data, and layout information is acquired.
Next, in step S2, the character entry areas of the form are grouped. For grouping, a method such as creating a group for each column of a form can be used. For example, in the case of the form shown in FIG. 12, the text entry field (group 1) of the item “bank name”, the text entry field (group 2) of the item “branch name”,..., The item “amount” (group) Group them into 4) character entry fields.
[0023]
In step S3, handwriting / type discrimination is performed for each character string in each character entry field. That is, in the case of FIG. 12, handwriting / type discrimination in the character entry column is performed for groups 1 to 4.
In step S4, the handwritten / type discrimination results are totaled for each group, and in step S5, the discrimination result of the larger number is the correct answer and the discrimination result of the lesser number is the larger one. Correct the discrimination result.
For example, in the example of FIG. 12, when there are many results for group 1 that are determined as handwritten characters and there are few results that are determined as printed characters, it is determined that the group is a handwritten character, and the results determined as printed characters are Correct the discrimination result. On the contrary, when there are many results determined as a type and few results as a handwritten character, the result determined as a handwritten character is corrected to a result of type determination. Similarly, the discrimination result is corrected for each of the groups 1 to 4.
As described above, using the layout information of the form, the handwriting / type discrimination results of each group are totaled, and the handwriting / type discrimination processing results of the character entry fields belonging to the same group are corrected based on the tabulation results. As in the second embodiment, it is possible to improve the accuracy of handwritten character / type discrimination processing.
[0024]
【The invention's effect】
As described above, Oite the present invention can obtain the following effects.
(1) Since the regularity of the center position of the character obtained without including the cloud point is extracted from the character string to distinguish between handwritten characters and printed characters, any characters such as katakana and hiragana that may contain a cloud point It is possible to distinguish between handwritten / printed columns.
(2) Since the regularity of the center position of the character is not included in the character string and the regularity of the center position of the character is extracted to distinguish between handwritten characters and printed characters, any characters such as katakana and hiragana that may contain lowercase characters It is possible to distinguish between handwritten / printed columns.
In addition, when the number of characters in the target character string is small, the determination result of the determination hold (reject) is output, and the recognition is left to both handwriting and type recognition processing, so that the character recognition accuracy is not impaired.
[Brief description of the drawings]
FIG. 1 is a principle diagram of the present invention.
FIG. 2 is a diagram illustrating a configuration example of a system to which the present invention is applied.
FIG. 3 is a diagram illustrating a processing flow of character recognition processing according to an embodiment of the present invention.
FIG. 4 is a diagram showing a processing flow of handwritten type discrimination according to an embodiment of the present invention.
FIG. 5 is a diagram for explaining a concept of handwriting / type discrimination processing according to the embodiment.
FIG. 6 is a processing flow showing a method for removing muddy spots.
FIG. 7 is a diagram illustrating a method for removing muddy points.
FIG. 8 is a diagram for explaining the concept of muddy point removal processing.
FIG. 9 is a diagram showing a processing flow for removing lowercase letters.
FIG. 10 is a diagram for explaining the concept of lowercase removal processing.
FIG. 11 is a processing flow of an example in which layout information is used for handwriting / type discrimination processing .
12 is a diagram for explaining the processing concept of the example of FIG . 11 ;
FIG. 13 is another example of a processing flow in which layout information is used for handwriting / type discrimination processing .
FIG. 14 is an example of katakana type characters including dakuten and lower case, which are problematic in the conventional method.
[Explanation of symbols]
11 CPU
10 buses 12 RAM
13 ROM13
14 Communication Interface 15 External Storage Device 16 Storage Medium Reading Device 17 Scanner 18 Input / Output Device

Claims

A character string is extracted from a document image obtained by converting the document into image data, the center position in the height direction of each character in the character string is obtained, and the regularity of the center position in the height direction of all characters A character recognition method for determining whether a character string is a handwritten character or a type, and performing character recognition based on the determination result,
A character recognition method characterized by discriminating a handwritten character from a printed character based on the regularity of the center position of the character obtained without including a cloud point included in the character string.

Extracting a character string from a document image obtained by converting the document into image data, obtaining a center position in the height direction of each character in the character string, and by regularity of the center position in the height direction of all characters, A character recognition method for determining whether the character string is a handwritten character or a type, and performing character recognition based on the determination result,
A character recognition method characterized by discriminating between handwritten characters and printed characters based on the regularity of the center position of the characters obtained without including lowercase letters included in the character string.

A character string extracting means for extracting a character string from a document image obtained by converting the document into image data;
A handwriting / type discrimination unit that obtains a center position in the height direction of each character in the character string and discriminates whether the character string is a handwritten character or a type by regularity of the center position in the height direction of all characters;
A character recognition apparatus comprising character recognition means for performing character recognition based on the determination result, wherein the handwriting / printing type determination means obtains a center position of each character by removing a cloud point from the character in the character string. The second means for obtaining the center position of each character by removing lowercase letters from the character string, the regularity of the center position of the character obtained by the first and second means, and the size of the variation A character recognition apparatus comprising a third means for determining whether a character string is a handwritten character or a printed character.